ISEc21
- Family IS110
- Group
Isoform Synonym(s) ISCfr5
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Escherichia coli | Citrobacter freundii Escherichia coli O127:H6 E2348/69 |
DNA section
IS Length : 1374 bp
Ends
Left end : AGTAATAATGCCGGTATCAGTTTTTATCATCACTCTGTTTGCTGTTTAACCAGACTGGTGTGATTACTGATGCAGTGAAGACCTTCCCGCATCCTGACTC
Right end : CTGAAGCGCTATATCTCACGCGAAGTTTATACATTACTGCGTAATCAAAACAGGCAGATCAACAGCATCCCGATAACGGCTTGACTCTTAGAAGGGCGTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCACAGCGAT | CAGGGAAGCC | 0 | |
CCACAGCGAT | CAGGGCAGCC | 0 | |
CCATAGCGAC | CAGGGAAGCC | 0 | |
CCACAGTGAT | CAAGGATGGC | 0 |
DNA sequence
AGTAATAATGCCGGTATCAGTTTTTATCATCACTCTGTTTGCTGTTTAACCAGACTGGTGTGATTACTGATGCAGTGAAGACCTTCCCGCATCCTGACTC
ACACAGCGATCGACCCTTTGTGTCCTGCCCTGGACCTGTCGGTTGCCGGAAGCGCCTTCATGCGAGGCGTCTCCTCACCGATGCGCGTGACTCAAGAAGG
GCCTGACGGTTTGTCTCGTTACTGTCCTGTCCGGGTTATCTGTCTGGAGATTCAACTCTGTTTCCTCACAGGAGCTCTGTTATGGCAGGTAAAGTTACGG
AAACCGCTGTTGTGGGTGGCGTGGATACACATAAAGATCTGCACGTTGCCGCTGTCGTAGATCAGAACAATAAAGTTCTGGGGACCCAGTTTTTCTCCAC
AACACGGCAAGGTTACCGGCAGATGCTGGCATGGATGACTTCGTTTGGGGCATTAAAGCGAATTGGTGTTGAGTGTACAGGCACCTATGGATCAGGTCTG
CTTCGCTATTTACAGAATGCCGGGTTAGACGTTCTTGAGGTGACTGCGCCAGATCGGATGGAGCGACGCAAACGGGGTAAAAGTGACACGATTGATGCTG
AATGTGCCGCTCACGCCGCATTCTCCGGAATAAGAACCGTCACACCCAAAACGCGCAATGGCATGATTGAGTCTCTGCGGGTATTAAAAACTTGCCGAAA
AACAGCAATATCAGCCCGCAGAGTCGCTCTCCAGATTATCCATTCCAATATTATCTCTGCCCCGGATGAATTACGTGAACAGCTCAGAAATATGACGCGC
ATGCAGCTCATCAGGACTCTGGGATCCTGGCGGCCTGATGCCAGTGAATACCGCAATGTTACCAACGTTTATCGTATTTCATTAAAGTCCCTTGCCCGAC
GCTATCTCGAGTTACATGACGAAATCGCTGATTTGGATGTCATGATTGCGGCAATTGTCGATGAGCTGGCGCCTGAACTGATTAAACGTAATGCTATTGG
ATACGAAAGCGCTTCGCAGTTGCTGATCACGGCAGGAGACAATCCCCAACGATTAAGATCAGAATCAGGTTTTGCGGCACTGTGTGGTGTCAGCCCTGTT
CCCGTATCTTCAGGAAAAACGAATCGTTATCGACTTAACCGGGGTGGAGATCGTGCTGCAAATAGTGCACTTCACATCATTGCCATCGGACGTTTGCGAA
CTGACGATAAAACGAAGGAATATGTCGCCAGACGAGTAGCGGAAGGGCATACAAAAATGGAAGCAATACGCTGCCTGAAGCGCTATATCTCACGCGAAGT
TTATACATTACTGCGTAATCAAAACAGGCAGATCAACAGCATCCCGATAACGGCTTGACTCTTAGAAGGGCGTC
ACACAGCGATCGACCCTTTGTGTCCTGCCCTGGACCTGTCGGTTGCCGGAAGCGCCTTCATGCGAGGCGTCTCCTCACCGATGCGCGTGACTCAAGAAGG
GCCTGACGGTTTGTCTCGTTACTGTCCTGTCCGGGTTATCTGTCTGGAGATTCAACTCTGTTTCCTCACAGGAGCTCTGTTATGGCAGGTAAAGTTACGG
AAACCGCTGTTGTGGGTGGCGTGGATACACATAAAGATCTGCACGTTGCCGCTGTCGTAGATCAGAACAATAAAGTTCTGGGGACCCAGTTTTTCTCCAC
AACACGGCAAGGTTACCGGCAGATGCTGGCATGGATGACTTCGTTTGGGGCATTAAAGCGAATTGGTGTTGAGTGTACAGGCACCTATGGATCAGGTCTG
CTTCGCTATTTACAGAATGCCGGGTTAGACGTTCTTGAGGTGACTGCGCCAGATCGGATGGAGCGACGCAAACGGGGTAAAAGTGACACGATTGATGCTG
AATGTGCCGCTCACGCCGCATTCTCCGGAATAAGAACCGTCACACCCAAAACGCGCAATGGCATGATTGAGTCTCTGCGGGTATTAAAAACTTGCCGAAA
AACAGCAATATCAGCCCGCAGAGTCGCTCTCCAGATTATCCATTCCAATATTATCTCTGCCCCGGATGAATTACGTGAACAGCTCAGAAATATGACGCGC
ATGCAGCTCATCAGGACTCTGGGATCCTGGCGGCCTGATGCCAGTGAATACCGCAATGTTACCAACGTTTATCGTATTTCATTAAAGTCCCTTGCCCGAC
GCTATCTCGAGTTACATGACGAAATCGCTGATTTGGATGTCATGATTGCGGCAATTGTCGATGAGCTGGCGCCTGAACTGATTAAACGTAATGCTATTGG
ATACGAAAGCGCTTCGCAGTTGCTGATCACGGCAGGAGACAATCCCCAACGATTAAGATCAGAATCAGGTTTTGCGGCACTGTGTGGTGTCAGCCCTGTT
CCCGTATCTTCAGGAAAAACGAATCGTTATCGACTTAACCGGGGTGGAGATCGTGCTGCAAATAGTGCACTTCACATCATTGCCATCGGACGTTTGCGAA
CTGACGATAAAACGAAGGAATATGTCGCCAGACGAGTAGCGGAAGGGCATACAAAAATGGAAGCAATACGCTGCCTGAAGCGCTATATCTCACGCGAAGT
TTATACATTACTGCGTAATCAAAACAGGCAGATCAACAGCATCCCGATAACGGCTTGACTCTTAGAAGGGCGTC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1077 bp | 358 aa | 282 | 1358 | + | No |
Chemistry : DEDD
ORF sequence :
MAGKVTETAVVGGVDTHKDLHVAAVVDQNNKVLGTQFFSTTRQGYRQMLAWMTSFGALKRIGVECTGTYGSGLLRYLQNAGLDVLEVTAPDRMERRKRGK
SDTIDAECAAHAAFSGIRTVTPKTRNGMIESLRVLKTCRKTAISARRVALQIIHSNIISAPDELREQLRNMTRMQLIRTLGSWRPDASEYRNVTNVYRIS
LKSLARRYLELHDEIADLDVMIAAIVDELAPELIKRNAIGYESASQLLITAGDNPQRLRSESGFAALCGVSPVPVSSGKTNRYRLNRGGDRAANSALHII
AIGRLRTDDKTKEYVARRVAEGHTKMEAIRCLKRYISREVYTLLRNQNRQINSIPITA
SDTIDAECAAHAAFSGIRTVTPKTRNGMIESLRVLKTCRKTAISARRVALQIIHSNIISAPDELREQLRNMTRMQLIRTLGSWRPDASEYRNVTNVYRIS
LKSLARRYLELHDEIADLDVMIAAIVDELAPELIKRNAIGYESASQLLITAGDNPQRLRSESGFAALCGVSPVPVSSGKTNRYRLNRGGDRAANSALHII
AIGRLRTDDKTKEYVARRVAEGHTKMEAIRCLKRYISREVYTLLRNQNRQINSIPITA
Blast result :
Comments
5 identical copies in E.coli E2348/69 chromosome. An example of a complete ISEc21 is found in the E.coli E2348/69 genome sequence at co-ordinates 4558241-4559614. ISEc21 is 56% aa similar to ISLxx2.
References
1] Atsushi Iguchi, Tetsuya Hayashi (2008) Direct submission.