ISEc10
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004431 | ND | Escherichia coli | Escherichia coli CFT073 |
DNA section
IS Length : 2410 bp
Ends
IR Length : 34/48
IRL : TATGAGACGACAATAACTCAGTCCGCCTGGCGCCGATTATTCTGGCCGGT
IRR : TATTAGACGTCAACTAGTTTTGCCGACTCGCGCCAACTATACTGGCCGCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AAAAGCGAAG | AGCAAC | TATGAGACGACAATA | 6 |
AGTTGACGTCTAATA | AGCAAC | TCATTAACGA | 6 |
CGATAATTTT | ACTGCC | TATGAGACGACAATA | 6 |
AGTTGACGTCTAATA | ACTGCC | AAATTTGCAG | 6 |
TTTATCTTCT | GTCTTC | TATGAGACGACAATA | 6 |
AGTTGACGTCTAATA | GTCTTC | AGAGGTATCC | 6 |
ATGGTCCGGA | CAAAC | TATGAGACGACAATA | 5 |
AGTTGACGTCTAATA | CAAAC | GGAGCTCCCC | 5 |
GTGCAGAGGT | GACCT | TATGAGACGACAATA | 5 |
AGTTGACGTCTAATA | GACCT | GCTTTATCAA | 5 |
TCGAGCGTCA | GAGTC | TATGAGACGACAATA | 5 |
AGTTGACGTCTAATA | GAGTC | AGTTTCATCG | 5 |
DNA sequence
TATGAGACGACAATAACTCAGTCCGCCTGGCGCCGATTATTCTGGCCGGTTTTGTAACCTGAGTGTCATTTACATGATCCAACAGGAAATGGCGTGACGC
TCAATACTTCTCAGGTCAGTTACTATATGACTCAGCGTAAGAAAGGTATAACTCAGCATATCTCGGCCATGAAGGCTGGTATCTCAGTCCGTTCTGGTCG
TCGGATCGAAAAAGGAGAGTGGGCAAAAAACAGTGTTCGGCACTGGCGCACACGCAAAGATCCTCTGGAAGCTGTGTGGGACAGCATGCTTGTTCCTCTG
TTGAAAGAGAGGCCGGCTCTGACACCAACAACTCTGCTGGAGATGCTACAGGATAAATATCCCGGCCAGTACCCCAACAGCCTTCGAAGAACAATGCAAC
GGCGGGTCCGCGAATGGAAGCTACAGTATGGTGCAGAGCAGGAGGTCATGTTCCGCCAGCGACATCAGCCCGGTCTGCGAGGTCTGTCGGACTTTACTGA
ACTGAAAGGTGTAGTTGTCACCATCGCCGGTAAGTTGTTGGCGCATAAGTTGTATCACTTCCGTCTGGAATGGAGCCACTGGAGCTGGATGCGGGTTGTG
CTGGGTGGTGAGAGCTTCTCTGCTCTGGCTGAAGGTCTGCAGGAAGCCCTCGGACAACTGGGCGGAGTGCCGGTAGAACATAAAACGGACAGCCTGAGGG
CAGCATGGAAACAACAGGGCGAAGATGGACGCCGCGAGCTGACTGAGCGTTATGCTGCTCTCTGTCAGCACTACGGAATGCAGGGCGTACACAATAATGC
CGGTCGGGGCCACGAAAATGGCTCGGTTGAAAGTGCCCACGGACATCTGAAAAGGCGTATCTGTCAGGCGCTGATACTGCGGGGCAGTAACGACTTCAGC
ACCATAGAAGAATATCAGGCCTTCATCACTCAGCAGGTTATGCGGCACAACCGTAACAATCAGGATCTGGTCAAGGAAGAACGTCTTCATCTGAAACCGC
TGCCGCTTCGTCGCAGTGCTGACTATGATGAGCTGACTGTGAGGGTTAGCCGCAGCAGTACCATCAATGTGAAGCACGTCGTCTACAGCGTACCTTCCCG
GCTTGTAGGTCAACTGTTACGGGTCCGGTTATGGGACGATCGTCTGAGCTGTTACGTTGGCAGCAGCGAGGTCATGAGCTGCCCACGTGTCAGACCAGAA
AAAGGGAAGACGCGGGCCCGTCGTATCGACTTCCGACATGTGATCGACAGTCTGGCAAAAAAGCCCGGTGCGTTCTGCCATGCAACGCTGAGAAATGACA
TCCTGCCAGACGATGAATGGCGGAGGCTGTGGCGTCGCTTATGTAATCATCTGGAACCCGACATGGCAGGCAGGCTGATGGTACATGCTCTGAAACTGGC
TGCAGGATACGACGATATCTCAGTCGTGGCAAAAGGTATGGAGCAGATGCTGAATACCCCGGGAAACGTGGATCTGCACCGGCTGATGCGCTTCCTGGGT
ATAAAGGAAAAGGCGTTGCCGGTAGTCAATGTGAAACAGCATAACCTGAGCAGTTATGAGCAACTACTGCGTGGCAAGGGAGGTTCGCAGTGAGCAATAT
CCATCACCTTGAACGCAGCCTGCGTAAACTACGCCTGACACGAGTTGGAGCTGAATGGCACGCTCTGGAAAAACGAGCACTGGCAGAAGGCTGGACACCA
TCGCGCTATCTTCTGACGCTATGCAATGAAGAACTCCTGTGGCGCGAGAGTGAAAAACTGCGTCGTTATAAAAAGGAGGCCCGGTTGCCAGTTGCCAAAA
CGCTAAGCGAATACGACTTCAGTCAGGTGCCGGAACTGAATGGAGCTCAGTTCCGGCAACTCTGTGAAACGACAGACTGGGTTGATGCAGGAGAAAACGT
TCTGCTGTTCGGAGCCAGCGGGTTGGGGAAAAGCCATCTGGCGGCAGCGATCGTGGATGGCGTAGTAGGCCAGGGCTACCGGGCCCGGTTCTACAGCGCA
GGAGAGTTGTTGCAGGAACTACGTAAAGCCAGAGCGCAGTTGAAACTGAATGAGCTGCTACTGAAACTGGATCGCTACCGGGTGATAGTGGTGGATGATC
TTGGCTATGTCAAACGCGACAGCGCCGAAACGGGAGTACTGTTCGAGTTAATAGCGCATCGCTATGAACGTGGGAGCCTGGTGATAACCAGTAACCATCC
GTTCAGCATGTGGGGCAGCATCTTCGTGGATGAGACTATGGCGGTGGCGGCGGCAGACCGGCTGATCCATCACGGATATATGTTCGAACTGAAAGGTGAA
AGCTACAGGAAAAAGACAGCGAAGGCAGTAACAAGCGCGACTTGATGTCGCACTGAAGGGTGCGGCCAGTATAGTTGGCGCGAGTCGGCAAAACTAGTTG
ACGTCTAATA
TCAATACTTCTCAGGTCAGTTACTATATGACTCAGCGTAAGAAAGGTATAACTCAGCATATCTCGGCCATGAAGGCTGGTATCTCAGTCCGTTCTGGTCG
TCGGATCGAAAAAGGAGAGTGGGCAAAAAACAGTGTTCGGCACTGGCGCACACGCAAAGATCCTCTGGAAGCTGTGTGGGACAGCATGCTTGTTCCTCTG
TTGAAAGAGAGGCCGGCTCTGACACCAACAACTCTGCTGGAGATGCTACAGGATAAATATCCCGGCCAGTACCCCAACAGCCTTCGAAGAACAATGCAAC
GGCGGGTCCGCGAATGGAAGCTACAGTATGGTGCAGAGCAGGAGGTCATGTTCCGCCAGCGACATCAGCCCGGTCTGCGAGGTCTGTCGGACTTTACTGA
ACTGAAAGGTGTAGTTGTCACCATCGCCGGTAAGTTGTTGGCGCATAAGTTGTATCACTTCCGTCTGGAATGGAGCCACTGGAGCTGGATGCGGGTTGTG
CTGGGTGGTGAGAGCTTCTCTGCTCTGGCTGAAGGTCTGCAGGAAGCCCTCGGACAACTGGGCGGAGTGCCGGTAGAACATAAAACGGACAGCCTGAGGG
CAGCATGGAAACAACAGGGCGAAGATGGACGCCGCGAGCTGACTGAGCGTTATGCTGCTCTCTGTCAGCACTACGGAATGCAGGGCGTACACAATAATGC
CGGTCGGGGCCACGAAAATGGCTCGGTTGAAAGTGCCCACGGACATCTGAAAAGGCGTATCTGTCAGGCGCTGATACTGCGGGGCAGTAACGACTTCAGC
ACCATAGAAGAATATCAGGCCTTCATCACTCAGCAGGTTATGCGGCACAACCGTAACAATCAGGATCTGGTCAAGGAAGAACGTCTTCATCTGAAACCGC
TGCCGCTTCGTCGCAGTGCTGACTATGATGAGCTGACTGTGAGGGTTAGCCGCAGCAGTACCATCAATGTGAAGCACGTCGTCTACAGCGTACCTTCCCG
GCTTGTAGGTCAACTGTTACGGGTCCGGTTATGGGACGATCGTCTGAGCTGTTACGTTGGCAGCAGCGAGGTCATGAGCTGCCCACGTGTCAGACCAGAA
AAAGGGAAGACGCGGGCCCGTCGTATCGACTTCCGACATGTGATCGACAGTCTGGCAAAAAAGCCCGGTGCGTTCTGCCATGCAACGCTGAGAAATGACA
TCCTGCCAGACGATGAATGGCGGAGGCTGTGGCGTCGCTTATGTAATCATCTGGAACCCGACATGGCAGGCAGGCTGATGGTACATGCTCTGAAACTGGC
TGCAGGATACGACGATATCTCAGTCGTGGCAAAAGGTATGGAGCAGATGCTGAATACCCCGGGAAACGTGGATCTGCACCGGCTGATGCGCTTCCTGGGT
ATAAAGGAAAAGGCGTTGCCGGTAGTCAATGTGAAACAGCATAACCTGAGCAGTTATGAGCAACTACTGCGTGGCAAGGGAGGTTCGCAGTGAGCAATAT
CCATCACCTTGAACGCAGCCTGCGTAAACTACGCCTGACACGAGTTGGAGCTGAATGGCACGCTCTGGAAAAACGAGCACTGGCAGAAGGCTGGACACCA
TCGCGCTATCTTCTGACGCTATGCAATGAAGAACTCCTGTGGCGCGAGAGTGAAAAACTGCGTCGTTATAAAAAGGAGGCCCGGTTGCCAGTTGCCAAAA
CGCTAAGCGAATACGACTTCAGTCAGGTGCCGGAACTGAATGGAGCTCAGTTCCGGCAACTCTGTGAAACGACAGACTGGGTTGATGCAGGAGAAAACGT
TCTGCTGTTCGGAGCCAGCGGGTTGGGGAAAAGCCATCTGGCGGCAGCGATCGTGGATGGCGTAGTAGGCCAGGGCTACCGGGCCCGGTTCTACAGCGCA
GGAGAGTTGTTGCAGGAACTACGTAAAGCCAGAGCGCAGTTGAAACTGAATGAGCTGCTACTGAAACTGGATCGCTACCGGGTGATAGTGGTGGATGATC
TTGGCTATGTCAAACGCGACAGCGCCGAAACGGGAGTACTGTTCGAGTTAATAGCGCATCGCTATGAACGTGGGAGCCTGGTGATAACCAGTAACCATCC
GTTCAGCATGTGGGGCAGCATCTTCGTGGATGAGACTATGGCGGTGGCGGCGGCAGACCGGCTGATCCATCACGGATATATGTTCGAACTGAAAGGTGAA
AGCTACAGGAAAAAGACAGCGAAGGCAGTAACAAGCGCGACTTGATGTCGCACTGAAGGGTGCGGCCAGTATAGTTGGCGCGAGTCGGCAAAACTAGTTG
ACGTCTAATA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1500 bp | 499 aa | 94 | 1593 | + | No |
Chemistry : DDE
ORF sequence :
MTLNTSQVSYYMTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRHWRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPGQYPNSLRRT
MQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLAHKLYHFRLEWSHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDS
LRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHGHLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHL
KPLPLRRSADYDELTVRVSRSSTINVKHVVYSVPSRLVGQLLRVRLWDDRLSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGMEQMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQLLRGKGGSQ
MQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLAHKLYHFRLEWSHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDS
LRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHGHLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHL
KPLPLRRSADYDELTVRVSRSSTINVKHVVYSVPSRLVGQLLRVRLWDDRLSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGMEQMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQLLRGKGGSQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
756 bp | 251 aa | 1590 | 2345 | + | No |
AG : IS21 helper
ORF sequence :
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELLWRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSAT
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSAT
Blast result :
Comments
There are 6 intact copies of the element in the CFT073 genome sequence, as well as a seventh partial copy with both terminal and internal deletions. [1]
A copy of this element was sequenced previously from the same strain [GenBank accession number AF081285, bases 8645..11051; 2], but with 3 single-base sequence differences which disrupted the amino-terminal part of ORF2. Those authors designated the C-terminal 488 aa of ORF1 as "R6" (they selected a downstream start codon), and the C-terminal 195 aa of ORF2 as "R5" or "orfB".
A partial copy (nucleotides 1730-2410) of the element is present in the large virulence plasmids of Shigella flexneri, pWR100 [AL391753; 3] and pWR501 [AF348706; 4].
The ORFs have strong similarities (40% identity) to the corresponding ORFs from an IS element in Bradyrhizobium elkanii [AB062279; 5].
A copy of this element was sequenced previously from the same strain [GenBank accession number AF081285, bases 8645..11051; 2], but with 3 single-base sequence differences which disrupted the amino-terminal part of ORF2. Those authors designated the C-terminal 488 aa of ORF1 as "R6" (they selected a downstream start codon), and the C-terminal 195 aa of ORF2 as "R5" or "orfB".
A partial copy (nucleotides 1730-2410) of the element is present in the large virulence plasmids of Shigella flexneri, pWR100 [AL391753; 3] and pWR501 [AF348706; 4].
The ORFs have strong similarities (40% identity) to the corresponding ORFs from an IS element in Bradyrhizobium elkanii [AB062279; 5].
References
1] Welch,R.A., Burland,V., Plunkett,G.,III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R. (2002) Proc. Natl. Acad. Sci. USA 99 (26), 17020-17024
2]Guyer,D.M., Kao,J.S. and Mobley,H.L.T. (1998) Infect. Immun. 66 (9), 4411-4417
3]Buchrieser,C., Glaser,P., Rusniok,C., Nedjari,H., D'Hauteville,H., Kunst,F., Sansonetti,P. and Parsot,C. (2000) Mol. Microbiol. 38 (4),
760-771 [PubMed: 11115111]
4]Venkatesan,M.M., Goldberg,M.B., Rose,D.J., Grotbeck,E.J., Burland,V.
and Blattner,F.R. (2001) Infect. Immun. 69 (5),3271-3285
5]Yasuta,T., Okazaki,S., Mitsui,H., Yuhashi,K., Ezura,H. and Minamisawa,K. (2001) Appl. Environ. Microbiol. 67 (11), 4999-5009
2]Guyer,D.M., Kao,J.S. and Mobley,H.L.T. (1998) Infect. Immun. 66 (9), 4411-4417
3]Buchrieser,C., Glaser,P., Rusniok,C., Nedjari,H., D'Hauteville,H., Kunst,F., Sansonetti,P. and Parsot,C. (2000) Mol. Microbiol. 38 (4),
760-771 [PubMed: 11115111]
4]Venkatesan,M.M., Goldberg,M.B., Rose,D.J., Grotbeck,E.J., Burland,V.
and Blattner,F.R. (2001) Infect. Immun. 69 (5),3271-3285
5]Yasuta,T., Okazaki,S., Mitsui,H., Yuhashi,K., Ezura,H. and Minamisawa,K. (2001) Appl. Environ. Microbiol. 67 (11), 4999-5009