ISEc10

  • Family IS21
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NC_004431 ND Escherichia coli
Escherichia coli CFT073
DNA section
IS Length : 2410 bp

Ends


IR Length : 34/48

IRL : TATGAGACGACAATAACTCAGTCCGCCTGGCGCCGATTATTCTGGCCGGT
IRR : TATTAGACGTCAACTAGTTTTGCCGACTCGCGCCAACTATACTGGCCGCA

Insertion site


Left flankDirect repeatRight flankDR Length
AAAAGCGAAGAGCAACTATGAGACGACAATA6
AGTTGACGTCTAATAAGCAACTCATTAACGA6
CGATAATTTTACTGCCTATGAGACGACAATA6
AGTTGACGTCTAATAACTGCCAAATTTGCAG6
TTTATCTTCTGTCTTCTATGAGACGACAATA6
AGTTGACGTCTAATAGTCTTCAGAGGTATCC6
ATGGTCCGGACAAACTATGAGACGACAATA5
AGTTGACGTCTAATACAAACGGAGCTCCCC5
GTGCAGAGGTGACCTTATGAGACGACAATA5
AGTTGACGTCTAATAGACCTGCTTTATCAA5
TCGAGCGTCAGAGTCTATGAGACGACAATA5
AGTTGACGTCTAATAGAGTCAGTTTCATCG5

DNA sequence

TATGAGACGACAATAACTCAGTCCGCCTGGCGCCGATTATTCTGGCCGGTTTTGTAACCTGAGTGTCATTTACATGATCCAACAGGAAATGGCGTGACGC
TCAATACTTCTCAGGTCAGTTACTATATGACTCAGCGTAAGAAAGGTATAACTCAGCATATCTCGGCCATGAAGGCTGGTATCTCAGTCCGTTCTGGTCG
TCGGATCGAAAAAGGAGAGTGGGCAAAAAACAGTGTTCGGCACTGGCGCACACGCAAAGATCCTCTGGAAGCTGTGTGGGACAGCATGCTTGTTCCTCTG
TTGAAAGAGAGGCCGGCTCTGACACCAACAACTCTGCTGGAGATGCTACAGGATAAATATCCCGGCCAGTACCCCAACAGCCTTCGAAGAACAATGCAAC
GGCGGGTCCGCGAATGGAAGCTACAGTATGGTGCAGAGCAGGAGGTCATGTTCCGCCAGCGACATCAGCCCGGTCTGCGAGGTCTGTCGGACTTTACTGA
ACTGAAAGGTGTAGTTGTCACCATCGCCGGTAAGTTGTTGGCGCATAAGTTGTATCACTTCCGTCTGGAATGGAGCCACTGGAGCTGGATGCGGGTTGTG
CTGGGTGGTGAGAGCTTCTCTGCTCTGGCTGAAGGTCTGCAGGAAGCCCTCGGACAACTGGGCGGAGTGCCGGTAGAACATAAAACGGACAGCCTGAGGG
CAGCATGGAAACAACAGGGCGAAGATGGACGCCGCGAGCTGACTGAGCGTTATGCTGCTCTCTGTCAGCACTACGGAATGCAGGGCGTACACAATAATGC
CGGTCGGGGCCACGAAAATGGCTCGGTTGAAAGTGCCCACGGACATCTGAAAAGGCGTATCTGTCAGGCGCTGATACTGCGGGGCAGTAACGACTTCAGC
ACCATAGAAGAATATCAGGCCTTCATCACTCAGCAGGTTATGCGGCACAACCGTAACAATCAGGATCTGGTCAAGGAAGAACGTCTTCATCTGAAACCGC
TGCCGCTTCGTCGCAGTGCTGACTATGATGAGCTGACTGTGAGGGTTAGCCGCAGCAGTACCATCAATGTGAAGCACGTCGTCTACAGCGTACCTTCCCG
GCTTGTAGGTCAACTGTTACGGGTCCGGTTATGGGACGATCGTCTGAGCTGTTACGTTGGCAGCAGCGAGGTCATGAGCTGCCCACGTGTCAGACCAGAA
AAAGGGAAGACGCGGGCCCGTCGTATCGACTTCCGACATGTGATCGACAGTCTGGCAAAAAAGCCCGGTGCGTTCTGCCATGCAACGCTGAGAAATGACA
TCCTGCCAGACGATGAATGGCGGAGGCTGTGGCGTCGCTTATGTAATCATCTGGAACCCGACATGGCAGGCAGGCTGATGGTACATGCTCTGAAACTGGC
TGCAGGATACGACGATATCTCAGTCGTGGCAAAAGGTATGGAGCAGATGCTGAATACCCCGGGAAACGTGGATCTGCACCGGCTGATGCGCTTCCTGGGT
ATAAAGGAAAAGGCGTTGCCGGTAGTCAATGTGAAACAGCATAACCTGAGCAGTTATGAGCAACTACTGCGTGGCAAGGGAGGTTCGCAGTGAGCAATAT
CCATCACCTTGAACGCAGCCTGCGTAAACTACGCCTGACACGAGTTGGAGCTGAATGGCACGCTCTGGAAAAACGAGCACTGGCAGAAGGCTGGACACCA
TCGCGCTATCTTCTGACGCTATGCAATGAAGAACTCCTGTGGCGCGAGAGTGAAAAACTGCGTCGTTATAAAAAGGAGGCCCGGTTGCCAGTTGCCAAAA
CGCTAAGCGAATACGACTTCAGTCAGGTGCCGGAACTGAATGGAGCTCAGTTCCGGCAACTCTGTGAAACGACAGACTGGGTTGATGCAGGAGAAAACGT
TCTGCTGTTCGGAGCCAGCGGGTTGGGGAAAAGCCATCTGGCGGCAGCGATCGTGGATGGCGTAGTAGGCCAGGGCTACCGGGCCCGGTTCTACAGCGCA
GGAGAGTTGTTGCAGGAACTACGTAAAGCCAGAGCGCAGTTGAAACTGAATGAGCTGCTACTGAAACTGGATCGCTACCGGGTGATAGTGGTGGATGATC
TTGGCTATGTCAAACGCGACAGCGCCGAAACGGGAGTACTGTTCGAGTTAATAGCGCATCGCTATGAACGTGGGAGCCTGGTGATAACCAGTAACCATCC
GTTCAGCATGTGGGGCAGCATCTTCGTGGATGAGACTATGGCGGTGGCGGCGGCAGACCGGCTGATCCATCACGGATATATGTTCGAACTGAAAGGTGAA
AGCTACAGGAAAAAGACAGCGAAGGCAGTAACAAGCGCGACTTGATGTCGCACTGAAGGGTGCGGCCAGTATAGTTGGCGCGAGTCGGCAAAACTAGTTG
ACGTCTAATA
Protein section
ORF number : 2

 

ORF 1
LengthBeginEndStrandFusion ORF
1500 bp499 aa941593+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MTLNTSQVSYYMTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRHWRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPGQYPNSLRRT
MQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLAHKLYHFRLEWSHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDS
LRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHGHLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHL
KPLPLRRSADYDELTVRVSRSSTINVKHVVYSVPSRLVGQLLRVRLWDDRLSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGMEQMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQLLRGKGGSQ

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
756 bp251 aa15902345+No
ORF function : Accessory Gene
AG : IS21 helper

ORF sequence :

MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELLWRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQLKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSAT

 

Blast result :
Comments
There are 6 intact copies of the element in the CFT073 genome sequence, as well as a seventh partial copy with both terminal and internal deletions. [1]
A copy of this element was sequenced previously from the same strain [GenBank accession number AF081285, bases 8645..11051; 2], but with 3 single-base sequence differences which disrupted the amino-terminal part of ORF2. Those authors designated the C-terminal 488 aa of ORF1 as "R6" (they selected a downstream start codon), and the C-terminal 195 aa of ORF2 as "R5" or "orfB".
A partial copy (nucleotides 1730-2410) of the element is present in the large virulence plasmids of Shigella flexneri, pWR100 [AL391753; 3] and pWR501 [AF348706; 4].
The ORFs have strong similarities (40% identity) to the corresponding ORFs from an IS element in Bradyrhizobium elkanii [AB062279; 5].
References
1] Welch,R.A., Burland,V., Plunkett,G.,III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R. (2002) Proc. Natl. Acad. Sci. USA 99 (26), 17020-17024
2]Guyer,D.M., Kao,J.S. and Mobley,H.L.T. (1998) Infect. Immun. 66 (9), 4411-4417
3]Buchrieser,C., Glaser,P., Rusniok,C., Nedjari,H., D'Hauteville,H., Kunst,F., Sansonetti,P. and Parsot,C. (2000) Mol. Microbiol. 38 (4),
760-771 [PubMed: 11115111]
4]Venkatesan,M.M., Goldberg,M.B., Rose,D.J., Grotbeck,E.J., Burland,V.
and Blattner,F.R. (2001) Infect. Immun. 69 (5),3271-3285
5]Yasuta,T., Okazaki,S., Mitsui,H., Yuhashi,K., Ezura,H. and Minamisawa,K. (2001) Appl. Environ. Microbiol. 67 (11), 4999-5009