ISEc8
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004431 | ND | Escherichia coli | Escherichia coli O157:H7 EDL933 |
DNA section
IS Length : 2442 bp
Ends
IR Length : 18/22
IRL : GTAAGCGTACAGCCTGAACCGTCTGGTCAGAATCTGACGAATTAGACAAA
IRR : GTAAGCGTACAGCGAGGGCCGTATTGACGGGGATGTGTTATTCAGCTGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTCCCGACGT | CAACAAAA | CCCCTTTTAA | 8 |
CGCCTGGATG | AGTGATTG | ATGATAGTTT | 8 |
GTGAAGTTCA | GCGTAATC | GGGGCAGACG | 8 |
CTTAAAGACA | CGAAAAAC | AACTTTTTAT | 8 |
GCTACTGCCG | GAACAACC | AGCAAACGGG | 8 |
TTGATACATC | AGTAGCTC | TGGGTATGAA | 8 |
GCGCCGGTGA | GACATCAG | CGTCATACAG | 8 |
DNA sequence
GTAAGCGTACAGCCTGAACCGTCTGGTCAGAATCTGACGAATTAGACAAAGTGGTGTCCACCAAATAAGTAGTGGGAACCAAAGTATCAGATATGCAGAA
AAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCAGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCA
AAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCC
AGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCA
CGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATTT
GGCTGGTTGCCGGTATCACCGATATGAGAAATGGCTTCAACGGCCTGGCTGCGAAAGTACAAACGGCGCTGAAAGACGATCCCATGTCCGGCCATGTTTT
CATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGACTGTGCCTCCTGACCAAACGGCTGGAGCGTGGGCGCTTCGCC
TGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCTAAGCGGCTGCTGACCT
CCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTCTTCTGACGACATCTTCCTGC
TGAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGA
TAAACTCCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGAT
ACGCTGACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACCCCGTGACGAAAAGC
GACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATACCGCCGAACAGCTGGAGTTGATGCGTAGCGCCTT
CCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCC
GGACCGGGGCTGCTGGCCCGCGTGCTGACCTCGAAGTATGCAGAGCACACCCCGCTGTATCGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGA
GGCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTCATGACTGACGGCAAACTCCA
TGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTGGGCGTATGTTCGTGATGACCGCAATGCAGGGTCA
GCGTTGGCACCTGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACTCATCTTGCCTGCTTCAGCGGTGTGCTGCAAGCGGATG
CGTACGCCGGGTTCAACGAGCTGTATCGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCAT
CCCGTCAGCACTGACGGAAGAAGCCCTGGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGCAGCGGCTTGCTGAA
CGTCAGCGAAAAACGAAACCGTTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAGACCCTGTCGCGACACTCAGAGTTGGCGAAGGCGTTCG
CGTACGCACTTAACCAGTGGCCGGCACTGACGTACTATGCGAACGATGGCTGGGTGGAAATCGACAACAACATCGCTGAAAATGCCCTGCGGGCGGTCAG
TCTGGGTCGTAAAAACTTCCTGTTCTTCGGCTCTGATCATGGTGGTGAGCGGGGAGCGCTACTGTACAGCCTGATCGGGACGTGCAAACTGAATGACGTG
GATCCAGAAAGCTACCTTCGCCATGTGCTTGGCGTCATAGCAGACTGGCCGGTCAACCGGGTCAGCGAACTGCTTCCGTGGCGCATAGCACTGCCAGCTG
AATAACACATCCCCGTCAATACGGCCCTCGCTGTACGCTTAC
AAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCAGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCA
AAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCC
AGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCA
CGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATTT
GGCTGGTTGCCGGTATCACCGATATGAGAAATGGCTTCAACGGCCTGGCTGCGAAAGTACAAACGGCGCTGAAAGACGATCCCATGTCCGGCCATGTTTT
CATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGACTGTGCCTCCTGACCAAACGGCTGGAGCGTGGGCGCTTCGCC
TGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCTAAGCGGCTGCTGACCT
CCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTCTTCTGACGACATCTTCCTGC
TGAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGA
TAAACTCCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGAT
ACGCTGACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACCCCGTGACGAAAAGC
GACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATACCGCCGAACAGCTGGAGTTGATGCGTAGCGCCTT
CCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCC
GGACCGGGGCTGCTGGCCCGCGTGCTGACCTCGAAGTATGCAGAGCACACCCCGCTGTATCGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGA
GGCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTCATGACTGACGGCAAACTCCA
TGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTGGGCGTATGTTCGTGATGACCGCAATGCAGGGTCA
GCGTTGGCACCTGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACTCATCTTGCCTGCTTCAGCGGTGTGCTGCAAGCGGATG
CGTACGCCGGGTTCAACGAGCTGTATCGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCAT
CCCGTCAGCACTGACGGAAGAAGCCCTGGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGCAGCGGCTTGCTGAA
CGTCAGCGAAAAACGAAACCGTTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAGACCCTGTCGCGACACTCAGAGTTGGCGAAGGCGTTCG
CGTACGCACTTAACCAGTGGCCGGCACTGACGTACTATGCGAACGATGGCTGGGTGGAAATCGACAACAACATCGCTGAAAATGCCCTGCGGGCGGTCAG
TCTGGGTCGTAAAAACTTCCTGTTCTTCGGCTCTGATCATGGTGGTGAGCGGGGAGCGCTACTGTACAGCCTGATCGGGACGTGCAAACTGAATGACGTG
GATCCAGAAAGCTACCTTCGCCATGTGCTTGGCGTCATAGCAGACTGGCCGGTCAACCGGGTCAGCGAACTGCTTCCGTGGCGCATAGCACTGCCAGCTG
AATAACACATCCCCGTCAATACGGCCCTCGCTGTACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
402 bp | 133 aa | 72 | 473 | + | No |
AG : IS66 TnpA
ORF sequence :
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 470 | 817 | + | No |
AG : IS66 TnpB
ORF sequence :
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
DWRQPKRLLTSLTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1539 bp | 512 aa | 867 | 2405 | + | No |
Chemistry : DDE
ORF sequence :
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
Blast result :
Comments
This element was first sequenced as part of the region adjacent to the locus of enterocyte effacement (LEE) pathogenicity island of Escherichia coli EDL933 [AF071034, bases 6012..8453; 1], but it was not recognized as an IS element. It was subsequently identified based on its similarity to the Sinorhizobium meliloti IS element ISRm14 [2]. ISEc8 is present in 7 intact copies and 4 partial copies in the complete genome of the E. coli O157:H7 Sakai strain [3], while strain EDL933 [4] has an additional copy in the duplicated urease-tellurite island.
References
1] Perna,N.T., Mayhew,G.F., Posfai,G., Elliott,S., Donnenberg,M.S., Kaper,J.B. and Blattner,F.R. (1998) Infect. Immun. 66 (8), 3810-3817
2] Schneiker,S., Kosier,B., Puhler,A. and Selbitschka,W. (1999) Curr. Microbiol. 39 (5), 274-281
3] Hayashi,T., Makino,K., Ohnishi,M., Kurokawa,K., Ishii,K., Yokoyama,K., Han, C.-G., Ohtsubo,E., Nakayama,K., Murata,T., Tanaka,M., Tobe,T., Iida,T., Takami,H., Honda,T., Sasakawa,C., Ogasawara,N., Yasunaga,T., Kuhara,S., Shiba,T., Hattori,M. and Shinagawa,H. (2001)
DNA Res. 8 (1), 11-22
4] Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J.,Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A.,Dimalanta,E., Potamousis,K., Apodaca,J., Anantharaman,T.S., Lin,J., Yen,G.,Schwartz,D.C., Welch,R.A. and Blattner,F.R. (2001) Nature 409 (6819), 529-533
2] Schneiker,S., Kosier,B., Puhler,A. and Selbitschka,W. (1999) Curr. Microbiol. 39 (5), 274-281
3] Hayashi,T., Makino,K., Ohnishi,M., Kurokawa,K., Ishii,K., Yokoyama,K., Han, C.-G., Ohtsubo,E., Nakayama,K., Murata,T., Tanaka,M., Tobe,T., Iida,T., Takami,H., Honda,T., Sasakawa,C., Ogasawara,N., Yasunaga,T., Kuhara,S., Shiba,T., Hattori,M. and Shinagawa,H. (2001)
DNA Res. 8 (1), 11-22
4] Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J.,Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A.,Dimalanta,E., Potamousis,K., Apodaca,J., Anantharaman,T.S., Lin,J., Yen,G.,Schwartz,D.C., Welch,R.A. and Blattner,F.R. (2001) Nature 409 (6819), 529-533