ISEc22
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Escherichia coli | Escherichia coli O26:H11 11368 |
DNA section
IS Length : 2454 bp
Ends
IR Length : 15/22
IRL : GTAAGCGTCGTCTCAGCACCGTCTGGCAGATCCTGAAATTCCTGAGAGAA
IRR : GTAAGCGTGCAGCAAGAACCGTATTGACGGGGATGTGTTATTCAGTCGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTTTATTGCA | GAAAAGCG | AGAGGTAATT | 8 |
GGGAGCCATA | TCGTATCT | GTATTGCGTA | 8 |
AGCATACTGA | TATTTACT | CCTCATGCCG | 8 |
AGGGTTTCCA | TATTGTCT | TCAAGATCAG | 8 |
ATTTATGCAC | CAGTGAAC | ATCTGCTGGT | 8 |
DNA sequence
GTAAGCGTCGTCTCAGCACCGTCTGGCAGATCCTGAAATTCCTGAGAGAATAGTGGACACCAAATATGGTGGACGCTATCCATGAAATCATTAACCGCAG
TGCGTAAAAAAAGCCCTAATTATCCCGTTGAGTTCAAAATCAAAATGGTTGAACTCTCGCATCGACCAGAGATCTCCGTAGCGCAACTCGCTCGTGAGCA
TGGGATCAACGATAATTTGCTGTTCAAGTGGCGCCAGTACTGGCGCGAAGGAAAACTACGTCCTCCTTCAACAACAGAAAACAACGTGCCTGAGCTGCTC
CCGATAACACTTGATGCCGAAGATGTTGTCCCTACAACCTCCCCCCGGTCACAACCTGTAGCTGCTGCGACACCTGAATCACTCAATATCAGCTGTGAAG
TGACGTTCCGGCACGGATCACTCCGTCTGAATGGTGCCATCAGCGAAAATATCCTGAACCTGCTGATACGGGAGCTCAAACGTTGATCCCATTACCATCA
GGGACAAAGATCTGGCTGGTCGCTGGCATCACCGATATGAGAAACGGCTTCAACGGCCTGGCGGCAAAGGTGCAGACGACGCTGAAAGACGATCCGATGT
CAGGTCACGTTTTTATCTTCCGTGGGCGTAATGGCAGTCAGGTAAAGCTCCTCTGGTCTACCGGCGATGGACTGTGTCTGCTGACCAAACGGCTGGAGCG
CGGCCGCTTCGCCTGGCCGTCAGCCCGGGATGGCAAAGCGTTCCTCACACCGGCACAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGGCAGCCTAAA
AGACTGCTTACGTCCCTGACTATGTTGTAAGCCTCTTTATCCTGGTCGACGCTGAATGAGCCTGGTAATATACCCGGTATGAGCAGCTCACTTCCTGACG
ATATCAATGCACTGAAACGTCTCCTTGCCGAACAGGAGGCGCTGAACCGTGCCCTGCTGGAAAAGCTGAACGAGCGTGAACGCGAAATAGACCATCTGCA
GGCACAGCTGGATAAGCTGCGCCGGATGAACTTCGGCAGCCGCTCCGAAAAAGTCTCCCGTCGTATCGCACAGATGGAAGCTGACCTGAAGGCACTTCAG
AAAGAAAGTGATACCCTTACCGGTCGGGTTGACGACCCGGCCGTGCAGCGCCCGCTGCGTCAAACCCGCACCCGCAAACCGTTCCCCGAATCACTCCCCC
GCGATGAAAAACGGCTGCTGCCGGCAGCATCATGCTGCCCGGAATGTGGAGGCTCACTGAGCTATCTGGGTGAGGATGCCGCCGAACAGCTGGAGTTGAT
GCGCAGCGCCTTCCGGGTTATCCGGACTGTACGTGAAAAGCATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCCCCCGCGCCTTCACGGCCCATCGAG
CGGGGTATCGCAGGACCGGGGCTGCTGGCCCGCGTGCTGATCTCAAAGTATGCAGAGCACACCCCGCTGTACCGCCAGTCTGAAATGTACGGCCGCCAGG
GCGTGGAGCTGAGTCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTACTGTCACCGCTGGAAGAAGCGCTTCAGGACTATGTGCTGACTGA
CGGTAAGCTCCATGCTGATGACACGCCTGTCCCGGTGCTGTTGCCAGGCAATAAGAAAACGAAGACCGGGCGGTTATGGACCTACGTTCGTGACGACCGT
AACGCCGGGTCAACGCTGGCGCCGGCGGTGTTGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACCCATCTTGCGGGGTTCAGTGGTGTAC
TGCAGGCGGATGCATACGCCGGGTTCAACGAGCTGTACCGGGATGGCCGGATAACGGAAGCCGCCTGTTGGGCTCACGCCCGCCGTAAAATCCACGATGT
GCACGTTCGCACCCCGTCAGCCCTGACGGAGGAAGCGCTGAAACGGATCGGCGAACTGTACGCCATCGAGGCAGAGATAAGGGGAATGACGGCGGAGCAG
CGCCTTGCCGAACGTCAGTTGAAAACGAAACCGCTGCTGAAATCCCTGGAAAGCTGGCTGCGTGAAAAGATGAAAACCCTGTCGCGACACTCAGAACTGG
CGAAAGCGTTCGCATACGCCCTGAACCAGTGGCCGGCGCTGACGTACTATGCAGATGATGGCTGGGCTGAGGCGGACAATAACATCGCTGAAAATGCGTT
GCGGATGGTCAGTCTGGGCCGCAAAAACTACCTGTTCTTCGGTTCGGATCATGGAGGAGAGCGGGGAGCGCTGCTGTACAGCCTGATCGGGACGTGCAAA
CTGAACGGAGTGGAGCCAGAAAGCTACCTCCGCTATGTCCTTGACGTCATAGCCGACTGGCCGATAAACCGGGTCGGCGAACTGCTCCCCTGGCGCGTAG
CACTGCCGACTGAATAACACATCCCCGTCAATACGGTTCTTGCTGCACGCTTAC
TGCGTAAAAAAAGCCCTAATTATCCCGTTGAGTTCAAAATCAAAATGGTTGAACTCTCGCATCGACCAGAGATCTCCGTAGCGCAACTCGCTCGTGAGCA
TGGGATCAACGATAATTTGCTGTTCAAGTGGCGCCAGTACTGGCGCGAAGGAAAACTACGTCCTCCTTCAACAACAGAAAACAACGTGCCTGAGCTGCTC
CCGATAACACTTGATGCCGAAGATGTTGTCCCTACAACCTCCCCCCGGTCACAACCTGTAGCTGCTGCGACACCTGAATCACTCAATATCAGCTGTGAAG
TGACGTTCCGGCACGGATCACTCCGTCTGAATGGTGCCATCAGCGAAAATATCCTGAACCTGCTGATACGGGAGCTCAAACGTTGATCCCATTACCATCA
GGGACAAAGATCTGGCTGGTCGCTGGCATCACCGATATGAGAAACGGCTTCAACGGCCTGGCGGCAAAGGTGCAGACGACGCTGAAAGACGATCCGATGT
CAGGTCACGTTTTTATCTTCCGTGGGCGTAATGGCAGTCAGGTAAAGCTCCTCTGGTCTACCGGCGATGGACTGTGTCTGCTGACCAAACGGCTGGAGCG
CGGCCGCTTCGCCTGGCCGTCAGCCCGGGATGGCAAAGCGTTCCTCACACCGGCACAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGGCAGCCTAAA
AGACTGCTTACGTCCCTGACTATGTTGTAAGCCTCTTTATCCTGGTCGACGCTGAATGAGCCTGGTAATATACCCGGTATGAGCAGCTCACTTCCTGACG
ATATCAATGCACTGAAACGTCTCCTTGCCGAACAGGAGGCGCTGAACCGTGCCCTGCTGGAAAAGCTGAACGAGCGTGAACGCGAAATAGACCATCTGCA
GGCACAGCTGGATAAGCTGCGCCGGATGAACTTCGGCAGCCGCTCCGAAAAAGTCTCCCGTCGTATCGCACAGATGGAAGCTGACCTGAAGGCACTTCAG
AAAGAAAGTGATACCCTTACCGGTCGGGTTGACGACCCGGCCGTGCAGCGCCCGCTGCGTCAAACCCGCACCCGCAAACCGTTCCCCGAATCACTCCCCC
GCGATGAAAAACGGCTGCTGCCGGCAGCATCATGCTGCCCGGAATGTGGAGGCTCACTGAGCTATCTGGGTGAGGATGCCGCCGAACAGCTGGAGTTGAT
GCGCAGCGCCTTCCGGGTTATCCGGACTGTACGTGAAAAGCATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCCCCCGCGCCTTCACGGCCCATCGAG
CGGGGTATCGCAGGACCGGGGCTGCTGGCCCGCGTGCTGATCTCAAAGTATGCAGAGCACACCCCGCTGTACCGCCAGTCTGAAATGTACGGCCGCCAGG
GCGTGGAGCTGAGTCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTACTGTCACCGCTGGAAGAAGCGCTTCAGGACTATGTGCTGACTGA
CGGTAAGCTCCATGCTGATGACACGCCTGTCCCGGTGCTGTTGCCAGGCAATAAGAAAACGAAGACCGGGCGGTTATGGACCTACGTTCGTGACGACCGT
AACGCCGGGTCAACGCTGGCGCCGGCGGTGTTGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACCCATCTTGCGGGGTTCAGTGGTGTAC
TGCAGGCGGATGCATACGCCGGGTTCAACGAGCTGTACCGGGATGGCCGGATAACGGAAGCCGCCTGTTGGGCTCACGCCCGCCGTAAAATCCACGATGT
GCACGTTCGCACCCCGTCAGCCCTGACGGAGGAAGCGCTGAAACGGATCGGCGAACTGTACGCCATCGAGGCAGAGATAAGGGGAATGACGGCGGAGCAG
CGCCTTGCCGAACGTCAGTTGAAAACGAAACCGCTGCTGAAATCCCTGGAAAGCTGGCTGCGTGAAAAGATGAAAACCCTGTCGCGACACTCAGAACTGG
CGAAAGCGTTCGCATACGCCCTGAACCAGTGGCCGGCGCTGACGTACTATGCAGATGATGGCTGGGCTGAGGCGGACAATAACATCGCTGAAAATGCGTT
GCGGATGGTCAGTCTGGGCCGCAAAAACTACCTGTTCTTCGGTTCGGATCATGGAGGAGAGCGGGGAGCGCTGCTGTACAGCCTGATCGGGACGTGCAAA
CTGAACGGAGTGGAGCCAGAAAGCTACCTCCGCTATGTCCTTGACGTCATAGCCGACTGGCCGATAAACCGGGTCGGCGAACTGCTCCCCTGGCGCGTAG
CACTGCCGACTGAATAACACATCCCCGTCAATACGGTTCTTGCTGCACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
405 bp | 134 aa | 82 | 486 | + | No |
AG : IS66 TnpA
ORF sequence :
MKSLTAVRKKSPNYPVEFKIKMVELSHRPEISVAQLAREHGINDNLLFKWRQYWREGKLRPPSTTENNVPELLPITLDAEDVVPTTSPRSQPVAAATPES
LNISCEVTFRHGSLRLNGAISENILNLLIRELKR
LNISCEVTFRHGSLRLNGAISENILNLLIRELKR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
294 bp | 97 aa | 537 | 830 | + | No |
AG : IS66 TnpB
ORF sequence :
MRNGFNGLAAKVQTTLKDDPMSGHVFIFRGRNGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKAFLTPAQLAMLLEGIDWRQPKRLLTSLTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1539 bp | 512 aa | 879 | 2417 | + | No |
Chemistry : DDE
ORF sequence :
MSSSLPDDINALKRLLAEQEALNRALLEKLNEREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLKALQKESDTLTGRVDDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAASCCPECGGSLSYLGEDAAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLISKYAEHTPLYRQ
SEMYGRQGVELSRSLLSGWVDACCRLLSPLEEALQDYVLTDGKLHADDTPVPVLLPGNKKTKTGRLWTYVRDDRNAGSTLAPAVLFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHDVHVRTPSALTEEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKNYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVG
ELLPWRVALPTE
PFPESLPRDEKRLLPAASCCPECGGSLSYLGEDAAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLISKYAEHTPLYRQ
SEMYGRQGVELSRSLLSGWVDACCRLLSPLEEALQDYVLTDGKLHADDTPVPVLLPGNKKTKTGRLWTYVRDDRNAGSTLAPAVLFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHDVHVRTPSALTEEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKNYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVG
ELLPWRVALPTE
Blast result :
Comments
6 identical copies in E.coli 11368 chromosome, plus 1 remnant. An example of a complete ISEc22 is found in the E.coli 11368 genome sequence at co-ordinates 919248-921701.
ISEc22 is 55% (ORF1) aa similar to ISEc8, 95% (ORF2) to ISEc8 and 89% (ORF3) to ISEc8.
ISEc22 is 55% (ORF1) aa similar to ISEc8, 95% (ORF2) to ISEc8 and 89% (ORF3) to ISEc8.
References
1] Tadasuke Ooka, Tetsuya Hayashi (2008) Direct submission.