ISEc22

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Escherichia coli
Escherichia coli O26:H11 11368
DNA section
IS Length : 2454 bp

Ends


IR Length : 15/22

IRL : GTAAGCGTCGTCTCAGCACCGTCTGGCAGATCCTGAAATTCCTGAGAGAA
IRR : GTAAGCGTGCAGCAAGAACCGTATTGACGGGGATGTGTTATTCAGTCGGC

Insertion site


Left flankDirect repeatRight flankDR Length
TTTTATTGCAGAAAAGCGAGAGGTAATT8
GGGAGCCATATCGTATCTGTATTGCGTA8
AGCATACTGATATTTACTCCTCATGCCG8
AGGGTTTCCATATTGTCTTCAAGATCAG8
ATTTATGCACCAGTGAACATCTGCTGGT8

DNA sequence

GTAAGCGTCGTCTCAGCACCGTCTGGCAGATCCTGAAATTCCTGAGAGAATAGTGGACACCAAATATGGTGGACGCTATCCATGAAATCATTAACCGCAG
TGCGTAAAAAAAGCCCTAATTATCCCGTTGAGTTCAAAATCAAAATGGTTGAACTCTCGCATCGACCAGAGATCTCCGTAGCGCAACTCGCTCGTGAGCA
TGGGATCAACGATAATTTGCTGTTCAAGTGGCGCCAGTACTGGCGCGAAGGAAAACTACGTCCTCCTTCAACAACAGAAAACAACGTGCCTGAGCTGCTC
CCGATAACACTTGATGCCGAAGATGTTGTCCCTACAACCTCCCCCCGGTCACAACCTGTAGCTGCTGCGACACCTGAATCACTCAATATCAGCTGTGAAG
TGACGTTCCGGCACGGATCACTCCGTCTGAATGGTGCCATCAGCGAAAATATCCTGAACCTGCTGATACGGGAGCTCAAACGTTGATCCCATTACCATCA
GGGACAAAGATCTGGCTGGTCGCTGGCATCACCGATATGAGAAACGGCTTCAACGGCCTGGCGGCAAAGGTGCAGACGACGCTGAAAGACGATCCGATGT
CAGGTCACGTTTTTATCTTCCGTGGGCGTAATGGCAGTCAGGTAAAGCTCCTCTGGTCTACCGGCGATGGACTGTGTCTGCTGACCAAACGGCTGGAGCG
CGGCCGCTTCGCCTGGCCGTCAGCCCGGGATGGCAAAGCGTTCCTCACACCGGCACAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGGCAGCCTAAA
AGACTGCTTACGTCCCTGACTATGTTGTAAGCCTCTTTATCCTGGTCGACGCTGAATGAGCCTGGTAATATACCCGGTATGAGCAGCTCACTTCCTGACG
ATATCAATGCACTGAAACGTCTCCTTGCCGAACAGGAGGCGCTGAACCGTGCCCTGCTGGAAAAGCTGAACGAGCGTGAACGCGAAATAGACCATCTGCA
GGCACAGCTGGATAAGCTGCGCCGGATGAACTTCGGCAGCCGCTCCGAAAAAGTCTCCCGTCGTATCGCACAGATGGAAGCTGACCTGAAGGCACTTCAG
AAAGAAAGTGATACCCTTACCGGTCGGGTTGACGACCCGGCCGTGCAGCGCCCGCTGCGTCAAACCCGCACCCGCAAACCGTTCCCCGAATCACTCCCCC
GCGATGAAAAACGGCTGCTGCCGGCAGCATCATGCTGCCCGGAATGTGGAGGCTCACTGAGCTATCTGGGTGAGGATGCCGCCGAACAGCTGGAGTTGAT
GCGCAGCGCCTTCCGGGTTATCCGGACTGTACGTGAAAAGCATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCCCCCGCGCCTTCACGGCCCATCGAG
CGGGGTATCGCAGGACCGGGGCTGCTGGCCCGCGTGCTGATCTCAAAGTATGCAGAGCACACCCCGCTGTACCGCCAGTCTGAAATGTACGGCCGCCAGG
GCGTGGAGCTGAGTCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTACTGTCACCGCTGGAAGAAGCGCTTCAGGACTATGTGCTGACTGA
CGGTAAGCTCCATGCTGATGACACGCCTGTCCCGGTGCTGTTGCCAGGCAATAAGAAAACGAAGACCGGGCGGTTATGGACCTACGTTCGTGACGACCGT
AACGCCGGGTCAACGCTGGCGCCGGCGGTGTTGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACCCATCTTGCGGGGTTCAGTGGTGTAC
TGCAGGCGGATGCATACGCCGGGTTCAACGAGCTGTACCGGGATGGCCGGATAACGGAAGCCGCCTGTTGGGCTCACGCCCGCCGTAAAATCCACGATGT
GCACGTTCGCACCCCGTCAGCCCTGACGGAGGAAGCGCTGAAACGGATCGGCGAACTGTACGCCATCGAGGCAGAGATAAGGGGAATGACGGCGGAGCAG
CGCCTTGCCGAACGTCAGTTGAAAACGAAACCGCTGCTGAAATCCCTGGAAAGCTGGCTGCGTGAAAAGATGAAAACCCTGTCGCGACACTCAGAACTGG
CGAAAGCGTTCGCATACGCCCTGAACCAGTGGCCGGCGCTGACGTACTATGCAGATGATGGCTGGGCTGAGGCGGACAATAACATCGCTGAAAATGCGTT
GCGGATGGTCAGTCTGGGCCGCAAAAACTACCTGTTCTTCGGTTCGGATCATGGAGGAGAGCGGGGAGCGCTGCTGTACAGCCTGATCGGGACGTGCAAA
CTGAACGGAGTGGAGCCAGAAAGCTACCTCCGCTATGTCCTTGACGTCATAGCCGACTGGCCGATAAACCGGGTCGGCGAACTGCTCCCCTGGCGCGTAG
CACTGCCGACTGAATAACACATCCCCGTCAATACGGTTCTTGCTGCACGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
405 bp134 aa82486+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MKSLTAVRKKSPNYPVEFKIKMVELSHRPEISVAQLAREHGINDNLLFKWRQYWREGKLRPPSTTENNVPELLPITLDAEDVVPTTSPRSQPVAAATPES
LNISCEVTFRHGSLRLNGAISENILNLLIRELKR

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
294 bp97 aa537830+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MRNGFNGLAAKVQTTLKDDPMSGHVFIFRGRNGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKAFLTPAQLAMLLEGIDWRQPKRLLTSLTML

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1539 bp512 aa8792417+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MSSSLPDDINALKRLLAEQEALNRALLEKLNEREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLKALQKESDTLTGRVDDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAASCCPECGGSLSYLGEDAAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLISKYAEHTPLYRQ
SEMYGRQGVELSRSLLSGWVDACCRLLSPLEEALQDYVLTDGKLHADDTPVPVLLPGNKKTKTGRLWTYVRDDRNAGSTLAPAVLFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHDVHVRTPSALTEEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKNYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVG
ELLPWRVALPTE

 

Blast result :
Comments
6 identical copies in E.coli 11368 chromosome, plus 1 remnant. An example of a complete ISEc22 is found in the E.coli 11368 genome sequence at co-ordinates 919248-921701.
ISEc22 is 55% (ORF1) aa similar to ISEc8, 95% (ORF2) to ISEc8 and 89% (ORF3) to ISEc8.
References
1] Tadasuke Ooka, Tetsuya Hayashi (2008) Direct submission.