ISEc23

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Escherichia coli
Escherichia coli O150:H5 SE15
DNA section
IS Length : 2532 bp

Ends


IR Length : 20/24

IRL : GTAAGCGTAAACTGACCGCCGTATGTAGCCATCAGACGAGAATTGGTAAC
IRR : GTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTACTA

Insertion site


Left flankDirect repeatRight flankDR Length
TTATTGTGAATATAATGGCGTGACCGCT8
AAATCCTTTACGTTACTCTCTGATGACT8
CATAGTTATTCACACTCCCTTCACTTAC8
ATGTGCGGAAATTAAATCCTGTCGTTTC8

DNA sequence

GTAAGCGTAAACTGACCGCCGTATGTAGCCATCAGACGAGAATTGGTAACTTAGACGCCCATCTGATATAGACGGACATCTAAGTATGGAATTACAGGAC
TGGCGAAAAGAACCTCGTAAAAACTATTCGAATGAATTCAAACTTCGTATGGTGGAACTGGCATCACAACCTGGAGCTTGTGTTGCACAGATTGCACGTG
AAAATGGCGTCAATGATAATGTTATTTTCAAATGGCTCAGGCTCTGGCAGAACGAAGGGCGTGTTTCGCGGCGTCTTCCGGTAACGACCTCTTCTGACAC
TGGCGTTGAATTATTACCTGTAGAAATAACGCCGGATGAGCAGAAAGAACCTGTGGCGGCCATTGCGCCGTCTTTATCCACTTCCACTCAGACCAGAGTC
AGTGCCAGTTCCTGCAAGGTGGAATTCCGTCACGGTAACATGACGCTGGAAAATCCATCGCCAGAGCTGCTCACAGTGTTGATCCGTGAACTGACCGGGA
GGGGAAGATGATCTCACTCCCATCAGGTACCCGTATCTGGCTCGTTGCCGGCGTTACCGATATGCGTAAATCCTTCAACGGACTGGGAGAACAGGTACAA
CATGTGCTGAATGATAATCCCTTCTCCGGTCACCTGTTTATCTTCCGTGGCCGACGGGGTGACACCGTCAAAATTCTTTGGGCTGATGCTGATGGTCTGT
GCCTGTTCACCAAACGCCTGGAGGAAGGCCAGTTTATCTGGCCTGCGGTACGTGACGGCAAGGTATCCATTACCCGCTCGCAACTGGCAATGCTCCTCGA
TAAGCTGGACTGGCGTCAGCCAAAAACATCCAGCCGTAACTCACTGACAATGTTGTAAAAAACTCCTGACCGCATTATAAAAACGGTCATGAGTCAGAAA
TACCTCATTCGCATCGCAGAGCTGGAAAGGTTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAGAGACGGAAG
CCTTCCTGCGCTCTGCACTGACACGTGCCGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGAT
GCTGTTCGGTACCCGTTCTGAAAAACTGCGTCGTGAAGTTGAACTGGCTGAGGCTCTGCTGAAACAACGTGAACAGGACAGCGATCGTTACAGTGGGCGG
GAAGACGATCCTCAGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACACCTTCCCCGTGAAATACACCGCCTGGAGCCAGAAG
AAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCTGAACAGCTGGAACTGGTGAGCAGTGCCCTGAAAGTGATCCGCAC
AGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGTATTGTTGAAGCACCGGCGCCGTCCCGCCCGATAGAGCGTGGTATCGCGGGCCCCGGATTACTT
GCCCGCGTGTTAACGGGAAAATACTGCGAACATCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGTGTCGAACTGAGCCGGGCCTTACTCT
CCAACTGGGTTGACGCGTGCTGCCAGTTAATGACACCGGTGAATGATGCCCTGTACCGTTATGTAATGAATACCCGCAAGGTTCACACTGATGACACACC
GGTAAAGGTACTGGCACCGGGTCAGAAAAAGGCGAAAACAGGGCGTATCTGGACGTATGTCCGGGATGATCGCAATGTGGGTTCGTCATCTCCTCCAGCG
GTCTGGTTCGCGTACTCGCCGAACCGGCAGGGGAAACACCCGGAGCAACACCTCCGCCCCTTCCGGGGTATCCTGCAGGCGGATGCGTTCACAGGTTACG
ACAGGTTGTTCAGTGCAGAACGTGAAGGTGGTGCACTGACAGAAGTTGCGTGCTGGGCCCATGCCCGGCGAAAAATCCACGATGTATACATCAGCAGCAA
AAGTGCGACGGCAGAAGAAGCCCTGAAGCGAATCAGTGAACTGTACGCCATCGAGGATGAAATACGGGGATTACCGGAGTCAGAGCGTCTTGCCGTCAGG
CAGCAGCGAAGCAAAGTGTTACTGACGTCGCTGCATGAATGGATGGTGGAGAAGAATGGTACGCTGTCGAAAAAATCCAGACTGGGCGAAGCGTTCAGCT
ATGTACTGAATCAGTGGGATGCCCTCTGTTATTACAGTGATGACGGTCTGGCGGAGGCGGATAATAATGCTGCGGAAAGAGCGCTTCGTGCAGTCTGTCT
CGGAAAGAAAAACTTTATGTTCTTTGGCAGCGATCACGGCGGCGAGCGTGGAGCACTGTTGTACGGGCTGATCGGCACCTGCCGTCTGAACGGTATCGAT
CCGGAAGCGTATCTGCGCCATATCCTGAGCGTACTGCCGGAATGGCCTTCCAACCGAGTTGACGAACTCCTGCCATGGAACGTAGTACTCACCAATAAAT
AAGCGTCAATACGGTGCTCCGTTGACGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
426 bp141 aa86511+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MELQDWRKEPRKNYSNEFKLRMVELASQPGACVAQIARENGVNDNVIFKWLRLWQNEGRVSRRLPVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTS
TQTRVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
351 bp116 aa508858+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDTVKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSSRNSLTML

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1614 bp537 aa8892502+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALTRAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVELAEALLKQREQDSDR
YSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAG
PGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPVNDALYRYVMNTRKVHTDDTPVKVLAPGQKKAKTGRIWTYVRDDRNVGSS
SPPAVWFAYSPNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGGALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK

 

Blast result :
Comments
4 identical copies in E.coli SE15 chromosome, plus 1 remnant. An example of a complete ISEc23 is found in the E.coli SE15 genome sequence at co-ordinates 1184983-1187514.
orf1 is 60% aa similar to ISEc22 (orf1)
orf2 is 86% aa similar to ISCro1 (orf2)
orf3 is 76% aa similar to ISEc22 (orf3)
References
1] Tadasuke Ooka, Tetsuya Hayashi (2008) Direct submission.