ISEc83
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Escherichia coli | Escherichia coli Escherichia coli O121:H19 51104 |
DNA section
IS Length : 2709 bp
Ends
IR Length : 24/34
IRL : GTAAGCGCCCCATCTGCGACGTCTTGTGAAAATTgtcctgtctggcaaca
IRR : gtaagcgcaccgtgaaggacgtggggtaaaaaTTAGTTTACAGATTGAGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ggttcagacc | cttttttt | aatgatgatg | 8 |
tttacctgta | tgacctac | gccgcatgga | 8 |
agacagtgac | ggatgttg | tcaagatatt | 8 |
ggacatgcca | ttgttttc | tgactgttgg | 8 |
cggcaactga | cagaaatc | tcagcaatga | 8 |
ctgttgttca | aatggcga | caacaatggc | 8 |
tagagtgcga | ttgctgtg | aacaacagac | 8 |
DNA sequence
GTAAGCGCCCCATCTGCGACGTCTTGTGAAAATTgtcctgtctggcaacaatcgcgcccatctatatagatggacacgaacgATGAATTCCCAGACAAAA
AAAGATATTCCCTGCTTCCGTTCTTATTTGCCTGATGCTCTGCGTTTAAGATTTGAAGATAAACTGACCATCCGGGCCATCGCTCAGCGTCTAGGTCTCA
GTCATTCCACAATACATACGCTTTTTCAGCGATTTATTGCATCCGGTATCGCATGGCCATTGCCCGATTCAGTTTCATTAGCTCAGCTTGATGCCATCCT
TTATGCCAACAGAAAGAAGGAATTAACAGAGCCTCAAATCAGCGAAGGCACATGGCGAAAAGAACGGCGAGCCAGCTACAGCCGTGAATTTAAGGTCCGT
CTGGCTAAGCAGGCATTACAGCCCGGTGCTGTTGTTGCCCGGATCGCCAGAGAGCACGGTATCAATGATAACCTGCTGTTTAAATGGAAAAGCCAGTACG
AGGACGGCTTACTGAGCGATGATGACATACAGGAATGCATGCCTGTCCCGGTGGCTCTGACTGATACGCCAGAGCCGACCAGACCAGTTACAAATCCCTT
CTGGCGTAACAAGCCTGATGAGTGCCCTGAGAGTGATCCCGGAAACGTCCCACGGTGCGAGCTGCATCTTAAATCAGGTGTGGTAAAACTGTTTGACCCT
CTCACTCCGGAAATGTTACGGGCGCTAATCCGCGAAATGAAAGGAGGTACCCGATGAtaacgctgccaaccggtaccagaatctggatcatcgctggcat
cacagatatgcgttgtggcttcaacggcctggcttcgaaggtgcagaacacgctgaaagatgacccgttctccgggcatatcttcgtcttccggggccgc
agtggcaaaatggtgaaaatactgtgggccgatcgtgacgggttatgcctgttcgccaaacgcctggaacggggccgcttcgtctggccggtaacccggg
aagggaaagtgcacctgacgccagctcagttatccatgctactggaggggatcgcgtggcaacatcccaaacggacagaacggcctggcatccggataTA
AcccgtgataaaacaggggaATGAACAACACACTCCCCGACGACATCGAGCAACTGAAGGCCCTGCTGATCGCACAGCAGGCTGTTATCGTCCGTCTGTC
TGGTGAAATAACCGGCTATGCCCGCGAGATCAGCTCACTCAGAGCGCTGGTCGCTAAACTGCAGAGAATGTTGTTCGGTCGCAGCAGCGAGAAAAGCCGC
GAGAAGACAGAAAAGAAGATCGCACGGGCAGAAACGCGTATAACCGAGCTCCAGAACAGGCTTGGTGAGGCGCAGTTGCAACTCACCTCAATGGCCGGAG
AGACAGCGCCGAAAACATCAGACTCTCCCGTCCGCAAAGCACTTCCGGCAACACTTCCCCGTGACAGGCAGGTTATCTCCCCGGCAGAAACCGAATGCCC
CGTCTGCAGCGGCAAACTGAAACCGCTGGGAGAAAGCATCTCTGAACAACTGGATATCATCAACACCGCGTTCAGGGTAATCGAAACGGTTCGCCCAAAA
CTGGCCTGCAGCCGGTGCGACTGTATAGTTCAGGCTCCGCAGCCACCAAAACCCATCGAGCGCAGTTACGCCAGTCCGGCTCTGCTGGCCCGCATAATCA
TGGCTAAGTTCGCCGAGCATCTGCCGCTGTACCGTCAGTCGGAAATCTATGCCCGCCAGGGCGTGGAGCTGCACCGCAATACGATGGGGCGCTGGGTTGA
CATCATGGGAGAGCAGCTTCGCCCGCTGTATGATGAACTGAAGCACTATGTGCTGATGGCGGGTAAAGTGCATGCCGATGACACGCCGGTAAATGTACTG
GAGCCGGGTCAGGGTAAAACCCGTACCGGACGGCTGTGGGTCTATGTTCGTGACGATCGCAACGCCGGTTCGACCATGCCGGCAGCGGTGTGGTTCTCAT
ACTCTCCCGACCGCAAAGGCATCCACCCACAGCAACATCTGGCGGACTACAGAGGTATCCTGCAGGCCGATGCATATGCGGGTTACAATGCTCTTTACGA
AAGCGGTCAGGTAACCGAAGCGGCTTGTATGGCACATGCCCGACGCAAGATCCACGATGTACATGTCCGCCATCCAACGACAGTAACGGGAGAAGCGCTC
CGTCGTATCGGGGAACTGTACGCTATCGAGGCTGAGATCCGCGGCAGTCCGGCAGAAGAGCGACTGGCGGTCAGAAAAGCCAGAACGGTACCGCTAATGC
AGTCGTTGTATGAGTGGCTCCAGGGGCAGATGAGCACGCTGTCGCGCCACTCGGATACAGCGAAAGCGTTCACCTATCTGCTGAAGCAATGGGACGCTCT
GAACGAATACTGCCGCAATGGCTGGGTGGAGATCGACAATAACCTGTGTGAAAACGCCCTCCGGGTAGTTGCACTGGGGCGGCGTAACTACATGTTCTTC
GGCTCTGATGGTGGAGGCGAGAGTGCGGCAGTGATGTACAGCCTGATCGGTAGCTGCAAACTTAACGGAATCGAGCCGGAAACGTGGCTACGCCACGTGA
TCAGTGTAATCAACACCTGGCCTGCCAACCGCGTGAAAGAGTTGTTGCCCTGGAATGTCACTCAATCTGTAAACTAAtttttaccccacgtccttcacgg
tgcgcttac
AAAGATATTCCCTGCTTCCGTTCTTATTTGCCTGATGCTCTGCGTTTAAGATTTGAAGATAAACTGACCATCCGGGCCATCGCTCAGCGTCTAGGTCTCA
GTCATTCCACAATACATACGCTTTTTCAGCGATTTATTGCATCCGGTATCGCATGGCCATTGCCCGATTCAGTTTCATTAGCTCAGCTTGATGCCATCCT
TTATGCCAACAGAAAGAAGGAATTAACAGAGCCTCAAATCAGCGAAGGCACATGGCGAAAAGAACGGCGAGCCAGCTACAGCCGTGAATTTAAGGTCCGT
CTGGCTAAGCAGGCATTACAGCCCGGTGCTGTTGTTGCCCGGATCGCCAGAGAGCACGGTATCAATGATAACCTGCTGTTTAAATGGAAAAGCCAGTACG
AGGACGGCTTACTGAGCGATGATGACATACAGGAATGCATGCCTGTCCCGGTGGCTCTGACTGATACGCCAGAGCCGACCAGACCAGTTACAAATCCCTT
CTGGCGTAACAAGCCTGATGAGTGCCCTGAGAGTGATCCCGGAAACGTCCCACGGTGCGAGCTGCATCTTAAATCAGGTGTGGTAAAACTGTTTGACCCT
CTCACTCCGGAAATGTTACGGGCGCTAATCCGCGAAATGAAAGGAGGTACCCGATGAtaacgctgccaaccggtaccagaatctggatcatcgctggcat
cacagatatgcgttgtggcttcaacggcctggcttcgaaggtgcagaacacgctgaaagatgacccgttctccgggcatatcttcgtcttccggggccgc
agtggcaaaatggtgaaaatactgtgggccgatcgtgacgggttatgcctgttcgccaaacgcctggaacggggccgcttcgtctggccggtaacccggg
aagggaaagtgcacctgacgccagctcagttatccatgctactggaggggatcgcgtggcaacatcccaaacggacagaacggcctggcatccggataTA
AcccgtgataaaacaggggaATGAACAACACACTCCCCGACGACATCGAGCAACTGAAGGCCCTGCTGATCGCACAGCAGGCTGTTATCGTCCGTCTGTC
TGGTGAAATAACCGGCTATGCCCGCGAGATCAGCTCACTCAGAGCGCTGGTCGCTAAACTGCAGAGAATGTTGTTCGGTCGCAGCAGCGAGAAAAGCCGC
GAGAAGACAGAAAAGAAGATCGCACGGGCAGAAACGCGTATAACCGAGCTCCAGAACAGGCTTGGTGAGGCGCAGTTGCAACTCACCTCAATGGCCGGAG
AGACAGCGCCGAAAACATCAGACTCTCCCGTCCGCAAAGCACTTCCGGCAACACTTCCCCGTGACAGGCAGGTTATCTCCCCGGCAGAAACCGAATGCCC
CGTCTGCAGCGGCAAACTGAAACCGCTGGGAGAAAGCATCTCTGAACAACTGGATATCATCAACACCGCGTTCAGGGTAATCGAAACGGTTCGCCCAAAA
CTGGCCTGCAGCCGGTGCGACTGTATAGTTCAGGCTCCGCAGCCACCAAAACCCATCGAGCGCAGTTACGCCAGTCCGGCTCTGCTGGCCCGCATAATCA
TGGCTAAGTTCGCCGAGCATCTGCCGCTGTACCGTCAGTCGGAAATCTATGCCCGCCAGGGCGTGGAGCTGCACCGCAATACGATGGGGCGCTGGGTTGA
CATCATGGGAGAGCAGCTTCGCCCGCTGTATGATGAACTGAAGCACTATGTGCTGATGGCGGGTAAAGTGCATGCCGATGACACGCCGGTAAATGTACTG
GAGCCGGGTCAGGGTAAAACCCGTACCGGACGGCTGTGGGTCTATGTTCGTGACGATCGCAACGCCGGTTCGACCATGCCGGCAGCGGTGTGGTTCTCAT
ACTCTCCCGACCGCAAAGGCATCCACCCACAGCAACATCTGGCGGACTACAGAGGTATCCTGCAGGCCGATGCATATGCGGGTTACAATGCTCTTTACGA
AAGCGGTCAGGTAACCGAAGCGGCTTGTATGGCACATGCCCGACGCAAGATCCACGATGTACATGTCCGCCATCCAACGACAGTAACGGGAGAAGCGCTC
CGTCGTATCGGGGAACTGTACGCTATCGAGGCTGAGATCCGCGGCAGTCCGGCAGAAGAGCGACTGGCGGTCAGAAAAGCCAGAACGGTACCGCTAATGC
AGTCGTTGTATGAGTGGCTCCAGGGGCAGATGAGCACGCTGTCGCGCCACTCGGATACAGCGAAAGCGTTCACCTATCTGCTGAAGCAATGGGACGCTCT
GAACGAATACTGCCGCAATGGCTGGGTGGAGATCGACAATAACCTGTGTGAAAACGCCCTCCGGGTAGTTGCACTGGGGCGGCGTAACTACATGTTCTTC
GGCTCTGATGGTGGAGGCGAGAGTGCGGCAGTGATGTACAGCCTGATCGGTAGCTGCAAACTTAACGGAATCGAGCCGGAAACGTGGCTACGCCACGTGA
TCAGTGTAATCAACACCTGGCCTGCCAACCGCGTGAAAGAGTTGTTGCCCTGGAATGTCACTCAATCTGTAAACTAAtttttaccccacgtccttcacgg
tgcgcttac
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
675 bp | 224 aa | 83 | 757 | + | No |
AG : IS66 TnpA
ORF sequence :
MNSQTKKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQRFIASGIAWPLPDSVSLAQLDAILYANRKKELTEPQISEGTWRKERRASYS
REFKVRLAKQALQPGAVVARIAREHGINDNLLFKWKSQYEDGLLSDDDIQECMPVPVALTDTPEPTRPVTNPFWRNKPDECPESDPGNVPRCELHLKSGV
VKLFDPLTPEMLRALIREMKGGTR
REFKVRLAKQALQPGAVVARIAREHGINDNLLFKWKSQYEDGLLSDDDIQECMPVPVALTDTPEPTRPVTNPFWRNKPDECPESDPGNVPRCELHLKSGV
VKLFDPLTPEMLRALIREMKGGTR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 754 | 1101 | + | No |
AG : IS66 TnpB
ORF sequence :
MITLPTGTRIWIIAGITDMRCGFNGLASKVQNTLKDDPFSGHIFVFRGRSGKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI
AWQHPKRTERPGIRI
AWQHPKRTERPGIRI
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1557 bp | 518 aa | 1121 | 2677 | + | No |
Chemistry : DDE
ORF sequence :
MNNTLPDDIEQLKALLIAQQAVIVRLSGEITGYAREISSLRALVAKLQRMLFGRSSEKSREKTEKKIARAETRITELQNRLGEAQLQLTSMAGETAPKTS
DSPVRKALPATLPRDRQVISPAETECPVCSGKLKPLGESISEQLDIINTAFRVIETVRPKLACSRCDCIVQAPQPPKPIERSYASPALLARIIMAKFAEH
LPLYRQSEIYARQGVELHRNTMGRWVDIMGEQLRPLYDELKHYVLMAGKVHADDTPVNVLEPGQGKTRTGRLWVYVRDDRNAGSTMPAAVWFSYSPDRKG
IHPQQHLADYRGILQADAYAGYNALYESGQVTEAACMAHARRKIHDVHVRHPTTVTGEALRRIGELYAIEAEIRGSPAEERLAVRKARTVPLMQSLYEWL
QGQMSTLSRHSDTAKAFTYLLKQWDALNEYCRNGWVEIDNNLCENALRVVALGRRNYMFFGSDGGGESAAVMYSLIGSCKLNGIEPETWLRHVISVINTW
PANRVKELLPWNVTQSVN
DSPVRKALPATLPRDRQVISPAETECPVCSGKLKPLGESISEQLDIINTAFRVIETVRPKLACSRCDCIVQAPQPPKPIERSYASPALLARIIMAKFAEH
LPLYRQSEIYARQGVELHRNTMGRWVDIMGEQLRPLYDELKHYVLMAGKVHADDTPVNVLEPGQGKTRTGRLWVYVRDDRNAGSTMPAAVWFSYSPDRKG
IHPQQHLADYRGILQADAYAGYNALYESGQVTEAACMAHARRKIHDVHVRHPTTVTGEALRRIGELYAIEAEIRGSPAEERLAVRKARTVPLMQSLYEWL
QGQMSTLSRHSDTAKAFTYLLKQWDALNEYCRNGWVEIDNNLCENALRVVALGRRNYMFFGSDGGGESAAVMYSLIGSCKLNGIEPETWLRHVISVINTW
PANRVKELLPWNVTQSVN
Blast result :
Comments
ISEc83 is 79% aa (transposase) similar to ISSfl3.
References
1] Keiji Nakamura (2018) Direct submission.