ISEc43
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008460 | ND | Escherichia coli | Escherichia coli CFT073 Escherichia coli 042 Escherichia coli plasmid pO86A1 |
DNA section
IS Length : 2541 bp
Ends
IR Length : 14/22
IRL : GTAATCGTAAAGCCAATGCCGTCTGTAACACCTGCTCCTTGCAGACTAAA
IRR : GTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTAGTGAGTACTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATGAAAAGAT | AAATAATA | CCAGTGGAAT | 8 |
TGGACACCTG | ACGCACCTGC | 0 | |
GCCTCTATGG | GACTATTACT | 0 |
DNA sequence
GTAATCGTAAAGCCAATGCCGTCTGTAACACCTGCTCCTTGCAGACTAAATTAGAGCTCCTTCTAAATTAGACGGAGTTCTATTGATGGATAAACCTACA
GACTGGCGCTCCGGAACCCGCCGGATATTTTCTAATGAATTTAAACTTCATATGGTTGAACTGGCTTCGAAACCAAATGCCAATGTTGCACAACTGGCCC
GGGAACATGGCGTTGATAACAACCTGATTTTTAAATGGCTACGCCTCTGGCAAAGAGAAGGACGTATTTCTCGTAGAATGCCTCCAACTATTGTAGGCCC
TACAGTATCACAATCTTTTCCGGCCTCTCCGACTCTGGTTCCAGTGGAACTTATCGACACCCCGCGCTGTGCTACAGATGCTCCTGCTCCGGAGGCATTA
TCAGTTGCTTGTGCAGCTTCCTGCCATGTGGAATTCCATTACGGTAAAATGATGCTGGAAAATCCTTCACCAGAGCTGCTCACGGTGTTGATCCGTGAGC
TGACCGGGAGGGGACGATGATTTCACTCCCATCAGGTACCCGTATCTGGCTCGTTGCCGGCGTTACTGATATGCGTAAATCCTTCAACGGTCTGGGGGAG
CAGATACAGCATGTGCTGGATGATAACCCCTTCTCCGGTCACCTGTTTATCTTCCGTGGCCGACGGGGAGACACGATTAAAATCCTGTGGGCTGATGCTG
ATGGTCTGTGCCTGTTCACCAAACGCCTTGAGGAAGGTCAGTTTATCTGGCCTGCGGTGCGTGACGGTAAGATATCCATTACCCGCTCGCAACTGGCAAT
GCTCCTCGATAAGCTGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAAACTCCTGACCGCATTATAAAAACGGCCATG
AGTCAGAAATACCTCATTCGCATCGCTGAGCTGGAAAGGCTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAG
AGACGGAGGCCTTCCTGCGCTCTGCACTGGCACGTGCCGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAGCATCTGCGGGCTCAGATAGAAAAACT
GCGCCGGATGCTGTTCGGAACCCGTTCTGAAAAACTGCGTCGTGAGGTTGAACAGGCTGAAGCCCTGCTGAAACAACGCGAGCAGGAAAGCGATCGTTAC
AGTGGGCGTGAGGATGACCCGCTGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACATCTCCCCCGTGAAATATACCGCCTGG
AGCCTGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCCCTGAAAGT
GATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCG
GGGTTACTTGCCCGCGTGTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGTGTCGAACTGAGCCGTG
CCTTACTCTCCAACTGGGTTGATGCGTGCTGCCAGTTAATGACGCCGCTGAATGATGCCCTGTACAGTTATGTGATGAACACCCGCAAGGTTCACACTGA
TGACACACCAGTAAAAGTACTGGCACCGGGCAGGAAGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATGACCGAAATGCCGGTTCGCCAGAA
CCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCATCAGGGTAAACATCCGGAGCAACACCTTCGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTCG
CAGGTTACGATCGGCTGTTCAGTGCCGAACGTGAAGGCGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAATCCACGATGTATATAT
CAGTACCAAAAGCGCGACGGCGGAAGAAGCACTGAAACTAATCGGCGAACTGTACGCCATTGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGCCTG
GCGGTCAGGCAAATGCAGAGTAAACCGCTACTGACTTCCCTGTATAAGCTGATGCAGGAGAAAGAACACACGTTATCGAAAAAATGCCGTCTGAGAGATG
CGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTGTGCAACTTCTGTGATGACGGTCTGGCGGAGGCGGACAATAACACAGCGGAAAGAGCGCTTCGTGC
AGTCTGTCTCGGAAAGAAAAATTACGTGTTCTTCGGTAGCGATCACGGCGGCGAGCGTGGTGCACTGTTGTACGGGCTGATCGGCACCTGCCGGTTGAAT
GGTATCGATCCGGAAGCGTATCTGCGCCATATCCTGAGCGTACTGCCGGAATGGCCTTCCAACCGAGTTGACGAACTCCTGCCATGGAACGTAGTACTCA
CTAATAAATAAGCGTCAATACGGTGCTCCGTTGACGCTTAC
GACTGGCGCTCCGGAACCCGCCGGATATTTTCTAATGAATTTAAACTTCATATGGTTGAACTGGCTTCGAAACCAAATGCCAATGTTGCACAACTGGCCC
GGGAACATGGCGTTGATAACAACCTGATTTTTAAATGGCTACGCCTCTGGCAAAGAGAAGGACGTATTTCTCGTAGAATGCCTCCAACTATTGTAGGCCC
TACAGTATCACAATCTTTTCCGGCCTCTCCGACTCTGGTTCCAGTGGAACTTATCGACACCCCGCGCTGTGCTACAGATGCTCCTGCTCCGGAGGCATTA
TCAGTTGCTTGTGCAGCTTCCTGCCATGTGGAATTCCATTACGGTAAAATGATGCTGGAAAATCCTTCACCAGAGCTGCTCACGGTGTTGATCCGTGAGC
TGACCGGGAGGGGACGATGATTTCACTCCCATCAGGTACCCGTATCTGGCTCGTTGCCGGCGTTACTGATATGCGTAAATCCTTCAACGGTCTGGGGGAG
CAGATACAGCATGTGCTGGATGATAACCCCTTCTCCGGTCACCTGTTTATCTTCCGTGGCCGACGGGGAGACACGATTAAAATCCTGTGGGCTGATGCTG
ATGGTCTGTGCCTGTTCACCAAACGCCTTGAGGAAGGTCAGTTTATCTGGCCTGCGGTGCGTGACGGTAAGATATCCATTACCCGCTCGCAACTGGCAAT
GCTCCTCGATAAGCTGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAAACTCCTGACCGCATTATAAAAACGGCCATG
AGTCAGAAATACCTCATTCGCATCGCTGAGCTGGAAAGGCTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAG
AGACGGAGGCCTTCCTGCGCTCTGCACTGGCACGTGCCGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAGCATCTGCGGGCTCAGATAGAAAAACT
GCGCCGGATGCTGTTCGGAACCCGTTCTGAAAAACTGCGTCGTGAGGTTGAACAGGCTGAAGCCCTGCTGAAACAACGCGAGCAGGAAAGCGATCGTTAC
AGTGGGCGTGAGGATGACCCGCTGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACATCTCCCCCGTGAAATATACCGCCTGG
AGCCTGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCCCTGAAAGT
GATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCG
GGGTTACTTGCCCGCGTGTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGTGTCGAACTGAGCCGTG
CCTTACTCTCCAACTGGGTTGATGCGTGCTGCCAGTTAATGACGCCGCTGAATGATGCCCTGTACAGTTATGTGATGAACACCCGCAAGGTTCACACTGA
TGACACACCAGTAAAAGTACTGGCACCGGGCAGGAAGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATGACCGAAATGCCGGTTCGCCAGAA
CCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCATCAGGGTAAACATCCGGAGCAACACCTTCGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTCG
CAGGTTACGATCGGCTGTTCAGTGCCGAACGTGAAGGCGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAATCCACGATGTATATAT
CAGTACCAAAAGCGCGACGGCGGAAGAAGCACTGAAACTAATCGGCGAACTGTACGCCATTGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGCCTG
GCGGTCAGGCAAATGCAGAGTAAACCGCTACTGACTTCCCTGTATAAGCTGATGCAGGAGAAAGAACACACGTTATCGAAAAAATGCCGTCTGAGAGATG
CGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTGTGCAACTTCTGTGATGACGGTCTGGCGGAGGCGGACAATAACACAGCGGAAAGAGCGCTTCGTGC
AGTCTGTCTCGGAAAGAAAAATTACGTGTTCTTCGGTAGCGATCACGGCGGCGAGCGTGGTGCACTGTTGTACGGGCTGATCGGCACCTGCCGGTTGAAT
GGTATCGATCCGGAAGCGTATCTGCGCCATATCCTGAGCGTACTGCCGGAATGGCCTTCCAACCGAGTTGACGAACTCCTGCCATGGAACGTAGTACTCA
CTAATAAATAAGCGTCAATACGGTGCTCCGTTGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 86 | 520 | + | No |
AG : IS66 TnpA
ORF sequence :
MDKPTDWRSGTRRIFSNEFKLHMVELASKPNANVAQLAREHGVDNNLIFKWLRLWQREGRISRRMPPTIVGPTVSQSFPASPTLVPVELIDTPRCATDAP
APEALSVACAASCHVEFHYGKMMLENPSPELLTVLIRELTGRGR
APEALSVACAASCHVEFHYGKMMLENPSPELLTVLIRELTGRGR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 517 | 867 | + | No |
AG : IS66 TnpB
ORF sequence :
MISLPSGTRIWLVAGVTDMRKSFNGLGEQIQHVLDDNPFSGHLFIFRGRRGDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKISITRSQLAMLLDKL
DWRQPKTSRLNALTML
DWRQPKTSRLNALTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1614 bp | 537 aa | 898 | 2511 | + | No |
Chemistry : DDE
ORF sequence :
MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQESDR
YSGREDDPLVPRQLRQSRHRRPLPAHLPREIYRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAG
PGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYSYVMNTRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSP
EPPAVWFAYSPDHQGKHPEQHLRPFRGILQADAFAGYDRLFSAEREGGALTEAGCWAHARRKIHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSER
LAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFCDDGLAEADNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
YSGREDDPLVPRQLRQSRHRRPLPAHLPREIYRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAG
PGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYSYVMNTRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSP
EPPAVWFAYSPDHQGKHPEQHLRPFRGILQADAFAGYDRLFSAEREGGALTEAGCWAHARRKIHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSER
LAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFCDDGLAEADNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
Blast result :
Comments
ISEc43 is 68%(orfA), 100%(orfB) and 93%(orfC) aa similar to IS682.
References
1] ISfinder annotation (2012)
2] Yamamoto,T. (2006) Direct submission to GenBank
2] Yamamoto,T. (2006) Direct submission to GenBank