ISEsp1
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP012999 | ND | Enterobacter sp. | Enterobacter sp. E20 |
DNA section
IS Length : 2494 bp
Ends
IR Length : 16/21
IRL : GTAAGCGTCAAGCCACTGCCGCCTGTTGCGATTACTAACGATTGACGATG
IRR : GTAAGCGTTAAGTGAGCGCCGTATTGACGGCTATTTATTGGTGAGATCTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTAAACCGGG | GATACTTTGC | 0 |
DNA sequence
GTAAGCGTCAAGCCACTGCCGCCTGTTGCGATTACTAACGATTGACGATGATAGAGTCCTCTTATTAACGTTAATGGACTCTATCAATGTCAAACACTCT
TCAGCCCCGCAGGGCGCGGGCGTCCTACTCAATGGACTTTAAGCTGGCTCTCGTCGAAAAGTCATATCAGCCTGGAGCCTGTGTTGCCCGGTTGGCGCGG
GATAATGGAATTAATGACAATCTGCTGTTTACCTGGCGCCAGCGTTACAGACATCTTCTGCCCGATGAAATACAACGGTCAATCAGAGAGCAAGACTCTG
TTATCCCCGTTGTCCTGCCTGATATGGCCCTGTCACACCATGCTGAGCCGCACTATGAACCCGCCGCTCCAGCCTGCCGCGAGGCCATGACATGCGAGGT
GACTGTCGGCGGTGCCAGCCTGCGTCTGTCCGGGGATTTATCACCTGCACTTCTGAAAACGCTGATCCGCGAGCTGACCGGGAGGAGCCGATGATACCCT
TACCGTCAGGCACTCGTATCTGGCTGGTTGCCGGGGTCACCGATATGCGTAAGTCCTTCAATGGTCTGGGCGAACTGGTCCAGCATGTTCTTGATGACAA
TCCGTTCTCCGGCCACCTGTTTATCTTCCGTGGTCGTAAAGGTGACACCGTGAGGATCCTCTGGGCTGATGCTGACGGTCTGTGTCTGTTTACCAAACGT
CTGGAAGAGGGACAGTTCGTCTGGCCTGCTGTACGCGACGGCAAAATCGCCATCACCCGCTCACAACTCGCCATGCTCCTCGATAAGCTGGACTGGCGGC
AACCTAAAACTGCACGCCTTAACTCACTGACGATGTTGTAAAAAGCGCATGACCGCATTATAAATGGGGTCATGAGTCAGGACTATCTCGCCCGTATCGC
TGCGCTGGAAGACGCGCTTCGCCAGAAAGACAGCCAGCTCAGTCTCGTTGCTGAGACTGAGTCGTTCCTGCGTTCGGCGCTGGCCCGCGCAGAAGAGAAA
ATAGAGAACGAAGAGCGTGAAATAGAATATCTGCGGGCTCAGATAGAAAAACTGCGCCGAATGCTGTTCGGTACCCGTTCAGAAAAGCTACGCCGGCAGG
TCGAAGAAGCCGAAGCCCTGCTGAAACAGCAGGAGCAGCAAAGCGATCGTTACAACGGCCGGGACAATGATCAGCAGGTTCCGCGTCAGTTGCGCCAGTC
CCGTCATCGTCGCCCGTTACCGGAACATCTTCCCCGCGAAATAAACAGACTGGAGCCAGCTGAAACCAGCTGTCCTGGTTGCGGTAGTGATATGGCCTAT
CTCAGCGAAGTCAGCGCGGAGCAACTGGAGCTGGTCTCCAGCGCCCTGAAAGTGATCCGCACGGTCAGAGTGAAAAAGGCCTGTACCCGATGCGACTGCG
TCGTTGAAGCGCCAGCGCCCTCACGTCCTATCGACCGGGGCATCGCCGGGCCGGGTCTGCTGGCCCGCGTGTTAACGGCCAAATACTGTGAACACCTGCC
GCTGTATCGCCAGTGCGAAATCTTTGCCCGTCAGGGTGTGGATCTGAGTCGTGCGCTGCTCTCCAACTGGGTGGATGCGTGCTGCCGGTTAATGGCCCCG
CTGGATGAAGCCCTCTACCACTACGTGATGGACTGCCGCAAACTGCATACGGATGACACTCCGGTGCCCGTGCTGGCGCCGGGCAGAAAGAAGACGAAAA
CCGGGCGTATCTGGACATATGTCCGTGATAACAGAAGCGCGGGTTCATCAGATCCGCCAGCGGCATGGTTCGCCTTCTCACCGGACCGACAGGGGAAACA
CCCTCAGCAACATCTTCGGCACTATCATGGCGTGCTGCAGGCAGATGCCTTCGCAGGGTACGACAGGTTGTTCAGCGCAGAGCGTGAAGGTGGCCCGTTG
ACAGAAGCGGCATGCTGGGCTCATGCGCGGCGCAAAATCCATGACGTCTATATCAGCACCCGGACGGCCACAGCAGAGGAGGCTCTGAAGCGCATCAGTG
AGTTATACGCGATAGAAGAGGAAATACGCGGCCTTCCGGCATCTCAGCGGCTGGCCGCCAGACGGTCCCGAAGTAAACCGTTGCTGATATCCCTGCATGA
CTGGTTGGTGGAGAAAAGAGCCACTCTGTCGAAAAAATCCCGGTTAGGCGAGGCGTTCGCTTATGCACTGAACCAGTGGGATGCCCTGTGTTACTACTGC
GATGATGGTCTGGCAGAGCCGGATAATAACGCTGCTGAGCGCGCGCTACGAGCGGTCTGTCTGGGCAAGAAAAACTACATCTTCTTCGGCAGTGATCATG
GTGGTGAACGTGGTGCCCTGCTGTATGGTCTGATCGGAACGTGCAGGCTGAACGGTATCGATCCAGAGGGTTACCTTCGCCATATCCTGAGCGTATTGCC
GGAGTGGCCCATCAACAAAGTGGCCGAACTGCTGCCATGGAACGTAGATCTCACCAATAAATAGCCGTCAATACGGCGCTCACTTAACGCTTAC
TCAGCCCCGCAGGGCGCGGGCGTCCTACTCAATGGACTTTAAGCTGGCTCTCGTCGAAAAGTCATATCAGCCTGGAGCCTGTGTTGCCCGGTTGGCGCGG
GATAATGGAATTAATGACAATCTGCTGTTTACCTGGCGCCAGCGTTACAGACATCTTCTGCCCGATGAAATACAACGGTCAATCAGAGAGCAAGACTCTG
TTATCCCCGTTGTCCTGCCTGATATGGCCCTGTCACACCATGCTGAGCCGCACTATGAACCCGCCGCTCCAGCCTGCCGCGAGGCCATGACATGCGAGGT
GACTGTCGGCGGTGCCAGCCTGCGTCTGTCCGGGGATTTATCACCTGCACTTCTGAAAACGCTGATCCGCGAGCTGACCGGGAGGAGCCGATGATACCCT
TACCGTCAGGCACTCGTATCTGGCTGGTTGCCGGGGTCACCGATATGCGTAAGTCCTTCAATGGTCTGGGCGAACTGGTCCAGCATGTTCTTGATGACAA
TCCGTTCTCCGGCCACCTGTTTATCTTCCGTGGTCGTAAAGGTGACACCGTGAGGATCCTCTGGGCTGATGCTGACGGTCTGTGTCTGTTTACCAAACGT
CTGGAAGAGGGACAGTTCGTCTGGCCTGCTGTACGCGACGGCAAAATCGCCATCACCCGCTCACAACTCGCCATGCTCCTCGATAAGCTGGACTGGCGGC
AACCTAAAACTGCACGCCTTAACTCACTGACGATGTTGTAAAAAGCGCATGACCGCATTATAAATGGGGTCATGAGTCAGGACTATCTCGCCCGTATCGC
TGCGCTGGAAGACGCGCTTCGCCAGAAAGACAGCCAGCTCAGTCTCGTTGCTGAGACTGAGTCGTTCCTGCGTTCGGCGCTGGCCCGCGCAGAAGAGAAA
ATAGAGAACGAAGAGCGTGAAATAGAATATCTGCGGGCTCAGATAGAAAAACTGCGCCGAATGCTGTTCGGTACCCGTTCAGAAAAGCTACGCCGGCAGG
TCGAAGAAGCCGAAGCCCTGCTGAAACAGCAGGAGCAGCAAAGCGATCGTTACAACGGCCGGGACAATGATCAGCAGGTTCCGCGTCAGTTGCGCCAGTC
CCGTCATCGTCGCCCGTTACCGGAACATCTTCCCCGCGAAATAAACAGACTGGAGCCAGCTGAAACCAGCTGTCCTGGTTGCGGTAGTGATATGGCCTAT
CTCAGCGAAGTCAGCGCGGAGCAACTGGAGCTGGTCTCCAGCGCCCTGAAAGTGATCCGCACGGTCAGAGTGAAAAAGGCCTGTACCCGATGCGACTGCG
TCGTTGAAGCGCCAGCGCCCTCACGTCCTATCGACCGGGGCATCGCCGGGCCGGGTCTGCTGGCCCGCGTGTTAACGGCCAAATACTGTGAACACCTGCC
GCTGTATCGCCAGTGCGAAATCTTTGCCCGTCAGGGTGTGGATCTGAGTCGTGCGCTGCTCTCCAACTGGGTGGATGCGTGCTGCCGGTTAATGGCCCCG
CTGGATGAAGCCCTCTACCACTACGTGATGGACTGCCGCAAACTGCATACGGATGACACTCCGGTGCCCGTGCTGGCGCCGGGCAGAAAGAAGACGAAAA
CCGGGCGTATCTGGACATATGTCCGTGATAACAGAAGCGCGGGTTCATCAGATCCGCCAGCGGCATGGTTCGCCTTCTCACCGGACCGACAGGGGAAACA
CCCTCAGCAACATCTTCGGCACTATCATGGCGTGCTGCAGGCAGATGCCTTCGCAGGGTACGACAGGTTGTTCAGCGCAGAGCGTGAAGGTGGCCCGTTG
ACAGAAGCGGCATGCTGGGCTCATGCGCGGCGCAAAATCCATGACGTCTATATCAGCACCCGGACGGCCACAGCAGAGGAGGCTCTGAAGCGCATCAGTG
AGTTATACGCGATAGAAGAGGAAATACGCGGCCTTCCGGCATCTCAGCGGCTGGCCGCCAGACGGTCCCGAAGTAAACCGTTGCTGATATCCCTGCATGA
CTGGTTGGTGGAGAAAAGAGCCACTCTGTCGAAAAAATCCCGGTTAGGCGAGGCGTTCGCTTATGCACTGAACCAGTGGGATGCCCTGTGTTACTACTGC
GATGATGGTCTGGCAGAGCCGGATAATAACGCTGCTGAGCGCGCGCTACGAGCGGTCTGTCTGGGCAAGAAAAACTACATCTTCTTCGGCAGTGATCATG
GTGGTGAACGTGGTGCCCTGCTGTATGGTCTGATCGGAACGTGCAGGCTGAACGGTATCGATCCAGAGGGTTACCTTCGCCATATCCTGAGCGTATTGCC
GGAGTGGCCCATCAACAAAGTGGCCGAACTGCTGCCATGGAACGTAGATCTCACCAATAAATAGCCGTCAATACGGCGCTCACTTAACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
408 bp | 135 aa | 87 | 494 | + | No |
AG : IS66 TnpA
ORF sequence :
MSNTLQPRRARASYSMDFKLALVEKSYQPGACVARLARDNGINDNLLFTWRQRYRHLLPDEIQRSIREQDSVIPVVLPDMALSHHAEPHYEPAAPACREA
MTCEVTVGGASLRLSGDLSPALLKTLIRELTGRSR
MTCEVTVGGASLRLSGDLSPALLKTLIRELTGRSR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 491 | 841 | + | No |
AG : IS66 TnpB
ORF sequence :
MIPLPSGTRIWLVAGVTDMRKSFNGLGELVQHVLDDNPFSGHLFIFRGRKGDTVRILWADADGLCLFTKRLEEGQFVWPAVRDGKIAITRSQLAMLLDKL
DWRQPKTARLNSLTML
DWRQPKTARLNSLTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1593 bp | 530 aa | 872 | 2464 | + | No |
Chemistry : DDE
ORF sequence :
MSQDYLARIAALEDALRQKDSQLSLVAETESFLRSALARAEEKIENEEREIEYLRAQIEKLRRMLFGTRSEKLRRQVEEAEALLKQQEQQSDRYNGRDND
QQVPRQLRQSRHRRPLPEHLPREINRLEPAETSCPGCGSDMAYLSEVSAEQLELVSSALKVIRTVRVKKACTRCDCVVEAPAPSRPIDRGIAGPGLLARV
LTAKYCEHLPLYRQCEIFARQGVDLSRALLSNWVDACCRLMAPLDEALYHYVMDCRKLHTDDTPVPVLAPGRKKTKTGRIWTYVRDNRSAGSSDPPAAWF
AFSPDRQGKHPQQHLRHYHGVLQADAFAGYDRLFSAEREGGPLTEAACWAHARRKIHDVYISTRTATAEEALKRISELYAIEEEIRGLPASQRLAARRSR
SKPLLISLHDWLVEKRATLSKKSRLGEAFAYALNQWDALCYYCDDGLAEPDNNAAERALRAVCLGKKNYIFFGSDHGGERGALLYGLIGTCRLNGIDPEG
YLRHILSVLPEWPINKVAELLPWNVDLTNK
QQVPRQLRQSRHRRPLPEHLPREINRLEPAETSCPGCGSDMAYLSEVSAEQLELVSSALKVIRTVRVKKACTRCDCVVEAPAPSRPIDRGIAGPGLLARV
LTAKYCEHLPLYRQCEIFARQGVDLSRALLSNWVDACCRLMAPLDEALYHYVMDCRKLHTDDTPVPVLAPGRKKTKTGRIWTYVRDNRSAGSSDPPAAWF
AFSPDRQGKHPQQHLRHYHGVLQADAFAGYDRLFSAEREGGPLTEAACWAHARRKIHDVYISTRTATAEEALKRISELYAIEEEIRGLPASQRLAARRSR
SKPLLISLHDWLVEKRATLSKKSRLGEAFAYALNQWDALCYYCDDGLAEPDNNAAERALRAVCLGKKNYIFFGSDHGGERGALLYGLIGTCRLNGIDPEG
YLRHILSVLPEWPINKVAELLPWNVDLTNK
Blast result :
Comments
ISEsp1 is 96% aa (transposase) similar to ISEcl6.
References
1] Dongsheng Zhou (2020) Direct submission.
2] Liu,Y., Cao,G., Chen,R., Ren,Z., Du,J., Xiang,C., Song,L., Wang,J., Liu,G. and Wang,G. (2019) Direct GenBank submission.
2] Liu,Y., Cao,G., Chen,R., Ren,Z., Du,J., Xiang,C., Song,L., Wang,J., Liu,G. and Wang,G. (2019) Direct GenBank submission.