ISEc78
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
MF156714 | ND | Escherichia coli | Escherichia coli strain 15061806 plasmid p61806-dfrA |
DNA section
IS Length : 2437 bp
Ends
IR Length : 17/22
IRL : GTAAGCGTCGTCTGAGTACCGTCTGGTCCCGAATCCAGGATCCTGTAACT
IRR : GTAAGCGTGCAGTGAGAACCGTATTGACGAAGTGAAGCTGGTCAGGCAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGAATCGTCA | GGCTCTTACG | 0 | |
GCCTTTGATT | AGAAGGCG | CACAATGTTA | 8 |
AACAGCGCTG | GCCTATGC | TTGGAAACTT | 8 |
CCCGGGAAAA | GGTTCTGC | CGATGCCCGG | 8 |
DNA sequence
GTAAGCGTCGTCTGAGTACCGTCTGGTCCCGAATCCAGGATCCTGTAACTTAAGCACCCACTTATTTTGCAGGTGGACACCTCATGCGCGCTAAAGAAAG
ACTTCCCCGGAAACACTATTCCCCTGAATTCAAAATGGAACTGGTCAGGCTGGCTCTTGAAGAAGAAGGCAGTATTGCCGCACTGGCCCGGAGACATGAC
GTCAATGATAACCTGCTCTTTAAATGGATAAGGCTCTGGCAGCGTGAAGGGCGGGTCTGTCGGCCCCGAAAAAACTCATCGTCGCTTCCTGCCCTGATAC
CCGTGCAGCTTCAGGCGGGCTCTTCTCTCCCAACCTTCGAACTACCATCCTGCTCTCCTCCTGCTACCTGCCACATAAAATGTCGGGGAGGAGAAATAAC
ACTGACTCATCCCTCTGCTGAACTCATGAGCACTGTCCTGCGCGAACTGATGCGGGGGCCGGTATGATAAATCTTCCCGCAGGCACAAAAATCTGGCTGG
TTGCCGGTATCACCGATATGCGCAACGGCTTCAATGGCCTCGCCGCAAAGATGCAGACCGCGCTGAAAGATGACCCGATGTCCGGCCACGTCTTCATCTT
CCGGGGACGCAGCGGCAGTCAGGTAAAACTGCTCTGGTCCACCGGCGATGGTCTGTGCCTGCTGACAAAGCGACTGGAACGTGGTCGCTTCGCCTGGCCC
TCAGCCCGCGATGGCAAAGTGTTCCTGACGCCGGCGCAACTGGCGATGCTGATGGAAGGTATCGACTGGCGACAGCCAAAGCGGTTACTGACATCCCTGA
CCATGTTGTAGCGCTCTTTATCCTGGTTGTCGCAGAATAAGCCTGGTAAAATACAGGCTTATGAACGACACCTCTTCTGACGACATCCTTCTGCTGAAAC
AGCGCCTGGCCGAACAGGAAGCGTTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAACGCGAAATAGACCATCTGCAGGCGCAACTGGATAAGCT
GCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAGGTATCCCGCCGTATCGCGCAAATGGAAGCTGACCTGAACCTGCTGCAGCAGGAAAGCGATACGCTG
ACCGGCCGGGTGTATGACCCGGCAGTGCAGCGCCCGCTGCGTCAGACCCGCACCCGCAAACCGTTCCCTGAATCACTACCCCGTGACGAAAAGCGACTGC
TGCCGACTGAGCCGTGTTGCCCGGAGTGCGGTGGTTCGCTGAGTTACCTGGGTGAGGATGCCGCCGAACAGCTGGAGCTGATGCGCAGCGCCTTCCGGGT
TATCCGGACGGTACGTGAAAAACATGCCTGCCGTCGGTGCGATCGCATCGTTCAGGCCCCGGCTCCTTCGCGCCCCATCGAGCGGGGTATCGCCGGACCG
GGGCTGCTGGCCCGAGTGCTGACCTCAAAGTATGCAGAGCACACACCGCTGTATCGCCAGTCGGAGATCTATGCCCGCCAGGGTGTAGTGTTGAGTCGTT
CTGTACTGTCGGGCTGGGTGGATGCGTGTTGTCGTCTGCTGGCACCGCTGGATGAAGCCCTTCAGCACTATGTCCTGACCGACGGCAAACTCCATGCTGA
CGATACGCCTGTCCCGGTGCTGTTGCCGGGCAATAAGAAGACGAAGACCGGGCGCTTATGGACGTACGTTCGAGACGACCGCAACGCCGGATCAGCGCTG
GCCCCCGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGTATCCACCCTCAGACCCATCTTGCAGGCTTCAGTGGCGTACTGCAGGCTGATGCCTACG
CCGGGTTCAACGAGCTCTACCGCGACGGCCATATAAAGGAAGCCGCGTGCTGGGCCCATGCCCGCCGAAAAATCCACGATGTTCACGTCCGTACCCCGTC
AGCCCTGACGGAGGAGGCGCTGAAACGGATCGGCGAGCTGTATGCCATCGAGTCGGAGCTCAGAGGTAAAAGAGCAGAGGAACGGCAGGCAGTCCGGCAC
CAAAAAGTGCTGCCGCTGCTGGCGTCACTGGAGGGGTGGCTGCGGGAGAAACAGAAAACCCTCTCAAGGCACTCAGAACTGGCGAAGGCGTTCGGGTATG
CGCTGAACCAGTGGCCGGCGCTGACCCGCTATGCAGAAGATGGCTGGGTGGAGGTGGATAATAACATTGCCGAAAACGCCCTACGACTGGTCAGTCTGGG
GAGAAAAAACTGGCTGTTCTTCGGCTCGGATCATGGTGGTGAGCGCGGTGCGTCACTGTACAGTCTTATCGGGACGTGCAAATTAAACGGCGTGGATCCG
GAACGCTATCTGCATTATGTACTTGATGTCATCGCCGACTGGCCTGTGAACCGGGTGGGTGCTCTGCTCCCGTGGCGGGTTACTCTGCCTGCCTGACCAG
CTTCACTTCGTCAATACGGTTCTCACTGCACGCTTAC
ACTTCCCCGGAAACACTATTCCCCTGAATTCAAAATGGAACTGGTCAGGCTGGCTCTTGAAGAAGAAGGCAGTATTGCCGCACTGGCCCGGAGACATGAC
GTCAATGATAACCTGCTCTTTAAATGGATAAGGCTCTGGCAGCGTGAAGGGCGGGTCTGTCGGCCCCGAAAAAACTCATCGTCGCTTCCTGCCCTGATAC
CCGTGCAGCTTCAGGCGGGCTCTTCTCTCCCAACCTTCGAACTACCATCCTGCTCTCCTCCTGCTACCTGCCACATAAAATGTCGGGGAGGAGAAATAAC
ACTGACTCATCCCTCTGCTGAACTCATGAGCACTGTCCTGCGCGAACTGATGCGGGGGCCGGTATGATAAATCTTCCCGCAGGCACAAAAATCTGGCTGG
TTGCCGGTATCACCGATATGCGCAACGGCTTCAATGGCCTCGCCGCAAAGATGCAGACCGCGCTGAAAGATGACCCGATGTCCGGCCACGTCTTCATCTT
CCGGGGACGCAGCGGCAGTCAGGTAAAACTGCTCTGGTCCACCGGCGATGGTCTGTGCCTGCTGACAAAGCGACTGGAACGTGGTCGCTTCGCCTGGCCC
TCAGCCCGCGATGGCAAAGTGTTCCTGACGCCGGCGCAACTGGCGATGCTGATGGAAGGTATCGACTGGCGACAGCCAAAGCGGTTACTGACATCCCTGA
CCATGTTGTAGCGCTCTTTATCCTGGTTGTCGCAGAATAAGCCTGGTAAAATACAGGCTTATGAACGACACCTCTTCTGACGACATCCTTCTGCTGAAAC
AGCGCCTGGCCGAACAGGAAGCGTTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAACGCGAAATAGACCATCTGCAGGCGCAACTGGATAAGCT
GCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAGGTATCCCGCCGTATCGCGCAAATGGAAGCTGACCTGAACCTGCTGCAGCAGGAAAGCGATACGCTG
ACCGGCCGGGTGTATGACCCGGCAGTGCAGCGCCCGCTGCGTCAGACCCGCACCCGCAAACCGTTCCCTGAATCACTACCCCGTGACGAAAAGCGACTGC
TGCCGACTGAGCCGTGTTGCCCGGAGTGCGGTGGTTCGCTGAGTTACCTGGGTGAGGATGCCGCCGAACAGCTGGAGCTGATGCGCAGCGCCTTCCGGGT
TATCCGGACGGTACGTGAAAAACATGCCTGCCGTCGGTGCGATCGCATCGTTCAGGCCCCGGCTCCTTCGCGCCCCATCGAGCGGGGTATCGCCGGACCG
GGGCTGCTGGCCCGAGTGCTGACCTCAAAGTATGCAGAGCACACACCGCTGTATCGCCAGTCGGAGATCTATGCCCGCCAGGGTGTAGTGTTGAGTCGTT
CTGTACTGTCGGGCTGGGTGGATGCGTGTTGTCGTCTGCTGGCACCGCTGGATGAAGCCCTTCAGCACTATGTCCTGACCGACGGCAAACTCCATGCTGA
CGATACGCCTGTCCCGGTGCTGTTGCCGGGCAATAAGAAGACGAAGACCGGGCGCTTATGGACGTACGTTCGAGACGACCGCAACGCCGGATCAGCGCTG
GCCCCCGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGTATCCACCCTCAGACCCATCTTGCAGGCTTCAGTGGCGTACTGCAGGCTGATGCCTACG
CCGGGTTCAACGAGCTCTACCGCGACGGCCATATAAAGGAAGCCGCGTGCTGGGCCCATGCCCGCCGAAAAATCCACGATGTTCACGTCCGTACCCCGTC
AGCCCTGACGGAGGAGGCGCTGAAACGGATCGGCGAGCTGTATGCCATCGAGTCGGAGCTCAGAGGTAAAAGAGCAGAGGAACGGCAGGCAGTCCGGCAC
CAAAAAGTGCTGCCGCTGCTGGCGTCACTGGAGGGGTGGCTGCGGGAGAAACAGAAAACCCTCTCAAGGCACTCAGAACTGGCGAAGGCGTTCGGGTATG
CGCTGAACCAGTGGCCGGCGCTGACCCGCTATGCAGAAGATGGCTGGGTGGAGGTGGATAATAACATTGCCGAAAACGCCCTACGACTGGTCAGTCTGGG
GAGAAAAAACTGGCTGTTCTTCGGCTCGGATCATGGTGGTGAGCGCGGTGCGTCACTGTACAGTCTTATCGGGACGTGCAAATTAAACGGCGTGGATCCG
GAACGCTATCTGCATTATGTACTTGATGTCATCGCCGACTGGCCTGTGAACCGGGTGGGTGCTCTGCTCCCGTGGCGGGTTACTCTGCCTGCCTGACCAG
CTTCACTTCGTCAATACGGTTCTCACTGCACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
384 bp | 127 aa | 84 | 467 | + | No |
AG : IS66 TnpA
ORF sequence :
MRAKERLPRKHYSPEFKMELVRLALEEEGSIAALARRHDVNDNLLFKWIRLWQREGRVCRPRKNSSSLPALIPVQLQAGSSLPTFELPSCSPPATCHIKC
RGGEITLTHPSAELMSTVLRELMRGPV
RGGEITLTHPSAELMSTVLRELMRGPV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 464 | 811 | + | No |
AG : IS66 TnpB
ORF sequence :
MINLPAGTKIWLVAGITDMRNGFNGLAAKMQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTPAQLAMLMEGI
DWRQPKRLLTSLTML
DWRQPKRLLTSLTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1536 bp | 511 aa | 861 | 2396 | + | No |
Chemistry : DDE
ORF sequence :
MNDTSSDDILLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNLLQQESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPTEPCCPECGGSLSYLGEDAAEQLELMRSAFRVIRTVREKHACRRCDRIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYARQGVVLSRSVLSGWVDACCRLLAPLDEALQHYVLTDGKLHADDTPVPVLLPGNKKTKTGRLWTYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGHIKEAACWAHARRKIHDVHVRTPSALTEEALKRIGELYAIESELRGKRAEERQAVRHQKVLPLLASLEGWLREKQKT
LSRHSELAKAFGYALNQWPALTRYAEDGWVEVDNNIAENALRLVSLGRKNWLFFGSDHGGERGASLYSLIGTCKLNGVDPERYLHYVLDVIADWPVNRVG
ALLPWRVTLPA
PFPESLPRDEKRLLPTEPCCPECGGSLSYLGEDAAEQLELMRSAFRVIRTVREKHACRRCDRIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYARQGVVLSRSVLSGWVDACCRLLAPLDEALQHYVLTDGKLHADDTPVPVLLPGNKKTKTGRLWTYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGHIKEAACWAHARRKIHDVHVRTPSALTEEALKRIGELYAIESELRGKRAEERQAVRHQKVLPLLASLEGWLREKQKT
LSRHSELAKAFGYALNQWPALTRYAEDGWVEVDNNIAENALRLVSLGRKNWLFFGSDHGGERGASLYSLIGTCKLNGVDPERYLHYVLDVIADWPVNRVG
ALLPWRVTLPA
Blast result :
Comments
ISEc78 transposase is 94% aa similar to the transposase of ISSgsp1.
References
1] Zhou D. (2017) Direct submission to ISfinder
2] Li,P., Feng,J., Zeng,L., Jiang,X., Zhan,Z., Luo,W., Wang,J. and Zhou,D. (2017) Direct submission to GeneBank
2] Li,P., Feng,J., Zeng,L., Jiang,X., Zhan,Z., Luo,W., Wang,J. and Zhou,D. (2017) Direct submission to GeneBank