ISEc44
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_010498 | ND | Escherichia coli | Escherichia coli SMS-3-5 |
DNA section
IS Length : 1879 bp
Ends
Left end : CTTCCCCTCCAAAGGGTACTCCCGCCGACATAAGGCGGGAGGGGAATTTGGACGGGCTTGAAAAGCCCGAAGGGTGATCAGCCACGCTGATGTTGAATGT II struct. : Yes
Right end : CTCGATGAAGCAGGAACCAGACGCTGAGCAATCAGCCGTCACCCACTAATGGGAATCCCCCTCCTGAGTCACGAAGTGCGAAGGTGGGGGAGGATGTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
CCTGTTTTTCA | CCAT | GTAGCCTTCGT | TCAA |
DNA sequence
CTTCCCCTCCAAAGGGTACTCCCGCCGACATAAGGCGGGAGGGGAATTTGGACGGGCTTGAAAAGCCCGAAGGGTGATCAGCCACGCTGATGTTGAATGT
ATTGCTTCACAACTTCCAGCGGCGCACCTCCGCAGGAGCCAGCAAAGTATGACCTCGACCAGAGCACGGCTTTTCCGTATGCCCCCCGCAAATCAAGAAA
TTCGTTTCGCAGACGGCGGGATGTCACTGCTTTCAGTGAGTTAACCAGTACCGAAAGCTGCACTGTTGGTGGGTATTCGATCAGCATGTGAACATGATCT
ACGTCTCCGTTACTTTCTTTTAGTTCTGCCCCAAAATCACGGCAGACTTCTGCTGCATACTGATGGAATGCTGCACAGTGTAACTCTCCCAGTATCTTTC
GACGGTATTTTGTTACAAAGACAAGATGAACATGCAAAAGGAAGGCTGCATGTCTTGAACGATTGATTTTATATTTTTGCATTGAGGTTGGTCGTAAAAG
CAGTAATATTGCAAGAAAATCTACTACACACTGAATGATGATGCTAATCCTGAAAGCCTACAAATTCAGACTGGAACCAACGCATGAGCAGTCGCAGCGT
TTGCGGCAGTTATGTGGTTGTGCCCGTTTTGTCTGGAATTTAGGTCTTGCGGAGACAAAGCGCATACTTGGCTCAGGCGAAAAGTTACCTTCGGCTTTCG
AGTTGAATCGGATGATTACAGTGTGGAAAAAAATGCCGGAATACATCTTCTTACAGGATGCTTATACCGACAATCTGCAACAAAAGCTGAAAGACCTGCA
TACCGCATGGAAACGTTGTTTTGATAAAAAGCTCGCAGCTAAGGCTCCGGTATGGAAACGAAAAAATGAGGGCAGAGACTCAATCCGTTTTGTGAACTTT
GAGAAATATTGCTGCCTTGAAAATCGCAGAGTGAAGCTACCGTCAGGTCTTGGGTGGGTAAAATTCCGGCAATCTCAACGTGTGAACGGTAAAATCAAAA
ATGCGACAATCAGTCAGTTAGCGGGACAGTGGTATATCTCGTTTCAGGTTGAAATTGAAACGGCAGAACCAAATCACACAAGCACAACGATAGTCGGACT
GGATGCAGGCGTGGCTAAACTTGCCACGCTGTCAGATGGCACAGTCTTTGAGCCTGTAAACAGTTTTCAGAAAAATCAGAAGAAGCTGGCGAGACTCCAG
CGACAGTTAAGCCGCAAGGTCAAATTCAGCAACAACTGGCAGAAGCAGAAACGCAAAATACAGCGACTGCATTCCTGTATCGCAAATATCCGCAGGGACT
ACCTTCACAAAGTCACAACGACCGTCAGCAAAAACCACGCAATGATAGTCATTGAGGATTTGAAGGTCAGCAACATGTCAAAGTCGGCAGCGGGTACGGT
CAGCCAGCCGGGGCGCAATGTCCGGGCAAAATCAGGTTTAAACCGTTCGATACTGGATCAGGGCTGGTATGAAATGCGCCGCCAGCTTGAGTACAAACAG
CTCTGGCGTGGTGGTCATGTAGAGGCGGTAAATCCGGCATACACAAGCCAGCGTTGTTCGTGTTGCGGTCATACGGAAAAAGCAAATCGTCGCACACAAA
GTAAGTTTGAGTGCAAAGCATGTGGGTATGCTGAAAATGCGGACGTAAACGCAGCACGAAACATTTTAGCGACGTGGCACGCTCAAATGGCTACAAGTAC
CGCGGGACACGCGGAAACCGGGAGTCTGTCTCTGGGATAGACTTCCTACGCCTGTGGAGAGGTCGGTGCAGTAAGACCGCTCGATGAAGCAGGAACCAGA
CGCTGAGCAATCAGCCGTCACCCACTAATGGGAATCCCCCTCCTGAGTCACGAAGTGCGAAGGTGGGGGAGGATGTCAA
ATTGCTTCACAACTTCCAGCGGCGCACCTCCGCAGGAGCCAGCAAAGTATGACCTCGACCAGAGCACGGCTTTTCCGTATGCCCCCCGCAAATCAAGAAA
TTCGTTTCGCAGACGGCGGGATGTCACTGCTTTCAGTGAGTTAACCAGTACCGAAAGCTGCACTGTTGGTGGGTATTCGATCAGCATGTGAACATGATCT
ACGTCTCCGTTACTTTCTTTTAGTTCTGCCCCAAAATCACGGCAGACTTCTGCTGCATACTGATGGAATGCTGCACAGTGTAACTCTCCCAGTATCTTTC
GACGGTATTTTGTTACAAAGACAAGATGAACATGCAAAAGGAAGGCTGCATGTCTTGAACGATTGATTTTATATTTTTGCATTGAGGTTGGTCGTAAAAG
CAGTAATATTGCAAGAAAATCTACTACACACTGAATGATGATGCTAATCCTGAAAGCCTACAAATTCAGACTGGAACCAACGCATGAGCAGTCGCAGCGT
TTGCGGCAGTTATGTGGTTGTGCCCGTTTTGTCTGGAATTTAGGTCTTGCGGAGACAAAGCGCATACTTGGCTCAGGCGAAAAGTTACCTTCGGCTTTCG
AGTTGAATCGGATGATTACAGTGTGGAAAAAAATGCCGGAATACATCTTCTTACAGGATGCTTATACCGACAATCTGCAACAAAAGCTGAAAGACCTGCA
TACCGCATGGAAACGTTGTTTTGATAAAAAGCTCGCAGCTAAGGCTCCGGTATGGAAACGAAAAAATGAGGGCAGAGACTCAATCCGTTTTGTGAACTTT
GAGAAATATTGCTGCCTTGAAAATCGCAGAGTGAAGCTACCGTCAGGTCTTGGGTGGGTAAAATTCCGGCAATCTCAACGTGTGAACGGTAAAATCAAAA
ATGCGACAATCAGTCAGTTAGCGGGACAGTGGTATATCTCGTTTCAGGTTGAAATTGAAACGGCAGAACCAAATCACACAAGCACAACGATAGTCGGACT
GGATGCAGGCGTGGCTAAACTTGCCACGCTGTCAGATGGCACAGTCTTTGAGCCTGTAAACAGTTTTCAGAAAAATCAGAAGAAGCTGGCGAGACTCCAG
CGACAGTTAAGCCGCAAGGTCAAATTCAGCAACAACTGGCAGAAGCAGAAACGCAAAATACAGCGACTGCATTCCTGTATCGCAAATATCCGCAGGGACT
ACCTTCACAAAGTCACAACGACCGTCAGCAAAAACCACGCAATGATAGTCATTGAGGATTTGAAGGTCAGCAACATGTCAAAGTCGGCAGCGGGTACGGT
CAGCCAGCCGGGGCGCAATGTCCGGGCAAAATCAGGTTTAAACCGTTCGATACTGGATCAGGGCTGGTATGAAATGCGCCGCCAGCTTGAGTACAAACAG
CTCTGGCGTGGTGGTCATGTAGAGGCGGTAAATCCGGCATACACAAGCCAGCGTTGTTCGTGTTGCGGTCATACGGAAAAAGCAAATCGTCGCACACAAA
GTAAGTTTGAGTGCAAAGCATGTGGGTATGCTGAAAATGCGGACGTAAACGCAGCACGAAACATTTTAGCGACGTGGCACGCTCAAATGGCTACAAGTAC
CGCGGGACACGCGGAAACCGGGAGTCTGTCTCTGGGATAGACTTCCTACGCCTGTGGAGAGGTCGGTGCAGTAAGACCGCTCGATGAAGCAGGAACCAGA
CGCTGAGCAATCAGCCGTCACCCACTAATGGGAATCCCCCTCCTGAGTCACGAAGTGCGAAGGTGGGGGAGGATGTCAA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
405 bp | 134 aa | 482 | 78 | - | No |
Chemistry : Y1
ORF sequence :
MQKYKINRSRHAAFLLHVHLVFVTKYRRKILGELHCAAFHQYAAEVCRDFGAELKESNGDVDHVHMLIEYPPTVQLSVLVNSLKAVTSRRLRNEFLDLRG
AYGKAVLWSRSYFAGSCGGAPLEVVKQYIQHQRG
AYGKAVLWSRSYFAGSCGGAPLEVVKQYIQHQRG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1200 bp | 399 aa | 541 | 1740 | + | No |
AG : TnpB
ORF sequence :
MLILKAYKFRLEPTHEQSQRLRQLCGCARFVWNLGLAETKRILGSGEKLPSAFELNRMITVWKKMPEYIFLQDAYTDNLQQKLKDLHTAWKRCFDKKLAA
KAPVWKRKNEGRDSIRFVNFEKYCCLENRRVKLPSGLGWVKFRQSQRVNGKIKNATISQLAGQWYISFQVEIETAEPNHTSTTIVGLDAGVAKLATLSDG
TVFEPVNSFQKNQKKLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGL
NRSILDQGWYEMRRQLEYKQLWRGGHVEAVNPAYTSQRCSCCGHTEKANRRTQSKFECKACGYAENADVNAARNILATWHAQMATSTAGHAETGSLSLG
KAPVWKRKNEGRDSIRFVNFEKYCCLENRRVKLPSGLGWVKFRQSQRVNGKIKNATISQLAGQWYISFQVEIETAEPNHTSTTIVGLDAGVAKLATLSDG
TVFEPVNSFQKNQKKLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANIRRDYLHKVTTTVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGL
NRSILDQGWYEMRRQLEYKQLWRGGHVEAVNPAYTSQRCSCCGHTEKANRRTQSKFECKACGYAENADVNAARNILATWHAQMATSTAGHAETGSLSLG
Blast result :
Comments
ISEc44 is 98% (transposase) aa similar to ISSen6.
References
1] ISfinder annotation (2018)
2] Fricke,W.F., Wright,M.S., Lindell,A.H., Harkins,D.M., Baker-Austin,C., Ravel,J. and Stepanauskas,R. (2008) J. Bacteriol.
2] Fricke,W.F., Wright,M.S., Lindell,A.H., Harkins,D.M., Baker-Austin,C., Ravel,J. and Stepanauskas,R. (2008) J. Bacteriol.