ISSmi2
- Family IS1182
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_013853 | ND | Streptococcus mitis | Streptococcus mitis B6 |
DNA section
IS Length : 1798 bp
Ends
IR Length : 25/26
IRL : GAGGCTGGGCAAAAACTCGCTTCTAACTATTAAAAAAACGAGCATTTCCG
IRR : GAGGCTGGGCAAAAACTAGCTTCTAAATAAAACCACACAGTGATTCCAGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATAAAAGTAA | TTACTGATCT | 0 | |
GAAAGAACAA | TT | TATTTTTTTG | 2 |
AAAAAGTAAA | TTTATTTCGT | 0 | |
CACAAAGGAT | GTCTTTTCCT | 0 |
DNA sequence
GAGGCTGGGCAAAAACTCGCTTCTAACTATTAAAAAAACGAGCATTTCCGTAGTTGGAGATGCTCGTTTTCTTATGTCATGGTTTATAATTAAAATAACA
ACAAAACAATTAGAAAGCCATGACAATGTATAAAGATTATAACACAAATCAGTTAAGTTTGGAAGTAAATCTTGCCTACGATATTCCATCAAATCACGAA
GCAAGAATGATTAGCCTCTTTGTAGACAGTATCCCTAGTCAAGTTCTATTGGAAGAGACCTCTCACACTGGTCGCCCTGCCTTTCATCCAGCCATGCTCA
TGAAGATGACCCTGTTCGCTTACGCTCGTCAAGTGTTCTCTGGACGTAAAATCGTTCAAATGAACTCAGAAGTCATTCCCATGAAATGGTTGAGCCAAGA
TACCTATGTTAGCTATAAAACGATTAATAACTTCCGCTCCAGTAAACATGCCAACAACCTAATCAAAACTGCTTTTATTTACTTCACTTTACTCATGCGT
GAGAATGGGATGATTGAGGATGATGCCTTGTTCATTGATGGAACAAAACTAGAAGCTGATGCCAACCTCTATTCTTTCACTTGGAAAAGAGCTATTGACA
AATATGAAGCAGCCTTGAATGGGAAAATTTCCCAGCTCTATGACCAGCTGATTCAAGAAGGAGTAAATCTCGCCTTATCTAAAGAAGAACGGGAAACCAG
TCAGGGTTTGGAGGAGCTACTAGAAAAAACGGAGGCTACCTTAGTTGAAATAGAACAAGCTATTGAAGAAGAGCCTAAAGTGATAAAGGGTGGGTCTGCC
AATAGACAAAAACGCCGTCGGATAAAGAAAATCCGTAAGCAATTAAAGGATGATTTCCTTCCTCGTAAACAGCGATATGAAGAAGCTAGAAAAATTTTAG
AAGACCGCAACTCTTTCTCAAAAACGGATCATGATGCCACCTTTATGCGAATGAAGGAAGACCATATGCAAAATGGGCAACTGAAACCTGGTTATAATGT
CCAAGCGGCTACCAATGGTCAGTATGTCCTTGATTTTGATATCTTTCCAAATCCAACAGACACTCGCACTCTGAAACCTTTCCTCCAATCGATTCAAACC
CTAGACTTATTCAACTATATCGTAGCAGATGCTGGTTATGGTAGCGAAGAAAATTATCGTTTTATCATTGATGAATTGGAAAAAATACCGCTTATTCCCT
ACACCATGTATCAGAAGGAATTGACTAAAAAGTACCAGACGAGCACAGATAATCCAATGAACTGGGAATATCTTGAGGACACGGACCAGTTTATAAAGCC
CGACGGCGTAGTTTATTCTTTTAAGAACTACTCTAGTCGAACTGACAAGTATGGCTTTCAACGTGATTTCAAGATTTATGAGGCTGATAAAGTTCAAGAC
ACTCCAGAGTTAGAACAGTTGGCTAAAACAGATAGTGGAAACCAGAAACAAATCCATTATAATCCAACTTGGAATTACTTTAAGGAACTCCTTAAGCAAA
CATTACATAGTGAAGAAGGCTCTCGAATCTATGCCAAACGGAAAATCGATGTTGAACCCGTTTTTGGTAGATTGAAGAGTGTTTTTGGCGTGCGCAGAGT
GCATGTCAGAGGGAAACAAGCTGTCTCAACAGAAATAGGATTCATCTTTATGAGCATGAATCTCACCAAGTTGGCCAAGAATCTAGCCTCTAAAACATCC
ACTATTCAAAAACCACACAGTATTTCTTTCAGCTTGATTGGATTCAAGACTGGAATCACTGTGTGGTTTTATTTAGAAGCTAGTTTTTGCCCAGCCTC
ACAAAACAATTAGAAAGCCATGACAATGTATAAAGATTATAACACAAATCAGTTAAGTTTGGAAGTAAATCTTGCCTACGATATTCCATCAAATCACGAA
GCAAGAATGATTAGCCTCTTTGTAGACAGTATCCCTAGTCAAGTTCTATTGGAAGAGACCTCTCACACTGGTCGCCCTGCCTTTCATCCAGCCATGCTCA
TGAAGATGACCCTGTTCGCTTACGCTCGTCAAGTGTTCTCTGGACGTAAAATCGTTCAAATGAACTCAGAAGTCATTCCCATGAAATGGTTGAGCCAAGA
TACCTATGTTAGCTATAAAACGATTAATAACTTCCGCTCCAGTAAACATGCCAACAACCTAATCAAAACTGCTTTTATTTACTTCACTTTACTCATGCGT
GAGAATGGGATGATTGAGGATGATGCCTTGTTCATTGATGGAACAAAACTAGAAGCTGATGCCAACCTCTATTCTTTCACTTGGAAAAGAGCTATTGACA
AATATGAAGCAGCCTTGAATGGGAAAATTTCCCAGCTCTATGACCAGCTGATTCAAGAAGGAGTAAATCTCGCCTTATCTAAAGAAGAACGGGAAACCAG
TCAGGGTTTGGAGGAGCTACTAGAAAAAACGGAGGCTACCTTAGTTGAAATAGAACAAGCTATTGAAGAAGAGCCTAAAGTGATAAAGGGTGGGTCTGCC
AATAGACAAAAACGCCGTCGGATAAAGAAAATCCGTAAGCAATTAAAGGATGATTTCCTTCCTCGTAAACAGCGATATGAAGAAGCTAGAAAAATTTTAG
AAGACCGCAACTCTTTCTCAAAAACGGATCATGATGCCACCTTTATGCGAATGAAGGAAGACCATATGCAAAATGGGCAACTGAAACCTGGTTATAATGT
CCAAGCGGCTACCAATGGTCAGTATGTCCTTGATTTTGATATCTTTCCAAATCCAACAGACACTCGCACTCTGAAACCTTTCCTCCAATCGATTCAAACC
CTAGACTTATTCAACTATATCGTAGCAGATGCTGGTTATGGTAGCGAAGAAAATTATCGTTTTATCATTGATGAATTGGAAAAAATACCGCTTATTCCCT
ACACCATGTATCAGAAGGAATTGACTAAAAAGTACCAGACGAGCACAGATAATCCAATGAACTGGGAATATCTTGAGGACACGGACCAGTTTATAAAGCC
CGACGGCGTAGTTTATTCTTTTAAGAACTACTCTAGTCGAACTGACAAGTATGGCTTTCAACGTGATTTCAAGATTTATGAGGCTGATAAAGTTCAAGAC
ACTCCAGAGTTAGAACAGTTGGCTAAAACAGATAGTGGAAACCAGAAACAAATCCATTATAATCCAACTTGGAATTACTTTAAGGAACTCCTTAAGCAAA
CATTACATAGTGAAGAAGGCTCTCGAATCTATGCCAAACGGAAAATCGATGTTGAACCCGTTTTTGGTAGATTGAAGAGTGTTTTTGGCGTGCGCAGAGT
GCATGTCAGAGGGAAACAAGCTGTCTCAACAGAAATAGGATTCATCTTTATGAGCATGAATCTCACCAAGTTGGCCAAGAATCTAGCCTCTAAAACATCC
ACTATTCAAAAACCACACAGTATTTCTTTCAGCTTGATTGGATTCAAGACTGGAATCACTGTGTGGTTTTATTTAGAAGCTAGTTTTTGCCCAGCCTC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1680 bp | 559 aa | 126 | 1805 | + | No |
Chemistry : DDE
ORF sequence :
MYKDYNTNQLSLEVNLAYDIPSNHEARMISLFVDSIPSQVLLEETSHTGRPAFHPAMLMKMTLFAYARQVFSGRKIVQMNSEVIPMKWLSQDTYVSYKTI
NNFRSSKHANNLIKTAFIYFTLLMRENGMIEDDALFIDGTKLEADANLYSFTWKRAIDKYEAALNGKISQLYDQLIQEGVNLALSKEERETSQGLEELLE
KTEATLVEIEQAIEEEPKVIKGGSANRQKRRRIKKIRKQLKDDFLPRKQRYEEARKILEDRNSFSKTDHDATFMRMKEDHMQNGQLKPGYNVQAATNGQY
VLDFDIFPNPTDTRTLKPFLQSIQTLDLFNYIVADAGYGSEENYRFIIDELEKIPLIPYTMYQKELTKKYQTSTDNPMNWEYLEDTDQFIKPDGVVYSFK
NYSSRTDKYGFQRDFKIYEADKVQDTPELEQLAKTDSGNQKQIHYNPTWNYFKELLKQTLHSEEGSRIYAKRKIDVEPVFGRLKSVFGVRRVHVRGKQAV
STEIGFIFMSMNLTKLAKNLASKTSTIQKPHSISFSLIGFKTGITVWFYLEASFCPAS
NNFRSSKHANNLIKTAFIYFTLLMRENGMIEDDALFIDGTKLEADANLYSFTWKRAIDKYEAALNGKISQLYDQLIQEGVNLALSKEERETSQGLEELLE
KTEATLVEIEQAIEEEPKVIKGGSANRQKRRRIKKIRKQLKDDFLPRKQRYEEARKILEDRNSFSKTDHDATFMRMKEDHMQNGQLKPGYNVQAATNGQY
VLDFDIFPNPTDTRTLKPFLQSIQTLDLFNYIVADAGYGSEENYRFIIDELEKIPLIPYTMYQKELTKKYQTSTDNPMNWEYLEDTDQFIKPDGVVYSFK
NYSSRTDKYGFQRDFKIYEADKVQDTPELEQLAKTDSGNQKQIHYNPTWNYFKELLKQTLHSEEGSRIYAKRKIDVEPVFGRLKSVFGVRRVHVRGKQAV
STEIGFIFMSMNLTKLAKNLASKTSTIQKPHSISFSLIGFKTGITVWFYLEASFCPAS
Blast result :
Comments
ISSmi2 is 74% aa similar to ISLac1.
There is no stop codon at the end of the element, the transposase use the first Stop codon after the end of the IRR.
The transposase in the database is cut at the end of the IS.
There is no stop codon at the end of the element, the transposase use the first Stop codon after the end of the IRR.
The transposase in the database is cut at the end of the IS.
References
1] Submitted by ISfinder (2010)
2] Denapaite,D., Bruckner,R., Nuhn,M., Reichmann,P., Henrich,B., Maurer,P., Schahle,Y., Selbmann,P., Zimmermann,W., Wambutt,R. and Hakenbeck,R. (2010) PLoS ONE 5 (2), E9426
2] Denapaite,D., Bruckner,R., Nuhn,M., Reichmann,P., Henrich,B., Maurer,P., Schahle,Y., Selbmann,P., Zimmermann,W., Wambutt,R. and Hakenbeck,R. (2010) PLoS ONE 5 (2), E9426