IS120
- Family IS3
- Group IS150
Isoform Synonym(s) IS1447
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | Y | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 Clostridium thermocellum |
DNA section
IS Length : 1447 bp
Ends
IR Length : 10/12
IRL : TATAATGATACCAAAAATTAAGACAGACAAAACAGCCCAAATAAGTTAGA
IRR : TATAATGCTCCCCATTGTCAAGACAGTTTTTTAGCTTTTCTAAGTTAGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
NNNGGCTTGG | TTA | TGCCGGTACT | 3 |
CAAAAAGCGTATTGACA | TAA | AGTATATGAAGACAAAA | 3 |
CAATACATTTGAAGAGA | TAC | CGTCAAATAAAGAGTAA | 3 |
TTTATTACCCAATTCAGGAA | CTTAAAGCATTTGACGAGTC | 0 | |
TCTGCAGGACGGCCTCA | TAC | AACCTCTGGGCTGTAAT | 3 |
TAGGTGAATTTGCAGCT | CTA | TGTACATTTATATTTGT | 3 |
ATCTGCGATTTTTGAAT | CTT | TATCTGAACGTCAGGAT | 3 |
CTTATCAAAAAACGAAC | CCG | TTTCTATATACATCTAT | 3 |
GTATATTGAGCTCTTTG | CAG | AGCCTGTACACCTTTTT | 3 |
CGCGGTTACCTTAGATT | TAT | CTTCCTGATTGATTTTA | 3 |
AAAGTTTGTACACATA | ATAG | TATATTAAAGGGACCT | 4 |
TTTCTATAAGAAAAAAG | CAT | CGGCAGAATTTAATTCC | 3 |
TGTAGCATTAAGTGGTA | TTT | CTCTATATTTCATACCT | 3 |
AGTCCACTCTTTCGGCT | GTA | ACGATAATTGTTTTAAC | 3 |
GAATAGCAGTTATAGAAGGA | TATAATGGTCTTTAAGCCTA | 0 | |
TGTCGTGGGAAATTTGA | AAT | CATAGTATAATAATACT | 3 |
GCATCCACAATGAAGTG | CTT | CGTTGCAGGCACGCTGA | 3 |
AGCATGACGGTCAGGAA | AAT | TGCACACCATTTTTTTA | 3 |
TCAATTACGGTCGGCTT | TTC | TGTCATTAAAAGATAAC | 3 |
DNA sequence
TATAATGATACCAAAAATTAAGACAGACAAAACAGCCCAAATAAGTTAGAATGGAACTATGGAAAAGAGAAAACATTTTACACCTGAACAAAAAGCAAAA
ATAGTGATTGAGGTCATCAAGGGAGAAAGAACGCTGAATGAGATTGCTGCAGAATATGGAATTCATCCAAACCTGTTAAGTCGCTGGAAGACTGAATTCA
TAAGCAATGCGGGCAGAGTATTCAGCAAGGAAACTGATGAAGTAGAGAAGGTCAAACAGTCGTATGAAAAGGAGAAGGACGAACTGCTTAAGCAAATTGG
TCAACTATCATATGAGGTTGCCTGGCTTAAAAAAAAATCTGGCCTCCTCTAAATCCCGAGAAGACCGCATGAAAATGATTGATAGAAATGAGAAGAAACT
CAGCATAACAAGGCAAGCAGAATTATTGAGCTTAAACCGTACGAGCGTTTACTACAAGCCTGCTCCGGTAAATGAGGAGGAATACCTGATTAAGCGTATC
ATTGATGAAATTTACGCGTCTTATCCGGAATATGGCTATCGCAGGATGACAAGTATATTGAACAAGGATTATCACATTCATATCAATCGAAAACGGACCC
GGCGTTATATGAGGGAAATGGGCATACATGGATTCTGTCCTGGCCCCAACCTCAGCAAACGAATACATGGTAAGAATTTGTATCCATATCTGTTGAGAAA
CTTGAAAATTGATCATCCTAATCAGGTATGGTCCATAGATGTGACCTATTGCCGAATGAAACGCGGTTTCATGTATATGGTTGCAATAATAGACTGGTAT
TCTCGGTATATTGTTGGGTTTGAACTATCAAACACTCTTGATAAGACATTCGTCATAGAAGCAATCCAAAAGGCCATAAAGCGATATGGCAAGCCTGAAA
TCATGAACAGTGATCAAGGCTCACAGTTTACCAGTGATGATTACATAAATCTATTAAAAAATAACGGTATCAAAATATCTATGGATGGAAAAGGAAGAGC
ATTAGACAACCAAAGGATAGAACGATTTTTCCGTTCCTACAAGTGGGAGAAACTTTATCTTGAAGAGTGCGAAACGGTACAACAACTTAGACAAATCACA
AAGGAATATGTGGAGCACTATAACCATAGGAGACCGCACCAGTCATTGGATTACAAAACACCGGCAGAGTATTACTTTGGAGGATATGACCAGCTACTGG
CAGTTGTATAGAATTATGGGGCTCCGCCCCAAACCCCGTCCTCACCGGAAGGCAGCCGGTCTGTCATAACAGACCGGAAAGCAAAAGGATATATGTCCAA
AGGATGTCAAGGGTCAAGATGAACTCGCTTACGCTCGCCCTTGACATCCTCCAACAGAGTGCACAGTTGTAAAGATTATACAAAATTAAGAAAGGAGAAC
TAACTTAGAAAAGCTAAAAAACTGTCTTGACAATGGGGAGCATTATA
ATAGTGATTGAGGTCATCAAGGGAGAAAGAACGCTGAATGAGATTGCTGCAGAATATGGAATTCATCCAAACCTGTTAAGTCGCTGGAAGACTGAATTCA
TAAGCAATGCGGGCAGAGTATTCAGCAAGGAAACTGATGAAGTAGAGAAGGTCAAACAGTCGTATGAAAAGGAGAAGGACGAACTGCTTAAGCAAATTGG
TCAACTATCATATGAGGTTGCCTGGCTTAAAAAAAAATCTGGCCTCCTCTAAATCCCGAGAAGACCGCATGAAAATGATTGATAGAAATGAGAAGAAACT
CAGCATAACAAGGCAAGCAGAATTATTGAGCTTAAACCGTACGAGCGTTTACTACAAGCCTGCTCCGGTAAATGAGGAGGAATACCTGATTAAGCGTATC
ATTGATGAAATTTACGCGTCTTATCCGGAATATGGCTATCGCAGGATGACAAGTATATTGAACAAGGATTATCACATTCATATCAATCGAAAACGGACCC
GGCGTTATATGAGGGAAATGGGCATACATGGATTCTGTCCTGGCCCCAACCTCAGCAAACGAATACATGGTAAGAATTTGTATCCATATCTGTTGAGAAA
CTTGAAAATTGATCATCCTAATCAGGTATGGTCCATAGATGTGACCTATTGCCGAATGAAACGCGGTTTCATGTATATGGTTGCAATAATAGACTGGTAT
TCTCGGTATATTGTTGGGTTTGAACTATCAAACACTCTTGATAAGACATTCGTCATAGAAGCAATCCAAAAGGCCATAAAGCGATATGGCAAGCCTGAAA
TCATGAACAGTGATCAAGGCTCACAGTTTACCAGTGATGATTACATAAATCTATTAAAAAATAACGGTATCAAAATATCTATGGATGGAAAAGGAAGAGC
ATTAGACAACCAAAGGATAGAACGATTTTTCCGTTCCTACAAGTGGGAGAAACTTTATCTTGAAGAGTGCGAAACGGTACAACAACTTAGACAAATCACA
AAGGAATATGTGGAGCACTATAACCATAGGAGACCGCACCAGTCATTGGATTACAAAACACCGGCAGAGTATTACTTTGGAGGATATGACCAGCTACTGG
CAGTTGTATAGAATTATGGGGCTCCGCCCCAAACCCCGTCCTCACCGGAAGGCAGCCGGTCTGTCATAACAGACCGGAAAGCAAAAGGATATATGTCCAA
AGGATGTCAAGGGTCAAGATGAACTCGCTTACGCTCGCCCTTGACATCCTCCAACAGAGTGCACAGTTGTAAAGATTATACAAAATTAAGAAAGGAGAAC
TAACTTAGAAAAGCTAAAAAACTGTCTTGACAATGGGGAGCATTATA
Recoding section
- Recoding by frameshift
- Frame +1
- Type translational
- Experimentally demonstrated Yes
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
294 bp | 97 aa | 59 | 352 | + | No |
Description : First part of the transposase
ORF sequence :
MEKRKHFTPEQKAKIVIEVIKGERTLNEIAAEYGIHPNLLSRWKTEFISNAGRVFSKETDEVEKVKQSYEKEKDELLKQIGQLSYEVAWLKKKSGLL
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
966 bp | 321 aa | 246 | 1211 | + | No |
Description : Second part of the transposase
ORF sequence :
RRSNSRMKRRRTNCLSKLVNYHMRLPGLKKNLASSKSREDRMKMIDRNEKKLSITRQAELLSLNRTSVYYKPAPVNEEEYLIKRIIDEIYASYPEYGYRR
MTSILNKDYHIHINRKRTRRYMREMGIHGFCPGPNLSKRIHGKNLYPYLLRNLKIDHPNQVWSIDVTYCRMKRGFMYMVAIIDWYSRYIVGFELSNTLDK
TFVIEAIQKAIKRYGKPEIMNSDQGSQFTSDDYINLLKNNGIKISMDGKGRALDNQRIERFFRSYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPHQS
LDYKTPAEYYFGGYDQLLAVV
MTSILNKDYHIHINRKRTRRYMREMGIHGFCPGPNLSKRIHGKNLYPYLLRNLKIDHPNQVWSIDVTYCRMKRGFMYMVAIIDWYSRYIVGFELSNTLDK
TFVIEAIQKAIKRYGKPEIMNSDQGSQFTSDDYINLLKNNGIKISMDGKGRALDNQRIERFFRSYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPHQS
LDYKTPAEYYFGGYDQLLAVV
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1153 bp | 384 aa | 59 | 1211 | + | Yes |
Chemistry : DDE
ORF sequence :
MEKRKHFTPEQKAKIVIEVIKGERTLNEIAAEYGIHPNLLSRWKTEFISNAGRVFSKETDEVEKVKQSYEKEKDELLKQIGQLSYEVAWLKKKNLASSKS
REDRMKMIDRNEKKLSITRQAELLSLNRTSVYYKPAPVNEEEYLIKRIIDEIYASYPEYGYRRMTSILNKDYHIHINRKRTRRYMREMGIHGFCPGPNLS
KRIHGKNLYPYLLRNLKIDHPNQVWSIDVTYCRMKRGFMYMVAIIDWYSRYIVGFELSNTLDKTFVIEAIQKAIKRYGKPEIMNSDQGSQFTSDDYINLL
KNNGIKISMDGKGRALDNQRIERFFRSYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPHQSLDYKTPAEYYFGGYDQLLAVV
REDRMKMIDRNEKKLSITRQAELLSLNRTSVYYKPAPVNEEEYLIKRIIDEIYASYPEYGYRRMTSILNKDYHIHINRKRTRRYMREMGIHGFCPGPNLS
KRIHGKNLYPYLLRNLKIDHPNQVWSIDVTYCRMKRGFMYMVAIIDWYSRYIVGFELSNTLDKTFVIEAIQKAIKRYGKPEIMNSDQGSQFTSDDYINLL
KNNGIKISMDGKGRALDNQRIERFFRSYKWEKLYLEECETVQQLRQITKEYVEHYNHRRPHQSLDYKTPAEYYFGGYDQLLAVV
Blast result :
Comments
Unlike the other IS3 family members, fusion between ORFA and ORFB of IS120 requires a +1 translational frameshifting. IS120 ORF2 is 30 % identical to that of IS150.
The third ORF is a putative ORFAB reconstructed in silico.
The third ORF is a putative ORFAB reconstructed in silico.
References
1] Snedecor, B., Chen, E., Gomez, R.F. (1983) in Proc. IVth Int. Symp. Genet. Industr. Microorg., 1982, pp. 356-360.
2] Chandler, M., and Fayet, O. (1993) Mol. Microbiol. 7, 497-503.
3] Zverlov,V.V., Klupp,M., Krauss,J. and Schwarz,W.H. (2008) J. Bacteriol. 190 (12), 4321-4327.
2] Chandler, M., and Fayet, O. (1993) Mol. Microbiol. 7, 497-503.
3] Zverlov,V.V., Klupp,M., Krauss,J. and Schwarz,W.H. (2008) J. Bacteriol. 190 (12), 4321-4327.