ISCth4
- Family IS256
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | ND | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 |
DNA section
IS Length : 1513 bp
Ends
IR Length : 26/33
IRL : GAGATTGTAAAATATTTTATGTAAACTAATTGACTCCTTCTATGATAGAA
IRR : GGCAGTGTAAATATTTTTGTGTAAACTAATTTTCCTTCTATGTTAGTATT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TACCTCCTCATTAAAGACT | TTATGAGGAGGTAAAATGCT | 0 | |
TCCCCTTTTTAG | AAATGAAT | TTATGAGCTTTT | 8 |
AATCGTAACAAA | AAATAATA | ATAAAATAAAAA | 8 |
TTTTGCCTGTGT | TTTTATTT | TGCGGCATGCAA | 8 |
GGACAGAGGAGT | ATTTTAAC | AATTATCCCGTT | 8 |
ACTGTCCCCAAA | AATTTATC | CTATTTTGAGAC | 8 |
GCTGATTAATAA | GGTTAACT | GTTGAAATTGAT | 8 |
ACAGTTCCGTTA | CTTTTTTG | AGAGGGAACGCA | 8 |
GACTGCGTCTTG | CCTGACGG | CAACCCTTGACT | 8 |
AAATGATAAAG | TAGAAATCA | TTTTACACCAC | 9 |
TATAATTCTTTACAAGCTAA | AATATTTTTATAAAAGCCAT | 0 | |
TATAATTCTTTACAAGCTAA | AATATTTTTATAAAAGCCAT | 0 | |
GTATTAAGTGGG | ACTGAAAG/A | AGGCAAAACATA | 10 |
AGTCAAGGGTTAT | CGATAGA | TAAGACAAAGTCT | 7 |
GAGTTAATGCCG | CTTATTAT | TGAATATTTCAT | 8 |
DNA sequence
GAGATTGTAAAATATTTTATGTAAACTAATTGACTCCTTCTATGATAGAATAATAAAGTATAGGAGGAGTTTTTTATGGCAAGAAAAAGGATAATAACAC
CAGAAAAGAAAGAGCTTATCAGAAATCTCATTTCTGAGTACAACATTACTTCAGCAAAGGATTTGCAGGAAGCATTGAAGGATCTGCTCGGAGATACGAT
ACAAAATATGTTGGAAGCAGAGCTGGATGAACATCTCGGATATGAAAAGTACGAATCAACTGAAGAAGCGAAATCAAATTACCGTAACGGGTACACATCA
AAAACATTAAAGTCAAGTGTAGGGCAAGTGGAAATAGATATCCCGCGGGACCGGAATGCAGAATTCGAGCCGAAAATTGTTCCCAGGTATAAAAGGGACA
TTTCAGAAATTGAAAATAAAATAATAGCAATGTATGCGCGGGGGATGTCTACCAGAGAAATCAACGAGCAGATACAGGAAATCTACGGATTTGAAGTATC
TGCCGAGATGGTAAGTAAGATCACTGATAAAATACTACCTGAGATAGAAGAGTGGCAGAAAAGGCCTCTGGGAGAGGTTTATCCGATAGTATTTATTGAC
GCAATTCATTTTTCAGTAAAAAATGACGGCATTGTTGGGAAGAAGGCCGTATATATTGTGCTGGCGATTGATATAGAAGGGCAGAAAGATGTTATCGGTA
TTTATGTAGGAGAAAATGAGAGCTCAAAATTCTGGCTGAGTGTCTTAAATGACCTTAAAAACAGGGGTGTTAAAGACATTCTGATTCTCTGTGCTGATGC
ACTTTCAGGGATAAAGGATGCAATCAATGCGGCTTTTCCGAATACTGAATATCAGAGGTGTATAGTACACCAGATAAGAAACACGCTAAAGTATGTGTCA
GATAAAGACCGAAAGGAATTTGCCAGGGACTTGAAACGGATATATACGGCTCCGAATGAGAAGGCAGGGTACGACCAGATGCTTGAGGTTTCAGAGAAAT
GGGAGAAGAAATACCCGGCAGCTATGAAGAGCTGGAAGAGCAATTGGGATGTTATTTGTCCATTTTTTAAGTATTCGGAGGAACTACGTAAAATCATGTA
TACGACCAATACTATTGAGAGCCTGAATAGCAGTTATAGAAGGATAAACAAATCAAGGACAGTATTTCCTGGCGACCAGTCACTTTTAAAGAGCATATAT
TTAGCTACAGTGAAGATTACTTCAAAATGGACGATGCGTTACAAAAACTGGGGTTTGATACTGGGACAGCTACAGATTATGTTCGAAGGGCGTATATAGT
ATAATCAAGGGGTTCAGGGAAACTCTGGTTTCCTGAACCCCCCTGTAATATTTTATACGCATCATCTTTGTCAAATTTTTCTTTTGTAAGAATTTCTTAT
CAGAAATTCTTTTGAATCTATAAAATTTGACAAAGCAGACTTTCGAGTCTTAAAATATATTTAAATACTAACATAGAAGGAAAATTAGTTTACACAAAAA
TATTTACACTGCC
CAGAAAAGAAAGAGCTTATCAGAAATCTCATTTCTGAGTACAACATTACTTCAGCAAAGGATTTGCAGGAAGCATTGAAGGATCTGCTCGGAGATACGAT
ACAAAATATGTTGGAAGCAGAGCTGGATGAACATCTCGGATATGAAAAGTACGAATCAACTGAAGAAGCGAAATCAAATTACCGTAACGGGTACACATCA
AAAACATTAAAGTCAAGTGTAGGGCAAGTGGAAATAGATATCCCGCGGGACCGGAATGCAGAATTCGAGCCGAAAATTGTTCCCAGGTATAAAAGGGACA
TTTCAGAAATTGAAAATAAAATAATAGCAATGTATGCGCGGGGGATGTCTACCAGAGAAATCAACGAGCAGATACAGGAAATCTACGGATTTGAAGTATC
TGCCGAGATGGTAAGTAAGATCACTGATAAAATACTACCTGAGATAGAAGAGTGGCAGAAAAGGCCTCTGGGAGAGGTTTATCCGATAGTATTTATTGAC
GCAATTCATTTTTCAGTAAAAAATGACGGCATTGTTGGGAAGAAGGCCGTATATATTGTGCTGGCGATTGATATAGAAGGGCAGAAAGATGTTATCGGTA
TTTATGTAGGAGAAAATGAGAGCTCAAAATTCTGGCTGAGTGTCTTAAATGACCTTAAAAACAGGGGTGTTAAAGACATTCTGATTCTCTGTGCTGATGC
ACTTTCAGGGATAAAGGATGCAATCAATGCGGCTTTTCCGAATACTGAATATCAGAGGTGTATAGTACACCAGATAAGAAACACGCTAAAGTATGTGTCA
GATAAAGACCGAAAGGAATTTGCCAGGGACTTGAAACGGATATATACGGCTCCGAATGAGAAGGCAGGGTACGACCAGATGCTTGAGGTTTCAGAGAAAT
GGGAGAAGAAATACCCGGCAGCTATGAAGAGCTGGAAGAGCAATTGGGATGTTATTTGTCCATTTTTTAAGTATTCGGAGGAACTACGTAAAATCATGTA
TACGACCAATACTATTGAGAGCCTGAATAGCAGTTATAGAAGGATAAACAAATCAAGGACAGTATTTCCTGGCGACCAGTCACTTTTAAAGAGCATATAT
TTAGCTACAGTGAAGATTACTTCAAAATGGACGATGCGTTACAAAAACTGGGGTTTGATACTGGGACAGCTACAGATTATGTTCGAAGGGCGTATATAGT
ATAATCAAGGGGTTCAGGGAAACTCTGGTTTCCTGAACCCCCCTGTAATATTTTATACGCATCATCTTTGTCAAATTTTTCTTTTGTAAGAATTTCTTAT
CAGAAATTCTTTTGAATCTATAAAATTTGACAAAGCAGACTTTCGAGTCTTAAAATATATTTAAATACTAACATAGAAGGAAAATTAGTTTACACAAAAA
TATTTACACTGCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1224 bp | 407 aa | 76 | 1299 | + | No |
Chemistry : DDE
ORF sequence :
MARKRIITPEKKELIRNLISEYNITSAKDLQEALKDLLGDTIQNMLEAELDEHLGYEKYESTEEAKSNYRNGYTSKTLKSSVGQVEIDIPRDRNAEFEPK
IVPRYKRDISEIENKIIAMYARGMSTREINEQIQEIYGFEVSAEMVSKITDKILPEIEEWQKRPLGEVYPIVFIDAIHFSVKNDGIVGKKAVYIVLAIDI
EGQKDVIGIYVGENESSKFWLSVLNDLKNRGVKDILILCADALSGIKDAINAAFPNTEYQRCIVHQIRNTLKYVSDKDRKEFARDLKRIYTAPNEKAGYD
QMLEVSEKWEKKYPAAMKSWKSNWDVICPFFKYSEELRKIMYTTNTIESLNSSYRRINKSRTVFPGDQSLLKSIYLATVKITSKWTMRYKNWGLILGQLQ
IMFEGRI
IVPRYKRDISEIENKIIAMYARGMSTREINEQIQEIYGFEVSAEMVSKITDKILPEIEEWQKRPLGEVYPIVFIDAIHFSVKNDGIVGKKAVYIVLAIDI
EGQKDVIGIYVGENESSKFWLSVLNDLKNRGVKDILILCADALSGIKDAINAAFPNTEYQRCIVHQIRNTLKYVSDKDRKEFARDLKRIYTAPNEKAGYD
QMLEVSEKWEKKYPAAMKSWKSNWDVICPFFKYSEELRKIMYTTNTIESLNSSYRRINKSRTVFPGDQSLLKSIYLATVKITSKWTMRYKNWGLILGQLQ
IMFEGRI
Blast result :
Comments
ISCth4 is 74% aa similar to ISDet4. There are 15 copies in the genome.
References
1] Miriam Land (2009) Direct submission
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.