ISCth3
- Family IS30
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | ND | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 |
DNA section
IS Length : 1386 bp
Ends
IR Length : 20/26
IRL : GGTTCGTTGTAAAATTAAATGCAACAGTAAAATGAAATAAAAAATTGACA
IRR : TGTAAATTGCAATATTAAATGCAACACCTATAGTGGAAATCATCCATAAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTCTTTGAATTTTAACTAAA | TAGCGAAAAAAATTT | TTAAATTTTTATAA | 15 |
AATCAACAAAAAAATTG | AAATTATATTTTT | CCATGGTATTTAGCCTT | 13 |
TCGAAAATCAAGGATAA | AATAAAAATTAAT | TCATATTTTTGGTTTAT | 13 |
TTAATTAAATAAATATA | CATATTAATTATA | TTTTAAATACATATTTA | 13 |
GGATTAAAATGAAAAAA | AATATTTTTTTTA | AGAAGATTGGGATTATT | 13 |
CAAATTAATTTATTTAT | ATTTTTTTATAAA | AAAAATTTATAAAGAAA | 13 |
AAAGGCTGTGGTCCTT | TTTTTATTTTTACA | TCATAAACATTTGGGG | 14 |
AGCAGCCATTAAGTTTA | CAAAATTCCTTTT | TCAAATTTCAAGAAATT | 13 |
TAAAAAAGAAAATGTCA | TAAAGATATTTTT | CCATGCTTTCCATACAA | 13 |
AGCAAAATGCAACAAGA | TGAACAGGAAATT | TAATAAGCTACAAAAAA | 13 |
TTTCTCACAATAAAAA | TAAAAAAAATAAAA | TTAAGACACACTTGAA | 14 |
ATGATAACGGCAAGAAA | ATAGATTTCAAAT | GCTTTGTTGCGGGTACG | 13 |
CTAAAATTAAAAATATA | TTATAACATTAAA | CAATAATTAAAAAATTA | 13 |
TCAATTTCTTTATCA | TTTTTGCGAAATTTT | GATTAGTATGCTTTT | 15 |
TTTTAATTAATTTAATA | ATAATTTAATAAA | AAGAGTTGTACATGTAA | 13 |
ATATTACGGCATTTTATTTTACAACTTCCC | AAAGAAAATTGTTTGAATTTTTATTTTTTT | 0 | |
TTCAAGTTTTTCATAT | AATTCTTTAGTTTT | ATCAATAATAATATCT | 14 |
DNA sequence
GGTTCGTTGTAAAATTAAATGCAACAGTAAAATGAAATAAAAAATTGACAACCATAGTGAGAAATCATGTATGATTTAGGTGTCCAATCCAAACATACAG
GAGGTTCTCAACTATGGCTGTACAATATAAGTCTACCACAACTGAGCATAAGTTTAAACACTTAAGTGTTTATGAAAGAGGGCAGATTGCAGCTCTTTTA
AAAGAAGGAAAGAGTCAACGTTATATTGCTAATAAACTAGGTCGCTCGCCAAGTACAATTAGCCGTGAAATTAAAAGAGGGACAACAATGCAGATGAGAA
CTGATTTATCGACATACAAAGTATATTTTCCTGAAACAGGGCAGGCAGTTTATGAGAAAAATCGTATGAATTGCGGAGCAAAGCGTAAATTGGCTCAAGT
TGAAGATTTTCTTAAGTTTGCAGAAGATAAGATACTACGCGAAAAATGGTCTCCAGATGCAGTTGTTGGTTTATGTAGGAGAGACCCCAAGTGGCAAAAT
TCTACTATTGTATGTACCAAAACACTGTATAATTATATAGACCTGGGACTCATAAAAGTACGAAATATAGATTTAAATCTTAAACTACGTTTAAAATCTA
AAATAAAAAGGATACGTCAAAACAAACGGGTTGTAGGGAAAAGCATTGATCAAAGGCCGGAAGAAGTACAATCACGTCAAACCTTTGGGCATTGGGAAAT
TGATACGGTAACAGGCAAAAAGTCTAACGATTCAGTAATTTTAACCTTAACTGAACGAAAAACCCGCTACGAGTTATTGTTTCTTTTGGACGCAAAAGAC
AGTAATACTGTTAACGAGGCACTTTCAGAACTTAAGAATTGTTATGGTAAGGATGTTTCAAATGTATTTCGCACTATAACGGCAGACAATGGTTCTGAAT
TTAGTAGACTATCCGAAATGTTACAAGGGCTAGGAATTGAAGCTTATTTCACTCATCCTTATTCCTCATGGGAGAGAGGAACTAATGAACGTCATAATGG
ACTTATTAGGCGTTTTATTCCTAAAGGAAAGGCTATAAAAGATTTTTCTGAAGAAACGATAAAACGGATACAACAATGGTTAAACAGCCTTCCACGAAGG
ATATTAGGTTACAAAACACCTGAAGAATGTTTTAATGAAGAGATACATAACCTGGTAAACAAAAATATATCAGCAATAGCCTGAGCCCTTCATCCAAGAT
ATTTTAAAGCTCATTTGAGGGTAGTCAAGGGTAAAGACTTTGTCTTATCTATCGATAACCCTTGACTACCCTCTGCACTCGCTCAAGATAGTAGTTAAGA
AGGGCTAAAAGATGATGTTGTACTTTAACACCTAAAATTATGGATGATTTCCACTATAGGTGTTGCATTTAATATTGCAATTTACA
GAGGTTCTCAACTATGGCTGTACAATATAAGTCTACCACAACTGAGCATAAGTTTAAACACTTAAGTGTTTATGAAAGAGGGCAGATTGCAGCTCTTTTA
AAAGAAGGAAAGAGTCAACGTTATATTGCTAATAAACTAGGTCGCTCGCCAAGTACAATTAGCCGTGAAATTAAAAGAGGGACAACAATGCAGATGAGAA
CTGATTTATCGACATACAAAGTATATTTTCCTGAAACAGGGCAGGCAGTTTATGAGAAAAATCGTATGAATTGCGGAGCAAAGCGTAAATTGGCTCAAGT
TGAAGATTTTCTTAAGTTTGCAGAAGATAAGATACTACGCGAAAAATGGTCTCCAGATGCAGTTGTTGGTTTATGTAGGAGAGACCCCAAGTGGCAAAAT
TCTACTATTGTATGTACCAAAACACTGTATAATTATATAGACCTGGGACTCATAAAAGTACGAAATATAGATTTAAATCTTAAACTACGTTTAAAATCTA
AAATAAAAAGGATACGTCAAAACAAACGGGTTGTAGGGAAAAGCATTGATCAAAGGCCGGAAGAAGTACAATCACGTCAAACCTTTGGGCATTGGGAAAT
TGATACGGTAACAGGCAAAAAGTCTAACGATTCAGTAATTTTAACCTTAACTGAACGAAAAACCCGCTACGAGTTATTGTTTCTTTTGGACGCAAAAGAC
AGTAATACTGTTAACGAGGCACTTTCAGAACTTAAGAATTGTTATGGTAAGGATGTTTCAAATGTATTTCGCACTATAACGGCAGACAATGGTTCTGAAT
TTAGTAGACTATCCGAAATGTTACAAGGGCTAGGAATTGAAGCTTATTTCACTCATCCTTATTCCTCATGGGAGAGAGGAACTAATGAACGTCATAATGG
ACTTATTAGGCGTTTTATTCCTAAAGGAAAGGCTATAAAAGATTTTTCTGAAGAAACGATAAAACGGATACAACAATGGTTAAACAGCCTTCCACGAAGG
ATATTAGGTTACAAAACACCTGAAGAATGTTTTAATGAAGAGATACATAACCTGGTAAACAAAAATATATCAGCAATAGCCTGAGCCCTTCATCCAAGAT
ATTTTAAAGCTCATTTGAGGGTAGTCAAGGGTAAAGACTTTGTCTTATCTATCGATAACCCTTGACTACCCTCTGCACTCGCTCAAGATAGTAGTTAAGA
AGGGCTAAAAGATGATGTTGTACTTTAACACCTAAAATTATGGATGATTTCCACTATAGGTGTTGCATTTAATATTGCAATTTACA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1071 bp | 356 aa | 114 | 1184 | + | No |
Chemistry : DDE
ORF sequence :
MAVQYKSTTTEHKFKHLSVYERGQIAALLKEGKSQRYIANKLGRSPSTISREIKRGTTMQMRTDLSTYKVYFPETGQAVYEKNRMNCGAKRKLAQVEDFL
KFAEDKILREKWSPDAVVGLCRRDPKWQNSTIVCTKTLYNYIDLGLIKVRNIDLNLKLRLKSKIKRIRQNKRVVGKSIDQRPEEVQSRQTFGHWEIDTVT
GKKSNDSVILTLTERKTRYELLFLLDAKDSNTVNEALSELKNCYGKDVSNVFRTITADNGSEFSRLSEMLQGLGIEVYFTHPYSSWERGTNERHNGLIRR
FIPKGKAIKDFSEETIKRIQQWLNSLPRRILGYKTPEECFNEEIHNLVNKNISAIA
KFAEDKILREKWSPDAVVGLCRRDPKWQNSTIVCTKTLYNYIDLGLIKVRNIDLNLKLRLKSKIKRIRQNKRVVGKSIDQRPEEVQSRQTFGHWEIDTVT
GKKSNDSVILTLTERKTRYELLFLLDAKDSNTVNEALSELKNCYGKDVSNVFRTITADNGSEFSRLSEMLQGLGIEVYFTHPYSSWERGTNERHNGLIRR
FIPKGKAIKDFSEETIKRIQQWLNSLPRRILGYKTPEECFNEEIHNLVNKNISAIA
Blast result :
Comments
ISCth3 is 67% aa similar to IS1470. There are 18 copies in the genome.
References
1] Miriam Land (2009) Direct submission
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.