ISCth12
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | ND | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 |
DNA section
IS Length : 2497 bp
Ends
IR Length : 25/38
IRL : TATTTATTGCAAAATAGAAACGGACAAAGTTGCAAAAAAAATTCCCCAGT
IRR : TGTTTTTTGCAAGTGAGATCTCCCCACTTTTGCAAAATTTTCCCCAGGCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTTTCATTTAAAA | GAAATATG | TGGTAATATGTA | 8 |
ATTATTTTTCTGC | TTTCAA | TAATACTTTTTAAT | 6 |
DNA sequence
TATTTATTGCAAAATAGAAACGGACAAAGTTGCAAAAAAAATTCCCCAGTTTTGCAACTATGAATTCCCCACTATTGCAAAAGGCCATTGAAGGCACCAC
ACCAAAATAATATCCTTGTGATTGATCGACCAAATCAGTTACAAGGAGGATTGGAGGATGCTATCAATGACCCAAATCAAGGATATCAGAAAAATGTATT
TTGAGGAAGGGAAAAATATCAGCCAGATAGCCAGAGAAACCGGCCATGATCGCAAAACGGTGAGAGCATATCTTGACAAAGTGGACTGGAACCAGAAGCC
ACCGAAAGTGAAGAAGGAAACAGCCTTTCCGAAACTTAATCCATACAAGGATGACATTGACACATGGCTAAACGAGGATAAAAAGGCCAGGCGCAAGCAA
AGACATACAGCAAAACGAATATACAACCGGCTGGTGGAAAAGTACGGAGAACGCTTCAACTGTTCCTACAGGACCGTAGCAGGATATGTAGCTGTGAAGA
AAAAAGAGATATTCAACGCAAGGGAAGGATTCCTGCCTTTAGAGCACGTACCAGGTGAAGCCCAGGCAGACTTTGGCGATGCTGACTTTTATGAAAATGG
CAGGCACTACAGGGGTAAAAGTCTGACTTTATCATTTCCCCACAGCAACAAAGGATATACCCAGCTATTCAAGGGAGAGAACCAGGAATGCCTGTTCGAG
GGCTTGAAGGCGATATTTGAGCACATAGGTGGAGTGCCGCCAAGGATATGGTTTGATAATGCCAGCACCATAGTAGCTAAGGTAATAAAGGGCGGAGGCA
GGAACCTGACAGATGATTTCATGCGTTTCATGGAGCATTACCGTTTCAAAGCAGTATTCTGCAATGTAGATGCCGGGCATGAAAAAGGCAATGTGGAGAA
CAAGGTCGGCTATCACAGGAGAAACATGCTGGTGCCGGTACCACGTTTTGAAGACATTAGTGAATTCAACAAAGAACTCCTGATTAGGTGTGAAGAAGAT
GCCAAAAGGCAGCATTACCGAAAGAACGGTACGATCGAAGAACTATACAGGGATGATAAGGCAGCCCTGCTGGAGCTGCCCAAGACAACTTTTGATACAA
GCAAATACATAACAGTGAAGACAAACGGATATGGCAAATTTCTGCTCAACAAAGGCCTGCACGAATATTCCTCAGCGCCAAAATTCGCAAACAAATATGT
ACTGGTCAGGCTGACTGCCTTTCATGTAACAGTGCTTGACGAAAGCCATCGGGAGATAGTGCGTCATGAGAGACTCTACGGCGACTACAAGCAGCAAAGC
ATGCAATGGCTGCCATATCTGACTCAGCTGGCACGGCGACCGGGGGCATTGAAATACACAGGTATATATCAGATGCTGCCACAGCCTGTGAAAGAATACA
TGGAAGAGCTAAGCAAGCAAGACAGAGGGAAAGTATTAAGAGTAATTGCTGATCTGACACAGAAGAGCAGCTTCGAAAAGGCCATTAAGACTGTCAGTAC
TGCCCTGTCCTATGGTGCTGCCGATGTGGACAGCCTGATAAATCTGCACAGATATTTGTATGAAAAAGTGCTGCAGCTGGAGCCGATACATTTGCCCGAG
CATATACCTCACTTAAACAGATATGTGCCTGATTTTATGGCATATGACAGAAGTCTCAAGGCAGGTGAAGAAAAATGCTGACATCTGATATTGCAGCATG
TTGCAGAAGGCTTCGCCTCAGCCGGAACATAGTGGAGATGTCAGGTAAAATACAGGCAGTCAGCCACCAGGAATACCTGCTTAAACTTCTTCAATCAGAG
ATCCGGCATCGTGAAGAACTGAAGAAAGACAAACTGCTGAAAAAAGCAGGTTTCTATACCATAAAGACATTTGAAAGCTTCCGGTTTGATGAAGTAAAGC
TGCCCAGTGGCGTTACTCCGGAATATCTTAAAGAGTGCGAGTTTATCGAAAACAAACACAATATCGTCATGTACGGCAATGTGGGCACAGGAAAGACGCA
TCTTTCAATTGCTTTAGGTGTAGAGGCCTGCAAGAAGGGATTGGAGGTGAGATTTTTCAGGACATCAGCGCTCGTGAACAGGCTGGCGGAACAAAAGAAA
GCCGGAACATTGTCAGGCTTCTTGAAGGACCTGAATAAAGCAGATCTTTTGATCTGTGACGAATGGGGCTACGTCCCCCTCGACCGTATAGGAGCGCAGT
TATTGTTTGAGGTCATATCCGAGTGTTATGAACGCAAATCGGTGATCATCAATACCAACATAGAATTTTCAAGGTGGGTGAACGTGTTCTATGACGAGCA
GATGACAGGCGCCATTATCGACCGGCTACTTCATCACTGTCACTTGCTACTGTTTCCCGGCCAGAGCAACAGGATGCGCGAAGCTGTACTAAACACATAA
AAAACTTACGGGAAAGTCACTGGGTTTACATAAAAATTCCCCACCGGTGCCTGGGGAAAATTTTGCAAAAGTGGGGAGATCTCACTTGCAAAAAACA
ACCAAAATAATATCCTTGTGATTGATCGACCAAATCAGTTACAAGGAGGATTGGAGGATGCTATCAATGACCCAAATCAAGGATATCAGAAAAATGTATT
TTGAGGAAGGGAAAAATATCAGCCAGATAGCCAGAGAAACCGGCCATGATCGCAAAACGGTGAGAGCATATCTTGACAAAGTGGACTGGAACCAGAAGCC
ACCGAAAGTGAAGAAGGAAACAGCCTTTCCGAAACTTAATCCATACAAGGATGACATTGACACATGGCTAAACGAGGATAAAAAGGCCAGGCGCAAGCAA
AGACATACAGCAAAACGAATATACAACCGGCTGGTGGAAAAGTACGGAGAACGCTTCAACTGTTCCTACAGGACCGTAGCAGGATATGTAGCTGTGAAGA
AAAAAGAGATATTCAACGCAAGGGAAGGATTCCTGCCTTTAGAGCACGTACCAGGTGAAGCCCAGGCAGACTTTGGCGATGCTGACTTTTATGAAAATGG
CAGGCACTACAGGGGTAAAAGTCTGACTTTATCATTTCCCCACAGCAACAAAGGATATACCCAGCTATTCAAGGGAGAGAACCAGGAATGCCTGTTCGAG
GGCTTGAAGGCGATATTTGAGCACATAGGTGGAGTGCCGCCAAGGATATGGTTTGATAATGCCAGCACCATAGTAGCTAAGGTAATAAAGGGCGGAGGCA
GGAACCTGACAGATGATTTCATGCGTTTCATGGAGCATTACCGTTTCAAAGCAGTATTCTGCAATGTAGATGCCGGGCATGAAAAAGGCAATGTGGAGAA
CAAGGTCGGCTATCACAGGAGAAACATGCTGGTGCCGGTACCACGTTTTGAAGACATTAGTGAATTCAACAAAGAACTCCTGATTAGGTGTGAAGAAGAT
GCCAAAAGGCAGCATTACCGAAAGAACGGTACGATCGAAGAACTATACAGGGATGATAAGGCAGCCCTGCTGGAGCTGCCCAAGACAACTTTTGATACAA
GCAAATACATAACAGTGAAGACAAACGGATATGGCAAATTTCTGCTCAACAAAGGCCTGCACGAATATTCCTCAGCGCCAAAATTCGCAAACAAATATGT
ACTGGTCAGGCTGACTGCCTTTCATGTAACAGTGCTTGACGAAAGCCATCGGGAGATAGTGCGTCATGAGAGACTCTACGGCGACTACAAGCAGCAAAGC
ATGCAATGGCTGCCATATCTGACTCAGCTGGCACGGCGACCGGGGGCATTGAAATACACAGGTATATATCAGATGCTGCCACAGCCTGTGAAAGAATACA
TGGAAGAGCTAAGCAAGCAAGACAGAGGGAAAGTATTAAGAGTAATTGCTGATCTGACACAGAAGAGCAGCTTCGAAAAGGCCATTAAGACTGTCAGTAC
TGCCCTGTCCTATGGTGCTGCCGATGTGGACAGCCTGATAAATCTGCACAGATATTTGTATGAAAAAGTGCTGCAGCTGGAGCCGATACATTTGCCCGAG
CATATACCTCACTTAAACAGATATGTGCCTGATTTTATGGCATATGACAGAAGTCTCAAGGCAGGTGAAGAAAAATGCTGACATCTGATATTGCAGCATG
TTGCAGAAGGCTTCGCCTCAGCCGGAACATAGTGGAGATGTCAGGTAAAATACAGGCAGTCAGCCACCAGGAATACCTGCTTAAACTTCTTCAATCAGAG
ATCCGGCATCGTGAAGAACTGAAGAAAGACAAACTGCTGAAAAAAGCAGGTTTCTATACCATAAAGACATTTGAAAGCTTCCGGTTTGATGAAGTAAAGC
TGCCCAGTGGCGTTACTCCGGAATATCTTAAAGAGTGCGAGTTTATCGAAAACAAACACAATATCGTCATGTACGGCAATGTGGGCACAGGAAAGACGCA
TCTTTCAATTGCTTTAGGTGTAGAGGCCTGCAAGAAGGGATTGGAGGTGAGATTTTTCAGGACATCAGCGCTCGTGAACAGGCTGGCGGAACAAAAGAAA
GCCGGAACATTGTCAGGCTTCTTGAAGGACCTGAATAAAGCAGATCTTTTGATCTGTGACGAATGGGGCTACGTCCCCCTCGACCGTATAGGAGCGCAGT
TATTGTTTGAGGTCATATCCGAGTGTTATGAACGCAAATCGGTGATCATCAATACCAACATAGAATTTTCAAGGTGGGTGAACGTGTTCTATGACGAGCA
GATGACAGGCGCCATTATCGACCGGCTACTTCATCACTGTCACTTGCTACTGTTTCCCGGCCAGAGCAACAGGATGCGCGAAGCTGTACTAAACACATAA
AAAACTTACGGGAAAGTCACTGGGTTTACATAAAAATTCCCCACCGGTGCCTGGGGAAAATTTTGCAAAAGTGGGGAGATCTCACTTGCAAAAAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1515 bp | 504 aa | 167 | 1681 | + | No |
Chemistry : DDE
ORF sequence :
MTQIKDIRKMYFEEGKNISQIARETGHDRKTVRAYLDKVDWNQKPPKVKKETAFPKLNPYKDDIDTWLNEDKKARRKQRHTAKRIYNRLVEKYGERFNCS
YRTVAGYVAVKKKEIFNAREGFLPLEHVPGEAQADFGDADFYENGRHYRGKSLTLSFPHSNKGYTQLFKGENQECLFEGLKAIFEHIGGVPPRIWFDNAS
TIVAKVIKGGGRNLTDDFMRFMEHYRFKAVFCNVDAGHEKGNVENKVGYHRRNMLVPVPRFEDISEFNKELLIRCEEDAKRQHYRKNGTIEELYRDDKAA
LLELPKTTFDTSKYITVKTNGYGKFLLNKGLHEYSSAPKFANKYVLVRLTAFHVTVLDESHREIVRHERLYGDYKQQSMQWLPYLTQLARRPGALKYTGI
YQMLPQPVKEYMEELSKQDRGKVLRVIADLTQKSSFEKAIKTVSTALSYGAADVDSLINLHRYLYEKVLQLEPIHLPEHIPHLNRYVPDFMAYDRSLKAG
EEKC
YRTVAGYVAVKKKEIFNAREGFLPLEHVPGEAQADFGDADFYENGRHYRGKSLTLSFPHSNKGYTQLFKGENQECLFEGLKAIFEHIGGVPPRIWFDNAS
TIVAKVIKGGGRNLTDDFMRFMEHYRFKAVFCNVDAGHEKGNVENKVGYHRRNMLVPVPRFEDISEFNKELLIRCEEDAKRQHYRKNGTIEELYRDDKAA
LLELPKTTFDTSKYITVKTNGYGKFLLNKGLHEYSSAPKFANKYVLVRLTAFHVTVLDESHREIVRHERLYGDYKQQSMQWLPYLTQLARRPGALKYTGI
YQMLPQPVKEYMEELSKQDRGKVLRVIADLTQKSSFEKAIKTVSTALSYGAADVDSLINLHRYLYEKVLQLEPIHLPEHIPHLNRYVPDFMAYDRSLKAG
EEKC
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
726 bp | 241 aa | 1675 | 2400 | + | No |
AG : IS21 helper
ORF sequence :
MLTSDIAACCRRLRLSRNIVEMSGKIQAVSHQEYLLKLLQSEIRHREELKKDKLLKKAGFYTIKTFESFRFDEVKLPSGVTPEYLKECEFIENKHNIVMY
GNVGTGKTHLSIALGVEACKKGLEVRFFRTSALVNRLAEQKKAGTLSGFLKDLNKADLLICDEWGYVPLDRIGAQLLFEVISECYERKSVIINTNIEFSR
WVNVFYDEQMTGAIIDRLLHHCHLLLFPGQSNRMREAVLNT
GNVGTGKTHLSIALGVEACKKGLEVRFFRTSALVNRLAEQKKAGTLSGFLKDLNKADLLICDEWGYVPLDRIGAQLLFEVISECYERKSVIINTNIEFSR
WVNVFYDEQMTGAIIDRLLHHCHLLLFPGQSNRMREAVLNT
Blast result :
Comments
ISCth12 is 55% (ORFA, the transposase) and 67% (istB) aa similar to ISBlo2.
References
1] ISfinder annotation (2009)
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.