ISCth11
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | ND | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 |
DNA section
IS Length : 2530 bp
Ends
IR Length : 28/30
IRL : GTAAGCGTCCAATAAGACAGTGTATAGAAATTCCCCCTGGACTATGAGAC
IRR : GTAAGCGTCTAATAACACGGTGTATAGAAAAAAAGCAAAAACTCATATAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCTTCACAAG | GTTTTTAC | AAACCGAGTC | 8 |
DNA sequence
GTAAGCGTCCAATAAGACAGTGTATAGAAATTCCCCCTGGACTATGAGACAATACTCTAAATCTAGATTTTTTAGAGTTTTTCGACTAACTTGAAAAACT
CCAAAGGAGGATTTAGAGTTGAATACCAGAGCAGTTACCCAAAAGTACAGATTAAATAAGTGGACCGACATCATACGTGAATGCCGGAGCAGTGGCCAGA
CAGTTACTGCATGGTGTACAGAACACGATATCAATCCAAAGAGTTACTACTACTGGTTGAAAAAAGTCCGATTAGCTGCATGTGAAGCCCTTACGCCCAT
AAATTCAGAAAATAGCCTAATAGTACCAATCGACATTTCAACGCAAACTACACACAGTAATTCCGAAATAAAGAGCATGTCATCTGATATAGTTATCCGC
ATCGGCTCAATAACTCTTGAACTAAGCAATAACGCATCTGCTGCGTTGATTGAGAATACACTGAGGGCGATACAGAATGTTAGGTGACATCTCCAAGGCT
GAAAAGATCTATATAGCCTGCGGGTATACAGACATGCGGAAAGCTATTGACGGACTGGCAGCTATTGTTCAACAGAACTTTCAGTTGAACCCATTCCAGA
ACAGTCTGTTTCTCTTTTGTGGCCGGCGGCGGGATCGGATGAAAGCGCTCTATTGGGAAGGCGACGGTTTTGTTCTCCTATACAAGCGCCTGGAAAACGG
AAAATTTCAATGGCCCATGAATGCTGATGCGGTTCGGGCACTCACCCCCCAGGAATTCCGTTGGCTGCTTGAAGGCCTCTCAATTGACCAGCCTAAAGCA
GTAAAAAAGATAGAGATAGGAGCCTGCATTTAACAGGAAAAAAATGTCGCCAGGCCGCATATTTACTGGATTTCCTGGCTATTTCGTGATATAATTGATA
TATCAAAAAACAGCGAGGAAACAGTATTTTATGAGCACAGCAGAGCAGATAGCAACCCTGGAAAACCGCATAAACGAGTTGGAGCTGGAAAACAAACGGC
TTCATGAAACAGTTGCTTATCTGACTCGTAAGCTGTACGGCAGAAGTTCTGAGAAGACATCAGCCCTTTCTGTGGGGCAGGTGTCTCTTTTTGATGAAGC
AGAGGTTTATGCTGTTCCGCAGGCACCGGAGCCTGATCTTAAAGAAGTACAGGGCTACATTAGAAGGAAGTACAAGGGCCAGAGGACTGATCTTTTAAAA
GACATCCCTCATGACAAACGTCTCTGTACACTTGCAGAAGAAGACCGCTATTGTGAGGCTTGCGGAACAGACCTCGTTTCTGTCGGAAAAGAATTCATCC
GCACTGAGATCGAATTCATTCCTGCTAAGATCCGGGTAATCGACTATTACCGTGAAACCTTTGAATGCCGTACCTGTCGCAAAAATGGAGAGCCATATAT
GGAAAAGTCGCCAATGCCATATCCTGTGATTCAGCATTCTATGGCATCTCCTTCTACTGTAGCATGGATTATGCATCAGAAGTTTGTAAACCATCTCCCT
CTTTACCGCCAGGAAAATGAGTGGAAGATGCTGGGTGTCAATTTAAAGCGGGAGACTATGTCCAACTGGATTCTGGCTGCAGCTCGTGACTGGCTGATGC
CATTGGTGGATTTGATGCATAAAAAACTCCTGCAGGAAAAATACCTGCATGCCGACGAAACCACGGTTCAGGTGCTAAATGAGGAAGGCCGGAGCAACAC
CACGAACTCATACATGTGGGTATACAGTAGCGGGAAGTACTGTAAAAAGCAGATCAGGCTCTTCCAGTACCAGCCCGGGCGTAATGGTAAATATCCTCAG
GAATTCCTTAAAGGGTTCAGTGGATTTCTACATACAGATGCTTACTCCGGGTATAAGAAAGTTCCGGAGATTACAAGGTGTATGTGTTGGACACATCTTC
GGCGATATTTCCGGGATGCACTTCCGAAAGATACCCAGAGTCCGGAAGCAACCATTCCAAGCCAGGGAATAAGATTCTGCAACAAGCTGTTTGAAATTGA
AGAGACTCTTGAAAAACTTACTCAGGAGCAGCGAAGATTGGAGCGTCTGAAACAGGAAACACCCGTTTTAGAGGCCTTTTGGTCGTGGGTTGATTCGGTT
AAAGACAAGGTCCTGCCAAAGTCTAAAATAGGTGAAGCCATTCAATATGCCCTGAATAACAAGGAAGACTTCATGAACTATCTTTTAGACGGTAACTGCT
CCATATCTAATAACCTCTCGGAGAACAGCATTCGTCCCTTTACCCTGGGAAGAAAAAACTGGCTGTTCAGCGGAAGCCCGAGAGGAGCGGATGCAAGCGC
TGCTGTTTATAGCATTGTCGAAAGTGCTAAGGCTAACGATATTAACCCATATAAATATCTTTATTACATCTTTAGCGAACTACCGGGTGTGCAGTTCGGC
CAGAATCCTGAATTCCTGGAAGATTATCTCCCATGGAGTCCCGATGTACAAGCCGCCTGTAAATAGTGCAAAAACTTATTTTATATGAGTTTTTGCTTTT
TTTCTATACACCGTGTTATTAGACGCTTAC
CCAAAGGAGGATTTAGAGTTGAATACCAGAGCAGTTACCCAAAAGTACAGATTAAATAAGTGGACCGACATCATACGTGAATGCCGGAGCAGTGGCCAGA
CAGTTACTGCATGGTGTACAGAACACGATATCAATCCAAAGAGTTACTACTACTGGTTGAAAAAAGTCCGATTAGCTGCATGTGAAGCCCTTACGCCCAT
AAATTCAGAAAATAGCCTAATAGTACCAATCGACATTTCAACGCAAACTACACACAGTAATTCCGAAATAAAGAGCATGTCATCTGATATAGTTATCCGC
ATCGGCTCAATAACTCTTGAACTAAGCAATAACGCATCTGCTGCGTTGATTGAGAATACACTGAGGGCGATACAGAATGTTAGGTGACATCTCCAAGGCT
GAAAAGATCTATATAGCCTGCGGGTATACAGACATGCGGAAAGCTATTGACGGACTGGCAGCTATTGTTCAACAGAACTTTCAGTTGAACCCATTCCAGA
ACAGTCTGTTTCTCTTTTGTGGCCGGCGGCGGGATCGGATGAAAGCGCTCTATTGGGAAGGCGACGGTTTTGTTCTCCTATACAAGCGCCTGGAAAACGG
AAAATTTCAATGGCCCATGAATGCTGATGCGGTTCGGGCACTCACCCCCCAGGAATTCCGTTGGCTGCTTGAAGGCCTCTCAATTGACCAGCCTAAAGCA
GTAAAAAAGATAGAGATAGGAGCCTGCATTTAACAGGAAAAAAATGTCGCCAGGCCGCATATTTACTGGATTTCCTGGCTATTTCGTGATATAATTGATA
TATCAAAAAACAGCGAGGAAACAGTATTTTATGAGCACAGCAGAGCAGATAGCAACCCTGGAAAACCGCATAAACGAGTTGGAGCTGGAAAACAAACGGC
TTCATGAAACAGTTGCTTATCTGACTCGTAAGCTGTACGGCAGAAGTTCTGAGAAGACATCAGCCCTTTCTGTGGGGCAGGTGTCTCTTTTTGATGAAGC
AGAGGTTTATGCTGTTCCGCAGGCACCGGAGCCTGATCTTAAAGAAGTACAGGGCTACATTAGAAGGAAGTACAAGGGCCAGAGGACTGATCTTTTAAAA
GACATCCCTCATGACAAACGTCTCTGTACACTTGCAGAAGAAGACCGCTATTGTGAGGCTTGCGGAACAGACCTCGTTTCTGTCGGAAAAGAATTCATCC
GCACTGAGATCGAATTCATTCCTGCTAAGATCCGGGTAATCGACTATTACCGTGAAACCTTTGAATGCCGTACCTGTCGCAAAAATGGAGAGCCATATAT
GGAAAAGTCGCCAATGCCATATCCTGTGATTCAGCATTCTATGGCATCTCCTTCTACTGTAGCATGGATTATGCATCAGAAGTTTGTAAACCATCTCCCT
CTTTACCGCCAGGAAAATGAGTGGAAGATGCTGGGTGTCAATTTAAAGCGGGAGACTATGTCCAACTGGATTCTGGCTGCAGCTCGTGACTGGCTGATGC
CATTGGTGGATTTGATGCATAAAAAACTCCTGCAGGAAAAATACCTGCATGCCGACGAAACCACGGTTCAGGTGCTAAATGAGGAAGGCCGGAGCAACAC
CACGAACTCATACATGTGGGTATACAGTAGCGGGAAGTACTGTAAAAAGCAGATCAGGCTCTTCCAGTACCAGCCCGGGCGTAATGGTAAATATCCTCAG
GAATTCCTTAAAGGGTTCAGTGGATTTCTACATACAGATGCTTACTCCGGGTATAAGAAAGTTCCGGAGATTACAAGGTGTATGTGTTGGACACATCTTC
GGCGATATTTCCGGGATGCACTTCCGAAAGATACCCAGAGTCCGGAAGCAACCATTCCAAGCCAGGGAATAAGATTCTGCAACAAGCTGTTTGAAATTGA
AGAGACTCTTGAAAAACTTACTCAGGAGCAGCGAAGATTGGAGCGTCTGAAACAGGAAACACCCGTTTTAGAGGCCTTTTGGTCGTGGGTTGATTCGGTT
AAAGACAAGGTCCTGCCAAAGTCTAAAATAGGTGAAGCCATTCAATATGCCCTGAATAACAAGGAAGACTTCATGAACTATCTTTTAGACGGTAACTGCT
CCATATCTAATAACCTCTCGGAGAACAGCATTCGTCCCTTTACCCTGGGAAGAAAAAACTGGCTGTTCAGCGGAAGCCCGAGAGGAGCGGATGCAAGCGC
TGCTGTTTATAGCATTGTCGAAAGTGCTAAGGCTAACGATATTAACCCATATAAATATCTTTATTACATCTTTAGCGAACTACCGGGTGTGCAGTTCGGC
CAGAATCCTGAATTCCTGGAAGATTATCTCCCATGGAGTCCCGATGTACAAGCCGCCTGTAAATAGTGCAAAAACTTATTTTATATGAGTTTTTGCTTTT
TTTCTATACACCGTGTTATTAGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
369 bp | 122 aa | 119 | 487 | + | No |
AG : IS66 TnpA
ORF sequence :
MNTRAVTQKYRLNKWTDIIRECRSSGQTVTAWCTEHDINPKSYYYWLKKVRLAACEALTPINSENSLIVPIDISTQTTHSNSEIKSMSSDIVIRIGSITL
ELSNNASAALIENTLRAIQNVR
ELSNNASAALIENTLRAIQNVR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 477 | 833 | + | No |
AG : IS66 TnpB
ORF sequence :
MLGDISKAEKIYIACGYTDMRKAIDGLAAIVQQNFQLNPFQNSLFLFCGRRRDRMKALYWEGDGFVLLYKRLENGKFQWPMNADAVRALTPQEFRWLLEG
LSIDQPKAVKKIEIGACI
LSIDQPKAVKKIEIGACI
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1536 bp | 511 aa | 931 | 2466 | + | No |
Chemistry : DDE
ORF sequence :
MSTAEQIATLENRINELELENKRLHETVAYLTRKLYGRSSEKTSALSVGQVSLFDEAEVYAVPQAPEPDLKEVQGYIRRKYKGQRTDLLKDIPHDKRLCT
LAEEDRYCEACGTDLVSVGKEFIRTEIEFIPAKIRVIDYYRETFECRTCRKNGEPYMEKSPMPYPVIQHSMASPSTVAWIMHQKFVNHLPLYRQENEWKM
LGVNLKRETMSNWILAAARDWLMPLVDLMHKKLLQEKYLHADETTVQVLNEEGRSNTTNSYMWVYSSGKYCKKQIRLFQYQPGRNGKYPQEFLKGFSGFL
HTDAYSGYKKVPEITRCMCWTHLRRYFRDALPKDTQSPEATIPSQGIRFCNKLFEIEETLEKLTQEQRRLERLKQETPVLEAFWSWVDSVKDKVLPKSKI
GEAIQYALNNKEDFMNYLLDGNCSISNNLSENSIRPFTLGRKNWLFSGSPRGADASAAVYSIVESAKANDINPYKYLYYIFSELPGVQFGQNPEFLEDYL
PWSPDVQAACK
LAEEDRYCEACGTDLVSVGKEFIRTEIEFIPAKIRVIDYYRETFECRTCRKNGEPYMEKSPMPYPVIQHSMASPSTVAWIMHQKFVNHLPLYRQENEWKM
LGVNLKRETMSNWILAAARDWLMPLVDLMHKKLLQEKYLHADETTVQVLNEEGRSNTTNSYMWVYSSGKYCKKQIRLFQYQPGRNGKYPQEFLKGFSGFL
HTDAYSGYKKVPEITRCMCWTHLRRYFRDALPKDTQSPEATIPSQGIRFCNKLFEIEETLEKLTQEQRRLERLKQETPVLEAFWSWVDSVKDKVLPKSKI
GEAIQYALNNKEDFMNYLLDGNCSISNNLSENSIRPFTLGRKNWLFSGSPRGADASAAVYSIVESAKANDINPYKYLYYIFSELPGVQFGQNPEFLEDYL
PWSPDVQAACK
Blast result :
Comments
ISCth11 is 59% (ORFA) aa similar to ISSba7, 70% (ORFB) to ISVme1 and 57% (ORFC, the transposase) to ISDpr4. There is only one copy on the genome.
References
1] ISfinder annotation (2009)
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.