ISCth5
- Family IS256
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | ND | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 |
DNA section
IS Length : 1395 bp
Ends
IR Length : 18/26
IRL : GTTATAGTCTCAAGTAGTGGTGTAAATTTCGTGGAGGCTAATTGACAAAA
IRR : GATAGTGTCTTGTCAAGTGGTGTAAAATGATTTCTACTTTATCATTTTTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACAGTAATGATTATATAAG | CTTATAGGAGGGAAATAGTG | 0 | |
ATTTAAGAAGTA | AATTGTAC/T | TGTACAATTTGT | 10 |
AAGAATTGGCA | ACAA/-GAAAA | TTCTATGTAAAG | 11 |
TGCTGCATCCGGCTCTGCTT | CAAATTTGCTATACTAATCC | 0 | |
TTATTCTCAATA | CAATATTG | TCTTTCCAAAAA | 8 |
CACTTCTTTATC | CAACATTG | ATTTCAGAACCT | 8 |
ATATATTAAGAAGAGATATA | TGGAGAGAATTTAGATAATA | 0 | |
TGACAAATTTGACTCTCTGA | ATAATGTATCACCAATATCG | 0 | |
TGAGATAATACT | CTCAACTC | AATTACATGTTT | 8 |
TAGGGGGGTAGAT | GTTTCTG | AATGAAAAAGAA | 7 |
DNA sequence
GTTATAGTCTCAAGTAGTGGTGTAAATTTCGTGGAGGCTAATTGACAAAAGTAACCACGATATGATTTCTACTTTATTAGGGAAACAAAACAAAACAAAT
AAAGTAGAGGAGGTCATACCGTGGCTACTAATAATAGAATGGCACTTTTAGAACAACTTAGCAAGTATGTTGTTGAAAAAGATAAAGATTTTTTAAAAGA
AGCATTAACATTACTCATTAATGCCCTAATGGATGCGGAAGTTACATCAATAATAGGTGCTGAAAAGTATGAAAGAAATAATAATAGAAACAACTATCGC
AATGGATATCGTCTAAGAGAATGGGATACTCGAGTAGGAACATTACAGTTAAGCATTCCCAAGTTACGTCACGGAAGTTATTTTCCAAGTCTTTTAGAAC
CGAGGAAAATGTCAGAGAAAGCATTATTGAATGTAGTTCAGGAAGCCTATGTTCATGGAGTAAGTACCAGGAAGGTGGATGAACTTGTAGAAGCTCTTGG
AATGAAAGGGATTGATAAAAGCGAAGTATCAAGAATCAGTAAGCAACTGGATGAATTTGTAGAAGAATTTAAAAACCGTAGACTGGAAGGAGAATATCCT
TACCTTTGGCTTGATGCCACTTTCCCCAAGGTTCGGGAAGGAGGCAGGGTATGCAGTATGGCACTAGTTATAGCAGTAGGAGTTAATCAACAAGGTGAAC
GGGAAATATTAGGTTTTGATGTAGGGATGAGTGAAGACGGGGCTTTTTGGGAGGAGTTTTTAAGAAGGCTGGTAGCAAGGGGTCTAAAAGGTGTAAGGCT
TGTAATCAGTGATGCACATGAAGGGCTGAAGGCTGCAATAAAGAAGATTTTAACGGGAAGTGCATGGCAAAGATGCCGTGTACATTTTATGAGAAACGTA
TTAAGCCAGGTACCAAAGCATTATCAGGGAATGGTATCATCGATAATACGGACAATATTTGCCCAGAATGATCAGGAATCTGCGAGGGAACAGTTAAGGC
ATGTAGTAGATGAGCTTAAAAATCGTTTTCCAAAAGCAATGAAAATTCTTGAAGAAGCAGAAGAAGAAATCCTGGCATATATGGCTTTTCCCCGTGAGCA
TTGGGCACAGATACACTCCACCAATCCTCTTGAGAGACTTAACCGGGAAATTCGCCGTCGAACGGATGTTGTTTGCATATTTCCAAATCGTGAGGCGGTA
ATCCGATTGGTAGGAGCAATGCTCATGGAACAAAATGATGAATGGAAAGTAGGGCGGCGCTATTTCAGTCTGGAATCAATGTCAAAGATTACATCGATAA
ATGAATTTACATTGACACCAGTAGCTTTATTACATAAATGAGGTGAAAAAATGATAAAGTAGAAATCATTTTACACCACTTGACAAGACACTATC
AAAGTAGAGGAGGTCATACCGTGGCTACTAATAATAGAATGGCACTTTTAGAACAACTTAGCAAGTATGTTGTTGAAAAAGATAAAGATTTTTTAAAAGA
AGCATTAACATTACTCATTAATGCCCTAATGGATGCGGAAGTTACATCAATAATAGGTGCTGAAAAGTATGAAAGAAATAATAATAGAAACAACTATCGC
AATGGATATCGTCTAAGAGAATGGGATACTCGAGTAGGAACATTACAGTTAAGCATTCCCAAGTTACGTCACGGAAGTTATTTTCCAAGTCTTTTAGAAC
CGAGGAAAATGTCAGAGAAAGCATTATTGAATGTAGTTCAGGAAGCCTATGTTCATGGAGTAAGTACCAGGAAGGTGGATGAACTTGTAGAAGCTCTTGG
AATGAAAGGGATTGATAAAAGCGAAGTATCAAGAATCAGTAAGCAACTGGATGAATTTGTAGAAGAATTTAAAAACCGTAGACTGGAAGGAGAATATCCT
TACCTTTGGCTTGATGCCACTTTCCCCAAGGTTCGGGAAGGAGGCAGGGTATGCAGTATGGCACTAGTTATAGCAGTAGGAGTTAATCAACAAGGTGAAC
GGGAAATATTAGGTTTTGATGTAGGGATGAGTGAAGACGGGGCTTTTTGGGAGGAGTTTTTAAGAAGGCTGGTAGCAAGGGGTCTAAAAGGTGTAAGGCT
TGTAATCAGTGATGCACATGAAGGGCTGAAGGCTGCAATAAAGAAGATTTTAACGGGAAGTGCATGGCAAAGATGCCGTGTACATTTTATGAGAAACGTA
TTAAGCCAGGTACCAAAGCATTATCAGGGAATGGTATCATCGATAATACGGACAATATTTGCCCAGAATGATCAGGAATCTGCGAGGGAACAGTTAAGGC
ATGTAGTAGATGAGCTTAAAAATCGTTTTCCAAAAGCAATGAAAATTCTTGAAGAAGCAGAAGAAGAAATCCTGGCATATATGGCTTTTCCCCGTGAGCA
TTGGGCACAGATACACTCCACCAATCCTCTTGAGAGACTTAACCGGGAAATTCGCCGTCGAACGGATGTTGTTTGCATATTTCCAAATCGTGAGGCGGTA
ATCCGATTGGTAGGAGCAATGCTCATGGAACAAAATGATGAATGGAAAGTAGGGCGGCGCTATTTCAGTCTGGAATCAATGTCAAAGATTACATCGATAA
ATGAATTTACATTGACACCAGTAGCTTTATTACATAAATGAGGTGAAAAAATGATAAAGTAGAAATCATTTTACACCACTTGACAAGACACTATC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1221 bp | 406 aa | 121 | 1341 | + | No |
Chemistry : DDE
ORF sequence :
MATNNRMALLEQLSKYVVEKDKDFLKEALTLLINALMDAEVTSIIGAEKYERNNNRNNYRNGYRLREWDTRVGTLQLSIPKLRHGSYFPSLLEPRKMSEK
ALLNVVQEAYVHGVSTRKVDELVEALGMKGIDKSEVSRISKQLDEFVEEFKNRRLEGEYPYLWLDATFPKVREGGRVCSMALVIAVGVNQQGEREILGFD
VGMSEDGAFWEEFLRRLVARGLKGVRLVISDAHEGLKAAIKKILTGSAWQRCRVHFMRNVLSQVPKHYQGMVSSIIRTIFAQNDQESAREQLRHVVDELK
NRFPKAMKILEEAEEEILAYMAFPREHWAQIHSTNPLERLNREIRRRTDVVCIFPNREAVIRLVGAMLMEQNDEWKVGRRYFSLESMSKITSINEFTLTP
VALLHK
ALLNVVQEAYVHGVSTRKVDELVEALGMKGIDKSEVSRISKQLDEFVEEFKNRRLEGEYPYLWLDATFPKVREGGRVCSMALVIAVGVNQQGEREILGFD
VGMSEDGAFWEEFLRRLVARGLKGVRLVISDAHEGLKAAIKKILTGSAWQRCRVHFMRNVLSQVPKHYQGMVSSIIRTIFAQNDQESAREQLRHVVDELK
NRFPKAMKILEEAEEEILAYMAFPREHWAQIHSTNPLERLNREIRRRTDVVCIFPNREAVIRLVGAMLMEQNDEWKVGRRYFSLESMSKITSINEFTLTP
VALLHK
Blast result :
Comments
ISCth5 is 71% aa similar to ISGdi8. There are 10 copies in the genome.
References
1] Miriam Land (2009) Direct submission
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.