ISCth9
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009012 | ND | Clostridium thermocellum | Clostridium thermocellum ATCC 27405 |
DNA section
IS Length : 2419 bp
Ends
IR Length : 36/50
IRL : TGTTAATGCTACTCTAAAAATGTACCATATACCGGCAAAAAAATAGGCCA
IRR : TGTTAATGTCAATCAAAATTTAGGCCACTTACCGGGGTAAAATTAGGCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AAGGTTGTCTTGAT | TACCTC | TCCATTGATTGTCA | 6 |
GAAGGTACGGTAAA | GGTTTT | TCTGTACCATCCGA | 6 |
TTCGTTTTCAAGGT | AAGTAA | TGTGATTTAACAAC | 6 |
AGCGTGTTTCTTAT | CTGGTG | TACTATACACCTCT | 6 |
DNA sequence
TGTTAATGCTACTCTAAAAATGTACCATATACCGGCAAAAAAATAGGCCACCTTAATCAAATTTCCTGTATGATTTTGTTGAACAGACACTATCGTACAG
GAGGAAAGGATATGATTAAGATGGCTCAATTAGAGGATATCAGAAAAATGTACTTCATGGAAGGCTTAAGTATCAGGGAAATTAACAGGAGGACTGGGAT
ACATAGGGATACAATCTCAAAATATATTTCGCTGGAGGAACCAAAACCACCTAAGTACAAGTTGACAAAGGAAAGAACGCATCCGGTATTAGGGCCGTAC
ATACCAATGATCAAACAGATAATAGAAGATGATAAAACCAGACACCGCAAACAACGCCATACAGGGACAAAAATATTTGAGACACTTAAAAAAGAAGGCT
TTTCAGGCGGCTACAACACTGTAATGGATTACCTGAGAAAGGAATACCGAAAACAAAGGGAAGCTTTCCTGCCACTGGAGTTCGAGTTGGGAGCATATGC
AGAAGTAGATTGGACAGAAGCATATTTTTATCTAAAAGGCAAAGAAACCAAGGCACATTTGTTTGTAATGAAGTTGAGAGGATCAGGCGGATTCTACGTA
AGAGCATACCCTTTTGAGAAACAGGAGGCGTTCTTTGATGGCCATATCAAATGCTTTGAGTTCATGAACGGTGTACCATACAAGATAGCATACGACAATC
TGAAAACGGCAGTGAAGAAGATACTCGAAGGCAGCAACAGAGAAGAGCAGGAGCAGTTTATCGCTTTACGAACCCATTACCTTTATGAATCTTCATTCTG
CCGGCCGGCAAAAGGGAGCGATAAAGGTGGTGTAGAGAATGCGGGCAAAGAGGCTGTGCGAAGGTTCTTCGTTCCCTACCCCGAGGTTGATTCATTTGAG
GAGTTGAATGAATATCTGCACAACGAATGCATAAAGCTTTTGGAAAGCAATCCGAAATGGGAAGCGGAAAGGGCAGCTTTGAGGCCATTACCGGCGGTAA
GGTTTGATGGTGCGAGGTATAAAGAGGCAAAGGTCAACCGCTATTCTATGGTACAGTTTGAAACTAACCGATACTCTGTTCCCACGATATATGTGGGAGA
GAAAGTCACTGTTAAAGCTACTGCGGATGAAGTAAAAATACTAAACAAAGGAACAATGATAGCAAGCCATCCAAGGATATACGGACGCTACCAGGAGCAG
ATAAAGCTTGATCACTATCTGGAATTGCTGCTGCAAAAATCACGCGCCCTGGGCAACACAAAAGTATATAAACCTCAGATGCTGGCACCCGTTTATGAGC
AGTATCGTCGAAGCTTAAATGCCAGAAGTCCGAGAGGCAACAGGGAATTCGTAAAAATACTCATGCTGCACAGGGATTACCCTACGGCACTGGTGACAGA
AGCTATTGAAATAGCTATGGCATACAATGTATACAGTTATGACGGTGTATTTAACATATTAGGACAGCTACTGGTCTCAGGCAGTCCTAAGACGGCTCCT
GTCAGCAAAGACAAGCTTCAGGGCATCCCCGAGGTTGTTGTAATACCTCCTGATCTCAGCAAATACAGCGCTCTCATGTCAGGAGGTGGACAATAATGCC
GGTCAATAAAATGCTTATCGAAACTTACATGAAGAAGCTAAAGATGCCACAGGTGGCAAAAACCTATGAATCCCTGGCAAGAGAAGCCGCAGACAATAAT
CTGGATTATGAAGAATACCTGCTGTGTGTGCTGGAACAGGAAGTACATCAGCGGGAGAATAACCGGATCCAGAGAGGGATCCGGCAAGCAGGCTTTCCTG
TTATCAAAACGATTGAAAGCTTTGACTTCCTTGCCATACCTTCTTTGAACAAACCGCGGGTATTGAAACTCATGCAGGGAGAATATATCCGAAGAAGAGA
AAATGTCATTTTGATAGGCAACTCCGGAGTAGGGAAAACCCATATTGCAACTGCGCTCGGTTACGAGGCTTGTCGGCAGGGTATGAAGGTCAAATTCTAT
ACGGCAGCTGGTTTGATAAATGAATTGCTTGCAGCACAGCAGGAATATCGTCTTAATAAGCTTGAAAAGCAATGGCTGGCGCCGCATTTAGTGATCCTTG
ATGAATTAGGCTATGTGCCTTTCAGTAAAATCGGAGCAGAATTGTTGTTCCAGTTCTGCTCTTCCCGATATGAGAGGGGCAGCCTGATCATAACTACAAA
CTTAGAATTTCCAAGATGGACGGAGGTGTTAGGCGATGAGCAAATGACAGCCGCCCTGCTTGACCGCTTGACCCATAATGCGCACATTCTGAACATCAAT
GGTGAAAGCTACAGGTTTAAGCAGGCTCTTTCCAAGCAGGCAAATAATGACTGATTTTTTATGAAATGGTGGCCTAATTTTACCCCGGTAAGTGGCCTAA
ATTTTGATTGACATTAACA
GAGGAAAGGATATGATTAAGATGGCTCAATTAGAGGATATCAGAAAAATGTACTTCATGGAAGGCTTAAGTATCAGGGAAATTAACAGGAGGACTGGGAT
ACATAGGGATACAATCTCAAAATATATTTCGCTGGAGGAACCAAAACCACCTAAGTACAAGTTGACAAAGGAAAGAACGCATCCGGTATTAGGGCCGTAC
ATACCAATGATCAAACAGATAATAGAAGATGATAAAACCAGACACCGCAAACAACGCCATACAGGGACAAAAATATTTGAGACACTTAAAAAAGAAGGCT
TTTCAGGCGGCTACAACACTGTAATGGATTACCTGAGAAAGGAATACCGAAAACAAAGGGAAGCTTTCCTGCCACTGGAGTTCGAGTTGGGAGCATATGC
AGAAGTAGATTGGACAGAAGCATATTTTTATCTAAAAGGCAAAGAAACCAAGGCACATTTGTTTGTAATGAAGTTGAGAGGATCAGGCGGATTCTACGTA
AGAGCATACCCTTTTGAGAAACAGGAGGCGTTCTTTGATGGCCATATCAAATGCTTTGAGTTCATGAACGGTGTACCATACAAGATAGCATACGACAATC
TGAAAACGGCAGTGAAGAAGATACTCGAAGGCAGCAACAGAGAAGAGCAGGAGCAGTTTATCGCTTTACGAACCCATTACCTTTATGAATCTTCATTCTG
CCGGCCGGCAAAAGGGAGCGATAAAGGTGGTGTAGAGAATGCGGGCAAAGAGGCTGTGCGAAGGTTCTTCGTTCCCTACCCCGAGGTTGATTCATTTGAG
GAGTTGAATGAATATCTGCACAACGAATGCATAAAGCTTTTGGAAAGCAATCCGAAATGGGAAGCGGAAAGGGCAGCTTTGAGGCCATTACCGGCGGTAA
GGTTTGATGGTGCGAGGTATAAAGAGGCAAAGGTCAACCGCTATTCTATGGTACAGTTTGAAACTAACCGATACTCTGTTCCCACGATATATGTGGGAGA
GAAAGTCACTGTTAAAGCTACTGCGGATGAAGTAAAAATACTAAACAAAGGAACAATGATAGCAAGCCATCCAAGGATATACGGACGCTACCAGGAGCAG
ATAAAGCTTGATCACTATCTGGAATTGCTGCTGCAAAAATCACGCGCCCTGGGCAACACAAAAGTATATAAACCTCAGATGCTGGCACCCGTTTATGAGC
AGTATCGTCGAAGCTTAAATGCCAGAAGTCCGAGAGGCAACAGGGAATTCGTAAAAATACTCATGCTGCACAGGGATTACCCTACGGCACTGGTGACAGA
AGCTATTGAAATAGCTATGGCATACAATGTATACAGTTATGACGGTGTATTTAACATATTAGGACAGCTACTGGTCTCAGGCAGTCCTAAGACGGCTCCT
GTCAGCAAAGACAAGCTTCAGGGCATCCCCGAGGTTGTTGTAATACCTCCTGATCTCAGCAAATACAGCGCTCTCATGTCAGGAGGTGGACAATAATGCC
GGTCAATAAAATGCTTATCGAAACTTACATGAAGAAGCTAAAGATGCCACAGGTGGCAAAAACCTATGAATCCCTGGCAAGAGAAGCCGCAGACAATAAT
CTGGATTATGAAGAATACCTGCTGTGTGTGCTGGAACAGGAAGTACATCAGCGGGAGAATAACCGGATCCAGAGAGGGATCCGGCAAGCAGGCTTTCCTG
TTATCAAAACGATTGAAAGCTTTGACTTCCTTGCCATACCTTCTTTGAACAAACCGCGGGTATTGAAACTCATGCAGGGAGAATATATCCGAAGAAGAGA
AAATGTCATTTTGATAGGCAACTCCGGAGTAGGGAAAACCCATATTGCAACTGCGCTCGGTTACGAGGCTTGTCGGCAGGGTATGAAGGTCAAATTCTAT
ACGGCAGCTGGTTTGATAAATGAATTGCTTGCAGCACAGCAGGAATATCGTCTTAATAAGCTTGAAAAGCAATGGCTGGCGCCGCATTTAGTGATCCTTG
ATGAATTAGGCTATGTGCCTTTCAGTAAAATCGGAGCAGAATTGTTGTTCCAGTTCTGCTCTTCCCGATATGAGAGGGGCAGCCTGATCATAACTACAAA
CTTAGAATTTCCAAGATGGACGGAGGTGTTAGGCGATGAGCAAATGACAGCCGCCCTGCTTGACCGCTTGACCCATAATGCGCACATTCTGAACATCAAT
GGTGAAAGCTACAGGTTTAAGCAGGCTCTTTCCAAGCAGGCAAATAATGACTGATTTTTTATGAAATGGTGGCCTAATTTTACCCCGGTAAGTGGCCTAA
ATTTTGATTGACATTAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1485 bp | 494 aa | 112 | 1596 | + | No |
Chemistry : DDE
ORF sequence :
MIKMAQLEDIRKMYFMEGLSIREINRRTGIHRDTISKYISLEEPKPPKYKLTKERTHPVLGPYIPMIKQIIEDDKTRHRKQRHTGTKIFETLKKEGFSGG
YNTVMDYLRKEYRKQREAFLPLEFELGAYAEVDWTEAYFYLKGKETKAHLFVMKLRGSGGFYVRAYPFEKQEAFFDGHIKCFEFMNGVPYKIAYDNLKTA
VKKILEGSNREEQEQFIALRTHYLYESSFCRPAKGSDKGGVENAGKEAVRRFFVPYPEVDSFEELNEYLHNECIKLLESNPKWEAERAALRPLPAVRFDG
ARYKEAKVNRYSMVQFETNRYSVPTIYVGEKVTVKATADEVKILNKGTMIASHPRIYGRYQEQIKLDHYLELLLQKSRALGNTKVYKPQMLAPVYEQYRR
SLNARSPRGNREFVKILMLHRDYPTALVTEAIEIAMAYNVYSYDGVFNILGQLLVSGSPKTAPVSKDKLQGIPEVVVIPPDLSKYSALMSGGGQ
YNTVMDYLRKEYRKQREAFLPLEFELGAYAEVDWTEAYFYLKGKETKAHLFVMKLRGSGGFYVRAYPFEKQEAFFDGHIKCFEFMNGVPYKIAYDNLKTA
VKKILEGSNREEQEQFIALRTHYLYESSFCRPAKGSDKGGVENAGKEAVRRFFVPYPEVDSFEELNEYLHNECIKLLESNPKWEAERAALRPLPAVRFDG
ARYKEAKVNRYSMVQFETNRYSVPTIYVGEKVTVKATADEVKILNKGTMIASHPRIYGRYQEQIKLDHYLELLLQKSRALGNTKVYKPQMLAPVYEQYRR
SLNARSPRGNREFVKILMLHRDYPTALVTEAIEIAMAYNVYSYDGVFNILGQLLVSGSPKTAPVSKDKLQGIPEVVVIPPDLSKYSALMSGGGQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
759 bp | 252 aa | 1596 | 2354 | + | No |
AG : IS21 helper
ORF sequence :
MPVNKMLIETYMKKLKMPQVAKTYESLAREAADNNLDYEEYLLCVLEQEVHQRENNRIQRGIRQAGFPVIKTIESFDFLAIPSLNKPRVLKLMQGEYIRR
RENVILIGNSGVGKTHIATALGYEACRQGMKVKFYTAAGLINELLAAQQEYRLNKLEKQWLAPHLVILDELGYVPFSKIGAELLFQFCSSRYERGSLIIT
TNLEFPRWTEVLGDEQMTAALLDRLTHNAHILNINGESYRFKQALSKQANND
RENVILIGNSGVGKTHIATALGYEACRQGMKVKFYTAAGLINELLAAQQEYRLNKLEKQWLAPHLVILDELGYVPFSKIGAELLFQFCSSRYERGSLIIT
TNLEFPRWTEVLGDEQMTAALLDRLTHNAHILNINGESYRFKQALSKQANND
Blast result :
Comments
ISCth9 is 52%(ORFA) aa similar to ISFlsp1 and 71%( ORFB) to ISMex8. There are 4 copies in the genome.
References
1] Miriam Land (2009) Direct submission
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Wu,J.H.D., Newcomb,M. and Richardson,P. (2007) Direct submission GenBank.