ISSto1
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_003106 | ND | Sulfolobus tokodaii | Sulfolobus tokodaii |
DNA section
IS Length : 1799 bp
Ends
Left end : AAAACGCGGGGCAGCCTCGCCTTTTAAGGCGGGGTAAAGTTTAAAAATTATTTTGAATACTTTAAAGTGTGGAATACAAATCAACTAGGCATGCAAAGTA II struct. : Yes
Right end : CACCCACTAACTATGAAGTTGTGAGGATGAAGGTGGTAAACCACAAACCTATGAACCGCCCTAAGGGAACCCTCGCCCTTTAGGGCGAGGAGGAGGTCAG II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
ACTTCTTTACACTTTTTAA | TGAC | TTAGTCGATAATAGAGTACTAGGG | TCAG |
ATAGAATCCTATACGGATC | TGAC | TCAGTATGCAATATGGAATCCCAAATG | TCAG |
DNA sequence
AAAACGCGGGGCAGCCTCGCCTTTTAAGGCGGGGTAAAGTTTAAAAATTATTTTGAATACTTTAAAGTGTGGAATACAAATCAACTAGGCATGCAAAGTA
CCTCTGCAACTACCACTTCGTATGGATACCGAAATACCGTAGGAAAGTGTTAACAGGCGAAATAGCTGAATACACTAAAGAGGTACTAAGAACCATAGCA
GAAGAGTTGGGCTGTGAAGTATTAGCCCTAGAAGTAATGCCAGACCACATCCACCTATTCGTTAACTGCCCACCGAGATACGCACCGTCATATTTAGCAA
ACTACTTCAAAGGAAAATCAGCAAGACTAATACTCAAGAAGTTCCCAGAGCTGAAGAAAGCTACTAACGGGAAGCTCTGGACTAGAAGCTACTTCGTATC
GACTTCTGGTAACATATCCAGCGAGACGATAAAGAAGTACATTGAAGAACAGTGGGTGAAAGAGGGTGAAGAGGACTAACGTAGTTAAACTAATAGTAGA
CAAGCAAACCCACGAAAGACTGAAGGAGTTGGCTATTACCACAGCGAAGTGTTGGAACGAAGTGAACTGGTTAAGGATGCAACAGTACAAGGAAGGGGAG
AGAGTAGACTTTGCTAAGACAGAAAAGGAGGTGTACGAGAAGTACAAACACGTGTTAAAGGTTAACGTTCAACAAGTTGCTAGAAAGAACGCTGAGGACT
GGAGAAGCTTCTTCTCCCTAATCGAAGAGAAGAAGGAGGGAAAATTACCAAAGTGGTTTAAACCAAGACCTCCAGGGTATTGGAAAGATAAAAGTGGAAA
TTATAAACTAATCCTAATCATTAGGAACGATCGTTATGAGGTGGATGAGAACAAGAGGATCATCTATTTGAAGGACTTCAAACTCTCTCTGAGGTTTAAG
GGGAAACTCAAGTGGCACGGGAAACAAGGTAGATTAGAAATAATTTACAACGAAGCTAAGAGGAGTTGGTATGCTCATATTCCAGTGGAGGTTGAAAGTG
AAACGAAAGTTGAGGGCAATCTGAGGGCTTCTGTTGATTTAGGAATCATCAATTTAGCAACAGTTTACGTAGAAGATGGTAATTGGTACCTTTTCAAGGG
TGGTAGTGTTCTTTCACAGTATGAGTATTATAGCAAGAAGATACGAATCGTTCAGAAGACCTTAGCGAGGCATAAGCAGAATAGGAGTAAGAAGCTGAAA
CTCCTCTACGAGAATAGGAGTAGGTTCCTCAAACACGCTTTAAACAGTATGGTTAGGAAAGTAGTGGAGCTGTTGAAGGACAAACGGGTAAGCGAGGTTG
TTATAGGTTATCCTAAAGAGATTAATAGGAACCACGGTAATAAACTCACTGTAAACTTCTGGAACTACAGGTATGTTATTAAGCGTTTTGAGGAGATTGG
GGAAGAACTAGGAATTAGGGTTGTTAAAGTGGATGAATCTTATACTTCTAAGACGTGCTCCCTATGCGGGGAAGCCCACGAAAGTGGGCGTGTTAAACGC
GGCCTATTTAAGTGTCCCCGCATAGGGAAAGTGATAAACGCAGACCTGAATGGAGCGATAAATATCCTACATATCCCCGAGTCCCTAGGATCTGGGAGTG
GAGGGCAACTCCCAGTGAGGGATAGGGGTAATGGGCTGAAGACCCAGCCCGTGGTCTACCGCTGGACGAACGGAGCGGGGTGGGTGCTTTACGCACCCAC
TAACTATGAAGTTGTGAGGATGAAGGTGGTAAACCACAAACCTATGAACCGCCCTAAGGGAACCCTCGCCCTTTAGGGCGAGGAGGAGGTCAG
CCTCTGCAACTACCACTTCGTATGGATACCGAAATACCGTAGGAAAGTGTTAACAGGCGAAATAGCTGAATACACTAAAGAGGTACTAAGAACCATAGCA
GAAGAGTTGGGCTGTGAAGTATTAGCCCTAGAAGTAATGCCAGACCACATCCACCTATTCGTTAACTGCCCACCGAGATACGCACCGTCATATTTAGCAA
ACTACTTCAAAGGAAAATCAGCAAGACTAATACTCAAGAAGTTCCCAGAGCTGAAGAAAGCTACTAACGGGAAGCTCTGGACTAGAAGCTACTTCGTATC
GACTTCTGGTAACATATCCAGCGAGACGATAAAGAAGTACATTGAAGAACAGTGGGTGAAAGAGGGTGAAGAGGACTAACGTAGTTAAACTAATAGTAGA
CAAGCAAACCCACGAAAGACTGAAGGAGTTGGCTATTACCACAGCGAAGTGTTGGAACGAAGTGAACTGGTTAAGGATGCAACAGTACAAGGAAGGGGAG
AGAGTAGACTTTGCTAAGACAGAAAAGGAGGTGTACGAGAAGTACAAACACGTGTTAAAGGTTAACGTTCAACAAGTTGCTAGAAAGAACGCTGAGGACT
GGAGAAGCTTCTTCTCCCTAATCGAAGAGAAGAAGGAGGGAAAATTACCAAAGTGGTTTAAACCAAGACCTCCAGGGTATTGGAAAGATAAAAGTGGAAA
TTATAAACTAATCCTAATCATTAGGAACGATCGTTATGAGGTGGATGAGAACAAGAGGATCATCTATTTGAAGGACTTCAAACTCTCTCTGAGGTTTAAG
GGGAAACTCAAGTGGCACGGGAAACAAGGTAGATTAGAAATAATTTACAACGAAGCTAAGAGGAGTTGGTATGCTCATATTCCAGTGGAGGTTGAAAGTG
AAACGAAAGTTGAGGGCAATCTGAGGGCTTCTGTTGATTTAGGAATCATCAATTTAGCAACAGTTTACGTAGAAGATGGTAATTGGTACCTTTTCAAGGG
TGGTAGTGTTCTTTCACAGTATGAGTATTATAGCAAGAAGATACGAATCGTTCAGAAGACCTTAGCGAGGCATAAGCAGAATAGGAGTAAGAAGCTGAAA
CTCCTCTACGAGAATAGGAGTAGGTTCCTCAAACACGCTTTAAACAGTATGGTTAGGAAAGTAGTGGAGCTGTTGAAGGACAAACGGGTAAGCGAGGTTG
TTATAGGTTATCCTAAAGAGATTAATAGGAACCACGGTAATAAACTCACTGTAAACTTCTGGAACTACAGGTATGTTATTAAGCGTTTTGAGGAGATTGG
GGAAGAACTAGGAATTAGGGTTGTTAAAGTGGATGAATCTTATACTTCTAAGACGTGCTCCCTATGCGGGGAAGCCCACGAAAGTGGGCGTGTTAAACGC
GGCCTATTTAAGTGTCCCCGCATAGGGAAAGTGATAAACGCAGACCTGAATGGAGCGATAAATATCCTACATATCCCCGAGTCCCTAGGATCTGGGAGTG
GAGGGCAACTCCCAGTGAGGGATAGGGGTAATGGGCTGAAGACCCAGCCCGTGGTCTACCGCTGGACGAACGGAGCGGGGTGGGTGCTTTACGCACCCAC
TAACTATGAAGTTGTGAGGATGAAGGTGGTAAACCACAAACCTATGAACCGCCCTAAGGGAACCCTCGCCCTTTAGGGCGAGGAGGAGGTCAG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
411 bp | 136 aa | 75 | 485 | + | No |
Chemistry : Y1
ORF sequence :
MEYKSTRHAKYLCNYHFVWIPKYRRKVLTGEIAEYTKEVLRTIAEELGCEVLALEVMPDHIHLFVNCPPRYAPSYLANYFKGKSARLILKKFPELKKATN
GKLWTRSYFVSTSGNISSETIKKYIEEQWVKEGEED
GKLWTRSYFVSTSGNISSETIKKYIEEQWVKEGEED
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1311 bp | 436 aa | 472 | 1782 | + | No |
AG : TnpB
ORF sequence :
MKRTNVVKLIVDKQTHERLKELAITTAKCWNEVNWLRMQQYKEGERVDFAKTEKEVYEKYKHVLKVNVQQVARKNAEDWRSFFSLIEEKKEGKLPKWFKP
RPPGYWKDKSGNYKLILIIRNDRYEVDENKRIIYLKDFKLSLRFKGKLKWHGKQGRLEIIYNEAKRSWYAHIPVEVESETKVEGNLRASVDLGIINLATV
YVEDGNWYLFKGGSVLSQYEYYSKKIRIVQKTLARHKQNRSKKLKLLYENRSRFLKHALNSMVRKVVELLKDKRVSEVVIGYPKEINRNHGNKLTVNFWN
YRYVIKRFEEIGEELGIRVVKVDESYTSKTCSLCGEAHESGRVKRGLFKCPRIGKVINADLNGAINILHIPESLGSGSGGQLPVRDRGNGLKTQPVVYRW
TNGAGWVLYAPTNYEVVRMKVVNHKPMNRPKGTLAL
RPPGYWKDKSGNYKLILIIRNDRYEVDENKRIIYLKDFKLSLRFKGKLKWHGKQGRLEIIYNEAKRSWYAHIPVEVESETKVEGNLRASVDLGIINLATV
YVEDGNWYLFKGGSVLSQYEYYSKKIRIVQKTLARHKQNRSKKLKLLYENRSRFLKHALNSMVRKVVELLKDKRVSEVVIGYPKEINRNHGNKLTVNFWN
YRYVIKRFEEIGEELGIRVVKVDESYTSKTCSLCGEAHESGRVKRGLFKCPRIGKVINADLNGAINILHIPESLGSGSGGQLPVRDRGNGLKTQPVVYRW
TNGAGWVLYAPTNYEVVRMKVVNHKPMNRPKGTLAL
Blast result :
Comments
References
1] Kawarabayasi,Y., Hino,Y., Horikawa,H., Jin-no,K., Takahashi,M., Sekine,M., Baba,S., Ankai,A., Kosugi,H., Hosoyama,A., Fukui,S., Nagai,Y., Nishijima,K., Otsuka,R., Nakazawa,H., Takamiya,M., Kato,Y., Yoshizawa,T., Tanaka,T., Kudoh,Y., Yamazaki,J., Kushida,N., Oguchi,A., Aoki,K., Masuda,S., Yanagii,M., Nishimura,M., Yamagishi,A., Oshima,T. and Kikuchi,H. (2001) DNA Res. 8 (4), 123-140