ISTsi3
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_012883 | ND | Thermococcus sibiricus | Thermococcus sibiricus MM 739 |
DNA section
IS Length : 1883 bp
Ends
Left end : TTTTTCTTTTGAATGCCTTGTCCTTTAGGGCGGGGATGCAGTAATTCCAAAACAAGTCCTCCACCTATGAAAGCGAAAAACATTTAAACCTCAAGTGCGT II struct. : Yes
Right end : CCCCACTTTATCTTGGGTCCGAATGAGACCCCTCAAACCTCCCCGTCAATGGCGAGGGATTAAGCCGAACCCTCGCCCTTCAGGGCGGGGAGGAGGTCAG II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
CTCATCCTTCT | TTAC | TTTGAAATCTCT | tcag |
DNA sequence
TTTTTCTTTTGAATGCCTTGTCCTTTAGGGCGGGGATGCAGTAATTCCAAAACAAGTCCTCCACCTATGAAAGCGAAAAACATTTAAACCTCAAGTGCGT
ACCATACATTAGAGGGTTCAATATGGAGACCAGAATACCAGCATTTAGCTCAACGAGACACGTAAAACACTTCCTCGCCTACCACTTCGTATGGATACCA
AAATACCGAAGGGACATCCTCACCGGGAAAGTCGCTGAAAGATTAAAACAAATGCTCAAAGAATTCGCCGGAGAAATCGGGTGTGAAGTGATCTCCCTTG
AAGTAATTCCCGACCACGTTCACGTCTTCCTCAGAGCAAAACCCGACCTCGCCCCAGCAAGAATAATCAACCACCTGAAGGGGAAGAGTGCAAGAAAACT
CCTCCAAGAATTTCCAGAATTGAGAACAAAAACTGCCCACGGGAGGTTATGGTCACGCTCCTATTTTGTAGCCTCGGCCGGATACATAACCGACGAGATC
GTAAAACACTACGTAGAAACCCAATGGGAGCGTGAGTTAAAACGAAGAGGACAGTAAGGATAAAACTCCAGCCCTCAAAAGAACAAGAGAAAGCCCTCTT
CGAGCTAGCTGACGCTGGTGCCAAAGTCTGGAATGAAGTGAATTACCTTCGCAGGCAACAGTTCTTCAACCACGAACCTGTTGATTTTAACAGGACTGAA
AAAATCGTTTATGAGAAGTACAAGGGTGAGATTGGCTCTGCGACAGTTCAGCAGATAGCGAGGAAGAACGCTGAAGCTTGGAGGAGCTTCTTCTCCCTCA
TAAAGAAGCGAAAAGAACTTCCCAAGTGGCTGAGGCCAAAACCACCGAACTACCTGAAAGAAGAGGGAAAGAGGAAGCCCTTAATTGTCCTCAGGAACGA
CCAGTACAGGATTGATGGGAACAAGGTAATTCTCAAGGGGCTTGGGAAATTCAAAAAGCTGGAAGTCCAGTTTAAGGGGCGGATACACCTGAAAGGCAAG
CAAGGGCGCTTGGAAATCATCTACGACGACGTGAAGAGGAAGTGGTACGCCCACATCAGCTACACCGTCGAAGAGAAGCTGGAAGGCAATTCTTGGGTTA
AACTCCCAAGAACGCCGAAAGGCAACCTCGTGGCTGGCATTGACCTAGGAGTGAACAATTTAATGGCCGTTTACGTTGAGAATGGGGAGAGTTTTCTGGT
CAATGGTAGGCCGTTGAAGAGCATTGGCTTTTACTGGCAGAAGAGGATTGCTGAGTATCAGTCAAAACTCAACAAGAGTGGGGCTAAGGCGAGCAGAAAG
CTCAAGAGAATGCACGAGAAGGCAAAACTCCAAGCGGGACACTACATTAACACCGCAGTCAGGCAGACGGTTAGGAGGCTTTACGAGCTGGGAGTTTCGA
GGATTGTAGTTGGCTATCCAAAGGAGATTTCAAGAGAACCAGACAAGGGCAGAAAGCAGAATTTCGTCCTTTCTCACGTTTGGCGGTTTAATTACGTGAT
TAAACGTTTAACAGAGGTTGCTGAGGAGTATGGTATTCAGGTCGTGGTTGTGAATGAGGCTTTCACTTCTCAGACGTGCCCTCTCTGCGGGAAGCCTCAT
AAAGGGGCGAGGTTTGTCCGTGGGCTGTATATGTGTCCCGTGGAGGGGCTTGTGTTCAATGCTGATTTAGTTGGTGCTTTCAACATTTTGAGGAAGGCCG
TGAAAACGATAACCCCGAATCTGAGCGGTCTTTATGCTCAGGGGAGGGGTAACGGGCCTGAGACCGGGCCAGAGGGGTTGAAGCCCCACTTTATCTTGGG
TCCGAATGAGACCCCTCAAACCTCCCCGTCAATGGCGAGGGATTAAGCCGAACCCTCGCCCTTCAGGGCGGGGAGGAGGTCAG
ACCATACATTAGAGGGTTCAATATGGAGACCAGAATACCAGCATTTAGCTCAACGAGACACGTAAAACACTTCCTCGCCTACCACTTCGTATGGATACCA
AAATACCGAAGGGACATCCTCACCGGGAAAGTCGCTGAAAGATTAAAACAAATGCTCAAAGAATTCGCCGGAGAAATCGGGTGTGAAGTGATCTCCCTTG
AAGTAATTCCCGACCACGTTCACGTCTTCCTCAGAGCAAAACCCGACCTCGCCCCAGCAAGAATAATCAACCACCTGAAGGGGAAGAGTGCAAGAAAACT
CCTCCAAGAATTTCCAGAATTGAGAACAAAAACTGCCCACGGGAGGTTATGGTCACGCTCCTATTTTGTAGCCTCGGCCGGATACATAACCGACGAGATC
GTAAAACACTACGTAGAAACCCAATGGGAGCGTGAGTTAAAACGAAGAGGACAGTAAGGATAAAACTCCAGCCCTCAAAAGAACAAGAGAAAGCCCTCTT
CGAGCTAGCTGACGCTGGTGCCAAAGTCTGGAATGAAGTGAATTACCTTCGCAGGCAACAGTTCTTCAACCACGAACCTGTTGATTTTAACAGGACTGAA
AAAATCGTTTATGAGAAGTACAAGGGTGAGATTGGCTCTGCGACAGTTCAGCAGATAGCGAGGAAGAACGCTGAAGCTTGGAGGAGCTTCTTCTCCCTCA
TAAAGAAGCGAAAAGAACTTCCCAAGTGGCTGAGGCCAAAACCACCGAACTACCTGAAAGAAGAGGGAAAGAGGAAGCCCTTAATTGTCCTCAGGAACGA
CCAGTACAGGATTGATGGGAACAAGGTAATTCTCAAGGGGCTTGGGAAATTCAAAAAGCTGGAAGTCCAGTTTAAGGGGCGGATACACCTGAAAGGCAAG
CAAGGGCGCTTGGAAATCATCTACGACGACGTGAAGAGGAAGTGGTACGCCCACATCAGCTACACCGTCGAAGAGAAGCTGGAAGGCAATTCTTGGGTTA
AACTCCCAAGAACGCCGAAAGGCAACCTCGTGGCTGGCATTGACCTAGGAGTGAACAATTTAATGGCCGTTTACGTTGAGAATGGGGAGAGTTTTCTGGT
CAATGGTAGGCCGTTGAAGAGCATTGGCTTTTACTGGCAGAAGAGGATTGCTGAGTATCAGTCAAAACTCAACAAGAGTGGGGCTAAGGCGAGCAGAAAG
CTCAAGAGAATGCACGAGAAGGCAAAACTCCAAGCGGGACACTACATTAACACCGCAGTCAGGCAGACGGTTAGGAGGCTTTACGAGCTGGGAGTTTCGA
GGATTGTAGTTGGCTATCCAAAGGAGATTTCAAGAGAACCAGACAAGGGCAGAAAGCAGAATTTCGTCCTTTCTCACGTTTGGCGGTTTAATTACGTGAT
TAAACGTTTAACAGAGGTTGCTGAGGAGTATGGTATTCAGGTCGTGGTTGTGAATGAGGCTTTCACTTCTCAGACGTGCCCTCTCTGCGGGAAGCCTCAT
AAAGGGGCGAGGTTTGTCCGTGGGCTGTATATGTGTCCCGTGGAGGGGCTTGTGTTCAATGCTGATTTAGTTGGTGCTTTCAACATTTTGAGGAAGGCCG
TGAAAACGATAACCCCGAATCTGAGCGGTCTTTATGCTCAGGGGAGGGGTAACGGGCCTGAGACCGGGCCAGAGGGGTTGAAGCCCCACTTTATCTTGGG
TCCGAATGAGACCCCTCAAACCTCCCCGTCAATGGCGAGGGATTAAGCCGAACCCTCGCCCTTCAGGGCGGGGAGGAGGTCAG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 123 | 557 | + | No |
Chemistry : Y1
ORF sequence :
METRIPAFSSTRHVKHFLAYHFVWIPKYRRDILTGKVAERLKQMLKEFAGEIGCEVISLEVIPDHVHVFLRAKPDLAPARIINHLKGKSARKLLQEFPEL
RTKTAHGRLWSRSYFVASAGYITDEIVKHYVETQWERELKRRGQ
RTKTAHGRLWSRSYFVASAGYITDEIVKHYVETQWERELKRRGQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1304 bp | 434 aa | 543 | 1846 | + | No |
AG : TnpB
ORF sequence :
MKRTVRIKLQPSKEQEKALFELADAGAKVWNEVNYLRRQQFFNHEPVDFNRTEKIVYEKYKGEIGSATVQQIARKNAEAWRSFFSLIKKRKELPKWLRPK
PPNYLKEEGKRKPLIVLRNDQYRIDGNKVILKGLGKFKKLEVQFKGRIHLKGKQGRLEIIYDDVKRKWYAHISYTVEEKLEGNSWVKLPRTPKGNLVAGI
DLGVNNLMAVYVENGESFLVNGRPLKSIGFYWQKRIAEYQSKLNKSGAKASRKLKRMHEKAKLQAGHYINTAVRQTVRRLYELGVSRIVVGYPKEISREP
DKGRKQNFVLSHVWRFNYVIKRLTEVAEEYGIQVVVVNEAFTSQTCPLCGKPHKGARFVRGLYMCPVEGLVFNADLVGAFNILRKAVKTITPNLSGLYAQ
GRGNGPETGPEGLKPHFILGPNETPQTSPSMARD
PPNYLKEEGKRKPLIVLRNDQYRIDGNKVILKGLGKFKKLEVQFKGRIHLKGKQGRLEIIYDDVKRKWYAHISYTVEEKLEGNSWVKLPRTPKGNLVAGI
DLGVNNLMAVYVENGESFLVNGRPLKSIGFYWQKRIAEYQSKLNKSGAKASRKLKRMHEKAKLQAGHYINTAVRQTVRRLYELGVSRIVVGYPKEISREP
DKGRKQNFVLSHVWRFNYVIKRLTEVAEEYGIQVVVVNEAFTSQTCPLCGKPHKGARFVRGLYMCPVEGLVFNADLVGAFNILRKAVKTITPNLSGLYAQ
GRGNGPETGPEGLKPHFILGPNETPQTSPSMARD
Blast result :
Comments
ISTsi3 is 77% (ORFA: the transposase) and 66% (ORFB) aa similar to ISDka2.
References
1] Mardanov A.V., Ravin N.V., Svetlitchnyi V.A., Beletsky A.V., Miroshnichenko M.L., Bonch-Osmolovskaya E.A., Skryabin K.G. (2009) Appl. Environ. Microbiol., 75, 4580-4588.