ISTsi1
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_012883 | ND | Thermococcus sibiricus | Thermococcus sibiricus MM739 |
DNA section
IS Length : 1854 bp
Ends
IR Length : 0
IRL : GGGCTGTTTCATAAATAATAACAAGCGAAACTTTTAAATATTTTTGGCAC
IRR : CGTTCTGGTCAGCCGTAATTTGTGTAAAGTTCATAAACATCGCCACGGCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTTAACTCCCATCGCTCTAT | GGACGACTATGGTCGTTATT | 0 | |
CAACTCTCGCTTCGAGGCAG | GGTCTGCTGAGGTAGAACTC | 0 |
DNA sequence
GGGCTGTTTCATAAATAATAACAAGCGAAACTTTTAAATATTTTTGGCACTAATTTCTTTTGGGCGAGAGTGGTGAGGCTTTACCGGACTGGTGAAGCAT
CAAAGAAACTCGGAGTTTCCAAGATGACAATCCTCCGCTGGATAAAATCGGGTAAACTCAAAGCCCACCGCATTGGAAAAGAATACAGAGTTCCAGAGAG
CGAAATCAAGAAGATTCTTGAAGGCAAAATCCCCGACCAAGTTGCCATTTACGCCAGAGTATCTAGTCAAAACCAAAAAGAAGACTTGGAAAGGCAAGTC
GAATACCTCAAAAACTACTGCTCGGCAAAAGGCTACCAGGTTGCTAAAATCATAACAGACATCTCCTCCGGCCTGAACGAGAACAGAAAAGGACTAAAAC
AACTCTTCAAACTCGTTGAGAATGGAGAAATAACCAAAGTCGTGATAACATACAAGGACAGGCTCACCCGCTTCGGCTTCAAATACCTCGAACAGTACTT
CAACTCCCACGGCGTTGAAATCGAAGTAATCTTTGACGACGAGGAGAAGACTCTAGAAGAAAAAGAACTTGTCGAAGACTTGTTGGCCATTGTAACTTCC
TTCGCTGGAAAACTTTATGGAGCTCGCTCCCACAAGAAAAAACGCCTCGTCGAGGCGGTAAAGAATGCCCTCAGAGACGATTAAACTCGCCTCAAAATTC
AAGCTGAAGGAAACTCCCGAAGGGTTAAACGAGCTATTCTCGACCTACCGTGATATTGTGAACTTCCTAATCACCCACGCTTTTGAGAACAACATAACCA
GCTTTTACCGCCTGAAAAAAGAGATATACAAGAGCCTCAGGAAAGAATATCCAGAACTCCCGAGCCATTACATTTACACGGCCTGTCAAATGGCTGCTTC
AATCTACAAGAGCTACCGGAAAAGGAAGCGGAGGGGAAAAGCCAGTGGAAGACCAGTGTTCAAGAAAGAAGCCATAATGCTCGACGACCACCTGTTCAAG
CTTGATCTTGAAAAGGGAATAATCAAACTCTCCACTCCAAACGGGAGAATAACTCTGAAATTTTACCCGGCAAAGCACCACGAGAAGTTCAAGAACTGGA
AGGTTGGCCAAGCGTGGCTCGTGAGAACGCCTAAGGGCGTCTTCATCAATGTCGTCTTCTCGAAGGAAGTCGAGGTTAAAGAACCCGAAGATTTTGTTGG
CGTGGACTTGAACGAGAACAATGTAACACTCAGCCTTTCAGACGGGGAGTTTGTTCAGATAATCACTCACGAAAAGGAAATTAGGACTGGTTACTTCGTG
AAGCGGAGAAAAATCCAGAAGAAAGTCAAAGTCGGCAAGAAGAGGCAAGAACTCCTCGAAAAGTACGGTGAGAGAGAAAGGAACAGGCTGAACGACCTTT
ACCACAAGCTTGCCAACAAAATTGTTGAACTGGCCGAGAAGTACGGCGGGATTGCTTTGGAGGATTTGACGGAAATCCGGAATTCGATTAGATACTCCGC
CGAGATGAATGGTCGTCTTCACAGGTGGAGTTTTAGGAAGCTTCAGTCAATCATCGAGTACAAGGCGAAGTTAAAAGGTGTTGAGGTTGTTTTTGTTGAT
CCAGCTTACACTTCCTCCCTGTGCCCGGTATGTGGGGAGAAGTTAAGCCCGAATGGGCACAGGGTTTTGAAGTGTTTGAATTGCGGTTTTGAGGCCGACA
GGGATGTTGTTGGCTCTTGGAATGTTCGTTTGAGAGCCCTGAAGATGTGGGGAGTTTCCGTTCCCCCCGAAAGCCCTCCGATGAAGATGGGAGGAGGGAA
GGCCAGCCGTGGCGATGTTTATGAACTTTACACAAATTACGGCTGACCAGAACG
CAAAGAAACTCGGAGTTTCCAAGATGACAATCCTCCGCTGGATAAAATCGGGTAAACTCAAAGCCCACCGCATTGGAAAAGAATACAGAGTTCCAGAGAG
CGAAATCAAGAAGATTCTTGAAGGCAAAATCCCCGACCAAGTTGCCATTTACGCCAGAGTATCTAGTCAAAACCAAAAAGAAGACTTGGAAAGGCAAGTC
GAATACCTCAAAAACTACTGCTCGGCAAAAGGCTACCAGGTTGCTAAAATCATAACAGACATCTCCTCCGGCCTGAACGAGAACAGAAAAGGACTAAAAC
AACTCTTCAAACTCGTTGAGAATGGAGAAATAACCAAAGTCGTGATAACATACAAGGACAGGCTCACCCGCTTCGGCTTCAAATACCTCGAACAGTACTT
CAACTCCCACGGCGTTGAAATCGAAGTAATCTTTGACGACGAGGAGAAGACTCTAGAAGAAAAAGAACTTGTCGAAGACTTGTTGGCCATTGTAACTTCC
TTCGCTGGAAAACTTTATGGAGCTCGCTCCCACAAGAAAAAACGCCTCGTCGAGGCGGTAAAGAATGCCCTCAGAGACGATTAAACTCGCCTCAAAATTC
AAGCTGAAGGAAACTCCCGAAGGGTTAAACGAGCTATTCTCGACCTACCGTGATATTGTGAACTTCCTAATCACCCACGCTTTTGAGAACAACATAACCA
GCTTTTACCGCCTGAAAAAAGAGATATACAAGAGCCTCAGGAAAGAATATCCAGAACTCCCGAGCCATTACATTTACACGGCCTGTCAAATGGCTGCTTC
AATCTACAAGAGCTACCGGAAAAGGAAGCGGAGGGGAAAAGCCAGTGGAAGACCAGTGTTCAAGAAAGAAGCCATAATGCTCGACGACCACCTGTTCAAG
CTTGATCTTGAAAAGGGAATAATCAAACTCTCCACTCCAAACGGGAGAATAACTCTGAAATTTTACCCGGCAAAGCACCACGAGAAGTTCAAGAACTGGA
AGGTTGGCCAAGCGTGGCTCGTGAGAACGCCTAAGGGCGTCTTCATCAATGTCGTCTTCTCGAAGGAAGTCGAGGTTAAAGAACCCGAAGATTTTGTTGG
CGTGGACTTGAACGAGAACAATGTAACACTCAGCCTTTCAGACGGGGAGTTTGTTCAGATAATCACTCACGAAAAGGAAATTAGGACTGGTTACTTCGTG
AAGCGGAGAAAAATCCAGAAGAAAGTCAAAGTCGGCAAGAAGAGGCAAGAACTCCTCGAAAAGTACGGTGAGAGAGAAAGGAACAGGCTGAACGACCTTT
ACCACAAGCTTGCCAACAAAATTGTTGAACTGGCCGAGAAGTACGGCGGGATTGCTTTGGAGGATTTGACGGAAATCCGGAATTCGATTAGATACTCCGC
CGAGATGAATGGTCGTCTTCACAGGTGGAGTTTTAGGAAGCTTCAGTCAATCATCGAGTACAAGGCGAAGTTAAAAGGTGTTGAGGTTGTTTTTGTTGAT
CCAGCTTACACTTCCTCCCTGTGCCCGGTATGTGGGGAGAAGTTAAGCCCGAATGGGCACAGGGTTTTGAAGTGTTTGAATTGCGGTTTTGAGGCCGACA
GGGATGTTGTTGGCTCTTGGAATGTTCGTTTGAGAGCCCTGAAGATGTGGGGAGTTTCCGTTCCCCCCGAAAGCCCTCCGATGAAGATGGGAGGAGGGAA
GGCCAGCCGTGGCGATGTTTATGAACTTTACACAAATTACGGCTGACCAGAACG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
612 bp | 203 aa | 73 | 684 | + | No |
Chemistry : Serine
ORF sequence :
MVRLYRTGEASKKLGVSKMTILRWIKSGKLKAHRIGKEYRVPESEIKKILEGKIPDQVAIYARVSSQNQKEDLERQVEYLKNYCSAKGYQVAKIITDISS
GLNENRKGLKQLFKLVENGEITKVVITYKDRLTRFGFKYLEQYFNSHGVEIEVIFDDEEKTLEEKELVEDLLAIVTSFAGKLYGARSHKKKRLVEAVKNA
LRDD
GLNENRKGLKQLFKLVENGEITKVVITYKDRLTRFGFKYLEQYFNSHGVEIEVIFDDEEKTLEEKELVEDLLAIVTSFAGKLYGARSHKKKRLVEAVKNA
LRDD
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1182 bp | 393 aa | 665 | 1846 | + | No |
AG : TnpB
ORF sequence :
MPSETIKLASKFKLKETPEGLNELFSTYRDIVNFLITHAFENNITSFYRLKKEIYKSLRKEYPELPSHYIYTACQMAASIYKSYRKRKRRGKASGRPVFK
KEAIMLDDHLFKLDLEKGIIKLSTPNGRITLKFYPAKHHEKFKNWKVGQAWLVRTPKGVFINVVFSKEVEVKEPEDFVGVDLNENNVTLSLSDGEFVQII
THEKEIRTGYFVKRRKIQKKVKVGKKRQELLEKYGERERNRLNDLYHKLANKIVELAEKYGGIALEDLTEIRNSIRYSAEMNGRLHRWSFRKLQSIIEYK
AKLKGVEVVFVDPAYTSSLCPVCGEKLSPNGHRVLKCLNCGFEADRDVVGSWNVRLRALKMWGVSVPPESPPMKMGGGKASRGDVYELYTNYG
KEAIMLDDHLFKLDLEKGIIKLSTPNGRITLKFYPAKHHEKFKNWKVGQAWLVRTPKGVFINVVFSKEVEVKEPEDFVGVDLNENNVTLSLSDGEFVQII
THEKEIRTGYFVKRRKIQKKVKVGKKRQELLEKYGERERNRLNDLYHKLANKIVELAEKYGGIALEDLTEIRNSIRYSAEMNGRLHRWSFRKLQSIIEYK
AKLKGVEVVFVDPAYTSSLCPVCGEKLSPNGHRVLKCLNCGFEADRDVVGSWNVRLRALKMWGVSVPPESPPMKMGGGKASRGDVYELYTNYG
Blast result :
Comments
Two almost identical copies of ISTsi1 are present in the genome of archaeon Thermococcus sibiricus.
ISTsi1 is 75% (ORFA : the resolvase) and 56% (ORFB) aa similar to ISSto13.
ISTsi1 is 75% (ORFA : the resolvase) and 56% (ORFB) aa similar to ISSto13.
References
1] Mardanov A.V., Ravin N.V., Svetlitchnyi V.A., Beletsky A.V., Miroshnichenko M.L., Bonch-Osmolovskaya E.A., Skryabin K.G. (2009) . Appl. Environ. Microbiol., 75, 4580-4588.