ISC1316
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_002754 | ND | Sulfolobus solfataricus | Sulfolobus solfataricus P2 |
DNA section
IS Length : 1317 bp
Ends
Left end : AACAGACGGGGGTTCAAAGGGGGCGGAAAACTTCGCCTCTTTGAGATGGATCCCTTTGTAGAAAAGTTAAAAATAGGAAAATAACGAAATACACATCAAT II struct. : Yes
Right end : TCGGAGCGACCCTTAGTGTGGAGCTCCGCCCTCTACCAGTACTCTGGCAAGGTGGGGCTGTGAAGCAGGAAGCTCCCTCATTTATGAGGGGGTAGCTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
AATGAGAACTT | TTAT | AGGCGATGAGGAATAAA | tcac |
DNA sequence
AACAGACGGGGGTTCAAAGGGGGCGGAAAACTTCGCCTCTTTGAGATGGATCCCTTTGTAGAAAAGTTAAAAATAGGAAAATAACGAAATACACATCAAT
GCCCACCTTAGGGTTTCGCTTCCGTGCATACACTGACGAACAAACCCTTAGGGCGTTAAAAGCCCAGTTGAAGTTAACATGTGAAATCTACAACACCTTA
AGGTGGGCAGACATATACTTTTACCAAAGGGATGGGAAAGGACTTACGCAGACTGAGTTAAGACAGTTGGCTCTAGATCTGAGAAAACAAGATGATGAGT
ATAAGCAACTCTACTCGCAAGTGGTTCAACAAGTAGCTGACCGTTATTCCGAAGCTAAGAAGAGGTTTTTTGAAGGTTTAGCACGTTTCCCAAAAGAAAA
GAAACCTCATAAATACTACTCCCTTGTCTATACGCAAAGCGGTTGGAAAATACTTCACGTTAGAGAAATAAGAAAAGGCAAGAAGAATAAGAAGAAACTA
ATAACGCTTAAACTATCAAATCTTGGTACGTTCAAGGTAATAGTTCACCGAGACTTTCCCCTTGACAAAGTAAAGAGGGTAGTAGTGAAGCTAACAAGAT
CTGAGAGGATTTACATCACTTTCGTAGTTGATCACGAATTCCCCAAGTTACCTAACACGGGTAAGGTAGTGGCGATAGATGTTGGTGTAGAAAAGTTGTT
AATAACGTCGGATGGTGAGTATTTTCCTAATTTGAGACCTTACGAGAAAGCGTTATGGAAAGTGAAGCATATACACAGAGAACTTTCAAGGAAGAAGTTC
CTCTCTAATAATTGGTTTAAGGCTAAGGTTAAGCTTGCTAGGGCTTATGAGCATTTGAAGAATCTAAGAACGGATCTTTACATGAAGTTGGGCAAGTGGT
TTGCTGAGCATTATGACGTTGTGGTGATGGAAGGTATTCACGCTAAACAGCTTGTGGGTAAGTCCTTGAGGTCTCTGAGGAGGAGATTGAGTGATGTGGG
ATTTGGTGAGTTGAGGGGTGTGCTGAAGTATCAGCTGGAAAAATACGGTAAGAAACTCATCCTAGTTAATCCTGCATACACTTCCAAAACTTGTGCTAGG
TGCGGGTATGTGAAAAATGACTTGTCTCTATCTGATCGTGTTTTCGTTTGTCCCAACTGTGGTTGGATTGCAGATCGTGACTATAATGCTTCTCTTAACA
TCTTACGTGGATCGGGGTCGGAGCGACCCTTAGTGTGGAGCTCCGCCCTCTACCAGTACTCTGGCAAGGTGGGGCTGTGAAGCAGGAAGCTCCCTCATTT
ATGAGGGGGTAGCTCAC
GCCCACCTTAGGGTTTCGCTTCCGTGCATACACTGACGAACAAACCCTTAGGGCGTTAAAAGCCCAGTTGAAGTTAACATGTGAAATCTACAACACCTTA
AGGTGGGCAGACATATACTTTTACCAAAGGGATGGGAAAGGACTTACGCAGACTGAGTTAAGACAGTTGGCTCTAGATCTGAGAAAACAAGATGATGAGT
ATAAGCAACTCTACTCGCAAGTGGTTCAACAAGTAGCTGACCGTTATTCCGAAGCTAAGAAGAGGTTTTTTGAAGGTTTAGCACGTTTCCCAAAAGAAAA
GAAACCTCATAAATACTACTCCCTTGTCTATACGCAAAGCGGTTGGAAAATACTTCACGTTAGAGAAATAAGAAAAGGCAAGAAGAATAAGAAGAAACTA
ATAACGCTTAAACTATCAAATCTTGGTACGTTCAAGGTAATAGTTCACCGAGACTTTCCCCTTGACAAAGTAAAGAGGGTAGTAGTGAAGCTAACAAGAT
CTGAGAGGATTTACATCACTTTCGTAGTTGATCACGAATTCCCCAAGTTACCTAACACGGGTAAGGTAGTGGCGATAGATGTTGGTGTAGAAAAGTTGTT
AATAACGTCGGATGGTGAGTATTTTCCTAATTTGAGACCTTACGAGAAAGCGTTATGGAAAGTGAAGCATATACACAGAGAACTTTCAAGGAAGAAGTTC
CTCTCTAATAATTGGTTTAAGGCTAAGGTTAAGCTTGCTAGGGCTTATGAGCATTTGAAGAATCTAAGAACGGATCTTTACATGAAGTTGGGCAAGTGGT
TTGCTGAGCATTATGACGTTGTGGTGATGGAAGGTATTCACGCTAAACAGCTTGTGGGTAAGTCCTTGAGGTCTCTGAGGAGGAGATTGAGTGATGTGGG
ATTTGGTGAGTTGAGGGGTGTGCTGAAGTATCAGCTGGAAAAATACGGTAAGAAACTCATCCTAGTTAATCCTGCATACACTTCCAAAACTTGTGCTAGG
TGCGGGTATGTGAAAAATGACTTGTCTCTATCTGATCGTGTTTTCGTTTGTCCCAACTGTGGTTGGATTGCAGATCGTGACTATAATGCTTCTCTTAACA
TCTTACGTGGATCGGGGTCGGAGCGACCCTTAGTGTGGAGCTCCGCCCTCTACCAGTACTCTGGCAAGGTGGGGCTGTGAAGCAGGAAGCTCCCTCATTT
ATGAGGGGGTAGCTCAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1182 bp | 393 aa | 99 | 1280 | + | No |
AG : TnpB
ORF sequence :
MPTLGFRFRAYTDEQTLRALKAQLKLTCEIYNTLRWADIYFYQRDGKGLTQTELRQLALDLRKQDDEYKQLYSQVVQQVADRYSEAKKRFFEGLARFPKE
KKPHKYYSLVYTQSGWKILHVREIRKGKKNKKKLITLKLSNLGTFKVIVHRDFPLDKVKRVVVKLTRSERIYITFVVDHEFPKLPNTGKVVAIDVGVEKL
LITSDGEYFPNLRPYEKALWKVKHIHRELSRKKFLSNNWFKAKVKLARAYEHLKNLRTDLYMKLGKWFAEHYDVVVMEGIHAKQLVGKSLRSLRRRLSDV
GFGELRGVLKYQLEKYGKKLILVNPAYTSKTCARCGYVKNDLSLSDRVFVCPNCGWIADRDYNASLNILRGSGSERPLVWSSALYQYSGKVGL
KKPHKYYSLVYTQSGWKILHVREIRKGKKNKKKLITLKLSNLGTFKVIVHRDFPLDKVKRVVVKLTRSERIYITFVVDHEFPKLPNTGKVVAIDVGVEKL
LITSDGEYFPNLRPYEKALWKVKHIHRELSRKKFLSNNWFKAKVKLARAYEHLKNLRTDLYMKLGKWFAEHYDVVVMEGIHAKQLVGKSLRSLRRRLSDV
GFGELRGVLKYQLEKYGKKLILVNPAYTSKTCARCGYVKNDLSLSDRVFVCPNCGWIADRDYNASLNILRGSGSERPLVWSSALYQYSGKVGL
Blast result :
Comments
This IS is present with ten isoform in the genome.
References
1] She,Q., Singh,R.K., Confalonieri,F. et al.(2001)Proc. Natl. Acad. Sci. U.S.A. 98 (14), 7835-7840