ISHwa17
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM180088 | ND | Haloquadratum walsbyi | Haloquadratum walsbyi DSM 16790 |
DNA section
IS Length : 2033 bp
Ends
Left end : CAGAGAAAGCAGCCAGAGTGCCTCGGGGCTTTGATATTGATGTGACCCCGAGGCTGAATGCCGTAAGCTCCAGGAGACGTGTTCAGTACGTTCGATATAC II struct. : Yes
Right end : GCCTCTGATACCGCTCAAGGGCACCATCCTGATGGTATCCGAGGCGTGTCCGACCAATCCACGGGAAGAAGCTCCGGGGCTTGACCCCGGAGCAGTTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GCATATGATG | TCTA | ATTGCAGAAC | tcac |
CAGTCGTATA | TCTA | TGGGGAATAC | tcac |
DNA sequence
CAGAGAAAGCAGCCAGAGTGCCTCGGGGCTTTGATATTGATGTGACCCCGAGGCTGAATGCCGTAAGCTCCAGGAGACGTGTTCAGTACGTTCGATATAC
TGCTCAATCGTCTGGCTCGATACTTCACCAGCCGTTCCGATGTAGTATGATTCTTCCCAGAATCCCCCACCCCACAAGTGGTTTTCCAAGAGGTGTTCGT
GTTGCTGCCACATCTCTCTGGCTGTGATGCTCTTGATTTTTCGTGCAATCTCGCTTGTGGAATGTTTCGGATGTGCTGAAAGAAACAACTGAACGTGGTC
TGGCGAGATATGCAACGACAATATCTCGTAATTGTACTCGTCACACACCGCTCTGAACGATGTTTCTCACGACTGTTCTATCTCACCAAGCACTGAGCGG
CGGTACTTCCGACATTGCAAATCTCCGATTTGCAAGCAGTCAATCTCCGATTGACAACACCACACAAAGTGGTAGTTGATGTTATACACAGTGTGATTTG
ACCGTGTTTCGCCCATATCGGCTAGAGTCATGCCCAAAAGTTAAGCATAATTGTGTCTAAGTATATACTGTAGAGATGACCAAACGCCTCACCGTAGACT
TCGAGGACGATTTGTACAAGGAGTTCTCGAAGAAATGCATCGACGCTGAACGTCCAAAGTCAGAAGTCGTGCGCGAACTCGTCCAAGACTGGGTAACCGA
TCAAGAATGACGCAGACACAGGCTCTCACCAAGACGCTCGTGTTTCCGCTGGACGTGCAGAGTGGCAACGAGAGCCTGCTTCACGACGCCCGCTTGGAGT
GTCGCCGCGTGTTCACCGAGGTACTGCGCCTCAACTACGACGGGTGGGACTGGAATAGGATTGAGGATGTTGTTGAGCAGAACGCCGACCTCGTACAAAA
CACCGCACAGCGCATCATCGACAAAGCCTTCGAAGCACTCACGCAATACTACGACTACGACGACTGGGGCAGACCGTGGTACAAACACGAGACGTTCCCA
CTGCGGATGAACTACGGCGAAGGCTACAACCTCTTTCTCGAAGACGAAACAGTACGGTTCCGCATCAGTGCGAAACCGTACAACCATGTCAAAGGCGAAC
TGCGTGGCACACAAGACCAGTTTGACCTGCTCGCACAGGCAATCGAAGATGATGACTGGCACGTTGGTACTGCCGAAGCACTCTGCCGAAACGGACGCGA
AGAATTGCACGTTACCGTGACGAACGAGACAGCCGAAGTCGGGGCAAAAACAGCGGCAGAAACGGTCATCGGTGTCGATATTAACGAAGACTGCATTGCA
CTGGCGGCACTAACCGAACACGGCATTGAAGATTCAATTGTTATCGACTACCCAGAAATCAAGGAAGAACGACACCGGTACTTCACGATGCGGAAACGGA
TGCAAGAAGCCGGACAGACCGCGTTTAACGACGTGTTCCGCGACAAGGAACAACGGTTCGTTCACGACCAGCTACACACGGTGTCACGTCGCGTAGTCGA
GTGGATTCAACGGTTACGGTTCGACAGTCCTGTAATTGTCTTTGAAGACCTCACAGACATGAGAGACGACATTGAGTACGGGACTCGTATGAACCGGCGA
CTCCATTCATTACCGTTCGCCAAACTTCGGGATTTCATCACGTACAAAGCCGCATGGCACGGGATTCCGTCAGATGACGTTGACCCGGAGTACACCAGCC
AACAGTGTCCGGTCTGTGGACACACAGAACGTGCAAATCGCCACAAGAAGCGGTTCAAATGCCGTGAGTGCGAGCATCAAGACCACGCTGACCGTGGCGC
TGGTGTCAGTGTCGGGCAAAAGTGGCTGAGGAAGCAAAACAATCGAAATGTGCCTGCTCTCAACACACTGCCACAGGTGCGGAAGTGGGAGTTGCGACGG
CAGGCATCGGGGCCTGTGGACGGCCCGACCGTGGCCTCTGATACCGCTCAAGGGCACCATCCTGATGGTATCCGAGGCGTGTCCGACCAATCCACGGGAA
GAAGCTCCGGGGCTTGACCCCGGAGCAGTTCAC
TGCTCAATCGTCTGGCTCGATACTTCACCAGCCGTTCCGATGTAGTATGATTCTTCCCAGAATCCCCCACCCCACAAGTGGTTTTCCAAGAGGTGTTCGT
GTTGCTGCCACATCTCTCTGGCTGTGATGCTCTTGATTTTTCGTGCAATCTCGCTTGTGGAATGTTTCGGATGTGCTGAAAGAAACAACTGAACGTGGTC
TGGCGAGATATGCAACGACAATATCTCGTAATTGTACTCGTCACACACCGCTCTGAACGATGTTTCTCACGACTGTTCTATCTCACCAAGCACTGAGCGG
CGGTACTTCCGACATTGCAAATCTCCGATTTGCAAGCAGTCAATCTCCGATTGACAACACCACACAAAGTGGTAGTTGATGTTATACACAGTGTGATTTG
ACCGTGTTTCGCCCATATCGGCTAGAGTCATGCCCAAAAGTTAAGCATAATTGTGTCTAAGTATATACTGTAGAGATGACCAAACGCCTCACCGTAGACT
TCGAGGACGATTTGTACAAGGAGTTCTCGAAGAAATGCATCGACGCTGAACGTCCAAAGTCAGAAGTCGTGCGCGAACTCGTCCAAGACTGGGTAACCGA
TCAAGAATGACGCAGACACAGGCTCTCACCAAGACGCTCGTGTTTCCGCTGGACGTGCAGAGTGGCAACGAGAGCCTGCTTCACGACGCCCGCTTGGAGT
GTCGCCGCGTGTTCACCGAGGTACTGCGCCTCAACTACGACGGGTGGGACTGGAATAGGATTGAGGATGTTGTTGAGCAGAACGCCGACCTCGTACAAAA
CACCGCACAGCGCATCATCGACAAAGCCTTCGAAGCACTCACGCAATACTACGACTACGACGACTGGGGCAGACCGTGGTACAAACACGAGACGTTCCCA
CTGCGGATGAACTACGGCGAAGGCTACAACCTCTTTCTCGAAGACGAAACAGTACGGTTCCGCATCAGTGCGAAACCGTACAACCATGTCAAAGGCGAAC
TGCGTGGCACACAAGACCAGTTTGACCTGCTCGCACAGGCAATCGAAGATGATGACTGGCACGTTGGTACTGCCGAAGCACTCTGCCGAAACGGACGCGA
AGAATTGCACGTTACCGTGACGAACGAGACAGCCGAAGTCGGGGCAAAAACAGCGGCAGAAACGGTCATCGGTGTCGATATTAACGAAGACTGCATTGCA
CTGGCGGCACTAACCGAACACGGCATTGAAGATTCAATTGTTATCGACTACCCAGAAATCAAGGAAGAACGACACCGGTACTTCACGATGCGGAAACGGA
TGCAAGAAGCCGGACAGACCGCGTTTAACGACGTGTTCCGCGACAAGGAACAACGGTTCGTTCACGACCAGCTACACACGGTGTCACGTCGCGTAGTCGA
GTGGATTCAACGGTTACGGTTCGACAGTCCTGTAATTGTCTTTGAAGACCTCACAGACATGAGAGACGACATTGAGTACGGGACTCGTATGAACCGGCGA
CTCCATTCATTACCGTTCGCCAAACTTCGGGATTTCATCACGTACAAAGCCGCATGGCACGGGATTCCGTCAGATGACGTTGACCCGGAGTACACCAGCC
AACAGTGTCCGGTCTGTGGACACACAGAACGTGCAAATCGCCACAAGAAGCGGTTCAAATGCCGTGAGTGCGAGCATCAAGACCACGCTGACCGTGGCGC
TGGTGTCAGTGTCGGGCAAAAGTGGCTGAGGAAGCAAAACAATCGAAATGTGCCTGCTCTCAACACACTGCCACAGGTGCGGAAGTGGGAGTTGCGACGG
CAGGCATCGGGGCCTGTGGACGGCCCGACCGTGGCCTCTGATACCGCTCAAGGGCACCATCCTGATGGTATCCGAGGCGTGTCCGACCAATCCACGGGAA
GAAGCTCCGGGGCTTGACCCCGGAGCAGTTCAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
444 bp | 148 aa | 516 | 73 | - | No |
Chemistry : Y1
ORF sequence :
MGETRSNHTVYNINYHFVWCCQSEIDCLQIGDLQCRKYRRSVLGEIEQS*ETSFRAVCDEYNYEILSLHISPDHVQLFLSAHPKHSTSEIARKIKSITAR
EMWQQHEHLLENHLWGGGFWEESYYIGTAGEVSSQTIEQYIERTEHVS
EMWQQHEHLLENHLWGGGFWEESYYIGTAGEVSSQTIEQYIERTEHVS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1311 bp | 436 aa | 707 | 2017 | + | No |
AG : TnpB
ORF sequence :
MTQTQALTKTLVFPLDVQSGNESLLHDARLECRRVFTEVLRLNYDGWDWNRIEDVVEQNADLVQNTAQRIIDKAFEALTQYYDYDDWGRPWYKHETFPLR
MNYGEGYNLFLEDETVRFRISAKPYNHVKGELRGTQDQFDLLAQAIEDDDWHVGTAEALCRNGREELHVTVTNETAEVGAKTAAETVIGVDINEDCIALA
ALTEHGIEDSIVIDYPEIKEERHRYFTMRKRMQEAGQTAFNDVFRDKEQRFVHDQLHTVSRRVVEWIQRLRFDSPVIVFEDLTDMRDDIEYGTRMNRRLH
SLPFAKLRDFITYKAAWHGIPSDDVDPEYTSQQCPVCGHTERANRHKKRFKCRECEHQDHADRGAGVSVGQKWLRKQNNRNVPALNTLPQVRKWELRRQA
SGPVDGPTVASDTAQGHHPDGIRGVSDQSTGRSSGA
MNYGEGYNLFLEDETVRFRISAKPYNHVKGELRGTQDQFDLLAQAIEDDDWHVGTAEALCRNGREELHVTVTNETAEVGAKTAAETVIGVDINEDCIALA
ALTEHGIEDSIVIDYPEIKEERHRYFTMRKRMQEAGQTAFNDVFRDKEQRFVHDQLHTVSRRVVEWIQRLRFDSPVIVFEDLTDMRDDIEYGTRMNRRLH
SLPFAKLRDFITYKAAWHGIPSDDVDPEYTSQQCPVCGHTERANRHKKRFKCRECEHQDHADRGAGVSVGQKWLRKQNNRNVPALNTLPQVRKWELRRQA
SGPVDGPTVASDTAQGHHPDGIRGVSDQSTGRSSGA
Blast result :
Comments
ISHwa17 is 85% (TnpA : the transposase) and 94% (TnpB) aa similar to ISNph7.
The transposase is not active : there is a STOP codon in frame and the STOP codon is missing at the end of the sequence.
The transposase is not active : there is a STOP codon in frame and the STOP codon is missing at the end of the sequence.
References
1] Bolhuis, H., Palm, P., Wende, A., Falb, M., Rampp, M., Rodriguez-Valera, F., Pfeiffer, F., and Oesterhelt, D. (2006)BMC Genomics 7, 169
2] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968
2] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968