ISHwa15
- Family ISNCY
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
FR746099 | ND | Haloquadratum walsbyi | Haloquadratum walsbyi |
DNA section
IS Length : 1845 bp
Ends
IR Length : 13/14
IRL : AGGAGTTCGGAACGACAGATTTTGTGCACGAGACGGGTGGCTAACCAGGC
IRR : AGGAGTTCGGAATGTGCATCAACTGCTGGTGGGTGGTAGTTAATATCACC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTGTGTCTGC | GCCTCGTC | GTCGCAATCA | 8 |
DNA sequence
AGGAGTTCGGAACGACAGATTTTGTGCACGAGACGGGTGGCTAACCAGGCTTGGTTACCAGTGTTGAGAGGATGTCACAGAAGACACCCGAGTTCGAGCC
GGTCATCAGGCGGCTCGAACGCCAAGCTGCCGAGTCGATATATAAAGACACCGATCTCCTCACCGTTGTCACTGAGTTCGCGTTTACTGATACGCCCGTT
TCTGGCACGGCTGCAAAGGAATGGGAGGCGAAGTACGGCTATCGACCGATGGTATGCGCGTTCTACTGCAAGGAACTTGCGGGATTCATGACTACAGGAC
TCTACGAGTATCTTGCCGACGCCGAGCGTGCCTGCGCCCGCACGCTCGGCTTCGATCCCGAGCAATTCGCTCCTGACAAGACTGCTCCAGACCGAACTAC
ACTTGGCCGCACATGGCGTGATCGCTTCTCTGACCGGCTCAAATCGTTCATTACACAGTCTGCACAGCGTATCCTTGCGGTTGCCCACGAAATGGGCAAC
CCGCTCGGAATGCGCGCACTCAAACCAACCAACAAGACCGACTGTTCGAACCGGACGGAGCGACGCTACGTCACCGAGAAGGCCAAGGATGTGACGGATG
CACTCTGTCAGATTGTCTTTCCTGCAATCGATCTTCAGAGTTATCTCGGATTGACCGGCACAGCTGCCAATCAAGGCAGCCAGATGTTCGACGAAGAAAC
GACTCGCGAAGCGGGTGGACCCGCTGGTGATACGCATCTCCGGTATATCAAGCAACTCGACGTGATGGAGATCGCGTCGATGATCAATGGTGCAATTGGT
GAGATGATTTGGGCTGCCGAGCAGTACTATTCGATAGATCGCCACGTCGACGTGGCGATCGACATCACATACGTCGCGTACTACGGTAACCGCGATGAGT
TCCAGATGAGCACCGGCGCTCCACCGAGCAAGTCTTATTCGTGGTGTTACAAGATGGCAACAATCTCGATCGTTGGTGAGGAAGTGAAATTTACGCTCGG
GATGCGGCCGCTTCGCGGCTACATCCCTCGGAGCGTGCTCGTGGAACAACTGATCGATATAGCAAGCGATCATGTCTCGCTCGAGACAATGTATGCCGAT
GCTGAATTCGACAGCATCGGCGTCATTGACGCTCTCGAAGAGAAGGGCCTCTCGTACCTGATTCGCAAATCTTCGGACGACCGCGTTGATCGGTTTGTCG
TCGACATGGATCACGATGTAGCGGTCAAGCAATCCCACGAGATGAAGAAGACGAGCGGGGGCGAATCTGTGACGGTCACCCCGACCCTGGTCGGGGTTCC
CCCAACCCGGAAGGAAGATGAAACGGTGACGTTTATGACGAATCTCCAGGTGAGCGATAGCACGAAAGCAGCACGCGGGCGTACCCGCCGCATCATGGGG
CGGTACGCTCGGAGGTGGGGGATTGAGAACAGCTACAAGTCAATCAAAGATTTCCTGGCATGGACGACATCACGGAATACGGCAGTACGAGTATTCTACT
TCAGCTTTGTGGTGATTCTCTATGATATGTGGTTGGTGGTGGATCTGCTGGTACAGATCAGTCTCACCGTCAAGCAGCGACTGAAACCGCGTGTGCCTGC
CCGGACGTTCCTGAATATCGTGCGCAAGGAGATGCCCGTGACGTAGGCGTTCCGCCGCGCCGGGCGGAACGCCGACTCGTGAGATACATCTCTCTGCAGT
CCGGTCATTTTCGCCCCACTATTCGTCGAAATCAGTAGTTCGTTCGATTATTTCCCCAGGCAAAATGTAGTCCAATATTATTCCACCAATATATTGGTGA
TATTAACTACCACCCACCAGCAGTTGATGCACATTCCGAACTCCT
GGTCATCAGGCGGCTCGAACGCCAAGCTGCCGAGTCGATATATAAAGACACCGATCTCCTCACCGTTGTCACTGAGTTCGCGTTTACTGATACGCCCGTT
TCTGGCACGGCTGCAAAGGAATGGGAGGCGAAGTACGGCTATCGACCGATGGTATGCGCGTTCTACTGCAAGGAACTTGCGGGATTCATGACTACAGGAC
TCTACGAGTATCTTGCCGACGCCGAGCGTGCCTGCGCCCGCACGCTCGGCTTCGATCCCGAGCAATTCGCTCCTGACAAGACTGCTCCAGACCGAACTAC
ACTTGGCCGCACATGGCGTGATCGCTTCTCTGACCGGCTCAAATCGTTCATTACACAGTCTGCACAGCGTATCCTTGCGGTTGCCCACGAAATGGGCAAC
CCGCTCGGAATGCGCGCACTCAAACCAACCAACAAGACCGACTGTTCGAACCGGACGGAGCGACGCTACGTCACCGAGAAGGCCAAGGATGTGACGGATG
CACTCTGTCAGATTGTCTTTCCTGCAATCGATCTTCAGAGTTATCTCGGATTGACCGGCACAGCTGCCAATCAAGGCAGCCAGATGTTCGACGAAGAAAC
GACTCGCGAAGCGGGTGGACCCGCTGGTGATACGCATCTCCGGTATATCAAGCAACTCGACGTGATGGAGATCGCGTCGATGATCAATGGTGCAATTGGT
GAGATGATTTGGGCTGCCGAGCAGTACTATTCGATAGATCGCCACGTCGACGTGGCGATCGACATCACATACGTCGCGTACTACGGTAACCGCGATGAGT
TCCAGATGAGCACCGGCGCTCCACCGAGCAAGTCTTATTCGTGGTGTTACAAGATGGCAACAATCTCGATCGTTGGTGAGGAAGTGAAATTTACGCTCGG
GATGCGGCCGCTTCGCGGCTACATCCCTCGGAGCGTGCTCGTGGAACAACTGATCGATATAGCAAGCGATCATGTCTCGCTCGAGACAATGTATGCCGAT
GCTGAATTCGACAGCATCGGCGTCATTGACGCTCTCGAAGAGAAGGGCCTCTCGTACCTGATTCGCAAATCTTCGGACGACCGCGTTGATCGGTTTGTCG
TCGACATGGATCACGATGTAGCGGTCAAGCAATCCCACGAGATGAAGAAGACGAGCGGGGGCGAATCTGTGACGGTCACCCCGACCCTGGTCGGGGTTCC
CCCAACCCGGAAGGAAGATGAAACGGTGACGTTTATGACGAATCTCCAGGTGAGCGATAGCACGAAAGCAGCACGCGGGCGTACCCGCCGCATCATGGGG
CGGTACGCTCGGAGGTGGGGGATTGAGAACAGCTACAAGTCAATCAAAGATTTCCTGGCATGGACGACATCACGGAATACGGCAGTACGAGTATTCTACT
TCAGCTTTGTGGTGATTCTCTATGATATGTGGTTGGTGGTGGATCTGCTGGTACAGATCAGTCTCACCGTCAAGCAGCGACTGAAACCGCGTGTGCCTGC
CCGGACGTTCCTGAATATCGTGCGCAAGGAGATGCCCGTGACGTAGGCGTTCCGCCGCGCCGGGCGGAACGCCGACTCGTGAGATACATCTCTCTGCAGT
CCGGTCATTTTCGCCCCACTATTCGTCGAAATCAGTAGTTCGTTCGATTATTTCCCCAGGCAAAATGTAGTCCAATATTATTCCACCAATATATTGGTGA
TATTAACTACCACCCACCAGCAGTTGATGCACATTCCGAACTCCT
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1575 bp | 524 aa | 72 | 1646 | + | No |
Chemistry : Unknow
ORF sequence :
MSQKTPEFEPVIRRLERQAAESIYKDTDLLTVVTEFAFTDTPVSGTAAKEWEAKYGYRPMVCAFYCKELAGFMTTGLYEYLADAERACARTLGFDPEQFA
PDKTAPDRTTLGRTWRDRFSDRLKSFITQSAQRILAVAHEMGNPLGMRALKPTNKTDCSNRTERRYVTEKAKDVTDALCQIVFPAIDLQSYLGLTGTAAN
QGSQMFDEETTREAGGPAGDTHLRYIKQLDVMEIASMINGAIGEMIWAAEQYYSIDRHVDVAIDITYVAYYGNRDEFQMSTGAPPSKSYSWCYKMATISI
VGEEVKFTLGMRPLRGYIPRSVLVEQLIDIASDHVSLETMYADAEFDSIGVIDALEEKGLSYLIRKSSDDRVDRFVVDMDHDVAVKQSHEMKKTSGGESV
TVTPTLVGVPPTRKEDETVTFMTNLQVSDSTKAARGRTRRIMGRYARRWGIENSYKSIKDFLAWTTSRNTAVRVFYFSFVVILYDMWLVVDLLVQISLTV
KQRLKPRVPARTFLNIVRKEMPVT
PDKTAPDRTTLGRTWRDRFSDRLKSFITQSAQRILAVAHEMGNPLGMRALKPTNKTDCSNRTERRYVTEKAKDVTDALCQIVFPAIDLQSYLGLTGTAAN
QGSQMFDEETTREAGGPAGDTHLRYIKQLDVMEIASMINGAIGEMIWAAEQYYSIDRHVDVAIDITYVAYYGNRDEFQMSTGAPPSKSYSWCYKMATISI
VGEEVKFTLGMRPLRGYIPRSVLVEQLIDIASDHVSLETMYADAEFDSIGVIDALEEKGLSYLIRKSSDDRVDRFVVDMDHDVAVKQSHEMKKTSGGESV
TVTPTLVGVPPTRKEDETVTFMTNLQVSDSTKAARGRTRRIMGRYARRWGIENSYKSIKDFLAWTTSRNTAVRVFYFSFVVILYDMWLVVDLLVQISLTV
KQRLKPRVPARTFLNIVRKEMPVT
Blast result :
Comments
In the genome sequence, this IS is interrupted by a 108 bp repeat after base 848 (after transposase amino acid 259).
ISHwa15 is 40% aa similar to ISH7B.
ISHwa15 is 40% aa similar to ISH7B.
References
1] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968