ISHwa8
- Family IS4
- Group IS4Sa
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM180088 | ND | Haloquadratum walsbyi | Haloquadratum walsbyi C23 Haloquadratum walsbyi HBSQ001 |
DNA section
IS Length : 1509 bp
Ends
IR Length : 20/23
IRL : CTGTCGCGTTCTGAGTATATAAAAACGGCGTGAGCCGTTAGTGACGACCC
IRR : CTGTCGCGTTCTAAGTTGATAAAGCCCGGCGAAATTCTCTCCCAACGAAC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCCCATATCTTG | CAATTGTG | TTCTTATCTCTT | 8 |
TGAGCCAGTGTG | GATAAGTT | TGAATATACACC | 8 |
TGTCTGGTCGTT | GTCACCGTA | TGAAACGCTAGA | 9 |
CTTTGAGTTTCA | AATAGGTC | CCTGCCTCTCGT | 8 |
TATCATTTCGAG | GCTTGAAG | TCGAAACTGTCC | 8 |
GTGATGTATTTGTCGC | AACGTGGCAACGACCA | 0 | |
TCTTCATGCCTGTCAC | TGAATATCCCTCCTTC | 0 | |
CGCTTATATGCT | CTCTATA | AGTCAATTATTT | 7 |
DNA sequence
CTGTCGCGTTCTGAGTATATAAAAACGGCGTGAGCCGTTAGTGACGACCCGTAATGTCTGTGAATTTGATTCAGCAAACAGGCCGTCAGAAGACGTAACG
GACGCCTCTAACATCAATTCTACTGATCTCGTCGCTGACGAACTGTACTCGCTGCTCGACGAAGTCGACAGCGAGTCGATTGCCGATGAGTTCAAGATTG
GGCTCCATTCCGACAAACACGACTTCACAACCCACGTCAAGACGGCTGTTCGTGAGGGTCTCGACCCCTCCAGCTCACTCGCTGAACTTGAGGATAAGAC
CATTGCCGACGGCTCACTCGAACACATGCCTAAATCACGGTTTTCAGAACTCACGAACGACCGCGACTACTGCGCGGTCGTTCAGCTCCTCTTCGAAGTA
CTGCACACACCACAGTTGTATCATCAACGTGGAGTTCAACGAAAACGACTGGAGTGGATGACACGAGATGTCGTCGCTGTTGATGCCACAAACCTCGAAC
TTACGCGCTCTGTTGTCGTCTCAGACGAGTTCGTCGGAGACGACGACAAGGTCTACAAGATCGACACAGACGATGGTGGTCTCGAACTTCACTGCGCAGC
ACGTGTGGATGGAGAAAATAAACATCCACTCGACGCTACTGTTACAGAGGGCGATACCCACGAAAGCCCGCAGTTCGATCTCCTCAAGGAAGATGTCGAG
GTCTTTGCAGACCTCGACTCGGTAATCTGGGTCTGTGACCGCGCATACACGCGATATCTACGGTTCTGTGAGATCAAGCATAGTGACAATGACTTCGTCA
CACTGATGTACTCAGACGCTCGGTTTGAACTCACTGAGACACTCGAAGAGTTCGAGGTCACCGTCTCAGGGAACAACGCCGCTCAGCCGACTCACTCAGA
TGAGGAATCAACACGACGGGTTCGTGATGAACGAATTGAGTTAGCTGAGACTGGTGAAGAGTTCCGCAGGATTGTGCTGGAAACGCCCGATGGAGAAGAG
ATTGAGTACCTGACGACGCTGGCGTCGTCAGAGTACGATCCAATCGACGTGATCAACATCTACACACTACGGACAGTGATTGAGATCCTCTTCCGAGAGT
GGAAACAGTACCTCAACATCGAGAACTTTCACTCGAAATCGCTGAACGGCGTGTTATTCGAGCTGTTCTGTGCATTGATTGGATATATGCTGGTCGTATG
GTTTCGCCAACGCCACCCAGTCAAGGGTGGCGTGGCGTGTGCTATCCAGAAAGTTCGCACCTTCTGGAATGAGACGCTAGATTCATTCGGCTAATCAGTC
TTCACTTCCCGGCGACAGCCGTCACCCACCAGCAACGGCTGTCGCCTGTGCAGCGATCATTCATCATTTCTTTCCCTGAACTGCTATTTCTCTTGGCTCA
ACACGGATTTCACCGACAACTGTTCGATGAGTGAACTCCCATACGATTCTTACCCAGCAGTTCGTTGGGAGAGAATTTCGCCGGGCTTTATCAACTTAGA
ACGCGACAG
GACGCCTCTAACATCAATTCTACTGATCTCGTCGCTGACGAACTGTACTCGCTGCTCGACGAAGTCGACAGCGAGTCGATTGCCGATGAGTTCAAGATTG
GGCTCCATTCCGACAAACACGACTTCACAACCCACGTCAAGACGGCTGTTCGTGAGGGTCTCGACCCCTCCAGCTCACTCGCTGAACTTGAGGATAAGAC
CATTGCCGACGGCTCACTCGAACACATGCCTAAATCACGGTTTTCAGAACTCACGAACGACCGCGACTACTGCGCGGTCGTTCAGCTCCTCTTCGAAGTA
CTGCACACACCACAGTTGTATCATCAACGTGGAGTTCAACGAAAACGACTGGAGTGGATGACACGAGATGTCGTCGCTGTTGATGCCACAAACCTCGAAC
TTACGCGCTCTGTTGTCGTCTCAGACGAGTTCGTCGGAGACGACGACAAGGTCTACAAGATCGACACAGACGATGGTGGTCTCGAACTTCACTGCGCAGC
ACGTGTGGATGGAGAAAATAAACATCCACTCGACGCTACTGTTACAGAGGGCGATACCCACGAAAGCCCGCAGTTCGATCTCCTCAAGGAAGATGTCGAG
GTCTTTGCAGACCTCGACTCGGTAATCTGGGTCTGTGACCGCGCATACACGCGATATCTACGGTTCTGTGAGATCAAGCATAGTGACAATGACTTCGTCA
CACTGATGTACTCAGACGCTCGGTTTGAACTCACTGAGACACTCGAAGAGTTCGAGGTCACCGTCTCAGGGAACAACGCCGCTCAGCCGACTCACTCAGA
TGAGGAATCAACACGACGGGTTCGTGATGAACGAATTGAGTTAGCTGAGACTGGTGAAGAGTTCCGCAGGATTGTGCTGGAAACGCCCGATGGAGAAGAG
ATTGAGTACCTGACGACGCTGGCGTCGTCAGAGTACGATCCAATCGACGTGATCAACATCTACACACTACGGACAGTGATTGAGATCCTCTTCCGAGAGT
GGAAACAGTACCTCAACATCGAGAACTTTCACTCGAAATCGCTGAACGGCGTGTTATTCGAGCTGTTCTGTGCATTGATTGGATATATGCTGGTCGTATG
GTTTCGCCAACGCCACCCAGTCAAGGGTGGCGTGGCGTGTGCTATCCAGAAAGTTCGCACCTTCTGGAATGAGACGCTAGATTCATTCGGCTAATCAGTC
TTCACTTCCCGGCGACAGCCGTCACCCACCAGCAACGGCTGTCGCCTGTGCAGCGATCATTCATCATTTCTTTCCCTGAACTGCTATTTCTCTTGGCTCA
ACACGGATTTCACCGACAACTGTTCGATGAGTGAACTCCCATACGATTCTTACCCAGCAGTTCGTTGGGAGAGAATTTCGCCGGGCTTTATCAACTTAGA
ACGCGACAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1254 bp | 417 aa | 41 | 1294 | + | No |
Chemistry : DDE
ORF sequence :
MTTRNVCEFDSANRPSEDVTDASNINSTDLVADELYSLLDEVDSESIADEFKIGLHSDKHDFTTHVKTAVREGLDPSSSLAELEDKTIADGSLEHMPKSR
FSELTNDRDYCAVVQLLFEVLHTPQLYHQRGVQRKRLEWMTRDVVAVDATNLELTRSVVVSDEFVGDDDKVYKIDTDDGGLELHCAARVDGENKHPLDAT
VTEGDTHESPQFDLLKEDVEVFADLDSVIWVCDRAYTRYLRFCEIKHSDNDFVTLMYSDARFELTETLEEFEVTVSGNNAAQPTHSDEESTRRVRDERIE
LAETGEEFRRIVLETPDGEEIEYLTTLASSEYDPIDVINIYTLRTVIEILFREWKQYLNIENFHSKSLNGVLFELFCALIGYMLVVWFRQRHPVKGGVAC
AIQKVRTFWNETLDSFG
FSELTNDRDYCAVVQLLFEVLHTPQLYHQRGVQRKRLEWMTRDVVAVDATNLELTRSVVVSDEFVGDDDKVYKIDTDDGGLELHCAARVDGENKHPLDAT
VTEGDTHESPQFDLLKEDVEVFADLDSVIWVCDRAYTRYLRFCEIKHSDNDFVTLMYSDARFELTETLEEFEVTVSGNNAAQPTHSDEESTRRVRDERIE
LAETGEEFRRIVLETPDGEEIEYLTTLASSEYDPIDVINIYTLRTVIEILFREWKQYLNIENFHSKSLNGVLFELFCALIGYMLVVWFRQRHPVKGGVAC
AIQKVRTFWNETLDSFG
Blast result :
Comments
The integration sites are from two different strains of Haloquadratum walsbyi, HBSQ001 and C23.
ISHwa8 is 44% aa similar to ISFac10.
Additionnal axccession number : FR746099.
ISHwa8 is 44% aa similar to ISFac10.
Additionnal axccession number : FR746099.
References
1] Bolhuis, H., Palm, P., Wende, A., Falb, M., Rampp, M., Rodriguez-Valera, F., Pfeiffer, F., and Oesterhelt, D. (2006) BMC Genomics 7, 169.
2] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968
2] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968