ISHal1
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AOLL01000028 | ND | Haloferax alexandrinus | Haloferax volcanii DS2 Haloferax alexandrinus JCM 10717 |
DNA section
IS Length : 1846 bp
Ends
Left end : TTTCAAGGCGAGAATCCCGCCGTTTACGGCGGGCGTGAAGCCGACAACTACCTCACGTTCTACCGTCGGTTGCAGGCCGAATACCCCACGACTACGAGCC II struct. : Yes
Right end : GCTCACGGCCTACGAAGAGTCGAAACCCTCTGCCAGCGACTACTGACTGGTATTCCCCATGCGTGGGAAGCCTCGCCGTTTACGGCGAGGAGGATGTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GACCTGCTAT | CTTT | CGACGATTAC | TCAC |
CACTCGCTTC | CTTT | CAGACGGGTC | TCAc |
DNA sequence
TTTCAAGGCGAGAATCCCGCCGTTTACGGCGGGCGTGAAGCCGACAACTACCTCACGTTCTACCGTCGGTTGCAGGCCGAATACCCCACGACTACGAGCC
TTTATTAGAATTGGGTGTGTAGAAACTAGTACGGCCAAATGAAGTACAACCTTGAAACCGGGTCGCACACGGTCTACGCGCTCCAATATCACTTCGTGAC
CGTCACGAAGTACCGCGCAGACCTCCTCACCGACGAAATCGCAGAGCGCATCGGTGAGATTGCCAGCGACATCTCCGAGGACTTCGGCGTGAACATCCAG
AACGTCAACGGCGGAAGCGACCACGTTCACATCCTCTTCACGGCGAAGCCAACGACCGACCTCACCAAGTTCATCAACTCACTCAAGGGCGTTACGTCCC
GCAAAATCCGTGACGAGAACCCCGAGGTTCGGCAGGCGCCCGACAAAGCGTTCTGGCAACCGGGATACTTCCTCGCCACCACCGGCCAAGTGAGTATCGA
CGTACTCATGAAGTACGTGGAGGAGCAGTAGCGTGTCTCCGACCGTCACGAAGACGTTGCAGGCGACGTTCGCGCCACCCACCGCGCACAAGCAGTCGAA
ACTCAACGACCTGCTGGAAACCTACCGTGACGGTCTGCAAGAGGCGTTCGACGCCGGGGCGAGTACCATGTCGGCGGTGAGTGAAATCGTGACGCCCTAC
GACCTGCCGTATCAGGCCAAAGCCGCGCTCTGCAACTACGTCCCGAAACTCCGCAAGACGTACAGCGCGAAGGAGTTGGACGACAGCCACCCGATACGGC
TCACGAACCAAGCCGCGAAGTTCGACCACTCCGCCGAACGCGACTACGAGTTCACGTGGTGGGTTCCCCGTCCCGGTCGGGGAACGAACTTCTGGATACC
GCTCCGTATCAACCCCGAACAGGAAGGCCTCTGGCACGACCTCGTATCGGAGGACGCGAAGGCGGGTGAGATACGGCTTCAGAAGCATCGGAAGAACTGG
GTACTGCACGTCACCGTCGAGTACCCGGTCGAAGAACCAGCGACGGACGGTGACGCCACGCACATCGGCTTAGACATCGGAGAAACCGCTCTCATCACGG
GCTGTGCCCTCAAGGATGGTTCTCCGACTGATCCGTTCGTGTGTAGCGGAAGCAGAGCGAAGCATCTCCGCAAAGAGATGCACACGACCCTGAAACGACT
CCAAGAGCGTGACGCATCCGAGTGGCGGATTGAAGACCGATTCAGCCACTACCAGAACGCGCTCACCGACATCGTGGAGAAAGTGTCTCGGCAAGCCGTC
GAGTACGCCCAACAGTTCGAGAACCCGGTGTTGGTGATGGAGGACTTAACGTACATCCGTGAACGGCTTGACTACGGGAAGTACATGAACCGTCGCCTTC
ACTCGTGGGCGTTCGCCCGACTCCAAGGGCGCATCGAGGACAAGGCGACGGAAGCAGGCATTCCGGTCGAGTACGTGAATCCGGCGTACACCTCGCAGAC
GTGCCACTCGTGCCACCGTATCGGTCGGCGGGACTCACAAGCCGAGTTCCGGTGTCCGAACGACGACTGCCACGTTTCGACGTTTCAGGCCGACATCAAC
GCTTCCGCTAATATCGCACGACGGGTTGACCCGTGGGGAGAGAGCGTCCCGCTTGACAAGGCGGAACGCGATGACTCGCCACGGGATGGGAGCAGTTGTG
ACACCGCCACGACTCACCGTGAGACGAGCGTACCAGCGCAGATGACGCTCACGGCCTACGAAGAGTCGAAACCCTCTGCCAGCGACTACTGACTGGTATT
CCCCATGCGTGGGAAGCCTCGCCGTTTACGGCGAGGAGGATGTCAC
TTTATTAGAATTGGGTGTGTAGAAACTAGTACGGCCAAATGAAGTACAACCTTGAAACCGGGTCGCACACGGTCTACGCGCTCCAATATCACTTCGTGAC
CGTCACGAAGTACCGCGCAGACCTCCTCACCGACGAAATCGCAGAGCGCATCGGTGAGATTGCCAGCGACATCTCCGAGGACTTCGGCGTGAACATCCAG
AACGTCAACGGCGGAAGCGACCACGTTCACATCCTCTTCACGGCGAAGCCAACGACCGACCTCACCAAGTTCATCAACTCACTCAAGGGCGTTACGTCCC
GCAAAATCCGTGACGAGAACCCCGAGGTTCGGCAGGCGCCCGACAAAGCGTTCTGGCAACCGGGATACTTCCTCGCCACCACCGGCCAAGTGAGTATCGA
CGTACTCATGAAGTACGTGGAGGAGCAGTAGCGTGTCTCCGACCGTCACGAAGACGTTGCAGGCGACGTTCGCGCCACCCACCGCGCACAAGCAGTCGAA
ACTCAACGACCTGCTGGAAACCTACCGTGACGGTCTGCAAGAGGCGTTCGACGCCGGGGCGAGTACCATGTCGGCGGTGAGTGAAATCGTGACGCCCTAC
GACCTGCCGTATCAGGCCAAAGCCGCGCTCTGCAACTACGTCCCGAAACTCCGCAAGACGTACAGCGCGAAGGAGTTGGACGACAGCCACCCGATACGGC
TCACGAACCAAGCCGCGAAGTTCGACCACTCCGCCGAACGCGACTACGAGTTCACGTGGTGGGTTCCCCGTCCCGGTCGGGGAACGAACTTCTGGATACC
GCTCCGTATCAACCCCGAACAGGAAGGCCTCTGGCACGACCTCGTATCGGAGGACGCGAAGGCGGGTGAGATACGGCTTCAGAAGCATCGGAAGAACTGG
GTACTGCACGTCACCGTCGAGTACCCGGTCGAAGAACCAGCGACGGACGGTGACGCCACGCACATCGGCTTAGACATCGGAGAAACCGCTCTCATCACGG
GCTGTGCCCTCAAGGATGGTTCTCCGACTGATCCGTTCGTGTGTAGCGGAAGCAGAGCGAAGCATCTCCGCAAAGAGATGCACACGACCCTGAAACGACT
CCAAGAGCGTGACGCATCCGAGTGGCGGATTGAAGACCGATTCAGCCACTACCAGAACGCGCTCACCGACATCGTGGAGAAAGTGTCTCGGCAAGCCGTC
GAGTACGCCCAACAGTTCGAGAACCCGGTGTTGGTGATGGAGGACTTAACGTACATCCGTGAACGGCTTGACTACGGGAAGTACATGAACCGTCGCCTTC
ACTCGTGGGCGTTCGCCCGACTCCAAGGGCGCATCGAGGACAAGGCGACGGAAGCAGGCATTCCGGTCGAGTACGTGAATCCGGCGTACACCTCGCAGAC
GTGCCACTCGTGCCACCGTATCGGTCGGCGGGACTCACAAGCCGAGTTCCGGTGTCCGAACGACGACTGCCACGTTTCGACGTTTCAGGCCGACATCAAC
GCTTCCGCTAATATCGCACGACGGGTTGACCCGTGGGGAGAGAGCGTCCCGCTTGACAAGGCGGAACGCGATGACTCGCCACGGGATGGGAGCAGTTGTG
ACACCGCCACGACTCACCGTGAGACGAGCGTACCAGCGCAGATGACGCTCACGGCCTACGAAGAGTCGAAACCCTCTGCCAGCGACTACTGACTGGTATT
CCCCATGCGTGGGAAGCCTCGCCGTTTACGGCGAGGAGGATGTCAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
393 bp | 130 aa | 139 | 531 | + | No |
Chemistry : Y1
ORF sequence :
MKYNLETGSHTVYALQYHFVTVTKYRADLLTDEIAERIGEIASDISEDFGVNIQNVNGGSDHVHILFTAKPTTDLTKFINSLKGVTSRKIRDENPEVRQA
PDKAFWQPGYFLATTGQVSIDVLMKYVEEQ
PDKAFWQPGYFLATTGQVSIDVLMKYVEEQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1260 bp | 419 aa | 533 | 1792 | + | No |
AG : TnpB
ORF sequence :
MSPTVTKTLQATFAPPTAHKQSKLNDLLETYRDGLQEAFDAGASTMSAVSEIVTPYDLPYQAKAALCNYVPKLRKTYSAKELDDSHPIRLTNQAAKFDHS
AERDYEFTWWVPRPGRGTNFWIPLRINPEQEGLWHDLVSEDAKAGEIRLQKHRKNWVLHVTVEYPVEEPATDGDATHIGLDIGETALITGCALKDGSPTD
PFVCSGSRAKHLRKEMHTTLKRLQERDASEWRIEDRFSHYQNALTDIVEKVSRQAVEYAQQFENPVLVMEDLTYIRERLDYGKYMNRRLHSWAFARLQGR
IEDKATEAGIPVEYVNPAYTSQTCHSCHRIGRRDSQAEFRCPNDDCHVSTFQADINASANIARRVDPWGESVPLDKAERDDSPRDGSSCDTATTHRETSV
PAQMTLTAYEESKPSASDY
AERDYEFTWWVPRPGRGTNFWIPLRINPEQEGLWHDLVSEDAKAGEIRLQKHRKNWVLHVTVEYPVEEPATDGDATHIGLDIGETALITGCALKDGSPTD
PFVCSGSRAKHLRKEMHTTLKRLQERDASEWRIEDRFSHYQNALTDIVEKVSRQAVEYAQQFENPVLVMEDLTYIRERLDYGKYMNRRLHSWAFARLQGR
IEDKATEAGIPVEYVNPAYTSQTCHSCHRIGRRDSQAEFRCPNDDCHVSTFQADINASANIARRVDPWGESVPLDKAERDDSPRDGSSCDTATTHRETSV
PAQMTLTAYEESKPSASDY
Blast result :
Comments
ISHal1 is 89% (TnpA : the transposase) and 82% (TnpB) aa similar to ISNph5.
References
1] Hartman, A.L., Norais, C., Badger, J.H., Delmas, S., Haldenby, S., Madupu, R., Robinson, J., Khouri, H., Ren, Q., Lowe, T.M., Maupin-Furlow, J., Pohlschroder, M., Daniels, C., Pfeiffer, F.,
Allers, T., Eisen, J.A. (2010) PLoS ONE 5, E9605-E9605
Allers, T., Eisen, J.A. (2010) PLoS ONE 5, E9605-E9605