ISHahl1
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AP017372.2 | ND | Halorhodospira halochloris | Halorhodospira halochloris Halorhodospira halochloris DSM 1059 |
DNA section
IS Length : 1813 bp
Ends
Left end : CAAAACGCGCCGCAGTGCTCCGCCTATAGCCGCGCAGCGGCTTAGGGGGTGGTGAAGCGGCGCATGGCGCGCAGCGCCATTACTCTGGGGCTTCCTGTTG II struct. : Yes
Right end : CGTGGAGGGAGGTCAGACCGGGTCATCCTTTGGATGAATCGGCACTCCCGTCGAAGCGCGAAACTCACGCCATAGCGTAAGCTTGGCGGGAGTAGTTCAT II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GGAGGGTAGGTGAGTC | CTAC | CGGGCACTGCCGGATGC | TCAT |
DNA sequence
CAAAACGCGCCGCAGTGCTCCGCCTATAGCCGCGCAGCGGCTTAGGGGGTGGTGAAGCGGCGCATGGCGCGCAGCGCCATTACTCTGGGGCTTCCTGTTG
TTCGATGTACTGCTTCAAGACCGACAGCGGCGCACCACCGACGGTAATGATGCAGTAGCTGCGACTCCAGAATACCGGTTTGCGCCAGTAATACTGTTGA
AGATGCTCCCCGAACTCCTTGCGGATGAGCCGGGAGGTCACCGTCTTTAGATTGTTGACGAAGGCGGAGGGGGCGGTGCGCGGCGTCAACTCCAAGAGCA
GGTGCAGATGGTCGGGCTCGCCGTTGAACTCCAGCACCTCGCCTTCCCACTTTTCGGTCGTCGCCCGAGCGATTGCCTCTAAGCGGGAGAGCATCGGTTG
GGTGATGCACGGGCGTCGATACTTGGTAACCACAACCAAGTGGTATTGAAGTCGGAAGGTGCAGTGATATAGCTGCTTTAGTGGTTGCGATTCCATAACA
ACCAAGTATACTGGAAACATGATCCAACGCAAAGCCACCTATCGACTTTACCCAACAGCTGAGCAGCTTGCCGCCTTGGAGCATCAGCTGTGGGTTCACT
GCCTGCTGTGGAACGAGGCGCTCGCCGAGCGGCGCTGGGCTTGGAGCGAGCGGCAGGAGTCGTTGGGGTTCTCGGCACAGTGCAAGCGGCTGACCGAGTG
GCGTAGCCAATCGAAACTTCTGAGTGGACTCAATGCCCAGTCTGAGCAGGTCACCCTCAAACGGTTGGATCTGGCGTTTCAGCACTTCTTTCGCCGGGTG
AAGCAGGGCGAGACGCCGGGCTTCCCCCGCTTCAAGTCCCTGCACCGGTTCAAGGGCTGGGGGTACAAGACCCACGGCGACGGCTGGCGGCTGTTGACCA
ACGAGCAGATGCAGCACGGCACCCTGCGTCTCTCCGGCGTTGGTGCGGTCGCCGTTCGGGGCAGACCCCGCACCCCTGGCACACCCAAGACTTGCGAGAT
CATCAAACGCGGTGATCGCTGGTATGCCAGCATTACGCTGAACTGCGAGCCGGTGCGAGAGTGTGGCGAGAAGATGGGCGGGCTCGACTGGGGCTTGGAG
ACCTTCGCCACTGTGGCGACCGCGCAAGGTGCCGAGACCATCGATAACCCCCGCCACCTTGCCGGCACCTTGGATGCCATCCGTGGCGTACAGCGCGAGA
TCTCCCGCAAGGAGGAGGCGGCCAAGAAGGCGTCGGGTAGGAGTCGGGGCTTCCCCATCTCCAACCGGTTGCGCAAGGCGTACGATGAGTTGCGCCGGCT
GCATCGCAAGGTGGCCAATCAGCGCCACGACTTTCTCCATCAGGTCTCTGCCCGACTGATCAAAGAGTTCAGCGCTTTGGGCCTGGAGGCGCTGAGCATT
CGGAACATGACCGCCAAAGGCGGTCAGCATAAGAGGGGCTTGAACCGCGGTATTCTTGATGCCGCTGGCGGGGCCTTCCATCAATTGCTCGGGTACAAAG
CGGAAGAGGCTGGTGCTTGGGCGGTGGAGGCCCCGACGCGGCAGATCAAACCCTCGCAGACCTGCCACGCCTGCGGGCAGCAAGAGAAGAAACCGCTCGG
CCAGCGCTGGCACAGCTGCCCTTGCGGCGCGTCCTGTTCACGGGACGAGAACGCCGCTCGGGTGCTGTTGGCTTGGCTGGAGCGGAGTTTATCCGGTCGG
GAACCGGCCGATGCGTGGAGGGAGGTCAGACCGGGTCATCCTTTGGATGAATCGGCACTCCCGTCGAAGCGCGAAACTCACGCCATAGCGTAAGCTTGGC
GGGAGTAGTTCAT
TTCGATGTACTGCTTCAAGACCGACAGCGGCGCACCACCGACGGTAATGATGCAGTAGCTGCGACTCCAGAATACCGGTTTGCGCCAGTAATACTGTTGA
AGATGCTCCCCGAACTCCTTGCGGATGAGCCGGGAGGTCACCGTCTTTAGATTGTTGACGAAGGCGGAGGGGGCGGTGCGCGGCGTCAACTCCAAGAGCA
GGTGCAGATGGTCGGGCTCGCCGTTGAACTCCAGCACCTCGCCTTCCCACTTTTCGGTCGTCGCCCGAGCGATTGCCTCTAAGCGGGAGAGCATCGGTTG
GGTGATGCACGGGCGTCGATACTTGGTAACCACAACCAAGTGGTATTGAAGTCGGAAGGTGCAGTGATATAGCTGCTTTAGTGGTTGCGATTCCATAACA
ACCAAGTATACTGGAAACATGATCCAACGCAAAGCCACCTATCGACTTTACCCAACAGCTGAGCAGCTTGCCGCCTTGGAGCATCAGCTGTGGGTTCACT
GCCTGCTGTGGAACGAGGCGCTCGCCGAGCGGCGCTGGGCTTGGAGCGAGCGGCAGGAGTCGTTGGGGTTCTCGGCACAGTGCAAGCGGCTGACCGAGTG
GCGTAGCCAATCGAAACTTCTGAGTGGACTCAATGCCCAGTCTGAGCAGGTCACCCTCAAACGGTTGGATCTGGCGTTTCAGCACTTCTTTCGCCGGGTG
AAGCAGGGCGAGACGCCGGGCTTCCCCCGCTTCAAGTCCCTGCACCGGTTCAAGGGCTGGGGGTACAAGACCCACGGCGACGGCTGGCGGCTGTTGACCA
ACGAGCAGATGCAGCACGGCACCCTGCGTCTCTCCGGCGTTGGTGCGGTCGCCGTTCGGGGCAGACCCCGCACCCCTGGCACACCCAAGACTTGCGAGAT
CATCAAACGCGGTGATCGCTGGTATGCCAGCATTACGCTGAACTGCGAGCCGGTGCGAGAGTGTGGCGAGAAGATGGGCGGGCTCGACTGGGGCTTGGAG
ACCTTCGCCACTGTGGCGACCGCGCAAGGTGCCGAGACCATCGATAACCCCCGCCACCTTGCCGGCACCTTGGATGCCATCCGTGGCGTACAGCGCGAGA
TCTCCCGCAAGGAGGAGGCGGCCAAGAAGGCGTCGGGTAGGAGTCGGGGCTTCCCCATCTCCAACCGGTTGCGCAAGGCGTACGATGAGTTGCGCCGGCT
GCATCGCAAGGTGGCCAATCAGCGCCACGACTTTCTCCATCAGGTCTCTGCCCGACTGATCAAAGAGTTCAGCGCTTTGGGCCTGGAGGCGCTGAGCATT
CGGAACATGACCGCCAAAGGCGGTCAGCATAAGAGGGGCTTGAACCGCGGTATTCTTGATGCCGCTGGCGGGGCCTTCCATCAATTGCTCGGGTACAAAG
CGGAAGAGGCTGGTGCTTGGGCGGTGGAGGCCCCGACGCGGCAGATCAAACCCTCGCAGACCTGCCACGCCTGCGGGCAGCAAGAGAAGAAACCGCTCGG
CCAGCGCTGGCACAGCTGCCCTTGCGGCGCGTCCTGTTCACGGGACGAGAACGCCGCTCGGGTGCTGTTGGCTTGGCTGGAGCGGAGTTTATCCGGTCGG
GAACCGGCCGATGCGTGGAGGGAGGTCAGACCGGGTCATCCTTTGGATGAATCGGCACTCCCGTCGAAGCGCGAAACTCACGCCATAGCGTAAGCTTGGC
GGGAGTAGTTCAT
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
315 bp | 104 aa | 394 | 80 | - | No |
Chemistry : Y1
ORF sequence :
MLSRLEAIARATTEKWEGEVLEFNGEPDHLHLLLELTPRTAPSAFVNNLKTVTSRLIRKEFGEHLQQYYWRKPVFWSRSYCIITVGGAPLSVLKQYIEQQ
EAPE
EAPE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1275 bp | 424 aa | 519 | 1793 | + | No |
AG : TnpB
ORF sequence :
MIQRKATYRLYPTAEQLAALEHQLWVHCLLWNEALAERRWAWSERQESLGFSAQCKRLTEWRSQSKLLSGLNAQSEQVTLKRLDLAFQHFFRRVKQGETP
GFPRFKSLHRFKGWGYKTHGDGWRLLTNEQMQHGTLRLSGVGAVAVRGRPRTPGTPKTCEIIKRGDRWYASITLNCEPVRECGEKMGGLDWGLETFATVA
TAQGAETIDNPRHLAGTLDAIRGVQREISRKEEAAKKASGRSRGFPISNRLRKAYDELRRLHRKVANQRHDFLHQVSARLIKEFSALGLEALSIRNMTAK
GGQHKRGLNRGILDAAGGAFHQLLGYKAEEAGAWAVEAPTRQIKPSQTCHACGQQEKKPLGQRWHSCPCGASCSRDENAARVLLAWLERSLSGREPADAW
REVRPGHPLDESALPSKRETHAIA
GFPRFKSLHRFKGWGYKTHGDGWRLLTNEQMQHGTLRLSGVGAVAVRGRPRTPGTPKTCEIIKRGDRWYASITLNCEPVRECGEKMGGLDWGLETFATVA
TAQGAETIDNPRHLAGTLDAIRGVQREISRKEEAAKKASGRSRGFPISNRLRKAYDELRRLHRKVANQRHDFLHQVSARLIKEFSALGLEALSIRNMTAK
GGQHKRGLNRGILDAAGGAFHQLLGYKAEEAGAWAVEAPTRQIKPSQTCHACGQQEKKPLGQRWHSCPCGASCSRDENAARVLLAWLERSLSGREPADAW
REVRPGHPLDESALPSKRETHAIA
Blast result :Comments : 56% identity with ISCARN6 TnpB
Comments
ISHahl1 is 88% aa similar to ISCARN6.
References
1] Sarah Sonbol (2020) Direct submission.
2] Tsukatani,Y. and Hirose,Y. (2016) Direct GenBank submission.
2] Tsukatani,Y. and Hirose,Y. (2016) Direct GenBank submission.