ISHgi15
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Haloferax gibbonsii | Haloferax gibbonsii plasmid pHGLR1 |
DNA section
IS Length : 1866 bp
Ends
Left end : AGAAGACAGCGAGAGTTCCACGGGTCACCTGACCCGTGGGTGAATCGCGTACGCTTGCCTAATGTTCAGTTTCCTCGATGTACCGACGAACCACATCCTC II struct. : Yes
Right end : CGCCCATCCTCATCCCGCGAAATCGCTTCGCGTGGGGTGCTTACAACGTAGGCTGAGGGAACAACAAAAGCCACGGGTCACCCGACCCGTGGTCGTTTAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
AGTCCTACAATAT | CTAA | ATTCTCTCTAAT | TTAC |
DNA sequence
AGAAGACAGCGAGAGTTCCACGGGTCACCTGACCCGTGGGTGAATCGCGTACGCTTGCCTAATGTTCAGTTTCCTCGATGTACCGACGAACCACATCCTC
GGACACCACACCAGTCGTCCCAACGTAGTATCCCTCCTTCCACAACCCACTCCCCCAGAAGTACCGCTGTTTGATTTCTGGGTGACGATTGAGAATTGTT
CGCCCCGAGTATCCCTTGAATTGCTTGGCAATCTCGGCAGGACTCCATTTTGGGTCGGCCTCCACGAACAAGTGAACGTGGTCGGTAGCAATCTCCATCG
ACACAATCTCGTGACCGAACCGTTCGGCGGTTCCTCGGAACAACTCTGCAAGTTCGTCTCGAACTAACTCCAACATCCCGTGCCGATACTTCGGACACCA
CACGAAATGATACTTACACAAACTAATCGAATGTCCATGACTTCTGTACTCTCCCACCCTAATCCCTAATTTTATTATCATTTGATTCGATAGTCAAGGT
ATGGGTATCGAAGAAGTCACGAAAACCGCACGCACTCGACTCTGCATAGAGTCTGGTGAGCGGTCGTGGCTCAAAGATGCCCGATTCACCGCACGAGACA
TCTCCAACGACACACTCCGTCTCAAACAAGACGGCTACAACAAAACCGAGATTCAACGCGAGGTTGACCGCGAGGACTTCCTCCGGAACAACAAGTGTGC
GGTCGTTGCTAAGGCTCTGCAAGCGTGGAACTCTTACAAAGAACTCCTCAACTGGTGGTACGACCAAGATGACACTACCGTCGGGAAACCGTCGCCACCA
GCCACGGACAAGAAAGGAGCCTACCCACTCGTTATGGCACACACCGAAGGTTACCGCCTCACCCACAACGACGAAGAAGGTAGGATTCGGTTCCGTGTGA
GTCCGAAGCCGTACAAGAAGGTGAAAGGACACCTTCGAGGGCGACCGGAAGATCTCAACCTCATCAAGTCAGCCTTGACTGAAGATGAGTGGTCGTTGGG
ACAGGCCGAACTTCTGTACCGAGATGGCGTGTATTACCTACACGTCACGGTCAAAACAGAAGTCGAAGTACCTGAACCCGAAGACGCTGACACACTCGTC
GGCGTCGATATTAACGAGCGCAACATCGCTCTCACCGCTCTTAACCGTGGGACGATGGAAACGCTCGGAACACTCGTGCTTGACTACGGTTCGGTGAAAG
CCGAACGCCAACGCTACCACACCATCACGAAACGCTGTCAAGAACACGGCCAACATTCCATCCACAACCAACTCGGAGACAAAGAAGAACGCTACACAGA
ATGGGTTCTTCACAGAATGTCCCGAGTCGTGGTCGAGTTCGCCCAACAGTTCCCGAACCCCGTTATGGTGTTCGAGGACTTGAGTGGAATTCGAGACGCC
ATCAAGTACGGCACGTACATGAATCGGCGTCTGCACAAACTCCCGTTCCACAAGTTCGAGCAACAGGTTCGTTACAAAGCGACGTGGAACCAGATTCCGT
GTGAGACGGTCGAGTCTCCGTACAACTCGAAGTCATGTTCGTGCTGTGGTCATCGGGGATACCGTCAAGGACGACGGTTCCGGTGTACAAATGATTCGTG
TGCGGTTCATCAAGACCATGCTGACCGGAACGCGAGTGTGAATGTGGCGTGGCGAGTGTGGGCGAAACACGCTGGCGTAGATGTTGAATCGGTTAATTAC
CGGACTCGCAAAACCCAACCAAGTGTTCGGAAGGTGAGCCTGTCTGGGTCGGGGCGCTCTGTAAACCGCCCATCCTCATCCCGCGAAATCGCTTCGCGTG
GGGTGCTTACAACGTAGGCTGAGGGAACAACAAAAGCCACGGGTCACCCGACCCGTGGTCGTTTAC
GGACACCACACCAGTCGTCCCAACGTAGTATCCCTCCTTCCACAACCCACTCCCCCAGAAGTACCGCTGTTTGATTTCTGGGTGACGATTGAGAATTGTT
CGCCCCGAGTATCCCTTGAATTGCTTGGCAATCTCGGCAGGACTCCATTTTGGGTCGGCCTCCACGAACAAGTGAACGTGGTCGGTAGCAATCTCCATCG
ACACAATCTCGTGACCGAACCGTTCGGCGGTTCCTCGGAACAACTCTGCAAGTTCGTCTCGAACTAACTCCAACATCCCGTGCCGATACTTCGGACACCA
CACGAAATGATACTTACACAAACTAATCGAATGTCCATGACTTCTGTACTCTCCCACCCTAATCCCTAATTTTATTATCATTTGATTCGATAGTCAAGGT
ATGGGTATCGAAGAAGTCACGAAAACCGCACGCACTCGACTCTGCATAGAGTCTGGTGAGCGGTCGTGGCTCAAAGATGCCCGATTCACCGCACGAGACA
TCTCCAACGACACACTCCGTCTCAAACAAGACGGCTACAACAAAACCGAGATTCAACGCGAGGTTGACCGCGAGGACTTCCTCCGGAACAACAAGTGTGC
GGTCGTTGCTAAGGCTCTGCAAGCGTGGAACTCTTACAAAGAACTCCTCAACTGGTGGTACGACCAAGATGACACTACCGTCGGGAAACCGTCGCCACCA
GCCACGGACAAGAAAGGAGCCTACCCACTCGTTATGGCACACACCGAAGGTTACCGCCTCACCCACAACGACGAAGAAGGTAGGATTCGGTTCCGTGTGA
GTCCGAAGCCGTACAAGAAGGTGAAAGGACACCTTCGAGGGCGACCGGAAGATCTCAACCTCATCAAGTCAGCCTTGACTGAAGATGAGTGGTCGTTGGG
ACAGGCCGAACTTCTGTACCGAGATGGCGTGTATTACCTACACGTCACGGTCAAAACAGAAGTCGAAGTACCTGAACCCGAAGACGCTGACACACTCGTC
GGCGTCGATATTAACGAGCGCAACATCGCTCTCACCGCTCTTAACCGTGGGACGATGGAAACGCTCGGAACACTCGTGCTTGACTACGGTTCGGTGAAAG
CCGAACGCCAACGCTACCACACCATCACGAAACGCTGTCAAGAACACGGCCAACATTCCATCCACAACCAACTCGGAGACAAAGAAGAACGCTACACAGA
ATGGGTTCTTCACAGAATGTCCCGAGTCGTGGTCGAGTTCGCCCAACAGTTCCCGAACCCCGTTATGGTGTTCGAGGACTTGAGTGGAATTCGAGACGCC
ATCAAGTACGGCACGTACATGAATCGGCGTCTGCACAAACTCCCGTTCCACAAGTTCGAGCAACAGGTTCGTTACAAAGCGACGTGGAACCAGATTCCGT
GTGAGACGGTCGAGTCTCCGTACAACTCGAAGTCATGTTCGTGCTGTGGTCATCGGGGATACCGTCAAGGACGACGGTTCCGGTGTACAAATGATTCGTG
TGCGGTTCATCAAGACCATGCTGACCGGAACGCGAGTGTGAATGTGGCGTGGCGAGTGTGGGCGAAACACGCTGGCGTAGATGTTGAATCGGTTAATTAC
CGGACTCGCAAAACCCAACCAAGTGTTCGGAAGGTGAGCCTGTCTGGGTCGGGGCGCTCTGTAAACCGCCCATCCTCATCCCGCGAAATCGCTTCGCGTG
GGGTGCTTACAACGTAGGCTGAGGGAACAACAAAAGCCACGGGTCACCCGACCCGTGGTCGTTTAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
423 bp | 140 aa | 481 | 59 | - | No |
Chemistry : Y1
ORF sequence :
MIIKLGIRVGEYRSHGHSISLCKYHFVWCPKYRHGMLELVRDELAELFRGTAERFGHEIVSMEIATDHVHLFVEADPKWSPAEIAKQFKGYSGRTILNRH
PEIKQRYFWGSGLWKEGYYVGTTGVVSEDVVRRYIEETEH
PEIKQRYFWGSGLWKEGYYVGTTGVVSEDVVRRYIEETEH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1317 bp | 438 aa | 501 | 1817 | + | No |
AG : TnpB
ORF sequence :
MGIEEVTKTARTRLCIESGERSWLKDARFTARDISNDTLRLKQDGYNKTEIQREVDREDFLRNNKCAVVAKALQAWNSYKELLNWWYDQDDTTVGKPSPP
ATDKKGAYPLVMAHTEGYRLTHNDEEGRIRFRVSPKPYKKVKGHLRGRPEDLNLIKSALTEDEWSLGQAELLYRDGVYYLHVTVKTEVEVPEPEDADTLV
GVDINERNIALTALNRGTMETLGTLVLDYGSVKAERQRYHTITKRCQEHGQHSIHNQLGDKEERYTEWVLHRMSRVVVEFAQQFPNPVMVFEDLSGIRDA
IKYGTYMNRRLHKLPFHKFEQQVRYKATWNQIPCETVESPYNSKSCSCCGHRGYRQGRRFRCTNDSCAVHQDHADRNASVNVAWRVWAKHAGVDVESVNY
RTRKTQPSVRKVSLSGSGRSVNRPSSSREIASRGVLTT
ATDKKGAYPLVMAHTEGYRLTHNDEEGRIRFRVSPKPYKKVKGHLRGRPEDLNLIKSALTEDEWSLGQAELLYRDGVYYLHVTVKTEVEVPEPEDADTLV
GVDINERNIALTALNRGTMETLGTLVLDYGSVKAERQRYHTITKRCQEHGQHSIHNQLGDKEERYTEWVLHRMSRVVVEFAQQFPNPVMVFEDLSGIRDA
IKYGTYMNRRLHKLPFHKFEQQVRYKATWNQIPCETVESPYNSKSCSCCGHRGYRQGRRFRCTNDSCAVHQDHADRNASVNVAWRVWAKHAGVDVESVNY
RTRKTQPSVRKVSLSGSGRSVNRPSSSREIASRGVLTT
Blast result :
Comments
ISHgi15 is 96% aa (transposase) similar to ISHall1.
References
1] Pfeiffer, F. (2020) Direct submission.
2] Tittes, C., Schwarzer, S., Pfeiffer, F., Dyall-Smith, M., Oksanen, H., Quax, T. (2020) (in preparation)
2] Tittes, C., Schwarzer, S., Pfeiffer, F., Dyall-Smith, M., Oksanen, H., Quax, T. (2020) (in preparation)