ISHgi7
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Haloferax gibbonsii | Haloferax gibbonsii plasmid pHGLR1 |
DNA section
IS Length : 1886 bp
Ends
Left end : TTACGAAAAATTCGAGGCGAAAGCCTCGCCCTTCAGGGCGGGGATGAAGTCGACCAATAGTATTCAACCGCCGACGACAGCACAGACGGGGTTCCAACGC II struct. : No
Right end : TCCCAAAGAAGTGCGCACAAACCCGCAAGTTGCCTCCGTGGGTCGGTAGCCGAACCCCCAATGGAGGAATCCTCGCGCTTTAGCGCGGGGAGGATGTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
TATGGACCGGTC | TTAC | GCCTCGGTTAGG | TCAA |
GTTTCGACCCGA | TTAC | GAGTCGGTCGGT | TCAA |
DNA sequence
GAAAAATTCGAGGCGAAAGCCTCGCCCTTCAGGGCGGGGATGAAGTCGACCAATAGTATTCAACCGCCGACGACAGCACAGACGGGGTTCCAACGCAATC
TTTAACCCGCCCGGTTGTGATAGTATGCATAGATGGTAAAGAGTACCCGTCACGCGAAATACGAACTCTACTACCACATAGTGTTCGTGCCGAAATATCG
GCGTTCGAACCTGACGGGGAAGACAAAGGAACGTCTCGAAGCCATCTTCGCGGAAATCTGTGAGGACAAAGACCTCGAACTGGCCGAGTCCGAGGTCATG
CCCGACCACGTACACCTGTTCATCGGGAGTCCACCCAAGAACGCACCGTCACTCATCGTCAACTGGGTCAAGGGCATCTCGGCGCGGAAGTATAACCAGC
GGTACGACGACCGCGTGAAGTGGACTCGTTCCTACTACGTCGGAACAGCGGGAAGCGCCTCGAAGGGCGCTGTCGAACAGTATATCGCTGAACAGGAAGG
TAGCGACGAATGAAGCGCGTCAACACCTTCGAGGTACTTCCACAGACCGAGAACGACAAAGAGTGCCTTCTACGGCTCCTCGACGCCTCCGCTTCCCTGT
GGAACGAACTGACCTACGAACGTCGCCAGAACTACTTCGGTGACGGCGACGTGTGGGACACCTCCGAGTACCGTGGACAGTACAACGGCGTCGTCGGAAG
CGCGACCGTCCAACAGGTCACGCGCAAGAACAGCGAAGCGTGGCGGTCGTTCTTTGCCAAGGAGAAAGGCGAGTACGCCAACCCACCATCGTACTGGGGC
AACGAAGAGGACGGACGCGAACTCCGCACCTACATCCGATGCAACCAGTACACGATTGAGTGGGGGAAGCGTAGTCGTCTCGAAATCCCTGTCGGGCAAG
AACTGAAAGACGAATACGGACTCGGCTACCACGAACGACTCCGCCTCGAAGTCCGAGGCAACCCGAAATGGGACGGCAAACAGGGTCGTCTGGAACTTGA
GTACGACGAGGTTAGCGACACGTTCAGGGCTTTCCAACCAGTCACCGTACCTGATTCTCGACTGGGTTCACCACTGGCTTCTCACGAAGCCGCCCTCGAC
GTTGGCGCGAACAATCTCGTCGCCTGTTCCACGACCACTGGGAACCAGTACCTCTACGACGGCCGGGAGTTGTTCAGACGGTTCCGCGAGACGACAGACG
AAATCGCCCGCCTACAGTCGAAATTGCCCGAGAGACGGAGTCTCTCGGAATCACGGAAGACGGAGTCTTCCGGACGACTCCGAGAGGGGCGCTACTCCTC
GAATCGGATTCGACGGCTGTACCGGCAGCGAACGAAACGCCGTGACCACGCACAGAACGCGCTGGTGCGCGACCTCGTTGAACGGCTGTACGACGAGGGC
GTGGCGACGGTGTACGTGGGCGATTTGACCGACGTGCTGGAAACGCATTGGTCGGTCAGGGTGAACGAGAAGACGCACAACTTCTGGGCGTTCAAGAAGT
TCATCCACCGTCTCGCGTGTGTCTGTGAGGAGTACGGCATCTCTCTCGAAGCCGAGTCGGAAGCGTGGACGAGTCAGACGTGTCCCGAGTGTGGCGACCA
CGAGGAGACGGTTCGCCACGGGGATACGCTGACGTGTCCGTGTGGTTTCGAGGGACACGCCGACCTCACGGCGTCAGAGACGTTCCTTCGGGAAAACAGC
GATTGCGAAATCAGGCCGATGGCACGGCCCGTGCGATTTGAGTGGGACGACCACGACTGGTCGGGGAAACCACACCCTCACGAAAGTCCCAAAGAAGTGC
GCACAAACCCGCAAGTTGCCTCCGTGGGTCGGTAGCCGAACCCCCAATGGAGGAATCCTCGCGCTTTAGCGCGGGGAGGATGTCAA
TTTAACCCGCCCGGTTGTGATAGTATGCATAGATGGTAAAGAGTACCCGTCACGCGAAATACGAACTCTACTACCACATAGTGTTCGTGCCGAAATATCG
GCGTTCGAACCTGACGGGGAAGACAAAGGAACGTCTCGAAGCCATCTTCGCGGAAATCTGTGAGGACAAAGACCTCGAACTGGCCGAGTCCGAGGTCATG
CCCGACCACGTACACCTGTTCATCGGGAGTCCACCCAAGAACGCACCGTCACTCATCGTCAACTGGGTCAAGGGCATCTCGGCGCGGAAGTATAACCAGC
GGTACGACGACCGCGTGAAGTGGACTCGTTCCTACTACGTCGGAACAGCGGGAAGCGCCTCGAAGGGCGCTGTCGAACAGTATATCGCTGAACAGGAAGG
TAGCGACGAATGAAGCGCGTCAACACCTTCGAGGTACTTCCACAGACCGAGAACGACAAAGAGTGCCTTCTACGGCTCCTCGACGCCTCCGCTTCCCTGT
GGAACGAACTGACCTACGAACGTCGCCAGAACTACTTCGGTGACGGCGACGTGTGGGACACCTCCGAGTACCGTGGACAGTACAACGGCGTCGTCGGAAG
CGCGACCGTCCAACAGGTCACGCGCAAGAACAGCGAAGCGTGGCGGTCGTTCTTTGCCAAGGAGAAAGGCGAGTACGCCAACCCACCATCGTACTGGGGC
AACGAAGAGGACGGACGCGAACTCCGCACCTACATCCGATGCAACCAGTACACGATTGAGTGGGGGAAGCGTAGTCGTCTCGAAATCCCTGTCGGGCAAG
AACTGAAAGACGAATACGGACTCGGCTACCACGAACGACTCCGCCTCGAAGTCCGAGGCAACCCGAAATGGGACGGCAAACAGGGTCGTCTGGAACTTGA
GTACGACGAGGTTAGCGACACGTTCAGGGCTTTCCAACCAGTCACCGTACCTGATTCTCGACTGGGTTCACCACTGGCTTCTCACGAAGCCGCCCTCGAC
GTTGGCGCGAACAATCTCGTCGCCTGTTCCACGACCACTGGGAACCAGTACCTCTACGACGGCCGGGAGTTGTTCAGACGGTTCCGCGAGACGACAGACG
AAATCGCCCGCCTACAGTCGAAATTGCCCGAGAGACGGAGTCTCTCGGAATCACGGAAGACGGAGTCTTCCGGACGACTCCGAGAGGGGCGCTACTCCTC
GAATCGGATTCGACGGCTGTACCGGCAGCGAACGAAACGCCGTGACCACGCACAGAACGCGCTGGTGCGCGACCTCGTTGAACGGCTGTACGACGAGGGC
GTGGCGACGGTGTACGTGGGCGATTTGACCGACGTGCTGGAAACGCATTGGTCGGTCAGGGTGAACGAGAAGACGCACAACTTCTGGGCGTTCAAGAAGT
TCATCCACCGTCTCGCGTGTGTCTGTGAGGAGTACGGCATCTCTCTCGAAGCCGAGTCGGAAGCGTGGACGAGTCAGACGTGTCCCGAGTGTGGCGACCA
CGAGGAGACGGTTCGCCACGGGGATACGCTGACGTGTCCGTGTGGTTTCGAGGGACACGCCGACCTCACGGCGTCAGAGACGTTCCTTCGGGAAAACAGC
GATTGCGAAATCAGGCCGATGGCACGGCCCGTGCGATTTGAGTGGGACGACCACGACTGGTCGGGGAAACCACACCCTCACGAAAGTCCCAAAGAAGTGC
GCACAAACCCGCAAGTTGCCTCCGTGGGTCGGTAGCCGAACCCCCAATGGAGGAATCCTCGCGCTTTAGCGCGGGGAGGATGTCAA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
381 bp | 126 aa | 133 | 513 | + | No |
Chemistry : Y1
ORF sequence :
MVKSTRHAKYELYYHIVFVPKYRRSNLTGKTKERLEAIFAEICEDKDLELAESEVMPDHVHLFIGSPPKNAPSLIVNWVKGISARKYNQRYDDRVKWTRS
YYVGTAGSASKGAVEQYIAEQEGSDE
YYVGTAGSASKGAVEQYIAEQEGSDE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1326 bp | 441 aa | 510 | 1835 | + | No |
AG : TnpB
ORF sequence :
MKRVNTFEVLPQTENDKECLLRLLDASASLWNELTYERRQNYFGDGDVWDTSEYRGQYNGVVGSATVQQVTRKNSEAWRSFFAKEKGEYANPPSYWGNEE
DGRELRTYIRCNQYTIEWGKRSRLEIPVGQELKDEYGLGYHERLRLEVRGNPKWDGKQGRLELEYDEVSDTFRAFQPVTVPDSRLGSPLASHEAALDVGA
NNLVACSTTTGNQYLYDGRELFRRFRETTDEIARLQSKLPERRSLSESRKTESSGRLREGRYSSNRIRRLYRQRTKRRDHAQNALVRDLVERLYDEGVAT
VYVGDLTDVLETHWSVRVNEKTHNFWAFKKFIHRLACVCEEYGISLEAESEAWTSQTCPECGDHEETVRHGDTLTCPCGFEGHADLTASETFLRENSDCE
IRPMARPVRFEWDDHDWSGKPHPHESPKEVRTNPQVASVGR
DGRELRTYIRCNQYTIEWGKRSRLEIPVGQELKDEYGLGYHERLRLEVRGNPKWDGKQGRLELEYDEVSDTFRAFQPVTVPDSRLGSPLASHEAALDVGA
NNLVACSTTTGNQYLYDGRELFRRFRETTDEIARLQSKLPERRSLSESRKTESSGRLREGRYSSNRIRRLYRQRTKRRDHAQNALVRDLVERLYDEGVAT
VYVGDLTDVLETHWSVRVNEKTHNFWAFKKFIHRLACVCEEYGISLEAESEAWTSQTCPECGDHEETVRHGDTLTCPCGFEGHADLTASETFLRENSDCE
IRPMARPVRFEWDDHDWSGKPHPHESPKEVRTNPQVASVGR
Blast result :
Comments
ISHgi7 is 98% (transposase) aa similar to ISHla16.
References
1] Friedhelm Pfeiffer (2020) Direct submission.
2] Tittes, C., Schwarzer, S., Pfeiffer, F., Dyall-Smith, M., Oksanen, H., Quax, T. (in preparation)
2] Tittes, C., Schwarzer, S., Pfeiffer, F., Dyall-Smith, M., Oksanen, H., Quax, T. (in preparation)