ISHgi6
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Haloferax gibbonsii | Haloferax gibbonsii |
DNA section
IS Length : 2425 bp
Ends
IR Length : 0
IRL : TAGTGCACCTGCTACGTAATTCGACATTTATAAGTAGTGAACAGCAGTAA
IRR : ACCGTTCCTATTTTCGGCAGTAAACAGCAGTCTGCTTAACGAATTGAGAC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTTCGCGGGG | TCAGGTCGCCCT | 0 |
DNA sequence
TAGTGCACCTGCTACGTAATTCGACATTTATAAGTAGTGAACAGCAGTAAACAACAGTAATGACAAAGGTGTACTCCATCGGAGAGTTCGCAGATGAACT
TGGAGTCCACCCAGAAACCGTCAAAAAGTGGTGTCGCGAAGACCAAATTCAATATTCTCGAACTCCCGGTGGGCACCGACGGATTCCACATACCGAACTC
CAACGACTCACAGGCAAACCCTCACGGAGAACCGATAAAGTTGCCATCTACGGGAGAGTCTCAGGTCATGCCCAGAAACAGGACGGCAACCTCGACCGCC
AACTAAACTCTCTCACCCAATACGCCCACGACCACGGGTGGAGTATCGAAAACAAATACTCCGACGTGGGAAGTGGCCTCAACGAGAATCGGCGGGGACT
GAACAAACTCTTGGACGACGCAGAGAACGCCGACTACGGGCGCGTCCTCGTCACGTATCAAGACCGACTCACCCGATTTGGATTCTCCTACATCGAACGC
CTCCTCGACGAATATGGCGTCGAAATCACCGTCATCAACGAGGAAACCGACAAGACTGCACACGAAGAACTCGTGGACGATCTCTTGCAACTCGTCGCCA
GTTTCGCAGGCAAACTCTACGGGGTGCGTTCTTCAAAACGCAAGCGGATCGTCGAATCGGTGGAAGCAGAGGTGGAAACAGATGAGTAGCCACCTCTCAC
TCCCGGTACGACTCCCATTCGATGCGCGAGAGCGCCACGCCCGGTTAGATGTACTCACGAAACTCGTTGCAAATACGCTCATCGAGCGATACTGGACGCC
CGAACATCTCACCGGGATTGACGACTATTCCTATCAAGCGTGGAAATACTTTGACGAAAACGAAGCGTTCGCAGACGTTGACTTGTACCTCCCGAGCCGG
TACAAACGGTGCGTGATGCAGAAAGTCGGAGAAACACTCCGAAGCCACGCCGACAAACAAGAAGCGTTTCAATCCATCCAATCCGTCCTTCCAGACCACA
AGATTCGCCGTATCCACACCCGGAGAATCAAAGAACAACTGTGGGACTCAGAGGAATACATCAAATCTGGGTACGTTGAGCTACTCATCGGCCAACTCAA
CTCGTACTACGACCGTCACGGTCGGTTCCCTGACTCGTATTTCGACATGCAGGACTGCCCCACGTACTCGAAAGGCGTGTTGCCGTACTCCGCAGACGAC
GGGCCAACGAGTGGACAGGCTGTCAAATACGAGTACACCCAAGACACACAGACGCTCACAGTCAAACTCAAAACCCCCGACACACTCGAACCTGAGACTC
GTGGGGATTGGACGTGGACAAAACACGAACTTGAGGGATACGAAGCGTTCCACGAACTCCTCGATCATGGCAGTCTCTCCGCACCCTCCTTCCACCCCAC
ACAGACCAAAACGGGGAGCGACTACCACGAACTCTCGTTCTCAATCGAAGTCGAACACCAAGAGAAATCAGACGACGTGAAAACCGTTTTAGCGATTGAC
GGAGGTCTCCGCAAGGACGCAACTGCTATCGTTGTGAACGAGGATGGCGGGCAACTCTCTGTTCCGTATTTCATCCAGAATACGGAGCGAGAACAGATGC
GGAACCTTGCTCGAGAACGTAATCAACTCAACTCCAAACTCGCGTACCTGCGTCGGAATGGACGCAACCACAGAGACTCGTTCAGACACGTCCAAGCAGA
GTACGAGCGAGTGAACAACAAGATTCGGCACAAGCGAGAGCAGTTAGTTCACGACGTGGCGAATCAAGTCCTCGCACTCGCGTTGGTGTACGATGTGGAC
GCGATTGTTCACGAAGACCTGCGAAGTCTGTCACCACCTCGTGGTGAGGGGCAACTCTCGTGGGAACTCAGTTCGTGGGCACGTCGTGAGATAATTTCGA
AACTCGAGTATCGGGCTGAAATCGCCGGACTGCACGTGGAGAAAGTGTGTCCGGGAAACACGTCTCGGTCGTGCCCCCGATGTGGCGCGACGGGTCACAC
TACGAAGTCTCCCGACCACTCTTTTGAGGTGTGGTGGGGAGGGCACTTCCGCTGTGACAACGCTCGGTGTGGCTTCCAAGCCGACCGAGACTACGTTGGT
GCAGTCAACGTAGCTCGCGTGTTCTTCAGCGAGACGGCAACGCTGGAACACGGTTTCACGTCTTCCTATACGGGGGATGTTGAAATCGTGCCAGCAAGCC
GTTCCGCTGGCCCGCGTCTCGCGTTCGGTGAAGCACCAATTGTGTTTACCGGACAGTCGAAAGTGGTGACTGCTGGTGGCGGGTCGTGCTTCATAGCGCC
CGCTGTCACCCCGACAGAGACGAAAACCAATAGCAGTAACACCAATTCAGTGTCAAGCCCAGCGACACACAGTGCGTCTCAATTCGTTAAGCAGACTGCT
GTTTACTGCCGAAAATAGGAACGGT
TGGAGTCCACCCAGAAACCGTCAAAAAGTGGTGTCGCGAAGACCAAATTCAATATTCTCGAACTCCCGGTGGGCACCGACGGATTCCACATACCGAACTC
CAACGACTCACAGGCAAACCCTCACGGAGAACCGATAAAGTTGCCATCTACGGGAGAGTCTCAGGTCATGCCCAGAAACAGGACGGCAACCTCGACCGCC
AACTAAACTCTCTCACCCAATACGCCCACGACCACGGGTGGAGTATCGAAAACAAATACTCCGACGTGGGAAGTGGCCTCAACGAGAATCGGCGGGGACT
GAACAAACTCTTGGACGACGCAGAGAACGCCGACTACGGGCGCGTCCTCGTCACGTATCAAGACCGACTCACCCGATTTGGATTCTCCTACATCGAACGC
CTCCTCGACGAATATGGCGTCGAAATCACCGTCATCAACGAGGAAACCGACAAGACTGCACACGAAGAACTCGTGGACGATCTCTTGCAACTCGTCGCCA
GTTTCGCAGGCAAACTCTACGGGGTGCGTTCTTCAAAACGCAAGCGGATCGTCGAATCGGTGGAAGCAGAGGTGGAAACAGATGAGTAGCCACCTCTCAC
TCCCGGTACGACTCCCATTCGATGCGCGAGAGCGCCACGCCCGGTTAGATGTACTCACGAAACTCGTTGCAAATACGCTCATCGAGCGATACTGGACGCC
CGAACATCTCACCGGGATTGACGACTATTCCTATCAAGCGTGGAAATACTTTGACGAAAACGAAGCGTTCGCAGACGTTGACTTGTACCTCCCGAGCCGG
TACAAACGGTGCGTGATGCAGAAAGTCGGAGAAACACTCCGAAGCCACGCCGACAAACAAGAAGCGTTTCAATCCATCCAATCCGTCCTTCCAGACCACA
AGATTCGCCGTATCCACACCCGGAGAATCAAAGAACAACTGTGGGACTCAGAGGAATACATCAAATCTGGGTACGTTGAGCTACTCATCGGCCAACTCAA
CTCGTACTACGACCGTCACGGTCGGTTCCCTGACTCGTATTTCGACATGCAGGACTGCCCCACGTACTCGAAAGGCGTGTTGCCGTACTCCGCAGACGAC
GGGCCAACGAGTGGACAGGCTGTCAAATACGAGTACACCCAAGACACACAGACGCTCACAGTCAAACTCAAAACCCCCGACACACTCGAACCTGAGACTC
GTGGGGATTGGACGTGGACAAAACACGAACTTGAGGGATACGAAGCGTTCCACGAACTCCTCGATCATGGCAGTCTCTCCGCACCCTCCTTCCACCCCAC
ACAGACCAAAACGGGGAGCGACTACCACGAACTCTCGTTCTCAATCGAAGTCGAACACCAAGAGAAATCAGACGACGTGAAAACCGTTTTAGCGATTGAC
GGAGGTCTCCGCAAGGACGCAACTGCTATCGTTGTGAACGAGGATGGCGGGCAACTCTCTGTTCCGTATTTCATCCAGAATACGGAGCGAGAACAGATGC
GGAACCTTGCTCGAGAACGTAATCAACTCAACTCCAAACTCGCGTACCTGCGTCGGAATGGACGCAACCACAGAGACTCGTTCAGACACGTCCAAGCAGA
GTACGAGCGAGTGAACAACAAGATTCGGCACAAGCGAGAGCAGTTAGTTCACGACGTGGCGAATCAAGTCCTCGCACTCGCGTTGGTGTACGATGTGGAC
GCGATTGTTCACGAAGACCTGCGAAGTCTGTCACCACCTCGTGGTGAGGGGCAACTCTCGTGGGAACTCAGTTCGTGGGCACGTCGTGAGATAATTTCGA
AACTCGAGTATCGGGCTGAAATCGCCGGACTGCACGTGGAGAAAGTGTGTCCGGGAAACACGTCTCGGTCGTGCCCCCGATGTGGCGCGACGGGTCACAC
TACGAAGTCTCCCGACCACTCTTTTGAGGTGTGGTGGGGAGGGCACTTCCGCTGTGACAACGCTCGGTGTGGCTTCCAAGCCGACCGAGACTACGTTGGT
GCAGTCAACGTAGCTCGCGTGTTCTTCAGCGAGACGGCAACGCTGGAACACGGTTTCACGTCTTCCTATACGGGGGATGTTGAAATCGTGCCAGCAAGCC
GTTCCGCTGGCCCGCGTCTCGCGTTCGGTGAAGCACCAATTGTGTTTACCGGACAGTCGAAAGTGGTGACTGCTGGTGGCGGGTCGTGCTTCATAGCGCC
CGCTGTCACCCCGACAGAGACGAAAACCAATAGCAGTAACACCAATTCAGTGTCAAGCCCAGCGACACACAGTGCGTCTCAATTCGTTAAGCAGACTGCT
GTTTACTGCCGAAAATAGGAACGGT
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
630 bp | 209 aa | 60 | 689 | + | No |
Chemistry : Serine
ORF sequence :
MTKVYSIGEFADELGVHPETVKKWCREDQIQYSRTPGGHRRIPHTELQRLTGKPSRRTDKVAIYGRVSGHAQKQDGNLDRQLNSLTQYAHDHGWSIENKY
SDVGSGLNENRRGLNKLLDDAENADYGRVLVTYQDRLTRFGFSYIERLLDEYGVEITVINEETDKTAHEELVDDLLQLVASFAGKLYGVRSSKRKRIVES
VEAEVETDE
SDVGSGLNENRRGLNKLLDDAENADYGRVLVTYQDRLTRFGFSYIERLLDEYGVEITVINEETDKTAHEELVDDLLQLVASFAGKLYGVRSSKRKRIVES
VEAEVETDE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1737 bp | 578 aa | 682 | 2418 | + | No |
AG : TnpB
ORF sequence :
MSSHLSLPVRLPFDARERHARLDVLTKLVANTLIERYWTPEHLTGIDDYSYQAWKYFDENEAFADVDLYLPSRYKRCVMQKVGETLRSHADKQEAFQSIQ
SVLPDHKIRRIHTRRIKEQLWDSEEYIKSGYVELLIGQLNSYYDRHGRFPDSYFDMQDCPTYSKGVLPYSADDGPTSGQAVKYEYTQDTQTLTVKLKTPD
TLEPETRGDWTWTKHELEGYEAFHELLDHGSLSAPSFHPTQTKTGSDYHELSFSIEVEHQEKSDDVKTVLAIDGGLRKDATAIVVNEDGGQLSVPYFIQN
TEREQMRNLARERNQLNSKLAYLRRNGRNHRDSFRHVQAEYERVNNKIRHKREQLVHDVANQVLALALVYDVDAIVHEDLRSLSPPRGEGQLSWELSSWA
RREIISKLEYRAEIAGLHVEKVCPGNTSRSCPRCGATGHTTKSPDHSFEVWWGGHFRCDNARCGFQADRDYVGAVNVARVFFSETATLEHGFTSSYTGDV
EIVPASRSAGPRLAFGEAPIVFTGQSKVVTAGGGSCFIAPAVTPTETKTNSSNTNSVSSPATHSASQFVKQTAVYCRK
SVLPDHKIRRIHTRRIKEQLWDSEEYIKSGYVELLIGQLNSYYDRHGRFPDSYFDMQDCPTYSKGVLPYSADDGPTSGQAVKYEYTQDTQTLTVKLKTPD
TLEPETRGDWTWTKHELEGYEAFHELLDHGSLSAPSFHPTQTKTGSDYHELSFSIEVEHQEKSDDVKTVLAIDGGLRKDATAIVVNEDGGQLSVPYFIQN
TEREQMRNLARERNQLNSKLAYLRRNGRNHRDSFRHVQAEYERVNNKIRHKREQLVHDVANQVLALALVYDVDAIVHEDLRSLSPPRGEGQLSWELSSWA
RREIISKLEYRAEIAGLHVEKVCPGNTSRSCPRCGATGHTTKSPDHSFEVWWGGHFRCDNARCGFQADRDYVGAVNVARVFFSETATLEHGFTSSYTGDV
EIVPASRSAGPRLAFGEAPIVFTGQSKVVTAGGGSCFIAPAVTPTETKTNSSNTNSVSSPATHSASQFVKQTAVYCRK
Blast result :
Comments
ISHgi6 is 84% aa (transposase) similar to ISNma20.
References
1] Friedhelm Pfeiffer (2020) Direct submission.
2] Tittes, C., Schwarzer, S., Pfeiffer, F., Dyall-Smith, M., Oksanen, H., Quax, T. (in preparation)
2] Tittes, C., Schwarzer, S., Pfeiffer, F., Dyall-Smith, M., Oksanen, H., Quax, T. (in preparation)