ISHahy4
- Family IS3
- Group IS407
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Halanaerobium hydrogeniformans | Halanaerobium hydrogeniformans |
DNA section
IS Length : 1411 bp
Ends
IR Length : 11/16
IRL : CTATATAGCCCCCTAACAGAATTGGAGAAAAAGTTTGAAAGTGATATACT
IRR : CGGTTCAACCCCCTAAATATGGAAAATATGTTTGTTAAGCTCGCTCATTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACCCATTTTTCACCCATATCAACCTGC | AT | CTTTCTCTGTGAGTCATTGCTACTCTTA | 2 |
TATGATGAATACGAGATAGATTTAAGAG | AG | ATGACTTATGCTGATGTTAAGGATTTAA | 2 |
ACAACTATTTAATATTTTAAAAGGTG | AGAT | GAGTTTTGTTGGGCCAAGACCACCAC | 4 |
ACATACTTCTAAAACCTACGAGGATTATTT | ATAGAAGCTGCTCAAATTGAAAAAATTTGG | 0 | |
TCAAAAGTAATAATTTATTTTCCCA | TCCCT | TTAATACATTCCTTACAAACAGACT | 5 |
TAGCGGGTGGCACTATTAACCTCCAC | ATCA | GACTTACACTAACTAGCAAATATTCA | 4 |
ATACTCTAATTACAAGGGGGAAACACCAAT | TTCATACATCTAAACTATATAATATTGTTA | 0 | |
GAATGGAAATGATTTGAAACGTCATG | GTAA | AATCAAAAATTTGCCTAATGAACAAT | 4 |
CCGATTCCCAATTAAGAATATAATAG | ATTT | CATGTTTTTTCTCCTCGGATTCTTTT | 4 |
TTTCACAAATAATCGGGAAATATCAA | AAAA | GACATATAGGAGAGGATGATTGAAAT | 4 |
DNA sequence
CTATATAGCCCCCTAACAGAATTGGAGAAAAAGTTTGAAAGTGATATACTCTAATTACAAGGGGGAAACACCAATGGCAAACAGAAAATATTCCGATGAA
ACTAAAGAACAAATTGTAAAAGAATGTCGCGAAATAGGTAACACAGCTCTTGTAGCAAGACGACATAATATTTCTAAGCATACTGTTTACAGCTGGGTCA
AAAAAGCTAAAGAAACAGGATCAGTTAGATCTCTTCCTAAAGATGAAAAAAAGCAAATGAAAGAGATAGAAAATAGATTAAGTAAAATGAGCGATGAAAA
TGATAAGCTCAAAAAAATTGTAGCAGAAAAAGAATTAGAATTAGCGATTTTAAGGGAGTTGAGAGATAAAGTAAACCCCCGATAGCCCTCAAAGTTCAGA
TTGCATCAAAGTGGATAAATAAAGGGTATAAAATCTCTATTGTTTTAGACTTTGTTGGGCTTAATTCTTCCACTTACTACAGTAATATAAATAGAAAAAC
TGAGAGTGAAAGTACTAATAGCAGCAATTCCAATAATCCTCAAGGAAGACCTGTCCCTGGGTATTCTCTAACTGAATCAGGTGAAAAAATATCTGATGAA
CAGATTAAAGAATGGCTCTTAGAACTGGTTGCAGGAGATGGCTTCCCTTATGGTTACAGGAAACTTACAGTCTGTTTAAAAGAAGACTATAACTTGAAAA
TAAATAAGAAAAAAGTATACAGGTTATGCAAAGAACTGGATATATTAAGATCGCAAAGAAAAATCAAAAAATTTAGACCTAAAAAGATTGCAAAACAGGA
AGAAATTACAGAACCAAATCAACTCTGGCAGATGGATTTAAAATACGGCTACATAAATGGAACAGATCAGTTCTTTTTCCAGATGTCAGTAATTGATGTC
TTTGATAAGACTGTTATAGATTATCACCTGGGACTAAGCTGTAAAGCTAAAGATACCTGCAGGGTATTAAAGGCTGCTTTAAATAAAAGAAAGCTGTATA
AAGGCATGAATTTGCCTAAAATTAGAACAGATAATGGACCACAATTTGTCTCTAAATTATTTGGAGACACCTGTGAAAAACTGGGGGTAGAGCATCAGAG
AATTCCAGTTAGAACACCTAATATGAATGCTCATATAGAATCATTTCATTCGGTTTTAGAAAAAGATTGTTATTCAATTAATGAATTCAGTAGTTTTATT
GACGCCTATAAAAAAGTCAGTGAGTATATGAATTATTATAACAACAGATACCGTCATGGCAGTCTTAATGATATGCCTCCAGCAAAATTTTATAAACTGG
CTAAAGCAGAAAAAATAGTTGCTGAACCAGTACTCGCCTAAATCAAAAAATGAGAAACTAAGAATGAGCGAGCTTAACAAACATATTTTCCATATTTAGG
GGGTTGAACCG
ACTAAAGAACAAATTGTAAAAGAATGTCGCGAAATAGGTAACACAGCTCTTGTAGCAAGACGACATAATATTTCTAAGCATACTGTTTACAGCTGGGTCA
AAAAAGCTAAAGAAACAGGATCAGTTAGATCTCTTCCTAAAGATGAAAAAAAGCAAATGAAAGAGATAGAAAATAGATTAAGTAAAATGAGCGATGAAAA
TGATAAGCTCAAAAAAATTGTAGCAGAAAAAGAATTAGAATTAGCGATTTTAAGGGAGTTGAGAGATAAAGTAAACCCCCGATAGCCCTCAAAGTTCAGA
TTGCATCAAAGTGGATAAATAAAGGGTATAAAATCTCTATTGTTTTAGACTTTGTTGGGCTTAATTCTTCCACTTACTACAGTAATATAAATAGAAAAAC
TGAGAGTGAAAGTACTAATAGCAGCAATTCCAATAATCCTCAAGGAAGACCTGTCCCTGGGTATTCTCTAACTGAATCAGGTGAAAAAATATCTGATGAA
CAGATTAAAGAATGGCTCTTAGAACTGGTTGCAGGAGATGGCTTCCCTTATGGTTACAGGAAACTTACAGTCTGTTTAAAAGAAGACTATAACTTGAAAA
TAAATAAGAAAAAAGTATACAGGTTATGCAAAGAACTGGATATATTAAGATCGCAAAGAAAAATCAAAAAATTTAGACCTAAAAAGATTGCAAAACAGGA
AGAAATTACAGAACCAAATCAACTCTGGCAGATGGATTTAAAATACGGCTACATAAATGGAACAGATCAGTTCTTTTTCCAGATGTCAGTAATTGATGTC
TTTGATAAGACTGTTATAGATTATCACCTGGGACTAAGCTGTAAAGCTAAAGATACCTGCAGGGTATTAAAGGCTGCTTTAAATAAAAGAAAGCTGTATA
AAGGCATGAATTTGCCTAAAATTAGAACAGATAATGGACCACAATTTGTCTCTAAATTATTTGGAGACACCTGTGAAAAACTGGGGGTAGAGCATCAGAG
AATTCCAGTTAGAACACCTAATATGAATGCTCATATAGAATCATTTCATTCGGTTTTAGAAAAAGATTGTTATTCAATTAATGAATTCAGTAGTTTTATT
GACGCCTATAAAAAAGTCAGTGAGTATATGAATTATTATAACAACAGATACCGTCATGGCAGTCTTAATGATATGCCTCCAGCAAAATTTTATAAACTGG
CTAAAGCAGAAAAAATAGTTGCTGAACCAGTACTCGCCTAAATCAAAAAATGAGAAACTAAGAATGAGCGAGCTTAACAAACATATTTTCCATATTTAGG
GGGTTGAACCG
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
312 bp | 103 aa | 74 | 385 | + | No |
Description : First part of the transposase
ORF sequence :
MANRKYSDETKEQIVKECREIGNTALVARRHNISKHTVYSWVKKAKETGSVRSLPKDEKKQMKEIENRLSKMSDENDKLKKIVAEKELELAILRELRDKV
NPR
NPR
Blast result :Comments : First part of the transposaseORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
972 bp | 323 aa | 370 | 1341 | + | No |
Chemistry : DDE
ORF sequence :
SKPPIALKVQIASKWINKGYKISIVLDFVGLNSSTYYSNINRKTESESTNSSNSNNPQGRPVPGYSLTESGEKISDEQIKEWLLELVAGDGFPYGYRKLT
VCLKEDYNLKINKKKVYRLCKELDILRSQRKIKKFRPKKIAKQEEITEPNQLWQMDLKYGYINGTDQFFFQMSVIDVFDKTVIDYHLGLSCKAKDTCRVL
KAALNKRKLYKGMNLPKIRTDNGPQFVSKLFGDTCEKLGVEHQRIPVRTPNMNAHIESFHSVLEKDCYSINEFSSFIDAYKKVSEYMNYYNNRYRHGSLN
DMPPAKFYKLAKAEKIVAEPVLA
VCLKEDYNLKINKKKVYRLCKELDILRSQRKIKKFRPKKIAKQEEITEPNQLWQMDLKYGYINGTDQFFFQMSVIDVFDKTVIDYHLGLSCKAKDTCRVL
KAALNKRKLYKGMNLPKIRTDNGPQFVSKLFGDTCEKLGVEHQRIPVRTPNMNAHIESFHSVLEKDCYSINEFSSFIDAYKKVSEYMNYYNNRYRHGSLN
DMPPAKFYKLAKAEKIVAEPVLA
Blast result :Comments : Second part of the transposaseORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1268 bp | 422 aa | 74 | 1341 | + | Yes |
Chemistry : DDE
ORF sequence :
MANRKYSDETKEQIVKECREIGNTALVARRHNISKHTVYSWVKKAKETGSVRSLPKDEKKQMKEIENRLSKMSDENDKLKKIVAEKELELAILRELRDKV
NPPIALKVQIASKWINKGYKISIVLDFVGLNSSTYYSNINRKTESESTNSSNSNNPQGRPVPGYSLTESGEKISDEQIKEWLLELVAGDGFPYGYRKLTV
CLKEDYNLKINKKKVYRLCKELDILRSQRKIKKFRPKKIAKQEEITEPNQLWQMDLKYGYINGTDQFFFQMSVIDVFDKTVIDYHLGLSCKAKDTCRVLK
AALNKRKLYKGMNLPKIRTDNGPQFVSKLFGDTCEKLGVEHQRIPVRTPNMNAHIESFHSVLEKDCYSINEFSSFIDAYKKVSEYMNYYNNRYRHGSLND
MPPAKFYKLAKAEKIVAEPVLA
NPPIALKVQIASKWINKGYKISIVLDFVGLNSSTYYSNINRKTESESTNSSNSNNPQGRPVPGYSLTESGEKISDEQIKEWLLELVAGDGFPYGYRKLTV
CLKEDYNLKINKKKVYRLCKELDILRSQRKIKKFRPKKIAKQEEITEPNQLWQMDLKYGYINGTDQFFFQMSVIDVFDKTVIDYHLGLSCKAKDTCRVLK
AALNKRKLYKGMNLPKIRTDNGPQFVSKLFGDTCEKLGVEHQRIPVRTPNMNAHIESFHSVLEKDCYSINEFSSFIDAYKKVSEYMNYYNNRYRHGSLND
MPPAKFYKLAKAEKIVAEPVLA
Blast result :
Comments
The third ORF is a putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
The ends as defined are not typical of the IS3 family (TG).
ISHahy4 is 46% aa similar to ISKol9.
The ends as defined are not typical of the IS3 family (TG).
ISHahy4 is 46% aa similar to ISKol9.
References
1] Frank R. (2016) Direct submission to ISfinder
2] Brown S.D., Begemann M.B., Mormile M.R., Wall J.D., Han C.S., Goodwin L.A., Pitluck S., Land M.L., Hauser L.J. and Elias D.A. (2011) J. Bacteriol. 193, 3682-3683
2] Brown S.D., Begemann M.B., Mormile M.R., Wall J.D., Han C.S., Goodwin L.A., Pitluck S., Land M.L., Hauser L.J. and Elias D.A. (2011) J. Bacteriol. 193, 3682-3683