ISHma14
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY596292 | ND | Haloarcula marismortui | Haloarcula marismortui ATCC 43049 plasmid pNG300 |
DNA section
IS Length : 1705 bp
Ends
Left end : CTTGATTCAGCGAGAGAATCCCGCCCTTCCCGTGAGTGACGAGCGTTCCGACGCTTAGTCGGAACCGAGGAGCGAACAGGGCGGGAGTGAATCGCGTAAG II struct. : Yes
Right end : GAGTACCGTCCACCGTGACGGGAATGTTGCTTCCGGGGAATCAGCATAGCCGAGACTCCTACAGAGGAAACCGCGCCGTTCACGGCGCGGAGGATGTCAT II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GTCGCGGCAGTA | CTAT | TGATTGGTCGTC | tcat |
DNA sequence
CTTGATTCAGCGAGAGAATCCCGCCCTTCCCGTGAGTGACGAGCGTTCCGACGCTTAGTCGGAACCGAGGAGCGAACAGGGCGGGAGTGAATCGCGTAAG
CTCCGGTAAGAAACCGCCATGTCACTGCTGGTCTTGAGTCTCCAACACTTCCAGACCAGCTTCTAACACTTCTGCATAGGCTTCCGAGAGGTTCAGGTCG
TTCGCTTCTGCGTAGTCTTTGATTCGTCCGTGGATTGCCCAGTCAATGTCGATGTTTGGCCTCATCGGTGGACTCATTAGACATTGGGACTGATTATACT
TTGGATGAAGCGCACCAACACGTTCGCCGTGCGACCGCTCACCGACGATGGTGAGCAGGTGCTACGGGATCTGTTGGACGCTTCTGCCGCTCTCTGGAAC
GAGATTAACTACCAGCGCCTCATGCGCTACAACGACGAGAACGGCTTTGAGGGCGAAGACGTGTGGGACGCCGATACCGGCGCTCTTGAAGGCAAATACA
AAGGCGTTCTCGGCGCGTCTACCGCTCAAACTGTCCGGCGAGCAAACACCGAAGCGTGGCGGTCGTTCTTCGAGAACAAGAAGGCGTATCACGACAAATC
GAACACGTCGGTCACGGAACACCCGGAACCGCCGGGCTTCCGTGGCAACGAAGATGACGGACGTGTTCTCAAAGGCGTCGTCCGAAAGGACGCCTACACC
GTTGAATGGGGCGACCGCTCTCGACTTGAGATTATCGTCGGCAAAGAACTCCGAGACAGGCACAACAGCCCGAAAAGCCGTCTTCGGCTCGAAATCGTTG
GCGACCCGAACTGGCCTGACTACGAGGACCAAGGCCGGTTGGAACTGTGGTACGACGAGACTGACAGCACCTTCCGAGCTTCGCAACCCGTCACTGTTTC
TGACGATGCACGGGGGACTCCACTGGCCGACCACAAGGCCGCTCTGGACATTGGTGCGAACAATCTCGTCGCCTGTACCACGACGACCGGCAAACAATAC
CTGTACGAAGGCCGCGAGTTGTTTCAGCGATTCCGTGAGACGACACGAGAAATCGCCCGGTTACAGTCCAAGCTACGGGAAGGCCGATACAGTAGCGAGC
GTATTCGGCGGTTGTACCGAAAGCGAACCCGTCGCCGCGACCACGCACAAGAGGCACTGTGTCGTGACCTGCTCGAACGACTCTACGAGGACGGTGTGGA
CACGGTGTATATCGGAGGCTTGGCCGACGTGCTGGACACACACTGGTCGGTCGAGACGAACGCCAAGACGCACAACTTCTGGGCGTTCAAGCAGTTCACC
GAGCGACTGGCGTGTACCGCAGACGAATACGGTATCTCGGTCGAAGTCCGGTCGGAAGCGTGGACCAGTCAAGAGTGCCCACAGTGTGGTTCGACAGACC
GGACGACACGGCATCAGGACACGCTCACCTGTCCGTGTGGATTCGAGGGGCACGCCGACCTTACGGCGTCAGAAACGTTCCTGAAGCGGCATACGAGCAA
AGAAGTCAGGCCGATGGCACGGCCCGTGCGGTTCGAGTGGGACGACCACGAATGGTCGGAGTCACCACGCTCTCTCGAAAGTCCCAAAGAACAGCGCACA
GACCCGAGTACCGTCCACCGTGACGGGAATGTTGCTTCCGGGGAATCAGCATAGCCGAGACTCCTACAGAGGAAACCGCGCCGTTCACGGCGCGGAGGAT
GTCAT
CTCCGGTAAGAAACCGCCATGTCACTGCTGGTCTTGAGTCTCCAACACTTCCAGACCAGCTTCTAACACTTCTGCATAGGCTTCCGAGAGGTTCAGGTCG
TTCGCTTCTGCGTAGTCTTTGATTCGTCCGTGGATTGCCCAGTCAATGTCGATGTTTGGCCTCATCGGTGGACTCATTAGACATTGGGACTGATTATACT
TTGGATGAAGCGCACCAACACGTTCGCCGTGCGACCGCTCACCGACGATGGTGAGCAGGTGCTACGGGATCTGTTGGACGCTTCTGCCGCTCTCTGGAAC
GAGATTAACTACCAGCGCCTCATGCGCTACAACGACGAGAACGGCTTTGAGGGCGAAGACGTGTGGGACGCCGATACCGGCGCTCTTGAAGGCAAATACA
AAGGCGTTCTCGGCGCGTCTACCGCTCAAACTGTCCGGCGAGCAAACACCGAAGCGTGGCGGTCGTTCTTCGAGAACAAGAAGGCGTATCACGACAAATC
GAACACGTCGGTCACGGAACACCCGGAACCGCCGGGCTTCCGTGGCAACGAAGATGACGGACGTGTTCTCAAAGGCGTCGTCCGAAAGGACGCCTACACC
GTTGAATGGGGCGACCGCTCTCGACTTGAGATTATCGTCGGCAAAGAACTCCGAGACAGGCACAACAGCCCGAAAAGCCGTCTTCGGCTCGAAATCGTTG
GCGACCCGAACTGGCCTGACTACGAGGACCAAGGCCGGTTGGAACTGTGGTACGACGAGACTGACAGCACCTTCCGAGCTTCGCAACCCGTCACTGTTTC
TGACGATGCACGGGGGACTCCACTGGCCGACCACAAGGCCGCTCTGGACATTGGTGCGAACAATCTCGTCGCCTGTACCACGACGACCGGCAAACAATAC
CTGTACGAAGGCCGCGAGTTGTTTCAGCGATTCCGTGAGACGACACGAGAAATCGCCCGGTTACAGTCCAAGCTACGGGAAGGCCGATACAGTAGCGAGC
GTATTCGGCGGTTGTACCGAAAGCGAACCCGTCGCCGCGACCACGCACAAGAGGCACTGTGTCGTGACCTGCTCGAACGACTCTACGAGGACGGTGTGGA
CACGGTGTATATCGGAGGCTTGGCCGACGTGCTGGACACACACTGGTCGGTCGAGACGAACGCCAAGACGCACAACTTCTGGGCGTTCAAGCAGTTCACC
GAGCGACTGGCGTGTACCGCAGACGAATACGGTATCTCGGTCGAAGTCCGGTCGGAAGCGTGGACCAGTCAAGAGTGCCCACAGTGTGGTTCGACAGACC
GGACGACACGGCATCAGGACACGCTCACCTGTCCGTGTGGATTCGAGGGGCACGCCGACCTTACGGCGTCAGAAACGTTCCTGAAGCGGCATACGAGCAA
AGAAGTCAGGCCGATGGCACGGCCCGTGCGGTTCGAGTGGGACGACCACGAATGGTCGGAGTCACCACGCTCTCTCGAAAGTCCCAAAGAACAGCGCACA
GACCCGAGTACCGTCCACCGTGACGGGAATGTTGCTTCCGGGGAATCAGCATAGCCGAGACTCCTACAGAGGAAACCGCGCCGTTCACGGCGCGGAGGAT
GTCAT
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
144 bp | 47 aa | 265 | 122 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MRPNIDIDWAIHGRIKDYAEANDLNLSEAYAEVLEAGLEVLETQDQQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1350 bp | 449 aa | 305 | 1654 | + | No |
AG : TnpB
ORF sequence :
MKRTNTFAVRPLTDDGEQVLRDLLDASAALWNEINYQRLMRYNDENGFEGEDVWDADTGALEGKYKGVLGASTAQTVRRANTEAWRSFFENKKAYHDKSN
TSVTEHPEPPGFRGNEDDGRVLKGVVRKDAYTVEWGDRSRLEIIVGKELRDRHNSPKSRLRLEIVGDPNWPDYEDQGRLELWYDETDSTFRASQPVTVSD
DARGTPLADHKAALDIGANNLVACTTTTGKQYLYEGRELFQRFRETTREIARLQSKLREGRYSSERIRRLYRKRTRRRDHAQEALCRDLLERLYEDGVDT
VYIGGLADVLDTHWSVETNAKTHNFWAFKQFTERLACTADEYGISVEVRSEAWTSQECPQCGSTDRTTRHQDTLTCPCGFEGHADLTASETFLKRHTSKE
VRPMARPVRFEWDDHEWSESPRSLESPKEQRTDPSTVHRDGNVASGESA
TSVTEHPEPPGFRGNEDDGRVLKGVVRKDAYTVEWGDRSRLEIIVGKELRDRHNSPKSRLRLEIVGDPNWPDYEDQGRLELWYDETDSTFRASQPVTVSD
DARGTPLADHKAALDIGANNLVACTTTTGKQYLYEGRELFQRFRETTREIARLQSKLREGRYSSERIRRLYRKRTRRRDHAQEALCRDLLERLYEDGVDT
VYIGGLADVLDTHWSVETNAKTHNFWAFKQFTERLACTADEYGISVEVRSEAWTSQECPQCGSTDRTTRHQDTLTCPCGFEGHADLTASETFLKRHTSKE
VRPMARPVRFEWDDHEWSESPRSLESPKEQRTDPSTVHRDGNVASGESA
Blast result :
Comments
ISHma14 is 91% aa (TnpB) similar to ISNamo22. The first ORF is a passenger gene annotated as hypothetical protein.
References
1] F. Pfeiffer (2013) Direct submission
2] Baliga,N.S., Bonneau,R., Facciotti,M.T., Pan,M., Glusman,G., Deutsch,E.W., Shannon,P., Chiu,Y., Weng,R.S., Gan,R.R., Hung,P., Date,S.V., Marcotte,E., Hood,L. and Ng,W.V. (2004) Genome Res. 14 (11), 2221-2234
2] Baliga,N.S., Bonneau,R., Facciotti,M.T., Pan,M., Glusman,G., Deutsch,E.W., Shannon,P., Chiu,Y., Weng,R.S., Gan,R.R., Hung,P., Date,S.V., Marcotte,E., Hood,L. and Ng,W.V. (2004) Genome Res. 14 (11), 2221-2234