ISHma17
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY596297 | ND | Haloarcula marismortui | Haloarcula marismortui ATCC 43049 |
DNA section
IS Length : 2009 bp
Ends
Left end : AGAGATACGAGCGCATACCCCCGCCTTCCCGTGAGTGGCGAGTGTTCCGACGCTCTGTCGGACCGAGGAGCGAACAGGCGGGGGTCGAGCGGACAGCATA II struct. : No
Right end : GACGCGTGGAACGTATGCGTCTTGAAACCAACAGGTCGCTACACTCGGCCTGAAATCCCGTGGTGGGATTCCTCCGCGTTCACGCGGAGGAAGAGGTCAA II struct. : No
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GAGGTGGTCCGC | TGAT | ATGCGGTATCAT |
DNA sequence
AGAGATACGAGCGCATACCCCCGCCTTCCCGTGAGTGGCGAGTGTTCCGACGCTCTGTCGGACCGAGGAGCGAACAGGCGGGGGTCGAGCGGACAGCATA
GTGTGACAGTCCACCGTTCTATTGCAGGGCTGGATTTCCTACCGTTTCCGCAAGACATTTGCCTGACAGGACCATAGTGTGTAGCACGGGTGAAGACCAC
ACGGCACGCAACCTACAACCTCAACTACCACATAGTGTGGCTCCCGAAGTCTCGCAGAACCGACGGTTCTGCGGATATCGCGGAACGCAGTTCCGCTCAA
TACCGTAAGGCGGTACTGACCGGAGAGGTTGCCGACCGTGTGGAATCCATCCTCCACGAAATTGCCGACGAGAAGGGCTTGGACATTCAAAACCTGACTG
TTCAGCCCGACCACGTTCATTTGTTCGTCAGTAGCCCGCCCAAACACAGCCCATCGCTGCTCGCCAACTGGTTCAAGCGCATTTCCTCACGGAAGTACAA
TCACCGCCACACCGACCACGACGGCGAGAAAATCCGGTGGGCACGCGGCTACTACGCCGGCACAGCCGGGCACGTCTCCAGCGAAACTGTCGAGAACTAC
ATCGACCGACACAAGGAGGTCCAAGCGTGACCGAACTCACGAAAACGCTCGAACTGAAGCTTGTGGACCCGAACACCCACAAGCGACAGAAGCTTCGTGA
GACACAGGACGCGTACCAGCAGGCACTCCAAGCCGCTTTCGCCGCTGGCTGTGACACACAGTCAGCGGCCAACGACGTGGTTGTTGAGTACGACCTGAGC
GGCTACGCGAAAAACGCCCTCAAGAAGTACGTCCCACAACTCTGTGGGAACAGCTATGACGCCGACGAGCTTCACGACAACCACCCGGTGCGGTTCACCA
ATGAGGGATTGCAACTTGACCACCAGCCACAGAACGCCATCGAGTGGTACGTCAAAATCCCGCACCACGAGGATTACCACCTCTGGATTCCAGCACGCGC
TAATCCCGAACAGCGGGAGTGGCTCGAAGCGTTAGACGCAGACGACGCCGAGATGGGTGAAAGTCGGCTGTTTGAGCGGGACGGAACGTGGTATCTCCAC
ATCACCGTTACCCGCGACGTGGAGGACCAATCTGAGGCGTCCGCCGACGAGCGGACACCGCCAGACGAGTCTGGCGAGCCCTGTCTCGCTGGACTCGACA
GGACGCCGATAGGCGTGGATATTGGAGAAGCGAGTCTCGTCACGGTGTGTCACCGCGACGGCTCTGGTTCTCCGGTTCGCCCCCGCCTGTGGGCCGACGA
CGGTAAAGCCGTTCGTCGGCTCCGCAAAACCTACTTCACCGCCAAGCGACGGCTTCAGCAGCGCGGCAGTGAGCGAATCGCGGAGTCCTACGGCGACTCG
CTGTGGGACCAGATTGACGACGTGTTCCACCGTGTAACCCGTGAGGTCGTGGAGTACGCCGAGTCTGTCGAGAACCCAGTGTTGGTGCTGGAAGACCTGA
CGTACATCCGCGAGAACATGGACTACGGCGAGTACATGAACCGCCGGTTGCACGGATGGGGGTTTGCCAAGCTCCACGCACAGATTCGCTACAAAGCCAC
AGAGAAGGGGATTCCCGTCGAGACGGTGAATCCCCGGAACACGTCGAAGGCGTTCCATGCCTGCGGTGAACACGGCTCCCGGCCACGACAGGCGACGTTC
AGATGCTCGAACGACGACTGCTGGCTCGGTGAGTATCAAGCCGACGTGAACGGGGCGATAAATATTGCAGACCGCTACCGTAGTGGAGAGAGTCACCGCC
GAAGCGACCGGAGTTCCCGGCAGAAGGCCGGTGACGATGACTCGGCTACGGATGGGGCCTCTTTGACCGGGCCACAAGACAGCCACGCCGATGCTGGAAC
CCAGCAGGAGACGCGTGGAACGTATGCGTCTTGAAACCAACAGGTCGCTACACTCGGCCTGAAATCCCGTGGTGGGATTCCTCCGCGTTCACGCGGAGGA
AGAGGTCAA
GTGTGACAGTCCACCGTTCTATTGCAGGGCTGGATTTCCTACCGTTTCCGCAAGACATTTGCCTGACAGGACCATAGTGTGTAGCACGGGTGAAGACCAC
ACGGCACGCAACCTACAACCTCAACTACCACATAGTGTGGCTCCCGAAGTCTCGCAGAACCGACGGTTCTGCGGATATCGCGGAACGCAGTTCCGCTCAA
TACCGTAAGGCGGTACTGACCGGAGAGGTTGCCGACCGTGTGGAATCCATCCTCCACGAAATTGCCGACGAGAAGGGCTTGGACATTCAAAACCTGACTG
TTCAGCCCGACCACGTTCATTTGTTCGTCAGTAGCCCGCCCAAACACAGCCCATCGCTGCTCGCCAACTGGTTCAAGCGCATTTCCTCACGGAAGTACAA
TCACCGCCACACCGACCACGACGGCGAGAAAATCCGGTGGGCACGCGGCTACTACGCCGGCACAGCCGGGCACGTCTCCAGCGAAACTGTCGAGAACTAC
ATCGACCGACACAAGGAGGTCCAAGCGTGACCGAACTCACGAAAACGCTCGAACTGAAGCTTGTGGACCCGAACACCCACAAGCGACAGAAGCTTCGTGA
GACACAGGACGCGTACCAGCAGGCACTCCAAGCCGCTTTCGCCGCTGGCTGTGACACACAGTCAGCGGCCAACGACGTGGTTGTTGAGTACGACCTGAGC
GGCTACGCGAAAAACGCCCTCAAGAAGTACGTCCCACAACTCTGTGGGAACAGCTATGACGCCGACGAGCTTCACGACAACCACCCGGTGCGGTTCACCA
ATGAGGGATTGCAACTTGACCACCAGCCACAGAACGCCATCGAGTGGTACGTCAAAATCCCGCACCACGAGGATTACCACCTCTGGATTCCAGCACGCGC
TAATCCCGAACAGCGGGAGTGGCTCGAAGCGTTAGACGCAGACGACGCCGAGATGGGTGAAAGTCGGCTGTTTGAGCGGGACGGAACGTGGTATCTCCAC
ATCACCGTTACCCGCGACGTGGAGGACCAATCTGAGGCGTCCGCCGACGAGCGGACACCGCCAGACGAGTCTGGCGAGCCCTGTCTCGCTGGACTCGACA
GGACGCCGATAGGCGTGGATATTGGAGAAGCGAGTCTCGTCACGGTGTGTCACCGCGACGGCTCTGGTTCTCCGGTTCGCCCCCGCCTGTGGGCCGACGA
CGGTAAAGCCGTTCGTCGGCTCCGCAAAACCTACTTCACCGCCAAGCGACGGCTTCAGCAGCGCGGCAGTGAGCGAATCGCGGAGTCCTACGGCGACTCG
CTGTGGGACCAGATTGACGACGTGTTCCACCGTGTAACCCGTGAGGTCGTGGAGTACGCCGAGTCTGTCGAGAACCCAGTGTTGGTGCTGGAAGACCTGA
CGTACATCCGCGAGAACATGGACTACGGCGAGTACATGAACCGCCGGTTGCACGGATGGGGGTTTGCCAAGCTCCACGCACAGATTCGCTACAAAGCCAC
AGAGAAGGGGATTCCCGTCGAGACGGTGAATCCCCGGAACACGTCGAAGGCGTTCCATGCCTGCGGTGAACACGGCTCCCGGCCACGACAGGCGACGTTC
AGATGCTCGAACGACGACTGCTGGCTCGGTGAGTATCAAGCCGACGTGAACGGGGCGATAAATATTGCAGACCGCTACCGTAGTGGAGAGAGTCACCGCC
GAAGCGACCGGAGTTCCCGGCAGAAGGCCGGTGACGATGACTCGGCTACGGATGGGGCCTCTTTGACCGGGCCACAAGACAGCCACGCCGATGCTGGAAC
CCAGCAGGAGACGCGTGGAACGTATGCGTCTTGAAACCAACAGGTCGCTACACTCGGCCTGAAATCCCGTGGTGGGATTCCTCCGCGTTCACGCGGAGGA
AGAGGTCAA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
441 bp | 146 aa | 190 | 630 | + | No |
Chemistry : Y1
ORF sequence :
MKTTRHATYNLNYHIVWLPKSRRTDGSADIAERSSAQYRKAVLTGEVADRVESILHEIADEKGLDIQNLTVQPDHVHLFVSSPPKHSPSLLANWFKRISS
RKYNHRHTDHDGEKIRWARGYYAGTAGHVSSETVENYIDRHKEVQA
RKYNHRHTDHDGEKIRWARGYYAGTAGHVSSETVENYIDRHKEVQA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1308 bp | 435 aa | 627 | 1934 | + | No |
AG : TnpB
ORF sequence :
MTELTKTLELKLVDPNTHKRQKLRETQDAYQQALQAAFAAGCDTQSAANDVVVEYDLSGYAKNALKKYVPQLCGNSYDADELHDNHPVRFTNEGLQLDHQ
PQNAIEWYVKIPHHEDYHLWIPARANPEQREWLEALDADDAEMGESRLFERDGTWYLHITVTRDVEDQSEASADERTPPDESGEPCLAGLDRTPIGVDIG
EASLVTVCHRDGSGSPVRPRLWADDGKAVRRLRKTYFTAKRRLQQRGSERIAESYGDSLWDQIDDVFHRVTREVVEYAESVENPVLVLEDLTYIRENMDY
GEYMNRRLHGWGFAKLHAQIRYKATEKGIPVETVNPRNTSKAFHACGEHGSRPRQATFRCSNDDCWLGEYQADVNGAINIADRYRSGESHRRSDRSSRQK
AGDDDSATDGASLTGPQDSHADAGTQQETRGTYAS
PQNAIEWYVKIPHHEDYHLWIPARANPEQREWLEALDADDAEMGESRLFERDGTWYLHITVTRDVEDQSEASADERTPPDESGEPCLAGLDRTPIGVDIG
EASLVTVCHRDGSGSPVRPRLWADDGKAVRRLRKTYFTAKRRLQQRGSERIAESYGDSLWDQIDDVFHRVTREVVEYAESVENPVLVLEDLTYIRENMDY
GEYMNRRLHGWGFAKLHAQIRYKATEKGIPVETVNPRNTSKAFHACGEHGSRPRQATFRCSNDDCWLGEYQADVNGAINIADRYRSGESHRRSDRSSRQK
AGDDDSATDGASLTGPQDSHADAGTQQETRGTYAS
Blast result :
Comments
ISHma17 is 79% (TnpA : the transposase) aa similar to ISNpe10 and 91% (TnpB) aa similar to ISNma24.
References
1] Friedhelm Pfeiffer (2015) Direct submission
2] Baliga,N.S., Bonneau,R., Facciotti,M.T., Pan,M., Glusman,G., Deutsch,E.W., Shannon,P., Chiu,Y., Weng,R.S., Gan,R.R., Hung,P., Date,S.V., Marcotte,E., Hood,L. and Ng,W.V. (2004) Genome Res. 14 (11), 2221-2234
2] Baliga,N.S., Bonneau,R., Facciotti,M.T., Pan,M., Glusman,G., Deutsch,E.W., Shannon,P., Chiu,Y., Weng,R.S., Gan,R.R., Hung,P., Date,S.V., Marcotte,E., Hood,L. and Ng,W.V. (2004) Genome Res. 14 (11), 2221-2234