ISHma1
- Family IS4
- Group ISH8
Isoform Synonym(s) ISHma3
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_005125 | ND | Haloarcula marismortui | Haloarcula marismortui ATCC 43049 |
DNA section
IS Length : 1403 bp
Ends
IR Length : 17/19
IRL : CATCTGTCTTTAGCTAAGAGACGAACCACGTGACAGCAGCGGCTTATCGA
IRR : CATTCGTCTTTAGCTAAGACTCACACCTCGGTTGTGTAGCGGTAGCGAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGTTCAAACA | ATTCATCAGAT | ATATGCGGCC | 11 |
CAGCCGTTTT | CGGGTCACAC | TTTTTCAGCA | 10 |
TGCCGACTAA | GTTTGAGTGTACGTATG | ATGAGCTAAC | 17 |
CTTCACCTCA | CTTTACCCTT | TTACCCCCCT | 10 |
DNA sequence
CATCTGTCTTTAGCTAAGAGACGAACCACGTGACAGCAGCGGCTTATCGAGGCTACTTTTCGAGAGGAATGAGGAGGAGTCCGCTGTGCACACTGACACC
TCCTCGTCGAGAATCATGCGTCGGCTCACTACACTGTTTCCCTCCGAGTTCCTCGAAGAGCACGCCGAGGAACTCGGCGTGGTCGAGCGAGAGGGCAAGC
TTCAGATTCCTGTCCTCGTGTGGGCGCTCGTGTTCGGCTTCGCCGCAGGCGAGAGCCGAACACTCGCTGGGTTTAGACGCTGCTACAACGCTACAGCTGA
CGAACCGATCTCTTCTGGCGGTTTCTATCACCGGTTGACGCCTACTCTTGCAGAGTATCTCCGCGACCTCGTCGAGGCCGCGCTCGACGAGGTCGCTGTC
CCTGATGCTGTTGACGCTGATATCGACCGATTCAGGGACGTGATGATCGCTGATGGAACCGTGTTGCGGTTGCACGAGTTCCTCTCTGATGAGTTTCAAG
CCCGCCACGAGGAGCAGGCTGGAGCGAAGCTCCACCTGCTCCACAACGCCACCGACCAGACGATTGAACGGATCGATGTGACTGATGAGAAAACGCACGA
CAGCGCGCTGTTCAAGACGGGATCGTGGCTGCAAGGACGACTGGTTCTATTTGATCGGGCGTACTTCAAATACCGCCGCTTTGCGTTGATCGACGAGAAC
GACGGCTACTTTGTGAGTCGGCTGAAGCAGAACGCAAATCCGGTGATAACGGCGGAATTACGGGAATGGCGCGGCCGCGCCATTCCCTTGGAGGGCAAGC
AGATCCACGACGTGGTCAATGATCTCTCGCGGAAGTACATCGACGTAGAGGTGGAAGCAGAATTCAAGCGCGGCCAGTACGAGGGAACTCGGTCGCTGGA
CACGAAGCGGTTTCGCGTCGTCGGCGTCCGCAACGAGGACGCCGACGACTACCACCTGTACATCACGAATCTGCCGAGAGAGGAGTTTCTCCCGGCGGAT
CTAGCGACGCTGTATCGGTGTCGATGGGAGGTAGAAACGTTGTTCCGTGAGTTGAAAACGCAGTACGAACTGGACGAATTCGACACGAACAACCCTGATG
TCGTGAAAATTCTACTATACGCTGCGTTGCTGTCACTGCTGGTGAGCCGAGAGTTGTTGGATCTGGTCACCGAGCAGGCCGACGATGAGATCGTGTTTCC
GCCGGAACGCTGGGCGGCGACCTTCCGGTCGCACGCCCAGCTCATCCTCCACGAACTCGGTGAGTACCTCGGCTACTCGCCACCGCCGTTGCTGGAGCGG
CTGATCGAGGATGCTCAGAAGATCCACCAACAACGACCGATCTTACAAGAGACGCTCGCTACCGCTACACAACCGAGGTGTGAGTCTTAGCTAAAGACGA
ATG
TCCTCGTCGAGAATCATGCGTCGGCTCACTACACTGTTTCCCTCCGAGTTCCTCGAAGAGCACGCCGAGGAACTCGGCGTGGTCGAGCGAGAGGGCAAGC
TTCAGATTCCTGTCCTCGTGTGGGCGCTCGTGTTCGGCTTCGCCGCAGGCGAGAGCCGAACACTCGCTGGGTTTAGACGCTGCTACAACGCTACAGCTGA
CGAACCGATCTCTTCTGGCGGTTTCTATCACCGGTTGACGCCTACTCTTGCAGAGTATCTCCGCGACCTCGTCGAGGCCGCGCTCGACGAGGTCGCTGTC
CCTGATGCTGTTGACGCTGATATCGACCGATTCAGGGACGTGATGATCGCTGATGGAACCGTGTTGCGGTTGCACGAGTTCCTCTCTGATGAGTTTCAAG
CCCGCCACGAGGAGCAGGCTGGAGCGAAGCTCCACCTGCTCCACAACGCCACCGACCAGACGATTGAACGGATCGATGTGACTGATGAGAAAACGCACGA
CAGCGCGCTGTTCAAGACGGGATCGTGGCTGCAAGGACGACTGGTTCTATTTGATCGGGCGTACTTCAAATACCGCCGCTTTGCGTTGATCGACGAGAAC
GACGGCTACTTTGTGAGTCGGCTGAAGCAGAACGCAAATCCGGTGATAACGGCGGAATTACGGGAATGGCGCGGCCGCGCCATTCCCTTGGAGGGCAAGC
AGATCCACGACGTGGTCAATGATCTCTCGCGGAAGTACATCGACGTAGAGGTGGAAGCAGAATTCAAGCGCGGCCAGTACGAGGGAACTCGGTCGCTGGA
CACGAAGCGGTTTCGCGTCGTCGGCGTCCGCAACGAGGACGCCGACGACTACCACCTGTACATCACGAATCTGCCGAGAGAGGAGTTTCTCCCGGCGGAT
CTAGCGACGCTGTATCGGTGTCGATGGGAGGTAGAAACGTTGTTCCGTGAGTTGAAAACGCAGTACGAACTGGACGAATTCGACACGAACAACCCTGATG
TCGTGAAAATTCTACTATACGCTGCGTTGCTGTCACTGCTGGTGAGCCGAGAGTTGTTGGATCTGGTCACCGAGCAGGCCGACGATGAGATCGTGTTTCC
GCCGGAACGCTGGGCGGCGACCTTCCGGTCGCACGCCCAGCTCATCCTCCACGAACTCGGTGAGTACCTCGGCTACTCGCCACCGCCGTTGCTGGAGCGG
CTGATCGAGGATGCTCAGAAGATCCACCAACAACGACCGATCTTACAAGAGACGCTCGCTACCGCTACACAACCGAGGTGTGAGTCTTAGCTAAAGACGA
ATG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1275 bp | 424 aa | 116 | 1390 | + | No |
Chemistry : DDE
ORF sequence :
MRRLTTLFPSEFLEEHAEELGVVEREGKLQIPVLVWALVFGFAAGESRTLAGFRRCYNATADEPISSGGFYHRLTPTLAEYLRDLVEAALDEVAVPDAVD
ADIDRFRDVMIADGTVLRLHEFLSDEFQARHEEQAGAKLHLLHNATDQTIERIDVTDEKTHDSALFKTGSWLQGRLVLFDRAYFKYRRFALIDENDGYFV
SRLKQNANPVITAELREWRGRAIPLEGKQIHDVVNDLSRKYIDVEVEAEFKRGQYEGTRSLDTKRFRVVGVRNEDADDYHLYITNLPREEFLPADLATLY
RCRWEVETLFRELKTQYELDEFDTNNPDVVKILLYAALLSLLVSRELLDLVTEQADDEIVFPPERWAATFRSHAQLILHELGEYLGYSPPPLLERLIEDA
QKIHQQRPILQETLATATQPRCES
ADIDRFRDVMIADGTVLRLHEFLSDEFQARHEEQAGAKLHLLHNATDQTIERIDVTDEKTHDSALFKTGSWLQGRLVLFDRAYFKYRRFALIDENDGYFV
SRLKQNANPVITAELREWRGRAIPLEGKQIHDVVNDLSRKYIDVEVEAEFKRGQYEGTRSLDTKRFRVVGVRNEDADDYHLYITNLPREEFLPADLATLY
RCRWEVETLFRELKTQYELDEFDTNNPDVVKILLYAALLSLLVSRELLDLVTEQADDEIVFPPERWAATFRSHAQLILHELGEYLGYSPPPLLERLIEDA
QKIHQQRPILQETLATATQPRCES
Blast result :
Comments
ISHma1 was found by screening completely sequenced genomes for seqences homologous to the ISPpu8 transposase using BLASTP. Multiple alignments revealed a DDE motif : D(N2)-66-D(N3)-126-E(C1). The copy number in Haloarcula marismortui ATCC 43049 is 1 on chromosome I, 1 on chromosome II, 1 on plasmid pNG400 and 1 on plamsid pNG500.
References
1] Baliga,N.S., Bonneau,R., Facciotti,M.T., Pan,M., Glusman,G., Deutsch,E.W., Shannon,P., Chiu,Y., Weng,R.S., Gan,R.R., Hung,P., Date,S.V., Marcotte,E., Hood,L. and Ng,W.V.(2004)Genome Res. 14,2221-2234