ISHma7
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006396 | ND | Haloarcula marismortui | Haloarcula marismortui chromosome I |
DNA section
IS Length : 2008 bp
Ends
Left end : AGGCGGAGCAGGCCGAGTGCCTCGGGGCTTGACCCCGAGGGTGAAGGCCGTAAGCCCCGTTAAACGTGTTCCGTTCGCTCGATATACTGCTCAATCGTGT II struct. : Yes
Right end : GTGACCCAACCGACCGTTCGAGGCTATCAGGCCGATGGTCGGATGGGAGTGTCCGACTAAACCACGGGAAGCCTCGGGGCTTGACCCCGAGGCGGTTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
CGAAATCAACAGACAGACCT | CTAC | CTCGTGCTTGAGGGCGAC | tcac |
DNA sequence
AGGCGGAGCAGGCCGAGTGCCTCGGGGCTTGACCCCGAGGGTGAAGGCCGTAAGCCCCGTTAAACGTGTTCCGTTCGCTCGATATACTGCTCAATCGTGT
CGGTCGAAACATCACCCGCCGTCCCCACGTAGTACGATTCCTCCCAGAACCCACCTCCCCACAGATACTCCTCCAAGTACGACTCGTACTGTTCCCACAT
CTCCCGCGCCGTGATGCTCTTGACCGTTCGCACAATCTCGCTCGGCGCATGCTTCGGGTGGGCTGACAGGAACAGGTGTACGTGGTCGGGTGAGATGTGG
AGCGACAGTATCTCGTAGCCGTACTCGTCGCACATGGCACGGAAACTCGCTTCCAGCGAATCCTCGATTGGTTCGAGAATGGTGTGGCGGTACTTCGGAC
ACCACACGAAGTGGTAGTTAATGTTGTACACCGTGTGGTTCGACCGCTTCTCGCCCATACATACCAATACAACGACCAATCGAAAGTATTCTCTGAAGTG
ACTCACAGACTACCACAAGTGAACTTTTAACTGTGGAAGAACAAGGGGTAGACGTGACGAAACAACTCAAGGTATCCGACACCGTGTACGACGACCTTGA
CGAACTCAAGGACGAGGAGGGCCACACGAGTTTCGACAGCGTACTCCGCACGCTCCTCCTCCACTACCGTCACGTCCAACCCAACGAGCAGGAATGACCG
ACTCACAGGCACTCGTCAAGACGCTGGACTTCCAACTCGACATCCAGAGTGACAACGAGAGCCTGCTGTACGACGCCACCCTCGAAGCGCGGTCGGTGTA
CAACGAAACCATCCGTCTCGCCAAGCAAGGCGTCGACTGGGACGCGATTCCCGACCGTGTAGCCGACGACGCCAACCTCGTGAAAAACACGACACAGCGC
GTCGTCGCCAAAGCACTCGGCGCGATGGAGAACTACTACGAGTACGACGACTTCGGCAAACCCAGTCACACCAAGGACGGCGCGTACCCCCTCCGAGCGA
ACTACGAGGAGGGGTACAACCTGTCGCTCACCGACGACGGCGACGTGGCGTTCCGCATCAGCGCGAAGCCGTACAAGCACGTCACGGGCGTCCTCAAAGG
GAGTGACGCCAACCTCGACATTCTCCAGACCGCACTCGAAAGCGATGAGTGGAAGATTGGGACGGCGGAAGCCCTGTTCCACAACGACAACGCTGAGTTG
CACGTCAACGTCACCAACACCGAACAGACCGTTCGAGACAAGCAGGACTCGCGGACGGTCGTCGGTGTGGACGTGAACGAAGACAACGTGGCTCTCACCG
CTCTCTCCGAGGATGGCATTGAGGACTCGTTGGTTATTGACTTCCCCGAAATCAAGTTCGAGCGCCACCGCTACTTCACGATGCGGAAGCGCGTCCAGAA
CGCGGGGAAAGACAGCATCCACGACACGCTGGAAGGGCGTGAGGAACGGTTCGTCCGTGACCGACTCCACAAGGTGTCTCGACACATCGTGGAGTGGAGT
CGTCAGTTCGAGAAGCCGTGCATCGTCTTTGAAGACCTCAAAGAGATGCGCGACAGTATCGACTACGGCACGCGGATGAACCGACGCTTGCACCACCTTC
CGTTCCGCGCCCTCCAGTTCTATACGTCGTACAAGGCGTCGTTCGAGGGTATCCCGACTGCGTGGATTAACCCCGAGTACACGAGCCAACGGTGTCCGAT
GTGCGGACACACGGAACGTGCGAACCGTAACAAGAAACGGTTCAATTGTTGGGACTGTGGGCATCAAGACCACAGCGACCGTGGTGCAAGCGTCAACATC
GCCGTGAAAGGCGTGAAGAAACTCGATTGGAATGTGCCTGCTCTCAACAGCCTTCCCGTTGTTCGGAAGGTGCGACGGCGGGCATCGGGGGCCGTGGACG
CCCCGACCGTGACCCAACCGACCGTTCGAGGCTATCAGGCCGATGGTCGGATGGGAGTGTCCGACTAAACCACGGGAAGCCTCGGGGCTTGACCCCGAGG
CGGTTCAC
CGGTCGAAACATCACCCGCCGTCCCCACGTAGTACGATTCCTCCCAGAACCCACCTCCCCACAGATACTCCTCCAAGTACGACTCGTACTGTTCCCACAT
CTCCCGCGCCGTGATGCTCTTGACCGTTCGCACAATCTCGCTCGGCGCATGCTTCGGGTGGGCTGACAGGAACAGGTGTACGTGGTCGGGTGAGATGTGG
AGCGACAGTATCTCGTAGCCGTACTCGTCGCACATGGCACGGAAACTCGCTTCCAGCGAATCCTCGATTGGTTCGAGAATGGTGTGGCGGTACTTCGGAC
ACCACACGAAGTGGTAGTTAATGTTGTACACCGTGTGGTTCGACCGCTTCTCGCCCATACATACCAATACAACGACCAATCGAAAGTATTCTCTGAAGTG
ACTCACAGACTACCACAAGTGAACTTTTAACTGTGGAAGAACAAGGGGTAGACGTGACGAAACAACTCAAGGTATCCGACACCGTGTACGACGACCTTGA
CGAACTCAAGGACGAGGAGGGCCACACGAGTTTCGACAGCGTACTCCGCACGCTCCTCCTCCACTACCGTCACGTCCAACCCAACGAGCAGGAATGACCG
ACTCACAGGCACTCGTCAAGACGCTGGACTTCCAACTCGACATCCAGAGTGACAACGAGAGCCTGCTGTACGACGCCACCCTCGAAGCGCGGTCGGTGTA
CAACGAAACCATCCGTCTCGCCAAGCAAGGCGTCGACTGGGACGCGATTCCCGACCGTGTAGCCGACGACGCCAACCTCGTGAAAAACACGACACAGCGC
GTCGTCGCCAAAGCACTCGGCGCGATGGAGAACTACTACGAGTACGACGACTTCGGCAAACCCAGTCACACCAAGGACGGCGCGTACCCCCTCCGAGCGA
ACTACGAGGAGGGGTACAACCTGTCGCTCACCGACGACGGCGACGTGGCGTTCCGCATCAGCGCGAAGCCGTACAAGCACGTCACGGGCGTCCTCAAAGG
GAGTGACGCCAACCTCGACATTCTCCAGACCGCACTCGAAAGCGATGAGTGGAAGATTGGGACGGCGGAAGCCCTGTTCCACAACGACAACGCTGAGTTG
CACGTCAACGTCACCAACACCGAACAGACCGTTCGAGACAAGCAGGACTCGCGGACGGTCGTCGGTGTGGACGTGAACGAAGACAACGTGGCTCTCACCG
CTCTCTCCGAGGATGGCATTGAGGACTCGTTGGTTATTGACTTCCCCGAAATCAAGTTCGAGCGCCACCGCTACTTCACGATGCGGAAGCGCGTCCAGAA
CGCGGGGAAAGACAGCATCCACGACACGCTGGAAGGGCGTGAGGAACGGTTCGTCCGTGACCGACTCCACAAGGTGTCTCGACACATCGTGGAGTGGAGT
CGTCAGTTCGAGAAGCCGTGCATCGTCTTTGAAGACCTCAAAGAGATGCGCGACAGTATCGACTACGGCACGCGGATGAACCGACGCTTGCACCACCTTC
CGTTCCGCGCCCTCCAGTTCTATACGTCGTACAAGGCGTCGTTCGAGGGTATCCCGACTGCGTGGATTAACCCCGAGTACACGAGCCAACGGTGTCCGAT
GTGCGGACACACGGAACGTGCGAACCGTAACAAGAAACGGTTCAATTGTTGGGACTGTGGGCATCAAGACCACAGCGACCGTGGTGCAAGCGTCAACATC
GCCGTGAAAGGCGTGAAGAAACTCGATTGGAATGTGCCTGCTCTCAACAGCCTTCCCGTTGTTCGGAAGGTGCGACGGCGGGCATCGGGGGCCGTGGACG
CCCCGACCGTGACCCAACCGACCGTTCGAGGCTATCAGGCCGATGGTCGGATGGGAGTGTCCGACTAAACCACGGGAAGCCTCGGGGCTTGACCCCGAGG
CGGTTCAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
399 bp | 132 aa | 60 | 458 | - | No |
Chemistry : Y1
ORF sequence :
MGEKRSNHTVYNINYHFVWCPKYRHTILEPIEDSLEASFRAMCDEYGYEILSLHISPDHVHLFLSAHPKHAPSEIVRTVKSITAREMWEQYESYLEEYLW
GGGFWEESYYVGTAGDVSTDTIEQYIERTEHV
GGGFWEESYYVGTAGDVSTDTIEQYIERTEHV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1275 bp | 424 aa | 694 | 1968 | + | No |
AG : TnpB
ORF sequence :
MTDSQALVKTLDFQLDIQSDNESLLYDATLEARSVYNETIRLAKQGVDWDAIPDRVADDANLVKNTTQRVVAKALGAMENYYEYDDFGKPSHTKDGAYPL
RANYEEGYNLSLTDDGDVAFRISAKPYKHVTGVLKGSDANLDILQTALESDEWKIGTAEALFHNDNAELHVNVTNTEQTVRDKQDSRTVVGVDVNEDNVA
LTALSEDGIEDSLVIDFPEIKFERHRYFTMRKRVQNAGKDSIHDTLEGREERFVRDRLHKVSRHIVEWSRQFEKPCIVFEDLKEMRDSIDYGTRMNRRLH
HLPFRALQFYTSYKASFEGIPTAWINPEYTSQRCPMCGHTERANRNKKRFNCWDCGHQDHSDRGASVNIAVKGVKKLDWNVPALNSLPVVRKVRRRASGA
VDAPTVTQPTVRGYQADGRMGVSD
RANYEEGYNLSLTDDGDVAFRISAKPYKHVTGVLKGSDANLDILQTALESDEWKIGTAEALFHNDNAELHVNVTNTEQTVRDKQDSRTVVGVDVNEDNVA
LTALSEDGIEDSLVIDFPEIKFERHRYFTMRKRVQNAGKDSIHDTLEGREERFVRDRLHKVSRHIVEWSRQFEKPCIVFEDLKEMRDSIDYGTRMNRRLH
HLPFRALQFYTSYKASFEGIPTAWINPEYTSQRCPMCGHTERANRNKKRFNCWDCGHQDHSDRGASVNIAVKGVKKLDWNVPALNSLPVVRKVRRRASGA
VDAPTVTQPTVRGYQADGRMGVSD
Blast result :
Comments
ISHma7 is 58%(ORF A) aa similar to ISDra2 and 47% (ORF B) to ISH12.
References
1] Baliga,N.S., Bonneau,R., Facciotti,M.T., Pan,M., Glusman,G., Deutsch,E.W., Shannon,P., Chiu,Y., Weng,R.S., Gan,R.R., Hung,P., Date,S.V., Marcotte,E., Hood,L. and Ng,W.V. (2004) Genome Res. 14 (11), 2221-2234