ISAcma16
- Family IS4
- Group IS4
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009925 | ND | Acaryochloris marina | Acaryochloris marina MBIC11017 |
DNA section
IS Length : 1483 bp
Ends
IR Length : 18
IRL : CAATGGCACTGACTTAAGAAGTCAGGAAAAAGCTACAAGTCTTGATAGAT
IRR : CAATGGCACTGACTTAAGCTCAATCAAACAGCTAGCTTAGCTTTGAGAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCAGATCAATCACATCCCCC | ACCCCCAGAT | AGCGCACCTGGCTCGCCCTC | 10 |
AGCAATGAGGTATTGACT | ATCCCTTGGAAT | ATCTTTCTCCTTGACAAC | 12 |
CTACCATTGGGTCGATGGCT | AGCTATAGTT | TATTCGCTCTTTCTAGCCAG | 10 |
AAATATTGAAAGGGAGAAAG | CCCCATAGAT | TGCTTGAACAGACGGCTAAA | 10 |
AGAAGGCGAATCTATCGCT | ATTGCATCGTT | CAATCTCTCAACGCTGAAT | 11 |
GTCGGTATTTCCAGATACG | TCCCTATGGGT | TGCCATCCATAGGGAATAC | 11 |
CCCAATGTTCAAGAGAGCT | ATCCCAACAGT | GGTTGTTCAAGAATGAACT | 11 |
CTCAATTCCCTGCCTGATCA | GTCTATAGAT | GCTTTGAAAATTCAGTAATT | 10 |
GTTTGCTGAGTCGTTTAAGC | AGTCCTGATT | CGCCCAGAATATCTTCCGGT | 10 |
ATCACCGCCTTGGGATAAAC | TATTAAAGTT | GGCTTGACCTGAATCGGGCA | 10 |
GGGAAATCCTTTCCCAAGTT | ATTTATGGAT | TCCTGCAGTTTTTGCCCAAA | 10 |
AGTTCATTCTTGAACAACC | ACTGTTGGGAT | AGCTCTCTTGAACATTGGG | 11 |
AGGAAACGCTGCATCTGTC | ATCTAACGGAT | AGGTCTCAGGGGTGCGAGA | 11 |
DNA sequence
CAATGGCACTGACTTAAGAAGTCAGGAAAAAGCTACAAGTCTTGATAGATATAGGATCAAGACTTGCACGCCATAAACTTCTGCCTTTCACTATGTCTGA
GTCAGCCGAAATTCTCAAGCAGCAATTTTCCCAAAGCCTGGGTTTACCGTGGACAGATATCCTTCCAGCCTCAAGGTTAGAGGAACTCCTGAAAGAAGAA
GCATTCTCCTATCGCAACCGGATATATAGCCCCATCGTTACGCTGTGGGCCATGCTTTACCAAGTGCTATCAGCCGATAAAAGCCTTCGTAACACAGTCA
AATGCATCACGACTTGGCTCACAGCGGCTGGCATCCAACCGCCCTCATCTGATACCGGAGCCTACAGCAAAGCCAGAAGTCGCTTTCCAGAGTCACTGTT
GCAACGTTTAATTCCCGAAAGCGCTGAGTGCTTAGCGCAACCCCTCTCCCCAGAGCACCTCTGGTGTGGTCGGCCCGTCAAGGTCTACGACGGAACCACC
GTGTTGATGGCTGATAGTGCGGCCAACCAAGCATCATATCCACAACATGGCAATCAAACAGCAGGCTGTGGTTTTCCCATCGCTCGCTTGGTAGTGTTCT
TCTGTTTGGTTACCGGTGCAGTGGCGTCAGCTTGTATTGCCTCCTGGGACACCAGTGAAATTGTCATGAGTCGTTTGCTCTATCAAGACCTTGAGGTCGG
TGATGTGGTCATGGCGGACCAAGCTTATGGCAGCTATGTTGATCTAGCCATCATTCAACAACACAGGGCTGATGGAGTGTTGCGTAAACATCATGCTCGC
AAGACTGATTTCCGCAAAGGCAACAAGCATGGCATTGGTGATCATCAGGTGACATGGCATAAGCCAGCCCAACGGCCTGAGCACATGAGTGAGCAAGATT
TTGCCCTGATTCCTCAAACATTGGTCGTTAGAGAGGTGTGTTTGCGCTTATCCCTTAAGGGCTTTCGCGACCAGCACATTATTGTGGTGACGACGCTGCT
GGATGCTCAACGCTACAGCGCTGGGCAACTGACTCGCTTGTATGGCTGGCGTTGGCCAGTGGCGGAAGTCAATCTGCGCCATCTCAAAACCACCTTAAAA
ATGGAGATGCTCAGTGCCAAAACTCCGGATATGGTGCGCAAGGACATTTGGGTACATTTGTTGGGCTATAACCTACTCAGAAGTCTCATGGAACTTGCGG
CACCGCTAGCAGATAATGCTAGAACTCAACTGTCTGTGCAAGGAGCACGACAACACTTCAATCAGATGCTTGCTTTGTTGGCGACAGCCAACCGTGCGAC
CAGAAAGCGGTTGTTTACTCATCTACTTGAGACCATGGCAGCCGATCTATTACCCTCTCGACCGAATCGGCACGAACCGAGAGTCGTCAAACGCAGACCC
AAATCTTTCCCGCGAATGCGACAACCTCGCTCTGCTCTCAAAGCTAAGCTAGCTGTTTGATTGAGCTTAAGTCAGTGCCATTG
GTCAGCCGAAATTCTCAAGCAGCAATTTTCCCAAAGCCTGGGTTTACCGTGGACAGATATCCTTCCAGCCTCAAGGTTAGAGGAACTCCTGAAAGAAGAA
GCATTCTCCTATCGCAACCGGATATATAGCCCCATCGTTACGCTGTGGGCCATGCTTTACCAAGTGCTATCAGCCGATAAAAGCCTTCGTAACACAGTCA
AATGCATCACGACTTGGCTCACAGCGGCTGGCATCCAACCGCCCTCATCTGATACCGGAGCCTACAGCAAAGCCAGAAGTCGCTTTCCAGAGTCACTGTT
GCAACGTTTAATTCCCGAAAGCGCTGAGTGCTTAGCGCAACCCCTCTCCCCAGAGCACCTCTGGTGTGGTCGGCCCGTCAAGGTCTACGACGGAACCACC
GTGTTGATGGCTGATAGTGCGGCCAACCAAGCATCATATCCACAACATGGCAATCAAACAGCAGGCTGTGGTTTTCCCATCGCTCGCTTGGTAGTGTTCT
TCTGTTTGGTTACCGGTGCAGTGGCGTCAGCTTGTATTGCCTCCTGGGACACCAGTGAAATTGTCATGAGTCGTTTGCTCTATCAAGACCTTGAGGTCGG
TGATGTGGTCATGGCGGACCAAGCTTATGGCAGCTATGTTGATCTAGCCATCATTCAACAACACAGGGCTGATGGAGTGTTGCGTAAACATCATGCTCGC
AAGACTGATTTCCGCAAAGGCAACAAGCATGGCATTGGTGATCATCAGGTGACATGGCATAAGCCAGCCCAACGGCCTGAGCACATGAGTGAGCAAGATT
TTGCCCTGATTCCTCAAACATTGGTCGTTAGAGAGGTGTGTTTGCGCTTATCCCTTAAGGGCTTTCGCGACCAGCACATTATTGTGGTGACGACGCTGCT
GGATGCTCAACGCTACAGCGCTGGGCAACTGACTCGCTTGTATGGCTGGCGTTGGCCAGTGGCGGAAGTCAATCTGCGCCATCTCAAAACCACCTTAAAA
ATGGAGATGCTCAGTGCCAAAACTCCGGATATGGTGCGCAAGGACATTTGGGTACATTTGTTGGGCTATAACCTACTCAGAAGTCTCATGGAACTTGCGG
CACCGCTAGCAGATAATGCTAGAACTCAACTGTCTGTGCAAGGAGCACGACAACACTTCAATCAGATGCTTGCTTTGTTGGCGACAGCCAACCGTGCGAC
CAGAAAGCGGTTGTTTACTCATCTACTTGAGACCATGGCAGCCGATCTATTACCCTCTCGACCGAATCGGCACGAACCGAGAGTCGTCAAACGCAGACCC
AAATCTTTCCCGCGAATGCGACAACCTCGCTCTGCTCTCAAAGCTAAGCTAGCTGTTTGATTGAGCTTAAGTCAGTGCCATTG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1419 bp | 472 aa | 42 | 1460 | + | No |
Chemistry : DDE
ORF sequence :
MIDIGSRLARHKLLPFTMSESAEILKQQFSQSLGLPWTDILPASRLEELLKEEAFSYRNRIYSPIVTLWAMLYQVLSADKSLRNTVKCITTWLTAAGIQP
PSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVLMADSAANQASYPQHGNQTAGCGFPIARLVVFFCLVTGAVASACIA
SWDTSEIVMSRLLYQDLEVGDVVMADQAYGSYVDLAIIQQHRADGVLRKHHARKTDFRKGNKHGIGDHQVTWHKPAQRPEHMSEQDFALIPQTLVVREVC
LRLSLKGFRDQHIIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLSAKTPDMVRKDIWVHLLGYNLLRSLMELAAPLADNARTQLSVQ
GARQHFNQMLALLATANRATRKRLFTHLLETMAADLLPSRPNRHEPRVVKRRPKSFPRMRQPRSALKAKLAV
PSSDTGAYSKARSRFPESLLQRLIPESAECLAQPLSPEHLWCGRPVKVYDGTTVLMADSAANQASYPQHGNQTAGCGFPIARLVVFFCLVTGAVASACIA
SWDTSEIVMSRLLYQDLEVGDVVMADQAYGSYVDLAIIQQHRADGVLRKHHARKTDFRKGNKHGIGDHQVTWHKPAQRPEHMSEQDFALIPQTLVVREVC
LRLSLKGFRDQHIIVVTTLLDAQRYSAGQLTRLYGWRWPVAEVNLRHLKTTLKMEMLSAKTPDMVRKDIWVHLLGYNLLRSLMELAAPLADNARTQLSVQ
GARQHFNQMLALLATANRATRKRLFTHLLETMAADLLPSRPNRHEPRVVKRRPKSFPRMRQPRSALKAKLAV
Blast result :
Comments
ISAcma15 is 57% aa similar to ISArch10.
References
1] Swingley,W.D., Chen,M., Cheung,P.C., Conrad,A.L., Dejesa,L.C.,Hao,J., Honchak,B.M., Karbach,L.E., Kurdoglu,A., Lahiri,S.,Mastrian,S.D., Miyashita,H., Page,L., Ramakrishna,P., Satoh,S., Sattley,W.M., Shimada,Y., Taylor,H.L., Tomo,T., Tsuchiya,T., Wang,Z.T., Raymond,J., Mimuro,M., Blankenship,R.E. and Touchman,J.W. (2008) Proc. Natl. Acad. Sci. U.S.A.
2] ISfinder annotation (2009)
2] ISfinder annotation (2009)