ISAcma17
- Family IS5
- Group IS5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009925 | ND | Acaryochloris marina | Acaryochloris marina MBIC11017 |
DNA section
IS Length : 1697 bp
Ends
IR Length : 13/17
IRL : GGGCTTGCTTCGGAAGTCTAAGTTCCAATATTTGCAGTGTATCCAAGTGT
IRR : GGGTTTGCTGAGAAAGTAAATTGAGCAAATTCCATGCATCAAATCGTACA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTTTTATTTAGATTAATTTTTAA | TACATA | AGCTCAGTTCAACCTAAGCACAT | 6 |
TCTTGAGTGGAAACTTGTCGAAC | TAAATA | TCGGTGAAGTTTGGCACGATGGA | 6 |
TTCTCACAGAATAGCTGTGAACT | TATTTA | GCAGTACTTTTCAGACTTCGAAT | 6 |
TCATACAGGAGAACAAGCTCAAA | TATCTTA | AACCTAGTCATTGATACAGACA | 7 |
GAGTTCCATCTTGAAATATCTCG | TACTTA | TACAGTTCTTGTGGACAGGTTTG | 6 |
AGATTCACAAGTCATCATTAATC | TAATAA | AAGGGTTAACATCTCACCAAGAC | 6 |
TAGGGGGCTGAGATAATCTCACC | TAACTA | CCTGAGTAAGATACAACCGTTAT | 6 |
GAACGCTATCGTTCTGAATCGCCA | TACTTA | AATTCCCAAACACCGTTGCTAGAG | 6 |
DNA sequence
GGGCTTGCTTCGGAAGTCTAAGTTCCAATATTTGCAGTGTATCCAAGTGTTGAGGGCCCTTGAGAAGTGCAAGGAAAATCATTCAAAATATTTAAAACCC
TTGCACCTGCAACCCGATGTATCGCCGTCCATCCCCCGGTCAACTGTCCTTTGAGAACTTCTACTTGCCCTTTGGAGGGAAACTTTCAGGAGAGAATCGC
TGGGTCAAACTAGCCGAGTTGATTCCTTGGGAGAGCTTTGAGTCAGAATATGCAGACCAATTCAGTCAGACCATGGGGGCACCTGCCAAACCCTTTCGTG
TGGCCTTAGGGACCTTAATCATCAAAGAGAAACTGGGTATATCCGACGCAGAGACCGTAGAACAGATTCGAGAGAACCCCTACTTGCAGTACTTTCTGGG
GTTTAGTGAATATCGAGAGAGTGCTCCGTTTGATGCCTCCATGCTGGTTCACTTTCGCCAACGGTTGAATCTAGAGCTGCTAGCTCAAGTCAATGACGCA
GTCGTGGACTCTGTGCTGGCATCCGAGGGGTCTCCTGTGGCGATGTCCCCTGAGTCATCTCCTCCCTCTGACGATGATGAAGATGAGCCTCCTGCCCCTC
CCAACCAGGGCCAATTGTTGATAGATACGACCTGTGCACCAGCCGATATTCGCTATCCGACAGACTTGGATTTGCTCAATGATGCCCGCAAAGCAAGCGA
AGCCATCCTCGATGCTTTATATGCTCAAGTCATGCTGTCCCTCAAGTGCAAACCTCGCACCTACCGCCAGAAGGCCCGCAAAGCCTATCTGGCCATTGCC
AAAAAACGTCAACCGAGCCGCAAGCTGATTCGTAAAGGGATTCGTCAGCAGTTGGGCTATGTGCGTCGCAATCTAGCCCATATTGATGGGTTGATAACAT
TGGGGGCTTCCCTATCCCAATTGTCCCGGCGGCAGTACCGTCTGTTGCTGGTGATTCATGAAGTCTTTCGACAGCAAGAATGGATGTATCAACAGTCAGT
ACGTCGAGTGGACGACCGGATTGTCAGTATGAGTCAGCCTCATGTCCGACCGATGGTCCGAGGCAAAGCTGGAACACCCGTTGAGTTTGGAGCCAAGCTA
TCCCTCAGTTGTGTGCGTGGATGTGTGTTTCTGGAGCGCTTGAGTTGGGATGCGTTCAATGAATCCCAGCATCTTCAACATCAAGTGGAAGCCTTTCGTC
GTCGCTTTGGCCATTATCCTTTCTCAGTCCATGCTGACCAAATCTATCGGACTCGGGCTAATCGACGATGGTGTAAAGCTCGGGGGATTCGCTTGAGTGG
ACCGCCGTTAGGAAGACCTCAACAAGACCCACAAGTACAGACCCAACTCAAGCAACAAGCGTGTGAGGATGAAAAGGTGAGGGTCCAGATTGAGGGTAAG
TTTGGGCAGGCCAAGCGACGGTTTGGCTTGAATCGAGTGATGGCGAAGTTAGCGGATACGGCTGCCAGTGCGATTGCTATCACGTTCTTGGTCATGAATC
TGGAGCGGTGGCTCAGCCACTTTTTATCGTTCATTTTTGGGGTGGGATTGCTGGTTTTGGAAAGCGTTATGGAGCTGTCAGGACTTCCTTGGGGCTTTGC
TAGGGCCAGTCAAAGGATATGGCTCTGGCCGTCACAAATGGGTTGCTTGTACGATTTGATGCATGGAATTTGCTCAATTTACTTTCTCAGCAAACCC
TTGCACCTGCAACCCGATGTATCGCCGTCCATCCCCCGGTCAACTGTCCTTTGAGAACTTCTACTTGCCCTTTGGAGGGAAACTTTCAGGAGAGAATCGC
TGGGTCAAACTAGCCGAGTTGATTCCTTGGGAGAGCTTTGAGTCAGAATATGCAGACCAATTCAGTCAGACCATGGGGGCACCTGCCAAACCCTTTCGTG
TGGCCTTAGGGACCTTAATCATCAAAGAGAAACTGGGTATATCCGACGCAGAGACCGTAGAACAGATTCGAGAGAACCCCTACTTGCAGTACTTTCTGGG
GTTTAGTGAATATCGAGAGAGTGCTCCGTTTGATGCCTCCATGCTGGTTCACTTTCGCCAACGGTTGAATCTAGAGCTGCTAGCTCAAGTCAATGACGCA
GTCGTGGACTCTGTGCTGGCATCCGAGGGGTCTCCTGTGGCGATGTCCCCTGAGTCATCTCCTCCCTCTGACGATGATGAAGATGAGCCTCCTGCCCCTC
CCAACCAGGGCCAATTGTTGATAGATACGACCTGTGCACCAGCCGATATTCGCTATCCGACAGACTTGGATTTGCTCAATGATGCCCGCAAAGCAAGCGA
AGCCATCCTCGATGCTTTATATGCTCAAGTCATGCTGTCCCTCAAGTGCAAACCTCGCACCTACCGCCAGAAGGCCCGCAAAGCCTATCTGGCCATTGCC
AAAAAACGTCAACCGAGCCGCAAGCTGATTCGTAAAGGGATTCGTCAGCAGTTGGGCTATGTGCGTCGCAATCTAGCCCATATTGATGGGTTGATAACAT
TGGGGGCTTCCCTATCCCAATTGTCCCGGCGGCAGTACCGTCTGTTGCTGGTGATTCATGAAGTCTTTCGACAGCAAGAATGGATGTATCAACAGTCAGT
ACGTCGAGTGGACGACCGGATTGTCAGTATGAGTCAGCCTCATGTCCGACCGATGGTCCGAGGCAAAGCTGGAACACCCGTTGAGTTTGGAGCCAAGCTA
TCCCTCAGTTGTGTGCGTGGATGTGTGTTTCTGGAGCGCTTGAGTTGGGATGCGTTCAATGAATCCCAGCATCTTCAACATCAAGTGGAAGCCTTTCGTC
GTCGCTTTGGCCATTATCCTTTCTCAGTCCATGCTGACCAAATCTATCGGACTCGGGCTAATCGACGATGGTGTAAAGCTCGGGGGATTCGCTTGAGTGG
ACCGCCGTTAGGAAGACCTCAACAAGACCCACAAGTACAGACCCAACTCAAGCAACAAGCGTGTGAGGATGAAAAGGTGAGGGTCCAGATTGAGGGTAAG
TTTGGGCAGGCCAAGCGACGGTTTGGCTTGAATCGAGTGATGGCGAAGTTAGCGGATACGGCTGCCAGTGCGATTGCTATCACGTTCTTGGTCATGAATC
TGGAGCGGTGGCTCAGCCACTTTTTATCGTTCATTTTTGGGGTGGGATTGCTGGTTTTGGAAAGCGTTATGGAGCTGTCAGGACTTCCTTGGGGCTTTGC
TAGGGCCAGTCAAAGGATATGGCTCTGGCCGTCACAAATGGGTTGCTTGTACGATTTGATGCATGGAATTTGCTCAATTTACTTTCTCAGCAAACCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1581 bp | 526 aa | 117 | 1697 | + | No |
Chemistry : DDE
ORF sequence :
MYRRPSPGQLSFENFYLPFGGKLSGENRWVKLAELIPWESFESEYADQFSQTMGAPAKPFRVALGTLIIKEKLGISDAETVEQIRENPYLQYFLGFSEYR
ESAPFDASMLVHFRQRLNLELLAQVNDAVVDSVLASEGSPVAMSPESSPPSDDDEDEPPAPPNQGQLLIDTTCAPADIRYPTDLDLLNDARKASEAILDA
LYAQVMLSLKCKPRTYRQKARKAYLAIAKKRQPSRKLIRKGIRQQLGYVRRNLAHIDGLITLGASLSQLSRRQYRLLLVIHEVFRQQEWMYQQSVRRVDD
RIVSMSQPHVRPMVRGKAGTPVEFGAKLSLSCVRGCVFLERLSWDAFNESQHLQHQVEAFRRRFGHYPFSVHADQIYRTRANRRWCKARGIRLSGPPLGR
PQQDPQVQTQLKQQACEDEKVRVQIEGKFGQAKRRFGLNRVMAKLADTAASAIAITFLVMNLERWLSHFLSFIFGVGLLVLESVMELSGLPWGFARASQR
IWLWPSQMGCLYDLMHGICSIYFLSKP
ESAPFDASMLVHFRQRLNLELLAQVNDAVVDSVLASEGSPVAMSPESSPPSDDDEDEPPAPPNQGQLLIDTTCAPADIRYPTDLDLLNDARKASEAILDA
LYAQVMLSLKCKPRTYRQKARKAYLAIAKKRQPSRKLIRKGIRQQLGYVRRNLAHIDGLITLGASLSQLSRRQYRLLLVIHEVFRQQEWMYQQSVRRVDD
RIVSMSQPHVRPMVRGKAGTPVEFGAKLSLSCVRGCVFLERLSWDAFNESQHLQHQVEAFRRRFGHYPFSVHADQIYRTRANRRWCKARGIRLSGPPLGR
PQQDPQVQTQLKQQACEDEKVRVQIEGKFGQAKRRFGLNRVMAKLADTAASAIAITFLVMNLERWLSHFLSFIFGVGLLVLESVMELSGLPWGFARASQR
IWLWPSQMGCLYDLMHGICSIYFLSKP
Blast result :
Comments
ISAcma17 is 42% aa similar to ISPg7.
References
1] Swingley,W.D., Chen,M., Cheung,P.C., Conrad,A.L., Dejesa,L.C.,Hao,J., Honchak,B.M., Karbach,L.E., Kurdoglu,A., Lahiri,S.,Mastrian,S.D., Miyashita,H., Page,L., Ramakrishna,P., Satoh,S., Sattley,W.M., Shimada,Y., Taylor,H.L., Tomo,T., Tsuchiya,T., Wang,Z.T., Raymond,J., Mimuro,M., Blankenship,R.E. and Touchman,J.W. (2008) Proc. Natl. Acad. Sci. U.S.A.
2] ISfinder annotation (2009)
2] ISfinder annotation (2009)