ISSoc3
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007775 | ND | Synechococcus sp. | Synechococcus sp. JA-3-3Ab |
DNA section
IS Length : 1806 bp
Ends
Left end : CAGAACCTCCGGGGTACTCCCGCCACAGCGCAGCTTGGCGGGTGGGGGATAGGCGGTGAGACGAAGGGGTTCAATGCCCCTGAGTCTCACTATCTGGGCG II struct. : Yes
Right end : GGATCCTGGTTTACCAGTCCGTTGAAGCAGCAACTCCGGGAAGTGATTCTCGGAAGCTCCCGTCTACAGCCCGTCAGGGCCTAGCGGGAGAGGATGTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
AGCCGTGACT | CCAT | TGGGTGCAAA | TCAC |
TGCAACGAAC | CCAT | TTTCTCAGCC | TCAC |
GCTAACGGCG | CCAT | GTTTCGGCGT | TCAC |
GGGCGGATTT | CCAT | ACTCTTGTCT | TCAC |
TCCAAGTATT | CCAT | CACGCTCCTC | TCAC |
TGGCCACAGT | CCAT | TCTCAGCTTG | TCAC |
TCCCTCGCCT | CCAT | TTCAGTCGGC | TCAC |
TCTGTTTTCC | CCAT | CGAGGAGGAG | TCAC |
GCTCTGGAAT | CCAT | GAGCTGCCAA | TCAC |
CATCCACCAG | CCAT | GCCTCATTGC | TCAC |
TCCCGACTCC | CCAT | TTGGATACGC | TCAC |
GAAAAAGCTG | CCAT | TCAAGCGAGC | TCAC |
CCAAGTTCGG | CCAT | TGCAAGCAAA | TCAC |
AGGAGGTTGT | CCAT | TTTCACTTTT | TCAC |
GCTATATTCT | CCAT | TTTAGGGGGC | TCAC |
CCGCTCCTGC | CCAT | AGGACGTATG | TCAC |
DNA sequence
CAGAACCTCCGGGGTACTCCCGCCACAGCGCAGCTTGGCGGGTGGGGGATAGGCGGTGAGACGAAGGGGTTCAATGCCCCTGAGTCTCACTATCTGGGCG
ATGCCTGCTGCTCAACATACTTCTTCAACTCCTCAACGGTGACCCCACCACAAGAGGCAACAAAATAGGCCCCTGTCCAAAATACAGGCTTGCTGTAGAA
CCGTGCCACCTCTGTGGCGAACTCTTTGCGAATCAACCGGCTGGAGACTGTTTTCAGGTTGTTCACCAGCTTCGAGACCTGAACATCCGGCGGAAAACTC
ACCAACAGATGCACATGGTCCGCCTCACCGTCGAACTCCACCAAGGAACAGCGCCACTTTTGGCAGGTCGCTCGAAATATATCTTCCAGCCTCTGCAACA
TTGGAGCAGTTATCACCCGACGACGGTACTTTGTCACCAGCACCAAGTGGATTTGTAGGCTGTAAACAGAACGATGGCCTATGTTGTAGCTCATTGCGCT
AAATGCTAGAACCGTCTCATGATTATCACCCACGAGTACCGGATCCTGCCCAGCGACGACCAAGCCGCTCTGATGACCGAGTGGCTGGAATTGTTGCGGC
GGCAGTGGAACGACGCTCTGGGGCAGAGACTGGACTGGCTGACCGCAACCCGTTGCCCAATTGACCGCTGCAGTCTTGTCTCTTGTCCGTTGCCTGTGTC
AGAACCCCCGCTGGAGCCGAATTATTATCGGCAGGCGGGATCCCTCAAACAAATCAAGCAACTGTTCCCGGCCTACCGGGGCATTTACGCCGAGGTGCTG
CAGCAAAACTTGATGCGGCTGGACAAGGCGTGGAAAGCGTGGCGGGAGCCGGATAGCACAGGCAAGCGGCGGGGGCGGCCTCGCTTCAAAAAAGCGGGGG
AGTTGAGATCCTTCACATTCCCCCGCATCAATTGCCCCAAGGCGGGAGCGCATCTGGAAGGGGAGACTCTGCGGCTGAGCAAGATTGGCTCGATGCCTGT
GGTGCTGCACCGCCCCTTACCAGAGGGGTTTGTGCCCAAAACCTGCACAGTGGTGCGCAAGGCCGATGGGTGGTATGTCTGCATTACTTTGGAGGACAAA
AGCGTCCCTTCCCCAGAGCCTGTGCCGATCAAAAAGGCGGTGGGCATTGATGTGGGATTGGATAGGTTTCTCACCACCAGCGATGGGGAGGTGGTGCCTA
TCCCGCGGCACTACCGCCGAGCTCAAAAGCACTTGGCCCGACAGCAGCGGCAACTGAGCCGCAAAGTGAAGGGGTCCGCCAACTGGAAGAGACAAGCCAC
GAAAGTTGCTTGTTTGCAGTTGCACGTTGCCCGACAACGCAAAGCGTTCCACTACCAAGTGGCGCACTGGCTGGTGGAGCAATACGACCTGTTGGTGGTG
GAGGATCTCAACGTCCGAGGGCTGGCACGGACTCGGTTGGCTAAATCGATTTTGGATGCGGCTTGGGGACGATTTCTTGACATTCTGACAGCAGTGGCGG
TCAAACGCGGCAAACAGGTGTTGAGAGTGGATCCCCGTGGTACGTCCCAAAATTGTTGTGTTTGTGAGGAGCGTGTTCCCAAGACCTTGTCGGAACGGGT
GCATGATTGCCCCCGTTGCGGGTCGTGGGACAGAGACTTGAACGCTGCTATCGAGATTTTGAAGCGAGGACTCAGGGCGGTGGGACTGCCGCTCTCTGGC
TGTGGAGGATCCTGGTTTACCAGTCCGTTGAAGCAGCAACTCCGGGAAGTGATTCTCGGAAGCTCCCGTCTACAGCCCGTCAGGGCCTAGCGGGAGAGGA
TGTCAC
ATGCCTGCTGCTCAACATACTTCTTCAACTCCTCAACGGTGACCCCACCACAAGAGGCAACAAAATAGGCCCCTGTCCAAAATACAGGCTTGCTGTAGAA
CCGTGCCACCTCTGTGGCGAACTCTTTGCGAATCAACCGGCTGGAGACTGTTTTCAGGTTGTTCACCAGCTTCGAGACCTGAACATCCGGCGGAAAACTC
ACCAACAGATGCACATGGTCCGCCTCACCGTCGAACTCCACCAAGGAACAGCGCCACTTTTGGCAGGTCGCTCGAAATATATCTTCCAGCCTCTGCAACA
TTGGAGCAGTTATCACCCGACGACGGTACTTTGTCACCAGCACCAAGTGGATTTGTAGGCTGTAAACAGAACGATGGCCTATGTTGTAGCTCATTGCGCT
AAATGCTAGAACCGTCTCATGATTATCACCCACGAGTACCGGATCCTGCCCAGCGACGACCAAGCCGCTCTGATGACCGAGTGGCTGGAATTGTTGCGGC
GGCAGTGGAACGACGCTCTGGGGCAGAGACTGGACTGGCTGACCGCAACCCGTTGCCCAATTGACCGCTGCAGTCTTGTCTCTTGTCCGTTGCCTGTGTC
AGAACCCCCGCTGGAGCCGAATTATTATCGGCAGGCGGGATCCCTCAAACAAATCAAGCAACTGTTCCCGGCCTACCGGGGCATTTACGCCGAGGTGCTG
CAGCAAAACTTGATGCGGCTGGACAAGGCGTGGAAAGCGTGGCGGGAGCCGGATAGCACAGGCAAGCGGCGGGGGCGGCCTCGCTTCAAAAAAGCGGGGG
AGTTGAGATCCTTCACATTCCCCCGCATCAATTGCCCCAAGGCGGGAGCGCATCTGGAAGGGGAGACTCTGCGGCTGAGCAAGATTGGCTCGATGCCTGT
GGTGCTGCACCGCCCCTTACCAGAGGGGTTTGTGCCCAAAACCTGCACAGTGGTGCGCAAGGCCGATGGGTGGTATGTCTGCATTACTTTGGAGGACAAA
AGCGTCCCTTCCCCAGAGCCTGTGCCGATCAAAAAGGCGGTGGGCATTGATGTGGGATTGGATAGGTTTCTCACCACCAGCGATGGGGAGGTGGTGCCTA
TCCCGCGGCACTACCGCCGAGCTCAAAAGCACTTGGCCCGACAGCAGCGGCAACTGAGCCGCAAAGTGAAGGGGTCCGCCAACTGGAAGAGACAAGCCAC
GAAAGTTGCTTGTTTGCAGTTGCACGTTGCCCGACAACGCAAAGCGTTCCACTACCAAGTGGCGCACTGGCTGGTGGAGCAATACGACCTGTTGGTGGTG
GAGGATCTCAACGTCCGAGGGCTGGCACGGACTCGGTTGGCTAAATCGATTTTGGATGCGGCTTGGGGACGATTTCTTGACATTCTGACAGCAGTGGCGG
TCAAACGCGGCAAACAGGTGTTGAGAGTGGATCCCCGTGGTACGTCCCAAAATTGTTGTGTTTGTGAGGAGCGTGTTCCCAAGACCTTGTCGGAACGGGT
GCATGATTGCCCCCGTTGCGGGTCGTGGGACAGAGACTTGAACGCTGCTATCGAGATTTTGAAGCGAGGACTCAGGGCGGTGGGACTGCCGCTCTCTGGC
TGTGGAGGATCCTGGTTTACCAGTCCGTTGAAGCAGCAACTCCGGGAAGTGATTCTCGGAAGCTCCCGTCTACAGCCCGTCAGGGCCTAGCGGGAGAGGA
TGTCAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
474 bp | 157 aa | 563 | 90 | - | No |
Chemistry : Y1
ORF sequence :
MVVAGQDPVLVGDNHETVLAFSAMSYNIGHRSVYSLQIHLVLVTKYRRRVITAPMLQRLEDIFRATCQKWRCSLVEFDGEADHVHLLVSFPPDVQVSKLV
NNLKTVSSRLIRKEFATEVARFYSKPVFWTGAYFVASCGGVTVEELKKYVEQQASPR
NNLKTVSSRLIRKEFATEVARFYSKPVFWTGAYFVASCGGVTVEELKKYVEQQASPR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1272 bp | 423 aa | 519 | 1790 | + | No |
AG : TnpB
ORF sequence :
MIITHEYRILPSDDQAALMTEWLELLRRQWNDALGQRLDWLTATRCPIDRCSLVSCPLPVSEPPLEPNYYRQAGSLKQIKQLFPAYRGIYAEVLQQNLMR
LDKAWKAWREPDSTGKRRGRPRFKKAGELRSFTFPRINCPKAGAHLEGETLRLSKIGSMPVVLHRPLPEGFVPKTCTVVRKADGWYVCITLEDKSVPSPE
PVPIKKAVGIDVGLDRFLTTSDGEVVPIPRHYRRAQKHLARQQRQLSRKVKGSANWKRQATKVACLQLHVARQRKAFHYQVAHWLVEQYDLLVVEDLNVR
GLARTRLAKSILDAAWGRFLDILTAVAVKRGKQVLRVDPRGTSQNCCVCEERVPKTLSERVHDCPRCGSWDRDLNAAIEILKRGLRAVGLPLSGCGGSWF
TSPLKQQLREVILGSSRLQPVRA
LDKAWKAWREPDSTGKRRGRPRFKKAGELRSFTFPRINCPKAGAHLEGETLRLSKIGSMPVVLHRPLPEGFVPKTCTVVRKADGWYVCITLEDKSVPSPE
PVPIKKAVGIDVGLDRFLTTSDGEVVPIPRHYRRAQKHLARQQRQLSRKVKGSANWKRQATKVACLQLHVARQRKAFHYQVAHWLVEQYDLLVVEDLNVR
GLARTRLAKSILDAAWGRFLDILTAVAVKRGKQVLRVDPRGTSQNCCVCEERVPKTLSERVHDCPRCGSWDRDLNAAIEILKRGLRAVGLPLSGCGGSWF
TSPLKQQLREVILGSSRLQPVRA
Blast result :
Comments
ISSoc3 is 74% (ORFA, the transposase) aa similar to IS1253A and 53% (ORFB) to ISMma22.
References
1] ISfinder annotation (2009)
2] Bhaya,D., Grossman,A.R., Steunou,A.S., Khuri,N., Cohan,F.M., Hamamura,N., Melendrez,M.C., Bateson,M.M., Ward,D.M. and Heidelberg,J.F. (2007) ISME J 1 (8), 703-713
3] Allewalt,J.P., Bateson,M.M., Revsbech,N.P., Slack,K. and Ward,D.M. (2006) Appl. Environ. Microbiol. 72 (1), 544-550
2] Bhaya,D., Grossman,A.R., Steunou,A.S., Khuri,N., Cohan,F.M., Hamamura,N., Melendrez,M.C., Bateson,M.M., Ward,D.M. and Heidelberg,J.F. (2007) ISME J 1 (8), 703-713
3] Allewalt,J.P., Bateson,M.M., Revsbech,N.P., Slack,K. and Ward,D.M. (2006) Appl. Environ. Microbiol. 72 (1), 544-550