ISSoc2
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007775 | ND | Synechococcus sp. | Synechococcus sp. JA-3-3Ab Synechococcus sp. JA-2-3B'a(2-13) |
DNA section
IS Length : 1832 bp
Ends
IR Length : 0
IRL : CATGACTCTAGTCAATGATATGCTGGACTATACTGAAAATAATGTACAGT
IRR : GATACATTTACGCTGAACACTGCTGAGCATTACTGCTCAGCGGTGAAGCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATGAGTTGAG | AG | CTTGCTACAA | 2 |
TGCTGAGGTG | AG | GCACGGATAT | 2 |
TGGCCTAGGG | AG | AATGGGGGTC | 2 |
CCTGGGTGGG | AG | TGTCAATAGG | 2 |
TCCCCAGAAG | AG | CACGAGCTGC | 2 |
AGGGCGAGGG | AG | GTTGTTGACA | 2 |
GCTCATCGAG | AG | TTTGCAGCTA | 2 |
CTTGTGTAAG | AG | CAACGTGGCT | 2 |
CTAGCCCAGG | AG | CTCGGTTGCT | 2 |
GCCGTAGGAG | AG | CTAAAAGTTC | 2 |
TCGGGCTGGG | AG | AAAGCCATTT | 2 |
CTGGTTTGAG | AG | GTCGAAAACT | 2 |
CATCTGTGGA | AG | TAGCCGGCTA | 2 |
ATGTCAAGGG | AG | TACCGGGCAG | 2 |
AGCCAGGTGG | AG | GATCCTGTAG | 2 |
GGCTCCAAAG | AG | CTTGCCCGTC | 2 |
CCCCTGGGAG | AG | CTAGGCAAAT | 2 |
CTGCGAGCAG | AG | GTTGGTGGTG | 2 |
CTTAGGTTGG | AG | AAAAAACCCA | 2 |
DNA sequence
CATGACTCTAGTCAATGATATGCTGGACTATACTGAAAATAATGTACAGTATGTTCAGTATGGCCGCTTGGTATGGCTAGATATGTAAAACCGAGAGAAG
CGGCGGATTATTTTGGGGTGTGTCTCCACACTTTGAGGCGATGGGAACAGAAGGGCTGGATCCATGCAGTACGTACACCATCTGGTAGAGCGAGAAGGTA
TGACCTCGACAGCTACATTGGCCTCCGGCCCGCTGACGCGAACAGAGCGCCCAAGAAGGACAAACGAGTCGTTTTGTACGCCCGAGTCAGCAGTCGAGGG
CAGAAACCAGACTTGGAGAGACAGATTGCAAGACTGGTTGACCTCTATCCTGGAGCCGAAGTGGTCGGAGAGGTTGGCGGCGGTCTCGACTTCAAAAGAC
CAAAGTTCCTTGCCTTATTGGAACGAGTTCGTGCAGGAGATATCGGAACGATTGTGGTCGCTCACCGGGATCGACTCTGCCGGTTTGGATTTGAGTTCGT
TGAGTGGTACTGCCGTCAATACGGGTGCGAAATCTTGGTTCTCGATGACGATCACCTTTCTCCCCAACAGGAACTGGTTGAGGATATCCTCACCATCCTG
CACTGCTTCAGCAGTCGGCTCTACGGACTCAGAAAATACCGGGCTGCAATCGAGAAAGATACGGATTTATCCGGAGCCAGCGCTGGCTAAAGTTTGGAAG
CGGTGGCAAGCGGCGTGCCGGTACTGCTACAACCAAGCGATTGCCTATCAGCGCCAGCATGGTGCCCCAAAAACGGCCAGAAAGCTGCGGGACATCATCC
TGCGCTCCGACCTGCCCGGGTGGGTGAAGGACGTCCCTTGCCACATCAAGCAGAACGCGGTCGTCGAGGCGTGGTTGGCGTTCCGCCGAAGCAAAGACGC
GAGGTTTCGCAGTGTGCGGGACAGGTCGCATACGCTGCAATTCAACGCCGGCAACTTTCGCAATGGGACGTGGTATCCGAAACTCACCCGAGGTTTGGCG
TTCCGTGCATCCGAGGAGATGCCCAGAGAATGGGCACGCGGAACTGAGCTAATGCGGGTGAAAGACAGATGGTATGCCATCTTTCCTGAGCCCGTGAACG
AGCAGTGTTCGTTAGCAAAGGGGGTGATCGCACTTGATCCTGGAGTAAGAAGCTTCCTTACAGGGTTTGATGGGGCGGGCTTTGTAGATATCGCCAAGGG
GGACTTTGGCAGGATCGTTCGGCTGTGCTACCACCTAGACGATCTGCAATCTAGGCTGAGTAAAGCACCGAGATCCAAGCGTAGGCGAATGCGGCAAGCG
GCGTTTCGTCTGCGGGAGAGAATCAGGAACTTGGTGGACGAGTGCCATCGCAAGGTAGCGGCGTTCCTAACGGATAACTACCGATTGATATTCCTCCCCA
CTTTCGAGTCAGCCAAGATGGTTGCCAAGGCAGGGAGGAAGTTTGGTAGCAAGACAGCAAGGGCGATGCTCACCTGGGCGCACTACCGGTTCAAGCAGTT
CCTGAAGTTTCAAGCCAAGAAGAAAAACGTGGTTGTCGTGGAAGTATCGGAAGCGTACACCAGCAAAACCTGTACCAAGTGCGGGCACATCCACACTAAG
TTGGGTGGCGCAAAGGTGTTTCGATGCCCAAAGTGCAACCATAGGCTACCACGAGATTGGCAAGGCGCTCTGGGCGTTATGCTCAGGGCTTTGCGGGATA
CCGCCTTTCTGTTTGGACTCCGTCCCGCTAACGCGGTCGCGTCAGCATCGCGTAGTAACAACGGAAACGGACAGAATGCTATCGCTTCACCGCTGAGCAG
TAATGCTCAGCAGTGTTCAGCGTAAATGTATC
CGGCGGATTATTTTGGGGTGTGTCTCCACACTTTGAGGCGATGGGAACAGAAGGGCTGGATCCATGCAGTACGTACACCATCTGGTAGAGCGAGAAGGTA
TGACCTCGACAGCTACATTGGCCTCCGGCCCGCTGACGCGAACAGAGCGCCCAAGAAGGACAAACGAGTCGTTTTGTACGCCCGAGTCAGCAGTCGAGGG
CAGAAACCAGACTTGGAGAGACAGATTGCAAGACTGGTTGACCTCTATCCTGGAGCCGAAGTGGTCGGAGAGGTTGGCGGCGGTCTCGACTTCAAAAGAC
CAAAGTTCCTTGCCTTATTGGAACGAGTTCGTGCAGGAGATATCGGAACGATTGTGGTCGCTCACCGGGATCGACTCTGCCGGTTTGGATTTGAGTTCGT
TGAGTGGTACTGCCGTCAATACGGGTGCGAAATCTTGGTTCTCGATGACGATCACCTTTCTCCCCAACAGGAACTGGTTGAGGATATCCTCACCATCCTG
CACTGCTTCAGCAGTCGGCTCTACGGACTCAGAAAATACCGGGCTGCAATCGAGAAAGATACGGATTTATCCGGAGCCAGCGCTGGCTAAAGTTTGGAAG
CGGTGGCAAGCGGCGTGCCGGTACTGCTACAACCAAGCGATTGCCTATCAGCGCCAGCATGGTGCCCCAAAAACGGCCAGAAAGCTGCGGGACATCATCC
TGCGCTCCGACCTGCCCGGGTGGGTGAAGGACGTCCCTTGCCACATCAAGCAGAACGCGGTCGTCGAGGCGTGGTTGGCGTTCCGCCGAAGCAAAGACGC
GAGGTTTCGCAGTGTGCGGGACAGGTCGCATACGCTGCAATTCAACGCCGGCAACTTTCGCAATGGGACGTGGTATCCGAAACTCACCCGAGGTTTGGCG
TTCCGTGCATCCGAGGAGATGCCCAGAGAATGGGCACGCGGAACTGAGCTAATGCGGGTGAAAGACAGATGGTATGCCATCTTTCCTGAGCCCGTGAACG
AGCAGTGTTCGTTAGCAAAGGGGGTGATCGCACTTGATCCTGGAGTAAGAAGCTTCCTTACAGGGTTTGATGGGGCGGGCTTTGTAGATATCGCCAAGGG
GGACTTTGGCAGGATCGTTCGGCTGTGCTACCACCTAGACGATCTGCAATCTAGGCTGAGTAAAGCACCGAGATCCAAGCGTAGGCGAATGCGGCAAGCG
GCGTTTCGTCTGCGGGAGAGAATCAGGAACTTGGTGGACGAGTGCCATCGCAAGGTAGCGGCGTTCCTAACGGATAACTACCGATTGATATTCCTCCCCA
CTTTCGAGTCAGCCAAGATGGTTGCCAAGGCAGGGAGGAAGTTTGGTAGCAAGACAGCAAGGGCGATGCTCACCTGGGCGCACTACCGGTTCAAGCAGTT
CCTGAAGTTTCAAGCCAAGAAGAAAAACGTGGTTGTCGTGGAAGTATCGGAAGCGTACACCAGCAAAACCTGTACCAAGTGCGGGCACATCCACACTAAG
TTGGGTGGCGCAAAGGTGTTTCGATGCCCAAAGTGCAACCATAGGCTACCACGAGATTGGCAAGGCGCTCTGGGCGTTATGCTCAGGGCTTTGCGGGATA
CCGCCTTTCTGTTTGGACTCCGTCCCGCTAACGCGGTCGCGTCAGCATCGCGTAGTAACAACGGAAACGGACAGAATGCTATCGCTTCACCGCTGAGCAG
TAATGCTCAGCAGTGTTCAGCGTAAATGTATC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
618 bp | 205 aa | 73 | 690 | + | No |
Chemistry : Serine
ORF sequence :
MARYVKPREAADYFGVCLHTLRRWEQKGWIHAVRTPSGRARRYDLDSYIGLRPADANRAPKKDKRVVLYARVSSRGQKPDLERQIARLVDLYPGAEVVGE
VGGGLDFKRPKFLALLERVRAGDIGTIVVAHRDRLCRFGFEFVEWYCRQYGCEILVLDDDHLSPQQELVEDILTILHCFSSRLYGLRKYRAAIEKDTDLS
GASAG
VGGGLDFKRPKFLALLERVRAGDIGTIVVAHRDRLCRFGFEFVEWYCRQYGCEILVLDDDHLSPQQELVEDILTILHCFSSRLYGLRKYRAAIEKDTDLS
GASAG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1281 bp | 426 aa | 545 | 1825 | + | No |
AG : TnpB
ORF sequence :
MTITFLPNRNWLRISSPSCTASAVGSTDSENTGLQSRKIRIYPEPALAKVWKRWQAACRYCYNQAIAYQRQHGAPKTARKLRDIILRSDLPGWVKDVPCH
IKQNAVVEAWLAFRRSKDARFRSVRDRSHTLQFNAGNFRNGTWYPKLTRGLAFRASEEMPREWARGTELMRVKDRWYAIFPEPVNEQCSLAKGVIALDPG
VRSFLTGFDGAGFVDIAKGDFGRIVRLCYHLDDLQSRLSKAPRSKRRRMRQAAFRLRERIRNLVDECHRKVAAFLTDNYRLIFLPTFESAKMVAKAGRKF
GSKTARAMLTWAHYRFKQFLKFQAKKKNVVVVEVSEAYTSKTCTKCGHIHTKLGGAKVFRCPKCNHRLPRDWQGALGVMLRALRDTAFLFGLRPANAVAS
ASRSNNGNGQNAIASPLSSNAQQCSA
IKQNAVVEAWLAFRRSKDARFRSVRDRSHTLQFNAGNFRNGTWYPKLTRGLAFRASEEMPREWARGTELMRVKDRWYAIFPEPVNEQCSLAKGVIALDPG
VRSFLTGFDGAGFVDIAKGDFGRIVRLCYHLDDLQSRLSKAPRSKRRRMRQAAFRLRERIRNLVDECHRKVAAFLTDNYRLIFLPTFESAKMVAKAGRKF
GSKTARAMLTWAHYRFKQFLKFQAKKKNVVVVEVSEAYTSKTCTKCGHIHTKLGGAKVFRCPKCNHRLPRDWQGALGVMLRALRDTAFLFGLRPANAVAS
ASRSNNGNGQNAIASPLSSNAQQCSA
Blast result :
Comments
ISSoc2 is 62% (ORFA, the transposase) aa similar to ISAfe11 and 44% (ORFB) to ISvNY2A. Some copies have one or two deletions in the ORFB.
References
1] ISfinder annotation (2009)
2] Bhaya,D., Grossman,A.R., Steunou,A.S., Khuri,N., Cohan,F.M., Hamamura,N., Melendrez,M.C., Bateson,M.M., Ward,D.M. and Heidelberg,J.F. (2007) ISME J 1 (8), 703-713
3] Allewalt,J.P., Bateson,M.M., Revsbech,N.P., Slack,K. and Ward,D.M. (2006) Appl. Environ. Microbiol. 72 (1), 544-550
4] Nelson, W.C., Wollerman, L., Bhaya, D., Heidelberg, J.F. (2011)
2] Bhaya,D., Grossman,A.R., Steunou,A.S., Khuri,N., Cohan,F.M., Hamamura,N., Melendrez,M.C., Bateson,M.M., Ward,D.M. and Heidelberg,J.F. (2007) ISME J 1 (8), 703-713
3] Allewalt,J.P., Bateson,M.M., Revsbech,N.P., Slack,K. and Ward,D.M. (2006) Appl. Environ. Microbiol. 72 (1), 544-550
4] Nelson, W.C., Wollerman, L., Bhaya, D., Heidelberg, J.F. (2011)