ISSoc10
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP000240 | ND | Synechococcus sp. | Synechococcus sp. JA-2-3B'a(2-13) |
DNA section
IS Length : 1829 bp
Ends
Left end : CTAAACCGCCGAGAAGCCCCGCCCTCTGTCCGTAGGACGTATGGCCTACGGCCCGCTGAAGCGAACGTGCTAGCGGCGCGGAGCGCTTGGGTCGGGATGA II struct. : Yes
Right end : CTCAGGAAACGTCCTTCACAGGAGTCACTGAGAAGCCCGCACTCTTTCGCTTTAGCGGCGCGGAGCGCTTTGCGTCAGCATGAGTGCGGGAGTATGTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
AAAGATCTCC | TTAG | ACCGCAAAAT | tcac |
DNA sequence
CTAAACCGCCGAGAAGCCCCGCCCTCTGTCCGTAGGACGTATGGCCTACGGCCCGCTGAAGCGAACGTGCTAGCGGCGCGGAGCGCTTGGGTCGGGATGA
GAGGCGGGAGTACCTTGGCTTTCAATGTACTGGCTCAATGCCGAGACTGTTACCCCACCACATGAAGCGATGAAATAAGATTCATTCCACAGCACATTTT
TCCGGTATATCTTGCTGATGTAATCCGCAAACTCGCTTCTCATCCTGCGACTAGAGACGGATTTGATGTTGTTTACCAATTTGGACAGATCCACCTGCGG
GTAGTACTGGAACAGCAGATGAACATGGTCTGCCTCGCCGTTGAACTCAATAAGTTTGCAACCCCACTTTTGGCAAAGGTCATACATAACTTCATGTAAT
CGCTGCAGCATTTCGCTGGTAAATACCTTGCGACGGTACTTGGTGGTTAAGACCAAGTGCGCCTTCAAATCAGATACAGCTCTGCCGCTGGAAACAAAAT
CATGCTTCATGGTCGCTAATCCCTGCTGGCGCTATAATCAGTGTAGCGGTTTCTGAGTACGCTTAAGATGAGGACTGCCTATCAGTACCGCCTGCGGCCT
ACGCCCAGTCAAGTTGTCCTGATGGAGCAATGGCTGGAGCTGTTGCGCAAGCAGTACAACTATCGGTTGGCAGAGCGGCTTAACTGGTGGGAACAAAACC
GCTGTAATATAAACGCCTGCCCCTTAATCTGTCATTTGCCTGAGCTGAAAGACAGGCCGGATGTCTACTCCCAAAAACGAGACTTGGTGAATACCAAAGT
AAGGTTCCCCGAATACCAAGCAATCCATTCGCAAGTCCTGCAAAACTGCATAGAGCGGGTGCACAAAGCATTCGACCGCTGGCTGAAAGGCGACCGTAAC
GGCAAGAGGGCAGGAAGGCCGAGATTCAAAGGGGCGGGCCGGTATCGTTCCTTCACCTTTCCCCAGATGAAGCAGGACTGCATTCAAGGGAAGTTTATTC
ATCTGCCCAAAATCGGAGCAATTAAGCTGATTCAGCACCGTCCGTTGCCTGACGGGTTTGTTATCAAGACTGCCACCATCACCCGTAAGGCTGATGGATG
GTATGTGACATTAAGCCTGCAAGATGCGTCCGTCCCCGAGTTTGCTCCTGAGCCTGCAACACTGGAGAACACGATAGGGATCGACATGGGGCTGGACTCC
TTTTTGGTGACTGATACAGGGGAGTCCGTCCCAGTGCCCCAGTATTACCGCCGCGCCCAAAAGCGCCTGGAGAGGCTGCAGCGGTCATTATCCCGTAAGC
AGAAAGGGTCAAACCGCCGCAAGAAAGCCTTAGGGCGAGTGGCTAAGGCACACCTCAAGGTGGCCAATCAGCGTAAAGATTTCCACTACAAAGTAGCCAA
GAAGCTAGTAAGCAAAGGGAAGCATATTGCGTATGAGGCGCTCAACACCAAAGGTATTGCGAGAACCCGACTAGCGAAATCCATCTATGACGCTGGGTGG
GGGCAATTCCTGCAAATTCTCGCAGTCAAGGCTGAAAGAGCTGGGCTGAGGGCGATTGCAGTGGATCCAAGAGGCACGAGCCAAGACTGTTCTCAGTGCG
GTCAAAAAGTACCGAAAACCATTCGAGACAGATGGCATTCTTGCCCGCATTGTGGACTGGAACTAGGCCGAGACCACAACGCGGCGATAAACATCAAATA
CAGAGCGGTGGGGCATCCCGTTCTCAAAGCTCAGGAAACGTCCTTCACAGGAGTCACTGAGAAGCCCGCACTCTTTCGCTTTAGCGGCGCGGAGCGCTTT
GCGTCAGCATGAGTGCGGGAGTATGTCAC
GAGGCGGGAGTACCTTGGCTTTCAATGTACTGGCTCAATGCCGAGACTGTTACCCCACCACATGAAGCGATGAAATAAGATTCATTCCACAGCACATTTT
TCCGGTATATCTTGCTGATGTAATCCGCAAACTCGCTTCTCATCCTGCGACTAGAGACGGATTTGATGTTGTTTACCAATTTGGACAGATCCACCTGCGG
GTAGTACTGGAACAGCAGATGAACATGGTCTGCCTCGCCGTTGAACTCAATAAGTTTGCAACCCCACTTTTGGCAAAGGTCATACATAACTTCATGTAAT
CGCTGCAGCATTTCGCTGGTAAATACCTTGCGACGGTACTTGGTGGTTAAGACCAAGTGCGCCTTCAAATCAGATACAGCTCTGCCGCTGGAAACAAAAT
CATGCTTCATGGTCGCTAATCCCTGCTGGCGCTATAATCAGTGTAGCGGTTTCTGAGTACGCTTAAGATGAGGACTGCCTATCAGTACCGCCTGCGGCCT
ACGCCCAGTCAAGTTGTCCTGATGGAGCAATGGCTGGAGCTGTTGCGCAAGCAGTACAACTATCGGTTGGCAGAGCGGCTTAACTGGTGGGAACAAAACC
GCTGTAATATAAACGCCTGCCCCTTAATCTGTCATTTGCCTGAGCTGAAAGACAGGCCGGATGTCTACTCCCAAAAACGAGACTTGGTGAATACCAAAGT
AAGGTTCCCCGAATACCAAGCAATCCATTCGCAAGTCCTGCAAAACTGCATAGAGCGGGTGCACAAAGCATTCGACCGCTGGCTGAAAGGCGACCGTAAC
GGCAAGAGGGCAGGAAGGCCGAGATTCAAAGGGGCGGGCCGGTATCGTTCCTTCACCTTTCCCCAGATGAAGCAGGACTGCATTCAAGGGAAGTTTATTC
ATCTGCCCAAAATCGGAGCAATTAAGCTGATTCAGCACCGTCCGTTGCCTGACGGGTTTGTTATCAAGACTGCCACCATCACCCGTAAGGCTGATGGATG
GTATGTGACATTAAGCCTGCAAGATGCGTCCGTCCCCGAGTTTGCTCCTGAGCCTGCAACACTGGAGAACACGATAGGGATCGACATGGGGCTGGACTCC
TTTTTGGTGACTGATACAGGGGAGTCCGTCCCAGTGCCCCAGTATTACCGCCGCGCCCAAAAGCGCCTGGAGAGGCTGCAGCGGTCATTATCCCGTAAGC
AGAAAGGGTCAAACCGCCGCAAGAAAGCCTTAGGGCGAGTGGCTAAGGCACACCTCAAGGTGGCCAATCAGCGTAAAGATTTCCACTACAAAGTAGCCAA
GAAGCTAGTAAGCAAAGGGAAGCATATTGCGTATGAGGCGCTCAACACCAAAGGTATTGCGAGAACCCGACTAGCGAAATCCATCTATGACGCTGGGTGG
GGGCAATTCCTGCAAATTCTCGCAGTCAAGGCTGAAAGAGCTGGGCTGAGGGCGATTGCAGTGGATCCAAGAGGCACGAGCCAAGACTGTTCTCAGTGCG
GTCAAAAAGTACCGAAAACCATTCGAGACAGATGGCATTCTTGCCCGCATTGTGGACTGGAACTAGGCCGAGACCACAACGCGGCGATAAACATCAAATA
CAGAGCGGTGGGGCATCCCGTTCTCAAAGCTCAGGAAACGTCCTTCACAGGAGTCACTGAGAAGCCCGCACTCTTTCGCTTTAGCGGCGCGGAGCGCTTT
GCGTCAGCATGAGTGCGGGAGTATGTCAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
441 bp | 146 aa | 510 | 70 | - | No |
Chemistry : Y1
ORF sequence :
MKHDFVSSGRAVSDLKAHLVLTTKYRRKVFTSEMLQRLHEVMYDLCQKWGCKLIEFNGEADHVHLLFQYYPQVDLSKLVNNIKSVSSRRMRSEFADYISK
IYRKNVLWNESYFIASCGGVTVSALSQYIESQGTPASHPDPSAPRR
IYRKNVLWNESYFIASCGGVTVSALSQYIESQGTPASHPDPSAPRR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1304 bp | 414 aa | 509 | 1812 | + | No |
AG : TnpB
ORF sequence :
MRTAYQYRLRPTPSQVVLMEQWLELLRKQYNYRLAERLNWWEQNRCNINACPLICHLPELKDRPDVYSQKRDLVNTKVRFPEYQAIHSQVLQNCIERVHK
AFDRWLKGDRNGKRAGRPRFKGAGRYRSFTFPQMKQDCIQGKFIHLPKIGAIKLIQHRPLPDGFVIKTATITRKADGWYVTLSLQDASVPEFAPEPATLE
NTIGIDMGLDSFLVTDTGESVPVPQYYRRAQKRLERLQRSLSRKQKGSNRRKKALGRVAKAHLKVANQRKDFHYKVAKKLVSKGKHIAYEALNTKGIART
RLAKSIYDAGWGQFLQILAVKAERAGLRAIAVDPRGTSQDCSQCGQKVPKTIRDRWHSCPHCGLELGRDHNAAINIKYRAVGHPVLKAQETSFTGVTEKP
ALFRFSGAERFASA
AFDRWLKGDRNGKRAGRPRFKGAGRYRSFTFPQMKQDCIQGKFIHLPKIGAIKLIQHRPLPDGFVIKTATITRKADGWYVTLSLQDASVPEFAPEPATLE
NTIGIDMGLDSFLVTDTGESVPVPQYYRRAQKRLERLQRSLSRKQKGSNRRKKALGRVAKAHLKVANQRKDFHYKVAKKLVSKGKHIAYEALNTKGIART
RLAKSIYDAGWGQFLQILAVKAERAGLRAIAVDPRGTSQDCSQCGQKVPKTIRDRWHSCPHCGLELGRDHNAAINIKYRAVGHPVLKAQETSFTGVTEKP
ALFRFSGAERFASA
Blast result :
Comments
ISSoc10 is 73% (TnpA : the transposase) aa similar to ISSoc3 and 73% (TnpB) aa similar to ISSoc3.
References
1] Bhaya, D., Grossman, A.R., Steunou, A.-S., Khuri, N., Cohan, F.M., Hamamura, N., Melendrez, M.C., Bateson, M.M., Ward, D.M., Heidelber, J.F. (2007) ISME J 1, 703-713.