ISSoc1
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP000239 | ND | Synechococcus sp. | Synechococcus sp. JA-3-3Ab Synechococcus sp. JA-2-3B'a(2-13) |
DNA section
IS Length : 1273 bp
Ends
Left end : CTAGAAATCGCAGGAGCCCCGCACTGTACCCGTAGGGTCAGTGTCGGGAGGAATGCGGAAGTTATGATAGAACCATGACCCAAGTCCTGACTGTAGCTTG II struct. : Yes
Right end : GAGGTTCGGGCCTGTTTTGTTCTCTGGTGGAGCAGAGCAGGCTCAGGGCTACTGAAAGCCCGCTCCGTACCGCTTTAGCGGTCGGAGTCGGGTAGTTTAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
TCCGCT | TCAG | GACTTGAGCA | TTAC |
TGTTCG | TCAG | AGCAGAAAGT | TTAC |
AGTAGA | TCAG | TGCTCTCTAT | TTAC |
GCGTCT | TCAG | CCATATAAGT | TTAC |
AGCCCG | TCAG | TAGAGGCGCT | TTAC |
GATAAC | TCAG | ATGGTTCCCG | TTAC |
GAAAGT | TCAG | AAAGTGCCAA | TTAC |
CCTTAT | TCAG | TTGCCCATCT | TTAC |
CTGTTT | TCAG | TTCAAGCTGG | TTAC |
CCGCCG | TCAG | TCGGCTCTTT | TTAC |
AGTTTG | TCAG | GCCCAAGTGT | TTAC |
GCAGAG | TCAG | GTCGGGGCGA | TTAC |
TCTGCT | TCAG | TCTTTGCTAC | TTAC |
GAATTT | TCAG | AACCATTCTG | TTAC |
CTTCTG | TCAG | GTAGATCATC | TTAC |
CTAGCT | TCAG | GCACAAATGC | TTAC |
ACTTCT | TCAG | GTTTTCACTT | TTAC |
GCGCTT | TCAG | ATTCCCCTCT | TTAC |
ATACTC | TCAG | CACTCACCTC | TTAC |
CATCTG | TCAG | TGGATCCAGA | TTAC |
AAAATT | TCAG | AGCCTCTATA | TTAC |
GTCCCT | TCAG | AAGTGACTTT | TTAC |
GGTGTG | TCAG | GCTTTGCCGC | TTAC |
GAGGGA | TCAG | CCAAATTCCT | TTAC |
GCATCT | TCAG | ACGCCCGTCC | TTAC |
GCGATA | TCAG | GGGTATAGCA | TTAC |
CGGGGA | TCAG | CTTTCGACCT | TTAC |
ATAGAT | TCAG | AGTAACTGCA | TTAC |
ACCCTC | TCAG | TCAGGTTAAT | TTAC |
CGGTAT | TCAG | TATCTCTCCA | TTAC |
GTACTC | TCAG | TTGCCCCGAT | TTAC |
TTGAGT | TCAG | TTTAAGTTTG | TTAC |
CCAGGT | TCAG | CTTTCAACCA | TTAC |
CTCGGT | TCAG | CTCCCACCAT | TTAC |
AGCCAG | TCAG | CTCTACACCC | TTAC |
GGGAGT | TCAG | GGCTTCGTCG | TTAC |
ATAAAG | TCAG | TTGATGCAAT | TTAC |
ACTGCC | TCAG | TTTGTTGTTG | TTAC |
DNA sequence
CTAGAAATCGCAGGAGCCCCGCACTGTACCCGTAGGGTCAGTGTCGGGAGGAATGCGGAAGTTATGATAGAACCATGACCCAAGTCCTGACTGTAGCTTG
CAAACTCAAGGTGTCCCAGTCGCAAGCCGCCAAATTGGACGCGACAATGGATGCCTTTGTGCAGGCGTTGAACTGGGTCAACCAAAACACACCAGAAAAA
GTAGTCAACGCAGTCAAACTCCAATCCCTTTGCTATTACCAGATTCGCGCTCGGTTTGGCTTGTCCAGTAATCTGGCCCAGCAGGTCTGCAGACGGGTGG
CAGGCGCTCGCAAAGTGGCGAGACAGCGCAACCGTCCGGTTAAAGAGTTCAAGCGTAGGTTCGTTACCTACGATGCACGTATCTTCTCGTTTCGCGAGAA
AGACTGGACAGTGTCGCTTGCCACGGTTGATGGAAGAGAACGCTTTGAGCTAGCCATTGGCAACTACCAGAGGGGAATGCTGGCTGGCTCTAACCCCAAA
TCGGCCACCCTAGTCCAGCGGAAGGACGGCTCCTACTCCATCCAGATTTGTGTCGAGGCAGAGCCCCCCAAGCAACAGAACACCGACAAGGTGATTGGTG
TTGACCTGGGACGGCGAGACATTGCACATACCTCCGAAGGAGACAACTGGAATGGACAGCAGCTGAACAAAGTCCGAGATCACTACTGTCGGTTAAGAGC
GTCTCTTCAGCGCAAGGCCAGTAAGGGCACCCGCAGTTGCAGGCGGCGATGCCGTCAACTGCTGCAACGGCTGTCTGGCAAAGAGAGACGCTTTCAAGCT
TGGGTGAACCACAGTATCGCCAAGCGCATCGTTAAAACAGCAAAATCTCTTTCTGCCTGCATTGGGATTGAGGACTTGAGCGGCATCCGGGGGCGTACCA
ACCGGCAGCCCCGCAGTCGTGAGGAGCGGCGGCGTAGCAATAGCTGGGCGTTCTACCAGCTGCGGCGGTTTATCGAATACAAGGCTGTGAGGGCGGGTGT
CAAAGTGGTAGTTGTCCCTGCGGCCTATACCTCGCAGATCTGCCACAAGTGCCCGCATATTCATCCCGACTCCGCTCAATCCTATCGCAAGGGTAAGCAG
TTCAAATGCGGGCACTGCGGTTGGGAAGGGGATGCGGATTTTAACGGTGCGAAGGTGATTGCGCTTTTGGGGGCTGCCGTAAACCAGCCTAGAGGTTCGG
GCCTGTTTTGTTCTCTGGTGGAGCAGAGCAGGCTCAGGGCTACTGAAAGCCCGCTCCGTACCGCTTTAGCGGTCGGAGTCGGGTAGTTTAC
CAAACTCAAGGTGTCCCAGTCGCAAGCCGCCAAATTGGACGCGACAATGGATGCCTTTGTGCAGGCGTTGAACTGGGTCAACCAAAACACACCAGAAAAA
GTAGTCAACGCAGTCAAACTCCAATCCCTTTGCTATTACCAGATTCGCGCTCGGTTTGGCTTGTCCAGTAATCTGGCCCAGCAGGTCTGCAGACGGGTGG
CAGGCGCTCGCAAAGTGGCGAGACAGCGCAACCGTCCGGTTAAAGAGTTCAAGCGTAGGTTCGTTACCTACGATGCACGTATCTTCTCGTTTCGCGAGAA
AGACTGGACAGTGTCGCTTGCCACGGTTGATGGAAGAGAACGCTTTGAGCTAGCCATTGGCAACTACCAGAGGGGAATGCTGGCTGGCTCTAACCCCAAA
TCGGCCACCCTAGTCCAGCGGAAGGACGGCTCCTACTCCATCCAGATTTGTGTCGAGGCAGAGCCCCCCAAGCAACAGAACACCGACAAGGTGATTGGTG
TTGACCTGGGACGGCGAGACATTGCACATACCTCCGAAGGAGACAACTGGAATGGACAGCAGCTGAACAAAGTCCGAGATCACTACTGTCGGTTAAGAGC
GTCTCTTCAGCGCAAGGCCAGTAAGGGCACCCGCAGTTGCAGGCGGCGATGCCGTCAACTGCTGCAACGGCTGTCTGGCAAAGAGAGACGCTTTCAAGCT
TGGGTGAACCACAGTATCGCCAAGCGCATCGTTAAAACAGCAAAATCTCTTTCTGCCTGCATTGGGATTGAGGACTTGAGCGGCATCCGGGGGCGTACCA
ACCGGCAGCCCCGCAGTCGTGAGGAGCGGCGGCGTAGCAATAGCTGGGCGTTCTACCAGCTGCGGCGGTTTATCGAATACAAGGCTGTGAGGGCGGGTGT
CAAAGTGGTAGTTGTCCCTGCGGCCTATACCTCGCAGATCTGCCACAAGTGCCCGCATATTCATCCCGACTCCGCTCAATCCTATCGCAAGGGTAAGCAG
TTCAAATGCGGGCACTGCGGTTGGGAAGGGGATGCGGATTTTAACGGTGCGAAGGTGATTGCGCTTTTGGGGGCTGCCGTAAACCAGCCTAGAGGTTCGG
GCCTGTTTTGTTCTCTGGTGGAGCAGAGCAGGCTCAGGGCTACTGAAAGCCCGCTCCGTACCGCTTTAGCGGTCGGAGTCGGGTAGTTTAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1212 bp | 403 aa | 75 | 1286 | + | No |
AG : TnpB
ORF sequence :
MTQVLTVACKLKVSQSQAAKLDATMDAFVQALNWVNQNTPEKVVNAVKLQSLCYYQIRARFGLSSNLAQQVCRRVAGARKVARQRNRPVKEFKRRFVTYD
ARIFSFREKDWTVSLATVDGRERFELAIGNYQRGMLAGSNPKSATLVQRKDGSYSIQICVEAEPPKQQNTDKVIGVDLGRRDIAHTSEGDNWNGQQLNKV
RDHYCRLRASLQRKASKGTRSCRRRCRQLLQRLSGKERRFQAWVNHSIAKRIVKTAKSLSACIGIEDLSGIRGRTNRQPRSREERRRSNSWAFYQLRRFI
EYKAVRAGVKVVVVPAAYTSQICHKCPHIHPDSAQSYRKGKQFKCGHCGWEGDADFNGAKVIALLGAAVNQPRGSGLFCSLVEQSRLRATESPLRTALAV
GVG
ARIFSFREKDWTVSLATVDGRERFELAIGNYQRGMLAGSNPKSATLVQRKDGSYSIQICVEAEPPKQQNTDKVIGVDLGRRDIAHTSEGDNWNGQQLNKV
RDHYCRLRASLQRKASKGTRSCRRRCRQLLQRLSGKERRFQAWVNHSIAKRIVKTAKSLSACIGIEDLSGIRGRTNRQPRSREERRRSNSWAFYQLRRFI
EYKAVRAGVKVVVVPAAYTSQICHKCPHIHPDSAQSYRKGKQFKCGHCGWEGDADFNGAKVIALLGAAVNQPRGSGLFCSLVEQSRLRATESPLRTALAV
GVG
Blast result :
Comments
ISSoc1 is 51% (TnpB) aa similar to ISHhu5.
References
1] Bhaya, D., Grossman, A.R., Steunou, A.-S., Khuri, N., Cohan, F.M., Hamamura, N., Melendrez, M.C., Bateson, M.M., Ward, D.M., Heidelber, J.F. (2007) ISME J 1, 703-713.