ISBce5
- Family IS4
- Group IS231
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006274 | ND | Bacillus cereus | Bacillus cereus E33L |
DNA section
IS Length : 1653 bp
Ends
IR Length : 16/17
IRL : CATCGCTATCAAGCTAACAAAACAAAATACCCCCTAAAATAAAAAAATAC
IRR : CATTGCTATCAAGCTAAGCTCATGTAACGTCAACGATAAGCGAGATTGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATCTAACCAG | GGGTAGCACCA | CATGCCCATC | 11 |
GATAGCGATG | TGGGTTTACCC | CTATTAAGTA | 11 |
GGTAGTATCA | TGGTGCTACC | 0 | |
TTTAAGAGAG | GGGTACAGCCA | CATGCCCATC | 11 |
GATGGATATG | TGGTGTGACCC | CAATATATAA | 11 |
DNA sequence
CATCGCTATCAAGCTAACAAAACAAAATACCCCCTAAAATAAAAAAATACGCTATTCTTTCTGTAGAATCATAGGAAAGGATGGCGTTTTTATATGTCTA
TTTCTGTATCTGATGAATTACAATTATTTGCTCAAGAAATTCAAAGCTTTTTATCCCCAAATACCTTACGGAATCTTGCTAGAGATGTTGGTTTTGTTCA
ACGAACCAGTAAGTACCAAGCAAAAGATCTAGTCGCCTTATGTGTATGGATGAACCAAAATGTCGCTACGACGTCTTTAACTCAGTTATCTAGCTGTTTA
GAAGCATCAACGGAAGTACTCATCAGTCCTGAAGGACTGAATCAACGATTTAACCAAGCAGCCGTCCAATTTTTACAACACATACTAGCTGAACTACTAA
ACCAAAAATTGGTCTCATCTATGCCCATTTCTTCTCCATATACTTCTATTTTCAAGCGTATCCGTATTTTAGATTCAACTGCATTTCAACTCCCAGATCC
CTTTTCATTCGTTTATCCAGGTGCAGGTGGATGCAGCCATACAGCGGGGGTGAAGATTCAACTTGAGTATGATTTATTAAGTGGACAATTCCTTCATATT
CATACAGGTCCAGGTAAACAACATGATCGAACCTACGGTTCCCTGTGTGTCCCAACTGTAACAGCGAATGATTTATGTATCCGAGATTTGGGTTATTTTC
ATTTAAAAGATCTTCAACATATACAAGATAAAAAGGCTTATTATATCTCACGTATTAAATCGAATACACGTATTTATCAAAGAAATCCTAACCCTGATTA
TTTTCAAGATGGCAGAATCAAGAAATGTACAGAGTATATCCAGATAGATATGGAGGTTTTAATGAACTCTCTTCAACCAGGACAAACATGTGAAATATCC
AATGCTTATGTAGGAATGACTGATAAAGTCCCAACTCGTGTGATTGTTCATCGACTAACTAAAGAACAACAACAAAAACGATTACAGGATCAGGCTGTAC
GAGAAAAAAAGAAAGGAATGAAATATTCTCCTCGTAGTAAACGACTTAGTGGTATCAATGTTTATATGACCAACACTTCTGCAGATATTGTTCCGATGGA
ACAAGTACATGATTGGTATTCTTTACGTTGGCAAATCGAAATTTTATTTAAAACGTGGAAATCATTCTTTCACATTCATCATTGTAAAAAAATAAAACGA
GAACGATTGGAATGCCATTTGTATGGGCAACTGATTGCCATTCTACTCTGTTCTTCTACCATGTTTCAAATGCGGCAATTACTTCTTATGAAAAAGAAAC
GAGAACTGAGTGAATATAAGGCCATATATATGATTAAAGATTACTTTCTTCTTCTTTTCCAAGCTATACAGAAAGACACCCAAGGGCTATCAAAGATTCT
AATTCGCCTGTTCAACCTCCTACAGCAAAACGGGAGAAAATCTCATCGATATGAGAAGAAAACAGTCTTTGATATATTAGGTGTCGTTTACAATTGTACC
ATGTCTGATAATCAAGCGGCTTAATTCAAAAAATGAAACCCGTTAGGGTTTATTTCGTATGCAAATCTTTAAAGCACCTACATATAGAATTTCGAAAAAA
AGAAACAATCTCGCTTATCGTTGACGTTACATGAGCTTAGCTTGATAGCAATG
TTTCTGTATCTGATGAATTACAATTATTTGCTCAAGAAATTCAAAGCTTTTTATCCCCAAATACCTTACGGAATCTTGCTAGAGATGTTGGTTTTGTTCA
ACGAACCAGTAAGTACCAAGCAAAAGATCTAGTCGCCTTATGTGTATGGATGAACCAAAATGTCGCTACGACGTCTTTAACTCAGTTATCTAGCTGTTTA
GAAGCATCAACGGAAGTACTCATCAGTCCTGAAGGACTGAATCAACGATTTAACCAAGCAGCCGTCCAATTTTTACAACACATACTAGCTGAACTACTAA
ACCAAAAATTGGTCTCATCTATGCCCATTTCTTCTCCATATACTTCTATTTTCAAGCGTATCCGTATTTTAGATTCAACTGCATTTCAACTCCCAGATCC
CTTTTCATTCGTTTATCCAGGTGCAGGTGGATGCAGCCATACAGCGGGGGTGAAGATTCAACTTGAGTATGATTTATTAAGTGGACAATTCCTTCATATT
CATACAGGTCCAGGTAAACAACATGATCGAACCTACGGTTCCCTGTGTGTCCCAACTGTAACAGCGAATGATTTATGTATCCGAGATTTGGGTTATTTTC
ATTTAAAAGATCTTCAACATATACAAGATAAAAAGGCTTATTATATCTCACGTATTAAATCGAATACACGTATTTATCAAAGAAATCCTAACCCTGATTA
TTTTCAAGATGGCAGAATCAAGAAATGTACAGAGTATATCCAGATAGATATGGAGGTTTTAATGAACTCTCTTCAACCAGGACAAACATGTGAAATATCC
AATGCTTATGTAGGAATGACTGATAAAGTCCCAACTCGTGTGATTGTTCATCGACTAACTAAAGAACAACAACAAAAACGATTACAGGATCAGGCTGTAC
GAGAAAAAAAGAAAGGAATGAAATATTCTCCTCGTAGTAAACGACTTAGTGGTATCAATGTTTATATGACCAACACTTCTGCAGATATTGTTCCGATGGA
ACAAGTACATGATTGGTATTCTTTACGTTGGCAAATCGAAATTTTATTTAAAACGTGGAAATCATTCTTTCACATTCATCATTGTAAAAAAATAAAACGA
GAACGATTGGAATGCCATTTGTATGGGCAACTGATTGCCATTCTACTCTGTTCTTCTACCATGTTTCAAATGCGGCAATTACTTCTTATGAAAAAGAAAC
GAGAACTGAGTGAATATAAGGCCATATATATGATTAAAGATTACTTTCTTCTTCTTTTCCAAGCTATACAGAAAGACACCCAAGGGCTATCAAAGATTCT
AATTCGCCTGTTCAACCTCCTACAGCAAAACGGGAGAAAATCTCATCGATATGAGAAGAAAACAGTCTTTGATATATTAGGTGTCGTTTACAATTGTACC
ATGTCTGATAATCAAGCGGCTTAATTCAAAAAATGAAACCCGTTAGGGTTTATTTCGTATGCAAATCTTTAAAGCACCTACATATAGAATTTCGAAAAAA
AGAAACAATCTCGCTTATCGTTGACGTTACATGAGCTTAGCTTGATAGCAATG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1428 bp | 476 aa | 94 | 1521 | + | No |
Chemistry : DDE
ORF sequence :
MSISVSDELQLFAQEIQSFLSPNTLRNLARDVGFVQRTSKYQAKDLVALCVWMNQNVATTSLTQLSSCLEASTEVLISPEGLNQRFNQAAVQFLQHILAE
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHTAGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQIDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEILFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKSHRYEKKTVFDILGVVYNCTMSDNQAA
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHTAGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQIDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEILFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKSHRYEKKTVFDILGVVYNCTMSDNQAA
Blast result :
Comments
ISBce5 is 95% aa similar to IS231Y.
ISBce5 was found by screening completely sequenced genomes for sequences homologous to the IS4 transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-70-D(N3)-150-E(C1). The copy number in the Bacillus cereus E33L genome is 5 (4 on pE33L466 and 1 on the chromosome).
ISBce5 was found by screening completely sequenced genomes for sequences homologous to the IS4 transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-70-D(N3)-150-E(C1). The copy number in the Bacillus cereus E33L genome is 5 (4 on pE33L466 and 1 on the chromosome).
References
1] Brettin,T.S., Bruce,D., Challacombe,J.F., Gilna,P., Han,C., Hill,K., Hitchcock,P., Jackson,P., Keim,P., Longmire,J., Lucas,S., Okinaka,R., Richardson,P., Rubin,E. and Tice,H.(2004)Unpublished
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18