ISCbt3
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007581 | ND | Clostridium botulinum | Clostridium botulinum type C C-Stockholm Bacteriophage c-st |
DNA section
IS Length : 1986 bp
Ends
IR Length : 0
IRL : TGATAAACTATAAATAAAAACATTGACAAACATAGACAATAGTGGTAATA
IRR : CTGCTACTCAAAATATGGACAAACATACCCATATCGAGATTGTTACTGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TATTGAAAATGTGGACAAGGAC | TGGTAATATGAATATAAAGA | 0 | |
CAACGTGACTTTAAAATTGAAAGGAG | AAATAAAAAGTATGGTTAAAA | 0 |
DNA sequence
TGATAAACTATAAATAAAAACATTGACAAACATAGACAATAGTGGTAATATGTATACATAAGGAGTTGAAATTATGAATACATATAAACCAAAAGATTTT
GCAGAAATGATAGGAGTATCAGTAAAAACACTACAAAGATGGGATAACGAAGGTAAATTAAAGGCATATAGAAACCCATCTAATAGAAGATATTATACTC
ATAATCAATATGTAGAATATATGGGTAAAATAGTACAAGATAAAGACAAAAGAAAAACTATTATATATGCAAGGGTTTCAAGTAATAGTCAAAAAGATGA
TTTAAAAAATCAAGTAGAATTTCTTAAACAATATGCTAACGCTAAAGGAATGATAGTAGATGAAATCTTTGAAGATGTAGGAAGTGGATTAAATTATAAT
CGTAAGAAATGGAATAAATTATTAGAAGATTGTATGTTAGGAGCAATAAAGACAATTATAGTATCTCATAAAGATAGATTTATTAGATTTGGATTTGATT
GGTTTGAAAGATTCGTTAAATCTAACGGTGTAGAATTAATAGTAGTAAATAATGAAAGTTTATCTCCACAAGAAGAAATGATTCAAGACTTAATTTCTAT
AATTCATGTATTTAGTTGTCGTATATATGGATTAAGAAAATATAAGAAAAAAATTAAGGAGGATGATGAAATTGTTAAGAGCTTACAAAGTGGAGATAAA
ACCAACTCAAGAACAAAGTATAAAAATTCATAAAACTATTGGAGTTAGTAGATTTATTTATAATTTTTATATAGCTCATAATAAAGCAATTTATGAAAAA
GAAAATAAATTTATAAGTGGTATGCAATTTTCTAAATGGTTAAATAATGAATATATTCCTAATAATCAAGATAAAATTTGGATTAAAGAAGTTTCTTCAA
AAGCTACCAAACAATCTATAATGAATGGAGAAAAAGCATTTAAGAAGTTTTTCAAAGGCTTAAGTGGATTCCCTAAATTTAAGAAAAAGAAAAACCAAGA
TGTTAAAGCTTATTTTCCAAAGAACAATAAAACTGATTGGACTATTGAAAGGTATAGAGTAAAAATACCTACTCTTGGTTGGATGAGATTAAAAGAGTTT
GGATATATACCAACAAATGTGAAAGTTAAAAGTGGCACAGTTGGTTACAAAGCTAATAGGTACTATGTATCTATACTAGTTGAAGAAATGGATATAGTAG
TTCCAAACCCTACAAACGAGGGAATTGGGATTGACTTAGGACTAAAAGATTTTGCAATATGTTCAAATGGAGATAAGTTTAAGAATATAAATAAAACATC
AAGAGTTAGAAAAGTAGAAAAGAAATTGAAAAGAGAACAAAGAAAACTTTCAAGGAAATATGAGAGTTTGAAAACAAGAAATAAAAATATAAAAGGAGGT
AAAGCTACTAGACAAAATATCCAAAAACAAATAGTCAAGGTACAAAAACTTCATGAAAAACTTACTAATATAAGAACTGATTATATAAATAAAACTGTAA
GTGAAATAGTTAAGCAAAAACCAAGCTATATAACTATTGAAGATTTAAATATTAGTGGAATGATGAAGAATAAACATCTAGCTAAAGCAGTAGCACAACA
AAAATTTTATGAATTTAGAACTAAATTAACCTCTAAGTGTAATCAAAATAATATTGAACTTAGAGTAGTTGATAGATTTTATCCTAGTAGTAAAACTTGT
AGTTGTTGTGGATCTATTAAGAAAGATTTAAAACTATCTGATAGAGTTTATAAATGTAATTGTGGACTTGTTATTGATAGAGATTTAAATGCAAGTATTA
ATTTAGCTAATGTTAAAAAATATAAGATAGCTTAATTAAACAAGTTCTTATATATGTACCGAAGGCTAATTCGGGAATTAACGACTGTGGAGTGCTATAC
AAACTGTAGTAGCTTAGGCAAGACAGGGTACGTTGAAACAGTAACAATCTCGATATGGGTATGTTTGTCCATATTTTGAGTAGCAG
GCAGAAATGATAGGAGTATCAGTAAAAACACTACAAAGATGGGATAACGAAGGTAAATTAAAGGCATATAGAAACCCATCTAATAGAAGATATTATACTC
ATAATCAATATGTAGAATATATGGGTAAAATAGTACAAGATAAAGACAAAAGAAAAACTATTATATATGCAAGGGTTTCAAGTAATAGTCAAAAAGATGA
TTTAAAAAATCAAGTAGAATTTCTTAAACAATATGCTAACGCTAAAGGAATGATAGTAGATGAAATCTTTGAAGATGTAGGAAGTGGATTAAATTATAAT
CGTAAGAAATGGAATAAATTATTAGAAGATTGTATGTTAGGAGCAATAAAGACAATTATAGTATCTCATAAAGATAGATTTATTAGATTTGGATTTGATT
GGTTTGAAAGATTCGTTAAATCTAACGGTGTAGAATTAATAGTAGTAAATAATGAAAGTTTATCTCCACAAGAAGAAATGATTCAAGACTTAATTTCTAT
AATTCATGTATTTAGTTGTCGTATATATGGATTAAGAAAATATAAGAAAAAAATTAAGGAGGATGATGAAATTGTTAAGAGCTTACAAAGTGGAGATAAA
ACCAACTCAAGAACAAAGTATAAAAATTCATAAAACTATTGGAGTTAGTAGATTTATTTATAATTTTTATATAGCTCATAATAAAGCAATTTATGAAAAA
GAAAATAAATTTATAAGTGGTATGCAATTTTCTAAATGGTTAAATAATGAATATATTCCTAATAATCAAGATAAAATTTGGATTAAAGAAGTTTCTTCAA
AAGCTACCAAACAATCTATAATGAATGGAGAAAAAGCATTTAAGAAGTTTTTCAAAGGCTTAAGTGGATTCCCTAAATTTAAGAAAAAGAAAAACCAAGA
TGTTAAAGCTTATTTTCCAAAGAACAATAAAACTGATTGGACTATTGAAAGGTATAGAGTAAAAATACCTACTCTTGGTTGGATGAGATTAAAAGAGTTT
GGATATATACCAACAAATGTGAAAGTTAAAAGTGGCACAGTTGGTTACAAAGCTAATAGGTACTATGTATCTATACTAGTTGAAGAAATGGATATAGTAG
TTCCAAACCCTACAAACGAGGGAATTGGGATTGACTTAGGACTAAAAGATTTTGCAATATGTTCAAATGGAGATAAGTTTAAGAATATAAATAAAACATC
AAGAGTTAGAAAAGTAGAAAAGAAATTGAAAAGAGAACAAAGAAAACTTTCAAGGAAATATGAGAGTTTGAAAACAAGAAATAAAAATATAAAAGGAGGT
AAAGCTACTAGACAAAATATCCAAAAACAAATAGTCAAGGTACAAAAACTTCATGAAAAACTTACTAATATAAGAACTGATTATATAAATAAAACTGTAA
GTGAAATAGTTAAGCAAAAACCAAGCTATATAACTATTGAAGATTTAAATATTAGTGGAATGATGAAGAATAAACATCTAGCTAAAGCAGTAGCACAACA
AAAATTTTATGAATTTAGAACTAAATTAACCTCTAAGTGTAATCAAAATAATATTGAACTTAGAGTAGTTGATAGATTTTATCCTAGTAGTAAAACTTGT
AGTTGTTGTGGATCTATTAAGAAAGATTTAAAACTATCTGATAGAGTTTATAAATGTAATTGTGGACTTGTTATTGATAGAGATTTAAATGCAAGTATTA
ATTTAGCTAATGTTAAAAAATATAAGATAGCTTAATTAAACAAGTTCTTATATATGTACCGAAGGCTAATTCGGGAATTAACGACTGTGGAGTGCTATAC
AAACTGTAGTAGCTTAGGCAAGACAGGGTACGTTGAAACAGTAACAATCTCGATATGGGTATGTTTGTCCATATTTTGAGTAGCAG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
660 bp | 219 aa | 74 | 733 | + | No |
Chemistry : Serine
ORF sequence :
MNTYKPKDFAEMIGVSVKTLQRWDNEGKLKAYRNPSNRRYYTHNQYVEYMGKIVQDKDKRKTIIYARVSSNSQKDDLKNQVEFLKQYANAKGMIVDEIFE
DVGSGLNYNRKKWNKLLEDCMLGAIKTIIVSHKDRFIRFGFDWFERFVKSNGVELIVVNNESLSPQEEMIQDLISIIHVFSCRIYGLRKYKKKIKEDDEI
VKSLQSGDKTNSRTKYKNS
DVGSGLNYNRKKWNKLLEDCMLGAIKTIIVSHKDRFIRFGFDWFERFVKSNGVELIVVNNESLSPQEEMIQDLISIIHVFSCRIYGLRKYKKKIKEDDEI
VKSLQSGDKTNSRTKYKNS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1170 bp | 389 aa | 666 | 1835 | + | No |
AG : TnpB
ORF sequence :
MKLLRAYKVEIKPTQEQSIKIHKTIGVSRFIYNFYIAHNKAIYEKENKFISGMQFSKWLNNEYIPNNQDKIWIKEVSSKATKQSIMNGEKAFKKFFKGLS
GFPKFKKKKNQDVKAYFPKNNKTDWTIERYRVKIPTLGWMRLKEFGYIPTNVKVKSGTVGYKANRYYVSILVEEMDIVVPNPTNEGIGIDLGLKDFAICS
NGDKFKNINKTSRVRKVEKKLKREQRKLSRKYESLKTRNKNIKGGKATRQNIQKQIVKVQKLHEKLTNIRTDYINKTVSEIVKQKPSYITIEDLNISGMM
KNKHLAKAVAQQKFYEFRTKLTSKCNQNNIELRVVDRFYPSSKTCSCCGSIKKDLKLSDRVYKCNCGLVIDRDLNASINLANVKKYKIA
GFPKFKKKKNQDVKAYFPKNNKTDWTIERYRVKIPTLGWMRLKEFGYIPTNVKVKSGTVGYKANRYYVSILVEEMDIVVPNPTNEGIGIDLGLKDFAICS
NGDKFKNINKTSRVRKVEKKLKREQRKLSRKYESLKTRNKNIKGGKATRQNIQKQIVKVQKLHEKLTNIRTDYINKTVSEIVKQKPSYITIEDLNISGMM
KNKHLAKAVAQQKFYEFRTKLTSKCNQNNIELRVVDRFYPSSKTCSCCGSIKKDLKLSDRVYKCNCGLVIDRDLNASINLANVKKYKIA
Blast result :
Comments
ISCbt3 is 62% (ORF A) aa similar to IS1921 and 57% (ORF B) aa similar to ISEfa4.
There are 2 complete copies in Bacteriophage c-st. The ends of this IS are defined by comparing the sequences of IS-inserted regions in c-st with those of analogous unoccupied regions in other BoNTX phages ( Sakaguchi,Y. et al, 2005). ISCbt3 are inserted site-specifically into AAGGAG sequence.
There are 2 complete copies in Bacteriophage c-st. The ends of this IS are defined by comparing the sequences of IS-inserted regions in c-st with those of analogous unoccupied regions in other BoNTX phages ( Sakaguchi,Y. et al, 2005). ISCbt3 are inserted site-specifically into AAGGAG sequence.
References
1] Sakaguchi,Y., Hayashi,T., Kurokawa,K., Nakayama,K., Oshima,K., Fujinaga,Y., Ohnishi,M., Ohtsubo,E., Hattori,M. and Oguma,K.(2005) Proc. Natl. Acad. Sci. U.S.A. 102 (48), 17472-17477