ISCbt4
- Family IS607
- Group
Isoform Synonym(s) ISCbo11, ISCbo12
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007581 | ND | Clostridium botulinum | Clostridium botulinum type C C-Stockholm Bacteriophage c-st Clostridium botulinum V891 plasmid p1CbV891 |
DNA section
IS Length : 2554 bp
Ends
IR Length : 0
IRL : AAACTAAACTGTTATGACAAATCTAAACAAAACTATACAAATATGTATAG
IRR : CCGATACGAAAAATATATACATTTCTATCTAAATGTAGACATTTATACTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CAAAGAAGGTGAACAAGTTGGG | ATGAATTACATCGCACAATGGTT | 0 | |
CAAAGAAGGTGAACAAGTTGGG | ATGAATTACATCGCACAATGGTT | 0 |
DNA sequence
AAACTAAACTGTTATGACAAATCTAAACAAAACTATACAAATATGTATAGAAAAATAGATTTATAACAGGAAGCCCTGTGTATCATGTGAATGGTGCATA
CTCGGTGTTAATTGCTTTTAACTCCTAAAGCCTTACAACCAAAGCGAAGTTGGAAACGACAAGCGTAACGGTGCGAAAGCTGAAAAAATAGTAAGGATGG
CATAAGCTGAAATAAAATCCAATTAGGTTCTATAAAAAGGTAGTCGTGACATATAGAAAACGTTGGACGCTAAATGCTGTAATAATGGATGTTTAGCAGG
GAAATTCCTAAGTCTTTTAAGATATGGAAAACCTTCAACGACTATCTCCTTGAGGGAGAGTAAAACCACAAGCCAATGGTGGAAGAAAAATACCGCATCT
CTTTTGAGATGAAGATATAGTCTACGCTCATATGAAAGTATGAGAGGTCTACTCGTGAGAGAAAGACTGCTATAGGTGTTGCGACTTATAGTGAATAAGA
AAAATATATACAAATATAAAAAAATATTGACATATCTCTCTAAAAAGTGTAATATATAATTATACTAAAGGGAGGTGAGTACATGGAATTAATGTCCATT
GGTAAATTTGCTAAACGTGTAGGCGTTAATGTTGTTACTCTGAGAAGAATGGAGGCTAAGGGAGAGTTTCTTCCAGCTCATGTTTCTTCTGGAGGAACTA
GGTATTATTCAACGGATCAGTTGAAATACTTTGGAAAAAAAAGAAATGCACATAAGTTAGCTGTAGGGTACTGTCGTGTTAACACACCAAGTCAAAAGGA
CGACTTAGAGAATCAAGTAAACAATGTGAAATCTTATATGATTGCCAAAGGCTATCAGTTTGAAATAATCAAAGACATTGGTTCAGGAATTAATTATAAG
AAAAAAGGCTTAAAAGAATTGATAGATAAGATAAATAACCAAGAAGTAAGTAGGGTGGTTATCTTATATAAAGATAGATTAATTCGTTTTGGCTTTGAAT
TAATTGAATATTTATGTCAAATAAACAACGTTGAACTTGAAATTATAGATCATTCCGAAAAGTCTAAAGAGGAAGAACTAACAGATGATTTAATTCAAAT
CATTACAGTTTTTGCAAATAGATTATATGGTCAAAGGTCTAAAAAAACTAAGCGTTTAATTGAGGAAGTGAAAAAACAATGATAGTAGCAATTAAAATAA
AATTAAAGCCAACTAAAGAACAAGAGATTTTATTTTGGAAATCGGCAGGAGTTGCAAGGTGGTCTTATAACTATTTTTTATCAGAAAGTGAAAGACACTA
CCAAGAGTATCTTGAAGGTAAACAAGACAAGAAAACCATAAAAGAAAGCGAAGTCAGAAAATATATTAATAATGTATTAAAAAAGACTACTCATACATGG
CTTAAAGAAGTTGGAAGCAATGTAATGAAACAAGCAGTAAAAGATGCTGACATAGCTAGAAAAAGATGGTTTGATGGTGTTGCAAATAAGCCGAAATTCA
AAAGTAAGCGAAAAAGTAAAGTTAGTTTTTATGTGAATTATGAAAGTCTAAAGCGAACAAATAATGGCTTTAGAGGTGAAAAAATCGGTGTAGTAAAAAC
ATATCAAGCATTACCAAAGTTAAAGAAATGTGAAAAATATTCAAATCCACGAATATCATTTGATGGTATGAATTGGTTTATATCTCTAGGATATAATAAG
GAATTTAAAGCTGTTAAATTAACTGATGTTAGTTTAGGAATTGACGTTGGTATAAAAGAATTAGCTGTGTGTTCAGACGGTCAATTTAAGAAAAATATAA
ATAAAACTAAAAGAGTTAGATTCCTTGAAAAGAAATTGAAACGTGAACAGCGAAAGCTAAGTCATAAACTTGAAGCTAATATTAAAAGTTATGATAAGAA
TAGAAAACCGATTTATAAAAGACCTTTAAGGGATATGAAAAATATCCAAAAACAAAACAGAATAATTCGTAATTTATATAAAAAACTTGAAAATATTCGC
ACAAATCATTTGCATCAATGTTCCAATGAGATAGTGAAAACCAAGCCTTCTCGAATTGTGATGGAAACATTGAATATAAAAGGTATGATGAAGAATAAGC
ATTTATCTAAAGCTATAGCTAATCAAAAGTTATATGAATTTAAAAGACAAATTCAATACAAATGTAAGAAGTATGGAATTAAATTTGTTGAAGCTGATAA
ATGGTATCCATCATCAAAAACTTGTAGTTGTTGTGGTCAAGTTAAATCAGATTTAAAGCTTAAAGACAGATTATATATATGTAGTTGTGGCTTAAAGATG
GATAGAGATTTGAACGCAAGTATTAATTTAGCAAATTACCAAATTCAAAGTGCTTAACCTAAACTAAACTTTGAATATGTACGTAACGATACTACGGAAT
TTAAGCCTGTTAAGAGTCATATCAAATGGTAGTAGCTTCGGCAAACCCAGATTCGTTGAAGCAGGAAGACACAAGACGAAAATCATAAGATATAAGTCGT
GTCATAGTATAAATGTCTACATTTAGATAGAAATGTATATATTTTTCGTATCGG
CTCGGTGTTAATTGCTTTTAACTCCTAAAGCCTTACAACCAAAGCGAAGTTGGAAACGACAAGCGTAACGGTGCGAAAGCTGAAAAAATAGTAAGGATGG
CATAAGCTGAAATAAAATCCAATTAGGTTCTATAAAAAGGTAGTCGTGACATATAGAAAACGTTGGACGCTAAATGCTGTAATAATGGATGTTTAGCAGG
GAAATTCCTAAGTCTTTTAAGATATGGAAAACCTTCAACGACTATCTCCTTGAGGGAGAGTAAAACCACAAGCCAATGGTGGAAGAAAAATACCGCATCT
CTTTTGAGATGAAGATATAGTCTACGCTCATATGAAAGTATGAGAGGTCTACTCGTGAGAGAAAGACTGCTATAGGTGTTGCGACTTATAGTGAATAAGA
AAAATATATACAAATATAAAAAAATATTGACATATCTCTCTAAAAAGTGTAATATATAATTATACTAAAGGGAGGTGAGTACATGGAATTAATGTCCATT
GGTAAATTTGCTAAACGTGTAGGCGTTAATGTTGTTACTCTGAGAAGAATGGAGGCTAAGGGAGAGTTTCTTCCAGCTCATGTTTCTTCTGGAGGAACTA
GGTATTATTCAACGGATCAGTTGAAATACTTTGGAAAAAAAAGAAATGCACATAAGTTAGCTGTAGGGTACTGTCGTGTTAACACACCAAGTCAAAAGGA
CGACTTAGAGAATCAAGTAAACAATGTGAAATCTTATATGATTGCCAAAGGCTATCAGTTTGAAATAATCAAAGACATTGGTTCAGGAATTAATTATAAG
AAAAAAGGCTTAAAAGAATTGATAGATAAGATAAATAACCAAGAAGTAAGTAGGGTGGTTATCTTATATAAAGATAGATTAATTCGTTTTGGCTTTGAAT
TAATTGAATATTTATGTCAAATAAACAACGTTGAACTTGAAATTATAGATCATTCCGAAAAGTCTAAAGAGGAAGAACTAACAGATGATTTAATTCAAAT
CATTACAGTTTTTGCAAATAGATTATATGGTCAAAGGTCTAAAAAAACTAAGCGTTTAATTGAGGAAGTGAAAAAACAATGATAGTAGCAATTAAAATAA
AATTAAAGCCAACTAAAGAACAAGAGATTTTATTTTGGAAATCGGCAGGAGTTGCAAGGTGGTCTTATAACTATTTTTTATCAGAAAGTGAAAGACACTA
CCAAGAGTATCTTGAAGGTAAACAAGACAAGAAAACCATAAAAGAAAGCGAAGTCAGAAAATATATTAATAATGTATTAAAAAAGACTACTCATACATGG
CTTAAAGAAGTTGGAAGCAATGTAATGAAACAAGCAGTAAAAGATGCTGACATAGCTAGAAAAAGATGGTTTGATGGTGTTGCAAATAAGCCGAAATTCA
AAAGTAAGCGAAAAAGTAAAGTTAGTTTTTATGTGAATTATGAAAGTCTAAAGCGAACAAATAATGGCTTTAGAGGTGAAAAAATCGGTGTAGTAAAAAC
ATATCAAGCATTACCAAAGTTAAAGAAATGTGAAAAATATTCAAATCCACGAATATCATTTGATGGTATGAATTGGTTTATATCTCTAGGATATAATAAG
GAATTTAAAGCTGTTAAATTAACTGATGTTAGTTTAGGAATTGACGTTGGTATAAAAGAATTAGCTGTGTGTTCAGACGGTCAATTTAAGAAAAATATAA
ATAAAACTAAAAGAGTTAGATTCCTTGAAAAGAAATTGAAACGTGAACAGCGAAAGCTAAGTCATAAACTTGAAGCTAATATTAAAAGTTATGATAAGAA
TAGAAAACCGATTTATAAAAGACCTTTAAGGGATATGAAAAATATCCAAAAACAAAACAGAATAATTCGTAATTTATATAAAAAACTTGAAAATATTCGC
ACAAATCATTTGCATCAATGTTCCAATGAGATAGTGAAAACCAAGCCTTCTCGAATTGTGATGGAAACATTGAATATAAAAGGTATGATGAAGAATAAGC
ATTTATCTAAAGCTATAGCTAATCAAAAGTTATATGAATTTAAAAGACAAATTCAATACAAATGTAAGAAGTATGGAATTAAATTTGTTGAAGCTGATAA
ATGGTATCCATCATCAAAAACTTGTAGTTGTTGTGGTCAAGTTAAATCAGATTTAAAGCTTAAAGACAGATTATATATATGTAGTTGTGGCTTAAAGATG
GATAGAGATTTGAACGCAAGTATTAATTTAGCAAATTACCAAATTCAAAGTGCTTAACCTAAACTAAACTTTGAATATGTACGTAACGATACTACGGAAT
TTAAGCCTGTTAAGAGTCATATCAAATGGTAGTAGCTTCGGCAAACCCAGATTCGTTGAAGCAGGAAGACACAAGACGAAAATCATAAGATATAAGTCGT
GTCATAGTATAAATGTCTACATTTAGATAGAAATGTATATATTTTTCGTATCGG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
600 bp | 199 aa | 583 | 1182 | + | No |
Chemistry : Serine
ORF sequence :
MELMSIGKFAKRVGVNVVTLRRMEAKGEFLPAHVSSGGTRYYSTDQLKYFGKKRNAHKLAVGYCRVNTPSQKDDLENQVNNVKSYMIAKGYQFEIIKDIG
SGINYKKKGLKELIDKINNQEVSRVVILYKDRLIRFGFELIEYLCQINNVELEIIDHSEKSKEEELTDDLIQIITVFANRLYGQRSKKTKRLIEEVKKQ
SGINYKKKGLKELIDKINNQEVSRVVILYKDRLIRFGFELIEYLCQINNVELEIIDHSEKSKEEELTDDLIQIITVFANRLYGQRSKKTKRLIEEVKKQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1179 bp | 392 aa | 1179 | 2357 | + | No |
AG : TnpB
ORF sequence :
MIVAIKIKLKPTKEQEILFWKSAGVARWSYNYFLSESERHYQEYLEGKQDKKTIKESEVRKYINNVLKKTTHTWLKEVGSNVMKQAVKDADIARKRWFDG
VANKPKFKSKRKSKVSFYVNYESLKRTNNGFRGEKIGVVKTYQALPKLKKCEKYSNPRISFDGMNWFISLGYNKEFKAVKLTDVSLGIDVGIKELAVCSD
GQFKKNINKTKRVRFLEKKLKREQRKLSHKLEANIKSYDKNRKPIYKRPLRDMKNIQKQNRIIRNLYKKLENIRTNHLHQCSNEIVKTKPSRIVMETLNI
KGMMKNKHLSKAIANQKLYEFKRQIQYKCKKYGIKFVEADKWYPSSKTCSCCGQVKSDLKLKDRLYICSCGLKMDRDLNASINLANYQIQSA
VANKPKFKSKRKSKVSFYVNYESLKRTNNGFRGEKIGVVKTYQALPKLKKCEKYSNPRISFDGMNWFISLGYNKEFKAVKLTDVSLGIDVGIKELAVCSD
GQFKKNINKTKRVRFLEKKLKREQRKLSHKLEANIKSYDKNRKPIYKRPLRDMKNIQKQNRIIRNLYKKLENIRTNHLHQCSNEIVKTKPSRIVMETLNI
KGMMKNKHLSKAIANQKLYEFKRQIQYKCKKYGIKFVEADKWYPSSKTCSCCGQVKSDLKLKDRLYICSCGLKMDRDLNASINLANYQIQSA
Blast result :
Comments
ISCbt4 is 66% (ORF A) aa similar to IS607 and 50% (ORF B) aa similar to ISEfa4.
There is 1 complete copy in Bacteriophage c-st. The ends of this IS are defined by comparing the sequences of IS-inserted regions in c-st with those of analogous unoccupied regions in other BoNTX phages ( Sakaguchi,Y. et al, 2005). ISCbt4 are inserted site-specifically into TTGGG sequence.
There is 1 complete copy in Bacteriophage c-st. The ends of this IS are defined by comparing the sequences of IS-inserted regions in c-st with those of analogous unoccupied regions in other BoNTX phages ( Sakaguchi,Y. et al, 2005). ISCbt4 are inserted site-specifically into TTGGG sequence.
References
1] Sakaguchi,Y., Hayashi,T., Kurokawa,K., Nakayama,K., Oshima,K., Fujinaga,Y., Ohnishi,M., Ohtsubo,E., Hattori,M. and Oguma,K.(2005) Proc. Natl. Acad. Sci. U.S.A. 102 (48), 17472-17477