ISBthe3
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004663 | ND | Bacteroides thetaiotaomicron | Bacteroides thetaiotaomicron VPI-5482 |
DNA section
IS Length : 1458 bp
Ends
IR Length : 19
IRL : CTACCCTTTATACACAAGTATATATTATTTTTTTTATCTTTGCTTGATGT
IRR : CTACCCTTTATACACAAGTCTTAGCTGATTAACTCATACAGTTCAAACTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACCACTCCTC | CTTCCTGAAG | GAGGAGAATG | 10 |
GGTACTCCTC | CTTCCCGAAG | GAGGAGAGTT | 10 |
AATTCTCCTC | CTTCGGGAAG | GAGGAGTACC | 10 |
GCCACCCCTC | CTTCCTGAAG | GAGGGGAGTT | 10 |
DNA sequence
CTACCCTTTATACACAAGTATATATTATTTTTTTTATCTTTGCTTGATGTTATTAGAGACAGATCAAATTCGTGATCCTCGGCTTTTGCGGCGTTTAAAC
TTAATTTGCTCTCAGATGGTTGTCCATCAAAGTGCTATAGTGAATCAATTTAGTAAGGAACATAAAGAAAAGATGGGTGCTTATAGATTTTTGAACAATT
CCTCAGTCAGCTCTGATGCTATCTTATCAGGTCTGATACACACTTGCTGTAAGAATGCTTCCGGTCGTCAGCATTTACTGTGTATTCAAGATACCTCGGA
GATAAACTATGAGGCTCATGTTGAGCGAATGAAGAAAAAAACAGCCAGTCCCGGCATTGTCGGTCAAAAGCAATGTGGTACTTTTTTGCATCCTGTCTTG
GTAGTGGATGCCTCCAGTCATATTCCTATAGGCTTTTCTTCGGTTAAGCAGTGGAATCGCTCACCGGCTGCTTTAAGTCGTGAAGAACGTAATTACAGAT
ATCAGCCTATAGAAGAAAAAGAGTCATATCGTTGGATAGAAAGTGGAATGGCTGCCAGTGAGCAAATGCCCCGGGATGCAGTTAAAACGATCATCGGTGA
CCGTGAAGCGGACATTTTCGAGCTTTTCAGCCGTATCCCTACTGATAATGTTCACTTGCTGATACGTTCTGTTCATGAAAGGAATTGCCGGTTGGATGAT
CCAGACTGTTCTGTCCATCTGAATACATTAATGGAGCAGGCTGTTCTACGGGCAGAGTATAGCTTTGAAGTGCTCCCGGGAAGCGGACGTAAGAAACGGG
TAGCGTGCATGGAACTTCGCTTTGAAAGAGTCACCTTGTGCGCTCCTGTTAACGGTCCGGCAAAGGGCAGTCCCCCTGTTAGTCTTTATTGCATACATGT
TAAAGAAAAATCTTCCAGTACACCGGTAAATGAGAGCCCTATTGAATGGAGACTGCTAACTACACATGTGGTGGAGACTGTAGAACAAGCAATTGAATGT
ATCGGTTGGTATCGTTGTAGATGGCTGATTGAAGAGTTGTTCAGAGTGCTCAAAAGAAAGGGATTCATGATTGAGGACGCACAGTTGGAAACAGTTTCGG
CATTACAAAAACTAATCTTAATTTCCTTGCAGGCAGCCCTGCAGGTGATGGTACTCAAACTTTCTTTTGATAAAGAAGATGAAAAACTCTCTTCAGAAAT
CTACTTTACAAGCAAAGAAATAGCATTATTACATATAGTAGGAAAAAAGAGTGAGGGAAATACAAAAATACAGCAAAATCCATATAAAAAAGAATCAATG
GCATGGGCAGCATGGATTATTGCAAGGTTAGGAGCATGGAGTGCATACAAGAGCCAGTCCATTCCAGGATATATTACCTTTAAGAATGGACTGGATAGAT
TTTATACACAGTTTGAACTGTATGAGTTAATCAGCTAAGACTTGTGTATAAAGGGTAG
TTAATTTGCTCTCAGATGGTTGTCCATCAAAGTGCTATAGTGAATCAATTTAGTAAGGAACATAAAGAAAAGATGGGTGCTTATAGATTTTTGAACAATT
CCTCAGTCAGCTCTGATGCTATCTTATCAGGTCTGATACACACTTGCTGTAAGAATGCTTCCGGTCGTCAGCATTTACTGTGTATTCAAGATACCTCGGA
GATAAACTATGAGGCTCATGTTGAGCGAATGAAGAAAAAAACAGCCAGTCCCGGCATTGTCGGTCAAAAGCAATGTGGTACTTTTTTGCATCCTGTCTTG
GTAGTGGATGCCTCCAGTCATATTCCTATAGGCTTTTCTTCGGTTAAGCAGTGGAATCGCTCACCGGCTGCTTTAAGTCGTGAAGAACGTAATTACAGAT
ATCAGCCTATAGAAGAAAAAGAGTCATATCGTTGGATAGAAAGTGGAATGGCTGCCAGTGAGCAAATGCCCCGGGATGCAGTTAAAACGATCATCGGTGA
CCGTGAAGCGGACATTTTCGAGCTTTTCAGCCGTATCCCTACTGATAATGTTCACTTGCTGATACGTTCTGTTCATGAAAGGAATTGCCGGTTGGATGAT
CCAGACTGTTCTGTCCATCTGAATACATTAATGGAGCAGGCTGTTCTACGGGCAGAGTATAGCTTTGAAGTGCTCCCGGGAAGCGGACGTAAGAAACGGG
TAGCGTGCATGGAACTTCGCTTTGAAAGAGTCACCTTGTGCGCTCCTGTTAACGGTCCGGCAAAGGGCAGTCCCCCTGTTAGTCTTTATTGCATACATGT
TAAAGAAAAATCTTCCAGTACACCGGTAAATGAGAGCCCTATTGAATGGAGACTGCTAACTACACATGTGGTGGAGACTGTAGAACAAGCAATTGAATGT
ATCGGTTGGTATCGTTGTAGATGGCTGATTGAAGAGTTGTTCAGAGTGCTCAAAAGAAAGGGATTCATGATTGAGGACGCACAGTTGGAAACAGTTTCGG
CATTACAAAAACTAATCTTAATTTCCTTGCAGGCAGCCCTGCAGGTGATGGTACTCAAACTTTCTTTTGATAAAGAAGATGAAAAACTCTCTTCAGAAAT
CTACTTTACAAGCAAAGAAATAGCATTATTACATATAGTAGGAAAAAAGAGTGAGGGAAATACAAAAATACAGCAAAATCCATATAAAAAAGAATCAATG
GCATGGGCAGCATGGATTATTGCAAGGTTAGGAGCATGGAGTGCATACAAGAGCCAGTCCATTCCAGGATATATTACCTTTAAGAATGGACTGGATAGAT
TTTATACACAGTTTGAACTGTATGAGTTAATCAGCTAAGACTTGTGTATAAAGGGTAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1320 bp | 4406 aa | 116 | 1435 | + | No |
Chemistry : DDE
ORF sequence :
MVVHQSAIVNQFSKEHKEKMGAYRFLNNSSVSSDAILSGLIHTCCKNASGRQHLLCIQDTSEINYEAHVERMKKKTASPGIVGQKQCGTFLHPVLVVDAS
SHIPIGFSSVKQWNRSPAALSREERNYRYQPIEEKESYRWIESGMAASEQMPRDAVKTIIGDREADIFELFSRIPTDNVHLLIRSVHERNCRLDDPDCSV
HLNTLMEQAVLRAEYSFEVLPGSGRKKRVACMELRFERVTLCAPVNGPAKGSPPVSLYCIHVKEKSSSTPVNESPIEWRLLTTHVVETVEQAIECIGWYR
CRWLIEELFRVLKRKGFMIEDAQLETVSALQKLILISLQAALQVMVLKLSFDKEDEKLSSEIYFTSKEIALLHIVGKKSEGNTKIQQNPYKKESMAWAAW
IIARLGAWSAYKSQSIPGYITFKNGLDRFYTQFELYELIS
SHIPIGFSSVKQWNRSPAALSREERNYRYQPIEEKESYRWIESGMAASEQMPRDAVKTIIGDREADIFELFSRIPTDNVHLLIRSVHERNCRLDDPDCSV
HLNTLMEQAVLRAEYSFEVLPGSGRKKRVACMELRFERVTLCAPVNGPAKGSPPVSLYCIHVKEKSSSTPVNESPIEWRLLTTHVVETVEQAIECIGWYR
CRWLIEELFRVLKRKGFMIEDAQLETVSALQKLILISLQAALQVMVLKLSFDKEDEKLSSEIYFTSKEIALLHIVGKKSEGNTKIQQNPYKKESMAWAAW
IIARLGAWSAYKSQSIPGYITFKNGLDRFYTQFELYELIS
Blast result :
Comments
ISBthe3 is 43% aa similar to ISEcl13.
ISBthe3 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-126-D(N3)-143-E(C1). The copy number in Bacteroides thetaiotaomicron VPI-5482 is 4.
ISBthe3 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-126-D(N3)-143-E(C1). The copy number in Bacteroides thetaiotaomicron VPI-5482 is 4.
References
1] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I.(2003)Science 299(5615), 2074-2076.
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18