ISBthe5
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004663 | ND | Bacteroides thetaiotaomicron | Bacteroides thetaiotaomicron VPI-5482 |
DNA section
IS Length : 2537 bp
Ends
IR Length : 21/24
IRL : GTAAGCCCCCGATAAAGTACCCGTTGTCTAGTATCTTTTTTCGCTCGATC
IRR : GTAAGCTCCCGATAAAGTCCTCGTATAAATTATAGAATCGCTATATACAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTATAAAATG | AGTTGAATAG | 0 |
DNA sequence
GTAAGCCCCCGATAAAGTACCCGTTGTCTAGTATCTTTTTTCGCTCGATCTTTGCGTAAAAAAGATTATGCGAAAAATGAGCAAAGAAGAATTCTTAGAA
ATTCTATCCCGTCAGCAACGAAGCGGTTTGACGATAAAAGACTTCTGTGTCAATGAAGCCTATACCGAATCTAGTTTTTATTATTGGAAAGGAAAGTTTG
GTCTCTCACGACCTTATCATGGGGAAAGATCCTCTTCCGGAGAGTTCGCTCCTATCAATTTGACCTCCCCGTCCACATCCAACCCGTCTTATGATAGAGT
AGCTATGGGATCCGGGGAGATTCAAATAGAGTTTCCTGGTGGCATAATAGCCCGCTTCAGTGGTATGGCTGAATCCCATGCCGCCATGCAATTACTCACT
CAAATTTACGGTCATCATGTTTTGCCTGAATGATACGATGCGATATTTCCTTTGCCCGGGTAAGACAGATATGCGCAAAGGCATGAATTCATTATGTGGA
GTCATTCATAACAAGATGGGCTACGACGTCCGTTTAGGTGACGTATTCATTTTCATAAACCGGCAGCGGACCACGATGAAACTTCTGCATGCGGAAGATG
GCGATCTTGTCTTGTATATAAAAAGGCTGGAGGAAGGAACCTTCCGCTTGCCCGAATATGATCAGCAAAGCAGATCATACCCCATGGAGTGGCGTGATCT
GGTCATGATGGTGAAAGGAATCAATAACGGATCCGCTAAGAGGCTCAAGCGATTGAAGGCGTTACGAAAAAGTGATATATAACAGTGAAACTCAATGGAC
CAGACCTTGTTTAAGTAAGAGTTTTTTCTTATTTTTGTATTGTTCAAAATCCTCATAAATAATGACTCGGACAGAAACATTAGAACTTTTGGTAGCCACA
CTCCAACAGACTAACGCCAGTCAGTCTGAGTCCATCAGACAGCTGACCCGGCAGAATGAACAGTTACAAAACAAACTGGATGAACTGCTGGCTCAGGTAG
CATGGCTGAACCGACAACTTTTTGGTCGTAAGAGTGAAAAGTTATCCCGTCTAGACCCAAATCAGCTATCCCTGTTTGAACAGCCGGTCCAATCCTTAGA
ACCGGAACCTTTGGAAGAAACTGTAGTTGAACAAACTACAACACCTATGGTTACTAAAAAGAAAGAACGGCAGAACCGTAAACTGTTGGAAGGACTTCCG
GTTGTGGAAGTTGTCATAGAACCGCAAGACCTGGATCTTACAAAGTATAAACGGATTGGCGAGGAACATACACGTACACTTGAGTTTGAACCGGGTAAAT
TATATGTAAAGGAAATCATACGTCCCAAATATGGACTGAAAGACAATACCGCTTTACCCCAGGAATATCAGGATGGTATCGTAATAGCCGATCTTCCACT
GTTGCCTATTTATAAGGGACTTCCGGGTGCCAGTCTGCTGGCAGAAATTCTCCTGCAAAAGTATGAGTATCACGTTCCATTCTACCGTCAGATACGGGAG
TTCCATCACTTAGGTTTGAAAATCCCGGAAAACACGCTTCATGGCTGGTTCAAACCCGCCTGTGAGCTGCTAAAGCCTTTGTATGATGAACTCAAGAAGC
AGGTATTGGCAGTCGATTACATCCAGGTGGATGAAACCACATTGCCTATCATCAACAAGCAAAGCCACAAAGCTGTCAGGGAATATCTGTGGATGGTCAG
GGCGGTAACCAGCGGATTAGTATTCTTTCATTATGATGACGGTTCCCGTTCACAGGAAACGGCACGGAACTTATTGGAACCATTTAAAGGATATCTTCAA
AGTGACGGTTATGCGGCCTACAACGTCTTTGAAGGTAAGGAAAGGGTGTGCCTTGTGGGTTGCCTTGTCCATATCAGGCGTCACTATGAGACCGCCAAAG
AGGAAAATAAGTCTCAGGCTAAGTACGTTCTGGCCAAAATACAGGAACTCTACCGGATAGAGCAGGCGGCCGACATACAGGGAATATCCCCTGAAATGCG
AATGTCAAAAAGGCAGAAACAGGCTCTTCCTATTCTTGACGAGCTGGAACAATGGATGGAAACAACCTATCCGAAAGTGCTTCCTAAAAGCCAGATGGGA
CAAGCCATTGCCTATGCCTATACCCTTTGGCCACGCATGAGAAATTACCTCAAAGACGGCAAGCTCAAAATTGATAATAATCTGGCTGAAAATGCAATCC
GCCCAATAGCTTTATCGAGAAAAAACTTCTTGTTTTGTGGAAATCATGAAGCGGCGGAAAACACAGATGTCATCTGTTCATTATTGGCATCATGTAAAGC
ATCTCAAGTTAACCCTAGAGAATGGCTTACCGAAATAATTGCCTGGTTGCCATATTACACAAGGGATAAGGGAAAAGACCTAAAAGAGTTGCTACCCAAT
TACTGGAAACTGAGAAGATCCAAAGAAATCTAACAATACTCTAATGAAAAGCATATTATAAACAGAGAATCGCTATAAACTTATTAGATGTATATAGCGA
TTCTATAATTTATACGAGGACTTTATCGGGAGCTTAC
ATTCTATCCCGTCAGCAACGAAGCGGTTTGACGATAAAAGACTTCTGTGTCAATGAAGCCTATACCGAATCTAGTTTTTATTATTGGAAAGGAAAGTTTG
GTCTCTCACGACCTTATCATGGGGAAAGATCCTCTTCCGGAGAGTTCGCTCCTATCAATTTGACCTCCCCGTCCACATCCAACCCGTCTTATGATAGAGT
AGCTATGGGATCCGGGGAGATTCAAATAGAGTTTCCTGGTGGCATAATAGCCCGCTTCAGTGGTATGGCTGAATCCCATGCCGCCATGCAATTACTCACT
CAAATTTACGGTCATCATGTTTTGCCTGAATGATACGATGCGATATTTCCTTTGCCCGGGTAAGACAGATATGCGCAAAGGCATGAATTCATTATGTGGA
GTCATTCATAACAAGATGGGCTACGACGTCCGTTTAGGTGACGTATTCATTTTCATAAACCGGCAGCGGACCACGATGAAACTTCTGCATGCGGAAGATG
GCGATCTTGTCTTGTATATAAAAAGGCTGGAGGAAGGAACCTTCCGCTTGCCCGAATATGATCAGCAAAGCAGATCATACCCCATGGAGTGGCGTGATCT
GGTCATGATGGTGAAAGGAATCAATAACGGATCCGCTAAGAGGCTCAAGCGATTGAAGGCGTTACGAAAAAGTGATATATAACAGTGAAACTCAATGGAC
CAGACCTTGTTTAAGTAAGAGTTTTTTCTTATTTTTGTATTGTTCAAAATCCTCATAAATAATGACTCGGACAGAAACATTAGAACTTTTGGTAGCCACA
CTCCAACAGACTAACGCCAGTCAGTCTGAGTCCATCAGACAGCTGACCCGGCAGAATGAACAGTTACAAAACAAACTGGATGAACTGCTGGCTCAGGTAG
CATGGCTGAACCGACAACTTTTTGGTCGTAAGAGTGAAAAGTTATCCCGTCTAGACCCAAATCAGCTATCCCTGTTTGAACAGCCGGTCCAATCCTTAGA
ACCGGAACCTTTGGAAGAAACTGTAGTTGAACAAACTACAACACCTATGGTTACTAAAAAGAAAGAACGGCAGAACCGTAAACTGTTGGAAGGACTTCCG
GTTGTGGAAGTTGTCATAGAACCGCAAGACCTGGATCTTACAAAGTATAAACGGATTGGCGAGGAACATACACGTACACTTGAGTTTGAACCGGGTAAAT
TATATGTAAAGGAAATCATACGTCCCAAATATGGACTGAAAGACAATACCGCTTTACCCCAGGAATATCAGGATGGTATCGTAATAGCCGATCTTCCACT
GTTGCCTATTTATAAGGGACTTCCGGGTGCCAGTCTGCTGGCAGAAATTCTCCTGCAAAAGTATGAGTATCACGTTCCATTCTACCGTCAGATACGGGAG
TTCCATCACTTAGGTTTGAAAATCCCGGAAAACACGCTTCATGGCTGGTTCAAACCCGCCTGTGAGCTGCTAAAGCCTTTGTATGATGAACTCAAGAAGC
AGGTATTGGCAGTCGATTACATCCAGGTGGATGAAACCACATTGCCTATCATCAACAAGCAAAGCCACAAAGCTGTCAGGGAATATCTGTGGATGGTCAG
GGCGGTAACCAGCGGATTAGTATTCTTTCATTATGATGACGGTTCCCGTTCACAGGAAACGGCACGGAACTTATTGGAACCATTTAAAGGATATCTTCAA
AGTGACGGTTATGCGGCCTACAACGTCTTTGAAGGTAAGGAAAGGGTGTGCCTTGTGGGTTGCCTTGTCCATATCAGGCGTCACTATGAGACCGCCAAAG
AGGAAAATAAGTCTCAGGCTAAGTACGTTCTGGCCAAAATACAGGAACTCTACCGGATAGAGCAGGCGGCCGACATACAGGGAATATCCCCTGAAATGCG
AATGTCAAAAAGGCAGAAACAGGCTCTTCCTATTCTTGACGAGCTGGAACAATGGATGGAAACAACCTATCCGAAAGTGCTTCCTAAAAGCCAGATGGGA
CAAGCCATTGCCTATGCCTATACCCTTTGGCCACGCATGAGAAATTACCTCAAAGACGGCAAGCTCAAAATTGATAATAATCTGGCTGAAAATGCAATCC
GCCCAATAGCTTTATCGAGAAAAAACTTCTTGTTTTGTGGAAATCATGAAGCGGCGGAAAACACAGATGTCATCTGTTCATTATTGGCATCATGTAAAGC
ATCTCAAGTTAACCCTAGAGAATGGCTTACCGAAATAATTGCCTGGTTGCCATATTACACAAGGGATAAGGGAAAAGACCTAAAAGAGTTGCTACCCAAT
TACTGGAAACTGAGAAGATCCAAAGAAATCTAACAATACTCTAATGAAAAGCATATTATAAACAGAGAATCGCTATAAACTTATTAGATGTATATAGCGA
TTCTATAATTTATACGAGGACTTTATCGGGAGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
366 bp | 121 aa | 68 | 433 | + | No |
AG : IS66 TnpA
ORF sequence :
MRKMSKEEFLEILSRQQRSGLTIKDFCVNEAYTESSFYYWKGKFGLSRPYHGERSSSGEFAPINLTSPSTSNPSYDRVAMGSGEIQIEFPGGIIARFSGM
AESHAAMQLLTQIYGHHVLPE
AESHAAMQLLTQIYGHHVLPE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
345 bp | 114 aa | 438 | 782 | + | No |
AG : IS66 TnpB
ORF sequence :
MRYFLCPGKTDMRKGMNSLCGVIHNKMGYDVRLGDVFIFINRQRTTMKLLHAEDGDLVLYIKRLEEGTFRLPEYDQQSRSYPMEWRDLVMMVKGINNGSA
KRLKRLKALRKSDI
KRLKRLKALRKSDI
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1572 bp | 523 aa | 862 | 2433 | + | No |
Chemistry : DDE
ORF sequence :
MTRTETLELLVATLQQTNASQSESIRQLTRQNEQLQNKLDELLAQVAWLNRQLFGRKSEKLSRLDPNQLSLFEQPVQSLEPEPLEETVVEQTTTPMVTKK
KERQNRKLLEGLPVVEVVIEPQDLDLTKYKRIGEEHTRTLEFEPGKLYVKEIIRPKYGLKDNTALPQEYQDGIVIADLPLLPIYKGLPGASLLAEILLQK
YEYHVPFYRQIREFHHLGLKIPENTLHGWFKPACELLKPLYDELKKQVLAVDYIQVDETTLPIINKQSHKAVREYLWMVRAVTSGLVFFHYDDGSRSQET
ARNLLEPFKGYLQSDGYAAYNVFEGKERVCLVGCLVHIRRHYETAKEENKSQAKYVLAKIQELYRIEQAADIQGISPEMRMSKRQKQALPILDELEQWME
TTYPKVLPKSQMGQAIAYAYTLWPRMRNYLKDGKLKIDNNLAENAIRPIALSRKNFLFCGNHEAAENTDVICSLLASCKASQVNPREWLTEIIAWLPYYT
RDKGKDLKELLPNYWKLRRSKEI
KERQNRKLLEGLPVVEVVIEPQDLDLTKYKRIGEEHTRTLEFEPGKLYVKEIIRPKYGLKDNTALPQEYQDGIVIADLPLLPIYKGLPGASLLAEILLQK
YEYHVPFYRQIREFHHLGLKIPENTLHGWFKPACELLKPLYDELKKQVLAVDYIQVDETTLPIINKQSHKAVREYLWMVRAVTSGLVFFHYDDGSRSQET
ARNLLEPFKGYLQSDGYAAYNVFEGKERVCLVGCLVHIRRHYETAKEENKSQAKYVLAKIQELYRIEQAADIQGISPEMRMSKRQKQALPILDELEQWME
TTYPKVLPKSQMGQAIAYAYTLWPRMRNYLKDGKLKIDNNLAENAIRPIALSRKNFLFCGNHEAAENTDVICSLLASCKASQVNPREWLTEIIAWLPYYT
RDKGKDLKELLPNYWKLRRSKEI
Blast result :
Comments
ISBthe5 (orf1) is 84% aa similar to ISBthe6.
ISBthe5 (orf2) is 51% aa similar to ISCro1 (orf2).
ISBthe5 (orf3) is 50% aa similar to ISPpu19 (orf3).
ISBthe5 (orf2) is 51% aa similar to ISCro1 (orf2).
ISBthe5 (orf3) is 50% aa similar to ISPpu19 (orf3).
References
1] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I. (2003) Science 299 (5615), 2074-2076.
2] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I. (2002) Direct Submission GenBank.
3] ISfinder annotation (2008).
2] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I. (2002) Direct Submission GenBank.
3] ISfinder annotation (2008).