ISBthe6
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004663 | ND | Bacteroides thetaiotaomicron | Bacteroides thetaiotaomicron VPI-5482 |
DNA section
IS Length : 2544 bp
Ends
IR Length : 24
IRL : GTAAGCCCCCGATAAAGTACCCGTTGTCTTTGTATCTTTAAGTCTCGATC
IRR : GTAAGCCCCCGATAAAGTACCCGTATAAAAAAGAATCGCTATAGCAAATC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTCAATACAG | ATAACAAT | ATTGCCATTC | 8 |
TGACAAGGTT | ATTGCTTG | TAGTATTTTC | 8 |
DNA sequence
GTAAGCCCCCGATAAAGTACCCGTTGTCTTTGTATCTTTAAGTCTCGATCTTTGCCAGAAAAAAGACTATGCAAAAAATGAGTAAAGAAGAATTCATTGA
AATCCTGTCTCGTCAACAGCGTAGCGGTTTGACGATAAAAGACTTCTGTATAAATGAAGCCTATACCGAATCAAGCTTTTACTATTGGAAAGGAAAGTTT
GGCCTATCGAGGCGTTATCATATGGATAGGCATTCTTCCTCTTTAGAGGAGTTTGCACCGGTTAGCCTCACCTCTTCACCTGCTTCCCACTCTGCCTGTG
ATAGTGGCGCTATACAAACGGGTGAGATCCGGATAGAGTTCCCCGGTGGTATTATCGCCCACTTCAGTGGTATGGCTGAATCCCAGGCGGCCATGCAATT
GCTCACTCAACTCTGCAATCGCCATGTTTTGCCTGAATGACACGATGCGCTACTTTCTCTGTCCGGGTAAGACAGATATGCGTAAAGGTATGAACTCATT
ATGCGGGGTTGTTCATGATAAAATGGGATATGATGTCCGTTTAGGTGATGTGTTTATCTTTATCAACCGCCAGCGCACTACAATGAAACTGCTACACGCG
GAAGATGGCGGACTGGTTTTGTACATAAAACGACTTGAAGAAGGCACTTTCCGCCTTCCCTCTTATGACAAAGAAAGCAAGTCGTACCCTATGCAGTGGC
GTGACCTGGTTCTGATGGTAGAGGGAATCAACGACGAACCTTCCAAAAGGCTCAAACGTCTGAAAGCGTTACGAAAAAGTGACATGCAGTACTGAAAAAC
AGCAGGTGAAACCTTGATTATACGGTTGCTTTTCTGTATTTTTGTATTGTTCAAAACAGGTATGACGAATGACTCAGACAGAAACATTAGAACTTTTGGT
GGCCACACTCCAGCAGGCTAATAGCTCCCAGTCTGAGTCAATCGAGCGTCTGACCCGGCAGAACGAACAGCTGCAAAATAAGCTTCAGGAACTGCTGGCT
CAGGTAGCCTGGTTAAATCGTCAGTTATTCGGCCGTAAGAGCGAGAAATTAGCCCATCTGGATCCGAACCAGCTATCACTGTTCGATCCACCTGTTCAGC
CGTTGGAACACGAAATACCGGAAGAAGCCGCAGCCCAAGAACCTGTCTGCTCGACGACTCCCAAAAAGAAAGTGCGTCAGAACCGCAACATGTTGGATGG
CCTTCCTGTAGTGGAGATTGTCATAGAACCCGAAGGAGTCGATCCGGATAAATACAAGCGCATAGGCGAGGAACGCACACGTACACTTGAATTTGAACCG
GGCAAATTATACGTAAAAGAGATCATACGTCCCAAGTATGGCCTGAAAGACAATATAAGTTTGCCTCAGGGGCATCAGGGCAGTGTTATTATAGCCCCTC
TTCCGCTATTGCCTATCTACAAGGGACTTCCCGGTGCCAGCCTGCTCACTGAAATCCTCTTGCAAAAATATGAATATCATGTACCATTCTATCGTCAGGT
GCGTGAGTTTCACCATTTAGGCCTGAAGATCTCGGAAAACACGCTTCAGGGGTGGTTCAAACCTGCCTGTGAATTACTTAAACCTCTCTATGAAGAGTTG
AAGAAACAGGTATTGAAGGCCGACTATATCCAGGTGGACGAAACGACATTACCGGTTATCAACAAACAGAATCATAAAGCGGTTAAGGAATACTTGTGGA
TAGTCAGGGCGGTTATGGATGGATTGGTCTTCTTTCATTATGATGACGGTTCCCGCTCACAGGAAACAGCCTGGAAATTATTACAAACCTTCAAAGGATA
TCTTCAAAGTGACGGCTATGCGGCCTACAACATCTTTGAGGGTAAGAAAGAGGTGTGCCTTGTCGGATGCCTTGCCCACATAAGGCGACATTACGAGGTT
GCCAAAGAAGAGAATGAATCCCTGGCCGGATATGTTCTGGCTCAAATACAGCAACTCTATCGGATCGAACAGATTGCCGACCAGGAGGAACTCACTTATG
AGCAACGCATGCTTAGAAGACAGGAACAGGCACTTCCCATACTTGAGCAACTGGAAAAATGGATGGAAACAGCCTATCCGAAAGTGCTTCCTAAAAGCCG
GATGGGGCAAGCTATCGCTTACGCGTATCAACTTTGGCCACGTATGAGGAATTATCTGAAAGACGGCAGGCTTAAAATAGATAATAATCTGGCCGAAAAT
GCGATTCGTCCGATAGCCTTATCAAGAAAAAACTTCTTATTTTGTGGAAACCATGAAGCCGCGCAGAACACTGCCATAATCTGTTCGCTCTTGGCATCAT
GCAAAGCCTCCAACATTAACCCCCGGGAATGGCTCACGGAGGTGATTGCACTATTGCCGTATTATGCAGCCAACAAGGAGAAAGACCTAAAAGAGCTGCT
ACCCCATTGCTGGGAATCGGGAAACTCCAAAGAACTCTAATAATACTCTAACGAAAAACAAGAATATAAATACAGAATCACTATAGCTTTGTTCGATTTG
CTATAGCGATTCTTTTTTATACGGGTACTTTATCGGGGGCTTAC
AATCCTGTCTCGTCAACAGCGTAGCGGTTTGACGATAAAAGACTTCTGTATAAATGAAGCCTATACCGAATCAAGCTTTTACTATTGGAAAGGAAAGTTT
GGCCTATCGAGGCGTTATCATATGGATAGGCATTCTTCCTCTTTAGAGGAGTTTGCACCGGTTAGCCTCACCTCTTCACCTGCTTCCCACTCTGCCTGTG
ATAGTGGCGCTATACAAACGGGTGAGATCCGGATAGAGTTCCCCGGTGGTATTATCGCCCACTTCAGTGGTATGGCTGAATCCCAGGCGGCCATGCAATT
GCTCACTCAACTCTGCAATCGCCATGTTTTGCCTGAATGACACGATGCGCTACTTTCTCTGTCCGGGTAAGACAGATATGCGTAAAGGTATGAACTCATT
ATGCGGGGTTGTTCATGATAAAATGGGATATGATGTCCGTTTAGGTGATGTGTTTATCTTTATCAACCGCCAGCGCACTACAATGAAACTGCTACACGCG
GAAGATGGCGGACTGGTTTTGTACATAAAACGACTTGAAGAAGGCACTTTCCGCCTTCCCTCTTATGACAAAGAAAGCAAGTCGTACCCTATGCAGTGGC
GTGACCTGGTTCTGATGGTAGAGGGAATCAACGACGAACCTTCCAAAAGGCTCAAACGTCTGAAAGCGTTACGAAAAAGTGACATGCAGTACTGAAAAAC
AGCAGGTGAAACCTTGATTATACGGTTGCTTTTCTGTATTTTTGTATTGTTCAAAACAGGTATGACGAATGACTCAGACAGAAACATTAGAACTTTTGGT
GGCCACACTCCAGCAGGCTAATAGCTCCCAGTCTGAGTCAATCGAGCGTCTGACCCGGCAGAACGAACAGCTGCAAAATAAGCTTCAGGAACTGCTGGCT
CAGGTAGCCTGGTTAAATCGTCAGTTATTCGGCCGTAAGAGCGAGAAATTAGCCCATCTGGATCCGAACCAGCTATCACTGTTCGATCCACCTGTTCAGC
CGTTGGAACACGAAATACCGGAAGAAGCCGCAGCCCAAGAACCTGTCTGCTCGACGACTCCCAAAAAGAAAGTGCGTCAGAACCGCAACATGTTGGATGG
CCTTCCTGTAGTGGAGATTGTCATAGAACCCGAAGGAGTCGATCCGGATAAATACAAGCGCATAGGCGAGGAACGCACACGTACACTTGAATTTGAACCG
GGCAAATTATACGTAAAAGAGATCATACGTCCCAAGTATGGCCTGAAAGACAATATAAGTTTGCCTCAGGGGCATCAGGGCAGTGTTATTATAGCCCCTC
TTCCGCTATTGCCTATCTACAAGGGACTTCCCGGTGCCAGCCTGCTCACTGAAATCCTCTTGCAAAAATATGAATATCATGTACCATTCTATCGTCAGGT
GCGTGAGTTTCACCATTTAGGCCTGAAGATCTCGGAAAACACGCTTCAGGGGTGGTTCAAACCTGCCTGTGAATTACTTAAACCTCTCTATGAAGAGTTG
AAGAAACAGGTATTGAAGGCCGACTATATCCAGGTGGACGAAACGACATTACCGGTTATCAACAAACAGAATCATAAAGCGGTTAAGGAATACTTGTGGA
TAGTCAGGGCGGTTATGGATGGATTGGTCTTCTTTCATTATGATGACGGTTCCCGCTCACAGGAAACAGCCTGGAAATTATTACAAACCTTCAAAGGATA
TCTTCAAAGTGACGGCTATGCGGCCTACAACATCTTTGAGGGTAAGAAAGAGGTGTGCCTTGTCGGATGCCTTGCCCACATAAGGCGACATTACGAGGTT
GCCAAAGAAGAGAATGAATCCCTGGCCGGATATGTTCTGGCTCAAATACAGCAACTCTATCGGATCGAACAGATTGCCGACCAGGAGGAACTCACTTATG
AGCAACGCATGCTTAGAAGACAGGAACAGGCACTTCCCATACTTGAGCAACTGGAAAAATGGATGGAAACAGCCTATCCGAAAGTGCTTCCTAAAAGCCG
GATGGGGCAAGCTATCGCTTACGCGTATCAACTTTGGCCACGTATGAGGAATTATCTGAAAGACGGCAGGCTTAAAATAGATAATAATCTGGCCGAAAAT
GCGATTCGTCCGATAGCCTTATCAAGAAAAAACTTCTTATTTTGTGGAAACCATGAAGCCGCGCAGAACACTGCCATAATCTGTTCGCTCTTGGCATCAT
GCAAAGCCTCCAACATTAACCCCCGGGAATGGCTCACGGAGGTGATTGCACTATTGCCGTATTATGCAGCCAACAAGGAGAAAGACCTAAAAGAGCTGCT
ACCCCATTGCTGGGAATCGGGAAACTCCAAAGAACTCTAATAATACTCTAACGAAAAACAAGAATATAAATACAGAATCACTATAGCTTTGTTCGATTTG
CTATAGCGATTCTTTTTTATACGGGTACTTTATCGGGGGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
372 bp | 123 aa | 69 | 440 | + | No |
AG : IS66 TnpA
ORF sequence :
MQKMSKEEFIEILSRQQRSGLTIKDFCINEAYTESSFYYWKGKFGLSRRYHMDRHSSSLEEFAPVSLTSSPASHSACDSGAIQTGEIRIEFPGGIIAHFS
GMAESQAAMQLLTQLCNRHVLPE
GMAESQAAMQLLTQLCNRHVLPE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 445 | 795 | + | No |
AG : IS66 TnpB
ORF sequence :
MRYFLCPGKTDMRKGMNSLCGVVHDKMGYDVRLGDVFIFINRQRTTMKLLHAEDGGLVLYIKRLEEGTFRLPSYDKESKSYPMQWRDLVLMVEGINDEPS
KRLKRLKALRKSDMQY
KRLKRLKALRKSDMQY
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1572 bp | 523 aa | 869 | 2440 | + | No |
Chemistry : DDE
ORF sequence :
MTQTETLELLVATLQQANSSQSESIERLTRQNEQLQNKLQELLAQVAWLNRQLFGRKSEKLAHLDPNQLSLFDPPVQPLEHEIPEEAAAQEPVCSTTPKK
KVRQNRNMLDGLPVVEIVIEPEGVDPDKYKRIGEERTRTLEFEPGKLYVKEIIRPKYGLKDNISLPQGHQGSVIIAPLPLLPIYKGLPGASLLTEILLQK
YEYHVPFYRQVREFHHLGLKISENTLQGWFKPACELLKPLYEELKKQVLKADYIQVDETTLPVINKQNHKAVKEYLWIVRAVMDGLVFFHYDDGSRSQET
AWKLLQTFKGYLQSDGYAAYNIFEGKKEVCLVGCLAHIRRHYEVAKEENESLAGYVLAQIQQLYRIEQIADQEELTYEQRMLRRQEQALPILEQLEKWME
TAYPKVLPKSRMGQAIAYAYQLWPRMRNYLKDGRLKIDNNLAENAIRPIALSRKNFLFCGNHEAAQNTAIICSLLASCKASNINPREWLTEVIALLPYYA
ANKEKDLKELLPHCWESGNSKEL
KVRQNRNMLDGLPVVEIVIEPEGVDPDKYKRIGEERTRTLEFEPGKLYVKEIIRPKYGLKDNISLPQGHQGSVIIAPLPLLPIYKGLPGASLLTEILLQK
YEYHVPFYRQVREFHHLGLKISENTLQGWFKPACELLKPLYEELKKQVLKADYIQVDETTLPVINKQNHKAVKEYLWIVRAVMDGLVFFHYDDGSRSQET
AWKLLQTFKGYLQSDGYAAYNIFEGKKEVCLVGCLAHIRRHYEVAKEENESLAGYVLAQIQQLYRIEQIADQEELTYEQRMLRRQEQALPILEQLEKWME
TAYPKVLPKSRMGQAIAYAYQLWPRMRNYLKDGRLKIDNNLAENAIRPIALSRKNFLFCGNHEAAQNTAIICSLLASCKASNINPREWLTEVIALLPYYA
ANKEKDLKELLPHCWESGNSKEL
Blast result :
Comments
ISBthe6 (orf1) is 84% aa similar to ISBthe5 (orf1).
ISBthe6 (orf2) is 96% aa similar to ISBthe5 (orf2).
ISBthe6 (orf3) is 87% aa similar to ISBthe5 (orf3).
ISBthe6 (orf2) is 96% aa similar to ISBthe5 (orf2).
ISBthe6 (orf3) is 87% aa similar to ISBthe5 (orf3).
References
1] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I. (2003) Science 299 (5615), 2074-2076.
2] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I. (2002) Direct Submission GenBank.
3] ISfinder annotation (2008).
2] Xu,J., Bjursell,M.K., Himrod,J., Deng,S., Carmichael,L.K., Chiang,H.C., Hooper,L.V. and Gordon,J.I. (2002) Direct Submission GenBank.
3] ISfinder annotation (2008).