ISBf10
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006347 | ND | Bacteroides fragilis | Bacteroides fragilis YCH46 |
DNA section
IS Length : 2939 bp
Ends
IR Length : 20/21
IRL : GTAAGCATTCGATAAACTCCCCCTTTCATTCTAAAAATGTGATATTACTT
IRR : GTAAGTATTCGATAAACTCCCTCGTATATTGGGTAACGTATGTTCATAAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGGAGATACC | TTTGCTTT | CGTCTTTAGT | 8 |
GAAAGACTGT | TTTATTTC | AATCTTGTTT | 8 |
TTGATTGTAA | AATACTTC | ATTGCTTCTT | 8 |
TGATGTCTTA | CCTTGAAC | GCAATTCACC | 8 |
DNA sequence
GTAAGCATTCGATAAACTCCCCCTTTCATTCTAAAAATGTGATATTACTTTTGTAATCAAAGTAAAAAAGCTCAATTGTTATGTCACATCAATGGACCAT
GGAAGATTTTGAATCTATTTATTCCCGTTTCAAGTCGAGCGGACTGTCAGTTATGGATTTTTGTTCGAATGAATGTATTCGTCCCAAACGTTTCTACGAG
TGGCGTTCCAAGCTGTTGCGCAAAGGCGGCTTTATCCCGGTAAAGGTAAACAGTAAGGGCCAGGTCAGTCTTCCCCATAAGGAGAAATCCCTGCTGTCTG
CCCCTCCGGTCAGCCCGTCGCCAATTCCCCAGCCGCTATGTGAGATCTCCTATCCCAATGGCGTCACAGTCCGCTTGAACAGCCCTTTGTCACCGGAGGT
ATTGCAAACCCTGATATTTTTGAATTCAAACCGTTAGCCTATGTTTTCTTTGAATGAATCCAACAGATATTATCTTTACCCGTATCCGACAGACATGCGT
AAGAGTTTTTATACGCTTAGCGGCATCGTGACCAACCAGATGGGAAAGAATGTACGGGACGGTGACGCTTTCATTTTTATCAATGCGAATTGTACCTGTA
TGAAAATCCTCCATATGGAATATGGTGGTCTGGTGATATACCATATGAGGCTGGAACATGGGCACTTCCATCTGCCGGTCATAAATACGGAGGAAGGTCG
GATTAAAGCGATTGAAACCTTCTGGAATGACCTTGTGATGATGGTTCAAGGAATGGACGGCAGCAAGGTCAGGCGTTATAAAAGGAGTGGTTTCCATGGG
TTGTAGTTGTAACAAATTATAACATATACAATGCCGTATTCCGGCTGTTTTTTAGTATCTTAGCATCATATTTAAACGATTATGCCAACCGATAAGGAAT
TACTGATAAAGGATCTCATGCACAAATGCGACTGTCTGTATAGAGAAAACAGCCGGTTGAAGGAGATGGTGTCCACGCAGCCCCTTAAAGCTGCGGACAA
AGAGGTGTATGAGGCTTTATTGTCTGACAAGGATGCCATAATCGCCCAAAAGGAAGCAAAAATCAACAGTCTGGAGCAACGTGTATCCTATCTTGAACGG
CAGTTGTACGGTAAGAAGGCGGAAAAGTTCATAAAGCCTGACGCCCAGGACCGCTGGCTGGATTTTGAAGGCTTTGACATGCTTCCCCAGGAGGCTGAAG
CCGCAGAAGAAGCAGAAAAGGAGTTGAAGGCTACCAGAGAAGCGATCATCGCCCGTAAAAAAGCGGGGAAGCAGCATCCTGCGAGAAAATCCCTTCCGGA
GAATCTTGAGCGTGAGGTGGTCCATATATATCCCGAAGGGTATAATCCGGAAGAGTGGACGCTCCTTCCCGGAGAGGAAGTGACCGAGATCCTTATGCAC
GAGCCCGAGAAGTTCTATATCCGCAGGATAGTGCGCCATACAGCCAAACGGAAGGGTACAAACGAGTTCAAGACCGGCCCGCTTCCCGTCATGCCAATAG
CAAAGAGCTATGCCTCTGCCTCATTGCTGGCAGACATGATGATAGGGAAATATGTGGATCACATACCGTTCCACAGGCAACTGGAACAGTTCAAACGTGT
GGGGGTACATCTCCCCGCGTCAACGGTCAACGACTGGTTCAAGGATGTGGCGGATTTGCTAAGGCCCCTTTACTTCCGATTATGGGAGCTGGTGATGCAG
ACTGACTACATACAGTCGGATGAAACGACAATCCCCGTGATGAATGACGAGAGACACAAGACGGTCAAAGGTTATATCTGGCTTGTGCGCAGTGTCATGA
CCGGACGTCAGTTCTTTTACTATGACAAGGGTTCCCGAAGTGGAAAGGTGGTGCTGAAACTCTTCGGCAAGTTCCGGGGAGCCATACAGACGGACGGATA
CGAAAGGTACGAGATGCTGGACGCCAAGAAAGGTATTATCCTTCTTGGCTGTTGGGCCCATGCACGCAGACATTTTTGGGAGGCAAGAAAGAATGACATG
CAGCGTGCCGACTATGCGCTCGCACAGATACAGTTGCTTTATGACGTGGAGCGTAAGGCTGACGATGAACGCCTGACTTACGAACAGAGGGCTGAACTTA
GGGCACGTCTTGCATATCCCATACTTGTGCGCTTCGAGAAATGGCTGGTCAATGAATATCCCAAAGTAATGAAAGACAGTCCGATCGGAAAAGCCATAAA
ATATACATACGGAAGGTTCGACAAACTCTCCAGGTACCATCTGGACGGTCGTTACAGACCGGATAATAACGAGATAGAAAATAAAGTGCGGCCTGTCGCG
TGCGGCAGACGTAATTACCTGTTCTGTGGCAATAATGATGCCGCTGAAGATGCGGCGGTGCTCTACTCGTTCTTCGGCTGCTGCAAGGCTGCCGGGGCTG
ACTTCCGCACGTGGCTGATCTACTTCCTTGAACATATACATGATTATGACGATGACTACTCGATGGATCTGGCCGAATTGTTACCTGACAATCTGTTATC
CAAGGGCAAGATATTATCCGTTACGTCACCGGAATCTCCCAAGAAAGACTCCTGATACCTTCGTAAATCTCGCCTAAACACGGGAAAACTACCGTTTTAT
GCCAATCAAGACGGGAAAATAGGGAATAAAGTGCCGTTTTTATCTTCATTATCCCCGATAAATAGGGAAAATAAGGGAAAGAATACCCTTTTATACCAAT
TATGGCGGGAAAGTAGGGATAAAAATATCGCTTTATCACATTTGTCACAGGAAAATAAGGGGCAAAGGGGAAAAACTTCCTTTTATCCGCTCCATTAGTA
GGGATAATATCAATTACTATAGAACCCGCACATCTTTGGGTTGAATTGATGTAACAATAAATCATAGAACGATAAAATGAATAAACGACATTATGAACAT
ACGTTACCCAATATACGAGGGAGTTTATCGAATACTTAC
GGAAGATTTTGAATCTATTTATTCCCGTTTCAAGTCGAGCGGACTGTCAGTTATGGATTTTTGTTCGAATGAATGTATTCGTCCCAAACGTTTCTACGAG
TGGCGTTCCAAGCTGTTGCGCAAAGGCGGCTTTATCCCGGTAAAGGTAAACAGTAAGGGCCAGGTCAGTCTTCCCCATAAGGAGAAATCCCTGCTGTCTG
CCCCTCCGGTCAGCCCGTCGCCAATTCCCCAGCCGCTATGTGAGATCTCCTATCCCAATGGCGTCACAGTCCGCTTGAACAGCCCTTTGTCACCGGAGGT
ATTGCAAACCCTGATATTTTTGAATTCAAACCGTTAGCCTATGTTTTCTTTGAATGAATCCAACAGATATTATCTTTACCCGTATCCGACAGACATGCGT
AAGAGTTTTTATACGCTTAGCGGCATCGTGACCAACCAGATGGGAAAGAATGTACGGGACGGTGACGCTTTCATTTTTATCAATGCGAATTGTACCTGTA
TGAAAATCCTCCATATGGAATATGGTGGTCTGGTGATATACCATATGAGGCTGGAACATGGGCACTTCCATCTGCCGGTCATAAATACGGAGGAAGGTCG
GATTAAAGCGATTGAAACCTTCTGGAATGACCTTGTGATGATGGTTCAAGGAATGGACGGCAGCAAGGTCAGGCGTTATAAAAGGAGTGGTTTCCATGGG
TTGTAGTTGTAACAAATTATAACATATACAATGCCGTATTCCGGCTGTTTTTTAGTATCTTAGCATCATATTTAAACGATTATGCCAACCGATAAGGAAT
TACTGATAAAGGATCTCATGCACAAATGCGACTGTCTGTATAGAGAAAACAGCCGGTTGAAGGAGATGGTGTCCACGCAGCCCCTTAAAGCTGCGGACAA
AGAGGTGTATGAGGCTTTATTGTCTGACAAGGATGCCATAATCGCCCAAAAGGAAGCAAAAATCAACAGTCTGGAGCAACGTGTATCCTATCTTGAACGG
CAGTTGTACGGTAAGAAGGCGGAAAAGTTCATAAAGCCTGACGCCCAGGACCGCTGGCTGGATTTTGAAGGCTTTGACATGCTTCCCCAGGAGGCTGAAG
CCGCAGAAGAAGCAGAAAAGGAGTTGAAGGCTACCAGAGAAGCGATCATCGCCCGTAAAAAAGCGGGGAAGCAGCATCCTGCGAGAAAATCCCTTCCGGA
GAATCTTGAGCGTGAGGTGGTCCATATATATCCCGAAGGGTATAATCCGGAAGAGTGGACGCTCCTTCCCGGAGAGGAAGTGACCGAGATCCTTATGCAC
GAGCCCGAGAAGTTCTATATCCGCAGGATAGTGCGCCATACAGCCAAACGGAAGGGTACAAACGAGTTCAAGACCGGCCCGCTTCCCGTCATGCCAATAG
CAAAGAGCTATGCCTCTGCCTCATTGCTGGCAGACATGATGATAGGGAAATATGTGGATCACATACCGTTCCACAGGCAACTGGAACAGTTCAAACGTGT
GGGGGTACATCTCCCCGCGTCAACGGTCAACGACTGGTTCAAGGATGTGGCGGATTTGCTAAGGCCCCTTTACTTCCGATTATGGGAGCTGGTGATGCAG
ACTGACTACATACAGTCGGATGAAACGACAATCCCCGTGATGAATGACGAGAGACACAAGACGGTCAAAGGTTATATCTGGCTTGTGCGCAGTGTCATGA
CCGGACGTCAGTTCTTTTACTATGACAAGGGTTCCCGAAGTGGAAAGGTGGTGCTGAAACTCTTCGGCAAGTTCCGGGGAGCCATACAGACGGACGGATA
CGAAAGGTACGAGATGCTGGACGCCAAGAAAGGTATTATCCTTCTTGGCTGTTGGGCCCATGCACGCAGACATTTTTGGGAGGCAAGAAAGAATGACATG
CAGCGTGCCGACTATGCGCTCGCACAGATACAGTTGCTTTATGACGTGGAGCGTAAGGCTGACGATGAACGCCTGACTTACGAACAGAGGGCTGAACTTA
GGGCACGTCTTGCATATCCCATACTTGTGCGCTTCGAGAAATGGCTGGTCAATGAATATCCCAAAGTAATGAAAGACAGTCCGATCGGAAAAGCCATAAA
ATATACATACGGAAGGTTCGACAAACTCTCCAGGTACCATCTGGACGGTCGTTACAGACCGGATAATAACGAGATAGAAAATAAAGTGCGGCCTGTCGCG
TGCGGCAGACGTAATTACCTGTTCTGTGGCAATAATGATGCCGCTGAAGATGCGGCGGTGCTCTACTCGTTCTTCGGCTGCTGCAAGGCTGCCGGGGCTG
ACTTCCGCACGTGGCTGATCTACTTCCTTGAACATATACATGATTATGACGATGACTACTCGATGGATCTGGCCGAATTGTTACCTGACAATCTGTTATC
CAAGGGCAAGATATTATCCGTTACGTCACCGGAATCTCCCAAGAAAGACTCCTGATACCTTCGTAAATCTCGCCTAAACACGGGAAAACTACCGTTTTAT
GCCAATCAAGACGGGAAAATAGGGAATAAAGTGCCGTTTTTATCTTCATTATCCCCGATAAATAGGGAAAATAAGGGAAAGAATACCCTTTTATACCAAT
TATGGCGGGAAAGTAGGGATAAAAATATCGCTTTATCACATTTGTCACAGGAAAATAAGGGGCAAAGGGGAAAAACTTCCTTTTATCCGCTCCATTAGTA
GGGATAATATCAATTACTATAGAACCCGCACATCTTTGGGTTGAATTGATGTAACAATAAATCATAGAACGATAAAATGAATAAACGACATTATGAACAT
ACGTTACCCAATATACGAGGGAGTTTATCGAATACTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 81 | 437 | + | No |
AG : IS66 TnpA
ORF sequence :
MSHQWTMEDFESIYSRFKSSGLSVMDFCSNECIRPKRFYEWRSKLLRKGGFIPVKVNSKGQVSLPHKEKSLLSAPPVSPSPIPQPLCEISYPNGVTVRLN
SPLSPEVLQTLIFLNSNR
SPLSPEVLQTLIFLNSNR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
366 bp | 121 aa | 441 | 806 | + | No |
AG : IS66 TnpB
ORF sequence :
MFSLNESNRYYLYPYPTDMRKSFYTLSGIVTNQMGKNVRDGDAFIFINANCTCMKILHMEYGGLVIYHMRLEHGHFHLPVINTEEGRIKAIETFWNDLVM
MVQGMDGSKVRRYKRSGFHGL
MVQGMDGSKVRRYKRSGFHGL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1674 bp | 557 aa | 882 | 2555 | + | No |
Chemistry : DDE
ORF sequence :
MPTDKELLIKDLMHKCDCLYRENSRLKEMVSTQPLKAADKEVYEALLSDKDAIIAQKEAKINSLEQRVSYLERQLYGKKAEKFIKPDAQDRWLDFEGFDM
LPQEAEAAEEAEKELKATREAIIARKKAGKQHPARKSLPENLEREVVHIYPEGYNPEEWTLLPGEEVTEILMHEPEKFYIRRIVRHTAKRKGTNEFKTGP
LPVMPIAKSYASASLLADMMIGKYVDHIPFHRQLEQFKRVGVHLPASTVNDWFKDVADLLRPLYFRLWELVMQTDYIQSDETTIPVMNDERHKTVKGYIW
LVRSVMTGRQFFYYDKGSRSGKVVLKLFGKFRGAIQTDGYERYEMLDAKKGIILLGCWAHARRHFWEARKNDMQRADYALAQIQLLYDVERKADDERLTY
EQRAELRARLAYPILVRFEKWLVNEYPKVMKDSPIGKAIKYTYGRFDKLSRYHLDGRYRPDNNEIENKVRPVACGRRNYLFCGNNDAAEDAAVLYSFFGC
CKAAGADFRTWLIYFLEHIHDYDDDYSMDLAELLPDNLLSKGKILSVTSPESPKKDS
LPQEAEAAEEAEKELKATREAIIARKKAGKQHPARKSLPENLEREVVHIYPEGYNPEEWTLLPGEEVTEILMHEPEKFYIRRIVRHTAKRKGTNEFKTGP
LPVMPIAKSYASASLLADMMIGKYVDHIPFHRQLEQFKRVGVHLPASTVNDWFKDVADLLRPLYFRLWELVMQTDYIQSDETTIPVMNDERHKTVKGYIW
LVRSVMTGRQFFYYDKGSRSGKVVLKLFGKFRGAIQTDGYERYEMLDAKKGIILLGCWAHARRHFWEARKNDMQRADYALAQIQLLYDVERKADDERLTY
EQRAELRARLAYPILVRFEKWLVNEYPKVMKDSPIGKAIKYTYGRFDKLSRYHLDGRYRPDNNEIENKVRPVACGRRNYLFCGNNDAAEDAAVLYSFFGC
CKAAGADFRTWLIYFLEHIHDYDDDYSMDLAELLPDNLLSKGKILSVTSPESPKKDS
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
168 bp | 55 aa | 2631 | 2798 | + | No |
Annotation : Description :
ORF sequence :
MPFLSSLSPINRENKGKNTLLYQLWRESRDKNIALSHLSQENKGQRGKTSFYPLH
Blast result :
Comments
ISBf10 (orf1) is 43% aa similar to ISCro1 (orf1).
ISBf10 (orf2) is 48% aa similar to ISCro1 (orf2).
ISBf10 (orf3) is 48% aa similar to ISPre3 (orf3).
ISBf10 (orf2) is 48% aa similar to ISCro1 (orf2).
ISBf10 (orf3) is 48% aa similar to ISPre3 (orf3).
References
1] Kuwahara,T., Yamashita,A., Hirakawa,H., Nakayama,H., Toh,H., Okada,N., Kuhara,S., Hattori,M., Hayashi,T. and Ohnishi,Y. (2004)Proc. Natl. Acad. Sci. U.S.A. 101 (41), 14919-14924.
2] Hattori,M., Yamashita,A., Toh,H., Oshima,K. and Shiba,T. (2004) Direct Submission GenBank.
3] ISfinder annotation (2008).
2] Hattori,M., Yamashita,A., Toh,H., Oshima,K. and Shiba,T. (2004) Direct Submission GenBank.
3] ISfinder annotation (2008).