ISBvu4
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009614 | ND | Bacteroides vulgatus | Bacteroides vulgatus ATCC 8482 |
DNA section
IS Length : 2371 bp
Ends
IR Length : 30/32
IRL : GTAAGCATTCGGCCTCGTGCGTGTTTTTTTCCCCTCTTTTCCTTCCTACC
IRR : GTAAGCATTCGGCCTCGTGCGTGTTTCTCTCCAAGTTATAATAAAGTCAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATATTTATAA | AGATATGA | AAGGCATGTA | 8 |
AAAGTCCTGA | AGTTTTCC | CAGCTCGCAT | 8 |
TTACTGAAAA | GGCAGAAG | AACAAAATAC | 8 |
CTTTTTTACC | CTCTGTTA | ATGTAAGTGT | 8 |
TTTTATCACT | AGAGGAATTA | 0 |
DNA sequence
GTAAGCATTCGGCCTCGTGCGTGTTTTTTTCCCCTCTTTTCCTTCCTACCTTTGTGGCTTCATTATAATCTTAATCCATTATGTGTTCAGTATTAGAATT
TTCTAGCGAAAGTGATAAACTACGTACTCTCTTTTCCGAGTCTAACCATTTTGATTCTGTTCTTTTCATATTTGATGGCGGCCGTCGTTATAAAGTTTCG
TCTTCCAGCCTTGGTATCCATTCCCCTCAGGACCTACTTCTCCCTTTAGGTCTTGGCCTTTTCCGCAGCAGTCCTTTTTGGTCTTATTACCTTTATCCCC
AAGGTTGTAATTTCCATAAAGGTATTGACGGTCTTTGTGGCGAGGTCATCCGACACACGGGTTCCTGCGTGTCAGAGCAGAGCTGTCATATTTTTCCCGA
CCGCTCCCGTAGCAGGTTGCATATCCTTTATCGGTGCGACGACGAATACCGTCTGGAATGCCGTCGTCTGAATCGCGGTTCTTTCCTCTTGAAAAAGGAG
GAACGGAAGAAGGATTTTCTTCAAATTTCCTGGAATCGCCTGAATGAGCTGCTTACTGTTAAAAAGTATCGGAAAACAGTTGAAAAGTAATTCCCCATTC
TATAAGCCGTGCAGTTTTATTTTGATAACTTTGCCACATAATAAAAAGTGGATGTACAGTGAAAGAACCGGTTATTACCCTTACTTTGGAAGAGTACGAA
GAGTTGCGCAAGGAACATGAATGCCTTGAAAAGGAGCATGCGGAACTTCAGAGAAAATACGAAGCCTCTTTGCAGGAGTATTCCCGCCAGGTCGAAGAAA
TCTCGGCCTGTACAGCCGTCATTGCCGATCTGAGATGGAAATTGGCTGACTTGACACGCCGTTTATGGGGTAAATCCAGTGAAAAACGTCATCTTCCCGA
AGATGCCGGCCAGCTGAGCATCTGTTTCGAGTCTCCGTCCGATGTCAATGACCCGGTAGCGGAAGAACAGAAAACCGCTGGGAAATCCGTCACATCGGAG
AATGGCTACAATCGTTTCCGTAAAAGCTTCACGAAAAAGATCACTCCCCACGCCCGTAAGCCCATCGACCCGTCCCTTCCCCGTGAGGAGATTATCATTC
CCATGCCGGAAGGCCTTTCGTTGGAGGGAGCGACGAAGCTGGGAGAGGAAGTGAGCGAACAGTATGCTGTCAGTCCCGCCCGATTCTATGTGAAACGCAT
CATCCGTCCTAAATACCGGCTCGCCGACGGTCGTATCATAACCGCTCCCATGCCCGTAATGGCACATCCCCACAGCAATGCCTCGGAAAGTGTACTGGCC
CACATTGCTACCGCCAAATATTACGACCACCTGCCTCTGCACAGACAGCTGGATATCTTTGAGCGTGAAGGAATCCATCTGAGTCCTTCCACCGTAAGTA
ACTGGATGATGGCTGCCGCACAGCGTCTGGAACCAATCTATAATGAACTTCGTGAACTGGTCAAGGACAGCTATTACGTCATGGCCGATGAGACACCCCA
TCCCGTACTTGAAAGCGACCGGCCCGGTGCTCTTCACCGCGGGTATATGTGGAACTTCTATCTGCCCCGGTTCCATACCCCCTTCTTTGAATATCACAAG
GGCCGTGGCAGCAGCGGAATAGACACACTGCTGGCAGGACAGGTCCGGGTGGTACAGAGTGACGGCTTTGCAGTATATGACAAGTTCGACACGCTACCCG
GAAAGTTGCATCTGTGCTGCTGGGCGCACGTCAGGCGCAACTTTGTGGAAGCGGAAGGAAATGACCCTCCCAGAGCAAGGCATGCACTTGGAAAAATAGG
CGGACTGTATGCCGTGGAGGAGAAAATCAGAATGGAACATCTGGAAGGGGAGGCGGTGGTAAAGCTTCGCCGGGAAAAATCGTACCCTATTATCAAAGAA
CTGGAAAAATGGTGCAAGGAGGAATATGGACATACGGTCGATAAATCGCCTATTGCCAAGGCCATGTTCTATATGTACACACGCTTTGAACAACTGTCCG
GATATGTCAATGACGCACAGTTCTGCATTGACAATAATCCGGTGGAGCGTTCTATAAGGCCGTTGACTTTAAACAGAAAAAATACGCTCTTCTCCGGATC
ACATGAGGCAGCACATGCAGCGGCAATTTTCTTTTCACTGATGGGATGCTGTAGGGAAAATAAGGTGAACCCAAAACTATGGATGCAGGACGTGCTGATT
AGGGTACAGGAGAAAGAAAGAGAAGAGAAAAACGATTACACCGATTTACTGCCATTTAATTGGAAAGGATAAACAGGAAAGCTATAATAAAGTCAAAAGC
CAGACATGCTTCGTCTGGCTTTTGACTTTATTATAACTTGGAGAGAAACACGCACGAGGCCGAATGCTTAC
TTCTAGCGAAAGTGATAAACTACGTACTCTCTTTTCCGAGTCTAACCATTTTGATTCTGTTCTTTTCATATTTGATGGCGGCCGTCGTTATAAAGTTTCG
TCTTCCAGCCTTGGTATCCATTCCCCTCAGGACCTACTTCTCCCTTTAGGTCTTGGCCTTTTCCGCAGCAGTCCTTTTTGGTCTTATTACCTTTATCCCC
AAGGTTGTAATTTCCATAAAGGTATTGACGGTCTTTGTGGCGAGGTCATCCGACACACGGGTTCCTGCGTGTCAGAGCAGAGCTGTCATATTTTTCCCGA
CCGCTCCCGTAGCAGGTTGCATATCCTTTATCGGTGCGACGACGAATACCGTCTGGAATGCCGTCGTCTGAATCGCGGTTCTTTCCTCTTGAAAAAGGAG
GAACGGAAGAAGGATTTTCTTCAAATTTCCTGGAATCGCCTGAATGAGCTGCTTACTGTTAAAAAGTATCGGAAAACAGTTGAAAAGTAATTCCCCATTC
TATAAGCCGTGCAGTTTTATTTTGATAACTTTGCCACATAATAAAAAGTGGATGTACAGTGAAAGAACCGGTTATTACCCTTACTTTGGAAGAGTACGAA
GAGTTGCGCAAGGAACATGAATGCCTTGAAAAGGAGCATGCGGAACTTCAGAGAAAATACGAAGCCTCTTTGCAGGAGTATTCCCGCCAGGTCGAAGAAA
TCTCGGCCTGTACAGCCGTCATTGCCGATCTGAGATGGAAATTGGCTGACTTGACACGCCGTTTATGGGGTAAATCCAGTGAAAAACGTCATCTTCCCGA
AGATGCCGGCCAGCTGAGCATCTGTTTCGAGTCTCCGTCCGATGTCAATGACCCGGTAGCGGAAGAACAGAAAACCGCTGGGAAATCCGTCACATCGGAG
AATGGCTACAATCGTTTCCGTAAAAGCTTCACGAAAAAGATCACTCCCCACGCCCGTAAGCCCATCGACCCGTCCCTTCCCCGTGAGGAGATTATCATTC
CCATGCCGGAAGGCCTTTCGTTGGAGGGAGCGACGAAGCTGGGAGAGGAAGTGAGCGAACAGTATGCTGTCAGTCCCGCCCGATTCTATGTGAAACGCAT
CATCCGTCCTAAATACCGGCTCGCCGACGGTCGTATCATAACCGCTCCCATGCCCGTAATGGCACATCCCCACAGCAATGCCTCGGAAAGTGTACTGGCC
CACATTGCTACCGCCAAATATTACGACCACCTGCCTCTGCACAGACAGCTGGATATCTTTGAGCGTGAAGGAATCCATCTGAGTCCTTCCACCGTAAGTA
ACTGGATGATGGCTGCCGCACAGCGTCTGGAACCAATCTATAATGAACTTCGTGAACTGGTCAAGGACAGCTATTACGTCATGGCCGATGAGACACCCCA
TCCCGTACTTGAAAGCGACCGGCCCGGTGCTCTTCACCGCGGGTATATGTGGAACTTCTATCTGCCCCGGTTCCATACCCCCTTCTTTGAATATCACAAG
GGCCGTGGCAGCAGCGGAATAGACACACTGCTGGCAGGACAGGTCCGGGTGGTACAGAGTGACGGCTTTGCAGTATATGACAAGTTCGACACGCTACCCG
GAAAGTTGCATCTGTGCTGCTGGGCGCACGTCAGGCGCAACTTTGTGGAAGCGGAAGGAAATGACCCTCCCAGAGCAAGGCATGCACTTGGAAAAATAGG
CGGACTGTATGCCGTGGAGGAGAAAATCAGAATGGAACATCTGGAAGGGGAGGCGGTGGTAAAGCTTCGCCGGGAAAAATCGTACCCTATTATCAAAGAA
CTGGAAAAATGGTGCAAGGAGGAATATGGACATACGGTCGATAAATCGCCTATTGCCAAGGCCATGTTCTATATGTACACACGCTTTGAACAACTGTCCG
GATATGTCAATGACGCACAGTTCTGCATTGACAATAATCCGGTGGAGCGTTCTATAAGGCCGTTGACTTTAAACAGAAAAAATACGCTCTTCTCCGGATC
ACATGAGGCAGCACATGCAGCGGCAATTTTCTTTTCACTGATGGGATGCTGTAGGGAAAATAAGGTGAACCCAAAACTATGGATGCAGGACGTGCTGATT
AGGGTACAGGAGAAAGAAAGAGAAGAGAAAAACGATTACACCGATTTACTGCCATTTAATTGGAAAGGATAAACAGGAAAGCTATAATAAAGTCAAAAGC
CAGACATGCTTCGTCTGGCTTTTGACTTTATTATAACTTGGAGAGAAACACGCACGAGGCCGAATGCTTAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
510 bp | 169 aa | 81 | 590 | + | No |
AG : IS66 TnpB
ORF sequence :
MCSVLEFSSESDKLRTLFSESNHFDSVLFIFDGGRRYKVSSSSLGIHSPQDLLLPLGLGLFRSSPFWSYYLYPQGCNFHKGIDGLCGEVIRHTGSCVSEQ
SCHIFPDRSRSRLHILYRCDDEYRLECRRLNRGSFLLKKEERKKDFLQISWNRLNELLTVKKYRKTVEK
SCHIFPDRSRSRLHILYRCDDEYRLECRRLNRGSFLLKKEERKKDFLQISWNRLNELLTVKKYRKTVEK
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1614 bp | 537 aa | 659 | 2272 | + | No |
Chemistry : DDE
ORF sequence :
MKEPVITLTLEEYEELRKEHECLEKEHAELQRKYEASLQEYSRQVEEISACTAVIADLRWKLADLTRRLWGKSSEKRHLPEDAGQLSICFESPSDVNDPV
AEEQKTAGKSVTSENGYNRFRKSFTKKITPHARKPIDPSLPREEIIIPMPEGLSLEGATKLGEEVSEQYAVSPARFYVKRIIRPKYRLADGRIITAPMPV
MAHPHSNASESVLAHIATAKYYDHLPLHRQLDIFEREGIHLSPSTVSNWMMAAAQRLEPIYNELRELVKDSYYVMADETPHPVLESDRPGALHRGYMWNF
YLPRFHTPFFEYHKGRGSSGIDTLLAGQVRVVQSDGFAVYDKFDTLPGKLHLCCWAHVRRNFVEAEGNDPPRARHALGKIGGLYAVEEKIRMEHLEGEAV
VKLRREKSYPIIKELEKWCKEEYGHTVDKSPIAKAMFYMYTRFEQLSGYVNDAQFCIDNNPVERSIRPLTLNRKNTLFSGSHEAAHAAAIFFSLMGCCRE
NKVNPKLWMQDVLIRVQEKEREEKNDYTDLLPFNWKG
AEEQKTAGKSVTSENGYNRFRKSFTKKITPHARKPIDPSLPREEIIIPMPEGLSLEGATKLGEEVSEQYAVSPARFYVKRIIRPKYRLADGRIITAPMPV
MAHPHSNASESVLAHIATAKYYDHLPLHRQLDIFEREGIHLSPSTVSNWMMAAAQRLEPIYNELRELVKDSYYVMADETPHPVLESDRPGALHRGYMWNF
YLPRFHTPFFEYHKGRGSSGIDTLLAGQVRVVQSDGFAVYDKFDTLPGKLHLCCWAHVRRNFVEAEGNDPPRARHALGKIGGLYAVEEKIRMEHLEGEAV
VKLRREKSYPIIKELEKWCKEEYGHTVDKSPIAKAMFYMYTRFEQLSGYVNDAQFCIDNNPVERSIRPLTLNRKNTLFSGSHEAAHAAAIFFSLMGCCRE
NKVNPKLWMQDVLIRVQEKEREEKNDYTDLLPFNWKG
Blast result :
Comments
The first orf of ISBvu4 is 48% aa similar to ISBthe5 (orf2), and the second is 52% aa similar to ISBf10 (orf3).
References
1] Xu,J., Mahowald,M.A., Ley,R.E., Lozupone,C.A., Hamady,M., Martens,E.C., Henrissat,B., Coutinho,P.M., Minx,P., Latreille,P., Cordum,H., Van Brunt,A., Kim,K., Fulton,R.S., Fulton,L.A., Clifton,S.W., Wilson,R.K., Knight,R.D. and Gordon,J.I. (2007) (er) PLoS Biol. 5 (7), E156 In press
2] Xu,J., Minx,P., Latreille,P., Cordum,H., Van Brunt,A., Kim,K., Fulton,R., Fulton,L., Chinwalla,A., Clifton,S., Wilson,R., Mahowald,M., Henrissat,B., Martens,E., Ley,R. and Gordon,J. (2005) Direct Submission GenBank.
2] ISfinder annotation (2008).
2] Xu,J., Minx,P., Latreille,P., Cordum,H., Van Brunt,A., Kim,K., Fulton,R., Fulton,L., Chinwalla,A., Clifton,S., Wilson,R., Mahowald,M., Henrissat,B., Martens,E., Ley,R. and Gordon,J. (2005) Direct Submission GenBank.
2] ISfinder annotation (2008).