ISBcen14
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_011001 | ND | Burkholderia cenocepacia | Burkholderia cenocepacia J2315 |
DNA section
IS Length : 2516 bp
Ends
IR Length : 18/22
IRL : GTAAGCACGAGACCAGTACCGTCTGTGAACGCTCGGTGACGAGTGCCATG
IRR : GTAAGCGCCCGAGCAGTACCGTAGTGCAGGTTGTTGACCGCGGAGCTCCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGCAAAGGCA | TCCTGAAT | CACCCGATCG | 8 |
CGTTGACCTC | GGTCGGCA | CGACTTCAGG | 8 |
CGGCGGCGCC | GGTCATGC | CGGCGCCGGA | 8 |
AGTAGTGCCG | GGACTTGT | TCGGCGTCGA | 8 |
DNA sequence
GTAAGCACGAGACCAGTACCGTCTGTGAACGCTCGGTGACGAGTGCCATGATGTCCTGTGAAGGACACGAATGACCGGTCGTGTCCACGATAGGAGTTGT
GGACAGAGTGACTCAGACAGTCGAGATCCCGCCGAAACGGCAAACTCGCCGACATCCGACGGAATGGAAGCGGACCATCGTCGCGCTGACATTCGAGCCC
GGCGCGTCGGTCGCCCGTGTTGCTCGCGAGAACGGCATCAATGCCAATCAGGTGTGGGCATGGCGCCGCCTCCATGCGCAAGGCCTGCTAACGGACGATG
CGATTCCAGACGCAATGCTGCCGGTTGTCGTCAACGAACCGTCTCAGCCGTCGATGGCGCTCGAGGTGCCGACTGAGACCGACAACATACCGTCCGGCTC
GATCCAGATTCAGCATGGCAAGACGTCGATTCGCATCGAAGGCGCGCCGGACCCCGACGTGCTGCGTTCTGTTCTCGATCGCATTCTACGATGATCGGTC
TGCCTCAAAACACGCGGATCTGGATCGCGGCGGGCGTCACCGATATGCGATGCGGCTTCAATTCGTTGGCCGCGAAGGTACAGACGGTGCTGGAGAAAGA
TCCGTTCTCTGGACACGTGTTCGTGTTCCGGGGCAAGCGCGGTGACCTGCTGAAGTGCTTGTATTGGAGCGACGGCGGGCTATGTTTATTGGCGAAGCGG
CTCGAAAAAGGACGCTTTGCTTGGCCCCGTGCCGACTCTGGTGTGGTCGCGCTGACAACCGCGCAGCTGTCGTTGTTGCTCGAGGGCTTCGATTGGCGAC
AGCCGGTCGAAGCCGTGCGTCCACGTAGCGCGCTGTAAACCATTGTTAACTGGCGTTCGCGTGATCGCGGGCAGTCGTAAACTGTCCTCATGGAATCAAC
CTCGACCGCGCTGCCCGACGACATCAACGCTCTGAGAGCGTTGCTGCTTGAGCGTGATGCTCAGGTAGCTGAGCTGCGAAAGCAGCTCTCGTCGCGGGCT
CTCGAAATCGAGCATCTGAAGCTCACGATCGCGAAACTGCGCCGAATGCAGTTCGGCCGGAAATCGGAGAAACTAGATCTCCAGATCGAACAGCTCGAAT
TGCGGCTGGAGGATCTGCAGGCTGACGAAGGCGCCGCCGATGCGTCTGCCGCTCCCGAGGCAAAGCGGCCGCGCCGCGAAGGTGCGAGCCGCAAGCCGCT
ACCCGGACATCTTGAACGCGAAGAGCGCGTCCATCTGCCCGCCGATGATGACTGCCCCGATTGCGGTGGGCAGCTCAAGCCGTTGGGCGAAGACATCGCG
GAGCAACTCGAATACGTGCGTGCGCACTTCCGTGTGATCCGTCATCGCCGCCCGAAGCTGGCCTGCGCGCGCTGTGACCGCATCGTGCAGGCCGCGGCGC
CGAGCCGCCCAATCGATCGGGGCATCCCGGGTCCGGCGCTGCTCGCCCACATCGCCGTGTCGAAGTTCGCATACCACCTCCCGCTGCACCGCCAGGCGGT
GATGTACGCGCGTGACGGCGTCGAGATCGATCCCGGCGCGATGGGCTACTGGATGGGTAGCATCACGGCGCTGCTCGCACCGTTGGTCGACGCCGTGCGC
CGCTACACGTTGGCCTGCGGTAAGGTGCATGCGGACGACACTCCGCTACCGGTGCTGGTGCCGGGCAACGGTCGTACGAAGACAGGACGCCTCTGGGTCT
ACGTGCGCGATGATCGGCAGAGTGGCTCCGACGAACCGCCGGCGGCCTGGTTCGCCTACACGCCTGATCGCCGTGGTGAGCATCCGCAGCGACATCTCGC
TGACTTCGCCGGCGTGCTGCAGGCTGACGCGTTCGCTGGCTATGCCGAGCTGTATCTCGACGGCCGCGTCCAGGAAGCCGCATGTATGGCTCACGCGCGC
CGGAAGATCCACGACCTGCATGCGGTGCGCCCCAACGCCGTGACGGAGGAGGCGCTGCGGCGGATCGGTGCGCTCTACAAGATCGAAGAGCAGATCCGCG
GCAAGCCGCCCGACGAACGGCGCAGTGTGCGCCAAGCGCGGGCGGTGCCGCTACTCGACGACATGAAGCGCTGGTTCGAAGCGACGCTTGCCACGCTCTC
TGCAAAATCTGACACGACCAAGGCGATTCGGTACGCGCTGAATCGCTGGCCGGCGCTCGTCTACTACTGCAGCGATGGGTGTACGGAGATCGACAACCTG
ATCGCCGAGCGAGCATTGCGAGGCGTCGCTCTCGGACGGCGGAACTATCTGTTCGCCGGCGCCGACTCTGGCGGCGAACGTGCCGCCGCGATGTACAGCC
TGATCGGCACGGCACGTCTGAACGGCCTCGATCCCGAGGCGTATCTCGCCTACGTGCTCGAGCGCATCGCCGACCATCCGGCCAACCGGGTCGACGAACT
GCTCCCGTGGAACGTCGCACCATCGCTGCCGCCTACTGCTCGCGTCGAGCCCATCCGATAGCGTCGCGGAGCTCCGCGGTCAACAACCTGCACTACGGTA
CTGCTCGGGCGCTTAC
GGACAGAGTGACTCAGACAGTCGAGATCCCGCCGAAACGGCAAACTCGCCGACATCCGACGGAATGGAAGCGGACCATCGTCGCGCTGACATTCGAGCCC
GGCGCGTCGGTCGCCCGTGTTGCTCGCGAGAACGGCATCAATGCCAATCAGGTGTGGGCATGGCGCCGCCTCCATGCGCAAGGCCTGCTAACGGACGATG
CGATTCCAGACGCAATGCTGCCGGTTGTCGTCAACGAACCGTCTCAGCCGTCGATGGCGCTCGAGGTGCCGACTGAGACCGACAACATACCGTCCGGCTC
GATCCAGATTCAGCATGGCAAGACGTCGATTCGCATCGAAGGCGCGCCGGACCCCGACGTGCTGCGTTCTGTTCTCGATCGCATTCTACGATGATCGGTC
TGCCTCAAAACACGCGGATCTGGATCGCGGCGGGCGTCACCGATATGCGATGCGGCTTCAATTCGTTGGCCGCGAAGGTACAGACGGTGCTGGAGAAAGA
TCCGTTCTCTGGACACGTGTTCGTGTTCCGGGGCAAGCGCGGTGACCTGCTGAAGTGCTTGTATTGGAGCGACGGCGGGCTATGTTTATTGGCGAAGCGG
CTCGAAAAAGGACGCTTTGCTTGGCCCCGTGCCGACTCTGGTGTGGTCGCGCTGACAACCGCGCAGCTGTCGTTGTTGCTCGAGGGCTTCGATTGGCGAC
AGCCGGTCGAAGCCGTGCGTCCACGTAGCGCGCTGTAAACCATTGTTAACTGGCGTTCGCGTGATCGCGGGCAGTCGTAAACTGTCCTCATGGAATCAAC
CTCGACCGCGCTGCCCGACGACATCAACGCTCTGAGAGCGTTGCTGCTTGAGCGTGATGCTCAGGTAGCTGAGCTGCGAAAGCAGCTCTCGTCGCGGGCT
CTCGAAATCGAGCATCTGAAGCTCACGATCGCGAAACTGCGCCGAATGCAGTTCGGCCGGAAATCGGAGAAACTAGATCTCCAGATCGAACAGCTCGAAT
TGCGGCTGGAGGATCTGCAGGCTGACGAAGGCGCCGCCGATGCGTCTGCCGCTCCCGAGGCAAAGCGGCCGCGCCGCGAAGGTGCGAGCCGCAAGCCGCT
ACCCGGACATCTTGAACGCGAAGAGCGCGTCCATCTGCCCGCCGATGATGACTGCCCCGATTGCGGTGGGCAGCTCAAGCCGTTGGGCGAAGACATCGCG
GAGCAACTCGAATACGTGCGTGCGCACTTCCGTGTGATCCGTCATCGCCGCCCGAAGCTGGCCTGCGCGCGCTGTGACCGCATCGTGCAGGCCGCGGCGC
CGAGCCGCCCAATCGATCGGGGCATCCCGGGTCCGGCGCTGCTCGCCCACATCGCCGTGTCGAAGTTCGCATACCACCTCCCGCTGCACCGCCAGGCGGT
GATGTACGCGCGTGACGGCGTCGAGATCGATCCCGGCGCGATGGGCTACTGGATGGGTAGCATCACGGCGCTGCTCGCACCGTTGGTCGACGCCGTGCGC
CGCTACACGTTGGCCTGCGGTAAGGTGCATGCGGACGACACTCCGCTACCGGTGCTGGTGCCGGGCAACGGTCGTACGAAGACAGGACGCCTCTGGGTCT
ACGTGCGCGATGATCGGCAGAGTGGCTCCGACGAACCGCCGGCGGCCTGGTTCGCCTACACGCCTGATCGCCGTGGTGAGCATCCGCAGCGACATCTCGC
TGACTTCGCCGGCGTGCTGCAGGCTGACGCGTTCGCTGGCTATGCCGAGCTGTATCTCGACGGCCGCGTCCAGGAAGCCGCATGTATGGCTCACGCGCGC
CGGAAGATCCACGACCTGCATGCGGTGCGCCCCAACGCCGTGACGGAGGAGGCGCTGCGGCGGATCGGTGCGCTCTACAAGATCGAAGAGCAGATCCGCG
GCAAGCCGCCCGACGAACGGCGCAGTGTGCGCCAAGCGCGGGCGGTGCCGCTACTCGACGACATGAAGCGCTGGTTCGAAGCGACGCTTGCCACGCTCTC
TGCAAAATCTGACACGACCAAGGCGATTCGGTACGCGCTGAATCGCTGGCCGGCGCTCGTCTACTACTGCAGCGATGGGTGTACGGAGATCGACAACCTG
ATCGCCGAGCGAGCATTGCGAGGCGTCGCTCTCGGACGGCGGAACTATCTGTTCGCCGGCGCCGACTCTGGCGGCGAACGTGCCGCCGCGATGTACAGCC
TGATCGGCACGGCACGTCTGAACGGCCTCGATCCCGAGGCGTATCTCGCCTACGTGCTCGAGCGCATCGCCGACCATCCGGCCAACCGGGTCGACGAACT
GCTCCCGTGGAACGTCGCACCATCGCTGCCGCCTACTGCTCGCGTCGAGCCCATCCGATAGCGTCGCGGAGCTCCGCGGTCAACAACCTGCACTACGGTA
CTGCTCGGGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
396 bp | 131 aa | 99 | 494 | + | No |
AG : IS66 TnpA
ORF sequence :
VDRVTQTVEIPPKRQTRRHPTEWKRTIVALTFEPGASVARVARENGINANQVWAWRRLHAQGLLTDDAIPDAMLPVVVNEPSQPSMALEVPTETDNIPSG
SIQIQHGKTSIRIEGAPDPDVLRSVLDRILR
SIQIQHGKTSIRIEGAPDPDVLRSVLDRILR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 491 | 838 | + | No |
AG : IS66 TnpB
ORF sequence :
MIGLPQNTRIWIAAGVTDMRCGFNSLAAKVQTVLEKDPFSGHVFVFRGKRGDLLKCLYWSDGGLCLLAKRLEKGRFAWPRADSGVVALTTAQLSLLLEGF
DWRQPVEAVRPRSAL
DWRQPVEAVRPRSAL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1572 bp | 523 aa | 890 | 2461 | + | No |
Chemistry : DDE
ORF sequence :
MESTSTALPDDINALRALLLERDAQVAELRKQLSSRALEIEHLKLTIAKLRRMQFGRKSEKLDLQIEQLELRLEDLQADEGAADASAAPEAKRPRREGAS
RKPLPGHLEREERVHLPADDDCPDCGGQLKPLGEDIAEQLEYVRAHFRVIRHRRPKLACARCDRIVQAAAPSRPIDRGIPGPALLAHIAVSKFAYHLPLH
RQAVMYARDGVEIDPGAMGYWMGSITALLAPLVDAVRRYTLACGKVHADDTPLPVLVPGNGRTKTGRLWVYVRDDRQSGSDEPPAAWFAYTPDRRGEHPQ
RHLADFAGVLQADAFAGYAELYLDGRVQEAACMAHARRKIHDLHAVRPNAVTEEALRRIGALYKIEEQIRGKPPDERRSVRQARAVPLLDDMKRWFEATL
ATLSAKSDTTKAIRYALNRWPALVYYCSDGCTEIDNLIAERALRGVALGRRNYLFAGADSGGERAAAMYSLIGTARLNGLDPEAYLAYVLERIADHPANR
VDELLPWNVAPSLPPTARVEPIR
RKPLPGHLEREERVHLPADDDCPDCGGQLKPLGEDIAEQLEYVRAHFRVIRHRRPKLACARCDRIVQAAAPSRPIDRGIPGPALLAHIAVSKFAYHLPLH
RQAVMYARDGVEIDPGAMGYWMGSITALLAPLVDAVRRYTLACGKVHADDTPLPVLVPGNGRTKTGRLWVYVRDDRQSGSDEPPAAWFAYTPDRRGEHPQ
RHLADFAGVLQADAFAGYAELYLDGRVQEAACMAHARRKIHDLHAVRPNAVTEEALRRIGALYKIEEQIRGKPPDERRSVRQARAVPLLDDMKRWFEATL
ATLSAKSDTTKAIRYALNRWPALVYYCSDGCTEIDNLIAERALRGVALGRRNYLFAGADSGGERAAAMYSLIGTARLNGLDPEAYLAYVLERIADHPANR
VDELLPWNVAPSLPPTARVEPIR
Blast result :
Comments
4 copies within J2315. Ends defined by inverted repeats, direct repeats and comparing multiple insertion events.
ISBcen14 is 58% (ORF1), 80% (ORF2) and 71% (ORF3) aa simialr to IS883.
ISBcen14 is 58% (ORF1), 80% (ORF2) and 71% (ORF3) aa simialr to IS883.
References
1] The Wellcome Trust Sanger Institute. http://www.sanger.ac.uk
2] Seth-Smith, H. (2006) Direct submission.
2] Seth-Smith, H. (2006) Direct submission.