ISBcen4
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Burkholderia cenocepacia | Burkholderia cenocepacia J2315 |
DNA section
IS Length : 1349 bp
Ends
IR Length : 13
IRL : AATGGTATGGACGCCTCCTCATCGCATAATGAGGCGTCGTTCAACCAGGA
IRR : ---TATATGGACGCCTCCGGCTTGCCAAGGTTTGTTGCTCTGTCGAACAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCTCGGCAAA | AAGACCATGG | 0 |
DNA sequence
AATGGTATGGACGCCTCCTCATCGCATAATGAGGCGTCGTTCAACCAGGAGAACGCAAATGGAACCGACAGTCGTAGGCGTAGACATCGCCAAGCGGGTC
TTCCAACTGCACTGGATCGCTGCGGACACCGGCGAGGTGTTCGACCGGCAGCTCCGCCGCAGTGATTTTCTCGAGCACTTCGCCAATCGTGAGCCCTGTC
TCATCGGCATGGAGGCCTGCGGAGGCTCACAGCACTGGGCACGCCGTCTCACCGAGCTCGGCCATCAGGTCAAATTGATGCCAGGCAAACTGGTCAAAGC
GTTCGTGACAGGAAACAAGAACGATGTCGCCGATGCGCGCGCAATCTGGGCAGCTGCACGACACCCGGGCGTGAAAGCCGTCCCGGTGAAGACCGAGGCC
CAGCAAGCGGTTCTGGCACTGCATCGTATTCGACAGCAGTTGGTTATATTTCGGCGCTCTCAATCTAACTGCCTGAGAGGCTTGCTCGGCGAGTACGGAG
AGGTCATGGGCGTGGGACGCGCAGCAATGAATCGCGCCATGCCCGACCTGCTGCTACGCCTCGAATTGCGCCTGCCGCGAGTCTTGATCGATTCCCTGCG
GGAACAGTGGCAACGCCTTGTCGACATCGACAAGCAGATAACCCTCATCGAGCAGCGACTCCGCGCGTGGCTGCATGAAAACCCGGCCTGCAGAACGATT
GCCGAAATCCCCGGTGTTGGGTTCCTGACCGCCACTGCTGCCGTAGCAGCTATGGGCGATCCCAAAGCGTTTCGGTCCGGCCGCGAATTTGCAGCATGGC
TGGGTCTCGTCCCTGCGCAGTTTGGAACTGGTGGCAAGGTCCGGCTGCTCGGCATCAGCAAGAGAGGCGATCGATACCTGCGCACGCTGATGATTCATGG
CGCACGGTCCGTCATGAAGCACGCAAAAGACCCGGGTTCCTGGGCAGTGCAGTTGAGCATGCGGCGTCCGCTCAATGTCGTAGTCGTTGCACTGGCCAAC
AAGATCGCGCGGACGATCTGGGCTCTGCTAGCTCATGGCCGCTCATACTGCGAGAGCTATCCGCCTCCAGTTACTGCAGCATGATGTGTGTCGAGATCGT
ATCGCGTTCTATCACCGATCTGTAGTCCAAGGTTGCGCAAGGTAGTGTTGTAATGGCAAACAGGTAAGACCGGGACTCATCAAACCTGCATCGTCTTCTG
GACATCACGGTCCGTGGCAAAAATGAGGCATGAGTCAGCGGATTCCATTGGGGCCAGCGGGCTTCGACCTGCATGGGCCGGATATAAAACCGCAGCCTGT
ATCTGTTCGACAGAGCAACAAACCTTGGCAAGCCGGAGGCGTCCATATA
TTCCAACTGCACTGGATCGCTGCGGACACCGGCGAGGTGTTCGACCGGCAGCTCCGCCGCAGTGATTTTCTCGAGCACTTCGCCAATCGTGAGCCCTGTC
TCATCGGCATGGAGGCCTGCGGAGGCTCACAGCACTGGGCACGCCGTCTCACCGAGCTCGGCCATCAGGTCAAATTGATGCCAGGCAAACTGGTCAAAGC
GTTCGTGACAGGAAACAAGAACGATGTCGCCGATGCGCGCGCAATCTGGGCAGCTGCACGACACCCGGGCGTGAAAGCCGTCCCGGTGAAGACCGAGGCC
CAGCAAGCGGTTCTGGCACTGCATCGTATTCGACAGCAGTTGGTTATATTTCGGCGCTCTCAATCTAACTGCCTGAGAGGCTTGCTCGGCGAGTACGGAG
AGGTCATGGGCGTGGGACGCGCAGCAATGAATCGCGCCATGCCCGACCTGCTGCTACGCCTCGAATTGCGCCTGCCGCGAGTCTTGATCGATTCCCTGCG
GGAACAGTGGCAACGCCTTGTCGACATCGACAAGCAGATAACCCTCATCGAGCAGCGACTCCGCGCGTGGCTGCATGAAAACCCGGCCTGCAGAACGATT
GCCGAAATCCCCGGTGTTGGGTTCCTGACCGCCACTGCTGCCGTAGCAGCTATGGGCGATCCCAAAGCGTTTCGGTCCGGCCGCGAATTTGCAGCATGGC
TGGGTCTCGTCCCTGCGCAGTTTGGAACTGGTGGCAAGGTCCGGCTGCTCGGCATCAGCAAGAGAGGCGATCGATACCTGCGCACGCTGATGATTCATGG
CGCACGGTCCGTCATGAAGCACGCAAAAGACCCGGGTTCCTGGGCAGTGCAGTTGAGCATGCGGCGTCCGCTCAATGTCGTAGTCGTTGCACTGGCCAAC
AAGATCGCGCGGACGATCTGGGCTCTGCTAGCTCATGGCCGCTCATACTGCGAGAGCTATCCGCCTCCAGTTACTGCAGCATGATGTGTGTCGAGATCGT
ATCGCGTTCTATCACCGATCTGTAGTCCAAGGTTGCGCAAGGTAGTGTTGTAATGGCAAACAGGTAAGACCGGGACTCATCAAACCTGCATCGTCTTCTG
GACATCACGGTCCGTGGCAAAAATGAGGCATGAGTCAGCGGATTCCATTGGGGCCAGCGGGCTTCGACCTGCATGGGCCGGATATAAAACCGCAGCCTGT
ATCTGTTCGACAGAGCAACAAACCTTGGCAAGCCGGAGGCGTCCATATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1026 bp | 341 aa | 60 | 1085 | + | No |
Chemistry : DEDD
ORF sequence :
MEPTVVGVDIAKRVFQLHWIAADTGEVFDRQLRRSDFLEHFANREPCLIGMEACGGSQHWARRLTELGHQVKLMPGKLVKAFVTGNKNDVADARAIWAAA
RHPGVKAVPVKTEAQQAVLALHRIRQQLVIFRRSQSNCLRGLLGEYGEVMGVGRAAMNRAMPDLLLRLELRLPRVLIDSLREQWQRLVDIDKQITLIEQR
LRAWLHENPACRTIAEIPGVGFLTATAAVAAMGDPKAFRSGREFAAWLGLVPAQFGTGGKVRLLGISKRGDRYLRTLMIHGARSVMKHAKDPGSWAVQLS
MRRPLNVVVVALANKIARTIWALLAHGRSYCESYPPPVTAA
RHPGVKAVPVKTEAQQAVLALHRIRQQLVIFRRSQSNCLRGLLGEYGEVMGVGRAAMNRAMPDLLLRLELRLPRVLIDSLREQWQRLVDIDKQITLIEQR
LRAWLHENPACRTIAEIPGVGFLTATAAVAAMGDPKAFRSGREFAAWLGLVPAQFGTGGKVRLLGISKRGDRYLRTLMIHGARSVMKHAKDPGSWAVQLS
MRRPLNVVVVALANKIARTIWALLAHGRSYCESYPPPVTAA
Blast result :
Comments
The transposase protein is 39% identical to that of IS1111.
Updated by Helena Seth-Smith (January 2006) :
ISBcen4 has been thoroughly annotated, and has been found to have another IS element (ISBcen6) inserted within it.
There are 13bp perfect inverted repeats: TATGGACGCCTCC. These IRs are not at the termini of the IS: this has previously been described for other IS1111 family elements.
There are one complete and one partial copy of ISBcen4. The incomplete version is due to a duplication of a segment, not an independant insertion.
Updated by Helena Seth-Smith (January 2006) :
ISBcen4 has been thoroughly annotated, and has been found to have another IS element (ISBcen6) inserted within it.
There are 13bp perfect inverted repeats: TATGGACGCCTCC. These IRs are not at the termini of the IS: this has previously been described for other IS1111 family elements.
There are one complete and one partial copy of ISBcen4. The incomplete version is due to a duplication of a segment, not an independant insertion.
References
1] The Welcome Trust Sanger Institute http://www.sanger.ac.uk
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
3] Seth-Smith Helena (2006) Direct submission
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
3] Seth-Smith Helena (2006) Direct submission