ISBcen5
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Burkholderia cenocepacia | Burkholderia cenocepacia J2315 |
DNA section
IS Length : 1425 bp
Ends
IR Length : 13
IRL : aattgaaATGAACGCGTCCCACCCGTGAACGTGAGAGCAGAAAAAGGGGT
IRR : ----tatATGAACGCGTCCCGTTTGCAACAGTTTTCGTGTGTTCAGACAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACACAGGAG | GATCACGATG | 0 | |
GTAACAGGAG | GGCCACAATG | 0 | |
CGAACAGGAG | GATCACGATG | 0 |
DNA sequence
AATTGAAATGAACGCGTCCCACCCGTGAACGTGAGAGCAGAAAAAGGGGTGGACCCTCACGGGCCAACGGCATCGATGTGCCCAAGTGGAAAATGCACGC
ATTTCCCGGCAACCGGAGGATCCACGATGAACAGTATGGCCGTAGGAGTCGACGTCGCCAAGCAGGTTTTCCAAGTGCACTACGTTGATAGGGAAACCGG
CGGGATCGTGAATAAGGCAATCAAGCGGGCGAAATTCCTTGAGTTCTTCGCGAACCGCGCGGCCTGTCTTATCGGGATGGAAGCGTGTGGCGGAGCCCAC
CATTGGGCGCGGCAATTGACGCAGATGGGTCACGAGGTCAGGCTGATGCCGGCCGAGTTCGTGAAGGCGTTCAACATCCGCAACAAGAACGATGCGGCGG
ACGCACGGGCGATCTGGCTGGCGGTACAGCAACCCGGCAAGCCGGTGGCGGTGAAGACCGAGATGCAGCAGGCGATGGTAGCACTGCACCGGATGCGCGA
GCAGTTGGTGAAGTTCCGCACGATGCAGGTCAACGGATTGCGAGGTTTGCTGACGGAATACGGGGAGGTAATGAGCAAGGGGCGGGCGAAGCTGGACAAG
GAAATACCGGTCGTGCTTGGCCGAATCGCGGAGCGCTTGCCAGCGGCGCTAATCGATACGTTGCGCGAGCAATGGAACGGGTTGGCAAAGCTCGACGAGC
AGATCGCGGAAATCGAACGCCGGATGCGCGAATGGAAGAAGGAAGACAAGGCGGTGAAGGCGATTAGCGAGATACCGGGCGTAGGTTTGCTGACGGCCAC
CGCAGCGGTAGCAATGATGGGCGACCCGAAGGCGTTCAGCTCGGGACGAGAGTTCGCGGCGTGGGCCGGGTTAGTGCCGAAGCAGACCGGCTCGGGCGGT
AAGGTGAACTTACACGGGATCAGCAAGAGGGGCGACATGTATCTGCGCACGCTACTGATTCACGGGGCACGAAGCGTGCTGACGCATGCGAAAGAGCCCG
GTGAGTGGATCGAGCAGATGAAGAAGCGGCGACCGCCGAATGTGGTGATCGTCGCACTGGCCAACAAGATGGCACGGACGATCTGGGCTGTACTGGCCCA
TGACCGGCCGTACCAAAAGGGTTACGTGAGCGTGAAGCCGGCCTGATCGGTGAATGCCACGTTGAAGATCAACGTTTAACACTGGGTGAACGTCGAAAGG
TTGCGCAGGAATCGAGTGTGATGACAAGCCAGGTAGGACCGGGACTCGCTAAACCTGAATCGTGGTAAGAGCTTCGAGCTCGCCAGGAGAATGAGGCGCG
AGTCAGCGAATTTCATAGGGGCCCGCAGCGGCAGTACTGGCTGCAACAAGGCCGGATATAGAGCTGCAGCCCATCCTGTCTGTCTGAACACACGAAAACT
GTTGCAAACGGGACGCGTTCATATA
ATTTCCCGGCAACCGGAGGATCCACGATGAACAGTATGGCCGTAGGAGTCGACGTCGCCAAGCAGGTTTTCCAAGTGCACTACGTTGATAGGGAAACCGG
CGGGATCGTGAATAAGGCAATCAAGCGGGCGAAATTCCTTGAGTTCTTCGCGAACCGCGCGGCCTGTCTTATCGGGATGGAAGCGTGTGGCGGAGCCCAC
CATTGGGCGCGGCAATTGACGCAGATGGGTCACGAGGTCAGGCTGATGCCGGCCGAGTTCGTGAAGGCGTTCAACATCCGCAACAAGAACGATGCGGCGG
ACGCACGGGCGATCTGGCTGGCGGTACAGCAACCCGGCAAGCCGGTGGCGGTGAAGACCGAGATGCAGCAGGCGATGGTAGCACTGCACCGGATGCGCGA
GCAGTTGGTGAAGTTCCGCACGATGCAGGTCAACGGATTGCGAGGTTTGCTGACGGAATACGGGGAGGTAATGAGCAAGGGGCGGGCGAAGCTGGACAAG
GAAATACCGGTCGTGCTTGGCCGAATCGCGGAGCGCTTGCCAGCGGCGCTAATCGATACGTTGCGCGAGCAATGGAACGGGTTGGCAAAGCTCGACGAGC
AGATCGCGGAAATCGAACGCCGGATGCGCGAATGGAAGAAGGAAGACAAGGCGGTGAAGGCGATTAGCGAGATACCGGGCGTAGGTTTGCTGACGGCCAC
CGCAGCGGTAGCAATGATGGGCGACCCGAAGGCGTTCAGCTCGGGACGAGAGTTCGCGGCGTGGGCCGGGTTAGTGCCGAAGCAGACCGGCTCGGGCGGT
AAGGTGAACTTACACGGGATCAGCAAGAGGGGCGACATGTATCTGCGCACGCTACTGATTCACGGGGCACGAAGCGTGCTGACGCATGCGAAAGAGCCCG
GTGAGTGGATCGAGCAGATGAAGAAGCGGCGACCGCCGAATGTGGTGATCGTCGCACTGGCCAACAAGATGGCACGGACGATCTGGGCTGTACTGGCCCA
TGACCGGCCGTACCAAAAGGGTTACGTGAGCGTGAAGCCGGCCTGATCGGTGAATGCCACGTTGAAGATCAACGTTTAACACTGGGTGAACGTCGAAAGG
TTGCGCAGGAATCGAGTGTGATGACAAGCCAGGTAGGACCGGGACTCGCTAAACCTGAATCGTGGTAAGAGCTTCGAGCTCGCCAGGAGAATGAGGCGCG
AGTCAGCGAATTTCATAGGGGCCCGCAGCGGCAGTACTGGCTGCAACAAGGCCGGATATAGAGCTGCAGCCCATCCTGTCTGTCTGAACACACGAAAACT
GTTGCAAACGGGACGCGTTCATATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1020 bp | 339 aa | 127 | 1146 | + | No |
Chemistry : DEDD
ORF sequence :
MNSMAVGVDVAKQVFQVHYVDRETGGIVNKAIKRAKFLEFFANRAACLIGMEACGGAHHWARQLTQMGHEVRLMPAEFVKAFNIRNKNDAADARAIWLAV
QQPGKPVAVKTEMQQAMVALHRMREQLVKFRTMQVNGLRGLLTEYGEVMSKGRAKLDKEIPVVLGRIAERLPAALIDTLREQWNGLAKLDEQIAEIERRM
REWKKEDKAVKAISEIPGVGLLTATAAVAMMGDPKAFSSGREFAAWAGLVPKQTGSGGKVNLHGISKRGDMYLRTLLIHGARSVLTHAKEPGEWIEQMKK
RRPPNVVIVALANKMARTIWAVLAHDRPYQKGYVSVKPA
QQPGKPVAVKTEMQQAMVALHRMREQLVKFRTMQVNGLRGLLTEYGEVMSKGRAKLDKEIPVVLGRIAERLPAALIDTLREQWNGLAKLDEQIAEIERRM
REWKKEDKAVKAISEIPGVGLLTATAAVAMMGDPKAFSSGREFAAWAGLVPKQTGSGGKVNLHGISKRGDMYLRTLLIHGARSVLTHAKEPGEWIEQMKK
RRPPNVVIVALANKMARTIWAVLAHDRPYQKGYVSVKPA
Blast result :
Comments
No uninterrupted target sequence was found, so the ends of the IS have been defined by analogy with other IS1111 family elements. In the sequence given above 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. Unlike most IS1111 family elements, the first nt in the above sequence does not match the one found adjacent to the left-hand end of the element.
The transposase protein is 85% identical to that of ISBfun2 and 40% identical to that of IS1111.
The transposase protein is 85% identical to that of ISBfun2 and 40% identical to that of IS1111.
References
1] The Welcome Trust Sanger Institute http://www.sanger.ac.uk
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384