ISUnCu16
- Family IS66
- Group ISBst12
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | uncultured bacterium | uncultured bacterium |
DNA section
IS Length : 1532 bp
Ends
IR Length : 23/28
IRL : GTAACGGTTCACCCAGACCATGTTGACAGGTAAAAAATGAGACGATCGTC
IRR : GTAACCGTTCACCGGGAGGATGTTGACAACAACTCTTCCGGGAGCGGGTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTAACGGTTC | GAACGGTTAC | 0 | |
GCCGAGGCGA | TCTTGGAC | CGACTGATCC | 8 |
TTCTGGCCCA | GCGCTATC | TGGGTCGCTA | 8 |
TTGACTGCCT | GTTTAGAG | GTTATCAGAA | 8 |
TATCGTACAT | GATTCCGA | GACTATATTG | 8 |
GCCGTTGGCC | GTTCATAG | CAGCAAGCGG | 8 |
DNA sequence
GTAACGGTTCACCCAGACCATGTTGACAGGTAAAAAATGAGACGATCGTCCGGTGATCCACTATTGCCTCTTGCGTGATCCTGTCAGCTTGGCACCATCA
GGGGCATGAGCACATCCATGACCCCACCGGCCGCTTCGCCGCTGACCCTTGAGCAGGCTAATGAACTTATTCAGAAGCTGATGGCGCGCATCGCCGAGCT
AGAAGACCGACTCAACCAGCACAGCGGCAATTCCTCCAAGCCTCCTTCCAGTGATGGGCCAGGCTCACCACCACCTGCTCGGCCGCGCCTTGCCTCAGGT
AAAAAACGCGGCGCCCAACCGGGCCACAAAGGCGCTCGGCGTGAGCGTCATCCAGCGGATGAACGGCTGACGGTAATACCCCATTACCCTGCCAGACAGT
GCGCCTGCTGTGGGGGCAATGTGGTATTTCATGACAAACCCTACCGCGTCCATCAGGTCTTTGACCTGCCCGAGGTCAGCTACAGGGTGACCGAACATCA
GCTCTTTAGCGGCACCTGTTGTCATTGCATCGAGACCAGCCCGGCTCCGTTGCCCGAGACGGTGAGCAGCAGCCAGATGGGGACCAACCTGCTGAGCTAC
ATCGCCCTGCAAAGTGGTCTGTTTCACCAGAGCATTAGCCAGATACAACAACAGCTTAAACAGCACTTCGGCCTGAGCTTCAGTCGCGGCGCCATCAGCG
AGGCGCAAGGGCGAGTCAGTGCCATGTTGACCCCGACGCACCAGGCCATCAAACAACAAGTGCAATCAGCCTCCTGTATCCATGCCGATGAAACACGGCA
TCAGCGTGGTGGTGAGCGGCGCTGGATGTGGCTGGCACTGAGCAAGGTGGCCGTCTGTTTCATGACGGCGTTTGGCCGGGGTCAGGATGCGGCCAAACGG
CTGCTGGGCTCAGAATTGGATGGCGTGTTGGTGACAGACCAGTACGCAGGATACCGTTTCATCGACAGCAGCCAGCGTCAGCTGTGCTGGGCCCATGTAC
TGCGCAATGTGGCAGCGATTGCCGACAGCGGCGAAAAGGTCAATCAGCCCATTGGGGCCCGACTGGTGCTTCTCGCAAATAGCGTATTCAGGGTTCGACA
TGGCTATGAACAAGGCGTGTTGAGCCAGAAACAATATCAGCGAAGGCTTGAGCGATGTCGGCAAAGCTGGCGAAAAGAGCTGGGGCGAGGCAGTTTGCTG
TGCAGCAAGCGCTACCGAGGTCGCTGTCGGCTACTGCTCAAGGATGATGAAATGCTGTGGCGTTTTCTGGAAAACGACGAGATAGCCTTGACCAATAATG
AGGCGGAACGGGCGCTACGCGGGTATGTGCTGTGGCGCAAGGGGAGCTATGGCGTTTGGTCACATCGGGGAGAGCTGTTTCGGCAGCGCATCTTGTCTTT
GATAGAGACAGCCAAGCGGTTAGGTCGCTGTCCGCAGGAATGGCTGAGGGCCGTGGTCAGTGCCTGCATAGAGAAAACAGACTACCCGCTCCCGGAAGAG
TTGTTGTCAACATCCTCCCGGTGAACGGTTAC
GGGGCATGAGCACATCCATGACCCCACCGGCCGCTTCGCCGCTGACCCTTGAGCAGGCTAATGAACTTATTCAGAAGCTGATGGCGCGCATCGCCGAGCT
AGAAGACCGACTCAACCAGCACAGCGGCAATTCCTCCAAGCCTCCTTCCAGTGATGGGCCAGGCTCACCACCACCTGCTCGGCCGCGCCTTGCCTCAGGT
AAAAAACGCGGCGCCCAACCGGGCCACAAAGGCGCTCGGCGTGAGCGTCATCCAGCGGATGAACGGCTGACGGTAATACCCCATTACCCTGCCAGACAGT
GCGCCTGCTGTGGGGGCAATGTGGTATTTCATGACAAACCCTACCGCGTCCATCAGGTCTTTGACCTGCCCGAGGTCAGCTACAGGGTGACCGAACATCA
GCTCTTTAGCGGCACCTGTTGTCATTGCATCGAGACCAGCCCGGCTCCGTTGCCCGAGACGGTGAGCAGCAGCCAGATGGGGACCAACCTGCTGAGCTAC
ATCGCCCTGCAAAGTGGTCTGTTTCACCAGAGCATTAGCCAGATACAACAACAGCTTAAACAGCACTTCGGCCTGAGCTTCAGTCGCGGCGCCATCAGCG
AGGCGCAAGGGCGAGTCAGTGCCATGTTGACCCCGACGCACCAGGCCATCAAACAACAAGTGCAATCAGCCTCCTGTATCCATGCCGATGAAACACGGCA
TCAGCGTGGTGGTGAGCGGCGCTGGATGTGGCTGGCACTGAGCAAGGTGGCCGTCTGTTTCATGACGGCGTTTGGCCGGGGTCAGGATGCGGCCAAACGG
CTGCTGGGCTCAGAATTGGATGGCGTGTTGGTGACAGACCAGTACGCAGGATACCGTTTCATCGACAGCAGCCAGCGTCAGCTGTGCTGGGCCCATGTAC
TGCGCAATGTGGCAGCGATTGCCGACAGCGGCGAAAAGGTCAATCAGCCCATTGGGGCCCGACTGGTGCTTCTCGCAAATAGCGTATTCAGGGTTCGACA
TGGCTATGAACAAGGCGTGTTGAGCCAGAAACAATATCAGCGAAGGCTTGAGCGATGTCGGCAAAGCTGGCGAAAAGAGCTGGGGCGAGGCAGTTTGCTG
TGCAGCAAGCGCTACCGAGGTCGCTGTCGGCTACTGCTCAAGGATGATGAAATGCTGTGGCGTTTTCTGGAAAACGACGAGATAGCCTTGACCAATAATG
AGGCGGAACGGGCGCTACGCGGGTATGTGCTGTGGCGCAAGGGGAGCTATGGCGTTTGGTCACATCGGGGAGAGCTGTTTCGGCAGCGCATCTTGTCTTT
GATAGAGACAGCCAAGCGGTTAGGTCGCTGTCCGCAGGAATGGCTGAGGGCCGTGGTCAGTGCCTGCATAGAGAAAACAGACTACCCGCTCCCGGAAGAG
TTGTTGTCAACATCCTCCCGGTGAACGGTTAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1419 bp | 472 aa | 106 | 1524 | + | No |
Chemistry : DDE
ORF sequence :
MSTSMTPPAASPLTLEQANELIQKLMARIAELEDRLNQHSGNSSKPPSSDGPGSPPPARPRLASGKKRGAQPGHKGARRERHPADERLTVIPHYPARQCA
CCGGNVVFHDKPYRVHQVFDLPEVSYRVTEHQLFSGTCCHCIETSPAPLPETVSSSQMGTNLLSYIALQSGLFHQSISQIQQQLKQHFGLSFSRGAISEA
QGRVSAMLTPTHQAIKQQVQSASCIHADETRHQRGGERRWMWLALSKVAVCFMTAFGRGQDAAKRLLGSELDGVLVTDQYAGYRFIDSSQRQLCWAHVLR
NVAAIADSGEKVNQPIGARLVLLANSVFRVRHGYEQGVLSQKQYQRRLERCRQSWRKELGRGSLLCSKRYRGRCRLLLKDDEMLWRFLENDEIALTNNEA
ERALRGYVLWRKGSYGVWSHRGELFRQRILSLIETAKRLGRCPQEWLRAVVSACIEKTDYPLPEELLSTSSR
CCGGNVVFHDKPYRVHQVFDLPEVSYRVTEHQLFSGTCCHCIETSPAPLPETVSSSQMGTNLLSYIALQSGLFHQSISQIQQQLKQHFGLSFSRGAISEA
QGRVSAMLTPTHQAIKQQVQSASCIHADETRHQRGGERRWMWLALSKVAVCFMTAFGRGQDAAKRLLGSELDGVLVTDQYAGYRFIDSSQRQLCWAHVLR
NVAAIADSGEKVNQPIGARLVLLANSVFRVRHGYEQGVLSQKQYQRRLERCRQSWRKELGRGSLLCSKRYRGRCRLLLKDDEMLWRFLENDEIALTNNEA
ERALRGYVLWRKGSYGVWSHRGELFRQRILSLIETAKRLGRCPQEWLRAVVSACIEKTDYPLPEELLSTSSR
Blast result :
Comments
ISUnCu16 is 71% aa similar to ISAma4.
References
1] GIRLICH D. (2011) Direct submission