IS50R
- Family IS4
- Group IS50
Isoform Synonym(s) IS50, IS50L
Accession number | Transposition | Origin | Host |
---|---|---|---|
U15573 | Y | Escherichia coli | Escherichia coli DB729 plasmid pJR67 |
DNA section
IS Length : 1534 bp
Ends
IR Length : 8/9
IRL : CTGACTCTTATACACAAGTAGCGTCCTGAACGGAACCTTTCCCGTTTTCC
IRR : CTGTCTCTTGATCAGATCTTGATCCCCTGCGCCATCAGATCCTTGGCGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
NNNCCACGCT | ATTTAACCC | TGCGCCGNNN | 9 |
NNNCCCGGCC | GCGCTGGAC | GGTAAACNNN | 9 |
NNNCGGGGCT | GGCCGACGC | TATCTGCNNN | 9 |
ACAACCACAA | GCACAGGGT | GATGGAGCAA | 9 |
CACAAGCACA | GGGTGATGG | AGCAAATGCA | 9 |
ATGCAGGACA | ACCACAAGC | ACAGGGTGAT | 9 |
NNNNNNNTAA | GCTTTAATG | CGCNNNNNNN | 9 |
NNNNNNNGCA | GTCAGGCAC | CGTNNNNNNN | 9 |
NNNNNNNTCA | CCCTGGATG | CTGNNNNNNN | 9 |
NNNNNNNCCA | GTCCTGCTC | GCTNNNNNNN | 9 |
NNNNNNNGCC | GCCCAGTCC | TGCNNNNNNN | 9 |
NNTGGCGCAG | GCTATGAAC | AGTGTGTNNN | 9 |
NNNNNNNNNN | GTCTGACGC | NNNNNNNNNN | 9 |
NNNTGGCGCA | GGCTATGAAC | AGTGTGTNNN | 10 |
DNA sequence
CTGACTCTTATACACAAGTAGCGTCCTGAACGGAACCTTTCCCGTTTTCCAGGATCTGATCTTCCATGTGACCTCCTAACATGGTAACGTTCATGATAAC
TTCTGCTCTTCATCGTGCGGCCGACTGGGCTAAATCTGTGTTCTCTTCGGCGGCGCTGGGTGATCCTCGCCGTACTGCCCGCTTGGTTAACGTCGCCGCC
CAATTGGCAAAATATTCTGGTAAATCAATAACCATCTCATCAGAGGGTAGTGAAGCCATGCAGGAAGGCGCTTACCGATTTATCCGCAATCCCAACGTTT
CTGCCGAGGCGATCAGAAAGGCTGGCGCCATGCAAACAGTCAAGTTGGCTCAGGAGTTTCCCGAACTGCTGGCCATTGAGGACACCACCTCTTTGAGTTA
TCGCCACCAGGTCGCCGAAGAGCTTGGCAAGCTGGGCTCTATTCAGGATAAATCCCGCGGATGGTGGGTTCACTCCGTTCTCTTGCTCGAGGCCACCACA
TTCCGCACCGTAGGATTACTGCATCAGGAGTGGTGGATGCGCCCGGATGACCCTGCCGATGCGGATGAAAAGGAGAGTGGCAAATGGCTGGCAGCGGCCG
CAACTAGCCGGTTACGCATGGGCAGCATGATGAGCAACGTGATTGCGGTCTGTGACCGCGAAGCCGATATTCATGCTTATCTGCAGGACAAACTGGCGCA
TAACGAGCGCTTCGTGGTGCGCTCCAAGCACCCACGCAAGGACGTAGAGTCTGGGTTGTATCTGTACGACCATCTGAAGAACCAACCGGAGTTGGGTGGC
TATCAGATCAGCATTCCGCAAAAGGGCGTGGTGGATAAACGCGGTAAACGTAAAAATCGACCAGCCCGCAAGGCGAGCTTGAGCCTGCGCAGTGGGCGCA
TCACGCTAAAACAGGGGAATATCACGCTCAACGCGGTGCTGGCCGAGGAGATTAACCCGCCCAAGGGTGAGACCCCGTTGAAATGGTTGTTGCTGACCAG
CGAACCGGTCGAGTCGCTAGCCCAAGCCTTGCGCGTCATCGACATTTATACCCATCGCTGGCGGATCGAGGAGTTCCATAAGGCATGGAAAACCGGAGCA
GGAGCCGAGAGGCAACGCATGGAGGAGCCGGATAATCTGGAGCGGATGGTCTCGATCCTCTCGTTTGTTGCGGTCAGGCTGTTACAGCTCAGAGAAAGCT
TCACGCTGCCGCAAGCACTCAGGGCGCAAGGGCTGCTAAAGGAAGCGGAACACGTAGAAAGCCAGTCCGCAGAAACGGTGCTGACCCCGGATGAATGTCA
GCTACTGGGCTATCTGGACAAGGGAAAACGCAAGCGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGCGATAGCTAGACTGGGCGGTTTTATG
GACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGCCCTCTGGGAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTTGCCGCCAAGGATCTGA
TGGCGCAGGGGATCAAGATCTGATCAAGAGACAG
TTCTGCTCTTCATCGTGCGGCCGACTGGGCTAAATCTGTGTTCTCTTCGGCGGCGCTGGGTGATCCTCGCCGTACTGCCCGCTTGGTTAACGTCGCCGCC
CAATTGGCAAAATATTCTGGTAAATCAATAACCATCTCATCAGAGGGTAGTGAAGCCATGCAGGAAGGCGCTTACCGATTTATCCGCAATCCCAACGTTT
CTGCCGAGGCGATCAGAAAGGCTGGCGCCATGCAAACAGTCAAGTTGGCTCAGGAGTTTCCCGAACTGCTGGCCATTGAGGACACCACCTCTTTGAGTTA
TCGCCACCAGGTCGCCGAAGAGCTTGGCAAGCTGGGCTCTATTCAGGATAAATCCCGCGGATGGTGGGTTCACTCCGTTCTCTTGCTCGAGGCCACCACA
TTCCGCACCGTAGGATTACTGCATCAGGAGTGGTGGATGCGCCCGGATGACCCTGCCGATGCGGATGAAAAGGAGAGTGGCAAATGGCTGGCAGCGGCCG
CAACTAGCCGGTTACGCATGGGCAGCATGATGAGCAACGTGATTGCGGTCTGTGACCGCGAAGCCGATATTCATGCTTATCTGCAGGACAAACTGGCGCA
TAACGAGCGCTTCGTGGTGCGCTCCAAGCACCCACGCAAGGACGTAGAGTCTGGGTTGTATCTGTACGACCATCTGAAGAACCAACCGGAGTTGGGTGGC
TATCAGATCAGCATTCCGCAAAAGGGCGTGGTGGATAAACGCGGTAAACGTAAAAATCGACCAGCCCGCAAGGCGAGCTTGAGCCTGCGCAGTGGGCGCA
TCACGCTAAAACAGGGGAATATCACGCTCAACGCGGTGCTGGCCGAGGAGATTAACCCGCCCAAGGGTGAGACCCCGTTGAAATGGTTGTTGCTGACCAG
CGAACCGGTCGAGTCGCTAGCCCAAGCCTTGCGCGTCATCGACATTTATACCCATCGCTGGCGGATCGAGGAGTTCCATAAGGCATGGAAAACCGGAGCA
GGAGCCGAGAGGCAACGCATGGAGGAGCCGGATAATCTGGAGCGGATGGTCTCGATCCTCTCGTTTGTTGCGGTCAGGCTGTTACAGCTCAGAGAAAGCT
TCACGCTGCCGCAAGCACTCAGGGCGCAAGGGCTGCTAAAGGAAGCGGAACACGTAGAAAGCCAGTCCGCAGAAACGGTGCTGACCCCGGATGAATGTCA
GCTACTGGGCTATCTGGACAAGGGAAAACGCAAGCGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGCGATAGCTAGACTGGGCGGTTTTATG
GACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGCCCTCTGGGAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTTGCCGCCAAGGATCTGA
TGGCGCAGGGGATCAAGATCTGATCAAGAGACAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1431 bp | 476 aa | 93 | 1523 | + | No |
Chemistry : DDE
ORF sequence :
MITSALHRAADWAKSVFSSAALGDPRRTARLVNVAAQLAKYSGKSITISSEGSEAMQEGAYRFIRNPNVSAEAIRKAGAMQTVKLAQEFPELLAIEDTTS
LSYRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADADEKESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDK
LAHNERFVVRSKHPRKDVESGLYLYDHLKNQPELGGYQISIPQKGVVDKRGKRKNRPARKASLSLRSGRITLKQGNITLNAVLAEEINPPKGETPLKWLL
LTSEPVESLAQALRVIDIYTHRWRIEEFHKAWKTGAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVESQSAETVLTPD
ECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWGALWEGWEALQSKLDGFLAAKDLMAQGIKI
LSYRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADADEKESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDK
LAHNERFVVRSKHPRKDVESGLYLYDHLKNQPELGGYQISIPQKGVVDKRGKRKNRPARKASLSLRSGRITLKQGNITLNAVLAEEINPPKGETPLKWLL
LTSEPVESLAQALRVIDIYTHRWRIEEFHKAWKTGAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVESQSAETVLTPD
ECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWGALWEGWEALQSKLDGFLAAKDLMAQGIKI
Blast result :
Comments
In the 5.8-kb Tn5 transposon, IS50L and IS50R flank the kanamycin, bleomycin and streptomycin (cryptic in Escherichia coli) resistance genes. Note that the original IS50L and IS50R sequences (Auerswald et al., 1980) have been revised: the original IS50R Accession Number (V00617) has been replaced by U15573 (Ahmed et al., 1995). Two proteins are produced by IS50R: P1 (93-1523) or cis-acting 476-aa transposase (Tnp), and P2 (258-1520) the 421-aa trans-acting inhibitor (Inh). Most IS50R transposition events generate 9-bp DR. However, 2/24 Tn5 insertions into Streptomyces avermitilis generates DR other than 9-bp long: one 8- and one 10-bp DR (Occi et al., 1993).
Found in Tn5.
September 24 2012 : the file of IS50L (isoforme of IS50R) was deleted and information added in the IS50R file.
Previous comments of IS50L file : Note that the original IS50 (L and R) sequences have been revised: the original IS50L Accession Number V00617 has been replaced by U15572 (Ahmed et al., 1995). The sole difference between IS50R and IS50L sequences is a T to G substitution at position 1443, creating a TAA-stop (rendering IS50L).
Found in Tn5.
September 24 2012 : the file of IS50L (isoforme of IS50R) was deleted and information added in the IS50R file.
Previous comments of IS50L file : Note that the original IS50 (L and R) sequences have been revised: the original IS50L Accession Number V00617 has been replaced by U15572 (Ahmed et al., 1995). The sole difference between IS50R and IS50L sequences is a T to G substitution at position 1443, creating a TAA-stop (rendering IS50L).
References