Transposon
Name: Tn5053
Family: Tn402
Evidence of Transposition: Yes
 Host     

Host Organism:Xanthomonas sp. W17 Molecular Source:chromosome
Place of Origin:Kirgizia, former URSS Date of Isolation:1993
Other Geographic Information:Khaidarkan mercury mine

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  8447 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCGTTTTC AGAAGACGAC CGCACCATCT GACTGGATGT AACGCCTGGT GTGCATACGG CTCCTGACAG CCCAATATCA GGAGTCGTCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTTTG GGCATGACAT CAGACACTCC ACCGATTGCC GCGCAAGGCG TGGCCACCCT GCCCGACGAG 200
GCATGGGCGC AAGCCCGGCA CCGGACGGAA ATCATCGGGC CACTGGCAGC GCTTGAGGTG GTCGGGCATG AAGCCGCCGA TGAGGCAGCC CAAGCGCTGG 300
GCCTGTCCCG GCGACAGGTA TATGTCCTGA TCCGTCGCGC CCGGCAGGGT ACTGGCCTGG TAACAGACCT GACGCCCGGC CGATCCGGCG GCGGCAAAGG 400
CAAGGGGCGC TTGCCGGAAC CGGTCGAGCG CATCATCCGC GAGCTGCTGC AAAAGCGCTT CCTGACCAAG CAGAAACGCA GCCTGGCGGC GTTCCACCGC 500
GAAGTCGCGC AGGCGTGCAA AACCCAGAAG CTGCCGGTGC CGGCGCGCAA CACCGTGGCC CAGCGGATTG CCGGACTACA CCCGGCGAAA ATAGCCCGCA 600
GCCGGGGCGG GCAGGACGCT GCCCGTCCCT TGCAAGGCGC GGGTGGCATT CCGCCCGAAG TCACCATGCC GCTGGAACAG GTGCAGATCG ACCACACCGT 700
CATCGACCTG ATCGTGGTCG ACGAGCGCGA CCGGCAACCG ATTGGCCGCC CATATTTGAC CCTCGCCATC GACGTGTTCA CGCGCTGCGT ACTCGGCATG 800
GTGGTCACGC TGGAAGCGCC GTCCGCCGTC TCGGTCGGCC TATGCCTCGC GCATGCCGCC TGCGACAAGC GGCCCTGGCT GGAAGGGCTG AGTGTGGAAA 900
TGGACTGGCC GATGAGCGGC AAGCCCAGGC TGCTCTATCT GGACAACGCG GCCGAGTTCA AAAGCGAAGC GCTGCGCCGT GGCTGCGAAC AGCATGGCAT 1000
CCGGCTGGAC TATCGCCCAC CAGGCCAGCC GCACTACGGC GGCATCGTGG AACGGATCAT CGGCACGGCG ATGCAGATGA TCCACGACGA ATTACCGGGG 1100
ACGACCTTCT CCAATCCCGG CCAGCGCGGC GAGTACGATT CCGAGAAGAT GGCCACCCTG ACGCTGCGCG AGCTGGAGCG CTGGCTCGCG TTGGCGGTAG 1200
GCACCTATCA CGGCTCCGTG CACAACGGCC TGCTCCAGCC GCCGGCCGCG CGCTGGGCCG AGGCCGTGGA GCGCGTTGGC GTCCCGGCCG TCGTTACCCG 1300
CCCCACCGCG TTTTTGGTCG ATTTCCTGCC GGTGATCCGC CGCACCCTGA CCCGCACCGG CTTTGTCATC GACCACATCC ACTACTACGC CGACGCCCTC 1400
AAGCCGTGGA TTGCCCGGCG CGAGCGCTTG CCCGCCTTCC TGATCCGGCG CGATCCGCGC GACATCAGCC GCATCTGGGT ACTGGAACCG GAAGGTCAGC 1500
ACTATCTGGA GATCCACTAC CGCACCTTGT CCCATCCGGC CGTCACCCTC TGGGAACAAC GCCAGGCGCT GGCCAAATTG CGTCAGCTCG GGCGCGAGCA 1600
GGTGGACGAG TCGGCGCTGT TCCGCATGAT CGGGCAGATG CGCGAGATCG TGACCACCGC CCAGAAGGCC ACGCGCAAGG CGCGGCGCGA CGCTGATCGC 1700
CGCCAGCACC TCAAGACGTC GGAGCCACCG GCCAAGCCCA TACCGCCGAA TGTGGACATG GCTGACCCGC AGGCAGACAA CCTGCCGCCG GCCAAACCGT 1800
TCGATCAGAT CGAGGAGTGG TAGCCGTGGA CGAATATCCC GTCATTGACC TGTCCCACCT GCTGCCAGCG GCACAGGGTT TGGCCAGGCT GCCGGCAGAC 1900
GAGCGCATCC AGCGCATTCG CGCCGACCGC TGGATCGGCT ACCCGCGCGC GGTCGAGGCG CTGAACCGGC TGGAAACTCT GTATGCGTGG CCGAACAAGC 2000
AACGCATGCC AAACCTGCTG CTGGTCGGCC CCACCAACAA CGGCAAGTCG ATGATCGTCG AGAAATTCCG GCGGGCGCAC CCGGTCGGCA CCGACGCTGA 2100
CCAAGAACAT ATCCCGGTGC TGGTCGTGCA GATGCCGTCA GAGCCATCGG TGATCCGCTT CTATGTCGCG CTGCTGGCTG CGATGGGCGC GCCGCTGCGC 2200
CCGCGCCCAC GGCTGACGGA AATGGAGCAA CTGGCGCTGG CCCTGCTGCG CATGGTCGGC GTGCGCATGC TGGTGATCGA CGAACTGCAC AATGTACTGG 2300
CTGGCAACAG CGTTAACCGG CGCGAGTTCC TCAACCTGCT GCGCTTCCTC GGCAACGAGC TGCGTATCCC CCTGGTCGGG GTCGGCACGC GCGAGGCGTA 2400
CCTGGCCATC CGTTCGGACG ATCAGTTGGA AAACCGCTTC GAGCCGATGC TGCTGCCGCC GTGGGAGGCC AATGAGGACT GCTGCTCGCT GCTGGCCAGC 2500
TTCGCGGCGT CACTCCCGCT ACGGCGGCCA TCCCCGATTG CCACGCTGGA CATGGCCCGC TACCTGCTCA CCCGGAGCGA AGGCACCATC GGCGAGCTTG 2600
CACATCTGCT GGTGGCGGCG GCCGTCGCCG CCGTGGAGAG TGGCGAGGAA GCGATCAACC ACCGCACGCT CAGCATGGCC GACTACATCG GCCCCAGTGA 2700
GCGGCGGCGA CAGTTCGAGC GGGAACTGAT GTGAAGTCCG CGCCGCGCTG GCCGCTGCAT CCGGCACCCA AGGAAGGCGA AGCCCTGTCC TCATGGCTCA 2800
ACCGGGTCGC CGCCTGCTAC CAGATGGACG TGCACGAGCT GCTGGCCCAC GATCTTGGTC ACAGCCAACT TGATGACCTG GATACCGCAC CATCCTTGTC 2900
GTTGCTGACG GCGCTCTGCC AGCGCAGTGG CGTCGAGCTG GAGCGGTTGC GCAGCATGAG TCTTGCAGGC TGGGTGCCGT GGCTGCTCGA CAGTCTCGAC 3000
GATTCGGTAC CAGCAGCCTT GGAAACCTAT ACATTCCAAT GCGCGGTGCT CCTGCCCAAG CGCACCCGCA AGGTGCGGTC CATCACTCGC TGGCGTGCCT 3100
GGCTACCGAG CCAGACGATT CGCCGGGCGT GTCCGCAGTG TCTGAACGAT CCAACGAATC AAGCTGTGCT ACTCGTCTGG CAGCTCCCCT TGATGCTGAG 3200
CTGCCCGCAG CATGGCTGCT GGCTGGAGTC CTACTGGGGC ATGCCCGGCC GGTATCTTCA GTGGGAAATC GCCGATGCTG CGCCGCGCCC TGCCGATGAC 3300
GCAATCGCCT GCATGGACCG GCGCACCTGG CAGGCGCTGA CGACGGGTTT TGTGGAGCTG CCGCGTCGGC GTGTCCACGC CGGCTTGTGG TTCCGGCTGC 3400
TGCGCACCCT GCTTGATGAG CTGAACACGC CGCTCTCGCA CTGCGGCAGT TGCTCGGCGA GCATCCGCCA TGTCTGGGAG CGCTGCGGCC ATCCGCTGCG 3500
CGCCGGGCAG AGTCTGTGGC GTCCCTACGA GATTCTGGCC CCGGCGGTGC AGTGGCAGAT GCTGGAGGCG GCGGCCACCG CCATCACGCT GATCGAGTCA 3600
AAGGCGCTGA TCCCGCGCGG GGAACAGGCC GCGCTGTTTC AGACGGAGCC GCACACTGGT TTCACCAACG GCCTGCCGGC GAAGGTGCCG AAGCCGGAAC 3700
CCATCAACCA CTGGCAACGA GCAGCCCAGG CCATCGATGA GGCCATCATT GAAGCGCGAC ACAACCCGGA GACGGCCCGC TTGCTGTTCA CACTGGCGTC 3800
CTACGGGCGA CGCGACCCCG AATCCCTGGA ACGTTTGCGC GCCACGTTCA CCAAGGAGGG CATCCCGCCG GAGTTTTTGT CACATTACGA GCCTGAATGG 3900
CCGTTTGCAG GTCTTAGACT AAATGACGGG TTAAGTGACA GTTTTTGACG GCGAGAACTT TCTGGCTCAC ACTGTCACAT AATCGAACGT ATATGTGACA 4000
GGTACGACAT GCTGATAGGC TACATGCGGG TATCGAAGGC GGACGGCTCC CAGGCTACCG ATTTGCAGCG CGACGCGCTG ATTGCCGCCG GGGTCGATCC 4100
AGTACATCTT TACGAGGACC AGGCATCCGG CATGCGCGAG GATCGGCCCG GCTTGACGAG CTGCCTGAAG GCGTTGCGAA CTGGCGACAC ACTGGTCGTG 4200
TGGAAACTGG ATCGGCTCGG ACGCGACCAG CGACATCTCA TCAACACCGT GCACGACCTG ACTGGGCGCG GCATCGGCTT GAAGGTATTA ACCGGGCACG 4300
GCGCGGCCAT CGACACCACG ACCGCCGCCG GCAAGCTGGT CTTTGGCATC TTCGCCGCCC TGGCCGAGTT CGAGCGCGAG TTGATCGCGG AGCGCACGAT 4400
TGCCGGCCTA GCCTCGGCCC GCGCGCGCGG GCGGAAAGGC GGCCGGCCGT TCAAGATGAC CGCCGCCAAG CTGCGGCTGG CGATGGCGGC AATGGGTCAG 4500
CCAGAGACCA AGGTCGGCGA CCTGTGCCAG GAACTTGGCG TCACGCGGCA GACCCTGTAT CGGCATGTTT CACCCAAGGG TGAGCTACGT CCAGATGGCG 4600
AGAAGCTACT CAGCCGAATT TGATGCCGGC ATGAGGCAAC GTAGCGACAG CGTGGTTTGT CTCAATGGGA AGCGCTCATG ATCGATCTTT GAAGGCCCGC 4700
AGCAGTCGTG TCACAGACAG GACGAACAAA CCGGTCAGCG TGAGGGCTGC GATACCCCAG TACTCTCCGA TGAACGCGCC GGCCGTCGTG CCGGCCAGCA 4800
CAATGGCGAG AATCGGCAAA TGGCAGGGAC AGGTGAGCAC GGCCAGCGCG CCCCACAGGT AGCCGGTGAT CGGTTTGTGC GTCTCGGACG GCAAGCGCTC 4900
GGGGCTGTTC ATGGCAGACT CTCCGCGTGC TGTGCCGGCT CGGTCGGCAT GGTGGCCAAC TGCACCTCCA GATCGGCCAA CGCTTCGCGC CGACGCTCGA 5000
CGAACTGGCG CAGAACGGCA AGCTGCGCGG CCGCTTCATC GCCGTCCGCA GCATCCAGCG CCCGGCACAG CCGCGCCAGC GCGTCCAGGC CGATGCCCGC 5100
CTCGAAGGCC GCCCGCACGA AGCACAGCCG TTGCAAGGCG GCATCATCGA ACAGGCCATA GCCGCCCGGG GTGCACGCCA CCGGACGCAG CAATCCGCGC 5200
AGCAGGTAGT CGCGCACGAT ATGCACGCTC ACCCCGGCAT CAAGGGCCAG CCGGGACACG GTGTAGGCGC TCATTGAAAA CCTCCTTTTT TTATCCAGCG 5300
CAGCAGGAAA GCTGCTTCAC GTCCTTGTTG AAGGTCTGCG CCGCAAGCTT CAACCCCTCG ACCATTGTCA GGTAGGGGAA CAACTGGTCG GCCAGTTCCT 5400
GCACCGTCAT GCGGTTGCGG ATGGCGAGCA CCGCCGTCTG GATCAGTTCG CCCGCTTCCG GGGCCACCGC CTGCACGCCG ATGAGCCGTC CGCTACCTTC 5500
CTCGATGACC AGCTTGATGA AGCCGCGTGT GTCGAAGTTG GCAAGCGCTC GCGGAACGTT GTCGAGTGTC AGCGTGCGAC TGTCGGTCTC GATGCCATCG 5600
TGGTGCGCTT CCGCCTCGCT GTAGCCCACG GTGGCGACTT GCGGGTCGGT GAACACCACT GCCGGCATCG CGGTCAGATT GAGGGCTGCG TCGCCGCCGG 5700
TCATGTTGAT CGCGGCACGG GTGCCGGCGG CCGCTGCCAC GTAGACGAAC TGCGGCTGGT CGGTGCAGTC GCCGGCCGCG TAGATGTTCG GGTTGCTCGT 5800
GCGCATGCCT TGGTCGATAA CGATGGCCCC TTGCGCATTG ACAGTGACCC CCGCCGCGTC CAGCGCGAGG CTGCGCGTAT TCGGTGCCCG ACCGGTGGCA 5900
ACCAGCAACT TGTCAGCGCG CAATTCACCG TGTCCGGTGG TCAGCACGAA TTCGCCGTTC ACATGGGCGA CCTGGCTGGC TTGCGTGTGC TCCAGCACCT 6000
CGATGCCCTC GGCGCGGAAA GCGGCTGTCA CGGCCTCGCC GATGGCCGGG TCTTCCCGGA AGAACAAGGT GCTGCGTGCC AGGATCGTGA CCTGGCTGCC 6100
GAGCCGGGCA AAGGCTTGCG CCAGTTCCAA CGCCACCACC GACGAACCGA TCACGGCCAG GCGTGCGGGA ATGGTGTCGC TGACAAGCGC TTCGGTGGAA 6200
GTCCAGTAGG GTGACTCTTT CAGGCCCGGA ATCGGCGGCA CGACCGGACT GGCACCGGTG GCGACCAGGC AGCGGTCGAA CGTTACCTCG CGCTCGCCAC 6300
CCTCGTTCAA ACGGACGACC AGGCTCTGGT CGTCCTTGAA ACGCGCTTCA CCGTGCAAAA CGGTGATGGC TGGATTGCCG TCCAGGATGC CTTCGTATTT 6400
GGCGTGCCGC AGTTCATCGA CACGGGCCTG CTGCTGGGCC AGCAGTTTGC TGCGGTCAAT CGCAGGCACA GTTGCCGCAA TACCGCCGTC GAACGGACTT 6500
TCCCGGCGCA GATGGGCAAT ATGGGCAGCG CGGATCATGA TCTTGGACGG CACACAGCCG ATATTGACGC AGGTGCCGCC GATGGTGCCG CGTTCGATCA 6600
GCGTGACCGT CGCGCCTTGC TCGACGGCCT TCAGCGCCGC CGCCATCGCG GCCCCGCCGC TGCCAATGAT GGCGATATGC AAACCGGCGC CCTCAAGTGC 6700
ATCACGGATT TTTGGTTCAT CTTTGAAATC ACCAACCCGG ATCGAGCCTT GATAACCCAA TGCGGCGATG GCGGCCAGCA GTTGGTTGTG GCTCACGGCG 6800
GTGTCTGCCA TGACTTGCGC GCGGCTTTCT GGATAGGACA CCACAGCGGC ATTCACGCCG GGAATCTTTT CCAAAGCATC TTTGACATGG GTGGCGCAGG 6900
ATGTGCAGGT CATGCCATTC ACGGTGATTT CGGTCATTTT TTTACTCCAT TGAATTTCGG GGTGCAGCAG GCATCGGCTT GGCGTTTTCG TTGGATGGCG 7000
TAGATGGTCA AGCCGATGAA AATCGCCAGC GCAGGCAGCA GCACATAGTC CAGATAGCCG GTCAGCGCGG ACAAGCCGAC CACACCGAGC AAAATGACCA 7100
GAACAGGGGT GAAGCAACAC AGCGCCACGA GGGTTGTGCC AATGATGCTG ACCCGCAGCA GTGTCTTCGG GTCTTTCATG ATCAGTTCTT GACTGATGAT 7200
GGGTAGCCCG CATCCTCGGT AGCCTTGGTC AGTTTCTGCA CGCTGGTCTT GGCATCATCG AAGGTGACCA CCGCTTCGCG CGTCTCGAAG GTCACGTCAA 7300
CTTTACTGAC GCCATCGACC TTGGAAATCG CCTTCTTGAC AGTGATCGGA CAGGCCGAGC AGGTCATGCC CGGTACGGAC AGCGTAACGG TCTGGGTGGC 7400
GGCCCACACG GGGGCAACAA CGGCAGCGAG GGCAAGGGCG GAAAGCAGCT TTTTCATGGT GAACTCCTGT GATCAATAGA AAAATGGCAC GACGTAGGGA 7500
AATCCGAGCG CGACCAAAAC CAGCACGGCC ACGCCCCAGA AAATGAGCTT GTAAGTAGCT CGCACTTGGG GAATCGCGCA AACCTCACCC GGTTTGCAGG 7600
CGGCTGACGG CCGGTAGATG CGCCGCCAGG CGAAGAACAA CGCCACCAGC GCCACGCCGA TAAAGATGGG GCGATAGGGT TCCAACACCG TCAAGTTGCC 7700
GATCCAAGCG CCGCTGAACC CCAAGGCGAT CAGAACCAGC GGCCCGAGGC AGCAAGCCGA GGCGAGGATG GCGGCCAGCC CGCCAGTGAA GAGCGCGCCG 7800
CGCCCGTTTT GAGGTTCAGA CATACGTTTG TCCTTTCGAA TCTGAATTGG ATAGCTTAAG CTTACTTCCG TAGTTATGTA CGGAGTCAAG CGATATGGAA 7900
AACAATTTGG AGAACCTGAC CATTGGCGTT TTCGCCAGGA CGGCCGGGGT CAATGTGGAG ACCATCCGGT TCTATCAGCG CAAGGGCTTG CTCCCGGAAC 8000
CGGACAAGCC TTACGGCAGC ATTCGCCGCT ATGGCGAGAC GGATGTAACG CGGGTGCGCT TCGTGAAATC AGCCCAGCGG TTGGGCTTCA GCCTGGATGA 8100
GATCGCCGAG CTGCTGCGGC TGGAGGATGG CACCCATTGT GAGGAAGCCA GCAGCCTGGC CGAGCACAAG CTCAAGGACG TGCGCGAGAG GATGGCTGAC 8200
CTGGCGCGCA TGGAGGCCGT GCTGTCTGAT TTGGTGTGCG CCTGCCATGC GCGGAAGGGG AACGTTTCCT GCCCGCTGAT TGCGTCACTG CAAGGGAAGA 8300
AAGAACCGCG CAGTGCGGAC GCGGTGTAGC CCGAGGGAAC TACGCCTTAG CGTGCTTTAT TTTCCGTTTT CTGAGGCGAC TCCAACGTCA GAAAAGACCG 8400
TGCGGTCGAC TTTTGATATT TCGTGCTGTC GCCTTCTGAA AGTGACA

 Recombination Sites     

Name Coordinates Gene Sequence
r6 3826-3840 15 CTGGAACGTT TGCGC
r5 3872-3885 14 AGTTTTTGTC ACAT
r4 3877-3890 14 TTGTCACATT ACGA
r3 3928-3941 14 GGGTTAAGTG ACAG
res 3969-4003 35 ACACTGTCAC ATAATCGAAC GTATATGTGA CAGGT
r2 3972-3985 14 CTGTCACATA ATCG
r1 3988-4001 14 CGTATATGTG ACAG

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA Tn5053 144-1823 Transposase   +
tniB Tn5053 1826-2734 Accessory Gene   +
tniQ Tn5053 2731-3948 Accessory Gene Target Site Selection +
tniR Tn5053 4009-4623 Accessory Gene Resolvase +
merE; urf-1 Tn5053 4676-4912 Passenger Gene Heavy Metal Resistance -
merD Tn5053 4909-5274 Passenger Gene Heavy Metal Resistance -
merA Tn5053 5291-6937 Passenger Gene Heavy Metal Resistance -
merF Tn5053 6934-7179 Passenger Gene Heavy Metal Resistance -
merP Tn5053 7182-7457 Passenger Gene Heavy Metal Resistance -
merT Tn5053 7473-7823 Passenger Gene Heavy Metal Resistance -
merR Tn5053 7895-8329 Passenger Gene Heavy Metal Resistance +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA Tn5053 1680 144-1823 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   homologous to TnsB of Tn7
Protein Sequence:  
MTSDTPPIAA QGVATLPDEA WAQARHRTEI IGPLAALEVV GHEAADEAAQ ALGLSRRQVY VLIRRARQGT GLVTDLTPGR SGGGKGKGRL PEPVERIIRE
LLQKRFLTKQ KRSLAAFHRE VAQACKTQKL PVPARNTVAQ RIAGLHPAKI ARSRGGQDAA RPLQGAGGIP PEVTMPLEQV QIDHTVIDLI VVDERDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLAHAAC DKRPWLEGLS VEMDWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPPGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPGQRGE YDSEKMATLT LRELERWLAL AVGTYHGSVH NGLLQPPAAR WAEAVERVGV PAVVTRPTAF LVDFLPVIRR
TLTRTGFVID HIHYYADALK PWIARRERLP AFLIRRDPRD ISRIWVLEPE GQHYLEIHYR TLSHPAVTLW EQRQALAKLR QLGREQVDES ALFRMIGQMR
EIVTTAQKAT RKARRDADRR QHLKTSEPPA KPIPPNVDMA DPQADNLPPA KPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB TniB Tn5053 909 1826-2734 +
Class:   Accessory Gene
Function:   is needed for transposition of Tn5053
Transpoase Chemistry:   Serine
Comment:   homologous to TnsC protein of Tn7 putative ATP-binding protein
Protein Sequence:  
MDEYPVIDLS HLLPAAQGLA RLPADERIQR IRADRWIGYP RAVEALNRLE TLYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRAHPVGTD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL TEMEQLALAL LRMVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRE AYLAIRSDDQ
LENRFEPMLL PPWEANEDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLVAAAVAAV ESGEEAINHR TLSMADYIGP SERRRQFERE
LM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniQ TniQ Tn5053 1218 2731-3948 +
Class:   Accessory Gene
Sub Class:   Target Site Selection
Function:   is needed for transposition of Tn5053
Protein Sequence:  
MKSAPRWPLH PAPKEGEALS SWLNRVAACY QMDVHELLAH DLGHSQLDDL DTAPSLSLLT ALCQRSGVEL ERLRSMSLAG WVPWLLDSLD DSVPAALETY
TFQCAVLLPK RTRKVRSITR WRAWLPSQTI RRACPQCLND PTNQAVLLVW QLPLMLSCPQ HGCWLESYWG MPGRYLQWEI ADAAPRPADD AIACMDRRTW
QALTTGFVEL PRRRVHAGLW FRLLRTLLDE LNTPLSHCGS CSASIRHVWE RCGHPLRAGQ SLWRPYEILA PAVQWQMLEA AATAITLIES KALIPRGEQA
ALFQTEPHTG FTNGLPAKVP KPEPINHWQR AAQAIDEAII EARHNPETAR LLFTLASYGR RDPESLERLR ATFTKEGIPP EFLSHYEPEW PFAGLRLNDG
LSDSF

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniR TniR Tn5053 615 4009-4623 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   also called TniC
Protein Sequence:  
MLIGYMRVSK ADGSQATDLQ RDALIAAGVD PVHLYEDQAS GMREDRPGLT SCLKALRTGD TLVVWKLDRL GRDQRHLINT VHDLTGRGIG LKVLTGHGAA
IDTTTAAGKL VFGIFAALAE FERELIAERT IAGLASARAR GRKGGRPFKM TAAKLRLAMA AMGQPETKVG DLCQELGVTR QTLYRHVSPK GELRPDGEKL
LSRI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE; urf-1 MerE Tn5053 237 4676-4912 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   Broad-spectrum mercury transporter
Protein Sequence:  
MNSPERLPSE THKPITGYLW GALAVLTCPC HLPILAIVLA GTTAGAFIGE YWGIAALTLT GLFVLSVTRL LRAFKDRS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn5053 366 4909-5274 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MSAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLAVLRQ FVERRREALA
DLEVQLATMP TEPAQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn5053 1647 5291-6937 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercuric ion reductase
Target:   Mercury
Protein Sequence:  
MTEITVNGMT CTSCATHVKD ALEKIPGVNA AVVSYPESRA QVMADTAVSH NQLLAAIAAL GYQGSIRVGD FKDEPKIRDA LEGAGLHIAI IGSGGAAMAA
ALKAVEQGAT VTLIERGTIG GTCVNIGCVP SKIMIRAAHI AHLRRESPFD GGIAATVPAI DRSKLLAQQQ ARVDELRHAK YEGILDGNPA ITVLHGEARF
KDDQSLVVRL NEGGEREVTF DRCLVATGAS PVVPPIPGLK ESPYWTSTEA LVSDTIPARL AVIGSSVVAL ELAQAFARLG SQVTILARST LFFREDPAIG
EAVTAAFRAE GIEVLEHTQA SQVAHVNGEF VLTTGHGELR ADKLLVATGR APNTRSLALD AAGVTVNAQG AIVIDQGMRT SNPNIYAAGD CTDQPQFVYV
AAAAGTRAAI NMTGGDAALN LTAMPAVVFT DPQVATVGYS EAEAHHDGIE TDSRTLTLDN VPRALANFDT RGFIKLVIEE GSGRLIGVQA VAPEAGELIQ
TAVLAIRNRM TVQELADQLF PYLTMVEGLK LAAQTFNKDV KQLSCCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merF MerF Tn5053 246 6934-7179 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercuric ion transport protein
Protein Sequence:  
MKDPKTLLRV SIIGTTLVAL CCFTPVLVIL LGVVGLSALT GYLDYVLLPA LAIFIGLTIY AIQRKRQADA CCTPKFNGVK K

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn5053 276 7182-7457 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   periplasmic mercuric ion binding protein
Protein Sequence:  
MKKLLSALAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVDGV SKVDVTFETR EAVVTFDDAK TSVQKLTKAT EDAGYPSSVK N

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn5053 351 7473-7823 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercury ion transport protein
Protein Sequence:  
MSEPQNGRGA LFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLTVLEPY RPIFIGVALV ALFFAWRRIY RPSAACKPGE VCAIPQVRAT YKLIFWGVAV
LVLVALGFPY VVPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn5053 435 7895-8329 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFARTAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGETD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVRERM
ADLARMEAVL SDLVCACHAR KGNVSCPLIA SLQGKKEPRS ADAV

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 Tn5053 9-27 TCAGAAGACG ACCGCACCA
repeat t2 Tn5053 49-67 CACACGTATG CCGAGGACT
repeat t3 Tn5053 78-96 TCAGGAGTCG TCTGCACCA
repeat t4 Tn5053 110-128 TCAATACTCG TGTGCACCA
IR Tn21-like Tn5053 8346-8383 GAATCGCACG AAATAAAAGG CAAAAGACTC CGCTGAGG
repeat i4 Tn5053 8356-8374 AAATAAAAGG CAAAAGACT
repeat i2 Tn5053 8398-8416 CCGTGCGGTC GACTTTTGA
IRi Tn5053 8420-8447 AAGCACGACA GCGGAAGACT TTCACTGT

 References