Transposon
Name: Tn5053.1       (Synonyms: Tn4671)
Family: Tn402        Group: Tn5053
Evidence of Transposition: yes
 Host     

Host Organism:Alicycliphilus denitrificans BC Molecular Source:plasmid pALIDE02
Place of Origin:U.S.A. Date of Isolation:2011

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  8448 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCATTC AGCGCGGTTT GAAGCCTCTT GTGCATACGG CTCCTGACAG CCAGATATCA GGACTCCTCT GCACGAAACT 100
CGTCTATGCT CAATACTCGT GTGCACCAAA GCGAGATGAG CATGGCGACC GATACCGCAC CGATTACCGA GCACGGCGTG GCCACCCTGC CAGAACAGGC 200
ATGGGAGCGT GCGCGTCGTC GCGCGGAGAT CATTGGGCCG TTGGCGCAGT CGGAGACGGT TGGGCATGAA GCGGCCGACG CGGCAGCCCA GGCATTGGGG 300
CTGTCCCGGC GGCAGGTCTA CGTCCTGATC CGCCGTGCCC GGCTAGGATC GGGGCTGGTC ACTGACTTGG CTCTTGGGCA GTCGAGCGGT GGCAAAGGTA 400
AAGGCCGCTT GCCGGAGTCG GTCGAACGAA TCATCCGCGA GTTACTGCAA AAGCGCTTCC TGACCAAGCA GAAGCGTAGC TTGGCGGCGT TCCACCGCGA 500
AGTTGTGCGG GCGTGCAAGC TGCAAAAGCT GCGGGTGCCG GCGCGCAACA CGGTGGCTCT GCGGATCGCC GGCCTCGATC CGCGCGAGGT CACTCACCGC 600
CGGGAAGGAC AAGATGCCGC CCGCGACCTG CAAGGTGTTG GTGGTGTTCC GCCACCCGTC TCCGCGCCGC TGGAGCAGGT GCAGATCGAC CACACAGTCA 700
TCGACCTGAT CGTGGTGGAC GAGCGCGACC GGCAACCGAT TGGCCGTCCA TACCTGACCC TCGCCATCGA CGTATTCACC CGCTGCGTGG TTGGCATGGT 800
CGTTACGCTG GAAGCCCCGT CCGCCGTCTC GGTCGGCTTG TGCTTGGTGC ATGCCGCCTG CGACAAGCGC CCTTGGTTGG AAGGGTTGAA CGTAGAGATG 900
GATTGGCCGA TGAGCGGCAA GCCCAGACTG CTCTACTTGG ACAACGCGGC TGAGTTCAAG AGCGAGGCGC TGCGCCGTGG CTGCGAGCAG CATGGCATCC 1000
GGTTGGACTA TCGCCCGCTC GGGCAGCCGC ACTACGGCGG TATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATC CACGACGAAT TGCCGGGGAC 1100
GACCTTCTCC AATCCTGACC AGCGCGGGGA ATACGCCTCC GAGAAGATGG CCGCCCTGAC ACTGCGCGAG CTGGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTATCACG GCTCCGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAA GCTATCACGC GGACCGGCGT GCCAACCGTC ATCACTCGCG 1300
CTACGGCTTT TCTGGTCGAT TTTCTGCCCA TCATCCGCCG CACGCTGACC CGCACCGGCT TCGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATA GCTCGGCGGG ACCGCTTGCC TGCGTTCCTG ATCCGGCGCG ACCCGCGCGA CATCAGCCGC ATATGGGTGC TGGAACCAGA GGGGCAGCAC 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CACCCGGCTG TCACCCTATG GGAACAACGG CAGGCGCTGG CGAAATTACG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC AGCGCTGTTC CGCATGATCG GCCAGATGCG AGAAATCGTG ACCACCGCGC AGAAAGCTAC GCGCAAGGCG CGGCGCGACG CGGATCGACG 1700
CCAGCATCTC AAGGCAACGG CACCGCCTGT CAAAACCACG CCACCACCAG ATGCGGACAT GGCTGACCCA CAGGCCGACA ACCAGCCGCC GGCCAAACCG 1800
TTCGACCAGA TTGAGGAGTG GTAGCCGTGG ACGAATATCC CATCATCGAC TTGTCACACC TGCTGCCAGC GGCACAGGGG CTGGCTCGGC TGCCGGCGGA 1900
CGAGCGCATC CAGCGCCTTC GCGCCGACCG CTGGATCGGC TATCCGCGCG CGGTCGAGGC GCTGAACCGG CTGGAAACCC TGTATGCGTG GCCAAACAAG 2000
CAACGCATGC CCAACCTGCT GCTGGTCGGC CCGACCAACA ACGGCAAGTC GATGATCGTC GAGAAGTTCC GGCGCACGCA TCCGGCCAGC GCCGACGCCG 2100
ACCAGGAGCA CATTCCGGTA CTGGTCGTGC AGATGCCATC CGAACCGTCG GTAATCCGCT TCTACGTCGC GCTGCTCGCG GCGATGGGTG CGCCATTGCG 2200
ACCGCGCCCA CGGCTGCCAG AAATGGAACA ACTGGCGCTG GCACTGCTGC GCAAGGTCGG CGTGCGCATG CTGGTGATCG ACGAGTTGCA CAACGTCCTG 2300
GCCGGTAACA GCGTCAACCG CCGGGAATTT CTCAATCTCC TGCGCTTCCT CGGCAATGAA CTGCGCATCC CACTGGTCGG GGTCGGCACG CGCGACGCCT 2400
ACCTGGTCAT CCGCTCCGAT GACCAGTTGG AAAATCGCTT CGAGCCGATG ATGCTGCCGG TATGGGAGGC CAACGACGAT TGCTGCTCAC TGCTGGCCAG 2500
CTTCGCCGCT TCGCTTCCAC TGCGGCGCCC CTCGTCGATT GCCACGCTGG ACATGGCCCG CTACCTGCTC ACACGCAGCG AAGGCACCAT CGGCGAACTG 2600
ACGCACTTGC TGATGGCGGC AGCCCTCGCC GCCGTGGAGA GCGGCGAGGA AGCGATCAAC CATCGCACGC TGAGCATGGC CGATTACACC GGCCCCAGCG 2700
AGCGGCGTCG GCAATTCGAG CGGGAACTGA TGTGAAGCCA GCGCCACGCT GGCCACTGCA TCCGGCTCCC AGGGAAGGTG AAGCCTTGTC TTCGTGGCTC 2800
AACCGCGTGG CCCTTTGCTA TCACATAGAG GTGTCCGAGC TGCTGGAGCA CGATCTTGGT CACGGTCAGG TTGATGACCT GGACACCGCG CCACCACTGG 2900
CGCTGCTGGC GATGCTTTCC CAGCGGAGCG GCATCGAACT GAACCGGCTG CGTTGCATGA GCTTTGCCGG CTGGGTGCCT TGGCTACTGG ACAGCCTTGA 3000
TGATCAGATT CCAGATGCAT TGGAGACCTA TGCGTTCCAG CTCTCGGTGT TGCTGCCGAC ACACCGCCGT AAGACGCGAT CCATCACGAG CTGGCGTGCC 3100
TGGCTGCCCA GCCAGCCGAT ACACCGCGCC TGTCCGCTAT GCCTGAACGA TCCGGAGAAC CAAGCCGTAC TGCTCGCGTG GAAGCTGCCC CTGATGCTGA 3200
GCTGCCCGCT GCATGGCTGC CGGCTGGAAT CCTATTGGGG CGTGCCAGGG CGGTTTCTAG GCTGGGAGAA CGCCGACGCC GAACCGCGCA CCGCCAGCGA 3300
CGCGATTGCG GCGATGGACC AGCGTACCTG GCAGGCACTG ACGACCGGTC ACCTGGAGCT GCCGCGCCGA CGCATCCATG CCGGATTGTG GTTTCGGCTG 3400
CTACGCACGC TGCTCGATGA GCTGAACACA CCGCTTTCGG CGTGCGGAAC CTACGCGGGG TATCTCCGCC AAGTCTGGGA AGGCTGCGGG CATCCGCTGC 3500
GTGCTGGGCA AAGTCTGTGG CGACCGTATG AAACCCTGAA TCCGGCAGTA CGGTTGCAGA TGCTGGAGGC GGCGGCAACG GCAATCAGCT TGATTGAGGT 3600
GAGGTACATA AGCCCGCCAG GCGAGCAGGC AAAGCTGTTC TGGTCCGAGC CCCAAACAGG GTTCACCAGT GGCCTGCCGA CGAAAGCGCC GAAGCCCGAA 3700
CCCATCAATC ACTGGCAGCG TGCAGTCCAG GCCATCGACG AGGCCATCAT TGAAGCGCGG CACAACCCCG AGACGGCACG CTCGCTGTTC GCGTTGGCTT 3800
CCTATGGTCG GCGCGACCCC GCTTCCCTGG AACAGTTGCG CGCCACCTTC GCGAAGGAAG GCATCGCCAC GGAATTTCTG TCACATTATG AGCCTGACGG 3900
ATCCTTTGCA TGTCTTAGAC AGAATGACGG GTTAAGTGAC AAATTTTGAC GACCTTAACT TTCCGGCGCA CACTGTCACA TAATCGAACG TATATGTGAC 4000
AGGTACGACA TGCTGATAGG CTACATGCGG GTATCGAAGG CGGACGGCTC CCAGGCTACC GATTTGCAGC GCGACGCGCT GATTGCCGCC GGGGTCGATC 4100
CAGTACATCT TTACGAGGAC CAGGCATCCG GCATGCGCGA GGATCGGCCC GGCTTGACGA GCTGCCTGAA GGCGTTGCGA ACTGGCGACA CACTGGTCGT 4200
GTGGAAACTG GATCGGCTCG GACGCGACCT GCGACATCTC ATCAACACCG TGCACGACCT GACTGGGCGC GGCATCGGCT TGAAGGTATT AACCGGGCAC 4300
GGCGCGGCCA TCGACACCAC GACCGCCGCC GGCAAGCTGG TCTTTGGCAT CTTCGCCGCC CTGGCCGAGT TCGAGCGCGA GTTGATCGCG GAGCGCACGA 4400
TTGCCGGCCT AGCCTCGGCC CGCGCGCGCG GGCGGAAAGG CGGCCGGCCG TTCAAGATGA CCGCCGCCAA GCTGCGGCTG GCGATGGCGG CAATGGGTCA 4500
GCCAGAGACC AAGGTCGGCG ACCTGTGCCA GGAACTTGGC GTCACGCGGC AGACCCTGTA TCGGCATGTT TCACCCAAGG GTGAGCTACG TCCAGATGGC 4600
GAGAAGCTAC TCAGCCGAAT TTGATGCCGG CATGAGGCAA CGTAGCGACA GCGTGGTTTG TCTCAATGGG AAGCGCTCAT GATCGATCTT TGAAGGCCCG 4700
CAGCAGTCGT GTCACAGACA GGACGAACAA ACCGGTCAGC GTGAGGGCTG CGATACCCCA GTACTCTCCG ATGAACGCGC CGGCCGTCGT GCCGGCCAGC 4800
ACAATGGCGA GAATCGGCAA ATGGCAGGGA CAGGTGAGCA CGGCCAGCGC GCCCCACAGG TAGCCGGTGA TCGGTTTGTG CGTCTCGGAC GGCAAGCGCT 4900
CGGGGCTGTT CATGGCAGAC TCTCCGCGTG CTGTGCCGGC TCGGTCGGCA TGGTGGCCAA CTGCACCTCC AGATCGGCCA ACGCTTCGCG CCGACGCTCG 5000
ACGAACTGGC GCAGAACGGC AAGCTGCGCG GCCGCTTCAT CGCCGTCCGC AGCATCCAGC GCCCGGCACA GCCGCGCCAG CGCGTCCAGG CCGATGCCCG 5100
CCTCGAAGGC CGCCCGCACG AAGCACAGCC GTTGCAAGGC GGCATCATCG AACAGGCCAT AGCCGCCCGG GGTGCACGCC ACCGGACGCA GCAATCCGCG 5200
CAGCAGGTAG TCGCGCACGA TATGCACGCT CACCCCGGCA TCAAGGGCCA GCCGGGACAC GGTGTAGGCG CTCATTGAAA ACCTCCTTTT TTTATCCAGC 5300
GCAGCAGGAA AGCTGCTTCA CGTCCTTGTT GAAGGTCTGC GCCGCAAGCT TCAACCCCTC GACCATTGTC AGGTAGGGGA ACAACTGGTC GGCCAGTTCC 5400
TGCACCGTCA TGCGGTTGCG GATGGCGAGC ACCGCCGTCT GGATCAGTTC GCCCGCTTCC GGGGCCACCG CCTGCACGCC GATGAGCCGT CCGCTACCTT 5500
CCTCGATGAC CAGCTTGATG AAGCCGCGTG TGTCGAAGTT GGCAAGCGCT CGCGGAACGT TGTCGAGTGT CAGCGTGCGA CTGTCGGTCT CGATGCCATC 5600
GTGGTGCGCT TCCGCCTCGC TGTAGCCCAC GGTGGCGACT TGCGGGTCGG TGAACACCAC TGCCGGCATC GCGGTCAGAT TGAGGGCTGC GTCGCCGCCG 5700
GTCATGTTGA TCGCGGCACG GGTGCCGGCG GCCGCTGCCA CGTAGACGAA CTGCGGCTGG TCGGTGCAGT CGCCGGCCGC GTAGATGTTC GGGTTGCTCG 5800
TGCGCATGCC TTGGTCGATA ACGATGGCCC CTTGCGCATT GACAGTGACC CCCGCCGCGT CCAGCGCGAG GCTGCGCGTA TTCGGTGCCC GACCGGTGGC 5900
AACCAGCAAC TTGTCAGCGC GCAATTCACC GTGTCCGGTG GTCAGCACGA ATTCGCCGTT CACATGGGCG ACCTGGCTGG CTTGCGTGTG CTCCAGCACC 6000
TCGATGCCCT CGGCGCGGAA AGCGGCTGTC ACGGCCTCGC CGATGGCCGG GTCTTCCCGG AAGAACAAGG TGCTGCGTGC CAGGATCGTG ACCTGGCTGC 6100
CGAGCCGGGC AAAGGCTTGC GCCAGTTCCA ACGCCACCAC CGACGAACCG ATCACGGCCA GGCGTGCGGG AATGGTGTCG CTGACAAGCG CTTCGGTGGA 6200
AGTCCAGTAG GGTGACTCTT TCAGGCCCGG AATCGGCGGC ACGGCCGGAC TGGCACCGGT GGCGACCAGG CAGCGGTCGA ACGTTACCTC GCGCTCGCCA 6300
CCCTCGTTCA AACGGACGAC CAGGCTCTGG TCGTCCTTGA AACGCGCTTC ACCGTGCAAA ACGGTGATGG CTGGATTGCC GTCCAGGATG CCTTCGTATT 6400
TGGCGTGCCG CAGTTCATCG ACACGGGCCT GCTGCTGGGC CAGCAGTTTG CTGCGGTCAA TCGCAGGCAC AGTTGCCGCA ATACCGCCGT CGAACGGACT 6500
TTCCCGGCGC AGATGGGCAA TATGGGCAGC GCGGATCATG ATCTTGGACG GCACACAGCC GATATTGACG CAGGTGCCGC CGATGGTGCC GCGTTCGATC 6600
AGCGTGACCG TCGCGCCTTG CTCGACGGCC TTCAGCGCCG CCGCCATCGC GGCCCCGCCG CTGCCAATGA TGGCGATATG CAAACCGGCG CCCTCAAGTG 6700
CATCACGGAT TTTTGGTTCA TCTTTGAAAT CACCAACCCG GATCGAGCCT TGATAACCCA ATGCGGCGAT GGCGGCCAGC AGTTGGTTGT GGCTCACGGC 6800
GGTGTCTGCC ATGACTTGCG CGCGGCTTTC TGGATAGGAC ACCACAGCGG CATTCACGCC GGGAATCTTT TCCAAAGCAT CTTTGACATG GGTGGCGCAG 6900
GATGTGCAGG TCATGCCATT CACGGTGATT TCGGTCATTT TTTTACTCCA TTGAATTTCG GGGTGCAGCA GGCATCGGCT TGGCGTTTTC GTTGGATGGC 7000
GTAGATGGTC AAGCCGATGA AAATCGCCAG CGCAGGCAGC AGCACATAGT CCAGATAGCC GGTCAGCGCG GACAAGCCGA CCACACCGAG CAAAATGACC 7100
AGAACAGGGG TGAAGCAACA CAGCGCCACG AGGGTTGTGC CAATGATGCT GACCCGCAGC AGTGTCTTCG GGTCTTTCAT GATCAGTTCT TGACTGATGA 7200
TGGGTAGCCC GCATCCTCGG TAGCCTTGGT CAGTTTCTGC ACGCTGGTCT TGGCATCATC GAAGGTGACC ACCGCTTCGC GCGTCTCGAA GGTCACGTCA 7300
ACTTTACTGA CGCCATCGAC CTTGGAAATC GCCTTCTTGA CAGTGATCGG ACAGGCCGAG CAGGTCATGC CCGGTACGGA CAGCGTAACG GTCTGGGTGG 7400
CGGCCCACAC GGGGGCAACA ACGGCAGCGA GGGCAAGGGC GGAAAGCAGC TTTTTCATGG TGAACTCCTG TGATCAATAG AAAAATGGCA CGACGTAGGG 7500
AAATCCGAGC GCGACCAAAA CCAGCACGGC CACGCCCCAG AAAATGAGCT TGTAAGTAGC TCGCACTTGG GGAATCGCGC AAACCTCACC CGGTTTGCAG 7600
GCGGCTGACG GCCGGTAGAT GCGCCGCCAG GCGAAGAACA ACGCCACCAG CGCCACGCCG ATAAAGATGG GGCGATAGGG TTCCAACACC GTCAAGTTGC 7700
CGATCCAAGC GCCGCTGAAC CCCAAGGCGA TCAGAACCAG CGGCCCGAGG CAGCAAGCCG AGGCGAGGAT GGCGGCCAGC CCGCCAGTGA AGAGCGCGCC 7800
GCGCCCGTTT TGAGGTTCAG ACATACGTTT GTCCTTTCGA ATCTGAATTG GATAGCTTAA GCTTACTTCC GTAGTTATGT ACGGAGTCAA GCGATATGGA 7900
AAACAATTTG GAGAACCTGA CCATTGGCGT TTTCGCCAGG ACGGCCGGGG TCAATGTGGA GACCATCCGG TTCTATCAGC GCAAGGGCTT GCTCCCGGAA 8000
CCGGACAAGC CTTACGGCAG CATTCGCCGC TATGGCGAGA CGGATGTAAC GCGGGTGCGC TTCGTGAAAT CAGCCCAGCG GTTGGGCTTC AGCCTGGATG 8100
AGATCGCCGA GCTGCTGCGG CTGGAGGATG GCACCCATTG CGAGGAAGCC AGCAGCCTGG CCGAGCACAA GCTCAAGGAC GTGCGCGAGA GGATGGCTGA 8200
CCTGGCGCGC ATGGAGGCCG TGCTGTCTGA TTTGGTGTGC GCCTGCCATG CGCGGAAGGG GAACGTTTCC TGCCCGCTGA TTGCGTCACT GCAAGGGAAG 8300
AAAGAACCGC GCAGTGCGGA CGCGGTGTAG CCCGAGGGAA CTACGCCTTA GCGTGCTTTA TTTTCCGTTT TCTGAGGCGA CTCCAACGTC AGAAAAGACC 8400
GTGCGGTCGA CTTTTGATAT TTCGTGCTGT CGCCTTCTGA AAGTGACA

 Recombination Sites     

Name Coordinates Gene Sequence
r5 3873-3886 14 AATTTCTGTC ACAT
r3 3929-3942 14 GGGTTAAGTG ACAA
res 3970-4004 35 ACACTGTCAC ATAATCGAAC GTATATGTGA CAGGT
r2 3973-3986 14 CTGTCACATA ATCG
r1 3989-4002 14 CGTATATGTG ACAG

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA Tn5053.1 142-1824 Transposase   +
tniB Tn5053.1 1827-2735 Accessory Gene   +
tniQ Tn5053.1 2732-3949 Accessory Gene Target Site Selection +
tniR Tn5053.1 4010-4624 Accessory Gene Resolvase +
merE; urf-1 Tn5053.1 4677-4913 Passenger Gene Heavy Metal Resistance -
merD Tn5053.1 4910-5275 Passenger Gene Heavy Metal Resistance -
merA Tn5053.1 5292-6938 Passenger Gene Heavy Metal Resistance -
merF Tn5053.1 6935-7180 Passenger Gene Heavy Metal Resistance -
merP Tn5053.1 7183-7458 Passenger Gene Heavy Metal Resistance -
merT Tn5053.1 7474-7824 Passenger Gene Heavy Metal Resistance -
merR Tn5053.1 7896-8330 Passenger Gene Heavy Metal Resistance +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA Tn5053.1 1683 142-1824 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   can be extended upstream by 12 amino acids
Protein Sequence:  
MATDTAPITE HGVATLPEQA WERARRRAEI IGPLAQSETV GHEAADAAAQ ALGLSRRQVY VLIRRARLGS GLVTDLALGQ SSGGKGKGRL PESVERIIRE
LLQKRFLTKQ KRSLAAFHRE VVRACKLQKL RVPARNTVAL RIAGLDPREV THRREGQDAA RDLQGVGGVP PPVSAPLEQV QIDHTVIDLI VVDERDRQPI
GRPYLTLAID VFTRCVVGMV VTLEAPSAVS VGLCLVHAAC DKRPWLEGLN VEMDWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGE YASEKMAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAITRTGV PTVITRATAF LVDFLPIIRR
TLTRTGFVID HIHYYADALK PWIARRDRLP AFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVTTAQKAT RKARRDADRR QHLKATAPPV KTTPPPDADM ADPQADNQPP AKPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB TniB Tn5053.1 909 1827-2735 +
Class:   Accessory Gene
Transpoase Chemistry:   Serine
Comment:   homologous to TnsC protein of Tn7 putative ATP-binding protein
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE TLYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASAD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLVIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSSIAT LDMARYLLTR SEGTIGELTH LLMAAALAAV ESGEEAINHR TLSMADYTGP SERRRQFERE
LM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniQ TniQ Tn5053.1 1218 2732-3949 +
Class:   Accessory Gene
Sub Class:   Target Site Selection
Protein Sequence:  
VKPAPRWPLH PAPREGEALS SWLNRVALCY HIEVSELLEH DLGHGQVDDL DTAPPLALLA MLSQRSGIEL NRLRCMSFAG WVPWLLDSLD DQIPDALETY
AFQLSVLLPT HRRKTRSITS WRAWLPSQPI HRACPLCLND PENQAVLLAW KLPLMLSCPL HGCRLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW
QALTTGHLEL PRRRIHAGLW FRLLRTLLDE LNTPLSACGT YAGYLRQVWE GCGHPLRAGQ SLWRPYETLN PAVRLQMLEA AATAISLIEV RYISPPGEQA
KLFWSEPQTG FTSGLPTKAP KPEPINHWQR AVQAIDEAII EARHNPETAR SLFALASYGR RDPASLEQLR ATFAKEGIAT EFLSHYEPDG SFACLRQNDG
LSDKF

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniR TniR Tn5053.1 615 4010-4624 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   also called TniC
Protein Sequence:  
MLIGYMRVSK ADGSQATDLQ RDALIAAGVD PVHLYEDQAS GMREDRPGLT SCLKALRTGD TLVVWKLDRL GRDLRHLINT VHDLTGRGIG LKVLTGHGAA
IDTTTAAGKL VFGIFAALAE FERELIAERT IAGLASARAR GRKGGRPFKM TAAKLRLAMA AMGQPETKVG DLCQELGVTR QTLYRHVSPK GELRPDGEKL
LSRI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE; urf-1 MerE Tn5053.1 237 4677-4913 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   Broad-spectrum mercury transporter
Protein Sequence:  
MNSPERLPSE THKPITGYLW GALAVLTCPC HLPILAIVLA GTTAGAFIGE YWGIAALTLT GLFVLSVTRL LRAFKDRS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn5053.1 366 4910-5275 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MSAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLAVLRQ FVERRREALA
DLEVQLATMP TEPAQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn5053.1 1647 5292-6938 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercuric ion reductase
Target:   Mercury
Protein Sequence:  
MTEITVNGMT CTSCATHVKD ALEKIPGVNA AVVSYPESRA QVMADTAVSH NQLLAAIAAL GYQGSIRVGD FKDEPKIRDA LEGAGLHIAI IGSGGAAMAA
ALKAVEQGAT VTLIERGTIG GTCVNIGCVP SKIMIRAAHI AHLRRESPFD GGIAATVPAI DRSKLLAQQQ ARVDELRHAK YEGILDGNPA ITVLHGEARF
KDDQSLVVRL NEGGEREVTF DRCLVATGAS PAVPPIPGLK ESPYWTSTEA LVSDTIPARL AVIGSSVVAL ELAQAFARLG SQVTILARST LFFREDPAIG
EAVTAAFRAE GIEVLEHTQA SQVAHVNGEF VLTTGHGELR ADKLLVATGR APNTRSLALD AAGVTVNAQG AIVIDQGMRT SNPNIYAAGD CTDQPQFVYV
AAAAGTRAAI NMTGGDAALN LTAMPAVVFT DPQVATVGYS EAEAHHDGIE TDSRTLTLDN VPRALANFDT RGFIKLVIEE GSGRLIGVQA VAPEAGELIQ
TAVLAIRNRM TVQELADQLF PYLTMVEGLK LAAQTFNKDV KQLSCCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merF MerF Tn5053.1 246 6935-7180 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercuric ion transport protein
Protein Sequence:  
MKDPKTLLRV SIIGTTLVAL CCFTPVLVIL LGVVGLSALT GYLDYVLLPA LAIFIGLTIY AIQRKRQADA CCTPKFNGVK K

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn5053.1 276 7183-7458 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   periplasmic mercuric ion binding protein
Protein Sequence:  
MKKLLSALAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVDGV SKVDVTFETR EAVVTFDDAK TSVQKLTKAT EDAGYPSSVK N

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn5053.1 351 7474-7824 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercury ion transport protein
Protein Sequence:  
MSEPQNGRGA LFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLTVLEPY RPIFIGVALV ALFFAWRRIY RPSAACKPGE VCAIPQVRAT YKLIFWGVAV
LVLVALGFPY VVPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn5053.1 435 7896-8330 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFARTAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGETD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVRERM
ADLARMEAVL SDLVCACHAR KGNVSCPLIA SLQGKKEPRS ADAV

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t4 Tn5053.1 110-128 TCAATACTCG TGTGCACCA
IR Tn21-like Tn5053.1 8347-8384 GAATCGCACG AAATAAAAGG CAAAAGACTC CGCTGAGG
repeat i4 Tn5053.1 8357-8375 AAATAAAAGG CAAAAGACT
repeat i2 Tn5053.1 8399-8417 CCGTGCGGTC GACTTTTGA
IRi Tn5053.1 8421-8448 AAGCACGACA GCGGAAGACT TTCACTGT

 References