Transposon
Name: Tn4378
Family: Tn3        Group: Tn21
Evidence of Transposition: no
 Host     

Host Organism:Cupriavidus metallidurans CH34 Molecular Source:plasmid pMOL28
Place of Origin:Belgium

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG
IRR (Length: 38 bp)GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG

 Sequence     
DNA SequenceLength  8232 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAGGC ATAGCCGAAC CTGCCAAGCT TGCTCCACCC TGTAGTGACG CGATCAGCGG GCAGGAAACG 100
TTCCCCCTTC GCGCATGGCA GGCGCACACC AACTCAGACA GCACGGCCTC CATGCGCGCC AGGTCAGCCA TTTTCTCGCG CACGTCCTTG AGCTTGTGCT 200
CGGCCAGACT GCTGGCTTCC TCGCAATGGG TGCCATCCTC CAGCCGCAGC AGCTCGGCGA TCTCATCCAG GCTGAAGCCC AGCCGCTGGG CTGATTTCAC 300
GAAGCGCACC CGCGTTACAT CCGCCTCGCC ATAGCGGCGG ATGCTGCCAT AGGGCTTGTC AGGCTCCAGC AACAAGCCCT TGCGCTGATA GAAACGGATG 400
GTCTCCACAT TGACCCCGGC CGCCTTGGCG AAAACGCCAA TGGTCAGGTT CTCCAAATTG TTTTCCATAT CGCTTGACTC CGTACATGAG TACGGAAGTA 500
AGGTTACGCT ATCCAATTTC AATTCGAAAG GACAAGCGCA TGTCTGAACC AAAAACCGGG CGCGGCGCGC TCTTCACTGG AGGGCTTGCC GCCATCCTCG 600
CCTCGGCTTG CTGCCTCGGG CCGTTGGTTC TGATCGCCTT GGGGTTCAGC GGCGCTTGGA TCGGCAACTT GGCGGTGTTG GATCCCTATC GCCCCATCTT 700
TATCGGCGTG GCGCTGGTGG CGTTGTTCTT CGCCTGGCGG CGCATCTACC GGCAGGCAGC GGCCTGCAAA CCGGGTGAGG TCTGCGCGAT TCCCCAAGTG 800
CGAGCTACTT ACAAGCTCAT TTTCTGGATC GTGGCCGCGC TGGTTCTGGT CGCGCTCGGA TTTCCCTACG TCATGCCATT TTTCTACTGA TCGGAGTTCA 900
CCATGAAGAA ACTGTTTGCC TCCCTCGCCC TCGCCGCCGT TGTTGCCCCC GTCTGGGCCG CCACCCAGAC CGTCACGCTG TCCGTACCGG GCATGACCTG 1000
CTCCGCCTGC CCGATCACTG TCAAGAAGGC GATTTCCAAG GTCGAAGGCG TCAGCAAAGT TGACGTGACT TTCGAGACAC GCCAAGCGGT CGTCACCTTC 1100
GACGATGCCA AGACCAGCGT GCAGAAGCTG ACCAAGGCAA CCGCAGACGC GGGCTATCCG TCCAGCGTCA AGCAGTGAGT CACTGAAAAC GGCACCGCAG 1200
CACAACGGAC GTCATTGTCT GGCGCCACAA ACGATAAAGG ATCTGTTGCA TGACCCATCT AAAAATCACC GGCATGACTT GCGACTCGTG CGCGGCGCAC 1300
GTCAAGGAAG CGCTGGAAAA AGTGCCAGGC GTGCAGTCGG CGCTGGTGTC CTATCCGAAG GGCACAGCGC AACTCGCCAT CGTGCCGGGC ACATCGCCGG 1400
ACGCGCTGAC TGCCGCCGTG GCCGGACTGG GCTACAAGGC AACGCTAGCC GATGCGCCAC TGGCGGACAA CCGCGTCGGA CTGCTCGACA AGGTGCGGGG 1500
ATGGATGGCC GCCGCCGAAA AGCACAGTGG CAACGAGCCC CCGGTGCAGG TAGCGGTCAT TGGCAGCGGT GGAGCCGCGA TGGCGGCGGC GCTGAAGGCC 1600
GTCGAGCAAG GCGCGCAGGT CACGCTGATC GAGCGCGGCA CCATCGGCGG CACCTGCGTC AATGTCGGCT GTGTGCCGTC CAAGATCATG ATCCGCGCCG 1700
CCCACATCGC CCATCTGCGC CGGGAAAGCC CGTTCGATGG CGGTATTGCG GCAACTGTGC CTACGATTGA CCGCAGTAAG CTGCTGGCCC AGCAGCAGGC 1800
CCGCGTCGAC GAACTGCGGC ACGCCAAGTA CGAAGGCATC CTGGGCGGTA ATCCGGCCAT CACCGTTGTG CACGGTGAGG CGCGCTTCAA GGACGACCAG 1900
AGCCTTACCG TCCGTTTGAA CGAGGGTGGC GAGCGCGTCG TGATGTTCGA CCGCTGCCTG GTCGCCACGG GTGCCAGCCC GGCGGTCCCG CCGATTCCGG 2000
GCTTGAAAGA GTCACCCTAC TGGACTTCCA CCGAGGCCCT GGCGAGCGAC ACCATTCCCG AACGCCTTGC CGTAATCGGC TCGTCGGTGG TGGCGCTGGA 2100
GCTGGCGCAA GCCTTTGCCC GGCTGGGCAG CAAGGTCACG GTCCTGGCGC GCAATACCTT GTTCTTCCGT GAAGACCCGG CCATCGGCGA GGCGGTGACA 2200
GCCGCTTTCC GTGCCGAGGG CATCGAGGTG CTGGAGCACA CGCAAGCCAG CCAGGTCGCC CATATGGACG GTGAATTCGT GCTGACCACC ACGCACGGTG 2300
AATTGCGCGC CGACAAACTG CTGGTTGCCA CCGGTCGGAC ACCGAACACG CGCAGCCTCG CGCTGGACGC AGCGGGGGTC ACTGTCAATG CGCAAGGTGC 2400
CATCGCCATC GACCAAGGCA TGCGCACGAG CAACCCGAAC ATCTACGCGG CCGGCGACTG CACCGACCAG CCGCAGTTCG TCTATGTGGC GGCAGCGGCC 2500
GGCACCCGTG CCGCGATCAA CATGACCGGC GGCGATGCGG CGCTCGACCT GACCGCAATG CCGGCCGTGG TGTTCACCGA TCCGCAAGTG GCGACCGTGG 2600
GCTACAGCGA GGCGGAAGCC CACCACGACG GGATCGAGAC CGACAGCCGC ACCTTGACCT TGGACAACGT GCCGCGTGCG CTCGCCAACT TCGACACACG 2700
CGGCTTCATC AAGTTGGTTA TCGAGGAAGG CAGCCATCGG CTGATCGGCG TACAGGCGGT CGCGCCGGAA GCGGGTGAAC TGATCCAGAC GGCGGCTCTG 2800
GCCATTCGCA ACCGCATGAC GGTGCAGGAA CTGGCCGACC AGTTGTTCCC CTACCTGACG ATGGTCGAGG GGTTGAAGCT CGCGGCGCAG ACCTTCAACA 2900
AGGATGTGAA GCAGCTTTCC TGCTGCGCCG GGTGAGAAAA AGGAGGTGTT CAATGAACGC CTACACGGTG TCCCGGCTGG CTCTTGATGC CGGGGTGAGC 3000
GTGCATATCG TGCGCGACTA CCTGCTGCGC GGATTGCTGC GCCCGGTGGC GTGCACACCA GGCGGCTACG GCTTGTTCGA TGACGCCGCC TTGCAACGGC 3100
TGTGCTTCGT GCGGGCGGCC TTCGAGGCGG GCATCGGCCT CGACGCGCTG GCGCGGCTGT GCCGGGCGCT GGATGCGGCG GACGGCGACG AAGCGGCCGC 3200
GCAGCTTGCC CTGCTGCGTC AGTTCGTCGA GCGTCGGCGC GAAGCGTTGG CCGATCTGGA AGTGCAGTTG GCCACCCTGC CGACCGAGCC GGCACAGCAC 3300
GCGGAGAGTC TGCCATGAAC AACCCCGAGC GCTTGCCGTC CGAGACGCAC AAACCGATCA CCGGCTACCT GTGGGGCGGA CTGGCTGTGC TGACTTGCCC 3400
CTGCCACCTG CCCATCCTCG CTGTCGTGCT GGCCGGCACA ACCGCCGGTG CTTTCCTCGG CGAGCATTGG GTCATCGCGG CGCTCGGTTT GACCGGCCTG 3500
TTCCTTCTGT CCCTGTCGCG GGCGTTGCGG GCATTCAGGG AAAGAGAATG AGCGCTTTCC GGCCGGATGG ATGGACGACG CCGGAACTGG CCCAAGCGGT 3600
CGAGCGCGGG CAGCTTGAAC TGCACTACCA GCCCGTCGTC GATCTGCGCA GTGGTGGGAT TGTCGGCGCG GAAGCCCTGT TGCGCTGGCG TCATCCGACG 3700
CTTGGACTAT TGCCACCGGG CCAGTTCCTG CCCGTGGTCG AATCGTCCGG CCTGATGCCT GAAATCGGCG CTTGGGTGCT GGGCGAAGCC TGCCGCCAGA 3800
TGCGTGACTG GCGAATGCTG GCATGGCGAC CGTTCCGGCT GGCCGTCAAT GTTTCGGCGA GCCAAGTGGG ACCGGACTTC GACGGGTGGG TAAAGGGCGT 3900
GCTGGCTGAT GCCGAGTTGC CCGCCGAGTA TCTCGAAATC GAGCTGACCG AATCGGTCGC GTTTGGTGAT CCGGCGATCT TCCCCGCCCT GGACGCCTTG 4000
CGGCAGATCG GTGTGCGCTT CGCCGCCGAT GACTTCGGGA CGGGGTATTC CTGTCTGCAA CATCTGAAGT GCTGCCCAAT CAGCACGCTC AAGATCGACC 4100
AATCGTTTGT CGCCGGGCTC GCCAACGACC GCCGCGACCA AACCATCGTG CACACCGTGA TTCAGCTTGC GCACGGGCTG GGCATGGATG TGGTGGCTGA 4200
AGGCGTGGAA ACATCGGCGA GTCTTGATCT ATTGCGACAA GCGGACTGCG ACACAGGACA AGGCTTCCTG TTCGCGAAGC CAATGCCGGC GGCGGCATTC 4300
GCCGTCTTCG TCAGTCAATG GAGGGGTGCC ACCATGAATG CAAGTGACTC GACCACCACC AGTTGCTGCG TGTGCTGCAA GGAAATCCCG CTCGATGCCG 4400
CCTTCACCCC GGAAGGCGCG GAATACGTCG AGCACTTCTG CGGGTTGGAG TGTTATCAAC GCTTCGAAGC GCGTGCCAAG ACAGGGAACG AAACCGATGC 4500
CGATCCGAAC GCCTGCGACT CGCTACCGTC AGATTGAGGC ATACCCTAAC CGGATGTCAG GGTAGACTGC CTCACAACGT CAGAATAGAG TCGGTTGTGT 4600
TATTTATTGA CACTAGCTGA AAAAGGTCAT AGATTTCTTC CTGACATTTT CGTCCAGGGA GGCATCTTGC AGGGTCAACG CATCGGCTAC GTCCGGGTCA 4700
GCAGCTTCGA CCAGAACCCG GAACGGCAAC TTGAACACGT CGAAGTCGGC AAGGTGTTCA CCGACAAGGC GTCGGGCAAG GACACCCAGC GGCCCGAGCT 4800
TGATTCGCTG CTGGCCTTCG TGCGCGAAGG CGACACCGTG GTGGTTCACA GCATGGATCG CTTGGCGCGC AACCTCGATG ACTTGCGCCG CCTCGTGCAA 4900
AAGCTCACCA AGCGCGGCGT GCGCATCGAG TTCGTCAAGG AGAGCCTGAC CTTCACCGGC GAGGATTCGC CGATGGCGAA CCTGATGCTG TCGGTCATGG 5000
GGGCGTTCGC TGAATTCGAG CGGGCCTTGA TCCGCGAGCG GCAGAGGGAA GGCATCGCGC TCGCCAAGCA ACGCGGAGCC TACCGGGGCC GCAAGAAAGC 5100
GCTGTCGCCC GAACAGGTAG CCGATCTGCG GCAGCGGGCC GCCGCCGGCG AACAAAAAGC GAAGCTGGCC CGCGAGTTTG GTGTCAGCCG GGAGACCCTG 5200
TATCAATACT TGAGAGCGGA TCAGTAAATA TGCCACGTCG TTCCATTCTG TCCGCCGCCG AGCGGGAAAG CCTGTTGGCG TTGCCGGATA CCAAGGACGA 5300
CTTGATCCGA TACTACACGT TCAGCGATAC CGACCTCTCC ATCATCCGGC AACGGCGCGG GCCTGCGAAC CGCTTGGGCT TTGCAGTTCA GCTCTGCTAC 5400
CTGCGCTTTC CCGGCATCCT CCTTGGCGTC GATGAGCCGC CGTTTCCGCC CTTGCTGAAA CTGGTCGCCG ACCAGCTCAA GGTCAGTGTC GAAAGCTGGG 5500
GCGAGTACGG GCAGCGGGAG CAGACCCGGC GCGAGCATCT GGTCGAGTTG CAAACGGTGT TCGGCTTCCA GCCCTTCACC ATGAGCCACT ACCGGCAGGC 5600
CGTCCACACG CTGACCGAGC TGGCCATGCA AACCGACAAG GGCATTGTGC TGGCCAGCGC CTTGATCGAG CATCTGCGGC GGCAGTCGGT CATTCTGCCT 5700
GCCCTCAACG CCGTCGAGCG GGCGAGCGCC GAAGCGATCA CCCGCGCCAA CCGGCGCATC TACTACGCCT TGGCCGAACC ACTGTCGGAC GCGCATCGCC 5800
GCCGCCTCGA CGATCTGCTC AAGCGCCGGG ACAACGGCAA GACGACTTGG CTGGCCTGGC TGCGCCAGTC ACCCGTCAAG CCCAATTCGC GGCATATGCT 5900
GGAGCACATC GAACGACTCA AGGCATGGCA GGCGCTCGAT CTGCCTACCG GCATCGAGCG GCTGATCCAC CAAAACCGGC TGCTCAAGAT CGCCCGCGAG 6000
GGCGGCCAGA TGACACCCGC CGACCTGGCC AAGTTCGAGG CGCAGCGGCG CTACGCGACC CTGGTGGCGC TCGCCATTGA AGGCATGGCC ACCGTCACCG 6100
ACGAAATCAT CGACCTGCAC GACCGCATCC TGGGCAAGCT GTTCAACGCC GCCAAGAACA AGCATCAGCA GCAGTTCCAG GCGTCCGGCA AGGCGATCAA 6200
CGCCAAGGTG CGGCTGTTCG GGCGTATCGG TCAGGCACTG ATCGAGGCCA AGCAATCGGG CCGCGATCCG TTTGCCGCCA TCGAGGCCGT CATGTCCTGG 6300
GACGCCTTCG CCGAGAGCGT CACCGAAGCG CAGAAGCTCG CGCAGCCCGA GGACTTCGAT TTCCTGCACC GCATCGGCGA GAGCTACGCC ACGCTGCGTC 6400
GCTACGCGCC GGAATTCCTC GCCGTGCTCA AGCTGCGGGC CGCGCCCGCT GCCAAGGATG TGCTGGAGGC CATCGAAGTG CTGCGCAACA TGAACAGCGA 6500
CAACGCCCGC AAGGTGCCCG CCGACGCGCC AACCGATTTC ATCAAGCCGC GCTGGCAGAA GCTGGTGATG ACCGACACCG GCATCGATCG GCGCTACTAC 6600
GAACTGTGCG CGCTGTCGGA GATGAAAAAC GCCCTGCGCT CCGGCGACAT CTGGGTGCAG GGATCGCGCC AGTTCAAGGA CTTCGAGGAC TACCTGGTGC 6700
CACCCGCGAA ATTCGCCAGC CTCAAGCAGG CCAGCGAATT GCCGCTGGCC GTGGCCACCG ATTGCGACCA GTACCTGCAT GACCGGCTGA CGCTGCTGGA 6800
AACGCAGCTC GCCACCGTCA ACCGCATGGC GCTGGCCAAC GAGCTGCCGG ACGCCATCAT CACGGAGTCG GGCCTGAAGA TCACGCCGCT CGATGCGGCG 6900
GTGCCCGACA CCGCGCAGGC GCTGATCGAC CAGACAGCAA TGATCCTGCC GCACGTCAAG ATCACCGAAC TGCTGCTGGA GGTAGACGAA TGGACAGGCT 7000
TCACCCGGCA CTTCGCGCAC CTGAAATCGG GCGACCTGGC CAAGGACAAG AACCTGCTGC TGACCACGAT CCTGGCCGAC GCCATCAACC TGGGTCTGAC 7100
CAAGATGGCG GAGTCCTGCC CCGGAACGAC CTACGCCAAG CTCGCCTGGC TCCAAGCCTG GCATACCCGC GACGAAACCT ATTCGTCGGC GCTGGCCGAA 7200
CTGGTCAATG CGCAGTTCCG GCATCCCTTC GCCGAGCACT GGGGCGACGG CACCACGTCA TCGTCGGACG GCCAGAATTT CCGAACCGGC AGCAAGGCCG 7300
AGAGCACTGG CCACATCAAC CCGAAATATG GCAGCAGTCC TGGGCGGACT TTCTACACCC ACATCTCCGA CCAGTACGCG CCATTCCACA CCAAGGTGGT 7400
CAATGTCGGC GTGCGCGACT CGACCTATGT GCTCGACGGC TTGCTGTACC ACGAGTCCGA CCTGCGCATC GAGGAGCACT ACACCGATAC GGCAGGATTC 7500
ACCGATCATG TATTTGGCCT GATGCACCTG CTGGGCTTCC GCTTTGCGCC GCGCATCCGC GACCTGGGCG ACACCAAGCT GTTCATCCCC AAGGGCGACA 7600
CCGTCTACGA CGCGCTCAAG CCGATGATTA GCAGCGACAG ACTGAACATC AAGGCTATTC GCGCCCATTG GGATGAAATT CTACGGCTGG CCACGTCGAT 7700
CAAGCAGGGC ACGGTGACGG CTTCGCTGAT GCTGCGCAAG CTCGGCAGCT ATCCGCGCCA GAACGGCCTG GCCGTGGCCC TGCGCGAGCT GGGGCGTATC 7800
GAGCGCACGC TGTTCATCCT GGATTGGTTG CAAAGCGTGG AGCTGCGCCG TCGCGTGCAC GCTGGGCTGA ACAAGGGCGA AGCCCGCAAT GCGCTGGCCC 7900
GCGCCGTGTT CTTCAACCGT CTGGGTGAAA TCCGCGACCG CAGCTTTGAG CAGCAGCGCT ACCGTGCCAG CGGCCTCAAC CTGGTGACGG CGGCCGTCGT 8000
GCTATGGAAC ACGGTCTATC TGGAACGGGC TGCGCACGCG CTGCGGGGCA ACGGCCACGC CGTCGATGAC GCGCTGTTGC AGTACCTGTC GCCGCTCGGC 8100
TGGGAGCACA TCAACCTGAC CGGCGATTAC CTCTGGCGCA GCAGCGCCAA GATCGGCGCG GGCAAGTTCA GGCCGCTGCG GCCGTTGCAA CCTGCTTAGC 8200
GTGCTTTATT TTCCGTTTTC TGAGACGACC CC

 Recombination Sites     

Name Coordinates Gene Sequence
res_site_I 4527-4557 31 CGTCAGATTG AGGCATACCC TAACCGGATG T
res_site_II 4578-4612 35 CGTCAGAATA GAGTCGGTTG TGTTATTTAT TGACA
res_site_III 4615-4646 32 AGCTGAAAAA GGTCATAGAT TTCTTCCTGA CA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
merR Tn4378 34-468 Passenger Gene Heavy Metal Resistance -
merT Tn4378 540-890 Passenger Gene Heavy Metal Resistance +
merP Tn4378 903-1178 Passenger Gene Heavy Metal Resistance +
merA Tn4378 1250-2935 Passenger Gene Heavy Metal Resistance +
merD Tn4378 2953-3318 Passenger Gene Heavy Metal Resistance +
merE Tn4378 3315-3551 Passenger Gene Heavy Metal Resistance +
urfM Tn4378 3548-4537 Passenger Gene Other +
tnpR Tn4378 4667-5227 Accessory Gene Resolvase +
tnpA Tn4378 5230-8199 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn4378 435 34-468 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   activator-repressor of mer operon
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLLEPDKPY GSIRRYGEAD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVREKM
ADLARMEAVL SELVCACHAR RGNVSCPLIA SLQGGASLAG SAMP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn4378 351 540-890 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   cytosolic mercuric ion transport protein
Target:   Mercury
Protein Sequence:  
MSEPKTGRGA LFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLAVLDPY RPIFIGVALV ALFFAWRRIY RQAAACKPGE VCAIPQVRAT YKLIFWIVAA
LVLVALGFPY VMPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn4378 276 903-1178 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Protein Sequence:  
MKKLFASLAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVEGV SKVDVTFETR QAVVTFDDAK TSVQKLTKAT ADAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn4378 1686 1250-2935 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercuric ion reductase
Target:   Mercury
Protein Sequence:  
MTHLKITGMT CDSCAAHVKE ALEKVPGVQS ALVSYPKGTA QLAIVPGTSP DALTAAVAGL GYKATLADAP LADNRVGLLD KVRGWMAAAE KHSGNEPPVQ
VAVIGSGGAA MAAALKAVEQ GAQVTLIERG TIGGTCVNVG CVPSKIMIRA AHIAHLRRES PFDGGIAATV PTIDRSKLLA QQQARVDELR HAKYEGILGG
NPAITVVHGE ARFKDDQSLT VRLNEGGERV VMFDRCLVAT GASPAVPPIP GLKESPYWTS TEALASDTIP ERLAVIGSSV VALELAQAFA RLGSKVTVLA
RNTLFFREDP AIGEAVTAAF RAEGIEVLEH TQASQVAHMD GEFVLTTTHG ELRADKLLVA TGRTPNTRSL ALDAAGVTVN AQGAIAIDQG MRTSNPNIYA
AGDCTDQPQF VYVAAAAGTR AAINMTGGDA ALDLTAMPAV VFTDPQVATV GYSEAEAHHD GIETDSRTLT LDNVPRALAN FDTRGFIKLV IEEGSHRLIG
VQAVAPEAGE LIQTAALAIR NRMTVQELAD QLFPYLTMVE GLKLAAQTFN KDVKQLSCCA G

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn4378 366 2953-3318 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   secondary regulatory protein
Target:   Mercury
Protein Sequence:  
MNAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLALLRQ FVERRREALA
DLEVQLATLP TEPAQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn4378 237 3315-3551 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Protein Sequence:  
MNNPERLPSE THKPITGYLW GGLAVLTCPC HLPILAVVLA GTTAGAFLGE HWVIAALGLT GLFLLSLSRA LRAFRERE

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM UrfM Tn4378 990 3548-4537 +
Class:   Passenger Gene
Sub Class:   Other
Function:   possible diguanylate phosphodiesterase
Sequence Family:  EAL (Pfam:PF00563)||DUF3330 (Pfam:PF11809)
Comment:   similar to urfM from E.coli
Protein Sequence:  
MSAFRPDGWT TPELAQAVER GQLELHYQPV VDLRSGGIVG AEALLRWRHP TLGLLPPGQF LPVVESSGLM PEIGAWVLGE ACRQMRDWRM LAWRPFRLAV
NVSASQVGPD FDGWVKGVLA DAELPAEYLE IELTESVAFG DPAIFPALDA LRQIGVRFAA DDFGTGYSCL QHLKCCPIST LKIDQSFVAG LANDRRDQTI
VHTVIQLAHG LGMDVVAEGV ETSASLDLLR QADCDTGQGF LFAKPMPAAA FAVFVSQWRG ATMNASDSTT TSCCVCCKEI PLDAAFTPEG AEYVEHFCGL
ECYQRFEARA KTGNETDADP NACDSLPSD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn4378 561 4667-5227 +
Class:   Accessory Gene
Sub Class:   Resolvase
Function:   resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MQGQRIGYVR VSSFDQNPER QLEHVEVGKV FTDKASGKDT QRPELDSLLA FVREGDTVVV HSMDRLARNL DDLRRLVQKL TKRGVRIEFV KESLTFTGED
SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKALSPEQ VADLRQRAAA GEQKAKLARE FGVSRETLYQ YLRADQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn4378 2970 5230-8199 +
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRRSILSAA ERESLLALPD TKDDLIRYYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGILLGVDEP PFPPLLKLVA DQLKVSVESW GEYGQREQTR
REHLVELQTV FGFQPFTMSH YRQAVHTLTE LAMQTDKGIV LASALIEHLR RQSVILPALN AVERASAEAI TRANRRIYYA LAEPLSDAHR RRLDDLLKRR
DNGKTTWLAW LRQSPVKPNS RHMLEHIERL KAWQALDLPT GIERLIHQNR LLKIAREGGQ MTPADLAKFE AQRRYATLVA LAIEGMATVT DEIIDLHDRI
LGKLFNAAKN KHQQQFQASG KAINAKVRLF GRIGQALIEA KQSGRDPFAA IEAVMSWDAF AESVTEAQKL AQPEDFDFLH RIGESYATLR RYAPEFLAVL
KLRAAPAAKD VLEAIEVLRN MNSDNARKVP ADAPTDFIKP RWQKLVMTDT GIDRRYYELC ALSEMKNALR SGDIWVQGSR QFKDFEDYLV PPAKFASLKQ
ASELPLAVAT DCDQYLHDRL TLLETQLATV NRMALANELP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFAHLKS
GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHTRDET YSSALAELVN AQFRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY
GSSPGRTFYT HISDQYAPFH TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFGLMHLLGF RFAPRIRDLG DTKLFIPKGD TVYDALKPMI
SSDRLNIKAI RAHWDEILRL ATSIKQGTVT ASLMLRKLGS YPRQNGLAVA LRELGRIERT LFILDWLQSV ELRRRVHAGL NKGEARNALA RAVFFNRLGE
IRDRSFEQQR YRASGLNLVT AAVVLWNTVY LERAAHALRG NGHAVDDALL QYLSPLGWEH INLTGDYLWR SSAKIGAGKF RPLRPLQPA

 References     

1.Taghavi S, Mergeay M, van der Lelie D. Genetic and physical maps of the Alcaligenes eutrophus CH34 megaplasmid pMOL28 and its derivative pMOL50 obtained after temperature-induced mutagenesis and mortality. Plasmid. 1997;37(1):22-34. doi: 10.1006/plas.1996.1274. PubMed ID: 9073579
2.Taghavi S, Mergeay M, van der Lelie D. Genetic and physical maps of the Alcaligenes eutrophus CH34 megaplasmid pMOL28 and its derivative pMOL50 obtained after temperature-induced mutagenesis and mortality. Plasmid. 1997;37(1):22-34. doi: 10.1006/plas.1996.1274. PubMed ID: 9073579
3.Van Houdt R, Monchy S, Leys N, Mergeay M. New mobile genetic elements in Cupriavidus metallidurans CH34, their possible roles and occurrence in other bacteria. Antonie Van Leeuwenhoek. 2009 Aug;96(2):205-26. doi: 10.1007/s10482-009-9345-4. Epub 2009 Apr 24. PubMed ID: 19390985