|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: Tn4378 |
|
Family: Tn3 Group: Tn21 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Cupriavidus metallidurans CH34 | Molecular Source: | plasmid pMOL28 |
Place of Origin: | Belgium | | |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 38 bp) | | GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG |
IRR (Length: 38 bp) | | GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAGGC ATAGCCGAAC CTGCCAAGCT TGCTCCACCC TGTAGTGACG CGATCAGCGG GCAGGAAACG 100
TTCCCCCTTC GCGCATGGCA GGCGCACACC AACTCAGACA GCACGGCCTC CATGCGCGCC AGGTCAGCCA TTTTCTCGCG CACGTCCTTG AGCTTGTGCT 200
CGGCCAGACT GCTGGCTTCC TCGCAATGGG TGCCATCCTC CAGCCGCAGC AGCTCGGCGA TCTCATCCAG GCTGAAGCCC AGCCGCTGGG CTGATTTCAC 300
GAAGCGCACC CGCGTTACAT CCGCCTCGCC ATAGCGGCGG ATGCTGCCAT AGGGCTTGTC AGGCTCCAGC AACAAGCCCT TGCGCTGATA GAAACGGATG 400
GTCTCCACAT TGACCCCGGC CGCCTTGGCG AAAACGCCAA TGGTCAGGTT CTCCAAATTG TTTTCCATAT CGCTTGACTC CGTACATGAG TACGGAAGTA 500
AGGTTACGCT ATCCAATTTC AATTCGAAAG GACAAGCGCA TGTCTGAACC AAAAACCGGG CGCGGCGCGC TCTTCACTGG AGGGCTTGCC GCCATCCTCG 600
CCTCGGCTTG CTGCCTCGGG CCGTTGGTTC TGATCGCCTT GGGGTTCAGC GGCGCTTGGA TCGGCAACTT GGCGGTGTTG GATCCCTATC GCCCCATCTT 700
TATCGGCGTG GCGCTGGTGG CGTTGTTCTT CGCCTGGCGG CGCATCTACC GGCAGGCAGC GGCCTGCAAA CCGGGTGAGG TCTGCGCGAT TCCCCAAGTG 800
CGAGCTACTT ACAAGCTCAT TTTCTGGATC GTGGCCGCGC TGGTTCTGGT CGCGCTCGGA TTTCCCTACG TCATGCCATT TTTCTACTGA TCGGAGTTCA 900
CCATGAAGAA ACTGTTTGCC TCCCTCGCCC TCGCCGCCGT TGTTGCCCCC GTCTGGGCCG CCACCCAGAC CGTCACGCTG TCCGTACCGG GCATGACCTG 1000
CTCCGCCTGC CCGATCACTG TCAAGAAGGC GATTTCCAAG GTCGAAGGCG TCAGCAAAGT TGACGTGACT TTCGAGACAC GCCAAGCGGT CGTCACCTTC 1100
GACGATGCCA AGACCAGCGT GCAGAAGCTG ACCAAGGCAA CCGCAGACGC GGGCTATCCG TCCAGCGTCA AGCAGTGAGT CACTGAAAAC GGCACCGCAG 1200
CACAACGGAC GTCATTGTCT GGCGCCACAA ACGATAAAGG ATCTGTTGCA TGACCCATCT AAAAATCACC GGCATGACTT GCGACTCGTG CGCGGCGCAC 1300
GTCAAGGAAG CGCTGGAAAA AGTGCCAGGC GTGCAGTCGG CGCTGGTGTC CTATCCGAAG GGCACAGCGC AACTCGCCAT CGTGCCGGGC ACATCGCCGG 1400
ACGCGCTGAC TGCCGCCGTG GCCGGACTGG GCTACAAGGC AACGCTAGCC GATGCGCCAC TGGCGGACAA CCGCGTCGGA CTGCTCGACA AGGTGCGGGG 1500
ATGGATGGCC GCCGCCGAAA AGCACAGTGG CAACGAGCCC CCGGTGCAGG TAGCGGTCAT TGGCAGCGGT GGAGCCGCGA TGGCGGCGGC GCTGAAGGCC 1600
GTCGAGCAAG GCGCGCAGGT CACGCTGATC GAGCGCGGCA CCATCGGCGG CACCTGCGTC AATGTCGGCT GTGTGCCGTC CAAGATCATG ATCCGCGCCG 1700
CCCACATCGC CCATCTGCGC CGGGAAAGCC CGTTCGATGG CGGTATTGCG GCAACTGTGC CTACGATTGA CCGCAGTAAG CTGCTGGCCC AGCAGCAGGC 1800
CCGCGTCGAC GAACTGCGGC ACGCCAAGTA CGAAGGCATC CTGGGCGGTA ATCCGGCCAT CACCGTTGTG CACGGTGAGG CGCGCTTCAA GGACGACCAG 1900
AGCCTTACCG TCCGTTTGAA CGAGGGTGGC GAGCGCGTCG TGATGTTCGA CCGCTGCCTG GTCGCCACGG GTGCCAGCCC GGCGGTCCCG CCGATTCCGG 2000
GCTTGAAAGA GTCACCCTAC TGGACTTCCA CCGAGGCCCT GGCGAGCGAC ACCATTCCCG AACGCCTTGC CGTAATCGGC TCGTCGGTGG TGGCGCTGGA 2100
GCTGGCGCAA GCCTTTGCCC GGCTGGGCAG CAAGGTCACG GTCCTGGCGC GCAATACCTT GTTCTTCCGT GAAGACCCGG CCATCGGCGA GGCGGTGACA 2200
GCCGCTTTCC GTGCCGAGGG CATCGAGGTG CTGGAGCACA CGCAAGCCAG CCAGGTCGCC CATATGGACG GTGAATTCGT GCTGACCACC ACGCACGGTG 2300
AATTGCGCGC CGACAAACTG CTGGTTGCCA CCGGTCGGAC ACCGAACACG CGCAGCCTCG CGCTGGACGC AGCGGGGGTC ACTGTCAATG CGCAAGGTGC 2400
CATCGCCATC GACCAAGGCA TGCGCACGAG CAACCCGAAC ATCTACGCGG CCGGCGACTG CACCGACCAG CCGCAGTTCG TCTATGTGGC GGCAGCGGCC 2500
GGCACCCGTG CCGCGATCAA CATGACCGGC GGCGATGCGG CGCTCGACCT GACCGCAATG CCGGCCGTGG TGTTCACCGA TCCGCAAGTG GCGACCGTGG 2600
GCTACAGCGA GGCGGAAGCC CACCACGACG GGATCGAGAC CGACAGCCGC ACCTTGACCT TGGACAACGT GCCGCGTGCG CTCGCCAACT TCGACACACG 2700
CGGCTTCATC AAGTTGGTTA TCGAGGAAGG CAGCCATCGG CTGATCGGCG TACAGGCGGT CGCGCCGGAA GCGGGTGAAC TGATCCAGAC GGCGGCTCTG 2800
GCCATTCGCA ACCGCATGAC GGTGCAGGAA CTGGCCGACC AGTTGTTCCC CTACCTGACG ATGGTCGAGG GGTTGAAGCT CGCGGCGCAG ACCTTCAACA 2900
AGGATGTGAA GCAGCTTTCC TGCTGCGCCG GGTGAGAAAA AGGAGGTGTT CAATGAACGC CTACACGGTG TCCCGGCTGG CTCTTGATGC CGGGGTGAGC 3000
GTGCATATCG TGCGCGACTA CCTGCTGCGC GGATTGCTGC GCCCGGTGGC GTGCACACCA GGCGGCTACG GCTTGTTCGA TGACGCCGCC TTGCAACGGC 3100
TGTGCTTCGT GCGGGCGGCC TTCGAGGCGG GCATCGGCCT CGACGCGCTG GCGCGGCTGT GCCGGGCGCT GGATGCGGCG GACGGCGACG AAGCGGCCGC 3200
GCAGCTTGCC CTGCTGCGTC AGTTCGTCGA GCGTCGGCGC GAAGCGTTGG CCGATCTGGA AGTGCAGTTG GCCACCCTGC CGACCGAGCC GGCACAGCAC 3300
GCGGAGAGTC TGCCATGAAC AACCCCGAGC GCTTGCCGTC CGAGACGCAC AAACCGATCA CCGGCTACCT GTGGGGCGGA CTGGCTGTGC TGACTTGCCC 3400
CTGCCACCTG CCCATCCTCG CTGTCGTGCT GGCCGGCACA ACCGCCGGTG CTTTCCTCGG CGAGCATTGG GTCATCGCGG CGCTCGGTTT GACCGGCCTG 3500
TTCCTTCTGT CCCTGTCGCG GGCGTTGCGG GCATTCAGGG AAAGAGAATG AGCGCTTTCC GGCCGGATGG ATGGACGACG CCGGAACTGG CCCAAGCGGT 3600
CGAGCGCGGG CAGCTTGAAC TGCACTACCA GCCCGTCGTC GATCTGCGCA GTGGTGGGAT TGTCGGCGCG GAAGCCCTGT TGCGCTGGCG TCATCCGACG 3700
CTTGGACTAT TGCCACCGGG CCAGTTCCTG CCCGTGGTCG AATCGTCCGG CCTGATGCCT GAAATCGGCG CTTGGGTGCT GGGCGAAGCC TGCCGCCAGA 3800
TGCGTGACTG GCGAATGCTG GCATGGCGAC CGTTCCGGCT GGCCGTCAAT GTTTCGGCGA GCCAAGTGGG ACCGGACTTC GACGGGTGGG TAAAGGGCGT 3900
GCTGGCTGAT GCCGAGTTGC CCGCCGAGTA TCTCGAAATC GAGCTGACCG AATCGGTCGC GTTTGGTGAT CCGGCGATCT TCCCCGCCCT GGACGCCTTG 4000
CGGCAGATCG GTGTGCGCTT CGCCGCCGAT GACTTCGGGA CGGGGTATTC CTGTCTGCAA CATCTGAAGT GCTGCCCAAT CAGCACGCTC AAGATCGACC 4100
AATCGTTTGT CGCCGGGCTC GCCAACGACC GCCGCGACCA AACCATCGTG CACACCGTGA TTCAGCTTGC GCACGGGCTG GGCATGGATG TGGTGGCTGA 4200
AGGCGTGGAA ACATCGGCGA GTCTTGATCT ATTGCGACAA GCGGACTGCG ACACAGGACA AGGCTTCCTG TTCGCGAAGC CAATGCCGGC GGCGGCATTC 4300
GCCGTCTTCG TCAGTCAATG GAGGGGTGCC ACCATGAATG CAAGTGACTC GACCACCACC AGTTGCTGCG TGTGCTGCAA GGAAATCCCG CTCGATGCCG 4400
CCTTCACCCC GGAAGGCGCG GAATACGTCG AGCACTTCTG CGGGTTGGAG TGTTATCAAC GCTTCGAAGC GCGTGCCAAG ACAGGGAACG AAACCGATGC 4500
CGATCCGAAC GCCTGCGACT CGCTACCGTC AGATTGAGGC ATACCCTAAC CGGATGTCAG GGTAGACTGC CTCACAACGT CAGAATAGAG TCGGTTGTGT 4600
TATTTATTGA CACTAGCTGA AAAAGGTCAT AGATTTCTTC CTGACATTTT CGTCCAGGGA GGCATCTTGC AGGGTCAACG CATCGGCTAC GTCCGGGTCA 4700
GCAGCTTCGA CCAGAACCCG GAACGGCAAC TTGAACACGT CGAAGTCGGC AAGGTGTTCA CCGACAAGGC GTCGGGCAAG GACACCCAGC GGCCCGAGCT 4800
TGATTCGCTG CTGGCCTTCG TGCGCGAAGG CGACACCGTG GTGGTTCACA GCATGGATCG CTTGGCGCGC AACCTCGATG ACTTGCGCCG CCTCGTGCAA 4900
AAGCTCACCA AGCGCGGCGT GCGCATCGAG TTCGTCAAGG AGAGCCTGAC CTTCACCGGC GAGGATTCGC CGATGGCGAA CCTGATGCTG TCGGTCATGG 5000
GGGCGTTCGC TGAATTCGAG CGGGCCTTGA TCCGCGAGCG GCAGAGGGAA GGCATCGCGC TCGCCAAGCA ACGCGGAGCC TACCGGGGCC GCAAGAAAGC 5100
GCTGTCGCCC GAACAGGTAG CCGATCTGCG GCAGCGGGCC GCCGCCGGCG AACAAAAAGC GAAGCTGGCC CGCGAGTTTG GTGTCAGCCG GGAGACCCTG 5200
TATCAATACT TGAGAGCGGA TCAGTAAATA TGCCACGTCG TTCCATTCTG TCCGCCGCCG AGCGGGAAAG CCTGTTGGCG TTGCCGGATA CCAAGGACGA 5300
CTTGATCCGA TACTACACGT TCAGCGATAC CGACCTCTCC ATCATCCGGC AACGGCGCGG GCCTGCGAAC CGCTTGGGCT TTGCAGTTCA GCTCTGCTAC 5400
CTGCGCTTTC CCGGCATCCT CCTTGGCGTC GATGAGCCGC CGTTTCCGCC CTTGCTGAAA CTGGTCGCCG ACCAGCTCAA GGTCAGTGTC GAAAGCTGGG 5500
GCGAGTACGG GCAGCGGGAG CAGACCCGGC GCGAGCATCT GGTCGAGTTG CAAACGGTGT TCGGCTTCCA GCCCTTCACC ATGAGCCACT ACCGGCAGGC 5600
CGTCCACACG CTGACCGAGC TGGCCATGCA AACCGACAAG GGCATTGTGC TGGCCAGCGC CTTGATCGAG CATCTGCGGC GGCAGTCGGT CATTCTGCCT 5700
GCCCTCAACG CCGTCGAGCG GGCGAGCGCC GAAGCGATCA CCCGCGCCAA CCGGCGCATC TACTACGCCT TGGCCGAACC ACTGTCGGAC GCGCATCGCC 5800
GCCGCCTCGA CGATCTGCTC AAGCGCCGGG ACAACGGCAA GACGACTTGG CTGGCCTGGC TGCGCCAGTC ACCCGTCAAG CCCAATTCGC GGCATATGCT 5900
GGAGCACATC GAACGACTCA AGGCATGGCA GGCGCTCGAT CTGCCTACCG GCATCGAGCG GCTGATCCAC CAAAACCGGC TGCTCAAGAT CGCCCGCGAG 6000
GGCGGCCAGA TGACACCCGC CGACCTGGCC AAGTTCGAGG CGCAGCGGCG CTACGCGACC CTGGTGGCGC TCGCCATTGA AGGCATGGCC ACCGTCACCG 6100
ACGAAATCAT CGACCTGCAC GACCGCATCC TGGGCAAGCT GTTCAACGCC GCCAAGAACA AGCATCAGCA GCAGTTCCAG GCGTCCGGCA AGGCGATCAA 6200
CGCCAAGGTG CGGCTGTTCG GGCGTATCGG TCAGGCACTG ATCGAGGCCA AGCAATCGGG CCGCGATCCG TTTGCCGCCA TCGAGGCCGT CATGTCCTGG 6300
GACGCCTTCG CCGAGAGCGT CACCGAAGCG CAGAAGCTCG CGCAGCCCGA GGACTTCGAT TTCCTGCACC GCATCGGCGA GAGCTACGCC ACGCTGCGTC 6400
GCTACGCGCC GGAATTCCTC GCCGTGCTCA AGCTGCGGGC CGCGCCCGCT GCCAAGGATG TGCTGGAGGC CATCGAAGTG CTGCGCAACA TGAACAGCGA 6500
CAACGCCCGC AAGGTGCCCG CCGACGCGCC AACCGATTTC ATCAAGCCGC GCTGGCAGAA GCTGGTGATG ACCGACACCG GCATCGATCG GCGCTACTAC 6600
GAACTGTGCG CGCTGTCGGA GATGAAAAAC GCCCTGCGCT CCGGCGACAT CTGGGTGCAG GGATCGCGCC AGTTCAAGGA CTTCGAGGAC TACCTGGTGC 6700
CACCCGCGAA ATTCGCCAGC CTCAAGCAGG CCAGCGAATT GCCGCTGGCC GTGGCCACCG ATTGCGACCA GTACCTGCAT GACCGGCTGA CGCTGCTGGA 6800
AACGCAGCTC GCCACCGTCA ACCGCATGGC GCTGGCCAAC GAGCTGCCGG ACGCCATCAT CACGGAGTCG GGCCTGAAGA TCACGCCGCT CGATGCGGCG 6900
GTGCCCGACA CCGCGCAGGC GCTGATCGAC CAGACAGCAA TGATCCTGCC GCACGTCAAG ATCACCGAAC TGCTGCTGGA GGTAGACGAA TGGACAGGCT 7000
TCACCCGGCA CTTCGCGCAC CTGAAATCGG GCGACCTGGC CAAGGACAAG AACCTGCTGC TGACCACGAT CCTGGCCGAC GCCATCAACC TGGGTCTGAC 7100
CAAGATGGCG GAGTCCTGCC CCGGAACGAC CTACGCCAAG CTCGCCTGGC TCCAAGCCTG GCATACCCGC GACGAAACCT ATTCGTCGGC GCTGGCCGAA 7200
CTGGTCAATG CGCAGTTCCG GCATCCCTTC GCCGAGCACT GGGGCGACGG CACCACGTCA TCGTCGGACG GCCAGAATTT CCGAACCGGC AGCAAGGCCG 7300
AGAGCACTGG CCACATCAAC CCGAAATATG GCAGCAGTCC TGGGCGGACT TTCTACACCC ACATCTCCGA CCAGTACGCG CCATTCCACA CCAAGGTGGT 7400
CAATGTCGGC GTGCGCGACT CGACCTATGT GCTCGACGGC TTGCTGTACC ACGAGTCCGA CCTGCGCATC GAGGAGCACT ACACCGATAC GGCAGGATTC 7500
ACCGATCATG TATTTGGCCT GATGCACCTG CTGGGCTTCC GCTTTGCGCC GCGCATCCGC GACCTGGGCG ACACCAAGCT GTTCATCCCC AAGGGCGACA 7600
CCGTCTACGA CGCGCTCAAG CCGATGATTA GCAGCGACAG ACTGAACATC AAGGCTATTC GCGCCCATTG GGATGAAATT CTACGGCTGG CCACGTCGAT 7700
CAAGCAGGGC ACGGTGACGG CTTCGCTGAT GCTGCGCAAG CTCGGCAGCT ATCCGCGCCA GAACGGCCTG GCCGTGGCCC TGCGCGAGCT GGGGCGTATC 7800
GAGCGCACGC TGTTCATCCT GGATTGGTTG CAAAGCGTGG AGCTGCGCCG TCGCGTGCAC GCTGGGCTGA ACAAGGGCGA AGCCCGCAAT GCGCTGGCCC 7900
GCGCCGTGTT CTTCAACCGT CTGGGTGAAA TCCGCGACCG CAGCTTTGAG CAGCAGCGCT ACCGTGCCAG CGGCCTCAAC CTGGTGACGG CGGCCGTCGT 8000
GCTATGGAAC ACGGTCTATC TGGAACGGGC TGCGCACGCG CTGCGGGGCA ACGGCCACGC CGTCGATGAC GCGCTGTTGC AGTACCTGTC GCCGCTCGGC 8100
TGGGAGCACA TCAACCTGAC CGGCGATTAC CTCTGGCGCA GCAGCGCCAA GATCGGCGCG GGCAAGTTCA GGCCGCTGCG GCCGTTGCAA CCTGCTTAGC 8200
GTGCTTTATT TTCCGTTTTC TGAGACGACC CC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res_site_I |
4527-4557 |
31 |
CGTCAGATTG AGGCATACCC TAACCGGATG T |
res_site_II |
4578-4612 |
35 |
CGTCAGAATA GAGTCGGTTG TGTTATTTAT TGACA |
res_site_III |
4615-4646 |
32 |
AGCTGAAAAA GGTCATAGAT TTCTTCCTGA CA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
merR |
Tn4378 |
34-468 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn4378 |
540-890 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merP |
Tn4378 |
903-1178 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merA |
Tn4378 |
1250-2935 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merD |
Tn4378 |
2953-3318 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merE |
Tn4378 |
3315-3551 |
Passenger Gene |
Heavy Metal Resistance |
+ |
urfM |
Tn4378 |
3548-4537 |
Passenger Gene |
Other |
+ |
tnpR |
Tn4378 |
4667-5227 |
Accessory Gene |
Resolvase |
+ |
tnpA |
Tn4378 |
5230-8199 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn4378 |
435 |
34-468 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | activator-repressor of mer operon |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLLEPDKPY GSIRRYGEAD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVREKM ADLARMEAVL SELVCACHAR RGNVSCPLIA SLQGGASLAG SAMP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn4378 |
351 |
540-890 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | cytosolic mercuric ion transport protein |
Target: | Mercury |
Protein Sequence:
|
MSEPKTGRGA LFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLAVLDPY RPIFIGVALV ALFFAWRRIY RQAAACKPGE VCAIPQVRAT YKLIFWIVAA LVLVALGFPY VMPFFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn4378 |
276 |
903-1178 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercury transport |
Target: | Mercury |
Protein Sequence:
|
MKKLFASLAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVEGV SKVDVTFETR QAVVTFDDAK TSVQKLTKAT ADAGYPSSVK Q
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn4378 |
1686 |
1250-2935 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercuric ion reductase |
Target: | Mercury |
Protein Sequence:
|
MTHLKITGMT CDSCAAHVKE ALEKVPGVQS ALVSYPKGTA QLAIVPGTSP DALTAAVAGL GYKATLADAP LADNRVGLLD KVRGWMAAAE KHSGNEPPVQ VAVIGSGGAA MAAALKAVEQ GAQVTLIERG TIGGTCVNVG CVPSKIMIRA AHIAHLRRES PFDGGIAATV PTIDRSKLLA QQQARVDELR HAKYEGILGG NPAITVVHGE ARFKDDQSLT VRLNEGGERV VMFDRCLVAT GASPAVPPIP GLKESPYWTS TEALASDTIP ERLAVIGSSV VALELAQAFA RLGSKVTVLA RNTLFFREDP AIGEAVTAAF RAEGIEVLEH TQASQVAHMD GEFVLTTTHG ELRADKLLVA TGRTPNTRSL ALDAAGVTVN AQGAIAIDQG MRTSNPNIYA AGDCTDQPQF VYVAAAAGTR AAINMTGGDA ALDLTAMPAV VFTDPQVATV GYSEAEAHHD GIETDSRTLT LDNVPRALAN FDTRGFIKLV IEEGSHRLIG VQAVAPEAGE LIQTAALAIR NRMTVQELAD QLFPYLTMVE GLKLAAQTFN KDVKQLSCCA G
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD |
MerD |
Tn4378 |
366 |
2953-3318 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | secondary regulatory protein |
Target: | Mercury |
Protein Sequence:
|
MNAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLALLRQ FVERRREALA DLEVQLATLP TEPAQHAESL P
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merE |
MerE |
Tn4378 |
237 |
3315-3551 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercury transport |
Target: | Mercury |
Protein Sequence:
|
MNNPERLPSE THKPITGYLW GGLAVLTCPC HLPILAVVLA GTTAGAFLGE HWVIAALGLT GLFLLSLSRA LRAFRERE
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
urfM |
UrfM |
Tn4378 |
990 |
3548-4537 |
+ |
Class: | Passenger Gene |
Sub Class: | Other |
Function: | possible diguanylate phosphodiesterase |
Sequence Family: | EAL (Pfam:PF00563)||DUF3330 (Pfam:PF11809) |
Comment: | similar to urfM from E.coli |
Protein Sequence:
|
MSAFRPDGWT TPELAQAVER GQLELHYQPV VDLRSGGIVG AEALLRWRHP TLGLLPPGQF LPVVESSGLM PEIGAWVLGE ACRQMRDWRM LAWRPFRLAV NVSASQVGPD FDGWVKGVLA DAELPAEYLE IELTESVAFG DPAIFPALDA LRQIGVRFAA DDFGTGYSCL QHLKCCPIST LKIDQSFVAG LANDRRDQTI VHTVIQLAHG LGMDVVAEGV ETSASLDLLR QADCDTGQGF LFAKPMPAAA FAVFVSQWRG ATMNASDSTT TSCCVCCKEI PLDAAFTPEG AEYVEHFCGL ECYQRFEARA KTGNETDADP NACDSLPSD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn4378 |
561 |
4667-5227 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Function: | resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MQGQRIGYVR VSSFDQNPER QLEHVEVGKV FTDKASGKDT QRPELDSLLA FVREGDTVVV HSMDRLARNL DDLRRLVQKL TKRGVRIEFV KESLTFTGED SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKALSPEQ VADLRQRAAA GEQKAKLARE FGVSRETLYQ YLRADQ
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn4378 |
2970 |
5230-8199 |
+ |
Class: | Transposase |
Function: | transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPRRSILSAA ERESLLALPD TKDDLIRYYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGILLGVDEP PFPPLLKLVA DQLKVSVESW GEYGQREQTR REHLVELQTV FGFQPFTMSH YRQAVHTLTE LAMQTDKGIV LASALIEHLR RQSVILPALN AVERASAEAI TRANRRIYYA LAEPLSDAHR RRLDDLLKRR DNGKTTWLAW LRQSPVKPNS RHMLEHIERL KAWQALDLPT GIERLIHQNR LLKIAREGGQ MTPADLAKFE AQRRYATLVA LAIEGMATVT DEIIDLHDRI LGKLFNAAKN KHQQQFQASG KAINAKVRLF GRIGQALIEA KQSGRDPFAA IEAVMSWDAF AESVTEAQKL AQPEDFDFLH RIGESYATLR RYAPEFLAVL KLRAAPAAKD VLEAIEVLRN MNSDNARKVP ADAPTDFIKP RWQKLVMTDT GIDRRYYELC ALSEMKNALR SGDIWVQGSR QFKDFEDYLV PPAKFASLKQ ASELPLAVAT DCDQYLHDRL TLLETQLATV NRMALANELP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFAHLKS GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHTRDET YSSALAELVN AQFRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYAPFH TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFGLMHLLGF RFAPRIRDLG DTKLFIPKGD TVYDALKPMI SSDRLNIKAI RAHWDEILRL ATSIKQGTVT ASLMLRKLGS YPRQNGLAVA LRELGRIERT LFILDWLQSV ELRRRVHAGL NKGEARNALA RAVFFNRLGE IRDRSFEQQR YRASGLNLVT AAVVLWNTVY LERAAHALRG NGHAVDDALL QYLSPLGWEH INLTGDYLWR SSAKIGAGKF RPLRPLQPA
|
|
References |
|
|
1. | Taghavi S, Mergeay M, van der Lelie D. Genetic and physical maps of the Alcaligenes eutrophus CH34 megaplasmid pMOL28 and its derivative pMOL50 obtained after temperature-induced mutagenesis and mortality. Plasmid. 1997;37(1):22-34. doi: 10.1006/plas.1996.1274. PubMed ID: 9073579
| | 2. | Taghavi S, Mergeay M, van der Lelie D. Genetic and physical maps of the Alcaligenes eutrophus CH34 megaplasmid pMOL28 and its derivative pMOL50 obtained after temperature-induced mutagenesis and mortality. Plasmid. 1997;37(1):22-34. doi: 10.1006/plas.1996.1274. PubMed ID: 9073579
| | 3. | Van Houdt R, Monchy S, Leys N, Mergeay M. New mobile genetic elements in Cupriavidus metallidurans CH34, their possible roles and occurrence in other bacteria. Antonie Van Leeuwenhoek. 2009 Aug;96(2):205-26. doi: 10.1007/s10482-009-9345-4. Epub 2009 Apr 24. PubMed ID: 19390985
| |
| | |
|
|