|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
|
|
|
|
|
|
|
|
|
Name: Tn5053 |
|
Family: Tn402 |
|
Evidence of Transposition: Yes |
|
|
Host |
|
|
Host Organism: | Xanthomonas sp. W17 | Molecular Source: | chromosome |
Place of Origin: | Kirgizia, former URSS | Date of Isolation: | 1993 |
| | Other Geographic Information: | Khaidarkan mercury mine |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCGTTTTC AGAAGACGAC CGCACCATCT GACTGGATGT AACGCCTGGT GTGCATACGG CTCCTGACAG CCCAATATCA GGAGTCGTCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTTTG GGCATGACAT CAGACACTCC ACCGATTGCC GCGCAAGGCG TGGCCACCCT GCCCGACGAG 200
GCATGGGCGC AAGCCCGGCA CCGGACGGAA ATCATCGGGC CACTGGCAGC GCTTGAGGTG GTCGGGCATG AAGCCGCCGA TGAGGCAGCC CAAGCGCTGG 300
GCCTGTCCCG GCGACAGGTA TATGTCCTGA TCCGTCGCGC CCGGCAGGGT ACTGGCCTGG TAACAGACCT GACGCCCGGC CGATCCGGCG GCGGCAAAGG 400
CAAGGGGCGC TTGCCGGAAC CGGTCGAGCG CATCATCCGC GAGCTGCTGC AAAAGCGCTT CCTGACCAAG CAGAAACGCA GCCTGGCGGC GTTCCACCGC 500
GAAGTCGCGC AGGCGTGCAA AACCCAGAAG CTGCCGGTGC CGGCGCGCAA CACCGTGGCC CAGCGGATTG CCGGACTACA CCCGGCGAAA ATAGCCCGCA 600
GCCGGGGCGG GCAGGACGCT GCCCGTCCCT TGCAAGGCGC GGGTGGCATT CCGCCCGAAG TCACCATGCC GCTGGAACAG GTGCAGATCG ACCACACCGT 700
CATCGACCTG ATCGTGGTCG ACGAGCGCGA CCGGCAACCG ATTGGCCGCC CATATTTGAC CCTCGCCATC GACGTGTTCA CGCGCTGCGT ACTCGGCATG 800
GTGGTCACGC TGGAAGCGCC GTCCGCCGTC TCGGTCGGCC TATGCCTCGC GCATGCCGCC TGCGACAAGC GGCCCTGGCT GGAAGGGCTG AGTGTGGAAA 900
TGGACTGGCC GATGAGCGGC AAGCCCAGGC TGCTCTATCT GGACAACGCG GCCGAGTTCA AAAGCGAAGC GCTGCGCCGT GGCTGCGAAC AGCATGGCAT 1000
CCGGCTGGAC TATCGCCCAC CAGGCCAGCC GCACTACGGC GGCATCGTGG AACGGATCAT CGGCACGGCG ATGCAGATGA TCCACGACGA ATTACCGGGG 1100
ACGACCTTCT CCAATCCCGG CCAGCGCGGC GAGTACGATT CCGAGAAGAT GGCCACCCTG ACGCTGCGCG AGCTGGAGCG CTGGCTCGCG TTGGCGGTAG 1200
GCACCTATCA CGGCTCCGTG CACAACGGCC TGCTCCAGCC GCCGGCCGCG CGCTGGGCCG AGGCCGTGGA GCGCGTTGGC GTCCCGGCCG TCGTTACCCG 1300
CCCCACCGCG TTTTTGGTCG ATTTCCTGCC GGTGATCCGC CGCACCCTGA CCCGCACCGG CTTTGTCATC GACCACATCC ACTACTACGC CGACGCCCTC 1400
AAGCCGTGGA TTGCCCGGCG CGAGCGCTTG CCCGCCTTCC TGATCCGGCG CGATCCGCGC GACATCAGCC GCATCTGGGT ACTGGAACCG GAAGGTCAGC 1500
ACTATCTGGA GATCCACTAC CGCACCTTGT CCCATCCGGC CGTCACCCTC TGGGAACAAC GCCAGGCGCT GGCCAAATTG CGTCAGCTCG GGCGCGAGCA 1600
GGTGGACGAG TCGGCGCTGT TCCGCATGAT CGGGCAGATG CGCGAGATCG TGACCACCGC CCAGAAGGCC ACGCGCAAGG CGCGGCGCGA CGCTGATCGC 1700
CGCCAGCACC TCAAGACGTC GGAGCCACCG GCCAAGCCCA TACCGCCGAA TGTGGACATG GCTGACCCGC AGGCAGACAA CCTGCCGCCG GCCAAACCGT 1800
TCGATCAGAT CGAGGAGTGG TAGCCGTGGA CGAATATCCC GTCATTGACC TGTCCCACCT GCTGCCAGCG GCACAGGGTT TGGCCAGGCT GCCGGCAGAC 1900
GAGCGCATCC AGCGCATTCG CGCCGACCGC TGGATCGGCT ACCCGCGCGC GGTCGAGGCG CTGAACCGGC TGGAAACTCT GTATGCGTGG CCGAACAAGC 2000
AACGCATGCC AAACCTGCTG CTGGTCGGCC CCACCAACAA CGGCAAGTCG ATGATCGTCG AGAAATTCCG GCGGGCGCAC CCGGTCGGCA CCGACGCTGA 2100
CCAAGAACAT ATCCCGGTGC TGGTCGTGCA GATGCCGTCA GAGCCATCGG TGATCCGCTT CTATGTCGCG CTGCTGGCTG CGATGGGCGC GCCGCTGCGC 2200
CCGCGCCCAC GGCTGACGGA AATGGAGCAA CTGGCGCTGG CCCTGCTGCG CATGGTCGGC GTGCGCATGC TGGTGATCGA CGAACTGCAC AATGTACTGG 2300
CTGGCAACAG CGTTAACCGG CGCGAGTTCC TCAACCTGCT GCGCTTCCTC GGCAACGAGC TGCGTATCCC CCTGGTCGGG GTCGGCACGC GCGAGGCGTA 2400
CCTGGCCATC CGTTCGGACG ATCAGTTGGA AAACCGCTTC GAGCCGATGC TGCTGCCGCC GTGGGAGGCC AATGAGGACT GCTGCTCGCT GCTGGCCAGC 2500
TTCGCGGCGT CACTCCCGCT ACGGCGGCCA TCCCCGATTG CCACGCTGGA CATGGCCCGC TACCTGCTCA CCCGGAGCGA AGGCACCATC GGCGAGCTTG 2600
CACATCTGCT GGTGGCGGCG GCCGTCGCCG CCGTGGAGAG TGGCGAGGAA GCGATCAACC ACCGCACGCT CAGCATGGCC GACTACATCG GCCCCAGTGA 2700
GCGGCGGCGA CAGTTCGAGC GGGAACTGAT GTGAAGTCCG CGCCGCGCTG GCCGCTGCAT CCGGCACCCA AGGAAGGCGA AGCCCTGTCC TCATGGCTCA 2800
ACCGGGTCGC CGCCTGCTAC CAGATGGACG TGCACGAGCT GCTGGCCCAC GATCTTGGTC ACAGCCAACT TGATGACCTG GATACCGCAC CATCCTTGTC 2900
GTTGCTGACG GCGCTCTGCC AGCGCAGTGG CGTCGAGCTG GAGCGGTTGC GCAGCATGAG TCTTGCAGGC TGGGTGCCGT GGCTGCTCGA CAGTCTCGAC 3000
GATTCGGTAC CAGCAGCCTT GGAAACCTAT ACATTCCAAT GCGCGGTGCT CCTGCCCAAG CGCACCCGCA AGGTGCGGTC CATCACTCGC TGGCGTGCCT 3100
GGCTACCGAG CCAGACGATT CGCCGGGCGT GTCCGCAGTG TCTGAACGAT CCAACGAATC AAGCTGTGCT ACTCGTCTGG CAGCTCCCCT TGATGCTGAG 3200
CTGCCCGCAG CATGGCTGCT GGCTGGAGTC CTACTGGGGC ATGCCCGGCC GGTATCTTCA GTGGGAAATC GCCGATGCTG CGCCGCGCCC TGCCGATGAC 3300
GCAATCGCCT GCATGGACCG GCGCACCTGG CAGGCGCTGA CGACGGGTTT TGTGGAGCTG CCGCGTCGGC GTGTCCACGC CGGCTTGTGG TTCCGGCTGC 3400
TGCGCACCCT GCTTGATGAG CTGAACACGC CGCTCTCGCA CTGCGGCAGT TGCTCGGCGA GCATCCGCCA TGTCTGGGAG CGCTGCGGCC ATCCGCTGCG 3500
CGCCGGGCAG AGTCTGTGGC GTCCCTACGA GATTCTGGCC CCGGCGGTGC AGTGGCAGAT GCTGGAGGCG GCGGCCACCG CCATCACGCT GATCGAGTCA 3600
AAGGCGCTGA TCCCGCGCGG GGAACAGGCC GCGCTGTTTC AGACGGAGCC GCACACTGGT TTCACCAACG GCCTGCCGGC GAAGGTGCCG AAGCCGGAAC 3700
CCATCAACCA CTGGCAACGA GCAGCCCAGG CCATCGATGA GGCCATCATT GAAGCGCGAC ACAACCCGGA GACGGCCCGC TTGCTGTTCA CACTGGCGTC 3800
CTACGGGCGA CGCGACCCCG AATCCCTGGA ACGTTTGCGC GCCACGTTCA CCAAGGAGGG CATCCCGCCG GAGTTTTTGT CACATTACGA GCCTGAATGG 3900
CCGTTTGCAG GTCTTAGACT AAATGACGGG TTAAGTGACA GTTTTTGACG GCGAGAACTT TCTGGCTCAC ACTGTCACAT AATCGAACGT ATATGTGACA 4000
GGTACGACAT GCTGATAGGC TACATGCGGG TATCGAAGGC GGACGGCTCC CAGGCTACCG ATTTGCAGCG CGACGCGCTG ATTGCCGCCG GGGTCGATCC 4100
AGTACATCTT TACGAGGACC AGGCATCCGG CATGCGCGAG GATCGGCCCG GCTTGACGAG CTGCCTGAAG GCGTTGCGAA CTGGCGACAC ACTGGTCGTG 4200
TGGAAACTGG ATCGGCTCGG ACGCGACCAG CGACATCTCA TCAACACCGT GCACGACCTG ACTGGGCGCG GCATCGGCTT GAAGGTATTA ACCGGGCACG 4300
GCGCGGCCAT CGACACCACG ACCGCCGCCG GCAAGCTGGT CTTTGGCATC TTCGCCGCCC TGGCCGAGTT CGAGCGCGAG TTGATCGCGG AGCGCACGAT 4400
TGCCGGCCTA GCCTCGGCCC GCGCGCGCGG GCGGAAAGGC GGCCGGCCGT TCAAGATGAC CGCCGCCAAG CTGCGGCTGG CGATGGCGGC AATGGGTCAG 4500
CCAGAGACCA AGGTCGGCGA CCTGTGCCAG GAACTTGGCG TCACGCGGCA GACCCTGTAT CGGCATGTTT CACCCAAGGG TGAGCTACGT CCAGATGGCG 4600
AGAAGCTACT CAGCCGAATT TGATGCCGGC ATGAGGCAAC GTAGCGACAG CGTGGTTTGT CTCAATGGGA AGCGCTCATG ATCGATCTTT GAAGGCCCGC 4700
AGCAGTCGTG TCACAGACAG GACGAACAAA CCGGTCAGCG TGAGGGCTGC GATACCCCAG TACTCTCCGA TGAACGCGCC GGCCGTCGTG CCGGCCAGCA 4800
CAATGGCGAG AATCGGCAAA TGGCAGGGAC AGGTGAGCAC GGCCAGCGCG CCCCACAGGT AGCCGGTGAT CGGTTTGTGC GTCTCGGACG GCAAGCGCTC 4900
GGGGCTGTTC ATGGCAGACT CTCCGCGTGC TGTGCCGGCT CGGTCGGCAT GGTGGCCAAC TGCACCTCCA GATCGGCCAA CGCTTCGCGC CGACGCTCGA 5000
CGAACTGGCG CAGAACGGCA AGCTGCGCGG CCGCTTCATC GCCGTCCGCA GCATCCAGCG CCCGGCACAG CCGCGCCAGC GCGTCCAGGC CGATGCCCGC 5100
CTCGAAGGCC GCCCGCACGA AGCACAGCCG TTGCAAGGCG GCATCATCGA ACAGGCCATA GCCGCCCGGG GTGCACGCCA CCGGACGCAG CAATCCGCGC 5200
AGCAGGTAGT CGCGCACGAT ATGCACGCTC ACCCCGGCAT CAAGGGCCAG CCGGGACACG GTGTAGGCGC TCATTGAAAA CCTCCTTTTT TTATCCAGCG 5300
CAGCAGGAAA GCTGCTTCAC GTCCTTGTTG AAGGTCTGCG CCGCAAGCTT CAACCCCTCG ACCATTGTCA GGTAGGGGAA CAACTGGTCG GCCAGTTCCT 5400
GCACCGTCAT GCGGTTGCGG ATGGCGAGCA CCGCCGTCTG GATCAGTTCG CCCGCTTCCG GGGCCACCGC CTGCACGCCG ATGAGCCGTC CGCTACCTTC 5500
CTCGATGACC AGCTTGATGA AGCCGCGTGT GTCGAAGTTG GCAAGCGCTC GCGGAACGTT GTCGAGTGTC AGCGTGCGAC TGTCGGTCTC GATGCCATCG 5600
TGGTGCGCTT CCGCCTCGCT GTAGCCCACG GTGGCGACTT GCGGGTCGGT GAACACCACT GCCGGCATCG CGGTCAGATT GAGGGCTGCG TCGCCGCCGG 5700
TCATGTTGAT CGCGGCACGG GTGCCGGCGG CCGCTGCCAC GTAGACGAAC TGCGGCTGGT CGGTGCAGTC GCCGGCCGCG TAGATGTTCG GGTTGCTCGT 5800
GCGCATGCCT TGGTCGATAA CGATGGCCCC TTGCGCATTG ACAGTGACCC CCGCCGCGTC CAGCGCGAGG CTGCGCGTAT TCGGTGCCCG ACCGGTGGCA 5900
ACCAGCAACT TGTCAGCGCG CAATTCACCG TGTCCGGTGG TCAGCACGAA TTCGCCGTTC ACATGGGCGA CCTGGCTGGC TTGCGTGTGC TCCAGCACCT 6000
CGATGCCCTC GGCGCGGAAA GCGGCTGTCA CGGCCTCGCC GATGGCCGGG TCTTCCCGGA AGAACAAGGT GCTGCGTGCC AGGATCGTGA CCTGGCTGCC 6100
GAGCCGGGCA AAGGCTTGCG CCAGTTCCAA CGCCACCACC GACGAACCGA TCACGGCCAG GCGTGCGGGA ATGGTGTCGC TGACAAGCGC TTCGGTGGAA 6200
GTCCAGTAGG GTGACTCTTT CAGGCCCGGA ATCGGCGGCA CGACCGGACT GGCACCGGTG GCGACCAGGC AGCGGTCGAA CGTTACCTCG CGCTCGCCAC 6300
CCTCGTTCAA ACGGACGACC AGGCTCTGGT CGTCCTTGAA ACGCGCTTCA CCGTGCAAAA CGGTGATGGC TGGATTGCCG TCCAGGATGC CTTCGTATTT 6400
GGCGTGCCGC AGTTCATCGA CACGGGCCTG CTGCTGGGCC AGCAGTTTGC TGCGGTCAAT CGCAGGCACA GTTGCCGCAA TACCGCCGTC GAACGGACTT 6500
TCCCGGCGCA GATGGGCAAT ATGGGCAGCG CGGATCATGA TCTTGGACGG CACACAGCCG ATATTGACGC AGGTGCCGCC GATGGTGCCG CGTTCGATCA 6600
GCGTGACCGT CGCGCCTTGC TCGACGGCCT TCAGCGCCGC CGCCATCGCG GCCCCGCCGC TGCCAATGAT GGCGATATGC AAACCGGCGC CCTCAAGTGC 6700
ATCACGGATT TTTGGTTCAT CTTTGAAATC ACCAACCCGG ATCGAGCCTT GATAACCCAA TGCGGCGATG GCGGCCAGCA GTTGGTTGTG GCTCACGGCG 6800
GTGTCTGCCA TGACTTGCGC GCGGCTTTCT GGATAGGACA CCACAGCGGC ATTCACGCCG GGAATCTTTT CCAAAGCATC TTTGACATGG GTGGCGCAGG 6900
ATGTGCAGGT CATGCCATTC ACGGTGATTT CGGTCATTTT TTTACTCCAT TGAATTTCGG GGTGCAGCAG GCATCGGCTT GGCGTTTTCG TTGGATGGCG 7000
TAGATGGTCA AGCCGATGAA AATCGCCAGC GCAGGCAGCA GCACATAGTC CAGATAGCCG GTCAGCGCGG ACAAGCCGAC CACACCGAGC AAAATGACCA 7100
GAACAGGGGT GAAGCAACAC AGCGCCACGA GGGTTGTGCC AATGATGCTG ACCCGCAGCA GTGTCTTCGG GTCTTTCATG ATCAGTTCTT GACTGATGAT 7200
GGGTAGCCCG CATCCTCGGT AGCCTTGGTC AGTTTCTGCA CGCTGGTCTT GGCATCATCG AAGGTGACCA CCGCTTCGCG CGTCTCGAAG GTCACGTCAA 7300
CTTTACTGAC GCCATCGACC TTGGAAATCG CCTTCTTGAC AGTGATCGGA CAGGCCGAGC AGGTCATGCC CGGTACGGAC AGCGTAACGG TCTGGGTGGC 7400
GGCCCACACG GGGGCAACAA CGGCAGCGAG GGCAAGGGCG GAAAGCAGCT TTTTCATGGT GAACTCCTGT GATCAATAGA AAAATGGCAC GACGTAGGGA 7500
AATCCGAGCG CGACCAAAAC CAGCACGGCC ACGCCCCAGA AAATGAGCTT GTAAGTAGCT CGCACTTGGG GAATCGCGCA AACCTCACCC GGTTTGCAGG 7600
CGGCTGACGG CCGGTAGATG CGCCGCCAGG CGAAGAACAA CGCCACCAGC GCCACGCCGA TAAAGATGGG GCGATAGGGT TCCAACACCG TCAAGTTGCC 7700
GATCCAAGCG CCGCTGAACC CCAAGGCGAT CAGAACCAGC GGCCCGAGGC AGCAAGCCGA GGCGAGGATG GCGGCCAGCC CGCCAGTGAA GAGCGCGCCG 7800
CGCCCGTTTT GAGGTTCAGA CATACGTTTG TCCTTTCGAA TCTGAATTGG ATAGCTTAAG CTTACTTCCG TAGTTATGTA CGGAGTCAAG CGATATGGAA 7900
AACAATTTGG AGAACCTGAC CATTGGCGTT TTCGCCAGGA CGGCCGGGGT CAATGTGGAG ACCATCCGGT TCTATCAGCG CAAGGGCTTG CTCCCGGAAC 8000
CGGACAAGCC TTACGGCAGC ATTCGCCGCT ATGGCGAGAC GGATGTAACG CGGGTGCGCT TCGTGAAATC AGCCCAGCGG TTGGGCTTCA GCCTGGATGA 8100
GATCGCCGAG CTGCTGCGGC TGGAGGATGG CACCCATTGT GAGGAAGCCA GCAGCCTGGC CGAGCACAAG CTCAAGGACG TGCGCGAGAG GATGGCTGAC 8200
CTGGCGCGCA TGGAGGCCGT GCTGTCTGAT TTGGTGTGCG CCTGCCATGC GCGGAAGGGG AACGTTTCCT GCCCGCTGAT TGCGTCACTG CAAGGGAAGA 8300
AAGAACCGCG CAGTGCGGAC GCGGTGTAGC CCGAGGGAAC TACGCCTTAG CGTGCTTTAT TTTCCGTTTT CTGAGGCGAC TCCAACGTCA GAAAAGACCG 8400
TGCGGTCGAC TTTTGATATT TCGTGCTGTC GCCTTCTGAA AGTGACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
r6 |
3826-3840 |
15 |
CTGGAACGTT TGCGC |
r5 |
3872-3885 |
14 |
AGTTTTTGTC ACAT |
r4 |
3877-3890 |
14 |
TTGTCACATT ACGA |
r3 |
3928-3941 |
14 |
GGGTTAAGTG ACAG |
res |
3969-4003 |
35 |
ACACTGTCAC ATAATCGAAC GTATATGTGA CAGGT |
r2 |
3972-3985 |
14 |
CTGTCACATA ATCG |
r1 |
3988-4001 |
14 |
CGTATATGTG ACAG |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tniA |
Tn5053 |
144-1823 |
Transposase |
|
+ |
tniB |
Tn5053 |
1826-2734 |
Accessory Gene |
|
+ |
tniQ |
Tn5053 |
2731-3948 |
Accessory Gene |
Target Site Selection |
+ |
tniR |
Tn5053 |
4009-4623 |
Accessory Gene |
Resolvase |
+ |
merE; urf-1 |
Tn5053 |
4676-4912 |
Passenger Gene |
Heavy Metal Resistance |
- |
merD |
Tn5053 |
4909-5274 |
Passenger Gene |
Heavy Metal Resistance |
- |
merA |
Tn5053 |
5291-6937 |
Passenger Gene |
Heavy Metal Resistance |
- |
merF |
Tn5053 |
6934-7179 |
Passenger Gene |
Heavy Metal Resistance |
- |
merP |
Tn5053 |
7182-7457 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn5053 |
7473-7823 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR |
Tn5053 |
7895-8329 |
Passenger Gene |
Heavy Metal Resistance |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
Tn5053 |
1680 |
144-1823 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | homologous to TnsB of Tn7 |
Protein Sequence:
|
MTSDTPPIAA QGVATLPDEA WAQARHRTEI IGPLAALEVV GHEAADEAAQ ALGLSRRQVY VLIRRARQGT GLVTDLTPGR SGGGKGKGRL PEPVERIIRE LLQKRFLTKQ KRSLAAFHRE VAQACKTQKL PVPARNTVAQ RIAGLHPAKI ARSRGGQDAA RPLQGAGGIP PEVTMPLEQV QIDHTVIDLI VVDERDRQPI GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLAHAAC DKRPWLEGLS VEMDWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPPGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPGQRGE YDSEKMATLT LRELERWLAL AVGTYHGSVH NGLLQPPAAR WAEAVERVGV PAVVTRPTAF LVDFLPVIRR TLTRTGFVID HIHYYADALK PWIARRERLP AFLIRRDPRD ISRIWVLEPE GQHYLEIHYR TLSHPAVTLW EQRQALAKLR QLGREQVDES ALFRMIGQMR EIVTTAQKAT RKARRDADRR QHLKTSEPPA KPIPPNVDMA DPQADNLPPA KPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB |
TniB |
Tn5053 |
909 |
1826-2734 |
+ |
Class: | Accessory Gene |
Function: | is needed for transposition of Tn5053 |
Transpoase Chemistry: | Serine |
Comment: | homologous to TnsC protein of Tn7 putative ATP-binding protein |
Protein Sequence:
|
MDEYPVIDLS HLLPAAQGLA RLPADERIQR IRADRWIGYP RAVEALNRLE TLYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRAHPVGTD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL TEMEQLALAL LRMVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRE AYLAIRSDDQ LENRFEPMLL PPWEANEDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLVAAAVAAV ESGEEAINHR TLSMADYIGP SERRRQFERE LM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniQ |
TniQ |
Tn5053 |
1218 |
2731-3948 |
+ |
Class: | Accessory Gene |
Sub Class: | Target Site Selection |
Function: | is needed for transposition of Tn5053 |
Protein Sequence:
|
MKSAPRWPLH PAPKEGEALS SWLNRVAACY QMDVHELLAH DLGHSQLDDL DTAPSLSLLT ALCQRSGVEL ERLRSMSLAG WVPWLLDSLD DSVPAALETY TFQCAVLLPK RTRKVRSITR WRAWLPSQTI RRACPQCLND PTNQAVLLVW QLPLMLSCPQ HGCWLESYWG MPGRYLQWEI ADAAPRPADD AIACMDRRTW QALTTGFVEL PRRRVHAGLW FRLLRTLLDE LNTPLSHCGS CSASIRHVWE RCGHPLRAGQ SLWRPYEILA PAVQWQMLEA AATAITLIES KALIPRGEQA ALFQTEPHTG FTNGLPAKVP KPEPINHWQR AAQAIDEAII EARHNPETAR LLFTLASYGR RDPESLERLR ATFTKEGIPP EFLSHYEPEW PFAGLRLNDG LSDSF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniR |
TniR |
Tn5053 |
615 |
4009-4623 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Comment: | also called TniC |
Protein Sequence:
|
MLIGYMRVSK ADGSQATDLQ RDALIAAGVD PVHLYEDQAS GMREDRPGLT SCLKALRTGD TLVVWKLDRL GRDQRHLINT VHDLTGRGIG LKVLTGHGAA IDTTTAAGKL VFGIFAALAE FERELIAERT IAGLASARAR GRKGGRPFKM TAAKLRLAMA AMGQPETKVG DLCQELGVTR QTLYRHVSPK GELRPDGEKL LSRI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merE; urf-1 |
MerE |
Tn5053 |
237 |
4676-4912 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | Broad-spectrum mercury transporter |
Protein Sequence:
|
MNSPERLPSE THKPITGYLW GALAVLTCPC HLPILAIVLA GTTAGAFIGE YWGIAALTLT GLFVLSVTRL LRAFKDRS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD |
MerD |
Tn5053 |
366 |
4909-5274 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MSAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLAVLRQ FVERRREALA DLEVQLATMP TEPAQHAESL P
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn5053 |
1647 |
5291-6937 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercuric ion reductase |
Target: | Mercury |
Protein Sequence:
|
MTEITVNGMT CTSCATHVKD ALEKIPGVNA AVVSYPESRA QVMADTAVSH NQLLAAIAAL GYQGSIRVGD FKDEPKIRDA LEGAGLHIAI IGSGGAAMAA ALKAVEQGAT VTLIERGTIG GTCVNIGCVP SKIMIRAAHI AHLRRESPFD GGIAATVPAI DRSKLLAQQQ ARVDELRHAK YEGILDGNPA ITVLHGEARF KDDQSLVVRL NEGGEREVTF DRCLVATGAS PVVPPIPGLK ESPYWTSTEA LVSDTIPARL AVIGSSVVAL ELAQAFARLG SQVTILARST LFFREDPAIG EAVTAAFRAE GIEVLEHTQA SQVAHVNGEF VLTTGHGELR ADKLLVATGR APNTRSLALD AAGVTVNAQG AIVIDQGMRT SNPNIYAAGD CTDQPQFVYV AAAAGTRAAI NMTGGDAALN LTAMPAVVFT DPQVATVGYS EAEAHHDGIE TDSRTLTLDN VPRALANFDT RGFIKLVIEE GSGRLIGVQA VAPEAGELIQ TAVLAIRNRM TVQELADQLF PYLTMVEGLK LAAQTFNKDV KQLSCCAG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merF |
MerF |
Tn5053 |
246 |
6934-7179 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | mercuric ion transport protein |
Protein Sequence:
|
MKDPKTLLRV SIIGTTLVAL CCFTPVLVIL LGVVGLSALT GYLDYVLLPA LAIFIGLTIY AIQRKRQADA CCTPKFNGVK K
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn5053 |
276 |
7182-7457 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | periplasmic mercuric ion binding protein |
Protein Sequence:
|
MKKLLSALAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVDGV SKVDVTFETR EAVVTFDDAK TSVQKLTKAT EDAGYPSSVK N
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn5053 |
351 |
7473-7823 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | mercury ion transport protein |
Protein Sequence:
|
MSEPQNGRGA LFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLTVLEPY RPIFIGVALV ALFFAWRRIY RPSAACKPGE VCAIPQVRAT YKLIFWGVAV LVLVALGFPY VVPFFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn5053 |
435 |
7895-8329 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFARTAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGETD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVRERM ADLARMEAVL SDLVCACHAR KGNVSCPLIA SLQGKKEPRS ADAV
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t1 |
Tn5053 |
9-27 |
TCAGAAGACG ACCGCACCA |
repeat t2 |
Tn5053 |
49-67 |
CACACGTATG CCGAGGACT |
repeat t3 |
Tn5053 |
78-96 |
TCAGGAGTCG TCTGCACCA |
repeat t4 |
Tn5053 |
110-128 |
TCAATACTCG TGTGCACCA |
IR Tn21-like |
Tn5053 |
8346-8383 |
GAATCGCACG AAATAAAAGG CAAAAGACTC CGCTGAGG |
repeat i4 |
Tn5053 |
8356-8374 |
AAATAAAAGG CAAAAGACT |
repeat i2 |
Tn5053 |
8398-8416 |
CCGTGCGGTC GACTTTTGA |
IRi |
Tn5053 |
8420-8447 |
AAGCACGACA GCGGAAGACT TTCACTGT |
|
References |
|
|
1. | Kholodii GY, Yurieva OV, Lomovskaya OL, Gorlenko Z, Mindlin SZ, Nikiforov VG. Tn5053, a mercury resistance transposon with integron's ends. J Mol Biol. 1993 Apr 20;230(4):1103-7. doi: 10.1006/jmbi.1993.1228. PubMed ID: 8387603
| | 2. | Kholodii GY, Mindlin SZ, Bass IA, Yurieva OV, Minakhina SV, Nikiforov VG. Four genes, two ends, and a res region are involved in transposition of Tn5053: a paradigm for a novel family of transposons carrying either a mer operon or an integron. Mol Microbiol. 1995 Sep;17(6):1189-200. doi: 10.1111/j.1365-2958.1995.mmi_17061189.x. PubMed ID: 8594337
| |
| | |
|
|