|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
|
|
|
|
|
|
|
|
|
Name: Tn5053.1 (Synonyms: Tn4671) |
|
Family: Tn402 Group: Tn5053 |
|
Evidence of Transposition: yes |
|
|
Host |
|
|
Host Organism: | Alicycliphilus denitrificans BC | Molecular Source: | plasmid pALIDE02 |
Place of Origin: | U.S.A. | Date of Isolation: | 2011 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCATTC AGCGCGGTTT GAAGCCTCTT GTGCATACGG CTCCTGACAG CCAGATATCA GGACTCCTCT GCACGAAACT 100
CGTCTATGCT CAATACTCGT GTGCACCAAA GCGAGATGAG CATGGCGACC GATACCGCAC CGATTACCGA GCACGGCGTG GCCACCCTGC CAGAACAGGC 200
ATGGGAGCGT GCGCGTCGTC GCGCGGAGAT CATTGGGCCG TTGGCGCAGT CGGAGACGGT TGGGCATGAA GCGGCCGACG CGGCAGCCCA GGCATTGGGG 300
CTGTCCCGGC GGCAGGTCTA CGTCCTGATC CGCCGTGCCC GGCTAGGATC GGGGCTGGTC ACTGACTTGG CTCTTGGGCA GTCGAGCGGT GGCAAAGGTA 400
AAGGCCGCTT GCCGGAGTCG GTCGAACGAA TCATCCGCGA GTTACTGCAA AAGCGCTTCC TGACCAAGCA GAAGCGTAGC TTGGCGGCGT TCCACCGCGA 500
AGTTGTGCGG GCGTGCAAGC TGCAAAAGCT GCGGGTGCCG GCGCGCAACA CGGTGGCTCT GCGGATCGCC GGCCTCGATC CGCGCGAGGT CACTCACCGC 600
CGGGAAGGAC AAGATGCCGC CCGCGACCTG CAAGGTGTTG GTGGTGTTCC GCCACCCGTC TCCGCGCCGC TGGAGCAGGT GCAGATCGAC CACACAGTCA 700
TCGACCTGAT CGTGGTGGAC GAGCGCGACC GGCAACCGAT TGGCCGTCCA TACCTGACCC TCGCCATCGA CGTATTCACC CGCTGCGTGG TTGGCATGGT 800
CGTTACGCTG GAAGCCCCGT CCGCCGTCTC GGTCGGCTTG TGCTTGGTGC ATGCCGCCTG CGACAAGCGC CCTTGGTTGG AAGGGTTGAA CGTAGAGATG 900
GATTGGCCGA TGAGCGGCAA GCCCAGACTG CTCTACTTGG ACAACGCGGC TGAGTTCAAG AGCGAGGCGC TGCGCCGTGG CTGCGAGCAG CATGGCATCC 1000
GGTTGGACTA TCGCCCGCTC GGGCAGCCGC ACTACGGCGG TATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATC CACGACGAAT TGCCGGGGAC 1100
GACCTTCTCC AATCCTGACC AGCGCGGGGA ATACGCCTCC GAGAAGATGG CCGCCCTGAC ACTGCGCGAG CTGGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTATCACG GCTCCGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAA GCTATCACGC GGACCGGCGT GCCAACCGTC ATCACTCGCG 1300
CTACGGCTTT TCTGGTCGAT TTTCTGCCCA TCATCCGCCG CACGCTGACC CGCACCGGCT TCGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATA GCTCGGCGGG ACCGCTTGCC TGCGTTCCTG ATCCGGCGCG ACCCGCGCGA CATCAGCCGC ATATGGGTGC TGGAACCAGA GGGGCAGCAC 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CACCCGGCTG TCACCCTATG GGAACAACGG CAGGCGCTGG CGAAATTACG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC AGCGCTGTTC CGCATGATCG GCCAGATGCG AGAAATCGTG ACCACCGCGC AGAAAGCTAC GCGCAAGGCG CGGCGCGACG CGGATCGACG 1700
CCAGCATCTC AAGGCAACGG CACCGCCTGT CAAAACCACG CCACCACCAG ATGCGGACAT GGCTGACCCA CAGGCCGACA ACCAGCCGCC GGCCAAACCG 1800
TTCGACCAGA TTGAGGAGTG GTAGCCGTGG ACGAATATCC CATCATCGAC TTGTCACACC TGCTGCCAGC GGCACAGGGG CTGGCTCGGC TGCCGGCGGA 1900
CGAGCGCATC CAGCGCCTTC GCGCCGACCG CTGGATCGGC TATCCGCGCG CGGTCGAGGC GCTGAACCGG CTGGAAACCC TGTATGCGTG GCCAAACAAG 2000
CAACGCATGC CCAACCTGCT GCTGGTCGGC CCGACCAACA ACGGCAAGTC GATGATCGTC GAGAAGTTCC GGCGCACGCA TCCGGCCAGC GCCGACGCCG 2100
ACCAGGAGCA CATTCCGGTA CTGGTCGTGC AGATGCCATC CGAACCGTCG GTAATCCGCT TCTACGTCGC GCTGCTCGCG GCGATGGGTG CGCCATTGCG 2200
ACCGCGCCCA CGGCTGCCAG AAATGGAACA ACTGGCGCTG GCACTGCTGC GCAAGGTCGG CGTGCGCATG CTGGTGATCG ACGAGTTGCA CAACGTCCTG 2300
GCCGGTAACA GCGTCAACCG CCGGGAATTT CTCAATCTCC TGCGCTTCCT CGGCAATGAA CTGCGCATCC CACTGGTCGG GGTCGGCACG CGCGACGCCT 2400
ACCTGGTCAT CCGCTCCGAT GACCAGTTGG AAAATCGCTT CGAGCCGATG ATGCTGCCGG TATGGGAGGC CAACGACGAT TGCTGCTCAC TGCTGGCCAG 2500
CTTCGCCGCT TCGCTTCCAC TGCGGCGCCC CTCGTCGATT GCCACGCTGG ACATGGCCCG CTACCTGCTC ACACGCAGCG AAGGCACCAT CGGCGAACTG 2600
ACGCACTTGC TGATGGCGGC AGCCCTCGCC GCCGTGGAGA GCGGCGAGGA AGCGATCAAC CATCGCACGC TGAGCATGGC CGATTACACC GGCCCCAGCG 2700
AGCGGCGTCG GCAATTCGAG CGGGAACTGA TGTGAAGCCA GCGCCACGCT GGCCACTGCA TCCGGCTCCC AGGGAAGGTG AAGCCTTGTC TTCGTGGCTC 2800
AACCGCGTGG CCCTTTGCTA TCACATAGAG GTGTCCGAGC TGCTGGAGCA CGATCTTGGT CACGGTCAGG TTGATGACCT GGACACCGCG CCACCACTGG 2900
CGCTGCTGGC GATGCTTTCC CAGCGGAGCG GCATCGAACT GAACCGGCTG CGTTGCATGA GCTTTGCCGG CTGGGTGCCT TGGCTACTGG ACAGCCTTGA 3000
TGATCAGATT CCAGATGCAT TGGAGACCTA TGCGTTCCAG CTCTCGGTGT TGCTGCCGAC ACACCGCCGT AAGACGCGAT CCATCACGAG CTGGCGTGCC 3100
TGGCTGCCCA GCCAGCCGAT ACACCGCGCC TGTCCGCTAT GCCTGAACGA TCCGGAGAAC CAAGCCGTAC TGCTCGCGTG GAAGCTGCCC CTGATGCTGA 3200
GCTGCCCGCT GCATGGCTGC CGGCTGGAAT CCTATTGGGG CGTGCCAGGG CGGTTTCTAG GCTGGGAGAA CGCCGACGCC GAACCGCGCA CCGCCAGCGA 3300
CGCGATTGCG GCGATGGACC AGCGTACCTG GCAGGCACTG ACGACCGGTC ACCTGGAGCT GCCGCGCCGA CGCATCCATG CCGGATTGTG GTTTCGGCTG 3400
CTACGCACGC TGCTCGATGA GCTGAACACA CCGCTTTCGG CGTGCGGAAC CTACGCGGGG TATCTCCGCC AAGTCTGGGA AGGCTGCGGG CATCCGCTGC 3500
GTGCTGGGCA AAGTCTGTGG CGACCGTATG AAACCCTGAA TCCGGCAGTA CGGTTGCAGA TGCTGGAGGC GGCGGCAACG GCAATCAGCT TGATTGAGGT 3600
GAGGTACATA AGCCCGCCAG GCGAGCAGGC AAAGCTGTTC TGGTCCGAGC CCCAAACAGG GTTCACCAGT GGCCTGCCGA CGAAAGCGCC GAAGCCCGAA 3700
CCCATCAATC ACTGGCAGCG TGCAGTCCAG GCCATCGACG AGGCCATCAT TGAAGCGCGG CACAACCCCG AGACGGCACG CTCGCTGTTC GCGTTGGCTT 3800
CCTATGGTCG GCGCGACCCC GCTTCCCTGG AACAGTTGCG CGCCACCTTC GCGAAGGAAG GCATCGCCAC GGAATTTCTG TCACATTATG AGCCTGACGG 3900
ATCCTTTGCA TGTCTTAGAC AGAATGACGG GTTAAGTGAC AAATTTTGAC GACCTTAACT TTCCGGCGCA CACTGTCACA TAATCGAACG TATATGTGAC 4000
AGGTACGACA TGCTGATAGG CTACATGCGG GTATCGAAGG CGGACGGCTC CCAGGCTACC GATTTGCAGC GCGACGCGCT GATTGCCGCC GGGGTCGATC 4100
CAGTACATCT TTACGAGGAC CAGGCATCCG GCATGCGCGA GGATCGGCCC GGCTTGACGA GCTGCCTGAA GGCGTTGCGA ACTGGCGACA CACTGGTCGT 4200
GTGGAAACTG GATCGGCTCG GACGCGACCT GCGACATCTC ATCAACACCG TGCACGACCT GACTGGGCGC GGCATCGGCT TGAAGGTATT AACCGGGCAC 4300
GGCGCGGCCA TCGACACCAC GACCGCCGCC GGCAAGCTGG TCTTTGGCAT CTTCGCCGCC CTGGCCGAGT TCGAGCGCGA GTTGATCGCG GAGCGCACGA 4400
TTGCCGGCCT AGCCTCGGCC CGCGCGCGCG GGCGGAAAGG CGGCCGGCCG TTCAAGATGA CCGCCGCCAA GCTGCGGCTG GCGATGGCGG CAATGGGTCA 4500
GCCAGAGACC AAGGTCGGCG ACCTGTGCCA GGAACTTGGC GTCACGCGGC AGACCCTGTA TCGGCATGTT TCACCCAAGG GTGAGCTACG TCCAGATGGC 4600
GAGAAGCTAC TCAGCCGAAT TTGATGCCGG CATGAGGCAA CGTAGCGACA GCGTGGTTTG TCTCAATGGG AAGCGCTCAT GATCGATCTT TGAAGGCCCG 4700
CAGCAGTCGT GTCACAGACA GGACGAACAA ACCGGTCAGC GTGAGGGCTG CGATACCCCA GTACTCTCCG ATGAACGCGC CGGCCGTCGT GCCGGCCAGC 4800
ACAATGGCGA GAATCGGCAA ATGGCAGGGA CAGGTGAGCA CGGCCAGCGC GCCCCACAGG TAGCCGGTGA TCGGTTTGTG CGTCTCGGAC GGCAAGCGCT 4900
CGGGGCTGTT CATGGCAGAC TCTCCGCGTG CTGTGCCGGC TCGGTCGGCA TGGTGGCCAA CTGCACCTCC AGATCGGCCA ACGCTTCGCG CCGACGCTCG 5000
ACGAACTGGC GCAGAACGGC AAGCTGCGCG GCCGCTTCAT CGCCGTCCGC AGCATCCAGC GCCCGGCACA GCCGCGCCAG CGCGTCCAGG CCGATGCCCG 5100
CCTCGAAGGC CGCCCGCACG AAGCACAGCC GTTGCAAGGC GGCATCATCG AACAGGCCAT AGCCGCCCGG GGTGCACGCC ACCGGACGCA GCAATCCGCG 5200
CAGCAGGTAG TCGCGCACGA TATGCACGCT CACCCCGGCA TCAAGGGCCA GCCGGGACAC GGTGTAGGCG CTCATTGAAA ACCTCCTTTT TTTATCCAGC 5300
GCAGCAGGAA AGCTGCTTCA CGTCCTTGTT GAAGGTCTGC GCCGCAAGCT TCAACCCCTC GACCATTGTC AGGTAGGGGA ACAACTGGTC GGCCAGTTCC 5400
TGCACCGTCA TGCGGTTGCG GATGGCGAGC ACCGCCGTCT GGATCAGTTC GCCCGCTTCC GGGGCCACCG CCTGCACGCC GATGAGCCGT CCGCTACCTT 5500
CCTCGATGAC CAGCTTGATG AAGCCGCGTG TGTCGAAGTT GGCAAGCGCT CGCGGAACGT TGTCGAGTGT CAGCGTGCGA CTGTCGGTCT CGATGCCATC 5600
GTGGTGCGCT TCCGCCTCGC TGTAGCCCAC GGTGGCGACT TGCGGGTCGG TGAACACCAC TGCCGGCATC GCGGTCAGAT TGAGGGCTGC GTCGCCGCCG 5700
GTCATGTTGA TCGCGGCACG GGTGCCGGCG GCCGCTGCCA CGTAGACGAA CTGCGGCTGG TCGGTGCAGT CGCCGGCCGC GTAGATGTTC GGGTTGCTCG 5800
TGCGCATGCC TTGGTCGATA ACGATGGCCC CTTGCGCATT GACAGTGACC CCCGCCGCGT CCAGCGCGAG GCTGCGCGTA TTCGGTGCCC GACCGGTGGC 5900
AACCAGCAAC TTGTCAGCGC GCAATTCACC GTGTCCGGTG GTCAGCACGA ATTCGCCGTT CACATGGGCG ACCTGGCTGG CTTGCGTGTG CTCCAGCACC 6000
TCGATGCCCT CGGCGCGGAA AGCGGCTGTC ACGGCCTCGC CGATGGCCGG GTCTTCCCGG AAGAACAAGG TGCTGCGTGC CAGGATCGTG ACCTGGCTGC 6100
CGAGCCGGGC AAAGGCTTGC GCCAGTTCCA ACGCCACCAC CGACGAACCG ATCACGGCCA GGCGTGCGGG AATGGTGTCG CTGACAAGCG CTTCGGTGGA 6200
AGTCCAGTAG GGTGACTCTT TCAGGCCCGG AATCGGCGGC ACGGCCGGAC TGGCACCGGT GGCGACCAGG CAGCGGTCGA ACGTTACCTC GCGCTCGCCA 6300
CCCTCGTTCA AACGGACGAC CAGGCTCTGG TCGTCCTTGA AACGCGCTTC ACCGTGCAAA ACGGTGATGG CTGGATTGCC GTCCAGGATG CCTTCGTATT 6400
TGGCGTGCCG CAGTTCATCG ACACGGGCCT GCTGCTGGGC CAGCAGTTTG CTGCGGTCAA TCGCAGGCAC AGTTGCCGCA ATACCGCCGT CGAACGGACT 6500
TTCCCGGCGC AGATGGGCAA TATGGGCAGC GCGGATCATG ATCTTGGACG GCACACAGCC GATATTGACG CAGGTGCCGC CGATGGTGCC GCGTTCGATC 6600
AGCGTGACCG TCGCGCCTTG CTCGACGGCC TTCAGCGCCG CCGCCATCGC GGCCCCGCCG CTGCCAATGA TGGCGATATG CAAACCGGCG CCCTCAAGTG 6700
CATCACGGAT TTTTGGTTCA TCTTTGAAAT CACCAACCCG GATCGAGCCT TGATAACCCA ATGCGGCGAT GGCGGCCAGC AGTTGGTTGT GGCTCACGGC 6800
GGTGTCTGCC ATGACTTGCG CGCGGCTTTC TGGATAGGAC ACCACAGCGG CATTCACGCC GGGAATCTTT TCCAAAGCAT CTTTGACATG GGTGGCGCAG 6900
GATGTGCAGG TCATGCCATT CACGGTGATT TCGGTCATTT TTTTACTCCA TTGAATTTCG GGGTGCAGCA GGCATCGGCT TGGCGTTTTC GTTGGATGGC 7000
GTAGATGGTC AAGCCGATGA AAATCGCCAG CGCAGGCAGC AGCACATAGT CCAGATAGCC GGTCAGCGCG GACAAGCCGA CCACACCGAG CAAAATGACC 7100
AGAACAGGGG TGAAGCAACA CAGCGCCACG AGGGTTGTGC CAATGATGCT GACCCGCAGC AGTGTCTTCG GGTCTTTCAT GATCAGTTCT TGACTGATGA 7200
TGGGTAGCCC GCATCCTCGG TAGCCTTGGT CAGTTTCTGC ACGCTGGTCT TGGCATCATC GAAGGTGACC ACCGCTTCGC GCGTCTCGAA GGTCACGTCA 7300
ACTTTACTGA CGCCATCGAC CTTGGAAATC GCCTTCTTGA CAGTGATCGG ACAGGCCGAG CAGGTCATGC CCGGTACGGA CAGCGTAACG GTCTGGGTGG 7400
CGGCCCACAC GGGGGCAACA ACGGCAGCGA GGGCAAGGGC GGAAAGCAGC TTTTTCATGG TGAACTCCTG TGATCAATAG AAAAATGGCA CGACGTAGGG 7500
AAATCCGAGC GCGACCAAAA CCAGCACGGC CACGCCCCAG AAAATGAGCT TGTAAGTAGC TCGCACTTGG GGAATCGCGC AAACCTCACC CGGTTTGCAG 7600
GCGGCTGACG GCCGGTAGAT GCGCCGCCAG GCGAAGAACA ACGCCACCAG CGCCACGCCG ATAAAGATGG GGCGATAGGG TTCCAACACC GTCAAGTTGC 7700
CGATCCAAGC GCCGCTGAAC CCCAAGGCGA TCAGAACCAG CGGCCCGAGG CAGCAAGCCG AGGCGAGGAT GGCGGCCAGC CCGCCAGTGA AGAGCGCGCC 7800
GCGCCCGTTT TGAGGTTCAG ACATACGTTT GTCCTTTCGA ATCTGAATTG GATAGCTTAA GCTTACTTCC GTAGTTATGT ACGGAGTCAA GCGATATGGA 7900
AAACAATTTG GAGAACCTGA CCATTGGCGT TTTCGCCAGG ACGGCCGGGG TCAATGTGGA GACCATCCGG TTCTATCAGC GCAAGGGCTT GCTCCCGGAA 8000
CCGGACAAGC CTTACGGCAG CATTCGCCGC TATGGCGAGA CGGATGTAAC GCGGGTGCGC TTCGTGAAAT CAGCCCAGCG GTTGGGCTTC AGCCTGGATG 8100
AGATCGCCGA GCTGCTGCGG CTGGAGGATG GCACCCATTG CGAGGAAGCC AGCAGCCTGG CCGAGCACAA GCTCAAGGAC GTGCGCGAGA GGATGGCTGA 8200
CCTGGCGCGC ATGGAGGCCG TGCTGTCTGA TTTGGTGTGC GCCTGCCATG CGCGGAAGGG GAACGTTTCC TGCCCGCTGA TTGCGTCACT GCAAGGGAAG 8300
AAAGAACCGC GCAGTGCGGA CGCGGTGTAG CCCGAGGGAA CTACGCCTTA GCGTGCTTTA TTTTCCGTTT TCTGAGGCGA CTCCAACGTC AGAAAAGACC 8400
GTGCGGTCGA CTTTTGATAT TTCGTGCTGT CGCCTTCTGA AAGTGACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
r5 |
3873-3886 |
14 |
AATTTCTGTC ACAT |
r3 |
3929-3942 |
14 |
GGGTTAAGTG ACAA |
res |
3970-4004 |
35 |
ACACTGTCAC ATAATCGAAC GTATATGTGA CAGGT |
r2 |
3973-3986 |
14 |
CTGTCACATA ATCG |
r1 |
3989-4002 |
14 |
CGTATATGTG ACAG |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tniA |
Tn5053.1 |
142-1824 |
Transposase |
|
+ |
tniB |
Tn5053.1 |
1827-2735 |
Accessory Gene |
|
+ |
tniQ |
Tn5053.1 |
2732-3949 |
Accessory Gene |
Target Site Selection |
+ |
tniR |
Tn5053.1 |
4010-4624 |
Accessory Gene |
Resolvase |
+ |
merE; urf-1 |
Tn5053.1 |
4677-4913 |
Passenger Gene |
Heavy Metal Resistance |
- |
merD |
Tn5053.1 |
4910-5275 |
Passenger Gene |
Heavy Metal Resistance |
- |
merA |
Tn5053.1 |
5292-6938 |
Passenger Gene |
Heavy Metal Resistance |
- |
merF |
Tn5053.1 |
6935-7180 |
Passenger Gene |
Heavy Metal Resistance |
- |
merP |
Tn5053.1 |
7183-7458 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn5053.1 |
7474-7824 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR |
Tn5053.1 |
7896-8330 |
Passenger Gene |
Heavy Metal Resistance |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
Tn5053.1 |
1683 |
142-1824 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | can be extended upstream by 12 amino acids |
Protein Sequence:
|
MATDTAPITE HGVATLPEQA WERARRRAEI IGPLAQSETV GHEAADAAAQ ALGLSRRQVY VLIRRARLGS GLVTDLALGQ SSGGKGKGRL PESVERIIRE LLQKRFLTKQ KRSLAAFHRE VVRACKLQKL RVPARNTVAL RIAGLDPREV THRREGQDAA RDLQGVGGVP PPVSAPLEQV QIDHTVIDLI VVDERDRQPI GRPYLTLAID VFTRCVVGMV VTLEAPSAVS VGLCLVHAAC DKRPWLEGLN VEMDWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPDQRGE YASEKMAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAITRTGV PTVITRATAF LVDFLPIIRR TLTRTGFVID HIHYYADALK PWIARRDRLP AFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR EIVTTAQKAT RKARRDADRR QHLKATAPPV KTTPPPDADM ADPQADNQPP AKPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB |
TniB |
Tn5053.1 |
909 |
1827-2735 |
+ |
Class: | Accessory Gene |
Transpoase Chemistry: | Serine |
Comment: | homologous to TnsC protein of Tn7 putative ATP-binding protein |
Protein Sequence:
|
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE TLYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASAD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLVIRSDDQ LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSSIAT LDMARYLLTR SEGTIGELTH LLMAAALAAV ESGEEAINHR TLSMADYTGP SERRRQFERE LM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniQ |
TniQ |
Tn5053.1 |
1218 |
2732-3949 |
+ |
Class: | Accessory Gene |
Sub Class: | Target Site Selection |
Protein Sequence:
|
VKPAPRWPLH PAPREGEALS SWLNRVALCY HIEVSELLEH DLGHGQVDDL DTAPPLALLA MLSQRSGIEL NRLRCMSFAG WVPWLLDSLD DQIPDALETY AFQLSVLLPT HRRKTRSITS WRAWLPSQPI HRACPLCLND PENQAVLLAW KLPLMLSCPL HGCRLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW QALTTGHLEL PRRRIHAGLW FRLLRTLLDE LNTPLSACGT YAGYLRQVWE GCGHPLRAGQ SLWRPYETLN PAVRLQMLEA AATAISLIEV RYISPPGEQA KLFWSEPQTG FTSGLPTKAP KPEPINHWQR AVQAIDEAII EARHNPETAR SLFALASYGR RDPASLEQLR ATFAKEGIAT EFLSHYEPDG SFACLRQNDG LSDKF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniR |
TniR |
Tn5053.1 |
615 |
4010-4624 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Comment: | also called TniC |
Protein Sequence:
|
MLIGYMRVSK ADGSQATDLQ RDALIAAGVD PVHLYEDQAS GMREDRPGLT SCLKALRTGD TLVVWKLDRL GRDLRHLINT VHDLTGRGIG LKVLTGHGAA IDTTTAAGKL VFGIFAALAE FERELIAERT IAGLASARAR GRKGGRPFKM TAAKLRLAMA AMGQPETKVG DLCQELGVTR QTLYRHVSPK GELRPDGEKL LSRI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merE; urf-1 |
MerE |
Tn5053.1 |
237 |
4677-4913 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | Broad-spectrum mercury transporter |
Protein Sequence:
|
MNSPERLPSE THKPITGYLW GALAVLTCPC HLPILAIVLA GTTAGAFIGE YWGIAALTLT GLFVLSVTRL LRAFKDRS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD |
MerD |
Tn5053.1 |
366 |
4910-5275 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MSAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLAVLRQ FVERRREALA DLEVQLATMP TEPAQHAESL P
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn5053.1 |
1647 |
5292-6938 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercuric ion reductase |
Target: | Mercury |
Protein Sequence:
|
MTEITVNGMT CTSCATHVKD ALEKIPGVNA AVVSYPESRA QVMADTAVSH NQLLAAIAAL GYQGSIRVGD FKDEPKIRDA LEGAGLHIAI IGSGGAAMAA ALKAVEQGAT VTLIERGTIG GTCVNIGCVP SKIMIRAAHI AHLRRESPFD GGIAATVPAI DRSKLLAQQQ ARVDELRHAK YEGILDGNPA ITVLHGEARF KDDQSLVVRL NEGGEREVTF DRCLVATGAS PAVPPIPGLK ESPYWTSTEA LVSDTIPARL AVIGSSVVAL ELAQAFARLG SQVTILARST LFFREDPAIG EAVTAAFRAE GIEVLEHTQA SQVAHVNGEF VLTTGHGELR ADKLLVATGR APNTRSLALD AAGVTVNAQG AIVIDQGMRT SNPNIYAAGD CTDQPQFVYV AAAAGTRAAI NMTGGDAALN LTAMPAVVFT DPQVATVGYS EAEAHHDGIE TDSRTLTLDN VPRALANFDT RGFIKLVIEE GSGRLIGVQA VAPEAGELIQ TAVLAIRNRM TVQELADQLF PYLTMVEGLK LAAQTFNKDV KQLSCCAG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merF |
MerF |
Tn5053.1 |
246 |
6935-7180 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | mercuric ion transport protein |
Protein Sequence:
|
MKDPKTLLRV SIIGTTLVAL CCFTPVLVIL LGVVGLSALT GYLDYVLLPA LAIFIGLTIY AIQRKRQADA CCTPKFNGVK K
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn5053.1 |
276 |
7183-7458 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | periplasmic mercuric ion binding protein |
Protein Sequence:
|
MKKLLSALAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVDGV SKVDVTFETR EAVVTFDDAK TSVQKLTKAT EDAGYPSSVK N
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn5053.1 |
351 |
7474-7824 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | mercury ion transport protein |
Protein Sequence:
|
MSEPQNGRGA LFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLTVLEPY RPIFIGVALV ALFFAWRRIY RPSAACKPGE VCAIPQVRAT YKLIFWGVAV LVLVALGFPY VVPFFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn5053.1 |
435 |
7896-8330 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFARTAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGETD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVRERM ADLARMEAVL SDLVCACHAR KGNVSCPLIA SLQGKKEPRS ADAV
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t4 |
Tn5053.1 |
110-128 |
TCAATACTCG TGTGCACCA |
IR Tn21-like |
Tn5053.1 |
8347-8384 |
GAATCGCACG AAATAAAAGG CAAAAGACTC CGCTGAGG |
repeat i4 |
Tn5053.1 |
8357-8375 |
AAATAAAAGG CAAAAGACT |
repeat i2 |
Tn5053.1 |
8399-8417 |
CCGTGCGGTC GACTTTTGA |
IRi |
Tn5053.1 |
8421-8448 |
AAGCACGACA GCGGAAGACT TTCACTGT |
|
References |
|
|