Transposon
Name: Tn1696.1
Family: Tn3        Group: Tn21
Evidence of Transposition: yes
 Host     

Host Organism:Citrobacter freundii L75 Molecular Source:plasmid pCf76
Place of Origin:Bejing, China Date of Isolation:2019
Other Geographic Information:Center for Infectious Disease Control, Center for Disease Control and Prevention of Pla

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTCGTCTCAGAATTCGGAAAATAAAGCACGCTAAG
IRR (Length: 41 bp)GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAGCCG

 Sequence     
DNA SequenceLength  7983 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTCT CAGAATTCGG AAAATAAAGC ACGCTAAGGC GTAGTCACCC CGTGACTCCC CCGCGCCGAT GCAGCGAGCT TCGTTCCGTC TTGCAGTGAC 100
GCAATCAGCG GGCAGGAAAC GTTCCCTTTC CGCGCATGGC AGGCGCACAC CAGTTCAGAC AGCACGGCCT CCATGCGTGC CAAGTCGGCC ATCTTCTCGC 200
GCACATCCTT GAGCTTGTGC TCGGCCAGGC CGCTGGCTTC CTCGCAATGG GTGCCATCCT CCAGCCGCAG TAGCTCGGCG ATTTCGTCCA GGCTAAAGCC 300
CAGCCGCTGG GCCGATTTCA CGAACCGCAC TCGTGTTACA TCCGCCTCGC CATAGCGGCG AATGCTGCCA TAGGGCTTGT CTGGCTCCGG CAGCAGGCCC 400
TTGCGCTGGT AGAACCGGAT GGTCTCCACA TTGACCCCGG CCGCCTTGGC AAAAACGCCA ATGGTCAGAT TCTCAAAATT AATTTGCATA TCGCTTGACT 500
CCGTACATAA CTACGGAAGT AAGCTTAAGC TATCCAAACC AAATTTGAAA GGACAAGCGT ATGTCTGAAC CACAAAAGTC TGAACCACAA AACGGGCGCG 600
GCGCGCTCTT CGCCGGTGGG CTGGCCGCCA TTCTTGCGTC GGCCTGCTGC CTGGGGCCGC TGGTTTTGAT CGCCTTGGGG TTCAGCGGGG CATGGATCGG 700
CAACCTGACG GTGCTGGAAC CCTATCGCCC GATCTTCATC GGCGCAGCGC TGGTCGCGCT GTTTTTCGCC TGGCGGCGCA TCTACCGCCC GGCGCAAGCC 800
TGCAAACCGG GTGAGGTCTG CGCGATTCCC CAAGTGCGAG CTACTTACAA GCTCATTTTC TGGATCGTGG CCGCGCTGGT CCTGGTCTCG CTCGGATTTC 900
CCTACGTCAT GCCATTTTTC TATTAATCAC AGGAGTTCAT CATGAAAAAA CTGTTTGCCG CCCTCGCCCT CGCTGCCGTT GTTGCCCCCG TGTGGGCCGC 1000
CACCCAGACC GTCACGCTGT CCGTGCCTGG CATGACCTGC GCCTCTTGCC CGATCACTGT CAAGCACGCG CTTTCCAAGG TTGAGGGCGT GAGCAAGACC 1100
GACGTAAGTT TCGACAAGCG CCAGGCCGTC GTCACCTTCG ACGATGCCAA GACCAACGTC CAGAAGTTGA CCAAGGCGAC CGAGGACGCG GGCTATCCGT 1200
CCAGCCTCAA ACGCTGATCC GTTAACCGAA CTCGGGAGCG ACACATGGGA CTCATCACGC GCATCGCTGG CAAAACCGGC GCGCTCGGCA GCGTCGTTTC 1300
CGCGATGGGC TGCGCCGCCT GTTTTCCTGC CATCGCCAGC TTTGGCGCGG CCATCGGACT GGGCTTCTTG AGCCAGTACG AGGGGCTATT CATTGGCATC 1400
CTGCTGCCGA TGTTCGCCGG CATCGCGTTA CTCGCCAATG CTATCGCTTG GCTCAATCAT CGACAGTGGC GACGCACGGC GCTCGGCACG ATAGGCCCGA 1500
TCTTGGTGCT GGCAGCGGTG TTTTTAATGC GGGCTTACGG CTGGCAGAGC GGTGGACTGC TCTATGTCGG CCTGGCCTTG ATGGTTGGGG TGTCGGTCTG 1600
GGATTTCATC TCGCCAGCAC ATCGCCGCTG CGGGCCGGAC AGCTGTGAAT TGCCAGAACA ACGTGGCTGA CGGCAACAGC CGTAGCCACC ACAGAAAAGG 1700
AAAAATACAT GACCACCCTG AAAATCACCG GGATGACCTG CGACTCGTGC GCGGCTCACG TCAAGGAAGC CTTGGAGAAA GTGCCCGGCG TGCAATCGGC 1800
GCTGGTGTCC TATCCGAAGG GCACAGCGCA ACTCGCCATT GAGGCGGGCA CGTCATCGGA TGCGCTGACT ACCGCCGTGG CCGGACTGGG CTACGAGGCA 1900
ACGCTTGCCG ATGCGCCACC GACGGACAAC CGCGCCGGCC TGCTCGACAA GATGCGCGGC TGGATAGGGG CCGCTGATAA GCCCAGTGGC AACGAACGCC 2000
CGTTGCAGGT CGTCGTCATT GGTAGCGGTG GAGCCGCGAT GGCGGCAGCA CTGAAGGCCG TCGAGCAAGG CGCGCAGGTC ACGCTGATTG AGCGCGGCAC 2100
CATCGGCGGC ACCTGCGTCA ACGTCGGTTG TGTGCCGTCC AAGATCATGA TCCGCGCCGC CCACATCGCC CATCTGCGCC GGGAAAGCCC ATTCGACGGC 2200
GGCATGCCAC CCACACCGCC GACGATCTTG CGCGAGCGGC TGCTGGCCCA GCAGCAGGCC CGTGTCGAAG AACTCCGTCA TGCCAAGTAC GAAGGCATCC 2300
TGGACGGCAA TTCAGCCATC ACCGTTCTGC ACGGTGAAGC GCGTTTCAAG GACGACCAGA GCCTTATCGT TAGTTTGAAC GAGGGTGGCG AGCGCGTCGT 2400
GATGTTCGAC CGCTGCCTGG TCGCCACGGG TGCCAGCCCG GCGGTCCCGC CGATTCCGGG CTTGAAAGAG TCACCCTACT GGACTTCCAC CGAGGCCCTG 2500
GCGAGCGACA CCATTCCCGA ACGCCTTGCC GTAATCGGCT CGTCGGTGGT GGCGCTGGAG CTGGCGCAAG CCTTTGCCCG GCTGGGCAGC AAGGTCACGG 2600
CCCTGGCGCG CAATACCTTG TTCTTCCGTG AAGACCCGGC CATCGGCGAG GCGGTGACAG CCGCTTTCCG TGCCGAGGGC ATCGAGGTGC TGGAGCACAC 2700
GCAAGCCAGC CAGGTCGCCC ATATGGACGG TGAATTCGTG CTGACCACCA CGCACGGTGA ATTGCGCGCC GACAAGCTGC TGGTCGCCAC CGGCCGGACA 2800
CCGAACACGC GCAGCCTGGC ATTGGAAGCG GCGGGGGTAG CCGTCAATGC GCAGGGGGCC ATCGTCATCG ACAAGGGCAT GCGCACCAGT AGCCCGAACA 2900
TCTACGCGGC CGGCGACTGC ACCGACCAGC CGCAGTTCGT CTATGTGGCG GCAGCGGCCG GCACTCGTGC GGCGATCAAC ATGACTGGCG GCGATGCGGC 3000
CCTGGACCTG ACCGCAATGC CGGCCGTGGT GTTCACCGAC CCGCAGGTCG CCACCGTGGG CTACAGCGAG GCGGAAGCAC ATCACGACGG GATCGAGACC 3100
GACAGTCGCC TGCTAACACT GGATAACGTG CCGCGTGCGC TTGCCAACTT CGACACACGC GGCTTCATCA AGCTGGTCAT CGAGGAAGGT AGCGGACGGC 3200
TCATCGGCGT GCAAGCGGTG GCCCCGGAAG CGGGTGAACT GATCCAGACG GCGGTGCTCG CCATTCGCAA CCGTATGACC GTGCAGGAAC TGGCCGACCA 3300
ATTGTTCCCC TACCTGACCA TGGTCGAAGG GCTGAAGCTC GCGGCGCAGA CCTTCAGCAA GGACGTGAAG CAGCTTTCGT GCTGCGCCGG ATGAGGAAAA 3400
GGAGGTGTTC AATGAGCGCC TACACAGTGT CCCGGCTGGC CCTTGATGCC GGGGTGAGCG TGCATATCGT GCGCGACTAC CTGCTGCGCG GATTGCTACG 3500
GCCGGTCGCG TACACCACGG GCGGCTACGG CTTGTTCGAT GACACCGCGT TGCAACGGCT GCGCTTTGTA CGGGCTGCCT TCGAAGCGGG TATCGGCCTG 3600
GACGCACTGG CGCGGCTGTG CCGGGCGCTG GATGCTGCGG ACGGTGACGG TGCGTCTGCG CAGCTTGCCG TGTTGCGGCA ACTCGTCGAG CGTCGGCGCG 3700
AGGCCCTGGC CAGCCTCGAA ATGCAACTGG CCGCCATGCC AACCGAACCG GCACAGCACG CGGAGAGTCT GCCATGAACA GCCCAGAGCA CTTGCCGTCT 3800
GAGACGCACA AACCGATCAC CGGCTACTTG TGGGGCGCGC TGGCCGTGCT CACCTGTCCC TGCCATTTGC CGATTCTCGC CATTGTGCTA GCCGGCACGA 3900
CGGCCGGCGC GTTCATCGGG GAGCACTGGG GTATTGCAGC CCTCACGCTG ACCGGCTTGT TTGTCCTGTC TGTGACGCGG CTGCTGCGGG CCTTCAAGGG 4000
AAGATCATGA CCGCTTCCCA GCCAGCCGAG AGTGGGCAGC TTTGAGCTTC GCTACCAATC TGGAGGAGTA CCACCATGAA CGCAAACGCC CCGAACACTG 4100
CCAGTTGCAC CACCTGCTGC GTATGCTGCA AAGAAATTCC GCTCGATGCC GCCTTCACCC CGGAAGGCGC GGAATACGTC GAACATTTCT GCGGGCTGGA 4200
TTGCTATGAA CGCTTCCAGG CACGCGCCAA GGCCGCGACA GAATCTGACA TTGCGCCTGT CCCTGGCGGT TCGCAGCCGT CAGATTGAGG CATACCCTAA 4300
CTTGATGTCA GATGCCATGT GCAAACGATG TCAGAATAGA GTTAAATTTC CTATTGATTG ACATATTCCG TCAAAGGTAA TAGATTTCAT CCTGACACTT 4400
TTGCCTTTGG AGGCATCTTG CAAGGTCAAC GCATCGGCTA TGTCCGCGTC AGCAGCTTCG ACCAGAACCC GGAACGGCAA TTGGAGGGTG TTCAGGTGGC 4500
GCGGGTGTTC ACCGACAAGG CTTCTGGCAA GGACACCCAG CGTCCCGAGC TGGAAAGGCT GCTGGCCTTC GTCCGCGAGG GCGACACCGT GGTGGTGCAT 4600
AGCATGGACA GGCTGGCACG CAACCTTGAT GACCTGCGCC GCATCGTCCA AGGGCTGACA CAACGGGGCG TGCGCATGGA GTTCGTCAAA GAAGGGCTGA 4700
AGTTCACCGG CGAGGACTCA CCGATGGCCA ATCTGATGCT GTCGGTCATG GGAGCCTTCG CTGAGTTCGA GCGCGCCCTG ATCCGCGAAC GTCAGCGCGA 4800
GGGAATCGTG CTGGCCAAGC AGCGCGGTGC CTACCGGGGA CGAAAGAAAT CGCTGAACAG CGAACAAATT GCCGAGTTGA AACGGCGAGT TGCGGCAGGC 4900
GACCAAAAAA CCTTGGTGGC CCGTGACTTC GGCATCAGCC GCGAAACCTT GTACCAGTAC CTGCGGGAAG ACTGACCATG CCACGCCGCT CAATCCTGTC 5000
CGCCACCGAG CGCGAAAGCC TGCTGGCACT GCCAGATGCC AAAGACGAAC TGATACGGCA CTACACGTTC AACGAAACCG ACCTGTCGGT GATCCGTCAG 5100
CGTCGCGGCG CCGCGAATCG ATTGGGCTTC GCTGTGCAGC TTTGCTACTT GCGATTCCCT GGCACCTTTT TGGGCGTCGA TGAGCCTCCG TTTCCGCCCC 5200
TGTTGCGCAT GGTGGCCGCG CAACTCAAGA TGCCAGTGGA AAGTTGGAGC GAGTACGGCC AGCGCGAACA GACACGGCGG GAGCACTTGG TCGAGCTGCA 5300
AACGGTTTTT GGGTTCAAGC CCTTCACCAT GAGCCACTAT CGGCAAGCCG TGCATACATT GACCGAGCTG GCCTTGCAGA CCGACAAAGG CATCGTGCTG 5400
GCGAGCGCAC TTGTCGAGAA TCTGCGGCGG CAGAGCATCA TCCTGCCCGC CATGAATGCC ATCGAGCGCG CAAGCGCCGA GGCCATCACC CGTGCCAACC 5500
GACGCATTTA CGCGGCGCTG ACCGATTCTT TGTTATCACC CCACCGTCAG CGCCTGGACG AACTTCTCAA GCGCAAGGAC GGCAGTAAAG TGACGTGGCT 5600
GGCATGGCTG CGCCAGTCGC CTGCCAAACC GAACTCTCGC CACATGCTCG AACATATTGA GCGCCTGAAA TCCTGGCAAG CACTTGATCT GCCCGCAGGC 5700
ATCGAGCGGC AGGTTCACCA GAACCGCCTG CTCAAAATCG CTCGTGAAGG TGGCCAGATG ACGCCTGCTG ATCTGGCAAA GTTCGAGGTG CAACGACGCT 5800
ATGCCACGCT GGTAGCGCTG GCCATCGAAG GCATGGCCAC CGTCACCGAT GAAATCATCG ACCTTCACGA TCGCATCATC GGCAAGCTGT TCAACGCGGC 5900
CAAGAACAAG CATCAGCAGC AGTTCCAGGC TTCCGGCAAG GCGATCAACG ACAAGGTGCG GATGTATGGG CGCATCGGTC AAGCGTTGAT TGAGGCCAAG 6000
CAAAGCGGCA GCGATCCGTT CGCCGCCATC GAGGCCGTTA TGCCCTGGGA CACCTTCGCC GCCAGCGTCA CCGAAGCGCA AACATTGGCG CGGCCTGCCG 6100
ACTTTGATTT CCTGCACCAC ATCGGTGAAA GCTATGCCAC GCTACGCCGC TACGCGCCGC AGTTCCTGGG CGTGCTCAAA TTGCGGGCTG CGCCCGCCGC 6200
CAAGGGTGTG CTCGATGCCA TCGACATGCT GCGCGGCATG AACAGCGACA GCGCGCGCAA GGTGCCCGCC GATGCGCCAA CCGCATTCAT CAAGCCGCGC 6300
TGGGCAAAGC TGGTTCTGAC CGACGACGGC ATCGACCGGC GTTACTACGA GTTATGCGCC CTGTCGGAGC TGAAGAACGC GCTGCGCTCC GGTGATGTCT 6400
GGGTGCAGGG TTCTCGCCAG TTCAAGGACT TCGACGAATA CCTGGTGCCG GTCGAGAAGT TCGCCACTTT GAAGCTGGCC AGCGAATTGC CGCTGGCAGT 6500
GGCCACCGAC TGCGACCAAT ACCTGCATGA CCGGTTGGAA TTGTTGGAGG CGCAACTCGC CACAGTCAAC CGCATGGCTG CGGCCAACGA CTTACCGGAT 6600
GCCATCATCA CCACCGCGTC AGGCCTGAAG ATCACGCCGC TGGACGCGGC AGTACCAGAC GCCGCGCAAG CCATGATCGA CCAGACAGCT ATGCTGCTGC 6700
CGCACCTCAA AATCACCGAG TTGCTGATGG AGGTCGATGA ATGGACGGGC TTCACCCGCC ACTTCACACA CCTGAAGACC AGCGACACGG CCAAGGACAA 6800
AACCTTGCTG TTGACGACGA TCCTGGCCGA CGCGATCAAC CTGGGTCTGA CCAAAATGGC CGAGTCCTGC CCTGGCACCA CCTACGCCAA GCTGTCTTGG 6900
CTGCAAGCCT GGCACATCCG CGATGAAACC TATTCGACGG CGCTGGCCGA GCTGGTGAAT GCGCAGTTTC GGCAACCCTT CGCCGGCAAC TGGGGTGACG 7000
GCACCACGTC ATCGTCGGAC GGCCAGAACT TCAGAACCGG CAGCAAAGCA GAAAGCACTG GTCATATCAA CCCGAAGTAT GGAAGCAGTC CAGGACGGAC 7100
TTTCTACACC CATATCTCCG ACCAGTACGC GCCCTTCAGT GCCAAGGTGG TCAACGTGGG CATTCGTGAT TCAACTTACG TGCTTGATGG CCTGCTGTAC 7200
CACGAGTCGG ACTTGCGCAT CGAGGAACAC TACACCGACA CGGCAGGCTT CACCGATCAC GTGTTTGGCT TGATGCATTT GCTGGGATTT CGCTTCGCGC 7300
CGCGTATCCG TGACTTGGGC GAAACCAAGC TATTCATCCC CAAGGGCGAT GCCGCCTATG ACGCGCTCAA GCCGATGATT AGCAGCGACA GGCTGAACAT 7400
CAAGCAAATA CGCGCCCATT GGGATGAAAT TCTGCGGCTG GCCACCTCCA TCAAGCAAGG CACGGTAACG GCTTCGCTGA TGCTGCGCAA ACTCGGCAGC 7500
TACCCGCGCC AGAACGGCTT GGCCGTGGCG TTGCGCGAGC TGGGGCGCAT CGAGCGCACG CTGTTCATTT TGGATTGGCT GCAAAGCGTG GAGCTGCGCC 7600
GCCGCGTCCA TGCGGGGCTG AATAAGGGCG AGGCGCGCAA CGCGCTGGCC AGGGCGGTCT TCTTCTACCG ATTGGGTGAA ATCCGCGACC GCAGTTTTGA 7700
GCAGCAGCGC TACCGGGCCA GCGGCCTCAA TCTGGTGACG GCGGCCATCG TGTTGTGGAA CACGGTATAT CTGGAGCGTG CCACCAGTGC TTTGCGTGGC 7800
AACGGCACGG CGCTGGACGA CACATTGTTG CAATATCTGT CGCCGCTGGG GTGGGAGCAC ATCAACCTGA CCGGCGATTA CCTATGGCGC AGCAGCGCCA 7900
AGGTCGGTGC GGGGAAGTTT AGGCCATTGC GACCGCTGCC ACCGGCTTAG CGTGCTTTAT TTTCCGTTTT CTGAGACGAC CCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 4278-4397 120 CGTCAGATTG AGGCATACCC TAACTTGATG TCAGATGCCA TGTGCAAACG ATGTCAGAAT
AGAGTTAAAT TTCCTATTGA TTGACATATT CCGTCAAAGG TAATAGATTT CATCCTGACA
res_site_I 4278-4308 31 CGTCAGATTG AGGCATACCC TAACTTGATG T
res_site_II 4329-4363 35 TGTCAGAATA GAGTTAAATT TCCTATTGAT TGACA
res_site_III 4366-4397 32 TTCCGTCAAA GGTAATAGAT TTCATCCTGA CA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
merR Tn1696.1 34-489 Passenger Gene Heavy Metal Resistance -
merT Tn1696.1 561-926 Passenger Gene Heavy Metal Resistance +
merP Tn1696.1 942-1217 Passenger Gene Heavy Metal Resistance +
merC Tn1696.1 1245-1670 Passenger Gene Heavy Metal Resistance +
merA Tn1696.1 1709-3394 Passenger Gene Heavy Metal Resistance +
merD Tn1696.1 3412-3777 Passenger Gene Heavy Metal Resistance +
merE Tn1696.1 3774-4010 Passenger Gene Heavy Metal Resistance +
urf-2Y Tn1696.1 4076-4288 Passenger Gene Heavy Metal Resistance +
tnpR Tn1696.1 4418-4975 Accessory Gene Resolvase +
tnpA Tn1696.1 4978-7950 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn1696.1 456 34-489 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MQINFENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGEAD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASGLAE HKLKDVREKM
ADLARMEAVL SELVCACHAR KGNVSCPLIA SLQDGTKLAA SARGSHGVTT P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn1696.1 366 561-926 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MSEPQKSEPQ NGRGALFAGG LAAILASACC LGPLVLIALG FSGAWIGNLT VLEPYRPIFI GAALVALFFA WRRIYRPAQA CKPGEVCAIP QVRATYKLIF
WIVAALVLVS LGFPYVMPFF Y

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn1696.1 276 942-1217 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MKKLFAALAL AAVVAPVWAA TQTVTLSVPG MTCASCPITV KHALSKVEGV SKTDVSFDKR QAVVTFDDAK TNVQKLTKAT EDAGYPSSLK R

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merC MerC Tn1696.1 426 1245-1670 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MGLITRIAGK TGALGSVVSA MGCAACFPAI ASFGAAIGLG FLSQYEGLFI GILLPMFAGI ALLANAIAWL NHRQWRRTAL GTIGPILVLA AVFLMRAYGW
QSGGLLYVGL ALMVGVSVWD FISPAHRRCG PDSCELPEQR G

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn1696.1 1686 1709-3394 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MTTLKITGMT CDSCAAHVKE ALEKVPGVQS ALVSYPKGTA QLAIEAGTSS DALTTAVAGL GYEATLADAP PTDNRAGLLD KMRGWIGAAD KPSGNERPLQ
VVVIGSGGAA MAAALKAVEQ GAQVTLIERG TIGGTCVNVG CVPSKIMIRA AHIAHLRRES PFDGGMPPTP PTILRERLLA QQQARVEELR HAKYEGILDG
NSAITVLHGE ARFKDDQSLI VSLNEGGERV VMFDRCLVAT GASPAVPPIP GLKESPYWTS TEALASDTIP ERLAVIGSSV VALELAQAFA RLGSKVTALA
RNTLFFREDP AIGEAVTAAF RAEGIEVLEH TQASQVAHMD GEFVLTTTHG ELRADKLLVA TGRTPNTRSL ALEAAGVAVN AQGAIVIDKG MRTSSPNIYA
AGDCTDQPQF VYVAAAAGTR AAINMTGGDA ALDLTAMPAV VFTDPQVATV GYSEAEAHHD GIETDSRLLT LDNVPRALAN FDTRGFIKLV IEEGSGRLIG
VQAVAPEAGE LIQTAVLAIR NRMTVQELAD QLFPYLTMVE GLKLAAQTFS KDVKQLSCCA G

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn1696.1 366 3412-3777 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MSAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVAYTTGGYG LFDDTALQRL RFVRAAFEAG IGLDALARLC RALDAADGDG ASAQLAVLRQ LVERRREALA
SLEMQLAAMP TEPAQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn1696.1 237 3774-4010 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MNSPEHLPSE THKPITGYLW GALAVLTCPC HLPILAIVLA GTTAGAFIGE HWGIAALTLT GLFVLSVTRL LRAFKGRS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urf-2Y Urf-2Y Tn1696.1 213 4076-4288 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   response to mercury ion (GO:0046689)
Target:   Mercury
Comment:   belongs to the mer operon || ProteinID:ACE81802.1
Protein Sequence:  
MNANAPNTAS CTTCCVCCKE IPLDAAFTPE GAEYVEHFCG LDCYERFQAR AKAATESDIA PVPGGSQPSD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn1696.1 558 4418-4975 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MQGQRIGYVR VSSFDQNPER QLEGVQVARV FTDKASGKDT QRPELERLLA FVREGDTVVV HSMDRLARNL DDLRRIVQGL TQRGVRMEFV KEGLKFTGED
SPMANLMLSV MGAFAEFERA LIRERQREGI VLAKQRGAYR GRKKSLNSEQ IAELKRRVAA GDQKTLVARD FGISRETLYQ YLRED

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn1696.1 2973 4978-7950 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRRSILSAT ERESLLALPD AKDELIRHYT FNETDLSVIR QRRGAANRLG FAVQLCYLRF PGTFLGVDEP PFPPLLRMVA AQLKMPVESW SEYGQREQTR
REHLVELQTV FGFKPFTMSH YRQAVHTLTE LALQTDKGIV LASALVENLR RQSIILPAMN AIERASAEAI TRANRRIYAA LTDSLLSPHR QRLDELLKRK
DGSKVTWLAW LRQSPAKPNS RHMLEHIERL KSWQALDLPA GIERQVHQNR LLKIAREGGQ MTPADLAKFE VQRRYATLVA LAIEGMATVT DEIIDLHDRI
IGKLFNAAKN KHQQQFQASG KAINDKVRMY GRIGQALIEA KQSGSDPFAA IEAVMPWDTF AASVTEAQTL ARPADFDFLH HIGESYATLR RYAPQFLGVL
KLRAAPAAKG VLDAIDMLRG MNSDSARKVP ADAPTAFIKP RWAKLVLTDD GIDRRYYELC ALSELKNALR SGDVWVQGSR QFKDFDEYLV PVEKFATLKL
ASELPLAVAT DCDQYLHDRL ELLEAQLATV NRMAAANDLP DAIITTASGL KITPLDAAVP DAAQAMIDQT AMLLPHLKIT ELLMEVDEWT GFTRHFTHLK
TSDTAKDKTL LLTTILADAI NLGLTKMAES CPGTTYAKLS WLQAWHIRDE TYSTALAELV NAQFRQPFAG NWGDGTTSSS DGQNFRTGSK AESTGHINPK
YGSSPGRTFY THISDQYAPF SAKVVNVGIR DSTYVLDGLL YHESDLRIEE HYTDTAGFTD HVFGLMHLLG FRFAPRIRDL GETKLFIPKG DAAYDALKPM
ISSDRLNIKQ IRAHWDEILR LATSIKQGTV TASLMLRKLG SYPRQNGLAV ALRELGRIER TLFILDWLQS VELRRRVHAG LNKGEARNAL ARAVFFYRLG
EIRDRSFEQQ RYRASGLNLV TAAIVLWNTV YLERATSALR GNGTALDDTL LQYLSPLGWE HINLTGDYLW RSSAKVGAGK FRPLRPLPPA