Transposon
Name: TnLfArs       (Synonyms: Tn7162)
Family: Tn3        Group: Tn21
Evidence of Transposition: no
 Host     

Host Organism:Leptospirillum ferriphilum Fairview Molecular Source:plasmid pLfTnArs
Place of Origin:South Africa Date of Isolation:2006
Other Geographic Information:Commercial Biooxidation Tank Fairview mine

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 39 bp)GGGGTCGTCTCAGAAAACGGAAAACAAAGCACGTTAAGC
IRR (Length: 41 bp)GGGGTCGTCTCAGAAAACGGAGAATAAAGCACGCTAAGCCG

 Sequence     
DNA SequenceLength  8768 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TTCAGGGGTC GTCTCAGAAA ACGGAAAACA AAGCACGTTA AGCTCGAATT TCGCGCAGGT ACTGGTACAA CGTTTCGCGG CTGATGCCAA ATTCACGGGC 100
CAGCTTTGCC TTTTGCTCGC CGGCATTGAC GCGTTGCAGA AGTTCAGCGG CGCGTTCGGG CGAAAGCGCT TTCTTGCGGC CCCTGTAAGC TCCGCGCTGT 200
TTGGCGAGCG CAATACCTTC TTTCTGTCGC TCACGGATCA AGGCCCGCTC GAATTCGGCG AACGCGCCCA TGACCGACAA CATCAAGTTC GCCATCGGTG 300
AATCCTCGCC GGTGAAGGTC AGGCATTCCT TGACGAACTC GATGCGCACA CCGCGCTTGG TCAGCTTTTG CACGAGGCGA CGCAGGTCAT CGAGGTTGCG 400
GGCCAGGCGA TCCATGCTGT GCACCACCAC CGTATCGCCT TCGCGCACGA AGGCCAGTAG TGAGTCAAGT TCGGGCCGCT GGGTGTCCTT GCCCGATGCC 500
TTGTCGGTGA ATACCTTTCC GACTTCGGCC TGCTCCAGTT GCCGCTCCGG GTTTTGGTCG AAGCTGCTGA CCCGGACGTA GCCGATACGT TGACCTTGCA 600
AGATGCCTCC AAATAGAAAG TGTCAGGAAG AAATCTATGA CCTTGCGTAG CTTGTGTCAA TAAATTAAGT ACGCGAGTCT ATCCGGACGT TTCGGGATAG 700
GCTGTCCTGA CGCCCGGTTA GGGTATAGCC TAATCGGACG CGACGGCATG ATTGAATCAT TTTGATAGTT GTAGTACAAT TTGAGCATGG AAAAGCAAGC 800
CGCAACATCG ATCTTCGAAT CCCTCTCCTC TGGCCTGCGC TTGGATGTGT TCAGGCTGTT GGTCAAGAAG GGGCCGGGCG GCATGGTGGC AGGCGAGATT 900
GCTAACGCGC TGGACATACC GCCGGCCAAC CTTTCCTTTC ACCTCAAGGC GCTGTCACAA GCTCATCTGG TGACGGTTGA GCAGGAGGGT CGCTTCCAAC 1000
GTTACCGAGC GGACATCCCG TTAATGTTGG ATTTGATTGC GTACCTCACC GAAGAATGCT GTTCGGGTAG TCCCGATCAA TGCCTTGAAC TACGTACCGC 1100
CTCGAAGTGT TCCGAGGAGT TCTTGCCGCT CCTGTCACCC ACTCCAACGA AAGCGACAAC ATGAATATTC TGTTTCTTTG TACCGGTAAC TCCTGCCGTT 1200
CCATCCTGGC GGAAGCGACC TTCAACCACT TGGCTCCGAC AGGCTGGAAA GCGATGAGTG CGGGAAGTCA GCCGACAGGG ACGGTCCATC CGCGTTCCCT 1300
GTCGCTCCTG ACACATGAAG GTATCGACAC GGGCGGACTG CACAGCAAAT CATGGGACAA CCTGCCGCTG GCCCCGGATG TGGTCATCAC TGTGTGCGCA 1400
AATGCTGCCG GGGAAACCTG CCCGGCTTAT CTTGGTCCGG TACTGCGTGC GCATTGGGGT GTTGACGATC CCGCTCATGC CACAGGGACC GAAGCGGAAA 1500
TCGAAGCCGC GTTCCCGACG GCCTACCGTA TCTTGCGGCG TCGCATCGAA GCCCTCCTCG CGCTGCCGTT GGCGCAACTT GCGCACGACC GGACACGCTT 1600
GAAGGCGGAA CTGGATCGGA TTGGGGCGCT TTCGGCGTAG TTTTTTTGAC TGATATTTGA ATACTTGATG GAGATTTTAA ATGAAGAAGA TCGAAGTGTT 1700
TGACCCATCG CTGTGTTGCA GCACCGGCGT CTGCGGCGTG GATGTTGACC AAGCCTTGGT GACTTTTGCT GCTGATGTCG ATTGGGCGAA GCAGAACGGC 1800
GCGCACATCG AGCGCTACAA CCTGGCGCAG CAGCCACAGA TGTTTGCCGA CAATGCGACC GTAAAGGGAT TTTTGCAGCG CTCCGGTCAG GATGCGCTGC 1900
CACTGATCCT GGTCGATGGT GAAGTGGCCC TCGCGGGGCG TTATCCAAAG CGGGCTGAAT TGGCCCTCTG GATCGGCATT GATCAACCGG CAGACGCCGC 2000
CAAACCGACG ACGGGCGGTT GCTGCTCCGG CCCGCGTGGC TGCTGCTGAA CAACATGGAG AAAATGATGA TGATATTCCT GCAACTTCCT CCCCGCTTTC 2100
TGTTTTTCAC GGGCAAGGGC GGCGTCGGCA AAACCTCGAT TGCCTGTGCC ACGGCTATTC AATTGGCCGA GGCCGGAAAA CGCGTCCTCC TGGTCAGTAC 2200
CGACCCAGCA TCCAACGTCG GGCAGGTATT TGGTGTTGAT ATCGGTAATC GCGTCACACC GATTCCGGCG GTTCCACGTC TTTCTGCCTT GGAAATTGAT 2300
CCCGAGGCAG CGGCCAGTGC CTATCGGGAG CGCCTGGTCG GCCCTGTGCG CGGTGTGCTT CCTGATGACG TGGTGAAGGG CATCGAAGAA TCGTTGTCCG 2400
GCGCGTGTAC CACCGAAATT GCCGCATTTG ACGAGTTCAC CGCGCTACTA ACCAACGTGG CACTCACGGC TGATTACGAG CACATCATCT TTGATACTGC 2500
GCCCACCGGC CACACCATCC GCTTGCTGCA ACTGCCGGGC GCATGGAGCG GTTTCCTGGA AGCTGGCAAG GGCGATGCCT CGTGCCTCGG CCCGCTGGCC 2600
GGTCTGGAAA AGCAGCGTAC TCAGTACAAG GCGGCTGTTG AAGCCTTGGC TGATCCGCTG CAAACCCGTC TGGTGCTGGT CGCTCGCGCC CAGCAGGCGA 2700
CTTTGCGCGA GGTAGCCCGA ACCCACGAAG AACTGGCCGC CATAGGCCTC AAACAGCAAC ATCTCGTCAT CAACGGCATC CTGCCGCACA TCGAAGCCGC 2800
CACCGACCCA CTGGCCGCAG CAATCCACGA ACGGGAACAA ACGGCGTTGA AGAACATCCC GGCTACGTTA ACTGCGCTCC CGTGTGATCA TGTTGAACTC 2900
AAGCCCTTCA ATCTCGTCGG TCTCGACGCG CTGAGGCAGT TGCTGACCGA CCTTCCACCA CAAGCACCTG TCGCCGTTGA CTCCCCGATC GAACTCGACG 3000
AGCCCGGCGT GGCCGACCTG ATCGACGGCA TCGCGGCGGA TGGACACGGG CTGGTCATGT TGATGGGCAA AGGTGGTGTA GGCAAGACGA CCCTGGCCGC 3100
CGCCATCGCG GTCGAACTGG CACATCGCGG CTTGCCCGTG CATCTGACGA CCTCCGATCC TGCTGCCCAC TTGACCGACA CCCTGGATTC CTCGCTTGAC 3200
AATCTGACCG TGAGCCGAAT CGATCCGCAT GCCGAGACCG AGCGCTATCG CCAGCACGTG CTGGAAACCA AGGGCGCTCA ACTCGATGCC GAAGGTCGCG 3300
CGCTGCTGGA AGAGGATTTG CGTTCGCCTT GCACGGAAGA GATTGCGGTC TTCCAGGCGT TCTCCCGAAT CATTCGCGAG GCTGGGAAAA AGTTCGTCGT 3400
CATGGACACG GCCCCGACAG GGCACACCTT GCTTCTGCTC GACGCGACGG GTGCGTATCA CCGCGAAGTG ACGCGGCAAA TGGGCAAGAC CGGCATGCAC 3500
TTCACGACGC CGATGATGCA ATTGCAGGAC CCGAAGCAAA CCAAGGTGCT GATCGTCACG TTGGCAGAGA CGACGCCGGT ACTGGAGGCC GCCAACCTGC 3600
AAGCTGATTT GCGCCGTGCC GGGATCGAGC CATGGGCCTG GATCATCAAC ACCAGCGTAG CGGCGGCCTC GGCCAAGTCG CCATTACTGC GTCAGCGTGC 3700
GGCCAACGAG CTACGCGAAA TCAGCGCCGT GGCCAATCAG CACGCGGACC GTTACGCGGT TGTCCCGCTT CTGAAGGAAG AACCGATCGG TGCAGATCGA 3800
CTGCGTGCGC TCATCCATCC CCAAACATAG GAGATCGACC ATGTTGATCC GGAATTTCAT GACGCCCGAT CCGGTAACCA TTCAACCTGA AACACCGGTT 3900
GAGGACATTG CGAGGCTGCT GCTTGCCCAT CGCATCAACG GCGTTCCGGT GGTCGACGGC GCTGGTCGGC TGATCGGCGT TGTGACTGCG GAGGACTTGA 4000
TTCACCGGGG AGCGGACGAA CGACTTGAAC CCCGTGAATC GATCTGGAAG GAGAACTTCT GGGTTTCCTT TCTTGGCCCA AAGGGGACGC AGCGTGACAA 4100
GGCCGAGGGA CGTACTGCCG CAGAGGTAAT GACCACAGAA GTGCACAGCG TCACGCCAGC CATGCATCCC TCCGTTGCAG CCCGGCTGAT GGTGGATCAT 4200
CACTTGACGG CCCTTCCGGT AGTGGATGAT GGGAAAGTGA TCGGCGTCAT CTCCCGGATT GACCTCTTAA GCCTTCTTAA AGAACTCGAA AACCCGCTGA 4300
AAAGAGAGAA TTGACGTGAT TGCTGCCCTA CCGATTTTTC TTGCCACTAT CGTCCTGGTG ATCTGGCAAC CGCGCGGGCT GGGCATTGGC TGGAGTGCGA 4400
CGCTTGGCGC TGTCGTGGCG TTACTGTCCG GTGTCGTTCA CTTCGGTGAC ATTCCTGTCG TCTGGCAGAT CGTCTGGAAC GCCACAGCCA CATTCATTGC 4500
CATCATCATC ATTAGCCTGT TGCTCGATGA GGCAGGGTTT TTCGAATGGG CAGCCTTGCA CGTCGCACGC TGGGGCGGAG GAAAAGGACG TCTGCTGTTT 4600
GCACTGATCA TCCTGCTAGG TGCAGCCGTC GCTGCGCTTT TTGCCAACGA TGGCGCAGCA CTGATTCTCA CGCCAATCGT CATGGCCATG TTGCTGGCAC 4700
TGGGGTTCAG TCCGGCGGCA ACGCTCGCCT TCGTCATGGC GGCGGGTTTC ATCGCAGATG CCGCCAGCCT GCCGCTTGTC GTATCCAACC TGGTCAACAT 4800
CGTCTCCGCC GACTTTTTCA ATATCGGGTT CAATGACTAC GCTGCGGTGA TGATTCCGGT GAATATTGTC GCCATCATTG CCTCTCTGGC CGTACTCTCG 4900
ATCTATTTCC GCCGCAGCAT CCCCGCACAC TACGACGTGA ATCAGTTGAA ACAGCCGAAT GAGGCCATTC GCGATGTGGC CACGTTCCGT TTCGGCTGGG 5000
TGGTTCTGGC CCTGCTGCTG GTCGGATTCT TCGGCCTTGA GCCGCTAGGC GTACCGGTTA GCGCCGTGGC TGCCGCAGGC GCCCTACTGT TGCTGGCAGT 5100
CGCTGCGCGC GGGCATGTCA TCAGCACCCG CAAAGTGATC CGTGAAGCGC CTTGGCAAAT TGTCGTTTTC TCGCTGGGAA TGTATCTCGT GGTCTATGGG 5200
TTGCGTAACC AAGGCCTGGC CGAGCATATC GCAAGGCTGC TGGATTACTT CGCACAAGGC GGTGTATGGG GGGCCGCCTT TGGAACAGGC TTCCTCACAG 5300
CGCTGCTGTC CTCGGCTATG AACAACATGC CTACGGTACT AGTGGGCGCC CTATCCATTG ATGCCACAAG CGCAACGGGC GTGGTGAAAG ACACGATGAT 5400
CTACGCCAAT GTGATTGGTA GTGATCTAGG ACCGAAGATC ACCCCCATCG GCTCGCTGGC GACGCTGCTT TGGTTGCATG TGCTGGCGCG CAAGGGAATG 5500
ACAATCACAT GGGGGTATTA CTTCAAAGTG GGCGTTGTGC TGACCGTCCC CGTACTCGCG GCGACCCTGG CAGCGCTGGC ACTGCGCCTG AGTCTCACTT 5600
GATGCGAGCG GAGCGCTGTT TGCGGACATG CAGTATTTGA GTAAAGAAGG GTGAGCGGCT TGGCGTATGG CTATTCTCTT GAGTGCGGTC ATCGGGATAC 5700
CGCTCTATAT CCGCGCCTTT CTTTAGTGGA ACTGGATTTA CTCGGCCACA GAAAGAAAAA GTGCATCCAT GCCACGCCGT TCGATCCTGT CCGCCGCCGA 5800
GCGTGACAAC CTGCTGGCAT TGCCGGACGC CAAGGAAGAG CTGATCCGTC ACTACACGTT CAGCGACTCC GATCTGTCCA TCATCAGGCA GAGGCGCGGC 5900
CCGGCCAACC GCCTGGGCTT CGCCGTGCAG CTCTGCTATC TGCGCTTTCC CGGCGTCATC CTTGGCGCCG ATGAACCGCC GTTTCCGCCC TTGCTGAAAC 6000
TGGTCGCTGA CCAGCTCAAG ATTGGCATCG AAAGCTGGGG TGAATATGGG CAGCGGGGGC AGACCCGGCG CGAGCACCTA GTCGAGCTGC AAACGGTGTT 6100
CGGCTTCCAG CCGTTCACCA TGAGCCACTA CCGGCAGGCT GTCCAGTTGC TGACCGAGTT GGCCATGCAG ACCGACAAGG GCATTGTGCT GGCCAGCGCC 6200
TTGATCGAGC ATCTGCGGCG GCAGTCGGTC ATTTTGCCTG CCCTCAATGC CGTCGAGCGG GCGAGCGCCG AAGCAATCAC CCGCGCCAAC TGGCGCATCT 6300
ACGACGCCTT GGCCGAACCA CTGTCGGACG TGCATCGCCG CCGCCTCGAC GATCTGCTCA AGCGCCGGGA CAACGGCAAG ACGACCTGGC TGGCCTGGCT 6400
GCGCCAATCA CCGGCCAAGC CGAACTCGCG GCACATGCTG GAACACATCG AACGCCTCAA GGCGTGGCAG GCGCTCGATC TTCCTTCCGG CGTCGAGCGG 6500
TCGGTGCACC AGAACCGCCT GCTCAAGATC GCCCGCGAGG GCGGCCAGAT GACGCCTGCC GACCTGGCCA AGTTCGAGGC GCAGCGACGC TATGCCACCC 6600
TGGTGGCGCT GGCCATCGAG GGCATGGCGA CCGTCACCGA CGAAATCATC GACTTGCACG ACCGCATCCT GGGCAAGCTG TTCAACGCCG CCAAGAACAA 6700
GCATCAGCAG CAGTTCCAGG CATCCGGCAA GGCGATCAAC GCCAAGGTGC GGCTGTATGG ACGCATCGGC CAGGCGCTGA TCGACGCCAA GCAGTCGGGC 6800
CGTGATCCGT TCGCCGCTAT CGAGGCCGTC ATGTCTTGGG ATGCCTTCGC CGAAAGCGTC ACCGAGGCGC AAAAGCTCGC GCAACCCGAT GACTTCGATT 6900
TTTTGCACCG CATCGGTGAG AGCTACGCCA CCCTGCGGGG CTATGCACCG GAATTCCTTG CCGTGCTCAA GCTGCGGGCC GCGCCCGCCG CCAAAAACGT 7000
GCTGGATGCC ATTGAGGTAC TGCGCGGTAT GAACACCGAC AACGCCCGCA AGGTGCCCGC CGATGCGCCG ACCGGCTTCA TCAAACCGCG CTGGCAGAAG 7100
CTGGTGATGA CCGACACCGG AATCGACCGG CGCTACTACG AACTGTGCGC CCTGTCGGAA CTCAAGAACT CCCTGCGCTC GGGCGACATC TGGGTGCAGG 7200
GTTCGCGCCA GTTCAAGGAC TTCGAGGACT ACCTGGTGCC GCCCGCGAAG TTCGCCAGCC TCAAGCAGTC CAGCGAATTA CCGCTGGCCG TGGCCACCGA 7300
CTGCGACCAG TACCTGAGCG AGCGCTTGGA GCTGCTGGAA GCGCAGCTCG CCACGGTCAA CCGCATGGCG GCGGCGAACG ATCTGCCGGA CGCCATCATC 7400
ACAGAGTCGG GCCTGAAAAT CACGCCGCTC GATGCGGCGG TGCCGGACAC TGCGCAAGCG TTGATCGACC AGACGGCCAT GATCCTGCCG CACGTCAAGA 7500
TCACAGAATT GCTGCTTGAA GTCGATGAAT GGACGGGCTT CACCCGCCAC TTCACGCACC TGAAATCGGG CGATCTGGCC AAGGACAAGA ACCTGTTGCT 7600
GACCACGATC CTGGCCGACG CGATCAACCT GGGGCTGACG AAAATGGCCG AGTCCTGCCC CGGAACGACC TACGCCAAGC TCGCCTGGCT GCAAGCCTGG 7700
AATACCCGAG ACGAAACCTA TTCGACGGCG CTGGCCGAGT TGGTCAACGC CCAATTCCGG CACCCCTTCG CCGGGCATTG GGGCGACGGC ACCACGTCAT 7800
CGTCGGACGG CCAGAACTTC CGCACCGGCA GCAAGGCCGA GAGTACCGGG CACATCAACC CGAAATACGG CAGCAGCCCA GGACGGACTT TCTACACCCA 7900
CATTTCCGAC CAGTACGCGC CATTCCACAC CAAAGTGGTC AATGTCGGCG TGCGCGACTC GACCTATGTG CTCGACGGCC TGCTGTACCA CGAGTCCGAC 8000
TTGCGGATCG AGGAGCACTA CACCGACACG GCGGGCTTCA CCGATCACGT CTTTGCGCTG ATGCACCTGC TGGGCTTTCG TTTCGCGCCG CGCATCCGCG 8100
ACCTGGGCGA CACCAAGCTC TACATTCCCA AGGGCGAAAC CGCCTATGAC GCCCTCAAGC CGATGATCGG CGGCACGCTC AACATCAAGC ATGTACGCGC 8200
CCATTGGGAC GAAATCCTGC GGCTGGCCAC CTCGATCAAG CAGGGCACCG TGACTGCCTC GCTGATGCTG CGCAAGCTCG GCAGCTACCC ACGCCAGAAC 8300
GGCCTGGCCG TGGCCCTGCG CGAGCTGGGC CGCATCGAGC GCACATTGTT CATCCTTGAC TGGCTACAAA GCGTGGAACT GCGCCGCCGC GTGCATGCCG 8400
GGTTGAACAA GGGCGAAGCC CGCAACGCGC TCGCCAGGGC GGTGTTCTTC AACCGGCTGG GCGAAATCCG CGACCGCAGT TTCGAGCAGC AGCGTTACCG 8500
GGCCAGCGGC CTCAACCTGG TGACGGCAGC CATCGTGCTG TGGAACACAG TTTATCTGGA GCGAGCCGCG AATGCCCTGC GTGACCACGG CAAGCCCGTT 8600
GATGACTCTC TGTTGCAGTA TCTGTCGCCG CTGGGCTGGG AGCACATCAA CCTGACCGGC GATTACCTCT GGCGCAGCAG CGCCAAGATC GGCGCGGGCA 8700
AGTTCAGGCC GCTACGGCCG CTGCAACCGG CTTAGCGTGC TTTATTCTCC GTTTTCTGAG ACGACCCCTG AA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR TnLfArs 38-601 Accessory Gene Resolvase -
arsR TnLfArs 787-1164 Passenger Gene Heavy Metal Resistance +
arsC TnLfArs 1161-1640 Passenger Gene Heavy Metal Resistance +
arsD TnLfArs 1681-2049 Passenger Gene Heavy Metal Resistance +
arsA TnLfArs 2055-3830 Passenger Gene Heavy Metal Resistance +
CBS domain-like protein TnLfArs 3841-4314 Passenger Gene Heavy Metal Resistance +
arsB TnLfArs 4316-5602 Passenger Gene Heavy Metal Resistance +
tnpA TnLfArs 5769-8735 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnLfArs 564 38-601 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MQGQRIGYVR VSSFDQNPER QLEQAEVGKV FTDKASGKDT QRPELDSLLA FVREGDTVVV HSMDRLARNL DDLRRLVQKL TKRGVRIEFV KECLTFTGED
SPMANLMLSV MGAFAEFERA LIRERQKEGI ALAKQRGAYR GRKKALSPER AAELLQRVNA GEQKAKLARE FGISRETLYQ YLREIRA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
arsR ArsR TnLfArs 378 787-1164 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Arsenic
Comment:   arsenic operon regulator
Protein Sequence:  
MEKQAATSIF ESLSSGLRLD VFRLLVKKGP GGMVAGEIAN ALDIPPANLS FHLKALSQAH LVTVEQEGRF QRYRADIPLM LDLIAYLTEE CCSGSPDQCL
ELRTASKCSE EFLPLLSPTP TKATT

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
arsC ArsC TnLfArs 480 1161-1640 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Arsenic
Comment:   cytoplasmic arsenate reductase
Protein Sequence:  
MNILFLCTGN SCRSILAEAT FNHLAPTGWK AMSAGSQPTG TVHPRSLSLL THEGIDTGGL HSKSWDNLPL APDVVITVCA NAAGETCPAY LGPVLRAHWG
VDDPAHATGT EAEIEAAFPT AYRILRRRIE ALLALPLAQL AHDRTRLKAE LDRIGALSA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
arsD ArsD TnLfArs 369 1681-2049 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Arsenic
Comment:   s arsenic operon regulator
Protein Sequence:  
MKKIEVFDPS LCCSTGVCGV DVDQALVTFA ADVDWAKQNG AHIERYNLAQ QPQMFADNAT VKGFLQRSGQ DALPLILVDG EVALAGRYPK RAELALWIGI
DQPADAAKPT TGGCCSGPRG CC

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
arsA ArsA TnLfArs 1776 2055-3830 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Arsenic
Comment:   ATPase inner membrane efflux pump for extrusion of arsenite, antimonite and arsenate
Protein Sequence:  
MEKMMMIFLQ LPPRFLFFTG KGGVGKTSIA CATAIQLAEA GKRVLLVSTD PASNVGQVFG VDIGNRVTPI PAVPRLSALE IDPEAAASAY RERLVGPVRG
VLPDDVVKGI EESLSGACTT EIAAFDEFTA LLTNVALTAD YEHIIFDTAP TGHTIRLLQL PGAWSGFLEA GKGDASCLGP LAGLEKQRTQ YKAAVEALAD
PLQTRLVLVA RAQQATLREV ARTHEELAAI GLKQQHLVIN GILPHIEAAT DPLAAAIHER EQTALKNIPA TLTALPCDHV ELKPFNLVGL DALRQLLTDL
PPQAPVAVDS PIELDEPGVA DLIDGIAADG HGLVMLMGKG GVGKTTLAAA IAVELAHRGL PVHLTTSDPA AHLTDTLDSS LDNLTVSRID PHAETERYRQ
HVLETKGAQL DAEGRALLEE DLRSPCTEEI AVFQAFSRII REAGKKFVVM DTAPTGHTLL LLDATGAYHR EVTRQMGKTG MHFTTPMMQL QDPKQTKVLI
VTLAETTPVL EAANLQADLR RAGIEPWAWI INTSVAAASA KSPLLRQRAA NELREISAVA NQHADRYAVV PLLKEEPIGA DRLRALIHPQ T

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
CBS domain-like protein CBS domain-like protein TnLfArs 474 3841-4314 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Arsenic
Comment:   possible arsenical efflux pump membrane protein sensor
Protein Sequence:  
MLIRNFMTPD PVTIQPETPV EDIARLLLAH RINGVPVVDG AGRLIGVVTA EDLIHRGADE RLEPRESIWK ENFWVSFLGP KGTQRDKAEG RTAAEVMTTE
VHSVTPAMHP SVAARLMVDH HLTALPVVDD GKVIGVISRI DLLSLLKELE NPLKREN

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
arsB ArsB TnLfArs 1287 4316-5602 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Arsenic
Comment:   Na+
Protein Sequence:  
MIAALPIFLA TIVLVIWQPR GLGIGWSATL GAVVALLSGV VHFGDIPVVW QIVWNATATF IAIIIISLLL DEAGFFEWAA LHVARWGGGK GRLLFALIIL
LGAAVAALFA NDGAALILTP IVMAMLLALG FSPAATLAFV MAAGFIADAA SLPLVVSNLV NIVSADFFNI GFNDYAAVMI PVNIVAIIAS LAVLSIYFRR
SIPAHYDVNQ LKQPNEAIRD VATFRFGWVV LALLLVGFFG LEPLGVPVSA VAAAGALLLL AVAARGHVIS TRKVIREAPW QIVVFSLGMY LVVYGLRNQG
LAEHIARLLD YFAQGGVWGA AFGTGFLTAL LSSAMNNMPT VLVGALSIDA TSATGVVKDT MIYANVIGSD LGPKITPIGS LATLLWLHVL ARKGMTITWG
YYFKVGVVLT VPVLAATLAA LALRLSLT

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnLfArs 2967 5769-8735 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRRSILSAA ERDNLLALPD AKEELIRHYT FSDSDLSIIR QRRGPANRLG FAVQLCYLRF PGVILGADEP PFPPLLKLVA DQLKIGIESW GEYGQRGQTR
REHLVELQTV FGFQPFTMSH YRQAVQLLTE LAMQTDKGIV LASALIEHLR RQSVILPALN AVERASAEAI TRANWRIYDA LAEPLSDVHR RRLDDLLKRR
DNGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPS GVERSVHQNR LLKIAREGGQ MTPADLAKFE AQRRYATLVA LAIEGMATVT DEIIDLHDRI
LGKLFNAAKN KHQQQFQASG KAINAKVRLY GRIGQALIDA KQSGRDPFAA IEAVMSWDAF AESVTEAQKL AQPDDFDFLH RIGESYATLR GYAPEFLAVL
KLRAAPAAKN VLDAIEVLRG MNTDNARKVP ADAPTGFIKP RWQKLVMTDT GIDRRYYELC ALSELKNSLR SGDIWVQGSR QFKDFEDYLV PPAKFASLKQ
SSELPLAVAT DCDQYLSERL ELLEAQLATV NRMAAANDLP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFTHLKS
GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWNTRDET YSTALAELVN AQFRHPFAGH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY
GSSPGRTFYT HISDQYAPFH TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYIPKGE TAYDALKPMI
GGTLNIKHVR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI
RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRDH GKPVDDSLLQ YLSPLGWEHI NLTGDYLWRS SAKIGAGKFR PLRPLQPA

 References     

Tuffin IM, Hector SB, Deane SM, Rawlings DE. Resistance determinants of a highly arsenic-resistant strain of Leptospirillum ferriphilum isolated from a commercial biooxidation tank. Appl Environ Microbiol. 2006 Mar;72(3):2247-53. doi: 10.1128/AEM.72.3.2247-2253.2006. PubMed ID: 16517682