Transposon
Name: TnAs2       (Synonyms: Tn7144)
Family: Tn3        Group: Tn21
Evidence of Transposition: yes
 Host     

Host Organism:Uncultured bacterium Molecular Source:plasmid pAKD34
Date of Isolation:2011

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG
IRR (Length: 41 bp)GGGGTCGCCTCAGAAAACGGAAAATAAAGCACGCTAAGCCG

 Sequence     
DNA SequenceLength  8650 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAGGC ATAGCCGACC CTGCCAGGCT TGTTCCGCCC TGTAGCGACG CGATCAGCGG GCAGGAAACA 100
TTCCCCTTCC GCGCATGGCA GGCGCACACC AGTTCAGACA GCACGGTTTC CATGCGCGCC AGGTCGGCCA TCTTTTCGCG CACGTTCTGA AGCTTGTGCT 200
CGGCCAGGCT GCTGGCTTCC TCGCAGTGGG TGCCGTCGTC GAGCCGCAGC AGATCGGCTA TTTCATCCAG GCTGAAGCCC AGCCGCTGGG CCGATTTCAC 300
GAATCGCACT CGCGTCACAT CCGCCGCGCC ATAGCGGCGA ATGCTGCCGT AAGGCTTATC CGGTTCCGGC AGCAAGCCCT TACGCTGATA GAAGCGGATC 400
GTTTCCACAT TGACGCCGGC CGCCTTGGCA AAAACGCCAA TGGTCAGGTT CTCTAAATTG TTTTCCATAT CGCTTGACTC CGTACATGAG TACGGAAGTA 500
AGGTTACGCT ATCCAATTTA AATCCAAAAG GACAAGCGCA TGTCTGAACC ACAAACTGGG CGCGGCGCGC TCTTCGCCGG CGGGCTGGCC GCCATCCTCG 600
CCTCCGCTTG CTGCCTCGGG CCGCTGGTTC TGATCGCCTT GGGCTTCAGC GGGGCTTGGA TCGGCAACCT GACCGTGCTG GAGCCGTATC GCCCGATTTT 700
CATCGGCGCA GCGCTGGTGG CGTTGTTCTT CGCCTGGCGG CGCATCTACC GCCCTGCGCA AGCCTGTAGG CCGGGTGAGG TCTGCGCGAT TCCCCAAGTG 800
CGAGCGACCT ACAAGCTCAT TTTCTGGGTC GTGGCCGTAC TGGTTCTGGT CGCGCTCGGA TTTCCCTATG TCATGCCATT TTTCTACTGA TCGGAGTTCA 900
CCATGAAGAA ACTGTTTGCC TCCCTCGCCC TCGCCGCCGT TGTTGCCCCC GTGTGGGCCG CTACCCAGAC CGTCACGCTG TCCGTACCGG GCATGACGTG 1000
CTCGGCCTGT CCGATCACCG TCAAGAAGGC GATTTCCAAG GTCGATGGCG TCAGTAAAGT TGCCGTGACC TTCGAGACCC GCGAAGCGGT GGTCACCTTC 1100
GATGATGCCA AGACCAGTGT GCAGAAGCTG ACCAAGGCAA CCGGGGACGC GGGCTATCCC TCCAGCGTCA AGCAGTGAGC CACTGAACCC GACCCTGGGG 1200
CGATCATGGG ACTGATCACA CGCATTGCCG ACAAGGCCGG CGCGCTTGGC AGTGTGGTTT CCGCGATGGG CTGCGCCGCC TGTTTTCCGG CCATCGCCAG 1300
CTTGGGCGCG GCCATCGGGC TGGGCTTTCT ACAGGAATAC GAAGGCTTGT TCATTTTCAC GCTGCTGCCG CTGTTCGCCG TCGTCGCTTT GCTGGCGAAT 1400
GCACTGGGCT GGCTCAGTCA TCGGCAATGG CACCGCAGCC TGCTCGGCAT GATCGGGCCA GCCATCGTGT TCGCCGGAAC CGTCTGGTTG CTTGGCAACT 1500
GGTGGACAGC GCGCCTCGTA TACACCGGCC TGGCCCTGAT GATCGGCGTG TCGATCTGGG ACTTGGTTTC ACCGGCAAAT CGCCGCTGCG GGCCGGATGG 1600
CTGCGAACTC CCAGCAAAAC ACGGCTGACG GCGTGAGCCA GCGGCCAACA CAGAAAAGGA ACGATGAATG ACCACCCTCA AAATCACCGG CATGACTTGC 1700
GACTCGTGCG CAGTGCATGT CAAGGAAGCC CTGGAGAAAG TGCCAGGCGT GCAGTCGGCG AATGTCTCAT ATACCAAGGG CAGCGCCAAG CTCGCCGTCG 1800
AGACCGGCAC ATCGCCGGAC GCGTTGACTG CCGCCGTGGC CGGACTCGGT TATCGGGCCA CGCTCGCCGA CGCGCTCGTG CCTCCGGTGG GCGGCGGCTT 1900
GCTCGTCAAG ATGCGCGAAT GGCTAGGTAG CGGCGACAAG GCTGGTGATG ATGGTGGCGG ATTGCATATT GCCGTTATCG GCAGCGGCGG CGCGGCGATG 2000
GCGGCGGCGC TGAAGGCCGT CGAGCAAGGC GCGCACGTCA CCCTGATCGA GCGCGGCACT ATTGGCGGCA CCTGCGTCAA TATCGGCTGC GTGCCGTCCA 2100
AGATCATGAT CCGCGCTGCT CACATCGCCC ACTTGCGTCG AGAAAGCCCG TTCGATGGCG GCATGCCGCC CACACCGCCG ACGATCTTGC GCGAGCGGCT 2200
GCTGGCCCAG CAGCAGGCGC GCGTCGATGA ACTGCGCCAC GCCAAATACG AAGGCATCCT GGATGACAAT CCCGCCATCT CCGTGCTGCA CGGCGAAGCC 2300
CGTTTCAAGG ACGGCCACAG CCTGACCGTG CAGCTCAATG GCGGCGGCGA GCGTGTGGTG ACTTTCGACC GTTGCCTGAT CGCCACCGGC GCGAGTCCGG 2400
CCGTGCCGCC GATCCCCGGC CTGAAAGACA CACCCTACTG GACTTCCACC GAAGCGCTGG TCAGCGACAC CATTCCCGAG CGGTTGGCGG TGATCGGCTC 2500
ATCGGTGGTG GCGCTGGAAC TGGCGCAAGC TTTTGCCCGG CTGGGCAGCC AGGTCACGAT CCTGGCGCGC AGCACCTTGT TCTTCCGAGA AGACCCGGCC 2600
ATCGGCGAGG CAATCACGGC GGCGTTCCGC GCCGAAGGGA TCGAGGTGCT GGAGCACACC CAGGCCAGCC AGGTCGCGCA CGAGGGCGGC GAATTCGTGC 2700
TCACCACGGC GCACGGCGAG CTGCGTGCCG ACAAACTGCT GGTCGCCACC GGTCGCTCGC CCAACACGCG CAGCCTGGCG CTGGACGCGG CAGGCGTCGC 2800
GCTCAACCTG CAAGGCGCTA TCGTCATCGA CGTCGGCATG CGGACCAGCA CGCCGGACAT CTACGCGGCC GGCGACTGCA CCGACCAGCC TCAGTTCGTC 2900
TATGTGGCGG CAGCGGCCGG CACCCGTGCG GCGATCAACA TGACGGGCGG TGACGCGGCG CTTAACCTTG CCGCCATGCC TGCCGTGGTG TTCACCGATC 3000
CGCAAGTCGC CACCGTGGGC TACAGCGAGG CAGAGGCCCA GCACGACGGG ATCGAGACCG ACAGTCGCAC CTTGACCTTG GACAACGTGC CGCGTGCGCT 3100
CGCCAACTTC GACACACGCG GCTTCATCAA GCTGGTCATC GAGGAAGGTA GTGGACGGCT CATCGGCGTA CAGGCGGTCG CGCCGGAAGC CGGCGAACTG 3200
ATCCAGACGG CGGTACTCGC CATCCGTAAC CGGATGACCG TGCAGGAACT GGCCGACCAG TTGTTCCCCT ATCTGACGAT GGTCGAGGGA CTGAAGCTCG 3300
CGGCGCAGAC CTTTAACAAG GATGTGAAGC AGCTTTCCTG CTGCGCCGGG TGAGAAAAAG GAGGTGTTCG ATGAACGCCT ACACGGTGTC CCGGCTGGCC 3400
CTTAATGCCG GGGTGAGCGT GCACATCGTG CGCGACTACC TGCTGCGCGG ACTGCTGCGG CCGGTGGCGT GCACCCCTGG CGGCTACGGC CTGTTCGACG 3500
ATGCGGCCTT GCAACGGCTG TGCTTCGTGC GGGCTGCTTT CGAAGCGGGT ATCGGTCTCG ACGCGCTGGC GCGGCTGTGC CGGGCGTTGG ATACCGCGGA 3600
CGGCGACGAA GCGGCCGCGC AGCTTGCCGT GCTGCGCCAG TTCGTCGAAC GTCGGCGCGA AGCATTGGCC GATCTGGAGG TGCAGTTGGC CACCATGCCG 3700
ACCGAGCTGG TACAGCATGC GGAGAGTCTG CCATGAACAG CCCCGAGCGC TTGCCGACCG AGACACACAA GCCGTTCACC GGCTACCTGT GGGGCGCGTT 3800
GGCCGTGCTC ACCTGTCCCT GCCATTTGCC GATTCTCGCT GTGGTGCTCG CGGGGACGAC GGCCGGCGCG TTCATCGGGG AGCATTGGGG TATTGCAGCC 3900
ATCACGCTGA CCGGCTTGTT TGTCCTGTCC GTGACGCGGC TGTTGCGGGT CTTCAAGGAA AGATCGTGAG CACATCACTG CCCGCCAGAT GGACGGCGAC 4000
CGAATTGGCG CAGGCGGTTG TGCGCGGGCA GTTTGAGCTG CACTACCAGC CGATTGTCGA TTTGCGCAGT GACCAGATTG TCGGTGCGGA AGCCCTGTTG 4100
CGCTGGCGTC ATCCGCAGCT CGGACTTTTG CCGCCGGGCC AGTTCCTGCC CGTGGTCGAA TCGTCCGGCT TGATGCCTGA AATCGGCGCT TGGGTGCTGG 4200
GCACCGCCTG CCGCCAGATG CGCGAATGGC GGATGCTGGC ATGGCAACCG TTCCGGCTCG CCGTCAATGT TTCGGCGAGC CAGGTAGGAC CGGATTTCGA 4300
CGGGTGGGTA AAGGGCGTGC TGGCTGACGC CGAGTTGCCT GCCGAGTATC TGGAAATTGA GCTGACCGAA TCGGTCGCGT TCGGCAATCC GGCGATCTTC 4400
CCAGCCCTGG AAGCCTTGCG ACAGATCGGT GTGCGCTTCG CTGCCGATGA CTTCGGGACG GGGTATTCCT GTCTGCAACA CCTGAAGTGC TGCCCGATTA 4500
CTAAGCTCAA GATCGACCAG TCCTTCGTCG CCGGACTCGC CGACGACCAC CGCGACCGGA CCATCGTGCA CACCGTGATT CAGCTTGCGC ACGGGCTGGG 4600
AATGGACGTG GTGGCCGAGG GCGTGGAAAC GCCGACCAGT CTCGCGCTGT TGCGGCAAGC GGAGTGCGAT ACGGGGCAAG GCTTCCTGTT CGCCAAGCCA 4700
GTGCCGGCGG CGGCATTCGC CGCCTTTGTC AGTCAATGGA GGGGTGCCAC CATGAATGCA AATGATCCGA CTGCCACCAG TTGCTGCGTG TGCTGCAAGG 4800
AAATCCCGCT CGACGCCGCC TTCACACCGG AAGGTGCGGA GTACGTCGAG CGCTTCTGCG GGCTGGAGTG TTATCAACGC TTCCAGGCGC GGGCAAGCAA 4900
CGCGACCGAA ACGAGCGCCG AACTGAACGC TTGCGGTTCG CCGCCGTCAG ATTGAGGCAT ACCCTATCCT GATGTCAGGA TAGACCGCCT CACAACGTCA 5000
GAATAGAGTC GATTGTGTTA TTTATTGACA CATGCAGAAA AAGGTCATAG ATTTCTTCCT GACATTTTCG CCCAGGGAGG CATCTTGCAG GGTCAACGCA 5100
TCGGCTACGT CCGGGTCAGC AGCTTCGACC AGAACCCGGA ACGGCAACTT GAACACGTCG AAGTCGGCAA GGTGTTCACC GACAAGGCGT CGGGCAAGGA 5200
CACCCAGCGG CCCGAGCTTG ATTCGCTGCT GGCCTTCGTA CGCGAAGGCG ACACCGTGGT GGTTCATAGC ATGGATCGCC TGGCGCGCAA CCTCGATGAC 5300
TTGCGCCGCC TCGTGCAAAA GCTCACCAAG CGCGGCGTGC GTATCGAGTT CGTCAAGGAA AGCCTGACCT TCACCGGCGA GGATTCGCCG ATGGCGAACC 5400
TAATGCTGTC GGTCATGGGG GCGTTCGCCG AATTCGAGCG GGCCTTGATC CGCGAGCGGC AGAGGGAAGG CATCGCGCTC GCCAAACAGC GCGGAGCCTA 5500
CCGGGGCCGC AAGAAAGCGC TGTCGCCCGA ACAGGTAGCC GATCTGCGGC AGCGGGCCGC CGCCGGCGAA CAAAAAGCGA AGCTGGCCCG CGAGTTTGGT 5600
GTCAGCCGGG AGACCCTGTA TCAATACTTG AGAACGGATC AGTAAATATG CCACGTCGTT CAATCCTGTC CGCCGCCGAG CGGGAAAGCC TGCTGGCGTT 5700
GCCGGACTCC AAGGACGACC TGATCCGACA TTACACATTC AGCGATAGCG ACCTCTCGAT CATCCGACAG CGGCGCGGGC CTGCAAATCG TTTGGGCTTT 5800
GCAGTTCAGC TCTGTTACCT GCGTTTCCCC GGCATCATTC TTGGTGTCGA TCAGCCGCCG TTCCTACCTT TACTGAAACT GGTCGCCGAC CAACTCAAGG 5900
TCGGCATCGA AAACTGGGAC GAGTACGGGC AGCGGGAGCA GACCCGGCGC GAGCACCTGG TCGAGCTGCA AGCCTTGTTC GGCTTTCAGC CGTTCACCAT 6000
GAGCCACTAC CGGCAGGCCG TGCACACGCT GACCGAGCTG GCCATGCAGA CCGACAAAGG CATCGTGCTG GCCAGCGCCT TGATCGAACA TCTGCGGCGG 6100
CAGTCGGTCA TTCTGCCTGC CCTCAACGCC GTCGAGCGGG CGAGCGCCGA GGCAATCACC CGTGCCAACC GGCGCATCTA CGATGCCTTG GCCGAACCGC 6200
TGTCGGACGC GCATCGCCGC CGCCTCGACG ACCTGCTCAA GCGCCGCGAT AACGGCAAGA CGACCTGGCT GGCTTGGCTG CGCCAGTCGC CCGTCAAGCC 6300
AAACTCGCGG CACATGCTGG AACACATCGA ACGCCTCAAA GCGTGGCAGA CGCTCGACCT GCCTTCGGGC ATCGAGCGTC TGGTTCACCA GAACCGGCTG 6400
CTCAAGATCG CCCGCGAAGG CGGTCAGATG ACGCCCGCCG ACTTGGCCAA GTTCGAGCCA CAGCGCCGCT ACGCAACCCT GGTGGCGCTT GCCATTGAAG 6500
GCATGGCTAC CGTCACCGAC GAAATCATCG ACCTACACGA CCGCATCTTG GGCAAGCTGT TCAACGCCGC CAAGAATAAG CATCAGCAGC AGTTCCAGGC 6600
GTCCGGCAAG GCCATCAACG CCAAGGTGCG CCTGTACGGA CGCATCGGCC AGGCGCTGAT CGACGCCAAG CAGTCGGGAC GCGACCCGTT CGCCGCCATC 6700
GAGGCCGTCA TGTCCTGGGA TGCTTTCGCC GAGAGCGTCA CCGAGGCGCA GAAGCTCGCG CAGCCCGATG ACTTCGATTT TCTGCATCGC ATCGGCGAGA 6800
GCTACGCCAC TTTGCGCCGT TACGCGCCGG AATTTCTTGC TGTGCTCAAG CTGCGGGCGG CGCCCGCTGC CAAGAACGTG CTCGATGCCA TCGAGGTGCT 6900
GCGCGGCATG AACACCGACA ACGCCCGCAA GGTGCCCGCC GATGCCCCGA CCGATTTCAT TAAGCCGCGC TGGCAGAAGC TGGTGATGAC CGACGCCGGC 7000
ATCGACCGGC GCTACTACGA GCTGTGCGCG CTATCGGAGC TGAAGAACTC ACTGCGCTCG GGCGACATCT GGGTGCAGGG TTCGCGCCAG TTCAAGGATT 7100
TCGAGGACTA TCTGGTGCCG CCTGCGAAGT TCGCCAGTCT CAAGCAGTCC AGTGAATTGC CGCTGGCCGT GGCCACCGAC TGCGATCAGT ACCTGCACGA 7200
GCGGCTGACG CTGCTGGAAA CCCAGCTCGC CACCGTCAAC CGCATGGCGG CGGCCAACAA CCTGCCGGAC GCCATCATCA CCGAGTCGGG CCTGAAAATC 7300
ACGCCGTTGG ATGCGGCGGT GCCGGACACC GCGCAGGCGT TGATCGACCA AACGGCCATG ATCCTGCCGC ACGTCAAGAT CACCGAGCTG CTGCTTGAAG 7400
TCGATGAGTG GACGGGCTTC ACCCGGCACT TCACACACCT GAAATCGGGC GATCTGGCCA AGGACAAAAA TTTGCTGCTG ACCACCATCT TGGCCGACGC 7500
GATCAACCTG GGCCTGACCA AGATGGCCGA GTCCTGCCCC GGCACGACCT ACGCCAAGTT GGCTTGGCTG CAAGCCTGGC ATACCCGCGA CGAAACCTAT 7600
TCGACGGCAT TGGCCGAATT GGTCAACGCC CAATTCCAGC ACCCGTTCGC CGGGCACTGG GGCGACGGCA CCACGTCATC GTCGGACGGC CAGAACTTCC 7700
GAACCGGCAG CAAGGCCGAG AGTACCGGGC ACATCAACCC AAAATATGGC AGCAGCCCAG GACGGACTTT CTACACCCAT ATCTCCGACC AGTACGCGCC 7800
GTTCCATACC AAGGTGGTCA ATGTCGGCGT GCGCGATTCG ACCTATGTGC TCGACGGCCT GCTGTACCAC GAGTCCGACT TGCGCATCGA GGAACACTAC 7900
ACCGACACGG CGGGCTTCAC CGATCACGTC TTCGCCTTGA TGCACCTGCT GGGCTTCCGC TTCGCGCCGC GCATCCGCGA CTTGGGCGAC ACCAAGCTGT 8000
TCATCCCCAA GGGCGAAACC AATTACGATG CACTCAAGCC GATGATTAGC AGTGACAAGC TGAACATCAA GGCGATTCGT GCCCATTGGG ACGAAATCTT 8100
GCGGCTGGCC ACCTCGATCA AGCAGGGCAC AGTGACGGCC TCGCTGATGC TGCGCAAGCT CGGCAGCTAC CCGCGCCAGA ACGGTCTGGC CGTGGCGTTG 8200
CGCGAGCTGG GCCGCATCGA ACGCACGTTG TTCATCCTGG ATTGGCTACA AAGCGTGGAA CTGCGCCGCC GCGTGCATGC CGGGTTGAAC AAGGGCGAAG 8300
CCCGCAACGC GCTGGCCCGC GCCGTGTTCT TCAACCGCCT TGGCGAAATC CGCGACCGCA GTTTCGAGCA GCAGCGCTAC CGAGCCAGCG GCCTCAACCT 8400
GGTGACGGCG GCCGTCGTGT TGTGGAACAC GGTCTATCTG GAACGGGCTG CACATGCGCT GCGCGGCAAC GGCCATGCCG TCGATGACGC GCTGTTGCAA 8500
TATCTGTCGC CGCTCGGCTG GGAGCACATC AATCTGACCG GCGATTACCT GTGGCGCAGC AGCGCCAAGA TCGGCGCAGG CAAGTTCAGG CCGTTACGAC 8600
CGCTGCAACC GGCTTAGCGT GCTTTATTTT CCGTTTTCTG AGGCGACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res_site_II 4996-5030 35 CGTCAGAATA GAGTCGATTG TGTTATTTAT TGACA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
merR TnAs2 34-468 Passenger Gene Heavy Metal Resistance -
merT TnAs2 540-890 Passenger Gene Heavy Metal Resistance +
merP TnAs2 903-1178 Passenger Gene Heavy Metal Resistance +
merC TnAs2 1206-1628 Passenger Gene Heavy Metal Resistance +
merA TnAs2 1668-3353 Passenger Gene Heavy Metal Resistance +
merD TnAs2 3371-3736 Passenger Gene Heavy Metal Resistance +
urf2 TnAs2 3501-4955 Passenger Gene Other +
tnpR TnAs2 5085-5645 Accessory Gene Resolvase +
tnpA TnAs2 5648-8617 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR TnAs2 435 34-468 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGAAD VTRVRFVKSA QRLGFSLDEI ADLLRLDDGT HCEEASSLAE HKLQNVREKM
ADLARMETVL SELVCACHAR KGNVSCPLIA SLQGGTSLAG SAMP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT TnAs2 351 540-890 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MSEPQTGRGA LFAGGLAAIL ASACCLGPLV LIALGFSGAW IGNLTVLEPY RPIFIGAALV ALFFAWRRIY RPAQACRPGE VCAIPQVRAT YKLIFWVVAV
LVLVALGFPY VMPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP TnAs2 276 903-1178 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MKKLFASLAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVDGV SKVAVTFETR EAVVTFDDAK TSVQKLTKAT GDAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merC MerC TnAs2 423 1206-1628 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MGLITRIADK AGALGSVVSA MGCAACFPAI ASLGAAIGLG FLQEYEGLFI FTLLPLFAVV ALLANALGWL SHRQWHRSLL GMIGPAIVFA GTVWLLGNWW
TARLVYTGLA LMIGVSIWDL VSPANRRCGP DGCELPAKHG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA TnAs2 1686 1668-3353 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MTTLKITGMT CDSCAVHVKE ALEKVPGVQS ANVSYTKGSA KLAVETGTSP DALTAAVAGL GYRATLADAL VPPVGGGLLV KMREWLGSGD KAGDDGGGLH
IAVIGSGGAA MAAALKAVEQ GAHVTLIERG TIGGTCVNIG CVPSKIMIRA AHIAHLRRES PFDGGMPPTP PTILRERLLA QQQARVDELR HAKYEGILDD
NPAISVLHGE ARFKDGHSLT VQLNGGGERV VTFDRCLIAT GASPAVPPIP GLKDTPYWTS TEALVSDTIP ERLAVIGSSV VALELAQAFA RLGSQVTILA
RSTLFFREDP AIGEAITAAF RAEGIEVLEH TQASQVAHEG GEFVLTTAHG ELRADKLLVA TGRSPNTRSL ALDAAGVALN LQGAIVIDVG MRTSTPDIYA
AGDCTDQPQF VYVAAAAGTR AAINMTGGDA ALNLAAMPAV VFTDPQVATV GYSEAEAQHD GIETDSRTLT LDNVPRALAN FDTRGFIKLV IEEGSGRLIG
VQAVAPEAGE LIQTAVLAIR NRMTVQELAD QLFPYLTMVE GLKLAAQTFN KDVKQLSCCA G

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD TnAs2 366 3371-3736 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MNAYTVSRLA LNAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDTADGDE AAAQLAVLRQ FVERRREALA
DLEVQLATMP TELVQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urf2 Urf2 TnAs2 1455 3501-4955 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  EAL (Pfam:PF00563)||DUF3330 (Pfam:PF11809)
Comment:   similar to urfM from E.coli
Protein Sequence:  
MRPCNGCASC GLLSKRVSVS TRWRGCAGRW IPRTATKRPR SLPCCASSSN VGAKHWPIWR CSWPPCRPSW YSMRRVCHEQ PRALADRDTQ AVHRLPVGRV
GRAHLSLPFA DSRCGARGDD GRRVHRGALG YCSHHADRLV CPVRDAAVAG LQGKIVSTSL PARWTATELA QAVVRGQFEL HYQPIVDLRS DQIVGAEALL
RWRHPQLGLL PPGQFLPVVE SSGLMPEIGA WVLGTACRQM REWRMLAWQP FRLAVNVSAS QVGPDFDGWV KGVLADAELP AEYLEIELTE SVAFGNPAIF
PALEALRQIG VRFAADDFGT GYSCLQHLKC CPITKLKIDQ SFVAGLADDH RDRTIVHTVI QLAHGLGMDV VAEGVETPTS LALLRQAECD TGQGFLFAKP
VPAAAFAAFV SQWRGATMNA NDPTATSCCV CCKEIPLDAA FTPEGAEYVE RFCGLECYQR FQARASNATE TSAELNACGS PPSD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnAs2 561 5085-5645 +
Class:   Accessory Gene
Sub Class:   Resolvase
Function:   resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MQGQRIGYVR VSSFDQNPER QLEHVEVGKV FTDKASGKDT QRPELDSLLA FVREGDTVVV HSMDRLARNL DDLRRLVQKL TKRGVRIEFV KESLTFTGED
SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKALSPEQ VADLRQRAAA GEQKAKLARE FGVSRETLYQ YLRTDQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnAs2 2970 5648-8617 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRRSILSAA ERESLLALPD SKDDLIRHYT FSDSDLSIIR QRRGPANRLG FAVQLCYLRF PGIILGVDQP PFLPLLKLVA DQLKVGIENW DEYGQREQTR
REHLVELQAL FGFQPFTMSH YRQAVHTLTE LAMQTDKGIV LASALIEHLR RQSVILPALN AVERASAEAI TRANRRIYDA LAEPLSDAHR RRLDDLLKRR
DNGKTTWLAW LRQSPVKPNS RHMLEHIERL KAWQTLDLPS GIERLVHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LAIEGMATVT DEIIDLHDRI
LGKLFNAAKN KHQQQFQASG KAINAKVRLY GRIGQALIDA KQSGRDPFAA IEAVMSWDAF AESVTEAQKL AQPDDFDFLH RIGESYATLR RYAPEFLAVL
KLRAAPAAKN VLDAIEVLRG MNTDNARKVP ADAPTDFIKP RWQKLVMTDA GIDRRYYELC ALSELKNSLR SGDIWVQGSR QFKDFEDYLV PPAKFASLKQ
SSELPLAVAT DCDQYLHERL TLLETQLATV NRMAAANNLP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFTHLKS
GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHTRDET YSTALAELVN AQFQHPFAGH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY
GSSPGRTFYT HISDQYAPFH TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLFIPKGE TNYDALKPMI
SSDKLNIKAI RAHWDEILRL ATSIKQGTVT ASLMLRKLGS YPRQNGLAVA LRELGRIERT LFILDWLQSV ELRRRVHAGL NKGEARNALA RAVFFNRLGE
IRDRSFEQQR YRASGLNLVT AAVVLWNTVY LERAAHALRG NGHAVDDALL QYLSPLGWEH INLTGDYLWR SSAKIGAGKF RPLRPLQPA

 References     

Sen D, Van der Auwera GA, Rogers LM, Thomas CM, Brown CJ, Top EM. Broad-host-range plasmids from agricultural soils have IncP-1 backbones with diverse accessory genes. Appl Environ Microbiol. 2011 Nov;77(22):7975-83. doi: 10.1128/AEM.05439-11. Epub 2011 Sep 23. PubMed ID: 21948829