Transposon
Name: Tn511
Family: Tn3        Group: Tn163
Evidence of Transposition: no
 Host     

Host Organism:Proteus mirabilis Molecular Source:plasmid R772

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG
IRR (Length: 38 bp)GGGGTCGCCTCAGAAAACGGAAAATAAAGCACGCTAAG

 Sequence     
DNA SequenceLength  18789 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAGGC ATAGCCGAAC CTGCCAAGCT TGCTCCACCC TGTAGTGACG CGATCAGCGG GCAGGAAACG 100
TTCCCCCTTC GCGCATGGCA GGCGCACACC AACTCAGACA GCACGGCCTC CATGCGCGCC AGGTCAGCCA TTTTCTCGCG CACGTCCTTG AGCTTGTGCT 200
CGGCCAGACT GCTGGCTTCC TCGCAATGGG TGCCATCCTC CAGCCGCAGC AGCTCGGCGA TCTCATCCAG GCTGAAGCCC AGCCGCTGGG CTGATTTCAC 300
GAAGCGCACC CGCGTTACAT CCGCCTCGCC ATAGCGGCGG ATGCTGCCAT AGGGCTTGTC AGGCTCCAGC AACAAGCCCT TGCGCTGATA GAAACGGATG 400
GTCTCCACAT TGACCCCGGC CGCCTTGGCG AAAACGCCAA TGGTCAGGTT CTCCAAATTG TTTTCCATAT CGCTTGACTC CGTACATGAG TACGGAAGTA 500
AGGTTACGCT ATCCAATTTC AATTCGAAAG GACAAGCGCA TGTCTGAACC AAAAACCGGG CGCGGCGCGC CCTTCACTGG AGGGCTTGCC GCCATCCTCG 600
CCTCGGCTTG CTGCCTCGGG CCGTTGGTTC TGATCGCCTT GGGGTTCAGC GGCGCTTGGA TCGGCAACTT GGCGGTGTTG GAACCCTATC GCCCCATCTT 700
TATCGGCGTG GCGCTGGTGG CGTTGTTCTT CGCCTGGCGG CGCATCTACC GGCAGGCAGC GGCCTGCAAA CCGGGTGAGG TCTGCGCGAT TCCCCAAGTG 800
CGAGCTACTT ACAAGCTCAT TTTCTGGATC GTGGCCGCGC TGGTTCTGGT CGCGCTCGGA TTTCCCTACG TCATGCCATT TTTCTACTGA TCGGAGTTCA 900
CCATGAAGAA ACTGTTTGCC TCCCTCGCCC TCGCCGCCGT TGTTGCCCCC GTCTGGGCCG CCACCCAGAC CGTCACGCTG TCCGTACCGG GCATGACCTG 1000
CTCCGCCTGC CCGATCACTG TCAAGAAGGC GATTTCCAAG GTCGAAGGCG TCAGCAAAGT TGACGTGACT TTCGAGACAC GCCAAGCGGT CGTCACCTTC 1100
GACGATGCCA AGACCAGCGT GCAGAAGCTG ACCAAGGCAA CCGCAGACGC GGGCTATCCG TCCAGCGTCA AGCAGTGAGT CACTGAAAAC GGCACCGCAG 1200
CACAACGGAC GTCATTGTCT GGCGCCACAA ACGATAAAGG ATCTGTTGCA TGACCCATCT AAAAATCACC GGCATGACTT GCGACTCGTG CGCGGCGCAC 1300
GTCAAGGAAG CGCTGGAAAA AGTGCCAGGC GTGCAGTCGG CGCTGGTGTC CTATCCGAAG GGCACAGCGC AACTCGCCAT CGTGCCGGGC ACATCGCCGG 1400
ACGCGCTGAC TGCCGCCGTG GCCGGACTGG GCTACAAGGC AACGCTAGCC GATGCGCCAC TGGCGGACAA CCGCGTCGGA CTGCTCGACA AGGTGCGGGG 1500
ATGGATGGCC GCCGCCGAAA AGCACAGTGG CAACGAGCCC CCGGTGCAGG TAGCGGTCAT TGGCAGCGGT GGAGCCGCGA TGGCGGCGGC GCTGAAAGCC 1600
GTCGAGCAAG GCGCGCAGGT CACGCTGATC GAGCGCGGCA CCATCGGCGG CACCTGCGTC AATGTCGGCT GTGTGCCGTC CAAGATCATG ATCCGCGCCG 1700
CCCACATCGC CCATCTGCGC CGGGAAAGCC CGTTCGATGG CGGTATTGCG GCAACTGTGC CTACGATTGA CCGCAGTAAG CTGCTGGCCC AGCAGCAGGC 1800
CCGCGTCGAC GAACTGCGGC ACGCCAAGTA CGAAGGCATC CTGGGCGGTA ATCCGGCCAT CACCGTTGTG CACGGTGAGG CGCGCTTCAA GGACGACCAG 1900
AGCCTTACCG TCCGTTTGAA CGAGGGTGGC GAGCGCGTCG TGATGTTCGA CCGCTGCCTG GTCGCCACGG GTGCCAGCCC GGCGGTCCCG CCGATTCCGG 2000
GCTTGAAAGA GTCACCCTAC TGGACTTCCA CCGAGGCCCT GGCGAGCGAC ACCATTCCCG AACGCCTTGC CGTAATCGGC TCGTCGGTGG TGGCGCTGGA 2100
GCTGGCGCAA GCCTTTGCCC GGCTGGGCAG CAAGGTCACG GTCCTGGCGC GCAATACCTT GTTCTTCCGT GAAGACCCGG CCATCGGCGA GGCGGTGACA 2200
GCCGCTTTCC GTGCCGAGGG CATCGAGGTG CTGGAGCACA CGCAAGCCAG CCAGGTCGCC CATATGGACG GTGAATTCGT GCTGACCACC ACGCACGGTG 2300
AATTGCGCGC CGACAAACTG CTGGTTGCCA CCGGTCGGAC ACCGAACACG CGCAGCCTCG CGCTGGACGC AGCGGGGGTC ACTGTCAATG CGCAAGGTGC 2400
CATCGCCATC GACCAAGGCA TGCGCACGAG CAACCCGAAC ATCTACGCGG CCGGCGACTG CACCGACCAG CCGCAGTTCG TCTATGTGGC GGCAGCGGCC 2500
GGCACCCGTG CCGCGATCAA CATGACCGGC GGCGATGCGG CGCTCGACCT GACCGCAATG CCGGCCGTGG TGTTCACCGA TCCGCAAGTG GCGACCGTGG 2600
GCTACAGCGA GGCGGAAGCC CACCACGACG GGATCGAGAC CGACAGCCGC ACCTTGACCT TGGACAACGT GCCGCGTGCG CTCGCCAACT TCGACACACG 2700
CGGCTTCATC AAGTTGGTTA TCGAGGAAGG CAGCCATCGG CTGATCGGCG TACAGGCGGT CGCGCCGGAA GCGGGTGAAC TGATCCAGAC GGCGGCTCTG 2800
GCCATTCGCA ACCGCATGAC GGTGCAGGAA CTGGCCGACC AGTTGTTCCC CTACCTGACG ATGGTCGAGG GGTTGAAGCT CGCGGCGCAG ACCTTCAACA 2900
AGGATGTGAT GCAGCTTTCC TGCTGCGCCG GGTGAGAAAA AGGAGGTGTT CAATGAACGC CTACACGGTG TCCCGGCTGG CTCTTGATGC CGGGGTGAGC 3000
GTGCATATCG TGCGCGACTA CCTGCTGCGC GGATTGCTGC GCCCGGTGGC GTGCACACCA GGCGGCTACG GCTTGTTCGA TGACGCCGCC TTGCAACGGC 3100
TGTGCTTCGT GCGGGCGGCC TTCGAGGCGG GCATCGGCCT CGACGCGCTG GCGCGGCTGT GCCGGGCGCT GGATGCGGCG GACGGCGACG AAGCGGCCGC 3200
GCAGCTTGCC CTGCTGCGTC AGTTCGTCGA GCGTCGGCGC GAAGCGTTGG CCGATCTGGA AGTGCAGTTG GCCACCCTGC CGACCGAGCC GGCACAGCAC 3300
GCGGAGAGTC TGCCATGAAC AACCCCGAGC GCTTGCCGTC CGAGACGCAC AAACCGATCA CCGGCTACCT GTGGGGCGGA CTGGCTGTGC TGACTTGCCC 3400
CTGCCACCTG CCCATCCTCG CTGTCGTGCT GGCCGGCACA ACCGCCGGTG CTTTCCTCGG CGAGCATTGG GTCATCGCGG CGCTCGGTTT GACCGGCCTG 3500
TTCCTTCTGT CCCTGTCGCG GGCGTTGCGG GCATTCAGGG AAAGAGAATG AGCGCTTTCC GGCCGGATGG ATGGACGACG CCGGAACTGG CCCAAGCGGT 3600
CGAGCGCGGG CAGCTTGAAC TGCACTACCA GCCCGTCGTC GATCTGCGCA GTGGTGGGAT TGTCGGCGCG GAAGCCCTGT TGCGCTGGCG TCATCCGACG 3700
CTTGGACTAT TGCCACCGGG CCAGTTCCTG CCCGTGGTCG AATCGTCCGG CCTGATGCCT GAAATCGGCG CTTGGGTGCT GGGCGAAGCC TGCCGCCAGA 3800
TGCGTGACTG GCGAATGCTG GCATGGCGAC CGTTCCGGCT GGCCGTCAAT GTTTCGGCGA GCCAAGTGGG ACCGGACTTC GACGGGTGGG TAAAGGGCGT 3900
GCTGGCTGAT GCCGAGTTGC CCGCCGAGTA TCTCGAAATC GAGCTGACCG AATCGGTCGC GTTTGGTGAT CCGGCGATCT TCCCCGCCCT GGACGCCTTG 4000
CGGCAGATCG GTGTGCGCTT CGCCGCCGAT GACTTCGGGA CGGGGTATTC CTGTCTGCAA CATCTGAAGT GCTGCCCAAT CAGCACGCTC AAGATCGACC 4100
AATCGTTTGT CGCCGGGCTC GCCAACGACC GCCGCGACCA AACCATCGTG CACACCGTGA TTCAGCTTGC GCACGGGCTG GGCATGGATG TGGTGGCTGA 4200
AGGCGTGGAA ACATCGGCGA GTCTTGATCT ATTGCGACAA GCGGACTGCG ACACAGGACA AGGCTTCCTG TTCGCGAAGC CAATGCCGGC GGCGGCATTC 4300
GCCGTCTTCG TCAGTCAATG GAGGGGTGCC ACCATGAATG CAAGTGACTC GACCACCACC AGTTGCTGCG TGTGCTGCAA GGAAATCCCG CTCGATGCCG 4400
CCTTCACCCC GGAAGGCGCG GAATACGTCG AGCACTTCTG CGGGTTGGAG TGTTATCAAC GCTTCGAAGC GCGTGCCAAG ACAGGGAACG AAACCGATGC 4500
CGATCCGAAC GCCTGCGACT CGCTACCGTC AGATTGAGGC ATACCCTAAC CGGATGTCAG GGTAGACTGC CTCACAACGT CAGAATAGAG TCGGTTGTGT 4600
TATTTATTGA CACTAGCTGA AAAAGGTCAT AGATTTCTTC CTGACATTTT CGTCCAGGGA GGCATCTTGC AGGGTCAACG CATCGGCTAC GTCCGGGTCA 4700
GCAGCTTCGA CCAGAACCCG GAACGGCAAC TTGAACACGT CGAAGTCGGC AAGGTGTTCA CCGACAAGGC GTCGGGCAAG GACACCCAGC GGCCCGAGCT 4800
TGATTCGCTG CTGGCCTTCG TGCGCGAAGG CGACACCGTG GTGGTTCACA GCATGGATCG CTTGGCGCGC AACCTCGATG ACTTGCGCCG CCTCGTGCAA 4900
AAGCTCACCA AGCGCGGCGT GCGCATCGAG TTCGTCAAGG AGAGCCTGAC CTTCACCGGC GAGGATTCGC CGATGGCGAA CCTGATGCTG TCGGTCATGG 5000
GGGCGTTCGC TGAATTCGAG CGGGCCTTGA TCCGCGAGCG GCAGAGGGAA GGCATCGCGC TCGCCAAGCA ACGCGGAGCC TACCGGGGCC GCAAGAAAGC 5100
GCTGTCGCCC GAACAGGTAG CCGATCTGCG GCAGCGGGCC GCCGCCGGCG AACAAAAAGC GAAGCTGGCC CGCGAGTTTG GTGTCAGCCG GGAGACCCTG 5200
TATCAATACT TGAGAGCGGA TCAGTAAATA TGCCACGTCG TTCCATTCTG TCCGCCGCCG AGCGGGAAAG CCTGTTGGCG TTGCCGGATA CCAAGGACGA 5300
CTTGATCCGA TACTACACGT TCAGCGATAC CGACCTCTCC ATCATCCGGC AACGGCGCGG GCCTGCGAAC CGCTTGGGCT TTGCAGTTCA GCTCTGCTAC 5400
CTGCGCTTTC CCGGCATCCT CCTTGGCGTC GATGAGCCGG ATGAGCCGCC GTTCCGCCCT GCTGAAACTG TCGCCGACCA GCTCAAGGTC AGTGTCGAAA 5500
GCTGGGCGAG TACGGGCAGC GGGAGCAGAC CCGGCGCGAG CATCTGGTCG AGTGCAAACG GTGTTCGGCT TCCAGCCCTT CACCATGAGC CACTACCGGC 5600
AGGCCGTCCA CACGCTGACC GAGCTGGCCA TGCAAACCGA CAAGGGCATT GTGCTGGCCA GCGCCTTGAT CGAGCATCTG CGGCGGCAGT CGGTCATTCT 5700
GCCTGCCCTC AACGCCGTCG AGCGGGCGAG CGCCGAAGCG ATCACCCGCG CCAACCGGCG CATCTACTAC GCCTTGGCCG AACCACTGTC GGACGCGCAT 5800
CGCCGCCGCC TCGACGATCT GCTCAAGCGC CGGGACAACG GCAAGACGAC TTGGCTGGCC TGGCTGCGCC AGTCACCCGT CAAGCCCAAT TCGCGGCATA 5900
TGCTGGAGCA CATCGAACGA CTCAAGGCAT GGCAGGCGCT CGATCTGCCT ACCGGCATCG AGCGGCTGAT CCACCAAAAC CGGCTGCTCA AGATCGCCCG 6000
CGAGGGCGGC CAGATGACAC CCGCCGACCT GGCCAAGTTC GAGGCGCAGC GGCGCTACGC GACCCTGGTG GCGCTCGCCA TTGAAGGCAT GGCCACCGTC 6100
ACCGACGAAA TCATCGACCT GCACGACCGC ATCCTGGGCA AGCTGTTCAA CGCCGCCAAG AACAAGCATC AGCAGCAGTT CCAGGCGTCC GGCAAGGCGA 6200
TCAACGCCAA GGTGCGGCTG TTCGGGCGTA TCGGTCAGGC ACTGATCGAG GCCAAGCAAT CGGGCCGCGA TCCGTTTGCC GCCATCGAGG CCGTCATGTC 6300
CTGGGACGCC TTCGCCGAGA GCGTCACCGA AGCGCAGAAG CTCGCGCAGC CCGAGGACTT CGATTTCCTG CACCGCATCG GCGAGAGCTA CGCCACGCTG 6400
CGTCGCTACG CGCCGGAATT CCTCGCCGTG CTCAAGCTGC GGGCCGCGCC CGCTGCCAAG GATGTGCTGG AGGCCATCGA AGTGCTGCGC AACATGAACA 6500
GCGACAACGC CCGCAAGGTG CCCGCCGACG CGCCAACCGA TTTCATCAAG CCGCGCTGGC AGAAGCTGGT GATGACCGAC ACCGGCATCG ATCGGCGCTA 6600
CTACGAACTG TGCGCGCTGT CGGAGATGAA AAACGCCCTG CGCTCCGGCG ACATCTGGGT GCAGGGATCG CGCCAGTTCA AGGACTTCGA GGACTACCTG 6700
GTGCCACCCG CGAAATTCGC CAGCCTCAAG CAGGCCAGCG AATTGCCGCT GGCCGTGGCC ACCGATTGCG ACCAGTACCT GCATGACCGG CTGACGCTGC 6800
TGGAAACGCA GCTCGCCACC GTCAACCGCA TGGCGCTGGC CAACGAGCTG CCGGACGCCA TCATCACGGA GTCGGGCCTG AAGATCACGC CGCTCGATGC 6900
GGCGGTGCCC GACACCGCGC AGGCGCTGAT CGACCAGACA GCAATGATCC TGCCGCACGT CAAGATCACC GAACTGCTGC TGGAGGTAGA CGAATGGACA 7000
GGCTTCACCC GGCACTTCGC GCACCTGAAA TCGGGCGACC TGGCCAAGGA CAAGAACCTG CTGCTGACCA CGATCCTGGC CGACGCCATC AACCTGGGTC 7100
TGACCAAGAT GGCGGAGTCC TGCCCCGGAA CGACCTACGC CAAGCTCGCC TGGCTCCAAG CCTGGCATAC CCGCGACGAA ACCTATTCGT CGGCGCTGGC 7200
CGAACTGGTC AATGCGCAGT TCCGGCATCC CTTCGCCGAG CACTGGGGCG ACGGCACCAC GTCATCGTCG GACGGCCAGA ATTTCCGAAC CGGCAGCAAG 7300
GCCGAGAGCA CTGGCCACAT CAACCCGAAA TATGGCAGCA GTCCTGGGCG GACTTTCTAC ACCCACATCT CCGACCAGTA CGCGCCATTC CACACCAAGG 7400
TGGTTCATGT CGGCGTGCGC GACTCGACTA TGTGCTCGAC GGCTGCTGTA CACGAGTCCG ACTGCGCATC GAGAGCACTA CACCGATACG GCAGGATTCA 7500
CCGATCATGT ATTTGGCTGA TGCACTGCTG GGCGTCCGCT TGCGCCGCGC ATCCGCGACC TGGGCGACAC CAAGCTGTTC ATCCCCAAGG GCGACACCGT 7600
CTACGACGCG CTCAAGCCGA TGATTAGCAG CGACAGACTG AACATCAAGG CTATTCGCGC CCATTGGGAT GAAATTCTAC GGCTGGCCAC GTCGATCAAG 7700
CAGGGCACGG TGACGGCTTC GCTGATGCTG CGCAAGCTCG GCAGCTATCC GCGCCAGAAC GGCCTGGCCG TGGCCCTGCG CGAGCTGGGG CGTATCGAGC 7800
GCACGCTGTT CATCCTGGAT TGGTTGCAAA GCGTGGAGCT GCGCCGTCGC GTGCACGCTG GGCTGAACAA GGGCGAAGCC CGCAATGCGC TGGCCCGCGC 7900
CGTGTTCTTC AACCGTCTGG GTGAAATCCG CGACCGCAGC TTTGAGCAGC AGCGCTACCG TGCCAGCGGC CTCAACCTGG TGACGGCGGC CGTCGTGCTA 8000
TGGAACACGG TCTATCTGGA ACGGGCTGCG CACGCGCTGC GGGGCAACGG CCACGCCGTC GATGACGCGC TGTTGCAGTA CCTGTCGCCG CTCGGCTGGG 8100
AGCACATCAA CCTGACCGGC GATTACCTCT GGCGCAGCAG CGCCAAGATC GGCGCGGGCA AGTTCAGGCC GCTGCGGCCG TTGCAACCTG CTTAGCGTGC 8200
TTTATTTTCC GTTTTCTGAG ACGACCCCTT TACGAGTTTT GCCTATCCGG GGTCGATTTG CGGCCACACC TTTGCGGTGA ATGAAAAAGG TATCGTACAC 8300
GGTGAATAAC ATCCGTGCAG TGCATCGTCG CAGGGATACC GCGTCAGATC TGGCTCGCGC CTCGCTGAAT GCGAACACCC TCGACGAGAC CATCACTATA 8400
CTTACCGGCC AGCCGCGATC CGGGGCGTTT CACCATACGC TCGGGCAGCT GGGCGATACG CGGCTGTTCA GCGTTGAGGC CACCGGTCAA GGCTGTTCCG 8500
TTCTCCCCCT GACGACGGTA ACCGGGCATG CCAACCATCT GGTGCATGCG GCACTGGCTG GCGTTGAGCA GATCGTTACC GACAGTTCAG CGTCGCGGCA 8600
GTTACGCCTC GTGCAGTGGC GGGAAACGCA ACCTCCGTTT GACGCCGCAG CGGCAAAAGC CATTCTCTCA GATACCCATG ACGCCGAACT GCCAATTTAC 8700
CGGCTGGCTG CCGACGATCC GGATGAGGAG AACACCCTCG CCACGGCAGT TTTCACCCTC GATGCCAACC ACGTCAGGTG GCAAATTTTC GACATTAACC 8800
GCGACGATGC TAAATTTCAG GGAGAAGTGC GTGGGTGAGA TAGCCGGTCG TCAGTCATAA AGGGCAGGGT AGTACCGTTG GCCCGGCGTT CGATAGTACC 8900
GTCCGGATAC TGCCCGCTAT CGGCGTCTTG AGCGATGTCC TGAAGCGCGG TGTCCGGAAT ATTCAGGTTT GTGTCTCTAT ACGATTCGGG CCATCGCCGT 9000
TTCTCATTTT GGTGTTGTTG TTGACAGGGA AACCAAAGAC TAGCTGTTAG AAAAACTCAT CGAGCATCAA ATGAAACTGC AATTTATTCA TATCAGGGTT 9100
ATCAATGCCA TATTTATGAA AAAGCCGTTT TTGTAATGAA GGTGAAAATT CACCGAGGCA GTTCCATAGG ATGGCAAGAT CCTGGTATCG GTCTGCGATT 9200
CCGACTCGTC CAACATCAAT ACAACCTATT AATTTCCCCT CGTCAAAAAT AAGGTTATCA AGTGAGAAAT CACCATGAGT GACGACTGAA TCCGGTGAGA 9300
ATGGCAAAAG CTTATGCATT TCTTTCCAGA CTTGTTCAAC AGGACAGCCA TTACGCTCGT CATCAAAATC ACTCGCATCA ACCAAACCGT TATTCATTCG 9400
TGATTGCGCC TGAGCGAGAC GAAATACGCG ATCGCTGTTA AAAGGACAAT TACAAAGAGG AATCGAATGC AACCGGCGCA GGAACGCTGC CAGAGCATCA 9500
ACAATATTTT CACGTGAATC AGGGTATTCT TCTAATGCCT GGAATGCTGT TTTCCCGGGG ATCGCAGTGG TGAGTAACCA CGCATCATCA GGAGTACGGA 9600
TAAAATGCTT GATGGTCGGA AGAGGCATAA ATGCCGTCAG CCAGTTTAGT CTGACCATCT CATCTGTAAC ATCATTGGCA ACGCTACCTT TGCCATGTTT 9700
CAGAAACAAC TCTGGCGCAT CGGGCTTCCC ATACAATCGA TAGATTGTCG CACCTGATTG CCCGACATTA TCGCGAGCCC ATCTATACCC ATATAGATCA 9800
GCATCCAGAT TTGAATTTAA TCGCGGCCTC GAGCAAGATG TTTCCCGTTG AATATGGCTC ATAACGCTCC TTGTATTACT GTTTATGTAA GCAGACAGTT 9900
TTATTGTTCA TGATGATATA TTTTTATCGT GTGTAATGTA ACATCAGATA TTTTGAGACA CGACGTGGTT CCCGCCATGT GAAAATCAGG CCAGACCAAC 10000
ATCATTTTCA GGTGGCCGAT GCACATCAAT GTCATATTGA TAGTCTAAAC CACTCACTTT GCGTGGTCAA GGTCAGATTT AGTTACGGGA CGAAGATGCA 10100
GTCGGGTGGG GTGAGGTTGC AAAAGCATTT CCTTAATAAA GAGTTCATCA GCATAGTTGG ATAACTGTTT TTATAGTAAG TAAAATCACG GGTTTTACGT 10200
CGCAAGCGCC GTAAAACCCG CTGATGGGGT TAGCTGAATC TTTCCGGGCG GAAGGTCGCC AGCATCGGGT GTTTCACGCC AGTGAGCATC TCCTGCGTAA 10300
GCATCTCGCC AAGGATTAAC GCCAGCGTGG CCCCGGAATG CGTGAAGACC ACAAAACAGC CGGGAACTTT CTGCAACTCG CCGACCACCG GTTCGCCATC 10400
GCCAGGAATA GGCTTCAGGC CAATTTTGCA GCTGTTGGCT GTTAACGGTG CGCCGTCACC TATCAGGTTA CCAGCTTCCG CCATCAACTG GTTAATGACA 10500
GACGTATCAA TCCTGTACTG ACCGTCGCCC TGCGCGACGA TGTGCTCTTC ATACCAGTCA TGATCGACGG CGATAGTATT GCCGGGATTA GGTCGGACAG 10600
CAGCGCGCGG CGTATTCAAT ACCACCTGCG GCGCGAGATC GCAGGGCTGG CTGGTCACCA GCATTGAGAC CGGAGAACCA TTGGGGATCT GCACGCCGAG 10700
TGGGGCAACC ACCTCCGGCG TCCACGGGCC ACACGCCACC AGTACCTGAT CCGCCAGCAA TGCACCGTGT TTTTCGCTGC GAATACCGCA GGCCCGGCCG 10800
TCTTTGGTTA TCACGCTGGC TTTACCGGTG TTCTCAATGA TTTCCCCGCC CAGCGCGCGA AACGCCTGAA CCAGATGATC AACCAGATGA GGCAAGCTGA 10900
CCCAGCCTTC ACCTGGATTG GCAATCGCGA CGTGACCCAG GCTGGCCGGA TTCACTTGCG CATCGACATC ACCGACCGTG GCGCGATTGA TCAGCTTTGA 11000
GTCATAGCCC TGCGCCTTTT CATAGTGATG ACGAGCTTTG GTGCCAGCGT CATCGTCTGC GGCCCAGTAA ATAGCGCCGT CGAAACGCAG CCAGTCGAGT 11100
TGCGGATGGC GGGCAAACAG CGTGCGATAG CGGTCAATGC CCGCTATCCG CAGCGCATGA TAAGGCTGCG AACGTTCGCC AGCGGAGTTG AGCCAGGAGA 11200
GTGAGCGGCC AGTGGCACCG GAACAGAGCG CCGCTTCCGT CACCAGCGTG ACTTGCGCGC CCGCCTGCGC CAGCTGCCAG GCGGTTGAGA CGCCGAGGAT 11300
GCCGCCACCC AGCACAATGA CGGTTTTTGC TACGGTTTTG TCAGACATCA TTAATTCCTT CTGTGAAAAA ATGTTCGCCC TTCAGGCAAT GCGTACACGA 11400
TGCTGCCGAT GGAGTTGCTC GATAAGCTGT AAAACGTTGA GCGCTTCACG TGCATCAACC GGTGGTGACC CGCCGTGTAA CAGCGCGTGG GCTAACAACT 11500
GATAGAAAGT GGGGTAGTGG CCACGTTCAA TGGACACCAT TGTGGTTGCC CCGACCGTGC TGGCCCTGGC GAAATTTTCT ACCGGCGCGA CGCCGTACTG 11600
GCTATCCTCT TGATGTGTCA TCACAGCACC TCCGACAGAA AACGCTGCAG GCGCGCAGAT TGCGGGCGAT CAAAGATTTG TTCCGGTGTG CCAGCTTCCA 11700
CGATCTGGGG TCGGTTCCGG CTGAGGGCGA AATGACACCC TAAGCGTTAG CTCTGTGTCG TTGCACGATG TCAGCGACGG TATTCTTGCT GATACCGAGC 11800
TCGCGTGCGA TCCAGCGATA GCTGCGTCCC TCGGCCCTCA TCGCAACCAC CTTAGGCAAA AGTCGGTCTG ATTTTGGTCG CACTCCGGCC TGACGACCAA 11900
GCCTCTTACC ACGTGCCTTC GCAACAGCAA GGCCTGACTT GACCCGCTCG CTGATGAGAT CCCGCTCAAA CTCCGCAATG CCGGAAAGAA ACGTCGCCAG 12000
CATTCGTCCA TACGGCGACG AAAGATCGAA CGCCATTCCA TTCATGGCTA TCACGGAAAC CTTCCAGTTC TCCAGTTCAC GTAGCGTATT GAGCAGATCG 12100
AGCGTCGAGC GCCCCCACCG GGAAAGCTCA GTGACCAGGA TTGCATCAAT TTGTCTGGAC TGGGCAAGCG CCAGGACTTT CTTTCGCTCG GCCCGGTCGA 12200
GTTTAGTTCC TGAACCTGTT TCCTTAAATA TTCCCACCAC GTCGTAGCCG GCACGGCCGG CGAAGGCTCG CAGATCAAAT TCCTGGCGTT CACAAGACTG 12300
ATCCGCTGTT GAAACCCGGC AGTAAATGGC GGCACGATGT CCCAATTGAA CCCTCCTGGA TTTTTGTATC GGAACGCCCT GATTTATATG GGCTGGCTGT 12400
TGTCCAAAAC AGACTATACT TCAAAAGGGA CGAATTTGTA TGTCACGACG CCATATTTTC ACCGAACGGC AGCGAGCAGC GCTGTTCGAT CTGCCCACGG 12500
ACGAACTGTC GCTACTGAAG TTCTACACGC TGGGCGATGA TGACCTGGAA AACATTAGGC AGCGCCGCAG ACCGGAAAAC AGGATTGGCT TTGCCCTGCA 12600
ACTTTGTGCC TTACGATATC CGGGCCGTGC ACTGGCTCCT GGTGAGATGA TCCCGCGTGA AATCCTTTCC TTCGTCGGTG CTCAGCTTGG AGTTCCGGCT 12700
GATGCGCTTC TCACTTATGC CACACGGCGC CAAACCCGTC AGCAGCACAT GGACACGCTG CGCGAAATTT ACGGCTACAA GACCTTCACG GGCCGTGGTG 12800
CCCGTGATCT GCGGAAGTGG ACTTTCGGTC AGGCCGAAGA TGCCAGATCA AACGAGGATC TTGCTCATCG TTTTATTGTG CGGTGTCGGG AAACTTCCAC 12900
CATTCTGCCC GCAGTATCGA CAATCGAGCG CTTGTGCGCG GATGCTCTGG TCGCCGCTGA GCGGCGGATT GAAACGCGGA TTGTGGAAAA TTTAACAGCG 13000
GATGTTCGCG ATCACCTGGA CAAACTTCTG AGTGAAATGC TCGCCGGCAA TATCAGTCGT TTCATCTGGC TTCGCAACTT CGAGGTTGGT AACAACTCGG 13100
CTGCTGCTAA CCGTTTGCTC GACAGGCTCG AATTTCTGCG TACCCTGAAT ATCAATCATA GTGCTTTGGC CAGCATACCT GCCCATCGCA TTGCCCGGCT 13200
GCGTCGGCAG GGTGAACGCT ACTTCACCGA CGGTTTGCGT GACATCACTT CGGACCGCCG CTGGGCGATC CTTGCCGTCT GTGTTGTGGA GTGGGAAGCG 13300
GCGATTGCTG ATGCCATAGT CGAAACCCAT GACAGGATCG TAGGAAAAAC CTGGCGGGAA GCGAAGCGCC AGCATGACGA AACAATTTCC GGCTCTAAAG 13400
CCACACTCGC GGATACGATC CGTACCTTCA CCGCGCTGGG AGCTTCGTTG CTTGAGGCCC GCAGTGACGG AACCCCGCTG GAGATGGCTG TCGCCAGTTC 13500
GGTTGCATGG GACCGGCTCG CTCAACTGGT AGCGACAGGG ACTCAACTCA GCAACACGCT AGCCGATGAG CCTCTTGCAT ATGTCGGGCA GGGATACCAT 13600
CGCTTTCGTC GTTATGCGCC CCGCATGTTG CGCTGTCTGA AGCTCGAAGC CGCGCCGGTC GCCGGACCAT TGGTAGCAGC AGCTTTGTCG ATCGGAGAGA 13700
TGAAAGGTGT TGCATCGCCA GAAAGGCGTT TCCTGCGGCC CAGCTCCAAA TGGAACCGTC ATTTACGAGC TCAGGAAAAA GGAGATACCC GTCTTTGGGA 13800
AGTGGCGGTA CTCTTTCACC TCCGGGATGC TTTTCGTTCC GGAGATGTCT GGCTCGCTCA TTCGCGCCGC TATGGTGACC TCAAGCAGGT ACTGGTGCCG 13900
ATGATCGCGG CGCAGGAAAA TGCAAAACTG GCCGTGCCTT CCAACCCACA GGATTGGCTG GCAGACAGAA AGGCGCGACT CACGATCGCT CTTAAGCGGC 14000
TGGCCCGGGC TGCCCGTAAC GGCACTATTC CGCACGGTAG CATAGAAGAT GGAACGTTGC GGATCGACAG GTTGACAGCA GACGTGCCGG ATGGTGCCGA 14100
GGCACTCATA CTGGATCTGT ATCGCCGAAT GCCGTCCGTT CGGATTACCG ACATGCTGCT TGAAGTTGAT GCAGCCCTTG GTTTCACAGA TGCGTTTACC 14200
CATCTGAGAA CCGGGGCTCC ATGTCGCGAC CGGATCGGTC TGCTCAACGT CCTGCTCGCT GAAGGGCTCA ATCTGGGCCT GCGTAAGATG GCGGAAGCTA 14300
CAAACACGCA TGATTACTGG CAGCTCTCAC GCCTTGCCCG CTGGCATGTT GAAAGCGAAG CCATGAACCA GGCATTGGCA ATTGTGGTGG CCGCGCAGGG 14400
TAAACTGCCG ATGTCACGCG TCTGGGGGAT GGGCACGTCA GCATCGAGCG ATGGTCAGTT TTTCCCGACA GCGCGGCATG GCGAAGCCAT GAACATGGTC 14500
AATGCCAAAT ATGGTTCTGT TCCCGGCCTC AAAGCGTATA CTCACGTAAG CGACCAGTTC GCGCCATTCG CTTGTCAGTC GATCCCGGCG ACCGTGAGCG 14600
AGGCACCGTA TATTCTCGAT GGACTACTGA TGAACGAGGT CGGTCGCCAT GTTCGCGAAC AGTATGCCGA TACAGCAGGA TTCACCGACC ATTTGTTCGG 14700
AGCCAGTAGC CTGCTCGGCT ACAATCTCGT TCTGCGAATC AGGGATCTGC CATCGAAGCG GTTGTACGTA TTTAATCCCG ATACGACCCC CAGGGAGTTA 14800
CGCAAGTTGG TAGGTGGAAA AGCCCGGGAG GATCTTATCG TTGCGAACTG GCCTGATATT TTCCGTTGTG CCGCGACGAT GACCGCTGGC AAAATCAGGC 14900
CCAGCCAACT CCTGCGCAAG CTCGCTTCTT ACCCACGACA AAACAACCTT GCAGTTGCGC TTCGTGAAGT TGGTCGTATT GAACGGACCC TTTTCATTAT 15000
TGAGTGGATC CTGGATACGG ACATGCAGCG GCGTGCTCAG ATCGGTCTTA ACAAGGGAGA GGCCCACCAT GCGCTCAAAA ATGCGCTCCG TATCGGGAGG 15100
CAGGGGGAAA TTCGCGATCG CACGACAGAG GGGCAGCACT ACCGAATCGC TGGGCTCAAT TTATTGACTG CGGTGATCAT TTACTGGAAT ACCGTCCATC 15200
TTGGTCATGC CGTCACGGAG CGGCGGAACG AAGGGTTGGA TGTTCCCCCT GAATTTCTTC CCCACATATC CCCATTGGCT GGGCGCACAT TCTACTGACT 15300
GGCGAATATC TTTGGCCCAA GGAACCGAAA GCTTAGGGTG TCATTTCGCC CTCAGCCGGA ACCGACCCCC CCTTGCTGAA ACTGGTCGCC GACCAGCTCA 15400
AGGTCAGTGT CGAAAGCTGG GCGAGTACGG GCAGCGGGAG CAGACCCGGC GCGAGCATCT GGTCGAGTGC AAACGGTGTT CGGCTTCCAG CCCTTCACCA 15500
TGAGCCACTA CCGGCAGGCC GTCCACACGC TGACCGAGCT GGCCATGCAA ACCGACAAGG GCATTGTGCT GGCCAGCGCC TTGATCGAGC ATCTGCGGCG 15600
GCAGTCGGTC ATTCTGCCTG CCCTCAACGC CGTCGAGCGG GCGAGCGCCG AAGCGATCAC CCGCGCCAAC CGGCGCATCT ACTACGCCTT GGCCGAACCA 15700
CTGTCGGACG CGCATCGCCG CCGCCTCGAC GATCTGCTCA AGCGCCGGGA CAACGGCAAG ACGACTTGGC TGGCCTGGCT GCGCCAGTCA CCCGTCAAGC 15800
CCAATTCGCG GCATATGCTG GAGCACATCG AACGACTCAA GGCATGGCAG GCGCTCGATC TGCCTACCGG CATCGAGCGG CTGATCCACC AAAACCGGCT 15900
GCTCAAGATC GCCCGCGAGG GCGGCCAGAT GACACCCGCC GACCTGGCCA AGTTCGAGGC GCAGCGGCGC TACGCGACCC TGGTGGCGCT CGCCATTGAA 16000
GGCATGGCCA CCGTCACCGA CGAAATCATC GACCTGCACG ACCGCATCCT GGGCAAGCTG TTCAACGCCG CCAAGAACAA GCATCAGCAG CAGTTCCAGG 16100
CGTCCGGCAA GGCGATCAAC GCCAAGGTGC GGCTGTTCGG GCGTATCGGT CAGGCACTGA TCGAGGCCAA GCAATCGGGC CGCGATCCGT TTGCCGCCAT 16200
CGAGGCCGTC ATGTCCTGGG ACGCCTTCGC CGAGAGCGTC ACCGAAGCGC AGAAGCTCGC GCAGCCCGAG GACTTCGATT TCCTGCACCG CATCGGCGAG 16300
AGCTACGCCA CGCTGCGTCG CTACGCGCCG GAATTCCTCG CCGTGCTCAA GCTGCGGGCC GCGCCCGCTG CCAAGGATGT GCTGGAGGCC ATCGAAGTGC 16400
TGCGCAACAT GAACAGCGAC AACGCCCGCA AGGTGCCCGC CGACGCGCCA ACCGATTTCA TCAAGCCGCG CTGGCAGAAG CTGGTGATGA TCGACACCGG 16500
CATCGATCGG CGCTACTACG AACTGTGCGC GCTGTCGGAA ATGAAAAACG CCCTGCGCTC CGGCGACATC TGGGTGCAGG GATCGCGCCA GTTCAAGGAC 16600
TTCGAGGACT ACCTGGTGCC ACCCGCGAAA TTCGCCAGCC TCAAGCAGGC CAGCGAATTG CCGCTGGCCG TGGCCACCGA TTGCGACCAG TACCTGCATG 16700
ACCGGCTGAC GCTGCTGGAA ACGCAGCTCG CCACCGTCAA CCGCATGGCG CTGGCCAACG AGCTGCCGGA CGCCATCATC ACGGAGTCGG GCCTGAAGAT 16800
CACGCCGCTC GATGCGGCGG TGCCCGACAC CGCGCAGGCG CTGATCGACC AGACAGCAAT GATCCTGCCG CACGTCAAGA TCACCGAACT GCTGCTGGAG 16900
GTAGACGAAT GGACAGGCTT CACCCGGCAC TTCGCGCACC TGAAATCGGG CGACCTGGCC AAGGACAAGA ACCTGCTGCT GACCACGATC CTGGCCGACG 17000
CCATCAACCT GGGTCTGACC AAGATGGCGG AGTCCTGCCC CGGAACGACC TACGCCAAGC TCGCCTGGCT CCAAGCCTGG CATACCCGCG ACGAAACCTA 17100
TTCGTCGGCG CTGGCCGAAC TGGTCAATGC GCAGTTCCGG CATCCCTTCG CCGAGCACTG GGGCGACGGC ACCACGTCAT CGTCGGACGG CCAGAATTTC 17200
CGAACCGGCA GCAAGGCCGA GAGCACTGGC CACATCAACC CGAAATATGG CAGCAGTCCT GGGCGGACTT TCTACACCCA CATCTCCGAC CAGTACGCGC 17300
CATTCCACAC CAAGGTGGTC AATGTCGGCG TGCGCGACTC GACCTATGTG CTCGACGGCT TGCTGTACCA CGAGTCCGAC CTGCGCATCG AGGAGCACTA 17400
CACCGATACG GCAGGATTCA CCGATCATGT ATTTGGCCTG ATGCACCTGC TGGGCTTCCG CTTTGCGCCG CGCATCCGCG ACCTGGGCGA CACCAAGCTG 17500
TTCATCCCCA AGGGCGACAC CGTCTACGAC GCGCTCAAGC CGATGATTAG CAGCGACAGA CTGAACATCA AGGCTATTCG CGCCCATTGG GATGAAATTC 17600
TACGGCTGGC CACGTCGATC AAGCAGGGCA CGGTGACGGC TTCGCTGATG CTGCGCAAGC TCGGCAGCTA TCCGCGCCAG AACGGCCTGG CCGTGGCCCT 17700
GCGCGAGCTG GGGCGTATCG AGCGCACGCT GTTCATCCTG GATTGGTTGC AAAGCGTGGA GCTGCGCCGT CGCGTGCACG CTGGGCTGAA CAAGGGCGAA 17800
GCCCGCAATG CGCTGGCCCG CGCCGTGTTC TTCAACCGTC TGGGTGAAAT CCGCGACCGC AGCTTTGAGC AGCAGCGCTA CCGTGCCAGC GGCCTCAACC 17900
TGGTGACGGC GGCCGTCGTG CTATGGAACA CGGTCTATCT GGAACGGGCT GCGCACGCGC TGCGGGGCAA CGGCCACGCC GTCGATGACG CGCTGTTGCA 18000
GTACCTGTCG CCGCTCGGCT GGGAGCACAT CAACCTGACC GGCGATTACC TCTGGCGCAG CAGCGCCAAG ATCGGCGCGG GCAAGTTCAG GCCGCTGCGG 18100
CCGTTGCAAC CTGCTTAGCG TGCTTTATTT TCCGTTTTCT GAGACGACCC CCAGTTGTTC GGGCAGCACA GATGCCGCAA TCGTTTTGGC TGTTTCTTCT 18200
TCGAGTTGTC CCAGCGCCGC TGTGACCAGG ACACGGATGT CACGGTCGCG CAGGATCAGG ATCTTGCTTT GATACAACCA GCGCCGAGCG AATACGAGCA 18300
ATTGATCCCG ATCGGCGCAG CGGGCCACCT CATCGCGCAG GGTGCGTACC AGGGCGCGAC GTTGGTGCTC CGTCATCCAA TGAAACCCCA GGACATCACA 18400
GGCCAGTTGT TGATGGTCAA AGAGGGTTAC CATCCAGAGG TTCCAAGAGG AGGCGCGATG TCAGCAGATC GAAATCCGAT CCATGGATTA GAGAGAAGCA 18500
AGTGGTACGA GAGGACGCTA GTCAGGGAGC TTCTCGTTGC CGTACCCATG CCGTCGTGTT GTGGAACACG GTCTATCTGG AACGGGCTGC GCACGCGCTG 18600
CGTGGCAACG GCCATGCCGT TGATGACGCG CTGTTGCAGT ACCTGTCGCC GCTCGGTTGG GAGCACATCA ACCTCACCGG CGATTACCTC TGGCGCAGCA 18700
GCGCCAAGAT CGGCGCGGGC AAGTTCAGGC CGCTACGACC GCTGCAACCG GCTTAGCGTG CTTTATTTTC CGTTTTCTGA GGCGACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res_site_I 4527-4557 31 CGTCAGATTG AGGCATACCC TAACCGGATG T
res_site_II 4578-4612 35 CGTCAGAATA GAGTCGGTTG TGTTATTTAT TGACA
res_site_III 4615-4646 32 AGCTGAAAAA GGTCATAGAT TTCTTCCTGA CA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
merR Tn4378.1 34-468 Passenger Gene Heavy Metal Resistance -
merT Tn4378.1 540-890 Passenger Gene Heavy Metal Resistance +
merP Tn4378.1 903-1178 Passenger Gene Heavy Metal Resistance +
merA Tn4378.1 1250-2935 Passenger Gene Heavy Metal Resistance +
merD Tn4378.1 2953-3318 Passenger Gene Heavy Metal Resistance +
merE Tn4378.1 3315-3551 Passenger Gene Heavy Metal Resistance +
urfM Tn4378.1 3548-4537 Passenger Gene Other +
tnpR Tn4378.1 4667-5227 Accessory Gene Resolvase +
tnpA non-functional Tn4378.1 5230-8195 Transposase   +
tnpA 5'-end Tn4378.1 5230-5439 Transposase   +
APH(3')-Ia (ARO:3002641) Tn4378.1 9047-9862 Passenger Gene Antibiotic Resistance -
socD Tn4378.1 10230-11348 Passenger Gene Other -
tnpR Tn5403a 11740-12345 Accessory Gene Resolvase -
tnpA Tn5403a 12440-15298 Transposase   +
tnpA 3'-end Tn4378.1 15370-18119 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn4378.1 435 34-468 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   activator-repressor of mer operon
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLLEPDKPY GSIRRYGEAD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVREKM
ADLARMEAVL SELVCACHAR RGNVSCPLIA SLQGGASLAG SAMP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn4378.1 351 540-890 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   cytosolic mercuric ion transport protein
Target:   Mercury
Protein Sequence:  
MSEPKTGRGA PFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLAVLEPY RPIFIGVALV ALFFAWRRIY RQAAACKPGE VCAIPQVRAT YKLIFWIVAA
LVLVALGFPY VMPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn4378.1 276 903-1178 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Protein Sequence:  
MKKLFASLAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVEGV SKVDVTFETR QAVVTFDDAK TSVQKLTKAT ADAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn4378.1 1686 1250-2935 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercuric ion reductase
Target:   Mercury
Protein Sequence:  
MTHLKITGMT CDSCAAHVKE ALEKVPGVQS ALVSYPKGTA QLAIVPGTSP DALTAAVAGL GYKATLADAP LADNRVGLLD KVRGWMAAAE KHSGNEPPVQ
VAVIGSGGAA MAAALKAVEQ GAQVTLIERG TIGGTCVNVG CVPSKIMIRA AHIAHLRRES PFDGGIAATV PTIDRSKLLA QQQARVDELR HAKYEGILGG
NPAITVVHGE ARFKDDQSLT VRLNEGGERV VMFDRCLVAT GASPAVPPIP GLKESPYWTS TEALASDTIP ERLAVIGSSV VALELAQAFA RLGSKVTVLA
RNTLFFREDP AIGEAVTAAF RAEGIEVLEH TQASQVAHMD GEFVLTTTHG ELRADKLLVA TGRTPNTRSL ALDAAGVTVN AQGAIAIDQG MRTSNPNIYA
AGDCTDQPQF VYVAAAAGTR AAINMTGGDA ALDLTAMPAV VFTDPQVATV GYSEAEAHHD GIETDSRTLT LDNVPRALAN FDTRGFIKLV IEEGSHRLIG
VQAVAPEAGE LIQTAALAIR NRMTVQELAD QLFPYLTMVE GLKLAAQTFN KDVMQLSCCA G

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn4378.1 366 2953-3318 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   secondary regulatory protein
Target:   Mercury
Protein Sequence:  
MNAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLALLRQ FVERRREALA
DLEVQLATLP TEPAQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn4378.1 237 3315-3551 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Protein Sequence:  
MNNPERLPSE THKPITGYLW GGLAVLTCPC HLPILAVVLA GTTAGAFLGE HWVIAALGLT GLFLLSLSRA LRAFRERE

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM UrfM Tn4378.1 990 3548-4537 +
Class:   Passenger Gene
Sub Class:   Other
Function:   possible diguanylate phosphodiesterase
Sequence Family:  EAL (Pfam:PF00563)||DUF3330 (Pfam:PF11809)
Comment:   similar to urfM from E.coli
Protein Sequence:  
MSAFRPDGWT TPELAQAVER GQLELHYQPV VDLRSGGIVG AEALLRWRHP TLGLLPPGQF LPVVESSGLM PEIGAWVLGE ACRQMRDWRM LAWRPFRLAV
NVSASQVGPD FDGWVKGVLA DAELPAEYLE IELTESVAFG DPAIFPALDA LRQIGVRFAA DDFGTGYSCL QHLKCCPIST LKIDQSFVAG LANDRRDQTI
VHTVIQLAHG LGMDVVAEGV ETSASLDLLR QADCDTGQGF LFAKPMPAAA FAVFVSQWRG ATMNASDSTT TSCCVCCKEI PLDAAFTPEG AEYVEHFCGL
ECYQRFEARA KTGNETDADP NACDSLPSD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn4378.1 561 4667-5227 +
Class:   Accessory Gene
Sub Class:   Resolvase
Function:   resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MQGQRIGYVR VSSFDQNPER QLEHVEVGKV FTDKASGKDT QRPELDSLLA FVREGDTVVV HSMDRLARNL DDLRRLVQKL TKRGVRIEFV KESLTFTGED
SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKALSPEQ VADLRQRAAA GEQKAKLARE FGVSRETLYQ YLRADQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA non-functional TnpA non-functional Tn4378.1 2966 5230-8195 +
Class:   Transposase
Comment:   non-functional transposase due to internal stop codon || appears to be a fusion of the first 70 codons of tnpA (Tn4378) and another unknown tnpA
Protein Sequence:  
MPRRSILSAA ERESLLALPD TKDDLIRYYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGILLGVDEP DEPPFRPAET VADQLKVSVE SWASTGSGSR
PGASIWSSAN GVRLPALHHE PLPAGRPHAD RAGHANRQGH CAGQRLDRAS AAAVGHSACP QRRRAGERRS DHPRQPAHLL RLGRTTVGRA SPPPRRSAQA
PGQRQDDLAG LAAPVTRQAQ FAAYAGAHRT TQGMAGARSA YRHRAADPPK PAAQDRPRGR PDDTRRPGQV RGAAALRDPG GARH*RHGHR HRRNHRPARP
HPGQAVQRRQ EQASAAVPGV RQGDQRQGAA VRAYRSGTDR GQAIGPRSVC RHRGRHVLGR LRRERHRSAE ARAARGLRFP APHRRELRHA ASLRAGIPRR
AQAAGRARCQ GCAGGHRSAA QHEQRQRPQG ARRRANRFHQ AALAEAGDDR HRHRSALLRT VRAVGDEKRP ALRRHLGAGI APVQGLRGLP GATREIRQPQ
AGQRIAAGRG HRLRPVPA*P ADAAGNAARH RQPHGAGQRA AGRHHHGVGP EDHAARCGGA RHRAGADRPD SNDPAARQDH RTAAGGRRMD RLHPALRAPE
IGRPGQGQEP AADHDPGRRH QPGSDQDGGV LPRNDLRQAR LAPSLAYPRR NLFVGAGRTG QCAVPASLRR ALGRRHHVIV GRPEFPNRQQ GREHWPHQPE
IWQQSWADFL HPHLRPVRAI PHQGGSCRRA RLDYVLDGCC TRVRLRIEST TPIRQDSPIM YLADALLGVR LRRASATWAT PSCSSPRATP STTRSSR*LA
ATD*TSRLFA PIGMKFYGWP RRSSRAR*RL R*CCASSAAI RARTAWPWPC ASWGVSSARC SSWIGCKAWS CAVACTLG*T RAKPAMRWPA PCSSTVWVKS
ATAALSSSAT VPAASTW*RR PSCYGTRSIW NGLRTRCGAT ATPSMTRCCS TCRRSAGSTS T*PAITSGAA APRSARASSG RCGRCNLL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA 5'-end N Tn4378.1 210 5230-5439 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   tnpA ORF interrupted
Protein Sequence:  
MPRRSILSAA ERESLLALPD TKDDLIRYYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGILLGVDEP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
APH(3')-Ia (ARO:3002641) APH(3')-Ia Tn4378.1 816 9047-9862 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  APH(3') (ARO:3000126)
Comment:   strict match to reference sequence for ARO:3002641 (bitscore: 550)||Synonyms: aphA-1, apha1-1AB, APH(3')-Ic, apha7
Protein Sequence:  
MSHIQRETSC SRPRLNSNLD ADLYGYRWAR DNVGQSGATI YRLYGKPDAP ELFLKHGKGS VANDVTDEMV RLNWLTAFMP LPTIKHFIRT PDDAWLLTTA
IPGKTAFQAL EEYPDSRENI VDALAAFLRR LHSIPLCNCP FNSDRVFRLA QAQSRMNNGL VDASDFDDER NGCPVEQVWK EMHKLLPFSP DSVVTHGDFS
LDNLIFDEGK LIGCIDVGRV GIADRYQDLA ILWNCLGEFS PSLQKRLFHK YGIDNPDMNK LQFHLMLDEF F

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
socD SocD Tn4378.1 1119 10230-11348 -
Class:   Passenger Gene
Sub Class:   Other
Comment:   putative santhopine-degrading protein || FAD-binding oxidoreductase
Protein Sequence:  
MSDKTVAKTV IVLGGGILGV STAWQLAQAG AQVTLVTEAA LCSGATGRSL SWLNSAGERS QPYHALRIAG IDRYRTLFAR HPQLDWLRFD GAIYWAADDD
AGTKARHHYE KAQGYDSKLI NRATVGDVDA QVNPASLGHV AIANPGEGWV SLPHLVDHLV QAFRALGGEI IENTGKASVI TKDGRACGIR SEKHGALLAD
QVLVACGPWT PEVVAPLGVQ IPNGSPVSML VTSQPCDLAP QVVLNTPRAA VRPNPGNTIA VDHDWYEEHI VAQGDGQYRI DTSVINQLMA EAGNLIGDGA
PLTANSCKIG LKPIPGDGEP VVGELQKVPG CFVVFTHSGA TLALILGEML TQEMLTGVKH PMLATFRPER FS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn5403a 606 11740-12345 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MGHRAAIYCR VSTADQSCER QEFDLRAFAG RAGYDVVGIF KETGSGTKLD RAERKKVLAL AQSRQIDAIL VTELSRWGRS TLDLLNTLRE LENWKVSVIA
MNGMAFDLSS PYGRMLATFL SGIAEFERDL ISERVKSGLA VAKARGKRLG RQAGVRPKSD RLLPKVVAMR AEGRSYRWIA RELGISKNTV ADIVQRHRAN
A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn5403a 2859 12440-15298 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MSRRHIFTER QRAALFDLPT DELSLLKFYT LGDDDLENIR QRRRPENRIG FALQLCALRY PGRALAPGEM IPREILSFVG AQLGVPADAL LTYATRRQTR
QQHMDTLREI YGYKTFTGRG ARDLRKWTFG QAEDARSNED LAHRFIVRCR ETSTILPAVS TIERLCADAL VAAERRIETR IVENLTADVR DHLDKLLSEM
LAGNISRFIW LRNFEVGNNS AAANRLLDRL EFLRTLNINH SALASIPAHR IARLRRQGER YFTDGLRDIT SDRRWAILAV CVVEWEAAIA DAIVETHDRI
VGKTWREAKR QHDETISGSK ATLADTIRTF TALGASLLEA RSDGTPLEMA VASSVAWDRL AQLVATGTQL SNTLADEPLA YVGQGYHRFR RYAPRMLRCL
KLEAAPVAGP LVAAALSIGE MKGVASPERR FLRPSSKWNR HLRAQEKGDT RLWEVAVLFH LRDAFRSGDV WLAHSRRYGD LKQVLVPMIA AQENAKLAVP
SNPQDWLADR KARLTIALKR LARAARNGTI PHGSIEDGTL RIDRLTADVP DGAEALILDL YRRMPSVRIT DMLLEVDAAL GFTDAFTHLR TGAPCRDRIG
LLNVLLAEGL NLGLRKMAEA TNTHDYWQLS RLARWHVESE AMNQALAIVV AAQGKLPMSR VWGMGTSASS DGQFFPTARH GEAMNMVNAK YGSVPGLKAY
THVSDQFAPF ACQSIPATVS EAPYILDGLL MNEVGRHVRE QYADTAGFTD HLFGASSLLG YNLVLRIRDL PSKRLYVFNP DTTPRELRKL VGGKAREDLI
VANWPDIFRC AATMTAGKIR PSQLLRKLAS YPRQNNLAVA LREVGRIERT LFIIEWILDT DMQRRAQIGL NKGEAHHALK NALRIGRQGE IRDRTTEGQH
YRIAGLNLLT AVIIYWNTVH LGHAVTERRN EGLDVPPEFL PHISPLAGRT FY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA 3'-end N Tn4378.1 2750 15370-18119 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   tnpA ORF interrupted
Protein Sequence:  
PLLKLVADQL KVSVESWAST GSGSRPGASI WSSANGVRLP ALHHEPLPAG RPHADRAGHA NRQGHCAGQR LDRASAAAVG HSACPQRRRA GERRSDHPRQ
PAHLLRLGRT TVGRASPPPR RSAQAPGQRQ DDLAGLAAPV TRQAQFAAYA GAHRTTQGMA GARSAYRHRA ADPPKPAAQD RPRGRPDDTR RPGQVRGAAA
LRDPGGARH* RHGHRHRRNH RPARPHPGQA VQRRQEQASA AVPGVRQGDQ RQGAAVRAYR SGTDRGQAIG PRSVCRHRGR HVLGRLRRER HRSAEARAAR
GLRFPAPHRR ELRHAASLRA GIPRRAQAAG RARCQGCAGG HRSAAQHEQR QRPQGARRRA NRFHQAALAE AGDDRHRHRS ALLRTVRAVG NEKRPALRRH
LGAGIAPVQG LRGLPGATRE IRQPQAGQRI AAGRGHRLRP VPA*PADAAG NAARHRQPHG AGQRAAGRHH HGVGPEDHAA RCGGARHRAG ADRPDSNDPA
ARQDHRTAAG GRRMDRLHPA LRAPEIGRPG QGQEPAADHD PGRRHQPGSD QDGGVLPRND LRQARLAPSL AYPRRNLFVG AGRTGQCAVP ASLRRALGRR
HHVIVGRPEF PNRQQGREHW PHQPEIWQQS WADFLHPHLR PVRAIPHQGG QCRRARLDLC ARRLAVPRVR PAHRGALHRY GRIHRSCIWP DAPAGLPLCA
AHPRPGRHQA VHPQGRHRLR RAQADD*QRQ TEHQGYSRPL G*NSTAGHVD QAGHGDGFAD AAQARQLSAP ERPGRGPARA GAYRAHAVHP GLVAKRGAAP
SRARWAEQGR SPQCAGPRRV LQPSG*NPRP QL*AAALPCQ RPQPGDGGRR AMEHGLSGTG CARAAGQRPR RR*RAVAVPV AARLGAHQPD RRLPLAQQRQ
DRRGQVQAAA AVATCL

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
Tn4378.1-EU287476.1 Tn4378.1 Transposon 1-18151 18151
Tn5403a-EU287476.1 Tn5403a Transposon 11707-15369 3663

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRL Tn511 1-38 GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAG
IR Tn4378.1 8191-8228 GAATCGCACG AAATAAAAGG CAAAAGACTC TGCTGGGG
IRL Tn5403a 11707-11752 GGGGTCGGTT CCGGCTGAGG GCGAAATGAC ACCCTAAGCG TTAGCT
IRR Tn5403a 15324-15369 TGGCTTTCGA ATCCCACAGT AAAGCGGGAG TCGGCCTTGG CTGGGG
IRR Tn4378.1 18114-18151 GAATCGCACG AAATAAAAGG CAAAAGACTC TGCTGGGG

 References     

1.Petrovski S, Stanisich VA. Embedded elements in the IncPbeta plasmids R772 and R906 can be mobilized and can serve as a source of diverse and novel elements. Microbiology (Reading). 2011 Jun;157(Pt 6):1714-1725. doi: 10.1099/mic.0.047761-0. Epub 2011 Mar 10. PubMed ID: 21393370
2.Coetzee JN. Mobilization of the Proteus mirabilis chromosome by R plasmid R772. J Gen Microbiol. 1978 Sep;108(1):103-9. doi: 10.1099/00221287-108-1-103. PubMed ID: 357678