|
|
|
|
Name: Tn511 |
|
Family: Tn3 Group: Tn163 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Proteus mirabilis | Molecular Source: | plasmid R772 |
| | | |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 38 bp) | | GGGGTCGTCTCAGAAAACGGAAAATAAAGCACGCTAAG |
IRR (Length: 38 bp) | | GGGGTCGCCTCAGAAAACGGAAAATAAAGCACGCTAAG |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAGGC ATAGCCGAAC CTGCCAAGCT TGCTCCACCC TGTAGTGACG CGATCAGCGG GCAGGAAACG 100
TTCCCCCTTC GCGCATGGCA GGCGCACACC AACTCAGACA GCACGGCCTC CATGCGCGCC AGGTCAGCCA TTTTCTCGCG CACGTCCTTG AGCTTGTGCT 200
CGGCCAGACT GCTGGCTTCC TCGCAATGGG TGCCATCCTC CAGCCGCAGC AGCTCGGCGA TCTCATCCAG GCTGAAGCCC AGCCGCTGGG CTGATTTCAC 300
GAAGCGCACC CGCGTTACAT CCGCCTCGCC ATAGCGGCGG ATGCTGCCAT AGGGCTTGTC AGGCTCCAGC AACAAGCCCT TGCGCTGATA GAAACGGATG 400
GTCTCCACAT TGACCCCGGC CGCCTTGGCG AAAACGCCAA TGGTCAGGTT CTCCAAATTG TTTTCCATAT CGCTTGACTC CGTACATGAG TACGGAAGTA 500
AGGTTACGCT ATCCAATTTC AATTCGAAAG GACAAGCGCA TGTCTGAACC AAAAACCGGG CGCGGCGCGC CCTTCACTGG AGGGCTTGCC GCCATCCTCG 600
CCTCGGCTTG CTGCCTCGGG CCGTTGGTTC TGATCGCCTT GGGGTTCAGC GGCGCTTGGA TCGGCAACTT GGCGGTGTTG GAACCCTATC GCCCCATCTT 700
TATCGGCGTG GCGCTGGTGG CGTTGTTCTT CGCCTGGCGG CGCATCTACC GGCAGGCAGC GGCCTGCAAA CCGGGTGAGG TCTGCGCGAT TCCCCAAGTG 800
CGAGCTACTT ACAAGCTCAT TTTCTGGATC GTGGCCGCGC TGGTTCTGGT CGCGCTCGGA TTTCCCTACG TCATGCCATT TTTCTACTGA TCGGAGTTCA 900
CCATGAAGAA ACTGTTTGCC TCCCTCGCCC TCGCCGCCGT TGTTGCCCCC GTCTGGGCCG CCACCCAGAC CGTCACGCTG TCCGTACCGG GCATGACCTG 1000
CTCCGCCTGC CCGATCACTG TCAAGAAGGC GATTTCCAAG GTCGAAGGCG TCAGCAAAGT TGACGTGACT TTCGAGACAC GCCAAGCGGT CGTCACCTTC 1100
GACGATGCCA AGACCAGCGT GCAGAAGCTG ACCAAGGCAA CCGCAGACGC GGGCTATCCG TCCAGCGTCA AGCAGTGAGT CACTGAAAAC GGCACCGCAG 1200
CACAACGGAC GTCATTGTCT GGCGCCACAA ACGATAAAGG ATCTGTTGCA TGACCCATCT AAAAATCACC GGCATGACTT GCGACTCGTG CGCGGCGCAC 1300
GTCAAGGAAG CGCTGGAAAA AGTGCCAGGC GTGCAGTCGG CGCTGGTGTC CTATCCGAAG GGCACAGCGC AACTCGCCAT CGTGCCGGGC ACATCGCCGG 1400
ACGCGCTGAC TGCCGCCGTG GCCGGACTGG GCTACAAGGC AACGCTAGCC GATGCGCCAC TGGCGGACAA CCGCGTCGGA CTGCTCGACA AGGTGCGGGG 1500
ATGGATGGCC GCCGCCGAAA AGCACAGTGG CAACGAGCCC CCGGTGCAGG TAGCGGTCAT TGGCAGCGGT GGAGCCGCGA TGGCGGCGGC GCTGAAAGCC 1600
GTCGAGCAAG GCGCGCAGGT CACGCTGATC GAGCGCGGCA CCATCGGCGG CACCTGCGTC AATGTCGGCT GTGTGCCGTC CAAGATCATG ATCCGCGCCG 1700
CCCACATCGC CCATCTGCGC CGGGAAAGCC CGTTCGATGG CGGTATTGCG GCAACTGTGC CTACGATTGA CCGCAGTAAG CTGCTGGCCC AGCAGCAGGC 1800
CCGCGTCGAC GAACTGCGGC ACGCCAAGTA CGAAGGCATC CTGGGCGGTA ATCCGGCCAT CACCGTTGTG CACGGTGAGG CGCGCTTCAA GGACGACCAG 1900
AGCCTTACCG TCCGTTTGAA CGAGGGTGGC GAGCGCGTCG TGATGTTCGA CCGCTGCCTG GTCGCCACGG GTGCCAGCCC GGCGGTCCCG CCGATTCCGG 2000
GCTTGAAAGA GTCACCCTAC TGGACTTCCA CCGAGGCCCT GGCGAGCGAC ACCATTCCCG AACGCCTTGC CGTAATCGGC TCGTCGGTGG TGGCGCTGGA 2100
GCTGGCGCAA GCCTTTGCCC GGCTGGGCAG CAAGGTCACG GTCCTGGCGC GCAATACCTT GTTCTTCCGT GAAGACCCGG CCATCGGCGA GGCGGTGACA 2200
GCCGCTTTCC GTGCCGAGGG CATCGAGGTG CTGGAGCACA CGCAAGCCAG CCAGGTCGCC CATATGGACG GTGAATTCGT GCTGACCACC ACGCACGGTG 2300
AATTGCGCGC CGACAAACTG CTGGTTGCCA CCGGTCGGAC ACCGAACACG CGCAGCCTCG CGCTGGACGC AGCGGGGGTC ACTGTCAATG CGCAAGGTGC 2400
CATCGCCATC GACCAAGGCA TGCGCACGAG CAACCCGAAC ATCTACGCGG CCGGCGACTG CACCGACCAG CCGCAGTTCG TCTATGTGGC GGCAGCGGCC 2500
GGCACCCGTG CCGCGATCAA CATGACCGGC GGCGATGCGG CGCTCGACCT GACCGCAATG CCGGCCGTGG TGTTCACCGA TCCGCAAGTG GCGACCGTGG 2600
GCTACAGCGA GGCGGAAGCC CACCACGACG GGATCGAGAC CGACAGCCGC ACCTTGACCT TGGACAACGT GCCGCGTGCG CTCGCCAACT TCGACACACG 2700
CGGCTTCATC AAGTTGGTTA TCGAGGAAGG CAGCCATCGG CTGATCGGCG TACAGGCGGT CGCGCCGGAA GCGGGTGAAC TGATCCAGAC GGCGGCTCTG 2800
GCCATTCGCA ACCGCATGAC GGTGCAGGAA CTGGCCGACC AGTTGTTCCC CTACCTGACG ATGGTCGAGG GGTTGAAGCT CGCGGCGCAG ACCTTCAACA 2900
AGGATGTGAT GCAGCTTTCC TGCTGCGCCG GGTGAGAAAA AGGAGGTGTT CAATGAACGC CTACACGGTG TCCCGGCTGG CTCTTGATGC CGGGGTGAGC 3000
GTGCATATCG TGCGCGACTA CCTGCTGCGC GGATTGCTGC GCCCGGTGGC GTGCACACCA GGCGGCTACG GCTTGTTCGA TGACGCCGCC TTGCAACGGC 3100
TGTGCTTCGT GCGGGCGGCC TTCGAGGCGG GCATCGGCCT CGACGCGCTG GCGCGGCTGT GCCGGGCGCT GGATGCGGCG GACGGCGACG AAGCGGCCGC 3200
GCAGCTTGCC CTGCTGCGTC AGTTCGTCGA GCGTCGGCGC GAAGCGTTGG CCGATCTGGA AGTGCAGTTG GCCACCCTGC CGACCGAGCC GGCACAGCAC 3300
GCGGAGAGTC TGCCATGAAC AACCCCGAGC GCTTGCCGTC CGAGACGCAC AAACCGATCA CCGGCTACCT GTGGGGCGGA CTGGCTGTGC TGACTTGCCC 3400
CTGCCACCTG CCCATCCTCG CTGTCGTGCT GGCCGGCACA ACCGCCGGTG CTTTCCTCGG CGAGCATTGG GTCATCGCGG CGCTCGGTTT GACCGGCCTG 3500
TTCCTTCTGT CCCTGTCGCG GGCGTTGCGG GCATTCAGGG AAAGAGAATG AGCGCTTTCC GGCCGGATGG ATGGACGACG CCGGAACTGG CCCAAGCGGT 3600
CGAGCGCGGG CAGCTTGAAC TGCACTACCA GCCCGTCGTC GATCTGCGCA GTGGTGGGAT TGTCGGCGCG GAAGCCCTGT TGCGCTGGCG TCATCCGACG 3700
CTTGGACTAT TGCCACCGGG CCAGTTCCTG CCCGTGGTCG AATCGTCCGG CCTGATGCCT GAAATCGGCG CTTGGGTGCT GGGCGAAGCC TGCCGCCAGA 3800
TGCGTGACTG GCGAATGCTG GCATGGCGAC CGTTCCGGCT GGCCGTCAAT GTTTCGGCGA GCCAAGTGGG ACCGGACTTC GACGGGTGGG TAAAGGGCGT 3900
GCTGGCTGAT GCCGAGTTGC CCGCCGAGTA TCTCGAAATC GAGCTGACCG AATCGGTCGC GTTTGGTGAT CCGGCGATCT TCCCCGCCCT GGACGCCTTG 4000
CGGCAGATCG GTGTGCGCTT CGCCGCCGAT GACTTCGGGA CGGGGTATTC CTGTCTGCAA CATCTGAAGT GCTGCCCAAT CAGCACGCTC AAGATCGACC 4100
AATCGTTTGT CGCCGGGCTC GCCAACGACC GCCGCGACCA AACCATCGTG CACACCGTGA TTCAGCTTGC GCACGGGCTG GGCATGGATG TGGTGGCTGA 4200
AGGCGTGGAA ACATCGGCGA GTCTTGATCT ATTGCGACAA GCGGACTGCG ACACAGGACA AGGCTTCCTG TTCGCGAAGC CAATGCCGGC GGCGGCATTC 4300
GCCGTCTTCG TCAGTCAATG GAGGGGTGCC ACCATGAATG CAAGTGACTC GACCACCACC AGTTGCTGCG TGTGCTGCAA GGAAATCCCG CTCGATGCCG 4400
CCTTCACCCC GGAAGGCGCG GAATACGTCG AGCACTTCTG CGGGTTGGAG TGTTATCAAC GCTTCGAAGC GCGTGCCAAG ACAGGGAACG AAACCGATGC 4500
CGATCCGAAC GCCTGCGACT CGCTACCGTC AGATTGAGGC ATACCCTAAC CGGATGTCAG GGTAGACTGC CTCACAACGT CAGAATAGAG TCGGTTGTGT 4600
TATTTATTGA CACTAGCTGA AAAAGGTCAT AGATTTCTTC CTGACATTTT CGTCCAGGGA GGCATCTTGC AGGGTCAACG CATCGGCTAC GTCCGGGTCA 4700
GCAGCTTCGA CCAGAACCCG GAACGGCAAC TTGAACACGT CGAAGTCGGC AAGGTGTTCA CCGACAAGGC GTCGGGCAAG GACACCCAGC GGCCCGAGCT 4800
TGATTCGCTG CTGGCCTTCG TGCGCGAAGG CGACACCGTG GTGGTTCACA GCATGGATCG CTTGGCGCGC AACCTCGATG ACTTGCGCCG CCTCGTGCAA 4900
AAGCTCACCA AGCGCGGCGT GCGCATCGAG TTCGTCAAGG AGAGCCTGAC CTTCACCGGC GAGGATTCGC CGATGGCGAA CCTGATGCTG TCGGTCATGG 5000
GGGCGTTCGC TGAATTCGAG CGGGCCTTGA TCCGCGAGCG GCAGAGGGAA GGCATCGCGC TCGCCAAGCA ACGCGGAGCC TACCGGGGCC GCAAGAAAGC 5100
GCTGTCGCCC GAACAGGTAG CCGATCTGCG GCAGCGGGCC GCCGCCGGCG AACAAAAAGC GAAGCTGGCC CGCGAGTTTG GTGTCAGCCG GGAGACCCTG 5200
TATCAATACT TGAGAGCGGA TCAGTAAATA TGCCACGTCG TTCCATTCTG TCCGCCGCCG AGCGGGAAAG CCTGTTGGCG TTGCCGGATA CCAAGGACGA 5300
CTTGATCCGA TACTACACGT TCAGCGATAC CGACCTCTCC ATCATCCGGC AACGGCGCGG GCCTGCGAAC CGCTTGGGCT TTGCAGTTCA GCTCTGCTAC 5400
CTGCGCTTTC CCGGCATCCT CCTTGGCGTC GATGAGCCGG ATGAGCCGCC GTTCCGCCCT GCTGAAACTG TCGCCGACCA GCTCAAGGTC AGTGTCGAAA 5500
GCTGGGCGAG TACGGGCAGC GGGAGCAGAC CCGGCGCGAG CATCTGGTCG AGTGCAAACG GTGTTCGGCT TCCAGCCCTT CACCATGAGC CACTACCGGC 5600
AGGCCGTCCA CACGCTGACC GAGCTGGCCA TGCAAACCGA CAAGGGCATT GTGCTGGCCA GCGCCTTGAT CGAGCATCTG CGGCGGCAGT CGGTCATTCT 5700
GCCTGCCCTC AACGCCGTCG AGCGGGCGAG CGCCGAAGCG ATCACCCGCG CCAACCGGCG CATCTACTAC GCCTTGGCCG AACCACTGTC GGACGCGCAT 5800
CGCCGCCGCC TCGACGATCT GCTCAAGCGC CGGGACAACG GCAAGACGAC TTGGCTGGCC TGGCTGCGCC AGTCACCCGT CAAGCCCAAT TCGCGGCATA 5900
TGCTGGAGCA CATCGAACGA CTCAAGGCAT GGCAGGCGCT CGATCTGCCT ACCGGCATCG AGCGGCTGAT CCACCAAAAC CGGCTGCTCA AGATCGCCCG 6000
CGAGGGCGGC CAGATGACAC CCGCCGACCT GGCCAAGTTC GAGGCGCAGC GGCGCTACGC GACCCTGGTG GCGCTCGCCA TTGAAGGCAT GGCCACCGTC 6100
ACCGACGAAA TCATCGACCT GCACGACCGC ATCCTGGGCA AGCTGTTCAA CGCCGCCAAG AACAAGCATC AGCAGCAGTT CCAGGCGTCC GGCAAGGCGA 6200
TCAACGCCAA GGTGCGGCTG TTCGGGCGTA TCGGTCAGGC ACTGATCGAG GCCAAGCAAT CGGGCCGCGA TCCGTTTGCC GCCATCGAGG CCGTCATGTC 6300
CTGGGACGCC TTCGCCGAGA GCGTCACCGA AGCGCAGAAG CTCGCGCAGC CCGAGGACTT CGATTTCCTG CACCGCATCG GCGAGAGCTA CGCCACGCTG 6400
CGTCGCTACG CGCCGGAATT CCTCGCCGTG CTCAAGCTGC GGGCCGCGCC CGCTGCCAAG GATGTGCTGG AGGCCATCGA AGTGCTGCGC AACATGAACA 6500
GCGACAACGC CCGCAAGGTG CCCGCCGACG CGCCAACCGA TTTCATCAAG CCGCGCTGGC AGAAGCTGGT GATGACCGAC ACCGGCATCG ATCGGCGCTA 6600
CTACGAACTG TGCGCGCTGT CGGAGATGAA AAACGCCCTG CGCTCCGGCG ACATCTGGGT GCAGGGATCG CGCCAGTTCA AGGACTTCGA GGACTACCTG 6700
GTGCCACCCG CGAAATTCGC CAGCCTCAAG CAGGCCAGCG AATTGCCGCT GGCCGTGGCC ACCGATTGCG ACCAGTACCT GCATGACCGG CTGACGCTGC 6800
TGGAAACGCA GCTCGCCACC GTCAACCGCA TGGCGCTGGC CAACGAGCTG CCGGACGCCA TCATCACGGA GTCGGGCCTG AAGATCACGC CGCTCGATGC 6900
GGCGGTGCCC GACACCGCGC AGGCGCTGAT CGACCAGACA GCAATGATCC TGCCGCACGT CAAGATCACC GAACTGCTGC TGGAGGTAGA CGAATGGACA 7000
GGCTTCACCC GGCACTTCGC GCACCTGAAA TCGGGCGACC TGGCCAAGGA CAAGAACCTG CTGCTGACCA CGATCCTGGC CGACGCCATC AACCTGGGTC 7100
TGACCAAGAT GGCGGAGTCC TGCCCCGGAA CGACCTACGC CAAGCTCGCC TGGCTCCAAG CCTGGCATAC CCGCGACGAA ACCTATTCGT CGGCGCTGGC 7200
CGAACTGGTC AATGCGCAGT TCCGGCATCC CTTCGCCGAG CACTGGGGCG ACGGCACCAC GTCATCGTCG GACGGCCAGA ATTTCCGAAC CGGCAGCAAG 7300
GCCGAGAGCA CTGGCCACAT CAACCCGAAA TATGGCAGCA GTCCTGGGCG GACTTTCTAC ACCCACATCT CCGACCAGTA CGCGCCATTC CACACCAAGG 7400
TGGTTCATGT CGGCGTGCGC GACTCGACTA TGTGCTCGAC GGCTGCTGTA CACGAGTCCG ACTGCGCATC GAGAGCACTA CACCGATACG GCAGGATTCA 7500
CCGATCATGT ATTTGGCTGA TGCACTGCTG GGCGTCCGCT TGCGCCGCGC ATCCGCGACC TGGGCGACAC CAAGCTGTTC ATCCCCAAGG GCGACACCGT 7600
CTACGACGCG CTCAAGCCGA TGATTAGCAG CGACAGACTG AACATCAAGG CTATTCGCGC CCATTGGGAT GAAATTCTAC GGCTGGCCAC GTCGATCAAG 7700
CAGGGCACGG TGACGGCTTC GCTGATGCTG CGCAAGCTCG GCAGCTATCC GCGCCAGAAC GGCCTGGCCG TGGCCCTGCG CGAGCTGGGG CGTATCGAGC 7800
GCACGCTGTT CATCCTGGAT TGGTTGCAAA GCGTGGAGCT GCGCCGTCGC GTGCACGCTG GGCTGAACAA GGGCGAAGCC CGCAATGCGC TGGCCCGCGC 7900
CGTGTTCTTC AACCGTCTGG GTGAAATCCG CGACCGCAGC TTTGAGCAGC AGCGCTACCG TGCCAGCGGC CTCAACCTGG TGACGGCGGC CGTCGTGCTA 8000
TGGAACACGG TCTATCTGGA ACGGGCTGCG CACGCGCTGC GGGGCAACGG CCACGCCGTC GATGACGCGC TGTTGCAGTA CCTGTCGCCG CTCGGCTGGG 8100
AGCACATCAA CCTGACCGGC GATTACCTCT GGCGCAGCAG CGCCAAGATC GGCGCGGGCA AGTTCAGGCC GCTGCGGCCG TTGCAACCTG CTTAGCGTGC 8200
TTTATTTTCC GTTTTCTGAG ACGACCCCTT TACGAGTTTT GCCTATCCGG GGTCGATTTG CGGCCACACC TTTGCGGTGA ATGAAAAAGG TATCGTACAC 8300
GGTGAATAAC ATCCGTGCAG TGCATCGTCG CAGGGATACC GCGTCAGATC TGGCTCGCGC CTCGCTGAAT GCGAACACCC TCGACGAGAC CATCACTATA 8400
CTTACCGGCC AGCCGCGATC CGGGGCGTTT CACCATACGC TCGGGCAGCT GGGCGATACG CGGCTGTTCA GCGTTGAGGC CACCGGTCAA GGCTGTTCCG 8500
TTCTCCCCCT GACGACGGTA ACCGGGCATG CCAACCATCT GGTGCATGCG GCACTGGCTG GCGTTGAGCA GATCGTTACC GACAGTTCAG CGTCGCGGCA 8600
GTTACGCCTC GTGCAGTGGC GGGAAACGCA ACCTCCGTTT GACGCCGCAG CGGCAAAAGC CATTCTCTCA GATACCCATG ACGCCGAACT GCCAATTTAC 8700
CGGCTGGCTG CCGACGATCC GGATGAGGAG AACACCCTCG CCACGGCAGT TTTCACCCTC GATGCCAACC ACGTCAGGTG GCAAATTTTC GACATTAACC 8800
GCGACGATGC TAAATTTCAG GGAGAAGTGC GTGGGTGAGA TAGCCGGTCG TCAGTCATAA AGGGCAGGGT AGTACCGTTG GCCCGGCGTT CGATAGTACC 8900
GTCCGGATAC TGCCCGCTAT CGGCGTCTTG AGCGATGTCC TGAAGCGCGG TGTCCGGAAT ATTCAGGTTT GTGTCTCTAT ACGATTCGGG CCATCGCCGT 9000
TTCTCATTTT GGTGTTGTTG TTGACAGGGA AACCAAAGAC TAGCTGTTAG AAAAACTCAT CGAGCATCAA ATGAAACTGC AATTTATTCA TATCAGGGTT 9100
ATCAATGCCA TATTTATGAA AAAGCCGTTT TTGTAATGAA GGTGAAAATT CACCGAGGCA GTTCCATAGG ATGGCAAGAT CCTGGTATCG GTCTGCGATT 9200
CCGACTCGTC CAACATCAAT ACAACCTATT AATTTCCCCT CGTCAAAAAT AAGGTTATCA AGTGAGAAAT CACCATGAGT GACGACTGAA TCCGGTGAGA 9300
ATGGCAAAAG CTTATGCATT TCTTTCCAGA CTTGTTCAAC AGGACAGCCA TTACGCTCGT CATCAAAATC ACTCGCATCA ACCAAACCGT TATTCATTCG 9400
TGATTGCGCC TGAGCGAGAC GAAATACGCG ATCGCTGTTA AAAGGACAAT TACAAAGAGG AATCGAATGC AACCGGCGCA GGAACGCTGC CAGAGCATCA 9500
ACAATATTTT CACGTGAATC AGGGTATTCT TCTAATGCCT GGAATGCTGT TTTCCCGGGG ATCGCAGTGG TGAGTAACCA CGCATCATCA GGAGTACGGA 9600
TAAAATGCTT GATGGTCGGA AGAGGCATAA ATGCCGTCAG CCAGTTTAGT CTGACCATCT CATCTGTAAC ATCATTGGCA ACGCTACCTT TGCCATGTTT 9700
CAGAAACAAC TCTGGCGCAT CGGGCTTCCC ATACAATCGA TAGATTGTCG CACCTGATTG CCCGACATTA TCGCGAGCCC ATCTATACCC ATATAGATCA 9800
GCATCCAGAT TTGAATTTAA TCGCGGCCTC GAGCAAGATG TTTCCCGTTG AATATGGCTC ATAACGCTCC TTGTATTACT GTTTATGTAA GCAGACAGTT 9900
TTATTGTTCA TGATGATATA TTTTTATCGT GTGTAATGTA ACATCAGATA TTTTGAGACA CGACGTGGTT CCCGCCATGT GAAAATCAGG CCAGACCAAC 10000
ATCATTTTCA GGTGGCCGAT GCACATCAAT GTCATATTGA TAGTCTAAAC CACTCACTTT GCGTGGTCAA GGTCAGATTT AGTTACGGGA CGAAGATGCA 10100
GTCGGGTGGG GTGAGGTTGC AAAAGCATTT CCTTAATAAA GAGTTCATCA GCATAGTTGG ATAACTGTTT TTATAGTAAG TAAAATCACG GGTTTTACGT 10200
CGCAAGCGCC GTAAAACCCG CTGATGGGGT TAGCTGAATC TTTCCGGGCG GAAGGTCGCC AGCATCGGGT GTTTCACGCC AGTGAGCATC TCCTGCGTAA 10300
GCATCTCGCC AAGGATTAAC GCCAGCGTGG CCCCGGAATG CGTGAAGACC ACAAAACAGC CGGGAACTTT CTGCAACTCG CCGACCACCG GTTCGCCATC 10400
GCCAGGAATA GGCTTCAGGC CAATTTTGCA GCTGTTGGCT GTTAACGGTG CGCCGTCACC TATCAGGTTA CCAGCTTCCG CCATCAACTG GTTAATGACA 10500
GACGTATCAA TCCTGTACTG ACCGTCGCCC TGCGCGACGA TGTGCTCTTC ATACCAGTCA TGATCGACGG CGATAGTATT GCCGGGATTA GGTCGGACAG 10600
CAGCGCGCGG CGTATTCAAT ACCACCTGCG GCGCGAGATC GCAGGGCTGG CTGGTCACCA GCATTGAGAC CGGAGAACCA TTGGGGATCT GCACGCCGAG 10700
TGGGGCAACC ACCTCCGGCG TCCACGGGCC ACACGCCACC AGTACCTGAT CCGCCAGCAA TGCACCGTGT TTTTCGCTGC GAATACCGCA GGCCCGGCCG 10800
TCTTTGGTTA TCACGCTGGC TTTACCGGTG TTCTCAATGA TTTCCCCGCC CAGCGCGCGA AACGCCTGAA CCAGATGATC AACCAGATGA GGCAAGCTGA 10900
CCCAGCCTTC ACCTGGATTG GCAATCGCGA CGTGACCCAG GCTGGCCGGA TTCACTTGCG CATCGACATC ACCGACCGTG GCGCGATTGA TCAGCTTTGA 11000
GTCATAGCCC TGCGCCTTTT CATAGTGATG ACGAGCTTTG GTGCCAGCGT CATCGTCTGC GGCCCAGTAA ATAGCGCCGT CGAAACGCAG CCAGTCGAGT 11100
TGCGGATGGC GGGCAAACAG CGTGCGATAG CGGTCAATGC CCGCTATCCG CAGCGCATGA TAAGGCTGCG AACGTTCGCC AGCGGAGTTG AGCCAGGAGA 11200
GTGAGCGGCC AGTGGCACCG GAACAGAGCG CCGCTTCCGT CACCAGCGTG ACTTGCGCGC CCGCCTGCGC CAGCTGCCAG GCGGTTGAGA CGCCGAGGAT 11300
GCCGCCACCC AGCACAATGA CGGTTTTTGC TACGGTTTTG TCAGACATCA TTAATTCCTT CTGTGAAAAA ATGTTCGCCC TTCAGGCAAT GCGTACACGA 11400
TGCTGCCGAT GGAGTTGCTC GATAAGCTGT AAAACGTTGA GCGCTTCACG TGCATCAACC GGTGGTGACC CGCCGTGTAA CAGCGCGTGG GCTAACAACT 11500
GATAGAAAGT GGGGTAGTGG CCACGTTCAA TGGACACCAT TGTGGTTGCC CCGACCGTGC TGGCCCTGGC GAAATTTTCT ACCGGCGCGA CGCCGTACTG 11600
GCTATCCTCT TGATGTGTCA TCACAGCACC TCCGACAGAA AACGCTGCAG GCGCGCAGAT TGCGGGCGAT CAAAGATTTG TTCCGGTGTG CCAGCTTCCA 11700
CGATCTGGGG TCGGTTCCGG CTGAGGGCGA AATGACACCC TAAGCGTTAG CTCTGTGTCG TTGCACGATG TCAGCGACGG TATTCTTGCT GATACCGAGC 11800
TCGCGTGCGA TCCAGCGATA GCTGCGTCCC TCGGCCCTCA TCGCAACCAC CTTAGGCAAA AGTCGGTCTG ATTTTGGTCG CACTCCGGCC TGACGACCAA 11900
GCCTCTTACC ACGTGCCTTC GCAACAGCAA GGCCTGACTT GACCCGCTCG CTGATGAGAT CCCGCTCAAA CTCCGCAATG CCGGAAAGAA ACGTCGCCAG 12000
CATTCGTCCA TACGGCGACG AAAGATCGAA CGCCATTCCA TTCATGGCTA TCACGGAAAC CTTCCAGTTC TCCAGTTCAC GTAGCGTATT GAGCAGATCG 12100
AGCGTCGAGC GCCCCCACCG GGAAAGCTCA GTGACCAGGA TTGCATCAAT TTGTCTGGAC TGGGCAAGCG CCAGGACTTT CTTTCGCTCG GCCCGGTCGA 12200
GTTTAGTTCC TGAACCTGTT TCCTTAAATA TTCCCACCAC GTCGTAGCCG GCACGGCCGG CGAAGGCTCG CAGATCAAAT TCCTGGCGTT CACAAGACTG 12300
ATCCGCTGTT GAAACCCGGC AGTAAATGGC GGCACGATGT CCCAATTGAA CCCTCCTGGA TTTTTGTATC GGAACGCCCT GATTTATATG GGCTGGCTGT 12400
TGTCCAAAAC AGACTATACT TCAAAAGGGA CGAATTTGTA TGTCACGACG CCATATTTTC ACCGAACGGC AGCGAGCAGC GCTGTTCGAT CTGCCCACGG 12500
ACGAACTGTC GCTACTGAAG TTCTACACGC TGGGCGATGA TGACCTGGAA AACATTAGGC AGCGCCGCAG ACCGGAAAAC AGGATTGGCT TTGCCCTGCA 12600
ACTTTGTGCC TTACGATATC CGGGCCGTGC ACTGGCTCCT GGTGAGATGA TCCCGCGTGA AATCCTTTCC TTCGTCGGTG CTCAGCTTGG AGTTCCGGCT 12700
GATGCGCTTC TCACTTATGC CACACGGCGC CAAACCCGTC AGCAGCACAT GGACACGCTG CGCGAAATTT ACGGCTACAA GACCTTCACG GGCCGTGGTG 12800
CCCGTGATCT GCGGAAGTGG ACTTTCGGTC AGGCCGAAGA TGCCAGATCA AACGAGGATC TTGCTCATCG TTTTATTGTG CGGTGTCGGG AAACTTCCAC 12900
CATTCTGCCC GCAGTATCGA CAATCGAGCG CTTGTGCGCG GATGCTCTGG TCGCCGCTGA GCGGCGGATT GAAACGCGGA TTGTGGAAAA TTTAACAGCG 13000
GATGTTCGCG ATCACCTGGA CAAACTTCTG AGTGAAATGC TCGCCGGCAA TATCAGTCGT TTCATCTGGC TTCGCAACTT CGAGGTTGGT AACAACTCGG 13100
CTGCTGCTAA CCGTTTGCTC GACAGGCTCG AATTTCTGCG TACCCTGAAT ATCAATCATA GTGCTTTGGC CAGCATACCT GCCCATCGCA TTGCCCGGCT 13200
GCGTCGGCAG GGTGAACGCT ACTTCACCGA CGGTTTGCGT GACATCACTT CGGACCGCCG CTGGGCGATC CTTGCCGTCT GTGTTGTGGA GTGGGAAGCG 13300
GCGATTGCTG ATGCCATAGT CGAAACCCAT GACAGGATCG TAGGAAAAAC CTGGCGGGAA GCGAAGCGCC AGCATGACGA AACAATTTCC GGCTCTAAAG 13400
CCACACTCGC GGATACGATC CGTACCTTCA CCGCGCTGGG AGCTTCGTTG CTTGAGGCCC GCAGTGACGG AACCCCGCTG GAGATGGCTG TCGCCAGTTC 13500
GGTTGCATGG GACCGGCTCG CTCAACTGGT AGCGACAGGG ACTCAACTCA GCAACACGCT AGCCGATGAG CCTCTTGCAT ATGTCGGGCA GGGATACCAT 13600
CGCTTTCGTC GTTATGCGCC CCGCATGTTG CGCTGTCTGA AGCTCGAAGC CGCGCCGGTC GCCGGACCAT TGGTAGCAGC AGCTTTGTCG ATCGGAGAGA 13700
TGAAAGGTGT TGCATCGCCA GAAAGGCGTT TCCTGCGGCC CAGCTCCAAA TGGAACCGTC ATTTACGAGC TCAGGAAAAA GGAGATACCC GTCTTTGGGA 13800
AGTGGCGGTA CTCTTTCACC TCCGGGATGC TTTTCGTTCC GGAGATGTCT GGCTCGCTCA TTCGCGCCGC TATGGTGACC TCAAGCAGGT ACTGGTGCCG 13900
ATGATCGCGG CGCAGGAAAA TGCAAAACTG GCCGTGCCTT CCAACCCACA GGATTGGCTG GCAGACAGAA AGGCGCGACT CACGATCGCT CTTAAGCGGC 14000
TGGCCCGGGC TGCCCGTAAC GGCACTATTC CGCACGGTAG CATAGAAGAT GGAACGTTGC GGATCGACAG GTTGACAGCA GACGTGCCGG ATGGTGCCGA 14100
GGCACTCATA CTGGATCTGT ATCGCCGAAT GCCGTCCGTT CGGATTACCG ACATGCTGCT TGAAGTTGAT GCAGCCCTTG GTTTCACAGA TGCGTTTACC 14200
CATCTGAGAA CCGGGGCTCC ATGTCGCGAC CGGATCGGTC TGCTCAACGT CCTGCTCGCT GAAGGGCTCA ATCTGGGCCT GCGTAAGATG GCGGAAGCTA 14300
CAAACACGCA TGATTACTGG CAGCTCTCAC GCCTTGCCCG CTGGCATGTT GAAAGCGAAG CCATGAACCA GGCATTGGCA ATTGTGGTGG CCGCGCAGGG 14400
TAAACTGCCG ATGTCACGCG TCTGGGGGAT GGGCACGTCA GCATCGAGCG ATGGTCAGTT TTTCCCGACA GCGCGGCATG GCGAAGCCAT GAACATGGTC 14500
AATGCCAAAT ATGGTTCTGT TCCCGGCCTC AAAGCGTATA CTCACGTAAG CGACCAGTTC GCGCCATTCG CTTGTCAGTC GATCCCGGCG ACCGTGAGCG 14600
AGGCACCGTA TATTCTCGAT GGACTACTGA TGAACGAGGT CGGTCGCCAT GTTCGCGAAC AGTATGCCGA TACAGCAGGA TTCACCGACC ATTTGTTCGG 14700
AGCCAGTAGC CTGCTCGGCT ACAATCTCGT TCTGCGAATC AGGGATCTGC CATCGAAGCG GTTGTACGTA TTTAATCCCG ATACGACCCC CAGGGAGTTA 14800
CGCAAGTTGG TAGGTGGAAA AGCCCGGGAG GATCTTATCG TTGCGAACTG GCCTGATATT TTCCGTTGTG CCGCGACGAT GACCGCTGGC AAAATCAGGC 14900
CCAGCCAACT CCTGCGCAAG CTCGCTTCTT ACCCACGACA AAACAACCTT GCAGTTGCGC TTCGTGAAGT TGGTCGTATT GAACGGACCC TTTTCATTAT 15000
TGAGTGGATC CTGGATACGG ACATGCAGCG GCGTGCTCAG ATCGGTCTTA ACAAGGGAGA GGCCCACCAT GCGCTCAAAA ATGCGCTCCG TATCGGGAGG 15100
CAGGGGGAAA TTCGCGATCG CACGACAGAG GGGCAGCACT ACCGAATCGC TGGGCTCAAT TTATTGACTG CGGTGATCAT TTACTGGAAT ACCGTCCATC 15200
TTGGTCATGC CGTCACGGAG CGGCGGAACG AAGGGTTGGA TGTTCCCCCT GAATTTCTTC CCCACATATC CCCATTGGCT GGGCGCACAT TCTACTGACT 15300
GGCGAATATC TTTGGCCCAA GGAACCGAAA GCTTAGGGTG TCATTTCGCC CTCAGCCGGA ACCGACCCCC CCTTGCTGAA ACTGGTCGCC GACCAGCTCA 15400
AGGTCAGTGT CGAAAGCTGG GCGAGTACGG GCAGCGGGAG CAGACCCGGC GCGAGCATCT GGTCGAGTGC AAACGGTGTT CGGCTTCCAG CCCTTCACCA 15500
TGAGCCACTA CCGGCAGGCC GTCCACACGC TGACCGAGCT GGCCATGCAA ACCGACAAGG GCATTGTGCT GGCCAGCGCC TTGATCGAGC ATCTGCGGCG 15600
GCAGTCGGTC ATTCTGCCTG CCCTCAACGC CGTCGAGCGG GCGAGCGCCG AAGCGATCAC CCGCGCCAAC CGGCGCATCT ACTACGCCTT GGCCGAACCA 15700
CTGTCGGACG CGCATCGCCG CCGCCTCGAC GATCTGCTCA AGCGCCGGGA CAACGGCAAG ACGACTTGGC TGGCCTGGCT GCGCCAGTCA CCCGTCAAGC 15800
CCAATTCGCG GCATATGCTG GAGCACATCG AACGACTCAA GGCATGGCAG GCGCTCGATC TGCCTACCGG CATCGAGCGG CTGATCCACC AAAACCGGCT 15900
GCTCAAGATC GCCCGCGAGG GCGGCCAGAT GACACCCGCC GACCTGGCCA AGTTCGAGGC GCAGCGGCGC TACGCGACCC TGGTGGCGCT CGCCATTGAA 16000
GGCATGGCCA CCGTCACCGA CGAAATCATC GACCTGCACG ACCGCATCCT GGGCAAGCTG TTCAACGCCG CCAAGAACAA GCATCAGCAG CAGTTCCAGG 16100
CGTCCGGCAA GGCGATCAAC GCCAAGGTGC GGCTGTTCGG GCGTATCGGT CAGGCACTGA TCGAGGCCAA GCAATCGGGC CGCGATCCGT TTGCCGCCAT 16200
CGAGGCCGTC ATGTCCTGGG ACGCCTTCGC CGAGAGCGTC ACCGAAGCGC AGAAGCTCGC GCAGCCCGAG GACTTCGATT TCCTGCACCG CATCGGCGAG 16300
AGCTACGCCA CGCTGCGTCG CTACGCGCCG GAATTCCTCG CCGTGCTCAA GCTGCGGGCC GCGCCCGCTG CCAAGGATGT GCTGGAGGCC ATCGAAGTGC 16400
TGCGCAACAT GAACAGCGAC AACGCCCGCA AGGTGCCCGC CGACGCGCCA ACCGATTTCA TCAAGCCGCG CTGGCAGAAG CTGGTGATGA TCGACACCGG 16500
CATCGATCGG CGCTACTACG AACTGTGCGC GCTGTCGGAA ATGAAAAACG CCCTGCGCTC CGGCGACATC TGGGTGCAGG GATCGCGCCA GTTCAAGGAC 16600
TTCGAGGACT ACCTGGTGCC ACCCGCGAAA TTCGCCAGCC TCAAGCAGGC CAGCGAATTG CCGCTGGCCG TGGCCACCGA TTGCGACCAG TACCTGCATG 16700
ACCGGCTGAC GCTGCTGGAA ACGCAGCTCG CCACCGTCAA CCGCATGGCG CTGGCCAACG AGCTGCCGGA CGCCATCATC ACGGAGTCGG GCCTGAAGAT 16800
CACGCCGCTC GATGCGGCGG TGCCCGACAC CGCGCAGGCG CTGATCGACC AGACAGCAAT GATCCTGCCG CACGTCAAGA TCACCGAACT GCTGCTGGAG 16900
GTAGACGAAT GGACAGGCTT CACCCGGCAC TTCGCGCACC TGAAATCGGG CGACCTGGCC AAGGACAAGA ACCTGCTGCT GACCACGATC CTGGCCGACG 17000
CCATCAACCT GGGTCTGACC AAGATGGCGG AGTCCTGCCC CGGAACGACC TACGCCAAGC TCGCCTGGCT CCAAGCCTGG CATACCCGCG ACGAAACCTA 17100
TTCGTCGGCG CTGGCCGAAC TGGTCAATGC GCAGTTCCGG CATCCCTTCG CCGAGCACTG GGGCGACGGC ACCACGTCAT CGTCGGACGG CCAGAATTTC 17200
CGAACCGGCA GCAAGGCCGA GAGCACTGGC CACATCAACC CGAAATATGG CAGCAGTCCT GGGCGGACTT TCTACACCCA CATCTCCGAC CAGTACGCGC 17300
CATTCCACAC CAAGGTGGTC AATGTCGGCG TGCGCGACTC GACCTATGTG CTCGACGGCT TGCTGTACCA CGAGTCCGAC CTGCGCATCG AGGAGCACTA 17400
CACCGATACG GCAGGATTCA CCGATCATGT ATTTGGCCTG ATGCACCTGC TGGGCTTCCG CTTTGCGCCG CGCATCCGCG ACCTGGGCGA CACCAAGCTG 17500
TTCATCCCCA AGGGCGACAC CGTCTACGAC GCGCTCAAGC CGATGATTAG CAGCGACAGA CTGAACATCA AGGCTATTCG CGCCCATTGG GATGAAATTC 17600
TACGGCTGGC CACGTCGATC AAGCAGGGCA CGGTGACGGC TTCGCTGATG CTGCGCAAGC TCGGCAGCTA TCCGCGCCAG AACGGCCTGG CCGTGGCCCT 17700
GCGCGAGCTG GGGCGTATCG AGCGCACGCT GTTCATCCTG GATTGGTTGC AAAGCGTGGA GCTGCGCCGT CGCGTGCACG CTGGGCTGAA CAAGGGCGAA 17800
GCCCGCAATG CGCTGGCCCG CGCCGTGTTC TTCAACCGTC TGGGTGAAAT CCGCGACCGC AGCTTTGAGC AGCAGCGCTA CCGTGCCAGC GGCCTCAACC 17900
TGGTGACGGC GGCCGTCGTG CTATGGAACA CGGTCTATCT GGAACGGGCT GCGCACGCGC TGCGGGGCAA CGGCCACGCC GTCGATGACG CGCTGTTGCA 18000
GTACCTGTCG CCGCTCGGCT GGGAGCACAT CAACCTGACC GGCGATTACC TCTGGCGCAG CAGCGCCAAG ATCGGCGCGG GCAAGTTCAG GCCGCTGCGG 18100
CCGTTGCAAC CTGCTTAGCG TGCTTTATTT TCCGTTTTCT GAGACGACCC CCAGTTGTTC GGGCAGCACA GATGCCGCAA TCGTTTTGGC TGTTTCTTCT 18200
TCGAGTTGTC CCAGCGCCGC TGTGACCAGG ACACGGATGT CACGGTCGCG CAGGATCAGG ATCTTGCTTT GATACAACCA GCGCCGAGCG AATACGAGCA 18300
ATTGATCCCG ATCGGCGCAG CGGGCCACCT CATCGCGCAG GGTGCGTACC AGGGCGCGAC GTTGGTGCTC CGTCATCCAA TGAAACCCCA GGACATCACA 18400
GGCCAGTTGT TGATGGTCAA AGAGGGTTAC CATCCAGAGG TTCCAAGAGG AGGCGCGATG TCAGCAGATC GAAATCCGAT CCATGGATTA GAGAGAAGCA 18500
AGTGGTACGA GAGGACGCTA GTCAGGGAGC TTCTCGTTGC CGTACCCATG CCGTCGTGTT GTGGAACACG GTCTATCTGG AACGGGCTGC GCACGCGCTG 18600
CGTGGCAACG GCCATGCCGT TGATGACGCG CTGTTGCAGT ACCTGTCGCC GCTCGGTTGG GAGCACATCA ACCTCACCGG CGATTACCTC TGGCGCAGCA 18700
GCGCCAAGAT CGGCGCGGGC AAGTTCAGGC CGCTACGACC GCTGCAACCG GCTTAGCGTG CTTTATTTTC CGTTTTCTGA GGCGACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res_site_I |
4527-4557 |
31 |
CGTCAGATTG AGGCATACCC TAACCGGATG T |
res_site_II |
4578-4612 |
35 |
CGTCAGAATA GAGTCGGTTG TGTTATTTAT TGACA |
res_site_III |
4615-4646 |
32 |
AGCTGAAAAA GGTCATAGAT TTCTTCCTGA CA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
merR |
Tn4378.1 |
34-468 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn4378.1 |
540-890 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merP |
Tn4378.1 |
903-1178 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merA |
Tn4378.1 |
1250-2935 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merD |
Tn4378.1 |
2953-3318 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merE |
Tn4378.1 |
3315-3551 |
Passenger Gene |
Heavy Metal Resistance |
+ |
urfM |
Tn4378.1 |
3548-4537 |
Passenger Gene |
Other |
+ |
tnpR |
Tn4378.1 |
4667-5227 |
Accessory Gene |
Resolvase |
+ |
tnpA non-functional |
Tn4378.1 |
5230-8195 |
Transposase |
|
+ |
tnpA 5'-end |
Tn4378.1 |
5230-5439 |
Transposase |
|
+ |
APH(3')-Ia (ARO:3002641) |
Tn4378.1 |
9047-9862 |
Passenger Gene |
Antibiotic Resistance |
- |
socD |
Tn4378.1 |
10230-11348 |
Passenger Gene |
Other |
- |
tnpR |
Tn5403a |
11740-12345 |
Accessory Gene |
Resolvase |
- |
tnpA |
Tn5403a |
12440-15298 |
Transposase |
|
+ |
tnpA 3'-end |
Tn4378.1 |
15370-18119 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn4378.1 |
435 |
34-468 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | activator-repressor of mer operon |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLLEPDKPY GSIRRYGEAD VTRVRFVKSA QRLGFSLDEI AELLRLEDGT HCEEASSLAE HKLKDVREKM ADLARMEAVL SELVCACHAR RGNVSCPLIA SLQGGASLAG SAMP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn4378.1 |
351 |
540-890 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | cytosolic mercuric ion transport protein |
Target: | Mercury |
Protein Sequence:
|
MSEPKTGRGA PFTGGLAAIL ASACCLGPLV LIALGFSGAW IGNLAVLEPY RPIFIGVALV ALFFAWRRIY RQAAACKPGE VCAIPQVRAT YKLIFWIVAA LVLVALGFPY VMPFFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn4378.1 |
276 |
903-1178 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercury transport |
Target: | Mercury |
Protein Sequence:
|
MKKLFASLAL AAVVAPVWAA TQTVTLSVPG MTCSACPITV KKAISKVEGV SKVDVTFETR QAVVTFDDAK TSVQKLTKAT ADAGYPSSVK Q
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn4378.1 |
1686 |
1250-2935 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercuric ion reductase |
Target: | Mercury |
Protein Sequence:
|
MTHLKITGMT CDSCAAHVKE ALEKVPGVQS ALVSYPKGTA QLAIVPGTSP DALTAAVAGL GYKATLADAP LADNRVGLLD KVRGWMAAAE KHSGNEPPVQ VAVIGSGGAA MAAALKAVEQ GAQVTLIERG TIGGTCVNVG CVPSKIMIRA AHIAHLRRES PFDGGIAATV PTIDRSKLLA QQQARVDELR HAKYEGILGG NPAITVVHGE ARFKDDQSLT VRLNEGGERV VMFDRCLVAT GASPAVPPIP GLKESPYWTS TEALASDTIP ERLAVIGSSV VALELAQAFA RLGSKVTVLA RNTLFFREDP AIGEAVTAAF RAEGIEVLEH TQASQVAHMD GEFVLTTTHG ELRADKLLVA TGRTPNTRSL ALDAAGVTVN AQGAIAIDQG MRTSNPNIYA AGDCTDQPQF VYVAAAAGTR AAINMTGGDA ALDLTAMPAV VFTDPQVATV GYSEAEAHHD GIETDSRTLT LDNVPRALAN FDTRGFIKLV IEEGSHRLIG VQAVAPEAGE LIQTAALAIR NRMTVQELAD QLFPYLTMVE GLKLAAQTFN KDVMQLSCCA G
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD |
MerD |
Tn4378.1 |
366 |
2953-3318 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | secondary regulatory protein |
Target: | Mercury |
Protein Sequence:
|
MNAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTPGGYG LFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGDE AAAQLALLRQ FVERRREALA DLEVQLATLP TEPAQHAESL P
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merE |
MerE |
Tn4378.1 |
237 |
3315-3551 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercury transport |
Target: | Mercury |
Protein Sequence:
|
MNNPERLPSE THKPITGYLW GGLAVLTCPC HLPILAVVLA GTTAGAFLGE HWVIAALGLT GLFLLSLSRA LRAFRERE
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
urfM |
UrfM |
Tn4378.1 |
990 |
3548-4537 |
+ |
Class: | Passenger Gene |
Sub Class: | Other |
Function: | possible diguanylate phosphodiesterase |
Sequence Family: | EAL (Pfam:PF00563)||DUF3330 (Pfam:PF11809) |
Comment: | similar to urfM from E.coli |
Protein Sequence:
|
MSAFRPDGWT TPELAQAVER GQLELHYQPV VDLRSGGIVG AEALLRWRHP TLGLLPPGQF LPVVESSGLM PEIGAWVLGE ACRQMRDWRM LAWRPFRLAV NVSASQVGPD FDGWVKGVLA DAELPAEYLE IELTESVAFG DPAIFPALDA LRQIGVRFAA DDFGTGYSCL QHLKCCPIST LKIDQSFVAG LANDRRDQTI VHTVIQLAHG LGMDVVAEGV ETSASLDLLR QADCDTGQGF LFAKPMPAAA FAVFVSQWRG ATMNASDSTT TSCCVCCKEI PLDAAFTPEG AEYVEHFCGL ECYQRFEARA KTGNETDADP NACDSLPSD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn4378.1 |
561 |
4667-5227 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Function: | resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MQGQRIGYVR VSSFDQNPER QLEHVEVGKV FTDKASGKDT QRPELDSLLA FVREGDTVVV HSMDRLARNL DDLRRLVQKL TKRGVRIEFV KESLTFTGED SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKALSPEQ VADLRQRAAA GEQKAKLARE FGVSRETLYQ YLRADQ
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA non-functional |
TnpA non-functional |
Tn4378.1 |
2966 |
5230-8195 |
+ |
Class: | Transposase |
Comment: | non-functional transposase due to internal stop codon || appears to be a fusion of the first 70 codons of tnpA (Tn4378) and another unknown tnpA |
Protein Sequence:
|
MPRRSILSAA ERESLLALPD TKDDLIRYYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGILLGVDEP DEPPFRPAET VADQLKVSVE SWASTGSGSR PGASIWSSAN GVRLPALHHE PLPAGRPHAD RAGHANRQGH CAGQRLDRAS AAAVGHSACP QRRRAGERRS DHPRQPAHLL RLGRTTVGRA SPPPRRSAQA PGQRQDDLAG LAAPVTRQAQ FAAYAGAHRT TQGMAGARSA YRHRAADPPK PAAQDRPRGR PDDTRRPGQV RGAAALRDPG GARH*RHGHR HRRNHRPARP HPGQAVQRRQ EQASAAVPGV RQGDQRQGAA VRAYRSGTDR GQAIGPRSVC RHRGRHVLGR LRRERHRSAE ARAARGLRFP APHRRELRHA ASLRAGIPRR AQAAGRARCQ GCAGGHRSAA QHEQRQRPQG ARRRANRFHQ AALAEAGDDR HRHRSALLRT VRAVGDEKRP ALRRHLGAGI APVQGLRGLP GATREIRQPQ AGQRIAAGRG HRLRPVPA*P ADAAGNAARH RQPHGAGQRA AGRHHHGVGP EDHAARCGGA RHRAGADRPD SNDPAARQDH RTAAGGRRMD RLHPALRAPE IGRPGQGQEP AADHDPGRRH QPGSDQDGGV LPRNDLRQAR LAPSLAYPRR NLFVGAGRTG QCAVPASLRR ALGRRHHVIV GRPEFPNRQQ GREHWPHQPE IWQQSWADFL HPHLRPVRAI PHQGGSCRRA RLDYVLDGCC TRVRLRIEST TPIRQDSPIM YLADALLGVR LRRASATWAT PSCSSPRATP STTRSSR*LA ATD*TSRLFA PIGMKFYGWP RRSSRAR*RL R*CCASSAAI RARTAWPWPC ASWGVSSARC SSWIGCKAWS CAVACTLG*T RAKPAMRWPA PCSSTVWVKS ATAALSSSAT VPAASTW*RR PSCYGTRSIW NGLRTRCGAT ATPSMTRCCS TCRRSAGSTS T*PAITSGAA APRSARASSG RCGRCNLL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA 5'-end |
N |
Tn4378.1 |
210 |
5230-5439 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | tnpA ORF interrupted |
Protein Sequence:
|
MPRRSILSAA ERESLLALPD TKDDLIRYYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGILLGVDEP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
APH(3')-Ia (ARO:3002641) |
APH(3')-Ia |
Tn4378.1 |
816 |
9047-9862 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | APH(3') (ARO:3000126) |
Comment: | strict match to reference sequence for ARO:3002641 (bitscore: 550)||Synonyms: aphA-1, apha1-1AB, APH(3')-Ic, apha7 |
Protein Sequence:
|
MSHIQRETSC SRPRLNSNLD ADLYGYRWAR DNVGQSGATI YRLYGKPDAP ELFLKHGKGS VANDVTDEMV RLNWLTAFMP LPTIKHFIRT PDDAWLLTTA IPGKTAFQAL EEYPDSRENI VDALAAFLRR LHSIPLCNCP FNSDRVFRLA QAQSRMNNGL VDASDFDDER NGCPVEQVWK EMHKLLPFSP DSVVTHGDFS LDNLIFDEGK LIGCIDVGRV GIADRYQDLA ILWNCLGEFS PSLQKRLFHK YGIDNPDMNK LQFHLMLDEF F
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
socD |
SocD |
Tn4378.1 |
1119 |
10230-11348 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Comment: | putative santhopine-degrading protein || FAD-binding oxidoreductase |
Protein Sequence:
|
MSDKTVAKTV IVLGGGILGV STAWQLAQAG AQVTLVTEAA LCSGATGRSL SWLNSAGERS QPYHALRIAG IDRYRTLFAR HPQLDWLRFD GAIYWAADDD AGTKARHHYE KAQGYDSKLI NRATVGDVDA QVNPASLGHV AIANPGEGWV SLPHLVDHLV QAFRALGGEI IENTGKASVI TKDGRACGIR SEKHGALLAD QVLVACGPWT PEVVAPLGVQ IPNGSPVSML VTSQPCDLAP QVVLNTPRAA VRPNPGNTIA VDHDWYEEHI VAQGDGQYRI DTSVINQLMA EAGNLIGDGA PLTANSCKIG LKPIPGDGEP VVGELQKVPG CFVVFTHSGA TLALILGEML TQEMLTGVKH PMLATFRPER FS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn5403a |
606 |
11740-12345 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MGHRAAIYCR VSTADQSCER QEFDLRAFAG RAGYDVVGIF KETGSGTKLD RAERKKVLAL AQSRQIDAIL VTELSRWGRS TLDLLNTLRE LENWKVSVIA MNGMAFDLSS PYGRMLATFL SGIAEFERDL ISERVKSGLA VAKARGKRLG RQAGVRPKSD RLLPKVVAMR AEGRSYRWIA RELGISKNTV ADIVQRHRAN A
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5403a |
2859 |
12440-15298 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MSRRHIFTER QRAALFDLPT DELSLLKFYT LGDDDLENIR QRRRPENRIG FALQLCALRY PGRALAPGEM IPREILSFVG AQLGVPADAL LTYATRRQTR QQHMDTLREI YGYKTFTGRG ARDLRKWTFG QAEDARSNED LAHRFIVRCR ETSTILPAVS TIERLCADAL VAAERRIETR IVENLTADVR DHLDKLLSEM LAGNISRFIW LRNFEVGNNS AAANRLLDRL EFLRTLNINH SALASIPAHR IARLRRQGER YFTDGLRDIT SDRRWAILAV CVVEWEAAIA DAIVETHDRI VGKTWREAKR QHDETISGSK ATLADTIRTF TALGASLLEA RSDGTPLEMA VASSVAWDRL AQLVATGTQL SNTLADEPLA YVGQGYHRFR RYAPRMLRCL KLEAAPVAGP LVAAALSIGE MKGVASPERR FLRPSSKWNR HLRAQEKGDT RLWEVAVLFH LRDAFRSGDV WLAHSRRYGD LKQVLVPMIA AQENAKLAVP SNPQDWLADR KARLTIALKR LARAARNGTI PHGSIEDGTL RIDRLTADVP DGAEALILDL YRRMPSVRIT DMLLEVDAAL GFTDAFTHLR TGAPCRDRIG LLNVLLAEGL NLGLRKMAEA TNTHDYWQLS RLARWHVESE AMNQALAIVV AAQGKLPMSR VWGMGTSASS DGQFFPTARH GEAMNMVNAK YGSVPGLKAY THVSDQFAPF ACQSIPATVS EAPYILDGLL MNEVGRHVRE QYADTAGFTD HLFGASSLLG YNLVLRIRDL PSKRLYVFNP DTTPRELRKL VGGKAREDLI VANWPDIFRC AATMTAGKIR PSQLLRKLAS YPRQNNLAVA LREVGRIERT LFIIEWILDT DMQRRAQIGL NKGEAHHALK NALRIGRQGE IRDRTTEGQH YRIAGLNLLT AVIIYWNTVH LGHAVTERRN EGLDVPPEFL PHISPLAGRT FY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA 3'-end |
N |
Tn4378.1 |
2750 |
15370-18119 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | tnpA ORF interrupted |
Protein Sequence:
|
PLLKLVADQL KVSVESWAST GSGSRPGASI WSSANGVRLP ALHHEPLPAG RPHADRAGHA NRQGHCAGQR LDRASAAAVG HSACPQRRRA GERRSDHPRQ PAHLLRLGRT TVGRASPPPR RSAQAPGQRQ DDLAGLAAPV TRQAQFAAYA GAHRTTQGMA GARSAYRHRA ADPPKPAAQD RPRGRPDDTR RPGQVRGAAA LRDPGGARH* RHGHRHRRNH RPARPHPGQA VQRRQEQASA AVPGVRQGDQ RQGAAVRAYR SGTDRGQAIG PRSVCRHRGR HVLGRLRRER HRSAEARAAR GLRFPAPHRR ELRHAASLRA GIPRRAQAAG RARCQGCAGG HRSAAQHEQR QRPQGARRRA NRFHQAALAE AGDDRHRHRS ALLRTVRAVG NEKRPALRRH LGAGIAPVQG LRGLPGATRE IRQPQAGQRI AAGRGHRLRP VPA*PADAAG NAARHRQPHG AGQRAAGRHH HGVGPEDHAA RCGGARHRAG ADRPDSNDPA ARQDHRTAAG GRRMDRLHPA LRAPEIGRPG QGQEPAADHD PGRRHQPGSD QDGGVLPRND LRQARLAPSL AYPRRNLFVG AGRTGQCAVP ASLRRALGRR HHVIVGRPEF PNRQQGREHW PHQPEIWQQS WADFLHPHLR PVRAIPHQGG QCRRARLDLC ARRLAVPRVR PAHRGALHRY GRIHRSCIWP DAPAGLPLCA AHPRPGRHQA VHPQGRHRLR RAQADD*QRQ TEHQGYSRPL G*NSTAGHVD QAGHGDGFAD AAQARQLSAP ERPGRGPARA GAYRAHAVHP GLVAKRGAAP SRARWAEQGR SPQCAGPRRV LQPSG*NPRP QL*AAALPCQ RPQPGDGGRR AMEHGLSGTG CARAAGQRPR RR*RAVAVPV AARLGAHQPD RRLPLAQQRQ DRRGQVQAAA AVATCL
|
|
Internal Transposable Elements (TE) |
|
|
TnCentral Accession |
TE Name |
Type |
Coordinates |
Length |
Tn4378.1-EU287476.1 |
Tn4378.1 |
Transposon |
1-18151 |
18151 |
Tn5403a-EU287476.1 |
Tn5403a |
Transposon |
11707-15369 |
3663 |
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
IRL |
Tn511 |
1-38 |
GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAG |
IR |
Tn4378.1 |
8191-8228 |
GAATCGCACG AAATAAAAGG CAAAAGACTC TGCTGGGG |
IRL |
Tn5403a |
11707-11752 |
GGGGTCGGTT CCGGCTGAGG GCGAAATGAC ACCCTAAGCG TTAGCT |
IRR |
Tn5403a |
15324-15369 |
TGGCTTTCGA ATCCCACAGT AAAGCGGGAG TCGGCCTTGG CTGGGG |
IRR |
Tn4378.1 |
18114-18151 |
GAATCGCACG AAATAAAAGG CAAAAGACTC TGCTGGGG |
|
References |
|
|
1. | Petrovski S, Stanisich VA. Embedded elements in the IncPbeta plasmids R772 and R906 can be mobilized and can serve as a source of diverse and novel elements. Microbiology (Reading). 2011 Jun;157(Pt 6):1714-1725. doi: 10.1099/mic.0.047761-0. Epub 2011 Mar 10. PubMed ID: 21393370
| | 2. | Coetzee JN. Mobilization of the Proteus mirabilis chromosome by R plasmid R772. J Gen Microbiol. 1978 Sep;108(1):103-9. doi: 10.1099/00221287-108-1-103. PubMed ID: 357678
| |
| | |
|
|