|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: Tn5044 |
|
Family: Tn3 Group: Tn3 |
|
Evidence of Transposition: yes |
|
|
Host |
|
|
Host Organism: | Xanthomonas campestris TAP44-3 | | |
Place of Origin: | Kamchatka peninsula, Russia | Date of Isolation: | 2000 |
| | Other Geographic Information: | t |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTTGGG GAGCAATGGA ACCAAAAACC AACGTAAGCC CTACCACCTG CGCTCGGTTC CTTCCTCCCA GGCATCTGCT TCGATAAAGC AGTTGCGCAT 100
ATCGGCTTCT TGGCTGATCT CGGTCAGTAG GTCATGCACG GTCTTGTCCA GCTCGTCGTC GCTCCGATGC GGAATGGTCA GCTCGTAATG GCCGGTCTCC 200
AGCCGCTTCA TGCCATAGGG CTCCAGGCAG TAGCGCTCGA TGTTCTCCGT GGCTCGCTTC CGGCCGCGCA CGAACTTGCT GTTGTTCACC ACCGCCAGGC 300
GCAGTGTGAC GGTGGCCACC CGCTCGACGG CGGGCTCTAC CGGTGACGCC ACATTGAGCT GCTGGCCTCT CGGCTGGGCA CTCTTTTGGT ACGTCCCGAT 400
ATCGACGCCA CGGTGGCGCA GGTAGCTGTA CAGCGTGCTC TTTGAAATAT GCAGCTTCTC GCCGATGGCA CTGACGCTCA GCCGCCCCTC GCGGTACAGC 500
GTTTCCGCCG CCATGGCGGT GGCCTCGGCC TTGGCGGGCA AGCCCTTGGG GCGGCCACCG ATCCGGCCCC GCGCCCGTGC GGCCGACAGG CCCGCATGAG 600
TCCGCTCGCG GATCAGCTCG CGCTCGAACT CGGCCAGCGA GGCGAACAGG TTGAACACCA GGCGGCCTTG GGCGTGGGTG GTGTCGATGG GGTCGTTCAG 700
GCTCTGCAAG CCGACCTTGC GCTCTGCCAG CTCGCCGACC AGCTCGACCA GGTGCTTGAG CGAGCGCCCG AGGCGATCCA GCTTCCAGAT CACCACGGCA 800
TCACCCGGCC GCACGTTGGC CAAGAGTTTG TCGAGTTCCG GCCGGGCGCT TTTCGCGCCG CTGGCGATGT CTTGGTAGAT GCGTTCGCAT CCGGCCTGTT 900
TCAGGGCATC GACCTGGAGG TCTGCTTTCT GATCCCGAGT GCTCACTCGC GCATAACCGA TCTTCATCAA AAGTACCGTT TACTCGACTA CGTTAGTAAT 1000
AGTTGAACTT TGATTAAGCG TACCAGTTAT TTGAACCGTA GCGCGGGCAG GTTAACGAAC CGAGCCATTC CTCGATAGAG TTCGGCAAAA CCTTCGTTTT 1100
GTCGAACCAT TGCATCGCCC TTTGCGGTTG GCGTATAAAC ACACAAACAC CTATTAGCGG AGAACGCCAT GAATACCATT CGCTGGAATG TCGCCGTCTC 1200
GGCCGACACC GACCAGTCGC TTCGGATGTT TCTGGCCAGC CAGGGCGGTG GCCGGAAAGG CGACCTGTCG CGCTTCATCG AAGAGGCAGT ACGCGCCCAC 1300
ATCCTGGAAC TGAGCGCAGA GCAGGCCAAG GCCGTCAACG CCCATCTGAG TGAGGCAGAA TTGACCGACG CGGTTGACGA AGCACTCGCC TGGGCAAGTA 1400
AGCGCTGATG CGGGTCGTGT TGGATACCAA CATCCTGTTC AGCGCCCTGA TCTCGCCGCA TGGCGCACCC GATGCAATCT ACCGTGCTTG GCGCGCCTCG 1500
CGTTTCGAGG TGGTGACCTC GCGGATGCAG CTCGATGAAA TTCGTCGGGC CAGCCGCTAT CCCAAGCTTC AGGCCATCCT ACAGCCCGCC AAGGTGGGCG 1600
CCATGATCAA TAACCTTCAA CGGGCGGTGG TTCTGGAGCG TCTGACCATC GAGGTCGAAG CCGACGATCC AGATGATTCG TTTCTGCTGG CCATGGCCTT 1700
GGCGGGCGAT GCGGACTACC TGGTCACCGG TGATCGCCGC GCTGGCCTAC TGCAACGAGG GCACATCGAA CGCACGCGAA TCGTCACGCC CGCCGTGTTC 1800
TGCGCCGAGG TGCTGTGATC GATGCCGGTC GGTTTTTTGA CGCAGGAACA GCGCGACGCC TTTGGTCGCT ATGTCGATGC ACCCAGCCGT GAGGAGCTGG 1900
AACGTTACTT CCACCTGAGT GATGAAGACC GTGAAGCCAT CCAGGTGCTG CGGGGTAACC ATAACCGTCT GGGCTACGCC GTTATGCTGA CCACCGTCCG 2000
CTTCGTGGGG GTTCTGCCGG ACAAGCCCGC CGCTGTGCCG GTGGAAGTCC GGCAGGTGCT TTGCCGGCAA CTGGCAATCG CCGATCCTGA CTGCCTTCAG 2100
CGCTATAGTG ATCATCGCCG CTGGATACAT GCGGCCGATA TTCAAGAACG CTATGGCTAT CGTCACTTTA CTGACCCTGG CGTCGGCTTC CGCTTGAGCC 2200
GGTGGCTGTA TGCCCTCTGC TGGACGGGCA CTGATAGACC GGGTGTGCTG TTTGAGCGAG CCACCTCATG GTTGTTCACA CAGAAAGTCC TCCTGCCCGG 2300
CGTTTCTCAA CTGGAACGCT TCATCGCCCA ACTGCGTAGC CGGGTCGAAG AGCGCCTCTG GTACACGCTG GGCCGCAGCG TGACTGAGGA ACAGAGACAG 2400
CATCTGCAAG ACTTGCTCCT GGTGGCCGAA GGCAACCGCA GTTCCCGGCT GGATCAACTG CGCTCCGGCC CGGTGATGAT CAGTGGCCCT GCACTGGTCC 2500
GGGCGCTGCG TCGGCTCGAT GATGTGCGTG GGTTGGGCAT TACTTTGCCG GCAGCGGCGC ATATCCCGCC CAGTCGTATC GCCGCCCTGG CCCGCTTCGC 2600
CAATACCGCC AAGGTCACGG CCATCAATCG ACTGCCGGCG TCGCGCCGGC TGGCCACATT GGTGGCGTTC GCGGTCTGTC TGGAAGCCAG TGCGCACGAC 2700
GATGCCCTGG AAGTGCTGGA GGCGCTGTTG CGCGACCTCT TCAACAACGC GGAGAAGGCC GACAAGAAAG CCCGGCTGCG TAGCTTGAAA GACCTGGATC 2800
GTTCGGCGGC GACACTCGCC ACTGCCTGCA AGGTTGTGCT GGACGCCTCG ATCAGCGATG ACAATGTGCG TGCCCGGCTG TTCAACGACT TGCCGAGGGC 2900
CACCTTGGAG AAGGCCCTGG AAGAGGTCAA CGCGCTGATT CGCCCAGCCG ACGATGTGTA CTTCTTGGCC CTGGCGGCGC GCTACCGCAG CGTTCGCCGT 3000
TTCCTACCGA ATCTACTCAG CCACATCCGC TTTGGCTTCA GCCCGGCCGG CAAGGGCGTG GCGGCTAGCC TGGATTGGTT GCAACTGAAC CTGCCGCGCA 3100
GGAAACCAGA GGATGACGCA CCACAGGAGA TCGTGGCCAA GGCTTGGCAG AATCACATCA CCCGCGAAGA TGGGTCGCTC GACATGGGTG CCTATGTGTT 3200
CTGTACGCTC GACGCTCTGC GTACCGCGCT ACGCCGCCGC GATGTCTTCG TCTCGCCCAG TTGGCGCTAT GCCGATCCCC GCATCGGCCT GCTCGGCGGT 3300
GCCGAATGGC TGGCGGCGCG ACCGATCATT TGCCGCTCGC TGGGCCTGAC CATTGACGCC GGCACCACCT TGGATGCGCT GAGCGCTGAG CTGGATGCGA 3400
CCTGGCAGAC GGTGGCCGCA CGCCTGCCCG ACAACCCCGC GATCCAACTG AGCGAGAACA CCGAGGGCAA GACCGAACTG TCGCTCGGGG CGCTGGACAA 3500
GCTGGACGAG CCCAGTTCGT TGTTGCAACT GCGGGCGGCG GTGGCCGACT TGATGCCGCG TGTCGATCTG CCGGAAATCC TCCTGGAGAT CGCCGCCCGT 3600
ACAGGCTTCG CCGAGGCGTT CACTCATGTG TCCGAGCGCA ACGCACGGGC CGACAACCTA GTCACCAGCC TCTGCGCGGT GCTGCTGGGT GGGGCCTGCA 3700
ACACCGGCCT GGAACCCTTG ATCCGTGCCG ACAACCTGGC GCTGCGCCGT GACCGACTGT CCTGGGTCAG CCAGAACTAT ATCCGCGACG ACACGTTGTC 3800
AGCGGCCAAC GCCATTCTGG TGGGTGCGCA AAGCCAACTG GAGCTGGCCC AGGTCTGGGG TGGTGGTGAG GTCGCTTCCG CCGACGGCAT GCGCTTCGTC 3900
GTACCGGTGC GCACCGTGCA TGCCGGCCCC AACCCGAAAT ATTTCGGTAC CGGCCGGGGC GTCACCTGGT ACAACCTGAT TTCCGACCAG TTCTCCGGCC 4000
TCAATGCCAT CACCGTGCCC GGCACGCTGC GCGACAGCCT GGTACTGCTG GCTGTTGTGC TGGAACAGCA GACCGAGTTG CAGCCGACGC AGATCATGAC 4100
CGACACCGGA GCCTACAGCG ATGTGGTGTT CGGGCTGTTC CGCCTGCTTG GCTACCACTT CAGTCCGCGG CTGGCCGATG TCGGCGGTAC CCGCTTCTGG 4200
CGCACGCGCC CGGACGCGGA CTACGGCAAG CTCAACGGGC TCGCCCGGCA GTCGGTCAAG CTCGACCTGA TCGCCGAGCA CTGGGATGAT CTGCTGCGCC 4300
TGGCCGGTTC ACTCAAGCTC GGCCGAGTGC CGGCGACCGG CATCATGCGC ACCCTGCAAA CGGGAGATAG ACCCACCCGG CTGGCCCAGG CGCTGGCCGA 4400
ATTCGGACGG ATCGAGAAAA CCCTGCACAC GCTGACCTAC ATCGATAACG AGTCCAAGCG CCGCGCCACC CTGACCCAGT TGAATCGAGG CGAAGGCCGG 4500
CACAGCCTTG CCCGCGCGGT GTTTCACGGC AAGCGCGGCG AGCTTCGCCA GCGCTACCGC GAAGGCCAGG AAGACCAGCT CGGTGCTCTG GGCCTGGTGG 4600
TGAACATTAT CGTGCTGTGG AACACCCTCT ACATGACAGC TGCCGTGGAA CGGCTAAGAC AGCACGGCTA CCCGGTGTTG GAAGAGGATT TGGCCCGGCT 4700
GTCACCGCTG ATCTACGAGC ACATCAACAT GCTCGGGCGG TATTCCTTCG CGGTACCGGA TGAGGTTGCA CGCGGCGAGT TGCGGCCGTT GCGTAATCCA 4800
GAGGATGACC TGTAGGTGGA TAGACAGGGC GCTTCTTTAA TCGGCCGAGC CAGGACTGTT GTCGGCAGGC GCGGTGTCGG ATACGGCCCT GGCCTGCGCA 4900
CGAGCCTGGA AACGCTGATA GCACTCCAGT CCGCAGAAGT GCAGGACGTA CTCGCTACCT TCCGCGGTTA AGGCCGCGTC GAGCGGGATT TCCTTGCAAC 5000
ACTCGCAGCA GGTTGTGCAG TCACTGACAG TTGGATCGGA TGTGGCGTTC ATGACGGTAC TCCTTCAGGG AGTGGCGATC GAGGAGGCAC ACGGTTCGAG 5100
CGTAGATTTC GGAATGCCGC AAGTGTTTCT CTGTATCCCA CTTCGATCAA TGCAGCGGTT TGATTAAAGT CGAGCAACCC GCCTGGTTGC TGAAAGTGCG 5200
GGCGTACCAG CTGTATGCCG ATTCCCCGTG CCCTGTGCCG TTGCAAACTG CACAGGTATT TGGTGTCTAG GGCAATCATG AAGCTGCGTA GCAGTATCTT 5300
CAGCGGCGAA GCCAGCGCAG TCGAATCCCG TGGACAACAA GTGCATGAGA TCATCAGTAT CTCGTGAGCA CCGCGATCCA GGGCCTTATC GAACGGCACG 5400
TTATCGGTTA CACCGCCATC CACATGCTGA TCCCCCGCAA TCCAGACCGG CGGGAAAACA CCGGGCAAGC TCATGCTGGC AATCATCGGT TCGATCAGGA 5500
CGCCGTGCCG TTCGTGATAG CAAGGCTGAC CGCTCTGCAG ATTGGTGGTC GTGATGACCA GCGGATGCTT GAGCTCCTCG AAACGCTTGA CGGGCAGCGC 5600
CTCATGGAGA AGGCGTCGTA GCGGGGCGAA CGAATAAAAA CCGCCTGTTT CGCCCAGTAA TCCCCGCCAG TTCCAGGAAA CGACATCAGG TTTTCGCAGG 5700
CCTTGCCATA GCTCAGACAG CTCAGCCGGC GACATGCCGG CTGCTATCAG GGTGCCGTTC AAGGCCCCCA CCGAACTCCC AACCACCAGA TCGAACGACA 5800
ATCCCAGTTC GATGATCGCT TGATAGAAAC CGACTTGCAT CGCACCTCGT GCGCCACCCC CACACAAGAC CAGTGCGGTA TGCGCCGAAG AAGCCTCGAC 5900
GTTCATGAGT CCTCCGAATG TACAGCACGC CAGGTCAACA CCCCCGAAAC GCTGAATACG GTGATCATCG TCACTGCGGC GATGCCCCAA TGTGCGCTGA 6000
GGAAAGCGCC GGCGGTCGTG CCAGCCAACA CAATGACGAG CACCGGTAGA TGGCAGGGAC AAGTGAGGGC GGCTAGCACC CCCCATGCGT AAGCCCGCCA 6100
GCGACTCGCT GGGGGCGAAC TGGCTGGGCT ACGCATGGCG GCTTTCCGGT TCAGCGTGAG GAAGCCCGGC CGCCAGTTGC CGGTCCAGTG CCACCAAGGT 6200
TTCGCGTCGC GCCGCGATCA ACGAACGCAG ATGCGCCCGA CACTCATCGA TATCGCCGCC CCCACCGTCT AGCGACTGAC ACCAGCGCGC CAACTCTCTC 6300
AGATCGATGC CCGCCTCAAA GGCCGTGCGC ACGAAGCGCA GCCGATTCAG CGACTGCGCA TCGAAAATGC CATAACCGCT GTCGGTGCGC CGCACAGGGT 6400
GCAGCAGTCC GCGCAACAGG TAGTCGCGCA CCACGTGCAC GCTGACGCCG GCCTGTTCGG CTAACTTGGA AATCTTGTAA CTGCTCATGC TTGTACTCCT 6500
TCAGACAGCC ACCCCCTCAA TGGATCGATG AGTGGGGGCT GCCTGGATAT ACAGCGGCAC TCCCCTTGTC CCTGATAGGC GTTCAACACA CGCCGCAACG 6600
CGGAGCGGGC CCGGTGCAGC AGCACGCGCA CATTGCCGTT GGAGGTCGCC AGGTGCACGG CAATGTTGTC GAGTTCCATG CCTTCCATGT CCCGCAGCCA 6700
ATAGGCGCGC TGCTGCGCGC CTGGCAGCGC GGCGATGGCA CGACCCAGAC ACTGGGCCGA CTCGGCGGAG GACAACAGCG CTTCCGGCCC CTCCTGATGC 6800
CAAGGGCCTG GAGTCGTCGG CCCACGCTCA GCCGCATCGA AGCGCAGGCT CTTGCCGCCG TCGACGCCTG GGAATACCGC CTCCCAAGGC ACGTAGCGCC 6900
GCTCCCGGCC CAACCGGGCC AGTGCCTGGT TGCGCACGAT TGCCCAGAGC CAGGTCTTGA GGCTGGCCCG GCCCTCGAAC CGGGGCAGCG CCCGATAGGC 7000
GGCGAGCCAG GCCTCTTGCA CGGCGTCGTC GGCCCAGGCT TCGCCGACGA TACAGACGGC CAGCGCTCGT AGCGCCCGAT GATGCTCACG CACCAAGGCC 7100
GTGAATGCCT CCCGGTCGCC GGCCCGCAAG CGCGACAGCA ACAGGGCCGC TGCGCTGAGC CCATCGGGTT GTACCGCCGC GCCGTAAGTG GTGACAGTGG 7200
CCCGGAAGGC CATACGAGGG GCCACTGTGC ACCTCCTCGG TCGTTGCTGC CTCGCAGAAG AAATAACCCT GCCGCCTGTC GCCACGAGAG CAGCCGATGC 7300
GTTCAGGCGA ACTTCGGGTT TCCTGTTGCG CAGCCTAGCC GGCGCAGCAG GAAAGCTGTT TGACGTCCTT GGTGAAGGTT TGCGCGGCGA GCTTCAGGCC 7400
CTCGACCATA GTCAGGTAGG GGAACAACTG GTCGGCCAGT TCCTGTACCG TCATCCGGGC GCGGATGGCG ATGACGGCGG ACTGGATCAG TTCACCCGCT 7500
TCGGGAGTGA CCGCCTGCAC GCCCAGCAAC CGGCCTGAGC CTGCTTCGGC GCACCAGCTT GATGAACCTC GTGTGTCGAA GTTGGCCAGT GCACGCGGCA 7600
CGTTGTCCAG AGTGAGTGTT CGGCTGTCCG TTTCGAGGCC GGCGTTATGC GCTTCGGCTT CGCTGTAGCC GACGGTGGCC ACTTGGGGAT CGGTGAACAC 7700
CACGGACGGC ATGACGTCGA GATTGAGTGT CGCCTCGCCG CCGGTCATGT TGATCGCCGC ACGGGTGCCG GCCGCCGCCG CCACATAGAC GAACTGCGGC 7800
TGGTCGGTGC AGTCGCCAGC CGCATAGATA TCCCGTGCGC TGGTACGCAT GCCCCGGTCG ATCTGGATGC CACCGCGCTC GTCCAGCGTC ACGCCGGCGC 7900
CGTCCAGGTT CATGCCTTGG GTGTTCGGCA AGCGCCCGGT GGCAATGAGC AGTTGGTCGG CACGCAGCTC ACCATGGTTG GTGCTCAGTA CGAACTCGCC 8000
GTTGGGCTGG GACACCTGGC TGGCTTGCGT TTGCTCCAGC ACCTCGATGC CCTCCATACG GAAGACCTCC GTCACCGCAG CACCTATGGC CGGGTCTTCG 8100
CGAAAGAACA TCGAACTGCG TGCCAGGATC GTGACGCGGC TACCTAGCCG GGCGAAGGCC TGCGCCAGTT CCACCGCCAC TACGGAGGCG CCGATCACGG 8200
CGAGCCGTTG CGGAATGCTG TCGCTCACCA GCGCCTCGTC TGAGGTCCAG TACGGCGTGT CTTTCAGCCC TGGAATAGAC GGAACGGCCG CGCTTGCGCC 8300
GGTGGCAACC AGGCAGCGGT CGAAGGCTAC GATGCGCTCG CCACCCTCGG CCAGTTCCAC GCTGAGCGTG TGGCCATCCT GGAAACGGGC GGTGCCACGC 8400
AGCACGCTGA TGGCTGGGGT GCTCTCCAGA ATGCTTTCGT ATTTGGCGTG GCGCAATTCG TCGACACGGC CTTGCTGCTG CGCGAGCAGT CGCTCGCGCA 8500
GGACTGTCGG CGTCGTGGCG GACAATCCGC CGTCGAACGG ACTCTCGCGG CGCAGATGGG CCACATGCGC GGCACGAATC ATGATCTTCG ACGGCACGCA 8600
GCCCACGTTG ACGCAGGTTC CGCCGATGAT GCCGCGCTCG ATCAGGGTGA CGCGGGCACC GCCTTCCACC GCCTTGAGCG CCGCTGCCAT GGCGGCGCCA 8700
CCGCTGCCGA TGACCGCAAC GTGGAGCCTG ACGTCCTCTT TCCCGGCGCC AATGCCACCG CTCAACCAGC CCGGCGCCTG GTCGAGCAGG CCCGGACGGG 8800
GTTGTAGCGT GGCGTCCTCG AACGCCGCTC GATACCCCAG AGCCTCGACG GCGGCCTGCA TTTGTTCATG GTTTACACGC TTGTCGACCG TCAATTCGGC 8900
CTTCCCGCTG GCGTAGGAAA CATCCGCCTG GTGCACGCCG GGAATCTTCT CCAAGGCATC TTTTACATGG TCGGCACAGG ATGCGCAGGT CATGCCGCTG 9000
ATTTGCAGAG TGGTCATAGC GTTCAATTCC TCAAATAGGG CTCTTGTCGG CATCAGTGTT TCTCAGGTGG CGGACAGGAA TTCGATGCGC AACGGCGGTG 9100
GGCGGGCGAC AGCAGATCCC AGATGGCCTC CCCAAGCATG AGGGCCAGCC CCGTGTAAAA CAGGCCGGCC GTCCACCAGC GGCCAAAGAA CAAGTACAGA 9200
GCGGCCAACA CGATGGTCGG CCCGGTTATA CCGAGCAGGC TGCGCTGCCA TTGGCGATGG CTGAACCAGC CCAGGGCATT TGCCAGCAAG GCGACGACGG 9300
CAAACAGCGG CAGCAGCGTG GTGATGAACA GGCCCTCGTA CTGACCTAGA AAGCCCAGTC CAATGGCCGC CCCCAGGCTG GCCATGGCAG GGAAGCACAC 9400
GGCGCAGCCC ATGGCGGAAA CAACGCTGCC CACAACACCG GTCTTGTCAG CGATTCGTGT GATAAGCCCC ATGATTCGAC CCCTGCCTGA TTACGCTCGG 9500
GCTACTGTTT GACGCTGGAC GGATAGCCGG CGTTCTCGGT TGCTTTGGTC AGGGCCTGGG CATTGGTCTT GGCATCGTCG AAGGTCACTG AGGCTTCGCG 9600
CGTATCGAAG CTCACGACGA TATTGCTCAC GCCCTCGACC TTGGAAAGCG CCTTCTTGAC CGTGATCGGG CAGGCGGCGC AGGTCATGCC AGGCACCGAC 9700
AAGGTGACGG TCTGACTGGC AGCCCACACA GGGGAAACAA AGGCAATGAG AGCGAGAGAG GCAAACAGCT TTTTCATGAT GAACTCCAGA TCAGTAGAAC 9800
AAGGGGAGGA TGTAGGGGAA TCCGAACGCG ACCAGCACCA ATGCCGCCAC GACCCAGAAA ATCAGCTTGT AGGTAGTGCG CACCTGCGGA ATCGCGCAGG 9900
CCTCGCCGGG TTTGCAGGCT GGTAGTGGGC GGAAAATGCG ACGGTAGGCG AAGAACAGCG CCACCAGCGC CGCGCCGATG AAGAGCGGGC GGTAAGGTTC 10000
CAGCACGGTC AGGTTGCCGA TCCAGGCCCC GCTGAACCCC AGTGCGATCA GAACCAACGG CCCAAGACAG CAGGTCGAGG CGAGAATCGC CGCCAGCCCC 10100
CCGGCGACAA GCGGGGCGCG CCCGTTCGAT GGTTCAGACA TGCATTTCTC CTTTTGAGCA TTTGATCGAT GGGTTAAGGT TACTTCCGTA GTCATGTACG 10200
GGATCAAGCG CTATGGAGAA CATTTTGGAA AGCCTGACCA TCGGCACCTT CGCTAAGGCG GCTGGGGTCA ACGTGGAAAC CATCCGGTTC TATCAGCGCA 10300
AGGGGCTGTT GCCCGAACCG GACAAGCCCT ACGGCAGCAT TCGCCGTTAC GGCGAGGCGG ATGTCGCACG GGTGAAATTC GTCAAATCTG CTCAGCGGCT 10400
GGGCTTTAGC CTCGATGAAG TGGCCGGACT GTTGAGGCTG GATGACGGCG CTCACTGCGA CGAAGCGCGC GTGCTCGCGG AACAGAAGCT CGAGGATGTG 10500
CGCGGGAAAC TTGCGGATCT GCAGCGGATC GAGTCGGTCT TGGCACGGCT GGTCCATGAC TGTTGTGCGA GCCAACCCAC TATTACCTGC CCGCTGATCG 10600
TTTCGCTGCA TGGGGAATAG CGAGCCGGCA TGAAGCGGCC GGAGACCGGT CATTACGAGC TGACTATTCC GTATCGGAGC GACGACGAGC TGGACAAGAT 10700
TGTGCATGAC CTGCTGACCG AGATCAGCCA GGAAGCCGAC ATGCGCAACT GTTTTGTCGA GATGGACGCC TGGGAAGAAG GCACGGAACG GCGCTGGTAG 10800
GGCTTACGTT GGTTTTTGGT TCCATTGCTC CCCAAACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res |
972-1107 |
136 |
AGTACCGTTT ACTCGACTAC GTTAGTAATA GTTGAACTTT GATTAAGCGT ACCAGTTATT TGAACCGTAG CGCGGGCAGG TTAACGAACC GAGCCATTCC TCGATAGAGT TCGGCAAAAC CTTCGTTTTG TCGAAC |
res_site_III |
982-1023 |
42 |
ACTCGACTAC GTTAGTAATA GTTGAACTTT GATTAAGCGT AC |
res_site_II |
1025-1060 |
36 |
AGTTATTTGA ACCGTAGCGC GGGCAGGTTA ACGAAC |
res_site_I |
1080-1107 |
28 |
GTTCGGCAAA ACCTTCGTTT TGTCGAAC |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tnpR |
Tn5044 |
41-967 |
Accessory Gene |
Resolvase |
- |
RHH_6 |
Tn5044 |
1169-1408 |
Passenger Gene |
Antitoxin |
+ |
PIN_3 |
Tn5044 |
1408-1818 |
Passenger Gene |
Toxin |
+ |
tnpA |
Tn5044 |
1822-4815 |
Transposase |
|
+ |
phospholipase |
Tn5044 |
5049-5906 |
Passenger Gene |
Other |
- |
merD |
Tn5044 |
6129-6488 |
Passenger Gene |
Heavy Metal Resistance |
- |
sigY |
Tn5044 |
6485-7225 |
Passenger Gene |
Heavy Metal Resistance |
- |
merA |
Tn5044 |
7335-9017 |
Passenger Gene |
Heavy Metal Resistance |
- |
merC |
Tn5044 |
9053-9472 |
Passenger Gene |
Heavy Metal Resistance |
- |
merP |
Tn5044 |
9502-9777 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn5044 |
9791-10141 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR |
Tn5044 |
10213-10620 |
Passenger Gene |
Heavy Metal Resistance |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn5044 |
927 |
41-967 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MKIGYARVST RDQKADLQVD ALKQAGCERI YQDIASGAKS ARPELDKLLA NVRPGDAVVI WKLDRLGRSL KHLVELVGEL AERKVGLQSL NDPIDTTHAQ GRLVFNLFAS LAEFERELIR ERTHAGLSAA RARGRIGGRP KGLPAKAEAT AMAAETLYRE GRLSVSAIGE KLHISKSTLY SYLRHRGVDI GTYQKSAQPR GQQLNVASPV EPAVERVATV TLRLAVVNNS KFVRGRKRAT ENIERYCLEP YGMKRLETGH YELTIPHRSD DELDKTVHDL LTEISQEADM RNCFIEADAW EEGTERRW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
RHH_6 |
RHH_6 |
Tn5044 |
240 |
1169-1408 |
+ |
Class: | Passenger Gene |
Sub Class: | Antitoxin |
Function: | Antitoxin |
Sequence Family: | RHH_6 (Pfam:PF16762) |
Protein Sequence:
|
MNTIRWNVAV SADTDQSLRM FLASQGGGRK GDLSRFIEEA VRAHILELSA EQAKAVNAHL SEAELTDAVD EALAWASKR
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
PIN_3 |
PIN_3 |
Tn5044 |
411 |
1408-1818 |
+ |
Class: | Passenger Gene |
Sub Class: | Toxin |
Function: | Toxin |
Target: | single stranded RNA |
Sequence Family: | PIN_3 (Pfam:PF13470) |
Comment: | Pfam PF13470.5 |
Protein Sequence:
|
MRVVLDTNIL FSALISPHGA PDAIYRAWRA SRFEVVTSRM QLDEIRRASR YPKLQAILQP AKVGAMINNL QRAVVLERLT IEVEADDPDD SFLLAMALAG DADYLVTGDR RAGLLQRGHI ERTRIVTPAV FCAEVL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5044 |
2994 |
1822-4815 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPVGFLTQEQ RDAFGRYVDA PSREELERYF HLSDEDREAI QVLRGNHNRL GYAVMLTTVR FVGVLPDKPA AVPVEVRQVL CRQLAIADPD CLQRYSDHRR WIHAADIQER YGYRHFTDPG VGFRLSRWLY ALCWTGTDRP GVLFERATSW LFTQKVLLPG VSQLERFIAQ LRSRVEERLW YTLGRSVTEE QRQHLQDLLL VAEGNRSSRL DQLRSGPVMI SGPALVRALR RLDDVRGLGI TLPAAAHIPP SRIAALARFA NTAKVTAINR LPASRRLATL VAFAVCLEAS AHDDALEVLE ALLRDLFNNA EKADKKARLR SLKDLDRSAA TLATACKVVL DASISDDNVR ARLFNDLPRA TLEKALEEVN ALIRPADDVY FLALAARYRS VRRFLPNLLS HIRFGFSPAG KGVAASLDWL QLNLPRRKPE DDAPQEIVAK AWQNHITRED GSLDMGAYVF CTLDALRTAL RRRDVFVSPS WRYADPRIGL LGGAEWLAAR PIICRSLGLT IDAGTTLDAL SAELDATWQT VAARLPDNPA IQLSENTEGK TELSLGALDK LDEPSSLLQL RAAVADLMPR VDLPEILLEI AARTGFAEAF THVSERNARA DNLVTSLCAV LLGGACNTGL EPLIRADNLA LRRDRLSWVS QNYIRDDTLS AANAILVGAQ SQLELAQVWG GGEVASADGM RFVVPVRTVH AGPNPKYFGT GRGVTWYNLI SDQFSGLNAI TVPGTLRDSL VLLAVVLEQQ TELQPTQIMT DTGAYSDVVF GLFRLLGYHF SPRLADVGGT RFWRTRPDAD YGKLNGLARQ SVKLDLIAEH WDDLLRLAGS LKLGRVPATG IMRTLQTGDR PTRLAQALAE FGRIEKTLHT LTYIDNESKR RATLTQLNRG EGRHSLARAV FHGKRGELRQ RYREGQEDQL GALGLVVNII VLWNTLYMTA AVERLRQHGY PVLEEDLARL SPLIYEHINM LGRYSFAVPD EVARGELRPL RNPEDDL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
phospholipase |
Phospholipase |
Tn5044 |
858 |
5049-5906 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
MNVEASSAHT ALVLCGGGAR GAMQVGFYQA IIELGLSFDL VVGSSVGALN GTLIAAGMSP AELSELWQGL RKPDVVSWNW RGLLGETGGF YSFAPLRRLL HEALPVKRFE ELKHPLVITT TNLQSGQPCY HERHGVLIEP MIASMSLPGV FPPVWIAGDQ HVDGGVTDNV PFDKALDRGA HEILMISCTC CPRDSTALAS PLKILLRSFM IALDTKYLCS LQRHRARGIG IQLVRPHFQQ PGGLLDFNQT AALIEVGYRE TLAAFRNLRS NRVPPRSPLP EGVPS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD |
MerD |
Tn5044 |
360 |
6129-6488 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | second regulatory element |
Protein Sequence:
|
MSSYKISKLA EQAGVSVHVV RDYLLRGLLH PVRRTDSGYG IFDAQSLNRL RFVRTAFEAG IDLRELARWC QSLDGGGGDI DECRAHLRSL IAARRETLVA LDRQLAAGLP HAEPESRHA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
sigY |
SigY |
Tn5044 |
741 |
6485-7225 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | RNA polymerase sigma factor-like protein |
Protein Sequence:
|
MAPRMAFRAT VTTYGAAVQP DGLSAAALLL SRLRAGDREA FTALVREHHR ALRALAVCIV GEAWADDAVQ EAWLAAYRAL PRFEGRASLK TWLWAIVRNQ ALARLGRERR YVPWEAVFPG VDGGKSLRFD AAERGPTTPG PWHQEGPEAL LSSAESAQCL GRAIAALPGA QQRAYWLRDM EGMELDNIAV HLATSNGNVR VLLHRARSAL RRVLNAYQGQ GECRCISRQP PLIDPLRGWL SEGVQA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn5044 |
1683 |
7335-9017 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MTTLQISGMT CASCADHVKD ALEKIPGVHQ ADVSYASGKA ELTVDKRVNH EQMQAAVEAL GYRAAFEDAT LQPRPGLLDQ APGWLSGGIG AGKEDVRLHV AVIGSGGAAM AAALKAVEGG ARVTLIERGI IGGTCVNVGC VPSKIMIRAA HVAHLRRESP FDGGLSATTP TVLRERLLAQ QQGRVDELRH AKYESILEST PAISVLRGTA RFQDGHTLSV ELAEGGERIV AFDRCLVATG ASAAVPSIPG LKDTPYWTSD EALVSDSIPQ RLAVIGASVV AVELAQAFAR LGSRVTILAR SSMFFREDPA IGAAVTEVFR MEGIEVLEQT QASQVSQPNG EFVLSTNHGE LRADQLLIAT GRLPNTQGMN LDGAGVTLDE RGGIQIDRGM RTSARDIYAA GDCTDQPQFV YVAAAAGTRA AINMTGGEAT LNLDVMPSVV FTDPQVATVG YSEAEAHNAG LETDSRTLTL DNVPRALANF DTRGSSSWCA EAGSGRLLGV QAVTPEAGEL IQSAVIAIRA RMTVQELADQ LFPYLTMVEG LKLAAQTFTK DVKQLSCCAG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merC |
MerC |
Tn5044 |
420 |
9053-9472 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | mercury transporter |
Protein Sequence:
|
MGLITRIADK TGVVGSVVSA MGCAVCFPAM ASLGAAIGLG FLGQYEGLFI TTLLPLFAVV ALLANALGWF SHRQWQRSLL GITGPTIVLA ALYLFFGRWW TAGLFYTGLA LMLGEAIWDL LSPAHRRCAS NSCPPPEKH
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn5044 |
276 |
9502-9777 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | Periplasmic mercury ion-binding protein |
Protein Sequence:
|
MKKLFASLAL IAFVSPVWAA SQTVTLSVPG MTCAACPITV KKALSKVEGV SNIVVSFDTR EASVTFDDAK TNAQALTKAT ENAGYPSSVK Q
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn5044 |
351 |
9791-10141 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | mercury transporter |
Protein Sequence:
|
MSEPSNGRAP LVAGGLAAIL ASTCCLGPLV LIALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAYRRIF RPLPACKPGE ACAIPQVRTT YKLIFWVVAA LVLVAFGFPY ILPLFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn5044 |
408 |
10213-10620 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MENILESLTI GTFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGEAD VARVKFVKSA QRLGFSLDEV AGLLRLDDGA HCDEARVLAE QKLEDVRGKL ADLQRIESVL ARLVHDCCAS QPTITCPLIV SLHGE
|
|
References |
|
|
1. | Minakhina S, Kholodii G, Mindlin S, Yurieva O, Nikiforov V. Tn5053 family transposons are res site hunters sensing plasmidal res sites occupied by cognate resolvases. Mol Microbiol. 1999 Sep;33(5):1059-68. doi: 10.1046/j.1365-2958.1999.01548.x. PubMed ID: 10476039
| | 2. | Kholodii G, Yurieva O, Mindlin S, Gorlenko Z, Rybochkin V, Nikiforov V. Tn5044, a novel Tn3 family transposon coding for temperature-sensitive mercury resistance. Res Microbiol. 2000 May;151(4):291-302. doi: 10.1016/s0923-2508(00)00149-2. PubMed ID: 10875286
| | 3. | Kholodii G, Bogdanova E. Tn5044-conferred mercury resistance depends on temperature: the complexity of the character of thermosensitivity. Genetica. 2002 Jun;115(2):233-41. doi: 10.1023/a:1020185206563. PubMed ID: 12403178
| |
| | |
|
|