Transposon
Name: Tn5044
Family: Tn3        Group: Tn3
Evidence of Transposition: yes
 Host     

Host Organism:Xanthomonas campestris TAP44-3
Place of Origin:Kamchatka peninsula, Russia Date of Isolation:2000
Other Geographic Information:t

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  10840 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTTGGG GAGCAATGGA ACCAAAAACC AACGTAAGCC CTACCACCTG CGCTCGGTTC CTTCCTCCCA GGCATCTGCT TCGATAAAGC AGTTGCGCAT 100
ATCGGCTTCT TGGCTGATCT CGGTCAGTAG GTCATGCACG GTCTTGTCCA GCTCGTCGTC GCTCCGATGC GGAATGGTCA GCTCGTAATG GCCGGTCTCC 200
AGCCGCTTCA TGCCATAGGG CTCCAGGCAG TAGCGCTCGA TGTTCTCCGT GGCTCGCTTC CGGCCGCGCA CGAACTTGCT GTTGTTCACC ACCGCCAGGC 300
GCAGTGTGAC GGTGGCCACC CGCTCGACGG CGGGCTCTAC CGGTGACGCC ACATTGAGCT GCTGGCCTCT CGGCTGGGCA CTCTTTTGGT ACGTCCCGAT 400
ATCGACGCCA CGGTGGCGCA GGTAGCTGTA CAGCGTGCTC TTTGAAATAT GCAGCTTCTC GCCGATGGCA CTGACGCTCA GCCGCCCCTC GCGGTACAGC 500
GTTTCCGCCG CCATGGCGGT GGCCTCGGCC TTGGCGGGCA AGCCCTTGGG GCGGCCACCG ATCCGGCCCC GCGCCCGTGC GGCCGACAGG CCCGCATGAG 600
TCCGCTCGCG GATCAGCTCG CGCTCGAACT CGGCCAGCGA GGCGAACAGG TTGAACACCA GGCGGCCTTG GGCGTGGGTG GTGTCGATGG GGTCGTTCAG 700
GCTCTGCAAG CCGACCTTGC GCTCTGCCAG CTCGCCGACC AGCTCGACCA GGTGCTTGAG CGAGCGCCCG AGGCGATCCA GCTTCCAGAT CACCACGGCA 800
TCACCCGGCC GCACGTTGGC CAAGAGTTTG TCGAGTTCCG GCCGGGCGCT TTTCGCGCCG CTGGCGATGT CTTGGTAGAT GCGTTCGCAT CCGGCCTGTT 900
TCAGGGCATC GACCTGGAGG TCTGCTTTCT GATCCCGAGT GCTCACTCGC GCATAACCGA TCTTCATCAA AAGTACCGTT TACTCGACTA CGTTAGTAAT 1000
AGTTGAACTT TGATTAAGCG TACCAGTTAT TTGAACCGTA GCGCGGGCAG GTTAACGAAC CGAGCCATTC CTCGATAGAG TTCGGCAAAA CCTTCGTTTT 1100
GTCGAACCAT TGCATCGCCC TTTGCGGTTG GCGTATAAAC ACACAAACAC CTATTAGCGG AGAACGCCAT GAATACCATT CGCTGGAATG TCGCCGTCTC 1200
GGCCGACACC GACCAGTCGC TTCGGATGTT TCTGGCCAGC CAGGGCGGTG GCCGGAAAGG CGACCTGTCG CGCTTCATCG AAGAGGCAGT ACGCGCCCAC 1300
ATCCTGGAAC TGAGCGCAGA GCAGGCCAAG GCCGTCAACG CCCATCTGAG TGAGGCAGAA TTGACCGACG CGGTTGACGA AGCACTCGCC TGGGCAAGTA 1400
AGCGCTGATG CGGGTCGTGT TGGATACCAA CATCCTGTTC AGCGCCCTGA TCTCGCCGCA TGGCGCACCC GATGCAATCT ACCGTGCTTG GCGCGCCTCG 1500
CGTTTCGAGG TGGTGACCTC GCGGATGCAG CTCGATGAAA TTCGTCGGGC CAGCCGCTAT CCCAAGCTTC AGGCCATCCT ACAGCCCGCC AAGGTGGGCG 1600
CCATGATCAA TAACCTTCAA CGGGCGGTGG TTCTGGAGCG TCTGACCATC GAGGTCGAAG CCGACGATCC AGATGATTCG TTTCTGCTGG CCATGGCCTT 1700
GGCGGGCGAT GCGGACTACC TGGTCACCGG TGATCGCCGC GCTGGCCTAC TGCAACGAGG GCACATCGAA CGCACGCGAA TCGTCACGCC CGCCGTGTTC 1800
TGCGCCGAGG TGCTGTGATC GATGCCGGTC GGTTTTTTGA CGCAGGAACA GCGCGACGCC TTTGGTCGCT ATGTCGATGC ACCCAGCCGT GAGGAGCTGG 1900
AACGTTACTT CCACCTGAGT GATGAAGACC GTGAAGCCAT CCAGGTGCTG CGGGGTAACC ATAACCGTCT GGGCTACGCC GTTATGCTGA CCACCGTCCG 2000
CTTCGTGGGG GTTCTGCCGG ACAAGCCCGC CGCTGTGCCG GTGGAAGTCC GGCAGGTGCT TTGCCGGCAA CTGGCAATCG CCGATCCTGA CTGCCTTCAG 2100
CGCTATAGTG ATCATCGCCG CTGGATACAT GCGGCCGATA TTCAAGAACG CTATGGCTAT CGTCACTTTA CTGACCCTGG CGTCGGCTTC CGCTTGAGCC 2200
GGTGGCTGTA TGCCCTCTGC TGGACGGGCA CTGATAGACC GGGTGTGCTG TTTGAGCGAG CCACCTCATG GTTGTTCACA CAGAAAGTCC TCCTGCCCGG 2300
CGTTTCTCAA CTGGAACGCT TCATCGCCCA ACTGCGTAGC CGGGTCGAAG AGCGCCTCTG GTACACGCTG GGCCGCAGCG TGACTGAGGA ACAGAGACAG 2400
CATCTGCAAG ACTTGCTCCT GGTGGCCGAA GGCAACCGCA GTTCCCGGCT GGATCAACTG CGCTCCGGCC CGGTGATGAT CAGTGGCCCT GCACTGGTCC 2500
GGGCGCTGCG TCGGCTCGAT GATGTGCGTG GGTTGGGCAT TACTTTGCCG GCAGCGGCGC ATATCCCGCC CAGTCGTATC GCCGCCCTGG CCCGCTTCGC 2600
CAATACCGCC AAGGTCACGG CCATCAATCG ACTGCCGGCG TCGCGCCGGC TGGCCACATT GGTGGCGTTC GCGGTCTGTC TGGAAGCCAG TGCGCACGAC 2700
GATGCCCTGG AAGTGCTGGA GGCGCTGTTG CGCGACCTCT TCAACAACGC GGAGAAGGCC GACAAGAAAG CCCGGCTGCG TAGCTTGAAA GACCTGGATC 2800
GTTCGGCGGC GACACTCGCC ACTGCCTGCA AGGTTGTGCT GGACGCCTCG ATCAGCGATG ACAATGTGCG TGCCCGGCTG TTCAACGACT TGCCGAGGGC 2900
CACCTTGGAG AAGGCCCTGG AAGAGGTCAA CGCGCTGATT CGCCCAGCCG ACGATGTGTA CTTCTTGGCC CTGGCGGCGC GCTACCGCAG CGTTCGCCGT 3000
TTCCTACCGA ATCTACTCAG CCACATCCGC TTTGGCTTCA GCCCGGCCGG CAAGGGCGTG GCGGCTAGCC TGGATTGGTT GCAACTGAAC CTGCCGCGCA 3100
GGAAACCAGA GGATGACGCA CCACAGGAGA TCGTGGCCAA GGCTTGGCAG AATCACATCA CCCGCGAAGA TGGGTCGCTC GACATGGGTG CCTATGTGTT 3200
CTGTACGCTC GACGCTCTGC GTACCGCGCT ACGCCGCCGC GATGTCTTCG TCTCGCCCAG TTGGCGCTAT GCCGATCCCC GCATCGGCCT GCTCGGCGGT 3300
GCCGAATGGC TGGCGGCGCG ACCGATCATT TGCCGCTCGC TGGGCCTGAC CATTGACGCC GGCACCACCT TGGATGCGCT GAGCGCTGAG CTGGATGCGA 3400
CCTGGCAGAC GGTGGCCGCA CGCCTGCCCG ACAACCCCGC GATCCAACTG AGCGAGAACA CCGAGGGCAA GACCGAACTG TCGCTCGGGG CGCTGGACAA 3500
GCTGGACGAG CCCAGTTCGT TGTTGCAACT GCGGGCGGCG GTGGCCGACT TGATGCCGCG TGTCGATCTG CCGGAAATCC TCCTGGAGAT CGCCGCCCGT 3600
ACAGGCTTCG CCGAGGCGTT CACTCATGTG TCCGAGCGCA ACGCACGGGC CGACAACCTA GTCACCAGCC TCTGCGCGGT GCTGCTGGGT GGGGCCTGCA 3700
ACACCGGCCT GGAACCCTTG ATCCGTGCCG ACAACCTGGC GCTGCGCCGT GACCGACTGT CCTGGGTCAG CCAGAACTAT ATCCGCGACG ACACGTTGTC 3800
AGCGGCCAAC GCCATTCTGG TGGGTGCGCA AAGCCAACTG GAGCTGGCCC AGGTCTGGGG TGGTGGTGAG GTCGCTTCCG CCGACGGCAT GCGCTTCGTC 3900
GTACCGGTGC GCACCGTGCA TGCCGGCCCC AACCCGAAAT ATTTCGGTAC CGGCCGGGGC GTCACCTGGT ACAACCTGAT TTCCGACCAG TTCTCCGGCC 4000
TCAATGCCAT CACCGTGCCC GGCACGCTGC GCGACAGCCT GGTACTGCTG GCTGTTGTGC TGGAACAGCA GACCGAGTTG CAGCCGACGC AGATCATGAC 4100
CGACACCGGA GCCTACAGCG ATGTGGTGTT CGGGCTGTTC CGCCTGCTTG GCTACCACTT CAGTCCGCGG CTGGCCGATG TCGGCGGTAC CCGCTTCTGG 4200
CGCACGCGCC CGGACGCGGA CTACGGCAAG CTCAACGGGC TCGCCCGGCA GTCGGTCAAG CTCGACCTGA TCGCCGAGCA CTGGGATGAT CTGCTGCGCC 4300
TGGCCGGTTC ACTCAAGCTC GGCCGAGTGC CGGCGACCGG CATCATGCGC ACCCTGCAAA CGGGAGATAG ACCCACCCGG CTGGCCCAGG CGCTGGCCGA 4400
ATTCGGACGG ATCGAGAAAA CCCTGCACAC GCTGACCTAC ATCGATAACG AGTCCAAGCG CCGCGCCACC CTGACCCAGT TGAATCGAGG CGAAGGCCGG 4500
CACAGCCTTG CCCGCGCGGT GTTTCACGGC AAGCGCGGCG AGCTTCGCCA GCGCTACCGC GAAGGCCAGG AAGACCAGCT CGGTGCTCTG GGCCTGGTGG 4600
TGAACATTAT CGTGCTGTGG AACACCCTCT ACATGACAGC TGCCGTGGAA CGGCTAAGAC AGCACGGCTA CCCGGTGTTG GAAGAGGATT TGGCCCGGCT 4700
GTCACCGCTG ATCTACGAGC ACATCAACAT GCTCGGGCGG TATTCCTTCG CGGTACCGGA TGAGGTTGCA CGCGGCGAGT TGCGGCCGTT GCGTAATCCA 4800
GAGGATGACC TGTAGGTGGA TAGACAGGGC GCTTCTTTAA TCGGCCGAGC CAGGACTGTT GTCGGCAGGC GCGGTGTCGG ATACGGCCCT GGCCTGCGCA 4900
CGAGCCTGGA AACGCTGATA GCACTCCAGT CCGCAGAAGT GCAGGACGTA CTCGCTACCT TCCGCGGTTA AGGCCGCGTC GAGCGGGATT TCCTTGCAAC 5000
ACTCGCAGCA GGTTGTGCAG TCACTGACAG TTGGATCGGA TGTGGCGTTC ATGACGGTAC TCCTTCAGGG AGTGGCGATC GAGGAGGCAC ACGGTTCGAG 5100
CGTAGATTTC GGAATGCCGC AAGTGTTTCT CTGTATCCCA CTTCGATCAA TGCAGCGGTT TGATTAAAGT CGAGCAACCC GCCTGGTTGC TGAAAGTGCG 5200
GGCGTACCAG CTGTATGCCG ATTCCCCGTG CCCTGTGCCG TTGCAAACTG CACAGGTATT TGGTGTCTAG GGCAATCATG AAGCTGCGTA GCAGTATCTT 5300
CAGCGGCGAA GCCAGCGCAG TCGAATCCCG TGGACAACAA GTGCATGAGA TCATCAGTAT CTCGTGAGCA CCGCGATCCA GGGCCTTATC GAACGGCACG 5400
TTATCGGTTA CACCGCCATC CACATGCTGA TCCCCCGCAA TCCAGACCGG CGGGAAAACA CCGGGCAAGC TCATGCTGGC AATCATCGGT TCGATCAGGA 5500
CGCCGTGCCG TTCGTGATAG CAAGGCTGAC CGCTCTGCAG ATTGGTGGTC GTGATGACCA GCGGATGCTT GAGCTCCTCG AAACGCTTGA CGGGCAGCGC 5600
CTCATGGAGA AGGCGTCGTA GCGGGGCGAA CGAATAAAAA CCGCCTGTTT CGCCCAGTAA TCCCCGCCAG TTCCAGGAAA CGACATCAGG TTTTCGCAGG 5700
CCTTGCCATA GCTCAGACAG CTCAGCCGGC GACATGCCGG CTGCTATCAG GGTGCCGTTC AAGGCCCCCA CCGAACTCCC AACCACCAGA TCGAACGACA 5800
ATCCCAGTTC GATGATCGCT TGATAGAAAC CGACTTGCAT CGCACCTCGT GCGCCACCCC CACACAAGAC CAGTGCGGTA TGCGCCGAAG AAGCCTCGAC 5900
GTTCATGAGT CCTCCGAATG TACAGCACGC CAGGTCAACA CCCCCGAAAC GCTGAATACG GTGATCATCG TCACTGCGGC GATGCCCCAA TGTGCGCTGA 6000
GGAAAGCGCC GGCGGTCGTG CCAGCCAACA CAATGACGAG CACCGGTAGA TGGCAGGGAC AAGTGAGGGC GGCTAGCACC CCCCATGCGT AAGCCCGCCA 6100
GCGACTCGCT GGGGGCGAAC TGGCTGGGCT ACGCATGGCG GCTTTCCGGT TCAGCGTGAG GAAGCCCGGC CGCCAGTTGC CGGTCCAGTG CCACCAAGGT 6200
TTCGCGTCGC GCCGCGATCA ACGAACGCAG ATGCGCCCGA CACTCATCGA TATCGCCGCC CCCACCGTCT AGCGACTGAC ACCAGCGCGC CAACTCTCTC 6300
AGATCGATGC CCGCCTCAAA GGCCGTGCGC ACGAAGCGCA GCCGATTCAG CGACTGCGCA TCGAAAATGC CATAACCGCT GTCGGTGCGC CGCACAGGGT 6400
GCAGCAGTCC GCGCAACAGG TAGTCGCGCA CCACGTGCAC GCTGACGCCG GCCTGTTCGG CTAACTTGGA AATCTTGTAA CTGCTCATGC TTGTACTCCT 6500
TCAGACAGCC ACCCCCTCAA TGGATCGATG AGTGGGGGCT GCCTGGATAT ACAGCGGCAC TCCCCTTGTC CCTGATAGGC GTTCAACACA CGCCGCAACG 6600
CGGAGCGGGC CCGGTGCAGC AGCACGCGCA CATTGCCGTT GGAGGTCGCC AGGTGCACGG CAATGTTGTC GAGTTCCATG CCTTCCATGT CCCGCAGCCA 6700
ATAGGCGCGC TGCTGCGCGC CTGGCAGCGC GGCGATGGCA CGACCCAGAC ACTGGGCCGA CTCGGCGGAG GACAACAGCG CTTCCGGCCC CTCCTGATGC 6800
CAAGGGCCTG GAGTCGTCGG CCCACGCTCA GCCGCATCGA AGCGCAGGCT CTTGCCGCCG TCGACGCCTG GGAATACCGC CTCCCAAGGC ACGTAGCGCC 6900
GCTCCCGGCC CAACCGGGCC AGTGCCTGGT TGCGCACGAT TGCCCAGAGC CAGGTCTTGA GGCTGGCCCG GCCCTCGAAC CGGGGCAGCG CCCGATAGGC 7000
GGCGAGCCAG GCCTCTTGCA CGGCGTCGTC GGCCCAGGCT TCGCCGACGA TACAGACGGC CAGCGCTCGT AGCGCCCGAT GATGCTCACG CACCAAGGCC 7100
GTGAATGCCT CCCGGTCGCC GGCCCGCAAG CGCGACAGCA ACAGGGCCGC TGCGCTGAGC CCATCGGGTT GTACCGCCGC GCCGTAAGTG GTGACAGTGG 7200
CCCGGAAGGC CATACGAGGG GCCACTGTGC ACCTCCTCGG TCGTTGCTGC CTCGCAGAAG AAATAACCCT GCCGCCTGTC GCCACGAGAG CAGCCGATGC 7300
GTTCAGGCGA ACTTCGGGTT TCCTGTTGCG CAGCCTAGCC GGCGCAGCAG GAAAGCTGTT TGACGTCCTT GGTGAAGGTT TGCGCGGCGA GCTTCAGGCC 7400
CTCGACCATA GTCAGGTAGG GGAACAACTG GTCGGCCAGT TCCTGTACCG TCATCCGGGC GCGGATGGCG ATGACGGCGG ACTGGATCAG TTCACCCGCT 7500
TCGGGAGTGA CCGCCTGCAC GCCCAGCAAC CGGCCTGAGC CTGCTTCGGC GCACCAGCTT GATGAACCTC GTGTGTCGAA GTTGGCCAGT GCACGCGGCA 7600
CGTTGTCCAG AGTGAGTGTT CGGCTGTCCG TTTCGAGGCC GGCGTTATGC GCTTCGGCTT CGCTGTAGCC GACGGTGGCC ACTTGGGGAT CGGTGAACAC 7700
CACGGACGGC ATGACGTCGA GATTGAGTGT CGCCTCGCCG CCGGTCATGT TGATCGCCGC ACGGGTGCCG GCCGCCGCCG CCACATAGAC GAACTGCGGC 7800
TGGTCGGTGC AGTCGCCAGC CGCATAGATA TCCCGTGCGC TGGTACGCAT GCCCCGGTCG ATCTGGATGC CACCGCGCTC GTCCAGCGTC ACGCCGGCGC 7900
CGTCCAGGTT CATGCCTTGG GTGTTCGGCA AGCGCCCGGT GGCAATGAGC AGTTGGTCGG CACGCAGCTC ACCATGGTTG GTGCTCAGTA CGAACTCGCC 8000
GTTGGGCTGG GACACCTGGC TGGCTTGCGT TTGCTCCAGC ACCTCGATGC CCTCCATACG GAAGACCTCC GTCACCGCAG CACCTATGGC CGGGTCTTCG 8100
CGAAAGAACA TCGAACTGCG TGCCAGGATC GTGACGCGGC TACCTAGCCG GGCGAAGGCC TGCGCCAGTT CCACCGCCAC TACGGAGGCG CCGATCACGG 8200
CGAGCCGTTG CGGAATGCTG TCGCTCACCA GCGCCTCGTC TGAGGTCCAG TACGGCGTGT CTTTCAGCCC TGGAATAGAC GGAACGGCCG CGCTTGCGCC 8300
GGTGGCAACC AGGCAGCGGT CGAAGGCTAC GATGCGCTCG CCACCCTCGG CCAGTTCCAC GCTGAGCGTG TGGCCATCCT GGAAACGGGC GGTGCCACGC 8400
AGCACGCTGA TGGCTGGGGT GCTCTCCAGA ATGCTTTCGT ATTTGGCGTG GCGCAATTCG TCGACACGGC CTTGCTGCTG CGCGAGCAGT CGCTCGCGCA 8500
GGACTGTCGG CGTCGTGGCG GACAATCCGC CGTCGAACGG ACTCTCGCGG CGCAGATGGG CCACATGCGC GGCACGAATC ATGATCTTCG ACGGCACGCA 8600
GCCCACGTTG ACGCAGGTTC CGCCGATGAT GCCGCGCTCG ATCAGGGTGA CGCGGGCACC GCCTTCCACC GCCTTGAGCG CCGCTGCCAT GGCGGCGCCA 8700
CCGCTGCCGA TGACCGCAAC GTGGAGCCTG ACGTCCTCTT TCCCGGCGCC AATGCCACCG CTCAACCAGC CCGGCGCCTG GTCGAGCAGG CCCGGACGGG 8800
GTTGTAGCGT GGCGTCCTCG AACGCCGCTC GATACCCCAG AGCCTCGACG GCGGCCTGCA TTTGTTCATG GTTTACACGC TTGTCGACCG TCAATTCGGC 8900
CTTCCCGCTG GCGTAGGAAA CATCCGCCTG GTGCACGCCG GGAATCTTCT CCAAGGCATC TTTTACATGG TCGGCACAGG ATGCGCAGGT CATGCCGCTG 9000
ATTTGCAGAG TGGTCATAGC GTTCAATTCC TCAAATAGGG CTCTTGTCGG CATCAGTGTT TCTCAGGTGG CGGACAGGAA TTCGATGCGC AACGGCGGTG 9100
GGCGGGCGAC AGCAGATCCC AGATGGCCTC CCCAAGCATG AGGGCCAGCC CCGTGTAAAA CAGGCCGGCC GTCCACCAGC GGCCAAAGAA CAAGTACAGA 9200
GCGGCCAACA CGATGGTCGG CCCGGTTATA CCGAGCAGGC TGCGCTGCCA TTGGCGATGG CTGAACCAGC CCAGGGCATT TGCCAGCAAG GCGACGACGG 9300
CAAACAGCGG CAGCAGCGTG GTGATGAACA GGCCCTCGTA CTGACCTAGA AAGCCCAGTC CAATGGCCGC CCCCAGGCTG GCCATGGCAG GGAAGCACAC 9400
GGCGCAGCCC ATGGCGGAAA CAACGCTGCC CACAACACCG GTCTTGTCAG CGATTCGTGT GATAAGCCCC ATGATTCGAC CCCTGCCTGA TTACGCTCGG 9500
GCTACTGTTT GACGCTGGAC GGATAGCCGG CGTTCTCGGT TGCTTTGGTC AGGGCCTGGG CATTGGTCTT GGCATCGTCG AAGGTCACTG AGGCTTCGCG 9600
CGTATCGAAG CTCACGACGA TATTGCTCAC GCCCTCGACC TTGGAAAGCG CCTTCTTGAC CGTGATCGGG CAGGCGGCGC AGGTCATGCC AGGCACCGAC 9700
AAGGTGACGG TCTGACTGGC AGCCCACACA GGGGAAACAA AGGCAATGAG AGCGAGAGAG GCAAACAGCT TTTTCATGAT GAACTCCAGA TCAGTAGAAC 9800
AAGGGGAGGA TGTAGGGGAA TCCGAACGCG ACCAGCACCA ATGCCGCCAC GACCCAGAAA ATCAGCTTGT AGGTAGTGCG CACCTGCGGA ATCGCGCAGG 9900
CCTCGCCGGG TTTGCAGGCT GGTAGTGGGC GGAAAATGCG ACGGTAGGCG AAGAACAGCG CCACCAGCGC CGCGCCGATG AAGAGCGGGC GGTAAGGTTC 10000
CAGCACGGTC AGGTTGCCGA TCCAGGCCCC GCTGAACCCC AGTGCGATCA GAACCAACGG CCCAAGACAG CAGGTCGAGG CGAGAATCGC CGCCAGCCCC 10100
CCGGCGACAA GCGGGGCGCG CCCGTTCGAT GGTTCAGACA TGCATTTCTC CTTTTGAGCA TTTGATCGAT GGGTTAAGGT TACTTCCGTA GTCATGTACG 10200
GGATCAAGCG CTATGGAGAA CATTTTGGAA AGCCTGACCA TCGGCACCTT CGCTAAGGCG GCTGGGGTCA ACGTGGAAAC CATCCGGTTC TATCAGCGCA 10300
AGGGGCTGTT GCCCGAACCG GACAAGCCCT ACGGCAGCAT TCGCCGTTAC GGCGAGGCGG ATGTCGCACG GGTGAAATTC GTCAAATCTG CTCAGCGGCT 10400
GGGCTTTAGC CTCGATGAAG TGGCCGGACT GTTGAGGCTG GATGACGGCG CTCACTGCGA CGAAGCGCGC GTGCTCGCGG AACAGAAGCT CGAGGATGTG 10500
CGCGGGAAAC TTGCGGATCT GCAGCGGATC GAGTCGGTCT TGGCACGGCT GGTCCATGAC TGTTGTGCGA GCCAACCCAC TATTACCTGC CCGCTGATCG 10600
TTTCGCTGCA TGGGGAATAG CGAGCCGGCA TGAAGCGGCC GGAGACCGGT CATTACGAGC TGACTATTCC GTATCGGAGC GACGACGAGC TGGACAAGAT 10700
TGTGCATGAC CTGCTGACCG AGATCAGCCA GGAAGCCGAC ATGCGCAACT GTTTTGTCGA GATGGACGCC TGGGAAGAAG GCACGGAACG GCGCTGGTAG 10800
GGCTTACGTT GGTTTTTGGT TCCATTGCTC CCCAAACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 972-1107 136 AGTACCGTTT ACTCGACTAC GTTAGTAATA GTTGAACTTT GATTAAGCGT ACCAGTTATT
TGAACCGTAG CGCGGGCAGG TTAACGAACC GAGCCATTCC TCGATAGAGT TCGGCAAAAC
CTTCGTTTTG TCGAAC
res_site_III 982-1023 42 ACTCGACTAC GTTAGTAATA GTTGAACTTT GATTAAGCGT AC
res_site_II 1025-1060 36 AGTTATTTGA ACCGTAGCGC GGGCAGGTTA ACGAAC
res_site_I 1080-1107 28 GTTCGGCAAA ACCTTCGTTT TGTCGAAC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR Tn5044 41-967 Accessory Gene Resolvase -
RHH_6 Tn5044 1169-1408 Passenger Gene Antitoxin +
PIN_3 Tn5044 1408-1818 Passenger Gene Toxin +
tnpA Tn5044 1822-4815 Transposase   +
phospholipase Tn5044 5049-5906 Passenger Gene Other -
merD Tn5044 6129-6488 Passenger Gene Heavy Metal Resistance -
sigY Tn5044 6485-7225 Passenger Gene Heavy Metal Resistance -
merA Tn5044 7335-9017 Passenger Gene Heavy Metal Resistance -
merC Tn5044 9053-9472 Passenger Gene Heavy Metal Resistance -
merP Tn5044 9502-9777 Passenger Gene Heavy Metal Resistance -
merT Tn5044 9791-10141 Passenger Gene Heavy Metal Resistance -
merR Tn5044 10213-10620 Passenger Gene Heavy Metal Resistance +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn5044 927 41-967 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MKIGYARVST RDQKADLQVD ALKQAGCERI YQDIASGAKS ARPELDKLLA NVRPGDAVVI WKLDRLGRSL KHLVELVGEL AERKVGLQSL NDPIDTTHAQ
GRLVFNLFAS LAEFERELIR ERTHAGLSAA RARGRIGGRP KGLPAKAEAT AMAAETLYRE GRLSVSAIGE KLHISKSTLY SYLRHRGVDI GTYQKSAQPR
GQQLNVASPV EPAVERVATV TLRLAVVNNS KFVRGRKRAT ENIERYCLEP YGMKRLETGH YELTIPHRSD DELDKTVHDL LTEISQEADM RNCFIEADAW
EEGTERRW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
RHH_6 RHH_6 Tn5044 240 1169-1408 +
Class:   Passenger Gene
Sub Class:   Antitoxin
Function:   Antitoxin
Sequence Family:  RHH_6 (Pfam:PF16762)
Protein Sequence:  
MNTIRWNVAV SADTDQSLRM FLASQGGGRK GDLSRFIEEA VRAHILELSA EQAKAVNAHL SEAELTDAVD EALAWASKR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
PIN_3 PIN_3 Tn5044 411 1408-1818 +
Class:   Passenger Gene
Sub Class:   Toxin
Function:   Toxin
Target:   single stranded RNA
Sequence Family:  PIN_3 (Pfam:PF13470)
Comment:   Pfam PF13470.5
Protein Sequence:  
MRVVLDTNIL FSALISPHGA PDAIYRAWRA SRFEVVTSRM QLDEIRRASR YPKLQAILQP AKVGAMINNL QRAVVLERLT IEVEADDPDD SFLLAMALAG
DADYLVTGDR RAGLLQRGHI ERTRIVTPAV FCAEVL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn5044 2994 1822-4815 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPVGFLTQEQ RDAFGRYVDA PSREELERYF HLSDEDREAI QVLRGNHNRL GYAVMLTTVR FVGVLPDKPA AVPVEVRQVL CRQLAIADPD CLQRYSDHRR
WIHAADIQER YGYRHFTDPG VGFRLSRWLY ALCWTGTDRP GVLFERATSW LFTQKVLLPG VSQLERFIAQ LRSRVEERLW YTLGRSVTEE QRQHLQDLLL
VAEGNRSSRL DQLRSGPVMI SGPALVRALR RLDDVRGLGI TLPAAAHIPP SRIAALARFA NTAKVTAINR LPASRRLATL VAFAVCLEAS AHDDALEVLE
ALLRDLFNNA EKADKKARLR SLKDLDRSAA TLATACKVVL DASISDDNVR ARLFNDLPRA TLEKALEEVN ALIRPADDVY FLALAARYRS VRRFLPNLLS
HIRFGFSPAG KGVAASLDWL QLNLPRRKPE DDAPQEIVAK AWQNHITRED GSLDMGAYVF CTLDALRTAL RRRDVFVSPS WRYADPRIGL LGGAEWLAAR
PIICRSLGLT IDAGTTLDAL SAELDATWQT VAARLPDNPA IQLSENTEGK TELSLGALDK LDEPSSLLQL RAAVADLMPR VDLPEILLEI AARTGFAEAF
THVSERNARA DNLVTSLCAV LLGGACNTGL EPLIRADNLA LRRDRLSWVS QNYIRDDTLS AANAILVGAQ SQLELAQVWG GGEVASADGM RFVVPVRTVH
AGPNPKYFGT GRGVTWYNLI SDQFSGLNAI TVPGTLRDSL VLLAVVLEQQ TELQPTQIMT DTGAYSDVVF GLFRLLGYHF SPRLADVGGT RFWRTRPDAD
YGKLNGLARQ SVKLDLIAEH WDDLLRLAGS LKLGRVPATG IMRTLQTGDR PTRLAQALAE FGRIEKTLHT LTYIDNESKR RATLTQLNRG EGRHSLARAV
FHGKRGELRQ RYREGQEDQL GALGLVVNII VLWNTLYMTA AVERLRQHGY PVLEEDLARL SPLIYEHINM LGRYSFAVPD EVARGELRPL RNPEDDL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
phospholipase Phospholipase Tn5044 858 5049-5906 -
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MNVEASSAHT ALVLCGGGAR GAMQVGFYQA IIELGLSFDL VVGSSVGALN GTLIAAGMSP AELSELWQGL RKPDVVSWNW RGLLGETGGF YSFAPLRRLL
HEALPVKRFE ELKHPLVITT TNLQSGQPCY HERHGVLIEP MIASMSLPGV FPPVWIAGDQ HVDGGVTDNV PFDKALDRGA HEILMISCTC CPRDSTALAS
PLKILLRSFM IALDTKYLCS LQRHRARGIG IQLVRPHFQQ PGGLLDFNQT AALIEVGYRE TLAAFRNLRS NRVPPRSPLP EGVPS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn5044 360 6129-6488 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   second regulatory element
Protein Sequence:  
MSSYKISKLA EQAGVSVHVV RDYLLRGLLH PVRRTDSGYG IFDAQSLNRL RFVRTAFEAG IDLRELARWC QSLDGGGGDI DECRAHLRSL IAARRETLVA
LDRQLAAGLP HAEPESRHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sigY SigY Tn5044 741 6485-7225 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   RNA polymerase sigma factor-like protein
Protein Sequence:  
MAPRMAFRAT VTTYGAAVQP DGLSAAALLL SRLRAGDREA FTALVREHHR ALRALAVCIV GEAWADDAVQ EAWLAAYRAL PRFEGRASLK TWLWAIVRNQ
ALARLGRERR YVPWEAVFPG VDGGKSLRFD AAERGPTTPG PWHQEGPEAL LSSAESAQCL GRAIAALPGA QQRAYWLRDM EGMELDNIAV HLATSNGNVR
VLLHRARSAL RRVLNAYQGQ GECRCISRQP PLIDPLRGWL SEGVQA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn5044 1683 7335-9017 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MTTLQISGMT CASCADHVKD ALEKIPGVHQ ADVSYASGKA ELTVDKRVNH EQMQAAVEAL GYRAAFEDAT LQPRPGLLDQ APGWLSGGIG AGKEDVRLHV
AVIGSGGAAM AAALKAVEGG ARVTLIERGI IGGTCVNVGC VPSKIMIRAA HVAHLRRESP FDGGLSATTP TVLRERLLAQ QQGRVDELRH AKYESILEST
PAISVLRGTA RFQDGHTLSV ELAEGGERIV AFDRCLVATG ASAAVPSIPG LKDTPYWTSD EALVSDSIPQ RLAVIGASVV AVELAQAFAR LGSRVTILAR
SSMFFREDPA IGAAVTEVFR MEGIEVLEQT QASQVSQPNG EFVLSTNHGE LRADQLLIAT GRLPNTQGMN LDGAGVTLDE RGGIQIDRGM RTSARDIYAA
GDCTDQPQFV YVAAAAGTRA AINMTGGEAT LNLDVMPSVV FTDPQVATVG YSEAEAHNAG LETDSRTLTL DNVPRALANF DTRGSSSWCA EAGSGRLLGV
QAVTPEAGEL IQSAVIAIRA RMTVQELADQ LFPYLTMVEG LKLAAQTFTK DVKQLSCCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merC MerC Tn5044 420 9053-9472 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercury transporter
Protein Sequence:  
MGLITRIADK TGVVGSVVSA MGCAVCFPAM ASLGAAIGLG FLGQYEGLFI TTLLPLFAVV ALLANALGWF SHRQWQRSLL GITGPTIVLA ALYLFFGRWW
TAGLFYTGLA LMLGEAIWDL LSPAHRRCAS NSCPPPEKH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn5044 276 9502-9777 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   Periplasmic mercury ion-binding protein
Protein Sequence:  
MKKLFASLAL IAFVSPVWAA SQTVTLSVPG MTCAACPITV KKALSKVEGV SNIVVSFDTR EASVTFDDAK TNAQALTKAT ENAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn5044 351 9791-10141 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercury transporter
Protein Sequence:  
MSEPSNGRAP LVAGGLAAIL ASTCCLGPLV LIALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAYRRIF RPLPACKPGE ACAIPQVRTT YKLIFWVVAA
LVLVAFGFPY ILPLFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn5044 408 10213-10620 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MENILESLTI GTFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGEAD VARVKFVKSA QRLGFSLDEV AGLLRLDDGA HCDEARVLAE QKLEDVRGKL
ADLQRIESVL ARLVHDCCAS QPTITCPLIV SLHGE

 References     

1.Minakhina S, Kholodii G, Mindlin S, Yurieva O, Nikiforov V. Tn5053 family transposons are res site hunters sensing plasmidal res sites occupied by cognate resolvases. Mol Microbiol. 1999 Sep;33(5):1059-68. doi: 10.1046/j.1365-2958.1999.01548.x. PubMed ID: 10476039