Transposon
Name: Tn5058
Family: Tn402        Group: Tn5053
Evidence of Transposition: Yes
 Host     

Host Organism:Pseudomonas sp. ED23-33
Place of Origin:Edoma suite sediments near the river Khomus Yurakh, Russia Date of Isolation:2005
Other Geographic Information:Permafrost

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  12373 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAATT GACGGGGCGT AACGCCAGAT GTGCAGGCGA CTCCTGACAA CGCAATATCA GAAGTCATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGTG AGCATGGCGT CAGACACATC ATTGATTGCC GAGCAAGGCG TGGCCACCCT GCCCGATGCG 200
GCTTGGGCGC AGGCCCGGCA ACGGGCGGAA ATCATCGGGC CGCTGGCAGC GCTTGATGTG GTCGGGCATG AAGCCGCCGA TGCCGCTGCT CACGCGCTTG 300
GCCTGTCCAG GCGGCAGGTG TATGTCCTAA TCCGCCGTGC CCGGCAAGGT GCTGGGCTTG TGACGGACCT GGCTCGCAGC CGATCCGGCG GCGGAAAAGG 400
CAAGGGACGC TTGCCGGAAT CAGTTGAGCG CATCATCCGC GAGTTGCTGC AAAAGCGCTT CCTGACCAAG CAGAAGCGTA GCCTGGCAGC GTTCCACCGC 500
GAGGTCGCGC AGGCTTGCAA AGCGCAAAAG CTGCGGGCGC CGGCGCGCAA CACCGTGGCT CTGCGGATCG CCGGCCTCGA TCCGCTCAAG GCCACTCGCC 600
GCCGGGAAGG TCAGGATGCG TCCCGCAGCC TGCAAGGTGT CGGTGGTGAG CCTCCCGCCG TGACCGCGCC ACTGGAACAA GTGCAGATTG ATCACACGGT 700
CATCGACCTG ATCGTGGTGG ACGAGCGCGA CCGGCAACCG ATTGGCCGTC CGTATCTGAC CATCGCCATC GACGTGTTTA CCCGCTGCGT GCTCGGCATG 800
GTCGTCACGC TGGAAGCGCC GTCATCTGTT TCGGTCGGCC TGTGCCTTGT GCATGTCGCC TGCGACAAGC GTCCCTGGCT GGAGGGTCTG AAGATAGAAA 900
TGGAGTGGCC GATGAGCGGC AAGCCCAGGC TGCTCTACCT GGACAACGCG GCCGAGTTCA AGAGCGAAGC GCTACGCCGA GGCTGCGAGC AGCATGGCAT 1000
CCGGCTTGAC TATCGCCCGC TCGGGCAGCC GCACTACGGC GGCATCGTGG AACGGATCAT CGGCACGGCG ATGCAGATGA TCCACGACGA ATTGCCAGGG 1100
ACGACCTTCT CCAACCCTGA CCAGCGCGGC GACTACGATT CCGAAAACAA GGCCGCCCTG ACGCTGCGTG AGCTGGAGCG CTGGCTCACA TTGGCGGTCG 1200
GCACCTACCA CGGCTCCGTG CACAACGGCC TGCTCCAGCC GCCGGCAGCG CGCTGGGCCG AAGCTATCGC GCGGACCGGC GTGCCAACCG TCATCACTCG 1300
CACCACGGCT TTTCTGGTCG ATTTTCTGCC CATCATCCGC CGCACGCTGA CCCGCACCGG CTTCGTCATC GACCACATCC ATTACTACGC CGATGCGCTC 1400
AAGCCGTGGA TAGCTCGGCG CGACCGCTTG CCTGCGTTCC TGATCCGGCG CGACCCGCGC GACATCAGCC GCATTTGGGT GCTGGAGCCG GAGGGGCAGC 1500
ACTATCTGGA AATTCCATAC CGCACCTTGT CGCACCCGGC TGTCACCCTC TGGGAACAAC GACAGGCGCT GGCGAAATTG CGGCAGCAAG GGCGCGAACA 1600
GGTGGATGAG TCGGCGCTGT TTCGCATGAT CGGCCAGATG CGCGAAATCG TGTCCACCGC GCAGAAAGCT ACGCGCAAGG CGCGGCGCGA CGCGGATCGA 1700
CGCCAGCATC TCAAGGCAAC GGCAGTTCTT TTCAAAACCA CGCCACCACC GGACGCGGAC ATGGCTGACC CGCAGGCAGA CAACCAGCCA CCTGCCAAAC 1800
CGTTCGACCA GATTGAGGAG TGGTAGCCGT GGAAGAATAT CCCATCATCG ACTTGTCCCA CCTGATGCCG GTGGCCCAGG GCTTGGCCCG TCTTCCGGCG 1900
GACGAACGCA TCCATCGCCT TCGCGCTGAC CGCTGGATCG GCTATCCGCG AGCAGTCGAG GCGCTGAATC GGCTGGAAGC CCTGTATGCG TGGCCGAACA 2000
AACAACGCAT GCCCAACCTG CTGTTGGTCG GTCCAACCAA CAACGGCAAG TCGATGATCG TCGAGAAATT CCGCCGCACC CACCCGGCCA GCTCCGACGC 2100
CGACCAGGAG CACATTCCGG TACTGGTCGT GCAGATGCCA TCCGAACCGT CGGTAATCCG CTTCTACGTC GCGCTACTTG CCGCGATGGG CGCGCCATTG 2200
CGCCCGCGCC CACGGCTGCC GGAAATGGAG CAATTGGCGC TGGCACTGCT ACGCAAGGTC GGCGTGCGCA TGCTGGTGAT CGACGAATTG CACAACGTCC 2300
TGGCCGGCAA CAGCGTCAAC CGCCGGGAAT TCCTCAACCT GCTGCGTTTC CTCGGCAACG AGCTGCGCAT CCCGCTGGTC GGGGTCGGCA CACGCGATGC 2400
CTACTTGGCG ATCCGCTCGG ACGACCAGTT GGAAAACCGC TTCGAGCCGA TGATGCTGCC GGTGTGGGAG GCCAACGACG ATTGCTGCTC ACTGCTGGCC 2500
AGCTTCGCGG CTTCGCTCCC GCTGCGCCGA CCCTCGTCGA TTGCCACGCT GGATATGGCC CGCTACCTGC TCACGCGCAG CGAGGGCACC ATCGGCGAGC 2600
TGGCGCACCT GTTGATGGCG GCGGCCGTCG CTGCCGTGGA GAGCGGTGAG GAAGCGATCA ACCATCGCAC GCTCAGCATG GCCGATTACA CCGGTCCCAG 2700
CGAGCGGCGG CGGCAATTCG AGCGGGAACT GATGTGAAGC CAGCGCCACA CTGGCCACTG CATCCGGCTC CCAGGGAAGG CGAAGCCTTG TCTTCGTGGC 2800
TCAACCGCGT GGCCCTTTGC TATCACATGG AGGTGTCCGA GCTGCTGGAG CACGATCTTG GTCACGGCCA GGTTGATGAC CTGGACACCG CGCCACCACT 2900
GGCGCTGCTG GCGATGCTCT CCCAGCGGAG CGGCATCGAG CCGGACCGGC TGCGTTGCAT GAGTTTCGCC GGCTGGGTGC CTTGGCTACT GGACAGCCTT 3000
GATGATCAGA TTCCAGATGC ATTGGAAACC TATGCGTTCC AGCTCTCGGT GCTGCTGCCG AAACTCCGCC GTAGGACGCG ATCCATCACG AGCTGGCGTG 3100
CCTGGCTGCC CAGCCAGCCG ATACATCGCG CCTGCCCGCT CTGTCTGAAC GATCCGGAGA ACCAAGCCGT ACTGCTCGCG TGGAAGCTGC CCCTGATGCT 3200
GAGCTGCCCA CTGCATGGCT GCTGGCTGGA ATCCTATTGG GGCGTGCCAG GGCGGTTTCT CGGCTGGGAG AACGCCGACG CCGAACCGCG CACTGCCAGC 3300
GACGCGATTG CGGCGATGGA CCAGCGTACC TGGCAGGCAC TGACGACCGG CCACGTGGAG TTGCCGCGCC GACGCATCCA CGCCGCATTG TGGTTTAGGC 3400
TGCTACGCAC GCTGCTCGAT GAGCTGAACA CCCCGCTTTC GACGTGCGGC ACCTACGCGG GGTATCTCCG CCAAATCTGG GAATGCTGCG GGCATCCGCT 3500
GCGTGCTGGG CAAAGTCTGT GGCGACCGTA TGAAACCCTG AACCCGGCAG TACGGTTGCA GATGCTGGAG GCGGCGGCAA CGGCAATCAG CTTGATTGAG 3600
GTGAGGGATA TAAGCCCGCC AGGCGAGCAC GCAAAGCTAT TCTGGTCCGA GCCCCAAACC GGGTTCACCA GTGGCCTGCC GGCGAAAGCG CCGAAGCCCG 3700
AACCCGTCGA TCACTGGCAG CGTGCAATCC AGGCCATTGA TGAGGCCATC ATTGAAGCAC GACACAACCC CGAGACGGCA CGCTCGCTGT TCGCGTTGGC 3800
TTCCTATGGT CGGCGCGACC CCGCTTCCTT GGAACAGTTG CGCGCCACCT TCGCGAAGGA AGGCATCCCC CCGGAATTTC TGTCACATTA TGAGCCTAGC 3900
CTACCCTTTG CATGCCTTAG ACAGAATGAC GGGTTAAGTG ACAAATTTTG ACGTGCAGAG CTTCCCGATG CAAACTGTCA CATAATCGAA CGTATATGTG 4000
ACAGGTACAA GATGCTGATT GGCTACATGC GAGTATCGAA GGCGGATGGC TCCCAGGCGA CGGACTTGCA GCGCGATGCG CTGGTCGCGG CCGGTGTTGA 4100
TCCGGCGCAT CTCTATGAAG ACCAAGCGTC CGGAAAACGC GAGGATCGTC CCGGTCTGGC GAGCTGTCTG AAGGCACTAC GGCCAGGCGA CACGTTGGTC 4200
GTTTGGAAAC TGGATCGGCT CGGGCGCGAC CTGCGCAATC TGATCAATAC CGTCCATGAC CTGACCGAAC GCGGCATCGG CCTCAAGGTG CTGACCGGGC 4300
ACGTCGCGGC CATTGACACC ACGACCGCCG CCGGCAAGCT GGTCTTCGGT ATCTTCGCCG CGCTGGCCGA GTTCGAGCGC GAGTTGATCG CCGAGCGCAC 4400
CGTGGCGGGC CTGGCCTCAG CACAAACACG CGGCCGGAAA GGCGGCCGGC CGTTCAAGAT GACCGCCGCC AAGCTGCGGC TGGCGATGGC GGCAATGGGG 4500
CAGTCACAGA CCAAGGTCGG CGACCTGTGC CAGGAACTTG GCATCACGCG GCAGACCCTG TATCGGCATA TTTCACCCAA GGGCGAGCTA CGTCCAGATG 4600
GCGAGAAGCT ACTCAGCCGA ATTTGATGCC GACATGAGGC GACGTAGCGA AAGCGTGGTT TGTCTCAATT TGACGGCGGC GAACCGCAAG CGTTCGGTTC 4700
GCCGCTCGTT TCGGTCGCAG TGCTGGCCCG CGCCTGGAAA CGTTGATAAC ACTCCAGCCC GCAGAAGTGC TCGACGTACT CCGCGCCTTC CGGTGTGAAG 4800
GCGGCATCGA GCGGAATTTC CTTTGACCAG CACACGCAGC AACTGGTGGC GGTCGGATCA TTTGCATTCA TGGTGGCACC CCTCCATTGA CTGACGAAGA 4900
CGGCGAATGC CGCGCCGGCA TCGGCTTTGC GAACAGAAAG CCTTGCCCGC GCTCGAACGC TTGGGCCCTC CGCCGTAGTC CATCCATCCG GCCGGGAAGC 5000
GCTCATGATC TTTCCCTGAA TGCCCGCAAC GCCCGCGCCA GTGACAGAAG AAACAGGCCG GTCAAACCGA GCGCCGTGAT GACCCAATAC TCGACAAGGA 5100
ATGCACCGGC GGTTGTGCCG GCCAGCACGG CAGCGAGGAT GGACAGGTGG CAGGGGCAAG TCAGCACGGC CAGCGTGCCC CACAGGTAGC CGGTGAACGG 5200
TTTGTGCGTC TCGGCCGGGA TGCGCTCGGG GCTGTTCATG GCAAACTCTC CGCATGCTGT GCCGGCGCGG TCGGCATGGC GGCCAACTGC ACTTCCAGAT 5300
TGGCCAACGC TTCGCGCCGG CGTTCGACGA ACTGACGCAG CACAGCAAGC TGCGCGGCAG TTTCATCGCA GTTCGCCGCA TCCAGCGCCC GGCACAGCCG 5400
CGCCAATGCG CCGAGGCCGA TGCCCGCCTC GAAGGCGGCC CGCACGAAGC ACAGTCGCTG CAAGGCGGCG TCATCGAACA GGCCGTAGCC ACCCGTGGTG 5500
CAGGCGACTG GCCGCAGCAA TCCGCGCAGC AGGTAGTCGC GCACGATATG CACGCTCACC CCGGCATCAA GGGCCAGCCG GGACACCGTG TAGGCGTTCA 5600
TCGAACACCT CCTTTTGGTC GGTTCACGGC AATGCATATA CCGTTTCGCC GAGTTCAATC CGGCGCTGCG AATCCGATGC CGGTGCCGTA GAACATTGGG 5700
TCGATATCCG ATCACGGTGT CCTAGATGAC ATGGTCTGCA ACAGATGTCG ATTAAACTCC TGGCCCAAGC CGAAAGCCTC GTGGACACTG ACGATCGCCA 5800
ATCCTTCCAA TCCTTGATGC TTGGAGGCCC AGTCTTCCGC CGTCGGGACA GATGCAAAGA AATGTACATG GCAACAGAAG GACTGACGAA CGTCGGCTGC 5900
TTCCTGCGGC AATACCAAGG ACACCGCCAT GCCGGCAGGT TCGACAGCCT GTATCTCGCT GGGTGAAACC GTAAGCGAAA CGGGTGCTCC AGTTGCAGCG 6000
CAATGCGACG AGACGCGAGC TGTACGGCCG ATTAACGCCG GGAGCATCAG GGTGTCCAGC GCGCACCAGG CATACAGCCG GCGGTTGTCA ATTTCCAAGA 6100
CATGCGACGT CTCGCGCAAG GTGAGGACAT AGCCGATGAT GTTCCCATCG TTGTCGTATT CGGTACTGGT GGCTTGTTCG AGCACAGCCG CCACTTGCTC 6200
AGCGGGCCAG CCAAGAGTCC TGGCAAGCGT AGTTCGCGAA ATCGGGCGTC CCTTGGCAAG TTCACCCAGT AGCGCGACCA AGAACTCCGC AAAACCTTTG 6300
GGCCGGTTGG TCGGGGTGAG ACGTTCTAAA ATATATGGGG CGAGCTTCAT GGTTCCATCT CCTATTTGAA TTGGATAGCG TAACCTTACC TTCGTACTCA 6400
TGTACGGAGT CAAGCGGTAT GGAAAATAAT TTGGAGAACC TGACCATTGG CGTTTTTGCC AAGGCAGCCG GCGTCAACGT GGAGACCATC CGGTTCTATC 6500
AGCGCAAGGG CTTGTTGCCC GAGCCGGACA AGCCCTATGG CAGCATTCGC CGCTATGGCG CGGCGGACGT GACACGGGTG CGATTCGTGA AATCGGCCCA 6600
GCGGTTGGGC TTCAGCCTGG ATGAAATCGC CGAGTTGCTG CGGCTCGACG ATGGCACCCA CTGCGAGGAG GCCAGCAGCC TGGCCGAGCA CAAGCTCCAA 6700
GACGTGCGCG AGAAGATGGC CGATTTGGCG CGCATGGAGG CTGTGCTGTC TGACCTGGTG TGCGCCTGCC ATTCGCGGCA GGGGAATGTT TCGTGTCCGC 6800
TGATTGCGTC GTTGCAAGGT GGAACGAGCT TGGCAGGGGC TTCCACAGCT TAGCGTGCTT TATTTTCCGT TTTCTGAGGT GCCCCCTACC AGCAACGCAC 6900
CGCCGAAGAC CGGCAGCAGG AGCGCGAGCA ATTCCAGATG GTTAACCGGG GCCTGAAAGA CCAGATTCAA GACCTGCAAA ACCGGCTCAT CCAAGCCGAG 7000
TCAAGAGCGT CCGAGCTGAT AGACACTAAT GCGCAATTAC AACGACAAAC GCATGAGCTC GAACAGGCCA ATGCGATGCA CAAAAGCACC TTGGATAGGC 7100
AAGTGGCCGA TATCCAGAAC TTGGAGCATC AACTTAACGA CGCCTCGGCA AAAAACCGGG AGCGGAAGCA TGAAAGCAAA CAATTGGCGG ATAATATTGC 7200
CGCACTCGCA AGCCAGAAAG CAGATGCCGA TAAGCAGGTG GCCGTACTAT CCCAAGCATT AGAGGTCACC AAAACGGATC TGAAAGCCTC TCAGAATAGA 7300
GCGGAAGTTC TGTCCGATGA GAACAAGTTG ATTCTTCAGA AAAAGCGGTG ATTCAGGGGC AGTTTAAGCA GCTTCAGGAG TCACTGTCGC GATAAAGGCT 7400
TTAGTCAAAC ATTCATCGAA AAAGCTATAT TAATCAATAT ATTATATACG AAGTGGCTAA TTCCAACGCT TTTACCGGCC GGACCTCAAG TAGCTGCACC 7500
GGCCGCCATT TCATTCAAAG AGGCCTCGAC GACAGTCAAA ACTGTTCGAC GCACGTCGAT CTTCCATCGC ACGCGCTCGA CGCATCGGTC CAGGCTTTCG 7600
CTATCGTCTG CGTCCAATGC TTTGCAAAGT CGCGCCAGTT CGTCGAGCCC AATTCCGGAC TCGAAGGCTG CGCGGAGAAA CCGCAGCCTT CCGAGTGTTT 7700
TGTCATCGAA AATGTTCTAG CCACTTTCGG TTCGCCGAGC AGGGTGCAAC AAGCCGCGTA TCATGTAGTC GCGCACGACA TGCACTGGAC GCAAGCATCA 7800
TCAGCCAGCT TCGATATCGA GTACGCATTC ATACTGGGCT TTCCTCGCAG TCCTCTAGCA GCTTAAGCGC GACATCATGA CCAAGGTCGA ACGCACTCTC 7900
GACAGGCACG ACCTCAATCC CTTGGTGGGT CGAAGCCCAC GAATTTGCAA TAGACGGAGA CGCAAAGAAA TGCACATGAC AGCAAAACGA ACAACGAATG 8000
TCGGGCGAAG TATCCGGAGT TCGCAGTGAA ACCATTGCTT CTGCCGGTTC GACATGAAGC ACTGCTTCTG GTGCAACGGT GAGAGTGATC GGCCTACCGG 8100
TGGCGGCGCA CGGGGAAGTG ACACGGGCAA TCTTGCCGGT CAAGGCGGGA AACATCAAGG TGTCCAGCGC ACACCATGTG TACAAATGAC GGCCATCCAC 8200
CTCAAAAACG TGAGAGGTTT CACGTAGTGT CAGCCCCAGT CCAATGATAT TGGCCTCGTC GTCATATTCC GTACCAGGTG CCTGCTCAAG TACCGTTGCA 8300
ACTCTCGCTC CAGACCATCC GAGTGCGCCG GCGAGCTTTT CGCGTGATAC AGGTCCCCCC ATCGCTAGCT GTCGGAGCAG CGCGACAAAG AGCGTTGCGA 8400
ACTCTTTAGG ATGATTGCCA GATGAAAGGC TTTCGGCAAT CTTTTTGGAA TAAATAGTCT TGTCCATTTT CCACCTGTTA TATCGGGCAG CGATCAACCG 8500
GGATAAACCT TACTCGGCGG TTACGTCCTG CGCTTAACAA AGACGAATTA CTTTGTCCTT GCCGGCGTTG CCATTCCTCG GCCAGCAGCT GGGCTTGCTG 8600
TAACTGTTCG GGCTTCATTT GAGCTGCATA GTCTGCAACG ACTTTTGCTG CTGTCGGTTC TCCCGACGCT GCGGCCAGAC TCATCCACTG GTATGCCTGC 8700
ACCTGGTCGG GCGCGATGTC GGTAAGACCC TTTGCAAACA GCTGGGCCAG CCTGACTTGA GCCTGCGTGT GTCCCTGCTC GGCAGCGAGG CGGTACCACT 8800
TGAAAGCCTC TGCATCGTTG GCAGGAGCCG CACGCCCATT CTTATACATA CCGTCGACTT CACTTCCGTA GAGGCGCCCG AGGTTGAACT GTGCATCGGC 8900
TTCACCCTGC TCGGCCGCCC TGCGAACCAC TGGATCGATG CCGCGTGGTT TTGCTTTACG CCCATACCGC ATTTGTACGC AAGACCCATG ATGTTCTGAG 9000
CCATGGCATC GCCAGCCCTG GCCTTCTTTT CCCAATCACT GACGGTCTCT GGCGACAATG TTTCAATCGC GGCGCCGAGC TTAGAAAAAT CATAAGCCGC 9100
CAACGCTTGG TTTGGCATGC CTAACGTTGC GGTACTTGCC AGTACGATCG TCAAGCACGC GCGTACACGG CGTAATTTAA GTCTGATGTC ACAAAACATA 9200
AAAACTCCTC TGTCCCTGTT TTGGTGCCCA AGGTTTAGCC GGCACAGCAC GATAGCTGTG TCACATCCTT CGAAAAAGTC TGTGCCACCA GTTTGAGACC 9300
TTCAACCATG GTGAGGTAAG GGAAAAGCTG GTTAGCCAGA TCCTGGACGG TCATGCGGGC CCGGATGGCA ATCGCGGCAG TCTGGATGAT TTCGCCTGCT 9400
TCCGGTGTCA CAGCCTGGAC TACGAGAATG CGTCCGGACC CTGCCTCAGC AACAAGCTTG ATAAAGCCGC GGGTATCGAA GTTGGCGAGT GCACGCGGCA 9500
CATTGTCGAG CGTGAGCGTA CGGCTTTCGG TTTGCAGACC AATGCGCTGG GCCTCTGCTT CACTGTAGCC TACCGTGGCA ACTTGCGGAT CGGTGAACAC 9600
GACTGCCGGC ATGGCGTCGA GATCGAGCTT TACGTCGCCG CCGGTCATAT TAATGCCAGC CCGGGTACCA GCCGCGGCTG CAACGTAGAC GTACTGAGGC 9700
TGGTCAGTGC AGTCGCCAGC TGCGTAAATG TTCGAAGCAC TCGTGCGCAT GCCTTGATCG ATCAGGATGC GATGTTGCGA GTCAAACGTC ACACCTGCGA 9800
GCTCGAGATT CAGGCTTGCG GTATTTGGCG TGCGGCCGGT GGCAACAAGC AATTGATCCA CCTTTAACTC GCCCTGCGCG GTCGTCAGGA TGAATTGGCG 9900
GTCGCAATAG GTCACGGCAC TCGCCTGCGT CTGTGTCAGC ACCGTGATGC CTTCCGCACG GAAAGCTGCA GTCACGGCCT CGCCAATGGC AGGGTCGTCA 10000
CTGGCAAACA AGGTGCGCCG GGCCAGGACC GTCACCTCGC TGCCCAAGCG CGCAAACGCT TGTGCCAGTT CCAGTGCTAC GACCGAGGCG CCAATTACTG 10100
CCAGCCGTTT AGGAATGGCT TCGCTAACCA GGGCTTCGGT GGACGTCCAA TATGGCGTTC CTGCAAGACC AGGCAACGGC GGGATGGCAG CGCTTGCACC 10200
AGTGGCAATG AGACAGCGAT CGAAGGCCAC TTCGCGAGTG TCACCTTCTG CCGCTTCCAC GATAAGCGTT TGGCCATCCT TGAATCGGGC GCTGCCTCGC 10300
ACCAGATTGA TCGACGGATT GCTTTCGAGG ATGCTTTCAT ATTTAGCATG ACGCAGCTCG TCCACACGGG CCTGCTGTTG CGCCAGCAGA CGGTCACGCA 10400
GGACAACTGG GGGCGTTGCC GACAGTCCCA CGTCAAAAGG ACTTTCGCGT CGCAGATGAG CGATATGCGC TGCCCGAATC ATGATCTTAG AAGGGACGCA 10500
GCCCACGTTG ACGCAGGTGC CGCCGATAGT TCCCCGCTCG ATCAGGGTGA CACGGGCACC GTTTTCAGCT GCCTTCAGCG CTGCTGCCAC AGCGGCGCCG 10600
CCAGTACCGA TGATCGCCAC ATGCAGCTGA CCTGCCGGTT TCTCCACCGC GCGCTCGCCG CTCAGCCACT GGGCGGCTTT GCCAAGGAGT CCTGGCTGTT 10700
CAACACTGCG CTCTTTGCTT TCAAATTCGT CTGTAGGCGT AGCACCGTAA CCAAGGGCGC CCACTGCTGC TACGAGCGCG GACACGTCAA GCCTATTGCC 10800
TGTGACTTGC GCCTTGTTCG ATGGATAATC GACAGCCGCT CGCGTTACGC CGGGGACCGC CAGCAGCGCG TCCTCAACGT GCCTTGTACA ACTTGGGCAA 10900
GTCATTCCCG CCACGCTCAA ACCAACTTCG TTTAACTCTT GCATTCCGCT CTCTCTTTTA TCCGTTGTGC CCAAGCCAAC TAGTACGACG CCTCGGTCAC 11000
TGAATCGGTT GTGCGCTTCA GGGAACAGGC TCCGTGGGTC GTTTACAGGC TGCTCACGCG ATGACGACCG GCAGAATCGC CTGAGCGGCA TCTCTGCCGT 11100
GGTTCTAGCG ATGCTGCCGT TATTTGTTGC CTTACTTCTT CAGCTCAGAT GGGTAGCCTA CATCCGTCGT CGCCTTGCTA AGCGCCTGGA CGTTGGTCTT 11200
CGCATCGTCA AACGTTACGA CGGCAAGCTT GGTTGAGAAG TCGACCTCGG TCTTGCTAAC ACCGTCGACC TTGCTCAAGG CCTTTTTAAT GGTGATAGGA 11300
CAAGTCGCGC AGGTCATGCC AGGAACGGCC AAGGTGACCG TCTGCGTGGC GGCCCACACC GGGGCAACAG CGGCGGCGAG GGCAAGGGAG GCAAACAGTT 11400
TCTTCATGAT GAACTCCTGG TTAATAGAAA AATGGAACGA CATAGGGAAA TCCAAGCGCG ACCAGGACCA GCACGGCCAC GATCCAGAAA ATCAGCTTGT 11500
AGGTGGCGCG CACCTGCGGA ATCGCGCAGA CCTCACCTGG CTTGCATGCC TGCACGGGCC GGTAAATCCG CTTCCAGGCG AAGAACAGCG CCACTAGCGC 11600
CGCGCCGATG AACAACGGTC GATAGGGTTC CAGCACCGTC AGGTTGCCGA TCCAAGCACC GGAGAAGCCC AGGGCGACCA GTACTAGCGG CCCCAGGCAG 11700
CAGGTCGATG CAAGAATGGC GGCCAGCCCG CCGGCGAAGA GCGCACCGCG CCCGTTTTGT GGTTCAGACA TACGTTGGCC CTTTTGAATT TGGATTGGAT 11800
AGCGTAACCT TACTTCCGTA CTCATGTACG GAGTCAAGCG ATATGGAAAA TAATTTGGAA AACCTGACCA TTGGCGTTTT TGCCAAGGCG GCCGGGGTCA 11900
ACGTGGAGAC AATCCGCTTC TATCAGCGCA AGGGCCTGTT GCCGGAACCG GACAAGCCTT ACGGCAGCAT CCGCCGCTAT GGGGAGGCGG ACGTGGTTCG 12000
GGTGAAATTC GTGAAATCGG CACAGCGGCT GGGGTTCAGT CTGGACGAGA TTGCCGAGCT GTTGCGGCTC GACGATGGCA CCCACTGCGA GGAGGCCAGC 12100
AGCCTGGCCG AGCACAAGCT CCAAGACGTG CGCGAGAAGA TGGCCGATTT GGCGCGCATG GAGGCTGTGC TGTCTGACCT GGTGTGCGCC TGCCATTCGC 12200
GGCAGGGGAA TGTTTCGTGT CCGCTGATTG CGTCGTTGCA AGGTGGAACG AGCTTGGCAG GGGCTTCCAC AGCTTAGCGT GCTTTATTTT CCGTTTTCTG 12300
AGACGACCAC AGATCAGAAA AGACTGTGCA GCCGACTCTT GATATTCCGT GCAGTCACCT TCTGAAAATG ACA

 Recombination Sites     

Name Coordinates Gene Sequence
repeat t3 78-97 20 TCAGAAGTCA TCTGCACCAA
r5 3875-3888 14 AATTTCTGTC ACAT
r3 3931-3944 14 GGGTTAAGTG ACAA
res 3975-4004 30 CTGTCACATA ATCGAACGTA TATGTGACAG
r2 3975-3988 14 CTGTCACATA ATCG
r1 3991-4004 14 CGTATATGTG ACAG

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA Tn5058 144-1826 Transposase   +
tniB Tn5058 1829-2737 Accessory Gene   +
tniQ Tn5058 2734-3951 Accessory Gene Target Site Selection +
tniR Tn5058 4012-4626 Accessory Gene Resolvase +
merE Tn5058 5003-5239 Passenger Gene Heavy Metal Resistance -
merD2 Tn5058 5236-5601 Passenger Gene Heavy Metal Resistance -
merB2 Tn5058 5712-6350 Passenger Gene Heavy Metal Resistance -
merR Tn5058 6419-6853 Passenger Gene Heavy Metal Resistance +
merD1 Tn5058 7486-7832 Passenger Gene Heavy Metal Resistance -
merB1 Tn5058 7829-8467 Passenger Gene Heavy Metal Resistance -
merG Tn5058 8681-9199 Passenger Gene Heavy Metal Resistance -
merA Tn5058 9235-10944 Passenger Gene Heavy Metal Resistance -
merP Tn5058 11132-11407 Passenger Gene Heavy Metal Resistance -
merT Tn5058 11421-11771 Passenger Gene Heavy Metal Resistance -
merR1 Tn5058 11843-12277 Passenger Gene Heavy Metal Resistance +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA Tn5058 1683 144-1826 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   homologous to TnsB of Tn7
Protein Sequence:  
MASDTSLIAE QGVATLPDAA WAQARQRAEI IGPLAALDVV GHEAADAAAH ALGLSRRQVY VLIRRARQGA GLVTDLARSR SGGGKGKGRL PESVERIIRE
LLQKRFLTKQ KRSLAAFHRE VAQACKAQKL RAPARNTVAL RIAGLDPLKA TRRREGQDAS RSLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDERDRQPI
GRPYLTIAID VFTRCVLGMV VTLEAPSSVS VGLCLVHVAC DKRPWLEGLK IEMEWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAIARTGV PTVITRTTAF LVDFLPIIRR
TLTRTGFVID HIHYYADALK PWIARRDRLP AFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVSTAQKAT RKARRDADRR QHLKATAVLF KTTPPPDADM ADPQADNQPP AKPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB TniB Tn5058 909 1829-2737 +
Class:   Accessory Gene
Transpoase Chemistry:   Serine
Comment:   homologous to TnsC protein of Tn7 putative ATP-binding protein
Protein Sequence:  
VEEYPIIDLS HLMPVAQGLA RLPADERIHR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSSIAT LDMARYLLTR SEGTIGELAH LLMAAAVAAV ESGEEAINHR TLSMADYTGP SERRRQFERE
LM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniQ TniQ Tn5058 1218 2734-3951 +
Class:   Accessory Gene
Sub Class:   Target Site Selection
Function:   putative transposition gene
Comment:   similar function to Tn7 tnsD?
Protein Sequence:  
MKPAPHWPLH PAPREGEALS SWLNRVALCY HMEVSELLEH DLGHGQVDDL DTAPPLALLA MLSQRSGIEP DRLRCMSFAG WVPWLLDSLD DQIPDALETY
AFQLSVLLPK LRRRTRSITS WRAWLPSQPI HRACPLCLND PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW
QALTTGHVEL PRRRIHAALW FRLLRTLLDE LNTPLSTCGT YAGYLRQIWE CCGHPLRAGQ SLWRPYETLN PAVRLQMLEA AATAISLIEV RDISPPGEHA
KLFWSEPQTG FTSGLPAKAP KPEPVDHWQR AIQAIDEAII EARHNPETAR SLFALASYGR RDPASLEQLR ATFAKEGIPP EFLSHYEPSL PFACLRQNDG
LSDKF

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniR TniR Tn5058 615 4012-4626 +
Class:   Accessory Gene
Sub Class:   Resolvase
Function:   putative resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   resolution of cointegrates || Protein: ACE81792.1
Protein Sequence:  
MLIGYMRVSK ADGSQATDLQ RDALVAAGVD PAHLYEDQAS GKREDRPGLA SCLKALRPGD TLVVWKLDRL GRDLRNLINT VHDLTERGIG LKVLTGHVAA
IDTTTAAGKL VFGIFAALAE FERELIAERT VAGLASAQTR GRKGGRPFKM TAAKLRLAMA AMGQSQTKVG DLCQELGITR QTLYRHISPK GELRPDGEKL
LSRI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn5058 237 5003-5239 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Comment:   similar to urf-1 in pKLH2 (GenBank AF213017), pKLH272 (Genbank Y08992), pMER610 (GenBank Y08993), pKLH210 (GenBank Y10102), Tn5036 (Genbank Y09025), orf1 in Tn501 (GenBank Z00027), and urf-1 in Tn5041 (GenBank X98999)
Protein Sequence:  
MNSPERIPAE THKPFTGYLW GTLAVLTCPC HLSILAAVLA GTTAGAFLVE YWVITALGLT GLFLLSLARA LRAFRERS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD2 MerD2 Tn5058 366 5236-5601 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MNAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTTGGYG LFDDAALQRL CFVRAAFEAG IGLGALARLC RALDAANCDE TAAQLAVLRQ FVERRREALA
NLEVQLAAMP TAPAQHAESL P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merB2 MerB2 Tn5058 639 5712-6350 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Transpoase Chemistry:   merB1 (Tn5058) Chemistry
Target:   Mercury
Comment:   organomercurial lyase
Protein Sequence:  
MKLAPYILER LTPTNRPKGF AEFLVALLGE LAKGRPISRT TLARTLGWPA EQVAAVLEQA TSTEYDNDGN IIGYVLTLRE TSHVLEIDNR RLYAWCALDT
LMLPALIGRT ARVSSHCAAT GAPVSLTVSP SEIQAVEPAG MAVSLVLPQE AADVRQSFCC HVHFFASVPT AEDWASKHQG LEGLAIVSVH EAFGLGQEFN
RHLLQTMSSR TP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn5058 435 6419-6853 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   putative regulatory gene mercury resistance
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGAAD VTRVRFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLQDVREKM
ADLARMEAVL SDLVCACHSR QGNVSCPLIA SLQGGTSLAG ASTA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD1 MerD1 Tn5058 347 7486-7832 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MNAYSISKLA DDACVQCMSC ATT*YAACCT LLGEPKVART FSMTKHSEGC GFSAQPSSPE LGSTNWRDFA KHWTQTIAKA WTDASSACDG RSTCVEQF*L
SSRPL*MKWR PVQLL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merB1 MerB1 Tn5058 639 7829-8467 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Transpoase Chemistry:   merB1 (Tn5058) Chemistry
Target:   Mercury
Comment:   organomercurial lyase
Protein Sequence:  
MDKTIYSKKI AESLSSGNHP KEFATLFVAL LRQLAMGGPV SREKLAGALG WSGARVATVL EQAPGTEYDD EANIIGLGLT LRETSHVFEV DGRHLYTWCA
LDTLMFPALT GKIARVTSPC AATGRPITLT VAPEAVLHVE PAEAMVSLRT PDTSPDIRCS FCCHVHFFAS PSIANSWAST HQGIEVVPVE SAFDLGHDVA
LKLLEDCEES PV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merG MerG Tn5058 519 8681-9199 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MFCDIRLKLR RVRACLTIVL ASTATLGMPN QALAAYDFSK LGAAIETLSP ETVSDWEKKA RAGDAMAQNI MGLAYKCGMG VKQNHAASIQ WFAGRPSRVK
PMHSSTSGAS TEVKSTVCIR MGVRLLPTMQ RLSSGTASLP SRDTRRLKSG WPSCLQRVLP TSRPTRCRHT SG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn5058 1710 9235-10944 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MQELNEVGLS VAGMTCPSCT RHVEDALLAV PGVTRAAVDY PSNKAQVTGN RLDVSALVAA VGALGYGATP TDEFESKERS VEQPGLLGKA AQWLSGERAV
EKPAGQLHVA IIGTGGAAVA AALKAAENGA RVTLIERGTI GGTCVNVGCV PSKIMIRAAH IAHLRRESPF DVGLSATPPV VLRDRLLAQQ QARVDELRHA
KYESILESNP SINLVRGSAR FKDGQTLIVE AAEGDTREVA FDRCLIATGA SAAIPPLPGL AGTPYWTSTE ALVSEAIPKR LAVIGASVVA LELAQAFARL
GSEVTVLARR TLFASDDPAI GEAVTAAFRA EGITVLTQTQ ASAVTYCDRQ FILTTAQGEL KVDQLLVATG RTPNTASLNL ELAGVTFDSQ HRILIDQGMR
TSASNIYAAG DCTDQPQYVY VAAAAGTRAG INMTGGDVKL DLDAMPAVVF TDPQVATVGY SEAEAQRIGL QTESRTLTLD NVPRALANFD TRGFIKLVAE
AGSGRILVVQ AVTPEAGEII QTAAIAIRAR MTVQDLANQL FPYLTMVEGL KLVAQTFSKD VTQLSCCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn5058 276 11132-11407 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   periplasmic mercuric ion binding protein
Protein Sequence:  
MKKLFASLAL AAAVAPVWAA TQTVTLAVPG MTCATCPITI KKALSKVDGV SKTEVDFSTK LAVVTFDDAK TNVQALSKAT TDVGYPSELK K

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn5058 351 11421-11771 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   cytosolic mercuric ion transport protein
Target:   Mercury
Protein Sequence:  
MSEPQNGRGA LFAGGLAAIL ASTCCLGPLV LVALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAWKRIY RPVQACKPGE VCAIPQVRAT YKLIFWIVAV
LVLVALGFPY VVPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR1 MerR1 Tn5058 435 11843-12277 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGEAD VVRVKFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLQDVREKM
ADLARMEAVL SDLVCACHSR QGNVSCPLIA SLQGGTSLAG ASTA

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t2 Tn5058 49-67 TACACGTCCG CTGAGGACT
repeat t4 Tn5058 110-128 TCAATACTCG TGTGCACCA
IRL Tn21 6849-6886 GAATCGCACG AAATAAAAGG CAAAAGACTC CACGGGGG
repeat i4 Tn5053 6859-6877 AAATAAAAGG CAAAAGACT
IR Tn21-like Tn5058 12273-12310 GAATCGCACG AAATAAAAGG CAAAAGACTC TGCTGGTG
repeat i4 Tn5058 12283-12301 AAATAAAAGG CAAAAGACT
repeat i1 Tn5058 12347-12365 GGCACGTCAG TGGAAGACT

 References     

Mindlin S, Minakhin L, Petrova M, Kholodii G, Minakhina S, Gorlenko Z, Nikiforov V. Present-day mercury resistance transposons are common in bacteria preserved in permafrost grounds since the Upper Pleistocene. Res Microbiol. 2005 Dec;156(10):994-1004. doi: 10.1016/j.resmic.2005.05.011. Epub 2005 Jul 18. PubMed ID: 16084067