|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
|
|
|
|
|
|
|
|
|
Name: Tn5058 |
|
Family: Tn402 Group: Tn5053 |
|
Evidence of Transposition: Yes |
|
|
Host |
|
|
Host Organism: | Pseudomonas sp. ED23-33 | | |
Place of Origin: | Edoma suite sediments near the river Khomus Yurakh, Russia | Date of Isolation: | 2005 |
| | Other Geographic Information: | Permafrost |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAATT GACGGGGCGT AACGCCAGAT GTGCAGGCGA CTCCTGACAA CGCAATATCA GAAGTCATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGTG AGCATGGCGT CAGACACATC ATTGATTGCC GAGCAAGGCG TGGCCACCCT GCCCGATGCG 200
GCTTGGGCGC AGGCCCGGCA ACGGGCGGAA ATCATCGGGC CGCTGGCAGC GCTTGATGTG GTCGGGCATG AAGCCGCCGA TGCCGCTGCT CACGCGCTTG 300
GCCTGTCCAG GCGGCAGGTG TATGTCCTAA TCCGCCGTGC CCGGCAAGGT GCTGGGCTTG TGACGGACCT GGCTCGCAGC CGATCCGGCG GCGGAAAAGG 400
CAAGGGACGC TTGCCGGAAT CAGTTGAGCG CATCATCCGC GAGTTGCTGC AAAAGCGCTT CCTGACCAAG CAGAAGCGTA GCCTGGCAGC GTTCCACCGC 500
GAGGTCGCGC AGGCTTGCAA AGCGCAAAAG CTGCGGGCGC CGGCGCGCAA CACCGTGGCT CTGCGGATCG CCGGCCTCGA TCCGCTCAAG GCCACTCGCC 600
GCCGGGAAGG TCAGGATGCG TCCCGCAGCC TGCAAGGTGT CGGTGGTGAG CCTCCCGCCG TGACCGCGCC ACTGGAACAA GTGCAGATTG ATCACACGGT 700
CATCGACCTG ATCGTGGTGG ACGAGCGCGA CCGGCAACCG ATTGGCCGTC CGTATCTGAC CATCGCCATC GACGTGTTTA CCCGCTGCGT GCTCGGCATG 800
GTCGTCACGC TGGAAGCGCC GTCATCTGTT TCGGTCGGCC TGTGCCTTGT GCATGTCGCC TGCGACAAGC GTCCCTGGCT GGAGGGTCTG AAGATAGAAA 900
TGGAGTGGCC GATGAGCGGC AAGCCCAGGC TGCTCTACCT GGACAACGCG GCCGAGTTCA AGAGCGAAGC GCTACGCCGA GGCTGCGAGC AGCATGGCAT 1000
CCGGCTTGAC TATCGCCCGC TCGGGCAGCC GCACTACGGC GGCATCGTGG AACGGATCAT CGGCACGGCG ATGCAGATGA TCCACGACGA ATTGCCAGGG 1100
ACGACCTTCT CCAACCCTGA CCAGCGCGGC GACTACGATT CCGAAAACAA GGCCGCCCTG ACGCTGCGTG AGCTGGAGCG CTGGCTCACA TTGGCGGTCG 1200
GCACCTACCA CGGCTCCGTG CACAACGGCC TGCTCCAGCC GCCGGCAGCG CGCTGGGCCG AAGCTATCGC GCGGACCGGC GTGCCAACCG TCATCACTCG 1300
CACCACGGCT TTTCTGGTCG ATTTTCTGCC CATCATCCGC CGCACGCTGA CCCGCACCGG CTTCGTCATC GACCACATCC ATTACTACGC CGATGCGCTC 1400
AAGCCGTGGA TAGCTCGGCG CGACCGCTTG CCTGCGTTCC TGATCCGGCG CGACCCGCGC GACATCAGCC GCATTTGGGT GCTGGAGCCG GAGGGGCAGC 1500
ACTATCTGGA AATTCCATAC CGCACCTTGT CGCACCCGGC TGTCACCCTC TGGGAACAAC GACAGGCGCT GGCGAAATTG CGGCAGCAAG GGCGCGAACA 1600
GGTGGATGAG TCGGCGCTGT TTCGCATGAT CGGCCAGATG CGCGAAATCG TGTCCACCGC GCAGAAAGCT ACGCGCAAGG CGCGGCGCGA CGCGGATCGA 1700
CGCCAGCATC TCAAGGCAAC GGCAGTTCTT TTCAAAACCA CGCCACCACC GGACGCGGAC ATGGCTGACC CGCAGGCAGA CAACCAGCCA CCTGCCAAAC 1800
CGTTCGACCA GATTGAGGAG TGGTAGCCGT GGAAGAATAT CCCATCATCG ACTTGTCCCA CCTGATGCCG GTGGCCCAGG GCTTGGCCCG TCTTCCGGCG 1900
GACGAACGCA TCCATCGCCT TCGCGCTGAC CGCTGGATCG GCTATCCGCG AGCAGTCGAG GCGCTGAATC GGCTGGAAGC CCTGTATGCG TGGCCGAACA 2000
AACAACGCAT GCCCAACCTG CTGTTGGTCG GTCCAACCAA CAACGGCAAG TCGATGATCG TCGAGAAATT CCGCCGCACC CACCCGGCCA GCTCCGACGC 2100
CGACCAGGAG CACATTCCGG TACTGGTCGT GCAGATGCCA TCCGAACCGT CGGTAATCCG CTTCTACGTC GCGCTACTTG CCGCGATGGG CGCGCCATTG 2200
CGCCCGCGCC CACGGCTGCC GGAAATGGAG CAATTGGCGC TGGCACTGCT ACGCAAGGTC GGCGTGCGCA TGCTGGTGAT CGACGAATTG CACAACGTCC 2300
TGGCCGGCAA CAGCGTCAAC CGCCGGGAAT TCCTCAACCT GCTGCGTTTC CTCGGCAACG AGCTGCGCAT CCCGCTGGTC GGGGTCGGCA CACGCGATGC 2400
CTACTTGGCG ATCCGCTCGG ACGACCAGTT GGAAAACCGC TTCGAGCCGA TGATGCTGCC GGTGTGGGAG GCCAACGACG ATTGCTGCTC ACTGCTGGCC 2500
AGCTTCGCGG CTTCGCTCCC GCTGCGCCGA CCCTCGTCGA TTGCCACGCT GGATATGGCC CGCTACCTGC TCACGCGCAG CGAGGGCACC ATCGGCGAGC 2600
TGGCGCACCT GTTGATGGCG GCGGCCGTCG CTGCCGTGGA GAGCGGTGAG GAAGCGATCA ACCATCGCAC GCTCAGCATG GCCGATTACA CCGGTCCCAG 2700
CGAGCGGCGG CGGCAATTCG AGCGGGAACT GATGTGAAGC CAGCGCCACA CTGGCCACTG CATCCGGCTC CCAGGGAAGG CGAAGCCTTG TCTTCGTGGC 2800
TCAACCGCGT GGCCCTTTGC TATCACATGG AGGTGTCCGA GCTGCTGGAG CACGATCTTG GTCACGGCCA GGTTGATGAC CTGGACACCG CGCCACCACT 2900
GGCGCTGCTG GCGATGCTCT CCCAGCGGAG CGGCATCGAG CCGGACCGGC TGCGTTGCAT GAGTTTCGCC GGCTGGGTGC CTTGGCTACT GGACAGCCTT 3000
GATGATCAGA TTCCAGATGC ATTGGAAACC TATGCGTTCC AGCTCTCGGT GCTGCTGCCG AAACTCCGCC GTAGGACGCG ATCCATCACG AGCTGGCGTG 3100
CCTGGCTGCC CAGCCAGCCG ATACATCGCG CCTGCCCGCT CTGTCTGAAC GATCCGGAGA ACCAAGCCGT ACTGCTCGCG TGGAAGCTGC CCCTGATGCT 3200
GAGCTGCCCA CTGCATGGCT GCTGGCTGGA ATCCTATTGG GGCGTGCCAG GGCGGTTTCT CGGCTGGGAG AACGCCGACG CCGAACCGCG CACTGCCAGC 3300
GACGCGATTG CGGCGATGGA CCAGCGTACC TGGCAGGCAC TGACGACCGG CCACGTGGAG TTGCCGCGCC GACGCATCCA CGCCGCATTG TGGTTTAGGC 3400
TGCTACGCAC GCTGCTCGAT GAGCTGAACA CCCCGCTTTC GACGTGCGGC ACCTACGCGG GGTATCTCCG CCAAATCTGG GAATGCTGCG GGCATCCGCT 3500
GCGTGCTGGG CAAAGTCTGT GGCGACCGTA TGAAACCCTG AACCCGGCAG TACGGTTGCA GATGCTGGAG GCGGCGGCAA CGGCAATCAG CTTGATTGAG 3600
GTGAGGGATA TAAGCCCGCC AGGCGAGCAC GCAAAGCTAT TCTGGTCCGA GCCCCAAACC GGGTTCACCA GTGGCCTGCC GGCGAAAGCG CCGAAGCCCG 3700
AACCCGTCGA TCACTGGCAG CGTGCAATCC AGGCCATTGA TGAGGCCATC ATTGAAGCAC GACACAACCC CGAGACGGCA CGCTCGCTGT TCGCGTTGGC 3800
TTCCTATGGT CGGCGCGACC CCGCTTCCTT GGAACAGTTG CGCGCCACCT TCGCGAAGGA AGGCATCCCC CCGGAATTTC TGTCACATTA TGAGCCTAGC 3900
CTACCCTTTG CATGCCTTAG ACAGAATGAC GGGTTAAGTG ACAAATTTTG ACGTGCAGAG CTTCCCGATG CAAACTGTCA CATAATCGAA CGTATATGTG 4000
ACAGGTACAA GATGCTGATT GGCTACATGC GAGTATCGAA GGCGGATGGC TCCCAGGCGA CGGACTTGCA GCGCGATGCG CTGGTCGCGG CCGGTGTTGA 4100
TCCGGCGCAT CTCTATGAAG ACCAAGCGTC CGGAAAACGC GAGGATCGTC CCGGTCTGGC GAGCTGTCTG AAGGCACTAC GGCCAGGCGA CACGTTGGTC 4200
GTTTGGAAAC TGGATCGGCT CGGGCGCGAC CTGCGCAATC TGATCAATAC CGTCCATGAC CTGACCGAAC GCGGCATCGG CCTCAAGGTG CTGACCGGGC 4300
ACGTCGCGGC CATTGACACC ACGACCGCCG CCGGCAAGCT GGTCTTCGGT ATCTTCGCCG CGCTGGCCGA GTTCGAGCGC GAGTTGATCG CCGAGCGCAC 4400
CGTGGCGGGC CTGGCCTCAG CACAAACACG CGGCCGGAAA GGCGGCCGGC CGTTCAAGAT GACCGCCGCC AAGCTGCGGC TGGCGATGGC GGCAATGGGG 4500
CAGTCACAGA CCAAGGTCGG CGACCTGTGC CAGGAACTTG GCATCACGCG GCAGACCCTG TATCGGCATA TTTCACCCAA GGGCGAGCTA CGTCCAGATG 4600
GCGAGAAGCT ACTCAGCCGA ATTTGATGCC GACATGAGGC GACGTAGCGA AAGCGTGGTT TGTCTCAATT TGACGGCGGC GAACCGCAAG CGTTCGGTTC 4700
GCCGCTCGTT TCGGTCGCAG TGCTGGCCCG CGCCTGGAAA CGTTGATAAC ACTCCAGCCC GCAGAAGTGC TCGACGTACT CCGCGCCTTC CGGTGTGAAG 4800
GCGGCATCGA GCGGAATTTC CTTTGACCAG CACACGCAGC AACTGGTGGC GGTCGGATCA TTTGCATTCA TGGTGGCACC CCTCCATTGA CTGACGAAGA 4900
CGGCGAATGC CGCGCCGGCA TCGGCTTTGC GAACAGAAAG CCTTGCCCGC GCTCGAACGC TTGGGCCCTC CGCCGTAGTC CATCCATCCG GCCGGGAAGC 5000
GCTCATGATC TTTCCCTGAA TGCCCGCAAC GCCCGCGCCA GTGACAGAAG AAACAGGCCG GTCAAACCGA GCGCCGTGAT GACCCAATAC TCGACAAGGA 5100
ATGCACCGGC GGTTGTGCCG GCCAGCACGG CAGCGAGGAT GGACAGGTGG CAGGGGCAAG TCAGCACGGC CAGCGTGCCC CACAGGTAGC CGGTGAACGG 5200
TTTGTGCGTC TCGGCCGGGA TGCGCTCGGG GCTGTTCATG GCAAACTCTC CGCATGCTGT GCCGGCGCGG TCGGCATGGC GGCCAACTGC ACTTCCAGAT 5300
TGGCCAACGC TTCGCGCCGG CGTTCGACGA ACTGACGCAG CACAGCAAGC TGCGCGGCAG TTTCATCGCA GTTCGCCGCA TCCAGCGCCC GGCACAGCCG 5400
CGCCAATGCG CCGAGGCCGA TGCCCGCCTC GAAGGCGGCC CGCACGAAGC ACAGTCGCTG CAAGGCGGCG TCATCGAACA GGCCGTAGCC ACCCGTGGTG 5500
CAGGCGACTG GCCGCAGCAA TCCGCGCAGC AGGTAGTCGC GCACGATATG CACGCTCACC CCGGCATCAA GGGCCAGCCG GGACACCGTG TAGGCGTTCA 5600
TCGAACACCT CCTTTTGGTC GGTTCACGGC AATGCATATA CCGTTTCGCC GAGTTCAATC CGGCGCTGCG AATCCGATGC CGGTGCCGTA GAACATTGGG 5700
TCGATATCCG ATCACGGTGT CCTAGATGAC ATGGTCTGCA ACAGATGTCG ATTAAACTCC TGGCCCAAGC CGAAAGCCTC GTGGACACTG ACGATCGCCA 5800
ATCCTTCCAA TCCTTGATGC TTGGAGGCCC AGTCTTCCGC CGTCGGGACA GATGCAAAGA AATGTACATG GCAACAGAAG GACTGACGAA CGTCGGCTGC 5900
TTCCTGCGGC AATACCAAGG ACACCGCCAT GCCGGCAGGT TCGACAGCCT GTATCTCGCT GGGTGAAACC GTAAGCGAAA CGGGTGCTCC AGTTGCAGCG 6000
CAATGCGACG AGACGCGAGC TGTACGGCCG ATTAACGCCG GGAGCATCAG GGTGTCCAGC GCGCACCAGG CATACAGCCG GCGGTTGTCA ATTTCCAAGA 6100
CATGCGACGT CTCGCGCAAG GTGAGGACAT AGCCGATGAT GTTCCCATCG TTGTCGTATT CGGTACTGGT GGCTTGTTCG AGCACAGCCG CCACTTGCTC 6200
AGCGGGCCAG CCAAGAGTCC TGGCAAGCGT AGTTCGCGAA ATCGGGCGTC CCTTGGCAAG TTCACCCAGT AGCGCGACCA AGAACTCCGC AAAACCTTTG 6300
GGCCGGTTGG TCGGGGTGAG ACGTTCTAAA ATATATGGGG CGAGCTTCAT GGTTCCATCT CCTATTTGAA TTGGATAGCG TAACCTTACC TTCGTACTCA 6400
TGTACGGAGT CAAGCGGTAT GGAAAATAAT TTGGAGAACC TGACCATTGG CGTTTTTGCC AAGGCAGCCG GCGTCAACGT GGAGACCATC CGGTTCTATC 6500
AGCGCAAGGG CTTGTTGCCC GAGCCGGACA AGCCCTATGG CAGCATTCGC CGCTATGGCG CGGCGGACGT GACACGGGTG CGATTCGTGA AATCGGCCCA 6600
GCGGTTGGGC TTCAGCCTGG ATGAAATCGC CGAGTTGCTG CGGCTCGACG ATGGCACCCA CTGCGAGGAG GCCAGCAGCC TGGCCGAGCA CAAGCTCCAA 6700
GACGTGCGCG AGAAGATGGC CGATTTGGCG CGCATGGAGG CTGTGCTGTC TGACCTGGTG TGCGCCTGCC ATTCGCGGCA GGGGAATGTT TCGTGTCCGC 6800
TGATTGCGTC GTTGCAAGGT GGAACGAGCT TGGCAGGGGC TTCCACAGCT TAGCGTGCTT TATTTTCCGT TTTCTGAGGT GCCCCCTACC AGCAACGCAC 6900
CGCCGAAGAC CGGCAGCAGG AGCGCGAGCA ATTCCAGATG GTTAACCGGG GCCTGAAAGA CCAGATTCAA GACCTGCAAA ACCGGCTCAT CCAAGCCGAG 7000
TCAAGAGCGT CCGAGCTGAT AGACACTAAT GCGCAATTAC AACGACAAAC GCATGAGCTC GAACAGGCCA ATGCGATGCA CAAAAGCACC TTGGATAGGC 7100
AAGTGGCCGA TATCCAGAAC TTGGAGCATC AACTTAACGA CGCCTCGGCA AAAAACCGGG AGCGGAAGCA TGAAAGCAAA CAATTGGCGG ATAATATTGC 7200
CGCACTCGCA AGCCAGAAAG CAGATGCCGA TAAGCAGGTG GCCGTACTAT CCCAAGCATT AGAGGTCACC AAAACGGATC TGAAAGCCTC TCAGAATAGA 7300
GCGGAAGTTC TGTCCGATGA GAACAAGTTG ATTCTTCAGA AAAAGCGGTG ATTCAGGGGC AGTTTAAGCA GCTTCAGGAG TCACTGTCGC GATAAAGGCT 7400
TTAGTCAAAC ATTCATCGAA AAAGCTATAT TAATCAATAT ATTATATACG AAGTGGCTAA TTCCAACGCT TTTACCGGCC GGACCTCAAG TAGCTGCACC 7500
GGCCGCCATT TCATTCAAAG AGGCCTCGAC GACAGTCAAA ACTGTTCGAC GCACGTCGAT CTTCCATCGC ACGCGCTCGA CGCATCGGTC CAGGCTTTCG 7600
CTATCGTCTG CGTCCAATGC TTTGCAAAGT CGCGCCAGTT CGTCGAGCCC AATTCCGGAC TCGAAGGCTG CGCGGAGAAA CCGCAGCCTT CCGAGTGTTT 7700
TGTCATCGAA AATGTTCTAG CCACTTTCGG TTCGCCGAGC AGGGTGCAAC AAGCCGCGTA TCATGTAGTC GCGCACGACA TGCACTGGAC GCAAGCATCA 7800
TCAGCCAGCT TCGATATCGA GTACGCATTC ATACTGGGCT TTCCTCGCAG TCCTCTAGCA GCTTAAGCGC GACATCATGA CCAAGGTCGA ACGCACTCTC 7900
GACAGGCACG ACCTCAATCC CTTGGTGGGT CGAAGCCCAC GAATTTGCAA TAGACGGAGA CGCAAAGAAA TGCACATGAC AGCAAAACGA ACAACGAATG 8000
TCGGGCGAAG TATCCGGAGT TCGCAGTGAA ACCATTGCTT CTGCCGGTTC GACATGAAGC ACTGCTTCTG GTGCAACGGT GAGAGTGATC GGCCTACCGG 8100
TGGCGGCGCA CGGGGAAGTG ACACGGGCAA TCTTGCCGGT CAAGGCGGGA AACATCAAGG TGTCCAGCGC ACACCATGTG TACAAATGAC GGCCATCCAC 8200
CTCAAAAACG TGAGAGGTTT CACGTAGTGT CAGCCCCAGT CCAATGATAT TGGCCTCGTC GTCATATTCC GTACCAGGTG CCTGCTCAAG TACCGTTGCA 8300
ACTCTCGCTC CAGACCATCC GAGTGCGCCG GCGAGCTTTT CGCGTGATAC AGGTCCCCCC ATCGCTAGCT GTCGGAGCAG CGCGACAAAG AGCGTTGCGA 8400
ACTCTTTAGG ATGATTGCCA GATGAAAGGC TTTCGGCAAT CTTTTTGGAA TAAATAGTCT TGTCCATTTT CCACCTGTTA TATCGGGCAG CGATCAACCG 8500
GGATAAACCT TACTCGGCGG TTACGTCCTG CGCTTAACAA AGACGAATTA CTTTGTCCTT GCCGGCGTTG CCATTCCTCG GCCAGCAGCT GGGCTTGCTG 8600
TAACTGTTCG GGCTTCATTT GAGCTGCATA GTCTGCAACG ACTTTTGCTG CTGTCGGTTC TCCCGACGCT GCGGCCAGAC TCATCCACTG GTATGCCTGC 8700
ACCTGGTCGG GCGCGATGTC GGTAAGACCC TTTGCAAACA GCTGGGCCAG CCTGACTTGA GCCTGCGTGT GTCCCTGCTC GGCAGCGAGG CGGTACCACT 8800
TGAAAGCCTC TGCATCGTTG GCAGGAGCCG CACGCCCATT CTTATACATA CCGTCGACTT CACTTCCGTA GAGGCGCCCG AGGTTGAACT GTGCATCGGC 8900
TTCACCCTGC TCGGCCGCCC TGCGAACCAC TGGATCGATG CCGCGTGGTT TTGCTTTACG CCCATACCGC ATTTGTACGC AAGACCCATG ATGTTCTGAG 9000
CCATGGCATC GCCAGCCCTG GCCTTCTTTT CCCAATCACT GACGGTCTCT GGCGACAATG TTTCAATCGC GGCGCCGAGC TTAGAAAAAT CATAAGCCGC 9100
CAACGCTTGG TTTGGCATGC CTAACGTTGC GGTACTTGCC AGTACGATCG TCAAGCACGC GCGTACACGG CGTAATTTAA GTCTGATGTC ACAAAACATA 9200
AAAACTCCTC TGTCCCTGTT TTGGTGCCCA AGGTTTAGCC GGCACAGCAC GATAGCTGTG TCACATCCTT CGAAAAAGTC TGTGCCACCA GTTTGAGACC 9300
TTCAACCATG GTGAGGTAAG GGAAAAGCTG GTTAGCCAGA TCCTGGACGG TCATGCGGGC CCGGATGGCA ATCGCGGCAG TCTGGATGAT TTCGCCTGCT 9400
TCCGGTGTCA CAGCCTGGAC TACGAGAATG CGTCCGGACC CTGCCTCAGC AACAAGCTTG ATAAAGCCGC GGGTATCGAA GTTGGCGAGT GCACGCGGCA 9500
CATTGTCGAG CGTGAGCGTA CGGCTTTCGG TTTGCAGACC AATGCGCTGG GCCTCTGCTT CACTGTAGCC TACCGTGGCA ACTTGCGGAT CGGTGAACAC 9600
GACTGCCGGC ATGGCGTCGA GATCGAGCTT TACGTCGCCG CCGGTCATAT TAATGCCAGC CCGGGTACCA GCCGCGGCTG CAACGTAGAC GTACTGAGGC 9700
TGGTCAGTGC AGTCGCCAGC TGCGTAAATG TTCGAAGCAC TCGTGCGCAT GCCTTGATCG ATCAGGATGC GATGTTGCGA GTCAAACGTC ACACCTGCGA 9800
GCTCGAGATT CAGGCTTGCG GTATTTGGCG TGCGGCCGGT GGCAACAAGC AATTGATCCA CCTTTAACTC GCCCTGCGCG GTCGTCAGGA TGAATTGGCG 9900
GTCGCAATAG GTCACGGCAC TCGCCTGCGT CTGTGTCAGC ACCGTGATGC CTTCCGCACG GAAAGCTGCA GTCACGGCCT CGCCAATGGC AGGGTCGTCA 10000
CTGGCAAACA AGGTGCGCCG GGCCAGGACC GTCACCTCGC TGCCCAAGCG CGCAAACGCT TGTGCCAGTT CCAGTGCTAC GACCGAGGCG CCAATTACTG 10100
CCAGCCGTTT AGGAATGGCT TCGCTAACCA GGGCTTCGGT GGACGTCCAA TATGGCGTTC CTGCAAGACC AGGCAACGGC GGGATGGCAG CGCTTGCACC 10200
AGTGGCAATG AGACAGCGAT CGAAGGCCAC TTCGCGAGTG TCACCTTCTG CCGCTTCCAC GATAAGCGTT TGGCCATCCT TGAATCGGGC GCTGCCTCGC 10300
ACCAGATTGA TCGACGGATT GCTTTCGAGG ATGCTTTCAT ATTTAGCATG ACGCAGCTCG TCCACACGGG CCTGCTGTTG CGCCAGCAGA CGGTCACGCA 10400
GGACAACTGG GGGCGTTGCC GACAGTCCCA CGTCAAAAGG ACTTTCGCGT CGCAGATGAG CGATATGCGC TGCCCGAATC ATGATCTTAG AAGGGACGCA 10500
GCCCACGTTG ACGCAGGTGC CGCCGATAGT TCCCCGCTCG ATCAGGGTGA CACGGGCACC GTTTTCAGCT GCCTTCAGCG CTGCTGCCAC AGCGGCGCCG 10600
CCAGTACCGA TGATCGCCAC ATGCAGCTGA CCTGCCGGTT TCTCCACCGC GCGCTCGCCG CTCAGCCACT GGGCGGCTTT GCCAAGGAGT CCTGGCTGTT 10700
CAACACTGCG CTCTTTGCTT TCAAATTCGT CTGTAGGCGT AGCACCGTAA CCAAGGGCGC CCACTGCTGC TACGAGCGCG GACACGTCAA GCCTATTGCC 10800
TGTGACTTGC GCCTTGTTCG ATGGATAATC GACAGCCGCT CGCGTTACGC CGGGGACCGC CAGCAGCGCG TCCTCAACGT GCCTTGTACA ACTTGGGCAA 10900
GTCATTCCCG CCACGCTCAA ACCAACTTCG TTTAACTCTT GCATTCCGCT CTCTCTTTTA TCCGTTGTGC CCAAGCCAAC TAGTACGACG CCTCGGTCAC 11000
TGAATCGGTT GTGCGCTTCA GGGAACAGGC TCCGTGGGTC GTTTACAGGC TGCTCACGCG ATGACGACCG GCAGAATCGC CTGAGCGGCA TCTCTGCCGT 11100
GGTTCTAGCG ATGCTGCCGT TATTTGTTGC CTTACTTCTT CAGCTCAGAT GGGTAGCCTA CATCCGTCGT CGCCTTGCTA AGCGCCTGGA CGTTGGTCTT 11200
CGCATCGTCA AACGTTACGA CGGCAAGCTT GGTTGAGAAG TCGACCTCGG TCTTGCTAAC ACCGTCGACC TTGCTCAAGG CCTTTTTAAT GGTGATAGGA 11300
CAAGTCGCGC AGGTCATGCC AGGAACGGCC AAGGTGACCG TCTGCGTGGC GGCCCACACC GGGGCAACAG CGGCGGCGAG GGCAAGGGAG GCAAACAGTT 11400
TCTTCATGAT GAACTCCTGG TTAATAGAAA AATGGAACGA CATAGGGAAA TCCAAGCGCG ACCAGGACCA GCACGGCCAC GATCCAGAAA ATCAGCTTGT 11500
AGGTGGCGCG CACCTGCGGA ATCGCGCAGA CCTCACCTGG CTTGCATGCC TGCACGGGCC GGTAAATCCG CTTCCAGGCG AAGAACAGCG CCACTAGCGC 11600
CGCGCCGATG AACAACGGTC GATAGGGTTC CAGCACCGTC AGGTTGCCGA TCCAAGCACC GGAGAAGCCC AGGGCGACCA GTACTAGCGG CCCCAGGCAG 11700
CAGGTCGATG CAAGAATGGC GGCCAGCCCG CCGGCGAAGA GCGCACCGCG CCCGTTTTGT GGTTCAGACA TACGTTGGCC CTTTTGAATT TGGATTGGAT 11800
AGCGTAACCT TACTTCCGTA CTCATGTACG GAGTCAAGCG ATATGGAAAA TAATTTGGAA AACCTGACCA TTGGCGTTTT TGCCAAGGCG GCCGGGGTCA 11900
ACGTGGAGAC AATCCGCTTC TATCAGCGCA AGGGCCTGTT GCCGGAACCG GACAAGCCTT ACGGCAGCAT CCGCCGCTAT GGGGAGGCGG ACGTGGTTCG 12000
GGTGAAATTC GTGAAATCGG CACAGCGGCT GGGGTTCAGT CTGGACGAGA TTGCCGAGCT GTTGCGGCTC GACGATGGCA CCCACTGCGA GGAGGCCAGC 12100
AGCCTGGCCG AGCACAAGCT CCAAGACGTG CGCGAGAAGA TGGCCGATTT GGCGCGCATG GAGGCTGTGC TGTCTGACCT GGTGTGCGCC TGCCATTCGC 12200
GGCAGGGGAA TGTTTCGTGT CCGCTGATTG CGTCGTTGCA AGGTGGAACG AGCTTGGCAG GGGCTTCCAC AGCTTAGCGT GCTTTATTTT CCGTTTTCTG 12300
AGACGACCAC AGATCAGAAA AGACTGTGCA GCCGACTCTT GATATTCCGT GCAGTCACCT TCTGAAAATG ACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
repeat t3 |
78-97 |
20 |
TCAGAAGTCA TCTGCACCAA |
r5 |
3875-3888 |
14 |
AATTTCTGTC ACAT |
r3 |
3931-3944 |
14 |
GGGTTAAGTG ACAA |
res |
3975-4004 |
30 |
CTGTCACATA ATCGAACGTA TATGTGACAG |
r2 |
3975-3988 |
14 |
CTGTCACATA ATCG |
r1 |
3991-4004 |
14 |
CGTATATGTG ACAG |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tniA |
Tn5058 |
144-1826 |
Transposase |
|
+ |
tniB |
Tn5058 |
1829-2737 |
Accessory Gene |
|
+ |
tniQ |
Tn5058 |
2734-3951 |
Accessory Gene |
Target Site Selection |
+ |
tniR |
Tn5058 |
4012-4626 |
Accessory Gene |
Resolvase |
+ |
merE |
Tn5058 |
5003-5239 |
Passenger Gene |
Heavy Metal Resistance |
- |
merD2 |
Tn5058 |
5236-5601 |
Passenger Gene |
Heavy Metal Resistance |
- |
merB2 |
Tn5058 |
5712-6350 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR |
Tn5058 |
6419-6853 |
Passenger Gene |
Heavy Metal Resistance |
+ |
merD1 |
Tn5058 |
7486-7832 |
Passenger Gene |
Heavy Metal Resistance |
- |
merB1 |
Tn5058 |
7829-8467 |
Passenger Gene |
Heavy Metal Resistance |
- |
merG |
Tn5058 |
8681-9199 |
Passenger Gene |
Heavy Metal Resistance |
- |
merA |
Tn5058 |
9235-10944 |
Passenger Gene |
Heavy Metal Resistance |
- |
merP |
Tn5058 |
11132-11407 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn5058 |
11421-11771 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR1 |
Tn5058 |
11843-12277 |
Passenger Gene |
Heavy Metal Resistance |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
Tn5058 |
1683 |
144-1826 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | homologous to TnsB of Tn7 |
Protein Sequence:
|
MASDTSLIAE QGVATLPDAA WAQARQRAEI IGPLAALDVV GHEAADAAAH ALGLSRRQVY VLIRRARQGA GLVTDLARSR SGGGKGKGRL PESVERIIRE LLQKRFLTKQ KRSLAAFHRE VAQACKAQKL RAPARNTVAL RIAGLDPLKA TRRREGQDAS RSLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDERDRQPI GRPYLTIAID VFTRCVLGMV VTLEAPSSVS VGLCLVHVAC DKRPWLEGLK IEMEWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAIARTGV PTVITRTTAF LVDFLPIIRR TLTRTGFVID HIHYYADALK PWIARRDRLP AFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR EIVSTAQKAT RKARRDADRR QHLKATAVLF KTTPPPDADM ADPQADNQPP AKPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB |
TniB |
Tn5058 |
909 |
1829-2737 |
+ |
Class: | Accessory Gene |
Transpoase Chemistry: | Serine |
Comment: | homologous to TnsC protein of Tn7 putative ATP-binding protein |
Protein Sequence:
|
VEEYPIIDLS HLMPVAQGLA RLPADERIHR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSSIAT LDMARYLLTR SEGTIGELAH LLMAAAVAAV ESGEEAINHR TLSMADYTGP SERRRQFERE LM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniQ |
TniQ |
Tn5058 |
1218 |
2734-3951 |
+ |
Class: | Accessory Gene |
Sub Class: | Target Site Selection |
Function: | putative transposition gene |
Comment: | similar function to Tn7 tnsD? |
Protein Sequence:
|
MKPAPHWPLH PAPREGEALS SWLNRVALCY HMEVSELLEH DLGHGQVDDL DTAPPLALLA MLSQRSGIEP DRLRCMSFAG WVPWLLDSLD DQIPDALETY AFQLSVLLPK LRRRTRSITS WRAWLPSQPI HRACPLCLND PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW QALTTGHVEL PRRRIHAALW FRLLRTLLDE LNTPLSTCGT YAGYLRQIWE CCGHPLRAGQ SLWRPYETLN PAVRLQMLEA AATAISLIEV RDISPPGEHA KLFWSEPQTG FTSGLPAKAP KPEPVDHWQR AIQAIDEAII EARHNPETAR SLFALASYGR RDPASLEQLR ATFAKEGIPP EFLSHYEPSL PFACLRQNDG LSDKF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniR |
TniR |
Tn5058 |
615 |
4012-4626 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Function: | putative resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Comment: | resolution of cointegrates || Protein: ACE81792.1 |
Protein Sequence:
|
MLIGYMRVSK ADGSQATDLQ RDALVAAGVD PAHLYEDQAS GKREDRPGLA SCLKALRPGD TLVVWKLDRL GRDLRNLINT VHDLTERGIG LKVLTGHVAA IDTTTAAGKL VFGIFAALAE FERELIAERT VAGLASAQTR GRKGGRPFKM TAAKLRLAMA AMGQSQTKVG DLCQELGITR QTLYRHISPK GELRPDGEKL LSRI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merE |
MerE |
Tn5058 |
237 |
5003-5239 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | mercury transport |
Target: | Mercury |
Comment: | similar to urf-1 in pKLH2 (GenBank AF213017), pKLH272 (Genbank Y08992), pMER610 (GenBank Y08993), pKLH210 (GenBank Y10102), Tn5036 (Genbank Y09025), orf1 in Tn501 (GenBank Z00027), and urf-1 in Tn5041 (GenBank X98999) |
Protein Sequence:
|
MNSPERIPAE THKPFTGYLW GTLAVLTCPC HLSILAAVLA GTTAGAFLVE YWVITALGLT GLFLLSLARA LRAFRERS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD2 |
MerD2 |
Tn5058 |
366 |
5236-5601 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MNAYTVSRLA LDAGVSVHIV RDYLLRGLLR PVACTTGGYG LFDDAALQRL CFVRAAFEAG IGLGALARLC RALDAANCDE TAAQLAVLRQ FVERRREALA NLEVQLAAMP TAPAQHAESL P
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merB2 |
MerB2 |
Tn5058 |
639 |
5712-6350 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Transpoase Chemistry: | merB1 (Tn5058) Chemistry |
Target: | Mercury |
Comment: | organomercurial lyase |
Protein Sequence:
|
MKLAPYILER LTPTNRPKGF AEFLVALLGE LAKGRPISRT TLARTLGWPA EQVAAVLEQA TSTEYDNDGN IIGYVLTLRE TSHVLEIDNR RLYAWCALDT LMLPALIGRT ARVSSHCAAT GAPVSLTVSP SEIQAVEPAG MAVSLVLPQE AADVRQSFCC HVHFFASVPT AEDWASKHQG LEGLAIVSVH EAFGLGQEFN RHLLQTMSSR TP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn5058 |
435 |
6419-6853 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | putative regulatory gene mercury resistance |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGAAD VTRVRFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLQDVREKM ADLARMEAVL SDLVCACHSR QGNVSCPLIA SLQGGTSLAG ASTA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD1 |
MerD1 |
Tn5058 |
347 |
7486-7832 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MNAYSISKLA DDACVQCMSC ATT*YAACCT LLGEPKVART FSMTKHSEGC GFSAQPSSPE LGSTNWRDFA KHWTQTIAKA WTDASSACDG RSTCVEQF*L SSRPL*MKWR PVQLL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merB1 |
MerB1 |
Tn5058 |
639 |
7829-8467 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Transpoase Chemistry: | merB1 (Tn5058) Chemistry |
Target: | Mercury |
Comment: | organomercurial lyase |
Protein Sequence:
|
MDKTIYSKKI AESLSSGNHP KEFATLFVAL LRQLAMGGPV SREKLAGALG WSGARVATVL EQAPGTEYDD EANIIGLGLT LRETSHVFEV DGRHLYTWCA LDTLMFPALT GKIARVTSPC AATGRPITLT VAPEAVLHVE PAEAMVSLRT PDTSPDIRCS FCCHVHFFAS PSIANSWAST HQGIEVVPVE SAFDLGHDVA LKLLEDCEES PV
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merG |
MerG |
Tn5058 |
519 |
8681-9199 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MFCDIRLKLR RVRACLTIVL ASTATLGMPN QALAAYDFSK LGAAIETLSP ETVSDWEKKA RAGDAMAQNI MGLAYKCGMG VKQNHAASIQ WFAGRPSRVK PMHSSTSGAS TEVKSTVCIR MGVRLLPTMQ RLSSGTASLP SRDTRRLKSG WPSCLQRVLP TSRPTRCRHT SG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn5058 |
1710 |
9235-10944 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MQELNEVGLS VAGMTCPSCT RHVEDALLAV PGVTRAAVDY PSNKAQVTGN RLDVSALVAA VGALGYGATP TDEFESKERS VEQPGLLGKA AQWLSGERAV EKPAGQLHVA IIGTGGAAVA AALKAAENGA RVTLIERGTI GGTCVNVGCV PSKIMIRAAH IAHLRRESPF DVGLSATPPV VLRDRLLAQQ QARVDELRHA KYESILESNP SINLVRGSAR FKDGQTLIVE AAEGDTREVA FDRCLIATGA SAAIPPLPGL AGTPYWTSTE ALVSEAIPKR LAVIGASVVA LELAQAFARL GSEVTVLARR TLFASDDPAI GEAVTAAFRA EGITVLTQTQ ASAVTYCDRQ FILTTAQGEL KVDQLLVATG RTPNTASLNL ELAGVTFDSQ HRILIDQGMR TSASNIYAAG DCTDQPQYVY VAAAAGTRAG INMTGGDVKL DLDAMPAVVF TDPQVATVGY SEAEAQRIGL QTESRTLTLD NVPRALANFD TRGFIKLVAE AGSGRILVVQ AVTPEAGEII QTAAIAIRAR MTVQDLANQL FPYLTMVEGL KLVAQTFSKD VTQLSCCAG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn5058 |
276 |
11132-11407 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | periplasmic mercuric ion binding protein |
Protein Sequence:
|
MKKLFASLAL AAAVAPVWAA TQTVTLAVPG MTCATCPITI KKALSKVDGV SKTEVDFSTK LAVVTFDDAK TNVQALSKAT TDVGYPSELK K
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn5058 |
351 |
11421-11771 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | cytosolic mercuric ion transport protein |
Target: | Mercury |
Protein Sequence:
|
MSEPQNGRGA LFAGGLAAIL ASTCCLGPLV LVALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAWKRIY RPVQACKPGE VCAIPQVRAT YKLIFWIVAV LVLVALGFPY VVPFFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR1 |
MerR1 |
Tn5058 |
435 |
11843-12277 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGEAD VVRVKFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLQDVREKM ADLARMEAVL SDLVCACHSR QGNVSCPLIA SLQGGTSLAG ASTA
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t2 |
Tn5058 |
49-67 |
TACACGTCCG CTGAGGACT |
repeat t4 |
Tn5058 |
110-128 |
TCAATACTCG TGTGCACCA |
IRL |
Tn21 |
6849-6886 |
GAATCGCACG AAATAAAAGG CAAAAGACTC CACGGGGG |
repeat i4 |
Tn5053 |
6859-6877 |
AAATAAAAGG CAAAAGACT |
IR Tn21-like |
Tn5058 |
12273-12310 |
GAATCGCACG AAATAAAAGG CAAAAGACTC TGCTGGTG |
repeat i4 |
Tn5058 |
12283-12301 |
AAATAAAAGG CAAAAGACT |
repeat i1 |
Tn5058 |
12347-12365 |
GGCACGTCAG TGGAAGACT |
|
References |
|
|
Mindlin S, Minakhin L, Petrova M, Kholodii G, Minakhina S, Gorlenko Z, Nikiforov V. Present-day mercury resistance transposons are common in bacteria preserved in permafrost grounds since the Upper Pleistocene. Res Microbiol. 2005 Dec;156(10):994-1004. doi: 10.1016/j.resmic.2005.05.011. Epub 2005 Jul 18. PubMed ID: 16084067
| |
| | |
|
|