|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
|
|
|
|
|
|
|
|
|
Name: Tn50580 |
|
Family: Tn402 Group: Tn5053 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Uncultured bacterium | Molecular Source: | plasmid pTP6 |
Place of Origin: | Termirtau County, Kazakhstan | Date of Isolation: | 2006 |
| | Other Geographic Information: | Nura River sediment 1999-2000 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCGTTTTC AGAAGACGAC CGCACCATCT GACTGGATGT AACGCCTGGT GTGCATACGG CTCCTGACAG CCCAATATCA GGAGTCGTCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGTG AGCATGGCGT CAGACACATC ATTGATTGCC GAGCAAGGCG TGGCCACCCT GCCCGATGCG 200
GCTTGGGCGC AGGCCCGGCA ACGGGCGGAA ATCATCGGGC CGCTGGCAGC GCTTGATGTG GTCGGGCATG AAGCCGCCGA TGCCGCTGCT CACGCGCTTG 300
GCCTGTCCAG GCGGCAGGTG TATGTCCTAA TCCGCCGTGC CCGGCAAGGT GCTGGGCTTG TGACGGACCT GGCTCGCAGC CGATCCGGCG GCGGAAAAGG 400
CAAGGGACGC TTGCCGGAAT CAGTTGAGCG CATCATCCGC GAGTTGCTGC AAAAGCGCTT CCTGACCAAG CAGAAGCGTA GCCTGGCAGC GTTCCACCGC 500
GAGGTCGCGC AGGCTTGCAA AGCGCAAAAG CTGCGGGCGC CGGCGCGCAA CACCGTGGCT CTGCGGATCG CCGGCCTCGA TCCGCTCAAG GCCACTCGCC 600
GCCGGGAAGG TCAGGATGCG TCCCGCAGCC TGCAAGGTGT CGGTGGTGAG CCTCCCGCCG TGACCGCGCC ACTGGAACAA GTGCAGATTG ATCACACGGT 700
CATCGACCTG ATCGTGGTGG ACGAGCGCGA CCGGCAACCG ATTGGCCGTC CGTATCTGAC CATCGCCATC GACGTGTTTA CCCGCTGCGT GCTCGGCATG 800
GTCGTCACGC TGGAAGCGCC GTCATCTGTT TCGGTCGGCC TGTGCCTTGT GCATGTCGCC TGCGACAAGC GTCCCTGGCT GGAGGGTCTG AACATAGAAA 900
TGGAGTGGCC GATGAGCGGC AAGCCCAGGC TGCTCTACCT GGACAACGCG GCCGAGTTCA AGAGCGAAGC GCTACGCCGA GGCTGCGAGC AGCATGGCAT 1000
CCGGCTTGAC TATCGCCCGC TCGGGCAGCC GCACTACGGC GGCATCGTGG AACGGATCAT CGGCACGGCG ATGCAGATGA TCCACGACGA ATTGCCAGGG 1100
ACGACCTTCT CCAACCCTGA CCAGCGCGGC GACTACGATT CCGAAAACAA GGCCGCCCTG ACGCTGCGTG AGCTGGAGCG CTGGCTCACA TTGGCGGTCG 1200
GCACCTACCA CGGCTCCGTG CACAACGGCC TGCTCCAGCC GCCGGCAGCG CGCTGGGCCG AAGCTATCGC GCGGACCGGC GTGCCAACCG TCATCACTCG 1300
CACCACGGCT TTTCTGGTCG ATTTTCTGCC CATCATCCGC CGCACGCTGA CCCGCACCGG CTTCGTCATC GACCACATCC ATTACTACGC CGATGCGCTC 1400
AAGCCGTGGA TAGCTCGGCG CGACCGCTTG CCTGCGTTCC TGATCCGGCG CGACCCGCGC GACATCAGCC GCATTTGGGT GCTGGAGCCG GAGGGGCAGC 1500
ACTATCTGGA AATTCCATAC CGCACCTTGT CGCACCCGGC TGTCACCCTC TGGGAACAAC GACAGGCGCT GGCGAAATTG CGGCAGCAAG GGCGCGAACA 1600
GGTGGATGAG TCGGCGCTGT TTCGCATGAT CGGCCAGATG CGCGAAATCG TGTCCACCGC GCAGAAAGCT ACGCGCAAGG CGCGGCGCGA CGCGGATCGA 1700
CGCCAGCATC TCAAGGCAAC GGCAGTTCTT TTCAAAACCA CGCCACCACC GGACGCGGAC ATGGCTGACC CGCAGGCAGA CAACCAGCCA CCTGCCAAAC 1800
CGTTCGACCA GATTGAGGAG TGGTAGCCGT GGAAGAATAT CCCATCATCG ACTTGTCCCA CCTGATGCCG GTGGCCCAGG GCTTGGCCCG TCTTCCGGCG 1900
GACGAACGCA TCCATCGCCT TCGCGCTGAC CGCTGGATCG GCTATCCGCG AGCAGTCGAG GCGCTGAATC GGCTGGAAGC CCTGTATGCG TGGCCGAACA 2000
AACAACGCAT GCCCAACCTG CTGTTGGTCG GTCCAACCAA CAACGGCAAG TCGATGATCG TCGAGAAATT CCGCCGCACC CACCCGGCCA GCTCCGACGC 2100
CGACCAGGAG CACATTCCGG TACTGGTCGT GCAGATGCCA TCCGAACCGT CGGTAATCCG CTTCTACGTC GCGCTACTTG CCGCGATGGG CGCGCCATTG 2200
CGCCCGCGCC CACGGCTGCC GGAAATGGAG CAATTGGCGC TGGCACTGCT ACGCAAGGTC GGCGTGCGCA TGCTGGTGAT CGACGAATTG CACAACGTCC 2300
TGGCCGGCAA CAGCGTCAAC CGCCGGGAAT TCCTCAACCT GCTGCGTTTC CTCGGCAACG AGCTGCGCAT CCCGCTGGTC GGGGTCGGCA CACGCGATGC 2400
CTACTTGGCG ATCCGCTCGG ACGACCAGTT GGAAAACCGC TTCGAGCCGA TGATGCTGCC GGTGTGGGAG GCCAACGACG ATTGCTGCTC ACTGCTGGCC 2500
AGCTTCGCGG CTTCGCTCCC GCTGCGGCGA CCCTCGTCGA TTGCCACGCT GGATATGGCC CGCTACCTGC TCACGCGCAG CGAGGGCACC ATCGGCGAGC 2600
TGGCGCACCT GTTGATGGCG GCGGCCGTCG CTGCCGTGGA GAGCGGTGAG GAAGCGATCA ACCATCGCAC GCTCAGCATG GCCGATTACA CCGGTCCCAG 2700
CGAGCGGCGG CGGCAATTCG AGCGGGAACT GATGTGAAGC CAGCGCCACA CTGGCCACTG CATCCGGCTC CCAGGGAAGG CGAAGCCTTG TCTTCGTGGC 2800
TCAACCGCGT GGCCCTTTGC TATCACATGG AGGTGTCCGA GCTGCTGGAG CACGATCTTG GTCACGGCCA GGTTGATGAC CTGGACACCG CGCCACCACT 2900
GGCGCTGCTG GCGATGCTCT CCCAGCGGAG CGGCATCGAG CCGGACCGGC TGCGTTGCAT GAGTTTCGCC GGCTGGGTGC CTTGGCTACT GGACAGCCTT 3000
GATGATCAGA TTCCAGATGC ATTGGAAACC TATGCGTTCC AGCTCTCGGT GCTGCTGCCG AAACTCCGCC GTAGGACGCG ATCCATCACG AGCTGGCGTG 3100
CCTGGCTGCC CAGCCAGCCG ATACATCGCG CCTGCCCGCT CTGTCTGAAC GATCCGGAGA ACCAAGCCGT ACTGCTCGCG TGGAAGCTGC CCCTGATGCT 3200
GAGCTGCCCA CTGCATGGCT GCTGGCTGGA ATCCTATTGG GGCGTGCCAG GGCGGTTTCT CGGCTGGGAG AACGCCGACG CCGAACCGCG CACTGCCAGC 3300
GACGCGATTG CGGCGATGGA CCAGCGTACC TGGCAGGCAC TGACGACCGG CCACGTGGAG TTGCCGCGCC GACGCATCCA CGCCGGATTG TGGTTTAGGC 3400
TGCTACGCAC GCTGCTCGAT GAGCTGAACA CCCCGCTTTC GACGTGCGGC ACCTACGCGG GGTATCTCCG CCAAATCTGG GAATGCTGCG GGCATCCGCT 3500
GCGTGCTGGG CAAAGTCTGT GGCGACCGTA TGAAACCCTG AACCCGGCAG TACGGTTGCA GATGCTGGAG GCGGCGGCAA CGGCAATCAG CTTGATTGAG 3600
GTGAGGGATA TAAGCCCGCC AGGCGAGCAC GCAAAGCTAT TCTGGTCCGA GCCCCAAACC GGGTTCACCA GTGGCCTGCC GGCGAAAGCG CCGAAGCCCG 3700
AACCCGTCGA TCACTGGCAG CGTGCAATCC AGGCCATTGA TGAGGCCATC ATTGAAGCAC GACACAACCC CGAGACGGCA CGCTCGCTGT TCGCGTTGGC 3800
TTCCTATGGT CGGCGCGACC CCGCTTCCTT GGAACAGTTG CGCGCCACCT TCGCGAAGGA AGGCATCCCC CCGGAATTTC TGTCACATTA TGAGCCTAGC 3900
CTACCCTTTG CATGCCTTAG ACAGAATGAC GGGTTAAGTG ACAAATTTTG ACGTGCAGAG CTTCCCGATG CAAACTGTCA CATAATCGAA CGTATATGTG 4000
ACAGGTACAA GATGCTGATT GGCTACATGC GAGTATCGAA GGCGGATGGC TCCCAGGCGA CGGACTTGCA GCGCGATGCG CTGGTCGCGG CCGGTGTTGA 4100
TCCGGCGCAT CTCTATGAAG ACCAAGCGTC CGGAAAACGC GAGGATCGTC CCGGTCTGGC GAGCTGTCTG AAGGCACTAC GGCCAGGCGA CACGTTGGTC 4200
GTTTGGAAAC TGGATCGGCT CGGGCGCGAC CTGCGCCATC TGATCAATAC CGTCCATGAC CTGACCGGAC GCGGCATCGG CCTCAAGGTG CTGACCGGGC 4300
ACGGCGCGGC CATTGACACC ACGACCGCCG CCGGCAAGCT GGTCTTCGGT ATCTTCGCCG CGCTGGCCGA GTTCGAGCGC GAGTTGATCG CCGAGCGCAC 4400
CGTGGCGGGC CTGGCCTCAG CACGGGCACG CGGCCGGAAA GGCGGCCGGC CGTTCAAGAT GACCGCCGCC AAGCTGCGGC TGGCGATGGC GGCAATGGGT 4500
CAGCCAGAGA CCAAGGTCGG CGACCTGTGC CAGGAACTTG GCGTCACGCG GCAGACCCTG TATCGGCATG TTTCACCCAA GGGTGAGCTA CGTCCAGATG 4600
GCGAGAAGCT ACTCAGCCGA ATTTGATGCC GGCATGAGGC AACGTAGCGA CAGCGTGGTT TGTCTCAATG GGAAGCGCTC ATGATCGATC TTTGAAGGCC 4700
CGCAGCAGTC GTGTCACAGA CAGGACGAAC AAACCGGTCA GCGTGAGGGC TGCGATACCC CAGTACTCTC CGATGAACGC GCCGGCCGTC GTGCCGGCCA 4800
GCACAATGGC GAGAATCGGC AAATGGCAGG GACAGGTGAG CACGGCCAGC GCGCCCCACA GGTAGCCGGT GATCGGTTTG TGCGTCTCGG CCGGCATGCG 4900
CTCGGGGCTG TTCATGGCAA ACTCTCCGCA TGCTGTGCCG GCGCGGTCGG CATGGCGGCC AACTGCACTT CCAGATTGGC CAACGCTTCG CGCCGGCGTT 5000
CGACGAACTG ACGCAGCACA GCAAGCTGCG CGGCAGTTTC ATCGCAGTTC GCCGCATCCA GCGCCCGGCA CAGCCGCGCC AATGCGCCGA GGCCGATGCC 5100
CGCCTCGAAG GCGGCCCGCA CGAAGCACAG TCGCTGCAAG GCGGCGTCAT CGAACAGGCC GTAGCCACCC GTGGTGCAGG CGACTGGCCG CAGCAATCCG 5200
CGCAGCAGGT AGTCGCGCAC GATATGCACG CTCACCCCGG CATCAAGGGC CAGCCGGGAC ACCGTGTAGG CGTTCATCGA ACACCTCCTT TTGGTCGGTT 5300
CACGGCAATG CATATACCGT TTCGCCGAGT TCAATCCGCG CGCTGCGAAT CCGATGCCGG TGCCGTAGAA CATTGGGTCG ATATCCGATC ACGGTGTCCT 5400
AGATGACATG GTCTGCAACA GATGTCGATT AAACTCCTGG CCCAAGCCGA AAGCCTCGTG GACACTGACG ATCGCCAATC CTTCCAATCC TTGATGCTTG 5500
GAGGCCCAGT CTTCCGCCGT CGGGACAGAT GCAAAGAAAT GTACATGGCA ACAGAAGGAC TGACGAACGT CGGCTGCTTC CTGCGGCAAT ACCAAGGACA 5600
CCGCCATGCC GGCAGGTTCG ACAGCCTGTA TCTCGCTGGG TGAAACCGTA AGCGAAACGG GTGCTCCAGT TGCAGCGCAA TGCGACGAGA CGCGAGCTGT 5700
ACGGCCGATT AACGCCGGGA GCATCAGGGT GTCCAGCGCG CACCAGGCAT ACAGCCGGCG GTTGTCAATT TCCAAGACAT GCGACGTCTC GCGCAAGGTG 5800
AGGACATAGC CGATGATGTT CCCATCGTTG TCGTATTCGG TACTGGTGGC TTGTTCGAGC ACAGCCGCCA CTTGCTCAGC GGGCCAGCCA AGAGTCCTGG 5900
CAAGCGTAGT TCGCGAAATC GGGCGTCCCT TGGCAAGTTC ACCCAGTAGC GCGACCAAGA ACTCCGCAAA ACCTTTGGGC CGGTTGGTCG GGGTGAGACG 6000
TTCTAAAATA TATGGGGCGA GCTTCATGGT TCCATCTCCT ATTTGAATTG GATAGCGTAA CCTTACTTCC GTACTCATGT ACGGAGTCAA GCGGTATGGA 6100
AAATAATTTG GAGAACCTGA CCATTGGCGT TTTTGCCAAG GCAGCCGGCG TCAACGTGGA GACCATCCGG TTCTATCAGC GCAAGGGCTT GTTGCCCGAG 6200
CCGGACAAGC CCTATGGCAG CATTCGCCGC TATGGCGCGG CGGACGTGAC ACGGGTGCGA TTCGTGAAAT CGGCCCAGCG GTTGGGCTTC AGCCTGGATG 6300
AAATCGCCGA GTTGCTGCGG CTCGACGATG GCACCCACTG CGAGGAGGCC AGCAGCCTGG CCGAGCACAA GCTCCAAGAC GTGCGCGAGA AGATGGCCGA 6400
TTTGGCGCGC ATGGAGGCTG TGCTGTCTGA CCTGGTGTGC GCCTGCCATT CGCGGCAGGG GAATGTTTCG TGTCCGCTGA TTGCGTCGTT GCAAGGTGGA 6500
ACGAGCTTGG CAGGGGCTTC CACAGCTTAG CGTGCTTTAT TTTCCGTTTT CTGAGGTGCC CCCTACCAGC AACGCACCGC CGAAGACCGG CAGCAGGAGC 6600
GCGAGCAATT CCAGATGGTT AACCGGGGCC TGAAAGACCA GATTCAAGAC CTGCAAAACC GGCTCATCCA AGCCGAGTCA AGAGCGTCCG AGCTGATAGA 6700
CACTAATGCG CAATTACAAC GACAAACGCA TGAGCTCGAA CAGGCCAATG CGATGCACAA AAGCACCTTG GATAGGCAAG TGGCCGATAT CCAGAACTTG 6800
GAGCATCAAC TTAACGACGC CTCGGCAAAA AACCGGGAGC GGAAGCATGA AAGCAAACAA TTGGCGGATA ATATTGCCGC ACTCGCAAGC CAGAAAGCAG 6900
ATGCCGATAA GCAGGTGGCC GTACTATCCC AAGCATTAGA GGTCACCAAA ACGGATCTGA AAGCCTCTCA GAATAGAGCG GAAGTTCTGT CCGATGAGAA 7000
CAAGTTGATT CTTCAGGAAA AAGCGGTGAT TCAGGGGCAG TTTAAGCAGC TTCAGGAGTC ACTGTCGCGA TAAAGGCTTT AGTCAAACAT TCATCGAAAA 7100
AGCTATATTA ATCAATATAT TATATACGAA GTGGCTAATT CCAACGCTTT TACCGGCCGG ACCTCAAGTA GCTGCACCGG CCGCCATTTC ATTCAAAGAG 7200
GCCTCGACGA CAGTCAAAAC TGTTCGACGC ACGTCGATCT TCCATCGCAC GCGCTCGACG CATCGGTCCA GGCTTTCGCT ATCGTCTGCG TCCAATGCTT 7300
TGCAAAGTCG CGCCAGTTCG TCGAGCCCAA TTCCGGACTC GAAGGCTGCG CGGAGAAACC GCAGCCTTCC GAGTGTTTTG TCATCGAAAA TGTTCTAGCC 7400
ACTTTCGGTT CGCCGAGCAG GGTGCAACAA GCCGCGTATC ATGTAGTCGC GCACGACATG CACGCTGACG CAAGCATCAT CAGCCAGCTT CGATATCGAG 7500
TACGCATTCA TACTGGGCTT TCCTCGCAGT CCTCTAGCAG CTTAAGCGCG ACATCATGAC CAAGGTCGAA CGCACTCTCG ACAGGCACGA CCTCAATCCC 7600
TTGGTGGGTC GAAGCCCACG AATTTGCAAT AGACGGAGAC GCAAAGAAAT GCACATGACA GCAAAACGAA CAACGAATGT CGGGCGAAGT ATCCGGAGTT 7700
CGCAGTGAAA CCATTGCTTC TGCCGGTTCG ACATGAAGCA CTGCTTCTGG TGCAACGGTG AGAGTGATCG GCCTACCGGT GGCGGCGCAG CGGGAAGTGA 7800
CACGGGCAAT CTTGCCGGTC AAGGCGGGAA ACATCAAGGT GTCCAGCGCA CACCATGTGT ACAAATGACG GCCATCCACC TCAAAAACGT GAGAGGTTTC 7900
ACGTAGTGTC AGCCCCAGTC CAATGATATT GGCCTCGTCG TCATATTCCG TACCAGGTGC CTGCTCAAGT ACCGTTGCAA CTCTCGCTCC AGACCATCCG 8000
AGTGCGCCGG CGAGCTTTTC GCGTGATACA GGTCCCCCCA TCGCTAGCTG TCGGAGCAGC GCGACAAAGA GCGTTGCGAA CTCTTTAGGA TGATTGCCAG 8100
ATGAAAGGCT TTCGGCAATC TTTTTGGAAT AAATAGTCTT GTCCATTTTC CACCTGTTAT ATCGGGCAGC GATCAACCGG GATAAACCTT ACTCGGCGGT 8200
TACGTCCTGC GCTTAACAAA GACGAATTAC TTTGTGCCTT GCCGGCGTTG CCATTCCTCG GCCAGCAGCT GGGCTTGCTG TAACTGTTCG GGCTTCATTT 8300
GAGCTGCATA GTCTGCAACG ACTTTTGCTG CTGTCGGTTC TCCCGACGCT GCGGCCAGAC TCATCCACTG GTATGCCTGC ACCTGGTCGG GCGCGATGTC 8400
GGTAAGACCC TTTGCAAACA GCTGGGCCAG CCTGACTTGA GCCTGCGTGT GTCCCTGCTC GGCAGCGAGG CGGTACCACT TGAAAGCCTC TGCATCGTTG 8500
GCAGGAGCCG CACGCCCATT CTTATACATA CCGTCGACTT CACTTCCGTA GAGGCGCCCG AGGTTGAACT GTGCATCGGC TTCACCCTGC TCGGCCGCCC 8600
TGCGGAACCA CTGGATCGAT GCCGCGTGGT TTTGCTTTAC GCCCATACCG CATTTGTACG CAAGACCCAT GATGTTCTGA GCCATGGCAT CGCCAGCCCT 8700
GGCCTTCTTT TCCCAATCAC TGACGGTCTC TGGCGACAAT GTTTCAATCG CGGCGCCGAG CTTAGAAAAA TCATAAGCCG CCAACGCTTG GTTTGGCATG 8800
CCTAACGTTG CGGTACTTGC CAGTACGATC GTCAAGCACG CGCGTACACG GCGTAATTTA AGTCTGATGT CACAAAACAT AAAAACTCCT CTGTCCCTGT 8900
TTTGGTGCCC AAGGTTTAGC CGGCACAGCA CGATAGCTGT GTCACATCCT TCGAAAAAGT CTGTGCCACC AGTTTGAGAC CTTCAACCAT GGTGAGGTAA 9000
GGGAAAAGCT GGTTAGCCAG ATCCTGGACG GTCATGCGGG CCCGGATGGC AATCGCGGCA GTCTGGATGA TTTCGCCTGC TTCCGGTGTC ACAGCCTGGA 9100
CTCCGAGAAT GCGTCCGGAC CCTGCCTCAG CAACAAGCTT GATAAAGCCG CGGGTATCGA AGTTGGCGAG TGCACGCGGC ACATTGTCGA GCGTGAGCGT 9200
ACGGCTTTCG GTTTGCAGAC CAATGCGCTG GGCCTCTGCT TCACTGTAGC CTACCGTGGC AACTTGCGGA TCGGTGAACA CGACTGCCGG CATGGCGTCG 9300
AGATCGAGCT TTACGTCGCC GCCGGTCATA TTAATGCCAG CCCGGGTACC AGCCGCGGCT GCAACGTAGA CGTACTGAGG CTGGTCAGTG CAGTCGCCAG 9400
CTGCGTAAAT GTTCGAAGCA CTCGTGCGCA TGCCTTGATC GATCAGGATG CGATGTTGCG AGTCAAACGT CACACCTGCG CGCTCGAGAT TCAGGCTTGC 9500
GGTATTTGGC GTGCGGCCGG TGGCAACAAG CAATTGATCC ACCTTTAACT CGCCCTGCGC GGTCGTCAGG ATGAATTGGC GGTCGCAATA GGTCACGGCA 9600
CTCGCCTGCG TCTGTGTCAG CACCGTGATG CCTTCCGCAC GGAAAGCTGC AGTCACGGCC TCGCCAATGG CAGGGTCGTC ACTGGCAAAC AAGGTGCGCC 9700
GGGCCAGGAC CGTCACCTCG CTGCCCAAGC GCGCAAACGC TTGTGCCAGT TCCAGTGCTA CGACCGAGGC GCCAATTACT GCCAGCCGTT TAGGAATGGC 9800
TTCGCTAACC AGGGCTTCGG TGGACGTCCA ATATGGCGTT CCTGCAAGAC CAGGCAACGG CGGGATGGCA GCGCTTGCAC CAGTGGCAAT GAGACAGCGA 9900
TCGAAGGCCA CTTCGCGAGT GTCACCTTCT GCCGCTTCCA CGATAAGCGT TTGGCCATCC TTGAATCGGG CGCTGCCTCG CACCAGATTG ATCGACGGAT 10000
TGCTTTCGAG GATGCTTTCA TATTTAGCAT GACGCAGCTC GTCCACACGG GCCTGCTGTT GCGCCAGCAG ACGGTCACGC AGGACAACTG GGGGCGTTGC 10100
CGACAGTCCC ACGTCAAAAG GACTTTCGCG TCGCAGATGA GCGATATGCG CTGCCCGAAT CATGATCTTA GAAGGGACGC AGCCCACGTT GACGCAGGTG 10200
CCGCCGATAG TTCCCCGCTC GATCAGGGTG ACACGGGCAC CGTTTTCAGC TGCCTTCAGC GCTGCTGCCA CAGCGGCGCC GCCAGTACCG ATGATCGCCA 10300
CATGCAGCTG ACCTGCCGGT TTCTCCACCG CGCGCTCGCC GCTCAGCCAC TGGGCGGCTT TGCCAAGGAG TCCTGGCTGT TCAACACTGC GCTCTTTGCT 10400
TTCAAATTCG TCTGTAGGCG TAGCACCGTA ACCAAGGGCG CCCACTGCTG CTACGAGCGC GGACACGTCA AGCCTATTGC CTGTGACTTG CGCCTTGTTC 10500
GATGGATAAT CGACAGCCGC TCGCGTTACG CCGGGGACCG CCAGCAGCGC GTCCTCAACG TGCCTTGTAC AACTTGGGCA AGTCATTCCC GCCACGCTCA 10600
AACCAACTTC GTTTAACTCT TGCATTCCGC TCTCTCTTTT ATCCGTTGTG CCCAAGCCAA CTAGTACGAC GCCTCGGTCA CTGAATCGGT TGTGCGCTTC 10700
AGGGAACAGG CTCCGTGGGT CGTTTACAGG CTGCTCACGC GATGACGACC GGCAGAATCG CCTGAGCGGC ATCTCTGCCG TGGTTCTAGC GATGCTGCCG 10800
TTATTTGTTG CCTTACTTCT TCAGCTCAGA TGGGTAGCCT ACATCCGTCG TCGCCTTGCT AAGCGCCTGG ACGTTGGTCT TCGCATCGTC AAACGTTACG 10900
ACGGCAAGCT TGGTTGAGAA GTCGACCTCG GTCTTGCTAA CACCGTCGAC CTTGCTCAAG GCCTTTTTAA TGGTGATAGG ACAAGTCGCG CAGGTCATGC 11000
CAGGAACGGC CAAGGTGACC GTCTGCGTGG CGGCCCACAC CGGGGCAACA GCGGCGGCGA GGGCAAGGGA GGCAAACAGT TTCTTCATGA TGAACTCCTG 11100
GTTAATAGAA AAATGGAACG ACATAGGGAA ATCCAAGCGC GACCAGGACC AGCACGGCCA CGATCCAGAA AATCAGCTTG TAGGTGGCGC GCACCTGCGG 11200
AATCGCGCAG ACCTCACCTG GCTTGCATGC CTGCACGGGC CGGTAAATCC GCTTCCAGGC GAAGAACAGC GCCACTAGCG CCGCGCCGAT GAACAACGGT 11300
CGATAGGGTT CCAGCACCGT CAGGTTGCCG ATCCAAGCAC CGGAGAAGCC CAGGGCGACC AGTACTAGCG GCCCCAGGCA GCAGGTCGAT GCAAGAATGG 11400
CGGCCAGCCC GCCGGCGAAG AGCGCACCGC GCCCGTTTTG TGGTTCAGAC ATACGTTGGC CCTTTTGAAT TTGGATTGGA TAGCGTAACC TTACTTCCGT 11500
ACTCATGTAC GGAGTCAAGC GATATGGAAA ATAATTTGGA AAACCTGACC ATTGGCGTTT TTGCCAAGGC GGCCGGGGTC AACGTGGAGA CAATCCGCTT 11600
CTATCAGCGC AAGGGCCTGT TGCCGGAACC GGACAAGCCT TACGGCAGCA TCCGCCGCTA TGGGGAGGCG GACGTGGTTC GGGTGAAATT CGTGAAATCG 11700
GCACAGCGGC TGGGGTTCAG TCTGGACGAG ATTGCCGAGC TGTTGCGGCT CGACGATGGC ACCCACTGCG AGGAGGCCAG CAGCCTGGCC GAGCACAAGC 11800
TCAAGGACGT GCGCGAGAGG ATGGCTGACC TGGCGCGCAT GGAGGCCGTG CTGTCTGATT TGGTGTGCGC CTGCCATGCG CGGAAGGGGA ACGTTTCCTG 11900
CCCGCTGATT GCGTCACTGC AAGGGAAGAA AGAACCGCGC AGTGCGGACG CGGTGTAGCC CGAGGGAACT ACGCCTTAGC GTGCTTTATT TTCCGTTTTC 12000
TGAGGCGACT CCAACGTCAG AAAAGACCGT GCGGTCGACT TTTGATATTT CGTGCTGTCG CCTTCTGAAA GTGACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
r5 |
3875-3888 |
14 |
AATTTCTGTC ACAT |
r3 |
3931-3944 |
14 |
GGGTTAAGTG ACAA |
res |
3975-4004 |
30 |
CTGTCACATA ATCGAACGTA TATGTGACAG |
r2 |
3975-3988 |
14 |
CTGTCACATA ATCG |
r1 |
3991-4004 |
14 |
CGTATATGTG ACAG |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tniA |
Tn50580 |
144-1826 |
Transposase |
|
+ |
tniB |
Tn50580 |
1829-2737 |
Accessory Gene |
|
+ |
tniQ |
Tn50580 |
2734-3951 |
Accessory Gene |
Target Site Selection |
+ |
tniR |
Tn50580 |
4012-4626 |
Accessory Gene |
Resolvase |
+ |
merE; urf-1 |
Tn50580 |
4679-4915 |
Passenger Gene |
Heavy Metal Resistance |
- |
merD1 |
Tn50580 |
4912-5373 |
Passenger Gene |
Heavy Metal Resistance |
- |
merB2 |
Tn50580 |
5389-6027 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR |
Tn50580 |
6096-6530 |
Passenger Gene |
Heavy Metal Resistance |
+ |
WP_048976769.1 |
Tn50580 |
6630-7073 |
Passenger Gene |
Hypothetical |
+ |
merD2 |
Tn50580 |
7164-7511 |
Passenger Gene |
Heavy Metal Resistance |
- |
merB1 |
Tn50580 |
7508-8146 |
Passenger Gene |
Heavy Metal Resistance |
- |
merG |
Tn50580 |
8227-8880 |
Passenger Gene |
Heavy Metal Resistance |
- |
merA |
Tn50580 |
8916-10625 |
Passenger Gene |
Heavy Metal Resistance |
- |
merP |
Tn50580 |
10813-11088 |
Passenger Gene |
Heavy Metal Resistance |
- |
merT |
Tn50580 |
11102-11452 |
Passenger Gene |
Heavy Metal Resistance |
- |
merR1 |
Tn50580 |
11524-11958 |
Passenger Gene |
Heavy Metal Resistance |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
Tn50580 |
1683 |
144-1826 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | homologous to TnsB of Tn7 |
Protein Sequence:
|
MASDTSLIAE QGVATLPDAA WAQARQRAEI IGPLAALDVV GHEAADAAAH ALGLSRRQVY VLIRRARQGA GLVTDLARSR SGGGKGKGRL PESVERIIRE LLQKRFLTKQ KRSLAAFHRE VAQACKAQKL RAPARNTVAL RIAGLDPLKA TRRREGQDAS RSLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDERDRQPI GRPYLTIAID VFTRCVLGMV VTLEAPSSVS VGLCLVHVAC DKRPWLEGLN IEMEWPMSGK PRLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAIARTGV PTVITRTTAF LVDFLPIIRR TLTRTGFVID HIHYYADALK PWIARRDRLP AFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR EIVSTAQKAT RKARRDADRR QHLKATAVLF KTTPPPDADM ADPQADNQPP AKPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB |
TniB |
Tn50580 |
909 |
1829-2737 |
+ |
Class: | Accessory Gene |
Transpoase Chemistry: | Serine |
Comment: | homologous to TnsC protein of Tn7 putative ATP-binding protein |
Protein Sequence:
|
MEEYPIIDLS HLMPVAQGLA RLPADERIHR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSSIAT LDMARYLLTR SEGTIGELAH LLMAAAVAAV ESGEEAINHR TLSMADYTGP SERRRQFERE LM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniQ |
TniQ |
Tn50580 |
1218 |
2734-3951 |
+ |
Class: | Accessory Gene |
Sub Class: | Target Site Selection |
Function: | putative transposition gene |
Comment: | similar function to Tn7 tnsD? |
Protein Sequence:
|
MKPAPHWPLH PAPREGEALS SWLNRVALCY HMEVSELLEH DLGHGQVDDL DTAPPLALLA MLSQRSGIEP DRLRCMSFAG WVPWLLDSLD DQIPDALETY AFQLSVLLPK LRRRTRSITS WRAWLPSQPI HRACPLCLND PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW QALTTGHVEL PRRRIHAGLW FRLLRTLLDE LNTPLSTCGT YAGYLRQIWE CCGHPLRAGQ SLWRPYETLN PAVRLQMLEA AATAISLIEV RDISPPGEHA KLFWSEPQTG FTSGLPAKAP KPEPVDHWQR AIQAIDEAII EARHNPETAR SLFALASYGR RDPASLEQLR ATFAKEGIPP EFLSHYEPSL PFACLRQNDG LSDKF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniR |
TniR |
Tn50580 |
615 |
4012-4626 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Function: | putative resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Comment: | resolution of cointegrates || Protein: ACE81792.1 |
Protein Sequence:
|
MLIGYMRVSK ADGSQATDLQ RDALVAAGVD PAHLYEDQAS GKREDRPGLA SCLKALRPGD TLVVWKLDRL GRDLRHLINT VHDLTGRGIG LKVLTGHGAA IDTTTAAGKL VFGIFAALAE FERELIAERT VAGLASARAR GRKGGRPFKM TAAKLRLAMA AMGQPETKVG DLCQELGVTR QTLYRHVSPK GELRPDGEKL LSRI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merE; urf-1 |
MerE |
Tn50580 |
237 |
4679-4915 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | Broad-spectrum mercury transporter |
Protein Sequence:
|
MNSPERMPAE THKPITGYLW GALAVLTCPC HLPILAIVLA GTTAGAFIGE YWGIAALTLT GLFVLSVTRL LRAFKDRS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD1 |
MerD1 |
Tn50580 |
462 |
4912-5373 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MFYGTGIGFA ARGLNSAKRY MHCREPTKRR CSMNAYTVSR LALDAGVSVH IVRDYLLRGL LRPVACTTGG YGLFDDAALQ RLCFVRAAFE AGIGLGALAR LCRALDAANC DETAAQLAVL RQFVERRREA LANLEVQLAA MPTAPAQHAE SLP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merB2 |
MerB2 |
Tn50580 |
639 |
5389-6027 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Transpoase Chemistry: | merB1 (Tn5058) Chemistry |
Target: | Mercury |
Comment: | organomercurial lyase |
Protein Sequence:
|
MKLAPYILER LTPTNRPKGF AEFLVALLGE LAKGRPISRT TLARTLGWPA EQVAAVLEQA TSTEYDNDGN IIGYVLTLRE TSHVLEIDNR RLYAWCALDT LMLPALIGRT ARVSSHCAAT GAPVSLTVSP SEIQAVEPAG MAVSLVLPQE AADVRQSFCC HVHFFASVPT AEDWASKHQG LEGLAIVSVH EAFGLGQEFN RHLLQTMSSR TP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR |
MerR |
Tn50580 |
435 |
6096-6530 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | putative regulatory gene mercury resistance |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGAAD VTRVRFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLQDVREKM ADLARMEAVL SDLVCACHSR QGNVSCPLIA SLQGGTSLAG ASTA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
WP_048976769.1 |
WP_048976769.1 |
Tn50580 |
444 |
6630-7073 |
+ |
Class: | Passenger Gene |
Sub Class: | Hypothetical |
Protein Sequence:
|
MKDQIQDLQN RLIQAESRAS ELIDTNAQLQ RQTHELEQAN AMHKSTLDRQ VADIQNLEHQ LNDASAKNRE RKHESKQLAD NIAALASQKA DADKQVAVLS QALEVTKTDL KASQNRAEVL SDENKLILQE KAVIQGQFKQ LQESLSR
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merD2 |
MerD2 |
Tn50580 |
348 |
7164-7511 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MNAYSISKLA DDACVSVHVV RDYMIRGLLH PARRTESG*N IFDDKTLGRL RFLRAAFESG IGLDELARLC KALDADDSES LDRCVERVRW KIDVRRTVLT VVEASLNEMA AGAAT
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merB1 |
MerB1 |
Tn50580 |
639 |
7508-8146 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Transpoase Chemistry: | merB1 (Tn5058) Chemistry |
Target: | Mercury |
Comment: | organomercurial lyase |
Protein Sequence:
|
MDKTIYSKKI AESLSSGNHP KEFATLFVAL LRQLAMGGPV SREKLAGALG WSGARVATVL EQAPGTEYDD EANIIGLGLT LRETSHVFEV DGRHLYTWCA LDTLMFPALT GKIARVTSRC AATGRPITLT VAPEAVLHVE PAEAMVSLRT PDTSPDIRCS FCCHVHFFAS PSIANSWAST HQGIEVVPVE SAFDLGHDVA LKLLEDCEES PV
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merG |
MerG |
Tn50580 |
654 |
8227-8880 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MFCDIRLKLR RVRACLTIVL ASTATLGMPN QALAAYDFSK LGAAIETLSP ETVSDWEKKA RAGDAMAQNI MGLAYKCGMG VKQNHAASIQ WFRRAAEQGE ADAQFNLGRL YGSEVDGMYK NGRAAPANDA EAFKWYRLAA EQGHTQAQVR LAQLFAKGLT DIAPDQVQAY QWMSLAAASG EPTAAKVVAD YAAQMKPEQL QQAQLLAEEW QRRQGTK
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merA |
MerA |
Tn50580 |
1710 |
8916-10625 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Protein Sequence:
|
MQELNEVGLS VAGMTCPSCT RHVEDALLAV PGVTRAAVDY PSNKAQVTGN RLDVSALVAA VGALGYGATP TDEFESKERS VEQPGLLGKA AQWLSGERAV EKPAGQLHVA IIGTGGAAVA AALKAAENGA RVTLIERGTI GGTCVNVGCV PSKIMIRAAH IAHLRRESPF DVGLSATPPV VLRDRLLAQQ QARVDELRHA KYESILESNP SINLVRGSAR FKDGQTLIVE AAEGDTREVA FDRCLIATGA SAAIPPLPGL AGTPYWTSTE ALVSEAIPKR LAVIGASVVA LELAQAFARL GSEVTVLARR TLFASDDPAI GEAVTAAFRA EGITVLTQTQ ASAVTYCDRQ FILTTAQGEL KVDQLLVATG RTPNTASLNL ERAGVTFDSQ HRILIDQGMR TSASNIYAAG DCTDQPQYVY VAAAAGTRAG INMTGGDVKL DLDAMPAVVF TDPQVATVGY SEAEAQRIGL QTESRTLTLD NVPRALANFD TRGFIKLVAE AGSGRILGVQ AVTPEAGEII QTAAIAIRAR MTVQDLANQL FPYLTMVEGL KLVAQTFSKD VTQLSCCAG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merP |
MerP |
Tn50580 |
276 |
10813-11088 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Mercury |
Comment: | periplasmic mercuric ion binding protein |
Protein Sequence:
|
MKKLFASLAL AAAVAPVWAA TQTVTLAVPG MTCATCPITI KKALSKVDGV SKTEVDFSTK LAVVTFDDAK TNVQALSKAT TDVGYPSELK K
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merT |
MerT |
Tn50580 |
351 |
11102-11452 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | cytosolic mercuric ion transport protein |
Target: | Mercury |
Protein Sequence:
|
MSEPQNGRGA LFAGGLAAIL ASTCCLGPLV LVALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAWKRIY RPVQACKPGE VCAIPQVRAT YKLIFWIVAV LVLVALGFPY VVPFFY
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
merR1 |
MerR1 |
Tn50580 |
435 |
11524-11958 |
+ |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Function: | activator |
Target: | Mercury |
Protein Sequence:
|
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLPEPDKPY GSIRRYGEAD VVRVKFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLKDVRERM ADLARMEAVL SDLVCACHAR KGNVSCPLIA SLQGKKEPRS ADAV
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t1 |
Tn50580 |
9-27 |
TCAGAAGACG ACCGCACCA |
repeat t2 |
Tn50580 |
49-67 |
CACACGTATG CCGAGGACT |
repeat t3 |
Tn50580 |
78-96 |
TCAGGAGTCG TCTGCACCA |
repeat t4 |
Tn50580 |
110-128 |
TCAATACTCG TGTGCACCA |
IRL |
Tn21 |
6526-6563 |
GAATCGCACG AAATAAAAGG CAAAAGACTC CACGGGGG |
repeat i4 |
Tn50580 |
6536-6554 |
AAATAAAAGG CAAAAGACT |
IR Tn21-like |
Tn5053 |
11975-12012 |
GAATCGCACG AAATAAAAGG CAAAAGACTC CGCTGAGG |
repeat i4 |
Tn50580 |
11985-12003 |
AAATAAAAGG CAAAAGACT |
repeat i2 |
Tn50580 |
12027-12045 |
CCGTGCGGTC GACTTTTGA |
IRi |
Tn50580 |
12049-12076 |
AAGCACGACA GCGGAAGACT TTCACTGT |
|
References |
|
|
Smalla K, Haines AS, Jones K, Krögerrecklenfort E, Heuer H, Schloter M, Thomas CM. Increased abundance of IncP-1beta plasmids and mercury resistance genes in mercury-polluted river sediments: first discovery of IncP-1beta plasmids with a complex mer transposon as the sole accessory element. Appl Environ Microbiol. 2006 Nov;72(11):7253-9. doi: 10.1128/AEM.00922-06. Epub 2006 Sep 15. PubMed ID: 16980416
| |
| | |
|
|