|
|
|
|
Name: Tn5045 |
|
Family: Tn3 Group: Tn21 |
|
Evidence of Transposition: yes |
|
|
Host |
|
|
Host Organism: | Pseudomonas sp. Tik3 | Molecular Source: | plasmid |
Place of Origin: | Siberia, Russia | Date of Isolation: | 2011 |
| | Other Geographic Information: | 15,000-40,000-year-old permafrost |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 37 bp) | | GGGGAGCCCGCAGAATTCGGAAAAAATCGTACGCTAA |
IRR (Length: 37 bp) | | GGGGAGCCCGCAGAATTCGGAAAAAATCGTACGCTAA |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGAGCCCG CAGAATTCGG AAAAAATCGT ACGCTAACAG CTCAAGTGTC CCGACCACGC CCGACAGTAG CTTGTTGGCT GGTCAATCTG GTTCACGGGT 100
AAAGCGATAG AACAGGTTTG GCTCACTGAC CACGTACAAA TACCCTTCAT CATCGATGGT TACGCCTTCG GCCTGGGGTA TGCCTTTCAG CAAACCAGCA 200
AAACCCCTCG CCAAGGAACG GAAACTCACC ACCTTGCCCT CATCGGTCAT TTCAATCAGC AGTTTCGACT CGTCGCTGAG CAGTATGAGA TGGCCACTCT 300
GTTGGTCGAA GACGACCGAA GACAAGTCAG TGGCAAATAC CTTGTCCTTT ACCAAGTTGG ACAGGTCGCG CACGTGCAGG GAAAAACCTC CTGCCAGGCT 400
GGCACGAAGG CCGCCCACTT CCAGCAATTG GCGAGGGTCA CGCTCCTTGG TCACAAACAA GCGATCACCT TTCAGGTCGT AGGCGAGCCC TTCAAGGCCT 500
TTGTTGTCCT CCTTGCCAAG CGCCAGGGTC AGAGCTGGAT ACTGGTCCCG GCTCAATGAA CGATCGGGAG AAAGCTTACC GTCTTCGGCG ATGGGGACAT 600
CCACAATAAC CAGGCTCTGC CGGCGCTCCT CGGCGATTAC CAGCTGGCCG TTGCCGGCAT AGGAGACCGC CTCCACATCG TGGAAGCCAT CCAGGTTGTA 700
ACGCCTTTCC ACATCACCGT CACGACTAAG GGCCAATAGT TCGTTTGGGC CGTTGGTGAC TGCCCACAGC AGGTTTAGGT CAGGGTCAAA GGTCAGACCC 800
GAAAGATTGT TGTCCACACC GGGGACCGCC TTAGCATCCA GTTCAACCCG ATAGCCAGGC AGCCACACAG AGCGCTCCTG CCAATCGTCG GTGTGCCAGC 900
TGGTCTTTAT CCAGAAGTAC AAACGATCAT CAAGGTGATG GGTGCGGACT TGGAATACGG TGAGCAACAC AAGGCACAGC AGTGCCCACA TCCAAGCGCT 1000
TGCTTTCCGT GTTCGGCTCA GCCAGTTTTT AGCCATCAAA TACATTAGGA GCCTCGTTAT TTGGGCTGAC GCCACGAGTC GTGACGATTT CAATCCGACT 1100
CGGTGCGACT CGCACAGGTG ATGCACAATG GCGTGGCCGG ATCGAACTCC AGACGGCCCG AGGCGATCAA CTCGCCGCAT TGGTTACACC AGCCATCATT 1200
GCTTGCTGCT GGAGTGCATC AATTCTCGAC AGACGCCCTA CCTTGCTTTG ATCCAGCTCT ACCGATTGCG AGCGAGACTC AGCGTCTTCC AGCAATTGAT 1300
CCAGCTCGGC AGCCCGCTGT TCCAGCAGGG TCTTGAAATG GGCAAGATCC AGGGCGTCGT CCATGGACAT TAACGCAGCA GTAACAACGG ACTGGTGGTG 1400
GTGCGGAGCA TGCTGGTCGT GGTGCTACCG ACCAGGAACT GCCGGATACG TGAATGGCCA TAGGCCCCCA TCACCAGCAG ATCAATGCCA TGCTCTTTCT 1500
GATAGGCGTG GAGGGTAGGC TCTATCTCGC CGTTCAGGGC CTCGGCGCGA ACAGTGAATC CGGCGTTGAG CAGCACTTTC TGCGCCCAGT CCAGCTGCGC 1600
CGACGATTCG TCGCTCACGG GCCCAACCAT GACCAGGTGG ATCGGCAGCC CCTTCAGCAG GGGGCTGGCC GCCAGCATCT CCACACCCTT GCGGGTAGTA 1700
GCGCCGCCAT CGAAGGCGAG CATCGCGCTC TCAGGCTTTT GGAAGTTGGC CGGGGTGACC AGAATCGGTC GGTGCATGAT ACGGATCACG CTCTCCAGCT 1800
GGCTTCCGAC ATGCTGACTC AGACCACCGC TAGATTCGCC CTGGCGACCG ATGACCAGCA GGCGCGTTTC GGTTTGCAGC TCTTGCAGGC TTTCCAGCAG 1900
ATCGCCATGA CGTTGCTTGG ACTCCGGCGC GGCCACGCCA TCCTTAATGG CCCGCTCTTT TGCGGCCGCA AGCATGATCC GCCCTTGTTC AAGGGCCAAC 2000
TTGCCACGCT GTTCATCCAG GGAAGCAAGC TCATCGAGCA GATGCTCGCG GCTGCCAAGG CCGATATTGC CACTCAGATC GGCCGTAACC GGGTACTGGC 2100
GCTGATCCAG CACATGCAGG AAGGTCAGCG GGGCTTCCAG GCTCAGACTG GCCCAGGCCG CGTAGTCGCA CACGGCTGGA GCCGAGGCGG AAGCGTCTAT 2200
ACAGGCAATT ACTTGGGTCA TTGTTGTTCT CCTTCTTAGT GGCCCATGAG TTGATCAATG GCGTCGGGTT TATCGTGAAC ACCGAAGCGA TCCACGATAG 2300
TGGCGCTCGC TTCGTTGAGG CCCAGCACTT CAACTTCGGT GCCTTCGCGG CGGAATTTGA TGACCACTTT GTCCAAAGCG GCAACGGCGG TGATATCCCA 2400
GAAGTGAGCA CGGTTCAGGT CGATGGTTAC CTTGTTTAGG GCTTCTTTGA AGTCGAAGGC CGCGACGAAC TTGTCTGCCG AGCTGAAGAA CACCTGGCCG 2500
GTGACGTTAT AGCTACGATG CTCGCCGGCT TCGTCCAGCA AAGAGCTGAT CGCCATGTAA TGGCCAACCT TGTTGGCGAA GAACATCGCG GCCAGCAGTA 2600
CGCCGGCCAA CACGCCGAAG GCAAGGTTGT GGGTGGCGAC CACGACCACC ACGGTGACGA CCATGACAAT GTTGGTCGAC AACGGGTGCT TCTTCAGGTT 2700
GCGCAGCGAA TCCCAACTGA AGGTGCCGAT GGACACCATG ATCATCACTG CCACCAGCGC AGCCATCGGG ATCTGCTTCA GCCAGTCGCC GAGGAATACC 2800
ACCATCAGTA GCAGGAATAC GCCTGCGGCC AGGGAGGACA GACGAGAACG ACCGCCGGAT TTAACGTTGA TGATCGACTG ACCAATCATC GCGCAACCTG 2900
CCATACCGCC GATTAGGCCC GAAGCAATGT TGGCCACGCC TTGACCCTTG CACTCGCGGT TCTTGTCACT GGAGGTGTCG GTCAGGTCGT CGACAATGGT 3000
CGCGGTCATC ATCGATTCCA ACAGACCGAC CACAGCCAGT GCTGCCGAAT AAGGGAAGAT GATGGCCAGC GTCTCGAATG TCAGCGGCAC GTCAGGCCAG 3100
AGGAAGATCG GTAGCGTATC CGGCAGTTCA CCCATATCAC CGACCGTGCG GATATCCAGC CCAACCGACA TGGCGACGGC GGTCAGCACG ATGATGCACA 3200
CCAGCGGCGA TGGGATGAGC TTGCCGATCT TGGGGACATA GGGGAACAGA TAGATGATGC CGAGGCCTGC GGCTGTCATG GCGTAGACGT GCCAGGTGAC 3300
ATTGGTCAGC TCGGGCAGCT GAGCCATGAA AATCAGGATC GCCAGTGCAT TGACGAAACC GGTCACCACC GAGCGCGAGA CGAAGCGCAT CAGCGATCCG 3400
AGCTTCAGGT AGCCAGCAGC GATTTGTAGC ACGCCACATA GCAGCGTGGC GGCCAGCAGA TATTCAAGAC CATGGTTCTT GACCAGGGTC ACCATCAGCA 3500
GTGCCATTGC ACCGGTTGCG GCCGAGATCA TTCCGGGGCG ACCACCGACA AAGGCGATAA CCACGGCGAT ACAGAAAGAG GCGTACAGGC CGACCTTGGG 3600
GTCGACGCCG GCAATGATCG AGAAGGCGAT GGCTTCAGGG ATTAGGGCCA GTGCGACCAC AATACCGGCG AGGATGTCGC CACGGATGTT GGATAACCAG 3700
GTTTGTTTTA ACGAGTGGAG CATCAGAATT CCCAAGGCAA TGGATCGCGC AGAACAGCAT GGCGCAGGCC ATGTGCAGCT GGTCGATACG AGGTCAAATT 3800
GTCGGATGTA GAGGAGCTGT GGCGGGTGTT AAAACCTAAA GCACAGCAGA ACGCGACCGC TGGCAGAGCA AGCAGTCACA GATGATGCGG GGGGCGAAGC 3900
GCTATGGCGG CGTAGGCAGC CAGATCATGT TTTGCAGGGT AAGCATGAGG TCTTATCGAG TGTCTTTAGT AACTCAATGG CCGCAACAGT CTACAGGAAG 4000
AGGGCAATTT CTGCGAGTCA CCCCCGGACT AGGGCGTCCT GTTTGGATGT CAGCCTGAGC TATACCCTAA CTGGATGTCA GGCAAGGCCG CACCGCGCCT 4100
GTCATTTTCA GAAGACGACT GCACCAGTTG ATTGGGCGTA ATGGCTGTTG TGCAGCCAGC TCCTGACAGT TCAATATCAG AAGTGATCTG CACCAATCTC 4200
GACTATGCTC AATACTCGTG TGCACCAAAG CGAGGTGAGC ATGGCGACGG ACACCCCACG GATTCCAGAA CAAGGCGTGG CCACTCTGCC TGATGAGGCT 4300
TGGGAGCGTG CGCGCCGTCG TGCGGAGATC ATCAGTCCGT TGGCGCAGTC GGAGACGGTC GGGCACGAAG CGGCCGATAT GGCGGCTCAG GCGCTGGGCT 4400
TGTCTCGGCG CCAGGTATAC GTTCTGATCC GGCGTGCCCG GCAAGGCAGC GGCCTCGTGA CGGATCTGGT GCCCGGCCAG TCCGGTGGAG GTAAAGGTAA 4500
GGGGCGCTTG CCGGAACCGG TCGAGCGCGT CATCCACGAG CTACTGCAAA AGCGGTTCCT GACCAAGCAG AAGCGCAGCC TAGCGGCCTT TCACCGCGAA 4600
GTCACTCAGG TGTGCAAGGC TCAAAAACTG CGAGTGCCGG CGCGCAATAC CGTGGCCTTA CGGATCGCTA GCCTTGACCC GCGCAAGGTC ATCCGCCGGC 4700
GGGAAGGCCA GGATGCCGCT CGTGACCTAC AAGGTGTGGG CGGCGAGCCT CCTGCCGTGA CCGCGCCGCT GGAGCAGGTG CAGATAGACC ATACGGTCAT 4800
CGACCTGATC GTGGTCGATG ACCGCGACCG GCAACCTATT GGCCGCCCGT ACCTGACCCT CGCCATCGAC GTGTTCACCC GCTGCGTGCT CGGCATGGTC 4900
GTCACGCTGG AAGCGCCGTC TGCCGTTTCG GTTGGCCTGT GCCTCGTGCA TGTCGCCTGC GACAAGCGCC CTTGGCTGGA AGGACTGAAC GTGGAAATGG 5000
ATTGGCAGAT GAGCGGCAAG CCCTTGCTGC TCTACCTAGA CAACGCGGCC GAGTTCAAGA GCGAGGCCCT GCGCCGGGGT TGCGAGCAGC ATGGCATCCG 5100
GCTGGACTAT CGCCCGCTGG GACAGCCGCA CTATGGCGGC ATCGTGGAAC GGATCATCGG CACGGCGATG CAGATGATTC ACGACGAACT GCCGGGAACG 5200
ACCTTCTCCA ACCCTGACCA GCGCGGCGAC TACGATTCCG AAAACAAGGC CGCCCTGACG CTGCGCGAGC TAGAGCGCTG GCTCACATTG GCGGTCGGCA 5300
CCTACCACGG TTCGGTGCAC AACGGCCTGC TCCAACCGCC GGCCGCGCGC TGGGCCGAGG CCGTGGCGCG TGTCGGCGTA CCGGCCGTCG TCACACGCGC 5400
TACTTCGTTC CTGGTCGATT TTCTGCCGAT CCTCCGGCGC ACGCTGACCC GCACCGGCTT TGTCATCGAC CACATCCACT ACTACGCCGA TGCGCTCAAG 5500
CCGTGGATTG CGCGGCGTGA ACGCTGGCCG TCCTTTCTGA TCCGGCGCGA TCCGCGCGAC ATCAGCCGTA TCTGGGTCCT GGAACCGGAG GGACAGCATT 5600
ACCTGGAAAT TCCCTACCGT ACCTTGTCGC ATCCGGCTGT CACCCTCTGG GAACAACGGC AGGCGCTGGC GAAACTGCGG CAGCAAGGGC GCGAACAGGT 5700
GGATGAGTCG GCGCTGTTCC GCATGATCGG CCAGATGCGT GAGATTGTGA CCAGCGCGCA GAAGGCCACA CGCAAGGCGC GGCGTGACGC GGATCGCCGC 5800
CAGCACCTCA AGACATCAGC TCGGCCGGAC AAGCCCGTTC CGCCGGATAC GGATATTGCC GACCCGCAGG CAGACAACTT GCCACCCGCC AAACCGTTCG 5900
ACCAGATTGA GGAGTGGTAG CCGTGGACGA ATATCCCATC ATCGACCTGT CCCACCTGCT GCCGGCGGCC CAGGGCTTGG CCCGTCTTCC GGGGGTCGTC 6000
TCAGAAAACG GACCGCAAAG TACGCTAAGG CGGCTGCAGC GGTCGTAGGG GCCTGAATTT GCCAGCTCCG ATCTTGGCGC TGCTGCGCCA GAGGTAATCG 6100
CCGGTCAGGT TGATGTGCTC CCAGCCCAGC GGCGACAGGT ACTGCAACAG CGCATCATCG ATGCCGAGAC CGTTTCCACG CAGCGCATGC GCTGCGCGCT 6200
CCAGGTAGAC CGTGTTCCAT AACACGACAG CCGCCGTCAC CAAGTTTAGG CCGCTGGCCC GGTAGCGCTG CTGCTCGAAA CTGCGGTCGC GAATTTCACC 6300
CAGCCGGTTG AAGAACACGG CGCGGGCCAA CGCGTTCCGC GCTTCCCCCT TGTTGAGTCC GGCGTGGACA CGGCGGCGCA GCTCCACGCT TTGCAGCCAG 6400
TCGAGGATGA ACAGCGTGCG CTCGATGCGT CCCAGCTCAC GCAGGGCGAC GGCCAGGCCG TTCTGGCGTG GATAGCTGCC GAGCTTGCGC AGCATTAGTG 6500
AGGCCGTCAC CGTGCCATGT TTGATCGAGG TAGCCAGCCG CAGGATTTCA TCCCAATGGG CACGGACATG CTTGATGTTG AGCGTGCCGC CGACCATCGG 6600
CTTGAGCGCC TCATAGGCGG TATCGCCCTT AGGGAGGTAG AGCTTAGTGT CGCCCAGGTC GCGGATGCGC GGCGCGAAGC GGAAGCCCAG CAGGTGCATG 6700
AGCGCGAAGA CGTGATCGGT AAAGCCCGCC GTGTCGGTGT AGTGCTCCTC GATCCGCAGG TCGGATTCGT GGTAAAGGAG GCCGTCGAGC ACGTAGGTCG 6800
AGTCGCGCAC GCCGACGTTG ACCACCTTGG CGTGGAACGG CGCGTACTGG TCAGAGATGT GGGTGTAGAA AGTCCGTCCT GGGCTGCTGC CATACTTCGG 6900
GTTGATATGG CCGGTGCTTT TGGCCTTGCT GCCTGTTCGG AAGTTCTGAC CGTCCGACGA TGACGTGGTG CCGTCGCCCC AGTGACAGGC GAAGGGATGC 7000
CGGAACTGGG TGTTGACCAG TTCTGCCAGC GCCGTTGAAT AGGTCTCGTC GCGGATGTGC CAGGCTTGCA GCCAGGCCAG CTTGGCGTAG GTCGTGCCGG 7100
GGCAGGACTC GGCCATCTTG CTCAAGCCCA GGTTGATCGC ATCGGCCAGG ATCGTGGTCA GCAGCAGGTT CTTGTCCTTG GCCAGGTCGC CGGATTTCAA 7200
GTGCGTGAAG TACCGGGTGA AGCCCGTCCA CTCATCGACT TCGAGCAGTA GTTCGGTGAT CTTGACGTGC GGCAGGATCA TTGCTGTCTG GTCGATCAGC 7300
GCCTGCGCGG TGTCGGGCAC CGCCGCGTCC AGCGGCGTGA TCTTCAGGCC CGACTCGGTG ATGATGGCAT CCGGCAAGTC GTTGGCCGCC GCCATGCGGT 7400
TGACGGTGGC AAGCTGCGCT TCTAGCAGTG TCAGCCGCTC GTACAGGTAC TGGTCGCAGT CGGTGGCCAC GTCCAGCGGC AATTCGCTGC CCTGCTTGAG 7500
GCTGGCGAAC TTCTCGTGCG GCACCAGGTA ATCCTCGAAA TCCTTGAACT GACGCGAGCC CTGCACCCAG ATGTCGCCCG AGCGCAGCGA GTTCTTGAGT 7600
TCCGACAGCG CGCACAGTTC ATAGTAGCGC CGGTCGATGC CGGCGTCGGT CATCACCAGC TTCTGCCAAC GAGGCTTGAT GAAGTCGGTC GGGGCATCAG 7700
TGGGCACCTT GCGGACGTTG TCAGTGTTCA TGCCGCGCAG TACGTCGACG GCGTCGAGCA CGCCCTTGGC GGCGGGCGCG GCCCGCAGCT TGAGCACGGC 7800
CAGGAATTCC GGCGCGTAGC GGCGCAAGGT AGCGTAGCTC TCGCCGATGC GGTGCAGGAA ATCGAAGTCA TCGGGCTGCG CGAGCTTCTG CGCCTCGGTG 7900
ACGCTCTGGG CGAAGGCATC CCAGGACATG ACGGCCTCGA TGGCGGCGAA CGGGTTGCCG CCCGACTGCT TGGCGTCGAT CAGCGCCTGG CCGATGCGTC 8000
CGTACAGTCG CACCTTGGCG TTGATCGCTT TGCCGGACGC CTGGAACTGT TGCTGATGCT TGTGTTTGGC GGCGTTGAAC AGCTTGCCCA GAATGCGGTC 8100
GTGCAGGTCG ATGATTTCGT CGATGACGGT GGCCATACCC TCGATGGCCA GCGCGACGAG GGTGGCGTAG CGCCGTTGCG GCTCGAACTT GGCCAGATCA 8200
GCGGGCGTCA TCTGGCCACC CTCGCGGGCG ATCTTGAGCA GCCGGTTCTG GTGAAGCAGC CGTTCGATGC CAGTAGGCAG GTCGAGCGCT TGCCAGGCTT 8300
TCAGGCGCTC GATGTGTTCG AGCATGTGGC GCGAGTTCGG CTTGACGGGC GACTGGCGCA GCCAGGCCAG CCAGGTCGTC TTGCCGAACT CGCGGCGCTT 8400
GAGCAGATCA TCGAGGCGAT GCCGGTGGGG GTCTGTCAGC GGCTCGGACA GCGCTTCGTA GATGCGCCGG TTGGCGCGAG TAATGGCCTC GGCGCAGATG 8500
CGTTCAATGA CGCCCGGTGA CGGGAGCAGC ACGCTCTTTT GTCGCAGCCC TTCGATCAAC TCCGTCGCCA ACACGATGCC TTTGTCGGTC TGCCAAGCCA 8600
GTTCATCCAG GCTGTGCACG CTGGGGCGAT AGTGCCGCGT TGCGAAGGCC TGGAAACCAA AGATCGATTG CAGTTCCAGC AGGTGCTCGC GGCGCGTCTC 8700
GGCGCGCTGG CCGTAGTCCG CCCAGGCTTC GGGTGGCACC TTCAGTTGCG TGGCTACCAG ACGCAGCAAC GGCGGAAACG GCTCCGTATC TACCGCGAGC 8800
ATCATGCCGG GATAGCGCAT GTAGCAGAGC TGCACGGCGA AGCCCAGGAG GTTGGCTGGA CCGCGCCGTT GCCGGATGAT CGAGAGGTCG GATTCGCTGA 8900
ACGTGTAGTA GCGGATCAAG TCCTCCATGG TGTCCGGCAA CGCCAGCAGG CTCTCGCGCT CGGCGGCGGA CAGGATCGAA CGGCGCGGCA TGTCCGGCGA 9000
CTCCCTCTCG AATTCGCTTG GTTACATCTT CGGCGGCCAG TTATGCGTCT CACTCTGGCA GTGCTTGCAC CATGCATAGA ACGCGTCGTA CATCACCATG 9100
CCGTGCTTGA GCATTTCATG GTCGTCGCTG AAGTTCTGCG ACAAGCCAAG AGATAGCGCG TACAACCCTG TCGACTGCGG CGTCAGGTCA AGTCGGGATG 9200
TATCGGCACC GCGAACAATC ACGGCGAGCT GTTTGAGGGC AGGGTCATCC ATCGCGTACT TGGCCAGGAA GGCATCGAAG CTGCAAAGCT CACCGTCATG 9300
GGAAAGTTCC ACTCCAGGAA TGTCATACGG GACCGCGCCG GTCTCCTGAG CGATGCGCAG CACGTCGCCT GCCGGGACGT AGAGGAACTC GGCATGCGTG 9400
TCGATGAACC GCGTCACCAT CCACGGGCAG GCAATGCGAT CGATCTTCGG GCGTTCGCGG GTAATCCATT TCATGAGAGT CCTCCTGAGG GTTGGTGACT 9500
AGCCCAGCGA ATAGCCGCTC GTTGCCAGTC GATGTTGGCC ATGAATGCAT CTACATAGGC ACCGGCCTTG GCGCCATAGT CGATGTGATA CGCGTGTTCA 9600
TACATATCAA GCGCCAGGAT TGGCAAAGCG CCAGCCAAGG TGTGGCAGTG ATCGGCGGCC CACTGGTTGC TCAACTTGCC GTCGCGCGCT GATTGGACCA 9700
GCAGCACCCA GCCGGAACCG CCGCCGAGCG CCTTGCCCAT CGCCGAGAAT TCGGCCTGCC AGCGCTCGAC GCTGCCAAAG TCGCGCTCGA TCGCCTCGCA 9800
CAGCGGGCCT ACGGGCAACA CGCCGTTGCC GCCCAGCGAG TCGAAGTAGA TCTCATGCAG CAGCATCGAG TTGTAGGCGA TCAGTTCTTC GCGTTTGAGA 9900
CCGTTCAGCA GGAAGCCGGC TTCCTGCGTG AAGTCAAGGG TGCCGAGCTT GGTGTGGATG GCGTTCAGGC GTTTGACGGC CCCGGAATAG TTGTTTTCGT 10000
GATGGCTGGC GATCAGCTTT TCGGACAGAC CATTTAGGGT ACTTGCGGGG AAAGGCAGCG GTTTAATGTC GAAGGACATG GTGTCACCTC ACAAGATCAA 10100
GGTCTTCAAT ATCAGGCCCA GCACCGCACA TGCCGCGATG ACGTGAATCA CGTTTCGCTT GAAGCGGAAC AAGGCCACTG CCGCACCGAT GGTGATCAGT 10200
GCCGACACCC AGTCGAAGAG CCCGGCGAGG CCTTTGGGCC AGAGCACGTG ATAGGCGAAA AACACGGCCA GGTTCAGGAT CACACCGACC ACCGCTGCCG 10300
TTATCGCAGT CAGCGGCGCG GTAAACCGGA GATCTCCGCG CGTCGACTCT ACGAACGGCC CTCCGGCGAG GATGAAAATG AAGGACGGCA GGAAAGTGAA 10400
CCAGGTCACC AGCGCGGCTG CGACCGCACC GGCAACGAAC AACATGTCGG GGCCGAATAG TGCCTGTACA TAGGCCCCCA CAAAGCCGAC GAAGGCCACC 10500
ACCATAATGA GCGGCCCCGG ATTGGCCTCG CCGAGCGCCA ACCCATCGAC CATCTGCGTG GGGGTGAGCC AACTGTAATG GCCGACCGCA CCCTGATAGA 10600
TGTACGGCAA TACTGCATAG GCCCCCCCAA AGGTCAGTAA TGCGGCTTTC GTAAAGAACC AGCCCATCTG GGTCAGGGTG TGGTCCCAGC CATAGGTGCC 10700
GAGAAGGAAT GCCATGGGGA TCAGCCACAA CAGGCAACCC ATCGCGGCAA CCCGCAGCGA GCCAGACCAG CGGAACAGGG CGTGCGACGG TATGGGGGTG 10800
TTGTCATCGA TCAGCGCGGG GCCATAGGAT TTATCGGCCG CACGATGCGC GCCGCCGGCC CTGAATTTGT CCGGGGTAAC GCGACCGCCG ATATACCCTA 10900
TCAGCGCGGC CCCGGCGACG ATGGCCGGAA ACGGCACATT CATCGCGAAG ATGGCGACGA ATGCCCCAGC GGCAATCGCC CACATCAGGC CGTTCTTGAG 11000
AGCACGCGAG CCGATCCGAT GCACGGCCTG CACGACAACC GCGGTGATGG CAGGTTTGAT GCCGTAGAAC AGGCCGGAAA CCAGTGGCGT GTCACCGAAC 11100
GCAATGTAAA TCCACGACAG CGCGATCAGA AAGAAGAGCG AAGGCAGCAC AAACAGGACA CCTGCAACTA TGCCTCCCCA AGTTCGGTGC ATTAGCCAGC 11200
CCATGTATGT GGCGAGTTGC TGCCCCTCCG GGCCCGGAAG CAGCATGCAG AAGTTGAGCG CATGCAGAAA GCGACGCTCG CTGATCCAGC GCCGCTGTTC 11300
GACCAATTCG CGGTGCATGA TCGCGATCTG TCCGGCGGGG CCGCCGAAGC CTATGAACCC GATCTTTAAC CACAGGCGAA ACGCTTGCCA AAAGCTGACA 11400
ACTTGTGGGG AGGGAGGCAC TGCCACAGCC TCCTTGGTTA CGACGGTTTG AGTCATACGG TGAGGGTCCC TTTTTCAAAG CTGGCCAGTA AGCCATCGAA 11500
CACAGTGGAG GCGATGGCCA ATAGTTGATC GTCGTGGTCA ACCGTTTCCC GCAAACCGGC CAGTACGCTT TCGATACCAG TGGCCTCTGG CGGCTGGATG 11600
CCGCCCACGT CGAGGTAATG CACCACAAGG CCAATCCTTG TGATGGCGGG CTGTTCCAGC CCAAAGCTCG CCGCCAGGAC CTCGAACGTG ACACGGCTGC 11700
CGACATGGCT GAACGTCGCG CCATCGAAGT CGAAGCCCAA CGCATCCGGC GGGCAGTCCG CAGGGGTTGC CAACCAAAGG ATGCGCGCCT GCGGGTCGAT 11800
GAAGCGCCGG ATCAGCCATG CGCTGGCGAG CCGATCAACC CAAGGACGTG CGCGAGTGGC CCAGGTACGG GCCTGGTAAT CCAAACGATC CAAGCGGGTA 11900
ATAGTCCCCT CTACAGCGTG CGGCTCGTCC GGCGACAGCG TGCGGGCGCA AGCTTGTTCC AGTTCACACA ACGCACTGTC CGCTTGACGC TGAGCCTCGC 12000
CGGGGTAGAA GTCGATTTCA ACCAGCGTAG TAAAGGATTT GCGAAGCTTA CGCACCTGCC GCAGCACGTC TTGCACGGTA TCCAATGTCA GCGTCTGCCT 12100
GAGATGATGG ACATCGACTA GCAAGGCAGC AAAATCGTTG CTGCGGTCGA ATAATGCGAC GAAGTTGACC CCTTCAGGGT CTTCCATACG CAGCACATGA 12200
GCGACCCCAC CGCCTTCACG CACATCGGAG GCCAAGTTAT CCAGCACCGC GCGGCATTCG TCGCGGTCGG GCATCAGGTA CACGCCGTCG CGCAGCACCG 12300
CAGCGCCGGA GGCTTTGAGG GCACGCCACG TCCGTTGCCG GACGGTCGCG TTTTCAGTAG GCAAGGAAAG GATAAGCGAA AGCAAATTCA TTTCGTAGAT 12400
GTTACTACAA TAATGAGATA AGATCTACAA AATAAAGTCT TGAGTAGATT GCAAAAACGG TCATTTGCGC AATCACAATG GAGGCAGCAC ACCCATAAGG 12500
TGTTGTGCGT TGCTAAAACG TATTGCAAAA TAAAAACAAA TGGCAATAGG AATAAGCAAT CATGCGCATC GGTTACGCCC GCGTTTCGAC CCAAGAGCAA 12600
GACAACCAAG CTCAAATCTC TGCCTTGCAA TCGGCAGGGT GTGAGTTGAT CTTCCAGGAG AAGGCCTCTG GTGGGCGCTG GGATCGTCCG GAGCTGCATC 12700
GGTTGCTTGG CCACCTGCGC AAGGCCGATG TGGTGGTGGT ATGGAAACTG GATCGCTTGT CCCGGTCCTT GAAAGACCTG CTGCTGACCT TGGAAAAGAT 12800
TGAAGAAGCG GGCGCGGGCT TTCAGAGTCT CACTGAATCT ATCGACACCA CGACACCGGC TGGACGAATG ATGATGCAGA TCGTCGGTTC ATTCGCGGAG 12900
TTCGAGCGGG CGATGCTACG CGAGCGCACG CGGCACGGTC TGGAAGCTGC GCGCAAGGAT GGGCGCGTCG GCGGCCGACG TCCGAAGCTG ACGCAGCAAC 13000
AGCAAAAAGA GATCGTCGCC CTGATCACGT CGGGGCAGAA GACGGGGGCT GATGCTGCCC GCTTGTTTCG AGTCCATCCT TCCACCGTTG TGCGGCTGCT 13100
GGCCAAGCAT CGGCAGGGGC CGGGGTAGTC CGGCTTAGCG TACTTTGCGG TCCGTTTTCT GAGACGACCC CTCCGGCGGA CGAGCGCATC CAGCGCCTTC 13200
GCGCCGACCG CTGGATCGGC TATCCGCGCG CAGTCGAGGC GCTGAACCGG CTGGAAGCCC TTTATGCGTG GCCAAACAAG CAACGCATGC CCAACCTGCT 13300
GCTGGTTGGC CCGACCAACA ATGGCAAGTC GATGATCGTC GAGAAGTTCC GCCGCACCCA CCCGGCCAGC TCCGACGCCG ACCAGGAGCA CATCCCGGTG 13400
TTGGTCGTGC AGATGCCGTC CGAGCCGTCC GTGATCCGCT TCTACGTCGC GCTGCTCGCC GCGATGGGCG CGCCGCTGCG CCCACGCCCA CGGTTGCCGG 13500
AAATGGAGCA ACTGGCTCTG GCACTGCTGC GCAAGGTCGG CGTGCGCATG CTGGTGATCG ACGAGCTGCA CAACGTGCTG GCCGGCAACA GCGTCAACCG 13600
CCGGGAATTC CTCAACCTGC TGCGCTTCCT CGGCAACGAA CTGCGCATCC CGTTGGTTGG GGTAGGCACG CGCGACGCCT ACCTAGCCAT CCGCTCCGAT 13700
GACCAGTTGG AAAATCGCTT CGAGCCGATG ATGCTGCCGG TATGGGAGGC CAACGACGAT TGCTGCTCAC TGCTGGCCAG CTTCGCCGCT TCGCTCCCGC 13800
TGCGCCGGCC TTCCCCAATT GCCACGCTGG ACATGGCTCG CTACCTGCTC ACACGCAGCG AGGGCACCAT AGGGGAACTG GCGCACTTGC TGATGGCGGC 13900
GGCCATCGTC GCCGTGGAGA GCGGCGAGGA AGCGATCAAC CATCGCACAC TCAGCATGGC CTGTCGACAA CCTCTCGCGC AACCAAGACA TCGCGGTCGG 14000
ACTGCAAGTG ATCTTGAAGC CACGGGCCCG TCCCACCCCG ACATGGACCT CGATGCCCGA ACGGACGTTA GATTTCGAGT TCTAGGCGTT CTGCGATGAA 14100
GGTTGGATCC CAGCCGGGAT TGAAAGTGTC GACGTGGGTG AATCCGAGCC GCTCGTATAG GCCACGCAGG TTCGGGTGGC AGTCGAGCCG CAGCTTGGCG 14200
CACCCCTGCG TTCGCGCGGC ATGGCGGCAA GCCTCGATCA GCGCGGAGCT GACACCCCGG CCCGCATGTG TCCGTCGCAC CGCGAGCTTG TGCAGATATG 14300
CGGCCTCCCC CTTGAGGGCG TCGGGCCAGA ACTCGGGATC CTCGGCCGAC AAGGTGCAAC AGCCGACGAT GCCGTCGCTG CAACTCGCGA CTAGGAGCTC 14400
GGATCTCAGG ACGAAGGTCT CCGCGAATGT CCGGTCGATC CGCGCGACGT CCCAGGCGGG CGTTCCCTTG GCGGACATCC ACGCCGCAGC GTCGTGCATC 14500
AGCCGCACAA CCTCGTCGAT ATCACCCGAG CAGGCGACCC GAACGTTCGG AGGCTCCTCG CTGTCCATTC GCTCCCCTGG CGCGGTATGA ACCGCCGCCT 14600
CATAGTGCAG TTTGATCCTG ACGAGCCCAG CATGTCTGCG CCCACCTTCG CGGAACCTGA CCAGGGTCCG CTAGCGGGCG GCCGGAAGGT GAATGCTAGG 14700
CATGATCTAA CCCTCGGTCT CTGGCGTCGC GACTGCGAAA TTTCGCGAGG GTTTCCGAGA AGGTGATTGC GCTTCGCAGA TCTCCAGGCG CGTGGGTGCG 14800
GACGTAGTCA GCGCCATTGC CGATCGCGTG AAGTTCCGCC GCAAGGCTCG CTGGACCCAG ATCCTTTACA GGAAGGCCAA CGGTGGCGCC CAAGAAGGAT 14900
TTCCGCGACA CCGAGACCAA TAGCGGAAGC CCCAACGCCG ACTTCAGCTT TTGAAGGTTC GACAGCACGT GCAGCGATGT TTCCGGTGCG GGGCTCAAGA 15000
AAAATCCCAT CCCCGGATCG AGGATGAGCC GGTCGGCAGC GACCCCGCTC CGTCGCAAGG CGGAAACCCG CGCCTCGAAG AACCGCACAA TCTCGTCGAG 15100
CGCGTCTTCG GGTCGAAGGT GACCGGTGCG GGTGGCGATG CCATCCCGCT GCGCTGAGTG CATAACCACC AGCCTGCAGT CCGCCTCAGC AATATCGGGA 15200
TAGAGCGCAG GGTCAGGAAA TCCTTGGATA TCGTTCAGGT AGCCCACGCC GCGCTTGAGC GCATAGCGCT GGGTTTCCGG TTGGAAGCTG TCGATTGAAA 15300
CACGGTGCAT CTGATCGGAC AGGGCGTCTA AGAGCGGCGC AATACGTCTG ATCTCATCGG CCGGCGATAC AGGCCTCGCG TCCGGATGGC TGGCGGCCGG 15400
TCCGACATCC ACGACGTCTG ATCCGACTCG CAGCATTTCG ATCGCCGCGG TGACAGCGCC GGCGGGGTCT AGCCGCCGGC TCTCATCGAA GAAGGAGTCC 15500
TCGGTGAGAT TCAGAATGCC GAACACCGTC ACCATGGCGT CGGCCTCCGC AGCGACTTCC ACGATGGGGA TCGGGCGAGC AAAAAGGCAG CAATTATGAG 15600
CCCCATACCT ACAAAGCCCC ACGCATCAAG CTTTTGCCCA TGAAGCAACC AGGCAATGGC TGTAATTATG ACGACGCCGA GTCCCGACCA GACTGCATAA 15700
GCAACACCGA CAGGGATGGA TTTCAGAACC AGAGAAAGAA AATAAAATGC GATGCCATAA CCGATTATGA CAACGGCGGA AGGGGCAAGC TTAGTAAAGC 15800
CCTCGCTAGA TTTTAATGCG GATGTTGCGA TTACTTCGCC AACTATTGCG ATAACAAGAA AAAGCCAGCC TTTCATGATA TATCTCCCAA TTTGTGTAGG 15900
GCTTATTATG CACGCTTAAA AATAATAAAA GCAGACTTGA CCTGATAGTT TGGCTGTGAG CAATTATGTG CTTAGTGCAT CTAACGCCGG AGTTAAGCCG 16000
CCGCGCGTAG CGCGGTCGGC TTGAACGAAT TGTTAGACAT CATTTACCAA CTGACTTGAT GATCTCGCCT TTCACAAAGC GAATAAATTC TTCCAAGTGA 16100
TCTGCGCGTG AGGCCAAGTG ATCTTCTTTT TGTCCCAGAT AAGCTTGCTT AGCTTCAAGT AAGACGGGCT GATACTGGGC AGGTAGGCGT TTTATTGCCC 16200
AGTCGGCAGC GACATCCTTC GGCGCGATTT TGCCGGTTAT TGCGCTGTAC CAAATGCGGG ACAACGTAAG CACTACATTT CGCTCATCGC CGGCCCAGTC 16300
GGGCTGCGAG TTCCATAGCT TCAAGGTTTC CCTCAGCGCC TCGAATAGAT CCTGTTCAGG AACCGGGTCA AAGAATTCCT CCGCTGCCGG ACCTACCAAG 16400
GCAACGCTAT GTTCTCTTGC TTTTGTAAGC AGGATAGCTA GATCAATGTC GATCATGGCT GGCTCGAAGA TACCCGCAAG AATGTCATTG CGCTGCCATT 16500
CTCCAAATTG CAGCTCGCGC TTAGCCGGAT AACGCCACGG GATGATGTCG TCATGCACGA CAAGGGTGAC TTCTATAGCG CGGAGCGTCT CGCTCTCGCC 16600
AGGGAAAGCC GAAGCCTCCA TAAGATCATT GAGCAATGCT CGCCGCGTCG TTTCATCAAG CTTTACGGCC ACAGTAACCA ACAAATCAAT ATCGCTGTAT 16700
GGCTTCAGGC CGCCATCCAC TGCGGAGCCG TACAAATGCA CGGCCAGCAA CGTTGATTCC AGATGGCGCT CAATGACGCT TAGCACCTCT GATAGTTGGT 16800
TCGAAATTTC GATGGTCACC GCTTCCCTCA TGATGTCTAA CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA 16900
TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA ACAGTCATAA CAAGCCATGA AAACCGCCAC TGCGCCGTTA 17000
CCACCGCTGC GTTCGGTCAA GGTTCTGGAC CAGTTGCGTG AGCGCATACG CTACTTGCAT TACAGTTTAC GAACCGAACA GGCTTATGTC CACTGGGTTC 17100
GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGGCAAC CTTGGGCAGC AGCGAAGTCG AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC 17200
GGTCTCCACG CATCGTCAGG CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG CACGGATCTG CCCTGGCTTC AGGAGATCGG AAGACCTCGG 17300
CCGTCGCGGC GCTTGCCGGT GGTGCTGACC CCGGATGAAG TGGTTCGCAT CCTCGGTTTT CTGGAAGGCG AGCATCGTTT GTTCGCCCAG CTTCTGTATG 17400
GAACGGGCAT GCGGATCAGT GAGGGTTTGC AACTGCGGGT CAAGGATCTG GATTTCGATC ACGGCACGAT CATCGTGCGG GAGGGCAAGG GCTCCAAGGA 17500
TCGGGCCTTG ATGTTACCCG AGAGCTTGGC ACCCAGCCTG CGCGAGCAGC TGTCGCGTGC ACGGGCATGG TGGCTGAAGG ACCAGGCCGA GGGCCGCAGC 17600
GGCGTTGCGC TTCCCGACGC CCTTGAGCGG AAGTATCCGC GCGCCGGGCA TTCCTGGCCG TGGTTCTGGG TTTTTGCGCA GCACACGCAT TCGACCGATC 17700
CACGGAGCGG TGTCGTGCGT CGCCATCACA TGTATGACCA GACCTTTCAG CGCGCCTTCA AACGTGCCGT AGAACAAGCA GGCATCACGA AGCCCGCCAC 17800
ACCGCACACC CTCCGCCACT CGTTCGCGAC GGCCTTGCTC CGCAGCGGTT ACGACATTCG AACCGTGCAG GATCTGCTCG GCCATTCCGA CGTCTCTACG 17900
ACGATGATTT ACACGCATGT GCTGAAAGTT GGCGGTGCCG GAGTGCGCTC ACCGCTTGAT GCGCTGCCGC CCCTCACTAG TGAGAGGTAG GGCAGCGCAA 18000
GTCAATCCTG GCGGATTCAC TACCCCTGCG CGAAGGCCAT CGGTGCCGCA TCGAACGGCC GGTTGCGGAA AGTCCTCCCT GCGTCCGCTG ATGGCCGGCA 18100
GCAGCCCGTC GTTGCCTGAT GGATCCAACC CCTCCGCTGC TATAGTGCAG TCGGCTTCTG ACGTTCAGTG CAGCCGTCTT CTGAAAACGA CAGCGCCGTC 18200
AGAATAGAAT CCGCTTTCAC ATTCTTTGAC ACATGCTTGC CAAGGTCATA GATTTCAGCC TGACAAATTC AAGGCTTCGG GCGCAATGGA ACCAAAAACC 18300
AACGTAAGCC CTACAGCCCA TGGAGGCATC TTGCAGGGAC AACGCATCGG TTATGTCCGG GTCAGCAGTT ACGATCAGAA TCCGGAACGA CAACTTGAGC 18400
AAGTTGAGGT CGGCAAGCTG TTCACCGACA AAGCCTCGGG CAGGGACACC CAGCGTCCCC AGCTGGAGGC CATGCTCGGC TTCGTCCGCG AGGGCGACAC 18500
CGTTGTGGTG CACAGCATGG ATCGCCTGGC CCGTAACCTC GATGACTTGC GACGCCTGGT GCAGAAGCTG ACCCAGCGCG GCGTGCGTAT CGAGTTCCTG 18600
AAAGAGGGCC TGGTGTTCAC CGGCGATGAC TCGCCGATGG CCAACCTGAT GCTGTCGGTG ATGGGGGCCT TCGCCGAGTT CGAGCGCGCC CTGATCCGTG 18700
AGCGGCAACG GGAGGGCATC GCCCTGGCCA AGCAGCGCGG CGCGTACCGG GGCCGCAAGA AGGCCCTGTC CGACGAGCAG GCTGCTACCC TGCGACAGCG 18800
GGCGTCGGCC GGCGAGCCCA AAGCGCAGCT TGCCCGCGAG TTCAACATCA GCCGGGAAAC TCTCTACCAG TACCTACGCA CGGACGATTG ATACATGCCG 18900
CGTCGCTTGA TCCTCTCGGC TACGGAGCGG GATACCCTGC TCGCGTTGCC GGAAAGCCAG GATGACCTGA TCCGCTACTA CACCTTCAAC GACTCCGACC 19000
TGTCGCTGAT CCGCCAGCGG CGCGGCGACG CCAACCGCCT GGGCTTCGCG GTGCAGCTCA GCCTGCTGCG ATATCCAGGC TATGCGCTGG GCAGCGACAG 19100
CGAGTTGCCC GAGCCGGTCA TCCAGTGGGT GGCCAAGCAA GTTCAGGCCG ACCCAACGAG TTGGGCGAAA TACGGCGAAC GCGACGTGAC TCGCCGCGAG 19200
CACGCCCAGG AACTGCGCAC CTACCTACAA CTGGCCCCGT TCGGCCTGTC CGACTTCCGC GCCCTGGTGC GCGAGCTGAC CGAGTTGGCC CAGCAGACCG 19300
ACAAGGGTTT GCTGCTGGCC GGCCAGGCGC TGGAGAGTCT GCGGCAGAAG CGGCGCATCC TGCCGGCGCT GAGCGTGATT GACCGGGCCT GCTCGGAAGC 19400
CATTGCGCGG GCCAATCGCC GGGTCTACCG CGCCCTGGTC GAACCACTCA CGGACTCGCA TCGGGCCAAA CTGGACGAGC TGTTGAAGCT CAAGGCCGGC 19500
AGCAGCATCA CCTGGTTGAC CTGGTTGCGG CAGGCCCCAC TAAAACCGAA CTCCCGGCAC ATGCTCGAAC ACATCGAGCG GCTGAAGACA TTTCAGCTGG 19600
TGGATTTGCC CGAAGCTCTG GGCCGGCACA TCCACCAGAA CCGCCTGCTC AAGCTGGCCC GCGAGGGTGG GCAGATGACG CCCAAAGACC TCTGTAAGTT 19700
CGAGCCGCAG CGGCGCTACG CGACCCTGGC CGCCGTGGTG CTGGAGAGTA CGGCGACCGT GATTGATGAG CTGGTCGATC TGCACGACCG CATCCTGGTC 19800
AAGCTGTTCA GCGGCGCGAA GCACAAGCAT CAGCAGCAGT TCCAGAAGCA AGGCAAGGCG ATCAACGACA AGGTGCGCCT GTACTCCAAG ATCGGCCAGG 19900
CCCTGCTGGA GGCCAAGGAA AGCGGCAGCG ATCCCTACGC CGCCATCGAG GCGGTGATTC CCTGGGACGA GTTCACCGAG AGCGTCAGCG AGGCCGAGCT 20000
GCTGGCCCGG CCGGAGGGCT TCGACCATCT GCACCTGGTT GGAGAGAACT TCGCCACCCT GCGCCGCTAT ACGCCAGCCT TGTTGGAGGT GCTGGAACTG 20100
CGCGCCGCCC CGGCTGCGCA AGGCGTGCTG GCGGCCGTGC AGACCCTGCG CGAGATGAAC GCCGACAACC TGCGCAAGGT GCCGGCCGAT GCGCCCACCG 20200
CCTTCATCAA GCAGCGCTGG AGGCCGCTAG TGATAACCCC GGAAGGCCTC GACCGGCGCT TCTACGAAAT CTGCGCCCTG TCAGAGCTGA AGAACGCGCT 20300
GCGCTCCGGC GACATCTGGG TCAAGGGCTC GCGGCAGTTC CGCGACTTCG ACGACTACCT GCTGCCGGCA GAGAGGTTCG CCGCGCTCAA GCATGCGCAG 20400
GCTCTGCCCC TGGCGATCAA CCCGAACAGG AACCAGTACC TGGAAGAGCG CTTGCAGCTG CTGGACGAGC AGCTGGCCAC CGTCACCCGC CTGGCCAAGG 20500
ACAACGAGCT GCCCGATGCC ATCCTCACCG AGTCGGGGCT GAAGATCACC CCACTGGATT CCGCGGTGCC CAATACCGCG CAGGCGCTGA TCGACCAGAC 20600
CAGCCAGTTG CTGCCGCGCA TCAAGATCAC CGAACTGCTG ATGGACGTGG ACGACTGGAC GGGCTTCAGC CGCCACTTCA CCCACCTGAA GGACGGTGCC 20700
GAGGCCAAAG ACCGGACATT GCTGCTGGCA GCGATCCTGG GCGATGCGAT CAACCTCGGG CTGACCAAGA TGGCCGAGTC GAGCCCCGGC CTGACCTACG 20800
CCAAGCTGTC CTGGCTGCAA GCCTGGCACA TCCGAGACGA AACCTACTCG GCGGCCCTAG CCGAGCTGGT CAACCACCAG TACCGTCATA CCTTCGCCGC 20900
TCACTGGGGC GACGGCACGA CCTCTTCTTC CGATGGCCAG CGCTTCCGGG CGGGCGGTCG GGGCGAAAGC ACCGGGCACG TCAACCCGAA GTACGGCAGC 21000
GAGCCGGGGC GGCTGTTCTA CACCCATATC TCCGACCAGT ACGCACCGTT CAGCACCCGC GTGGTGAATG TCGGCGTGCG CGATTCCACC TATGTGCTCG 21100
ACGGCTTGCT GTACCACGAG TCCGACCTAC GGATTGAGGA GCACTACACC GACACGGCTG GCTTCACCGA TCACGTCTTC GCCCTGATGC ACCTGCTGGG 21200
CTTCCGCTTC GCACCGCGCA TCCGCGACCT CGGCGAAACC AAGCTGTATG TTCCGAATAG CGTCCAGGAC TACCCGACAT TGCGCCCAAT GGTTGGCGGC 21300
ACCCTGAACA TCAAGCATGT CCGCGCCCAC TGGGACGACA TCCTGCGCCT GGCCAGCTCG ATCAAGCAGG GCACCGTCAC TGCCTCGCTG ATGCTGCGCA 21400
AGCTCGGCAG CTACCCGCGC CAGAACGGTC TGGCCGTGGC CTTGCGCGAA CTGGGCCGGA TTGAGCGCAC ACTGTTCATC CTCGACTGGC TGCAAAGCGT 21500
AGAGCTACGT CGCCGCGTGC ATGCCGGACT GAACAAGGGC GAGGCGCGCA ACTCCCTGGC CAGGGCGGTG TTCTTCAACC GCCTCGGCGA AATCAGGGAT 21600
CGGAGCTTCG AGCAGCAGCG CTACCGGGCC AGCGGTCTCA ACCTGGTGAC GGCCGCCATC GTGCTGTGGA ACACGGTGTA CTTGGAACGC GCCACCCAGG 21700
CGATGGGCGA AGCGGGGAAG TCGGTGGATG GCGAGCTGCT GCAGTACCTG TCGCCGCTGG GGTGGGAGCA CATCAACCTG ACCGGCGATT ATGTCTGGCG 21800
GCAGAGCCGC AGGCTGGAGG ACGGGAAGTT TCGGCCGCTA AGGCTGCCCG GAAAACCTTA GCGTACGATT TTTTCCGAAT TCTGCGGGCT CCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
attC-aadA3 5'-end |
15985-16038 |
54 |
CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC |
attI |
16841-16896 |
56 |
CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA |
res_site_II |
18205-18231 |
27 |
TAGAATCCGC TTTCACATTC TTTGACA |
res_site_III |
18234-18265 |
32 |
TGCTTGCCAA GGTCATAGAT TTCAGCCTGA CA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
taoD |
Tn5045 |
83-1045 |
Passenger Gene |
Other |
- |
taoC' |
Tn5045 |
1167-1370 |
Passenger Gene |
Other |
- |
taoB |
Tn5045 |
1370-2221 |
Passenger Gene |
Other |
- |
taoA |
Tn5045 |
2236-3723 |
Passenger Gene |
Other |
- |
tniA |
In_Tn5045 |
4205-5920 |
Transposase |
|
+ |
tniB N-ter |
In_Tn5045 |
5923-5991 |
Transposase |
|
+ |
tnpA |
TnOtChr.1 |
6025-8991 |
Transposase |
|
- |
chrF |
TnOtChr.1 |
9022-9474 |
Passenger Gene |
Heavy Metal Resistance |
- |
chrC |
TnOtChr.1 |
9471-10079 |
Passenger Gene |
Heavy Metal Resistance |
- |
chrA |
TnOtChr.1 |
10089-11456 |
Passenger Gene |
Heavy Metal Resistance |
- |
chrB |
TnOtChr.1 |
11453-12391 |
Passenger Gene |
Heavy Metal Resistance |
- |
tnpR |
TnOtChr.1 |
12562-13128 |
Accessory Gene |
Resolvase |
+ |
tniB C-ter |
In_Tn5045 |
13172-14099 |
Transposase |
|
+ |
GNAT_fam |
In_Tn5045 |
14068-14568 |
Passenger Gene |
Antibiotic Resistance |
- |
sul1 (ARO:3000410) |
In_Tn5045 |
14696-15535 |
Passenger Gene |
Antibiotic Resistance |
- |
qacEdelta1 (ARO:3005010) |
In_Tn5045 |
15529-15876 |
Passenger Gene |
Antibiotic Resistance |
- |
aadA3 (ARO:3002603) |
In_Tn5045 |
16040-16831 |
Passenger Gene |
Antibiotic Resistance |
- |
intI1 |
In_Tn5045 |
16977-17990 |
Integron Integrase |
Class 1 |
+ |
tnpR |
Tn5045 |
18331-18891 |
Accessory Gene |
Resolvase |
+ |
tnpA |
Tn5045 |
18895-21861 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
taoD |
TaoD |
Tn5045 |
963 |
83-1045 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Sequence Family: | SdiA-Regulated Motif Containing Protein |
Comment: | DNA binding protein |
Protein Sequence:
|
MYLMAKNWLS RTRKASAWMW ALLCLVLLTV FQVRTHHLDD RLYFWIKTSW HTDDWQERSV WLPGYRVELD AKAVPGVDNN LSGLTFDPDL NLLWAVTNGP NELLALSRDG DVERRYNLDG FHDVEAVSYA GNGQLVIAEE RRQSLVIVDV PIAEDGKLSP DRSLSRDQYP ALTLALGKED NKGLEGLAYD LKGDRLFVTK ERDPRQLLEV GGLRASLAGG FSLHVRDLSN LVKDKVFATD LSSVVFDQQS GHLILLSDES KLLIEMTDEG KVVSFRSLAR GFAGLLKGIP QAEGVTIDDE GYLYVVSEPN LFYRFTREPD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
taoC' |
TaoC' |
Tn5045 |
204 |
1167-1370 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Comment: | truncated compared to Tn1404 due to frameshift-causing deletion |
Protein Sequence:
|
MSMDDALDLA HFKTLLEQRA AELDQLLEDA ESRSQSVELD QSKVGRLSRI DALQQQAMMA GVTNAAS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
taoB |
TaoB |
Tn5045 |
852 |
1370-2221 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Comment: | also known as uspA1 |
Protein Sequence:
|
MTQVIACIDA SASAPAVCDY AAWASLSLEA PLTFLHVLDQ RQYPVTADLS GNIGLGSREH LLDELASLDE QRGKLALEQG RIMLAAAKER AIKDGVAAPE SKQRHGDLLE SLQELQTETR LLVIGRQGES SGGLSQHVGS QLESVIRIMH RPILVTPANF QKPESAMLAF DGGATTRKGV EMLAASPLLK GLPIHLVMVG PVSDESSAQL DWAQKVLLNA GFTVRAEALN GEIEPTLHAY QKEHGIDLLV MGAYGHSRIR QFLVGSTTTS MLRTTTSPLL LLR
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
taoA |
TaoA |
Tn5045 |
1488 |
2236-3723 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Sequence Family: | SulP Family Inorganic Anion Transporter |
Comment: | also known as sulP sulphate permease |
Protein Sequence:
|
MLHSLKQTWL SNIRGDILAG IVVALALIPE AIAFSIIAGV DPKVGLYASF CIAVVIAFVG GRPGMISAAT GAMALLMVTL VKNHGLEYLL AATLLCGVLQ IAAGYLKLGS LMRFVSRSVV TGFVNALAIL IFMAQLPELT NVTWHVYAMT AAGLGIIYLF PYVPKIGKLI PSPLVCIIVL TAVAMSVGLD IRTVGDMGEL PDTLPIFLWP DVPLTFETLA IIFPYSAALA VVGLLESMMT ATIVDDLTDT SSDKNRECKG QGVANIASGL IGGMAGCAMI GQSIINVKSG GRSRLSSLAA GVFLLLMVVF LGDWLKQIPM AALVAVMIMV SIGTFSWDSL RNLKKHPLST NIVMVVTVVV VVATHNLAFG VLAGVLLAAM FFANKVGHYM AISSLLDEAG EHRSYNVTGQ VFFSSADKFV AAFDFKEALN KVTIDLNRAH FWDITAVAAL DKVVIKFRRE GTEVEVLGLN EASATIVDRF GVHDKPDAID QLMGH
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
In_Tn5045 |
1716 |
4205-5920 |
+ |
Class: | Transposase |
Function: | transposase |
Transpoase Chemistry: | DDE |
Comment: | identical to tniA (Tn1721) |
Protein Sequence:
|
MLNTRVHQSE VSMATDTPRI PEQGVATLPD EAWERARRRA EIISPLAQSE TVGHEAADMA AQALGLSRRQ VYVLIRRARQ GSGLVTDLVP GQSGGGKGKG RLPEPVERVI HELLQKRFLT KQKRSLAAFH REVTQVCKAQ KLRVPARNTV ALRIASLDPR KVIRRREGQD AARDLQGVGG EPPAVTAPLE QVQIDHTVID LIVVDDRDRQ PIGRPYLTLA IDVFTRCVLG MVVTLEAPSA VSVGLCLVHV ACDKRPWLEG LNVEMDWQMS GKPLLLYLDN AAEFKSEALR RGCEQHGIRL DYRPLGQPHY GGIVERIIGT AMQMIHDELP GTTFSNPDQR GDYDSENKAA LTLRELERWL TLAVGTYHGS VHNGLLQPPA ARWAEAVARV GVPAVVTRAT SFLVDFLPIL RRTLTRTGFV IDHIHYYADA LKPWIARRER WPSFLIRRDP RDISRIWVLE PEGQHYLEIP YRTLSHPAVT LWEQRQALAK LRQQGREQVD ESALFRMIGQ MREIVTSAQK ATRKARRDAD RRQHLKTSAR PDKPVPPDTD IADPQADNLP PAKPFDQIEE W
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB N-ter |
TniB N-ter |
In_Tn5045 |
69 |
5923-5991 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | disrupted by TnOtChr.1 |
Protein Sequence:
|
VDEYPIIDLS HLLPAAQGLA RLP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
TnOtChr.1 |
2967 |
6025-8991 |
- |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPRRSILSAA ERESLLALPD TMEDLIRYYT FSESDLSIIR QRRGPANLLG FAVQLCYMRY PGMMLAVDTE PFPPLLRLVA TQLKVPPEAW ADYGQRAETR REHLLELQSI FGFQAFATRH YRPSVHSLDE LAWQTDKGIV LATELIEGLR QKSVLLPSPG VIERICAEAI TRANRRIYEA LSEPLTDPHR HRLDDLLKRR EFGKTTWLAW LRQSPVKPNS RHMLEHIERL KAWQALDLPT GIERLLHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LAIEGMATVI DEIIDLHDRI LGKLFNAAKH KHQQQFQASG KAINAKVRLY GRIGQALIDA KQSGGNPFAA IEAVMSWDAF AQSVTEAQKL AQPDDFDFLH RIGESYATLR RYAPEFLAVL KLRAAPAAKG VLDAVDVLRG MNTDNVRKVP TDAPTDFIKP RWQKLVMTDA GIDRRYYELC ALSELKNSLR SGDIWVQGSR QFKDFEDYLV PHEKFASLKQ GSELPLDVAT DCDQYLYERL TLLEAQLATV NRMAAANDLP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRYFTHLKS GDLAKDKNLL LTTILADAIN LGLSKMAESC PGTTYAKLAW LQAWHIRDET YSTALAELVN TQFRHPFACH WGDGTTSSSD GQNFRTGSKA KSTGHINPKY GSSPGRTFYT HISDQYAPFH AKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYLPKGD TAYEALKPMV GGTLNIKHVR AHWDEILRLA TSIKHGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AVVLWNTVYL ERAAHALRGN GLGIDDALLQ YLSPLGWEHI NLTGDYLWRS SAKIGAGKFR PLRPLQPP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
chrF |
ChrF |
TnOtChr.1 |
453 |
9022-9474 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Chromate |
Protein Sequence:
|
MKWITRERPK IDRIACPWMV TRFIDTHAEF LYVPAGDVLR IAQETGAVPY DIPGVELSHD GELCSFDAFL AKYAMDDPAL KQLAVIVRGA DTSRLDLTPQ STGLYALSLG LSQNFSDDHE MLKHGMVMYD AFYAWCKHCQ SETHNWPPKM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
chrC |
ChrC |
TnOtChr.1 |
609 |
9471-10079 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Chromate |
Comment: | superoxide dismutase unkown function |
Protein Sequence:
|
MSFDIKPLPF PASTLNGLSE KLIASHHENN YSGAVKRLNA IHTKLGTLDF TQEAGFLLNG LKREELIAYN SMLLHEIYFD SLGGNGVLPV GPLCEAIERD FGSVERWQAE FSAMGKALGG GSGWVLLVQS ARDGKLSNQW AADHCHTLAG ALPILALDMY EHAYHIDYGA KAGAYVDAFM ANIDWQRAAI RWASHQPSGG LS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
chrA |
ChrA |
TnOtChr.1 |
1368 |
10089-11456 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Chromate |
Comment: | chromate efflux pump |
Protein Sequence:
|
MTQTVVTKEA VAVPPSPQVV SFWQAFRLWL KIGFIGFGGP AGQIAIMHRE LVEQRRWISE RRFLHALNFC MLLPGPEGQQ LATYMGWLMH RTWGGIVAGV LFVLPSLFFL IALSWIYIAF GDTPLVSGLF YGIKPAITAV VVQAVHRIGS RALKNGLMWA IAAGAFVAIF AMNVPFPAIV AGAALIGYIG GRVTPDKFRA GGAHRAADKS YGPALIDDNT PIPSHALFRW SGSLRVAAMG CLLWLIPMAF LLGTYGWDHT LTQMGWFFTK AALLTFGGAY AVLPYIYQGA VGHYSWLTPT QMVDGLALGE ANPGPLIMVV AFVGFVGAYV QALFGPDMLF VAGAVAAALV TWFTFLPSFI FILAGGPFVE STRGDLRFTA PLTAITAAVV GVILNLAVFF AYHVLWPKGL AGLFDWVSAL ITIGAAVALF RFKRNVIHVI AACAVLGLIL KTLIL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
chrB |
ChrB |
TnOtChr.1 |
939 |
11453-12391 |
- |
Class: | Passenger Gene |
Sub Class: | Heavy Metal Resistance |
Target: | Chromate |
Comment: | chromate-sensitive regulator of chrBACF operon exclusively activated by Cr(VI) |
Protein Sequence:
|
MNLLSLILSL PTENATVRQR TWRALKASGA AVLRDGVYLM PDRDECRAVL DNLASDVREG GGVAHVLRME DPEGVNFVAL FDRSNDFAAL LVDVHHLRQT LTLDTVQDVL RQVRKLRKSF TTLVEIDFYP GEAQRQADSA LCELEQACAR TLSPDEPHAV EGTITRLDRL DYQARTWATR ARPWVDRLAS AWLIRRFIDP QARILWLATP ADCPPDALGF DFDGATFSHV GSRVTFEVLA ASFGLEQPAI TRIGLVVHYL DVGGIQPPEA TGIESVLAGL RETVDHDDQL LAIASTVFDG LLASFEKGTL TV
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
TnOtChr.1 |
567 |
12562-13128 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MRIGYARVST QEQDNQAQIS ALQSAGCELI FQEKASGGRW DRPELHRLLG HLRKADVVVV WKLDRLSRSL KDLLLTLEKI EEAGAGFQSL TESIDTTTPA GRMMMQIVGS FAEFERAMLR ERTRHGLEAA RKDGRVGGRR PKLTQQQQKE IVALITSGQK TGADAARLFR VHPSTVVRLL AKHRQGPG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB C-ter |
TniB C-ter |
In_Tn5045 |
928 |
13172-14099 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | disrupted by TnOtChr.1 |
Protein Sequence:
|
PADERIQRLR ADRWIGYPRA VEALNRLEAL YAWPNKQRMP NLLLVGPTNN GKSMIVEKFR RTHPASSDAD QEHIPVLVVQ MPSEPSVIRF YVALLAAMGA PLRPRPRLPE MEQLALALLR KVGVRMLVID ELHNVLAGNS VNRREFLNLL RFLGNELRIP LVGVGTRDAY LAIRSDDQLE NRFEPMMLPV WEANDDCCSL LASFAASLPL RRPSPIATLD MARYLLTRSE GTIGELAHLL MAAAIVAVES GEEAINHRTL SMACRQPLAQ PRHRGRTASD LEATGPSHPD MDLDARTDVR FRVLGVLR
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
GNAT_fam |
GNAT_fam |
In_Tn5045 |
501 |
14068-14568 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | Acetyltransf_1 (Pfam:PF00583) |
Comment: | putative acetyltransferase ADU64769.1 |
Protein Sequence:
|
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
sul1 (ARO:3000410) |
Sul1 |
In_Tn5045 |
840 |
14696-15535 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target replacement (ARO:0001002) |
Transpoase Chemistry: | dihydropteroate synthase |
Target: | sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401) |
Sequence Family: | sulfonamide resistant sul (ARO:3004238) |
Comment: | perfect match to reference sequence for ARO:3000410 |
Protein Sequence:
|
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
qacEdelta1 (ARO:3005010) |
QacEdelta1 |
In_Tn5045 |
348 |
15529-15876 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic efflux (ARO:0010000) |
Target: | disinfecting agents and antiseptics (ARO:3005386) |
Sequence Family: | major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002) |
Comment: | subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219) |
Protein Sequence:
|
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL ARSPSWKSLR RPTPW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
aadA3 (ARO:3002603) |
AadA3 |
In_Tn5045 |
792 |
16040-16831 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | ANT(3'') (ARO:3004275) |
Comment: | strict match to reference sequence for ARO:3002603 (bitscore: 522) |
Protein Sequence:
|
MREAVTIEIS NQLSEVLSVI ERHLESTLLA VHLYGSAVDG GLKPYSDIDL LVTVAVKLDE TTRRALLNDL MEASAFPGES ETLRAIEVTL VVHDDIIPWR YPAKRELQFG EWQRNDILAG IFEPAMIDID LAILLTKARE HSVALVGPAA EEFFDPVPEQ DLFEALRETL KLWNSQPDWA GDERNVVLTL SRIWYSAITG KIAPKDVAAD WAIKRLPAQY QPVLLEAKQA YLGQKEDHLA SRADHLEEFI RFVKGEIIKS VGK
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
intI1 |
IntI1 |
In_Tn5045 |
1014 |
16977-17990 |
+ |
Class: | Integron Integrase |
Sub Class: | Class 1 |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Class 1 Integron Tyrosine Integrase |
Protein Sequence:
|
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn5045 |
561 |
18331-18891 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MQGQRIGYVR VSSYDQNPER QLEQVEVGKL FTDKASGRDT QRPQLEAMLG FVREGDTVVV HSMDRLARNL DDLRRLVQKL TQRGVRIEFL KEGLVFTGDD SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKALSDEQ AATLRQRASA GEPKAQLARE FNISRETLYQ YLRTDD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5045 |
2967 |
18895-21861 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | putative transposase of Tn5045 identical to tnpR of Tn1013 in pBS228 |
Protein Sequence:
|
MPRRLILSAT ERDTLLALPE SQDDLIRYYT FNDSDLSLIR QRRGDANRLG FAVQLSLLRY PGYALGSDSE LPEPVIQWVA KQVQADPTSW AKYGERDVTR REHAQELRTY LQLAPFGLSD FRALVRELTE LAQQTDKGLL LAGQALESLR QKRRILPALS VIDRACSEAI ARANRRVYRA LVEPLTDSHR AKLDELLKLK AGSSITWLTW LRQAPLKPNS RHMLEHIERL KTFQLVDLPE ALGRHIHQNR LLKLAREGGQ MTPKDLCKFE PQRRYATLAA VVLESTATVI DELVDLHDRI LVKLFSGAKH KHQQQFQKQG KAINDKVRLY SKIGQALLEA KESGSDPYAA IEAVIPWDEF TESVSEAELL ARPEGFDHLH LVGENFATLR RYTPALLEVL ELRAAPAAQG VLAAVQTLRE MNADNLRKVP ADAPTAFIKQ RWRPLVITPE GLDRRFYEIC ALSELKNALR SGDIWVKGSR QFRDFDDYLL PAERFAALKH AQALPLAINP NRNQYLEERL QLLDEQLATV TRLAKDNELP DAILTESGLK ITPLDSAVPN TAQALIDQTS QLLPRIKITE LLMDVDDWTG FSRHFTHLKD GAEAKDRTLL LAAILGDAIN LGLTKMAESS PGLTYAKLSW LQAWHIRDET YSAALAELVN HQYRHTFAAH WGDGTTSSSD GQRFRAGGRG ESTGHVNPKY GSEPGRLFYT HISDQYAPFS TRVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG ETKLYVPNSV QDYPTLRPMV GGTLNIKHVR AHWDDILRLA SSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNSLAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERATQAMGEA GKSVDGELLQ YLSPLGWEHI NLTGDYVWRQ SRRLEDGKFR PLRLPGKP
|
|
Internal Transposable Elements (TE) |
|
|
TnCentral Accession |
TE Name |
Type |
Coordinates |
Length |
TnOtChr.1-FN821089.1 |
TnOtChr.1 |
Transposon |
4095-4099 |
5 |
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
IRt |
In_Tn5045 |
4100-4132 |
TGTCATTTTC AGAAGACGAC TGCACCAGTT GAT |
repeat t1 |
In_Tn5045 |
4108-4126 |
TCAGAAGACG ACTGCACCA |
repeat t2 |
In_Tn5045 |
4148-4166 |
AACACGTCGG TCGAGGACT |
repeat t3 |
In_Tn5045 |
4177-4196 |
TCAGAAGTGA TCTGCACCAA |
repeat t4 |
In_Tn5045 |
4209-4227 |
TCAATACTCG TGTGCACCA |
IRR |
TnOtChr.1 |
5992-6029 |
GGGGTCGTCT CAGAAAACGG ACCGCAAAGT ACGCTAAG |
IRL |
TnOtChr.1 |
13134-13171 |
GAATCGCATG AAACGCCAGG CAAAAGACTC TGCTGGGG |
repeat i4 |
In_Tn5045 |
18073-18091 |
AGGAGGGACG CAGGCGACT |
repeat i3 |
In_Tn5045 |
18101-18119 |
CGTCGGGCAG CAACGGACT |
repeat i2 |
In_Tn5045 |
18143-18161 |
ATCACGTCAG CCGAAGACT |
IRi |
In_Tn5045 |
18160-18192 |
CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT |
repeat i1 |
In_Tn5045 |
18166-18184 |
GTCACGTCGG CAGAAGACT |
|
References |
|
|
Petrova M, Gorlenko Z, Mindlin S. Tn5045, a novel integron-containing antibiotic and chromate resistance transposon isolated from a permafrost bacterium. Res Microbiol. 2011 Apr;162(3):337-45. doi: 10.1016/j.resmic.2011.01.003. Epub 2011 Jan 22. PubMed ID: 21262357
| |
| | |
|
|