Transposon
Name: Tn21.2
Family: Tn3        Group: Tn21
Evidence of Transposition: no
 Host     

Host Organism:Salmonella enterica subsp. enterica serovar Typhimurium Molecular Source:plasmid pST1007-1B

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGGCACCTCAGAAAACGGAAAATAAAGCACGCTAAG

 Sequence     
DNA SequenceLength  35400 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGGCACCT CAGAAAACGG AAAATAAAGC ACGCTAAGGC ATAGCTGACC TTGCCAGGCC TGCTTCGCCC TGTAGTGACG CGATCAACGG GCAGGAAACA 100
TTCCCCTTTC GTGCATGGCA GGCGCACACG AGTTCAGACA GCACGGTTTC CATGCGCGCC AAGTCGGCCA TCTTCTCGCG CACGTCCTTG AGCTTGTGTT 200
CGGCCAGGCT GCTGGCCTCC TCGCAGTGGG TGCCATCGTC GAGCCGCAAC AGCTCGGCAA TCTCGTCCAG ACTGAACCCC AGCCGCTGTG CCGATTTCAC 300
GAATTTCACC CGAACCACGT CCGCCTCCCC ATAGCGGCGG ATGCTGCCGT AAGGCTTGTC CGGTTCCCGC AACAGGCCCT TGCGCTGATA GAAGCGGATT 400
GTCTCCACGT TGACCCCGGC CGCCTTGGCA AAAACGCCAA TGGTCAGGTT TTCCAAATTA TTTTCCATAT CGCTTGACTC CGTACATGAG TACGGAAGTA 500
AGGTTACGCT ATCCAATCCA AATTCAAAAG GGCCAACGTA TGTCTGAACC ACAAAACGGG CGCGGTGCGC TCTTCGCCGG CGGGCTGGCC GCCATTCTTG 600
CATCGACCTG CTGCCTGGGG CCGCTAGTAC TGGTCGCCCT GGGCTTCTCC GGTGCTTGGA TCGGCAACCT GACGGTGCTG GAACCCTATC GACCGTTGTT 700
CATCGGCGCG GCGCTAGTGG CGCTGTTCTT CGCCTGGAAG CGGATTTACC GGCCCGTGCA GGCATGCAAG CCAGGTGAGG TCTGCGCGAT TCCGCAGGTG 800
CGCGCCACCT ACAAGCTGAT TTTCTGGATC GTGGCCGTGC TGGTCCTGGT CGCGCTTGGA TTTCCCTATG TCGTTCCATT TTTCTATTAA CCAGGAGTTC 900
ATCATGAAGA AACTGTTTGC CTCCCTTGCC CTCGCCGCCG CTGTTGCCCC GGTGTGGGCC GCTACCCAGA CCGTCACGCT AGCGGTTCCC GGCATGACTT 1000
GCGCCGCCTG CCCGATCACA GTCAAGAAAG CGCTCTCCAA GGTCGAAGGC GTGAGCAAGG TCGATGTGGG CTTCGAGAAG CGCGAGGCCG TCGTCACTTT 1100
TGACGACACC AAGGCCAGCG TACAGAAGCT GACCAAGGCC ACCGCAGACG CCGGCTATCC GTCCAGCGTC AAGCAGTGAG CCAGCAAGCC AACGACAACA 1200
GCGAGAGCCG CTTCATGGGA CTGATGACAC GCATTGCCGA TAAAACCGGC GCGCTCGGCA GCGTCGTTTC CGCGATGGGC TGCGCCGCCT GCTTTCCAGC 1300
CCTCGCCAGC TTCGGCGCGG CCATCGGGCT GGGCTTCTTG AGCCAGTACG AGGGACTGTT CATCAGCCGC CTGCTGCCGC TGTTTGCCGC GCTGGCCTTC 1400
CTGGCGAACG CGCTGGGTTG GTTCAGTCAT CGGCAATGGC TGCGCAGTCT GCTCGGCATG ATCGGCCCGG CCATCGTGTT TGCGGCCACG GTCTGGCTGC 1500
TCGGCAACTG GTGGACGGCG AACCTGATGT ACGTCGGCCT GGCCTTGATG ATTGGGGTGT CGATCTGGGA CTTCGTGTCG CCGGCGCATC GCCGTTGCGG 1600
ACCGGACGGC TGCGAACTCC CCGCCAAGCG CTTGTGAAAG ACGGCTGACC GTGCGACACG GCGGCCCACA CGAATAAGGA ACGATGGTAT GAGCACTCTC 1700
AAAATCACCG GCATGACTTG CGACTCGTGC GCAGTGCATG TCAAGGACGC CCTGGAGAAA GTGCCCGGCG TGCAATCAGC GGATGTCTCC TACGCCAAGG 1800
GCAGCGCCAA GCTCGCCATT GAGGTCGGCA CGTCACCCGA CGCGCTGACG GCCGCTGTAG CTGGACTCGG TTATCGGGCC ACGCTGGCCG ATGCCCCCTC 1900
AGTTTCGACG CCGGGCGGAT TGCTCGACAA GATGCGCGAT CTGCTGGGCA GAAACGACAA GACGGGTAGC AGCGGCGCAT TGCATATCGC CGTCATCGGC 2000
AGCGGCGGGG CCGCGATGGC AGCGGCGCTG AAGGCCGTCG AGCAAGGCGC ACGTGTCACG CTGATCGAGC GCGGCACCAT CGGCGGCACC TGCGTCAATG 2100
TCGGTTGTGT GCCGTCCAAG ATCATGATCC GCGCCGCCCA TATCGCCCAT CTGCGCCGGG AAAGCCCGTT CGATGGCGGC ATCGCCGCTA CCACGCCGAC 2200
CATCCAGCGC ACGGCGCTGC TGGCCCAGCA GCAGGCCCGC GTCGATGAAC TGCGCCACGC CAAGTACGAA GGCATCTTGG AGGGCAATCC GGCGATCACT 2300
GTGCTGCACG GCTCCGCCCG CTTTAAGGAC AATCGCAACC TGATCGTGCA ACTCAACGAC GGCGGCGAGC GCGTGGTGGC ATTCGACCGC TGCCTGATCG 2400
CCACCGGCGC GAGCCCGGCC GTGCCGCCGA TTCCCGGCCT GAAAGACACT CCGTACTGGA CTTCCACTGA AGCGCTGGTC AGCGAGACGA TTCCTAAGCG 2500
CCTGGCCGTG ATTGGCTCAT CAGTGGTGGC GCTGGAGCTG GCGCAGGCGT TCGCCCGACT CGGAGCGAAG GTGACGATCC TGGCTCGCAG CACGCTGTTC 2600
TTCCGCGAAG ACCCAGCTAT AGGCGAAGCC GTCACGGCCG CATTCCGCAT GGAGGGCATC GAGGTGAGGG AACACACCCA GGCCAGCCAG GTCGCGTATA 2700
TCAATGGTGA AGGGGACGGC GAATTCGTGC TCACCACGGC GCACGGCGAA CTGCGCGCCG ACAAGCTGCT GGTCGCCACC GGCCGCGCGC CCAACACACG 2800
CAAGCTGGCA CTGGATGCGA CGGGCGTCAC GCTCACCCCG CAAGGCGCTA TCGTCATCGA CCCCGGCATG CGTACAAGCG TGGAACACAT CTACGCCGCA 2900
GGCGACTGCA CCGACCAGCC GCAGTTCGTC TATGTGGCGG CAGCGGCCGG CACTCGCGCC GCGATCAACA TGACCGGCGG TGACGCGGCC CTGAACCTGA 3000
CCGCGATGCC GGCCGTGGTG TTCACCGACC CGCAAGTGGC GACCGTAGGC TACAGCGAGG CGGAAGCGCA CCATGACGGC ATCAAAACTG ATAGTCGCAC 3100
GCTAACGCTG GACAACGTGC CGCGCGCGCT CGCCAACTTC GACACGCGCG GCTTCATCAA ACTGGTGGTT GAAGAAGGCA GCGGACGACT GATCGGCGTG 3200
CAGGCAGTGG CCCCGGAAGC GGGCGAACTG ATCCAGACGG CCGCACTGGC GATTCGCAAC CGGATGACGG TGCAGGAACT GGCCGACCAG TTGTTCCCCT 3300
ACCTGACGAT GGTCGAAGGG TTGAAGCTCG CGGCGCAGAC CTTCAACAAG GATGTGAAGC AGCTTTCCTG CTGCGCCGGG TGAGGACAAG GAGGTGTGCG 3400
ATGAGCGCCT ACACGGTATC GCAACTGGCC CATAACGCTG GGGTGAGCGT ACATATCGTG CGCGACTACC TGGTGCGCGG CTTGTTACGG CCGGTGGCCT 3500
GCACCACGGG CGGCTACGGC GTGTTCGACG ATGCGGCCTT GCAACGGCTG TGCTTCGTGC GCGCGGCCTT CGAGGCGGGT ATCGGCCTGG ATGCCCTGGC 3600
GCGGCTGTGC CGTGCGCTCG ACGCAGCGGA CGGCGCACAA GCCGCAGCGC AGCTTGCCGT GCTGCGCCAG TTGGTCGAGC GGCGGCGCGC GGCGTTGGCC 3700
CATCTGGACG CGCAACTGGC CTCCATGCCA GCCGAGCGGG CGCACGAGGA GGCATTGCCG TGAACGCCCC TGACAAACTG CCGCCCGAGA CGCGCCAACC 3800
CGTTTCCGGC TACCTGTGGG GTGCGCTGGC CGTGTTGACC TGCCCCTGCC ATCTGCCGAT TCTCGCCGCC GTGCTGGCCG GGACGACCGC CGGTGCCTTC 3900
CTTGGCGAGC ATTGGGGTGT TGCCGCGCTC GCGCTGACCG GCTTGTTCGT TCTGGCCGTA ACGCGGCTGC TGCGCGCCTT CCGGGGCGGA TCATGACGAG 4000
TTCGCAGCCC GCCGGATGGA CGGCGGCCGA GTTGGCGCAG GCGGCGGCGC GCGGACAGCT TGACCTGCAT TACCAGCCGC TGGTCGATCT GCGCGATCAC 4100
CGGATCGCTG GCGCGGAAGC GTTGATGCGC TGGCGGCATC CGAGGCTTGG CCTGTTGCCG CCCGGCCAGT TCCTGCCGCT GGCCGAGTCG TTCGGCCTGA 4200
TGCCGGAAAT AGGCGCGTGG GTGCTGGGCG AGGCCTGTCG CCAGATGCAC AAGTGGCAAG GACCGGCATG GCAACCGTTC CGTCTTGCCA TCAATGTGTC 4300
CGCCAGCCAG GTTGGGCCAA CGTTCGACGA CGAGGTAAAG CGGGTGCTGG CCGATATGGC CCTGCCCGCC GAGCTTCTGG AGATCGAACT GACCGAATCG 4400
GTCGCATTCG GCAATCCAGC CCTGTTCGCC AGTTTCGACG CCTTGCGCGC CATCGGCGTG CGCTTCGCCG CCGACGACTT CGGCACCGGC TATTCCTGCC 4500
TGCAACATCT GAAATGCTGC CCCATCACCA CATTGAAAAT CGACCAATCC TTTGTCGCCA GGCTCCCGGA TGATGCCCGT GACCAAACTA TCGTGCGGGC 4600
GGTGATCCAG CTCGCGCACG GGCTGGGCAT GGATGTCATT TTCAGAAGAC GACTGCACCA GTTGATTGGG CGTAATGGCT GTTGTGCAGC CAGCTCCTGA 4700
CAGTTCAATA TCAGAAGTGA TCTGCACCAA TCTCGACTAT GCTCAATACT CGTGTGCACC AAAGCGAGGT GAGCATGGCG ACGGACACCC CACGGATTCC 4800
AGAACAAGGC GTGGCCACTC TGCCTGATGA GGCTTGGGAG CGTGCGCGCC GTCGTGCGGA GATCATCAGT CCGTTGGCGC AGTCGGAGAC GGTCGGGCAC 4900
GAAGCGGCCG ATATGGCGGC TCAGGCGCTG GGCTTGTCTC GGCGCCAGGT ATACGTTCTG ATCCGGCGTG CCCGGCAAGG CAGCGGCCTC GTGACGGATC 5000
TGGTGCCCGG CCAGTCCGGT GGAGGTAAAG GTAAGGGGCG CTTGCCGGAA CCGGTCGAGC GCGTCATCCA CGAGCTACTG CAAAAGCGGT TCCTGACCAA 5100
GCAGAAGCGC AGCCTAGCGG CCTTTCACCG CGAAGTCACT CAGGTGTGCA AGGCTCAAAA ACTGCGAGTG CCGGCGCGCA ATACCGTGGC CTTACGGATC 5200
GCTAGCCTTG ACCCGCGCAA GGTCATCCGC CGGCGGGAAG GCCAGGATGC CGCTCGTGAC CTACAAGGTG TGGGCGGCGA GCCTCCTGCC GTGACCGCGC 5300
CGCTGGAGCA GGTGCAGATA GACCATACGG TCATCGACCT GATCGTGGTC GATGACCGCG ACCGGCAACC TATTGGCCGC CCGTACCTGA CCCTCGCCAT 5400
CGACGTGTTC ACCCGCTGCG TGCTCGGCAT GGTCGTCACG CTGGAAGCGC CGTCTGCCGT TTCGGTTGGC CTGTGCCTCG TGCATGTCGC CTGCGACAAG 5500
CGCCCTTGGC TGGAAGGACT GAACGTGGAA ATGGATTGGC AGATGAGCGG CAAGCCCTTG CTGCTCTACC TAGACAACGC GGCCGAGTTC AAGAGCGAGG 5600
CCCTGCGCCG GGGTTGCGAG CAGCATGGCA TCCGGCTGGA CTATCGCCCG CTGGGACAGC CGCACTATGG CGGCATCGTG GAACGGATCA TCGGCACGGC 5700
GATGCAGATG ATTCACGACG AACTGCCGGG AACGACCTTC TCCAACCCTG ACCAGCGCGG CGACTACGAT TCCGAAAACA AGGCCGCCCT GACGCTGCGC 5800
GAGCTAGAGC GCTGGCTCAC ATTGGCGGTC GGCACCTACC ACGGTTCGGT GCACAACGGC CTGCTCCAAC CGCCGGCCGC GCGCTGGGCC GAGGCCGTGG 5900
CGCGTGTCGG CGTACCGGCC GTCGTCACAC GCGCTACTTC GTTCCTGGTC GATTTTCTGC CGATCCTCCG GCGCACGCTG ACCCGCACCG GCTTTGTCAT 6000
CGACCACATC CACTACTACG CCGATGGGCA CTGTTGCAAA TAGTCGGTGG TGATAAACTT ATCATCCCCT TTTGCTGATG GAGCTGCACA TGAACCCATT 6100
CAAAGGCCGG CATTTTCAGC GTGACATCAT TCTGTGGGCC GTACGCTGGT ACTGCAAATA CGGCATCAGT TACCGTGAGC TGCAGGAGAT GCTGGCTGAA 6200
CGCGGAGTGA ATGTCGATCA CTCCACGATT TACCGCTGGG TTCAGCGTTA TGCGCCTGAA ATGGAAAAAC GGCTGCGCTG GTACTGGCGT AACCCTTCCG 6300
ATCTTTGCCC GTGGCACATG GATGAAACCT ACGTGAAGGT CAATGGCCGC TGGGCGTATC TGTACCGGGC CGTCGACAGC CGGGGCCGCA CTGTCGATTT 6400
TTATCTCTCC TCCCGTCGTA ACAGCAAAGC TGCATACCGG TTTCTGGGTA AAATCCTCAA CAACGTGAAG AAGTGGCAGA TCCCGCGATT CATCAACACG 6500
GATAAAGCGC CCGCCTATGG TCGCGCGCTT GCTCTGCTCA AACGCGAAGG CCGGTGCCCG TCTGACGTTG AACACCGACA GATTAAGTAC CGGAACAACG 6600
TGATTGAATG CGATCATGGC AAACTGAAAC GGATAATCGG CGCCACGCTG GGATTTAAAT CCATGAAGAC GGCTTACGCC ACCATCAAAG GTATTGAGGT 6700
GATGCGTGCA CTACGCAAAG GCCAGGCCTC AGCATTTTAT TATGGTGATC CCCTGGGCGA AATGCGCCTG GTAAGCAGAG TTTTTGAAAT GTAAGGCCTT 6800
TGAATAAGAC AAAAGGCTGC CTCATCGCTA ACTTTGCAAC AGTGCCGATT GAAGAAACCT TCCGAGACTT GAAAAGTCCT GCCTACGGAC TAGGCCTACG 6900
CCATAGCCGA ACGAGCAGCT CAGAGCGTTT TGATATCATG CTGCTAATCG CCCTGATGCT TCAACTAACA TGTTGGCTTG CGGGCGTTCA TGCTCAGAAA 7000
CAAGGTTGGG ACAAGCACTT CCAGGCTAAC ACAGTCAGAA ATCGAAACGT ACTCTCAACA GTTCGCTTAG GCATGGAAGT TTTGCGGCAT TCTGGCTACA 7100
CAATAACAAG GGAAGACTCA CTCGTGGCTG CAACCCTGCT TACTCAAAAT CTATTCACAC ATGGTTACGT TTTGGGGAAA TTATGAGGGG ATCTCTCAGT 7200
GCTCAGCATG TTTGAGTGCT TTACTCGCCG CGGGTTGAGA AATATGTAAA ACTTCAGCCG CACGCGTTAA GGAACCACAA GTCATTACAG CATAAAAGAT 7300
ATCGAGATGT CTTAATTTCA TTCCATGTAG CCTACTGATT ATTTATTTCT ACGGACTATT CTAACGAATA AAGTATAAAT AATCAGATAT ACTCGTCATA 7400
ATTCAAATTT TGGTTTAGTT GTCACGAGAA TTAATTCGTA AACGTGCAAG CTAAATAGAT TGACTAGGGG AATATACGGG GTAAGAAAGC GGCTCGGCAT 7500
ACTGCCCTAT AATAAATTGT TAGGAGAAGT ACGCCGAGTA GTGTGTAGCT ATTAGGTAAA GTATGTACCA ATTTCAATAA ATATTTTAAT TAAGATACTG 7600
TTTGAAATAT CAACAATAAA GGCACCGACC ATAGGAACGA CAAGGAAAGC TTTATGTGAT GGTCCAAATG CTTTTGTGAC TGTTTGCATA TTCGCAATTG 7700
CTGTTGGTGT TGCTCCCATA CCAAAGCCAC AGTGACCCGC GCTGATCACG ACAGCATCAT AATCTTTGCC CATCATTTTG AAGGTGACAA AGCAGGCAAA 7800
TAGCACCATG ACAACAGTTT GTACAGCAAT GATAATTAAT ACTGGCCCTG CCATGCTTGC CAATTGACCA AATTTTAATG ACATTAACGC CATTGCCAAG 7900
AAAAGCGATA AAGCAACGCT ACCTAATACA TCGACGGTCG GCTCAAACAC TTCGTGTTTA AATACATGAG TCAGTGTATT ACGGATAATA ATACCGACAA 8000
ATAAACACCA GACAAAAGTA GGCAGTTGCA GAAAAGTATC TTTAAACAAT GCACTGATAT AGCCACCAAC AACAATACAG ATAATCAGCA TTGAAATGGT 8100
TTCAATAACG TTATTTGCAT TGATTTTTCT TTTGACGCTT GGTTGCTCAA AAGCTTCAAC GATAGTGTCG CGCTCTTGCT CGGTTGTTTT AGGAATAGAG 8200
ACCTTTTTCA AAAGATGACG GGCAACAGGG CCGCCAACTA AACCACCCAA CACTAATCCA AGTGTTGCAC AAGCCATCGC TAATTCAACG GCGCCTGTTA 8300
CACCATATTT ATCAGCGAGA ATAGGGCCCC ATGCTCCGGC ATTACCATGA CCACCTGTTA GAGTAATTGA ACCTGCAATT AAGCCAATAA ATGGACTTTC 8400
ATTCATCATG ACAGCCATAC TCATGCCGAC AGTATTTTGA ATGGCGATTA GGATCGTTAC TGCAATAGTT AATAGAACTA ACGGCTTTCC TCCTTTAATC 8500
AGTCGAGAAA AGTCAGAACT TAGCCCGATA GAGGAAAAGA ATGTGAGCAT TAATAAACTT TGCAATGAAG CATCAAAAGT AAACGAATAA CCTGATGTTT 8600
TATCAATAAT TAACAGAACA ATTGCAACAA TAAAGCCACC AACGACGGCT TCTGGTATGT GGTTTTTTTG TAGGAACGGG GTAAATTTTA CGACAAACAT 8700
TCCTATGAGT AGCGCGATAC ATGCGACTAA TAATGTATAA CTGGCATCTA GGATCATTTT TTTACCTCTA TTAGTTTACC AGAATTTTCA CTATAACAAT 8800
TTAACAATAG TGAAAAATGA AGTTACTTTG GTTAACTAAT CTTTAACAAT GAGTTTTATT AGGGTATCCA TAACTTTTAG TTAATAGGGT CATAATTTTA 8900
GTGTGATCTT AATCATGAAA TATTAAAAAG ATTAAAATAA CAACAATGGT ATTTAATGAT GGGTAAACCT AAATACCATG CGGGATCCGC ACTATTTTCG 9000
GTGATAGCAA AAAATAAAAA AGTATAAATA ATAAACAAAA TTATAAGAGA AGGAGAACAT GATAAGCGCA GGCCGAGCAA TTTAAAGAAG TGTGATATAC 9100
AACATAAAAC AGCAGCATGA CCTTTCTATA TTTCGTTGCT TATCGAAGTG TTAATTTTCG GGTGGATTAC TCTGTTTGTT GATTAATAAC GGAGAAAAAT 9200
ATGATTGCTG TAATATTTGA GGTGCAAATA CAACCCGACC AACAAACTCG CTATTTGACT TTAGCTGAGG AGTTAAGACC ACTATTAAGT CATGTAGCTG 9300
GTTTTATTTC AATTGAACGT TTTCAAAGTC TAGCTACAGA AGGAAAAATG TTATCGCTAT CTTGGTGGGA AAACGAATAC GCAGTTCTGC AATGGAAAAA 9400
TCATGTTTTA CATGCGAAAG CTCAACAAGA AGGGCGAGAG TCAATATTTG ATTTTTACAA AATTAGTATT GCTCATATTA CTCGCGAATA TTCATTTAAA 9500
AAGGACAAGG ATAATGTTTG ATGTTCACGT TGTTTTAGAT AATCAAATAG GACAATTAGC ATTACTAGGA AAAACATTAG GTAATAAAGG TATTGGATTG 9600
GAAGGGGGAG GGATATTTAC GGTTGGTGAT GAATGCCATG CTCATTTTCT TGTTGAACAA GGAAAGGAAG CTAAAATAGC GCTAGAGCAA GCTGGACTGT 9700
TAGTACTTGC GATCCGGACA CCATTAATTC GTAAGTTAAA ACAGGAAAAA CCGGGGGAAC TTGGCGAAAT AGCACGAGTA TTGGCGGAGA ATAACATTAA 9800
TATTTTAGTG CAATACAGTG ACCATGCTAA CCAACTGATA TTAATAACGG ACAATGATAG TATGGCTGCA TCTGTTACGC TCCCTTGGGC AATAAAGTGA 9900
ACTTGCGATG GCTAATTTAA TACGAAAAGA GGTTACCTTT GAGTCCTCAA TAGCCGCGAT AGGGGCGGCT ATGTCTGACA TTTCACGAGT TAAAATACTC 10000
AGTGCTTTGA TGGATGGGCG AGCTTGGACG GCCACTGAGC TAAGTTCTGT GGCGAATATA TCAGCTTCAA CGGCGAGCAG TCATTTATCT AAATTATTAG 10100
ATTGCCAGCT AATCACAGTA GTAGCTCAAG GCAAGCATCG TTATTTTCGG CTAGCAGGAA AAGATATTGC TGAATTGATG GAAAGTATGA TGGGGATCTC 10200
CTTAAACCAT GGCGTACATG CCAGAGTTTC CACGCCAGTG CATTTACGAA AAGCACGTAC TTGCTATGAT CATTTAGCTG GCGAAGTTGC CGTTAAGATC 10300
TATGATTCCC TTTGTCAACA GCAATGGATC ACTGAAAATG GTTCAATGAT CACATTAAGT GGTATTCAAT ATTTTCATGA AATGGGAATT GACGTTCCTT 10400
CCAAACATTC ACGTAAAATC TGTTGTGCGT GTTTAGATTG GAGTGAACGC CGTTTCCATT TAGGTGGGTA CGTTGGAGCC GCATTATTTT CGCTTTATGA 10500
ATCTAAAGGG TGGTTAACTC GACATCTTGG TTACCGTGAA GTTACCATCA CGGAAAAAGG TTATGCTGCT TTTAAGACCC ACTTTCACAT TTAAGTTGTT 10600
TTTCTAATCC GCATATGATC AATTCAAGGC CGAATAAGAA GGCTGGCTCT GCACCTTGGT GATCAAATAA TTCGATAGCT TGTCGTAATA ATGGCGGCAT 10700
ACTATCAGTA GTAGGTGTTT CCCTTTCTTC TTTAGCGACT TGATGCTCTT GATCTTCCAA TACGCAACCT AAAGTAAAAT GCCCCACAGC GCTGAGTGCA 10800
TATAATGCAT TCTCTAGTGA AAAACCTTGT TGGCATAAAA AGGCTAATTG ATTTTCGAGA GTTTCATACT GTTTTTCTGT AGGCCGTGTA CCTAAATGTA 10900
CTTTTGCTCC ATCGCGATGA CTTAGTAAAG CACATCTAAA ACTTTTAGCG TTATTACGTA AAAAATCTTG CCAGCTTTCC CCTTCTAAAG GGCAAAAGTG 11000
AGTATGGTGC CTATCTAACA TCTCAATGGC TAAGGCGTCG AGCAAAGCCC GCTTATTTTT TACATGCCAA TACAATGTAG GCTGCTCTAC ACCTAGCTTC 11100
TGGGCGAGTT TACGGGTTGT TAAACCTTCG ATTCCGACCT CATTAAGCAG CTCTAATGCG CTGTTAATCA CTTTACTTTT ATCTAATCTA GACATCATTA 11200
ATTCCTAATT TTTGTTGACA CTCTATCATT GATAGAGTTA TTTTACCACT CCCTATCAGT GATAGAGAAA AGTGAAATGA ATAGTTCGAC AAAGATCGCA 11300
TTGGTAATTA CGTTACTCGA TGCCATGGGG ATTGGCCTTA TCATGCCAGT CTTGCCAACG TTATTACGTG AATTTATTGC TTCGGAAGAT ATCGCTAACC 11400
ACTTTGGCGT ATTGCTTGCA CTTTATGCGT TAATGCAGGT TATCTTTGCT CCTTGGCTTG GAAAAATGTC TGACCGATTT GGTCGGCGCC CAGTGCTGTT 11500
GTTGTCATTA ATAGGCGCAT CGCTGGATTA CTTATTGCTG GCTTTTTCAA GTGCGCTTTG GATGCTGTAT TTAGGCCGTT TGCTTTCAGG GATCACAGGA 11600
GCTACTGGGG CTGTCGCGGC ATCGGTCATT GCCGATACCA CCTCAGCTTC TCAACGCGTG AAGTGGTTCG GTTGGTTAGG GGCAAGTTTT GGGCTTGGTT 11700
TAATAGCGGG GCCTATTATT GGTGGTTTTG CAGGAGAGAT TTCACCGCAT AGTCCCTTTT TTATCGCTGC GTTGCTAAAT ATTGTCGCTT TCCTTGTGGT 11800
TATGTTTTGG TTCCGTGAAA CCAAAAATAC ACGTGATAAT ACAGATACCG AAGTAGGGGT TGAGACGCAA TCGAATTCGG TATACATCAC TTTATTTAAA 11900
ACGATGCCCA TTTTGTTGAT TATTTATTTT TCAGCGCAAT TGATAGGCCA AATTCCCGCA ACGGTGTGGG TGCTATTTAC CGAAAATCGT TTTGGATGGA 12000
ATAGCATGAT GGTTGGCTTT TCATTAGCGG GTCTTGGTCT TTTACACTCA GTATTCCAAG CCTTTGTGGC AGGAAGAATA GCCACTAAAT GGGGCGAAAA 12100
AACGGCAGTA CTGCTCGGAT TTATTGCAGA TAGTAGTGCA TTTGCCTTTT TAGCGTTTAT ATCTGAAGGT TGGTTAGTTT TCCCTGTTTT AATTTTATTG 12200
GCTGGTGGTG GGATCGCTTT ACCTGCATTA CAGGGAGTGA TGTCTATCCA AACAAAGAGT CATCAGCAAG GTGCTTTACA GGGATTATTG GTGAGCCTTA 12300
CCAATGCAAC CGGTGTTATT GGCCCATTAC TGTTTGCTGT TATTTATAAT CATTCACTAC CAATTTGGGA TGGCTGGATT TGGATTATTG GTTTAGCGTT 12400
TTACTGTATT ATTATCCTGC TATCGATGAC CTTCATGTTA ACCCCTCAAG CTCAGGGGAG TAAACAGGAG ACAAGTGCTT AGTTATTTCG TCACCAAATG 12500
ATGTTATTCC GCGAAATATA ATGACCCTCT TGATAACCCA AGAGGGCATT TTTTACGATA AAGAAGATTT AGCTTCAAAT AAAACCTATC TATTTTATTT 12600
ATCTTTCAAG CTCAATAAAA AGCCGCGGTA AATAGCAATA AATTGGCCTT TTTTATCGGC AAGCTCTTTT AGGTTTTTCG CATGTATTGC GATATGCATA 12700
AACCAGCCAT TGAGTAAGTT TTTAAGCACA TCATCATCAT AAGCTTTAAG TTGGTTCTCT TGGATCAATT TGCTGACAAT GGCGTTTACC TTACCAGTAA 12800
TGTATTCAAG GCTAATTTTT TCAAGTTCAT TCCAACCAAT GATAGGCATC ACTTCTTGGA TAGGGATAAG GTTTTTATTA TTATCAATAA TATAATCAAG 12900
ATAATGGGTG ATGCTGCCAA CTTACTGATT TAGTGTATGA TGGTGTTTTT GAGGTGCTCC AGTGGCTTCT GTTTCTATCA GCTGTCCCTC CTGTTCAGCT 13000
ACTGACGGGG TGGTGCGTAA CGGCAAAAGC ACCGCCGGAC ATCAGCGCTA TCTCTGCTCT CACTGCCGTA AAACATGGCA ACTGCAGTTC ACTTACACCG 13100
CTTCTCAACC CGGTACGCAC CAGAAAATCA TTGATATGGC CATGAATGGC GTTGGATGCC GGGCAACCGC CCGCATTATG GGCGTTGGCC TCAACACGAT 13200
TTTCCGCCAT TTAAAAAACT CAGGCCGCAG TCGGTAACCT CGCGCATACA GCCGGGCAGT GACGTCATCG TCTGCGCGGA AATGGACGAA CAGTGGGGAT 13300
ACGTCGGGGC TAAATCGCGC CAGCGCTGGC TGTTTTACGC GTATGACAGG CTCCGGAAGA CGGTTGTTGC GCACGTATTC GGTGAACGCA CTATGGCGAC 13400
GCTGGGGCGT CTTATGAGCC TGCTGTCACC CTTTGACGTG GTGATATGGA TGACGGATGG CTGGCCGCTG TATGAATCCC GCCTGAAGGG AAAGCTGCAC 13500
GTAATCAGCA AGCGATATAC GCAGCGAATT GAGCGGCATA ACCTGAATCT GAGGCAGCAC CTGGCACGGC TGGGACGGAA GTCGCTGTCG TTCTCAAAAT 13600
CGGTGGAGCT GCATGACAAA GTCATCGGGC ATTATCTGAA CATAAAACAC TATCAATAAG TTGGAGTCAT TACCCCCCGG ACGAGTGGGA ATCCATGGTA 13700
GGGATTTTTA CCGAAATGGA AGAACACTAT TAGGGGGCAC CTCAGAAAAC GGAAAATAAA GCACGCTAAG GCATAGCTGA CCTTGCCAGG CCTGCTTCGC 13800
CCTGTAGTGA CGCGATCAAC GGGCAGGAAA CATTCCCCTT TCGTGCATGG CAGGCGCACA CGAGTTCAGA CAGCACGGTT TCCATGCGCG CCAAGTCGGC 13900
CATCTTCTCG CGCACGTCCT TGAGCTTGTG TTCGGCCAGG CTGCTGGCCT CCTCGCAGTG GGTGCCATCG TCGAGCCGCA ACAGCTCGGC AATCTCGTCC 14000
AGACTGAACC CCAGCCGCTG TGCCGATTTC ACGAATTTCA CCCGAACCAC GTCCGCCTCC CCATAGCGGC GGATGCTGCC GTAAGGCTTG TCCGGTTCCC 14100
GCAACAGGCC CTTGCGCTGA TAGAAGCGGA TTGTCTCCAC GTTGACCCCG GCCGCCTTGG CAAAAACGCC AATGGTCAGG TTTTCCAAAT TATTTTCCAT 14200
ATCGCTTGAC TCCGTACATG AGTACGGAAG TAAGGTTACG CTATCCAATC CAAATTCAAA AGGGCCAACG TATGTCTGAA CCACAAAACG GGCGCGGTGC 14300
GCTCTTCGCC GGCGGGCTGG CCGCCATTCT TGCATCGACC TGCTGCCTGG GGCCGCTAGT ACTGGTCGCC CTGGGCTTCT CCGGTGCTTG GATCGGCAAC 14400
CTGACGGTGC TGGAACCCTA TCGACCGTTG TTCATCGGCG CGGCGCTAGT GGCGCTGTTC TTCGCCTGGA AGCGGATTTA CCGGCCCGTG CAGGCATGCA 14500
AGCCAGGTGA GGTCTGCGCG ATTCCGCAGG TGCGCGCCAC CTACAAGCTG ATTTTCTGGA TCGTGGCCGT GCTGGTCCTG GTCGCGCTTG GATTTCCCTA 14600
TGTCGTTCCA TTTTTCTATT AACCAGGAGT TCATCATGAA GAAACTGTTT GCCTCCCTTG CCCTCGCCGC CGCTGTTGCC CCGGTGTGGG CCGCTACCCA 14700
GACCGTCACG CTAGCGGTTC CCGGCATGAC TTGCGCCGCC TGCCCGATCA CAGTCAAGAA AGCGCTCTCC AAGGTCGAAG GCGTGAGCAA GGTCGATGTG 14800
GGCTTCGAGA AGCGCGAGGC CGTCGTCACT TTTGACGACA CCAAGGCCAG CGTACAGAAG CTGACCAAGG CCACCGCAGA CGCCGGCTAT CCGTCCAGCG 14900
TCAAGCAGTG AGCCAGCAAG CCAACGACAA CAGCGAGAGC CGCTTCATGG GACTGATGAC ACGCATTGCC GATAAAACCG GCGCGCTCGG CAGCGTCGTT 15000
TCCGCGATGG GCTGCGCCGC CTGCTTTCCA GCCCTCGCCA GCTTCGGCGC GGCCATCGGG CTGGGCTTCT TGAGCCAGTA CGAGGGACTG TTCATCAGCC 15100
GCCTGCTGCC GCTGTTTGCC GCGCTGGCCT TCCTGGCGAA CGCGCTGGGT TGGTTCAGTC ATCGGCAATG GCTGCGCAGT CTGCTCGGCA TGATCGGCCC 15200
GGCCATCGTG TTTGCGGCCA CGGTCTGGCT GCTCGGCAAC TGGTGGACGG CGAACCTGAT GTACGTCGGC CTGGCCTTGA TGATTGGGGT GTCGATCTGG 15300
GACTTCGTGT CGCCGGCGCA TCGCCGTTGC GGACCGGACG GCTGCGAACT CCCCGCCAAG CGCTTGTGAA AGACGGCTGA CCGTGCGACA CGGCGGCCCA 15400
CACGAATAAG GAACGATGGT ATGAGCACTC TCAAAATCAC CGGCATGACT TGCGACTCGT GCGCAGTGCA TGTCAAGGAC GCCCTGGAGA AAGTGCCCGG 15500
CGTGCAATCA GCGGATGTCT CCTACGCCAA GGGCAGCGCC AAGCTCGCCA TTGAGGTCGG CACGTCACCC GACGCGCTGA CGGCCGCTGT AGCTGGACTC 15600
GGTTATCGGG CCACGCTGGC CGATGCCCCC TCAGTTTCGA CGCCGGGCGG ATTGCTCGAC AAGATGCGCG ATCTGCTGGG CAGAAACGAC AAGACGGGTA 15700
GCAGCGGCGC ATTGCATATC GCCGTCATCG GCAGCGGCGG GGCCGCGATG GCAGCGGCGC TGAAGGCCGT CGAGCAAGGC GCACGTGTCA CGCTGATCGA 15800
GCGCGGCACC ATCGGCGGCA CCTGCGTCAA TGTCGGTTGT GTGCCGTCCA AGATCATGAT CCGCGCCGCC CATATCGCCC ATCTGCGCCG GGAAAGCCCG 15900
TTCGATGGCG GCATCGCCGC TACCACGCCG ACCATCCAGC GCACGGCGCT GCTGGCCCAG CAGCAGGCCC GCGTCGATGA ACTGCGCCAC GCCAAGTACG 16000
AAGGCATCTT GGAGGGCAAT CCGGCGATCA CTGTGCTGCA CGGCTCCGCC CGCTTTAAGG ACAATCGCAA CCTGATCGTG CAACTCAACG ACGGCGGCGA 16100
GCGCGTGGTG GCATTCGACC GCTGCCTGAT CGCCACCGGC GCGAGCCCGG CCGTGCCGCC GATTCCCGGC CTGAAAGACA CTCCGTACTG GACTTCCACT 16200
GAAGCGCTGG TCAGCGAGAC GATTCCTAAG CGCCTGGCCG TGATTGGCTC ATCAGTGGTG GCGCTGGAGC TGGCGCAGGC GTTCGCCCGA CTCGGAGCGA 16300
AGGTGACGAT CCTGGCTCGC AGCACGCTGT TCTTCCGCGA AGACCCAGCT ATAGGCGAAG CCGTCACGGC CGCATTCCGC ATGGAGGGCA TCGAGGTGAG 16400
GGAACACACC CAGGCCAGCC AGGTCGCGTA TATCAATGGT GAAGGGGACG GCGAATTCGT GCTCACCACG GCGCACGGCG AACTGCGCGC CGACAAGCTG 16500
CTGGTCGCCA CCGGCCGCGC GCCCAACACA CGCAAGCTGG CACTGGATGC GACGGGCGTC ACGCTCACCC CGCAAGGCGC TATCGTCATC GACCCCGGCA 16600
TGCGTACAAG CGTGGAACAC ATCTACGCCG CAGGCGACTG CACCGACCAG CCGCAGTTCG TCTATGTGGC GGCAGCGGCC GGCACTCGCG CCGCGATCAA 16700
CATGACCGGC GGTGACGCGG CCCTGAACCT GACCGCGATG CCGGCCGTGG TGTTCACCGA CCCGCAAGTG GCGACCGTAG GCTACAGCGA GGCGGAAGCG 16800
CACCATGACG GCATCAAAAC TGATAGTCGC ACGCTAACGC TGGACAACGT GCCGCGCGCG CTCGCCAACT TCGACACGCG CGGCTTCATC AAACTGGTGG 16900
TTGAAGAAGG CAGCGGACGA CTGATCGGCG TGCAGGCAGT GGCCCCGGAA GCGGGCGAAC TGATCCAGAC GGCCGCACTG GCGATTCGCA ACCGGATGAC 17000
GGTGCAGGAA CTGGCCGACC AGTTGTTCCC CTACCTGACG ATGGTCGAAG GGTTGAAGCT CGCGGCGCAG ACCTTCAACA AGGATGTGAA GCAGCTTTCC 17100
TGCTGCGCCG GGTGAGGACA AGGAGGTGTG CGATGAGCGC CTACACGGTA TCGCAACTGG CCCATAACGC TGGGGTGAGC GTACATATCG TGCGCGACTA 17200
CCTGGTGCGC GGCTTGTTAC GGCCGGTGGC CTGCACCACG GGCGGCTACG GCGTGTTCGA CGATGCGGCC TTGCAACGGC TGTGCTTCGT GCGCGCGGCC 17300
TTCGAGGCGG GTATCGGCCT GGATGCCCTG GCGCGGCTGT GCCGTGCGCT CGACGCAGCG GACGGCGCAC AAGCCGCAGC GCAGCTTGCC GTGCTGCGCC 17400
AGTTGGTCGA GCGGCGGCGC GCGGCGTTGG CCCATCTGGA CGCGCAACTG GCCTCCATGC CAGCCGAGCG GGCGCACGAG GAGGCATTGC CGTGAACGCC 17500
CCTGACAAAC TGCCGCCCGA GACGCGCCAA CCCGTTTCCG GCTACCTGTG GGGTGCGCTG GCCGTGTTGA CCTGCCCCTG CCATCTGCCG ATTCTCGCCG 17600
CCGTGCTGGC CGGGACGACC GCCGGTGCCT TCCTTGGCGA GCATTGGGGT GTTGCCGCGC TCGCGCTGAC CGGCTTGTTC GTTCTGGCCG TAACGCGGCT 17700
GCTGCGCGCC TTCCGGGGCG GATCATGACG AGTTCGCAGC CCGCCGGATG GACGGCGGCC GAGTTGGCGC AGGCGGCGGC GCGCGGACAG CTTGACCTGC 17800
ATTACCAGCC GCTGGTCGAT CTGCGCGATC ACCGGATCGC TGGCGCGGAA GCGTTGATGC GCTGGCGGCA TCCGAGGCTT GGCCTGTTGC CGCCCGGCCA 17900
GTTCCTGCCG CTGGCCGAGT CGTTCGGCCT GATGCCGGAA ATAGGCGCGT GGGTGCTGGG CGAGGCCTGT CGCCAGATGC ACAAGTGGCA AGGACCGGCA 18000
TGGCAACCGT TCCGTCTTGC CATCAATGTG TCCGCCAGCC AGGTTGGGCC AACGTTCGAC GACGAGGTAA AGCGGGTGCT GGCCGATATG GCCCTGCCCG 18100
CCGAGCTTCT GGAGATCGAA CTGACCGAAT CGGTCGCATT CGGCAATCCA GCCCTGTTCG CCAGTTTCGA CGCCTTGCGC GCCATCGGCG TGCGCTTCGC 18200
CGCCGACGAC TTCGGCACCG GCTATTCCTG CCTGCAACAT CTGAAATGCT GCCCCATCAC CACATTGAAA ATCGACCAAT CCTTTGTCGC CAGGCTCCCG 18300
GATGATGCCC GTGACCAAAC TATCGTGCGG GCGGTGATCC AGCTCGCGCA CGGGCTGGGC ATGGATGTCA TTTTCAGAAG ACGACTGCAC CAGTTGATTG 18400
GGCGTAATGG CTGTTGTGCA GCCAGCTCCT GACAGTTCAA TATCAGAAGT GATCTGCACC AATCTCGACT ATGCTCAATA CTCGTGTGCA CCAAAGCGAG 18500
GTGAGCATGG CGACGGACAC CCCACGGATT CCAGAACAAG GCGTGGCCAC TCTGCCTGAT GAGGCTTGGG AGCGTGCGCG CCGTCGTGCG GAGATCATCA 18600
GTCCGTTGGC GCAGTCGGAG ACGGTCGGGC ACGAAGCGGC CGATATGGCG GCTCAGGCGC TGGGCTTGTC TCGGCGCCAG GTATACGTTC TGATCCGGCG 18700
TGCCCGGCAA GGCAGCGGCC TCGTGACGGA TCTGGTGCCC GGCCAGTCCG GTGGAGGTAA AGGTAAGGGG CGCTTGCCGG AACCGGTCGA GCGCGTCATC 18800
CACGAGCTAC TGCAAAAGCG GTTCCTGACC AAGCAGAAGC GCAGCCTAGC GGCCTTTCAC CGCGAAGTCA CTCAGGTGTG CAAGGCTCAA AAACTGCGAG 18900
TGCCGGCGCG CAATACCGTG GCCTTACGGA TCGCTAGCCT TGACCCGCGC AAGGTCATCC GCCGGCGGGA AGGCCAGGAT GCCGCTCGTG ACCTACAAGG 19000
TGTGGGCGGC GAGCCTCCTG CCGTGACCGC GCCGCTGGAG CAGGTGCAGA TAGACCATAC GGTCATCGAC CTGATCGTGG TCGATGACCG CGACCGGCAA 19100
CCTATTGGCC GCCCGTACCT GACCCTCGCC ATCGACGTGT TCACCCGCTG CGTGCTCGGC ATGGTCGTCA CGCTGGAAGC GCCGTCTGCC GTTTCGGTTG 19200
GCCTGTGCCT CGTGCATGTC GCCTGCGACA AGCGCCCTTG GCTGGAAGGA CTGAACGTGG AAATGGATTG GCAGATGAGC GGCAAGCCCT TGCTGCTCTA 19300
CCTAGACAAC GCGGCCGAGT TCAAGAGCGA GGCCCTGCGC CGGGGTTGCG AGCAGCATGG CATCCGGCTG GACTATCGCC CGCTGGGACA GCCGCACTAT 19400
GGCGGCATCG TGGAACGGAT CATCGGCACG GCGATGCAGA TGATTCACGA CGAACTGCCG GGAACGACCT TCTCCAACCC TGACCAGCGC GGCGACTACG 19500
ATTCCGAAAA CAAGGCCGCC CTGACGCTGC GCGAGCTAGA GCGCTGGCTC ACATTGGCGG TCGGCACCTA CCACGGTTCG GTGCACAACG GCCTGCTCCA 19600
ACCGCCGGCC GCGCGCTGGG CCGAGGCCGT GGCGCGTGTC GGCGTACCGG CCGTCGTCAC ACGCGCTACT TCGTTCCTGG TCGATTTTCT GCCGATCCTC 19700
CGGCGCACGC TGACCCGCAC CGGCTTTGTC ATCGACCACA TCCACTACTA CGCCGATGGG CACTGTTGCA AATAGTCGGT GGTGATAAAC TTATCATCCC 19800
CTTTTGCTGA TGGAGCTGCA CATGAACCCA TTCAAAGGCC GGCATTTTCA GCGTGACATC ATTCTGTGGG CCGTACGCTG GTACTGCAAA TACGGCATCA 19900
GTTACCGTGA GCTGCAGGAG ATGCTGGCTG AACGCGGAGT GAATGTCGAT CACTCCACGA TTTACCGCTG GGTTCAGCGT TATGCGCCTG AAATGGAAAA 20000
ACGGCTGCGC TGGTACTGGC GTAACCCTTC CGATCTTTGC CCGTGGCACA TGGATGAAAC CTACGTGAAG GTCAATGGCC GCTGGGCGTA TCTGTACCGG 20100
GCCGTCGACA GCCGGGGCCG CACTGTCGAT TTTTATCTCT CCTCCCGTCG TAACAGCAAA GCTGCATACC GGTTTCTGGG TAAAATCCTC AACAACGTGA 20200
AGAAGTGGCA GATCCCGCGA TTCATCAACA CGGATAAAGC GCCCGCCTAT GGTCGCGCGC TTGCTCTGCT CAAACGCGAA GGCCGGTGCC CGTCTGACGT 20300
TGAACACCGA CAGATTAAGT ACCGGAACAA CGTGATTGAA TGCGATCATG GCAAACTGAA ACGGATAATC GGCGCCACGC TGGGATTTAA ATCCATGAAG 20400
ACGGCTTACG CCACCATCAA AGGTATTGAG GTGATGCGTG CACTACGCAA AGGCCAGGCC TCAGCATTTT ATTATGGTGA TCCCCTGGGC GAAATGCGCC 20500
TGGTAAGCAG AGTTTTTGAA ATGTAAGGCC TTTGAATAAG ACAAAAGGCT GCCTCATCGC TAACTTTGCA ACAGTGCCCA TTTATGAATG TTCCTGTTAT 20600
GGCTTATGTT CAAGAAAGCA TTGCCCCTGA AATGATGGGC AAGGTGTTTT CCCTTTTGAT GACCGCCATG ACTCTTTCTA TGCCGATAGG CTTACTTGTT 20700
GCAGGTCCGG TTGTTGAGGT TATAGGTGTT AATACATGGT TTTTCTGGTC TGGTGTTGCG TTGATAGTAA ACGCTGTTCT CTGCCGCATT CTGACACGAC 20800
GCTATGACAA AGTAACAATG AAACCGCAAG TGGACTGAAA AAAGGACCGG GTTGATGATA ATTTGTAGTG GTGAGCTTCT GGGAGTACAA AACAAAGTGC 20900
TCAAAATTGT CGGGCTCATG GCGTTTAACG GTATTAATTT CGCTTATAAT AATCTTTCTA TAATAGCCTA AAGGAGAATA TCTATGATAC CTAATAGCGA 21000
AAATAAAAGA GTATGGTTTA TTACCGGAGC AAGCAAGGGG CTTGGCTATG CTTTTACATG CGCCGCCTTG AAAGCCGGGG ATAAAGTTGT TGCAGTTGCA 21100
AGGACTATCG ATAATTTGGC GAAGCTAGAA GAAACATATC AAGAGAGCTT ACTGCCATTA AACCTCGATG TTACAGATAG GGAGGCTGTT TTTTCTACGG 21200
TTGAAACAGC AGTTAAACAT TTCGGTAGGC TTGATATTGT TGTTAATAAT GCGGGTATCA TGACTATGGG TATGATTGAA GAACTAAACG AATCCGATGC 21300
TCGGAAACTA ATGGACACAA ACTTTTTTGG AGCTCTTTGG GTTTGTCAGG CAGTGATGCC CTATTTGAGG TCGCAGCGTT CTGGACATAT CATACAGATT 21400
ACAAGCATTG GCGCTATTAT CTCGGGTCCG ATGTCTGGTA TTTACAGTGC AAGTAAATTT GCGCTGGAAG GAATGAGTGA AGCCTTAGCA AAAGAAGCGG 21500
AACATTTCGG AGTGAAACTC ACTATGGTGG AACCAGGAGG ATATTGGACA GATTTATACA CTTCGATGAG TTACAGCAAT CCATTAGATT CATACGGTAC 21600
ACTGCGCGAT GAGCTGGCGA AACAGTATTC CGAAGATTCG GTCGATAGTG ATCCTTCCTT GGCAGCGGAA GCCCTTATGA AATTAGTAGC CAGCAATAAT 21700
CCGCCTTTGA GGCTTATCCT TGGCAGCATG GTATACGATT TGGCGATGGA TACCTTAAAA GCACGGATGG CTACCTGGGA AGAATGGGAA GCTGTTAGCC 21800
GTGCATCAGA AAAGGCTATT CCTGCACCGG AGAGATATGG AGTATAATAC CAAACATGGT TTTTGAGGCA AGGACGAAAC TATCATGAAA ATTGAAATTA 21900
TGGAATATAA CCCGGATTGG ACAAAGAATT TTGAAGAAGA GAAAATAAAA TTGCTCCATT TTTTCGGTTC TCATGCCGTA GCCATTGAAC ATATCGGAAG 22000
CACGGCGATA CCCAATCAAC GGGCGAAACC GGTTATAGAT ATATTTATTG GCGTTTCGCC TTTTGCTGAA CTACCTTTTA TCAGCGCATT TTTAATGCAA 22100
AGGAGTATCA CCACACTCCG ACAGATATGA CAGGCAGATA TTTGTTTGCA AAATATACAA ATGAAGTTTG GACCCATAAT CTTCTTGCTT TGGTGTAGGT 22200
ATAATACCCA AAAACGAAGT TTATGCGGAA CAGGATAACA CCTCGAAATC GGAGTTTATA TGGCTTCGTG GGGGAGTTAT GTGTTTGAGC TTTGTCGCAT 22300
TAACGCGAAT CAGGGGTTGA CGGCAGAAAT AGACTGGAAT TACACTTCTT GGAGTCGGCG TTGCCGGAAA ATTCTGATTG GATTAGTTGT TCGGGGTGCG 22400
AAAACAGTCG TAGTTCGGGG AAAAAACCGA ATTTTGACCC AAAACCGCGA AAAACTCGAA ATGACAGTTC CTATCAGTTT TCATAATGAA TTAAAACAAA 22500
AAAATTAGGC TGCAAAGATA GGGAAAATGT GTCAATTTGC GTAAGTGATA ATTAGTCAAT TAAGATAAAT GCTACTTTTG TAATCAAAGC AAAATGATAT 22600
GAGCAAGATT TTTGGAATCG TAAATATAAC CACCGATAGT TTTTCCGATG GAGGACTTTA TTTAGATACA GATAAGGCAA TTGAGCATGC TCTGCATTTG 22700
GTTGAAGATG GAGCAGATGT GATTGATTTG GGAGCCGCTT CCAGTAATCC TGATACAACT GAAGTGGGCG TTGTGGAAGA AATCAAAAGA CTCAAACCTG 22800
TCATTAAGGC TTTAAAAGAA AAAGGCATTT CTATTTCTGT TGATACATTT AAACCTGAGG TTCAGAGTTT TTGCATAGAA CAAAAGGTTG ATTTTATTAA 22900
TGATATTCAA GGTTTTCCTT ATCCTGAGAT TTATTCAGGC TTGGCAAAGT CAGATTGCAA ACTTGTGTTG ATGCACTCCG TTCAGCGAAT TGGTGCAGCT 23000
ACTAAAGTTG AAACGAATCC GGAAGAGGTT TTTACTTCCA TGATGGAATT TTTTAAAGAA AGAATTGCTG CTTTAGTTGA GGCTGGTGTA AAGCGTGAAC 23100
GAATTATTCT TGATCCGGGT ATGGGCTTCT TTTTAGGCTC TAATCCAGAA ACATCTATTC TTGTTTTGAA GCGTTTCCCT GAAATTCAAG AAGCTTTTAA 23200
TTTGCAAGTA ATGATTGCAG TGTCACGGAA ATCATTCTTA GGTAAAATAA CTGGAACCGA TGTGAAATCT CGTTTAGCAC CAACTCTTGC AGCAGAAATG 23300
TATGCATACA AAAAAGGTGC AGATTATCTC CGCACCCATG ATGTTAAGTC TTTATCAGAT GCCTTGAAAA TATCCAAAGC CCTAGGTTAG TTTTATTTTT 23400
TTAGCTTTCT AAATGAATAT GCTATACTTG CTCTGAAGTT GTGTTGGCGT ACTTGACTTA GGTCTGGATC TTTCGAACTG TTACTAAGAG GGTAATCATA 23500
GCCAAACATA AACTGGAAAT GTCCTAATTC TAAACCTGTT CCAATACCTA CACTATAATC AAGAGGACTT AAACTTCCAT CACTTCCAAA AGGTTTTGTG 23600
TACTTCTCTT GAGTTTTCGT AGCTAAATAG TAGTGTTTGT TGGTAAATCC AAATAATCCA ATATTCAATC TTGCGACCGG GAAAGTCCTC TTGAAAGTAC 23700
CACAAGTTCG CGGGATGGAG TTTTATCCCA GCTGCATCGA GAAAGGCATG CGCAGTGAGC GTGCTCTCAA GCTCGCCATC GCCGAAATGT ATGTCAAAGG 23800
AGTAAGTACC CGCAGGGTCT CGGATATCGT CGAAATTCTT TGTGGCACCG AAGTCAGCTC GTCCCAGGTC AGCAGGCTGG CAAAGGAGCT CGATGAAGAG 23900
ATTACGTCTT GGAAGGCGCA GCCTGTCGGA CAGATTCAAT ACTTGGTACT TGATGCGACC TATGAATCGG TTCGCGTCGG TTCCCATGTG GTCAAGCAGG 24000
CGCTTCTAGT GGCTATTGGC GTTGATTACA GCGGGAATCG GCATATTCTT GACGCCGAAG TCGCGAACAG TGAGGCAGAG GTAAACTGGC GTTCCTTTCT 24100
CGAGGGTCTC GTACGACGAG GGATGCACGG CCTGCGAATG ATCACCAGTG ATGACCACTC AGGACTGCGC GCGGCAATCG ATGCTGTCTT CCCTGGAATT 24200
CTGTGGCAAC GCTGCCAGTT TCATCTGCAG CAGAATGCCC ACTCCTACGT CACGAAAAAA GATGAGATCC CGCTGATAGC CGCTGATATT CGGAAGGTGT 24300
TCAATCGGAA TATGTCAAGA TAGTTTATGC CAGATTTTTG AGAGGTTTGC ATCAAGCAGC CTTCGAAAGA GGACGGCATG ACATTGAGAC CGGCAAGGCG 24400
TCGTAGATAC GCACTTTCCC AGTACGCCAC CGTAAGGGGT TTTTCTCTTT GGCAGCAAGA ATGGTTTGGT TTCTTTTTTC CTCGGCGCCG TAGGCGTCCG 24500
ACTAACACCT GAATTAAGCC GCGCCGCGAA GCGGCGTCGG CTTGAATGAA CTGTTAGATG CCAGCCCGAT CAATGTGCGC TGACCTTGGA TAGCAGATTT 24600
AGAACGGCGA CGCCACTAAC GATAAGTCCC ATGCCAACGA ACGCCCACAA GTCTAGTTTC TGGCCATGGA AGATCCAAGC GATAGCTGCC ACAAGTACGA 24700
TGCCGAGGCC AGCCCAAACA GCATAAGCAA TGCCGACCGG GATGGACTTG AGTGCGAGAG AGAGGAAATA GAACGCAAGC CCGTAGCCAG CCACAACTAC 24800
AACAGAAGGA ACTAACTTGG TGAATCCATG GCTGGACTTC AGTGCGGAAG TTGCGACGAC CTCACCAAAT ATTGCAATAG CCAGAAAGAG CCAGTTCTTC 24900
ACGTGCAATC TCCTCTACGG TATGAAGGAT AAATAGTGGT GGCTATGAGT TGCCAAAAAC AGTCTTGCGG CTGTCGATTT TCTGTGAGCA TACGCAACGC 25000
CAAATCTGGC ATCTAACGCT TGAGTTAAGC CGCGCCGCGA AGCGGCGTCG GCTTGAACGA ATTGTTAGAC ATTATTTGCC GACTACCTTG GTGATCTCGC 25100
CTTTCACGTA GTGAACAAAT TCTTCCAACT GATCTGCGCG GGAGGCCAAG CGATCTTCTT CTTGTCCAAG ATAAGCCTGT CTAGCTTCAA GTATGACGGG 25200
CTGATACTGG GCCGGCAGGC GCTCCATTGC CCAGTCGGCA GCGACATCCT TCGGCGCGAT TTTGCCGGTT ACTGCGCTGT ACCAAATGCG GGACAACGTA 25300
AGCACTACAT TTCGCTCATC GCCAGCCCAG TCGGGCGGCG AGTTCCATAG CGTTAAGGTT TCATTTAGCG CCTCAAATAG ATCCTGTTCA GGAACCGGAT 25400
CAAAGAGTTC CTCCGCCGCT GGACCTACCA AGGCAACGCT ATGTTCTCTT GCTTTTGTCA GCAAGATAGC CAGATCAATG TCGATCGTGG CTGGCTCGAA 25500
GATACCTGCA AGAATGTCAT TGCGCTGCCA TTCTCCAAAT TGCAGTTCGC GCTTAGCTGG ATAACGCCAC GGAATGATGT CGTCGTGCAC AACAATGGTG 25600
ACTTCTACAG CGCGGAGAAT CTCGCTCTCT CCAGGGGAAG CCGAAGTTTC CAAAAGGTCG TTGATCAAAG CTCGCCGCGT TGTTTCATCA AGCCTTACGG 25700
TCACCGTAAC CAGCAAATCA ATATCACTGT GTGGCTTCAG GCCGCCATCC ACTGCGGAGC CGTACAAATG TACGGCCAGC AACGTCGGTT CGAGATGGCG 25800
CTCGATGACG CCAACTACCT CTGATAGTTG AGTCGATACT TCGGCGATCA CCGCTTCCCT CATGATGTTT AACGCCTGAG CTCAGCCGAC CGAAACCGCG 25900
TAGCGGTTTT GGGTCGGCTG CAGCGATTTG TTGGGCGATA GCTTGCCACA TTCTCTCAAC GATTGGGATT TGATGTACTT TCCGCACTTT GTAGCGCGAC 26000
CACATCATGC TCCCCCTGGC CGCGAGAGCC CTTCACTCGG GAAACACAAG ACAGACCGAG CACGACTGTT GCAAGGGTCA AACAGTACAC GACAACCGGC 26100
CAAGCCGTGT TGCGCGGCAA CAGCGAAATG ATCAACGTTC CGATGCTTCC TAGCAGTACA CCGCCCAAGC AGAAGTAGAC TGCCGTGACC GTTCCAGCAA 26200
CATGGTCGAA TCCTCGAAGA GCGCCATTGG GCGCCACAGA TACCGCTGTG GCGACACCAA TACCCACTAG CCACATTGGA GCAATAAAGC CTAACACGGA 26300
CTGCAAAGCC CATATTTCGG TGATGGCAAG CAATACTGCT CCAGCTATCA GGCATCCCAT TCCCATTCGC AAGACACTTG GGCTGCCCCA CTTGGGTATC 26400
ACACGCCCCA TAAAACGAGC CGTAAACACC ATGGCAATTG CCACTGTGGC GAACAGCAGG CTGAAGCCAA GCTGAGACAC ACCTTGCCTG CCCATCATTA 26500
GTCCGGGCGC AATGGAGAAA AAGACGAAGA AGCTACCCAT TCCAGCGGCG TAACACAACG TGTACAACCA GAAGTTCAGG CACTTAACGG GGAGTAGCAG 26600
CTGCGACCAT TGCAAGCCCG CAACTCGTTG CACCCGGGTT TCAGGCCAGA ATCGCCACGC TGCTGCAGAT GCAGCGATCA TGCCCAAACC TAGAAACGCA 26700
AAGATAGCCC GCCACCCAAG CCACATGTCG ACGAGCGCTC CGAGCAATGG GCCTACCGCC GGGACCATGG CCAGCATGGA TCCGAGTATG CCGTAAATGA 26800
CATTACTTTC CTCGCGACCT GCGTAAATGT CACGTACTGT TGCAAATGTG GAAACAAGGC ACGCCGAGGC ACCACAAGCC TGAAGAATCC GAAGCCCCAG 26900
AAAGACTTCA GCCGATGACG TAAGAGCGAG GCCCATTGAC GCCACAACGT AGGCGAGGCC ACCTCCCAGT AGAACGGGGC GGCGCCCCAG TCGGTCCGAT 27000
AGCGGTCCAA ACAAGAGCTG ACCGGCACCA ATCATGACCA AGTACGTTGT CAGCGTAAGC TGAATTGTGC TCGCTGTCGT ACCAAGCGCG TTTGGCATAA 27100
ACGGCACTGC TGGCAAGTAC ATGTCCATGC CGAGTGATGC CAATAAATCG AACGGTGATA ACAACAACAC CGTGGCGGCA AGGGAGTACC GCCAACTAAA 27200
GTTTTTTGAG CTCACGAAAC CATCTCCTTG AACAAAGGAT TTGGCGGCGC TCCCGTAGAG CAAAAGGATT CATGAGAACG CCGCAACAAC CGAAAAATGA 27300
AGGTTGCTGC GGCTTACTTG TCTGCGTTCT TGGAAGTGCT CATCTGCTGA CTATCTCATG ATTGAATTTG AATCGTAACA ACTTTCAAGC AACTCTGCAA 27400
GGAGCCTTAT TGTGCGCCCA ACGCCGGAGT TAAGCCGCCG CGCGTAGCGC GGTCGGCTTG AACGAATTGT TAGACATCAT TTACCAACTG ACTTGATGAT 27500
CTCGCCTTTC ACAAAGCGAA TAAATTCTTC CAAGTGATCT GCGCGTGAGG CCAAGTGATC TTCTTTTTGT CCCAGATAAG CTTGCTTAGC TTCAAGTAAG 27600
ACGGGCTGAT ACTGGGCAGG TAGGCGTTTT ATTGCCCAGT CGGCAGCGAC ATCCTTCGGC GCGATTTTGC CGGTTATTGC GCTGTACCAA ATGCGGGACA 27700
ACGTAAGCAC TACATTTCGC TCATCGCCGG CCCAGTCGGG CTGCGAGTTC CATAGCTTCA AGGTTTCCCT CAGCGCCTCG AATAGATCCT GTTCAGGAAC 27800
CGGGTCAAAG AATTCCTCCG CTGCCGGACC TACCAAGGCA ACGCTATGTT CTCTTGCTTT TGTAAGCAGG ATAGCTAGAT CAATGTCGAT CATGGCTGGC 27900
TCGAAGATAC CCGCAAGAAT GTCATTGCGC TGCCATTCTC CAAATTGCAG CTCGCGCTTA GCCGGATAAC GCCACGGGAT GATGTCGTCA TGCACGACAA 28000
GGGTGACTTC TATAGCGCGG AGCGTCTCGC TCTCGCCAGG GAAAGCCGAA GCCTCCATAA GATCATTGAG CAATGCTCGC CGCGTCGTTT CATCAAGCTT 28100
TACGGCCACA GTAACCAACA AATCAATATC GCTGTATGGC TTCAGGCCGC CATCCACTGC GGAGCCGTAC AAATGCACGG CCAGCAACGT TGATTCCAGA 28200
TGGCGCTCAA TGACGCTTAG CACCTCTGAT AGTTGGTTCG AAATTTCGAT GGTCACCGCT TCCCTCATGA TGTCTAACGG GCGAGGTAAG CCGACCGCAG 28300
AATGCGGGTC GGCTTGACCG AAATGTTAGA ACCAGAAGCC AAAACGGATA ACTTGAATTT GGCGACGGGC GCTAACCGTG AAAAAACGCT GCGCCACCGA 28400
GGCGGCACAG CACTGCAAAA ACGATAGCTG CTTGCGCTTG CTACGCAAGG GCTAGAGGCC AAAAAGACTG AAAACCTGCG CAGCCCATGC AGGCGAAGCC 28500
CGGAAAAAAG GCAAAACAGG CACTGAATAA CGCCTGAAAA GCTAAATGCC GTTTGAATAA ACATGAGCTA AATAAAGCTG GGTTTCAGTG GTGCTAACGT 28600
TGGACGTAAC GAGAGCCGGA GCGCAGCGGA GGGAACCAAA ATGCGCAGCA TTTTGGCGTC CCGTTGACGG AATGGTTAGC CGTTTCGACG CGCATAAACG 28700
GAGTGGGTGT ACGGAATTAC AGCTTGAATG GTTTCGGTTG AGACAAGCTC GAATTCTGTT TCGTTGAGCA TTGGGAAGAA GGCGTCACCC TCGAAGGTTT 28800
GATGTACCTC AGATAGAAAC ACGCCGTGGG CGTGAGGTAG TGCCAGAGTG TATATCTCAG CTCCGCCCGC GACGTAGAGT TCATTGCCGA GTTCGGATGC 28900
CAAAGCGATA GCGTGCGACA GCGTTGAAAC AACTACGCAG CCAGTGGCGC GGTAGTTAGC TTGGCGTGAG ATTACCAATG TGTGACGGTT CGGTAGAGGC 29000
TTGCCGATAG ACTCAAAGGT CTTTCGCCCC ATGACAACGA CTTTTCCCTC AGTGAGTCTG CGAAAAATCT TCTGCTCACC CGGAATTTTC CAGGGGATAT 29100
TAGGACCATT GCCAATAACC CGATTGGCTC CCATCGCAGC AACGAGATAA ATGCGTACTG ATTCCGAGTT CATATGGCTA ACTTTGTTTT AGGGCGACTG 29200
CCCTGCTGCG TAACATCGTT GCTGCTCCAT AACATCAAAC ATCGACCCAC GGCGTAACGC GCTTGCTGCT TGGATGCCCG AGGCATAGAC TGTACAAAAA 29300
AACAGTCATA ACAAGCCATG AAAACCGCCA CTGCGCCGTT ACCACCGCTG CGTTCGGTCA AGGTTCTGGA CCAGTTGCGT GAGCGCATAC GCTACTTGCA 29400
TTACAGCTTA CCAACCGAAC AGGCTTATGT CCACTGGGTT CGTGCCTTCA TCCGTTTCCA CGGTGTGCGT CACCCGGCAA CCTTGGGCAG CAGCGAAGTC 29500
GAGGCATTTC TGTCCTGGCT GGCGAACGAG CGCAAGGTTT CGGTCTCCAC GCATCGTCAG GCATTGGCGG CCTTGCTGTT CTTCTACGGC AAGGTGCTGT 29600
GCACGGATCT GCCCTGGCTT CAGGAGATCG GAAGACCTCG GCCGTCGCGG CGCTTGCCGG TGGTGCTGAC CCCGGATGAA GTGGTTCGCA TCCTCGGTTT 29700
TCTGGAAGGC GAGCATCGTT TGTTCGCCCA GCTTCTGTAT GGAACGGGCA TGCGGATCAG TGAGGGTTTG CAACTGCGGG TCAAGGATCT GGATTTCGAT 29800
CACGGCACGA TCATCGTGCG GGAGGGCAAG GGCTCCAAGG ATCGGGCCTT GATGTTACCC GAGAGCTTGG CACCCAGCCT GCGCGAGCAG CTGTCGCGTG 29900
CACGGGCATG GTGGCTGAAG GACCAGGCCG AGGGCCGCAG CGGCGTTGCG CTTCCCGACG CCCTTGAGCG GAAGTATCCG CGCGCCGGGC ATTCCTGGCC 30000
GTGGTTCTGG GTTTTTGCGC AGCACACGCA TTCGACCGAT CCACGGAGCG GTGTCGTGCG TCGCCATCAC ATGTATGACC AGACCTTTCA GCGCGCCTTC 30100
AAACGTGCCG TAGAACAAGC AGGCATCACG AAGCCCGCCA CACCGCACAC CCTCCGCCAC TCGTTCGCGA CGGCCTTGCT CCGCAGCGGT TACGACATTC 30200
GAACCGTGCA GGATCTGCTC GGCCATTCCG ACGTCTCTAC GACGATGATT TACACGCATG TGCTGAAAGT TGGCGGTGCC GGAGTGCGCT CACCGCTTGA 30300
TGCGCTGCCG CCCCTCACTA GTGAGAGGTA GGGCAGCGCA AGTCAATCCT GGCGGATTCA CTACCCCTGC GCGAAGGCCA TCGGTGCCGC ATCGAACGGC 30400
CGGTTGCGGA AAGTCCTCCC TGCGTCCGCT GATGGCCGGC AGCAGCCCGT CGTTGCCTGA TGGATCCAAC CCCTCCGCTG CTATAGTGCA GTCGGCTTCT 30500
GACGTTCAGT GCAGCCGTCT TCTGAAAACG ACAATGGAGG TGGTAGCCGA GGGTGTGGAA ACACCCGACT GCCTTGCGTG GTTGCGGCAG GCGGGTTGCG 30600
ACACGGTGCA GGGTTTCCTG TTCGCCAGGC CGATGCCGGC GGCGGCCTTC GTCGGCTTCG TCAACCAATG GAGGAACACC ACCATGAACG CCAATGAACC 30700
GAGCACCAGT TGCTGCGTGT GCTGCAAGGA AATCCCGCTC GATGCCGCCT TCACGCCGGA AGGGGCCGAG TACGTGGAGC ATTTCTGCGG GCTGGAGTGC 30800
TATCAGCGCT TCCAGGCGCG GGCCAGCACT GCGACCGAAA CCAGCGTCAA ACCGGACGCT TGTGATTCGC CGCCGTCAGG TTGAGGCATA CCCTAACCTG 30900
ATGTCAGATG CCATGTGTAA ATTGCGTCAG GATAGGATTG AATTTTGAAT TTATTGACAT ATCTCGTTGA AGGTCATAGA GTCTTCCCTG ACATTTTGCA 31000
GGGAATTCCA TGACTGGACA GCGCATTGGG TATATCAGGG GGCACTGTTG CAAATAGTCG GTGGTGATAA ACTTATCATC CCCTTTTGCT GATGGAGCTG 31100
CACATGAACC CATTCAAAGG CCGGCATTTT CAGCGTGACA TCATTCTGTG GGCCGTACGC TGGTACTGCA AATACGGCAT CAGTTACCGT GAGCTGCAGG 31200
AGATGCTGGC TGAACGCGGA GTGAATGTCG ATCACTCCAC GATTTACCGC TGGGTTCAGC GTTATGCGCC TGAAATGGAA AAACGGCTGC GCTGGTACTG 31300
GCGTAACCCT TCCGATCTTT GCCCGTGGCA CATGGATGAA ACCTACGTGA AGGTCAATGG CCGCTGGGCG TATCTGTACC GGGCCGTCGA CAGCCGGGGC 31400
CGCACTGTCG ATTTTTATCT CTCCTCCCGT CGTAACAGCA AAGCTGCATA CCGGTTTCTG GGTAAAATCC TCAACAACGT GAAGAAGTGG CAGATCCCGC 31500
GATTCATCAA CACGGATAAA GCGCCCGCCT ATGGTCGCGC GCTTGCTCTG CTCAAACGCG AAGGCCGGTG CCCGTCTGAC GTTGAACACC GACAGATTAA 31600
GTACCGGAAC AACGTGATTG AATGCGATCA TGGCAAACTG AAACGGATAA TCGGCGCCAC GCTGGGATTT AAATCCATGA AGACGGCTTA CGCCACCATC 31700
AAAGGTATTG AGGTGATGCG TGCACTACGC AAAGGCCAGG CCTCAGCATT TTATTATGGT GATCCCCTGG GCGAAATGCG CCTGGTAAGC AGAGTTTTTG 31800
AAATGTAAGG CCTTTGAATA AGACAAAAGG CTGCCTCATC GCTAACTTTG CAACAGTGCC TATCAGGGTC AGCACCTTCG ACCAGAACCC GGAACGGCAA 31900
CTGGAAGGCG TCAAGGTTGA TCGCGCTTTT AGCGACAAGG CATCCGGCAA GGATGTCAAG CGTCCGCAAC TGGAAGCGCT GATAAGCTTC GCCCGCACCG 32000
GCGACACCGT GGTGGTGCAT AGCATGGATC GCCTGGCGCG CAATCTCGAT GATTTGCGCC GGATCGTGCA AACGCTGACA CAACGCGGCG TGCATATCGA 32100
ATTCGTCAAG GAACACCTCA GTTTTACTGG CGAAGACTCT CCGATGGCGA ACCTGATGCT CTCGGTGATG GGCGCGTTCG CCGAGTTCGA GCGCGCCCTG 32200
ATCCGCGAGC GTCAGCGCGA GGGTATTGCG CTCGCCAAGC AACGCGGGGC TTACCGTGGC AGGAAGAAAT CCCTGTCGTC TGAGCGTATT GCCGAACTGC 32300
GCCAACGTGT CGAGGCTGGC GAGCAAAAGA CCAAGCTTGC TCGTGAATTC GGAATCAGTC GCGAAACCCT GTATCAATAC TTGAGAACGG ATCAGTAAAT 32400
ATGCCACGTC GTTCCATCCT GTCCGCCGCC GAGCGGGAAA GCCTGCTGGC GTTGCCGGAC TCCAAGGACG ACCTGATCCG ACATTACACA TTCAACGATA 32500
CCGACCTCTC GATCATCCGA CAGCGGCGCG GGCCAGCCAA TCGGCTGGGC TTCGCGGTGC AGCTCTGTTA CCTGCGCTTT CCCGGCGTCA TCCTGGGCGT 32600
CGATGAACTA CCGTTCCCGC CCTTGTTGAA GCTGGTCGCC GACCAGCTCA AGGTCGGCGT CGAAAGCTGG AACGAGTACG GCCAGCGGGA GCAGACCCGG 32700
CGCGAGCACC TGAGCGAGCT GCAAACCGTG TTCGGTTTCC GGCCCTTCAC CATGAGCCAT TACCGGCAGG CCGTCCAGAT GCTGACCGAG CTGGCGATGC 32800
AAACCGACAA AGGCATCGTG CTGGCCAGCG CCTTGATCGG GCACCTGCGG CGGCAGTCGG TCATTCTGCC CGCCCTCAAC GCCGTCGAGC GGGCGAGTGC 32900
CGAGGCGATC ACCCGTGCTA ACCGGCGCAT CTACGACGCC TTGGCCGAAC CACTGGCGGA CGCGCATCGC CGCCGCCTCG ACGATCTGCT CAAGCGCCGG 33000
GACAACGGCA AGACGACCTG GTTGGCTTGG TTGCGCCAGT CTCCGGCCAA GCCAAATTCG CGGCATATGC TGGAACACAT CGAACGCCTC AAGGCATGGC 33100
AGGCACTCGA TCTGCCTACC GGCATCGAGC GGCTGGTTCA CCAGAACCGC CTGCTCAAGA TTGCCCGCGA GGGCGGCCAG ATGACACCCG CCGACCTGGC 33200
CAAATTCGAG CCGCAACGGC GCTACGCCAC TCTCGTGGCG CTGGCCACCG AGGGCATGGC CACCGTCACC GACGAAATCA TCGACCTGCA CGACCGCATC 33300
CTGGGTAAGC TGTTTAACGC TGCCAAGAAT AAGCATCAGC AGCAGTTCCA GGCGTCAGGC AAGGCCATCA ACGCCAAGGT ACGTCTGTAC GGGCGCATCG 33400
GTCAGGCGCT GATCGACGCC AAGCAATCAG GCCGCGATGC GTTTGCCGCC ATCGAGGCCG TCATGTCCTG GGATTCCTTT GCCGAGAGCG TCACCGAGGC 33500
GCAGAAGCTC GCGCAACCCG ATGACTTCGA TTTCCTGCAT CGCATCGGCG AGAGCTACGC CACCCTGCGC CGCTATGCAC CGGAATTCCT TGCCGTGCTC 33600
AAGCTGCGGG CCGCGCCCGC CGCCAAAAAC GTGCTTGATG CCATTGAGGT GCTGCGCGGC ATGAACACCG ACAACGCCCG CAAGCTGCCA GCCGATGCAC 33700
CGACCGGCTT CATCAAGCCG CGCTGGCAGA AACTGGTGAT GACCGACGCC GGCATCGACC GGCGCTACTA CGAACTGTGC GCGCTGTCCG AGTTGAAGAA 33800
CTCCCTGCGC TCGGGCGACA TCTGGGTGCA GGGTTCACGC CAGTTCAAGG ACTTCGAGGA CTACCTGGTA CCGCCCGAGA AGTTCACCAG CCTCAAGCAG 33900
TCCAGCGAAT TGCCGCTGGC CGTGGCCACC GACTGCGAAC AATATCTGCA TGAGCGGCTG ACGCTGCTGG AAGCACAACT TGCCACCGTC AACCGCATGG 34000
CGGCAGCCAA CGACCTGCCG GATGCCATCA TCACCGAGTC GGGCTTGAAG ATCACGCCGC TGGATGCGGC GGTGCCCGAC ACCGCGCAGG CGCTGATAGA 34100
CCAGACAGCC ATGGTCCTGC CGCACGTCAA GATCACCGAA CTGCTGCTCG AAGTCGATGA GTGGACGGGC TTCACCCGGC ACTTCACGCA CTTGAAATCG 34200
GGCGATCTGG CCAAGGACAA GAACCTGTTG TTGACCACGA TCCTGGCCGA CGCGATCAAC CTGGGCCTGA CCAAGATGGC CGAGTCCTGC CCCGGCACGA 34300
CCTACGCGAA GCTCGCTTGG CTGCAAGCCT GGCATACCCG CGACGAAACG TACTCGACAG CGTTGGCTGA ACTGGTCAAC GCTCAGTTTC GGCATCCCTT 34400
TGCCGGGCAC TGGGGCGATG GCACCACATC ATCATCGGAC GGACAGAATT TCCGAACCGC TAGCAAGGCA AAGAGCACGG GGCACATCAA CCCAAAATAT 34500
GGCAGCAGCC CAGGACGGAC TTTCTACACC CACATCTCCG ACCAATACGC GCCATTCCAC ACCAAGGTGG TCAATGTCGG CCTGCGCGAC TCAACCTACG 34600
TGCTCGACGG CCTGCTGTAC CACGAATCCG ACCTGCGGAT CGAGGAGCAC TACACCGACA CGGCGGGCTT CACCGATCAC GTCTTCGCCC TGATGCACCT 34700
CTTGGGCTTC CGCTTCGCGC CGCGCATCCG CGACCTGGGC GACACCAAGC TCTACATCCC GAAGGGCGAT GCCGCCTATG ACGCGCTCAA GCCGATGATC 34800
GGCGGCACGC TCAACATCAA GCACGTCCGC GCCCATTGGG ACGAAATCCT GCGGCTGGCC ACCTCGATCA AGCAGGGCAC GGTGACGGCC TCGCTGATGC 34900
TCAGGAAACT CGGCAGCTAC CCGCGCCAGA ACGGCTTGGC CGTCGCGCTG CGCGAGTTGG GCCGCATCGA GCGCACGCTG TTCATCCTCG ACTGGCTGCA 35000
AAGCGTCGAG CTACGCCGCC GCGTGCATGC CGGGCTGAAC AAGGGCGAGG CGCGCAATGC GCTGGCCCGT GCCGTGTTCT TCAACCGCCT TGGTGAAATC 35100
CGTGACCGCA GTTTCGAGCA GCAGCGCTAC CGGGCCAGCG GCCTCAACCT GGTGACGGCG GCCATCGTGC TGTGGAACAC GGTCTACCTG GAGCGTGCGG 35200
CGCATGCGTT GCGCGGCAAT GGTCATGCCG TCGATGACTC GCTATTGCAG TACCTGTCGC CACTCGGCTG GGAGCACATC AACCTGACCG GTGATTACCT 35300
ATGGCGCAGC AGCGCCAAGA TCGGCGCGGG GAAGTTCAGG CCGCTACGGC CTCTGCAACC GGCTTAGCGT GCTTTATTTT CCGTTTTCTG AGACGACCCC 35400

 Recombination Sites     

Name Coordinates Gene Sequence
attC cmlA6 3'-end 2789-2794 6 GCCCAA
attC JK007 3'-end 21842-21847 6 GTATAA
attC DUF1010 3'-end 22552-22557 6 TTAGTC
attC AAC(3)-Ia 3'-end 23458-23463 6 TTAGGT
attC DUF1010 3'-end 24500-24505 6 GACTAA
attC aadA1a core 25017-25070 54 CGCTTGAGTT AAGCCGCGCC GCGAAGCGGC GTCGGCTTGA ACGAATTGTT AGAC
attC-cmlA6 5'-end 25873-25936 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT
GGGC
attC cmlA6 core 25873-25936 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT
GGGC
attC-cmlA6 3'-end 27416-27421 6 GCCCAA
attC-aadA2 5'-end 27422-27475 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC aadA3 core 27422-27475 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC-aadA2 3'-end 28272-28277 6 GTCTAA
attC orfD core 28283-28330 48 GAGGTAAGCC GACCGCAGAA TGCGGGTCGG CTTGACCGAA ATGTTAGA
attI 29182-29237 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA
attC cmlA6 3'-end 29483-29488 6 TTGGGC
res 30869-30999 131 GCCGCCGTCA GGTTGAGGCA TACCCTAACC TGATGTCAGA TGCCATGTGT AAATTGCGTC
AGGATAGGAT TGAATTTTGA ATTTATTGAC ATATCTCGTT GAAGGTCATA GAGTCTTCCC
TGACATTTTG C
res_site_I 30869-30907 39 GCCGCCGTCA GGTTGAGGCA TACCCTAACC TGATGTCAG
res_site_II 30921-30964 44 ATTGCGTCAG GATAGGATTG AATTTTGAAT TTATTGACAT ATCT
res_site_III 30968-30999 32 TGAAGGTCAT AGAGTCTTCC CTGACATTTT GC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
merR Tn21.2 34-468 Passenger Gene Heavy Metal Resistance -
merT Tn21.2 540-890 Passenger Gene Heavy Metal Resistance +
merP Tn21.2 904-1179 Passenger Gene Heavy Metal Resistance +
merC Tn21.2 1215-1637 Passenger Gene Heavy Metal Resistance +
merA Tn21.2 1689-3383 Passenger Gene Heavy Metal Resistance +
merD Tn21.2 3401-3763 Passenger Gene Heavy Metal Resistance +
merE Tn21.2 3760-3996 Passenger Gene Heavy Metal Resistance +
urfM 5'-end-EEN5315556.1 Tn21.2 3993-4663 Passenger Gene Other +
tniA_p Tn21.2 4775-6026 Transposase   +
tnpA IS26 6090-6794 Transposase   +
tnp_p IS10_p 6848-7186 Transposase   +
jemA Tn10_p 7552-8757 Passenger Gene Other -
jemB Tn10_p 9201-9521 Passenger Gene Other +
CPT07_26605 Tn10_p 9514-9900 Passenger Gene Hypothetical +
jemC Tn10_p 9908-10594 Passenger Gene Other +
tetR (ARO:3003479) Tn10_p 10572-11198 Passenger Gene Antibiotic Resistance -
tet(B) (ARO:3000166) Tn10_p 11277-12482 Passenger Gene Antibiotic Resistance +
tetC_p Tn10_p 12595-12849 Passenger Gene Antibiotic Resistance -
insAB IS1R 12962-13659 Transposase   +
insA IS1R 12962-13237 Accessory Gene   +
insB IS1R 13156-13659 Transposase   +
merR Tn21.1 13766-14200 Passenger Gene Heavy Metal Resistance -
merT Tn21.1 14272-14622 Passenger Gene Heavy Metal Resistance +
merP Tn21.1 14636-14911 Passenger Gene Heavy Metal Resistance +
merC Tn21.1 14947-15369 Passenger Gene Heavy Metal Resistance +
merA Tn21.1 15421-17115 Passenger Gene Heavy Metal Resistance +
merD Tn21.1 17133-17495 Passenger Gene Heavy Metal Resistance +
merE Tn21.1 17492-17728 Passenger Gene Heavy Metal Resistance +
urfM 5'-end Tn21.2 17725-18395 Passenger Gene Other +
tniA_p In_Tn21.1 18471-19775 Transposase   +
tnpA IS26 19822-20526 Transposase   +
SDR family oxidoreductase In_Tn21.1 20984-21847 Passenger Gene Other +
GrpB domain protein In_Tn21.1 21885-22130 Passenger Gene Other +
sul3 (ARO:3000413) In_Tn21.1 22599-23390 Passenger Gene Antibiotic Resistance +
tnp IS256 family In_Tn21.1 23715-24323 Transposase   +
qacL (ARO:3005098) In_Tn21.1 24570-24902 Passenger Gene Antibiotic Resistance -
aadA (ARO:3002601) In_Tn21.1 25072-25863 Passenger Gene Antibiotic Resistance -
cmlA6 (ARO:3002696) In_Tn21.1 25956-27215 Passenger Gene Antibiotic Resistance -
aadA2 (ARO:3002602) In_Tn21.1 27477-28256 Passenger Gene Antibiotic Resistance -
DUF1010 family protein In_Tn21.1 28274-28564 Passenger Gene Other -
dfrA12 (ARO:3002858) In_Tn21.1 28676-29173 Passenger Gene Antibiotic Resistance -
intI1 In_Tn21.1 29318-30331 Integron Integrase Class 1 +
tnpM Tn21.1 30534-30884 Accessory Gene Inhibitor +
tnpR 5'-end Tn21.1 31010-31032 Accessory Gene Resolvase +
tnpA IS26 31104-31808 Transposase   +
tnpR 3'-end Tn21.1 31861-32398 Accessory Gene Resolvase +
tnpA Tn21.1 32401-35367 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn21.2 435 34-468 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   activator-repressor of mer operon
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLREPDKPY GSIRRYGEAD VVRVKFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLKDVREKM
ADLARMETVL SELVCACHAR KGNVSCPLIA SLQGEAGLAR SAMP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn21.2 351 540-890 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   cytosolic mercuric ion transport protein
Target:   Mercury
Protein Sequence:  
MSEPQNGRGA LFAGGLAAIL ASTCCLGPLV LVALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAWKRIY RPVQACKPGE VCAIPQVRAT YKLIFWIVAV
LVLVALGFPY VVPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn21.2 276 904-1179 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Protein Sequence:  
MKKLFASLAL AAAVAPVWAA TQTVTLAVPG MTCAACPITV KKALSKVEGV SKVDVGFEKR EAVVTFDDTK ASVQKLTKAT ADAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merC MerC Tn21.2 423 1215-1637 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   transmembrane protein mercury transport
Target:   Mercury
Protein Sequence:  
MGLMTRIADK TGALGSVVSA MGCAACFPAL ASFGAAIGLG FLSQYEGLFI SRLLPLFAAL AFLANALGWF SHRQWLRSLL GMIGPAIVFA ATVWLLGNWW
TANLMYVGLA LMIGVSIWDF VSPAHRRCGP DGCELPAKRL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn21.2 1695 1689-3383 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercuric ion reductase
Target:   Mercury
Protein Sequence:  
MSTLKITGMT CDSCAVHVKD ALEKVPGVQS ADVSYAKGSA KLAIEVGTSP DALTAAVAGL GYRATLADAP SVSTPGGLLD KMRDLLGRND KTGSSGALHI
AVIGSGGAAM AAALKAVEQG ARVTLIERGT IGGTCVNVGC VPSKIMIRAA HIAHLRRESP FDGGIAATTP TIQRTALLAQ QQARVDELRH AKYEGILEGN
PAITVLHGSA RFKDNRNLIV QLNDGGERVV AFDRCLIATG ASPAVPPIPG LKDTPYWTST EALVSETIPK RLAVIGSSVV ALELAQAFAR LGAKVTILAR
STLFFREDPA IGEAVTAAFR MEGIEVREHT QASQVAYING EGDGEFVLTT AHGELRADKL LVATGRAPNT RKLALDATGV TLTPQGAIVI DPGMRTSVEH
IYAAGDCTDQ PQFVYVAAAA GTRAAINMTG GDAALNLTAM PAVVFTDPQV ATVGYSEAEA HHDGIKTDSR TLTLDNVPRA LANFDTRGFI KLVVEEGSGR
LIGVQAVAPE AGELIQTAAL AIRNRMTVQE LADQLFPYLT MVEGLKLAAQ TFNKDVKQLS CCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn21.2 363 3401-3763 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   secondary regulatory protein
Target:   Mercury
Protein Sequence:  
MSAYTVSQLA HNAGVSVHIV RDYLVRGLLR PVACTTGGYG VFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGAQ AAAQLAVLRQ LVERRRAALA
HLDAQLASMP AERAHEEALP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn21.2 237 3760-3996 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Comment:   similar to urf-1 in pKLH2 (GenBank AF213017), pKLH272 (Genbank Y08992), pMER610 (GenBank Y08993), pKLH210 (GenBank Y10102), Tn5036 (Genbank Y09025), orf1 in Tn501 (GenBank Z00027), and urf-1 in Tn5041 (GenBank X98999)
Protein Sequence:  
MNAPDKLPPE TRQPVSGYLW GALAVLTCPC HLPILAAVLA GTTAGAFLGE HWGVAALALT GLFVLAVTRL LRAFRGGS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM 5'-end-EEN5315556.1 UrfM 5'-end-EEN5315556.1 Tn21.2 671 3993-4663 +
Class:   Passenger Gene
Sub Class:   Other
Comment:   urfM ORF interrupted by insertion of In2
Protein Sequence:  
MTSSQPAGWT AAELAQAAAR GQLDLHYQPL VDLRDHRIAG AEALMRWRHP RLGLLPPGQF LPLAESFGLM PEIGAWVLGE ACRQMHKWQG PAWQPFRLAI
NVSASQVGPT FDDEVKRVLA DMALPAELLE IELTESVAFG NPALFASFDA LRAIGVRFAA DDFGTGYSCL QHLKCCPITT LKIDQSFVAR LPDDARDQTI
VRAVIQLAHG LGMDVIFRRR LHQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA_p TniA_p Tn21.2 1252 4775-6026 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   Contains the first 417 amino acids of tniA (In2)|| can be extended upstream by 12 amino acids|| truncated by insertion of IS26
Protein Sequence:  
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE
LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR
TLTRTGFVID HIHYYAD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA IS26 705 6090-6794 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MNPFKGRHFQ RDIILWAVRW YCKYGISYRE LQEMLAERGV NVDHSTIYRW VQRYAPEMEK RLRWYWRNPS DLCPWHMDET YVKVNGRWAY LYRAVDSRGR
TVDFYLSSRR NSKAAYRFLG KILNNVKKWQ IPRFINTDKA PAYGRALALL KREGRCPSDV EHRQIKYRNN VIECDHGKLK RIIGATLGFK SMKTAYATIK
GIEVMRALRK GQASAFYYGD PLGEMRLVSR VFEM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp_p N IS10_p 339 6848-7186 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   highly similar to C terminal 112 amino acids of tnp (IS10R)
Protein Sequence:  
IEETFRDLKS PAYGLGLRHS RTSSSERFDI MLLIALMLQL TCWLAGVHAQ KQGWDKHFQA NTVRNRNVLS TVRLGMEVLR HSGYTITRED SLVAATLLTQ
NLFTHGYVLG KL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
jemA JemA Tn10_p 1206 7552-8757 -
Class:   Passenger Gene
Sub Class:   Other
Function:   glutamate:sodium symporter activity (GO:0015501)
Comment:   sodium-dependent glutamate permease
Protein Sequence:  
MILDASYTLL VACIALLIGM FVVKFTPFLQ KNHIPEAVVG GFIVAIVLLI IDKTSGYSFT FDASLQSLLM LTFFSSIGLS SDFSRLIKGG KPLVLLTIAV
TILIAIQNTV GMSMAVMMNE SPFIGLIAGS ITLTGGHGNA GAWGPILADK YGVTGAVELA MACATLGLVL GGLVGGPVAR HLLKKVSIPK TTEQERDTIV
EAFEQPSVKR KINANNVIET ISMLIICIVV GGYISALFKD TFLQLPTFVW CLFVGIIIRN TLTHVFKHEV FEPTVDVLGS VALSLFLAMA LMSLKFGQLA
SMAGPVLIII AVQTVVMVLF ACFVTFKMMG KDYDAVVISA GHCGFGMGAT PTAIANMQTV TKAFGPSHKA FLVVPMVGAF IVDISNSILI KIFIEIGTYF
T

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
jemB JemB Tn10_p 321 9201-9521 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  Antibiotic biosynthesis monooxygenase (Pfam:PF03992)
Protein Sequence:  
MIAVIFEVQI QPDQQTRYLT LAEELRPLLS HVAGFISIER FQSLATEGKM LSLSWWENEY AVLQWKNHVL HAKAQQEGRE SIFDFYKISI AHITREYSFK
KDKDNV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
CPT07_26605 CPT07_26605 Tn10_p 387 9514-9900 +
Class:   Passenger Gene
Sub Class:   Hypothetical
Comment:   Amino acid-binding protein
Protein Sequence:  
MFDVHVVLDN QIGQLALLGK TLGNKGIGLE GGGIFTVGDE CHAHFLVEQG KEAKIALEQA GLLVLAIRTP LIRKLKQEKP GELGEIARVL AENNINILVQ
YSDHANQLIL ITDNDSMAAS VTLPWAIK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
jemC JemC Tn10_p 687 9908-10594 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  HTH ArsR-type DNA-binding domain (InterPro:IPR001845)
Comment:   N-terminal 100 amino acids similar to bacterial transcriptional repressors and metal-binding proteins (PMID:10781570)
Protein Sequence:  
MANLIRKEVT FESSIAAIGA AMSDISRVKI LSALMDGRAW TATELSSVAN ISASTASSHL SKLLDCQLIT VVAQGKHRYF RLAGKDIAEL MESMMGISLN
HGVHARVSTP VHLRKARTCY DHLAGEVAVK IYDSLCQQQW ITENGSMITL SGIQYFHEMG IDVPSKHSRK ICCACLDWSE RRFHLGGYVG AALFSLYESK
GWLTRHLGYR EVTITEKGYA AFKTHFHI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tetR (ARO:3003479) TetR Tn10_p 627 10572-11198 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target alteration (ARO:0001001); antibiotic efflux (ARO:0010000)
Transpoase Chemistry:   repressor of the tetracycline resistance element
Target:   tetracycline antibiotic (ARO:3000050)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   perfect match to reference sequence for ARO:3003479
Protein Sequence:  
MMSRLDKSKV INSALELLNE VGIEGLTTRK LAQKLGVEQP TLYWHVKNKR ALLDALAIEM LDRHHTHFCP LEGESWQDFL RNNAKSFRCA LLSHRDGAKV
HLGTRPTEKQ YETLENQLAF LCQQGFSLEN ALYALSAVGH FTLGCVLEDQ EHQVAKEERE TPTTDSMPPL LRQAIELFDH QGAEPAFLFG LELIICGLEK
QLKCESGS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tet(B) (ARO:3000166) Tet(B) Tn10_p 1206 11277-12482 +
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Transpoase Chemistry:   antibiotic efflux (ARO:0010000)
Target:   tetracycline antibiotic (ARO:3000050)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   strict match to reference sequence for ARO:3000166 (bitscore:786)||99.9% identical to TnCentral reference sequence for tet(B) (Tn10)||Synonym: tetB
Protein Sequence:  
MNSSTKIALV ITLLDAMGIG LIMPVLPTLL REFIASEDIA NHFGVLLALY ALMQVIFAPW LGKMSDRFGR RPVLLLSLIG ASLDYLLLAF SSALWMLYLG
RLLSGITGAT GAVAASVIAD TTSASQRVKW FGWLGASFGL GLIAGPIIGG FAGEISPHSP FFIAALLNIV AFLVVMFWFR ETKNTRDNTD TEVGVETQSN
SVYITLFKTM PILLIIYFSA QLIGQIPATV WVLFTENRFG WNSMMVGFSL AGLGLLHSVF QAFVAGRIAT KWGEKTAVLL GFIADSSAFA FLAFISEGWL
VFPVLILLAG GGIALPALQG VMSIQTKSHQ QGALQGLLVS LTNATGVIGP LLFAVIYNHS LPIWDGWIWI IGLAFYCIII LLSMTFMLTP QAQGSKQETS
A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tetC_p N Tn10_p 255 12595-12849 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   tetracycline antibiotic (ARO:3000050)
Sequence Family:  Bacterial regulatory proteins, tetR family (Pfam:PF00440)
Comment:   no match in CARD||identical to C-terminal 84 amino acids of tetC (Tn10)
Protein Sequence:  
MPIIGWNELE KISLEYITGK VNAIVSKLIQ ENQLKAYDDD VLKNLLNGWF MHIAIHAKNL KELADKKGQF IAIYRGFLLS LKDK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
insAB InsAB IS1R 698 12962-13659 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   fusion protein from -1 programmed frameshifting between insA and insB on aaaaaa sequence
Protein Sequence:  
MASVSISCPS CSATDGVVRN GKSTAGHQRY LCSHCRKTWQ LQFTYTASQP GTHQKIIDMA MNGVGCRATA RIMGVGLNTI FRHLKKLRPQ SVTSRIQPGS
DVIVCAEMDE QWGYVGAKSR QRWLFYAYDR LRKTVVAHVF GERTMATLGR LMSLLSPFDV VIWMTDGWPL YESRLKGKLH VISKRYTQRI ERHNLNLRQH
LARLGRKSLS FSKSVELHDK VIGHYLNIKH YQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
insA InsA IS1R 276 12962-13237 +
Class:   Accessory Gene
Comment:   regulatory protein
Protein Sequence:  
VASVSISCPS CSATDGVVRN GKSTAGHQRY LCSHCRKTWQ LQFTYTASQP GTHQKIIDMA MNGVGCRATA RIMGVGLNTI FRHLKNSGRS R

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
insB InsB IS1R 504 13156-13659 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   downstream catalytic domain
Protein Sequence:  
MPGNRPHYGR WPQHDFPPFK KLRPQSVTSR IQPGSDVIVC AEMDEQWGYV GAKSRQRWLF YAYDRLRKTV VAHVFGERTM ATLGRLMSLL SPFDVVIWMT
DGWPLYESRL KGKLHVISKR YTQRIERHNL NLRQHLARLG RKSLSFSKSV ELHDKVIGHY LNIKHYQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn21.1 435 13766-14200 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   activator-repressor of mer operon
Target:   Mercury
Protein Sequence:  
MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLREPDKPY GSIRRYGEAD VVRVKFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLKDVREKM
ADLARMETVL SELVCACHAR KGNVSCPLIA SLQGEAGLAR SAMP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn21.1 351 14272-14622 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   cytosolic mercuric ion transport protein
Target:   Mercury
Protein Sequence:  
MSEPQNGRGA LFAGGLAAIL ASTCCLGPLV LVALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAWKRIY RPVQACKPGE VCAIPQVRAT YKLIFWIVAV
LVLVALGFPY VVPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn21.1 276 14636-14911 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Protein Sequence:  
MKKLFASLAL AAAVAPVWAA TQTVTLAVPG MTCAACPITV KKALSKVEGV SKVDVGFEKR EAVVTFDDTK ASVQKLTKAT ADAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merC MerC Tn21.1 423 14947-15369 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   transmembrane protein mercury transport
Target:   Mercury
Protein Sequence:  
MGLMTRIADK TGALGSVVSA MGCAACFPAL ASFGAAIGLG FLSQYEGLFI SRLLPLFAAL AFLANALGWF SHRQWLRSLL GMIGPAIVFA ATVWLLGNWW
TANLMYVGLA LMIGVSIWDF VSPAHRRCGP DGCELPAKRL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn21.1 1695 15421-17115 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercuric ion reductase
Target:   Mercury
Protein Sequence:  
MSTLKITGMT CDSCAVHVKD ALEKVPGVQS ADVSYAKGSA KLAIEVGTSP DALTAAVAGL GYRATLADAP SVSTPGGLLD KMRDLLGRND KTGSSGALHI
AVIGSGGAAM AAALKAVEQG ARVTLIERGT IGGTCVNVGC VPSKIMIRAA HIAHLRRESP FDGGIAATTP TIQRTALLAQ QQARVDELRH AKYEGILEGN
PAITVLHGSA RFKDNRNLIV QLNDGGERVV AFDRCLIATG ASPAVPPIPG LKDTPYWTST EALVSETIPK RLAVIGSSVV ALELAQAFAR LGAKVTILAR
STLFFREDPA IGEAVTAAFR MEGIEVREHT QASQVAYING EGDGEFVLTT AHGELRADKL LVATGRAPNT RKLALDATGV TLTPQGAIVI DPGMRTSVEH
IYAAGDCTDQ PQFVYVAAAA GTRAAINMTG GDAALNLTAM PAVVFTDPQV ATVGYSEAEA HHDGIKTDSR TLTLDNVPRA LANFDTRGFI KLVVEEGSGR
LIGVQAVAPE AGELIQTAAL AIRNRMTVQE LADQLFPYLT MVEGLKLAAQ TFNKDVKQLS CCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn21.1 363 17133-17495 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   secondary regulatory protein
Target:   Mercury
Protein Sequence:  
MSAYTVSQLA HNAGVSVHIV RDYLVRGLLR PVACTTGGYG VFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGAQ AAAQLAVLRQ LVERRRAALA
HLDAQLASMP AERAHEEALP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn21.1 237 17492-17728 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Function:   mercury transport
Target:   Mercury
Comment:   similar to urf-1 in pKLH2 (GenBank AF213017), pKLH272 (Genbank Y08992), pMER610 (GenBank Y08993), pKLH210 (GenBank Y10102), Tn5036 (Genbank Y09025), orf1 in Tn501 (GenBank Z00027), and urf-1 in Tn5041 (GenBank X98999)
Protein Sequence:  
MNAPDKLPPE TRQPVSGYLW GALAVLTCPC HLPILAAVLA GTTAGAFLGE HWGVAALALT GLFVLAVTRL LRAFRGGS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM 5'-end N Tn21.2 671 17725-18395 +
Class:   Passenger Gene
Sub Class:   Other
Comment:   urfM ORF interrupted by insertion of In2
Protein Sequence:  
MTSSQPAGWT AAELAQAAAR GQLDLHYQPL VDLRDHRIAG AEALMRWRHP RLGLLPPGQF LPLAESFGLM PEIGAWVLGE ACRQMHKWQG PAWQPFRLAI
NVSASQVGPT FDDEVKRVLA DMALPAELLE IELTESVAFG NPALFASFDA LRAIGVRFAA DDFGTGYSCL QHLKCCPITT LKIDQSFVAR LPDDARDQTI
VRAVIQLAHG LGMDVIFRRR LHQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA_p N In_Tn21.1 1305 18471-19775 +
Class:   Transposase
Function:   integrase
Transpoase Chemistry:   DDE
Comment:   Contains the first 429 amino acids of tniA (In2)||probably truncated by insertion of IS26
Protein Sequence:  
MLNTRVHQSE VSMATDTPRI PEQGVATLPD EAWERARRRA EIISPLAQSE TVGHEAADMA AQALGLSRRQ VYVLIRRARQ GSGLVTDLVP GQSGGGKGKG
RLPEPVERVI HELLQKRFLT KQKRSLAAFH REVTQVCKAQ KLRVPARNTV ALRIASLDPR KVIRRREGQD AARDLQGVGG EPPAVTAPLE QVQIDHTVID
LIVVDDRDRQ PIGRPYLTLA IDVFTRCVLG MVVTLEAPSA VSVGLCLVHV ACDKRPWLEG LNVEMDWQMS GKPLLLYLDN AAEFKSEALR RGCEQHGIRL
DYRPLGQPHY GGIVERIIGT AMQMIHDELP GTTFSNPDQR GDYDSENKAA LTLRELERWL TLAVGTYHGS VHNGLLQPPA ARWAEAVARV GVPAVVTRAT
SFLVDFLPIL RRTLTRTGFV IDHIHYYADG HCCK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA IS26 705 19822-20526 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MNPFKGRHFQ RDIILWAVRW YCKYGISYRE LQEMLAERGV NVDHSTIYRW VQRYAPEMEK RLRWYWRNPS DLCPWHMDET YVKVNGRWAY LYRAVDSRGR
TVDFYLSSRR NSKAAYRFLG KILNNVKKWQ IPRFINTDKA PAYGRALALL KREGRCPSDV EHRQIKYRNN VIECDHGKLK RIIGATLGFK SMKTAYATIK
GIEVMRALRK GQASAFYYGD PLGEMRLVSR VFEM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
SDR family oxidoreductase SDR family oxidoreductase In_Tn21.1 864 20984-21847 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  WP_000612791.1
Protein Sequence:  
MIPNSENKRV WFITGASKGL GYAFTCAALK AGDKVVAVAR TIDNLAKLEE TYQESLLPLN LDVTDREAVF STVETAVKHF GRLDIVVNNA GIMTMGMIEE
LNESDARKLM DTNFFGALWV CQAVMPYLRS QRSGHIIQIT SIGAIISGPM SGIYSASKFA LEGMSEALAK EAEHFGVKLT MVEPGGYWTD LYTSMSYSNP
LDSYGTLRDE LAKQYSEDSV DSDPSLAAEA LMKLVASNNP PLRLILGSMV YDLAMDTLKA RMATWEEWEA VSRASEKAIP APERYGV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GrpB domain protein GrpB domain protein In_Tn21.1 246 21885-22130 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  GrpB (Pfam:PF04229)
Protein Sequence:  
MKIEIMEYNP DWTKNFEEEK IKLLHFFGSH AVAIEHIGST AIPNQRAKPV IDIFIGVSPF AELPFISAFL MQRSITTLRQ I

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul3 (ARO:3000413) Sul3 In_Tn21.1 792 22599-23390 +
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Target:   sulfone antibiotic (ARO:3003401)||sulfonamide antibiotic (ARO:3000282)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000413
Protein Sequence:  
MSKIFGIVNI TTDSFSDGGL YLDTDKAIEH ALHLVEDGAD VIDLGAASSN PDTTEVGVVE EIKRLKPVIK ALKEKGISIS VDTFKPEVQS FCIEQKVDFI
NDIQGFPYPE IYSGLAKSDC KLVLMHSVQR IGAATKVETN PEEVFTSMME FFKERIAALV EAGVKRERII LDPGMGFFLG SNPETSILVL KRFPEIQEAF
NLQVMIAVSR KSFLGKITGT DVKSRLAPTL AAEMYAYKKG ADYLRTHDVK SLSDALKISK ALG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp IS256 family Tnp IS256 family In_Tn21.1 609 23715-24323 +
Class:   Transposase
Function:   tranposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MEFYPSCIEK GMRSERALKL AIAEMYVKGV STRRVSDIVE ILCGTEVSSS QVSRLAKELD EEITSWKAQP VGQIQYLVLD ATYESVRVGS HVVKQALLVA
IGVDYSGNRH ILDAEVANSE AEVNWRSFLE GLVRRGMHGL RMITSDDHSG LRAAIDAVFP GILWQRCQFH LQQNAHSYVT KKDEIPLIAA DIRKVFNRNM
SR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacL (ARO:3005098) QacL In_Tn21.1 333 24570-24902 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   quaternary ammonium salts
Sequence Family:  small multidrug resistance (SMR) antibiotic efflux pump (ARO:0010003)
Comment:   subunit of the qac multidrug efflux pump||strict match to reference sequence for ARO:3005098 (bitscore: 202)
Protein Sequence:  
MKNWLFLAIA IFGEVVATSA LKSSHGFTKL VPSVVVVAGY GLAFYFLSLA LKSIPVGIAY AVWAGLGIVL VAAIAWIFHG QKLDLWAFVG MGLIVSGVAV
LNLLSKVSAH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA (ARO:3002601) AadA In_Tn21.1 792 25072-25863 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   aminoglycoside nucleotidyltransferase
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   perfect match to reference sequence for ARO:3002601||Synonyms: aadA1-pm aadA, aadA1, aad(3'')(9)
Protein Sequence:  
MREAVIAEVS TQLSEVVGVI ERHLEPTLLA VHLYGSAVDG GLKPHSDIDL LVTVTVRLDE TTRRALINDL LETSASPGES EILRAVEVTI VVHDDIIPWR
YPAKRELQFG EWQRNDILAG IFEPATIDID LAILLTKARE HSVALVGPAA EELFDPVPEQ DLFEALNETL TLWNSPPDWA GDERNVVLTL SRIWYSAVTG
KIAPKDVAAD WAMERLPAQY QPVILEARQA YLGQEEDRLA SRADQLEEFV HYVKGEITKV VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
cmlA6 (ARO:3002696) CmlA6 In_Tn21.1 1260 25956-27215 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   phenicol antibiotic (ARO:3000387)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   strict match to reference sequence for ARO:3002696 (bitscore: 819)
Protein Sequence:  
MSSKNFSWRY SLAATVLLLS PFDLLASLGM DMYLPAVPFM PNALGTTAST IQLTLTTYLV MIGAGQLLFG PLSDRLGRRP VLLGGGLAYV VASMGLALTS
SAEVFLGLRI LQACGASACL VSTFATVRDI YAGREESNVI YGILGSMLAM VPAVGPLLGA LVDMWLGWRA IFAFLGLGMI AASAAAWRFW PETRVQRVAG
LQWSQLLLPV KCLNFWLYTL CYAAGMGSFF VFFSIAPGLM MGRQGVSQLG FSLLFATVAI AMVFTARFMG RVIPKWGSPS VLRMGMGCLI AGAVLLAITE
IWALQSVLGF IAPMWLVGIG VATAVSVAPN GALRGFDHVA GTVTAVYFCL GGVLLGSIGT LIISLLPRNT AWPVVVYCLT LATVVLGLSC VSRVKGSRGQ
GEHDVVALQS AESTSNPNR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA2 (ARO:3002602) AadA2 In_Tn21.1 780 27477-28256 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   strict match to reference sequence for ARO:3002602 (bitscore: 520)
Protein Sequence:  
VTIEISNQLS EVLSVIERHL ESTLLAVHLY GSAVDGGLKP YSDIDLLVTV AVKLDETTRR ALLNDLMEAS AFPGESETLR AIEVTLVVHD DIIPWRYPAK
RELQFGEWQR NDILAGIFEP AMIDIDLAIL LTKAREHSVA LVGPAAEEFF DPVPEQDLFE ALRETLKLWN SQPDWAGDER NVVLTLSRIW YSAITGKIAP
KDVAADWAIK RLPAQYQPVL LEAKQAYLGQ KEDHLASRAD HLEEFIRFVK GEIIKSVGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
DUF1010 family protein DUF1010 family protein In_Tn21.1 291 28274-28564 -
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  DUF1010 (Pfam:PF06231)
Protein Sequence:  
MFIQTAFSFS GVIQCLFCLF SGLRLHGLRR FSVFLASSPC VASASSYRFC SAVPPRWRSV FSRLAPVAKF KLSVLASGSN ISVKPTRILR SAYLAR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
dfrA12 (ARO:3002858) DfrA12 In_Tn21.1 498 28676-29173 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Target:   diaminopyrimidine antibiotic (ARO:3000171)
Sequence Family:  trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218)
Comment:   100% identity with reference sequence for ARO:3002858 (bitscore: 339)||Synonyms:
Protein Sequence:  
MNSESVRIYL VAAMGANRVI GNGPNIPWKI PGEQKIFRRL TEGKVVVMGR KTFESIGKPL PNRHTLVISR QANYRATGCV VVSTLSHAIA LASELGNELY
VAGGAEIYTL ALPHAHGVFL SEVHQTFEGD AFFPMLNETE FELVSTETIQ AVIPYTHSVY ARRNG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In_Tn21.1 1014 29318-30331 +
Class:   Integron Integrase
Sub Class:   Class 1
Function:   Integrase
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LPTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpM TnpM Tn21.1 351 30534-30884 +
Class:   Accessory Gene
Sub Class:   Inhibitor
Function:   transposition regulator; reported to enhance Tn21 transposition and suppress resolution of cointegrate replicons in vivo
Comment:   3'-end of urfM ORF, which is interrupted by insertion of In2||inhibits tranposition probably by inhibiting resolution
Protein Sequence:  
MEVVAEGVET PDCLAWLRQA GCDTVQGFLF ARPMPAAAFV GFVNQWRNTT MNANEPSTSC CVCCKEIPLD AAFTPEGAEY VEHFCGLECY QRFQARASTA
TETSVKPDAC DSPPSG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR 5'-end N Tn21.1 23 31010-31032 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   tnpR ORF interrupted by IS26 insertion
Protein Sequence:  
MTGQRIG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA IS26 705 31104-31808 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MNPFKGRHFQ RDIILWAVRW YCKYGISYRE LQEMLAERGV NVDHSTIYRW VQRYAPEMEK RLRWYWRNPS DLCPWHMDET YVKVNGRWAY LYRAVDSRGR
TVDFYLSSRR NSKAAYRFLG KILNNVKKWQ IPRFINTDKA PAYGRALALL KREGRCPSDV EHRQIKYRNN VIECDHGKLK RIIGATLGFK SMKTAYATIK
GIEVMRALRK GQASAFYYGD PLGEMRLVSR VFEM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR 3'-end N Tn21.1 538 31861-32398 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   tnpR ORF interrupted by IS26 insertion
Protein Sequence:  
YQGQHLRPEP GTATGRRQG* SRF*RQGIRQ GCQASATGSA DKLRPHRRHR GGA*HGSPGA QSR*FAPDRA NADTTRRAYR IRQGTPQFYW RRLSDGEPDA
LGDGRVRRVR ARPDPRASAR GYCARQATRG LPWQEEIPVV *AYCRTAPTC RGWRAKDQAC S*IRNQSRNP VSILENGSV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn21.1 2967 32401-35367 +
Class:   Transposase
Function:   transposition, DNA-mediated (GO:0006313)
Transpoase Chemistry:   DDE
Comment:   identical to TnAs3 tnpA
Protein Sequence:  
MPRRSILSAA ERESLLALPD SKDDLIRHYT FNDTDLSIIR QRRGPANRLG FAVQLCYLRF PGVILGVDEL PFPPLLKLVA DQLKVGVESW NEYGQREQTR
REHLSELQTV FGFRPFTMSH YRQAVQMLTE LAMQTDKGIV LASALIGHLR RQSVILPALN AVERASAEAI TRANRRIYDA LAEPLADAHR RRLDDLLKRR
DNGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPT GIERLVHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LATEGMATVT DEIIDLHDRI
LGKLFNAAKN KHQQQFQASG KAINAKVRLY GRIGQALIDA KQSGRDAFAA IEAVMSWDSF AESVTEAQKL AQPDDFDFLH RIGESYATLR RYAPEFLAVL
KLRAAPAAKN VLDAIEVLRG MNTDNARKLP ADAPTGFIKP RWQKLVMTDA GIDRRYYELC ALSELKNSLR SGDIWVQGSR QFKDFEDYLV PPEKFTSLKQ
SSELPLAVAT DCEQYLHERL TLLEAQLATV NRMAAANDLP DAIITESGLK ITPLDAAVPD TAQALIDQTA MVLPHVKITE LLLEVDEWTG FTRHFTHLKS
GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHTRDET YSTALAELVN AQFRHPFAGH WGDGTTSSSD GQNFRTASKA KSTGHINPKY
GSSPGRTFYT HISDQYAPFH TKVVNVGLRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYIPKGD AAYDALKPMI
GGTLNIKHVR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI
RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAAHALRGN GHAVDDSLLQ YLSPLGWEHI NLTGDYLWRS SAKIGAGKFR PLRPLQPA

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS26-MH257753 IS26 Insertion Sequence 6027-6846 820
Tn10_p-MH626558 Tn10 Transposon 6842-12907 6066
IS10_p-MH626558 IS10 Insertion Sequence 6847-7199 353
IS1R-J01730 IS1R Insertion Sequence 12907-13674 768
Tn21.1-MH257753 Tn21.1 Transposon 13733-35400 21668
In_Tn21.1-MH257753 In Integron 18366-30533 12168
IS26-MH257753 IS26 Insertion Sequence 19759-20578 820
IS26-MH257753 IS26 Insertion Sequence 31041-31860 820

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat i4 Tn5053 10-28 TCAGAAAACG GAAAATAAA
IRt In_Tn21.2 4634-4666 TGTCATTTTC AGAAGACGAC TGCACCAGTT GAT
repeat t1 In_Tn21.2 4642-4660 TCAGAAGACG ACTGCACCA
repeat t2 In_Tn21.2 4682-4700 AACACGTCGG TCGAGGACT
repeat t3 In_Tn21.2 4711-4730 TCAGAAGTGA TCTGCACCAA
repeat t4 In_Tn21.2 4743-4761 TCAATACTCG TGTGCACCA
IRL IS26 6027-6040 GGCACTGTTG CAAA
IRR IS26 6833-6846 AAACGTTGTC ACGG
IRR IS10_p 7178-7199 TTTAATACTC CCCTAGAGAG TC
IRL IS1R 12907-12929 GGTGATGCTG CCAACTTACT GAT
IRR IS1R 13652-13674 TAGTTATTCA ACCTCAGTAA TGG
IRL Tn21.1 13733-13770 GGGGGCACCT CAGAAAACGG AAAATAAAGC ACGCTAAG
repeat i4 Tn21.2 13742-13760 TCAGAAAACG GAAAATAAA
IRt In_Tn21.1 18366-18398 TGTCATTTTC AGAAGACGAC TGCACCAGTT GAT
repeat t1 In_Tn21.1 18374-18392 TCAGAAGACG ACTGCACCA
repeat t2 In_Tn21.1 18414-18432 AACACGTCGG TCGAGGACT
repeat t3 In_Tn21.1 18443-18462 TCAGAAGTGA TCTGCACCAA
repeat t4 In_Tn21.1 18475-18493 TCAATACTCG TGTGCACCA
IRL IS26 19759-19772 GGCACTGTTG CAAA
IRR IS26 20565-20578 AAACGTTGTC ACGG
repeat i4 In_Tn21.1 30414-30432 AGGAGGGACG CAGGCGACT
repeat i3 In_Tn21.1 30442-30460 CGTCGGGCAG CAACGGACT
repeat i2 In_Tn21.1 30484-30502 ATCACGTCAG CCGAAGACT
IRi In_Tn21.1 30501-30533 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT
repeat i1 In_Tn21.1 30507-30525 GTCACGTCGG CAGAAGACT
IRL IS26 31041-31054 GGCACTGTTG CAAA
IRR IS26 31847-31860 AAACGTTGTC ACGG
IRR Tn21.2 35360-35400 GCCGAATCGC ACGAAATAAA AGGCAAAAGA CTCTGCTGGG G