Transposon
Name: Tn5501.8
Family: Tn3        Group: Tn3000
Evidence of Transposition: no
 Host     

Host Organism:Comamonas sp. 7D-2 Molecular Source:plasmid pBHB
Place of Origin:Jintan, Jiangsu, China Date of Isolation:2013
Other Geographic Information:bromoxynil octanoate-contaminated soil sample from Repont Pesticide Factory

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTTCTAAGCCGGAACCGCCGAAAATTCCGTCAGCC
IRR (Length: 38 bp)GGGGTTCTAAGCCAGAACCGCCGAAATTTCCGTCATCC

 Sequence     
DNA SequenceLength  27892 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTCTAA GCCGGAACCG CCGAAAATTC CGTCAGCCGA TCAACGTGGC TTGTCCCGCG CCCGGTCGAT GGGGTAGACC CAACAGTCGT GTCACTAGCC 100
GCCATTTCGA TCACGGCAAT GCCAGCCGGA CGTCACGTCC AGATTGTTCC GGTCTGGATG AGGCCGACTG ACGTCTCGGA TGACGGGTGG CATACAACTG 200
CTGTGAGTCC TGCAGGGGGG CAGCTGCCTG ACCGGACGGC GAGCATCAGC CCATCTCATG TATTAGTCAT GTCAGCTTTG ACACTGCGCA CGCGACGGCA 300
CCCGACCCGA AGGAGACAAC GGAGTTTTTC AAGCCGAAGG ACAGCTCCAC CCTTCAAGCT GACGCTCAGT TAGTCCATAG AGGGATTGGC GCTCCACAAC 400
AGCGGTCGCT CTGACCTTGC CCTTGAATTG TGAATCCCAT AACCGAGCTC ACAGATTCAC TGTTTTCGGT TGATCCGCGA GCCAGTGTTG CTCGTATTGC 500
ATCGGGCTCA TGTAGCCCAG CGTCGAATGC AGCCTGCGGC TGTTGTACCA GACCAGCCAG TCGATGACCT CGTCCTTGGC CTGCCGGCGG GTCTCGAGCC 600
GAACTGCGTT TCGCTGCACG CATTGTCCCA GCAATTGCCC TTTCGGCTCA TCGAACTGCG CATGCCCCAG CCCTTGAGCA GGTCCTGGAA TTCCCCGCTG 700
GCGTACGGCG CAGTACCCAT CTATTCGATG GCCTTGCCCA TGTCCTCGAC CACCTTCTTG GCGTCGCCGA ACACCATCAT GGTCTTGTCC ATGTAGAACA 800
GCTCGTTGTC CAGCCCCGCA TAGCCCGCGG CCATGCTGCG CTTGTTGACG ATCACGGTCT TGGCCTTGTA GGCCTCCAGG ATAGGCATGC CGTAGATCGG 900
GCTGCCCTTT TGCAGCGCCA CGGGGTTCAC CACGTCGTTG GCGCCCAGGA TGATGGCCAC GTCGGCCTGG CCGAACTCGC CGTTGATGTC CTCCATCTCG 1000
AACACCTGGT CGTAGGGCAC CTCGGCCTCG GCCAGCAGCA CGTTCATGTG GCCCGGCATG CGCCCAGCCA CCGGGTGGAT CGCGTACTTG ACCGTCACGC 1100
CCTTGTGGGT GAGCTTCTCG GCCAGCTCGT TGACCGCATG CTGCGCGCGC GCCACCGCCA GGCCGTAGCC GGGCACGATG ATCACGGTCT CGGCGTTCCC 1200
GAGGATGTAG GCCGCGTCGT CGGCGCTGCC GCTCTTGACC GGGCGCTGCT CGCCGCTGCC GGCCGCCGCC GCGCTGGTCT CGCCGCCGAA CCCGCCCAGG 1300
ATCACGCTGA AGAACGAGCG GTTCATGGCC TTGCACATGA TGTAGGACAG GATCGCGCCG GAGCTGCCGA CCAGGCTGCC GGCGATGATC AGCATGGAGT 1400
TGTTCAGGCT GAAGCCGATG CCGGCGGCCG CCCAGCCCGA GTAGCTGTTG AGCATGGACA CCACCACCGG CATGTCGGCG CCGCCGATCG GGATGATGAT 1500
GGTCACGCCC AGCACGAAGG ACAGGGCCAG CATCAGGAAG AAGGCGCTCC AGTTCTCGGT GAAGGTGAAG ATCAGGCCCA GGGCGATGGT GGCCAGGCCC 1600
AAGGCCAGGT TCAGCATGTG CTGGCCCGCG AACTGCACCG GCGCGCCGCG GAACAGCCGG AACTTGTACT TGCCCGAGAG CTTGCCGAAG GCGATCACCG 1700
AGCCGCTGAA GGTGATGGCG CCGATGGCCG CGCCCAGGAA CAGCTCGAGC CGGTTGCCTG CGGGGATCGG GCTGCCCTTG GCCGTGATGC CGAAGGCCCA 1800
GGGCTCGGCC ACGGCGGCGA TCGCGATGAA CACCGCGGCC AGGCCGATCA TGCTGTGCAT GAAGGCCACC AGCTCGGGCA TCTTGGTCAT CTCCACGCGC 1900
TGGGCCATGA CGGCGCCCAG GCCGCCGCCG AGCAGCAGAC CGACCAGGAC CCAGACCATG CCCTGCGCGG CGCCGCCGGC GATCTCCACG ATCAGCGCGG 2000
CGGTGGTCAG GATGGCGATC GCCATGCCGC TCATGCCGAA CAGGTTGCCG CGGATCGAGC TGGTGGGGTG GGACAGGCCC TTCAAGGCCT GGATGAAGCA 2100
GATGCTGGCG ATGAGGTACA GCAGGGTGAC GAGGTTCATG CTCATTTCAC TTGCCCTCCG CGTTCGACTT CTTGTCCTTC TTCTTGAACA TCTCGAGCAT 2200
GCGGCGCGTG ACCAGGAAGC CACCGAACAC GTTGACGGCG GCCAGGGCCA CGGCGAGCAC GCCCATGGCC TTGCCAAGAC CGGTTTCGGT CAGCGCGGCG 2300
GCCAGCATGG CGCCGACGAT GACGATGGCC GAGATGGCGT TGGTGACGGC CATCAGCGGG GTGTGCAGCG CGGGCGTGAC GTTCCAGACC ACGTGGTAGC 2400
CGACGTAGAT GGCGAGCACG AAGATGATCA GGTTGACGAG GGTGGGGGAG ACGATGTCCA TGCTTATTTC CTCTTGACTT CGCCGCCCTG GGTCATCAGG 2500
CAGGCGGCGA CGATGTCGTC GCTCGGGTCG ATGTTGAGCG GGCCTTCCTT CGGTAAAACG AGCTTGAGGA AGTCGAGCAC GTTGCGCGCG TAGAACGCGC 2600
TGGCGTCGGC GGCCACGAGC GCGGGCAGGT TGGTCTCGCC GACCAGGGTG ACGCCATGCT TGATCACGGT CTTGCCCGCC TCGGTGAGCG GGCAGTTGCC 2700
GCCCTGGGGA GCCGCCAGGT CGACGATGAC CGAGCCGGGC TTCATGGACT TGACCATCTC CTCGGTGACC AGCACCGGGG CGGCGCGGCC GGGGATCAGG 2800
GCGGTGGTGA TGACGATGTC GGCCTGGGCG ACGCGCTTGG CGACCTCGGC CTTCTGGCGC TCGAGCCAGC TGGCGGGCAT GGGGCGGGCG TAGCCGCCCA 2900
CGCCCTCGGC GGCGTCCTTC TCCTCCTGGG TCTCGTAGGG CACGTCGATG AACTTGGCGC CCAGGCTCTC GACCTGTTCC TTGACGCTGG GGCGCACGTC 3000
GGAGGCCTCG ATGACGGCGC CCAGGCGCCG GGCGGTGGCA ATCGCCTGCA GGCCGGCCAC GCCCACGCCC AGGATGACCA CGCGCGCGGC CTTGACGGTG 3100
CCGGCTGCCG TCATGAGCAT GGGAAAGAAG CGCTGGTAGC GGTCGGCGGC GATCATGACG GCCTTGTAGC CGGCGATGTT GGCCTGGGAG GACAGCACGT 3200
CCATGCTTTG CGCGCGGGTG GTGCGCGGGG CGGCTTCGAG CGCGAAGCTG GTCAGGCCGG CGGCGGCCAG GCGATGCAGG CCCTCGGCGT CGAAGGGGTT 3300
GAGCATGCCG ACCACGGCGG CGCCGGGTTT CATGAGCGCC AGTTCCTCGG CCGAGGGCGC GCGCACCTTG AGCACCAGCT CGCAGCCGAG GGCGCCGGCC 3400
GCGTCGCAGA TCTCGGCGCC CACCGCCTCG TAGGCCGCGT CGGGCGCGGC AGCGGCCACG CCGGCGCCCG TCTGCACGCG CACCACATGG CCCTGGGCCT 3500
TGAGTTTCTT GGCCGTCTCG GGCGTGACGG CAACACGGGT TTCACCCGCC TCGGTCTCGG CGGGCACGCC AATCTGCATA CGTGTCTCCT TGCGTTTAGA 3600
TAGAAACTTC AGCATCCTCG ACTGCGGTGG TCACCGATCG AGCTTGGGCA AGGATTAGAG GTTGAGAACG ATCTCGCCTT CGGCTGTGGC AGCTCTGGAG 3700
CAGCACGTAA TCATCTTCAC CTTGCGCTGC TCCTTGCTTA GAACAAAATC CCGATGATCC ACTTCGCCGG AGAGATATGG CACAGCGCAG ACGCCGCAGA 3800
GACCGTCACT GCACTTCACG TCTACTTGAA GACCCGCGCT CGCAAGCGCC TGCACGGCGC TTTGCGTAGC ACTCACCGGG ATGACTTGAC CCGTGCTGTT 3900
CAGACGAAGC GTAAACGGGA GTTTCTCCCG CTCGTCCATC TCAGGTACGG AAAAGTACTC TCGATGCAAG GAACTCTCGT CCCACCCGAA GGCGGCGGCC 4000
GTGTCAAACA CAGCGTCCAT GAAAGCTGCA GGACCACAGG TGTAGACGTG ATTACCAGGA ACATAGTCAC GCAGCACGTC GGCAATATCT AACCTTCGTT 4100
CATCGCTAAA GTGGAAGTGG ACGCGGTTTG ACCACGGTAC TGACTCGAGT TCCTGGATGA AGGCGGCCTG GGCCCGCGTC TTTGCTTTGT AGTACAAAAC 4200
AAAGTCTTTC CCGGCGGAAT GGAGTTCATG TCCCATCGCA ATGAGCGGAG TGACTCCGAT GCCCCCGGCC AGGAGAAGGT GCCGGCGCGC ATCCTGCATC 4300
ACAGGGAAGT GGTTTCTTGG CGCGGAGACG ACGACTGGAA CTCCAGGACG AAGCATTTGG TGGATCTTCA ATGAGCCACC CCTGCCTTGA TCCTCCCTCA 4400
GAATGCCAAG TACATACTTG CTGCGATCAG CAGGATCTCC CGCCAGGCTG AACTGCCGGA TGAACTCGGG CGTGATCGTG ATGTCGATGT GTGCACCAGC 4500
GTCGAACTTC GGCAAGGGCG ATCCGTCGCT GGAGACGATT TCGAAGATGT CAATTTTTCC GTCCTTCGAG CTTGCCCAAC GGCGGCTGAT TTTTGCGACG 4600
ATGACAGGCG GCTCTTCTGT CAGGCGCCGC AAGCCTGGTA CCAGGTTGTC CGTGTCGCCG GAGGAAATGC GCGTTTTGTA CGCGTCTGGG GTCAGAAGAT 4700
TCTTGTATGC ATCTATCCCT GCTTCGCGAT CAACTCTGCT CGGCATGGGC AAGGGCAAAG GTGCGAGGGG CGCTGGGTAA GCGGCGAGCG TTTGATCCTC 4800
ATATTTCAAC CGAATATGCT TGTTGAGTCC CCGGACATTT ACCTCCTTCG CGGCGACCAC GCTTCCGTTG CGCTCGGTCG TGATGTCCCA CCACCACTTC 4900
TTCCGAGGGT TAATCTCTCC CCGATTAAGG TAGTCGTCAA GCTTCGCCAG CCACTGTGCA GCGATAGGAA GATTCATCGC GGCCCACCTG AAGGGCTTTT 5000
CCTGAACAAG TCCTGCAAGA TTCCAGGGAC ACGCTTTCAT GCACCGGCCG CACATTGCGC CTGCCAAATT TGTCATCCGG TACCGCGTGC ATTTCTCGAC 5100
GTCGGCCTTC CAGATCTCGT AGCCGTTGAA CATCACTTTC GGTCCGGCCG AGATCGCGCC TGAAGGACAC TCGCGGGCGC ACTTGTTGCA AGATTCGCAG 5200
AAGTTCTGAA GTCCAAAGTC AATGGCCTTG TCCGGCTCGA TGGGGAAGTC TGTGGTCATT ACCCCCGTCT TGCTTCTCGG ACCGAGGAAC GGATTGAGAA 5300
TGGTCTCGCC AATCCGGCTC ACCTCTCCAA GACCTGCAAG CAGAACGAGC GGTGTTTGGA TCACGTCACT ATCCGCCGCC GAGTGAGCCG TGGCCCCGTA 5400
TCCAAGGCGT CGGAGATGAT CTGCAAGCAC GCCACAGACC AAGCCCGCAC GCATGTAGGT CCGCATGGAT TGCGCGCTAG AGATCCAGTC GTCGCCGGAC 5500
GCACCTTCCA TGGTCTCAAA GCCTTGGTCA ATCATCACAG TGGCTGCAAA CGGATGCGCC ACGGGCATCT CTGTGCCATC CTGACGGTGC GAATACCATA 5600
CCCAATCAGG TGCCCGCGAA ACACCCGCAA GGTCCGCGCC CAGAAAGTGC AGGGTCGCTT TGACCAGGTC CGCATTGTTC TTGCGGTCTG GGAACTCAAC 5700
CTTCAGTTTG GCGGGCTCGC CTCGCTGGAG GACTGAATAG ACATTAAGTG CAAGTCGCAA CGCCGCAGCC AAGGGGGCTT TGGCCACGGA ATATCCGTCT 5800
TTTGAGGCTT CTTGGGGAGC TTTGCCGAGA TCGCCGAACG CCGCCCGCAG GAAAAATTCG CTGCTTTTGG GGACACGCGG AATGCGCGCC TCGTCGATAA 5900
AGGTCGTAGT CTCTTCGACC CGCTTGAGCG ACTCGACGGG AAAGCGGCTA TACCGATACT CTCGTGCGCC GAACTCACGG GCGTTCCATT TATTCTTCGG 6000
GCTCGACCAG CCCAGCCACC AACTTGGACC CTTGGCTCTG AAGGATTCCA GCAAGGACGG CTGCGCCAGC GGAAGGTCTG GGTCGAGTGC CATATCCGTT 6100
GTGACGGCTG CGAGCACGAA TCTGTCGCCC ACAAACGGGT TTAAGACACG TCCGTGGTCA ACACGTGCCA GCCCGGACGA CACAGCAAGC CTGAGCAGGT 6200
TCACGTCAGT CGTTGTCGCA GTGTGGCTCC TGGCGCTATA ACCAAGAATT CGAAGGTAGT TGGCCAGTAC CGCGGCGGTC TCTGCTGCCC TCAGCGCGGC 6300
GCGCCACTGC TGCATTCCCA CAATCCAGTC TGCCCCTGCC TCCTCGGGCT GGGGATCGCG CGGAAACTCG GCCATCAAAA CAAGCGCGTG GGAGTGATGC 6400
TCCACGCCTT GCTCGCAGAG GTCCATTGAC CGCTTCATCT GTCGCAATAC TGCGTTCGGG TTGAAGCGAA GCCTCAGCTT GGTCTCGGAT TGCTCGTAGG 6500
TTGCGGTGCG AAAGGTCGGA TGCTCGATAC GGAGATCGAG CAAATGCTCG CGATTCAGTT CGCTGACACC GGCAATTGAC GTATCGAGAA AGTACGCGGC 6600
CGACTTGAGA TGGCGTGAGC GTTCCATCGG GTCCGACGGA ATTTCACCTC GCTGCTTGTT GTGGTCTCCC TCCCTTACCT GGTCGAGAGC GCACATGAAT 6700
CTGCTGATGG AATGTGCCAG CGTGGATTCC GAGTCGTTTG TCGGAGGCGG TTGAAGGGCA CCCGTCTCGT GGATCGACTG GTGGCTACTC GAGCTCGCTT 6800
TGCGGGAAAG CCGTTCACTG GGGTATGCGC CGTAATGCAT GGCGCGCTGG GTGTGAGGAA AAATTCGCAT CTGAAGCCTC TGACTTTCCT GGCTATTCCC 6900
TAATCCCACG AAGCTTCACG ATCGCGGCGT AGCGGGCGCT ATCACGGCGC AGCTGATCTG CGAGGTCAGC GGGGCTAAGA GGAGTCGGCT CGGCGCCAAG 7000
GTTTCGAATG GTCTGGATGA CCTGAGGGCT TGCCAACTGC TTGTTGATTT CAGCGTTCAT GCGCTCCACG ATCGCGGCCG GCGTTTTTGC CGGAGCGTAG 7100
ATGGCATGAG TGGTGCCTGA GTCGAAGCCT TTCAGTCCGA GTTCCTGCAA TGTGGGCGTG TCAGGAAACA GCGGCGATCG CTTCGTGCTG CCAACAGCCA 7200
ACAAACGGAG CTTTCCCGCC TTGACGTGCT CGAGGCCAAT TCCGGGGTCA AACGCAAACT GGACGACTCC TCCGAGCAGA TCCTGCAGCG CGGGCGATGA 7300
GCCGCGATAG GGTATATGCA CAGCGGTAAT GCCTGTTTCC GCCATCATCA TCTCCGCTGC CACGTGAGGC GCACTTCCAT TGCCAGGGCT GGCATACGAG 7400
AGACGGCCGG GCCGCGCCTT CGCTGCCGCC ACGAGCTCAG AAAAGGACTT GAAGTCCTGA CGTGCATCAC TCACCAGGAA GAGCTCGATT CGAGCAGTGC 7500
TCGCAACAGG CACGAGATCC TTCATCGGAT CGATTGGCAC CTTCGGAAGC GTGTGCACCG AAATCGTCAC GGAGCTACCC GACGTCATTA GAAAGGTGTA 7600
TCCGTCCGCA GGTGATCGGG CGACAGCTTC AGCGGCAATC ATTCCCGTCA CGCCGGCGCG GTTCTCCACC ACTACGGGCT GCTTCAGGGC TTCCGCAAGT 7700
CCAGGGGCGA GAGCACGCCC GATGACATCA GGCGAACTCC CGGGCGGGAA TGCGACGATG AGCTTGATTG GCTTGGACGG CCAGTCACCG GAGGCCCAGG 7800
CCGTGGCTTG CAACGCCAGC GTTGACAACG GGATCAGGCA TGTGACCAAA GCGCGACGGC CGATGCTGAG ATAGGAAAAG CTCATCTATG TCTCCAGTGT 7900
TCGTGTGGTA CGCGTGTGTT GCATTAGCTT GCTGTTGACG CCGGATTTTT GACGCCTTCC AGAACGAGAA GCCCGTAGGC AGTGTTGGAC GCAGGAACGT 8000
GATAGAACCG ATGACGGATG TCCACATCGT CGGTCAGTGC TCCGCGCATG ACGAGCCACA TCACGCACTC CAGTCCCTCT GGGGCCGCTT CCCTCAACAC 8100
CTGGAGATGG GGCATGGCCG CGGGCGTAGC TGGACGGTCG ACAAACGCAT CCAGCCATGC GTTGTCGAAT TGCGCATTGA TCAGGCCCGC TCGCTCGCCC 8200
TGCAATTGAC GCGACAAGCC CGTTCGCACG CAGCACGTCC ACCGGGGTTT GCTCGAGGAC ACCAGCGCGG ATCCGGGATT TGACATAGTC CCGGCTTTGT 8300
CTCTCGAGGA CGACACTGTC AATCTTTCGT CGCGCGAGGA TTTCGGAGGG GAGCACGCCC GCCGGCCCCG CGCCGATGAT GGTCACTTGT GTTTTCATTG 8400
GAGGCAATTA CTTTGTTGCA GATGCTCGTC ATCCTAGGGA GGACAACGAC CTTTGTGAAG GCAGCGATCA GCTATCACGA TCGCTGTTCT CTATCAGTCG 8500
TCGGCGCGCC GCCCCAGCAT AATTTGCCTA TGAATCGATC TGATATCAGG GCACTAAAGC GGCAGGTTCT CGGCGAGGAC TCACCCGAGG GCATCGTTGC 8600
GGCGCGCAGT GCGAGCATTG CGTTGCTGGA GCGCAGTATC CGGTTTGGAC ATAAGAAGCT GGAACTTAGA CGTCTAAGCG GCGCCGTCGC GCTCGGCGCG 8700
AACGTCACTG CTGAACACTT CCAATATTGC AGGGAAGTCG TTAGGTCTGG GACCGCCGCA CGCCCGAGCA ATCAGCGCGG CCTTGATGAG ACCCACCATC 8800
CGTAGTACGT ATGCGCATCT TTCGATTCTG CGGATTCTTC CGACTGATCA AACTCTGGGT TGCAGGTCGG GGGGTGATCC CATTTAGCGC GCTCGGATGT 8900
TTGTTCACCT GCTTGGGAAG CTCACTGGCG AACCAACTTG TCCTCTCGGG CCGAGTGCAG TCAATACAGG AGTTCGTAGT GAGGTTCGGT GTGGGCTACG 9000
CTGGGGCTGT ACTTGTCGGC TGTGCAGCGC TGCCGATTCT GAACGCGCTG CTGCGCAATC TGCGACAAGA CGATGCGTAG GCCAACGCGT CATTTTGATC 9100
TGAACCTGCT GCGCTAGTAT TACTGTGTTA TAAATCAATG AGGTATCCAC ACTTCTTCAA GTGAAGGATG AGCCTTGATG GACGAGCTGC AAGAGTTTGA 9200
TCGATACATG GCGCATCTAA GAGCTATCGG CATAGGGGCC CCCAGCAATA GGAGGCCCCC CTGCCACACC ACCGGGCATG CGGGCCCGCA CCCGGCGGTT 9300
CGAGAGTTTG AGGTCATGAG AGCCGGGGTA TCCCCATCCG GTCGAAGCAG GCAACGGTCA GCACACTGTT GAGCAACATG GCGCTGTTGC GCCACCAGCG 9400
GCGGCCATAC AGCGCCACCC GTTTGGCAAC ATCATCATCC GCACCAAGCG CTTTGAGTTC CCGGTAGATC GTCCGTGGCC GCTTCCAATG TTTGAGCTGA 9500
ATCGCTCGTA GACGGTGAGC GGCTGGCCGT CCCACCTTGA GATGCGTGGC TGAGACGTGG ACGATGCTGT CCATCTAACT TAGATGGACA GCTATGCACG 9600
TCAATCAGAA GCCGCGCCGG CGGCACAGCG AGCAGTTCAA GGCGCAGGTT CTGGCCGCGT GCGCCGAGCC CGGCGCGTCG GTATCGGCGG TGGCGTTGTC 9700
GTTCGGGGTG AACGCCAACC TTGTGCACCA ATGGCGCCGG GGCCGCGGCT TCAAGGCGGC CCGAACAGTG CCGCCGTGCC CGGTGATCGA GCCGGCGCCG 9800
CGGTTCGTTG CGCTGTCGTT GCCGGCGCCG ACGCCCGCGC CATCTCCTGC GGCGGGCGCA CCCGCTCCAG CGATCCGCGT GGAGCTCAAA CGTGGCGCTC 9900
TGGGCGTGAA CGTCATCTGG CCGATCGCGG CGGCCGGCGA CTGCACCGCC TGGCTGCGCG AGTTGACGAC GGACCTGCTG AAGTGATCCG CATCGATCAG 10000
CTGTGGTTGT GCACCGCGCC GATGGACATG CGCGCCGGCG CCGAGCGACT TCTGAGCTGC GTTGTGCAGA CCACCGGCGC TGCCCACGCG CACCATGGCT 10100
ACCTCTTCGC CAACGCGCGC GCCACGCGGA TCAAGCTGTT GGTGCACGAC GGGTTCGGCG TGTGGTGCGC CGCGCGGCGC CTGAACGCGG GCCACTTCGC 10200
GTGGCCGCGC GAGGCGGCGG CCACGCCGCT GTCGTTGACG CAGGCGCAGT TCGATGCATT GGTCGTGGGC CTGCCATGGC AGCGCCTGCC CGAGATGAGC 10300
GTGATCACGC GGCTGTGAGC GTCATGGCGG TCTGCTCGCG ACGCTCGTTC GCGCGTTGAC AGGAGTTGTG CCAATGGCGA GAGTCGGGTC GCATGGGCAT 10400
GATGCGAGGC ATGCTCAGCA TGCGCGATCT CAAAGCTCAG GACCTGCAAG GTCTGTCGCC CGAGACCGTC ACGGCGCTGG CCGCGCAGAT GCTCGAGCGC 10500
ATCGAGCAGC AGGCGCAGGA GATTGATCTG CAGCGGCGCG ACCTCGAGGT CAAGCAGAAG CTGATCGAGC GCAAGGACCG CGACATCGCC TGGCGCGACG 10600
CAAAGCTGGA GAAGGTCAAC TTCGAGCTGG CGCGCCTGAA GCGCTGGAAG TTCGGCGCCA AGAGCGAGGC GATGACGGCC GAGCAGCGCC AAATGTTCCA 10700
GGACACGCTG CTGGAGGACG AGGCCGACCT CGAGGCGCAA CTCGCCGCGC TGCAAGCAGC CCTGCCCAAG ACGCTGTCCG CACCCAAGAC ACCGCGCAGG 10800
CGCCCGCGCC GCCAGGCACT GCCCGATCAC CTGCGTCGCG TCGAGCACCG TCACGAGCCC GAGGACACCA ACTGCACGAC GCCGCAGTGC GGGCAGCCGA 10900
TGACCCGCGT CGGCGAGGAC ATCAGCGAAC GGCTGGACAT CGTGCCGGCG GAGTTCTTCG TGCACCGCCA TATCTACGGC AAGTGGGCGT GCCGCTGCTG 11000
CCAACGCCAA GGGATCGAAC GCCTGGTACA GGAGCCTGCC GATGCGCAGA TCATCGATGG GGGCATCGCC GCCAGCGGGC TGGTGGCACA TACGCTGATC 11100
AGCCGCTTCG TCGATCACTT GCCGTACTAC CGCCAGGAGG CCATCAACGC CAGGTCCGGC GTTCACACGC CGCGCTCGAC ACTGGCGTCG CAGTCCGGCC 11200
GCGCCGGCGC AGCGATGGAG CCGCTGTACG AGGCGCACAA GCGCTTCGTG CTGAGCTGCC CGGTGGTGCA CGCCGACGAG ACGCCGGTGG CGATGCTGGA 11300
CCCCGGAGCT GGCAAGACCA AGCGGGCTTA CATCTGGGCC TATGCGCGAG GCGAGCTGGA TGGCCAGCGC GGGGTGATCT ATGAGTTCTG CCTGGGCCGC 11400
GGTTCGCAGT ACCCGGTGGC CTTCCTGGGG GGGGCCCAGG GTCCGCCGGG GTCGCCGATC GACGAACAAG CGGCGTGGAG CGGCACGCTG GTGTGCGATC 11500
AATACGCCGG ATACGACCGC GTGCTGGATC GGCGCGTGTA CCCGCAGCGC ATCGCCGCCA ATTGCGTCGC TCATGCCCGC CGGAAGTTCG ACGAACTGGT 11600
CGGCACCAGC GAGGTGGCCA AGGAGACGAT CAAACGCATT GGCTGGATCT ACCACGTCGA GGGTCAGTTC GAGGGGATGG ACGCGCAGCA GCGCCTGGTG 11700
GCGCGGGACC AGCTCACGCG GCCGCTGTGG AAGGAGCTGC ACGTCTGGCT GAAGCTGGAA CGCGGCCGCG TGCCGGACGG CGGCTCGATT GCCGGGGCGA 11800
TCGACTACAG CCTGAACAGC TGGACCGCAC TGACGCGGCA CCTCGAAGAC GGGGCGGTGC CGATCGATAA CAACTTCATC GAGCGCCAAA TCAAGCCCTG 11900
GGCGATGGGC AGAAAGGCAT GGCTCTTCTG TGGCAGCGAG TTGGCCGGCC AGCGCGCGGC GATCGTAATG AGCCTGGTGC AGTCGGCCAA ACTCAACGGA 12000
CACGATCCGT GGGCCTACCT GCGCGACGTG CTCGAGCGGC TCCCCAGCCA CCCGAACAGC CGCATCGACG AGCTGCTGCC GCACCGCTGG AAGAAGCCCG 12100
ACGCCTGATC GTCGGCCGGC GGTAGCGTCG TCGGGTCGAC GGCGTCAAGA GGGAGGGCAC GGCTAGACGC TCGCCGTAGA CGGTGGCGGC GCAACCATTT 12200
GTCCAGCCTG CGCCAGACTT GTGGTGTTTG CGCCAGTCCG AAGTACGCCT TCCAACCCAG CAGGTAAGGC CTGAGCTGTT CCACCACTTG CTCCATGCTG 12300
CCCCCGCCCG AGCGCCCCGT GAGTTGCCGG ACCCGGGCTT TGAAGTTCGC CAGTGCCTTG TCGGCCACCT TGCATTTCAC CTCGCGGCCC TGGGCTACCC 12400
ACAGCGCGTA GATGTCCGAG CTTGACGACC GTAGTAAGCG GTCATTCAAC CACGTCCCTT GCGCGGCGGA TCGCTTGGTG GCAGGGAAGC AGCGGGTGCG 12500
GGAAGGCCGC GGGGTTCGTT CCGATCAGTC CGCCAGCTTG ATTCGCCAGG GCAGTCACCA ACTACGTCAA CCACGTCGCC GGGACGGAGA TTGACTCCCC 12600
GGTCGTTCAG GCCTGAGCGG CTGACGGCGG GCCGTGTCAT CGGCCGGGCC CGCTGGTCCC GGCTGACCGG CGCCGCGGGA TCAGCAAACG CCCTAACGGT 12700
GATCGCTGTC TACTATCAGG ATCGCTTCCG GCTGCATTCC CGAGATGGAC CAACGTCCTT ATCATCCTGT CCACAACTTC GACGCATCAG GACCATCATG 12800
ACCCAGGATC TTCTTACCAA CAGCGGCAAG CCGCTCTTCT TCCGTGTCGA CAACGCCGCC GAACTCGGCT TGCACCCGCC GGCAGAGCGG CGGGGGCAGA 12900
GTCTGCGAAC GAGAATCCGC TCGCTCACCG TGATGCAAAA AGAAGCCTTG GTCGTGAACA GCCACACGGG GCTCGCGTGG CGCCTGTGCA GCGACGAAGG 13000
CGCTTACCTG ATGGGGCACG ACGTCGCACC ACCGCCCCTG GCGTTCCTCA CCACCGGAAT GGTGAGCTCC TACATGAACG AGATCCTGGC GCTGGCCCGC 13100
CAGCGCAACC TGAAGCTGCG CGACATCCGC CTGGTGCAGG ACAACTTCTA CACGATGGAG GGGTCGGCCG TGCAAGGGAC CATGGTGGGC GGAGCCTTGC 13200
CGGTGGAGCT CACGGCGCAC ATCGACAGCG ACGTGGAGCC CGGCGAGCTC ACCAAGCTGC TCCAGGACGC CACGGCGGCG TCGCCTCTCA ACGGGCTGAT 13300
GCGTGGCCGG CTCGACAGCC GCTTCTCCCT CGTCCACAAC GGGCGTGAGA TCGAGCCCGG CAAGCTGCAT CGCCTTAGCG CCTTGCCGGC GAAAGAGGGC 13400
GCGACGGCTT TCGATGCCGC ACGGCCCGCG GCGGGCGACT GGGAGACGCT GGTTCAACGC GGTGCTCCCA GCCCCAAACT GTCGCAGACG ACCAGTGGTG 13500
CCGGATCCGG TCTGGCGGAA AGCCAGAGCC GGCGTCTGCA CGTTCGCGGG ATCTGCACGC TGCGCGAAGA CGGCGTGAAG CAAATCGAAC AGCACCTGTA 13600
CCAGCCGCAT GGGACCGTCT TCAAGTTCCT CAGCGATGAA GGCACGCGCA ACGGCGGCGG CGGTCGCGCC CCCGATGCAG CGAGCTACAT TTCGGCCGGC 13700
ATCGGCTTCT GCTTCATGAC GCAGCTCGGC CGCTACGCCA AGATCGTGAA GAAGGACCTG CGTGACTACC GCATCGTGCA GGACACCTTC TTCTCTTTCG 13800
GCGGTGCCAG TGATGGCACA GGGCGGGCCG GCGAAGCGGA TCCGGTGGAG ACGCACGTAT ATCTGACGAC CGGTGAGGAT GACGCGTTCG CCCGGACGGC 13900
CTTGGACATG GCGGAGCAGA CCTGCTTCCT GCACGCGTTC TGCCGCACGG ACATGAAGGT GCGATTCCGC ATCGCAACCC TGGTAGACGG CGGCGCTCCT 14000
GCATGAGAGT GCATGTGCGG CATCTGCATG CGCAGCTGCG GCCCGAGCTT CGGCTGATCT TCTTCTGGGC CAGGCTCAGC CTGCTCGTCG GCACGGTGCT 14100
CAACGTGATC AACCAGGGAG ATCATCGGCG TTCGCGTGGC AGATGTCGTG CTCGGCCCGA CATCGAGCAT CCGGTTGCAC GGAAAGGGAC GTAAGCAGCG 14200
TTCGTTGCCG CTGTGGAAGA CCACCGCCAA GGCGGTACGG GATTGGCTTC ACCTGAATCC CCAGCTTCAG GCCGAGTCGC CGCTGCTGCC GCGCCGAGAC 14300
GGCAAACTCA TGACGCGGGC CAACGTGGCT CAGCGTCTGA AGCTCGCCGT GCAGATCGCC TTCCAGAAAT ACCGCGACTT GGCCAACATT TCTGTGTCTC 14400
CGCACATGGT TCGGCACGCC ACGGCAATGA GCTTGCTTCA GTCGGGAGCC GACCCGTGCG AGATTGCCCT GTGGCTTGGA CACGAGAGTC CGGCGACAAC 14500
ACACATGTAT GTCGAGGCGG ACTTGGCGAT GAAGGAGCGG GCATTGTGAG CATGAGCACC CATCGTGGTG GGCTGATGAC GACGCCTTCT GGCGCGACCA 14600
TGAACGGCGC CGACTGGAGC AGGGCATGAG CGTACCGCAG TACTGCGCGG CCAACGCCCT GGCACTTTCG ACCTACCGGC ATCGCGTGAA TGGTAAGACG 14700
CGCTCGAGCG CGAGGCCAGC GGCCGCGAAG TCGACACCTT CGCGATCGGC GGCATTCGTG CCAGTGTCGA CTCCACGGCC TGAGGTTGCC GCGCTCGTGG 14800
AGATCGCGCT GGAGGGCATG ACGCTGCGCC TGAACGGCGA GGCCGCCGAG CGCGTTCTCG CCGGCGTGAT GGCACGTCTG GCATGATCCT GTCGTCGGCG 14900
ATCCGGGCCT ACGTGTACAG CGAAGCCGTA GACATGAGGA AGTCCATCGA CGGCCTGTCG CAGATCGTGG CGGCTGCGAT GGGCATGAAC CCGCTGTCGG 15000
GCCAGGTGTT CGTCTTCATC GGCCGGCGCC GCGACCGTGC GAAGCTCTTG GTGTGGGATC GGCACGGCTT CTGGGTTCTG TACAAGCGTC TGGAGCGAGG 15100
CCGCTTCACC GACCCCGCGC GGCTGGCCGC AGGAGGCATC GCGATGAGCG AGCTCGTTGC GTGGCTGGAG GGGATCGATC TGAGTCGCAC GAGGCGGCGA 15200
CAGACAACAA CATCAGCGAA CGCGCAATGA AGCCAGTTGC CCTATCGAGG AAGAACTGGC TCTTCGCGGG CTCCGAGCGC GGCGGGCGCG CCGCGGCCGT 15300
CGCCTTCAGC CTCATCGAGA CGGCGCGGCT CAACGGGGTG GAGCCCTACG CGTACCTGCG CGACGTGCTC CAGCGCATCA ATGGTCATCG CCAGGATCGG 15400
CTGGAGGAGT TGCTGCCGAT GAATTGGAGG CCGGCATGAA CGCGCGCATG CCGCCGATGC AGGACGCGGT GCTGATCGAC GCGCTGAACC AGGCGACGAG 15500
CCTTGAGCTG TACCAACTCA GCGCGCTCGT CGAGCGACTG ATCACGGACC CGCGACGCAT CGTGGCCGTG CGCAAGGATC TGCACCTTGG ACAGGTCGTA 15600
CGCTTCTACG ATGGCCGCCG CGACACGATG CGCGAAGGGC GCATCGTCGA GAGCGCGATG CGCAGGTGAC GCTGCACGAC ATGCAGCAGC GGGTGCAGTG 15700
GAAGTTGCCC TACGCCGCGA TCGAGCCGCC GGAGTCCGGC GCGCAACCGC GGCCGCAAAC GCCGCCAGCG CCGGCTGCTT CCTCGACACT CACGCGCGCT 15800
GACTTCGCGC GGGGCGACAA GGTCTGCAGC ACCAGCGCCA GCAGCGAATC AATGCCGCCG TGCAGGTCGC TCGCGCCGAG CGCGAGCCAC AAAGGACGTC 15900
AGCTCCTCGC GCTCGACTTC GCAGACAACG TGTGCAAGGC CCACCGATCT AAAAGGAGCA AATCAACGGT GAGCTTGGGC GCTCGATTGG TTGCCTTCCA 16000
CTACGGGCAG CCCGCCACCG CGATCATGAG CTTGGTTGTT GATTCACCGT ACAAACTGAG CCAGTGTTCA CGCCGAATAT TGAGCCACTA GTTTTAAGGA 16100
TGTTTTGTCA GTTGGTTGTG GATAAGTCTA ACGGCTGGGT CGTTTGATCG GCTCCTTTCT GGGCTTTGGC CCGTGCGGTT TTGGTGATCC CCGCAGCGGC 16200
ACTGGAATGC TTGAAGCGCC AGCTCTCGTT GCCGCTCTCC ACGATGTGGC AGTGGTGGGT CAGCCGGTCC AGCAACGCCG TTGTCATCTT GGCGTCGCCA 16300
AAGACGGTGC TCCACTCCGA GAAGTTCAGG TTGGTGGTGA TGACCACGCT AGTTCGCTCG TAGAGTTTGG ACAGCAGGTG GAACAGCAAG GCGCCACCCG 16400
ACTGCGTGAA GGGCAGATAG CCCATCTCGT CCAGGATCAC CAGATCCACG TACATCAGCC GGTGGGCCAA CTGCCCAGCC TTGTTCTGGG CCTTCTCCAA 16500
CTCCAGCGCG TTGACCAGCT CCACCGTCGA GAAGAAGCGC ACCCGCTTGC CATGGACCCG AATCGCCTCG ATGCCCAGGC TGGTGGCCAG GTGCGTCTTG 16600
CCCGTTCCGG GACCTCCGAC GAAGACCACG TTGTGCGCGG ATTCGGTAAA GCGCAGGGTG TGCAGCTCTC GCACCTGTGC CTCCTGCACA TGCGCCTGGG 16700
CAAAGTCAAA CCCCGCCAGG TCCCGGTGCG AGGGGAAGCG GGCCACGCGC ATCTGGTAGG CCATGGAGCG CACCTCGCGC TGCGCCATCT CGGCCTTGAT 16800
GAGCTGGCGC AGCACCGCCT CGTGGTCCAT CGCCTTGATG CGTGCCGTGC CCAGCACCTC CGGCCAGGCA CTGGCCATGC CGTGCAGACC GAGTGCCTTG 16900
AAGTCGGTGA CCAGTTCAAG CATGGCGCAC CTCCTTGGGC TGGCGCAGCC GGTCGTAGCG CTTCAGATCG GCCTGTGAGG GTGTCTTCAG GTCAACGGAC 17000
GTGGTCGCCG GCTCCACCCG CAGCTCGGGT TCTTTCAAGC GCGACAGCAC ATTGAGTACG TGCTCGGCGC TGACCCGCCC GGACTGCAGG GCCAGCTCCA 17100
CCGCCACCAG CACCGCCTCC AGCCCGTGGG CGGTCACCGC GCTGAGCACC TGGGCCATCA CCCGATCACC GCCCGCATGA CGCAGCAACT GGTCTTGCAG 17200
CAGTCGCAGC GGCTCGGGCA TGGTCTTGAA CGGTGCGCCA TTACGCAGCG CCCCGGGCTT GCGCTCGATC AGGGCAATGT AGTGGGTCCA GTCGTAGATC 17300
GTCTGGTCAC GATCGAAGCA GCGCTGCAGC GTCACACTGT GGCCCTTGGG GCCCACCACC AGCAGCCGGT CGTGGTACGC CCGCAGGCTG ACCACCGCAT 17400
GCACCCATTC GCACGGCACG CTGTAGCGGT TGCGCTGGAA GTGGATCAGC GAGGTCGAGG TCACCCGCAC CGGGTCCTCC ACATAGCCAT CGAAGGGACG 17500
CGGGTTCGGC GTCAGGCGCA GCCTCTCGTC CTGCCACACC TCGGCCACCG TGAGCTCGTG CCACTCGGGG TGGCGCAACT CGCTCCAGGC GTCCACGCAC 17600
GCCTGCTGCA GCCAGGCGTT GAGCTCGCTC AGGCTGGCCC AGCGTCGCTC GCTGGCTTCA CGCCAGATGC CACGCCGGCG GTCCTGCACG TTCTTCTCCA 17700
CGATGCCCTT CTCCCAGCCC GCTGCGCGGT TGCAGAACTC GGGATCAAAG AGGTAGTGAC CGGTCATCGC CTCGAAGCGG GCGTTGACGC TGCGCTGCTT 17800
GCCCGCGCCC ACCTTGTCCA CTGCCGTCTT CATGTTGTCG TAGATACCCC GCTTGGCCAC CCCGCCCAGC ACCGCAAAGG CCCGGGCATG GGCCTCAAAC 17900
AGCATCTCGT GCGCCTGGCT GTGGAACGCC GACAGCATAA ACGCCCGGCT GGCCGCCAGC TTGGTGTGCG CCACCTCCAG GCGCCGGCGC AGTCCGCCAA 18000
TGAACGCGTA CTCGGTGCTC CAGTCAAACT GGAACGCCTC GCCCAATTCA AAGGACAGGG GCACAAAGCC TGCCCCTCGG GCGCTGTGGT CCTGCGCCAG 18100
ACGCCACTGC TTGGCAAAGC TGTAGACCGG CCCACGGCTG CCGCCGTAGC CCATGGCGCG CAAGGCCTCG AACATGGCCT TGATGCCTCG GCGCTCGCGC 18200
TTGTTACGGT GCTGGTCGGC CTTGAGCCAG CGGGCCAAGG TGTCCTCATA AGGCGTCAGT ACGCCCTGCG CCTTCACCCG CACGGGGTAC TTCGGCTCCC 18300
GCATTTCGGG CTCAGACAGC CACTTGCTGG CCGTGTTGCG CGAGATCCCC AGCCGCCGGC TGGCCTCCCG CACCGATACG CCATCCCTCA GCACGAGGCG 18400
GCGCAATTTA CTCAATGTGC TCACGTTGAT CACTCCTGAC CACCTTGCTG AAAAATTCAG CAGCGTAGCC GTCAACGTGG CTCAATTTTC AGTGTGATCC 18500
ACCCCAAAAA TGGCCCAGTT TTCGTTGTGA TTCAACAGCC TACGATCTCC TCGGGCGAGC AGCGGCGAAC GATGCTGGCA GCGAGCTTGC GCAATCGCTC 18600
GACAACGGCC GGATCGTCGG CGCGGCCGCA GTTGTAGACC ACGCGCGTCT CGGAGCGGCG CCGCTCGGGA ACCCACACGT TCTCCGCAAG CTGCAGATAC 18700
GACACCGTGC TGCCGTCGGC CCGTCGCTGT TTGGACTCGC GCAGGTACAT GTGCCCAAGC GTAGCCGCGC ATGCCCAGGC AAATCAAGGA GGCGAGGTCA 18800
ATCCATGTAT CTAGGCATTT TGCGACGAGC GACCTTGCCG GCTCTCGAAA AATCCATCAC TTACGCGAGC TCGACGCGTC AGAACCGGGT CTATGTGCCC 18900
CGGAACTGTC AAAGTCGGGT CAGAGGCTGC GCCGCCCCAA TGAAGTCGGT GGTAAAAAAA CGCCGCACTT TGCCAGGAAC CTGCCTTACC CACGCACTAC 19000
AAGCCTCGCT ACACCAGCGG CATCAGCCGG TGCCGGCGCT TCGTATCTGA ACAGTATTGA GGCTAAACTT GTAACATAGT AAGACTAGGT GGCTGGCGGT 19100
GCGACCCGCG CCACGATGCG CGCGCATGAC TCGCACATCG CCGCGAATAC GGCGGCCGCC GGCGTCAGTG GTGGTTCCGT GCGGTACACC ACGCAGTTGT 19200
CGAAGCGGTG CGCTCGCTCC TTCAGCCGGA GCTGCACGAC CTGCGTGCCG AGGAGGCCGC GTTCCACCAT CACGGCCGGC AACAGCGCCA GCCAGTCGGT 19300
GCCGCTGATC AGTGCCGCGA GCTGCGTGAA TGACTCACAG GTTGCTGCCA CGCGCGGCGC GGGCAGTCCC TGGTCGGCAT GGAAGCGCGT GATCGTGCCG 19400
CCCGGTCCAC CAGGCGGCCC TAGCAGGATC CATTCGCAGT CGGCCAGTTC GCGAAGCGAG CGCGCCTGCG CATGCGGGTG ACCGGGCCGT CCCACCACCA 19500
CGAGGTCGCT CTTGAACAGC ACCTTGCTCT TGAGCCCAGG CCCTGGGCCC TCGTCGGGGA CGGCAGTGAC CGCGAGCTCA ATCAATCCCT GCAGCAACGC 19600
CGGCCGCAGC TGCTCATATA ACCCGCTCAC CAGCGTCAAC TTTACCGTCG GGAACCGGGC GTGAAAGTCG GGTACTACCA AGTGCAGAAG CACGGCGGTC 19700
GGCGTAGGAC CCAATCCAAC ATGCAGTGTG CCGCCCGCGG CCCCACTCGC TTGCTGTGCT TCATCCATGG CACGCCGAGC CTCGCGGTCG ATCAGCTGCG 19800
CGCGTGCGAA TAGGCGCTTG CCGTCAGCAG TCAATGCCAC GCCCTGCGCC GAGCGTTGCA GCAACGATAC CCCCAACCCC GCCTCGAGCG CTTGCAGACT 19900
GTTGGTCAGG CCGGCCTGCG ACAGGCCAAG CTCACGCGCT GCGCGGCGGA AGCTGCCGAA TTCGACAATG GCCAGAAAGG CGCGCAATTG CTGCAGGGTC 20000
ATCGAGACGA TCTCTCTTGG GCCGATCGGT TAAGTCTATC AGAATGGAAT CAGGTTCACT TCCCGCGATC TACTCCGCTT CCTACCATAG CAATGTGGCG 20100
GGGTCGTCCC GTCCAACAAC TGAAAGTAGG TGATCGATGA AAACACAGGT GGCCATCATT GGCGGCGGGC CGGCAGGCAT GCTGCTATCG GAGATCCTGC 20200
ACCGGGCAGG TATACACAGC GTTGTGCTCG AGCAGCGCAG CCGCGAGTAC GTGCTGTCGC GCATCCGTGC CGGCGTGCTC GAGCAGGGCA CGGTCGACGT 20300
GCTGCGGGCC AACGGCCTGG GCGAGCGCAT GGACCGCGAG GGCCATGCCC ACGACGGCAT GAAGATTGTC TGGGCCGGCC GCGACAGCTT TTTCATTGAT 20400
GTGAACAAGC ACCTCGGCAA GCGCTTCATG GCTTACGGCC AGACCAACAT CCAGGAAGAC CTGTTCGCCG CCGCCGACCG GCGCAATGCG GCCGTGCTCA 20500
CTGAGGCCGA GGATGTGCAG CCGATGGATG TGACCACTGA TCGTCCCTAC GTCACGTTCA GGTACCGCGG AGAGGCCATG CGCATGGACT GCGACTTCAT 20600
CGCTGGTTGC GACGGCTTCC ATGGTGTGTC GCGCAAGGCG ATTCCCGCCA ACGTGCTGCG CGAGTTCGAG AAGGTGTATC CGTTCGGCTG GCTGGGCATC 20700
CTGTCGAAGA CGCCGCCGCT GCCGGACATC GTCTACGCTA ACCACCCGCG CGGCTTCGCC CTGGCGTCGA TGCGCAACCC GACGCTCAGC CGCTACTACA 20800
TCCAGGTGCC TCTGGACACG AAGATCGAAG ACTGGTCCGA CGACCGCTTC TGGTCCGAAC TCAAGATGCG CTACCCGCGT GAACTGGCGG ACGCCATCGT 20900
CACCGGGCCG TCGATCGAGA AGTCGATTGC GCCGCTGCGC AGTTTCGTGG CGGAGCCGAT GCGCCACGGC CGCCTGTTCC TGGCCGGCGA CGCTGCGCAC 21000
ATCGTGCCGC CCACCGGCGC CAAGGGGCTC AACCTCGCGG TGTCGGACGT GTTCTACCTC TCGCGGGCGC TGGCAGCCTT TTACGGCCAG GGCACGACGG 21100
CCCTCCTAGA CAACTACAGC GACATGGCGC TCCGGCGCGT CTGGAGCTCG GTGCGCACCT CCTGGTACCT GACCAACCTG CTGCATCGCT TCCCGGGCGC 21200
GAGCGACTTC GACCAGCGCG CACAGGAGTA TGAGCTGGAA TACCTCAAGT CATCGCACCA TGCGCAGGCC GCCCTGGCCG AGCAGTATGC GGGCCTGCCG 21300
TTCGAAGAAG CGGCATGACC ATGGAGGCAA CGATGAACAC GCCACGCGAC TACCACGACA TACCCGGAAC CTATGTGTTC GACGGCGAGC ACAACCGCAA 21400
GGGCTATCAG CTCAACTTGC TCTGCAAGTC GCTGGACGTC GCAGCCAACC GCGACGCGTT CCGCGCCGGC CCCGAGGCCT ACCTCGACCG CTTCCCGATG 21500
ACCGCCGAGC AGCGCCAGGC CGTGCTCGAA CGCGACTGGC TTGGCATGCT GCGCCTGGGA GGCAATATCT ATTACACCTT CAAGCTCGCC ATCTTCGACG 21600
GCATGACGAT GCAACACGTC GGCGCAGCGA TGTCGGGCAC GGGCATGACG GTGGAGCAGT TTCGACAAAT GATGCTGGAC GGCGGCCGCC CAATCAAAGG 21700
CAACCGGAGC AAGAAGGAGC GACGCAATGG CTAAGCTGGT GGGTGGTGTC GGCACGTCGC ACGTGCCCGC AATCGGCGCC GCGGTGGACC ACGGCAAAAC 21800
GCAGGAGCCC TACTGGAAGC CGGTGTTCGA TGGCATGCGG GCGGCCAAGG ACTGGATCGC GGAGGCGAGG CCTGATGTCT GCATCATCGT TTACAACGAC 21900
CACGCGTCGC GCTTCGCGCT CGACTTGATC CCGACCTTTG CACTGGGGGT CGGCGCTGAG TTCCATCCGC ATGATGAAGG CTACGGCCCG CGTCCGGTGC 22000
CTGTGGTCAA GGGCGATCCG GAGTTCGGCT GGCACCTGGC CGAGTCCTTG ATCCTTGAAG AGTTCGACAT CACCATCGTC GGCGACATGA CGGTGGACCA 22100
TGGTCTGACC GTGCCGCTTT CGGTGATGTT CGGGCAGCCT CACGAATGGC CATGCAAGGT CATTCCGCTG TGCGTGAACG TGATCCAGTA TCCGCAACCC 22200
ACCGGCAATC GGTGCTTCAA GCTGGGGCAA GCCATCCGCC GCGCGATCGA CTCCTACGAC AAGGACATCA AGGTGGCAAT TTTCGGCACA GGTGGCCTGT 22300
CGCATCAACT GCAGGGTGAG CGCGCCGGCG TGATCAACGC CGGGTACGAC AACCAGTGGC TCGACCGCTT CGTTGCCGAA CCCGCCGAGG TGGCCAAGAT 22400
CGGGCACACG GAACTCCTGC GCGAGACGGG CTCGGAGGGC ATGGAGTGCG TCATGTGGCT GATCATGCGC GGCGCGCTTG ATGAGCGGGT GAAGGTCGTT 22500
CACCGCTACT ACCACGTTCC CGTCTCGAAC ACTGCTTACG GGTTGCTCGT ACTCGAGAGC GCCTGACCAC CGGCGCCACC GCAAGGCAAT CGTGACGGAT 22600
GCCGGAATCG CGGGGCGCCA CTTACTGACC TGCGCACCAA ATAGTGCGGC AAAGCGTTAA GGATTGATGA AAATCGATGG ACGCAAACTT GACCGGGGGG 22700
CGCAAGCTCA CTTGCGCAGA CAAGTGGTGT GACCTTGCCC CCGAATGGTG TAGTTCTTAG CCCATCGTTT ACGCGGCCTT TTTGCGCTGT GCCTCGTACT 22800
AGCGTTGCTC GTACTGCATC GGGCTGAGGT AACCCAGCGA GGAATGCAGA CGTCGGTGAT TGTAGAAGGC CATCCAGTCC AGGACGGCAT CCATGGCCTC 22900
ACGTCGGGTG GCGAATTTCC TGCCGTACAG GCTCGCGGTC TTCAGGCGGC CCCAGAAGCT CTCGGTCGGC GCGTTATCCC AGCAGTTGCC TTTCCTGCTC 23000
ATCGATGAGC GCATCCCCCA GCCCTTCAAT ACATCCTGGA ACTCGTGGCG TATCCGTCCG GCCAGCCGTA CTCGCCCTTG ACCTCGGCAT GAATGGCACG 23100
GATGTGCGCC AGCAGGGCCT CGTCGCTGTA GCGCCCGCCA GGCCCTGGCC GAGCCGAATC GCGGCGGCGG CGCAGCCAGC TGAAATAGCC ACTGGCACTC 23200
AAATCCAGCA CGTTCGGCCC TGTCTTTGAT GGCAGCTGGC GTCGCATGGC GATGAGGGAC CGCTTCGGAG GTGGAGCCAC CACGGTCGGT CACAAGGCTA 23300
CGTCTGTGTG CCTTGGGACC CGCCGTTCGC AACGTGATCT TCCAGATAGT TGAGCGACCG GAAGTGGCCG CCTGCCGCCG ACCGCGGTCT GGCTCGAAAG 23400
CAGTCTGTCA ACGCGCCGTT TCATTGTTGC CATGATCATA TCTTAATCGC GCACAGGCGG CCACAGGCGG GCAGTATGAA CCAGCGTCAG TATCCACACC 23500
GTTTCGCCGT CGATCTGATA CACCAGGCGA TAGCTTTCGT GCGGGATCAA CTCGCGGGTC CCGGGAATCT TTCCCGGCTT GCCCAGCATG GGGTGCTGGA 23600
TCAAGCGGGC GGCCGCGTCG CTGAAAATCT CATCCATCCG GGCCGCCGCG CGCGGATTGT CGGCTGCGAT GTAGTCCCAC ACATCGGCAC GGTCTTGCTG 23700
CGCTTCGGGC GTCCAAACAA CCCTCACGCC TGGCTCGCCA CACTGGCACG CCGTGCGGCG AATTCGGCCT CAACTTCATC GTTCGACCGC CCCAATCCAG 23800
CGCGCATCGA AGCCCGGCCG GCTTCGACCT TGCGGCGCAG GAACTCGTCG TACTCGCGCG ACTCGCGCTG GCGCTGAACG AACTCGCGCA TCAGCTCGCG 23900
CAGCACTTGC GACGCCGGGC GATGGGCCGC CTCGGCTTCG GCCATAAACT CGGCGCGCAA CTCAGGCTCC AGCTTCATCG TGAAAACGGC TTGTTTTGAC 24000
ATGATCGGGG CCTCCTGCCA CTTGATACTA ACAAAGTATA TACGCCGTCA TTACTAAGCG CTATTCACAG AACGCTGCAA GGCGGGCGTG CGCTAGGCCA 24100
AGGCCTGTCG GAAAACATTT GTTTTTCGAC AGGCCTTCAA CGGTCCTCTG CACCAACCTC CGAGTGGCCG CAAAATTGTG CGGAAAACTC TGTCGCCAGA 24200
CGCTACCATA CGGAAACCTC GTCTTAATGG TTTTCCGCTT ATGTTGGTAG GTTACATGCG CGTGTCGTCG GACTCCGACC GCCAGAGCAC GAACTTGCAG 24300
CGCGATGCGC TGCTCGCCGT CGGCGTCGAT GCGCGGCATC TGTTCGAGGA TCATGCTTCC GGCGCGAAGG ACGACCGCGC GGGCCTGGCG CGGGCGCTCG 24400
AATTCGTTCG CCCTGGCGAC GTGTTGGTCG TGTGGAAGCT CGACCGGCTC GGCCGTTCGT TGTCGCACTT GCTCGCCATC GTGACCTCGC TCAAGAAAAA 24500
GCAGGTGGCG TTCCGCTCGC TGACGGAGAA CCTGGATACC ACGACGCCCT CGGGCGAGTT TCTGTTCCAG GTGTTCGGCG CGCTCGCGCA GTACGAACGC 24600
GCCTTGATCC AGGAACGTGT CGTCGCCGGT CTGGCTGCCG CCCGCAAACG CGGCCGGATC GGCGGCCGGC CGCAGGCGAT CACCGGCGAG AAGCTGGAGG 24700
CCATCGTCGC TGCGCTCGAT GGCGGCATGT CCAAGGCGGC GGTGTGCCGC AACTTCGGCG TCAAGCGAAC CACGCTGATC GAGACCCTGG CACGGGTTGG 24800
TTGGACGGGC TCTCGTGGAG CGTCATCGCG ATGACGACCA AGAGCGAACG ATTGACCGTC CTGTCGGACG CCGAGCAGGA AGCCCTGTAC GGCCTGCCGG 24900
ACTTCGACGA CGCCCAGCGG CTGGAATACT TGGCGTTGAC TGAAACCGAA CTGGCGCTCG CCAGCAGCCG GCCTGGTCTC CATGCCCAGG TCTATTGCAT 25000
CTTGCAGATC GGTTACTTCA AGGCCAAGCA TGCCTTCTTC CGCTTCGACT GGAGTGAGGT CGAGCACGAT TGCGCCTTCG TGCTGAGCCG CTACTTCCAC 25100
GGCGAGTCCT TCGAGCACAA GCCAATCTCC AAGCACGAGC ACTACACCCA GCGCGAGTGG ATTGCCGATC TGTTCGGCTA CCGGCCGTGG GCGGCCGAGT 25200
TCCTGGCGCA GCTCGCGCAG CAGGCCGCGC AGACCGTGCG GCGCGACGTG ATGCCGGGGT TCATCGCCGC CGAGCTGATC GTCTGGCTAA ACGAGCACAA 25300
GATCATCCGG CCCGGCTATA CCACCCTGCA AGAGCTGGTG AGCGAAGCCC TGTCCGCCGA GCGTCGGCGG CTGGCTGGCC TGCTGTCGGA AGTGTTGGAC 25400
GAATCGGCCA AGGCCGCGCT GGGTCGGCTT CTAGTGCGTG ACGACACCCT GTCGCAATTG GCGGCGCTCA AGCAGGACGC CAAGGACTTT GGCTGGCGTC 25500
AGATGGCCCG CGAACGCGAA AAGCGCGCCA CGCTGGAGCC GCTGCACCGG ATCGCCAAGG CGCTGCTGCC CAAGCTCGGC GTCTCGCAGC AGAATCTGCT 25600
GTACTACGCC AGCCTGGCGA ACTTCTACAC CGTCCACGAT CTACGCAACC TGAAGGCCGA TCAGACCTAC CTCTACCTGC TTTGCTATGC CTGGGTGCGC 25700
TACCGGCAGC TTTCCGACAA CCTGGTCGAT GCGATGGCCT ACCACATGAA GCAGTTGGAG GACGAAAGCA GTGCGGGCGC AAAGCAATCC TTTGTCGCCG 25800
AGCAGGTGCG CCGTCAGCAA GACACACCGC AGGTCGGCCG CCTGCTGTCG CTTTACATCG ACGACAGCGT GCCCGATCCC ACGCCGTTCG GCGATGTGCG 25900
CCAGCGCGCC TACAAAATCA TGCCCCGCGA TACGCTGCAA ACCACCGCGC AGCGCATGAG CGTGAAGCCG GTGAGCAAGC TGGCTTTGCA CTGGCAGGCG 26000
GTGGACGGCC TGGCTGAGCG CATCCGCCGC CATCTTCGGC CGCTGTATGT CGCGCTCGAC CTCGCTGGCA CTGATCCGGG CAGCCCGTGG CTCGTGGCGC 26100
TGGCCTGGGC CAAGGACGTG TTCGCCAAAC AGCAGCGCCT ATCGCAACGG CCGCTCGCCG AATGTCCAGC GGCCACGCTG CCGAAACGCT TGCGACCGTA 26200
CCTGCTGACC TTCGATGCCG ATGGCAAGCC GACGGACCTG CATGCCGACC GCTACGAGTT CTGGCTGTAC CGCCAGGTCA GGAAGCGCTT CCAGTCGGGT 26300
GAACTCTACC TCGACGACAG CTTGCAGCAC CGGCATTTTT CCGACGAGCT GGTTTCGCTG GATGAGAAGG CCGCCGTGCT GGCGCAGATC GACATCCCGT 26400
TCCTGCGGCA GCCACTCGAT GCCCAGCTCG ATGCGCTCGC GACCGAGCTG CGCGCTCAGT GGCTGGCCTT CAACCGCGAG CTGAAGCAGG GCAAGCTGAC 26500
GCACCTAGAA TACGACAAGG ACACGCAGAA GCTGACATGG CGCAAGCCCA AGGGCGAGAA CCAGAAGGCG CGCGAGAAGG CGTTCTACGA GCAACTGCCG 26600
TTCTGCGACG TGGCCGACGT GTTCCGCTTC GTCAACGGCC AGTGCCAGTT CCTGTCGGCG CTGACGCCTT TGCAGCCGCG CTATGCGAAG AAGGTCGCCG 26700
ACGCCGACAG CCTGATGGCG GTCATCATCG CGCAGGCGAT GAACCACGGC AACCAGGTCA TGGCACGCAC CAGCGACATC CCGTACCACG TGCTGGAGAG 26800
CGCCTACCAA CAGTACCTGC GCCACGCAAC GCTGCACGCG GCCAACGACT GCATCAGCAA CGCCATCGCC GCGCTGCCGA TCTTCCCGTA CTACTCGTTC 26900
GACCTCGATG CACTGTACGG TGCCGTCGAT GGTCAGAAAT TCGGCGTCGA GCGGCCGACC GTGAAAGCGC GCCACTCGCG CAAATACTTT GGGCGCGGCA 27000
AGGGCGTGGT CGCCTACACG CTGCTGTGCA ACCACGTGCC GCTCAACGGC TACCTGATCG GCGCGCACGA TTACGAGGCC CATCACGTGT TCGACATCTG 27100
GTATCGCAAC ACGTCGGACA TCGTGCCGAC CGCGATCACC GGCGACATGC ACAGCGTCAA CAAGGCCAAC TTCGCTATCC TGCACTGGTT CGGCCTGCGT 27200
TTCGAGCCGC GCTTCACCGA CCTTGGCGAT CAGTTGAAGG AACTCTACAG TGCCGACGAT CCGGCGCTGT ACGATCAGTG CCTGATCCGG CCGGCCGGGA 27300
GAATCGACCG CGATCTCATA GTCAGCGAGA AGCCGAACCT CGACCAGATT GTCGCCACGC TCGGACTGAA GGAGATGACG CAGGGCACGC TGATCCGCAA 27400
GCTATGCACC TACACCGCGC CGAACCCCAC GCGGCGCGCG GTGTTCGAGT TCGACAAGCT CATCCGCAGC ATCTACACGC TGCGCTACCT GCGCGATCCG 27500
CAACTGGAGC GCAACGTTCA CCGCTCACAG AACCGCATCG AGTCCTATCA CCAGCTACGC TCAACCATCG CCCAGGTCGG CGGCAAGAAG GAATTGACCG 27600
GGCGCACCGA CATCGAAATT GAGATCAGCA ACCAGTGCGC CAGGCTGATC GCCAACGCGG TCATCTTCTA CAACTCGGCC ATCCTCTCGC GGCTGCTGAT 27700
GAAGTACGAG GCGAGCGGCA ACGCCAAGGC GCACGCTCTC CTGACCCAGA TATCGCCGGC GGCCTGGCGG CACATCCTGC TGAACGGGCA TTACACCTTC 27800
CAGAGCGACG GCAAGATGAT CGACCTGGAT GCGCTCGTGG CGGGGCTGGA GCTGGGATGA CGGAAATTTC GGCGGTTCTG GCTTAGAACC CC

 Recombination Sites     

Name Coordinates Gene Sequence
res 24103-24233 131 GCCTGTCGGA AAACATTTGT TTTTCGACAG GCCTTCAACG GTCCTCTGCA CCAACCTCCG
AGTGGCCGCA AAATTGTGCG GAAAACTCTG TCGCCAGACG CTACCATACG GAAACCTCGT
CTTAATGGTT T
res_site_I 24103-24131 29 GCCTGTCGGA AAACATTTGT TTTTCGACA
res_site_II 24165-24208 44 TGGCCGCAAA ATTGTGCGGA AAACTCTGTC GCCAGACGCT ACCA
res_site_III 24209-24233 25 TACGGAAACC TCGTCTTAAT GGTTT

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
Re Tn5501.8 721-2145 Passenger Gene Other -
Re Tn5501.8 2464-3615 Passenger Gene Other -
4Fe-4S dicluster Tn5501.8 3655-6870 Passenger Gene Other -
tripartite tricarboxylate transporter Tn5501.8 6893-7885 Passenger Gene Other -
tnpA Tn5501.8 9594-9986 Accessory Gene Helper +
tnpB Tn5501.8 9983-10318 Accessory Gene Helper +
tnpC Tn5501.8 10393-12108 Transposase   +
osmC Tn5501.8 12798-14006 Passenger Gene Other +
tyrosine-type recombinase Tn5501.8 14205-14549 Passenger Gene Other +
WP_015586010.1 Tn5501.8 14626-14886 Passenger Gene Hypothetical +
istB ISCsp4 16108-16923 Accessory Gene ATPase Transposition Helper -
istA ISCsp4 16916-18433 Transposase   -
lysR family Tn5501.8 19085-20002 Passenger Gene Other -
bhbF2 Tn5501.8 20137-21318 Passenger Gene Other +
bhbD2 Tn5501.8 21315-21734 Passenger Gene Other +
bhbE2 Tn5501.8 21727-22566 Passenger Gene Other +
parE Tn5501.8 23443-23727 Passenger Gene Toxin -
parD Tn5501.8 23724-24002 Passenger Gene Antitoxin -
tnpR Tn5501.8 24256-24834 Accessory Gene Resolvase +
tnpA Tn5501.8 24831-27860 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
Re Re Tn5501.8 1425 721-2145 -
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MSMNLVTLLY LIASICFIQA LKGLSHPTSS IRGNLFGMSG MAIAILTTAA LIVEIAGGAA QGMVWVLVGL LLGGGLGAVM AQRVEMTKMP ELVAFMHSMI
GLAAVFIAIA AVAEPWAFGI TAKGSPIPAG NRLELFLGAA IGAITFSGSV IAFGKLSGKY KFRLFRGAPV QFAGQHMLNL ALGLATIALG LIFTFTENWS
AFFLMLALSF VLGVTIIIPI GGADMPVVVS MLNSYSGWAA AGIGFSLNNS MLIIAGSLVG SSGAILSYIM CKAMNRSFFS VILGGFGGET SAAAAGSGEQ
RPVKSGSADD AAYILGNAET VIIVPGYGLA VARAQHAVNE LAEKLTHKGV TVKYAIHPVA GRMPGHMNVL LAEAEVPYDQ VFEMEDINGE FGQADVAIIL
GANDVVNPVA LQKGSPIYGM PILEAYKAKT VIVNKRSMAA GYAGLDNELF YMDKTMMVFG DAKKVVEDMG KAIE

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
Re Re Tn5501.8 1152 2464-3615 -
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MLKFLSKRKE TRMQIGVPAE TEAGETRVAV TPETAKKLKA QGHVVRVQTG AGVAAAAPDA AYEAVGAEIC DAAGALGCEL VLKVRAPSAE ELALMKPGAA
VVGMLNPFDA EGLHRLAAAG LTSFALEAAP RTTRAQSMDV LSSQANIAGY KAVMIAADRY QRFFPMLMTA AGTVKAARVV ILGVGVAGLQ AIATARRLGA
VIEASDVRPS VKEQVESLGA KFIDVPYETQ EEKDAAEGVG GYARPMPASW LERQKAEVAK RVAQADIVIT TALIPGRAAP VLVTEEMVKS MKPGSVIVDL
AAPQGGNCPL TEAGKTVIKH GVTLVGETNL PALVAADASA FYARNVLDFL KLVLPKEGPL NIDPSDDIVA ACLMTQGGEV KRK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
4Fe-4S dicluster 4Fe-4S dicluster Tn5501.8 3216 3655-6870 -
Class:   Passenger Gene
Sub Class:   Other
Comment:   4Fe-4S dicluster domain-containing protein
Protein Sequence:  
MRIFPHTQRA MHYGAYPSER LSRKASSSSH QSIHETGALQ PPPTNDSEST LAHSISRFMC ALDQVREGDH NKQRGEIPSD PMERSRHLKS AAYFLDTSIA
GVSELNREHL LDLRIEHPTF RTATYEQSET KLRLRFNPNA VLRQMKRSMD LCEQGVEHHS HALVLMAEFP RDPQPEEAGA DWIVGMQQWR AALRAAETAA
VLANYLRILG YSARSHTATT TDVNLLRLAV SSGLARVDHG RVLNPFVGDR FVLAAVTTDM ALDPDLPLAQ PSLLESFRAK GPSWWLGWSS PKNKWNAREF
GAREYRYSRF PVESLKRVEE TTTFIDEARI PRVPKSSEFF LRAAFGDLGK APQEASKDGY SVAKAPLAAA LRLALNVYSV LQRGEPAKLK VEFPDRKNNA
DLVKATLHFL GADLAGVSRA PDWVWYSHRQ DGTEMPVAHP FAATVMIDQG FETMEGASGD DWISSAQSMR TYMRAGLVCG VLADHLRRLG YGATAHSAAD
SDVIQTPLVL LAGLGEVSRI GETILNPFLG PRSKTGVMTT DFPIEPDKAI DFGLQNFCES CNKCARECPS GAISAGPKVM FNGYEIWKAD VEKCTRYRMT
NLAGAMCGRC MKACPWNLAG LVQEKPFRWA AMNLPIAAQW LAKLDDYLNR GEINPRKKWW WDITTERNGS VVAAKEVNVR GLNKHIRLKY EDQTLAAYPA
PLAPLPLPMP SRVDREAGID AYKNLLTPDA YKTRISSGDT DNLVPGLRRL TEEPPVIVAK ISRRWASSKD GKIDIFEIVS SDGSPLPKFD AGAHIDITIT
PEFIRQFSLA GDPADRSKYV LGILREDQGR GGSLKIHQML RPGVPVVVSA PRNHFPVMQD ARRHLLLAGG IGVTPLIAMG HELHSAGKDF VLYYKAKTRA
QAAFIQELES VPWSNRVHFH FSDERRLDIA DVLRDYVPGN HVYTCGPAAF MDAVFDTAAA FGWDESSLHR EYFSVPEMDE REKLPFTLRL NSTGQVIPVS
ATQSAVQALA SAGLQVDVKC SDGLCGVCAV PYLSGEVDHR DFVLSKEQRK VKMITCCSRA ATAEGEIVLN L

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tripartite tricarboxylate transporter Tripartite tricarboxylate transporter Tn5501.8 993 6893-7885 -
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MSFSYLSIGR RALVTCLIPL STLALQATAW ASGDWPSKPI KLIVAFPPGS SPDVIGRALA PGLAEALKQP VVVENRAGVT GMIAAEAVAR SPADGYTFLM
TSGSSVTISV HTLPKVPIDP MKDLVPVAST ARIELFLVSD ARQDFKSFSE LVAAAKARPG RLSYASPGNG SAPHVAAEMM MAETGITAVH IPYRGSSPAL
QDLLGGVVQF AFDPGIGLEH VKAGKLRLLA VGSTKRSPLF PDTPTLQELG LKGFDSGTTH AIYAPAKTPA AIVERMNAEI NKQLASPQVI QTIRNLGAEP
TPLSPADLAD QLRRDSARYA AIVKLRGIRE

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn5501.8 393 9594-9986 +
Class:   Accessory Gene
Sub Class:   Helper
Sequence Family:  IS66_family
Comment:   mutation of tnpA in IS66 family members reduces tranposition by at least two orders of magnitude
Protein Sequence:  
MHVNQKPRRR HSEQFKAQVL AACAEPGASV SAVALSFGVN ANLVHQWRRG RGFKAARTVP PCPVIEPAPR FVALSLPAPT PAPSPAAGAP APAIRVELKR
GALGVNVIWP IAAAGDCTAW LRELTTDLLK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpB TnpB Tn5501.8 336 9983-10318 +
Class:   Accessory Gene
Sub Class:   Helper
Sequence Family:  IS66_family
Comment:   mutation of tnpB in IS66 family members reduces tranposition by at least two orders of magnitude
Protein Sequence:  
MIRIDQLWLC TAPMDMRAGA ERLLSCVVQT TGAAHAHHGY LFANARATRI KLLVHDGFGV WCAARRLNAG HFAWPREAAA TPLSLTQAQF DALVVGLPWQ
RLPEMSVITR L

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpC TnpC Tn5501.8 1716 10393-12108 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MGMMRGMLSM RDLKAQDLQG LSPETVTALA AQMLERIEQQ AQEIDLQRRD LEVKQKLIER KDRDIAWRDA KLEKVNFELA RLKRWKFGAK SEAMTAEQRQ
MFQDTLLEDE ADLEAQLAAL QAALPKTLSA PKTPRRRPRR QALPDHLRRV EHRHEPEDTN CTTPQCGQPM TRVGEDISER LDIVPAEFFV HRHIYGKWAC
RCCQRQGIER LVQEPADAQI IDGGIAASGL VAHTLISRFV DHLPYYRQEA INARSGVHTP RSTLASQSGR AGAAMEPLYE AHKRFVLSCP VVHADETPVA
MLDPGAGKTK RAYIWAYARG ELDGQRGVIY EFCLGRGSQY PVAFLGGAQG PPGSPIDEQA AWSGTLVCDQ YAGYDRVLDR RVYPQRIAAN CVAHARRKFD
ELVGTSEVAK ETIKRIGWIY HVEGQFEGMD AQQRLVARDQ LTRPLWKELH VWLKLERGRV PDGGSIAGAI DYSLNSWTAL TRHLEDGAVP IDNNFIERQI
KPWAMGRKAW LFCGSELAGQ RAAIVMSLVQ SAKLNGHDPW AYLRDVLERL PSHPNSRIDE LLPHRWKKPD A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
osmC OsmC Tn5501.8 1209 12798-14006 +
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MTQDLLTNSG KPLFFRVDNA AELGLHPPAE RRGQSLRTRI RSLTVMQKEA LVVNSHTGLA WRLCSDEGAY LMGHDVAPPP LAFLTTGMVS SYMNEILALA
RQRNLKLRDI RLVQDNFYTM EGSAVQGTMV GGALPVELTA HIDSDVEPGE LTKLLQDATA ASPLNGLMRG RLDSRFSLVH NGREIEPGKL HRLSALPAKE
GATAFDAARP AAGDWETLVQ RGAPSPKLSQ TTSGAGSGLA ESQSRRLHVR GICTLREDGV KQIEQHLYQP HGTVFKFLSD EGTRNGGGGR APDAASYISA
GIGFCFMTQL GRYAKIVKKD LRDYRIVQDT FFSFGGASDG TGRAGEADPV ETHVYLTTGE DDAFARTALD MAEQTCFLHA FCRTDMKVRF RIATLVDGGA
PA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tyrosine-type recombinase Tyrosine-type recombinase Tn5501.8 345 14205-14549 +
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MPLWKTTAKA VRDWLHLNPQ LQAESPLLPR RDGKLMTRAN VAQRLKLAVQ IAFQKYRDLA NISVSPHMVR HATAMSLLQS GADPCEIALW LGHESPATTH
MYVEADLAMK ERAL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_015586010.1 WP_015586010.1 Tn5501.8 261 14626-14886 +
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MSVPQYCAAN ALALSTYRHR VNGKTRSSAR PAAAKSTPSR SAAFVPVSTP RPEVAALVEI ALEGMTLRLN GEAAERVLAG VMARLA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istB IstB ISCsp4 816 16108-16923 -
Class:   Accessory Gene
Sub Class:   ATPase Transposition Helper
Comment:   ATPase
Protein Sequence:  
MLELVTDFKA LGLHGMASAW PEVLGTARIK AMDHEAVLRQ LIKAEMAQRE VRSMAYQMRV ARFPSHRDLA GFDFAQAHVQ EAQVRELHTL RFTESAHNVV
FVGGPGTGKT HLATSLGIEA IRVHGKRVRF FSTVELVNAL ELEKAQNKAG QLAHRLMYVD LVILDEMGYL PFTQSGGALL FHLLSKLYER TSVVITTNLN
FSEWSTVFGD AKMTTALLDR LTHHCHIVES GNESWRFKHS SAAAGITKTA RAKAQKGADQ TTQPLDLSTT N

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istA IstA ISCsp4 1518 16916-18433 -
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
VINVSTLSKL RRLVLRDGVS VREASRRLGI SRNTASKWLS EPEMREPKYP VRVKAQGVLT PYEDTLARWL KADQHRNKRE RRGIKAMFEA LRAMGYGGSR
GPVYSFAKQW RLAQDHSARG AGFVPLSFEL GEAFQFDWST EYAFIGGLRR RLEVAHTKLA ASRAFMLSAF HSQAHEMLFE AHARAFAVLG GVAKRGIYDN
MKTAVDKVGA GKQRSVNARF EAMTGHYLFD PEFCNRAAGW EKGIVEKNVQ DRRRGIWREA SERRWASLSE LNAWLQQACV DAWSELRHPE WHELTVAEVW
QDERLRLTPN PRPFDGYVED PVRVTSTSLI HFQRNRYSVP CEWVHAVVSL RAYHDRLLVV GPKGHSVTLQ RCFDRDQTIY DWTHYIALIE RKPGALRNGA
PFKTMPEPLR LLQDQLLRHA GGDRVMAQVL SAVTAHGLEA VLVAVELALQ SGRVSAEHVL NVLSRLKEPE LRVEPATTSV DLKTPSQADL KRYDRLRQPK
EVRHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
lysR family LysR family Tn5501.8 918 19085-20002 -
Class:   Passenger Gene
Sub Class:   Other
Comment:   lysR family transcriptional regulator
Protein Sequence:  
MTLQQLRAFL AIVEFGSFRR AARELGLSQA GLTNSLQALE AGLGVSLLQR SAQGVALTAD GKRLFARAQL IDREARRAMD EAQQASGAAG GTLHVGLGPT
PTAVLLHLVV PDFHARFPTV KLTLVSGLYE QLRPALLQGL IELAVTAVPD EGPGPGLKSK VLFKSDLVVV GRPGHPHAQA RSLRELADCE WILLGPPGGP
GGTITRFHAD QGLPAPRVAA TCESFTQLAA LISGTDWLAL LPAVMVERGL LGTQVVQLRL KERAHRFDNC VVYRTEPPLT PAAAVFAAMC ESCARIVARV
APPAT

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bhbF2 BhbF2 Tn5501.8 1182 20137-21318 +
Class:   Passenger Gene
Sub Class:   Other
Comment:   4-hydroxybenzoate 3-monooxygenase
Protein Sequence:  
MKTQVAIIGG GPAGMLLSEI LHRAGIHSVV LEQRSREYVL SRIRAGVLEQ GTVDVLRANG LGERMDREGH AHDGMKIVWA GRDSFFIDVN KHLGKRFMAY
GQTNIQEDLF AAADRRNAAV LTEAEDVQPM DVTTDRPYVT FRYRGEAMRM DCDFIAGCDG FHGVSRKAIP ANVLREFEKV YPFGWLGILS KTPPLPDIVY
ANHPRGFALA SMRNPTLSRY YIQVPLDTKI EDWSDDRFWS ELKMRYPREL ADAIVTGPSI EKSIAPLRSF VAEPMRHGRL FLAGDAAHIV PPTGAKGLNL
AVSDVFYLSR ALAAFYGQGT TALLDNYSDM ALRRVWSSVR TSWYLTNLLH RFPGASDFDQ RAQEYELEYL KSSHHAQAAL AEQYAGLPFE EAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bhbD2 BhbD2 Tn5501.8 420 21315-21734 +
Class:   Passenger Gene
Sub Class:   Other
Comment:   protocatechuate 4,5-dioxygenase alpha subunit
Protein Sequence:  
MTMEATMNTP RDYHDIPGTY VFDGEHNRKG YQLNLLCKSL DVAANRDAFR AGPEAYLDRF PMTAEQRQAV LERDWLGMLR LGGNIYYTFK LAIFDGMTMQ
HVGAAMSGTG MTVEQFRQMM LDGGRPIKGN RSKKERRNG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bhbE2 BhbE2 Tn5501.8 840 21727-22566 +
Class:   Passenger Gene
Sub Class:   Other
Comment:   Protocatechuate 4,5-dioxygenase
Protein Sequence:  
MAKLVGGVGT SHVPAIGAAV DHGKTQEPYW KPVFDGMRAA KDWIAEARPD VCIIVYNDHA SRFALDLIPT FALGVGAEFH PHDEGYGPRP VPVVKGDPEF
GWHLAESLIL EEFDITIVGD MTVDHGLTVP LSVMFGQPHE WPCKVIPLCV NVIQYPQPTG NRCFKLGQAI RRAIDSYDKD IKVAIFGTGG LSHQLQGERA
GVINAGYDNQ WLDRFVAEPA EVAKIGHTEL LRETGSEGME CVMWLIMRGA LDERVKVVHR YYHVPVSNTA YGLLVLESA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parE ParE Tn5501.8 285 23443-23727 -
Class:   Passenger Gene
Sub Class:   Toxin
Target:   DNA gyrase
Sequence Family:  ParE_toxin (Pfam:PF05016)
Protein Sequence:  
VRVVWTPEAQ QDRADVWDYI AADNPRAAAR MDEIFSDAAA RLIQHPMLGK PGKIPGTREL IPHESYRLVY QIDGETVWIL TLVHTARLWP PVRD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parD ParD Tn5501.8 279 23724-24002 -
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  parD (PDB:4Q2U)
Comment:   RelB
Protein Sequence:  
MSKQAVFTMK LEPELRAEFM AEAEAAHRPA SQVLRELMRE FVQRQRESRE YDEFLRRKVE AGRASMRAGL GRSNDEVEAE FAARRASVAS QA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn5501.8 579 24256-24834 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MRVSSDSDRQ STNLQRDALL AVGVDARHLF EDHASGAKDD RAGLARALEF VRPGDVLVVW KLDRLGRSLS HLLAIVTSLK KKQVAFRSLT ENLDTTTPSG
EFLFQVFGAL AQYERALIQE RVVAGLAAAR KRGRIGGRPQ AITGEKLEAI VAALDGGMSK AAVCRNFGVK RTTLIETLAR VGWTGSRGAS SR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn5501.8 3030 24831-27860 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MTTKSERLTV LSDAEQEALY GLPDFDDAQR LEYLALTETE LALASSRPGL HAQVYCILQI GYFKAKHAFF RFDWSEVEHD CAFVLSRYFH GESFEHKPIS
KHEHYTQREW IADLFGYRPW AAEFLAQLAQ QAAQTVRRDV MPGFIAAELI VWLNEHKIIR PGYTTLQELV SEALSAERRR LAGLLSEVLD ESAKAALGRL
LVRDDTLSQL AALKQDAKDF GWRQMARERE KRATLEPLHR IAKALLPKLG VSQQNLLYYA SLANFYTVHD LRNLKADQTY LYLLCYAWVR YRQLSDNLVD
AMAYHMKQLE DESSAGAKQS FVAEQVRRQQ DTPQVGRLLS LYIDDSVPDP TPFGDVRQRA YKIMPRDTLQ TTAQRMSVKP VSKLALHWQA VDGLAERIRR
HLRPLYVALD LAGTDPGSPW LVALAWAKDV FAKQQRLSQR PLAECPAATL PKRLRPYLLT FDADGKPTDL HADRYEFWLY RQVRKRFQSG ELYLDDSLQH
RHFSDELVSL DEKAAVLAQI DIPFLRQPLD AQLDALATEL RAQWLAFNRE LKQGKLTHLE YDKDTQKLTW RKPKGENQKA REKAFYEQLP FCDVADVFRF
VNGQCQFLSA LTPLQPRYAK KVADADSLMA VIIAQAMNHG NQVMARTSDI PYHVLESAYQ QYLRHATLHA ANDCISNAIA ALPIFPYYSF DLDALYGAVD
GQKFGVERPT VKARHSRKYF GRGKGVVAYT LLCNHVPLNG YLIGAHDYEA HHVFDIWYRN TSDIVPTAIT GDMHSVNKAN FAILHWFGLR FEPRFTDLGD
QLKELYSADD PALYDQCLIR PAGRIDRDLI VSEKPNLDQI VATLGLKEMT QGTLIRKLCT YTAPNPTRRA VFEFDKLIRS IYTLRYLRDP QLERNVHRSQ
NRIESYHQLR STIAQVGGKK ELTGRTDIEI EISNQCARLI ANAVIFYNSA ILSRLLMKYE ASGNAKAHAL LTQISPAAWR HILLNGHYTF QSDGKMIDLD
ALVAGLELG

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
ISCsp4-KC771559 ISCsp4 Insertion Sequence 16037-18537 2501

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRR ISCsp4 16037-16068 TGTTGATTCA CCGTACAAAC TGAGCCAGTG TT
IRR ISCsp4 16037-16068 TGTTGATTCA CCGTACAAAC TGAGCCAGTG TT
IRL ISCsp4 18506-18537 TTTTTACCGG GTCAAAAGCA ACACTAAGTT GT

 References     

Chen K, Huang L, Xu C, Liu X, He J, Zinder SH, Li S, Jiang J. Molecular characterization of the enzymes involved in the degradation of a brominated aromatic herbicide. Mol Microbiol. 2013 Sep;89(6):1121-39. doi: 10.1111/mmi.12332. Epub 2013 Jul 31. PubMed ID: 23859214