Transposon
Name: TnXax1.1       (Synonyms: Tn7207)
Family: Tn3        Group: Tn4651
Evidence of Transposition: yes
 Host     

Host Organism:Xanthomonas arboricola pv. pruni CFBP 55306 Molecular Source:plasmid pXap41
Date of Isolation:2011

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 46 bp)GAGGGTCGGCAGGGATTCGTGTAAAACACAGCCAAAAGTGAGCTAA
IRR (Length: 48 bp)GAGGGTCGGCAGGGATTCGTGTAAAAAACAGCCAAAAATGAGCTAACT

 Sequence     
DNA SequenceLength  23552 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GAGGGTCGGC AGGGATTCGT GTAAAACACA GCCAAAAGTG AGCTAACTCC CTGTCAGCTA AGGATTCCTC ACGACGCCCT GCTGACCCTC ATGCTGGGCT 100
ACTGCTGCAC GTGCGCCACT AGGTCCAACA CGGAGAGCTG CAATTCGGCG TCCACCTCTC GCCACACGTT GGCGACGTGT CAGGCCCGGC CTATAGCACC 200
GCGCATGTCC TGGCCGCGCT TGGGCGCATA GCGCGCCCAA GCGCGCTACG GTGGCATCAT GGACGTGCCG ACGCTCATCC GGCTTGATCC ATCGTCACCA 300
ACTCAAGGGG GGGCGACCCA AGGCATCCAG GCGCTCTGCA TCTTCAAGGA TCGCAGCAGT TGTGCGCGGC GAGATGGGCT GTGCGTTATC CAGTCCATAG 400
GCCTGCCGCG CAGCATCGGC GGCTAGAGCT TGACGCGTCC GCGGTGAGTA CTCTTGCAGC CTTTGACGCG TGCTCTGGGC CAGTTCGGGC GCGATGACCG 500
GCATCGCGGA AAAGATATCG CCGGGTGCGG GGTTGGCTAG AAATGCTGTC TCCAGGTCGC GCGCATTCGC ATGGAAATCA GTTTGCGGAT CCTCGATCTC 600
GGCTCTGAAT GCGTTTGTCC TGGCAAGCGC GTCGATTGCA TCGCGCTTGT CGAAGCGTTC AATTACGTTG GTGGATGTTC CGTACGTTTC GGCCCAGTGC 700
GAATCCCTTA GTGGGGTTGT AAGCCGGAAC CGCCGAAATT TCCGCCACCC GTCCCGAGCG ATCACTCGGC AAGTTGCTTG ATACGTCCGG CAAGGCGTGC 800
GAACGTTTCG TTCACATACT CCAACGCTTT GGGGTCAGTG CCCGTCGCGC GTAAAAAGTC TTTAAGCTGT CCAGATCGGA TAGCCATTGG CGGTGCAGGC 900
TGCCCAGCTC GTTGTCGTGT GGCTGGAAGA CTTGCAGGTT GCCGCTGTAA CCAGCAGTTG AGCTACCCGT GGCTTGTGCT CTTCGGAGAA CTCGGCCAAG 1000
CCGTCATATG GGACGGGATG GGGCAGTACG CCGGAATTTT GTCACGACCC TCTAGTGTGC TCGATGAGTG CAGTGGGACA TTGCGGGGAC AACGCGGCCT 1100
GTGAAGGCTT CTTCGGCCTG CTCAAACGCG AGCGGATCAG GCGACAGATC TATCCCACCA AGGACGCCGC ACGCGCCGAG GTATTCGACT ACATCGAGAT 1200
GTTCTACAAC CCCAACCGCC GCCACGGTTC AACTGGTGAC CTGTTCCCTG TAGAGTTTGA ACGGCGCTAC GCGCAACGAG GGTCTTGAGT GTCTAAGGAA 1300
CTCTGGGCGT ATCAAGAAGC AGTGGTTCAC TACCTACCAA CGAGGGAGCA CTTTAATGGG TGACAAGACC GACAAGAACA CAATCGCCTG GCTGGCTCAA 1400
CCGGAGGAAC ATGATTATCC GGCTGCGCAG TCCTATCTCA ATCTGCTGTA TGACGACGCC CACTGCGCCA AGCTGGTGCG TAAGCTGCAC GCGGCACCCA 1500
TGTCGGCCTT CAAGGCCAAG GACATCCTGC GCGCCAGCGG CCTCTCGCCG CTAGGAATGA GCAATGCACA CGTGGAAAGG GACCTCAAAA AAATCCAGTC 1600
CGGCACCGCT CTGTCACCGC TGCTGCTGGT GCGCCAGGAA GGCCAGCGCA CCGTAGTTGC CGACGGCTAC CACCGTTTGT GCGCCGTCTA TAGCTTCGAC 1700
GAGGACGCTT CTATCCCTTG CAAGATCGTG TGAGCCGACC CTCTACCTAG CCCAGCTGCA CGGTTGTCAG GCACGTACCC CATCATGTGA TTGATGATGG 1800
AATGGGCCTC ACTGACACCT TGGGTGGCAA GCTGTTTCAT ATGCTGGTCG ATCTTCGTGG CCAGTCGGCG AAATTCGTTC ATTCTCATGT CCGTCACAAA 1900
AGACGCGCTG ATTGCCCTTG GACAGAGAGA AGTGTCACCG CCTTCTTATC GAAGGAAACG AAGGTTTCCC CACCAAGCCA ATTACCTTCG TAGGCAATGA 2000
CGCCATCGGC AAAGTCACCG CCTGCGTCGA GCACCAGCAA ACCGGCCTCC ACGGCGGGCC GGTTCACTTC CACGTTGGCG GCGTCCAGTA GCGCCCGGAT 2100
CGCGTCGGCC GCGTCAGATT GCTGGAAGCC ATAGACGCGC ATCAGCACCC AAACAAATTC GCATAGGCAC GGCAGCGCGA CCGCGATCAA CTCAGCATCG 2200
GTCAAGACTG CGGCGGCAAC GTCCGCTTGT GCGGGATCGT CACGCACAAC AGCACGCACA AGGACGTTGG TATCGACTGC GACCTTCATT GCTTGCCTGC 2300
CCATCCTTGT GCAGCCGCCT CGTTGATTTC TTCGATGGTG GCAACCTTCT GCGTCTTACC CGCGAGCAGG CCGACAAAGC TGGCTATCGT CCCTGCGGGC 2400
CGTGCCGCCT TGAGTACTCC CCGACCATCT GGCAGCAAGT CAAGCTCGAT CTTGTCGCCT GGCCTAATGC CGAGGTGTTG CAGTACGTCC TTCCGAAACG 2500
TCACTTGTCC ACGTGCGGTA ACGGTCAATG TAGTCATGGT GATGTGCCTC GCAATCTCAT GGGTTGACGC AATAGTAATG CAAAAAACCC TTACTTGTAC 2600
TGTCCGTCAA ACGATCGTTA GTCGGCCGTG CCGGGCGCTG TCCTGAAAAC TGCTTGTGCA ACGGGTTGTC GGACAATAAG ATAGACGGAC GGAATAGCGG 2700
ACAAGATGGG TGGCAAATGG CACTGATCGG CTATGCGCGG GTATCAACGG CGGAACAGGA CACCGCTTTG CAGACGGATG CGCTGCGCAA AGCAGGCTGC 2800
GAGCGCGTTT TCGAGGACAC GGCTTCCGGG GCCAAGGCTG ACCGCCCCGG TTTGGCTGAT GCGCTGGCCT ACCTGCGCAA CGGCGACGTG CTGGCCGTCT 2900
GGCGGCTGGA CCGGCTTGGG CGCTCCATGC CGCACCTGAT CGAAACGATA GGCGCGCTGG AAGCGCGAGG CGTCGGCTTT CGTTCGCTGA CGGAAGCCAT 3000
CGACACCACC ACGCCGGGCG GGCGGCTCAT CTTCCACGTG TTCGGCGCGC TGGGCCAGTT CGAGCGCGAC CTGATTCGCG AGCGCACCAA GGCCGGGTTG 3100
AGTGCCGCCG CCGCTCGCGG GCGCAAGGGC GGACGCAAAC CAGTCATCAC CGCCGACAAG TTGCAGCAGG CACGTAAACA CATCGCCAAC GGGCTAAATG 3200
TCCGGGAGGC CGCTACACGG CTCAAGGTGA GTAAAACAGC CTTGTATGCA GCTCTGCAAT CCACCAGCGC ACGAGATCTC CAAAAATGAC AACCGCCTCA 3300
CGCGCAGCGG CACTGTAATG GCGGCCAAGA GCGAACGATT GACCATTCTG TCGGACGCCG AGCAGGAAGC CTTGTACGGC CTGCCGGATT TCGACGACAC 3400
CCAGCGGCTG GAATATCTGG CCTTGACCGA TGCTGAACTG GCGCTCGCCA GCAGCCGGCC CGGCATGCCT GCCCAAGTCT ATTGCGTCTT GCAGATCGGC 3500
TACTTCAAGG CCAAGCACGC GTTCTTCCGC TTCGACTGGA ACGAGGTCGA GGACGATTGC GCCTTCGTGC TGAGACGCTA TTTCACTGAC GAGCCGTTCG 3600
AGTGCAAGGC GGTCACCAAG CACGAGCACT ACACCCAACG CGAACGGATC GCCGAGCTGT TTTGCTATCT TCCGTGGGTA GCCAGCTTCC TGCCACAGCT 3700
CGCGCAGCAG GCCGCGCTGA TCGTGCGCCG CGATGTAATT CCGGGGTTCA TCGCCACCGA GCTGATCGTC TGGCTCAACG AGCACAAGAT CATTCGGCCA 3800
GGCTACACCA CCTTGCAAGA GCTGGTCAGC GAAGCCCTGT CCGCCGAACG CCGGCGGCTG GGCGGGCTGC TCGCGGAAGT ATTGGACGAT CCGGCCAGGG 3900
CCGCGATGGC CCAGCTCCTG GTGCGCGATG ACACCCTGTC GCAACTGGCG GCGCTCAAGC AGGACGCCAA GGATTTCGGC TGGCGTCAGA TGGCCCGCGA 4000
ACGGGAAAAA CGCGCCACGC TGGAACCGCT GCATAGCATT GCCAAAGCGC TGCTGCCCAA GCTCGGCGTC TCGCAGCAAA ACCTGCTGTA CTACGCGAGC 4100
CTGGCGAACT TCTACACCGT CCACGACCTT CGCAATCTGA AGGCCGATCA GAGCCACCTC TATCTGCTGT GCTATGCCTG GGTGCGCTAC CGACAGCTCT 4200
CCGACAACCT GGTCGATGCG ATGGCCTACC ACATGAAGCA GTTGGAGGAC CAGAGCAGTG CGGGCGCAAA GCAGTCCTTC GTTGCCGAGC AGGTGCGCCG 4300
TCAGCAAGAC ACGCCGCAGG TCGGCCGCCT GCTGTCGCTT TACATCGACG ACAGCGTGCC CGATCCCACG CCGTTCGGCG ATGTGCGCCA GCGCGCCTAC 4400
AAGATCATGC CTCGCGATGC GCTGCTAACC ACGGCACAGC GCATGAGCGC CAAGCCGGTG AGCAAACTGG CTTTGCACTG GCAGGCGGTG GACGGCCTGG 4500
CCGAGCGTAT CCGTCGGCAT CTTCGACCGC TGTACGTCGC GCTCGACCTC GCCGCCACCA ACCCGGACAG CCCGTGGCTG GTGGCGCTGG CCTGGGCCAA 4600
GGGCGTGTTC GCCAAACAGC AGCGCCTATC GCAACGGCCG CTCGCTGAAT GTCCAGCGGC CACGCTACCG AAACGCTTGC GGCCGTACCT GCTGACCTTC 4700
GATGCCGACG GCAAGTCAAC GGGCCTGCAC GCTGATCGCT ACGAATTCTG GCTGTACCGC CAGGTCAGGA AACGCTTCCA GTCGGGCGAA CTCTACCTCG 4800
ACGACAGTTT GCAACACCGT CATTTCTCCG ACGAGCTGGT TTCGGTGGAG GAAAAGGCCG ACGCGCTTGC GAAGATGGAC ATCCCCTTCC TGCGGCAGCC 4900
GGTCGATGCC CAACTTGATG CGCTGGCGGC CGAACTTCAC TTGCAGTGGC TGGCCCTCAA CCGCGCGCTT AAACAAGGCA AGCTGACGCA CCTGGAATAC 5000
GACAAAGCCA CGCATAAACT GATCTGGCGC AAACCCAAGG GCGAAAACCA GAAGGCCCGC GAGAAAGCGT TCTACGAACA ACTGCCGTTC TGCGACGTGG 5100
CCGACGTTTT CCGCTTCGTC GACGGCCAGT GCCAGTTCCT GTCGGCGCTG ACGCCCTTGC AGCCGCGCTA TGCGAAGAAG GTCGCCGACG CCGACAGCCT 5200
GATGGCGGTC ATCATCGCGC AGGCGATGAA CCACGGTAAC CAGGTCATGG CGCGTACCAG CGACATCCCG TACCACGTGC TGGAGAGCGC CTACCAACAG 5300
TACCTGCGCC ACGCAACGCT GCACGCGGCC AACGACTGCA TCAGCAACGC CATCGCGCAA TTACCGATTT TTCCGTATTA CTCGTTCGAT CTCAATACGC 5400
TGTACGGTGC TGTCGATGGC CAGAAATTCG GCGTCGAGCG GCCGACAGTG AAAGCGCGCT ACTCGCGTAA ATACTTCGGG CGCGGGAAAG GCGTGGTCGC 5500
CTACACGCTG CTGTGCAACC ACGTGCCGCT CAACGGCTAC CTGATCGGCG CGCACGACTA CGAGGCCCAT CACGTATTCG ACATCTGGTA TCGCAACACG 5600
TCGGATATCG TACCAACCGC GATCACCGGC GACATGCATA GCGTCAACAA GGCCAACTTC GCCATCCTGC ACTGGTTCGG TCCGCGCTTC GAGCCACGCT 5700
TCGCCAACCT CGACGACCAG TTGAAGGAAC TGTACTGCGC CGACGATCTT GCACAATACG AGAACTGCCT GATCCGACCG ATCGGGAAAA TCGACCGCGA 5800
TCTCATCATC GACGAAAAGC CAAACATCGA CCGGATCGTC GCCACGCTCG GGCTGAAGGA GATGACGCAG GGCACGCTGA TCCGCAAGCT GTGCACCTAT 5900
ACCGCGCAAA ACCCGACGCG GCGCGCGATA TTCGAGTTCG ACAAGCTCGT CCGCAGCATC TACACGCTGC GCTACCTGCG CGACCCTCAA CTGGAGCGCA 6000
ACGTGCACCG CTCCCAGAAC CGCATCGAGT CTTACCACCA GCTACGCTCA ACCGTTGCCC AGGTCGGTGG CAAGAAGGAA CTGACCGGAC GCACCGACAT 6100
CGAGATCGAA ATCAGCAACC AGTGTGCGCG CCTGATCGCC AACGCGGTCA TTTACTACAA CTCGGCCATC CTGTCGCGGC TGCTGACGAA GTATGAGGCG 6200
AGCGGCAACG CCAACGCGCT GGCGCTCATC ACCCAGATGT CACCGGCAGC CTGGCGGCAC ATCCTGCTGA ACGGGCATTA CACCTTCCAG AGCGACGGCA 6300
AGATGCTCGA CCTAGACGCG CTCGTGGCCG AACTGGAGCT GGGATGACGG AAATTTCGGC GGTTCCGGCT TACAACCCCA CCACGCCGTT ACCGAGCGAA 6400
ACGTACAAGC GCCTGGTGGC CTTGTCCTTC AACGGCTGAA AATCTGGATC TTGCTTCCAA TAGAGCCCAA GAAGATCAAG CGCCGCAGGC ACAGGCATGA 6500
TCTGCAGCTC GGCCAATAGT GCCGGCGGAA ACGACTTGCG CGGTCTCACG GAAGAATCCT TCTCGAAGCA GAAGTTCGGT TCCCTTCCCC CAGACCCCCA 6600
ACCCTCTCCG GCCGCTGTGC GGCCTGCGCG TCCCGTGGAC AACGCTGCGC GTTGACACAC CGGACTTGCG GGCTACTCGT GATATTTAAA TCACGGGGGC 6700
TACTGCGGCC GCGGCGGCCG GGAGCGGAAC AAACAAACCG AATGCACCAA AAATCATGCG CGCAAATCCC GGCCGCGAGG CTCCGCCATT AGTAATACGG 6800
ACGGACTTTG AACATCATGC TGAGCGCTAT CAGCAGTGCC GGCGATACCG GCGCCGATGC TGGATTGGTC AGCTGTAATA CGGACGGACC TTGAACGCTC 6900
TTGGGGTGCT TTCAGATAGC CCGCAACGGA GCCGACAGAT ACGCCCATAC GCTGAGCAAT AACGCGAACA GATAAGCCTT CCGCCTTCAG CGCCTGAGCC 7000
TGCTTCCGCT TGGCTTGGGC CGCATCCAGG TAGGTTTCGC GGTCTACCGC GCCAGCTGCG CGCCGCTTGG CCTCATGGCG CTTCCGATCG CGTTCCCTGG 7100
CCATGTCTGC ACCAATGATG GTGCGCAGCT GCGCCTGTTC GTCAGAGGTG ATCTGAAAGA GGTTGATGAG GGTGTCGTTT TTCGGGGTGT AGAGCGGGGC 7200
GAATTCTTTA TCGCCCAATG TGACCTTTTC ACCGGCTTCG TAGGCTTTGG CCTTGCTGTA GAGCGTCATC AGCTCTTTGC TGCGGTAGTT CCATCCCGCG 7300
TCCAGCTCGC CGGCGAGCGC GGCGGCTTCG TGATACATCT GGCCACTGTG TGTTGCGCCC GATAGCAGCA GAAAATTCAG CCGCCAGAAC AAGTGCTGCA 7400
TCCGCTCACC TTCGCGCACC CCGCCGCGCA GCGTGGCCAG CGTGCGCAGA TCCTCAAGGC GGTCCCAAGC AAGCTGCCGC CCGGAGAAGC CGCGCAGATT 7500
GTCCGACTTG CCGCCTGGCA GCAGCCTGAG CTGCTGGCGT TCGCGGCGAT CGGCGCGGTC CTGGCGTTGT TGCTCGATGG TCCATCTGGC TTTCGGCAGC 7600
AGTGCTTCTG CTAGGTACTC GAAGTTGTAG CGGATCGGAT GGCCATCTTG GCCAGATTCG ACATGGACAA CACGGCAGAT GTTGCCGCTC TTGGTATTTA 7700
CGGTGCCGAC CAGGCGCAGC ACACGCGAAG CATCCTTGGC TTGCGGATCT GCGCCAAGCG CGGCCAGTCG GTCGATTAGG TAACGCTGAC AAGCGTTCCA 7800
GCGCGGCAGG GCCTGTCGTG GAATCGTCCC GTCCATCAGC CATTTCGCCT GCAGGCCTCG GCCGCTGTAA ACGATAAGTG AGGGGGTTGG CAGGCCTTCA 7900
TCTGCGCAAT GAAAGAGGAC GGCCGCGGCC AGTTGCTCGG GCGTGCGTTC CACTGCCCAC GGCTGACGGT AAGTGTCGAT ATCGGCGAAC AGGAGGCCGA 8000
TGCGCAGCAG ATTGACCACG CGCCGGTTGG GCCGCATGAA CTCTGCCTGC GTCATCCAGG TGTCGCGGTC CTTGTCGATC AAGCCGAGCA CTGCCGGCAT 8100
GTCGGTCAGC TTATGGGACG ACTGGCGCTT TTCACCGCGC TGGTCGACCA GCAGCGAGAA GAAGCCGGTT CGGCCAGCGT CGTGATAGGT CTGCGCCTCG 8200
TCATCGATGC TGAATAGCGC GAGTTGCGCT TGTGTTTTCG TGCCCATACC TCTCCGATCG GTGCACGAAT TTCGCGCACA CGCCCTTGCT TTCCGGAGAG 8300
GGGAGCCTTA CAATACGAGG TACCAAACGC TCGTATCGGT TAGGGTTCTT CCTTCCGAAA TGATTTGGCC AAAGCCCCGC GCCAACGGGG CTTTCGCTTT 8400
TTTAGCGTTC TCGATCCGAC GTCAATCAGA TCGAGGCAGC TGCGATTAAA CCTTCACAGT TATGCCCTCT GTGCTTCATC CACTTCGGTT GAGTGCGCGG 8500
TGTCCAGCCC ACTGTGTTCG CAGCGAATTT CGAGATACCG CGCTCCAGCA ACAACGCAGC AGCGCCGTCG AAGCTTTGGA AGTACCTGAC GCCTCGATTG 8600
CGGGCCAGGC CAACCACACG CCTGTCCTCG CCAAGATCGA CGATCACCAC CAGACCTTCT CCGTCCTCGG CCGCACGAAT TTCCGCGCCG CGCACTAGGT 8700
GGGCGTCGGC GGCTTCCTGC AGGGCTTTAG GCTCGATAGC GATCACGGGC GAATCTCCAC ATTGCTGAAT CTGAGCCGGA TCATGCCCCA TGCGACGGGG 8800
CCGTGCAAGT TATTCAATTT CTCGCAAGGC AGAAGGTGGC GCTGCAGCCG GTGTCTTTCC CCTCACGGTA TCTGCTGCGC GGCATGCTTC ACGCGCAGGA 8900
TTTCAACGAC GCCGGCGCTG GCCAACACTC GGTAGAGAAC GAGGTAGGGG TTCAGCCGAC ATACGTGTAA AAATGACTCA CAAGCTTGGA TTTCACCCAT 9000
TGCATCAGTG AGTTAGCGGA TATTTCTGGG TAACAAAACT ACACTGCCAA CAGCGTTAGC TCACTTTTGG CTGTTTTTAC ACGAATCCCT GCCGACCCTC 9100
AGAACGATGT AATTCTGAGG CACGACCAAC TCGCGCACGC CTGCAGGCAG GCCAGGTCGC CCAGAGGGTC GGTAGGGATT GAGGATTCGC ACGGTGGGTG 9200
CGCTCCCCTG TCAGCTGAAA ATCACAAGGT ATAGAGCGGC GGCGACAATC AGCGCCAGCA ATGCAAGTGA TCCGCCGATA GCGACAAGTA GGATTGCCGC 9300
ACCAGTGAGG CCGACGCCGA CCACGCCCAT TAGTTGCAGG CGCAGGAAAC GTGCCACGGG CGGAGCTGGT GGCCGCGTGT CGTCGGCTCC CGCCCTCGGG 9400
GCAGACCGAC ACGCGGCAAT GCCGCCGTAA AACATCACGC CGGCAATGGG AAGCCAGCCT GTCAGGAAAA ACGGCGTCTC CCAAAGCTGC GGGCGGGCGA 9500
AGACCGCAGC GAATATCAGG CCGCCAAGGC CGAAGCCCGC CACAGCGATC GATGCGCGTC TAAGGTTGCG CGAAATATCA TTCATCGGGG TTCAAGGTGT 9600
GTTGTGCAGC AGATCGGTAA TCGCCGCATC GTTCGCGTTG AACTGTCTCG TGCAAAGGCA ATGCGTCGCG CTGCCTCAGC ACCTAGCTCG CCGCCATCCT 9700
CGATCCGGGC GCACGCGTGC AGCACAGCTG TCATAGCAGC CGGATCACCG GCCATGCGAT TGGCAAAACG GTCAGCGCGC AGTTCCTCGG CGTAGCTGTC 9800
CCGGCGGGCG CGTCGATCGG TGCGCTCGGC AAGCTGCCCA GCGACCACGG CTGCAACAAT GCCCAGCACA AGCGCACACA AGGTCTGCGG GGCGGTGATG 9900
GCCAGCGGCT TTGCGGCGAA GCTGCCGACA GCGGCAGCAG TCATCGCGCC AACCACCAGC GCGAGCGCTG GCGGCCAGAA GTAGCTGCCG ACCCGCTTGA 10000
GCATCGTCGC GCGCCGCTGG GCATGGCCCA CTTCGTGGGC CAAGGTGTAA CGCTGGTTGG CATCGAGTGT GAGCGCGAGT GTTTGGGACA CCACGATGGT 10100
GTGCGTGAGC GCGTTATAGC GTGCATTGCG ATGCCGCCAG AGCCTCACGC GCGGTGGCGC GATGTAGGCG TGGGCAGCGA TCGCATCGAC CGTGGCCTGC 10200
AGCTGCAGGC GCGTTGCAGT GGTCATGGGG ATGCGCTGGT GGCACGGGCA GCTTTCCGGG TGCGCGGCAG GCCGGGCAAT TTCAGCTGCG CCCGATCGCT 10300
GGGCAGGATC TTGGCGGCGG TGGCCACCGT CGCCGCGCCG GCCGGCTTCC TGGCCGGCTT GGACTTGGCG GGTTTGCCCT CGCGCTGGAT CTTGGCCAAC 10400
GCCTTCAACG CAGCCGGGCG GCCGCGTCGG TAGACCTTGG GCGCTCGCAC GTAGTTGGCC GGCAGCACCT CGAACACGGT CACCCCGAGC GCCAGCAGCT 10500
GGGCGGCAGC GGTGTCCAGG GAAGCAAACA CACGTGGCTC GCCGCGTTGG CCGGTCAGTT GCTTTTCGGT TTGGCCCACC CGCACCATCA CGGTGTAGCC 10600
GCCGACCTGG CCGACCAGCT GCACGGTGCG CAGCATCCCG GATCTGGCGA GGGCGGCGAG GGTGGTGATG GAAATGGGCG ATGTCATGAT CGACCTATCA 10700
AACATGGATG TTTGATCCTC GATCATTTGA AGCCGCTCCG CACATGAACC CAAAGTAGTC CAGATGTTGC ATCGCCCGGG CGCATCCGTT ACAGTCTACG 10800
CACCACTCCT CCCTTGTCGC CCCCATTAAG TTAGGGTAAC GGCTTGGGAC GAAAAAACCG CCGCAAGGCG GTTTTTTCAT GTCTGCAGGA AGCTCGTTGT 10900
GACAAGCGCC AAAGCAACAG CCGTCTACAT CGATGGCTAC AACCTGTACT ACGGACGCAT CCGTGGGACT GCATTCAAGT GGCTTGATGT GGTCGCACTG 11000
TTCGATCGCC TTCTGCATGA TCAAGACCCG ACCACCGAGC TGCTGCACGT CCGGTATTTC ACAGCGCCGG CGCTGGGTCG GTTTGCGACC CACACGCAGG 11100
CCCCGGAAGC GCAAACGACG TATCTGCGCG CGTTAACACA CAGCCACCCG CAACGATTCA CCACCACATT AGGCAAACAC AGCTGGGACA AAGACGGAGC 11200
CCTGCTGCCG GAGTTTGTCA GCGGCCAGCC GTATGATCGG ACACGCCGGG TGCGGGTCTG GAAACTCGAA GAGAAACAGA CCGATGTGAA CCTGGCGCTG 11300
GCCATGTATC GCGATGCTGC GTCGGGGCGC TATCAGCAGT TGGTGGTGTG CTCCAACGAT AGCGACATCG AGCCGGTCCT GGCGGCCATC CGCGAGGACT 11400
TCCCCACTAT CGTGCTAGGC ATTGTCACGC CACGCAGGCC GCCGGTCGAG GGTGAAGCGG ACAGGCGAGT CAGCGTCTCG CTGTCGAGCC GCGCTGACTG 11500
GACACGTCAC TACATCTTGG ATAGTGAGCT GGCTGCCGCT CAGCTGCCCG AGCGCGTTCG TAAGCCCGGC AAGCCGATCG ACAAGCCTGG CCACTGGTGA 11600
GCCCCACTCA CCAGTGATTA CCCGCTTTTT GGCATTAATC TGTAGCTAAC TAGGCGGCAA GGATACGGCG AAGCATTTTT TCTGCCTCTG CATCATCTCC 11700
ACAGCGATAG ACCTCTTTCG CCATCTGACG CATCGAAACG CCAGGGGCAA CCTCGGCATC TCCATCGGCC AGATCAACCC CGGAGAGATC TTGATCCGAG 11800
GGGAGACCGT TAGGAAATTC CACAATGACC CACACATCCA CAGCACGGAA TACAGCGCCG GATCGCGTTT CAATTCGGTC AAATAGCTCG GTCATTTTTT 11900
GCGCCCCAGC GCCATACATC AGGGCAGCGT TTAGGGCAGG ACTTAGACGC ACCCCGAACA AGGGATCGCG GTTGTGAAAC GTAATATCGC AGTAGCTCTC 12000
ATGGCGCGAT AGATGCACCA GAACGCGCAT GTTGTGTTGC AGTGGCGTGA TCTGTTCCTC TTGCATTATC TGTTCTCCGA TATCGGGCCT GCCGACCCTC 12100
ATAGTTGGAT GAGGAACATT CGCATGACAA CCATATGCAA AAGGCCAATC CTCGTTACGC GTCAGAGGAC TAGGCTTGTT CTTGTTGCTC CTGCGCCCAA 12200
TTCAGCACCG CACGCATTTC TTCTTGATCG TAATAGGAAG GCGCGGAGCC CGGCTCAAAT ACCCCGTTAG CAATCGCGGC CTCAAAGTAA TGGCGGGCGG 12300
CTTCATAGTT CGGCACATGC AAACGTTTAA TTAGTTCCAG AACCTCGCTC TCTTCCGAGA AGCGCTTGTC ATATAGGAAC GACAGAAGCG TAATCAGGCC 12400
ATCGGAACCT ACGAAACTGG AAAGCGGCAG ATCGGCAACG GTGTATCGCT TGGTGGCAAA CCACCGGTCC TCATCGTGAT AGCACGCGTT CTTCCTGTCT 12500
GCGTTTGGGG ATGCGTGCCG ATGATGGACA AGGCGAAGAT GGTGCGCACT CCGCTGAATG TTATCGGGTT CGCTTCGTCC ATGGAGCCAC TCTACCCAGC 12600
CGTCATCAAC GCTGAGAATC GGCTTGCCGC ACTCGTCACA TGTCCACGGT GGATTGACTT CCATACTGCA TCCTTGTCGG TAGTTGAGCG TCTCCTTGTA 12700
TCGTAATCGA AGAGCCGCAG CAAAAATGGC GTCTGCGGCG CACGCGCGTG GGGCGCATGC CGCTATGCAA TTCCACGCTT CTCCCTACAT GAGGGTCCAA 12800
TATAGTCAAT TCTCATGCCC TTGAGCGCAG TGAGCGGCCC ACACACGCTT GACGGTGGTG ACGCTCGCGC CTGCCAGGCG CGCAGTGTTG GCGATTGTCT 12900
CTCCACGGCC GCGCAACGCG ACGATGCGAG CATGTACAGC TGCGTCTGCC GGGCGGCCAC GGTACTTGTC TCTGGCCTTC GCCAGCTCAA TGCCCTGACG 13000
CTGACGCTCA CGGCGGTCCT CGTAGTCGTC GCGAGCGATC TGCAGGGCGA TGCGCAGCAG CATGTCCTGC ATGGATTCGA GCACGACCTT TGCCACACCT 13100
TCCGCCTCGG CCGCCACTTC GGAGAAGTCG ACGACGCCGG GAACGGCGAG ACGCGCGCCC TTGGCGCGGA TAGACGCCAC CAGGCGCTCG GCTTCGACCA 13200
GCGGCAAGCG GCTTATCCGA GGGTCGGCAG GGATTGATAC AAAAAACAGC CAAATCTTGC CCAACCGCCT GATAATTGAA GAATTATCAC TCCGGCTTCC 13300
GGCACTGCGC ATGCCGTCTG GTTCGATGCG CCCATCGCCA GCGCCGCGCG ACGCCTGACC GAGCTGGCCC GCGCGCATGC TGGACCGCGA CCTCCACGTC 13400
TGGCGGTTTA GCGCTGGTCT CGCGCCATTT TATGAATTGT GATCGAGGAC GAAGTAACGC GTTGCCGCAG AAGTAGCACT TCTCTAACGC GGCGATTACT 13500
TCCGATTTTG TGCGCCGCAT ATTTACTAAA ATTTTACTGT AATTTAACTA TCAAACCAGA AGAAAGACCA TGAGAAAATT TAAATCGAAG TACATGCTTG 13600
GCGCCTGCAT CGTTGGAAGC ATCGTTAGCG CGGGTGCAGC GGCGCAGGTC GCTGCAATAA ACAATACTGG CCGTGCCGAG GCTATGTCTG CATGGGAAAC 13700
TACGCTCAAG CACGAGGCTC CGGCTATGGA GGGCTGCTTT AGCTCGACAT TTCCGGACAT GGGGTGGCAG GCGGTACGCT GTGGCGCACC GCCGAAGTTG 13800
GTCATGAAGC CACGACACGT TGCCAGCGGC AACGTTCGTA CCACCACCGA CAACCGTGTG CTCATCACCG GCAATGGCGA TGACTACGCT GCACGCACGG 13900
GCAGGCTCAC CCACTCAGCC GTCGGTTCGT TCCCGAGCGT AAGCGGCGTG ACCACTGGGG TAGTCCAATA CTCGCTGCAA ATCAATACCG ACAACGACAG 14000
CAACCCCGCC GCTTGCGCGC AGTTCGGTTT TTCTTCGTGC AAGACCTGGC AACAGTACGT CTACTCCAGC GACGCCGACG AAGACTCAAG TAATGGCTTT 14100
GACCCGGTCA TCTTCATTGA AAGCTGGGTA TATGCGGACA GTACCTCGGA ATACGATGCG GCGGGTTGCC CGTCCGGCTG GGATGCTTAC GAAGGCAACG 14200
CTTGCGTGAT TAACAGCGAA TCGGTGCGAG TGCCTCTGGT GCCAGTTTCG GGCATCGCTG GTGTCAAGCT GACCGGTTCT GCGACCTCGG GGGGCGTGGA 14300
TACCGTGTCG TTTTCGGTGA ACGGAAGGGC CTACAGCGTG AGTCAGCGCG CCTCTACGGT GGATATCAAT AAGATCTGGC GCCTGACCGA ATTCAACATC 14400
TTCGGCAACG GCGCCAGAAC TCAGACGGTG TCGTTCAATC GCGGCTCGCA CGTCACGGTA AACGTAGCCG TAAACGATGG TACGACAAAT GCCCCGACTT 14500
GCCTGGGCAA TGCCGGCAAG ACGTTCGAGC AGAACAACCT GACGCGCGGT AGTTGCTCCG TCTTTGGTGG CGCTTCGCCC GGCATTAGCT TTCCTCAAAG 14600
CAACTGACTA TTTGCGTCTG GGGAGGTTTC GTCCTTCCCA GACGTTACTT GTTCGCCAAC GGGGCGCGGT GGCCGCAGCT GCGCCAGGAG CTGTTCCAGC 14700
GTGCTCAGAC GTACCGTGGC CCGCAAGGTG TCCGCCTCGG CATGTTCCCG GCGCTGGCGC TCTTCGACCA GCTCGGTCCG CGCCGCAGCC AGTTCGGCGC 14800
GCACACCTTC CAGCGCCTGG ACATCTTTGG TCCAGCGCGC CTGCAGGCCT TGGTGCTCGG CGGCCGCCAG GCGCAGCGCG TCGCGCTCGC GCTGCTGGGC 14900
GTCGGCCCGC TGGCGTGTCT GCGTCAGTTC CCGCTCTAGC CGGGTATGGC GTTCCAGCCA CTGGCCGTTT TCCCGGTTGA GCTGCATCAG GTCGTGGTTC 15000
TTGGCGGTCA GCGCCTCGTT GGCCTGCCGT AGGGCAACCT GCAGTTCCTG TACCTGGTGC TCGTGCCGGC GTTGTTCCTG CTCACGCTGG TCCTTGACCG 15100
AGGTGCGGTA GTGCTCCAGG GCTTCCCTGG CGTGGTCGTG CTTCTGCTCC AGGGAGCGGG CGTGGCTTTG GTGCTCCGCC AGTCGCGCCG TGAGGCCCGC 15200
GATCCGCTCC TCCAACTGAG CCAGTTCGGT AGTGCGGCTG GCCACCTCGC TTCTGGCGGT GGCGCTCGCC TCGCATTCGG CCTGCAGTAC GGTCTCGCAG 15300
CGCTGCAGCT GCGTCGTGAG TGAGTCAGCC TCTTGCCGGG CCTGCTCCAG CGTTTGGGTG CGCTCCTGGA GCTGCGCCTG GAAGCGCGCC TGGGCCTGCG 15400
CCACGACCGT GTCGGCCTCG CCGTGCAACC GCTCGGCGAG GCGGGCGACG AGGTCTTGGA GTGCGTCGCT CACGGCCATC TTCGCGCCAA CGCCTTGCCC 15500
TTCCTCCTCT TCCAACTCCC TCAGATAACG GTGAATGGTG GTCTTGGAAC CGGTATTGCC GAGCGCCACC CGGACCGCAT CGACAGACGG ATTCTTGCCC 15600
GTTGCGCGCA GGCTATCGCG GGCACGCTGC ACATCGCTTT TGTACAGCCC AGACCGTGCC ATCCGTCACT CCACGTATTT CGTAACGTAT TACATACCAT 15700
GTAATTACGT ACCATTCAAC ACGTTAGATA AACCGGTGGC GGGTGTCGAG CTCACGCGGG ATAATCTAGA ATTACCCCGC CTTATGCGGA ATTTTCGGCT 15800
ACAGTCGCCC TGCGGCCGTA CGAAACGGCG GTTCGGAGCG ACGCGGTGAG CTCGGTCAAG CAGTATCTGG AAGCGGCAAC TCGCGCCAAC ACCGAGCGCG 15900
CGTATGCAGG GGCGATCCGT CACTTCGAGG TCGAATGGGG TGGTCATCTG CCGGCGACTG CGGAGCAGGT GGCGCGCTAT TTGGCCGCCT ACGCCGGGCA 16000
ACTCGCGCTA AACACGCTCA GGCACCGGCT CGCAGCACTG GCGCAGTGGC ACCAGGTGCA CGGCTTTGTC GACCCGACCC GGGCGCCGGT GGTGCGCCAA 16100
GTGCTCAAAG GCATCCAAAC CCTGCACCCG AGCGTCGAAA AACGCGCCAC GCCCCTGCAA CTCACCCAGC TGGGCCAGGT CACGACATGG CTGGAAGACG 16200
CTGCGGCTGC GGCCCAGTCT CGCGGCGATC GCGCCGCAGA ACTGCGGCAC CTGCGTGATC GCGCCTTGCT GCTGCTCGGC TTCTGGCGCG GCTTCCGCGG 16300
CGATGAACTC ACCCGCCTTC AGGTGAACCA TCTGCGCCTG GTGCCGGGCG AAGGCATGAC CTGCTTCCTG CCGCACAGCA AGAGCGATCG CCAGCACGCC 16400
GGCGCCACCT ACAAGGTCCC GGCGCTGTCG CGCTGGTGCC CGGTGGCTGC CACCATGACC TGGGTTGCGG CAGCCGCCCT CCACGAGGGG CCACTGTTTC 16500
GCGCAGTCAA CCAGTGGGGC GGGATTGCCG CCGCGCCACT CCACACCAAC AGCCTGGTGC CGCTGCTACG GCGGATCTTC CGCGAGGCGG GGCTGAGTTC 16600
TCCCAACGAC TACAGCGGCC ACTCGCTGCG GCGCGGCTTT GCCAGCTGGG CCAACGCCAA CGGCTGGGAC GTCAAGGCGC TGATGGAGTA CGTCGGCTGG 16700
CGCGACGTGC ACTCGGCGAT GCGTTACCTC GACGGGGCCG ATCCGTTCGC CCGCCAGCGC ATTGAGGCGA GCCTGTCGCC AGCCACACCA CCGCTGCTGG 16800
CCCTGGCGGC GCCGGCTCTC GATCCAGTGC CGACCACGGC GGTCGAAGCG ACCGTCACCC TGACGCGCTT CAACTCGCGT GTGCGCGGAC TGGCCAAAGC 16900
GCATCGGCTG ATCGAGCAAA TCTGCCTGCA ACCGCATCAG GCGCAGCGAC TCAACGCCGA CGGCACGCGC TACCGGTTGG CGATCGCCGC CGTTGACGAG 17000
GCGGCCTTTG AAGAAACCAT CGCAATGCTG CTCGACGAGA TGCACCGCAT CGCCGACAAC CACCAGTGCT TCCTCGCCGT CGCCTTGCGC GATGAAGCGG 17100
GCGGACGCCA TTGGGACTAA GCTGAGCATG ACGACGCTGC ACGAAACCGC CTACCCGCGC CTCAAGCCCG ATCCCACCGC CAAAGAACTC CAGGATATCT 17200
ACACGCCTAC TGCGGCGGAG CTGCAGTGCG TCCGGAACAT CGCTACCGGC CCGGCCACAC GGCTGGCCCT GCTACTGCAC CTGAAGCTGT TCCAGCGCCT 17300
GGGTTATTTC ACGCCCCTGA TCGAGGTGCC CGAACGTATC GTGCAGCATG TCGCTCAGAC GCTGAGAATG CGCCGCGTGC CCGCCGACCG GTTGGCGAGC 17400
TACGACACCT CCGGCGCCAA ACGCGGGCAT CTAGCCCAAC TGCGGGCGTT CCTCAACGTC TGTCCGCTCG ATGCGGCTGG GCGAGACTGG CTCGGCACCG 17500
TGGCTGAAAC GGCTGCGCAG ACCAAGCACA TCGTGCCCGA TATCGTGAAC GTGATGCTTG AAGAGCTGGT GCATCACCGC TTTGAGTTGC CGGCCTTCAG 17600
CACGCTCGAA CGCATCGCGA TCGCCGCCCG CGAACGTGTC CACGATGCGC ACTACCGCCA GATCGCCGAT GCGTTGTCAC CGACCATGCG GACGCTGATC 17700
GATAACCTGC TGCTGACACC ACCAGGCAGC CACCACAGTG ATTGGCACAC GCTCAAGCGC GAGCCCAAAC GCCCAACCAA CAAGGAAGTG CGCCATTACC 17800
TGCGGCACAT CCAGCGCCTG CGCATCCTGG CCGAGCAGCT GCCGCCGATC GATGTCTCGG TGCCCAAGCT CAAGCAATTC CGGGCGATGG CCCGCGCACT 17900
TGATGCCGCC GAGTTGGCCG AACTGGTCCC GATCAAGCGC TATGCGCTGG CCGCGATCTT CATCCGCTCC CAGTATCGCA AGACGCTCGA CGATGCGGCC 18000
GACCTGTTCA TCCGCCTGAT CCAGAACCTG GAGAACACCG CGCAGCAAAA GCTGATCGCT TATCAGCTCG AACACAGCAA ACGGGCCGAT GCCCTGATCG 18100
GCCAGCTCCG GGAGATCCTG CAGGCCTATC AGGTCGAGGG CACAGACACC GAGCGTGTGG GCGCGATCGC GGGCGTGCTG GTCGCCGACA TCGCTCTGCT 18200
GACGGCTGAG TGCGACGAGC ACATGGCCTA CGCCGGCCGC AATTATCTGC CATTCCTGCT GGCACCCTAT GGGACGTTGC GTCCGCTGCT GTTCAACTGC 18300
CTGGAAATCA TGGGCCTGCG TGCCGCCAGC CAGGACCCCA GCATGGAGCG CATGATCGGC GCTGTCCTGG CGCTGCGCAG TCAGCGCCGC GAGACGATCG 18400
ACGCCGCTAG CCTGGGCGTG GCCACAACCG ACCTGACGTG GTTATCGTCA GCCTGGCGCA AGCATGTGAT GCCCAAAGCC TTGGCCGCCG CCAGCCCCGG 18500
CTGGATCCAT CGCAAGTACT TCGAACTGGC GGTGCTCGCG CAGATCAAGG ACGAACTCAA ATCCGGCGAC CTGTACATCC CGCACGGCGA GCGTTACGAC 18600
GACTACCGCG AGCAACTGGT CGACGAGGCG ACACTCGCGC AGGAACTGGA CGCCTACGGC GAGGTCTCGG GCGTCGCCAC CGACGCGGCT GACTTCGTGC 18700
AGGGTCTGCG CACCGAACTG ACGACATTGG CCGATGCCGT GGACGCTCGG TTCTCGGACA ACCTGCACGC CAGCATGGTC GACGGGCGGC TGGTGCTCAA 18800
GCGCCTGCAG GGCGCACAGG TGACCCAGGC GATCGCCACA GTGGACAGCG CGATCACCGA CCGGCTGCCG CCGACCAGCA TCGTCGATGT CCTGGTCGAC 18900
ACCACCCGCT GGCTGGACCT GCACGTGCAC TTCCGCCCGA TCGCCGGCAC CGACGCGCGA GTCGACGATC TGCTACGGCG CGTGATCACC ACCCTGTTCT 19000
GCTACGGCTG CAATCTGGGG CCGACCCAGA CCGCGCGTTC GGTCAAGGGC TTCAGCCGGC GCCAGATCTC GTGGTTGAAC TTGAAGTACG TCACCGATGA 19100
AACGCTCGAC AAGGCGATCG TCCAGGTGAT CAACATGTAC AACAAATTCG AGTTACCCGG CTACTGGGGC AGTGGCAAGA GCGCCTCGGC CGACGGCACG 19200
AAGTGGAGCG TGTACGAGCA GAACCTACTG TCGGAATACC ACATTCGCTA CGGCGGCTAT GGTGGCATCG GTTACTATCA TGTGTCCGAC AAGTACATCG 19300
CGTTGTTCAG CCACTTCATC CCGTGTGGCA TACACGAAGC GGTCTACATC CTCGACGGGA TGCTGGCCAA CCGGTCCGAC ATCCAGCCCG ACACCGTACA 19400
CGGCGACACC CAGGCGCAGA GCTTCCCGGT ATTCGGGCTG GCGCATCTGC TGGGCATCAA CCTCATGCCG CGCATCCGCA ACATCAAGGA CCTGGTGTTC 19500
TCGCGGCCGG AGCCAGGTCG GACCTACGAG AATATCCAGG CGCTGTTTGG GGACAGCCTC GACTGGACAC TCATCGAGAC CCACGTGCAC GACATGCTGC 19600
GGGTCGCCAT CTCGATCAAA CTGGGCAAGA TCACCGCTTC CACCATCCTG CGCCGGCTCG GCACCTACAG CCGCAAGAAC AAGCTGTACT GGGCATTTCG 19700
CGAACTGGGC AAGGCGGTAC GCACACTGTT CCTGTTGCGC TACATCGACG ATGTCGAAGT GCGCAAGACC ATCCACGCCG CCACCAACAA GAGCGAGGAG 19800
TTCAACGGCT TCGTCAAATG GGCCTTCTTC GGCGGTGAAG GGATCATTGC CGAGAACGTC CAGCACGAAC AGCGCAAGAT CGTGCGTTAC AACCAGTTGG 19900
TGGCCAACCT GGTCATCCTG CACAACGTGG AGCAGATGAC CCGCGTGCTG GCCGAGCTGC GGGACGAGGG CTCAAACATC AGCCCGGAAG TGCTGGCCGG 20000
CCTGTCGCCG TATCGGACCA GCCATATCAA CCGGTTCGGT GACTACACCT TGGACCTCAA GCGGCAGGTC GAGCCAATCG ATTTTTCGCG GAGAATTCTT 20100
GCGGCGACAA CGCGTTAGGA CGCAATCAGC TTTTTTTTAC CGGGCGAATT CCTGCTTGCC CATCGCGGTC AAAAACGCAT AGCTGCAATA ATTACATCCT 20200
GGGAACAGGA GGACTAGAGT GGCCGCTCAA CCACCACCTC GGGTATCTCA TGATCATGGC TTCGCGCCTT TTGCGTTTAA CGCTCGGTGT CAGCGTCTGT 20300
GTGGCCGCAA CCAGCGTAGC AGCGCAAGCC ATCGCTCCCG AAACAACCGC ATCCTCCGCA ACCGACGCCG GTGGGCAACA GCAGGATCCC GCACCCACTG 20400
CGAGCTTCGA ACAGTGGCTG GCAGACTTCC GCCAACGTGC GCTTGCCGCC GGCATTGGAA CCACCACGCT GGATAACGCA TTGGCTGGTG TCACACCCGA 20500
TCCATCGGTG CATGAACTGG ATCAGCGGCA ACCCGAATTT ACCCAGTATC TGTGGGACTA TCTCAACGCG CGTGTCACAC CTTCGGCCAT CCAGGAAGGA 20600
CAGCAGCTGC TGATCAGCCA GCACGCCCTC TTCGAGAAGT TGCGGCAGCA CTACGGTGTG GATCCGGGCA TTCTGACCGC CATCTGGAGC ATGGAAAGCG 20700
GGTACGGGAA GCAGATCGGG GACTTTTATG TAATTCGCTC CCTGGCGACA CTGGCCCATG AGGGACGGCG CACCACGTAT GGCAACACGC AGCTGCTTGC 20800
GGCCCTGCAG ATCCTGCAGA CGGAAAAAAG CATTGATCGC TCGCAACTGG TGGGTTCGTG GGCCGGCGCC ATGGGACAGA CGCAATTCAT TCCGAGCACC 20900
TATCGCGACT ACGCCGTCGA CGAGGATGGC GACCAGAAGC GGGATGTCTG GAATTCCAAG GCGGACGCTT TGGGCTCTGC GGCGAATTAT CTCAAGCAGA 21000
ACAATTGGAC CAGCGCTGTT CCCTGGGGCC AGGAAGTCCA GCTGTCAGCC GGCTTCGACT ACGCGCAAGC CGACCTGACG ATCAAGAAGA CCGTCACCGA 21100
ATGGCAGCGC CTGGGCGTGG CGCCCCGCAA GCCGATTGCG CCCGCACTGG CTCAGCAGTT GGCATCGGTG TTGTTGCCCA CCGGATACCG CGGCCCGGCC 21200
TTCCTAGTGT TCGATAACTT CCGCAGCATT CTGCGCTACA ACAATTCCAC CGCCTATGCC CTGGCGGTCG GCTTGCTGGC AGACGGCTAT GCGGGCAGGG 21300
CCGGGGTCGA GCAACCCTGG CCCAAAGACG ATCCGCCGCT CAATAGCACG GCCCAGATCA CCGAGCTGCA GCAGAGGCTG ACCGACAAGG GATTCGACGT 21400
GGGCGGCATC GACGGTGTGC TGGGGGCGCA AACTCGCCAG GGAATCCGTG CATTCCAGCG TATCCAGCAA TTGCCGCAGG ACGGTTACGC CAGTACGTCG 21500
CTGCTGGCCC GTCTGCGGGC CGCCTGATCG GGGATGTACG TCGGAGGCTG CAATTCTTCA CCCACTTACA AGTACTGCAA GGCCATCAAT AATGGTGCGG 21600
GAATCTCAAA AATGGGTTGC ACTATCTCAA CGACAAACAA CGCCCCGCAC TCGCCGCGCC AAGAGGACGC GCCTCCCCTT CCGCCGCAGA CGCGGCAATC 21700
CTTTGTGGGG GTTGTAAATG GGCTGTTGAG CGATTTGCCC AAGCGTCGGC GTCGTGGTGG GTCGCTCTCC GATCCTGATA TCTCCCTCGC CGGCTACTTG 21800
CTGAGCAAAG CCGTCATCGG GGATCCGGTC GAGCCCCAGG ACATCCCTCG GCTGCACAAG GCGAACAACA CGGTGCAGAA GACACGGGCG CGCTTCCCAT 21900
ACGGACGCGG CAACGTCGCC ACCGACATTG CCGTCAGCGA TCACGCATCC AGTCAACATG CGCAGGCCGC ACATGATGTG TTTGTCGATC TGGTGCGCGG 22000
CGCGGCGCCG GCGTCAATGC TCACGAATCC GACGCTGGGG CATGCGGTCG TATCTGAATT CGTCCAGGGC GGTCATTGCG CCGGATATGC TGCCGTAGCA 22100
ACCATGCGAC ATGTACAGAA GCTTCAGCCA GAAGAAAGCG TCCACTATGT CCAGCACAAC CATCAGGGCC ATGACTGGGC TGAGTCGCGC GTGCCGGACG 22200
GACATCACAA GACCATCGTC CTAGACCCTT GGGCCCAAGG ACCGGCGGTG TTTGCGTCCG ACAGCAGATT TGCGGCAAAT GCTCAACACA CCCAAGAACG 22300
CTTGGCCTTG AATGCCAAGG ACGGAGATGA TATTGCCGCC AAGACGGCCG CAGGTGCGCA GTATCTTCTG GAAAACTGCC TCCCCCTGAC TGAAACACAC 22400
CTCAAGAGAC TGAAAGCCCA GGAGTTCCAT TGCGCGCCTG AGGAAGTCTG GCAACCGCAA CCTGTGGTAA GCGATGCGTT TCTTAGAAGG GTTCGGCAAA 22500
GTCTGGCTAC ATTGACCAAC TCTTCAGAGC TTGGCTTGTC GCGGGCAGAG CAGAGCAGCA AGGCTTTGAA GAAGATGTCC ATCAAATCGG CATGCGCACT 22600
GGGTTTCGGC AAGAAAGCAG CAAGCGCAGC TGCCGAAGGC ATTGCCGCTG CCGCATATCA GCTGGGCGAG CAGAGCCAGT AACCGGTGAT GGACCCAACC 22700
ATGCGGGTGG GTCACATCAG TCCAAGACTC TAAACCCAGC CAGTACTCAG GGAGGGGGAC GTCGAGCTAA GCATTTTTCA CAAACTCCTG CCGGCACTTT 22800
TGAGCGCAAT CTATTCATTA CCGGGCTTGG GCGCGATCAG TTCCTTCTGC TCAGGTAGCG GACCGGGACG GAAGCGCCCT CCCGCTCGCG CTTGCGACAC 22900
GGCAAACTGC ACATCATCTT CAGTATCGGA CTGACTCCTA ATCGAGCTTT GCCTCTTCAA TCCTTGCAAT ACACCCTGTG TCTCCCGATG AGTCGCCGAC 23000
TCTTGCGTCG CTTGCTCATT CGTGGTCCGA CCAGCGTGTT GCTCGACAAC ATATTCGCTC GCCTTCCCAC TGGCACCTGT GACGTTGAAG CATCCCATAT 23100
AAGAACTCCT ACTTGAGAGT GCGGGCTGCT AACTTCTCAC GAAATGCGTA AACCTGCTTT CGCCAAGCGA GAAACTGTTT GGACTAGCGA AACTGTGCCT 23200
AGTGATCGGG CGCAGCTGAA AACTAAACAA CGTGCCTGGA CCTCGTGGGT AAATGTGCTG CCGTGTGTAA ACACCACGTT ATTCAGTGGA ATGTGGATCC 23300
GGGAGATGTG GGGAGGTCGG TGTGCGAGGC TGCTATGGGG TCGATGCCGC CGCAGGCGTC ATGATCGGCG GGGCCACGTC CACGACCAGT ACGGCCACGC 23400
GCTAAAGGAG GCTACCCACC GCGTCGAGTC GGTCAAATTT CAATTCAGCG CGAGGCCTAT GCAGAGGGGC AGTAGAACTT CTCGAAAACC CTTTTCCGAC 23500
AAAAAGTTAG CTCATTTTTG GCTGTTTTTT ACACGAATCC CTGCCGACCC TC

 Recombination Sites     

Name Coordinates Gene Sequence
res_core_IRL 7183-7190 8 CGGGGTGT
res_acc_IR2 15695-15703 9 TACCATGTA
res_acc_IR2 15752-15765 14 TCACGCGGGA TAAT

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnp TnPsy42 1073-1288 Transposase   +
WP_004666662.1 TnPsy42 1356-1733 Passenger Gene Hypothetical +
ARO44905.1 TnPsy42 1688-1882 Passenger Gene Hypothetical -
PIN TnPsy42 1894-2304 Passenger Gene Toxin -
abrB TnPsy42 2286-2537 Passenger Gene Antitoxin -
tnpR TnPsy42 2717-3289 Accessory Gene Resolvase +
tnpA TnPsy42 3246-6347 Transposase   +
repA2 TnXax1.1 6754-8247 Passenger Gene Other -
parC TnXax1.1 8426-8791 Passenger Gene Other -
WP_014125906.1 TnXax1.1 9211-9585 Passenger Gene Hypothetical -
M48 family metalloprotease TnXax1.1 9582-10226 Passenger Gene Other -
WP_014125904.1 TnXax1.1 10223-10726 Passenger Gene Hypothetical -
WP_014125902.1 TnXax1.1 11650-12066 Passenger Gene Hypothetical -
WP_014125901.1 TnXax1.1 12170-12664 Passenger Gene Hypothetical -
parA TnXax1.1 12806-13378 Passenger Gene Other -
WP_014125899.1 TnXax1.1 13570-14607 Passenger Gene Hypothetical +
tnpT TnXax1.1 14595-15662 Accessory Gene Resolvase -
tnpS TnXax1.1 15846-17120 Accessory Gene Resolvase +
tnpA TnXax1.1 17092-20118 Transposase   +
mltB TnXax1.1 20250-21527 Passenger Gene Plant Pathogenicity +
avr TnXax1.1 21534-22682 Passenger Gene Plant Pathogenicity +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp TnPsy42 216 1073-1288 +
Class:   Transposase
Transpoase Chemistry:   DDE
Sequence Family:  IS3_family
Comment:   99.5% identical to reference sequence TnCentral: tnp(TnPsy42)||similar to TnXac3_aa3 rve_3 superfamily|| BLAST ID: NAS61029.1
Protein Sequence:  
MGHCGDNAAC EGFFGLLKRE RIRRQIYPTK DAARAEVFDY IEMFYNPNRR HGSTGDLFPV EFERRYAQRG S

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_004666662.1 WP_004666662.1 TnPsy42 378 1356-1733 +
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MGDKTDKNTI AWLAQPEEHD YPAAQSYLNL LYDDAHCAKL VRKLHAAPMS AFKAKDILRA SGLSPLGMSN AHVERDLKKI QSGTALSPLL LVRQEGQRTV
VADGYHRLCA VYSFDEDASI PCKIV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
ARO44905.1 ARO44905.1 TnPsy42 195 1688-1882 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MNEFRRLATK IDQHMKQLAT QGVSEAHSII NHMMGYVPDN RAAGLGRGSA HTILQGIEAS SSKL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
PIN PIN TnPsy42 411 1894-2304 -
Class:   Passenger Gene
Sub Class:   Toxin
Target:   single stranded RNA
Sequence Family:  PIN (Pfam:PF01850)
Comment:   tRNA(fMet)-specific endonuclease
Protein Sequence:  
MGRQAMKVAV DTNVLVRAVV RDDPAQADVA AAVLTDAELI AVALPCLCEF VWVLMRVYGF QQSDAADAIR ALLDAANVEV NRPAVEAGLL VLDAGGDFAD
GVIAYEGNWL GGETFVSFDK KAVTLLSVQG QSARLL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
abrB AbrB TnPsy42 252 2286-2537 -
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  abrB
Protein Sequence:  
MTTLTVTARG QVTFRKDVLQ HLGIRPGDKI ELDLLPDGRG VLKAARPAGT IASFVGLLAG KTQKVATIEE INEAAAQGWA GKQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnPsy42 573 2717-3289 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MALIGYARVS TAEQDTALQT DALRKAGCER VFEDTASGAK ADRPGLADAL AYLRNGDVLA VWRLDRLGRS MPHLIETIGA LEARGVGFRS LTEAIDTTTP
GGRLIFHVFG ALGQFERDLI RERTKAGLSA AAARGRKGGR KPVITADKLQ QARKHIANGL NVREAATRLK VSKTALYAAL QSTSARDLQK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnPsy42 3102 3246-6347 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MQLCNPPAHE ISKNDNRLTR SGTVMAAKSE RLTILSDAEQ EALYGLPDFD DTQRLEYLAL TDAELALASS RPGMPAQVYC VLQIGYFKAK HAFFRFDWNE
VEDDCAFVLR RYFTDEPFEC KAVTKHEHYT QRERIAELFC YLPWVASFLP QLAQQAALIV RRDVIPGFIA TELIVWLNEH KIIRPGYTTL QELVSEALSA
ERRRLGGLLA EVLDDPARAA MAQLLVRDDT LSQLAALKQD AKDFGWRQMA REREKRATLE PLHSIAKALL PKLGVSQQNL LYYASLANFY TVHDLRNLKA
DQSHLYLLCY AWVRYRQLSD NLVDAMAYHM KQLEDQSSAG AKQSFVAEQV RRQQDTPQVG RLLSLYIDDS VPDPTPFGDV RQRAYKIMPR DALLTTAQRM
SAKPVSKLAL HWQAVDGLAE RIRRHLRPLY VALDLAATNP DSPWLVALAW AKGVFAKQQR LSQRPLAECP AATLPKRLRP YLLTFDADGK STGLHADRYE
FWLYRQVRKR FQSGELYLDD SLQHRHFSDE LVSVEEKADA LAKMDIPFLR QPVDAQLDAL AAELHLQWLA LNRALKQGKL THLEYDKATH KLIWRKPKGE
NQKAREKAFY EQLPFCDVAD VFRFVDGQCQ FLSALTPLQP RYAKKVADAD SLMAVIIAQA MNHGNQVMAR TSDIPYHVLE SAYQQYLRHA TLHAANDCIS
NAIAQLPIFP YYSFDLNTLY GAVDGQKFGV ERPTVKARYS RKYFGRGKGV VAYTLLCNHV PLNGYLIGAH DYEAHHVFDI WYRNTSDIVP TAITGDMHSV
NKANFAILHW FGPRFEPRFA NLDDQLKELY CADDLAQYEN CLIRPIGKID RDLIIDEKPN IDRIVATLGL KEMTQGTLIR KLCTYTAQNP TRRAIFEFDK
LVRSIYTLRY LRDPQLERNV HRSQNRIESY HQLRSTVAQV GGKKELTGRT DIEIEISNQC ARLIANAVIY YNSAILSRLL TKYEASGNAN ALALITQMSP
AAWRHILLNG HYTFQSDGKM LDLDALVAEL ELG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
repA2 RepA2 TnXax1.1 1494 6754-8247 -
Class:   Passenger Gene
Sub Class:   Other
Function:   plasmid replication initiation protein
Protein Sequence:  
MGTKTQAQLA LFSIDDEAQT YHDAGRTGFF SLLVDQRGEK RQSSHKLTDM PAVLGLIDKD RDTWMTQAEF MRPNRRVVNL LRIGLLFADI DTYRQPWAVE
RTPEQLAAAV LFHCADEGLP TPSLIVYSGR GLQAKWLMDG TIPRQALPRW NACQRYLIDR LAALGADPQA KDASRVLRLV GTVNTKSGNI CRVVHVESGQ
DGHPIRYNFE YLAEALLPKA RWTIEQQRQD RADRRERQQL RLLPGGKSDN LRGFSGRQLA WDRLEDLRTL ATLRGGVREG ERMQHLFWRL NFLLLSGATH
SGQMYHEAAA LAGELDAGWN YRSKELMTLY SKAKAYEAGE KVTLGDKEFA PLYTPKNDTL INLFQITSDE QAQLRTIIGA DMARERDRKR HEAKRRAAGA
VDRETYLDAA QAKRKQAQAL KAEGLSVRVI AQRMGVSVGS VAGYLKAPQE RSRSVRITAD QSSIGAGIAG TADSAQHDVQ SPSVLLMAEP RGRDLRA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parC ParC TnXax1.1 366 8426-8791 -
Class:   Passenger Gene
Sub Class:   Other
Comment:   putative partitioning-associated protein ParC
Protein Sequence:  
MGHDPAQIQQ CGDSPVIAIE PKALQEAADA HLVRGAEIRA AEDGEGLVVI VDLGEDRRVV GLARNRGVRY FQSFDGAAAL LLERGISKFA ANTVGWTPRT
QPKWMKHRGH NCEGLIAAAS I

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_014125906.1 WP_014125906.1 TnXax1.1 375 9211-9585 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MNDISRNLRR ASIAVAGFGL GGLIFAAVFA RPQLWETPFF LTGWLPIAGV MFYGGIAACR SAPRAGADDT RPPAPPVARF LRLQLMGVVG VGLTGAAILL
VAIGGSLALL ALIVAAALYL VIFS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
M48 family metalloprotease M48 family metalloprotease TnXax1.1 645 9582-10226 -
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MTTATRLQLQ ATVDAIAAHA YIAPPRVRLW RHRNARYNAL THTIVVSQTL ALTLDANQRY TLAHEVGHAQ RRATMLKRVG SYFWPPALAL VVGAMTAAAV
GSFAAKPLAI TAPQTLCALV LGIVAAVVAG QLAERTDRRA RRDSYAEELR ADRFANRMAG DPAAMTAVLH ACARIEDGGE LGAEAARRIA FARDSSTRTM
RRLPICCTTH LEPR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_014125904.1 WP_014125904.1 TnXax1.1 504 10223-10726 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MIEDQTSMFD RSIMTSPISI TTLAALARSG MLRTVQLVGQ VGGYTVMVRV GQTEKQLTGQ RGEPRVFASL DTAAAQLLAL GVTVFEVLPA NYVRAPKVYR
RGRPAALKAL AKIQREGKPA KSKPARKPAG AATVATAAKI LPSDRAQLKL PGLPRTRKAA RATSASP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_014125902.1 WP_014125902.1 TnXax1.1 417 11650-12066 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MQEEQITPLQ HNMRVLVHLS RHESYCDITF HNRDPLFGVR LSPALNAALM YGAGAQKMTE LFDRIETRSG AVFRAVDVWV IVEFPNGLPS DQDLSGVDLA
DGDAEVAPGV SMRQMAKEVY RCGDDAEAEK MLRRILAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_014125901.1 WP_014125901.1 TnXax1.1 495 12170-12664 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MEVNPPWTCD ECGKPILSVD DGWVEWLHGR SEPDNIQRSA HHLRLVHHRH ASPNADRKNA CYHDEDRWFA TKRYTVADLP LSSFVGSDGL ITLLSFLYDK
RFSEESEVLE LIKRLHVPNY EAARHYFEAA IANGVFEPGS APSYYDQEEM RAVLNWAQEQ QEQA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parA ParA TnXax1.1 573 12806-13378 -
Class:   Passenger Gene
Sub Class:   Other
Function:   site-specific recombinases, DNA invertase Pin homologs
Comment:   might be parB
Protein Sequence:  
MRAGQLGQAS RGAGDGRIEP DGMRSAGSRS DNSSIIRRLG KIWLFFVSIP ADPRISRLPL VEAERLVASI RAKGARLAVP GVVDFSEVAA EAEGVAKVVL
ESMQDMLLRI ALQIARDDYE DRRERQRQGI ELAKARDKYR GRPADAAVHA RIVALRGRGE TIANTARLAG ASVTTVKRVW AAHCAQGHEN

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_014125899.1 WP_014125899.1 TnXax1.1 1038 13570-14607 +
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MRKFKSKYML GACIVGSIVS AGAAAQVAAI NNTGRAEAMS AWETTLKHEA PAMEGCFSST FPDMGWQAVR CGAPPKLVMK PRHVASGNVR TTTDNRVLIT
GNGDDYAART GRLTHSAVGS FPSVSGVTTG VVQYSLQINT DNDSNPAACA QFGFSSCKTW QQYVYSSDAD EDSSNGFDPV IFIESWVYAD STSEYDAAGC
PSGWDAYEGN ACVINSESVR VPLVPVSGIA GVKLTGSATS GGVDTVSFSV NGRAYSVSQR ASTVDINKIW RLTEFNIFGN GARTQTVSFN RGSHVTVNVA
VNDGTTNAPT CLGNAGKTFE QNNLTRGSCS VFGGASPGIS FPQSN

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpT TnpT TnXax1.1 1068 14595-15662 -
Class:   Accessory Gene
Sub Class:   Resolvase
Protein Sequence:  
MARSGLYKSD VQRARDSLRA TGKNPSVDAV RVALGNTGSK TTIHRYLREL EEEEGQGVGA KMAVSDALQD LVARLAERLH GEADTVVAQA QARFQAQLQE
RTQTLEQARQ EADSLTTQLQ RCETVLQAEC EASATARSEV ASRTTELAQL EERIAGLTAR LAEHQSHARS LEQKHDHARE ALEHYRTSVK DQREQEQRRH
EHQVQELQVA LRQANEALTA KNHDLMQLNR ENGQWLERHT RLERELTQTR QRADAQQRER DALRLAAAEH QGLQARWTKD VQALEGVRAE LAAARTELVE
ERQRREHAEA DTLRATVRLS TLEQLLAQLR PPRPVGEQVT SGKDETSPDA NSQLL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpS TnpS TnXax1.1 1275 15846-17120 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Tyrosine
Sequence Family:  Tyrosine Site-Specific Recombinase
Protein Sequence:  
VSSVKQYLEA ATRANTERAY AGAIRHFEVE WGGHLPATAE QVARYLAAYA GQLALNTLRH RLAALAQWHQ VHGFVDPTRA PVVRQVLKGI QTLHPSVEKR
ATPLQLTQLG QVTTWLEDAA AAAQSRGDRA AELRHLRDRA LLLLGFWRGF RGDELTRLQV NHLRLVPGEG MTCFLPHSKS DRQHAGATYK VPALSRWCPV
AATMTWVAAA ALHEGPLFRA VNQWGGIAAA PLHTNSLVPL LRRIFREAGL SSPNDYSGHS LRRGFASWAN ANGWDVKALM EYVGWRDVHS AMRYLDGADP
FARQRIEASL SPATPPLLAL AAPALDPVPT TAVEATVTLT RFNSRVRGLA KAHRLIEQIC LQPHQAQRLN ADGTRYRLAI AAVDEAAFEE TIAMLLDEMH
RIADNHQCFL AVALRDEAGG RHWD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnXax1.1 3027 17092-20118 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MKRADAIGTK LSMTTLHETA YPRLKPDPTA KELQDIYTPT AAELQCVRNI ATGPATRLAL LLHLKLFQRL GYFTPLIEVP ERIVQHVAQT LRMRRVPADR
LASYDTSGAK RGHLAQLRAF LNVCPLDAAG RDWLGTVAET AAQTKHIVPD IVNVMLEELV HHRFELPAFS TLERIAIAAR ERVHDAHYRQ IADALSPTMR
TLIDNLLLTP PGSHHSDWHT LKREPKRPTN KEVRHYLRHI QRLRILAEQL PPIDVSVPKL KQFRAMARAL DAAELAELVP IKRYALAAIF IRSQYRKTLD
DAADLFIRLI QNLENTAQQK LIAYQLEHSK RADALIGQLR EILQAYQVEG TDTERVGAIA GVLVADIALL TAECDEHMAY AGRNYLPFLL APYGTLRPLL
FNCLEIMGLR AASQDPSMER MIGAVLALRS QRRETIDAAS LGVATTDLTW LSSAWRKHVM PKALAAASPG WIHRKYFELA VLAQIKDELK SGDLYIPHGE
RYDDYREQLV DEATLAQELD AYGEVSGVAT DAADFVQGLR TELTTLADAV DARFSDNLHA SMVDGRLVLK RLQGAQVTQA IATVDSAITD RLPPTSIVDV
LVDTTRWLDL HVHFRPIAGT DARVDDLLRR VITTLFCYGC NLGPTQTARS VKGFSRRQIS WLNLKYVTDE TLDKAIVQVI NMYNKFELPG YWGSGKSASA
DGTKWSVYEQ NLLSEYHIRY GGYGGIGYYH VSDKYIALFS HFIPCGIHEA VYILDGMLAN RSDIQPDTVH GDTQAQSFPV FGLAHLLGIN LMPRIRNIKD
LVFSRPEPGR TYENIQALFG DSLDWTLIET HVHDMLRVAI SIKLGKITAS TILRRLGTYS RKNKLYWAFR ELGKAVRTLF LLRYIDDVEV RKTIHAATNK
SEEFNGFVKW AFFGGEGIIA ENVQHEQRKI VRYNQLVANL VILHNVEQMT RVLAELRDEG SNISPEVLAG LSPYRTSHIN RFGDYTLDLK RQVEPIDFSR
RILAATTR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
mltB MltB TnXax1.1 1278 20250-21527 +
Class:   Passenger Gene
Sub Class:   Plant Pathogenicity
Target:   cell wall
Comment:   Membrane-bound lytic murein transglycosylase B precursor
Protein Sequence:  
MIMASRLLRL TLGVSVCVAA TSVAAQAIAP ETTASSATDA GGQQQDPAPT ASFEQWLADF RQRALAAGIG TTTLDNALAG VTPDPSVHEL DQRQPEFTQY
LWDYLNARVT PSAIQEGQQL LISQHALFEK LRQHYGVDPG ILTAIWSMES GYGKQIGDFY VIRSLATLAH EGRRTTYGNT QLLAALQILQ TEKSIDRSQL
VGSWAGAMGQ TQFIPSTYRD YAVDEDGDQK RDVWNSKADA LGSAANYLKQ NNWTSAVPWG QEVQLSAGFD YAQADLTIKK TVTEWQRLGV APRKPIAPAL
AQQLASVLLP TGYRGPAFLV FDNFRSILRY NNSTAYALAV GLLADGYAGR AGVEQPWPKD DPPLNSTAQI TELQQRLTDK GFDVGGIDGV LGAQTRQGIR
AFQRIQQLPQ DGYASTSLLA RLRAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
avr Avr TnXax1.1 1149 21534-22682 +
Class:   Passenger Gene
Sub Class:   Plant Pathogenicity
Target:   cell wall
Comment:   avirulence protein
Protein Sequence:  
MYVGGCNSSP TYKYCKAINN GAGISKMGCT ISTTNNAPHS PRQEDAPPLP PQTRQSFVGV VNGLLSDLPK RRRRGGSLSD PDISLAGYLL SKAVIGDPVE
PQDIPRLHKA NNTVQKTRAR FPYGRGNVAT DIAVSDHASS QHAQAAHDVF VDLVRGAAPA SMLTNPTLGH AVVSEFVQGG HCAGYAAVAT MRHVQKLQPE
ESVHYVQHNH QGHDWAESRV PDGHHKTIVL DPWAQGPAVF ASDSRFAANA QHTQERLALN AKDGDDIAAK TAAGAQYLLE NCLPLTETHL KRLKAQEFHC
APEEVWQPQP VVSDAFLRRV RQSLATLTNS SELGLSRAEQ SSKALKKMSI KSACALGFGK KAASAAAEGI AAAAYQLGEQ SQ

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
TnPsy42-KX009060.1 TnPsy42 Transposon 713-6379 5667

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRL TnPsy42 713-750 GGGGTTGTAA GCCGGAACCG CCGAAATTTC CGCCACCC
IRR TnPsy42 6342-6379 CCTACTGCCT TTAAAGCCGC CAAGGCCGAA TGTTGGGG
IRL TnXax1 9056-9100 AATCGAGTGA AAACCGACAA AAATGTGCTT AGGGACGGCT GGGAG
IRL TnXca1 13219-13247 GAGGGTCGGC AGGGATTGAT ACAAAAAAC

 References