Transposon
Name: IS1133::Tn10
Family: IS3        Group: IS3
Evidence of Transposition: no
 Host     

Host Organism:Erwinia amylovora

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 27 bp)TGGTCTTGCCTCGGTTTCGTTCCGGTC
IRR (Length: 27 bp)TGGTCTTGCACCGGTTGTGTTCCGGTC

 Sequence     
DNA SequenceLength  11454 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGGTCTTGCC TCGGTTTCGT TCCGGTCTTT TTGAGAGCAT TATTTGTTAT CAAGGAGACC ATATATGTCA TTAAAGCATA GTGATGAATT TAAGCGTGAT 100
GCAGTTCGCA TAGCACTCAC TAGTGGCTTA ACACGCCGTC AAGTTGCGTC AGATTTAAGT ATTGGGCTTT CCACGCTTGG GAAATGGATC GCATCAATTT 200
CCGATGAAAC TAAAATTCCT ACCCAAGACA CTGATCTTCT GCGTGAGAAT GAACGTTTAC GCAAAGAGAA CGGCTTTGTT GAATAAATCA GATTTCGGGT 300
AAGTCTCCCC CGTAGCGGGT TGTGTTTTCA GGCAATACGC ACGCTTTCAG GCATACCTGC TTTCGTCATT TTGTTCAGCG CTCGTACCAG GGCCATAGCC 400
TCCGCAACCT GACCATCGTA GTCACGCAGC GTCAGTGAAC CCCCGAACAG CTGTTTTACC CGGTACATCG CCGTTTCCGC TATCGAGCGA CGGTTGTAAT 500
CTGTTGTCCA TTTCCACCGC GCATTACTCC CGGTCATTCG CTGATTAGCC ACTGCACGGT TACGGTCTGC ATATTCACCG GGCCAGTAAC CCGCACCTTT 600
TCGGGGAGGG ATAAGCGCGC TGATTTTCTT ACGCCGCAGT TCATCGTGAC ATAGCCGGGT ATCGTAAGCG CCATCGGCGG CGGCTGACCT GATTTTCCGG 700
TGGGTTTGCC GGATTAACCC GGGGAAGGCC TCTGAGTCCG TAACGTTGTT CAGCGACAGG TCAGCGCAGA TGATTTCATG TGTTTTACTG TCAACGGCGA 800
GATGCAGCTT ACGCCAGATA CGGCGGCGTT CCTGGCCATG CTTTTTGACT TTCCACTCGC CTTCACCGAA GACCTTCAGC CCGGTGGAAT CAATTACCAG 900
GTGTGCGATT TCACCCCGGG TGGGCGTTTT GAAACTGACA TTAACCGACT TTGCCCGCCT GCTGACACAG CTGTAATCCG GGCAGCGTAG CGGAACGTTC 1000
ATCAGAGAAA AAATGGAATC AATAAAGCCC TGCGCAGCGC GCAGGGTCAG CCTGAATACG CGTTTAATGA CCAGCACAGT CGTGATGGCA AGGTCAGAAT 1100
AGCGCTGAGG TCTGCCTCGT GAAGAAGGTG TTGCTGACTC ATACCAGGCC TGAATAGCTT CATCATCCAG CCAGAAAGTT ATGGAGCCAC GGTTGATGAG 1200
GGCTTTATTG TAGGTGGGCC AGTTGGTGAT TTTGAACTTT TGCTTTGCCA CGGAACGGTC TGCGTTGTCG GGAAGATGCG TGATCTGATC CTTCAACTCA 1300
GCAAAAGTTC GATTTATTCA ACAAAGCCAA AGAGAACCGT ATCCTTCGGG AGGAGAGGGA GATATTAAAA AAGGCAGCAA TATTTTTCGC AGTACAAAAG 1400
CTGTGAGATT TCAGTTTATT ACGGATTACC GTGGCTCTCT CTCACGTTCA CGCATATGTC GTTTGATGGG CGTAACAGAT CGTGGTTTAC GCGCATGGAA 1500
ACGCCGTCCT CCATCACTGC GCCAGCGTCG TGATCTTATA CTTCTAGCGC ATATACGTGA GCAGCATCGG TTGTGTTTGG GGAGCTATGG TAGGCCGCGT 1600
ATGACAGAAG AGTTGAAAGC GCTGGGCCTG CAGGTTGGGC AGCGTCGGGT TGGACGTTTG ATGCGCCAGA ATAACATTAC AGTTGTTCGA ACGCGTAAAT 1700
TCAAACGGAC AACGGATAGT CATCATACCT TCAACATTGC ACCGAACCTA TTAAAACAAG ACTTTAGCGC AAGCGCACCC AACCAGAAAT GGGCAGGCGA 1800
TATCACTTAT GTTTGGACCA GAGAAGGATG GGTCTATCTT GCTGTTATCC TTGACCTGTA TTCCCGTCGC GTGATTGGCT GGGCAACAGG TGATCGATTA 1900
AAGCAGGATC TTGCATTAAG GGCACTGAAT ATGGCGTTGG CTTTACGCAA ACCACCACCG GGTTGTATTC AACACACAGA CCGTGGGAGC CAATATTGCG 2000
CTCATGAATA TCAAAAGCTA CTGCTCAAAC ATCAATTGCT GCCGTCCATG AGCGGGAAAG GCAATTGTTT TGATAACTCC GCAGTAGAAA GCTTCTTTAA 2100
ATCATTAAAG GCTGAGTTGA TTTGGCGCAG ACACTGGCAA ACAAGGCGAG ATATTGAGAT TGCAATCTTC GAATATATAA ATGGCTTTTA TAATCCACGC 2200
CGAAGACATT CAACACTCGG CTCTGATGAA TCCCCTAATG ATTTTGGTAA AAATCATTAA GTTAAGGTGG ATACACATCT TGTCATATGA TCAAATGGTT 2300
TCGCGAAAAA TCAATAATCA GACAACAAGA TGTGCGAACT CGATATTTTA CACGACTCTC TTTACCAATT CTGCCCCGAA TTACACTTAA AACGACTCAA 2400
CAGCTTAACG TTGGCTTGCC ACGCATTACT TGACTGTAAA ACTCTCACTC TTACCGAACT TGGCCGTAAC CTGCCAACCA AAGCGAGAAC AAAACATAAC 2500
ATCAAACGAA TCGACCGATT GTTAGGTAAT CGTCACCTCC ACAAAGAGCG ACTCGCTGTA TACCGTTGGC ATGCTAGCTT TATCTGTTCG GGCAATACGA 2600
TGCCCATTGT ACTTGTTGAC TGGTCTGATA TTCGTGAGCA AAAACGACTT ATGGTATTGC GAGCTTCAGT CGCACTACAC GGTCGTTCTG TTACTCTTTA 2700
TGAGAAAGCG TTCCCGCTTT CAGAGCAATG TTCAAAGAAA GCTCATGACC AATTTCTAGC CGACCTTGCG AGCATTCTAC CGAGTAACAC CACACCGCTC 2800
ATTGTCAGTG ATGCTGGCTT TAAAGTGCCA TGGTATAAAT CCGTTGAGAA GCTGGGTTGG TACTGGTTAA GTCGAGTAAG AGGAAAAGTA CAATATGCAG 2900
ACCTAGGAGC GGAAAACTGG AAACCTATCA GCAACTTACA TGATATGTCA TCTAGTCACT CAAAGACTTT AGGCTATAAG AGGCTGACTA AAAGCAATCC 3000
AATCTCATGC CAAATTCTAT TGTATAAATC TCGCTCTAAA GGCCGAAAAA ATCAGCGCTC GACACGGACT CATTGTCACC ACCCGTCACC TAAAATCTAC 3100
TCAGCGTCGG CAAAGGAGCC ATGGGTTCTA GCAACTAACT TACCTGTTGA AATTCGAACA CCCAAACAAC TTGTTAATAT CTATTCGAAG CGAATGCAGA 3200
TTGAAGAAAC CTTCCGAGAC TTGAAAAGTC CTGCCTACGG ACTAGGCCTA CGCCATAGCC GAACGAGCAG CTCAGAGCGT TTTGATATCA TGCTGCTAAT 3300
CGCCCTGATG CTTCAACTAA CATGTTGGCT TGCGGGCGTT CATGCTCAGA AACAAGGTTG GGACAAGCAC TTCCAGGCTA ACACAGTCAG AAATCGAAAC 3400
GTACTCTCAA CAGTTCGCTT AGGCATGGAA GTTTTGCGGC ATTCTGGCTA CACAATAACA AGGGAAGACT TACTCGTGGC TGCAACCCTA CTAGCTCAAA 3500
ATTTATTCAC ACATGGTTAC GCTTTGGGGA AATTATGAGG GGATCTCTCA GTGCATTGCC TCCAATTCCC ATAATTTATT ACGCCGATAA TAACTTGGTG 3600
TAACCTTAAA AATGTACTTA AATCGACGTG TAAAAGATTG TTGGGAATCA AATTGATATT TTAATGCGAT CTCAAGGATA GTTTTTTTCG TCAACCTCAA 3700
CTCAACAGCC GCTTTCGTCA AACGACGAGC ACGAATATAG CTAGCCAGTG TGACCCCTGT TACTTTTTTG AACAGCCGCT GAAAATACCA CTTGGTATAA 3800
CCCGCTTTAT TCGCCACATC ATCAAGCAGT AAAGACTGAT CTAAATTATG TTCAATCCAT AATAGAACAT CTTTGATAAC CGTTGTATGA AACTGCTTAT 3900
CATCATATCT TAATTGGATG TTATTAGCTT TATTTTGATA GCGAGAATGC TGTTCAATAT ACATAAAATA ACCTAAATGT TCTTAAGATT GTCACGACCA 4000
CATCATCATG ATACCATAAA CATACTGACG GTATGTTATT TTAAATCTAT CATGGAAAAT AAAAATCATC AACAAGAAAA TTTTAAGAGT ACCTATCAAT 4100
CACTGGTTAA CTCAGCACGA ATATTGTTTG TTGAAAAAGG CTATCAAGCT GTTTCAATAG ATGAGATCTC GGGAAAAGCG TTGGTGACCA AAGGTGCCTT 4200
TTATCATCAC TTTAAAAATA AAAAACAATT ACTCAGTGCC TGTTATAAGC AGCAATTAAT TATGATTGAT GCCTACATCA CAACAAAAAC TGATTTAACA 4300
AATGGTTGGT CTGCCTTAGA AAGTATATTT GAACATTATC TTGATTATAT TATTGATAAT AATAAAAACC TTATCCCTAT CCAAGAAGTG ATGCCTATCA 4400
TTGGTTGGAA TGAACTTGAA AAAATTAGCC TTGAATACAT TACTGGTAAG GTAAACGCCA TTGTCAGCAA ATTGATCCAA GAGAACCAAC TTAAAGCTTA 4500
TGATGATGAT GTGCTTAAAA ACTTACTCAA TGGCTGGTTT ATGCATATCG CAATACATGC GAAAAACCTA AAAGAGCTTG CCGATAAAAA AGGCCAATTT 4600
ATTGCTATTT ACCGCGGCTT TTTATTGAGC TTGAAAGATA AATAAAATAG ATAGGTTTTA TTTGAAGCTA AATCTTCTTT ATCGTAAAAA ATGCCCTCTT 4700
GGGTTATCAA GAGGGTCATT ATATTTCGCG GAATAACATC ATTTGGTGAC GAAATAACTA AGCACTTGTC TCCTGTTTAC TCCCCTGAGC TTGAGGGGTT 4800
AACATGAAGG TCATCGATAG CAGGATAATA ATACAGTAAA ACGCTAAACC AATAATCCAA ATCCAGCCAT CCCAAATTGG TAGTGAATGA TTATAAATAA 4900
CAGCAAACAG TAATGGGCCA ATAACACCGG TTGCATTGGT AAGGCTCACC AATAATCCCT GTAAAGCACC TTGCTGATGA CTCTTTGTTT GGATAGACAT 5000
CACTCCCTGT AATGCAGGTA AAGCGATCCC ACCACCAGCC AATAAAATTA AAACAGGGAA AACTAACCAA CCTTCAGATA TAAACGCTAA AAAGGCAAAT 5100
GCACTACTAT CTGCAATAAA TCCGAGCAGT ACTGCCGTTT TTTCGCCCCA TTTAGTGGCT ATTCTTCCTG CCACAAAGGC TTGGAATACT GAGTGTAAAA 5200
GACCAAGACC CGCTAATGAA AAGCCAACCA TCATGCTATT CCATCCAAAA CGATTTTCGG TAAATAGCAC CCACACCGTT GCGGGAATTT GGCCTATCAA 5300
TTGCGCTGAA AAATAAATAA TCAACAAAAT GGGCATCGTT TTAAATAAAG TGATGTATAC CGAATTCGAT TGCGTCTCAA CCCCTACTTC GGTATCTGTA 5400
TTATCACGTG TATTTTTGGT TTCACGGAAC CAAAACATAA CCACAAGGAA AGTGACAATA TTTAGCAACG CAGCGATAAA AAAGGGACTA TGCGGTGAAA 5500
TCTCTCCTGC AAAACCACCA ATAATAGGCC CCGCTATTAA ACCAAGCCCA AAACTTGCCC CTAACCAACC GAACCACTTC ACGCGTTGAG AAGCTGAGGT 5600
GGTATCGGCA ATGACCGATG CCGCGACAGC CCCAGTAGCT CCTGTGATCC CTGAAAGCAA ACGGCCTAAA TACAGCATCC AAAGCGCACT TGAAAAAGCC 5700
AGCAATAAGT AATCCAGCGA TGCGCCTATT AATGACAACA ACAGCACTGG GCGCCGACCA AATCGGTCAG ACATTTTTCC AAGCCAAGGA GCAAAGATAA 5800
CCTGCATTAA CGCATAAAGT GCAAGCAATA CGCCAAAGTG GTTAGCGATA TCTTCCGAAG CAATAAATTC ACGTAATAAC GTTGGCAAGA CTGGCATGAT 5900
AAGGCCAATC CCCATGGCAT CGAGTAACGT AATTACCAAT GCGATCTTTG TCGAACTATT CATTTCACTT TTCTCTATCA CTGATAGGGA GTGGTAAAAT 6000
AACTCTATCA ATGATAGAGT GTCAACAAAA ATTAGGAATT AATGATGTCT AGATTAGATA AAAGTAAAGT GATTAACAGC GCATTAGAGC TGCTTAATGA 6100
GGTCGGAATC GAAGGTTTAA CAACCCGTAA ACTCGCCCAG AAGCTAGGTG TAGAGCAGCC TACATTGTAT TGGCATGTAA AAAATAAGCG GGCTTTGCTC 6200
GACGCCTTAG CCATTGAGAT GTTAGATAGG CACCATACTC ACTTTTGCCC TTTAGAAGGG GAAAGCTGGC AAGATTTTTT ACGTAATAAC GCTAAAAGTT 6300
TTAGATGTGC TTTACTAAGT CATCGCGATG GAGCAAAAGT ACATTTAGGT ACACGGCCTA CAGAAAAACA GTATGAAACT CTCGAAAATC AATTAGCCTT 6400
TTTATGCCAA CAAGGTTTTT CACTAGAGAA TGCATTATAT GCACTCAGCG CTGTGGGGCA TTTTACTTTA GGTTGCGTAT TGGAAGATCA AGAGCATCAA 6500
GTCGCTAAAG AAGAAAGGGA AACACCTACT ACTGATAGTA TGCCGCCATT ATTACGACAA GCTATCGAAT TATTTGATCA CCAAGGTGCA GAGCCAGCCT 6600
TCTTATTCGG CCTTGAATTG ATCATATGCG GATTAGAAAA ACAACTTAAA TGTGAAAGTG GGTCTTAAAA GCAGCATAAC CTTTTTCCGT GATGGTAACT 6700
TCACGGTAAC CAAGATGTCG AGTTAACCAC CCTTTAGATT CATAAAGCGA AAATAATGCG GCTCCAACGT ACCCACCTAA ATGGAAACGG CGTTCACTCC 6800
AATCTAAACA CGCACAACAG ATTTTACGTG AATGTTTGGA AGGAACGTCA ATTCCCATTT CATGAAAATA TTGAATACCA CTTAATGTGA TCATTGAACC 6900
ATTTTCAGTG ATCCATTGCT GTTGACAAAG GGAATCATAG ATCTTAACGG CAACTTCGCC AGCTAAATGA TCATAGCAAG TACGTGCTTT TCGTAAATGC 7000
ACTGGCGTGG AAACTTTGGC ATGTACGCCA TGGTTTAAGG AGATCCCCAT CATACTTTCC ATCAATTCAG CAATATCTTT TCCTGCTAGC CGAAAATAAC 7100
GATGCTTGCC TTGAGCTACT ACTGTGATTA GCTGGCAATC TAATAATTTA GATAAATGAC TGCTCGCCGT TGAAGCTGAT ATATTCGCCA CAGAACTTAG 7200
CTCAGTGGCC GTCCAAGCTC GCCCATCCAT CAAAGCACTG AGTATTTTAA CTCGTGAAAT GTCAGACATA GCCGCCCCTA TCGCGGCTAT TGAGGACTCA 7300
AAGGTAACCT CTTTTCGTAT TAAATTAGCC ATCGCAAGTT CACTTTATTG CCCAAGGGAG CGTAACAGAT GCAGCCATAC TATCATTGTC CGTTATTAAT 7400
ATCAGTTGGT TAGCATGGTC ACTGTATTGC ACTAAAATAT TAATGTTATT CTCCGCCAAT ACTCGTGCTA TTTCGCCAAG TTCCCCCGGT TTTTCCTGTT 7500
TTAACTTACG AATTAATGGT GTCCGGATCG CAAGTACTAA CAGTCCAGCT TGCTCTAGCG CTATTTTAGC TTCCTTTCCT TGTTCAACAA GAAAATGAGC 7600
ATGGCATTCA TCACCAACCG TAAATATCCC TCCCCCTTCC AATCCAATAC CTTTATTACC TAATGTTTTT CCTAGTAATG CTAATTGTCC TATTTGATTA 7700
TCTAAAACAA CGTGAACATC AAACATTATC CTTGTCCTTT TTAAATGAAT ATTCGCGAGT AATATGAGCA ATACTAATTT TGTAAAAATC AAATATTGAC 7800
TCTCGCCCTT CTTGTTGAGC TTTCGCATGT AAAACATGAT TTTTCCATTG CAGAACTGCG TATTCGTTTT CCCACCAAGA TAGCGATAAC ATTTTTCCTT 7900
CTGTAGCTAG ACTTTGAAAA CGTTCAATTG AAATAAAACC AGCTACATGA CTTAATAGTG GTCTTAACTC CTCAGCTAAA GTCAAATAGC GAGTTTGTTG 8000
GTCGGGTTGT ATTTGCACCT CAAATATTAC AGCAATCATA TTTTTCTCCG TTATTAATCA ACAAACAGAG TAATCCACCC GAAAATTAAC ACTTCGATAA 8100
GCAACGAAAT ATAGAAAGGT CATGCTGCTG TTTTATGTTG TATATCACAC TTCTTTAAAT TGCTCGGCCT GCGCTTATCA TGTTCTCCTT CTCTTATAAT 8200
TTTGTTTATT ATTTATACTT TTTTATTTTT TGCTATCACC GAAAATAGTG CGGATCCCGC ATGGTATTTA GGTTTACCCA TCATTAAATA CCATTGTTGT 8300
TATTTTAATC TTTTTAATAT TTCATGATTA AGATCACACT AAAATTATGA CCCTATTAAC TAAAAGTTAT GGATACCCTA ATAAAACTCA TTGTTAAAGA 8400
TTAGTTAACC AAAGTAACTT CATTTTTCAC TATTGTTAAA TTGTTATAGT GAAAATTCTG GTAAACTAAT AGAGGTAAAA AAATGATCCT AGATGCCAGT 8500
TATACATTAT TAGTCGCATG TATCGCGCTA CTCATAGGAA TGTTTGTCGT AAAATTTACC CCGTTCCTAC AAAAAAACCA CATACCAGAA GCCGTCGTTG 8600
GTGGCTTTAT TGTTGCAATT GTTCTGTTAA TTATTGATAA AACATCAGGT TATTCGTTTA CTTTTGATGC TTCATTGCAA AGTTTATTAA TGCTCACATT 8700
CTTTTCCTCT ATCGGGCTAA GTTCTGACTT TTCTCGACTG ATTAAAGGAG GAAAGCCGTT AGTTCTATTA ACTATTGCAG TAACGATCCT AATCGCCATT 8800
CAAAATACTG TCGGCATGAG TATGGCTGTC ATGATGAATG AAAGTCCATT TATTGGCTTA ATTGCAGGTT CAATTACTCT AACAGGTGGT CATGGTAATG 8900
CCGGAGCATG GGGCCCTATT CTCGCTGATA AATATGGTGT AACAGGCGCC GTTGAATTAG CGATGGCTTG TGCAACACTT GGATTAGTGT TGGGTGGTTT 9000
AGTTGGCGGC CCTGTTGCCC GTCATCTTTT GAAAAAGGTC TCTATTCCTA AAACAACCGA GCAAGAGCGC GACACTATCG TTGAAGCTTT TGAGCAACCA 9100
AGCGTCAAAA GAAAAATCAA TGCAAATAAC GTTATTGAAA CCATTTCAAT GCTGATTATC TGTATTGTTG TTGGTGGCTA TATCAGTGCA TTGTTTAAAG 9200
ATACTTTTCT GCAACTGCCT ACTTTTGTCT GGTGTTTATT TGTCGGTATT ATTATCCGTA ATACACTGAC TCATGTATTT AAACACGAAG TGTTTGAGCC 9300
GACCGTCGAT GTATTAGGTA GCGTTGCTTT ATCGCTTTTC TTGGCAATGG CGTTAATGTC ATTAAAATTT GGTCAATTGG CAAGCATGGC AGGGCCAGTA 9400
TTAATTATCA TTGCTGTACA AACTGTTGTC ATGGTGCTAT TTGCCTGCTT TGTCACCTTC AAAATGATGG GCAAAGATTA TGATGCTGTC GTGATCAGCG 9500
CGGGTCACTG TGGCTTTGGT ATGGGAGCAA CACCAACAGC AATTGCGAAT ATGCAAACAG TCACAAAAGC ATTTGGACCA TCACATAAAG CTTTCCTTGT 9600
CGTTCCTATG GTCGGTGCCT TTATTGTTGA TATTTCAAAC AGTATCTTAA TTAAAATATT TATTGAAATT GGTACATACT TTACCTAATA GCTACACACT 9700
ACTCGGCGTA CTTCTCCTAA CAATTTATTA TAGGGCAGTA TGCCGAGCCG CTTTCTTACC CCGTATATTC CCCTAGTCAA TCTATTTAGC TTGCACGTTT 9800
ACGAATTAAT TCTCGTGACA ACTAAACCAA AATTTGAATT ATGACGAGTA TATCTGATTA TTTATACTTT ATTCGTTAGA ATAGTCCGTA GAAATAAATA 9900
ATCAGTAGGC TACATGGAAT GAAATTAAGA CATCTCGATA TCTTTTATGC TGTAATGACT TGTGGTTCCT TAACGCGTGC GGCTGAAGTT TTACATATTT 10000
CTCAACCCGC GGCGAGTAAA GCACTCAAAC ATGCTGAGCA CTGAGAGATC CCCTCATAAT TTCCCCAAAA CGTAACCATG TGTGAATAGA TTTTGAGTAA 10100
GCAGGGTTGC AGCCACGAGT GAGTCTTCCC TTGTTATTGT GTAGCCAGAA TGCCGCAAAA CTTCCATGCC TAAGCGAACT GTTGAGAGTA CGTTTCGATT 10200
TCTGACTGTG TTAGCCTGGA AGTGCTTGTC CCAACCTTGT TTCTGAGCAT GAACGCCCGC AAGCCAACAT GTTAGTTGAA GCATCAGGGC GATTAGCAGC 10300
ATGATATCAA AACGCTCTGA GCTGCTCGTT CGGCTATGGC GTAGGCCTAG TCCGTAGGCA GGACTTTTCA AGTCTCGGAA GGTTTCTTCA ATCTGCATTC 10400
GCTTCGAATA GATATTAACA AGTTGTTTGG GTGTTCGAAT TTCAACAGGT AAGTTAGTTG CTAGAATCCA TGGCTCCTTT GCCGACGCTG AGTAGATTTT 10500
AGGTGACGGG TGGTGACAAT GAGTCCGTGT CGAGCGCTGA TTTTTTCGGC CTTTAGAGCG AGATTTATAC AATAGAATTT GGCATGAGAT TGGATTGCTT 10600
TTAGTCAGCC TCTTATAGCC TAAAGTCTTT GAGTGACTAG ATGACATATC ATGTAAGTTG CTGATAGGTT TCCAGTTTTC CGCTCCTAGG TCTGCATATT 10700
GTACTTTTCC TCTTACTCGA CTTAACCAGT ACCAACCCAG CTTCTCAACG GATTTATACC ATGGCACTTT AAAGCCAGCA TCACTGACAA TGAGCGGTGT 10800
GGTGTTACTC GGTAGAATGC TCGCAAGGTC GGCTAGAAAT TGGTCATGAG CTTTCTTTGA ACATTGCTCT GAAAGCGGGA ACGCTTTCTC ATAAAGAGTA 10900
ACAGAACGAC CGTGTAGTGC GACTGAAGCT CGCAATACCA TAAGCCGTTT TTGCTCACGG ATATCAGACC AGTCAACAAG TACAATGGGC ATCGTATTGC 11000
CCGAACAGAT AAAGCTAGCA TGCCAACGGT ATACAGCGAG TCGCTCTTTG TGGAGGTGAC GATTACCTAA CAATCGGTCG ATTCGTTTGA TGTTATGTTT 11100
TGTTCTCGCT TTGGTTGGCA GGTTACGGCC AAGTTCGGTA AGAGTGAGAG TTTTACAGTC AAGTAAGGCG TGGCAAGCCA ACGTTAAGCT GTTGAGTCGT 11200
TTTAAGTGTA ATTCGGGGCA GAATTGGTAA AGAGAGTCGT GTAAAATATC GAGTTCGCAC ATTTTGTTGT CTGATTATTG ATTTTTGGCG AAACCATTTG 11300
ATCATATGAC AAGATGTGTA TCTACCTTAA CTTAATGATT TTGATAAAAA TCATTAGGGG ATTCATCAGC ACTCGGCTGG AAATCGCCGG TGGCATTTGA 11400
GAAAAAAGCC GCTTAAAATG AGAGATAGAC CGGAACACAA CCGGTGCAAG ACCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
orfA_p IS1133::Tn10::IS903B 65-286 Accessory Gene Regulator +
tnp IS903B 328-1251 Transposase   -
orfB_p IS1133::Tn10::IS903B 1466-2260 Transposase   +
tnp IS10R 2330-3538 Transposase   +
tetD Tn10 3548-3964 Passenger Gene Antibiotic Resistance -
tetC Tn10 4052-4645 Passenger Gene Antibiotic Resistance +
tet(B) (ARO:3000166) Tn10 4758-5963 Passenger Gene Antibiotic Resistance -
tetR (ARO:3003479) Tn10 6042-6668 Passenger Gene Antibiotic Resistance +
jemC Tn10 6646-7332 Passenger Gene Other -
CPT07_26605 Tn10 7340-7726 Passenger Gene Hypothetical -
jemB Tn10 7719-8039 Passenger Gene Other -
jemA Tn10 8483-9688 Passenger Gene Other +
tnp IS10L 10054-11262 Transposase   -

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
orfA_p N IS1133::Tn10::IS903B 222 65-286 +
Class:   Accessory Gene
Sub Class:   Regulator
Function:   orfB-5'
Comment:   orfA, together with orfB, encodes a fusion protein formed by translational frame-shifting that is the transposase for IS1133
Protein Sequence:  
MSLKHSDEFK RDAVRIALTS GLTRRQVASD LSIGLSTLGK WIASISDETK IPTQDTDLLR ENERLRKENG FVE

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS903B 924 328-1251 -
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
VAKQKFKITN WPTYNKALIN RGSITFWLDD EAIQAWYESA TPSSRGRPQR YSDLAITTVL VIKRVFRLTL RAAQGFIDSI FSLMNVPLRC PDYSCVSRRA
KSVNVSFKTP TRGEIAHLVI DSTGLKVFGE GEWKVKKHGQ ERRRIWRKLH LAVDSKTHEI ICADLSLNNV TDSEAFPGLI RQTHRKIRSA AADGAYDTRL
CHDELRRKKI SALIPPRKGA GYWPGEYADR NRAVANQRMT GSNARWKWTT DYNRRSIAET AMYRVKQLFG GSLTLRDYDG QVAEAMALVR ALNKMTKAGM
PESVRIA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
orfB_p N IS1133::Tn10::IS903B 795 1466-2260 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   Intact orfB, together with orfA, encodes a fusion protein formed by translational frame-shifting that is the transposase for IS1133
Protein Sequence:  
MGVTDRGLRA WKRRPPSLRQ RRDLILLAHI REQHRLCLGS YGRPRMTEEL KALGLQVGQR RVGRLMRQNN ITVVRTRKFK RTTDSHHTFN IAPNLLKQDF
SASAPNQKWA GDITYVWTRE GWVYLAVILD LYSRRVIGWA TGDRLKQDLA LRALNMALAL RKPPPGCIQH TDRGSQYCAH EYQKLLLKHQ LLPSMSGKGN
CFDNSAVESF FKSLKAELIW RRHWQTRRDI EIAIFEYING FYNPRRRHST LGSDESPNDF GKNH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS10R 1209 2330-3538 +
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MCELDILHDS LYQFCPELHL KRLNSLTLAC HALLDCKTLT LTELGRNLPT KARTKHNIKR IDRLLGNRHL HKERLAVYRW HASFICSGNT MPIVLVDWSD
IREQKRLMVL RASVALHGRS VTLYEKAFPL SEQCSKKAHD QFLADLASIL PSNTTPLIVS DAGFKVPWYK SVEKLGWYWL SRVRGKVQYA DLGAENWKPI
SNLHDMSSSH SKTLGYKRLT KSNPISCQIL LYKSRSKGRK NQRSTRTHCH HPSPKIYSAS AKEPWVLATN LPVEIRTPKQ LVNIYSKRMQ IEETFRDLKS
PAYGLGLRHS RTSSSERFDI MLLIALMLQL TCWLAGVHAQ KQGWDKHFQA NTVRNRNVLS TVRLGMEVLR HSGYTITRED LLVAATLLAQ NLFTHGYALG
KL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tetD TetD Tn10 417 3548-3964 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   tetracycline antibiotic (ARO:3000050)
Comment:   no match in CARD
Protein Sequence:  
MYIEQHSRYQ NKANNIQLRY DDKQFHTTVI KDVLLWIEHN LDQSLLLDDV ANKAGYTKWY FQRLFKKVTG VTLASYIRAR RLTKAAVELR LTKKTILEIA
LKYQFDSQQS FTRRFKYIFK VTPSYYRRNK LWELEAMH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tetC TetC Tn10 594 4052-4645 +
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   DNA binding (GO:0003677)
Target:   tetracycline antibiotic (ARO:3000050)
Sequence Family:  Bacterial regulatory proteins, tetR family (Pfam:PF00440)
Comment:   no match in CARD
Protein Sequence:  
MENKNHQQEN FKSTYQSLVN SARILFVEKG YQAVSIDEIS GKALVTKGAF YHHFKNKKQL LSACYKQQLI MIDAYITTKT DLTNGWSALE SIFEHYLDYI
IDNNKNLIPI QEVMPIIGWN ELEKISLEYI TGKVNAIVSK LIQENQLKAY DDDVLKNLLN GWFMHIAIHA KNLKELADKK GQFIAIYRGF LLSLKDK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tet(B) (ARO:3000166) Tet(B) Tn10 1206 4758-5963 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Transpoase Chemistry:   antibiotic efflux (ARO:0010000)
Target:   tetracycline antibiotic (ARO:3000050)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   strict match to reference sequence for ARO:3000166 (bitscore:786)||Synonym: tetB
Protein Sequence:  
MNSSTKIALV ITLLDAMGIG LIMPVLPTLL REFIASEDIA NHFGVLLALY ALMQVIFAPW LGKMSDRFGR RPVLLLSLIG ASLDYLLLAF SSALWMLYLG
RLLSGITGAT GAVAASVIAD TTSASQRVKW FGWLGASFGL GLIAGPIIGG FAGEISPHSP FFIAALLNIV TFLVVMFWFR ETKNTRDNTD TEVGVETQSN
SVYITLFKTM PILLIIYFSA QLIGQIPATV WVLFTENRFG WNSMMVGFSL AGLGLLHSVF QAFVAGRIAT KWGEKTAVLL GFIADSSAFA FLAFISEGWL
VFPVLILLAG GGIALPALQG VMSIQTKSHQ QGALQGLLVS LTNATGVIGP LLFAVIYNHS LPIWDGWIWI IGLAFYCIII LLSMTFMLTP QAQGSKQETS
A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tetR (ARO:3003479) TetR Tn10 627 6042-6668 +
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target alteration (ARO:0001001); antibiotic efflux (ARO:0010000)
Transpoase Chemistry:   repressor of the tetracycline resistance element
Target:   tetracycline antibiotic (ARO:3000050)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   perfect match to reference sequence for ARO:3003479
Protein Sequence:  
MMSRLDKSKV INSALELLNE VGIEGLTTRK LAQKLGVEQP TLYWHVKNKR ALLDALAIEM LDRHHTHFCP LEGESWQDFL RNNAKSFRCA LLSHRDGAKV
HLGTRPTEKQ YETLENQLAF LCQQGFSLEN ALYALSAVGH FTLGCVLEDQ EHQVAKEERE TPTTDSMPPL LRQAIELFDH QGAEPAFLFG LELIICGLEK
QLKCESGS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
jemC JemC Tn10 687 6646-7332 -
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  HTH ArsR-type DNA-binding domain (InterPro:IPR001845)
Comment:   N-terminal 100 amino acids similar to bacterial transcriptional repressors and metal-binding proteins (PMID:10781570)
Protein Sequence:  
MANLIRKEVT FESSIAAIGA AMSDISRVKI LSALMDGRAW TATELSSVAN ISASTASSHL SKLLDCQLIT VVAQGKHRYF RLAGKDIAEL MESMMGISLN
HGVHAKVSTP VHLRKARTCY DHLAGEVAVK IYDSLCQQQW ITENGSMITL SGIQYFHEMG IDVPSKHSRK ICCACLDWSE RRFHLGGYVG AALFSLYESK
GWLTRHLGYR EVTITEKGYA AFKTHFHI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
CPT07_26605 CPT07_26605 Tn10 387 7340-7726 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Comment:   Amino acid-binding protein
Protein Sequence:  
MFDVHVVLDN QIGQLALLGK TLGNKGIGLE GGGIFTVGDE CHAHFLVEQG KEAKIALEQA GLLVLAIRTP LIRKLKQEKP GELGEIARVL AENNINILVQ
YSDHANQLIL ITDNDSMAAS VTLPWAIK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
jemB JemB Tn10 321 7719-8039 -
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  Antibiotic biosynthesis monooxygenase (Pfam:PF03992)
Protein Sequence:  
MIAVIFEVQI QPDQQTRYLT LAEELRPLLS HVAGFISIER FQSLATEGKM LSLSWWENEY AVLQWKNHVL HAKAQQEGRE SIFDFYKISI AHITREYSFK
KDKDNV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
jemA JemA Tn10 1206 8483-9688 +
Class:   Passenger Gene
Sub Class:   Other
Function:   glutamate:sodium symporter activity (GO:0015501)
Comment:   sodium-dependent glutamate permease
Protein Sequence:  
MILDASYTLL VACIALLIGM FVVKFTPFLQ KNHIPEAVVG GFIVAIVLLI IDKTSGYSFT FDASLQSLLM LTFFSSIGLS SDFSRLIKGG KPLVLLTIAV
TILIAIQNTV GMSMAVMMNE SPFIGLIAGS ITLTGGHGNA GAWGPILADK YGVTGAVELA MACATLGLVL GGLVGGPVAR HLLKKVSIPK TTEQERDTIV
EAFEQPSVKR KINANNVIET ISMLIICIVV GGYISALFKD TFLQLPTFVW CLFVGIIIRN TLTHVFKHEV FEPTVDVLGS VALSLFLAMA LMSLKFGQLA
SMAGPVLIII AVQTVVMVLF ACFVTFKMMG KDYDAVVISA GHCGFGMGAT PTAIANMQTV TKAFGPSHKA FLVVPMVGAF IVDISNSILI KIFIEIGTYF
T

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS10L 1209 10054-11262 -
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MCELDILHDS LYQFCPELHL KRLNSLTLAC HALLDCKTLT LTELGRNLPT KARTKHNIKR IDRLLGNRHL HKERLAVYRW HASFICSGNT MPIVLVDWSD
IREQKRLMVL RASVALHGRS VTLYEKAFPL SEQCSKKAHD QFLADLASIL PSNTTPLIVS DAGFKVPWYK SVEKLGWYWL SRVRGKVQYA DLGAENWKPI
SNLHDMSSSH SKTLGYKRLT KSNPISCQIL LYKSRSKGRK NQRSTRTHCH HPSPKIYSAS AKEPWILATN LPVEIRTPKQ LVNIYSKRMQ IEETFRDLKS
PAYGLGLRHS RTSSSERFDI MLLIALMLQL TCWLAGVHAQ KQGWDKHFQA NTVRNRNVLS TVRLGMEVLR HSGYTITRED SLVAATLLTQ NLFTHGYVLG
KL

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS903B-CP000602.1 IS903B Insertion Sequence 272-1328 1057
Tn10-AF162223 Tn10 Transposon 2223-11369 9147
IS10R-AF162223 IS10R Insertion Sequence 2223-3551 1329
IS10L-AF162223 IS10L Insertion Sequence 10041-11369 1329

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRR IS903B 272-289 GGCTTTGTTG AATAAATC
IRL IS903B 1311-1328 CTAAATAAGT TGTTTCGG
IRL IS10R 2223-2239 CTGATGAATC CCCTAAT
IRR IS10R 3530-3551 TTTAATACTC CCCTAGAGAG TC
IRR IS10L 10041-10062 CTGAGAGATC CCCTCATAAT TT
IRL IS10L 11353-11369 TAATCCCCTA AGTAGTC

 References     

Welch TJ, Fricke WF, McDermott PF, White DG, Rosso ML, Rasko DA, Mammel MK, Eppinger M, Rosovitz MJ, Wagner D, Rahalison L, Leclerc JE, Hinshaw JM, Lindler LE, Cebula TA, Carniel E, Ravel J. Multiple antimicrobial resistance in plague: an emerging public health risk. PLoS One. 2007 Mar 21;2(3):e309. doi: 10.1371/journal.pone.0000309. PubMed ID: 17375195