|
|
|
|
Name: In36 |
|
Family: Tn402 Group: Class 1 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Escherichia coli | Molecular Source: | plasmid pHSH1 |
Place of Origin: | Shanghai, China | Date of Isolation: | 2003 |
| | Other Geographic Information: | teaching hospital March 2000 and March 2001 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GAGGCTCTGT TGCAAAGATT GGCGGCAGTC AGAGGTAGGC TGTCGCTCTG 200
CGCCGATCAG GCGGCTGCTG CGAAATGGTG GTTGAGCATG CCCATGGCCT CCGTCAGCGC CGAGGGCCCA ATGCCAAAAG CTCTCTCCAC AAGGCGCACC 300
TCGCCCCTGA TGCCGGGCTG CAGGCACCAG GGGCGAGCCT GTCCTTTGCG CAGGGCTCGC ATGACTTCGA ATCCCTTGAT CGTGGCATAG GCCGTGGGGA 400
TCGATTTGAA ACCGCGCACC GGCTTGATCA GTATCTTGAG CTTTCCGTGA TCGGCCTCGA TCACGTTATT GAGATACTTC ACCTGCCGGT GGGCCGTCTC 500
CCGGTCCAGC TTTCCTTCGC GCTTCAATTC GGTGATCGCT GCACCATAGC TCGGCGCTTT GTCGGTATTG AGCGTGGCAG GCTTTTCCCA GTGCTTCAGG 600
CCTCGCAGGG CCTTGCCCAG GAACCGCTTC GCTGCCTTGG CGCTGCGGGT CGGCGACAGG TAGAAATCGA TCGTGTCGCC CCGCTTGTCG ACTGCCCGGT 700
ACAGGTAGGT CCACTTGCCC CGCACCTTGA CGTAGGTTTC ATCCAGGCGC CAGCTCGGAT CAAAGCCACG CCGCCAGAAC CAGCGCAGCC GCTTCTCCAT 800
CTCCGGGGCG TAGCACTGGA CCCAGCGATA GATCGTCGTA TGGTCGACCG AAATGCCGCG TTCCGCCAGC ATTTCCTCAA GGTCGCGATA GCTGATCGGA 900
TAGCGACAAT ACCAGCGCAC CGCCCACAGG ATCACATCAC CCTGGAAATG GCGCCACTTG AAATCCGTCA TCGTTCCGTC CGTCCAATCT CCGCCAAGCA 1000
TGCTCAAGCT TCACGATTTT TGCAACAGAG CCCACACGAG TATTGAGCAT AGTCGAGATT GGTGCAGATC ACTTCTGATA TTGAACTGTC AGGAGCTGGC 1100
TGCACAACAG CCATTACGCC CAATCAACTG GTGCAGTCGT CTTCTGAAAA TGACATTTGG TATCTCTCAT AAACGGATGT TTTTGAGAGA ACTATCTTCG 1200
GCCTTCACAC GCACGAAAGG CGGCGAAGCT CCGCCGTTAA TCCGTCCGCC GGAGATCTCG CCCAGGCAGG CTGAAGGCCG AGCAAGCCTG ACAGGCCCGA 1300
AAAGCCCGGC ACGGGCGTCG GCGGCGATGA CGGCGGCGGC ATTATCCAGG GTTGATGATG GAAGTGGAGG ATATCGACAA CCTCTCGCGC AACCAAGACA 1400
TCGCGGTCGG ACTGCAAGTG ATCTTGAAGC CACGGGCCCG TCCCACCCCG ACATGGACCT CGATGCCCGA ACGGACGTTA GATTTCGAGT TCTAGGCGTT 1500
CTGCGATGAA GGTTGGATCC CAGCCGGGAT TGAAAGTGTC GACGTGGGTG AATCCGAGCC GCTCGTATAG GCCACGCAGG TTCGGGTGGC AGTCGAGCCG 1600
CAGCTTGGCG CACCCCTGCG TTCGCGCGGC ATGGCGGCAA GCCTCGATCA GCGCGGAGCT GACACCCCGG CCCGCATGTG TCCGTCGCAC CGCGAGCTTG 1700
TGCAGATATG CGGCCTCCCC CTTGAGGGCG TCGGGCCAGA ACTCGGGATC CTCGGCCGAC AAGGTGCAAC AGCCGACGAT GCCGTCGCTG CAACTCGCGA 1800
CTAGGAGCTC GGATCTCAGG ACGAAGGTCT CCGCGAATGT CCGGTCGATC CGCGCGACGT CCCAGGCGGG CGTTCCCTTG GCGGACATCC ACGCCGCAGC 1900
GTCGTGCATC AGCCGCACAA CCTCGTCGAT ATCACCCGAG CAGGCGACCC GAACGTTCGG AGGCTCCTCG CTGTCCATTC GCTCCCCTGG CGCGGTATGA 2000
ACCGCCGCCT CATAGTGCAG TTTGATCCTG ACGAGCCCAG CATGTCTGCG CCCACCTTCG CGCAACCCAA AAGGAGATAT TTAGCGGGCG GCCGGAAGGT 2100
GAATGCTAGG CATGATCTAA CCCTCGGTCT CTGGCGTCGC GACTGCGAAA TTTCGCGAGG GTTTCCGAGA AGGTGATTGC GCTTCGCAGA TCTCCAGGCG 2200
CGTGGGTGCG GACGTAGTCA GCGCCATTGC CGATCGCGTG AAGTTCCGCC GCAAGGCTCG CTGGACCCAG ATCCTTTACA GGAAGGCCAA CGGTGGCGCC 2300
CAAGAAGGAT TTCCGCGACA CCGAGACCAA TAGCGGAAGC CCCAACGCCG ACTTCAGCTT TTGAAGGTTC GACAGCACGT GCAGCGATGT TTCCGGTGCG 2400
GGGCTCAAGA AAAATCCCAT CCCCGGATCG AGGATGAGCC GGTCGGCAGC GACCCCGCTC CGTCGCAAGG CGGAAACCCG CGCCTCGAAG AACCGCACAA 2500
TCTCGTCGAG CGCGTCTTCG GGTCGAAGGT GACCGGTGCG GGTGGCGATG CCATCCCGCT GCGCTGAGTG CATAACCACC AGCCTGCAGT CCGCCTCAGC 2600
AATATCGGGA TAGAGCGCAG GGTCAGGAAA TCCTTGGATA TCGTTCAGGT AGCCCACGCC GCGCTTGAGC GCATAGCGCT GGGTTTCCGG TTGGAAGCTG 2700
TCGATTGAAA CACGGTGCAT CTGATCGGAC AGGGCGTCTA AGAGCGGCGC AATACGTCTG ATCTCATCGG CCGGCGATAC AGGCCTCGCG TCCGGATGGC 2800
TGGCGGCCGG TCCGACATCC ACGACGTCTG ATCCGACTCG CAGCATTTCG ATCGCCGCGG TGACAGCGCC GGCGGGGTCT AGCCGCCGGC TCTCATCGAA 2900
GAAGGAGTCC TCGGTGAGAT TCAGAATGCC GAACACCGTC ACCATGGCGT CGGCCTCCGC AGCGACTTCC ACGATGGGGA TCGGGCGAGC AAAAAGGCAG 3000
CAATTATGAG CCCCATACCT ACAAAGCCCC ACGCATCAAG CTTTTGCCCA TGAAGCAACC AGGCAATGGC TGTAATTATG ACGACGCCGA GTCCCGACCA 3100
GACTGCATAA GCAACACCGA CAGGGATGGA TTTCAGAACC AGAGAAAGAA AATAAAATGC GATGCCATAA CCGATTATGA CAACGGCGGA AGGGGCAAGC 3200
TTAGTAAAGC CCTCGGTGCG GCGGACATTA TCCGCGAACA GGCTGAACAG CACGGCATTG CGCGGGTAAC GGATGTCTGG CTGGAAGTCG GCGCACTGGC 3300
GGATGTTGAG GAGAGTGCAC TGCATTTCTG TTTTGATATC GCCTGCCGTG ATACCGTGGC GCAGGGCTGC ACACTGCATA TTGATGTTAT CCCGGCACAG 3400
GCATGGTGCT GGGATTGCAG CCGTGAGGCC GAAATCATGC AGCACGCCGG ATGCTGTCCG CACTGCGGCA GTGAACGGCT GCGCATCAGT GAAGGTGATG 3500
ATTTGCGGGT AAAAAGCCTG GAAGGTGAGT GAGTTTTACG CCGCCGCCGT ATTCAGCAGC CAGCGGGCGA ATTGCTGCAT GGCCGGGGTT TCCGTACGGG 3600
ACTGTAACCG CGTCAGCCAG TAGCCGCCGA GGGTGATTTC TGCGGCAAAC GGCTGTACCA GTGCGCCTGA CTGTAACAGG CGGCTGAACA TACATACCGG 3700
TGCGATCGCT ACCCCGGCAC CCAGTTGTGC CGCCTCGGCC ATGGCCAGTG AGGTATCGAA CACCATTACC GGCTGTGACG GGGAAGGCGG TGTGCCGCCC 3800
GCACAATCCA GCCAGCGGCT CCATTCATCC CGGCGGAATG AGCGCAGCAG GGTAAAGCGG TGAACATCAT CCGGCTGCTG TAACTGTTCT GCAATGGCCG 3900
GTGAGCACAG CGGAGCGTGT GGTGCACTGA AAATCAGTTC CGCATCTGAC TCATGCCACG CGCCGTTACC GAAACGGATC GTATAATCAT GCCCTTCCGC 4000
CGCCGGGTCC ACATGATTGT TATGGGTGGA GATATGCAGA TCAATATGCG GATGGCTGTC ATAGAATCCG GCCAGACGCG GCAGCAGCCA GCCTGCGGCA 4100
AATGTTCCCA CCGCACCGAC TTTCACCCGC TCACGGAACT GCCCGTGAGA AAAACACTCC AGAGTATCCG CAATCCGGTC AAACGCCTCA TTGAGCACCG 4200
GCAGTAATCC CTCACCTTCA TGGGTCAGCA CCAGCCCGCG CGAGACGCGG GTAAACAGCA CACAGCCGAG TTGTTCTTCC AGCGCCCTGA CCTGCTGGCT 4300
GACGGCGGCA TGGGTGACAT TCAGCTCAAT CGCCGCGCGG GTAAAACTGA GATGACGGGC GGCGGCCTCA AAGGCGCGCA GCGGGTTAAG GGGGAGATAA 4400
CGTCTGACCA TAATCCACCT GTAAGTTTTT CTTTAGGGCA GTTGAGTCTT CATAATGCCT CCTTTGAAGG GAAGGTATAA CGTCGGCTGC GGCAGCAGAG 4500
AAACGGCATT CCGGCGGTAA TACTCTGCTA TAGCAGAATC TGCCTTGTTG TAGCGCGGAA CTCAAGTGAT ATTTGCCTCT TGTGTTTGCA TTCGAGCTAA 4600
TCCGGCAGCA CTATTACTCC CAAGGGTTCC AGCAGTTGCT CCTGTTGCCA GGCACAGATC TTGACTCCTT CGAGGTTGAC CCGTCTGGGG TCGAGCCCAT 4700
CCAGATCGGC AAAGGTCAGG TCACAGCCCC GCAGATTGAC CTGTTGCCAA CAGTCGCGGG AGAAGGTGCC GCGGCTGAGA TCTGAGCCCA TCAAGGAAGC 4800
GCCGCTGAGA TTGGCATTGC TCCAGTTGTT TTCAAACAGC TCGCATTTTT CCAGGCATTG GCCACTCAAG TTGGTATAGG CCAGGTTGCA ACCTGAGATA 4900
TAAGCCGAGC AGAAGTACAT CTTATGGCTG ACTTGATTGT AGAAGCGGGC CCGGGAAAAG TTGGCGCCCT TGAGATCGCA CTCCCTGAAC TCTATGCCAA 5000
AGCAGTTGGC ACCGCTGAAG TTGGCCAAAG ACAGACGGCA GGCCTTGAAA CTGGCATCGC GCAGATCGGC ATAGCTGAAG TGACACCCTT CAACGGCGCC 5100
GCTTTCAATG AAACTGCAAT CCTCGAAACT GGCATCCTGC AGCTGACAGT GGCTGAAGTC ACACTGATAA AAGCGGCAGC GGCGAAAACG GCTGTCACTC 5200
AAATCCTGGC GTGAGAAATC CTCTTGCTGA AAAACTTTAT CAATAATATC CATACGGCTT CCTTTAATCA GGGAGGGTGC TAACGGGTAT AGGAAGTATA 5300
AACCACCTTT TTGCTCCTCA TCCGAAGTAT CTTACCTGAA ATTCCCTCAC TCGTTTACCG CTCAAGCCCC AATTTTAACT GCCGGTCCAG CCTAAACCGC 5400
TCTAATAAGG TTCGATTTGG CGGTAAAATC TCTAGCCTGA TAGCTCGAGA GATACAAACT GCCCCACCGC CCCGTTTAAA AGTTGGCAGT GTTGAGCAGT 5500
GTTGGATTTG GGGTCGTCAG TCAAAGAGAC GACTCTGTGA TGGATCGAAC AGGCTGGGAG TCAGTGGCGG CGCTCGTTCT GGTGGCAGCT CACGCTGCTT 5600
GGCGGCATTC GCCTTGGCTG TTTTCTGTTT CAGATGCTTG AGAATCTGCT CAATGACCTT CGGATCTTCG ATGCTGGCAA TCACTTTGAC GTGACCGCCG 5700
CAGTGTTCGC AGACTTCAAT ATCAATATTG AAGACTCGCT TGAGGCGTTG CATCCAGGTC ATGGCGCGGT GGCGCTCTGC AGGACTCTTG TCACGCCAGT 5800
TAGTATCGAG ACCTTCCGAT TTGTCGGGCT TCTTGCCCCG CTTGGCGGGT GTTACTTGAA CTCGGTGTTT GCTGTTCGGT GCAAAGACGC CGTGGAAGCG 5900
TGTGAGGTTG ACTCGCGGCT TAGGTACCAA CGCAGCGAGT TTGGCGATGA AGTCCAGCGG CTCGAAGATC ACATGGGTGG TGCCATTGCG GTACGGAGTT 6000
TTGAGCTCGT AACGCACCTG CCCATTGGCG GTTAATGCCA GACGTTTTTC TGAAACCGCT GGCCGACTAA TGTAGCGACA CAAGCGCTCA AGCTTATCCC 6100
GCTGATGCGC TTCGGCCATC ACACCGGCGT GTAGCGAGAA ACCAGCATGG TTGGCTACTC GACTGCTTGA GTCGGCTTTA TCCTCACGCC CTGGCAAGGT 6200
TTGCAGGGTG AAGACTTTGC GCCCTTGCTG GGGGCCGACG GCAATGCGAT ACGTAACCGA AGCACCATGT AATTGAGTCA GCGTATCGTC TTCGCCCTCT 6300
TCCAGTGTCA ACCACGTATT CTCGGCATCA CGCTCCAAAA TCCCACGCTT TTCCATGCAG CGAGCGATGC GATGGCTGAG GGTGTGAGCG AGCGTATTCA 6400
GCTCATCGTA AGTGGGTGCC TTGACACGAT GGAAGCGTTG CTTGCCATAG TCATCTTCGG CATAGACACC ATCGAGAAAC AGCATGTGGT AGTGGACATT 6500
GAGATTTAGC GCGGAGCCAA AGCGTTGGAT AAGAGTCACT GAGCCAGTTT GTGCAGAGGC TTTGGTGTAA CCGGCTTTTT TGATCAGATG AGTTGAGAGT 6600
GTACGATAGA CGATACTCAA GACCTGGCCC ATCAGCTGGG GATGGCGAGC CAGCAAAAAG CGTAGCTGGA AAGGAAAGCT GAGCACCCAC TGGCGAATGG 6700
GCTCCTTGGG GAAGACTTCG TCTATCAGCA GCGCCGCACT CTCGGCCATC CGGCGGGCAC CGCAGCTAGG GCAAAAGCCG CGTCGTTTAC AGCTGAAGGC 6800
GACCAGACGC TCGTGATGAC AATCCTCGCA GCGAACCCGC ATGAAACCAT ACTCCAGACG GCCACATTGG AGGAGGTCGT TGAATTCTTG TTGGATGTAG 6900
CGAGGCAGGT GTTGACCTTG GGCTTCGAGT GAGGCTTTGA AGGCTGGGTA GTGCTGCTCA ACCAGCTGGT AGAGCAGCGT CTGGTCGGGT TGGTGGCGTT 7000
CGTAACCGTT TGTTTGAGTG GGCGATTGAC TCGCCGTGGC GTTCCTTGCC AGCGACATGG GTATCCTCCG CTGATACTGT GGTTATGTAC AGTATCAGCG 7100
GCTTGCGTTC AGACGTCCAG TCTGGCCCTA GACATCGCTA AATGCTTAAC CCGCAATAGC CCTCACGAGT TGTTATCAGC CACTACCGGT TGAGCGAGAA 7200
GGTTTTGGGT TCAGGGTGCT ATTGCTCCAC CAATCACAAT ACTGAAGCCC CAACTGTTAT CAGTTGGGGC TTTTTCTTGT CTGTTTGCGG CGGTTGCGTT 7300
TTATCGGTAG TCGTCGAGCT CTGCACCATC CCACATAAGA GCTTAACGGT GCGATCTTCA ACGCCATCAC ACAAAACTTT CTTTTTCACG CACAGTCAAC 7400
TTATTGGATG TTTTATTAAC AACCCAAAAG GAGATATTTA GCGGGCGGCC GGAAGGTGAA TGCTAGGCAT GATCTAACCC TCGGTCTCTG GCGTCGCGAC 7500
TGCGAAATTT CGCGAGGGTT TCCGAGAAGG TGATTGCGCT TCGCAGATCT CCAGGCGCGT GGGTGCGGAC GTAGTCAGCG CCATTGCCGA TCGCGTGAAG 7600
TTCCGCCGCA AGGCTCGCTG GACCCAGATC CTTTACAGGA AGGCCAACGG TGGCGCCCAA GAAGGATTTC CGCGACACCG AGACCAATAG CGGAAGCCCC 7700
AACGCCGACT TCAGCTTTTG AAGGTTCGAC AGCACGTGCA GCGATGTTTC CGGTGCGGGG CTCAAGAAAA ATCCCATCCC CGGATCGAGG ATGAGCCGGT 7800
CGGCAGCGAC CCCGCTCCGT CGCAAGGCGG AAACCCGCGC CTCGAAGAAC CGCACAATCT CGTCGAGCGC GTCTTCGGGT CGAAGGTGAC CGGTGCGGGT 7900
GGCGATGCCA TCCCGCTGCG CTGAGTGCAT AACCACCAGC CTGCAGTCCG CCTCAGCAAT ATCGGGATAG AGCGCAGGGT CAGGAAATCC TTGGATATCG 8000
TTCAGGTAGC CCACGCCGCG CTTGAGCGCA TAGCGCTGGG TTTCCGGTTG GAAGCTGTCG ATTGAAACAC GGTGCATCTG ATCGGACAGG GCGTCTAAGA 8100
GCGGCGCAAT ACGTCTGATC TCATCGGCCG GCGATACAGG CCTCGCGTCC GGATGGCTGG CGGCCGGTCC GACATCCACG ACGTCTGATC CGACTCGCAG 8200
CATTTCGATC GCCGCGGTGA CAGCGCCGGC GGGGTCTAGC CGCCGGCTCT CATCGAAGAA GGAGTCCTCG GTGAGATTCA GAATGCCGAA CACCGTCACC 8300
ATGGCGTCGG CCTCCGCAGC GACTTCCACG ATGGGGATCG GGCGAGCAAA AAGGCAGCAA TTATGAGCCC CATACCTACA AAGCCCCACG CATCAAGCTT 8400
TTGCCCATGA AGCAACCAGG CAATGGCTGT AATTATGACG ACGCCGAGTC CCGACCAGAC TGCATAAGCA ACACCGACAG GGATGGATTT CAGAACCAGA 8500
GAAAGAAAAT AAAATGCGAT GCCATAACCG ATTATGACAA CGGCGGAAGG GGCAAGCTTA GTAAAGCCCT CGCTAGATTT TAATGCGGAT GTTGCGATTA 8600
CTTCGCCAAC TATTGCGATA ACAAGAAAAA GCCAGCCTTT CATGATATAT CTCCCAATTT GTGTAGGGCT TATTATGCAC GCTTAAAAAT AATAAAAGCA 8700
GACTTGACCT GATAGTTTGG CTGTGAGCAA TTATGTGCTT AGTGCATCTA ACGCCGGAGT TAAGCCGCCG CGCGTAGCGC GGTCGGCTTG AACGAATTGT 8800
TAGACATCAT TTACCAACTG ACTTGATGAT CTCGCCTTTC ACAAAGCGAA TAAATTCTTC CAAGTGATCT GCGCGTGAGG CCAAGTGATC TTCTTTTTGT 8900
CCCAGATAAG CTTGCTTAGC TTCAAGTAAG ACGGGCTGAT ACTGGGCAGG TAGGCGTTTT ATTGCCCAGT CGGCAGCGAC ATCCTTCGGC GCGATTTTGC 9000
CGGTTATTGC GCTGTACCAA ATGCGGGACA ACGTAAGCAC TACATTTCGC TCATCGCCGG CCCAGTCGGG CTGCGAGTTC CATAGCTTCA AGGTTTCCCT 9100
CAGCGCCTCG AATAGATCCT GTTCAGGAAC CGGGTCAAAG AATTCCTCCG CTGCCGGACC TACCAAGGCA ACGCTATGTT CTCTTGCTTT TGTAAGCAGG 9200
ATAGCTAGAT CAATGTCGAT CATGGCTGGC TCGAAGATAC CCGCAAGAAT GTCATTGCGC TGCCATTCTC CAAATTGCAG CTCGCGCTTA GCCGGATAAC 9300
GCCACGGGAT GATGTCGTCA TGCACGACAA GGGTGACTTC TATAGCGCGG AGCGTCTCGC TCTCGCCAGG GAAAGCCGAA GCCTCCATAA GGTCATTGAG 9400
CAATGCTCGC CGCGTCGTTT CATCAAGCTT TACGGCCACA GTAACCAACA AATCAATATC GCTGTATGGC TTCAGGCCGC CATCCACTGC GGAGCCGTAC 9500
AAATGCACGG CCAGCAACGT TGATTCCAGA TGGCGCTCAA TGACGCTTAG CACCTCTGAT AGTTGGTTCG AAATTTCGAT GGTCACCGCT ACCCTCATGA 9600
TGTCTAACGC CCGCCTCAAC TGCCGGAGGT TAATGGCGCC TTTTTTGCAG CTCCACCGCA AAAAAAGGTG ACAGTAACCG GAGGTCAGAT TGCAGGCGCT 9700
TGTTAACTCT TTTGCCAGAT TTGATAACAA TAGTTAATGT TAGACTCGAA ATCTTGCTCA AATACCAACT TGAATGTATC TGGGATTTCA GGAAAAACTA 9800
TGTCTCCATC TCGCTCGATG TCGACTGTTG AAATATGCAA AGTATCTACT TTGGAAATCA AGCTTTTGTA TATTTCACCA CCACCAGAAA CGATAACATG 9900
ATCCGTGATT TCCTCTAAAT TTATTAAGGC ATCCTGAATT GAGGAGAAAA CCATTACACC CTCATCATTC GTAGAAAAAT TAGAGCGAGT TACAACTGCA 10000
TACTTTCGAT TTGGGAGAGC GCCCATTGAC TCAAAAGTTT TGCGTCCAAC TAAAAGCCAT TGATTATATG TAATTGCCTT AAATAGAAGT TGCTCGCCTT 10100
TGGCGCTCCA TGGAATATCT GGTCCATTAC CGATAATACC GTTCTTCGAC TTGGCAGCCA TTAGTGATAA CTTCACGATT TCTCCCTCGA GTTAACTTTG 10200
TTTTAGGGCG ACTGCCCTGC TGCGTAACAT CGTTGCTGCT CCATAACATC AAACATCGAC CCACGGCGTA ACGCGCTTGC TGCTTGGATG CCCGAGGCAT 10300
AGACTGTACA AAAAAACAGT CATAACAAGC CATGAAAACC GCCACTGCGC CGTTACCACC GCTGCGTTCG GTCAAGGTTC TGGACCAGTT GCGTGAGCGC 10400
ATACGCTACT TGCATTACAG TTTACGAACC GAACAGGCTT ATGTCCACTG GGTTCGTGCC TTCATCCGTT TCCACGGTGT GCGTCACCCG GCAACCTTGG 10500
GCAGCAGCGA AGTCGAGGCA TTTCTGTCCT GGCTGGCGAA CGAGCGCAAG GTTTCGGTCT CCACGCATCG TCAGGCATTG GCGGCCTTGC TGTTCTTCTA 10600
CGGCAAGGTG CTGTGCACGG ATCTGCCCTG GCTTCAGGAG ATCGGAAGAC CTCGGCCGTC GCGGCGCTTG CCGGTGGTGC TGACCCCGGA TGAAGTGGTT 10700
CGCATCCTCG GTTTTCTGGA AGGCGAGCAT CGTTTGTTCG CCCAGCTTCT GTATGGAACG GGCATGCGGA TCAGTGAGGG TTTGCAACTG CGGGTCAAGG 10800
ATCTGGATTT CGATCACGGC ACGATCATCG TGCGGGAGGG CAAGGGCTCC AAGGATCGGG CCTTGATGTT ACCCGAGAGC TTGGCACCCA GCCTGCGCGA 10900
GCAGCTGTCG CGTGCACGGG CATGGTGGCT GAAGGACCAG GCCGAGGGCC GCAGCGGCGT TGCGCTTCCC GACGCCCTTG AGCGGAAGTA TCCGCGCGCC 11000
GGGCATTCCT GGCCGTGGTT CTGGGTTTTT GCGCAGCACA CGCATTCGAC CGATCCACGG AGCGGTGTCG TGCGTCGCCA TCACATGTAT GACCAGACCT 11100
TTCAGCGCGC CTTCAAACGT GCCGTAGAAC AAGCAGGCAT CACGAAGCCC GCCACACCGC ACACCCTCCG CCACTCGTTC GCGACGGCCT TGCTCCGCAG 11200
CGGTTACGAC ATTCGAACCG TGCAGGATCT GCTCGGCCAT TCCGACGTCT CTACGACGAT GATTTACACG CATGTGCTGA AAGTTGGCGG TGCCGGAGTG 11300
CGCTCACCGC TTGATGCGCT GCCGCCCCTC ACTAGTGAGA GGTAGGGCAG CGCAAGTCAA TCCTGGCGGA TTCACTACCC CTGCGCGAAG GCCATCGGTG 11400
CCGCATCGAA CGGCCGGTTG CGGAAAGTCC TCCCTGCGTC CGCTGATGGC CGGCAGCAGC CCGTCGTTGC CTGATGGATC CAACCCCTCC GCTGCTATAG 11500
TGCAGTCGGC TTCTGACGTT CAGTGCAGCC GTCTTCTGAA AACGACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
attC aadA3 core |
8752-8805 |
54 |
CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC |
attC dfr16 core |
9608-9706 |
99 |
CGCCCGCCTC AACTGCCGGA GGTTAATGGC GCCTTTTTTG CAGCTCCACC GCAAAAAAAG GTGACAGTAA CCGGAGGTCA GATTGCAGGC GCTTGTTAA |
attI |
10196-10251 |
56 |
CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA |
|
ORFs |
|
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnp |
Tnp |
IS6100 |
765 |
207-971 |
- |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MTDFKWRHFQ GDVILWAVRW YCRYPISYRD LEEMLAERGI SVDHTTIYRW VQCYAPEMEK RLRWFWRRGF DPSWRLDETY VKVRGKWTYL YRAVDKRGDT IDFYLSPTRS AKAAKRFLGK ALRGLKHWEK PATLNTDKAP SYGAAITELK REGKLDRETA HRQVKYLNNV IEADHGKLKI LIKPVRGFKS IPTAYATIKG FEVMRALRKG QARPWCLQPG IRGEVRLVER AFGIGPSALT EAMGMLNHHF AAAA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
WP_001424634.1 |
WP_001424634.1 |
In36 |
288 |
1167-1454 |
- |
Class: | Passenger Gene |
Sub Class: | Hypothetical |
Protein Sequence:
|
MSGWDGPVAS RSLAVRPRCL GCARGCRYPP LPSSTLDNAA AVIAADARAG LFGPVRLARP SACLGEISGG RINGGASPPF VRVKAEDSSL KNIRL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
GNAT_fam |
GNAT_fam |
In36 |
501 |
1478-1978 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | Acetyltransf_1 (Pfam:PF00583) |
Comment: | putative acetyltransferase ADU64769.1 |
Protein Sequence:
|
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
sul1 (ARO:3000410) |
Sul1 |
In36 |
840 |
2106-2945 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target replacement (ARO:0001002) |
Transpoase Chemistry: | dihydropteroate synthase |
Target: | sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401) |
Sequence Family: | sulfonamide resistant sul (ARO:3004238) |
Comment: | perfect match to reference sequence for ARO:3000410 |
Protein Sequence:
|
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
qacEdelta1_p (ARO:3005010) |
N |
In36 |
208 |
2939-3146 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic efflux (ARO:0010000) |
Transpoase Chemistry: | antibiotic efflux (ARO:0010000) |
Target: | acridine dye (ARO:3000054)||quaternary ammonium salts |
Sequence Family: | major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002) |
Comment: | perfect match C-temrinal 68 amino acis of reference sequence for ARO:3005010 (bitscore: 133) |
Protein Sequence:
|
SLVLKSIPVG VAYAVWSGLG VVIITAIAWL LHGQKLDAWG FVGMGLIIAA FLLARSPSWK SLRRPTPW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
bla NmcR (ARO:3003665) |
Bla NmcR |
In36 |
876 |
3536-4411 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Transpoase Chemistry: | antibiotic inactivation (ARO:0001004) |
Target: | penam (ARO:3000008)||cephamycin (ARO:0000044)||carbapenem (ARO:0000020)||cephalosporin (ARO:0000032) |
Sequence Family: | NmcA beta-lactamase (ARO:3004195) |
Comment: | poor match 46% to reference sequence for ARO:3003665 (bitscore: 275)||Synonyms: |
Protein Sequence:
|
MVRRYLPLNP LRAFEAAARH LSFTRAAIEL NVTHAAVSQQ VRALEEQLGC VLFTRVSRGL VLTHEGEGLL PVLNEAFDRI ADTLECFSHG QFRERVKVGA VGTFAAGWLL PRLAGFYDSH PHIDLHISTH NNHVDPAAEG HDYTIRFGNG AWHESDAELI FSAPHAPLCS PAIAEQLQQP DDVHRFTLLR SFRRDEWSRW LDCAGGTPPS PSQPVMVFDT SLAMAEAAQL GAGVAIAPVC MFSRLLQSGA LVQPFAAEIT LGGYWLTRLQ SRTETPAMQQ FARWLLNTAA A
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
QnrA1 (ARO:3002707) |
QnrA1 |
In36 |
657 |
4597-5253 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target protection (ARO:0001003) |
Transpoase Chemistry: | antibiotic target protection (ARO:0001003) |
Target: | fluoroquinolone antibiotic (ARO:0000001) |
Sequence Family: | quinolone resistance protein (qnr) (ARO:3000419) |
Comment: | strict match to reference sequence for ARO:3002707 (bitscore: 446)||Synonyms: |
Protein Sequence:
|
MDIIDKVFQQ EDFSRQDLSD SRFRRCRFYQ CDFSHCQLQD ASFEDCSFIE SGAVEGCHFS YADLRDASFK ACRLSLANFS GANCFGIEFR ECDLKGANFS RARFYNQVSH KMYFCSAYIS GCNLAYTNLS GQCLEKCELF ENNWSNANLS GASLMGSDLS RGTFSRDCWQ QVNLRGCDLT FADLDGLDPR RVNLEGVKIC AWQQEQLLEP LGVIVLPD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnp |
Tnp |
ISCR1 |
1542 |
5517-7058 |
- |
Class: | Transposase |
Transpoase Chemistry: | HUH |
Sequence Family: | IS91 tnp |
Comment: | CR of ISCR1 |
Protein Sequence:
|
MSLARNATAS QSPTQTNGYE RHQPDQTLLY QLVEQHYPAF KASLEAQGQH LPRYIQQEFN DLLQCGRLEY GFMRVRCEDC HHERLVAFSC KRRGFCPSCG ARRMAESAAL LIDEVFPKEP IRQWVLSFPF QLRFLLARHP QLMGQVLSIV YRTLSTHLIK KAGYTKASAQ TGSVTLIQRF GSALNLNVHY HMLFLDGVYA EDDYGKQRFH RVKAPTYDEL NTLAHTLSHR IARCMEKRGI LERDAENTWL TLEEGEDDTL TQLHGASVTY RIAVGPQQGR KVFTLQTLPG REDKADSSSR VANHAGFSLH AGVMAEAHQR DKLERLCRYI SRPAVSEKRL ALTANGQVRY ELKTPYRNGT THVIFEPLDF IAKLAALVPK PRVNLTRFHG VFAPNSKHRV QVTPAKRGKK PDKSEGLDTN WRDKSPAERH RAMTWMQRLK RVFNIDIEVC EHCGGHVKVI ASIEDPKVIE QILKHLKQKT AKANAAKQRE LPPERAPPLT PSLFDPSQSR LFD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
sul1 (ARO:3000410) |
Sul1 |
In36 |
840 |
7463-8302 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target replacement (ARO:0001002) |
Transpoase Chemistry: | dihydropteroate synthase |
Target: | sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401) |
Sequence Family: | sulfonamide resistant sul (ARO:3004238) |
Comment: | perfect match to reference sequence for ARO:3000410 |
Protein Sequence:
|
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
qacEdelta1 (ARO:3005010) |
QacEdelta1 |
In36 |
348 |
8296-8643 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic efflux (ARO:0010000) |
Target: | disinfecting agents and antiseptics (ARO:3005386) |
Sequence Family: | major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002) |
Comment: | subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219) |
Protein Sequence:
|
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL ARSPSWKSLR RPTPW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
aadA3 (ARO:3002603) |
AadA3 |
In36 |
792 |
8807-9598 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | ANT(3'') (ARO:3004275) |
Comment: | strict match to reference sequence for ARO:3002603 (bitscore: 522) |
Protein Sequence:
|
MRVAVTIEIS NQLSEVLSVI ERHLESTLLA VHLYGSAVDG GLKPYSDIDL LVTVAVKLDE TTRRALLNDL MEASAFPGES ETLRAIEVTL VVHDDIIPWR YPAKRELQFG EWQRNDILAG IFEPAMIDID LAILLTKARE HSVALVGPAA EEFFDPVPEQ DLFEALRETL KLWNSQPDWA GDERNVVLTL SRIWYSAITG KIAPKDVAAD WAIKRLPAQY QPVLLEAKQA YLGQKEDHLA SRADHLEEFI RFVKGEIIKS VGK
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
dfrA16 (ARO:3003014) |
DfrA16 |
In36 |
474 |
9703-10176 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target replacement (ARO:0001002) |
Transpoase Chemistry: | antibiotic target replacement (ARO:0001002) |
Target: | diaminopyrimidine antibiotic (ARO:3000171) |
Sequence Family: | trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218) |
Comment: | strict match to reference sequence for ARO:3003014 (bitscore: 325) |
Protein Sequence:
|
MKLSLMAAKS KNGIIGNGPD IPWSAKGEQL LFKAITYNQW LLVGRKTFES MGALPNRKYA VVTRSNFSTN DEGVMVFSSI QDALINLEEI TDHVIVSGGG EIYKSLISKV DTLHISTVDI ERDGDIVFPE IPDTFKLVFE QDFESNINYC YQIWQKS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
intI1 |
IntI1 |
In36 |
1014 |
10332-11345 |
+ |
Class: | Integron Integrase |
Sub Class: | Class 1 |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Class 1 Integron Tyrosine Integrase |
Protein Sequence:
|
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER
|
|
Internal Transposable Elements (TE) |
|
|
TnCentral Accession |
TE Name |
Type |
Coordinates |
Length |
IS6100-U12338.3 |
IS6100 |
Insertion Sequence |
153-1032 |
880 |
ISCR1-L06822 |
ISCR1 |
Insertion Sequence |
5285-7438 |
2154 |
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t1 |
In36 |
9-27 |
TCAGAAGACG ACTGCACCA |
repeat t2 |
In36 |
49-67 |
AACACGTCGG TCGAGGACT |
repeat t3 |
In36 |
78-97 |
TCAGAAGTGA TCTGCACCAA |
repeat t4 |
In36 |
110-128 |
TCAATACTCG TGTGCACCA |
IR |
IS6100 |
153-166 |
GGCTCTGTTG CAAA |
IR |
IS6100 |
1019-1032 |
AAACGTTGTC TCGG |
repeat t3 |
In36 |
1059-1078 |
AACCACGTCT AGTGAAGACT |
repeat t2 |
In36 |
1089-1107 |
TCAGGAGCTG GCTGCACAA |
IRt |
In36 |
1123-1155 |
TAGTTGACCA CGTCAGCAGA AGACTTTTAC TGT |
repeat t1 |
In36 |
1129-1147 |
ACCACGTCAG CAGAAGACT |
repeat i4 |
In36 |
11428-11446 |
AGGAGGGACG CAGGCGACT |
repeat i3 |
In36 |
11456-11474 |
CGTCGGGCAG CAACGGACT |
repeat i2 |
In36 |
11498-11516 |
ATCACGTCAG CCGAAGACT |
IRi |
In36 |
11515-11547 |
CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT |
|
References |
|
|
Wang M, Tran JH, Jacoby GA, Zhang Y, Wang F, Hooper DC. Plasmid-mediated quinolone resistance in clinical isolates of Escherichia coli from Shanghai, China. Antimicrob Agents Chemother. 2003 Jul;47(7):2242-8. doi: 10.1128/aac.47.7.2242-2248.2003. PubMed ID: 12821475
| |
| | |
|
|