Transposon
Name: In36
Family: Tn402        Group: Class 1
Evidence of Transposition: no
 Host     

Host Organism:Escherichia coli Molecular Source:plasmid pHSH1
Place of Origin:Shanghai, China Date of Isolation:2003
Other Geographic Information:teaching hospital March 2000 and March 2001

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  11547 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GAGGCTCTGT TGCAAAGATT GGCGGCAGTC AGAGGTAGGC TGTCGCTCTG 200
CGCCGATCAG GCGGCTGCTG CGAAATGGTG GTTGAGCATG CCCATGGCCT CCGTCAGCGC CGAGGGCCCA ATGCCAAAAG CTCTCTCCAC AAGGCGCACC 300
TCGCCCCTGA TGCCGGGCTG CAGGCACCAG GGGCGAGCCT GTCCTTTGCG CAGGGCTCGC ATGACTTCGA ATCCCTTGAT CGTGGCATAG GCCGTGGGGA 400
TCGATTTGAA ACCGCGCACC GGCTTGATCA GTATCTTGAG CTTTCCGTGA TCGGCCTCGA TCACGTTATT GAGATACTTC ACCTGCCGGT GGGCCGTCTC 500
CCGGTCCAGC TTTCCTTCGC GCTTCAATTC GGTGATCGCT GCACCATAGC TCGGCGCTTT GTCGGTATTG AGCGTGGCAG GCTTTTCCCA GTGCTTCAGG 600
CCTCGCAGGG CCTTGCCCAG GAACCGCTTC GCTGCCTTGG CGCTGCGGGT CGGCGACAGG TAGAAATCGA TCGTGTCGCC CCGCTTGTCG ACTGCCCGGT 700
ACAGGTAGGT CCACTTGCCC CGCACCTTGA CGTAGGTTTC ATCCAGGCGC CAGCTCGGAT CAAAGCCACG CCGCCAGAAC CAGCGCAGCC GCTTCTCCAT 800
CTCCGGGGCG TAGCACTGGA CCCAGCGATA GATCGTCGTA TGGTCGACCG AAATGCCGCG TTCCGCCAGC ATTTCCTCAA GGTCGCGATA GCTGATCGGA 900
TAGCGACAAT ACCAGCGCAC CGCCCACAGG ATCACATCAC CCTGGAAATG GCGCCACTTG AAATCCGTCA TCGTTCCGTC CGTCCAATCT CCGCCAAGCA 1000
TGCTCAAGCT TCACGATTTT TGCAACAGAG CCCACACGAG TATTGAGCAT AGTCGAGATT GGTGCAGATC ACTTCTGATA TTGAACTGTC AGGAGCTGGC 1100
TGCACAACAG CCATTACGCC CAATCAACTG GTGCAGTCGT CTTCTGAAAA TGACATTTGG TATCTCTCAT AAACGGATGT TTTTGAGAGA ACTATCTTCG 1200
GCCTTCACAC GCACGAAAGG CGGCGAAGCT CCGCCGTTAA TCCGTCCGCC GGAGATCTCG CCCAGGCAGG CTGAAGGCCG AGCAAGCCTG ACAGGCCCGA 1300
AAAGCCCGGC ACGGGCGTCG GCGGCGATGA CGGCGGCGGC ATTATCCAGG GTTGATGATG GAAGTGGAGG ATATCGACAA CCTCTCGCGC AACCAAGACA 1400
TCGCGGTCGG ACTGCAAGTG ATCTTGAAGC CACGGGCCCG TCCCACCCCG ACATGGACCT CGATGCCCGA ACGGACGTTA GATTTCGAGT TCTAGGCGTT 1500
CTGCGATGAA GGTTGGATCC CAGCCGGGAT TGAAAGTGTC GACGTGGGTG AATCCGAGCC GCTCGTATAG GCCACGCAGG TTCGGGTGGC AGTCGAGCCG 1600
CAGCTTGGCG CACCCCTGCG TTCGCGCGGC ATGGCGGCAA GCCTCGATCA GCGCGGAGCT GACACCCCGG CCCGCATGTG TCCGTCGCAC CGCGAGCTTG 1700
TGCAGATATG CGGCCTCCCC CTTGAGGGCG TCGGGCCAGA ACTCGGGATC CTCGGCCGAC AAGGTGCAAC AGCCGACGAT GCCGTCGCTG CAACTCGCGA 1800
CTAGGAGCTC GGATCTCAGG ACGAAGGTCT CCGCGAATGT CCGGTCGATC CGCGCGACGT CCCAGGCGGG CGTTCCCTTG GCGGACATCC ACGCCGCAGC 1900
GTCGTGCATC AGCCGCACAA CCTCGTCGAT ATCACCCGAG CAGGCGACCC GAACGTTCGG AGGCTCCTCG CTGTCCATTC GCTCCCCTGG CGCGGTATGA 2000
ACCGCCGCCT CATAGTGCAG TTTGATCCTG ACGAGCCCAG CATGTCTGCG CCCACCTTCG CGCAACCCAA AAGGAGATAT TTAGCGGGCG GCCGGAAGGT 2100
GAATGCTAGG CATGATCTAA CCCTCGGTCT CTGGCGTCGC GACTGCGAAA TTTCGCGAGG GTTTCCGAGA AGGTGATTGC GCTTCGCAGA TCTCCAGGCG 2200
CGTGGGTGCG GACGTAGTCA GCGCCATTGC CGATCGCGTG AAGTTCCGCC GCAAGGCTCG CTGGACCCAG ATCCTTTACA GGAAGGCCAA CGGTGGCGCC 2300
CAAGAAGGAT TTCCGCGACA CCGAGACCAA TAGCGGAAGC CCCAACGCCG ACTTCAGCTT TTGAAGGTTC GACAGCACGT GCAGCGATGT TTCCGGTGCG 2400
GGGCTCAAGA AAAATCCCAT CCCCGGATCG AGGATGAGCC GGTCGGCAGC GACCCCGCTC CGTCGCAAGG CGGAAACCCG CGCCTCGAAG AACCGCACAA 2500
TCTCGTCGAG CGCGTCTTCG GGTCGAAGGT GACCGGTGCG GGTGGCGATG CCATCCCGCT GCGCTGAGTG CATAACCACC AGCCTGCAGT CCGCCTCAGC 2600
AATATCGGGA TAGAGCGCAG GGTCAGGAAA TCCTTGGATA TCGTTCAGGT AGCCCACGCC GCGCTTGAGC GCATAGCGCT GGGTTTCCGG TTGGAAGCTG 2700
TCGATTGAAA CACGGTGCAT CTGATCGGAC AGGGCGTCTA AGAGCGGCGC AATACGTCTG ATCTCATCGG CCGGCGATAC AGGCCTCGCG TCCGGATGGC 2800
TGGCGGCCGG TCCGACATCC ACGACGTCTG ATCCGACTCG CAGCATTTCG ATCGCCGCGG TGACAGCGCC GGCGGGGTCT AGCCGCCGGC TCTCATCGAA 2900
GAAGGAGTCC TCGGTGAGAT TCAGAATGCC GAACACCGTC ACCATGGCGT CGGCCTCCGC AGCGACTTCC ACGATGGGGA TCGGGCGAGC AAAAAGGCAG 3000
CAATTATGAG CCCCATACCT ACAAAGCCCC ACGCATCAAG CTTTTGCCCA TGAAGCAACC AGGCAATGGC TGTAATTATG ACGACGCCGA GTCCCGACCA 3100
GACTGCATAA GCAACACCGA CAGGGATGGA TTTCAGAACC AGAGAAAGAA AATAAAATGC GATGCCATAA CCGATTATGA CAACGGCGGA AGGGGCAAGC 3200
TTAGTAAAGC CCTCGGTGCG GCGGACATTA TCCGCGAACA GGCTGAACAG CACGGCATTG CGCGGGTAAC GGATGTCTGG CTGGAAGTCG GCGCACTGGC 3300
GGATGTTGAG GAGAGTGCAC TGCATTTCTG TTTTGATATC GCCTGCCGTG ATACCGTGGC GCAGGGCTGC ACACTGCATA TTGATGTTAT CCCGGCACAG 3400
GCATGGTGCT GGGATTGCAG CCGTGAGGCC GAAATCATGC AGCACGCCGG ATGCTGTCCG CACTGCGGCA GTGAACGGCT GCGCATCAGT GAAGGTGATG 3500
ATTTGCGGGT AAAAAGCCTG GAAGGTGAGT GAGTTTTACG CCGCCGCCGT ATTCAGCAGC CAGCGGGCGA ATTGCTGCAT GGCCGGGGTT TCCGTACGGG 3600
ACTGTAACCG CGTCAGCCAG TAGCCGCCGA GGGTGATTTC TGCGGCAAAC GGCTGTACCA GTGCGCCTGA CTGTAACAGG CGGCTGAACA TACATACCGG 3700
TGCGATCGCT ACCCCGGCAC CCAGTTGTGC CGCCTCGGCC ATGGCCAGTG AGGTATCGAA CACCATTACC GGCTGTGACG GGGAAGGCGG TGTGCCGCCC 3800
GCACAATCCA GCCAGCGGCT CCATTCATCC CGGCGGAATG AGCGCAGCAG GGTAAAGCGG TGAACATCAT CCGGCTGCTG TAACTGTTCT GCAATGGCCG 3900
GTGAGCACAG CGGAGCGTGT GGTGCACTGA AAATCAGTTC CGCATCTGAC TCATGCCACG CGCCGTTACC GAAACGGATC GTATAATCAT GCCCTTCCGC 4000
CGCCGGGTCC ACATGATTGT TATGGGTGGA GATATGCAGA TCAATATGCG GATGGCTGTC ATAGAATCCG GCCAGACGCG GCAGCAGCCA GCCTGCGGCA 4100
AATGTTCCCA CCGCACCGAC TTTCACCCGC TCACGGAACT GCCCGTGAGA AAAACACTCC AGAGTATCCG CAATCCGGTC AAACGCCTCA TTGAGCACCG 4200
GCAGTAATCC CTCACCTTCA TGGGTCAGCA CCAGCCCGCG CGAGACGCGG GTAAACAGCA CACAGCCGAG TTGTTCTTCC AGCGCCCTGA CCTGCTGGCT 4300
GACGGCGGCA TGGGTGACAT TCAGCTCAAT CGCCGCGCGG GTAAAACTGA GATGACGGGC GGCGGCCTCA AAGGCGCGCA GCGGGTTAAG GGGGAGATAA 4400
CGTCTGACCA TAATCCACCT GTAAGTTTTT CTTTAGGGCA GTTGAGTCTT CATAATGCCT CCTTTGAAGG GAAGGTATAA CGTCGGCTGC GGCAGCAGAG 4500
AAACGGCATT CCGGCGGTAA TACTCTGCTA TAGCAGAATC TGCCTTGTTG TAGCGCGGAA CTCAAGTGAT ATTTGCCTCT TGTGTTTGCA TTCGAGCTAA 4600
TCCGGCAGCA CTATTACTCC CAAGGGTTCC AGCAGTTGCT CCTGTTGCCA GGCACAGATC TTGACTCCTT CGAGGTTGAC CCGTCTGGGG TCGAGCCCAT 4700
CCAGATCGGC AAAGGTCAGG TCACAGCCCC GCAGATTGAC CTGTTGCCAA CAGTCGCGGG AGAAGGTGCC GCGGCTGAGA TCTGAGCCCA TCAAGGAAGC 4800
GCCGCTGAGA TTGGCATTGC TCCAGTTGTT TTCAAACAGC TCGCATTTTT CCAGGCATTG GCCACTCAAG TTGGTATAGG CCAGGTTGCA ACCTGAGATA 4900
TAAGCCGAGC AGAAGTACAT CTTATGGCTG ACTTGATTGT AGAAGCGGGC CCGGGAAAAG TTGGCGCCCT TGAGATCGCA CTCCCTGAAC TCTATGCCAA 5000
AGCAGTTGGC ACCGCTGAAG TTGGCCAAAG ACAGACGGCA GGCCTTGAAA CTGGCATCGC GCAGATCGGC ATAGCTGAAG TGACACCCTT CAACGGCGCC 5100
GCTTTCAATG AAACTGCAAT CCTCGAAACT GGCATCCTGC AGCTGACAGT GGCTGAAGTC ACACTGATAA AAGCGGCAGC GGCGAAAACG GCTGTCACTC 5200
AAATCCTGGC GTGAGAAATC CTCTTGCTGA AAAACTTTAT CAATAATATC CATACGGCTT CCTTTAATCA GGGAGGGTGC TAACGGGTAT AGGAAGTATA 5300
AACCACCTTT TTGCTCCTCA TCCGAAGTAT CTTACCTGAA ATTCCCTCAC TCGTTTACCG CTCAAGCCCC AATTTTAACT GCCGGTCCAG CCTAAACCGC 5400
TCTAATAAGG TTCGATTTGG CGGTAAAATC TCTAGCCTGA TAGCTCGAGA GATACAAACT GCCCCACCGC CCCGTTTAAA AGTTGGCAGT GTTGAGCAGT 5500
GTTGGATTTG GGGTCGTCAG TCAAAGAGAC GACTCTGTGA TGGATCGAAC AGGCTGGGAG TCAGTGGCGG CGCTCGTTCT GGTGGCAGCT CACGCTGCTT 5600
GGCGGCATTC GCCTTGGCTG TTTTCTGTTT CAGATGCTTG AGAATCTGCT CAATGACCTT CGGATCTTCG ATGCTGGCAA TCACTTTGAC GTGACCGCCG 5700
CAGTGTTCGC AGACTTCAAT ATCAATATTG AAGACTCGCT TGAGGCGTTG CATCCAGGTC ATGGCGCGGT GGCGCTCTGC AGGACTCTTG TCACGCCAGT 5800
TAGTATCGAG ACCTTCCGAT TTGTCGGGCT TCTTGCCCCG CTTGGCGGGT GTTACTTGAA CTCGGTGTTT GCTGTTCGGT GCAAAGACGC CGTGGAAGCG 5900
TGTGAGGTTG ACTCGCGGCT TAGGTACCAA CGCAGCGAGT TTGGCGATGA AGTCCAGCGG CTCGAAGATC ACATGGGTGG TGCCATTGCG GTACGGAGTT 6000
TTGAGCTCGT AACGCACCTG CCCATTGGCG GTTAATGCCA GACGTTTTTC TGAAACCGCT GGCCGACTAA TGTAGCGACA CAAGCGCTCA AGCTTATCCC 6100
GCTGATGCGC TTCGGCCATC ACACCGGCGT GTAGCGAGAA ACCAGCATGG TTGGCTACTC GACTGCTTGA GTCGGCTTTA TCCTCACGCC CTGGCAAGGT 6200
TTGCAGGGTG AAGACTTTGC GCCCTTGCTG GGGGCCGACG GCAATGCGAT ACGTAACCGA AGCACCATGT AATTGAGTCA GCGTATCGTC TTCGCCCTCT 6300
TCCAGTGTCA ACCACGTATT CTCGGCATCA CGCTCCAAAA TCCCACGCTT TTCCATGCAG CGAGCGATGC GATGGCTGAG GGTGTGAGCG AGCGTATTCA 6400
GCTCATCGTA AGTGGGTGCC TTGACACGAT GGAAGCGTTG CTTGCCATAG TCATCTTCGG CATAGACACC ATCGAGAAAC AGCATGTGGT AGTGGACATT 6500
GAGATTTAGC GCGGAGCCAA AGCGTTGGAT AAGAGTCACT GAGCCAGTTT GTGCAGAGGC TTTGGTGTAA CCGGCTTTTT TGATCAGATG AGTTGAGAGT 6600
GTACGATAGA CGATACTCAA GACCTGGCCC ATCAGCTGGG GATGGCGAGC CAGCAAAAAG CGTAGCTGGA AAGGAAAGCT GAGCACCCAC TGGCGAATGG 6700
GCTCCTTGGG GAAGACTTCG TCTATCAGCA GCGCCGCACT CTCGGCCATC CGGCGGGCAC CGCAGCTAGG GCAAAAGCCG CGTCGTTTAC AGCTGAAGGC 6800
GACCAGACGC TCGTGATGAC AATCCTCGCA GCGAACCCGC ATGAAACCAT ACTCCAGACG GCCACATTGG AGGAGGTCGT TGAATTCTTG TTGGATGTAG 6900
CGAGGCAGGT GTTGACCTTG GGCTTCGAGT GAGGCTTTGA AGGCTGGGTA GTGCTGCTCA ACCAGCTGGT AGAGCAGCGT CTGGTCGGGT TGGTGGCGTT 7000
CGTAACCGTT TGTTTGAGTG GGCGATTGAC TCGCCGTGGC GTTCCTTGCC AGCGACATGG GTATCCTCCG CTGATACTGT GGTTATGTAC AGTATCAGCG 7100
GCTTGCGTTC AGACGTCCAG TCTGGCCCTA GACATCGCTA AATGCTTAAC CCGCAATAGC CCTCACGAGT TGTTATCAGC CACTACCGGT TGAGCGAGAA 7200
GGTTTTGGGT TCAGGGTGCT ATTGCTCCAC CAATCACAAT ACTGAAGCCC CAACTGTTAT CAGTTGGGGC TTTTTCTTGT CTGTTTGCGG CGGTTGCGTT 7300
TTATCGGTAG TCGTCGAGCT CTGCACCATC CCACATAAGA GCTTAACGGT GCGATCTTCA ACGCCATCAC ACAAAACTTT CTTTTTCACG CACAGTCAAC 7400
TTATTGGATG TTTTATTAAC AACCCAAAAG GAGATATTTA GCGGGCGGCC GGAAGGTGAA TGCTAGGCAT GATCTAACCC TCGGTCTCTG GCGTCGCGAC 7500
TGCGAAATTT CGCGAGGGTT TCCGAGAAGG TGATTGCGCT TCGCAGATCT CCAGGCGCGT GGGTGCGGAC GTAGTCAGCG CCATTGCCGA TCGCGTGAAG 7600
TTCCGCCGCA AGGCTCGCTG GACCCAGATC CTTTACAGGA AGGCCAACGG TGGCGCCCAA GAAGGATTTC CGCGACACCG AGACCAATAG CGGAAGCCCC 7700
AACGCCGACT TCAGCTTTTG AAGGTTCGAC AGCACGTGCA GCGATGTTTC CGGTGCGGGG CTCAAGAAAA ATCCCATCCC CGGATCGAGG ATGAGCCGGT 7800
CGGCAGCGAC CCCGCTCCGT CGCAAGGCGG AAACCCGCGC CTCGAAGAAC CGCACAATCT CGTCGAGCGC GTCTTCGGGT CGAAGGTGAC CGGTGCGGGT 7900
GGCGATGCCA TCCCGCTGCG CTGAGTGCAT AACCACCAGC CTGCAGTCCG CCTCAGCAAT ATCGGGATAG AGCGCAGGGT CAGGAAATCC TTGGATATCG 8000
TTCAGGTAGC CCACGCCGCG CTTGAGCGCA TAGCGCTGGG TTTCCGGTTG GAAGCTGTCG ATTGAAACAC GGTGCATCTG ATCGGACAGG GCGTCTAAGA 8100
GCGGCGCAAT ACGTCTGATC TCATCGGCCG GCGATACAGG CCTCGCGTCC GGATGGCTGG CGGCCGGTCC GACATCCACG ACGTCTGATC CGACTCGCAG 8200
CATTTCGATC GCCGCGGTGA CAGCGCCGGC GGGGTCTAGC CGCCGGCTCT CATCGAAGAA GGAGTCCTCG GTGAGATTCA GAATGCCGAA CACCGTCACC 8300
ATGGCGTCGG CCTCCGCAGC GACTTCCACG ATGGGGATCG GGCGAGCAAA AAGGCAGCAA TTATGAGCCC CATACCTACA AAGCCCCACG CATCAAGCTT 8400
TTGCCCATGA AGCAACCAGG CAATGGCTGT AATTATGACG ACGCCGAGTC CCGACCAGAC TGCATAAGCA ACACCGACAG GGATGGATTT CAGAACCAGA 8500
GAAAGAAAAT AAAATGCGAT GCCATAACCG ATTATGACAA CGGCGGAAGG GGCAAGCTTA GTAAAGCCCT CGCTAGATTT TAATGCGGAT GTTGCGATTA 8600
CTTCGCCAAC TATTGCGATA ACAAGAAAAA GCCAGCCTTT CATGATATAT CTCCCAATTT GTGTAGGGCT TATTATGCAC GCTTAAAAAT AATAAAAGCA 8700
GACTTGACCT GATAGTTTGG CTGTGAGCAA TTATGTGCTT AGTGCATCTA ACGCCGGAGT TAAGCCGCCG CGCGTAGCGC GGTCGGCTTG AACGAATTGT 8800
TAGACATCAT TTACCAACTG ACTTGATGAT CTCGCCTTTC ACAAAGCGAA TAAATTCTTC CAAGTGATCT GCGCGTGAGG CCAAGTGATC TTCTTTTTGT 8900
CCCAGATAAG CTTGCTTAGC TTCAAGTAAG ACGGGCTGAT ACTGGGCAGG TAGGCGTTTT ATTGCCCAGT CGGCAGCGAC ATCCTTCGGC GCGATTTTGC 9000
CGGTTATTGC GCTGTACCAA ATGCGGGACA ACGTAAGCAC TACATTTCGC TCATCGCCGG CCCAGTCGGG CTGCGAGTTC CATAGCTTCA AGGTTTCCCT 9100
CAGCGCCTCG AATAGATCCT GTTCAGGAAC CGGGTCAAAG AATTCCTCCG CTGCCGGACC TACCAAGGCA ACGCTATGTT CTCTTGCTTT TGTAAGCAGG 9200
ATAGCTAGAT CAATGTCGAT CATGGCTGGC TCGAAGATAC CCGCAAGAAT GTCATTGCGC TGCCATTCTC CAAATTGCAG CTCGCGCTTA GCCGGATAAC 9300
GCCACGGGAT GATGTCGTCA TGCACGACAA GGGTGACTTC TATAGCGCGG AGCGTCTCGC TCTCGCCAGG GAAAGCCGAA GCCTCCATAA GGTCATTGAG 9400
CAATGCTCGC CGCGTCGTTT CATCAAGCTT TACGGCCACA GTAACCAACA AATCAATATC GCTGTATGGC TTCAGGCCGC CATCCACTGC GGAGCCGTAC 9500
AAATGCACGG CCAGCAACGT TGATTCCAGA TGGCGCTCAA TGACGCTTAG CACCTCTGAT AGTTGGTTCG AAATTTCGAT GGTCACCGCT ACCCTCATGA 9600
TGTCTAACGC CCGCCTCAAC TGCCGGAGGT TAATGGCGCC TTTTTTGCAG CTCCACCGCA AAAAAAGGTG ACAGTAACCG GAGGTCAGAT TGCAGGCGCT 9700
TGTTAACTCT TTTGCCAGAT TTGATAACAA TAGTTAATGT TAGACTCGAA ATCTTGCTCA AATACCAACT TGAATGTATC TGGGATTTCA GGAAAAACTA 9800
TGTCTCCATC TCGCTCGATG TCGACTGTTG AAATATGCAA AGTATCTACT TTGGAAATCA AGCTTTTGTA TATTTCACCA CCACCAGAAA CGATAACATG 9900
ATCCGTGATT TCCTCTAAAT TTATTAAGGC ATCCTGAATT GAGGAGAAAA CCATTACACC CTCATCATTC GTAGAAAAAT TAGAGCGAGT TACAACTGCA 10000
TACTTTCGAT TTGGGAGAGC GCCCATTGAC TCAAAAGTTT TGCGTCCAAC TAAAAGCCAT TGATTATATG TAATTGCCTT AAATAGAAGT TGCTCGCCTT 10100
TGGCGCTCCA TGGAATATCT GGTCCATTAC CGATAATACC GTTCTTCGAC TTGGCAGCCA TTAGTGATAA CTTCACGATT TCTCCCTCGA GTTAACTTTG 10200
TTTTAGGGCG ACTGCCCTGC TGCGTAACAT CGTTGCTGCT CCATAACATC AAACATCGAC CCACGGCGTA ACGCGCTTGC TGCTTGGATG CCCGAGGCAT 10300
AGACTGTACA AAAAAACAGT CATAACAAGC CATGAAAACC GCCACTGCGC CGTTACCACC GCTGCGTTCG GTCAAGGTTC TGGACCAGTT GCGTGAGCGC 10400
ATACGCTACT TGCATTACAG TTTACGAACC GAACAGGCTT ATGTCCACTG GGTTCGTGCC TTCATCCGTT TCCACGGTGT GCGTCACCCG GCAACCTTGG 10500
GCAGCAGCGA AGTCGAGGCA TTTCTGTCCT GGCTGGCGAA CGAGCGCAAG GTTTCGGTCT CCACGCATCG TCAGGCATTG GCGGCCTTGC TGTTCTTCTA 10600
CGGCAAGGTG CTGTGCACGG ATCTGCCCTG GCTTCAGGAG ATCGGAAGAC CTCGGCCGTC GCGGCGCTTG CCGGTGGTGC TGACCCCGGA TGAAGTGGTT 10700
CGCATCCTCG GTTTTCTGGA AGGCGAGCAT CGTTTGTTCG CCCAGCTTCT GTATGGAACG GGCATGCGGA TCAGTGAGGG TTTGCAACTG CGGGTCAAGG 10800
ATCTGGATTT CGATCACGGC ACGATCATCG TGCGGGAGGG CAAGGGCTCC AAGGATCGGG CCTTGATGTT ACCCGAGAGC TTGGCACCCA GCCTGCGCGA 10900
GCAGCTGTCG CGTGCACGGG CATGGTGGCT GAAGGACCAG GCCGAGGGCC GCAGCGGCGT TGCGCTTCCC GACGCCCTTG AGCGGAAGTA TCCGCGCGCC 11000
GGGCATTCCT GGCCGTGGTT CTGGGTTTTT GCGCAGCACA CGCATTCGAC CGATCCACGG AGCGGTGTCG TGCGTCGCCA TCACATGTAT GACCAGACCT 11100
TTCAGCGCGC CTTCAAACGT GCCGTAGAAC AAGCAGGCAT CACGAAGCCC GCCACACCGC ACACCCTCCG CCACTCGTTC GCGACGGCCT TGCTCCGCAG 11200
CGGTTACGAC ATTCGAACCG TGCAGGATCT GCTCGGCCAT TCCGACGTCT CTACGACGAT GATTTACACG CATGTGCTGA AAGTTGGCGG TGCCGGAGTG 11300
CGCTCACCGC TTGATGCGCT GCCGCCCCTC ACTAGTGAGA GGTAGGGCAG CGCAAGTCAA TCCTGGCGGA TTCACTACCC CTGCGCGAAG GCCATCGGTG 11400
CCGCATCGAA CGGCCGGTTG CGGAAAGTCC TCCCTGCGTC CGCTGATGGC CGGCAGCAGC CCGTCGTTGC CTGATGGATC CAACCCCTCC GCTGCTATAG 11500
TGCAGTCGGC TTCTGACGTT CAGTGCAGCC GTCTTCTGAA AACGACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC aadA3 core 8752-8805 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC dfr16 core 9608-9706 99 CGCCCGCCTC AACTGCCGGA GGTTAATGGC GCCTTTTTTG CAGCTCCACC GCAAAAAAAG
GTGACAGTAA CCGGAGGTCA GATTGCAGGC GCTTGTTAA
attI 10196-10251 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnp IS6100 207-971 Transposase   -
WP_001424634.1 In36 1167-1454 Passenger Gene Hypothetical -
GNAT_fam In36 1478-1978 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In36 2106-2945 Passenger Gene Antibiotic Resistance -
qacEdelta1_p (ARO:3005010) In36 2939-3146 Passenger Gene Antibiotic Resistance -
bla NmcR (ARO:3003665) In36 3536-4411 Passenger Gene Antibiotic Resistance -
QnrA1 (ARO:3002707) In36 4597-5253 Passenger Gene Antibiotic Resistance -
tnp ISCR1 5517-7058 Transposase   -
sul1 (ARO:3000410) In36 7463-8302 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In36 8296-8643 Passenger Gene Antibiotic Resistance -
aadA3 (ARO:3002603) In36 8807-9598 Passenger Gene Antibiotic Resistance -
dfrA16 (ARO:3003014) In36 9703-10176 Passenger Gene Antibiotic Resistance -
intI1 In36 10332-11345 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS6100 765 207-971 -
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MTDFKWRHFQ GDVILWAVRW YCRYPISYRD LEEMLAERGI SVDHTTIYRW VQCYAPEMEK RLRWFWRRGF DPSWRLDETY VKVRGKWTYL YRAVDKRGDT
IDFYLSPTRS AKAAKRFLGK ALRGLKHWEK PATLNTDKAP SYGAAITELK REGKLDRETA HRQVKYLNNV IEADHGKLKI LIKPVRGFKS IPTAYATIKG
FEVMRALRKG QARPWCLQPG IRGEVRLVER AFGIGPSALT EAMGMLNHHF AAAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_001424634.1 WP_001424634.1 In36 288 1167-1454 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MSGWDGPVAS RSLAVRPRCL GCARGCRYPP LPSSTLDNAA AVIAADARAG LFGPVRLARP SACLGEISGG RINGGASPPF VRVKAEDSSL KNIRL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In36 501 1478-1978 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  Acetyltransf_1 (Pfam:PF00583)
Comment:   putative acetyltransferase ADU64769.1
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In36 840 2106-2945 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1_p (ARO:3005010) N In36 208 2939-3146 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Transpoase Chemistry:   antibiotic efflux (ARO:0010000)
Target:   acridine dye (ARO:3000054)||quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   perfect match C-temrinal 68 amino acis of reference sequence for ARO:3005010 (bitscore: 133)
Protein Sequence:  
SLVLKSIPVG VAYAVWSGLG VVIITAIAWL LHGQKLDAWG FVGMGLIIAA FLLARSPSWK SLRRPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bla NmcR (ARO:3003665) Bla NmcR In36 876 3536-4411 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   antibiotic inactivation (ARO:0001004)
Target:   penam (ARO:3000008)||cephamycin (ARO:0000044)||carbapenem (ARO:0000020)||cephalosporin (ARO:0000032)
Sequence Family:  NmcA beta-lactamase (ARO:3004195)
Comment:   poor match 46% to reference sequence for ARO:3003665 (bitscore: 275)||Synonyms:
Protein Sequence:  
MVRRYLPLNP LRAFEAAARH LSFTRAAIEL NVTHAAVSQQ VRALEEQLGC VLFTRVSRGL VLTHEGEGLL PVLNEAFDRI ADTLECFSHG QFRERVKVGA
VGTFAAGWLL PRLAGFYDSH PHIDLHISTH NNHVDPAAEG HDYTIRFGNG AWHESDAELI FSAPHAPLCS PAIAEQLQQP DDVHRFTLLR SFRRDEWSRW
LDCAGGTPPS PSQPVMVFDT SLAMAEAAQL GAGVAIAPVC MFSRLLQSGA LVQPFAAEIT LGGYWLTRLQ SRTETPAMQQ FARWLLNTAA A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
QnrA1 (ARO:3002707) QnrA1 In36 657 4597-5253 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target protection (ARO:0001003)
Transpoase Chemistry:   antibiotic target protection (ARO:0001003)
Target:   fluoroquinolone antibiotic (ARO:0000001)
Sequence Family:  quinolone resistance protein (qnr) (ARO:3000419)
Comment:   strict match to reference sequence for ARO:3002707 (bitscore: 446)||Synonyms:
Protein Sequence:  
MDIIDKVFQQ EDFSRQDLSD SRFRRCRFYQ CDFSHCQLQD ASFEDCSFIE SGAVEGCHFS YADLRDASFK ACRLSLANFS GANCFGIEFR ECDLKGANFS
RARFYNQVSH KMYFCSAYIS GCNLAYTNLS GQCLEKCELF ENNWSNANLS GASLMGSDLS RGTFSRDCWQ QVNLRGCDLT FADLDGLDPR RVNLEGVKIC
AWQQEQLLEP LGVIVLPD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp ISCR1 1542 5517-7058 -
Class:   Transposase
Transpoase Chemistry:   HUH
Sequence Family:  IS91 tnp
Comment:   CR of ISCR1
Protein Sequence:  
MSLARNATAS QSPTQTNGYE RHQPDQTLLY QLVEQHYPAF KASLEAQGQH LPRYIQQEFN DLLQCGRLEY GFMRVRCEDC HHERLVAFSC KRRGFCPSCG
ARRMAESAAL LIDEVFPKEP IRQWVLSFPF QLRFLLARHP QLMGQVLSIV YRTLSTHLIK KAGYTKASAQ TGSVTLIQRF GSALNLNVHY HMLFLDGVYA
EDDYGKQRFH RVKAPTYDEL NTLAHTLSHR IARCMEKRGI LERDAENTWL TLEEGEDDTL TQLHGASVTY RIAVGPQQGR KVFTLQTLPG REDKADSSSR
VANHAGFSLH AGVMAEAHQR DKLERLCRYI SRPAVSEKRL ALTANGQVRY ELKTPYRNGT THVIFEPLDF IAKLAALVPK PRVNLTRFHG VFAPNSKHRV
QVTPAKRGKK PDKSEGLDTN WRDKSPAERH RAMTWMQRLK RVFNIDIEVC EHCGGHVKVI ASIEDPKVIE QILKHLKQKT AKANAAKQRE LPPERAPPLT
PSLFDPSQSR LFD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In36 840 7463-8302 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In36 348 8296-8643 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   disinfecting agents and antiseptics (ARO:3005386)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA3 (ARO:3002603) AadA3 In36 792 8807-9598 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   strict match to reference sequence for ARO:3002603 (bitscore: 522)
Protein Sequence:  
MRVAVTIEIS NQLSEVLSVI ERHLESTLLA VHLYGSAVDG GLKPYSDIDL LVTVAVKLDE TTRRALLNDL MEASAFPGES ETLRAIEVTL VVHDDIIPWR
YPAKRELQFG EWQRNDILAG IFEPAMIDID LAILLTKARE HSVALVGPAA EEFFDPVPEQ DLFEALRETL KLWNSQPDWA GDERNVVLTL SRIWYSAITG
KIAPKDVAAD WAIKRLPAQY QPVLLEAKQA YLGQKEDHLA SRADHLEEFI RFVKGEIIKS VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
dfrA16 (ARO:3003014) DfrA16 In36 474 9703-10176 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   antibiotic target replacement (ARO:0001002)
Target:   diaminopyrimidine antibiotic (ARO:3000171)
Sequence Family:  trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218)
Comment:   strict match to reference sequence for ARO:3003014 (bitscore: 325)
Protein Sequence:  
MKLSLMAAKS KNGIIGNGPD IPWSAKGEQL LFKAITYNQW LLVGRKTFES MGALPNRKYA VVTRSNFSTN DEGVMVFSSI QDALINLEEI TDHVIVSGGG
EIYKSLISKV DTLHISTVDI ERDGDIVFPE IPDTFKLVFE QDFESNINYC YQIWQKS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In36 1014 10332-11345 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS6100-U12338.3 IS6100 Insertion Sequence 153-1032 880
ISCR1-L06822 ISCR1 Insertion Sequence 5285-7438 2154

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In36 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In36 49-67 AACACGTCGG TCGAGGACT
repeat t3 In36 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 In36 110-128 TCAATACTCG TGTGCACCA
IR IS6100 153-166 GGCTCTGTTG CAAA
IR IS6100 1019-1032 AAACGTTGTC TCGG
repeat t3 In36 1059-1078 AACCACGTCT AGTGAAGACT
repeat t2 In36 1089-1107 TCAGGAGCTG GCTGCACAA
IRt In36 1123-1155 TAGTTGACCA CGTCAGCAGA AGACTTTTAC TGT
repeat t1 In36 1129-1147 ACCACGTCAG CAGAAGACT
repeat i4 In36 11428-11446 AGGAGGGACG CAGGCGACT
repeat i3 In36 11456-11474 CGTCGGGCAG CAACGGACT
repeat i2 In36 11498-11516 ATCACGTCAG CCGAAGACT
IRi In36 11515-11547 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References     

Wang M, Tran JH, Jacoby GA, Zhang Y, Wang F, Hooper DC. Plasmid-mediated quinolone resistance in clinical isolates of Escherichia coli from Shanghai, China. Antimicrob Agents Chemother. 2003 Jul;47(7):2242-8. doi: 10.1128/aac.47.7.2242-2248.2003. PubMed ID: 12821475