Transposon
Name: In37
Family: Tn402        Group: Class 1
Evidence of Transposition: no
 Host     

Host Organism:Escherichia coli Molecular Source:plasmid pHSH2
Place of Origin:Shanghai, China Date of Isolation:2003
Other Geographic Information:teaching hospital March 2000 and March 2001

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  13188 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GAGGCTCTGT TGCAAAGATT GGCGGCAGTC AGAGGTAGGC TGTCGCTCTG 200
CGCCGATCAG GCGGCTGCTG CGAAATGGTG GTTGAGCATG CCCATGGCCT CCGTCAGCGC CGAGGGCCCA ATGCCAAAAG CTCTCTCCAC AAGGCGCACC 300
TCGCCCCTGA TGCCGGGCTG CAGGCACCAG GGGCGAGCCT GTCCTTTGCG CAGGGCTCGC ATGACTTCGA ATCCCTTGAT CGTGGCATAG GCCGTGGGGA 400
TCGATTTGAA ACCGCGCACC GGCTTGATCA GTATCTTGAG CTTTCCGTGA TCGGCCTCGA TCACGTTATT GAGATACTTC ACCTGCCGGT GGGCCGTCTC 500
CCGGTCCAGC TTTCCTTCGC GCTTCAATTC GGTGATCGCT GCACCATAGC TCGGCGCTTT GTCGGTATTG AGCGTGGCAG GCTTTTCCCA GTGCTTCAGG 600
CCTCGCAGGG CCTTGCCCAG GAACCGCTTC GCTGCCTTGG CGCTGCGGGT CGGCGACAGG TAGAAATCGA TCGTGTCGCC CCGCTTGTCG ACTGCCCGGT 700
ACAGGTAGGT CCACTTGCCC CGCACCTTGA CGTAGGTTTC ATCCAGGCGC CAGCTCGGAT CAAAGCCACG CCGCCAGAAC CAGCGCAGCC GCTTCTCCAT 800
CTCCGGGGCG TAGCACTGGA CCCAGCGATA GATCGTCGTA TGGTCGACCG AAATGCCGCG TTCCGCCAGC ATTTCCTCAA GGTCGCGATA GCTGATCGGA 900
TAGCGACAAT ACCAGCGCAC CGCCCACAGG ATCACATCAC CCTGGAAATG GCGCCACTTG AAATCCGTCA TCGTTCCGTC CGTCCAATCT CCGCCAAGCA 1000
TGCTCAAGCT TCACGATTTT TGCAACAGAG CCCACACGAG TATTGAGCAT AGTCGAGATT GGTGCAGATC ACTTCTGATA TTGAACTGTC AGGAGCTGGC 1100
TGCACAACAG CCATTACGCC CAATCAACTG GTGCAGTCGT CTTCTGAAAA TGACATTTGG TATCTCTCAT AAACGGATGT TTTTGAGAGA ACTATCTTCG 1200
GCCTTCACAC GCACGAAAGG CGGCGAAGCT CCGCCGTTAA TCCGTCCGCC GGAGATCTCG CCCAGGCAGG CTGAAGGCCG AGCAAGCCTG ACAGGCCCGA 1300
AAAGCCCGGC ACGGGCGTCG GCGGCGATGA CGGCGGCGGC ATTATCCAGG GTTGATGATG GAAGTGGAGG ATATCGACAA CCTCTCGCGC AACCAAGACA 1400
TCGCGGTCGG ACTGCAAGTG ATCTTGAAGC CACGGGCCCG TCCCACCCCG ACATGGACCT CGATGCCCGA ACGGACGTTA GATTTCGAGT TCTAGGCGTT 1500
CTGCGATGAA GGTTGGATCC CAGCCGGGAT TGAAAGTGTC GACGTGGGTG AATCCGAGCC GCTCGTATAG GCCACGCAGG TTCGGGTGGC AGTCGAGCCG 1600
CAGCTTGGCG CACCCCTGCG TTCGCGCGGC ATGGCGGCAA GCCTCGATCA GCGCGGAGCT GACACCCCGG CCCGCATGTG TCCGTCGCAC CGCGAGCTTG 1700
TGCAGATATG CGGCCTCCCC CTTGAGGGCG TCGGGCCAGA ACTCGGGATC CTCGGCCGAC AAGGTGCAAC AGCCGACGAT GCCGTCGCTG CAACTCGCGA 1800
CTAGGAGCTC GGATCTCAGG ACGAAGGTCT CCGCGAATGT CCGGTCGATC CGCGCGACGT CCCAGGCGGG CGTTCCCTTG GCGGACATCC ACGCCGCAGC 1900
GTCGTGCATC AGCCGCACAA CCTCGTCGAT ATCACCCGAG CAGGCGACCC GAACGTTCGG AGGCTCCTCG CTGTCCATTC GCTCCCCTGG CGCGGTATGA 2000
ACCGCCGCCT CATAGTGCAG TTTGATCCTG ACGAGCCCAG CATGTCTGCG CCCACCTTCG CGCAACCCAA AAGGAGATAT TTAGCGGGCG GCCGGAAGGT 2100
GAATGCTAGG CATGATCTAA CCCTCGGTCT CTGGCGTCGC GACTGCGAAA TTTCGCGAGG GTTTCCGAGA AGGTGATTGC GCTTCGCAGA TCTCCAGGCG 2200
CGTGGGTGCG GACGTAGTCA GCGCCATTGC CGATCGCGTG AAGTTCCGCC GCAAGGCTCG CTGGACCCAG ATCCTTTACA GGAAGGCCAA CGGTGGCGCC 2300
CAAGAAGGAT TTCCGCGACA CCGAGACCAA TAGCGGAAGC CCCAACGCCG ACTTCAGCTT TTGAAGGTTC GACAGCACGT GCAGCGATGT TTCCGGTGCG 2400
GGGCTCAAGA AAAATCCCAT CCCCGGATCG AGGATGAGCC GGTCGGCAGC GACCCCGCTC CGTCGCAAGG CGGAAACCCG CGCCTCGAAG AACCGCACAA 2500
TCTCGTCGAG CGCGTCTTCG GGTCGAAGGT GACCGGTGCG GGTGGCGATG CCATCCCGCT GCGCTGAGTG CATAACCACC AGCCTGCAGT CCGCCTCAGC 2600
AATATCGGGA TAGAGCGCAG GGTCAGGAAA TCCTTGGATA TCGTTCAGGT AGCCCACGCC GCGCTTGAGC GCATAGCGCT GGGTTTCCGG TTGGAAGCTG 2700
TCGATTGAAA CACGGTGCAT CTGATCGGAC AGGGCGTCTA AGAGCGGCGC AATACGTCTG ATCTCATCGG CCGGCGATAC AGGCCTCGCG TCCGGATGGC 2800
TGGCGGCCGG TCCGACATCC ACGACGTCTG ATCCGACTCG CAGCATTTCG ATCGCCGCGG TGACAGCGCC GGCGGGGTCT AGCCGCCGGC TCTCATCGAA 2900
GAAGGAGTCC TCGGTGAGAT TCAGAATGCC GAACACCGTC ACCATGGCGT CGGCCTCCGC AGCGACTTCC ACGATGGGGA TCGGGCGAGC AAAAAGGCAG 3000
CAATTATGAG CCCCATACCT ACAAAGCCCC ACGCATCAAG CTTTTGCCCA TGAAGCAACC AGGCAATGGC TGTAATTATG ACGACGCCGA GTCCCGACCA 3100
GACTGCATAA GCAACACCGA CAGGGATGGA TTTCAGAACC AGAGAAAGAA AATAAAATGC GATGCCATAA CCGATTATGA CAACGGCGGA AGGGGCAAGC 3200
TTAGTAAAGC CCTCGGTGCG GCGGACATTA TCCGCGAACA GGCTGAACAG CACGGCATTG CGCGGGTAAC GGATGTCTGG CTGGAAGTCG GCGCACTGGC 3300
GGATGTTGAG GAGAGTGCAC TGCATTTCTG TTTTGATATC GCCTGCCGTG ATACCGTGGC GCAGGGCTGC ACACTGCATA TTGATGTTAT CCCGGCACAG 3400
GCATGGTGCT GGGATTGCAG CCGTGAGGCC GAAATCATGC AGCACGCCGG ATGCTGTCCG CACTGCGGCA GTGAACGGCT GCGCATCAGT GAAGGTGATG 3500
ATTTGCGGGT AAAAAGCCTG GAAGGTGAGT GAGTTTTACG CCGCCGCCGT ATTCAGCAGC CAGCGGGCGA ATTGCTGCAT GGCCGGGGTT TCCGTACGGG 3600
ACTGTAACCG CGTCAGCCAG TAGCCGCCGA GGGTGATTTC TGCGGCAAAC GGCTGTACCA GTGCGCCTGA CTGTAACAGG CGGCTGAACA TACATACCGG 3700
TGCGATCGCT ACCCCGGCAC CCAGTTGTGC CGCCTCGGCC ATGGCCAGTG AGGTATCGAA CACCATTACC GGCTGTGACG GGGAAGGCGG TGTGCCGCCC 3800
GCACAATCCA GCCAGCGGCT CCATTCATCC CGGCGGAATG AGCGCAGCAG GGTAAAGCGG TGAACATCAT CCGGCTGCTG TAACTGTTCT GCAATGGCCG 3900
GTGAGCACAG CGGAGCGTGT GGTGCACTGA AAATCAGTTC CGCATCTGAC TCATGCCACG CGCCGTTACC GAAACGGATC GTATAATCAT GCCCTTCCGC 4000
CGCCGGGTCC ACATGATTGT TATGGGTGGA GATATGCAGA TCAATATGCG GATGGCTGTC ATAGAATCCG GCCAGACGCG GCAGCAGCCA GCCTGCGGCA 4100
AATGTTCCCA CCGCACCGAC TTTCACCCGC TCACGGAACT GCCCGTGAGA AAAACACTCC AGAGTATCCG CAATCCGGTC AAACGCCTCA TTGAGCACCG 4200
GCAGTAATCC CTCACCTTCA TGGGTCAGCA CCAGCCCGCG CGAGACGCGG GTAAACAGCA CACAGCCGAG TTGTTCTTCC AGCGCCCTGA CCTGCTGGCT 4300
GACGGCGGCA TGGGTGACAT TCAGCTCAAT CGCCGCGCGG GTAAAACTGA GATGACGGGC GGCGGCCTCA AAGGCGCGCA GCGGGTTAAG GGGGAGATAA 4400
CGTCTGACCA TAATCCACCT GTAAGTTTTT CTTTAGGGCA GTTGAGTCTT CATAATGCCT CCTTTGAAGG GAAGGTATAA CGTCGGCTGC GGCAGCAGAG 4500
AAACGGCATT CCGGCGGTAA TACTCTGCTA TAGCAGAATC TGCCTTGTTG TAGCGCGGAA CTCAAGTGAT ATTTGCCTCT TGTGTTTGCA TTCGAGCTAA 4600
TCCGGCAGCA CTATTACTCC CAAGGGTTCC AGCAGTTGCT CCTGTTGCCA GGCACAGATC TTGACTCCTT CGAGGTTGAC CCGTCTGGGG TCGAGCCCAT 4700
CCAGATCGGC AAAGGTCAGG TCACAGCCCC GCAGATTGAC CTGTTGCCAA CAGTCGCGGG AGAAGGTGCC GCGGCTGAGA TCTGAGCCCA TCAAGGAAGC 4800
GCCGCTGAGA TTGGCATTGC TCCAGTTGTT TTCAAACAGC TCGCATTTTT CCAGGCATTG GCCACTCAAG TTGGTATAGG CCAGGTTGCA ACCTGAGATA 4900
TAAGCCGAGC AGAAGTACAT CTTATGGCTG ACTTGATTGT AGAAGCGGGC CCGGGAAAAG TTGGCGCCCT TGAGATCGCA CTCCCTGAAC TCTATGCCAA 5000
AGCAGTTGGC ACCGCTGAAG TTGGCCAAAG ACAGACGGCA GGCCTTGAAA CTGGCATCGC GCAGATCGGC ATAGCTGAAG TGACACCCTT CAACGGCGCC 5100
GCTTTCAATG AAACTGCAAT CCTCGAAACT GGCATCCTGC AGCTGACAGT GGCTGAAGTC ACACTGATAA AAGCGGCAGC GGCGAAAACG GCTGTCACTC 5200
AAATCCTGGC GTGAGAAATC CTCTTGCTGA AAAACTTTAT CAATAATATC CATACGGCTT CCTTTAATCA GGGAGGGGGG TATAGGAAGT ATAAACCACC 5300
TTTTTGCTCC TCATCCGAAG TATCTTACCT GAAATTCCCT CACTCGTTTA CCGCTCAAGC CCCAATTTTA ACTGCCGGTC CAGCCTAAAC CGCTCTAATA 5400
AGGTTCGATT TGGCGGTAAA ATCTCTAGCC TGATAGCTCG AGAGATACAA ACTGCCCCAC CGCCCCGTTT AAAAGTTGGC AGTGTTGAGC AGTGTTGGAT 5500
TTGGGGTCGT CAGTCAAAGA GACGACTCTG TGATGGATCG AACAGGCTGG GAGTCAGTGG CGGCGCTCGT TCTGGTGGCA GCTCACGCTG CTTGGCGGCA 5600
TTCGCCTTGG CTGTTTTCTG TTTCAGATGC TTGAGAATCT GCTCAATGAC CTTCGGATCT TCGATGCTGG CAATCACTTT GACGTGACCG CCGCAGTGTT 5700
CGCAGACTTC AATATCAATA TTGAAGACTC GCTTGAGGCG TTGCATCCAG GTCATGGCGC GGTGGCGCTC TGCAGGACTC TTGTCACGCC AGTTAGTATC 5800
GAGACCTTCC GATTTGTCGG GCTTCTTGCC CCGCTTGGCG GGTGTTACTT GAACTCGGTG TTTGCTGTTC GGTGCAAAGA CGCCGTGGAA GCGTGTGAGG 5900
TTGACTCGCG GCTTAGGTAC CAACGCAGCG AGTTTGGCGA TGAAGTCCAG CGGCTCGAAG ATCACATGGG TGGTGCCATT GCGGTACGGA GTTTTGAGCT 6000
CGTAACGCAC CTGCCCATTG GCGGTTAATG CCAGACGTTT TTCTGAAACC GCTGGCCGAC TAATGTAGCG ACACAAGCGC TCAAGCTTAT CCCGCTGATG 6100
CGCTTCGGCC ATCACACCGG CGTGTAGCGA GAAACCAGCA TGGTTGGCTA CTCGACTGCT TGAGTCGGCT TTATCCTCAC GCCCTGGCAA GGTTTGCAGG 6200
GTGAAGACTT TGCGCCCTTG CTGGGGGCCG ACGGCAATGC GATACGTAAC CGAAGCACCA TGTAATTGAG TCAGCGTATC GTCTTCGCCC TCTTCCAGTG 6300
TCAACCACGT ATTCTCGGCA TCACGCTCCA AAATCCCACG CTTTTCCATG CAGCGAGCGA TGCGATGGCT GAGGGTGTGA GCGAGCGTAT TCAGCTCATC 6400
GTAAGTGGGT GCCTTGACAC GATGGAAGCG TTGCTTGCCA TAGTCATCTT CGGCATAGAC ACCATCGAGA AACAGCATGT GGTAGTGGAC ATTGAGATTT 6500
AGCGCGGAGC CAAAGCGTTG GATAAGAGTC ACTGAGCCAG TTTGTGCAGA GGCTTTGGTG TAACCGGCTT TTTTGATCAG ATGAGTTGAG AGTGTACGAT 6600
AGACGATACT CAAGACCTGG CCCATCAGCT GGGGATGGCG AGCCAGCAAA AAGCGTAGCT GGAAAGGAAA GCTGAGCACC CACTGGCGAA TGGGCTCCTT 6700
GGGGAAGACT TCGTCTATCA GCAGCGCCGC ACTCTCGGCC ATCCGGCGGG CACCGCAGCT AGGGCAAAAG CCGCGTCGTT TACAGCTGAA GGCGACCAGA 6800
CGCTCGTGAT GACAATCCTC GCAGCGAACC CGCATGAAAC CATACTCCAG ACGGCCACAT TGGAGGAGGT CGTTGAATTC TTGTTGGATG TAGCGAGGCA 6900
GGTGTTGACC TTGGGCTTCG AGTGAGGCTT TGAAGGCTGG GTAGTGCTGC TCAACCAGCT GGTAGAGCAG CGTCTGGTCG GGTTGGTGGC GTTCGTAACC 7000
GTTTGTTTGA GTGGGCGATT GACTCGCCGT GGCGTTCCTT GCCAGCGACA TGGGTATCCT CCGCTGATAC TGTGGTTATG TACAGTATCA GCGGCTTGCG 7100
TTCAGACGTC CAGTCTGGCC CTAGACATCG CTAAATGCTT AACCCGCAAT AGCCCTCACG AGTTGTTATC AGCCACTACC GGTTGAGCGA GAAGGTTTTG 7200
GGTTCAGGGT GCTATTGCTC CACCAATCAC AATACTGAAG CCCCAACTGT TATCAGTTGG GGCTTTTTCT TGTCTGTTTG CGGCGGTTGC GTTTTATCGG 7300
TAGTCGTCGA GCTCTGCACC ATCCCACATA AGAGCTTAAC GGTGCGATCT TCAACGCCAT CACACAAAAC TTTCTTTTTC ACGCACAGTC AACTTATTGG 7400
ATGTTTTATT AACAACCCAA AAGGAGATAT TTAGCGGGCG GCCGGAAGGT GAATGCTAGG CATGATCTAA CCCTCGGTCT CTGGCGTCGC GACTGCGAAA 7500
TTTCGCGAGG GTTTCCGAGA AGGTGATTGC GCTTCGCAGA TCTCCAGGCG CGTGGGTGCG GACGTAGTCA GCGCCATTGC CGATCGCGTG AAGTTCCGCC 7600
GCAAGGCTCG CTGGACCCAG ATCCTTTACA GGAAGGCCAA CGGTGGCGCC CAAGAAGGAT TTCCGCGACA CCGAGACCAA TAGCGGAAGC CCCAACGCCG 7700
ACTTCAGCTT TTGAAGGTTC GACAGCACGT GCAGCGATGT TTCCGGTGCG GGGCTCAAGA AAAATCCCAT CCCCGGATCG AGGATGAGCC GGTCGGCAGC 7800
GACCCCGCTC CGTCGCAAGG CGGAAACCCG CGCCTCGAAG AACCGCACAA TCTCGTCGAG CGCGTCTTCG GGTCGAAGGT GACCGGTGCG GGTGGCGATG 7900
CCATCCCGCT GCGCTGAGTG CATAACCACC AGCCTGCAGT CCGCCTCAGC AATATCGGGA TAGAGCGCAG GGTCAGGAAA TCCTTGGATA TCGTTCAGGT 8000
AGCCCACGCC GCGCTTGAGC GCATAGCGCT GGGTTTCCGG TTGGAAGCTG TCGATTGAAA CACGGTGCAT CTGATCGGAC AGGGCGTCTA AGAGCGGCGC 8100
AATACGTCTG ATCTCATCGG CCGGCGATAC AGGCCTCGCG TCCGGATGGC TGGCGGCCGG TCCGACATCC ACGACGTCTG ATCCGACTCG CAGCATTTCG 8200
ATCGCCGCGG TGACAGCGCC GGCGGGGTCT AGCCGCCGGC TCTCATCGAA GAAGGAGTCC TCGGTGAGAT TCAGAATGCC GAACACCGTC ACCATGGCGT 8300
CGGCCTCCGC AGCGACTTCC ACGATGGGGA TCGGGCGAGC AAAAAGGCAG CAATTATGAG CCCCATACCT ACAAAGCCCC ACGCATCAAG CTTTTGCCCA 8400
TGAAGCAACC AGGCAATGGC TGTAATTATG ACGACGCCGA GTCCCGACCA GACTGCATAA GCAACACCGA CAGGGATGGA TTTCAGAACC AGAGAAAGAA 8500
AATAAAATGC GATGCCATAA CCGATTATGA CAACGGCGGA AGGGGCAAGC TTAGTAAAGC CCTCGCTAGA TTTTAATGCG GATGTTGCGA TTACTTCGCC 8600
AACTATTGCG ATAACAAGAA AAAGCCAGCC TTTCATGATA TATCTCCCAA TTTGTGTAGG GCTTATTATG CACGCTTAAA AATAATAAAA GCAGACTTGA 8700
CCTGATAGTT TGGCTGTGAG CAATTATGTG CTTAGTGCAT CTAACGCCGC CATAAACGGC GACAGGGTGG CGCGCCTATT GCGCATAAAA TGGCGAAGCC 8800
ATGCGCAACA GGCGCGGAAT CTCTGGCGTC CGGTTTGATG GCTTTGTTAT GCAAAGGACT AGTCTTCAAT GACGTGTAAA CCACGGCGCT TTAAGTCCTC 8900
CAACGAATCC AACATTCCCC TTATTAATTC AACAGGATGC CCCTCCCAGT CTTCAACAAC GCCAACAATT CTCAAGGGTT CGCAGGTTCT ATAGGACTGT 9000
GTTGGATTAC CGGGAAATCT TTTGTTCGTA AGATTCGGAT CGTCTTCGAA CGGTCCTGTT GGCTCAACTA TGTATATGTA GCCGCGACCC TCGAGGCCAG 9100
ACAGTGACAT AGCAAGTTCA GCTCCCCAAA CTGCTGGCTC CATCAAGGCT GAAAAGTAGA TGTGCTTAAG AATACGACCG TCCTCGAAAT GAGAGATGAA 9200
CCCTGTGGTT AGCAAGTCAC CAATCGCCAA ATTGGCTTTG GTTCCATGAT AGAACGGTCC TTGCACCTGC TTGTAATTAT CATGAGAGAT GGGAATCCAA 9300
TCTTTTACCA TTTTAAGACC CTTAATTGTT GGGATTTGGC TGCATAACGC CTGAAATAAG CCGTGCCGCG AAGCGGCATC GGCTTGATTG AATTGTTAGA 9400
CGGCAAACTC GAGCCAATAC TTGTGCAGGC CAACAATATT AGACGAGCAC AGCATGGGCA TTGCCGCTTT GATCTTCTCC AGTGACCAAT TCCACCACTC 9500
CATCTCCAGA AGCAATGAAA TTTCCTCATC GGTGAAGCGT TTCTTAATCT TCTTAGCGGG ATTGCCGCCA ACGATAGCGT AAGGCTCCAC ATCTTTTGTC 9600
ACCAACGAGC GGCTGCCTAT CACCGCACCG TGCCCGATCT TGATTCCGGG CATGACCATT GCCTCAGAGC CGATCCAAAC GTCATTGCCA ATGACAGTAT 9700
TACCTGCTTT TTGGAAGGCA TCGAGTGCGC TTGAGAATGC AGGTTCTTCC TGCATATAAA AGAACGGGAA AGATGATGCC CAGTCGTACC GATGCCCCTG 9800
ATTGCCAGCC ATGATAAAGG AAGCCCCACT CCCGATAGAG CAGAAACTAC CGATGATCAA CTTATCAACG TCATCACGGT CCGGAAACAG ATACCGTGCG 9900
CAGTCATCGA ATGAGTGCCC ATGATAGTAG CCAGAGTAAT AGCTGTACCG CCCAACTTTG ATATTGGGGT TCTTCACTTG CTCAGAAAGC AGCTTGCCTT 10000
TGAAGGGGCT ATCAAAGTAG TTGGTCATAA GAGATCCCGC GGTCTGTGAC TTTGCCGTCT AACGTTTGAA ATAAGGGGCG CCGAGCGCCA GCGAGGGGAG 10100
CCAAAAGCTT GCTTTTGGCC GTCCCGACTT GATTGAAGGG TTGGGCGATT TTGCCATTAG ATTTTTTATA AATTTAGTGT GTTTAGAATG GTGATCGCAT 10200
TTTTCTTGGC TTTTATGCTT GATGTTAAAT TCGACCCCAA GTTTCCTGTA AGTGCGGACA CAAAAACATA TTTATGTCCT GATTTGCTTA TAATAAACCC 10300
TTCAAACCAT CCGTTTTGTA AGGTTCTATT TGCTGTGAAT CCTGCACCAG TTTTCCCATA CAGTTTTGTA CTATTATCCA GATCTTGTAG ATACATGTTC 10400
TCTATGGTGT TTTCTATGGC TGAGTTTTTA ACTGGGAGAT TGTGATTAAT AATTTTACGC AGGAATTGAA TTTGTTCTTC TGGTGAAATT TTTAAGCTAC 10500
TTTCGAGCCA TGCTTCTGTT AATCCGTTGT TTCTTTCTTT ATCTCCAGAG AAGTCTTGAT TTCCATAATC AAAATCTTTG AGATAATTCT TGATTTTATT 10600
TAATCCAATT TTTTGGGTTA TTTCTTGCGA AACCCAAACA ACAGAAAATT GCATCCACGT CTTTGGTGTA TGATTGCTGT TCCAGATCTC CATTCCTTTG 10700
GGGGTTTTAT CCCATTTGAA TATGGTTTTC TGATCTATTA TTTCCGCATC AAATGCCATA AGTGATAATG CGATCTTGAA AGTTGAATCT GGTGCCATTT 10800
GCGTTGCACA CTTTGCTTTA TTGAATTGAG CAATTTCAGC GTTTGTGGAT GCATCGTAAA GTAAAAAACA ACCTTCAGTT CCTTCAAATA ATGGAGATGC 10900
AACAGTAGAG ATATCTGTTG ATGCACTGGC GCTGCTGTAG ATAATATTTG CAATTATTAA AAAAATAGCG AAGTTGATAT GTATTGTGTT TTTCATAATA 11000
AGTATTGGTT TGGTAAAGGG CTTAATTTTA ACGGCTAACA ATTAATGAGG CTCCGGGTTC GCCCAACGTT TGACATGAGG GGCGGCCAAG GGCGCCAGCC 11100
CTTGGACGTC CCCCTCGATG GAAGGGTTAG GCATCACTGC GTGTTCGCTC GAATGCCTGG CGTGTTTGAA CCATGTACAC GGCTGGACCA TATGGGGTGG 11200
TTACGGTACC TTGCCTCTCA AACCCCGCTT TCTCGTAGCA TCGGATCGCT CGCAAGTTGC TCGGCGACGG GTCCGTTTGG ATCTTGGTGA CCTCGGGATC 11300
ATTGAACAGC AACTCAACCA GAGCTCGAAC CAGCTTGGTT CCCAAGCCTT TGCCCAGTTG TGATGCATTC GCCAGTAACT GGTCTATTCC GCGTACTCCT 11400
GGATCGGTTT CTTCTTCCCA CCGTCCGTCC CCGCTTCCAA GAGCAACGTA CGACTGGGCA TACCCAATCG GCTCTCCATT CAGCATTGCA ATGTATGGAG 11500
TGACGGACTC TTGCGCTAAA ACGCTTGGCA AGTACTGTTC CTGTACGTCA GCAAGTGTCG GGCGTGCTTC TTCTCCGCCC CACCACTCGA CGATATGAGA 11600
TCGATTTAGC CACTCATAGA GCATCGCAAG GTCATGCTCA GTCATGAGGC GCAGTGTGAC GGAATCGTTG CTGTTGGTCA CGATGCTGTA CTTTGTGATG 11700
CCTAACTTTG TTTTTGCGTT GCTCATGATG TCTAACTCCC AATTTGTGTA GGGCTTATTA TGCACGCTTA AAAATAATAA AAGCAGACTT GACCTGATAG 11800
TTTGGCTGTG AGCAATTATG TGCTTAGTGC ATCTAACTTT GTTTTAGGGC GACTGCCCTG CTGCGTAACA TCGTTGCTGC TCCATAACAT CAAACATCGA 11900
CCCACGGCGT AACGCGCTTG CTGCTTGGAT GCCCGAGGCA TAGACTGTAC AAAAAAACAG TCATAACAAG CCATGAAAAC CGCCACTGCG CCGTTACCAC 12000
CGCTGCGTTC GGTCAAGGTT CTGGACCAGT TGCGTGAGCG CATACGCTAC TTGCATTACA GTTTACGAAC CGAACAGGCT TATGTCCACT GGGTTCGTGC 12100
CTTCATCCGT TTCCACGGTG TGCGTCACCC GGCAACCTTG GGCAGCAGCG AAGTCGAGGC ATTTCTGTCC TGGCTGGCGA ACGAGCGCAA GGTTTCGGTC 12200
TCCACGCATC GTCAGGCATT GGCGGCCTTG CTGTTCTTCT ACGGCAAGGT GCTGTGCACG GATCTGCCCT GGCTTCAGGA GATCGGAAGA CCTCGGCCGT 12300
CGCGGCGCTT GCCGGTGGTG CTGACCCCGG ATGAAGTGGT TCGCATCCTC GGTTTTCTGG AAGGCGAGCA TCGTTTGTTC GCCCAGCTTC TGTATGGAAC 12400
GGGCATGCGG ATCAGTGAGG GTTTGCAACT GCGGGTCAAG GATCTGGATT TCGATCACGG CACGATCATC GTGCGGGAGG GCAAGGGCTC CAAGGATCGG 12500
GCCTTGATGT TACCCGAGAG CTTGGCACCC AGCCTGCGCG AGCAGCTGTC GCGTGCACGG GCATGGTGGC TGAAGGACCA GGCCGAGGGC CGCAGCGGCG 12600
TTGCGCTTCC CGACGCCCTT GAGCGGAAGT ATCCGCGCGC CGGGCATTCC TGGCCGTGGT TCTGGGTTTT TGCGCAGCAC ACGCATTCGA CCGATCCACG 12700
GAGCGGTGTC GTGCGTCGCC ATCACATGTA TGACCAGACC TTTCAGCGCG CCTTCAAACG TGCCGTAGAA CAAGCAGGCA TCACGAAGCC CGCCACACCG 12800
CACACCCTCC GCCACTCGTT CGCGACGGCC TTGCTCCGCA GCGGTTACGA CATTCGAACC GTGCAGGATC TGCTCGGCCA TTCCGACGTC TCTACGACGA 12900
TGATTTACAC GCATGTGCTG AAAGTTGGCG GTGCCGGAGT GCGCTCACCG CTTGATGCGC TGCCGCCCCT CACTAGTGAG AGGTAGGGCA GCGCAAGTCA 13000
ATCCTGGCGG ATTCACTACC CCTGCGCGAA GGCCATCGGT GCCGCATCGA ACGGCCGGTT GCGGAAAGTC CTCCCTGCGT CCGCTGATGG CCGGCAGCAG 13100
CCCGTCGTTG CCTGATGGAT CCAACCCCTC CGCTGCTATA GTGCAGTCGG CTTCTGACGT TCAGTGCAGC CGTCTTCTGA AAACGACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC arr-3 core 8746-8852 107 GCCGCCATAA ACGGCGACAG GGTGGCGCGC CTATTGCGCA TAAAATGGCG AAGCCATGCG
CAACAGGCGC GGAATCTCTG GCGTCCGGTT TGATGGCTTT GTTATGC
attC catB3 core 9342-9401 60 GCATAACGCC TGAAATAAGC CGTGCCGCGA AGCGGCATCG GCTTGATTGA ATTGTTAGAC
attC bla OXA-1 core 10057-10146 90 GTCTAACGTT TGAAATAAGG GGCGCCGAGC GCCAGCGAGG GGAGCCAAAA GCTTGCTTTT
GGCCGTCCCG ACTTGATTGA AGGGTTGGGC
attC AAC(6')-Ib core 11067-11132 66 CGTTTGACAT GAGGGGCGGC CAAGGGCGCC AGCCCTTGGA CGTCCCCCTC GATGGAAGGG
TTAGGC
attI 11837-11892 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnp IS6100 207-971 Transposase   -
WP_001424634.1 In37 1167-1454 Passenger Gene Hypothetical -
GNAT_fam In37 1478-1978 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In37 2106-2945 Passenger Gene Antibiotic Resistance -
qacEdelta1_p (ARO:3005010) In37 2939-3146 Passenger Gene Antibiotic Resistance -
bla NmcR (ARO:3003665) In37 3536-4411 Passenger Gene Antibiotic Resistance -
QnrA1 (ARO:3002707) In37 4597-5253 Passenger Gene Antibiotic Resistance -
tnp ISCR1 5510-7051 Transposase   -
sul1 (ARO:3000410) In37 7456-8295 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In37 8289-8636 Passenger Gene Antibiotic Resistance -
arr-3 (ARO:3002848) In37 8859-9347 Passenger Gene Antibiotic Resistance -
catB3 (ARO:3002676) In37 9396-10028 Passenger Gene Antibiotic Resistance -
bla OXA-1 (ARO:3001396) In37 10166-11041 Passenger Gene Antibiotic Resistance -
AAC(6')-Ib10 (ARO:3002581) In37 11127-11681 Passenger Gene Antibiotic Resistance -
intI1 In37 11973-12986 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS6100 765 207-971 -
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MTDFKWRHFQ GDVILWAVRW YCRYPISYRD LEEMLAERGI SVDHTTIYRW VQCYAPEMEK RLRWFWRRGF DPSWRLDETY VKVRGKWTYL YRAVDKRGDT
IDFYLSPTRS AKAAKRFLGK ALRGLKHWEK PATLNTDKAP SYGAAITELK REGKLDRETA HRQVKYLNNV IEADHGKLKI LIKPVRGFKS IPTAYATIKG
FEVMRALRKG QARPWCLQPG IRGEVRLVER AFGIGPSALT EAMGMLNHHF AAAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_001424634.1 WP_001424634.1 In37 288 1167-1454 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MSGWDGPVAS RSLAVRPRCL GCARGCRYPP LPSSTLDNAA AVIAADARAG LFGPVRLARP SACLGEISGG RINGGASPPF VRVKAEDSSL KNIRL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In37 501 1478-1978 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  Acetyltransf_1 (Pfam:PF00583)
Comment:   putative acetyltransferase ADU64769.1
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In37 840 2106-2945 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1_p (ARO:3005010) N In37 208 2939-3146 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Transpoase Chemistry:   antibiotic efflux (ARO:0010000)
Target:   acridine dye (ARO:3000054)||quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   perfect match C-temrinal 68 amino acis of reference sequence for ARO:3005010 (bitscore: 133)
Protein Sequence:  
SLVLKSIPVG VAYAVWSGLG VVIITAIAWL LHGQKLDAWG FVGMGLIIAA FLLARSPSWK SLRRPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bla NmcR (ARO:3003665) Bla NmcR In37 876 3536-4411 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   antibiotic inactivation (ARO:0001004)
Target:   penam (ARO:3000008)||cephamycin (ARO:0000044)||carbapenem (ARO:0000020)||cephalosporin (ARO:0000032)
Sequence Family:  NmcA beta-lactamase (ARO:3004195)
Comment:   poor match 46% to reference sequence for ARO:3003665 (bitscore: 275)||Synonyms:
Protein Sequence:  
MVRRYLPLNP LRAFEAAARH LSFTRAAIEL NVTHAAVSQQ VRALEEQLGC VLFTRVSRGL VLTHEGEGLL PVLNEAFDRI ADTLECFSHG QFRERVKVGA
VGTFAAGWLL PRLAGFYDSH PHIDLHISTH NNHVDPAAEG HDYTIRFGNG AWHESDAELI FSAPHAPLCS PAIAEQLQQP DDVHRFTLLR SFRRDEWSRW
LDCAGGTPPS PSQPVMVFDT SLAMAEAAQL GAGVAIAPVC MFSRLLQSGA LVQPFAAEIT LGGYWLTRLQ SRTETPAMQQ FARWLLNTAA A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
QnrA1 (ARO:3002707) QnrA1 In37 657 4597-5253 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target protection (ARO:0001003)
Transpoase Chemistry:   antibiotic target protection (ARO:0001003)
Target:   fluoroquinolone antibiotic (ARO:0000001)
Sequence Family:  quinolone resistance protein (qnr) (ARO:3000419)
Comment:   strict match to reference sequence for ARO:3002707 (bitscore: 446)||Synonyms:
Protein Sequence:  
MDIIDKVFQQ EDFSRQDLSD SRFRRCRFYQ CDFSHCQLQD ASFEDCSFIE SGAVEGCHFS YADLRDASFK ACRLSLANFS GANCFGIEFR ECDLKGANFS
RARFYNQVSH KMYFCSAYIS GCNLAYTNLS GQCLEKCELF ENNWSNANLS GASLMGSDLS RGTFSRDCWQ QVNLRGCDLT FADLDGLDPR RVNLEGVKIC
AWQQEQLLEP LGVIVLPD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp ISCR1 1542 5510-7051 -
Class:   Transposase
Transpoase Chemistry:   HUH
Sequence Family:  IS91 tnp
Comment:   CR of ISCR1
Protein Sequence:  
MSLARNATAS QSPTQTNGYE RHQPDQTLLY QLVEQHYPAF KASLEAQGQH LPRYIQQEFN DLLQCGRLEY GFMRVRCEDC HHERLVAFSC KRRGFCPSCG
ARRMAESAAL LIDEVFPKEP IRQWVLSFPF QLRFLLARHP QLMGQVLSIV YRTLSTHLIK KAGYTKASAQ TGSVTLIQRF GSALNLNVHY HMLFLDGVYA
EDDYGKQRFH RVKAPTYDEL NTLAHTLSHR IARCMEKRGI LERDAENTWL TLEEGEDDTL TQLHGASVTY RIAVGPQQGR KVFTLQTLPG REDKADSSSR
VANHAGFSLH AGVMAEAHQR DKLERLCRYI SRPAVSEKRL ALTANGQVRY ELKTPYRNGT THVIFEPLDF IAKLAALVPK PRVNLTRFHG VFAPNSKHRV
QVTPAKRGKK PDKSEGLDTN WRDKSPAERH RAMTWMQRLK RVFNIDIEVC EHCGGHVKVI ASIEDPKVIE QILKHLKQKT AKANAAKQRE LPPERAPPLT
PSLFDPSQSR LFD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In37 840 7456-8295 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In37 348 8289-8636 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   disinfecting agents and antiseptics (ARO:3005386)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
arr-3 (ARO:3002848) Arr-3 In37 489 8859-9347 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   rifamycin antibiotic (ARO:3000157)
Sequence Family:  rifampin ADP-ribosyltransferase (Arr) (ARO:3000390)
Comment:   perfect match to reference sequence for ARO:3002848 from Escherichia coli||Synonyms: arr-6
Protein Sequence:  
LCSQIPTIKG LKMVKDWIPI SHDNYKQVQG PFYHGTKANL AIGDLLTTGF ISHFEDGRIL KHIYFSALME PAVWGAELAM SLSGLEGRGY IYIVEPTGPF
EDDPNLTNKR FPGNPTQSYR TCEPLRIVGV VEDWEGHPVE LIRGMLDSLE DLKRRGLHVI ED

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
catB3 (ARO:3002676) CatB3 In37 633 9396-10028 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   antibiotic inactivation (ARO:0001004)
Target:   phenicol antibiotic (ARO:3000387)
Sequence Family:  chloramphenicol acetyltransferase (CAT) (ARO:3000122)
Comment:   strict match to reference sequence for ARO:3002676 (bitscore: 437)||Synonyms:
Protein Sequence:  
MTNYFDSPFK GKLLSEQVKN PNIKVGRYSY YSGYYHGHSF DDCARYLFPD RDDVDKLIIG SFCSIGSGAS FIMAGNQGHR YDWASSFPFF YMQEEPAFSS
ALDAFQKAGN TVIGNDVWIG SEAMVMPGIK IGHGAVIGSR SLVTKDVEPY AIVGGNPAKK IKKRFTDEEI SLLLEMEWWN WSLEKIKAAM PMLCSSNIVG
LHKYWLEFAV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bla OXA-1 (ARO:3001396) Bla OXA-1 In37 876 10166-11041 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   penam (ARO:3000008)||carbapenem (ARO:0000020)||cephalosporin (ARO:0000032)
Sequence Family:  OXA beta-lactamase (ARO:3000017)
Comment:   strict match to reference sequence for ARO:3001396 (bitscore: 564 )||Synonyms:
Protein Sequence:  
MLAVKIKPFT KPILIMKNTI HINFAIFLII ANIIYSSASA STDISTVASP LFEGTEGCFL LYDASTNAEI AQFNKAKCAT QMAPDSTFKI ALSLMAFDAE
IIDQKTIFKW DKTPKGMEIW NSNHTPKTWM QFSVVWVSQE ITQKIGLNKI KNYLKDFDYG NQDFSGDKER NNGLTEAWLE SSLKISPEEQ IQFLRKIINH
NLPVKNSAIE NTIENMYLQD LDNSTKLYGK TGAGFTANRT LQNGWFEGFI ISKSGHKYVF VSALTGNLGS NLTSSIKAKK NAITILNTLN L

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
AAC(6')-Ib10 (ARO:3002581) AAC(6')-Ib10 In37 555 11127-11681 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   aminoglycoside acetyltransferase
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  AAC(6') (ARO:3000345)
Comment:   strict match to reference sequence for ARO:3002581 (bitscore: 377)
Protein Sequence:  
MTNSNDSVTL RLMTEHDLAM LYEWLNRSHI VEWWGGEEAR PTLADVQEQY LPSVLAQESV TPYIAMLNGE PIGYAQSYVA LGSGDGRWEE ETDPGVRGID
QLLANASQLG KGLGTKLVRA LVELLFNDPE VTKIQTDPSP SNLRAIRCYE KAGFERQGTV TTPYGPAVYM VQTRQAFERT RSDA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In37 1014 11973-12986 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS6100-U12338.3 IS6100 Insertion Sequence 153-1032 880
ISCR1-L06822 ISCR1 Insertion Sequence 5278-7431 2154

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In37 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In37 49-67 AACACGTCGG TCGAGGACT
repeat t3 In37 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 Tn5053 110-128 TCAATACTCG TGTGCACCA
IR IS6100 153-166 GGCTCTGTTG CAAA
IR IS6100 1019-1032 AAACGTTGTC TCGG
repeat t3 In37 1059-1078 AACCACGTCT AGTGAAGACT
repeat t2 In37 1089-1107 TCAGGAGCTG GCTGCACAA
IRt In37 1123-1155 TAGTTGACCA CGTCAGCAGA AGACTTTTAC TGT
repeat t1 In37 1129-1147 ACCACGTCAG CAGAAGACT
repeat i4 In37 13069-13087 AGGAGGGACG CAGGCGACT
repeat i3 In37 13097-13115 CGTCGGGCAG CAACGGACT
repeat i2 In37 13139-13157 ATCACGTCAG CCGAAGACT
IRi In37 13156-13188 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References     

Wang M, Tran JH, Jacoby GA, Zhang Y, Wang F, Hooper DC. Plasmid-mediated quinolone resistance in clinical isolates of Escherichia coli from Shanghai, China. Antimicrob Agents Chemother. 2003 Jul;47(7):2242-8. doi: 10.1128/aac.47.7.2242-2248.2003. PubMed ID: 12821475