Transposon
Name: Tn5393.7
Family: Tn3        Group: Tn163
Evidence of Transposition: no
 Host     

Host Organism:Escherichia coli Molecular Source:plasmid R6K

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 81 bp)GGGGTCGTTTGCGGGAGAGGGCGAAATCCTACGCTAAGGCTTTGGCCAACGATATTCTCCGGTAAGATTGATGTGTTCCCA
IRR (Length: 40 bp)GGGGTCGTTTGCGGGAGGGGGCGGAATCCTACGCTAAGGC

 Sequence     
DNA SequenceLength  11658 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTTT GCGGGAGAGG GCGAAATCCT ACGCTAAGGC TTTGGCCAAC GATATTCTCC GGTAAGATTG ATGTGTTCCC AGGGGATAGG AGAAGTCGCT 100
TGATATCTAG TATGACGTCT GTCGCACCTG CTTGATCGCG GCCGCGATAG CTAGATCGCG TTGCTCCTCT TCTCCATCCG CGTTCCAAGC TGCGGAAAGG 200
CACCCATAAG CGTACGCCTG GTCGAGCAGG CGACGCGGAT CGACGTCCAG CGCACGAGAG AATGCGTCCG CCATCTGTGC AATGCGTCTA GGATCGAGAC 300
AAAGGTCGTC TCTGTCAGCC GGATCGTAGA ACATATTGGC GGCGCCAAAG CCCACTTCAC CGACCAGACC GACGGGATCT ATCACCAGCC AGCCGCGACT 400
GGAGAACATG ATGTTTTCAT GATGCAGATC GCCATGTAGC CCACGCAGTT CCGAGGCATT GCTCATCATT TGATCGGCTA TAATCGCCGC GTGGACGTAG 500
TCAGTTTGAC AACCTGCGTT TTGATCATCG CGCGCCCGCT GAAACAAAGC TGCAAAGCGA TCCCGGATCG GGAGAAGGGC AGAAGGCAGG GGTTCCTCAG 600
ATGCGGCATA CAGCTTCGCC ATTAGTTCCG CTGCAATTTC GGTCGCCTGG TAGTCGCCGT GCTCGGCAAC GATGTGAGAG AGCATTCGCT CCCCGGCATA 700
TTCGAGCAAC ATCAGATTGT TCTCACGACC GAGCAACCGG ACTGCTCCCC TCCCATTGCG CCATACCAGA TAGTCGGCCC CGCGCAGTTC ATCAGCAATG 800
TCTTCTATAG GTTTCAATCC CTTGACGATT GCAGGAGTCC CGTCTGGCAA TGAAACTTTC CAAACGAGGC TGGAAAAGGT GTCCGCAATG AGAACAGGTT 900
GCGAAACGTG CCAATGAGCA GGAAAAACAG GCGGCATGAA CATCAACCCC AAGTCAGAGG GTCCAATCGC AGATAGAAGG CAAGGCGTTC GCGGTCGGGG 1000
GCTTCGATCC CCAATACATT GAATAGGACA GCGAAGGCGC GCTCTGCTTC ATCTGGCGCT GCCCAGTTCT CTTCGGCGTT AGCAATCATG AGTGCCAAAT 1100
CGGCATAGCG ATCTGCTGTT CCGAGCCGCC CAAGGTCGAT CAGACCCGTG CATTGAAGAG TTTTAGGGTC CACCATGAAG TTCGGCATGC AGGGATCACC 1200
ATGGCAAACA ACCATATCGG TGCGCTCTTG GTCGAGCCGC ACCGGTAGCT CTCGTTCGAC ACGAGCCAAA AGATCGAGCT GCGGCGTACT CTTGTCCTCG 1300
TCCGGTAAGA AGTCGGGATT GACGGCATTG CGGGACACCA CATCAACGGC GCGTCCGAAC ATTCGCGACA GCCTGCGCTC AAACGGACAT TGATCAACCG 1400
ATAGGCTGTG AACAGCGCCA AGTTGCTGCC CCATTGACGG CCACGCTTTG AGCAAATCCG CTCCAGACAG ATCAGCCGCC GGTACTCCCG GAATTGCCGT 1500
TATCACCAAG CATGCACCCT CCTGTTCCTC CTGCCAGTTG ATCACCTCGG GGCAAGCCAC ACCTCGACCT TTGAGCCAAA TGAGGCGGTC ACGCTCTCCA 1600
GCGAGCTCAC CGCGGCGGGA AGCAGGTGCG ATTTTCGCGA AGGCATGCCC GTCACCACGT CGAAAAACAA AATCACCAGA TTCTCCGCCT CTGACAGGCA 1700
ACCAGTCAGA ATGCGATTCA CCAAAAAAAA TATTAGTTCG ATTCAATGGA GGTTCCTTCA GTTTTCTGAT GAAGCGCGGA GGTGGCTCAA CCTGCGAAAA 1800
GAAACGAGTT GCTATGGACT TGCACCGGTT GTGTTCCGGT CTATCTCTCA TTTTAAGCGG CTTTTTTCTC AAATGCCACC GGCGATTTCC AGCCGAGTGT 1900
TGAATGTCTT CGGCGTGGAT TATAAAAGCC ATTTATATAT TCGAAGATTG CAATCTCAAT ATCTCGCCTT GTTTGCCAGT GTCTGCGCCA AATCAACTCA 2000
GCCTTTAATG ATTTAAAGAA GCTTTCTACT GCGGAGTTAT CAAAACAATT GCCTTTCCCG CTCATGGACG GCAGCAATTG ATGTTTGAGC AGTAGCTTTT 2100
GATATTCATG AGCGCAATAT TGGCTCCCAC GGTCTGTGTG TTGAATACAA CCCGGTGGTG GTTTGCGTAA AGCCAACGCC ATATTCAGTG CCCTTAATGC 2200
AAGATCCTGC TTTAATCGAT CACCTGTTGC CCAGCCAATC ACGCGACGGG AATACAGGTC AAGGATAACA GCAAGATAGA CCCATCCTTC TCTGGTCCAA 2300
ACATAAGTGA TATCGCCTGC CCATTTCTGG TTGGGTGCGC TTGCGCTAAA GTCTTGTTTT AATAGGTTCG GTGCAATGTT GAAGGTATGA TGACTATCCG 2400
TTGTCCGTTT GAATTTACGC GTTCGAACAA CTGTAATGTT ATTCTGGCGC ATCAAACGTC CAACCCGACG CTGCCCAACC TGCAGGCCCA GCGCTTTCAA 2500
CTCTTCTGTC ATACGCGGCC TACCATAGCT CCCCAAACAC AACCGATGCT GCTCACGTAT ATGCGCTAGA AGTATAAGAT CACGACGCTG GCGCAGTGAT 2600
GGAGGACGGC GTTTCCATGC ACGTAAACCA CGATCTGTTA CGCCCATCAA ACGACATATG CGTGAACGTG AGAGAGAGCC ACGGTAATCC GTAATAAACT 2700
GAAATCTCAC AGCTTTTGTA CTGCGAAAAA TATTGCTGCC TTTTTTAATA TCTCCCTCTC CTCCCGAAGG ATACGGTTCT CTTTGCGTAA ACGTTCATTC 2800
TCACGCAGAA GATCAGTGTC TTGGGTAGGA ATTTTAGTTT CATCGGAAAT TGATGCGATC CATTTCCCAA GCGTGGAAAG CCCAATACTT AAATCTGACG 2900
CAACTTGACG GCGTGTTAAG CCACTAGTGA GTGCTATGCG AACTGCATCA CGCTTAAATT CATCACTATG CTTTAATGAC ATATATGGTC TCCTTGATAA 3000
CAAATAATGC TCTCAAAAAG ACCGGAACGA AACCGAGGCA AGTCCACTAC GTAAGTCCGA GAACATGCTT TCCATGGTCT CTGAGCTCGC CTTTGGGACC 3100
GACATATCGG TAGAGAGTGA CGCGCTCGAT GCCGAGTTCC TTGCAGAGAT CGGAAACTGA AGTATCGCGC TGGGCCATGG CGGCTTGCGC GAGACGCACC 3200
TGAGCTTTGG TGAGCGCGAA TTTTCGTCCG CCCTTGCGAC CGCGCGCTCT CGCGGAGGCG AGACCCGCCA TGGTGCGCTC TCGGATCAGA TCCCGCTCGA 3300
ACTCGGCCAA GGTGGCGAAG ATTCCGAACA CCATGCGACC GGACGCAGTC GTGGTGTCGA TCTGAGCGCC CTTTCCAGTC AGAACCCGCA GGCCGATCTT 3400
GCGGTCTGAC AGCTCCTTCA CCGTGTTGAC CAGATGGGCA AGCGATCGTC CGAGGCGATC GAGCTTCCAG ACCACCAGCA CATCGCCGTC ACGCAATGAC 3500
TTGAGGCAGG CAGTCAAGCC AGGGCGATCA TCACGACCGC CGGAAGCAAG ATCATCATAG ATATTGTCCC GTTCGACACC TGCGGCGCGC AAGGCGTCGT 3600
GCTGCAGGTC GAGAGACTGC GAGCCATCGG CTTTGGAGAC GCGGGCATAT CCGATCAGCA TGTATCACAA ACGTTGGTTT GAGGCGGCGC TTCGGCCACG 3700
ATTGCATTGA CCTCTGGAAA TGTATCTCAA CCAGCTTCAT AAACAAAGCG TCTTGAACGC TATCAGATTT TGAAAAAGGA ACATGTATGC CGCGTCGCGT 3800
CACTCTAACC GATCGGCAGA AAGACGCGCT GTTGCGCTTG CCGACTTCAC AGACGGATTT GCTCAAGCAC TATACGCTGA GTGATGAAGA CCTTGGGCAT 3900
ATCAGGCTGC GTCGGCGCGC TCACAACAGG TTCGGCTTCG CCCTGCAATT GTGTGTCCTG CGCTATCCCG GCCGGGTGCT GGCTCCAGGC GAACTGATCC 4000
CTGCAGAGGT CATCGAATTT ATCGGAGCGC AGCTTGGCCT GGGTGCCGAC GATCTCGTAG ACTATGCTGC CCGCGAGGAA ACACGGCACG AGCATCTTGC 4100
CGAGTTACGG GGGCTCTACG GCTTCCGCAC CTTCTCCGGA CGTGGTGCGA GCGAGCTGAA GGAATGGTTG TTCCGAGAAG CCGAGATGGC GGTGTCGAAC 4200
GAGGATATCG CCCGTCGCTT CGTAGCCGAG TGCCGACGCA CCCGCACTGT CCTTCCCGCG ACATCCACGA TCGAGCGGCT TTGTGCCGCG GCTCTCGTCG 4300
ATGCCGAGCG ACGCATCGAG ACGAGGATCG CCAGTCGGCT GCCTATGTCG ATCCGAGAAC AGTTGCTGGC ATTGCTCGAG GAGACGGCTG ATGATCGGGT 4400
GACCCGTTTT GTGTGGCTGC GCCAGTTCGA GCCTGGCTCG AACTCTTCGT CGGCCAACCG GCTGCTCGAC CGGCTCGAAT ATCTGCAACG CATCGATCTC 4500
CCCGAGGATC TGCTTGCCGG CGTTCCTGCC CATCGGGTGA CTCGTCTGCG CAGGCAGGGT GAACGGTATT ATGCCGACGG CATGCGCGAT CTCCCGGAGG 4600
ACAGGCGGCT TGCGATCTTG GCTGTTTGCG TCTCGGAATG GCAGGCGATG TTGGCCGACG CAGTGGTCGA AACCCACGAC CGGATCGTCG GCCGTCTCTA 4700
CCGTGCTTCG GAGCGTATTT GCCATGCAAA GGTCGCAGAC GAAGCGGGGG TGGTGCGTGA CACCCTGAAA TCCTTCGCCG AGATCGGGGG CGCCCTGGTC 4800
GATGCACAGG ATGATGGCCA GCCGCTGGGC GATGTCATCG CGAGTGGGTC AGGGTGGGAC GGCTTAAAAA CCCTTGTTGC AATGGCAACC AGGCTGACCG 4900
CCACCATGGC CGACGATCCG CTCAATCATG TGCTCGACGG TTATCACCGC TTCCGCCGAT ACGCTCCACG CATGTTGCGC CTGCTCGATC TGCGAGCTGC 5000
GCCCGTTGCA CTGCCGCTTC TGGAAGCGGT GACGGCCCTT CGTACCGGTT TGAACGATGC CGCGATGACC AGCTTCTTGC GGCCCAGCTC GAAATGGCAT 5100
CGCCACCTTC GGGCCCAGAG GGCTGGCGAC GCTCGCCTAT GGGAGATCGC GGTGCTGTTC CATCTGCGCG ATGCGTTCCG CTCCGGAGAT GTCTGGCTTA 5200
CTAGGTCCCG GCGCTATGGC GATCTGAAAC ACGCACTCGT TCCGGCACAA TCCATCGCGG AAGGCGGTCG TCTCGCTGTG CCATTGCGGC CGGAGGAATG 5300
GCTGGCAGAC CGGCAAGCTC GCCTCGACAT GCGGTTGCGC GAGCTTGGCC GTGCCGCTCG CGCAGGCACG ATCCCGGGCG GGTCGATTGA AAACGGCGTT 5400
CTGCATATCG AGAAACTCGA AGCCGCCGCG CCGACAGGCG CCGAAGATCT GGTGCTCGAT CTCTACAAGC AGATCCCGCC CACGCGCATC ACCGATCTCC 5500
TGCTGGAGGT GGATGCGGCG ACCGGCTTCA CCGAAGCGTT CACCCATCTG CGCACAGGAG CACCCTGCGC TGACCGGATC GGGCTAATGA ACGTTATCTT 5600
GGCGGAAGGG ATCAACCTCG GCTTGCGCAA AATGGCGGAT GCGACAAACA CCCACACCTT CTGGGAATTG ATCCGCATTG GACGGTGGCA TGTCGAGGGC 5700
GAAGCCTATG ACCGGGCGCT GGCCATGGTG GTCGAGGCAC AGGCAGCGTT ACCCATGGCC CGGTTCTGGG GCATGGGCAC GTCGGCTTCG AGCGACGGAC 5800
AGTTCTTCGT CGCTACAGAG CAAGGTGAGG CCATGAACCT AGTCAACGCG AAATATGGCA ATACCCCGGG CCTGAAAGCC TATAGCCACG TCTCGGGGTC 5900
TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT 6000
AAATCAATCT AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA 6100
TAGTTGCCTG ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG CGAGACCCAC GCTCACCGGC 6200
TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG 6300
GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTGCA GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT 6400
TCAGCTCCGG TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA 6500
GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA 6600
ACCAAGTCAT TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAACA CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC 6700
TCATCATTGG AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC 6800
AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC 6900
ATACTCTTCC TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG 7000
TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTCG 7100
TCTTCAAGAA TTTTATAAAC CGTGGAGCGG GCAATACTGA GCTGATGAGC AATTTCCGTT GCACCAGTGC CCTTCTGATG AAGCGTCAGC ACGACGTTCC 7200
TGTCCACGGT ACGCCTGCGG CCAAATTTGA TTCCTTTCAG CTTTGCTTCC TGTCGGCCCT CATTCGTGCG CTCTAGGATC CTCCGGCGTT CAGCCTGTGC 7300
CACAGCCGAC AGGATGGTGA CCACCATTTG CCCCATATCA CCGTCGGTAC TGATCCCGTC GTCAATAAAC CGAACCGCTA CACCCTGAGC ATCAAACTCT 7400
TTTATCAGTT GGATCATGTC GGCGGTGTCG CGGCCAAGAC GGTCGAGCTT CTTCACCAGA ATGACATCAC CTTCCTCCAC CTTCATCCTC AGCAAATCCA 7500
GCCCTTCCCG ATCTGTTGAA CTGCCGGATG CCTTGTCGGT AAAGATGCGG TTAGCTTTTA CCCCTGCATC TTTGAGCGCT CTGATCTGAA TATCGAGGGA 7600
CTGCTGGCTG GTTGAGACCC GCGCATAACC AAAAATTCGC ATAAAAATGT ACCTTAAATC GAATATCAGA CACGATGTGT CTATTATGCC AAAATGACGA 7700
TTTAATGGAC ACTCAAACGA AGCCGTTTTA CTATGTCTGA TAATTTATAA TATTTCGAAC GGTTGCAGTT GTGTTAAAAA AGCCGTCAGG CAGGGAGGCC 7800
GATATGCCCG TTGATTTTTT GACCACTGAG CAGGTTGAGA GTTATGGCAG GTTCACTGGC GAACCCGATG AACTTCAGCT GGCGCGTTAT TTTCATCTTG 7900
ATGAAGCGGA TAAAGAATTT ATCGGGAAAA GCCGGGGTGA TCACAATCGA CTTGGTATTG CCCTGCAAAT CGGGTGTGTG CGTTTTCTGG GCACTTTTCT 8000
TACTGACATG AATCATATTC CTTCCGGCGT CCGGCATTTT ACCGCCAGAC AGCTCGGGAT TCGTGATATC ACCGTTCTTG CAGAATACGG TCAGAGGGAA 8100
AATACCCGCC GTGAGCATGC AGCGCTGATA CGTCAGCACT ATCAGTATCG TGAATTTGCC TGGCCCTGGA CATTTCGCCT TACCCGTCTT TTATATACCC 8200
GGAGCTGGAT AAGCAACGAA CGTCCTGGCC TGCTTTTCGA CCTGGCGACA GGGTGGCTTA TGCAACATCG TATTATTCTC CCCGGAGCCA CCACGCTGAC 8300
CCGGTTGATT TCAGAGGTAA GGGAAAAGGC GACGTTGCGC CTGTGGAACA AACTGGCACT GATACCGTCA GCCGAACAGC GTTCACAGCT GGAGATGCTG 8400
CTGGGGCCAA CTGATTGCAG CCGCCTGTCT TTACTGGAAT CACTGAAAAA AGGCCCTGTG ACCATCAGTG GTCCGGCGTT TAATGAAGCA ATTGAACGCT 8500
GGAAAACTCT GAACGATTTT GGCCTGCATG CTGAAAACCT GAGTACACTC CCGGCTGTGC GCCTGAAAAA TCTCGCACGT TATGCTGGTA TGACTTCGGT 8600
GTTCAATATT GCCAGGATGT CACCGCAGAA AAGGATGGCG GTTCTGGTTG CCTTTGTCCT TGCATGGGAA ACGCTGGCGC TGGATGATGC ACTGGACGTT 8700
CTGGACGCCA TGCTGGCCGT TATCATCCGT GACGCCAGAA AGATTGGGCA GAAAAAACGG CTCCGCTCGC TGAAGGATCT GGATAAATCT GCATTGGCGC 8800
TCGCCAGCGC ATGTTCGTAC TTGCTGAAAG AAGAAACACC GGACGAATCG ATTCGTGCTG AGGTGTTCAG CTACATCCCT AGGCAAAAGC TGGCTGAAAT 8900
CATCACGCTT GTCCGTGAAA TTGCCCGGCC CTCAGACGAT AATTTTCATG ACGAAATGGT GGAGCAGTAC GGGCGCGTTC GTCGTTTCCT GCCCCATCTG 9000
CTGAATACCG TTAAATTTTC ATCCGCACCT GCCGGGGTTA CCACTCTGAA TGCCTGTGAC TACCTCAGCC GGGAGTTCAG CTCACGGCGG CAGTTTTTTG 9100
ACGACGCACC AACGGAAATC ATCAGTCAGT CATGGAAACG GCTGGTGATT AACAAGGAAA AACATATCAC CCGAAGGGGA TACACGCTCT GCTTTCTCAG 9200
TAAACTGCAG GATAGTCTGA GACGGAGGGA TGTCTACGTT ACCGGCAGTA ACCGGTGGGG AGATCCTCGT GCAAGATTAC TACAGGGTGC TGACTGGCAG 9300
GCAAATCGGA TTAAGGTTTA TCGTTCTTTG GGGCACCCGA CAGACCCGCA GGAAGCAATA AAATCTCTGG GCCATCAGCT TGATAGTCGT TACAGACAGG 9400
TTGCTGCACG TCTTGGCGAA AATGAGGCTG TCGAACTCGA TGTTTCTGGC CCGAAGCCCC GGTTGACAAT TTCTCCCCTC GCCAGTCTTG ATGAGCCGGA 9500
CAGTCTGAAA CGACTGAGCA AAATGATCAG TGATCTGCTC CCTCCGGTGG ATTTAACGGA GTTGCTGCTC GAAATTAACG CCCATACCGG ATTTGCTGAT 9600
GAGTTTTTCC ATGCCAGTGA AGCCAGTGCC AGAGTTGATG ATCTGCCCGT CAGCATCAGC GCCGTGCTGA TGGCTGAAGC CTGCAATATC GGTCTGGAAC 9700
CACTGATCAG ATCAAATGTT CCTGCACTGA CCCGACACCG GCTGAACTGG ACAAAAGCGA ACTATCTGCG GGCTGAAACT ATCACCAGCG CTAATGCCAG 9800
ACTGGTTGAT TTTCAGGCAA CGCTGCCACT GGCACAGATA TGGGGTGGAG GAGAAGTGGC ATCTGCAGAT GGAATGCGCT TTGTTACGCC AGTCAGAACA 9900
ATCAATGCCG GACCGAACCG CAAATACTTT GGTAATAACA GAGGGATCAC CTGGTACAAC TTTGTGTCCG ATCAGTATTC CGGCTTTCAT GGCATCGTTA 10000
TACCGGGGAC GCTGAGGGAC TCTATCTTTG TGCTGGAAGG CCTTCTGGAA CAGGAGACCG GGCTGAATCC AACCGAAATT ATGACCGATA CGGCAGGTGC 10100
CAGCGATCTT GTCTTTGGCC TTTTCTGGCT GCTGGGATAC CAGTTTTCTC CACGCCTGGC TGATGCCGGT GCTTCGGTTT TCTGGCGAAT GGACCATGAT 10200
GCCGACTATG GCGTGCTGAA TGATATTGCC AGAGGGCAAT CAGATCCCCG AAAAATAGTC CTTCAGTGGG ACGAAATGAT CCGGACCGCA GGCTCCCTGA 10300
AGCTGGGCAA AGTACAGGCC TCAGTGCTGG TCCGTTCATT GCTGAAAAGT GAACGTCCCT CCGGACTGAC TCAGGCAATC ATTGAAGTGG GGCGCATCAA 10400
CAAAACGCTG TATCTGCTTA ATTATATTGA TGATGAAGAT TACCGCCGGC GCATTCTGAC CCAGCTTAAT CGGGGAGAAA GCCGTCATGC AGTTGCCAGA 10500
GCCATCTGTC ACGGTCAAAA AGGTGAGATA AGAAAACGAT ATACCGACGG TCAGGAAGAT CAGTTGGGAG CTCTGGGGCT GGTCACTAAC GCCGTCGTGT 10600
TATGGAACAC TATTTATATG CAGGCAGCTC TGGATCATCT CCGGGCGCAG GGTGAAACAC TGAATGATGA AGATATCGCA CGCCTCTCCC CGCTTTGCCA 10700
CGGACATATC AATATGCTCG GCCATTATTC CTTCACGCTG GCAGAACTGG TGACCAAAGG TCATCTGAGA CCATTAAAAG AGGCGTCAGA GGCAGAAAAC 10800
GTTGCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTCTCCGA CCAATATGCG CCGTTCGCAA CCCAGGTGAT TCCTGCAACG GCAAGCGAAG 10900
CGCCTTACAT CCTCGATGGC CTGCTGATGA ACGATGCTGG ACGCCATATC CGCGAGCAGT TCACCGACAC GGGCGGCTTC ACCGATCACG TCTTTGCCGC 11000
ATGTGCCATT CTCGGCTACC GGTTCGCTCC GCGCATCCGC GACCTGCCAT CCAAACGGCT CTACGCGTTC AATCCGTCGG CCGCCCCGGC GCACCTGCGA 11100
GCGTTGATCG GCGGAAAGGT CAACCAAGCC ATGATCGAGC GCAATTGGCC CGACATCCTG CGCATCGCCG CCACCATTGC TGCCGGGACC GTCGCGCCAA 11200
GCCAGATTCT GCGGAAACTC GCCTCCTATC CGCGGCAGAA CGAGCTCGCG ACAGCCCTGC GGGAAGTCGG TCGCGTCGAG CGCACCCTGT TCATGATCGA 11300
CTGGATTCTG GATGCCGAAC TCCAACGGCG TGCCCAGATC GGGCTCAACA AAGGCGAAGC TCATCATGCG CTGAAGCGGG CAATCAGCTT CCACCGCCGC 11400
GGTGAAATCC GCGACCGTTC CGCCGAAGGC CAGCATTACC GCATCGCCGG CATGAATCTG CTCGCCGCCA TCATCATCTT CTGGAACACC ATGAAGCTCG 11500
GCGAGGTCGT TGCAAACCAG AAACGCGATG GAAAGCTGCT ATCGCCCGAT CTCTTGGCCC ATGTTTCGCC GCTCGGATGG GAACACATCA ATCTCACCGG 11600
AGAATATCGC TGGCCAAAGC CTTAGCGTAG GATTCCGCCC CCTCCCGCAA ACGACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 3565-3786 222 TGTCCCGTTC GACACCTGCG GCGCGCAAGG CGTCGTGCTG CAGGTCGAGA GACTGCGAGC
CATCGGCTTT GGAGACGCGG GCATATCCGA TCAGCATGTA TCACAAACGT TGGTTTGAGG
CGGCGCTTCG GCCACGATTG CATTGACCTC TGGAAATGTA TCTCAACCAG CTTCATAAAC
AAAGCGTCTT GAACGCTATC AGATTTTGAA AAAGGAACAT GT
res 7645-7765 121 AAATGTACCT TAAATCGAAT ATCAGACACG ATGTGTCTAT TATGCCAAAA TGACGATTTA
ATGGACACTC AAACGAAGCC GTTTTACTAT GTCTGATAAT TTATAATATT TCGAACGGTT
G
res_site_III_a 7648-7670 23 TGTACCTTAA ATCGAATATC AGA
res_site_II_a 7676-7711 36 TGTGTCTATT ATGCCAAAAT GACGATTTAA TGGACA
res_site_I 7734-7762 29 TGTCTGATAA TTTATAATAT TTCGAACGG

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
APH(6)-Id (ARO:3002660) Tn5393c 107-952 Passenger Gene Antibiotic Resistance -
APH(3'')-Ib (ARO:3002639) Tn5393c 943-1746 Passenger Gene Antibiotic Resistance -
orfAB IS1133 1853-2982 Transposase   -
orfB IS1133 1853-2710 Transposase   -
orfA IS1133 2707-2982 Accessory Gene Regulator -
tnpR Tn5393.7 3050-3661 Accessory Gene Resolvase -
tnpA N-ter Tn5393.7 3787-5895 Transposase   +
TEM-1 (ARO:3000873) Tn3.1 6042-6902 Passenger Gene Antibiotic Resistance -
tnpR Tn3.1 7085-7642 Accessory Gene Resolvase -
tnpA Tn3.1 7771-10809 Transposase   +
tnpA C-ter Tn5393.7 10843-11625 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
APH(6)-Id (ARO:3002660) APH(6)-Id Tn5393c 846 107-952 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  APH(6) (ARO:3000151)
Comment:   strB, orfI || strict match to reference sequence for ARO:3002660 (bitscore: 568)
Protein Sequence:  
MGLMFMPPVF PAHWHVSQPV LIADTFSSLV WKVSLPDGTP AIVKGLKPIE DIADELRGAD YLVWRNGRGA VRLLGRENNL MLLEYAGERM LSHIVAEHGD
YQATEIAAEL MAKLYAASEE PLPSALLPIR DRFAALFQRA RDDQNAGCQT DYVHAAIIAD QMMSNASELR GLHGDLHHEN IMFSSRGWLV IDPVGLVGEV
GFGAANMFYD PADRDDLCLD PRRIAQMADA FSRALDVDPR RLLDQAYAYG CLSAAWNADG EEEQRDLAIA AAIKQVRQTS Y

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
APH(3'')-Ib (ARO:3002639) APH(3'')-Ib Tn5393c 804 943-1746 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  APH(3'') (ARO:3000127)
Comment:   strA, orfH || perfect match to reference sequence for ARO: 3002639
Protein Sequence:  
MNRTNIFFGE SHSDWLPVRG GESGDFVFRR GDGHAFAKIA PASRRGELAG ERDRLIWLKG RGVACPEVIN WQEEQEGACL VITAIPGVPA ADLSGADLLK
AWPSMGQQLG AVHSLSVDQC PFERRLSRMF GRAVDVVSRN AVNPDFLPDE DKSTPQLDLL ARVERELPVR LDQERTDMVV CHGDPCMPNF MVDPKTLQCT
GLIDLGRLGT ADRYADLALM IANAEENWAA PDEAERAFAV LFNVLGIEAP DRERLAFYLR LDPLTWG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
orfAB OrfAB IS1133 1130 1853-2982 -
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   fusion protein from -1 programmed frameshifting between orfA and orfB
Protein Sequence:  
MSLKHSDEFK RDAVRIALTS GLTRRQVASD LSIGLSTLGK WIASISDETK IPTQDTDLLR ENERLRKENR ILREEREILK KAAIFFAVQK L*DFSLLRIT
VALSHVHAYV V*WA*QIVVY VHGNAVLHHC ASVVILYF*R IYVSSIGCVW GAMVGRV*QK S*KRWACRLG SVGLDV*CAR ITLQLFERVN SNGQRIVIIP
STLHRTY*NK TLAQAHPTRN GQAISLMFGP EKDGSILLLS LTCIPVA*LA GQQVID*SRI LH*GH*IWRW LYANHHRVVF NTQTVGANIA LMNIKSYCSN
INCCRP*AGK AIVLITPQ*K ASLNH*RLS* FGADTGKQGE ILRLQSSNI* MAFIIHAEDI QHSAGNRRWH LRKKPL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
orfB OrfB IS1133 858 1853-2710 -
Class:   Transposase
Function:   transposase activity (GO:0004803)
Transpoase Chemistry:   DDE
Protein Sequence:  
MRFQFITDYR GSLSRSRICR LMGVTDRGLR AWKRRPPSLR QRRDLILLAH IREQHRLCLG SYGRPRMTEE LKALGLQVGQ RRVGRLMRQN NITVVRTRKF
KRTTDSHHTF NIAPNLLKQD FSASAPNQKW AGDITYVWTR EGWVYLAVIL DLYSRRVIGW ATGDRLKQDL ALRALNMALA LRKPPPGCIQ HTDRGSQYCA
HEYQKLLLKH QLLPSMSGKG NCFDNSAVES FFKSLKAELI WRRHWQTRRD IEIAIFEYIN GFYNPRRRHS TLGWKSPVAF EKKAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
orfA OrfA IS1133 276 2707-2982 -
Class:   Accessory Gene
Sub Class:   Regulator
Function:   transposase activity (GO:0004803)
Comment:   orfA, together with orfB, encodes a fusion protein formed by translational frame-shifting that is the transposase for IS1133
Protein Sequence:  
MSLKHSDEFK RDAVRIALTS GLTRRQVASD LSIGLSTLGK WIASISDETK IPTQDTDLLR ENERLRKENR ILREEREILK KAAIFFAVQK L

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn5393.7 612 3050-3661 -
Class:   Accessory Gene
Sub Class:   Resolvase
Function:   recombinaseactivity (GO:0000150)
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MLIGYARVSK ADGSQSLDLQ HDALRAAGVE RDNIYDDLAS GGRDDRPGLT ACLKSLRDGD VLVVWKLDRL GRSLAHLVNT VKELSDRKIG LRVLTGKGAQ
IDTTTASGRM VFGIFATLAE FERDLIRERT MAGLASARAR GRKGGRKFAL TKAQVRLAQA AMAQRDTSVS DLCKELGIER VTLYRYVGPK GELRDHGKHV
LGLT

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA N-ter TnpA N-ter Tn5393.7 2109 3787-5895 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRRVTLTDR QKDALLRLPT SQTDLLKHYT LSDEDLGHIR LRRRAHNRFG FALQLCVLRY PGRVLAPGEL IPAEVIEFIG AQLGLGADDL VDYAAREETR
HEHLAELRGL YGFRTFSGRG ASELKEWLFR EAEMAVSNED IARRFVAECR RTRTVLPATS TIERLCAAAL VDAERRIETR IASRLPMSIR EQLLALLEET
ADDRVTRFVW LRQFEPGSNS SSANRLLDRL EYLQRIDLPE DLLAGVPAHR VTRLRRQGER YYADGMRDLP EDRRLAILAV CVSEWQAMLA DAVVETHDRI
VGRLYRASER ICHAKVADEA GVVRDTLKSF AEIGGALVDA QDDGQPLGDV IASGSGWDGL KTLVAMATRL TATMADDPLN HVLDGYHRFR RYAPRMLRLL
DLRAAPVALP LLEAVTALRT GLNDAAMTSF LRPSSKWHRH LRAQRAGDAR LWEIAVLFHL RDAFRSGDVW LTRSRRYGDL KHALVPAQSI AEGGRLAVPL
RPEEWLADRQ ARLDMRLREL GRAARAGTIP GGSIENGVLH IEKLEAAAPT GAEDLVLDLY KQIPPTRITD LLLEVDAATG FTEAFTHLRT GAPCADRIGL
MNVILAEGIN LGLRKMADAT NTHTFWELIR IGRWHVEGEA YDRALAMVVE AQAALPMARF WGMGTSASSD GQFFVATEQG EAMNLVNAKY GNTPGLKAYS
HVS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
TEM-1 (ARO:3000873) TEM-1 Tn3.1 861 6042-6902 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   penem (ARO:3003706)||cephalosporin (ARO:0000032)||monobactam (ARO:0000004)||penam (ARO:3000008)
Sequence Family:  TEM beta-lactamase (ARO:3000014)
Comment:   perfect match to reference sequence for ARO:3000873||Synonyms: TEM-98, RTEM-1
Protein Sequence:  
MSIQHFRVAL IPFFAAFCLP VFAHPETLVK VKDAEDQLGA RVGYIELDLN SGKILESFRP EERFPMMSTF KVLLCGAVLS RVDAGQEQLG RRIHYSQNDL
VEYSPVTEKH LTDGMTVREL CSAAITMSDN TAANLLLTTI GGPKELTAFL HNMGDHVTRL DRWEPELNEA IPNDERDTTM PAAMATTLRK LLTGELLTLA
SRQQLIDWME ADKVAGPLLR SALPAGWFIA DKSGAGERGS RGIIAALGPD GKPSRIVVIY TTGSQATMDE RNRQIAEIGA SLIKHW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn3.1 558 7085-7642 -
Class:   Accessory Gene
Sub Class:   Resolvase
Function:   resolvase; serine site-specific recombinase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   first defined as a repressor
Protein Sequence:  
MRIFGYARVS TSQQSLDIQI RALKDAGVKA NRIFTDKASG SSTDREGLDL LRMKVEEGDV ILVKKLDRLG RDTADMIQLI KEFDAQGVAV RFIDDGISTD
GDMGQMVVTI LSAVAQAERR RILERTNEGR QEAKLKGIKF GRRRTVDRNV VLTLHQKGTG ATEIAHQLSI ARSTVYKILE DERAS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn3.1 3039 7771-10809 +
Class:   Transposase
Function:   transposase activity (GO:0004803)
Transpoase Chemistry:   DDE
Comment:   In frame three amino acid deletion relative to tnpA (Tn3)
Protein Sequence:  
VLKKPSGREA DMPVDFLTTE QVESYGRFTG EPDELQLARY FHLDEADKEF IGKSRGDHNR LGIALQIGCV RFLGTFLTDM NHIPSGVRHF TARQLGIRDI
TVLAEYGQRE NTRREHAALI RQHYQYREFA WPWTFRLTRL LYTRSWISNE RPGLLFDLAT GWLMQHRIIL PGATTLTRLI SEVREKATLR LWNKLALIPS
AEQRSQLEML LGPTDCSRLS LLESLKKGPV TISGPAFNEA IERWKTLNDF GLHAENLSTL PAVRLKNLAR YAGMTSVFNI ARMSPQKRMA VLVAFVLAWE
TLALDDALDV LDAMLAVIIR DARKIGQKKR LRSLKDLDKS ALALASACSY LLKEETPDES IRAEVFSYIP RQKLAEIITL VREIARPSDD NFHDEMVEQY
GRVRRFLPHL LNTVKFSSAP AGVTTLNACD YLSREFSSRR QFFDDAPTEI ISQSWKRLVI NKEKHITRRG YTLCFLSKLQ DSLRRRDVYV TGSNRWGDPR
ARLLQGADWQ ANRIKVYRSL GHPTDPQEAI KSLGHQLDSR YRQVAARLGE NEAVELDVSG PKPRLTISPL ASLDEPDSLK RLSKMISDLL PPVDLTELLL
EINAHTGFAD EFFHASEASA RVDDLPVSIS AVLMAEACNI GLEPLIRSNV PALTRHRLNW TKANYLRAET ITSANARLVD FQATLPLAQI WGGGEVASAD
GMRFVTPVRT INAGPNRKYF GNNRGITWYN FVSDQYSGFH GIVIPGTLRD SIFVLEGLLE QETGLNPTEI MTDTAGASDL VFGLFWLLGY QFSPRLADAG
ASVFWRMDHD ADYGVLNDIA RGQSDPRKIV LQWDEMIRTA GSLKLGKVQA SVLVRSLLKS ERPSGLTQAI IEVGRINKTL YLLNYIDDED YRRRILTQLN
RGESRHAVAR AICHGQKGEI RKRYTDGQED QLGALGLVTN AVVLWNTIYM QAALDHLRAQ GETLNDEDIA RLSPLCHGHI NMLGHYSFTL AELVTKGHLR
PLKEASEAEN VA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA C-ter TnpA C-ter Tn5393.7 783 10843-11625 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
VSDQYAPFAT QVIPATASEA PYILDGLLMN DAGRHIREQF TDTGGFTDHV FAACAILGYR FAPRIRDLPS KRLYAFNPSA APAHLRALIG GKVNQAMIER
NWPDILRIAA TIAAGTVAPS QILRKLASYP RQNELATALR EVGRVERTLF MIDWILDAEL QRRAQIGLNK GEAHHALKRA ISFHRRGEIR DRSAEGQHYR
IAGMNLLAAI IIFWNTMKLG EVVANQKRDG KLLSPDLLAH VSPLGWEHIN LTGEYRWPKP

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS1133-LT827129.1 IS1133 Insertion Sequence 1815-3046 1232
Tn3.1-KY749247.1 Tn3.1 Transposon null-null 1

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRL Tn5393.7 1-40 GGGGTCGTTT GCGGGAGAGG GCGAAATCCT ACGCTAAGGC
IRR IS1133 1815-1841 TGGACTTGCA CCGGTTGTGT TCCGGTC
IRL IS1133 3020-3046 CTGGCCTTGC TTTGGCTCCG TTCAGGT
IRL Tn3.1 5895-5932 GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAG
IRR Tn3.1 10805-10842 GAATTGCACT CAAAAGCAAG GTGACTCGCA GTCTGGGG
IRRl Tn5393.7 11578-11658 ACCCTTGTGT AGTTAGAGTG GCCTCTTATA GCGACCGGTT TCGGAATCGC
ATCCTAAGGC GGGGGAGGGC GTTTGCTGGG G