Transposon
Name: In_Tn21.1
Family: Tn402        Group: Class 1
Evidence of Transposition: yes
 Host     

Host Organism:Salmonella enterica subsp. enterica serovar Typhimurium Molecular Source:plasmid pST1007-1A

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  12168 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGGGCACTG 1400
TTGCAAATAG TCGGTGGTGA TAAACTTATC ATCCCCTTTT GCTGATGGAG CTGCACATGA ACCCATTCAA AGGCCGGCAT TTTCAGCGTG ACATCATTCT 1500
GTGGGCCGTA CGCTGGTACT GCAAATACGG CATCAGTTAC CGTGAGCTGC AGGAGATGCT GGCTGAACGC GGAGTGAATG TCGATCACTC CACGATTTAC 1600
CGCTGGGTTC AGCGTTATGC GCCTGAAATG GAAAAACGGC TGCGCTGGTA CTGGCGTAAC CCTTCCGATC TTTGCCCGTG GCACATGGAT GAAACCTACG 1700
TGAAGGTCAA TGGCCGCTGG GCGTATCTGT ACCGGGCCGT CGACAGCCGG GGCCGCACTG TCGATTTTTA TCTCTCCTCC CGTCGTAACA GCAAAGCTGC 1800
ATACCGGTTT CTGGGTAAAA TCCTCAACAA CGTGAAGAAG TGGCAGATCC CGCGATTCAT CAACACGGAT AAAGCGCCCG CCTATGGTCG CGCGCTTGCT 1900
CTGCTCAAAC GCGAAGGCCG GTGCCCGTCT GACGTTGAAC ACCGACAGAT TAAGTACCGG AACAACGTGA TTGAATGCGA TCATGGCAAA CTGAAACGGA 2000
TAATCGGCGC CACGCTGGGA TTTAAATCCA TGAAGACGGC TTACGCCACC ATCAAAGGTA TTGAGGTGAT GCGTGCACTA CGCAAAGGCC AGGCCTCAGC 2100
ATTTTATTAT GGTGATCCCC TGGGCGAAAT GCGCCTGGTA AGCAGAGTTT TTGAAATGTA AGGCCTTTGA ATAAGACAAA AGGCTGCCTC ATCGCTAACT 2200
TTGCAACAGT GCCCATTTAT GAATGTTCCT GTTATGGCTT ATGTTCAAGA AAGCATTGCC CCTGAAATGA TGGGCAAGGT GTTTTCCCTT TTGATGACCG 2300
CCATGACTCT TTCTATGCCG ATAGGCTTAC TTGTTGCAGG TCCGGTTGTT GAGGTTATAG GTGTTAATAC ATGGTTTTTC TGGTCTGGTG TTGCGTTGAT 2400
AGTAAACGCT GTTCTCTGCC GCATTCTGAC ACGACGCTAT GACAAAGTAA CAATGAAACC GCAAGTGGAC TGAAAAAAGG ACCGGGTTGA TGATAATTTG 2500
TAGTGGTGAG CTTCTGGGAG TACAAAACAA AGTGCTCAAA ATTGTCGGGC TCATGGCGTT TAACGGTATT AATTTCGCTT ATAATAATCT TTCTATAATA 2600
GCCTAAAGGA GAATATCTAT GATACCTAAT AGCGAAAATA AAAGAGTATG GTTTATTACC GGAGCAAGCA AGGGGCTTGG CTATGCTTTT ACATGCGCCG 2700
CCTTGAAAGC CGGGGATAAA GTTGTTGCAG TTGCAAGGAC TATCGATAAT TTGGCGAAGC TAGAAGAAAC ATATCAAGAG AGCTTACTGC CATTAAACCT 2800
CGATGTTACA GATAGGGAGG CTGTTTTTTC TACGGTTGAA ACAGCAGTTA AACATTTCGG TAGGCTTGAT ATTGTTGTTA ATAATGCGGG TATCATGACT 2900
ATGGGTATGA TTGAAGAACT AAACGAATCC GATGCTCGGA AACTAATGGA CACAAACTTT TTTGGAGCTC TTTGGGTTTG TCAGGCAGTG ATGCCCTATT 3000
TGAGGTCGCA GCGTTCTGGA CATATCATAC AGATTACAAG CATTGGCGCT ATTATCTCGG GTCCGATGTC TGGTATTTAC AGTGCAAGTA AATTTGCGCT 3100
GGAAGGAATG AGTGAAGCCT TAGCAAAAGA AGCGGAACAT TTCGGAGTGA AACTCACTAT GGTGGAACCA GGAGGATATT GGACAGATTT ATACACTTCG 3200
ATGAGTTACA GCAATCCATT AGATTCATAC GGTACACTGC GCGATGAGCT GGCGAAACAG TATTCCGAAG ATTCGGTCGA TAGTGATCCT TCCTTGGCAG 3300
CGGAAGCCCT TATGAAATTA GTAGCCAGCA ATAATCCGCC TTTGAGGCTT ATCCTTGGCA GCATGGTATA CGATTTGGCG ATGGATACCT TAAAAGCACG 3400
GATGGCTACC TGGGAAGAAT GGGAAGCTGT TAGCCGTGCA TCAGAAAAGG CTATTCCTGC ACCGGAGAGA TATGGAGTAT AATACCAAAC ATGGTTTTTG 3500
AGGCAAGGAC GAAACTATCA TGAAAATTGA AATTATGGAA TATAACCCGG ATTGGACAAA GAATTTTGAA GAAGAGAAAA TAAAATTGCT CCATTTTTTC 3600
GGTTCTCATG CCGTAGCCAT TGAACATATC GGAAGCACGG CGATACCCAA TCAACGGGCG AAACCGGTTA TAGATATATT TATTGGCGTT TCGCCTTTTG 3700
CTGAACTACC TTTTATCAGC GCATTTTTAA TGCAAAGGAG TATCACCACA CTCCGACAGA TATGACAGGC AGATATTTGT TTGCAAAATA TACAAATGAA 3800
GTTTGGACCC ATAATCTTCT TGCTTTGGTG TAGGTATAAT ACCCAAAAAC GAAGTTTATG CGGAACAGGA TAACACCTCG AAATCGGAGT TTATATGGCT 3900
TCGTGGGGGA GTTATGTGTT TGAGCTTTGT CGCATTAACG CGAATCAGGG GTTGACGGCA GAAATAGACT GGAATTACAC TTCTTGGAGT CGGCGTTGCC 4000
GGAAAATTCT GATTGGATTA GTTGTTCGGG GTGCGAAAAC AGTCGTAGTT CGGGGAAAAA ACCGAATTTT GACCCAAAAC CGCGAAAAAC TCGAAATGAC 4100
AGTTCCTATC AGTTTTCATA ATGAATTAAA ACAAAAAAAT TAGGCTGCAA AGATAGGGAA AATGTGTCAA TTTGCGTAAG TGATAATTAG TCAATTAAGA 4200
TAAATGCTAC TTTTGTAATC AAAGCAAAAT GATATGAGCA AGATTTTTGG AATCGTAAAT ATAACCACCG ATAGTTTTTC CGATGGAGGA CTTTATTTAG 4300
ATACAGATAA GGCAATTGAG CATGCTCTGC ATTTGGTTGA AGATGGAGCA GATGTGATTG ATTTGGGAGC CGCTTCCAGT AATCCTGATA CAACTGAAGT 4400
GGGCGTTGTG GAAGAAATCA AAAGACTCAA ACCTGTCATT AAGGCTTTAA AAGAAAAAGG CATTTCTATT TCTGTTGATA CATTTAAACC TGAGGTTCAG 4500
AGTTTTTGCA TAGAACAAAA GGTTGATTTT ATTAATGATA TTCAAGGTTT TCCTTATCCT GAGATTTATT CAGGCTTGGC AAAGTCAGAT TGCAAACTTG 4600
TGTTGATGCA CTCCGTTCAG CGAATTGGTG CAGCTACTAA AGTTGAAACG AATCCGGAAG AGGTTTTTAC TTCCATGATG GAATTTTTTA AAGAAAGAAT 4700
TGCTGCTTTA GTTGAGGCTG GTGTAAAGCG TGAACGAATT ATTCTTGATC CGGGTATGGG CTTCTTTTTA GGCTCTAATC CAGAAACATC TATTCTTGTT 4800
TTGAAGCGTT TCCCTGAAAT TCAAGAAGCT TTTAATTTGC AAGTAATGAT TGCAGTGTCA CGGAAATCAT TCTTAGGTAA AATAACTGGA ACCGATGTGA 4900
AATCTCGTTT AGCACCAACT CTTGCAGCAG AAATGTATGC ATACAAAAAA GGTGCAGATT ATCTCCGCAC CCATGATGTT AAGTCTTTAT CAGATGCCTT 5000
GAAAATATCC AAAGCCCTAG GTTAGTTTTA TTTTTTTAGC TTTCTAAATG AATATGCTAT ACTTGCTCTG AAGTTGTGTT GGCGTACTTG ACTTAGGTCT 5100
GGATCTTTCG AACTGTTACT AAGAGGGTAA TCATAGCCAA ACATAAACTG GAAATGTCCT AATTCTAAAC CTGTTCCAAT ACCTACACTA TAATCAAGAG 5200
GACTTAAACT TCCATCACTT CCAAAAGGTT TTGTGTACTT CTCTTGAGTT TTCGTAGCTA AATAGTAGTG TTTGTTGGTA AATCCAAATA ATCCAATATT 5300
CAATCTTGCG ACCGGGAAAG TCCTCTTGAA AGTACCACAA GTTCGCGGGA TGGAGTTTTA TCCCAGCTGC ATCGAGAAAG GCATGCGCAG TGAGCGTGCT 5400
CTCAAGCTCG CCATCGCCGA AATGTATGTC AAAGGAGTAA GTACCCGCAG GGTCTCGGAT ATCGTCGAAA TTCTTTGTGG CACCGAAGTC AGCTCGTCCC 5500
AGGTCAGCAG GCTGGCAAAG GAGCTCGATG AAGAGATTAC GTCTTGGAAG GCGCAGCCTG TCGGACAGAT TCAATACTTG GTACTTGATG CGACCTATGA 5600
ATCGGTTCGC GTCGGTTCCC ATGTGGTCAA GCAGGCGCTT CTAGTGGCTA TTGGCGTTGA TTACAGCGGG AATCGGCATA TTCTTGACGC CGAAGTCGCG 5700
AACAGTGAGG CAGAGGTAAA CTGGCGTTCC TTTCTCGAGG GTCTCGTACG ACGAGGGATG CACGGCCTGC GAATGATCAC CAGTGATGAC CACTCAGGAC 5800
TGCGCGCGGC AATCGATGCT GTCTTCCCTG GAATTCTGTG GCAACGCTGC CAGTTTCATC TGCAGCAGAA TGCCCACTCC TACGTCACGA AAAAAGATGA 5900
GATCCCGCTG ATAGCCGCTG ATATTCGGAA GGTGTTCAAT CGGAATATGT CAAGATAGTT TATGCCAGAT TTTTGAGAGG TTTGCATCAA GCAGCCTTCG 6000
AAAGAGGACG GCATGACATT GAGACCGGCA AGGCGTCGTA GATACGCACT TTCCCAGTAC GCCACCGTAA GGGGTTTTTC TCTTTGGCAG CAAGAATGGT 6100
TTGGTTTCTT TTTTCCTCGG CGCCGTAGGC GTCCGACTAA CACCTGAATT AAGCCGCGCC GCGAAGCGGC GTCGGCTTGA ATGAACTGTT AGATGCCAGC 6200
CCGATCAATG TGCGCTGACC TTGGATAGCA GATTTAGAAC GGCGACGCCA CTAACGATAA GTCCCATGCC AACGAACGCC CACAAGTCTA GTTTCTGGCC 6300
ATGGAAGATC CAAGCGATAG CTGCCACAAG TACGATGCCG AGGCCAGCCC AAACAGCATA AGCAATGCCG ACCGGGATGG ACTTGAGTGC GAGAGAGAGG 6400
AAATAGAACG CAAGCCCGTA GCCAGCCACA ACTACAACAG AAGGAACTAA CTTGGTGAAT CCATGGCTGG ACTTCAGTGC GGAAGTTGCG ACGACCTCAC 6500
CAAATATTGC AATAGCCAGA AAGAGCCAGT TCTTCACGTG CAATCTCCTC TACGGTATGA AGGATAAATA GTGGTGGCTA TGAGTTGCCA AAAACAGTCT 6600
TGCGGCTGTC GATTTTCTGT GAGCATACGC AACGCCAAAT CTGGCATCTA ACGCTTGAGT TAAGCCGCGC CGCGAAGCGG CGTCGGCTTG AACGAATTGT 6700
TAGACATTAT TTGCCGACTA CCTTGGTGAT CTCGCCTTTC ACGTAGTGAA CAAATTCTTC CAACTGATCT GCGCGGGAGG CCAAGCGATC TTCTTCTTGT 6800
CCAAGATAAG CCTGTCTAGC TTCAAGTATG ACGGGCTGAT ACTGGGCCGG CAGGCGCTCC ATTGCCCAGT CGGCAGCGAC ATCCTTCGGC GCGATTTTGC 6900
CGGTTACTGC GCTGTACCAA ATGCGGGACA ACGTAAGCAC TACATTTCGC TCATCGCCAG CCCAGTCGGG CGGCGAGTTC CATAGCGTTA AGGTTTCATT 7000
TAGCGCCTCA AATAGATCCT GTTCAGGAAC CGGATCAAAG AGTTCCTCCG CCGCTGGACC TACCAAGGCA ACGCTATGTT CTCTTGCTTT TGTCAGCAAG 7100
ATAGCCAGAT CAATGTCGAT CGTGGCTGGC TCGAAGATAC CTGCAAGAAT GTCATTGCGC TGCCATTCTC CAAATTGCAG TTCGCGCTTA GCTGGATAAC 7200
GCCACGGAAT GATGTCGTCG TGCACAACAA TGGTGACTTC TACAGCGCGG AGAATCTCGC TCTCTCCAGG GGAAGCCGAA GTTTCCAAAA GGTCGTTGAT 7300
CAAAGCTCGC CGCGTTGTTT CATCAAGCCT TACGGTCACC GTAACCAGCA AATCAATATC ACTGTGTGGC TTCAGGCCGC CATCCACTGC GGAGCCGTAC 7400
AAATGTACGG CCAGCAACGT CGGTTCGAGA TGGCGCTCGA TGACGCCAAC TACCTCTGAT AGTTGAGTCG ATACTTCGGC GATCACCGCT TCCCTCATGA 7500
TGTTTAACGC CTGAGCTCAG CCGACCGAAA CCGCGTAGCG GTTTTGGGTC GGCTGCAGCG ATTTGTTGGG CGATAGCTTG CCACATTCTC TCAACGATTG 7600
GGATTTGATG TACTTTCCGC ACTTTGTAGC GCGACCACAT CATGCTCCCC CTGGCCGCGA GAGCCCTTCA CTCGGGAAAC ACAAGACAGA CCGAGCACGA 7700
CTGTTGCAAG GGTCAAACAG TACACGACAA CCGGCCAAGC CGTGTTGCGC GGCAACAGCG AAATGATCAA CGTTCCGATG CTTCCTAGCA GTACACCGCC 7800
CAAGCAGAAG TAGACTGCCG TGACCGTTCC AGCAACATGG TCGAATCCTC GAAGAGCGCC ATTGGGCGCC ACAGATACCG CTGTGGCGAC ACCAATACCC 7900
ACTAGCCACA TTGGAGCAAT AAAGCCTAAC ACGGACTGCA AAGCCCATAT TTCGGTGATG GCAAGCAATA CTGCTCCAGC TATCAGGCAT CCCATTCCCA 8000
TTCGCAAGAC ACTTGGGCTG CCCCACTTGG GTATCACACG CCCCATAAAA CGAGCCGTAA ACACCATGGC AATTGCCACT GTGGCGAACA GCAGGCTGAA 8100
GCCAAGCTGA GACACACCTT GCCTGCCCAT CATTAGTCCG GGCGCAATGG AGAAAAAGAC GAAGAAGCTA CCCATTCCAG CGGCGTAACA CAACGTGTAC 8200
AACCAGAAGT TCAGGCACTT AACGGGGAGT AGCAGCTGCG ACCATTGCAA GCCCGCAACT CGTTGCACCC GGGTTTCAGG CCAGAATCGC CACGCTGCTG 8300
CAGATGCAGC GATCATGCCC AAACCTAGAA ACGCAAAGAT AGCCCGCCAC CCAAGCCACA TGTCGACGAG CGCTCCGAGC AATGGGCCTA CCGCCGGGAC 8400
CATGGCCAGC ATGGATCCGA GTATGCCGTA AATGACATTA CTTTCCTCGC GACCTGCGTA AATGTCACGT ACTGTTGCAA ATGTGGAAAC AAGGCACGCC 8500
GAGGCACCAC AAGCCTGAAG AATCCGAAGC CCCAGAAAGA CTTCAGCCGA TGACGTAAGA GCGAGGCCCA TTGACGCCAC AACGTAGGCG AGGCCACCTC 8600
CCAGTAGAAC GGGGCGGCGC CCCAGTCGGT CCGATAGCGG TCCAAACAAG AGCTGACCGG CACCAATCAT GACCAAGTAC GTTGTCAGCG TAAGCTGAAT 8700
TGTGCTCGCT GTCGTACCAA GCGCGTTTGG CATAAACGGC ACTGCTGGCA AGTACATGTC CATGCCGAGT GATGCCAATA AATCGAACGG TGATAACAAC 8800
AACACCGTGG CGGCAAGGGA GTACCGCCAA CTAAAGTTTT TTGAGCTCAC GAAACCATCT CCTTGAACAA AGGATTTGGC GGCGCTCCCG TAGAGCAAAA 8900
GGATTCATGA GAACGCCGCA ACAACCGAAA AATGAAGGTT GCTGCGGCTT ACTTGTCTGC GTTCTTGGAA GTGCTCATCT GCTGACTATC TCATGATTGA 9000
ATTTGAATCG TAACAACTTT CAAGCAACTC TGCAAGGAGC CTTATTGTGC GCCCAACGCC GGAGTTAAGC CGCCGCGCGT AGCGCGGTCG GCTTGAACGA 9100
ATTGTTAGAC ATCATTTACC AACTGACTTG ATGATCTCGC CTTTCACAAA GCGAATAAAT TCTTCCAAGT GATCTGCGCG TGAGGCCAAG TGATCTTCTT 9200
TTTGTCCCAG ATAAGCTTGC TTAGCTTCAA GTAAGACGGG CTGATACTGG GCAGGTAGGC GTTTTATTGC CCAGTCGGCA GCGACATCCT TCGGCGCGAT 9300
TTTGCCGGTT ATTGCGCTGT ACCAAATGCG GGACAACGTA AGCACTACAT TTCGCTCATC GCCGGCCCAG TCGGGCTGCG AGTTCCATAG CTTCAAGGTT 9400
TCCCTCAGCG CCTCGAATAG ATCCTGTTCA GGAACCGGGT CAAAGAATTC CTCCGCTGCC GGACCTACCA AGGCAACGCT ATGTTCTCTT GCTTTTGTAA 9500
GCAGGATAGC TAGATCAATG TCGATCATGG CTGGCTCGAA GATACCCGCA AGAATGTCAT TGCGCTGCCA TTCTCCAAAT TGCAGCTCGC GCTTAGCCGG 9600
ATAACGCCAC GGGATGATGT CGTCATGCAC GACAAGGGTG ACTTCTATAG CGCGGAGCGT CTCGCTCTCG CCAGGGAAAG CCGAAGCCTC CATAAGATCA 9700
TTGAGCAATG CTCGCCGCGT CGTTTCATCA AGCTTTACGG CCACAGTAAC CAACAAATCA ATATCGCTGT ATGGCTTCAG GCCGCCATCC ACTGCGGAGC 9800
CGTACAAATG CACGGCCAGC AACGTTGATT CCAGATGGCG CTCAATGACG CTTAGCACCT CTGATAGTTG GTTCGAAATT TCGATGGTCA CCGCTTCCCT 9900
CATGATGTCT AACGGGCGAG GTAAGCCGAC CGCAGAATGC GGGTCGGCTT GACCGAAATG TTAGAACCAG AAGCCAAAAC GGATAACTTG AATTTGGCGA 10000
CGGGCGCTAA CCGTGAAAAA ACGCTGCGCC ACCGAGGCGG CACAGCACTG CAAAAACGAT AGCTGCTTGC GCTTGCTACG CAAGGGCTAG AGGCCAAAAA 10100
GACTGAAAAC CTGCGCAGCC CATGCAGGCG AAGCCCGGAA AAAAGGCAAA ACAGGCACTG AATAACGCCT GAAAAGCTAA ATGCCGTTTG AATAAACATG 10200
AGCTAAATAA AGCTGGGTTT CAGTGGTGCT AACGTTGGAC GTAACGAGAG CCGGAGCGCA GCGGAGGGAA CCAAAATGCG CAGCATTTTG GCGTCCCGTT 10300
GACGGAATGG TTAGCCGTTT CGACGCGCAT AAACGGAGTG GGTGTACGGA ATTACAGCTT GAATGGTTTC GGTTGAGACA AGCTCGAATT CTGTTTCGTT 10400
GAGCATTGGG AAGAAGGCGT CACCCTCGAA GGTTTGATGT ACCTCAGATA GAAACACGCC GTGGGCGTGA GGTAGTGCCA GAGTGTATAT CTCAGCTCCG 10500
CCCGCGACGT AGAGTTCATT GCCGAGTTCG GATGCCAAAG CGATAGCGTG CGACAGCGTT GAAACAACTA CGCAGCCAGT GGCGCGGTAG TTAGCTTGGC 10600
GTGAGATTAC CAATGTGTGA CGGTTCGGTA GAGGCTTGCC GATAGACTCA AAGGTCTTTC GCCCCATGAC AACGACTTTT CCCTCAGTGA GTCTGCGAAA 10700
AATCTTCTGC TCACCCGGAA TTTTCCAGGG GATATTAGGA CCATTGCCAA TAACCCGATT GGCTCCCATC GCAGCAACGA GATAAATGCG TACTGATTCC 10800
GAGTTCATAT GGCTAACTTT GTTTTAGGGC GACTGCCCTG CTGCGTAACA TCGTTGCTGC TCCATAACAT CAAACATCGA CCCACGGCGT AACGCGCTTG 10900
CTGCTTGGAT GCCCGAGGCA TAGACTGTAC AAAAAAACAG TCATAACAAG CCATGAAAAC CGCCACTGCG CCGTTACCAC CGCTGCGTTC GGTCAAGGTT 11000
CTGGACCAGT TGCGTGAGCG CATACGCTAC TTGCATTACA GCTTACCAAC CGAACAGGCT TATGTCCACT GGGTTCGTGC CTTCATCCGT TTCCACGGTG 11100
TGCGTCACCC GGCAACCTTG GGCAGCAGCG AAGTCGAGGC ATTTCTGTCC TGGCTGGCGA ACGAGCGCAA GGTTTCGGTC TCCACGCATC GTCAGGCATT 11200
GGCGGCCTTG CTGTTCTTCT ACGGCAAGGT GCTGTGCACG GATCTGCCCT GGCTTCAGGA GATCGGAAGA CCTCGGCCGT CGCGGCGCTT GCCGGTGGTG 11300
CTGACCCCGG ATGAAGTGGT TCGCATCCTC GGTTTTCTGG AAGGCGAGCA TCGTTTGTTC GCCCAGCTTC TGTATGGAAC GGGCATGCGG ATCAGTGAGG 11400
GTTTGCAACT GCGGGTCAAG GATCTGGATT TCGATCACGG CACGATCATC GTGCGGGAGG GCAAGGGCTC CAAGGATCGG GCCTTGATGT TACCCGAGAG 11500
CTTGGCACCC AGCCTGCGCG AGCAGCTGTC GCGTGCACGG GCATGGTGGC TGAAGGACCA GGCCGAGGGC CGCAGCGGCG TTGCGCTTCC CGACGCCCTT 11600
GAGCGGAAGT ATCCGCGCGC CGGGCATTCC TGGCCGTGGT TCTGGGTTTT TGCGCAGCAC ACGCATTCGA CCGATCCACG GAGCGGTGTC GTGCGTCGCC 11700
ATCACATGTA TGACCAGACC TTTCAGCGCG CCTTCAAACG TGCCGTAGAA CAAGCAGGCA TCACGAAGCC CGCCACACCG CACACCCTCC GCCACTCGTT 11800
CGCGACGGCC TTGCTCCGCA GCGGTTACGA CATTCGAACC GTGCAGGATC TGCTCGGCCA TTCCGACGTC TCTACGACGA TGATTTACAC GCATGTGCTG 11900
AAAGTTGGCG GTGCCGGAGT GCGCTCACCG CTTGATGCGC TGCCGCCCCT CACTAGTGAG AGGTAGGGCA GCGCAAGTCA ATCCTGGCGG ATTCACTACC 12000
CCTGCGCGAA GGCCATCGGT GCCGCATCGA ACGGCCGGTT GCGGAAAGTC CTCCCTGCGT CCGCTGATGG CCGGCAGCAG CCCGTCGTTG CCTGATGGAT 12100
CCAACCCCTC CGCTGCTATA GTGCAGTCGG CTTCTGACGT TCAGTGCAGC CGTCTTCTGA AAACGACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC aadA1a core 6652-6705 54 CGCTTGAGTT AAGCCGCGCC GCGAAGCGGC GTCGGCTTGA ACGAATTGTT AGAC
attC cmlA6 core 7508-7571 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT
GGGC
attC-cmlA6 3'-end 9051-9056 6 GCCCAA
attC aadA3 core 9057-9110 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC orfD core 9918-9965 48 GAGGTAAGCC GACCGCAGAA TGCGGGTCGG CTTGACCGAA ATGTTAGA
attC dfrA12 core 10233-10316 84 CGTTGGACGT AACGAGAGCC GGAGCGCAGC GGAGGGAACC AAAATGCGCA GCATTTTGGC
GTCCCGTTGA CGGAATGGTT AGCC
attI 10817-10872 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA 5'-end In_Tn21.1 106-1410 Transposase   +
tnpA IS26 1457-2161 Transposase   +
SDR family oxidoreductase In_Tn21.1 2619-3482 Passenger Gene Other +
GrpB domain protein In_Tn21.1 3520-3765 Passenger Gene Other +
sul3 (ARO:3000413) In_Tn21.1 4234-5025 Passenger Gene Antibiotic Resistance +
tnp IS256 family In_Tn21.1 5350-5958 Transposase   +
qacL (ARO:3005098) In_Tn21.1 6205-6537 Passenger Gene Antibiotic Resistance -
aadA (ARO:3002601) In_Tn21.1 6707-7498 Passenger Gene Antibiotic Resistance -
cmlA6 (ARO:3002696) In_Tn21.1 7591-8850 Passenger Gene Antibiotic Resistance -
aadA2 (ARO:3002602) In_Tn21.1 9112-9891 Passenger Gene Antibiotic Resistance -
DUF1010 family protein In_Tn21.1 9909-10199 Passenger Gene Other -
dfrA12 (ARO:3002858) In_Tn21.1 10311-10808 Passenger Gene Antibiotic Resistance -
intI1 In_Tn21.1 10953-11966 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA 5'-end TniA 5'-end In_Tn21.1 1305 106-1410 +
Class:   Transposase
Function:   integrase
Transpoase Chemistry:   DDE
Comment:   Contains the first 429 amino acids of tniA (In2)||probably truncated by insertion of IS26
Protein Sequence:  
MLNTRVHQSE VSMATDTPRI PEQGVATLPD EAWERARRRA EIISPLAQSE TVGHEAADMA AQALGLSRRQ VYVLIRRARQ GSGLVTDLVP GQSGGGKGKG
RLPEPVERVI HELLQKRFLT KQKRSLAAFH REVTQVCKAQ KLRVPARNTV ALRIASLDPR KVIRRREGQD AARDLQGVGG EPPAVTAPLE QVQIDHTVID
LIVVDDRDRQ PIGRPYLTLA IDVFTRCVLG MVVTLEAPSA VSVGLCLVHV ACDKRPWLEG LNVEMDWQMS GKPLLLYLDN AAEFKSEALR RGCEQHGIRL
DYRPLGQPHY GGIVERIIGT AMQMIHDELP GTTFSNPDQR GDYDSENKAA LTLRELERWL TLAVGTYHGS VHNGLLQPPA ARWAEAVARV GVPAVVTRAT
SFLVDFLPIL RRTLTRTGFV IDHIHYYADG HCCK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA IS26 705 1457-2161 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MNPFKGRHFQ RDIILWAVRW YCKYGISYRE LQEMLAERGV NVDHSTIYRW VQRYAPEMEK RLRWYWRNPS DLCPWHMDET YVKVNGRWAY LYRAVDSRGR
TVDFYLSSRR NSKAAYRFLG KILNNVKKWQ IPRFINTDKA PAYGRALALL KREGRCPSDV EHRQIKYRNN VIECDHGKLK RIIGATLGFK SMKTAYATIK
GIEVMRALRK GQASAFYYGD PLGEMRLVSR VFEM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
SDR family oxidoreductase SDR family oxidoreductase In_Tn21.1 864 2619-3482 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  WP_000612791.1
Protein Sequence:  
MIPNSENKRV WFITGASKGL GYAFTCAALK AGDKVVAVAR TIDNLAKLEE TYQESLLPLN LDVTDREAVF STVETAVKHF GRLDIVVNNA GIMTMGMIEE
LNESDARKLM DTNFFGALWV CQAVMPYLRS QRSGHIIQIT SIGAIISGPM SGIYSASKFA LEGMSEALAK EAEHFGVKLT MVEPGGYWTD LYTSMSYSNP
LDSYGTLRDE LAKQYSEDSV DSDPSLAAEA LMKLVASNNP PLRLILGSMV YDLAMDTLKA RMATWEEWEA VSRASEKAIP APERYGV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GrpB domain protein GrpB domain protein In_Tn21.1 246 3520-3765 +
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  GrpB (Pfam:PF04229)
Protein Sequence:  
MKIEIMEYNP DWTKNFEEEK IKLLHFFGSH AVAIEHIGST AIPNQRAKPV IDIFIGVSPF AELPFISAFL MQRSITTLRQ I

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul3 (ARO:3000413) Sul3 In_Tn21.1 792 4234-5025 +
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Target:   sulfone antibiotic (ARO:3003401)||sulfonamide antibiotic (ARO:3000282)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000413
Protein Sequence:  
MSKIFGIVNI TTDSFSDGGL YLDTDKAIEH ALHLVEDGAD VIDLGAASSN PDTTEVGVVE EIKRLKPVIK ALKEKGISIS VDTFKPEVQS FCIEQKVDFI
NDIQGFPYPE IYSGLAKSDC KLVLMHSVQR IGAATKVETN PEEVFTSMME FFKERIAALV EAGVKRERII LDPGMGFFLG SNPETSILVL KRFPEIQEAF
NLQVMIAVSR KSFLGKITGT DVKSRLAPTL AAEMYAYKKG ADYLRTHDVK SLSDALKISK ALG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp IS256 family Tnp IS256 family In_Tn21.1 609 5350-5958 +
Class:   Transposase
Function:   tranposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MEFYPSCIEK GMRSERALKL AIAEMYVKGV STRRVSDIVE ILCGTEVSSS QVSRLAKELD EEITSWKAQP VGQIQYLVLD ATYESVRVGS HVVKQALLVA
IGVDYSGNRH ILDAEVANSE AEVNWRSFLE GLVRRGMHGL RMITSDDHSG LRAAIDAVFP GILWQRCQFH LQQNAHSYVT KKDEIPLIAA DIRKVFNRNM
SR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacL (ARO:3005098) QacL In_Tn21.1 333 6205-6537 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   disinfecting agents and antiseptics (ARO:3005386)
Sequence Family:  small multidrug resistance (SMR) antibiotic efflux pump (ARO:0010003)
Comment:   subunit of the qac multidrug efflux pump||strict match to reference sequence for ARO:3005098 (bitscore: 202)
Protein Sequence:  
MKNWLFLAIA IFGEVVATSA LKSSHGFTKL VPSVVVVAGY GLAFYFLSLA LKSIPVGIAY AVWAGLGIVL VAAIAWIFHG QKLDLWAFVG MGLIVSGVAV
LNLLSKVSAH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA (ARO:3002601) AadA In_Tn21.1 792 6707-7498 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   aminoglycoside nucleotidyltransferase
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   perfect match to reference sequence for ARO:3002601||Synonyms: aadA1-pm aadA, aadA1, aad(3'')(9)
Protein Sequence:  
MREAVIAEVS TQLSEVVGVI ERHLEPTLLA VHLYGSAVDG GLKPHSDIDL LVTVTVRLDE TTRRALINDL LETSASPGES EILRAVEVTI VVHDDIIPWR
YPAKRELQFG EWQRNDILAG IFEPATIDID LAILLTKARE HSVALVGPAA EELFDPVPEQ DLFEALNETL TLWNSPPDWA GDERNVVLTL SRIWYSAVTG
KIAPKDVAAD WAMERLPAQY QPVILEARQA YLGQEEDRLA SRADQLEEFV HYVKGEITKV VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
cmlA6 (ARO:3002696) CmlA6 In_Tn21.1 1260 7591-8850 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   phenicol antibiotic (ARO:3000387)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   strict match to reference sequence for ARO:3002696 (bitscore: 819)
Protein Sequence:  
MSSKNFSWRY SLAATVLLLS PFDLLASLGM DMYLPAVPFM PNALGTTAST IQLTLTTYLV MIGAGQLLFG PLSDRLGRRP VLLGGGLAYV VASMGLALTS
SAEVFLGLRI LQACGASACL VSTFATVRDI YAGREESNVI YGILGSMLAM VPAVGPLLGA LVDMWLGWRA IFAFLGLGMI AASAAAWRFW PETRVQRVAG
LQWSQLLLPV KCLNFWLYTL CYAAGMGSFF VFFSIAPGLM MGRQGVSQLG FSLLFATVAI AMVFTARFMG RVIPKWGSPS VLRMGMGCLI AGAVLLAITE
IWALQSVLGF IAPMWLVGIG VATAVSVAPN GALRGFDHVA GTVTAVYFCL GGVLLGSIGT LIISLLPRNT AWPVVVYCLT LATVVLGLSC VSRVKGSRGQ
GEHDVVALQS AESTSNPNR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA2 (ARO:3002602) AadA2 In_Tn21.1 780 9112-9891 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   strict match to reference sequence for ARO:3002602 (bitscore: 520)
Protein Sequence:  
VTIEISNQLS EVLSVIERHL ESTLLAVHLY GSAVDGGLKP YSDIDLLVTV AVKLDETTRR ALLNDLMEAS AFPGESETLR AIEVTLVVHD DIIPWRYPAK
RELQFGEWQR NDILAGIFEP AMIDIDLAIL LTKAREHSVA LVGPAAEEFF DPVPEQDLFE ALRETLKLWN SQPDWAGDER NVVLTLSRIW YSAITGKIAP
KDVAADWAIK RLPAQYQPVL LEAKQAYLGQ KEDHLASRAD HLEEFIRFVK GEIIKSVGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
DUF1010 family protein DUF1010 family protein In_Tn21.1 291 9909-10199 -
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  DUF1010 (Pfam:PF06231)
Protein Sequence:  
MFIQTAFSFS GVIQCLFCLF SGLRLHGLRR FSVFLASSPC VASASSYRFC SAVPPRWRSV FSRLAPVAKF KLSVLASGSN ISVKPTRILR SAYLAR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
dfrA12 (ARO:3002858) DfrA12 In_Tn21.1 498 10311-10808 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Target:   diaminopyrimidine antibiotic (ARO:3000171)
Sequence Family:  trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218)
Comment:   100% identity with reference sequence for ARO:3002858 (bitscore: 339)||Synonyms:
Protein Sequence:  
MNSESVRIYL VAAMGANRVI GNGPNIPWKI PGEQKIFRRL TEGKVVVMGR KTFESIGKPL PNRHTLVISR QANYRATGCV VVSTLSHAIA LASELGNELY
VAGGAEIYTL ALPHAHGVFL SEVHQTFEGD AFFPMLNETE FELVSTETIQ AVIPYTHSVY ARRNG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In_Tn21.1 1014 10953-11966 +
Class:   Integron Integrase
Sub Class:   Class 1
Function:   Integrase
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LPTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS26-MH257753 IS26 Insertion Sequence 1394-2213 820

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In_Tn21.1 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In_Tn21.1 49-67 AACACGTCGG TCGAGGACT
repeat t3 In_Tn21.1 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 In_Tn21.1 110-128 TCAATACTCG TGTGCACCA
IRL IS26 1394-1407 GGCACTGTTG CAAA
IRR IS26 2200-2213 AAACGTTGTC ACGG
repeat i4 In_Tn21.1 12049-12067 AGGAGGGACG CAGGCGACT
repeat i3 In_Tn21.1 12077-12095 CGTCGGGCAG CAACGGACT
repeat i2 In_Tn21.1 12119-12137 ATCACGTCAG CCGAAGACT
IRi In_Tn21.1 12136-12168 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT