Transposon
Name: In2
Family: Integron        Group: Class 1
Evidence of Transposition: no
 Host     

Host Organism:Shigella flexneri Molecular Source:plasmid NR1 (R100)
Place of Origin:Japan Date of Isolation:1950s

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  11000 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCTG TTGAGTTGCA TCTAAAATTG 2700
ACCCACTGGG GGTGCGGACG ATTTCTTGGA CGGTTTATAC GGACATCAAT CCGACCGCAT GACGATACTC GATGGGACTA CGCCCGCCAA GCGACACTTT 2800
GATGCGGCGC TCGTTGTACC AGTGGATATA GGCATCGATT CGCGTCATGA GGTCTTTCAG CGTCACGTGC TGCCAATTCC TCGGGTAGAT TAGTTCGGTC 2900
TTCAATCGTC CGAAAAAGCC CTCGCATGCA GCATTGTCTG GCGAGCAGCC CTTTTTGGAC ATCGACCGCG TTAATTGGGC ATTTTCAGTG CGGCGGATCC 3000
ACGCAGGCCA GCGATAATGC GAGCCCCTGT CCGAATGGAT AACCGGATGC TCACCGGGTC GCAGTGTCCG TACCGCGTGA TCCAGCATGG TATTGACCAG 3100
GTTCGCATCC GGGCTGGTGC CGATATTCCA GGCCACCACC AGCCCATCGA AGCAATCGAC GATCGGCGAG ACGTAGACCT TCCCTGCCGG AATGTGTATT 3200
TCCGTCAGAT CGGTCAACCA TTTCGTATTC GGCGCCGACG CGTGAAAGTC GCGATTCAGC AGATTCGGGA CCGCTGGTGT CGGGTCGCCA GCATACGCCG 3300
AGAAGCGCCG GCGGCGCGGT GTTCTCACGA CCAGACGCTC TTGCGCCATC AAGCGACGCA CGACCTTCTC GGACACACGC ATGCCACCAA GGCGCAAGGC 3400
ACTATCAATG CGTCGATAGC CATAGCAGCG GTAGTTGTCC TCGAAGATAG TCCGAATGAC CTCACGCACC TGCGTGTACT TGTCGGGCCG CGTCTGCCGC 3500
AGGCGTTGAT AGAAGTATGT GCTGCGCGCC AGCTTCAGGC CGCACAACAG ATTGGCTAAT GGAAACGTGA CTCTGAGGGC ATCAACCACC TTCGTTTTTT 3600
CTCGGCTTGT CAGTTCGAGG GGGTTGATGC CCATGTCTTT TTTTATCAAT TCACTCGCCT TCTCCAGAAT TGCATTCTCC ATGCGAAGCC GCTGGTTCTG 3700
GCTCTCCAGT TCGGCCAGTT CCCTGAGTAG TGCCTCATGC CGCTGCTCGA GCGAGGTGTC ACCTTTCTTC TTTGTCATGG GTTTTAGGGG CACTTTGCCA 3800
AGTAATCGAT GCTGCCAGTT ATACAACGTT GGTCGCGATA CACCGACAGT GTCGGCCACA TCCTTTGCCG AACCTACGCG CAGGTTCAGT GCAATGACGG 3900
CTTGCTGCTT CTCGAGGCGA GAGCGGGCGA CTGTGGGAGC GCTGCTGCCG ACGACCGTCC TAGCGAATTC AGGGCGTAAA TCACGGATCC AGGCACGCAA 4000
GGCCTCGCGG CTTGGGTAGC CCAGGCTTCG GATTGTGTGA CTCAGGCAGT AGCCTTGTTC GATATAGTGA TCTACTGCCC GTTGCTTTTG CTCATCGGTG 4100
TACTGCCGTT TTATCCGTTG ATAGCCTCGG CGAAGATCCT GATTCCGTTC GAATTCTGCC AACCAGGCCT TCAGCGAGTT CTTGGTGGGG TATCCCAGCT 4200
GCCGTAGTGT GGCGCTCATC CGGCGCCCAA GCTTCAGGTA CAACCTCACG GCTCGAAGGC GATCTTCATA CGAATACATG AACTACTCCT AAAGTAGTCC 4300
AAGATTTTGT CCGCACCCCA ACTTAGGGTA AAGATTTGCG TCGAAATTTG ACCCACGTAT GACACTGTTT CCCGTCTGGA TATGGCGGGA GAAATCAAGG 4400
AGTGATAAAC GTGGCGATAT TGAGCGCAAT TCGACGCTGG CATTTTCGCG ATGGTGCGTC GATTCGGGAA ATAGCCCGAC GAAGCGGCCT GTCCAGGAAC 4500
ACCGTTCGCA AGTATTTGCA AAGCAAGGTG GTTGAACCGC AGTACCCAGC GCGAGACAGC GTTGGCAAGT TAAGTCCTTT TGAGCCCAAG TTAAGGCAGT 4600
GGCTCTCCAC CGAGCACAAA AAGACAAAGA AGCTGCGCAG AAACCTGCGC AGCATGTACC GGGATTTGGT CGCTTTGGGC TTTACCGGGT CTTATGACCG 4700
AGTGTGTGCC TTTGCCCGAC AGTGGAAAGA TTCCGAACAG TTCAAGGCGC AAACCTCGGG CAAGGGTTGT TTCATCCCCT TGCGCTTTGC TTGTGGCGAA 4800
GCCTTCCAAT TCGATTGGAG TGAGGACTTT GCCCGCATAG CGGGCAAACA GGTCAAACTT CAGATTGCCC AGTTTAAGTT GGCCCACAGC CGGGCCTTTG 4900
TGCTTCGGGC TTACTACCAG CAAAAACATG AAATGCTGTT TGATGCCCAC TGGCATGCCT TTCAAATCTT CGGTGGCATT CCCAAGCGCG GCATCTACGA 5000
CAACATGAAG ACCGCTGTGG ATTCGGTGGG GCGTGGCAAA GAGCGCAGGG TCAATCAGCG GTTCACTGCC ATGGTCAGCC ACTACCTGTT TGATGCGCAG 5100
TTCTGTAATC CAGCATCGGG TTGGGAGAAA GGCCAGATTG AGAAGAACGT GCAGGATTCC CGCCAACGCC TGTGGCAAGG GGCACCAGAC TTTCAAAGCC 5200
TTGCTGATTT GAATGTGTGG CTTGAGCATC GCTGCAAAGC GCTGTGGTCT GAGCTGCGCC ACCCCGAATT GGACCAAACC GTGCAAGAGG CCTTTGCCGA 5300
TGAACAAGGC GAGTTGATGG CGCTACCCAA TGCCTTTGAT GCATTCGTGG AGCAAACCAA GCGAGTCACT TCAACCTGCC TTGTTCACCA CGAGGGCAAT 5400
CGCTACAGCG TTCCTGCCAG TTACGCCAAC AGGGCCATCA GCCTTCGGAT TTATGCAGAC AAGCTGGTGA TGGCTGCCGA AGGCCAACAC ATTGCCGAGC 5500
ATCCAAGATT GTTTGGCAGT GGCCACGCTC GGCGTGGCCA CACACAATAC GACTGGCACC ATTACTTGTC TGTGCTTCAG AAGAAACCTG GGGCGTTGCG 5600
CAATGGTGCG CCATTTGCTG AATTGCCACC CGCGTTCAAG AAGCTTCAAT CCATCTTGCT GCAACGCCCC GGCGGTGACC GTGACATGGT GGAAATTCTG 5700
GCCCTTGTAT TGCACCACGA TGAAGGTGCG GTACTCAGTG CTGTGGAATT GGCATTGGAG TGTGGCAAGC CATCGAAGGA GCATGTGCTT AATCTGTTGG 5800
GACGTTTGAC CGAAGAACCT CCACCCAAAC CGATTCCAAT TCCCAAGGGG TTAAGGCTGA CATTGGAACC ACAGGCCAAC GTGAACCGCT ATGACAGTTT 5900
AAGGAGAGCC CATGATGCAG CATGAAGGCC ATGTGAGAAT CCTCAAATCC TTGAAACTCT TTGGCATGGC ACACGCCATT GAGGAGTTGG GCAATCAGAA 6000
TTCACCAGCA TTTAATCAAG CCTTGCCCAT GCTGGACAGC TTGATTAAAG CTGAAGTGGC AGAGCGTGAA GTACGTTCGG TGAACTATCA ATTGCGGGTG 6100
GCCAAGTTCC CCGTGTATCG GGACTTGGTG GGCTTTGACT TCAGTCAAAG CCTGGTTAAT GAGGCCACGG TCAAACAATT GCACCGGTGC GACTTCATGG 6200
AACAAGCCCA GAACGTGGTG CTGATTGGTG GGCCAGGCAC AGGCAAGACT CACCTGGCCA CAGCCATTGG TACACAAGCA GTGATGCACT TGAACCGACG 6300
GGTGCGTTTC TTCTCCACCG TGGATTTGGT CAATGCACTG GAGCAAGAGA AATCATCTGG GCGTCAGGGA CAAATCGCAA ACCGTCTGTT GTATGCCGAT 6400
TTGGTGATTC TGGATGAGCT GGGATATTTG CCTTTTAGCC AAACCGGTGG GGCACTGCTG TTTCACCTGC TCTCAAAGCT GTACGAAAAA ACCAGCGTGA 6500
TACTGACCAC CAACTTGAGC TTCTCGGAAT GGAGCCGAGT GTTTGGCGAT GAAAAGATGA CAACAGCGTT GTTGGACCGA CTAACCCACC ACTGCCACAT 6600
CCTGGAAACC GGCAATGAAA GTTACCGCTT CAAACACAGT TCAACTCAGA ATAAGCAGGA GGAAAAACAG ACCCGCAAAC TGAAAATCGA GACATAATTC 6700
TGACAACAAG GGGTGGGTCA AAATTCAATG CAAATCCCGG GTCAAATTTG GGTGCAAATC AACAGATATC GACAACCTCT CGCGCAACCA AGACATCGCG 6800
GTCGGACTGC AAGTGATCTT GAAGCCACGG GCCCGTCCCA CCCCGACATG GACCTCGATG CCCGAACGGA CGTTAGATTT CGAGTTCTAG GCGTTCTGCG 6900
ATGAAGGTTG GATCCCAGCC GGGATTGAAA GTGTCGACGT GGGTGAATCC GAGCCGCTCG TATAGGCCAC GCAGGTTCGG GTGGCAGTCG AGCCGCAGCT 7000
TGGCGCACCC CTGCGTTCGC GCGGCATGGC GGCAAGCCTC GATCAGCGCG GAGCTGACAC CCCGGCCCGC ATGTGTCCGT CGCACCGCGA GCTTGTGCAG 7100
ATATGCGGCC TCCCCCTTGA GGGCGTCGGG CCAGAACTCG GGATCCTCGG CCGACAAGGT GCAACAGCCG ACGATGCCGT CGCTGCAACT CGCGACTAGG 7200
AGCTCGGATC TCAGGACGAA GGTCTCCGCG AATGTCCGGT CGATCCGCGC GACGTCCCAG GCGGGCGTTC CCTTGGCGGA CATCCACGCC GCAGCGTCGT 7300
GCATCAGCCG CACAACCTCG TCGATATCAC CCGAGCAGGC GACCCGAACG TTCGGAGGCT CCTCGCTGTC CATTCGCTCC CCTGGCGCGG TATGAACCGC 7400
CGCCTCATAG TGCAGTTTGA TCCTGACGAG CCCAGCATGT CTGCGCCCAC CTTCGCGGAA CCTGACCAGG GTCCGCTAGC GGGCGGCCGG AAGGTGAATG 7500
CTAGGCATGA TCTAACCCTC GGTCTCTGGC GTCGCGACTG CGAAATTTCG CGAGGGTTTC CGAGAAGGTG ATTGCGCTTC GCAGATCTCC AGGCGCGTGG 7600
GTGCGGACGT AGTCAGCGCC ATTGCCGATC GCGTGAAGTT CCGCCGCAAG GCTCGCTGGA CCCAGATCCT TTACAGGAAG GCCAACGGTG GCGCCCAAGA 7700
AGGATTTCCG CGACACCGAG ACCAATAGCG GAAGCCCCAA CGCCGACTTC AGCTTTTGAA GGTTCGACAG CACGTGCAGC GATGTTTCCG GTGCGGGGCT 7800
CAAGAAAAAT CCCATCCCCG GATCGAGGAT GAGCCGGTCG GCAGCGACCC CGCTCCGTCG CAAGGCGGAA ACCCGCGCCT CGAAGAACCG CACAATCTCG 7900
TCGAGCGCGT CTTCGGGTCG AAGGTGACCG GTGCGGGTGG CGATGCCATC CCGCTGCGCT GAGTGCATAA CCACCAGCCT GCAGTCCGCC TCAGCAATAT 8000
CGGGATAGAG CGCAGGGTCA GGAAATCCTT GGATATCGTT CAGGTAGCCC ACGCCGCGCT TGAGCGCATA GCGCTGGGTT TCCGGTTGGA AGCTGTCGAT 8100
TGAAACACGG TGCATCTGAT CGGACAGGGC GTCTAAGAGC GGCGCAATAC GTCTGATCTC ATCGGCCGGC GATACAGGCC TCGCGTCCGG ATGGCTGGCG 8200
GCCGGTCCGA CATCCACGAC GTCTGATCCG ACTCGCAGCA TTTCGATCGC CGCGGTGACA GCGCCGGCGG GGTCTAGCCG CCGGCTCTCA TCGAAGAAGG 8300
AGTCCTCGGT GAGATTCAGA ATGCCGAACA CCGTCACCAT GGCGTCGGCC TCCGCAGCGA CTTCCACGAT GGGGATCGGG CGAGCAAAAA GGCAGCAATT 8400
ATGAGCCCCA TACCTACAAA GCCCCACGCA TCAAGCTTTT GCCCATGAAG CAACCAGGCA ATGGCTGTAA TTATGACGAC GCCGAGTCCC GACCAGACTG 8500
CATAAGCAAC ACCGACAGGG ATGGATTTCA GAACCAGAGA AAGAAAATAA AATGCGATGC CATAACCGAT TATGACAACG GCGGAAGGGG CAAGCTTAGT 8600
AAAGCCCTCG CTAGATTTTA ATGCGGATGT TGCGATTACT TCGCCAACTA TTGCGATAAC AAGAAAAAGC CAGCCTTTCA TGATATATCT CCCAATTTGT 8700
GTAGGGCTTA TTATGCACGC TTAAAAATAA TAAAAGCAGA CTTGACCTGA TAGTTTGGCT GTGAGCAATT ATGTGCTTAG TGCATCTAAC GCTTGAGTTA 8800
AGCCGCGCCG CGAAGCGGCG TCGGCTTGAA CGAATTGTTA GACATTATTT GCCGACTACC TTGGTGATCT CGCCTTTCAC GTAGTGGACA AATTCTTCCA 8900
ACTGATCTGC GCGCGAGGCC AAGCGATCTT CTTCTTGTCC AAGATAAGCC TGTCTAGCTT CAAGTATGAC GGGCTGATAC TGGGCCGGCA GGCGCTCCAT 9000
TGCCCAGTCG GCAGCGACAT CCTTCGGCGC GATTTTGCCG GTTACTGCGC TGTACCAAAT GCGGGACAAC GTAAGCACTA CATTTCGCTC ATCGCCAGCC 9100
CAGTCGGGCG GCGAGTTCCA TAGCGTTAAG GTTTCATTTA GCGCCTCAAA TAGATCCTGT TCAGGAACCG GATCAAAGAG TTCCTCCGCC GCTGGACCTA 9200
CCAAGGCAAC GCTATGTTCT CTTGCTTTTG TCAGCAAGAT AGCCAGATCA ATGTCGATCG TGGCTGGCTC GAAGATACCT GCAAGAATGT CATTGCGCTG 9300
CCATTCTCCA AATTGCAGTT CGCGCTTAGC TGGATAACGC CACGGAATGA TGTCGTCGTG CACAACAATG GTGACTTCTA CAGCGCGGAG AATCTCGCTC 9400
TCTCCAGGGG AAGCCGAAGT TTCCAAAAGG TCGTTGATCA AAGCTCGCCG CGTTGTTTCA TCAAGCCTTA CGGTCACCGT AACCAGCAAA TCAATATCAC 9500
TGTGTGGCTT CAGGCCGCCA TCCACTGCGG AGCCGTACAA ATGTACGGCC AGCAACGTCG GTTCGAGATG GCGCTCGATG ACGCCAACTA CCTCTGATAG 9600
TTGAGTCGAT ACTTCGGCGA TCACCGCTTC CCTCATGATG TTTAACTTTG TTTTAGGGCG ACTGCCCTGC TGCGTAACAT CGTTGCTGCT CCATAACATC 9700
AAACATCGAC CCACGGCGTA ACGCGCTTGC TGCTTGGATG CCCGAGGCAT AGACTGTACC CCAAAAAAAC AGTCATAACA AGCCATGAAA ACCGCCACTG 9800
CGCCGTTACC ACCGCTGCGT TCGGTCAAGG TTCTGGACCA GTTGCGTGAG CGCATACGCT ACTTGCATTA CAGCTTACGA ACCGAACAGG CTTATGTCCA 9900
CTGGGTTCGT GCCTTCATCC GTTTCCACGG TGTGCGTCAC CCGGCAACCT TGGGCAGCAG CGAAGTCGAG GCATTTCTGT CCTGGCTGGC GAACGAGCGC 10000
AAGGTTTCGG TCTCCACGCA TCGTCAGGCA TTGGCGGCCT TGCTGTTCTT CTACGGCAAG GTGCTGTGCA CGGATCTGCC CTGGCTTCAG GAGATCGGAA 10100
GACCTCGGCC GTCGCGGCGC TTGCCGGTGG TGCTGACCCC GGATGAAGTG GTTCGCATCC TCGGTTTTCT GGAAGGCGAG CATCGTTTGT TCGCCCAGCT 10200
TCTGTATGGA ACGGGCATGC GGATCAGTGA GGGTTTGCAA CTGCGGGTCA AGGATCTGGA TTTCGATCAC GGCACGATCA TCGTGCGGGA GGGCAAGGGC 10300
TCCAAGGATC GGGCCTTGAT GTTACCCGAG AGCTTGGCAC CCAGCCTGCG CGAGCAGCTG TCGCGTGCAC GGGCATGGTG GCTGAAGGAC CAGGCCGAGG 10400
GCCGCAGCGG CGTTGCGCTT CCCGACGCCC TTGAGCGGAA GTATCCGCGC GCCGGGCATT CCTGGCCGTG GTTCTGGGTT TTTGCGCAGC ACACGCATTC 10500
GACCGATCCA CGGAGCGGTG TCGTGCGTCG CCATCACATG TATGACCAGA CCTTTCAGCG CGCCTTCAAA CGTGCCGTAG AACAAGCAGG CATCACGAAG 10600
CCCGCCACAC CGCACACCCT CCGCCACTCG TTCGCGACGG CCTTGCTCCG CAGCGGTTAC GACATTCGAA CCGTGCAGGA TCTGCTCGGC CATTCCGACG 10700
TCTCTACGAC GATGATTTAC ACGCATGTGC TGAAAGTTGG CGGTGCCGGA GTGCGCTCAC CGCTTGATGC GCTGCCGCCC CTCACTAGTG AGAGGTAGGG 10800
CAGCGCAAGT CAATCCTGGC GGATTCACTA CCCCTGCGCG AAGGCCATCG GTGCCGCATC GAACGGCCGG TTGCGGAAAG TCCTCCCTGC GTCCGCTGAT 10900
GGCCGGCAGC AGCCCGTCGT TGCCTGATGG ATCCAACCCC TCCGCTGCTA TAGTGCAGTC GGCTTCTGAC GTTCAGTGCA GCCGTCTTCT GAAAACGACA 11000

 Recombination Sites     

Name Coordinates Gene Sequence
attC qacH_sul1 core 7473-7506 34 CCGCTAGCGG GCGGCCGGAA GGTGAATGCT AGGC
attC aadA1a core 8790-8843 54 CGCTTGAGTT AAGCCGCGCC GCGAAGCGGC GTCGGCTTGA ACGAATTGTT AGAC
attI 9646-9701 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA In2 142-1821 Transposase   +
tniB delta1 In2 1824-2684 Accessory Gene   +
tnp IS1353 2735-4279 Transposase   -
istA IS1326::IS1353 4402-5925 Transposase   +
istB IS1326::IS1353 5912-6697 Accessory Gene ATPase Transposition Helper +
GNAT_fam In2 6873-7373 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In2 7501-8340 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In2 8334-8681 Passenger Gene Antibiotic Resistance -
aadA (ARO:3002601) In2 8845-9636 Passenger Gene Antibiotic Resistance -
intI1 In2 9785-10798 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA In2 1680 142-1821 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7
Protein Sequence:  
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE
LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR
TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB delta1 TniB delta1 In2 861 1824-2684 +
Class:   Accessory Gene
Function:   probable ATP-binding protein.
Comment:   probably truncated by insertion of IS1326::IS1353
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAC

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS1353 1545 2735-4279 -
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MYSYEDRLRA VRLYLKLGRR MSATLRQLGY PTKNSLKAWL AEFERNQDLR RGYQRIKRQY TDEQKQRAVD HYIEQGYCLS HTIRSLGYPS REALRAWIRD
LRPEFARTVV GSSAPTVARS RLEKQQAVIA LNLRVGSAKD VADTVGVSRP TLYNWQHRLL GKVPLKPMTK KKGDTSLEQR HEALLRELAE LESQNQRLRM
ENAILEKASE LIKKDMGINP LELTSREKTK VVDALRVTFP LANLLCGLKL ARSTYFYQRL RQTRPDKYTQ VREVIRTIFE DNYRCYGYRR IDSALRLGGM
RVSEKVVRRL MAQERLVVRT PRRRRFSAYA GDPTPAVPNL LNRDFHASAP NTKWLTDLTE IHIPAGKVYV SPIVDCFDGL VVAWNIGTSP DANLVNTMLD
HAVRTLRPGE HPVIHSDRGS HYRWPAWIRR TENAQLTRSM SKKGCSPDNA ACEGFFGRLK TELIYPRNWQ HVTLKDLMTR IDAYIHWYNE RRIKVSLGGR
SPIEYRHAVG LMSV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istA IstA IS1326::IS1353 1524 4402-5925 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MINVAILSAI RRWHFRDGAS IREIARRSGL SRNTVRKYLQ SKVVEPQYPA RDSVGKLSPF EPKLRQWLST EHKKTKKLRR NLRSMYRDLV ALGFTGSYDR
VCAFARQWKD SEQFKAQTSG KGCFIPLRFA CGEAFQFDWS EDFARIAGKQ VKLQIAQFKL AHSRAFVLRA YYQQKHEMLF DAHWHAFQIF GGIPKRGIYD
NMKTAVDSVG RGKERRVNQR FTAMVSHYLF DAQFCNPASG WEKGQIEKNV QDSRQRLWQG APDFQSLADL NVWLEHRCKA LWSELRHPEL DQTVQEAFAD
EQGELMALPN AFDAFVEQTK RVTSTCLVHH EGNRYSVPAS YANRAISLRI YADKLVMAAE GQHIAEHPRL FGSGHARRGH TQYDWHHYLS VLQKKPGALR
NGAPFAELPP AFKKLQSILL QRPGGDRDMV EILALVLHHD EGAVLSAVEL ALECGKPSKE HVLNLLGRLT EEPPPKPIPI PKGLRLTLEP QANVNRYDSL
RRAHDAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istB IstB IS1326::IS1353 786 5912-6697 +
Class:   Accessory Gene
Sub Class:   ATPase Transposition Helper
Function:   stimulates transposition
Protein Sequence:  
MMQHEGHVRI LKSLKLFGMA HAIEELGNQN SPAFNQALPM LDSLIKAEVA EREVRSVNYQ LRVAKFPVYR DLVGFDFSQS LVNEATVKQL HRCDFMEQAQ
NVVLIGGPGT GKTHLATAIG TQAVMHLNRR VRFFSTVDLV NALEQEKSSG RQGQIANRLL YADLVILDEL GYLPFSQTGG ALLFHLLSKL YEKTSVILTT
NLSFSEWSRV FGDEKMTTAL LDRLTHHCHI LETGNESYRF KHSSTQNKQE EKQTRKLKIE T

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In2 501 6873-7373 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  GNAT
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In2 840 7501-8340 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In2 348 8334-8681 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   acridine dye (ARO:3000054)||quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA (ARO:3002601) AadA In2 792 8845-9636 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   aminoglycoside nucleotidyltransferase
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   perfect match to reference sequence for ARO:3002601||Synonyms: aadA1-pm, aadA, aadA1, aad(3'')(9)
Protein Sequence:  
MREAVIAEVS TQLSEVVGVI ERHLEPTLLA VHLYGSAVDG GLKPHSDIDL LVTVTVRLDE TTRRALINDL LETSASPGES EILRAVEVTI VVHDDIIPWR
YPAKRELQFG EWQRNDILAG IFEPATIDID LAILLTKARE HSVALVGPAA EELFDPVPEQ DLFEALNETL TLWNSPPDWA GDERNVVLTL SRIWYSAVTG
KIAPKDVAAD WAMERLPAQY QPVILEARQA YLGQEEDRLA SRADQLEEFV HYVKGEITKV VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In2 1014 9785-10798 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS1326_IS1353-AF071413 IS1326 Insertion Sequence 2679-6764 4086
IS1353-AF071413 IS1353 Insertion Sequence 2707-4320 1614

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In2 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In2 49-67 AACACGTCGG TCGAGGACT
repeat t3 In2 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 In2 110-128 TCAATACTCG TGTGCACCA
IRL IS1326::IS1353 2679-2704 TGTTGAGTTG CATCTAAAAT TGACCC
IRR IS1353 2707-2719 TGGGGGTGCG GAC
IRL IS1353 4309-4320 CAGGCGTGGG GT
IRR IS1326::IS1353 6739-6764 CCCAGTTTAA ACCCACGTTT AGTTGT
repeat i4 In2 10881-10899 AGGAGGGACG CAGGCGACT
repeat i3 In2 10909-10927 CGTCGGGCAG CAACGGACT
repeat i2 In2 10951-10969 ATCACGTCAG CCGAAGACT
IRi In2 10968-11000 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References     

1.NAKAYA R, NAKAMURA A, MURATA Y. Resistance transfer agents in Shigella. Biochem Biophys Res Commun. 1960 Dec;3:654-9. doi: 10.1016/0006-291x(60)90081-4. PubMed ID: 13727669
2.Kopecko DJ, Brevet J, Cohen SN. Involvement of multiple translocating DNA segments and recombinational hotspots in the structural evolution of bacterial plasmids. J Mol Biol. 1976 Dec;108(2):333-60. PubMed ID: 796463
3.Diver WP, Grinsted J, Fritzinger DC, Brown NL, Altenbuchner J, Rogowsky P, Schmitt R. DNA sequences of and complementation by the tnpR genes of Tn21, Tn501 and Tn1721. Mol Gen Genet. 1983;191(2):189-93. PubMed ID: 6312271
4.Zheng ZX, Chandler M, Hipskind R, Clerget M, Caro L. Dissection of the r-determinant of the plasmid R100.1: the sequence at the extremities of Tn21. Nucleic Acids Res. 1981 Dec 11;9(23):6265-78. doi: 10.1093/nar/9.23.6265. PubMed ID: 6275355
5.Barrineau P, Gilbert P, Jackson WJ, Jones CS, Summers AO, Wisdom S. The DNA sequence of the mercury resistance operon of the IncFII plasmid NR1. J Mol Appl Genet. 1984;2(6):601-19. PubMed ID: 6530603
6.Misra TK, Brown NL, Fritzinger DC, Pridmore RD, Barnes WM, Haberstroh L, Silver S. Mercuric ion-resistance operons of plasmid R100 and transposon Tn501: the beginning of the operon including the regulatory region and the first two structural genes. Proc Natl Acad Sci U S A. 1984 Oct;81(19):5975-9. doi: 10.1073/pnas.81.19.5975. PubMed ID: 6091128
7.Misra TK, Brown NL, Haberstroh L, Schmidt A, Goddette D, Silver S. Mercuric reductase structural genes from plasmid R100 and transposon Tn501: functional domains of the enzyme. Gene. 1985;34(2-3):253-62. PubMed ID: 2989109
8.Rogowsky P, Halford SE, Schmitt R. Definition of three resolvase binding sites at the res loci of Tn21 and Tn1721. EMBO J. 1985 Aug;4(8):2135-41. PubMed ID: 2998784
9.Hyde DR, Tu CP. tnpM: a novel regulatory gene that enhances Tn21 transposition and suppresses cointegrate resolution. Cell. 1985 Sep;42(2):629-38. PubMed ID: 2992807
10.Brown NL, Misra TK, Winnie JN, Schmidt A, Seiff M, Silver S. The nucleotide sequence of the mercuric resistance operons of plasmid R100 and transposon Tn501: further evidence for mer genes which enhance the activity of the mercuric ion detoxification system. Mol Gen Genet. 1986 Jan;202(1):143-51. doi: 10.1007/BF00330531. PubMed ID: 3007931
11.Ward E, Grinsted J. The nucleotide sequence of the tnpA gene of Tn21. Nucleic Acids Res. 1987 Feb 25;15(4):1799-806. doi: 10.1093/nar/15.4.1799. PubMed ID: 3029727
12.Sundström L, Rådström P, Swedberg G, Sköld O. Site-specific recombination promotes linkage between trimethoprim. Mol Gen Genet. 1988 Aug;213(2-3):191-201. doi: 10.1007/BF00339581. PubMed ID: 3054482
13.Schmidt FR, Nucken EJ, Henschke RB. Structure and function of hot spots providing signals for site-directed specific recombination and gene expression in Tn21 transposons. Mol Microbiol. 1989 Nov;3(11):1545-55. PubMed ID: 2559298
14.Stokes HW, Hall RM. A novel family of potentially mobile DNA elements encoding site-specific gene-integration functions: integrons. Mol Microbiol. 1989 Dec;3(12):1669-83. doi: 10.1111/j.1365-2958.1989.tb00153.x. PubMed ID: 2560119
15.Guerineau F, Brooks L, Mullineaux P. Expression of the sulfonamide resistance gene from plasmid R46. Plasmid. 1990 Jan;23(1):35-41. PubMed ID: 2190244
16.Mercier J, Lachapelle J, Couture F, Lafond M, Vézina G, Boissinot M, Levesque RC. Structural and functional characterization of tnpI, a recombinase locus in Tn21 and related beta-lactamase transposons. J Bacteriol. 1990 Jul;172(7):3745-57. doi: 10.1128/jb.172.7.3745-3757.1990. PubMed ID: 2163386
17.Allmeier H, Cresnar B, Greck M, Schmitt R. Complete nucleotide sequence of Tn1721: gene organization and a novel gene product with features of a chemotaxis protein. Gene. 1992 Feb 1;111(1):11-20. PubMed ID: 1312499
18.Summers AO. Untwist and shout: a heavy metal-responsive transcriptional regulator. J Bacteriol. 1992 May;174(10):3097-101. doi: 10.1128/jb.174.10.3097-3101.1992. PubMed ID: 1577681
19.Lévesque C, Brassard S, Lapointe J, Roy PH. Diversity and relative strength of tandem promoters for the antibiotic-resistance genes of several integrons. Gene. 1994 May 3;142(1):49-54. doi: 10.1016/0378-1119(94)90353-0. PubMed ID: 8181756
20.Rådström P, Sköld O, Swedberg G, Flensburg J, Roy PH, Sundström L. Transposon Tn5090 of plasmid R751, which carries an integron, is related to Tn7, Mu, and the retroelements. J Bacteriol. 1994 Jun;176(11):3257-68. doi: 10.1128/jb.176.11.3257-3268.1994. PubMed ID: 8195081
21.Recchia GD, Stokes HW, Hall RM. Characterisation of specific and secondary recombination sites recognised by the integron DNA integrase. Nucleic Acids Res. 1994 Jun 11;22(11):2071-8. doi: 10.1093/nar/22.11.2071. PubMed ID: 8029014
22.Hall RM, Brown HJ, Brookes DE, Stokes HW. Integrons found in different locations have identical 5' ends but variable 3' ends. J Bacteriol. 1994 Oct;176(20):6286-94. doi: 10.1128/jb.176.20.6286-6294.1994. PubMed ID: 7929000
23.Recchia GD, Hall RM. Gene cassettes: a new class of mobile element. Microbiology. 1995 Dec;141 ( Pt 12):3015-27. doi: 10.1099/13500872-141-12-3015. PubMed ID: 8574395
24.Brown HJ, Stokes HW, Hall RM. The integrons In0, In2, and In5 are defective transposon derivatives. J Bacteriol. 1996 Aug;178(15):4429-37. doi: 10.1128/jb.178.15.4429-4437.1996. PubMed ID: 8755869
25.Collis CM, Kim MJ, Stokes HW, Hall RM. Binding of the purified integron DNA integrase Intl1 to integron. Mol Microbiol. 1998 Jul;29(2):477-90. doi: 10.1046/j.1365-2958.1998.00936.x. PubMed ID: 9720866
26.Gravel A, Fournier B, Roy PH. DNA complexes obtained with the integron integrase IntI1 at the attI1 site. Nucleic Acids Res. 1998 Oct 1;26(19):4347-55. doi: 10.1093/nar/26.19.4347. PubMed ID: 9742234
27.Liebert CA, Hall RM, Summers AO. Transposon Tn21, flagship of the floating genome. Microbiol Mol Biol Rev. 1999 Sep;63(3):507-22. PubMed ID: 10477306
28.Partridge SR, Hall RM. In34, a complex In5 family class 1 integron containing orf513 and dfrA10. Antimicrob Agents Chemother. 2003 Jan;47(1):342-9. doi: 10.1128/aac.47.1.342-349.2003. PubMed ID: 12499211
29.Kholodii G, Mindlin S, Petrova M, Minakhina S. Tn5060 from the Siberian permafrost is most closely related to the ancestor of Tn21 prior to integron acquisition. FEMS Microbiol Lett. 2003 Sep 26;226(2):251-5. doi: 10.1016/S0378-1097(03)00559-7. PubMed ID: 14553919