Transposon
Name: In4
Family: Integron        Group: Class 1
Evidence of Transposition: no
 Host     

Host Organism:Pseudomonas aeuroginosa Molecular Source:plasmid R1033
Place of Origin:Spain
Other Geographic Information:clinical strain

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  8330 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GAGGCTCTGT TGCAAAGATT GGCGGCAGTC AGAGGTAGGC TGTCGCTCTG 200
CGCCGATCAG GCGGCTGCTG CGAAATGGTG GTTGAGCATG CCCATGGCCT CCGTCAGCGC CGAGGGCCCA ATGCCAAAAG CTCTCTCCAC AAGGCGCACC 300
TCGCCCCTGA TGCCGGGCTG CAGGCACCAG GGGCGAGCCT GTCCTTTGCG CAGGGCTCGC ATGACTTCGA ATCCCTTGAT CGTGGCATAG GCCGTGGGGA 400
TCGATTTGAA ACCGCGCACC GGCTTGATCA GTATCTTGAG CTTTCCGTGA TCGGCCTCGA TCACGTTATT GAGGGCTCTG TTGCAAAGAT TGGCGGCAGT 500
CAGAGGTAGG CTGTCGCTCT GCGCCGATCA GGCGGCTGCT GCGAAATGGT GGTTGAGCAT GCCCATGGCC TCCGTCAGCG CCGAGGGCCC AATGCCAAAA 600
GCTCTCTCCA CAAGGCGCAC CTCGCCCCTG ATGCCGGGCT GCAGGCACCA GGGGCGAGCC TGTCCTTTGC GCAGGGCTCG CATGACTTCG AATCCCTTGA 700
TCGTGGCATA GGCCGTGGGG ATCGATTTGA AACCGCGCAC CGGCTTGATC AGTATCTTGA GCTTTCCGTG ATCGGCCTCG ATCACGTTAT TGAGATACTT 800
CACCTGCCGG TGGGCCGTCT CCCGGTCCAG CTTTCCTTCG CGCTTCAATT CGGTGATCGC TGCACCATAG CTCGGCGCTT TGTCGGTATT GAGCGTGGCA 900
GGCTTTTCCC AGTGCTTCAG GCCTCGCAGG GCCTTGCCCA GGAACCGCTT CGCTGCCTTG GCGCTGCGGG TCGGCGACAG GTAGAAATCG ATCGTGTCGC 1000
CCCGCTTGTC GACTGCCCGG TACAGGTAGG TCCACTTGCC CCGCACCTTG ACGTAGGTTT CATCCAGGCG CCAGCTCGGA TCAAAGCCAC GCCGCCAGAA 1100
CCAGCGCAGC CGCTTCTCCA TCTCCGGGGC GTAGCACTGG ACCCAGCGAT AGATCGTCGT ATGGTCGACC GAAATGCCGC GTTCCGCCAG CATTTCCTCA 1200
AGGTCGCGAT AGCTGATCGG ATAGCGACAA TACCAGCGCA CCGCCCACAG GATCACATCA CCCTGGAAAT GGCGCCACTT GAAATCCGTC ATCGTTCCGT 1300
CCGTCCAATC TCCGCCAAGC ATGCTCAAGC TTCACGATTT TTGCAACAGA GCCCACACGA GTATTGAGCA TAGTCGAGAT TGGTGCAGAT CACTTCTGAT 1400
ATTGAACTGT CAGGAGCTGG CTGCACAACA GCCATTACGC CCAATCAACT GGTGCAGTCG TCTTCTGAAA ATGACATTTG GTATCTCTCA TAAACGGATG 1500
TTTTTGAGAG AACTATCTTC GGCCTTCACA CGCACGAAAG GCGGCGAAGC TCCGCCGTTA ATCCGTCCGC CGGAGATCTC GCCCAGGCAG GCTGAAGGCC 1600
GAGCAAGCCT GACAGGCCCG AAAAGCCCGG CACGGGCGTC GGCGGCGATG ACGGCGGCGG CATTATCCAG GGTTGATGAT GGAAGTGGAG GATATCGACA 1700
ACCTCTCGCG CAACCAAGAC ATCGCGGTCG GACTGCAAGT GATCTTGAAG CCACGGGCCC GTCCCACCCC GACATGGACC TCGATGCCCG AACGGACGTT 1800
AGATTTCGAG TTCTAGGCGT TCTGCGATGA AGGTTGGATC CCAGCCGGGA TTGAAAGTGT CGACGTGGGT GAATCCGAGC CGCTCGTATA GGCCACGCAG 1900
GTTCGGGTGG CAGTCGAGCC GCAGCTTGGC GCACCCCTGC GTTCGCGCGG CATGGCGGCA AGCCTCGATC AGCGCGGAGC TGACACCCCG GCCCGCATGT 2000
GTCCGTCGCA CCGCGAGCTT GTGCAGATAT GCGGCCTCCC CCTTGAGGGC GTCGGGCCAG AACTCGGGAT CCTCGGCCGA CAAGGTGCAA CAGCCGACGA 2100
TGCCGTCGCT GCAACTCGCG ACTAGGAGCT CGGATCTCAG GACGAAGGTC TCCGCGAATG TCCGGTCGAT CCGCGCGACG TCCCAGGCGG GCGTTCCCTT 2200
GGCGGACATC CACGCCGCAG CGTCGTGCAT CAGCCGCACA ACCTCGTCGA TATCACCCGA GCAGGCGACC CGAACGTTCG GAGGCTCCTC GCTGTCCATT 2300
CGCTCCCCTG GCGCGGTATG AACCGCCGCC TCATAGTGCA GTTTGATCCT GACGAGCCCA GCATGTCTGC GCCCACCTTC GCGGAACCTG ACCAGGGTCC 2400
GCTAGCGGGC GGCCGGAAGG TGAATGCTAG GCATGATCTA ACCCTCGGTC TCTGGCGTCG CGACTGCGAA ATTTCGCGAG GGTTTCCGAG AAGGTGATTG 2500
CGCTTCGCAG ATCTCCAGGC GCGTGGGTGC GGACGTAGTC AGCGCCATTG CCGATCGCGT GAAGTTCCGC CGCAAGGCTC GCTGGACCCA GATCCTTTAC 2600
AGGAAGGCCA ACGGTGGCGC CCAAGAAGGA TTTCCGCGAC ACCGAGACCA ATAGCGGAAG CCCCAACGCC GACTTCAGCT TTTGAAGGTT CGACAGCACG 2700
TGCAGCGATG TTTCCGGTGC GGGGCTCAAG AAAAATCCCA TCCCCGGATC GAGGATGAGC CGGTCGGCAG CGACCCCGCT CCGTCGCAAG GCGGAAACCC 2800
GCGCCTCGAA GAACCGCACA ATCTCGTCGA GCGCGTCTTC GGGTCGAAGG TGACCGGTGC GGGTGGCGAT GCCATCCCGC TGCGCTGAGT GCATAACCAC 2900
CAGCCTGCAG TCCGCCTCAG CAATATCGGG ATAGAGCGCA GGGTCAGGAA ATCCTTGGAT ATCGTTCAGG TAGCCCACGC CGCGCTTGAG CGCATAGCGC 3000
TGGGTTTCCG GTTGGAAGCT GTCGATTGAA ACACGGTGCA TCTGATCGGA CAGGGCGTCT AAGAGCGGCG CAATACGTCT GATCTCATCG GCCGGCGATA 3100
CAGGCCTCGC GTCCGGATGG CTGGCGGCCG GTCCGACATC CACGACGTCT GATCCGACTC GCAGCATTTC GATCGCCGCG GTGACAGCGC CGGCGGGGTC 3200
TAGCCGCCGG CTCTCATCGA AGAAGGAGTC CTCGGTGAGA TTCAGAATGC CGAACACCGT CACCATGGCG TCGGCCTCCG CAGCGACTTC CACGATGGGG 3300
ATCGGGCGAG CAAAAAGGCA GCAATTATGA GCCCCATACC TACAAAGCCC CACGCATCAA GCTTTTGCCC ATGAAGCAAC CAGGCAATGG CTGTAATTAT 3400
GACGACGCCG AGTCCCGACC AGACTGCATA AGCAACACCG ACAGGGATGG ATTTCAGAAC CAGAGAAAGA AAATAAAATG CGATGCCATA ACCGATTATG 3500
ACAACGGCGG AAGGGGCAAG CTTAGTAAAG CCCTCGCTAG ATTTTAATGC GGATGTTGCG ATTACTTCGC CAACTATTGC GATAACAAGA AAAAGCCAGC 3600
CTTTCATGAT ATATCTCCCA ATTTGTGTAG GGCTTATTAT GCACGCTTAA AAATAATAAA AGCAGACTTG ACCTGATAGT TTGGCTGTGA GCAATTATGT 3700
GCTTAGTGCA TCTAACGCCT GAGCTCAGCC GACCGAAACC GCGTAGCGGT TTTGGGTCGG CTGCAGCGAT TTGTTGGGCG ATAGCTTGCC ACATTCTCTC 3800
AACGATTGGG ATTTGATGTA CTTTCCGCAC TTTGTAGCGC GACCACATCA TGCTCCCCCT GGCCGCGAGA GCCCTTCACT CGGGAAACAC AAGACAGACC 3900
GAGCACGACT GTTGCAAGGG TCAAACAGTA CACGACAACC GGCCAAGCCG TGTTGCGCGG CAACAGCGAA ATGATCAACG TTCCGATGCT TCCTAGCAGT 4000
ACACCGCCCA AGCAGAAGTA GACTGCCGTG ACCGTTCCAG CAACATGGTC GAATCCTCGA AGAGCGCCAT TGGGCGCCAC AGATACCGCT GTGGCGACAC 4100
CAATACCCAC TAGCCACATT GGAGCAATAA AGCCTAACAC GGACTGCAAA GCCCATATTT CGGTGATGGC AAGCAATACT GCTCCAGCTA TCAGGCATCC 4200
CATTCCCATT CGCAAGACAC TTGGGCTGCC CCACTTGGGT ATCACACGCC CCATAAAACG AGCCGTAAAC ACCATGGCAA TTGCCACTGT GGCGAACAGC 4300
AGGCTGAAGC CAAGCTGAGA CACACCTTGC CTGCCCATCA TTAGTCCGGG CGCAATGGAG AAAAAGACGA AGAAGCTACC CATTCCAGCG GCGTAACACA 4400
ACGTGTACAA CCAGAAGTTC AGGCACTTAA CGGGGAGTAG CAGCTGCGAC CATTGCAAGC CCGCAACTCG TTGCACCCGG GTTTCAGGCC AGAATCGCCA 4500
CGCTGCTGCA GATGCAGCGA TCATGCCCAA ACCTAGAAAC GCAAAGATAG CCCGCCACCC AAGCCACATG TCGACGAGCG CTCCGAGCAA TGGGCCTACC 4600
GCCGGGACCA TGGCCAGCAT GGATCCGAGT ATGCCGTAAA TGACATTACT TTCCTCGCGA CCTGCGTAAA TGTCACGTAC TGTTGCAAAT GTGGAAACAA 4700
GGCACGCCGA GGCACCACAA GCCTGAAGAA TCCGAAGCCC CAGAAAGACT TCAGCCGATG ACGTAAGAGC GAGGCCCATT GACGCCACAA CGTAGGCGAG 4800
GCCACCTCCC AGTAGAACGG GGCGGCGCCC CAGTCGGTCC GATAGCGGTC CAAACAAGAG CTGACCGGCA CCAATCATGA CCAAGTACGT TGTCAGCGTA 4900
AGCTGAATTG TGCTCGCTGT CGTACCAAGC GCGTTTGGCA TAAACGGCAC TGCTGGCAAG TACATGTCCA TGCCGAGTGA TGCCAATAAA TCGAACGGTG 5000
ATAACAACAA CACCGTGGCG GCAAGGGAGT ACCGCCAACT AAAGTTTTTT GAGCTCACGA AACCATCTCC TTGAACAAAG GATTTGGCGG CGCTCCCGTA 5100
GAGCAAAAGG ATTCATGAGA ACGCCGCAAC AACCGAAAAA TGAAGGTTGC TGCGGCTTAC TTGTCTGCGT TCTTGGAAGT GCTCATCTGC TGACTATCTC 5200
ATGATTGAAT TTGAATCGTA ACAACTTTCA AGCAACTCTG CAAGGAGCCT TATTGTGCGC CCAACGCCGG AGTTAAGCCG CCGCGCGTAG CGCGGTCGGC 5300
TTGAACGAAT TGTTAGACAT CATTTACCAA CTGACTTGAT GATCTCGCCT TTCACAAAGC GAATAAATTC TTCCAAGTGA TCTGCGCGTG AGGCCAAGTG 5400
ATCTTCTTTT TGTCCCAGAT AAGCTTGCTT AGCTTCAAGT AAGACGGGCT GATACTGGGC AGGTAGGCGT TTTATTGCCC AGTCGGCAGC GACATCCTTC 5500
GGCGCGATTT TGCCGGTTAT TGCGCTGTAC CAAATGCGGG ACAACGTAAG CACTACATTT CGCTCATCGC CGGCCCAGTC GGGCTGCGAG TTCCATAGCT 5600
TCAAGGTTTC CCTCAGCGCC TCGAATAGAT CCTGTTCAGG AACCGGGTCA AAGAATTCCT CCGCTGCCGG ACCTACCAAG GCAACGCTAT GTTCTCTTGC 5700
TTTTGTAAGC AGGATAGCTA GATCAATGTC GATCATGGCT GGCTCGAAGA TACCCGCAAG AATGTCATTG CGCTGCCATT CTCCAAATTG CAGCTCGCGC 5800
TTAGCCGGAT AACGCCACGG GATGATGTCG TCATGCACGA CAAGGGTGAC TTCTATAGCG CGGAGCGTCT CGCTCTCGCC AGGGAAAGCC GAAGCCTCCA 5900
TAAGGTCATT GAGCAATGCT CGCCGCGTCG TTTCATCAAG CTTTACGGCC ACAGTAACCA ACAAATCAAT ATCGCTGTAT GGCTTCAGGC CGCCATCCAC 6000
TGCGGAGCCG TACAAATGCA CGGCCAGCAA CGTTGATTCC AGATGGCGCT CAATGACGCT TAGCACCTCT GATAGTTGGT TCGAAATTTC GATGGTCACC 6100
GCTACCCTCA TGATGTCTAA CGGCCAAGGT AAGCGGCCCG CAGAATGCGG GTCCGCTTGA CCGCAGAGTT AGACCCGAAG GCCAAAAGGG AGCGCCCGGC 6200
CTTGAAGAAG GGCGCAGCCC AAGAAAAACC GCTGGACCAC GAAAGCGGCG CAGCGCTGCA AGAAGGATAG CTGGTGGCGC TTGCTGCACA AGCGCTGGAG 6300
GCCCAAAAGG CTAGAAAGCC TTTGGAACAG GGCGGAGCGC TGAAATAAGT GGCTGCGCGC ATAAAGCGGG TGATGGGACT AACGCCTGAG TTAAGCCGGA 6400
GCGCTTTGCG GCCGCGGCGT TGTGACAATT TACCGAACAA CTCCGCGGCC GGGAAGCCGA TCTCGGCTTG AACGAATTGT TAGGTGGCGG TACTTGGGTC 6500
GATATCAAAG TGCATCACTT CTTCCCGTAT GCCCAACTTT GTATAGAGAG CCACTGCGGG ATCGTCACCG TAATCTGCTT GCACGTAGAT CACATAAGCA 6600
CCAAGCGCGT TGGCCTCATG CTTGAGGAGA TTGATGAGCG CGGTGGCAAT GCCCTGCCTC CGGTGCTCGC CGGAGACTGC GAGATCATAG ATATAGATCT 6700
CACTACGCGG CTGCTCAAAC CTGGGCAGAA CGTAAGCCGC GAGAGCGCCA ACAACCGCTT CTTGGTCGAA GGCAGCAAGC GCGATGAATG TCTTACTACG 6800
GAGCAAGTTC CCGAGGTAAT CGGAGTCCGG CTGATGTTGG GAGTAGGTGG CTACGTCTCC GAACTCACGA CCGAAAAGAT CAAGAGCAGC CCGCATGGAT 6900
TTGACTTGGT CAGGGCCGAG CCTACATGTG CGAATGATGC CCATACTTGA GCCACCTAAC TTTGTTTTAG GGCGACTGCC CTGCTGCGTA ACATCGTTGC 7000
TGCTGCGTAA CATCGTTGCT GCTCCATAAC ATCAAACATC GACCCACGGC GTAACGCGCT TGCTGCTTGG ATGCCCGAGG CATAGACTGT ACAAAAAAAC 7100
AGTCATAACA AGCCATGAAA ACCGCCACTG CGCCGTTACC ACCGCTGCGT TCGGTCAAGG TTCTGGACCA GTTGCGTGAG CGCATACGCT ACTTGCATTA 7200
CAGTTTACGA ACCGAACAGG CTTATGTCAA CTGGGTTCGT GCCTTCATCC GTTTCCACGG TGTGCGTCAC CCGGCAACCT TGGGCAGCAG CGAAGTCGAG 7300
GCATTTCTGT CCTGGCTGGC GAACGAGCGC AAGGTTTCGG TCTCCACGCA TCGTCAGGCA TTGGCGGCCT TGCTGTTCTT CTACGGCAAG GTGCTGTGCA 7400
CGGATCTGCC CTGGCTTCAG GAGATCGGAA GACCTCGGCC GTCGCGGCGC TTGCCGGTGG TGCTGACCCC GGATGAAGTG GTTCGCATCC TCGGTTTTCT 7500
GGAAGGCGAG CATCGTTTGT TCGCCCAGCT TCTGTATGGA ACGGGCATGC GGATCAGTGA GGGTTTGCAA CTGCGGGTCA AGGATCTGGA TTTCGATCAC 7600
GGCACGATCA TCGTGCGGGA GGGCAAGGGC TCCAAGGATC GGGCCTTGAT GTTACCCGAG AGCTTGGCAC CCACGCTGCG CGAGCAGCTG TCGCGTGCAC 7700
GGGCATGGTG GCTGAAGGAC CAGGCCGAGG GCCGCAGCGG CGTTGCGCTT CCCGACGCCC TTGAGCGGAA GTATCCGCGC GCCGGGCATT CCTGGCCGTG 7800
GTTCTGGGTT TTTGCGCAGC ACACGCATTC GACCGATCCA CGGAGCGGTG TCGTGCGTCG CCATCACATG TATGACCAGA CCTTTCAGCG CGCCTTCAAA 7900
CGTGCCGTAG AACAAGCAGG CATCACGAAG CCCGCCACAC CGCACACCCT CCGCCACTCG TTCGCGACGG CCTTGCTCCG CAGCGGTTAC GACATTCGAA 8000
CCGTGCAGGA TCTGCTCGGC CATTCCGACG TCTCTACGAC GATGATTTAC ACGCATGTGC TGAAAGTTGG CGGTGCCGGA GTGCGCTCAC CGCTTGATGC 8100
GCTGCCGCCC CTCACTAGTG AGAGGTAGGG CAGCGCAAGT CAATCCTGGC GGATTCACTA CCCCTGCGCG AAGGCCATCG GTGCCGCATC GAACGGCCGG 8200
TTGCGGAAAG TCCTCCCTGC GTCCGCTGAT GGCCGGCAGC AGCCCGTCGT TGCCTGATGG ATCCAACCCC TCCGCTGCTA TAGTGCAGTC GGCTTCTGAC 8300
GTTCAGTGCA GCCGTCTTCT GAAAACGACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC qacEdelta1_sul1 core 2399-2432 34 CCGCTAGCGG GCGGCCGGAA GGTGAATGCT AGGC
attC-cmlA6 5'-end 3716-3779 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT
GGGC
attC cmlA6 core 3716-3779 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT
GGGC
attC-cmlA6 3'-end 5259-5264 6 GCCCAA
attC-aadA3 5'-end 5265-5318 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC aadA3 core 5265-5318 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC-aadA3 3'-end 6115-6120 6 GTCTAA
attC-DUF1010 5'-end 6121-6174 54 CGGCCAAGGT AAGCGGCCCG CAGAATGCGG GTCCGCTTGA CCGCAGAGTT AGAC
attC DUF1010 core 6121-6174 54 CGGCCAAGGT AAGCGGCCCG CAGAATGCGG GTCCGCTTGA CCGCAGAGTT AGAC
attC-DUF1010 3'-end 6377-6382 6 GACTAA
attC-AAC(3)-Ia 5'-end 6383-6485 103 CGCCTGAGTT AAGCCGGAGC GCTTTGCGGC CGCGGCGTTG TGACAATTTA CCGAACAACT
CCGCGGCCGG GAAGCCGATC TCGGCTTGAA CGAATTGTTA GGT
attC AAC(3)-Ia core 6383-6485 103 CGCCTGAGTT AAGCCGGAGC GCTTTGCGGC CGCGGCGTTG TGACAATTTA CCGAACAACT
CCGCGGCCGG GAAGCCGATC TCGGCTTGAA CGAATTGTTA GGT
attC-AAC(3)-Ia 3'-end 6954-6959 6 ACCTAA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnp IS6100 528-1292 Transposase   -
WP_001424634.1 In4 1488-1775 Passenger Gene Hypothetical -
GNAT_fam In4 1799-2299 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In4 2427-3266 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In4 3260-3607 Passenger Gene Antibiotic Resistance -
cmlA6 (ARO:3002696) In4 3799-5058 Passenger Gene Antibiotic Resistance -
aadA3 (ARO:3002603) In4 5320-6111 Passenger Gene Antibiotic Resistance -
DUF1010 family protein In4 6117-6362 Passenger Gene Other -
AAC(3)-Ia (ARO:3002528) In4 6480-6944 Passenger Gene Antibiotic Resistance -
intI1 In4 7115-8128 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS6100 765 528-1292 -
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MTDFKWRHFQ GDVILWAVRW YCRYPISYRD LEEMLAERGI SVDHTTIYRW VQCYAPEMEK RLRWFWRRGF DPSWRLDETY VKVRGKWTYL YRAVDKRGDT
IDFYLSPTRS AKAAKRFLGK ALRGLKHWEK PATLNTDKAP SYGAAITELK REGKLDRETA HRQVKYLNNV IEADHGKLKI LIKPVRGFKS IPTAYATIKG
FEVMRALRKG QARPWCLQPG IRGEVRLVER AFGIGPSALT EAMGMLNHHF AAAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_001424634.1 WP_001424634.1 In4 288 1488-1775 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MSGWDGPVAS RSLAVRPRCL GCARGCRYPP LPSSTLDNAA AVIAADARAG LFGPVRLARP SACLGEISGG RINGGASPPF VRVKAEDSSL KNIRL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In4 501 1799-2299 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  GNAT
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In4 840 2427-3266 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In4 348 3260-3607 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   acridine dye (ARO:3000054)||quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
cmlA6 (ARO:3002696) CmlA6 In4 1260 3799-5058 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   phenicol antibiotic (ARO:3000387)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   strict match to reference sequence for ARO:3002696 (bitscore: 819)
Protein Sequence:  
MSSKNFSWRY SLAATVLLLS PFDLLASLGM DMYLPAVPFM PNALGTTAST IQLTLTTYLV MIGAGQLLFG PLSDRLGRRP VLLGGGLAYV VASMGLALTS
SAEVFLGLRI LQACGASACL VSTFATVRDI YAGREESNVI YGILGSMLAM VPAVGPLLGA LVDMWLGWRA IFAFLGLGMI AASAAAWRFW PETRVQRVAG
LQWSQLLLPV KCLNFWLYTL CYAAGMGSFF VFFSIAPGLM MGRQGVSQLG FSLLFATVAI AMVFTARFMG RVIPKWGSPS VLRMGMGCLI AGAVLLAITE
IWALQSVLGF IAPMWLVGIG VATAVSVAPN GALRGFDHVA GTVTAVYFCL GGVLLGSIGT LIISLLPRNT AWPVVVYCLT LATVVLGLSC VSRVKGSRGQ
GEHDVVALQS AESTSNPNR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA3 (ARO:3002603) AadA3 In4 792 5320-6111 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   strict match to reference sequence for ARO:3002603 (bitscore: 522)
Protein Sequence:  
MRVAVTIEIS NQLSEVLSVI ERHLESTLLA VHLYGSAVDG GLKPYSDIDL LVTVAVKLDE TTRRALLNDL MEASAFPGES ETLRAIEVTL VVHDDIIPWR
YPAKRELQFG EWQRNDILAG IFEPAMIDID LAILLTKARE HSVALVGPAA EEFFDPVPEQ DLFEALRETL KLWNSQPDWA GDERNVVLTL SRIWYSAITG
KIAPKDVAAD WAIKRLPAQY QPVLLEAKQA YLGQKEDHLA SRADHLEEFI RFVKGEIIKS VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
DUF1010 family protein DUF1010 family protein In4 246 6117-6362 -
Class:   Passenger Gene
Sub Class:   Other
Sequence Family:  DUF1010 (Pfam:PF06231)
Comment:   BLAST: WP_032492246.1
Protein Sequence:  
MRAATYFSAP PCSKGFLAFW ASSACAASAT SYPSCSAAPL SWSSGFSWAA PFFKAGRSLL AFGSNSAVKR TRILRAAYLG R

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
AAC(3)-Ia (ARO:3002528) AAC(3)-Ia In4 465 6480-6944 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  AAC(3) (ARO:3000322)
Comment:   perfect match to reference sequence for ARO:3002528||Synonyms: AAC(3)-Ia C, aacC1, aacC-A1
Protein Sequence:  
MGIIRTCRLG PDQVKSMRAA LDLFGREFGD VATYSQHQPD SDYLGNLLRS KTFIALAAFD QEAVVGALAA YVLPRFEQPR SEIYIYDLAV SGEHRRQGIA
TALINLLKHE ANALGAYVIY VQADYGDDPA VALYTKLGIR EEVMHFDIDP STAT

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In4 1014 7115-8128 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVNW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPTLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS6100-U12338.3 IS6100 Insertion Sequence 474-1353 880

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In4 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In4 49-67 AACACGTCGG TCGAGGACT
repeat t3 In4 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 Tn5053 110-128 TCAATACTCG TGTGCACCA
IR IS6100 153-166 GGCTCTGTTG CAAA
IRR IS6100 474-487 GGCTCTGTTG CAAA
IRL IS6100 1340-1353 AAACGTTGTC TCGG
repeat t3 In4 1380-1399 AACCACGTCT AGTGAAGACT
repeat t2 In4 1410-1428 TCAGGAGCTG GCTGCACAA
IRt In4 1444-1476 TAGTTGACCA CGTCAGCAGA AGACTTTTAC TGT
repeat t1 In4 1450-1468 ACCACGTCAG CAGAAGACT
repeat i4 In4 8211-8229 AGGAGGGACG CAGGCGACT
repeat i3 In4 8239-8257 CGTCGGGCAG CAACGGACT
repeat i2 In4 8281-8299 ATCACGTCAG CCGAAGACT
IRi In4 8298-8330 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References     

1.Stokes HW, Hall RM. Sequence analysis of the inducible chloramphenicol resistance determinant in the Tn1696 integron suggests regulation by translational attenuation. Plasmid. 1991 Jul;26(1):10-9. doi: 10.1016/0147-619x(91)90032-r. PubMed ID: 1658833
2.Hall RM, Brown HJ, Brookes DE, Stokes HW. Integrons found in different locations have identical 5' ends but variable 3' ends. J Bacteriol. 1994 Oct;176(20):6286-94. doi: 10.1128/jb.176.20.6286-6294.1994. PubMed ID: 7929000
3.Smith DI, Lus RG, Rubio Calvo MC, Datta N, Jacob AE, Hedges RW. Third type of plasmid conferring gentamicin resistance in Pseudomonas aeruginosa. Antimicrob Agents Chemother. 1975 Sep;8(3):227-30. doi: 10.1128/aac.8.3.227. PubMed ID: 810080