Transposon
Name: In0
Family: Tn402        Group: Class 1
Evidence of Transposition: yes
 Host     

Host Organism:Pseudomonas aeuroginosa PAT Molecular Source:plasmid pVS1
Date of Isolation:1992

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  8525 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCTG TTGAGTTGCA TCTAAAATTG 2700
ACCCACTTAG GGTAAAGATT TGCGTCGAAA TTTGACCCAC GTATGACACT GTTTCCCGTC TGGATATGGC GGGAGAAATC AAGGAGTGAT AAACGTGGCG 2800
ATATTGAGCG CAATTCGACG CTGGCATTTT CGCGATGGTG CGTCGATTCG GGAAATAGCC CGACGAAGCG GCCTGTCCAG GAACACCGTT CGCAAGTATT 2900
TGCAAAGCAA GGTGGTTGAA CCGCAGTACC CAGCGCGAGA CAGCGTTGGC AAGTTAAGTC CTTTTGAGCC CAAGTTAAGG CAGTGGCTCT CCACCGAGCA 3000
CAAAAAGACA AAGAAGCTGC GCAGAAACCT GCGCAGCATG TACCGGGATT TGGTCGCTTT GGGCTTTACC GGGTCTTATG ACCGAGTGTG TGCCTTTGCC 3100
CGACAGTGGA AAGATTCCGA ACAGTTCAAG GCGCAAACCT CGGGCAAGGG TTGTTTCATC CCCTTGCGCT TTGCTTGTGG CGAAGCCTTC CAATTCGATT 3200
GGAGTGAGGA CTTTGCCCGC ATAGCGGGCA AACAGGTCAA ACTTCAGATT GCCCAGTTTA AGTTGGCCCA CAGCCGGGCC TTTGTGCTTC GGGCTTACTA 3300
CCAGCAAAAA CATGAAATGC TGTTTGATGC CCACTGGCAT GCCTTTCAAA TCTTCGGTGG CATTCCCAAG CGCGGCATCT ACGACAACAT GAAGACCGCT 3400
GTGGATTCGG TGGGGCGTGG CAAAGAGCGC AGGGTCAATC AGCGGTTCAC TGCCATGGTC AGCCACTACC TGTTTGATGC GCAGTTCTGT AATCCAGCAT 3500
CGGGTTGGGA GAAAGGCCAG ATTGAGAAGA ACGTGCAGGA TTCCCGCCAA CGCCTGTGGC AAGGGGCACC AGACTTTCAA AGCCTTGCTG ATTTGAATGT 3600
GTGGCTTGAG CATCGCTGCA AAGCGCTGTG GTCTGAGCTG CGCCACCCCG AATTGGACCA AACCGTGCAA GAGGCCTTTG CCGATGAACA AGGCGAGTTG 3700
ATGGCGCTAC CCAATGCCTT TGATGCATTC GTGGAGCAAA CCAAGCGAGT CACTTCAACC TGCCTTGTTC ACCACGAGGG CAATCGCTAC AGCGTTCCTG 3800
CCAGTTACGC CAACAGGGCC ATCAGCCTTC GGATTTATGC AGACAAGCTG GTGATGGCTG CCGAAGGCCA ACACATTGCC GAGCATCCAA GATTGTTTGG 3900
CAGTGGCCAC GCTCGGCGTG GCCACACACA ATACGACTGG CACCATTACT TGTCTGTGCT TCAGAAGAAA CCTGGGGCGT TGCGCAATGG TGCGCCATTT 4000
GCTGAATTGC CACCCGCGTT CAAGAAGCTT CAATCCATCT TGCTGCAACG CCCCGGCGGT GACCGTGACA TGGTGGAAAT TCTGGCCCTT GTATTGCACC 4100
ACGATGAAGG TGCGGTACTC AGTGCTGTGG AATTGGCATT GGAGTGTGGC AAGCCATCGA AGGAGCATGT GCTTAATCTG TTGGGACGTT TGACCGAAGA 4200
ACCTCCACCC AAACCGATTC CAATTCCCAA GGGGTTAAGG CTGACATTGG AACCACAGGC CAACGTGAAC CGCTATGACA GTTTAAGGAG AGCCCATGAT 4300
GCAGCATGAA GGCCATGTGA GAATCCTCAA ATCCTTGAAA CTCTTTGGCA TGGCACACGC CATTGAGGAG TTGGGCAATC AGAATTCACC AGCATTTAAT 4400
CAAGCCTTGC CCATGCTGGA CAGCTTGATT AAAGCTGAAG TGGCAGAGCG TGAAGTACGT TCGGTGAACT ATCAATTGCG GGTGGCCAAG TTCCCCGTGT 4500
ATCGGGACTT GGTGGGCTTT GACTTCAGTC AAAGCCTGGT TAATGAGGCC ACGGTCAAAC AATTGCACCG GTGCGACTTC ATGGAACAAG CCCAGAACGT 4600
GGTGCTGATT GGTGGGCCAG GCACAGGCAA GACTCACCTG GCCACAGCCA TTGGTACACA AGCAGTGATG CACTTGAACC GACGGGTGCG TTTCTTCTCC 4700
ACCGTGGATT TGGTCAATGC ACTGGAGCAA GAGAAATCAT CTGGGCGTCA GGGACAAATC GCAAACCGTC TGTTGTATGC CGATTTGGTG ATTCTGGATG 4800
AGCTGGGATA TTTGCCTTTT AGCCAAACCG GTGGGGCACT GCTGTTTCAC CTGCTCTCAA AGCTGTACGA AAAAACCAGC GTGATACTGA CCACCAACTT 4900
GAGCTTCTCG GAATGGAGCC GAGTGTTTGG CGATGAAAAG ATGACAACAG CGTTGTTGGA CCGACTAACC CACCACTGCC ACATCCTGGA AACCGGCAAT 5000
GAAAGTTACC GCTTCAAACA CAGTTCAACT CAGAATAAGC AGGAGGAAAA ACAGACCCGC AAACTGAAAA TCGAGACATA ATTCTGACAA CAAGGGGTGG 5100
GTCAAAATTC AATGCAAATC CCGGGTCAAA TTTGGGTGCA AATCAACAGA TATCGACAAC CTCTCGCGCA ACCAAGACAT CGCGGTCGGA CTGCAAGTGA 5200
TCTTGAAGCC ACGGGCCCGT CCCACCCCGA CATGGACCTC GATGCCCGAA CGGACGTTAG ATTTCGAGTT CTAGGCGTTC TGCGATGAAG GTTGGATCCC 5300
AGCCGGGATT GAAAGTGTCG ACGTGGGTGA ATCCGAGCCG CTCGTATAGG CCACGCAGGT TCGGGTGGCA GTCGAGCCGC AGCTTGGCGC ACCCCTGCGT 5400
TCGCGCGGCA TGGCGGCAAG CCTCGATCAG CGCGGAGCTG ACACCCCGGC CCGCATGTGT CCGTCGCACC GCGAGCTTGT GCAGATATGC GGCCTCCCCC 5500
TTGAGGGCGT CGGGCCAGAA CTCGGGATCC TCGGCCGACA AGGTGCAACA GCCGACGATG CCGTCGCTGC AACTCGCGAC TAGGAGCTCG GATCTCAGGA 5600
CGAAGGTCTC CGCGAATGTC CGGTCGATCC GCGCGACGTC CCAGGCGGGC GTTCCCTTGG CGGACATCCA CGCCGCAGCG TCGTGCATCA GCCGCACAAC 5700
CTCGTCGATA TCACCCGAGC AGGCGACCCG AACGTTCGGA GGCTCCTCGC TGTCCATTCG CTCCCCTGGC GCGGTATGAA CCGCCGCCTC ATAGTGCAGT 5800
TTGATCCTGA CGAGCCCAGC ATGTCTGCGC CCACCTTCGC GGAACCTGAC CAGGGTCCGC TAGCGGGCGG CCGGAAGGTG AATGCTAGGC ATGATCTAAC 5900
CCTCGGTCTC TGGCGTCGCG ACTGCGAAAT TTCGCGAGGG TTTCCGAGAA GGTGATTGCG CTTCGCAGAT CTCCAGGCGC GTGGGTGCGG ACGTAGTCAG 6000
CGCCATTGCC GATCGCGTGA AGTTCCGCCG CAAGGCTCGC TGGACCCAGA TCCTTTACAG GAAGGCCAAC GGTGGCGCCC AAGAAGGATT TCCGCGACAC 6100
CGAGACCAAT AGCGGAAGCC CCAACGCCGA CTTCAGCTTT TGAAGGTTCG ACAGCACGTG CAGCGATGTT TCCGGTGCGG GGCTCAAGAA AAATCCCATC 6200
CCCGGATCGA GGATGAGCCG GTCGGCAGCG ACCCCGCTCC GTCGCAAGGC GGAAACCCGC GCCTCGAAGA ACCGCACAAT CTCGTCGAGC GCGTCTTCGG 6300
GTCGAAGGTG ACCGGTGCGG GTGGCGATGC CATCCCGCTG CGCTGAGTGC ATAACCACCA GCCTGCAGTC CGCCTCAGCA ATATCGGGAT AGAGCGCAGG 6400
GTCAGGAAAT CCTTGGATAT CGTTCAGGTA GCCCACGCCG CGCTTGAGCG CATAGCGCTG GGTTTCCGGT TGGAAGCTGT CGATTGAAAC ACGGTGCATC 6500
TGATCGGACA GGGCGTCTAA GAGCGGCGCA ATACGTCTGA TCTCATCGGC CGGCGATACA GGCCTCGCGT CCGGATGGCT GGCGGCCGGT CCGACATCCA 6600
CGACGTCTGA TCCGACTCGC AGCATTTCGA TCGCCGCGGT GACAGCGCCG GCGGGGTCTA GCCGCCGGCT CTCATCGAAG AAGGAGTCCT CGGTGAGATT 6700
CAGAATGCCG AACACCGTCA CCATGGCGTC GGCCTCCGCA GCGACTTCCA CGATGGGGAT CGGGCGAGCA AAAAGGCAGC AATTATGAGC CCCATACCTA 6800
CAAAGCCCCA CGCATCAAGC TTTTGCCCAT GAAGCAACCA GGCAATGGCT GTAATTATGA CGACGCCGAG TCCCGACCAG ACTGCATAAG CAACACCGAC 6900
AGGGATGGAT TTCAGAACCA GAGAAAGAAA ATAAAATGCG ATGCCATAAC CGATTATGAC AACGGCGGAA GGGGCAAGCT TAGTAAAGCC CTCGCTAGAT 7000
TTTAATGCGG ATGTTGCGAT TACTTCGCCA ACTATTGCGA TAACAAGAAA AAGCCAGCCT TTCATGATAT ATCTCCCAAT TTGTGTAGGG CTTATTATGC 7100
ACGCTTAAAA ATAATAAAAG CAGACTTGAC CTGATAGTTT GGCTGTGAGC AATTATGTGC TTAGTGCATC TAACTTTGTT TTAGGGCGAC TGCCCTGCTG 7200
CGTAACATCG TTGCTGCTCC ATAACATCAA ACATCGACCC ACGGCGTAAC GCGCTTGCTG CTTGGATGCC CGAGGCATAG ACTGTACAAA AAAACAGTCA 7300
TAACAAGCCA TGAAAACCGC CACTGCGCCG TTACCACCGC TGCGTTCGGT CAAGGTTCTG GACCAGTTGC GTGAGCGCAT ACGCTACTTG CATTACAGCT 7400
TACGAACCGA ACAGGCTTAT GTCCACTGGG TTCGTGCCTT CATCCGTTTC CACGGTGTGC GTCACCCGGC AACCTTGGGC AGCAGCGAAG TCGAGGCATT 7500
TCTGTCCTGG CTGGCGAACG AGCGCAAGGT TTCGGTCTCC ACGCATCGTC AGGCATTGGC GGCCTTGCTG TTCTTCTACG GCAAGGTGCT GTGCACGGAT 7600
CTGCCCTGGC TTCAGGAGAT CGGAAGACCT CGGCCGTCGC GGCGCTTGCC GGTGGTGCTG ACCCCGGATG AAGTGGTTCG CATCCTCGGT TTTCTGGAAG 7700
GCGAGCATCG TTTGTTCGCC CAGCTTCTGT ATGGAACGGG CATGCGGATC AGTGAGGGTT TGCAACTGCG GGTCAAGGAT CTGGATTTCG ATCACGGCAC 7800
GATCATCGTG CGGGAGGGCA AGGGCTCCAA GGATCGGGCC TTGATGTTAC CCGAGAGCTT GGCACCCAGC CTGCGCGAGC AGCTGTCGCG TGCACGGGCA 7900
TGGTGGCTGA AGGACCAGGC CGAGGGCCGC AGCGGCGTTG CGCTTCCCGA CGCCCTTGAG CGGAAGTATC CGCGCGCCGG GCATTCCTGG CCGTGGTTCT 8000
GGGTTTTTGC GCAGCACACG CATTCGACCG ATCCACGGAG CGGTGTCGTG CGTCGCCATC ACATGTATGA CCAGACCTTT CAGCGCGCCT TCAAACGTGC 8100
CGTAGAACAA GCAGGCATCA CGAAGCCCGC CACACCGCAC ACCCTCCGCC ACTCGTTCGC GACGGCCTTG CTCCGCAGCG GTTACGACAT TCGAACCGTG 8200
CAGGATCTGC TCGGCCATTC CGACGTCTCT ACGACGATGA TTTACACGCA TGTGCTGAAA GTTGGCGGTG CCGGAGTGCG CTCACCGCTT GATGCGCTGC 8300
CGCCCCTCAC TAGTGAGAGG TAGGGCAGCG CAAGTCAATC CTGGCGGATT CACTACCCCT GCGCGAAGGC CATCGGTGCC GCATCGAACG GCCGGTTGCG 8400
GAAAGTCCTC CCTGCGTCCG CTGATGGCCG GCAGCAGCCC GTCGTTGCCT GATGGATCCA ACCCCTCCGC TGCTATAGTG CAGTCGGCTT CTGACGTTCA 8500
GTGCAGCCGT CTTCTGAAAA CGACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC qacH_sul1 core 5857-5890 34 CCGCTAGCGG GCGGCCGGAA GGTGAATGCT AGGC
attI 7174-7229 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA In0 142-1821 Transposase   +
tniB delta1 In0 1824-2684 Accessory Gene   +
istA IS1326 2786-4309 Transposase   +
istB IS1326 4296-5081 Accessory Gene ATPase Transposition Helper +
GNAT_fam In0 5257-5757 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In0 5885-6724 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In0 6718-7065 Passenger Gene Antibiotic Resistance -
intI1 In0 7310-8323 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA In0 1680 142-1821 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7
Protein Sequence:  
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE
LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR
TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB delta1 TniB delta1 In0 861 1824-2684 +
Class:   Accessory Gene
Function:   probable ATP-binding protein.
Comment:   probably truncated by insertion of IS1326::IS1353
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAC

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istA IstA IS1326 1524 2786-4309 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MINVAILSAI RRWHFRDGAS IREIARRSGL SRNTVRKYLQ SKVVEPQYPA RDSVGKLSPF EPKLRQWLST EHKKTKKLRR NLRSMYRDLV ALGFTGSYDR
VCAFARQWKD SEQFKAQTSG KGCFIPLRFA CGEAFQFDWS EDFARIAGKQ VKLQIAQFKL AHSRAFVLRA YYQQKHEMLF DAHWHAFQIF GGIPKRGIYD
NMKTAVDSVG RGKERRVNQR FTAMVSHYLF DAQFCNPASG WEKGQIEKNV QDSRQRLWQG APDFQSLADL NVWLEHRCKA LWSELRHPEL DQTVQEAFAD
EQGELMALPN AFDAFVEQTK RVTSTCLVHH EGNRYSVPAS YANRAISLRI YADKLVMAAE GQHIAEHPRL FGSGHARRGH TQYDWHHYLS VLQKKPGALR
NGAPFAELPP AFKKLQSILL QRPGGDRDMV EILALVLHHD EGAVLSAVEL ALECGKPSKE HVLNLLGRLT EEPPPKPIPI PKGLRLTLEP QANVNRYDSL
RRAHDAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istB IstB IS1326 786 4296-5081 +
Class:   Accessory Gene
Sub Class:   ATPase Transposition Helper
Function:   stimulates transposition
Protein Sequence:  
MMQHEGHVRI LKSLKLFGMA HAIEELGNQN SPAFNQALPM LDSLIKAEVA EREVRSVNYQ LRVAKFPVYR DLVGFDFSQS LVNEATVKQL HRCDFMEQAQ
NVVLIGGPGT GKTHLATAIG TQAVMHLNRR VRFFSTVDLV NALEQEKSSG RQGQIANRLL YADLVILDEL GYLPFSQTGG ALLFHLLSKL YEKTSVILTT
NLSFSEWSRV FGDEKMTTAL LDRLTHHCHI LETGNESYRF KHSSTQNKQE EKQTRKLKIE T

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In0 501 5257-5757 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  Acetyltransf_1 (Pfam:PF00583)
Comment:   putative acetyltransferase ADU64769.1
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In0 840 5885-6724 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In0 348 6718-7065 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   disinfecting agents and antiseptics (ARO:3005386)
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In0 1014 7310-8323 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS1326-KY749247.1 IS1326 Insertion Sequence 2679-5148 2470

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In0 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In0 49-67 AACACGTCGG TCGAGGACT
repeat t3 In0 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 In0 110-128 TCAATACTCG TGTGCACCA
IRL IS1326 2679-2704 TGTTGAGTTG CATCTAAAAT TGACCC
IRR IS1326 5123-5148 CCCAGTTTAA ACCCACGTTT AGTTGT
repeat i4 In0 8406-8424 AGGAGGGACG CAGGCGACT
repeat i3 In0 8434-8452 CGTCGGGCAG CAACGGACT
repeat i2 In0 8476-8494 ATCACGTCAG CCGAAGACT
IRi In0 8493-8525 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References     

1.Recchia GD, Stokes HW, Hall RM. Characterisation of specific and secondary recombination sites recognised by the integron DNA integrase. Nucleic Acids Res. 1994 Jun 11;22(11):2071-8. doi: 10.1093/nar/22.11.2071. PubMed ID: 8029014
2.Hall RM, Brown HJ, Brookes DE, Stokes HW. Integrons found in different locations have identical 5' ends but variable 3' ends. J Bacteriol. 1994 Oct;176(20):6286-94. doi: 10.1128/jb.176.20.6286-6294.1994. PubMed ID: 7929000
3.Brown HJ, Stokes HW, Hall RM. The integrons In0, In2, and In5 are defective transposon derivatives. J Bacteriol. 1996 Aug;178(15):4429-37. doi: 10.1128/jb.178.15.4429-4437.1996. PubMed ID: 8755869