Transposon
Name: In_Tn4
Family: Integron        Group: Class 1
Evidence of Transposition: no
 Host     

Host Organism:Salmonella enterica subsp. enterica serovar Paratyphi B Molecular Source:plasmid R1

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  9384 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCTG TTGAGTTGCA TCTAAAATTG 2700
ACCCACTTAG GGTAAAGATT TGCGTCGAAA TTTGACCCAC GTATGACACT GTTTCCCGTC TGGATATGGC GGGAGAAATC AAGGAGTGAT AAACGTGGCG 2800
ATATTGAGCG CAATTCGACG CTGGCATTTT CGCGATGGTG CGTCGATTCG GGAAATAGCC CGACGAAGCG GCCTGTCCAG GAACACCGTT CGCAAGTATT 2900
TGCAAAGCAA GGTGGTTGAA CCGCAGTACC CAGCGCGAGA CAGCGTTGGC AAGTTAAGTC CTTTTGAGCC CAAGTTAAGG CAGTGGCTCT CCACCGAGCA 3000
CAAAAAGACA AAGAAGCTGC GCAGAAACCT GCGCAGCATG TACCGGGATT TGGTCGCTTT GGGCTTTACC GGGTCTTATG ACCGAGTGTG TGCCTTTGCC 3100
CGACAGTGGA AAGATTCCGA ACAGTTCAAG GCGCAAACCT CGGGCAAGGG TTGTTTCATC CCCTTGCGCT TTGCTTGTGG CGAAGCCTTC CAATTCGATT 3200
GGAGTGAGGA CTTTGCCCGC ATAGCGGGCA AACAGGTCAA ACTTCAGATT GCCCAGTTTA AGTTGGCCCA CAGCCGGGCC TTTGTGCTTC GGGCTTACTA 3300
CCAGCAAAAA CATGAAATGC TGTTTGATGC CCACTGGCAT GCCTTTCAAA TCTTCGGTGG CATTCCCAAG CGCGGCATCT ACGACAACAT GAAGACCGCT 3400
GTGGATTCGG TGGGGCGTGG CAAAGAGCGC AGGGTCAATC AGCGGTTCAC TGCCATGGTC AGCCACTACC TGTTTGATGC GCAGTTCTGT AATCCAGCAT 3500
CGGGTTGGGA GAAAGGCCAG ATTGAGAAGA ACGTGCAGGA TTCCCGCCAA CGCCTGTGGC AAGGGGCACC AGACTTTCAA AGCCTTGCTG ATTTGAATGT 3600
GTGGCTTGAG CATCGCTGCA AAGCGCTGTG GTCTGAGCTG CGCCACCCCG AATTGGACCA AACCGTGCAA GAGGCCTTTG CCGATGAACA AGGCGAGTTG 3700
ATGGCGCTAC CCAATGCCTT TGATGCATTC GTGGAGCAAA CCAAGCGAGT CACTTCAACC TGCCTTGTTC ACCACGAGGG CAATCGCTAC AGCGTTCCTG 3800
CCAGTTACGC CAACAGGGCC ATCAGCCTTC GGATTTATGC AGACAAGCTG GTGATGGCTG CCGAAGGCCA ACACATTGCC GAGCATCCAA GATTGTTTGG 3900
CAGTGGCCAC GCTCGGCGTG GCCACACACA ATACGACTGG CACCATTACT TGTCTGTGCT TCAGAAGAAA CCTGGGGCGT TGCGCAATGG TGCGCCATTT 4000
GCTGAATTGC CACCCGCGTT CAAGAAGCTT CAATCCATCT TGCTGCAACG CCCCGGCGGT GACCGTGACA TGGTGGAAAT TCTGGCCCTT GTATTGCACC 4100
ACGATGAAGG TGCGGTACTC AGTGCTGTGG AATTGGCATT GGAGTGTGGC AAGCCATCGA AGGAGCATGT GCTTAATCTG TTGGGACGTT TGACCGAAGA 4200
ACCTCCACCC AAACCGATTC CAATTCCCAA GGGGTTAAGG CTGACATTGG AACCACAGGC CAACGTGAAC CGCTATGACA GTTTAAGGAG AGCCCATGAT 4300
GCAGCATGAA GGCCATGTGA GAATCCTCAA ATCCTTGAAA CTCTTTGGCA TGGCACACGC CATTGAGGAG TTGGGCAATC AGAATTCACC AGCATTTAAT 4400
CAAGCCTTGC CCATGCTGGA CAGCTTGATT AAAGCTGAAG TGGCAGAGCG TGAAGTACGT TCGGTGAACT ATCAATTGCG GGTGGCCAAG TTCCCCGTGT 4500
ATCGGGACTT GGTGGGCTTT GACTTCAGTC AAAGCCTGGT TAATGAGGCC ACGGTCAAAC AATTGCACCG GTGCGACTTC ATGGAACAAG CCCAGAACGT 4600
GGTGCTGATT GGTGGGCCAG GCACAGGCAA GACTCACCTG GCCACAGCCA TTGGTACACA AGCAGTGATG CACTTGAACC GACGGGTGCG TTTCTTCTCC 4700
ACCGTGGATT TGGTCAATGC ACTGGAGCAA GAGAAATCAT CTGGGCGTCA GGGACAAATC GCAAACCGTC TGTTGTATGC CGATTTGGTG ATTCTGGATG 4800
AGCTGGGATA TTTGCCTTTT AGCCAAACCG GTGGGGCACT GCTGTTTCAC CTGCTCTCAA AGCTGTACGA AAAAACCAGC GTGATACTGA CCACCAACTT 4900
GAGCTTCTCG GAATGGAGCC GAGTGTTTGG CGATGAAAAG ATGACAACAG CGTTGTTGGA CCGACTAACC CACCACTGCC ACATCCTGGA AACCGGCAAT 5000
GAAAGTTACC GCTTCAAACA CAGTTCAACT CAGAATAAGC AGGAGGAAAA ACAGACCCGC AAACTGAAAA TCGAGACATA ATTCTGACAA CAAGGGGTGG 5100
GTCAAAATTC AATGCAAATC CCGGGTCAAA TTTGGGTGCA AATCAACAGA TATCGACAAC CTCTCGCGCA ACCAAGACAT CGCGGTCGGA CTGCAAGTGA 5200
TCTTGAAGCC ACGGGCCCGT CCCACCCCGA CATGGACCTC GATGCCCGAA CGGACGTTAG ATTTCGAGTT CTAGGCGTTC TGCGATGAAG GTTGGATCCC 5300
AGCCGGGATT GAAAGTGTCG ACGTGGGTGA ATCCGAGCCG CTCGTATAGG CCACGCAGGT TCGGGTGGCA GTCGAGCCGC AGCTTGGCGC ACCCCTGCGT 5400
TCGCGCGGCA TGGCGGCAAG CCTCGATCAG CGCGGAGCTG ACACCCCGGC CCGCATGTGT CCGTCGCACC GCGAGCTTGT GCAGATATGC GGCCTCCCCC 5500
TTGAGGGCGT CGGGCCAGAA CTCGGGATCC TCGGCCGACA AGGTGCAACA GCCGACGATG CCGTCGCTGC AACTCGCGAC TAGGAGCTCG GATCTCAGGA 5600
CGAAGGTCTC CGCGAATGTC CGGTCGATCC GCGCGACGTC CCAGGCGGGC GTTCCCTTGG CGGACATCCA CGCCGCAGCG TCGTGCATCA GCCGCACAAC 5700
CTCGTCGATA TCACCCGAGC AGGCGACCCG AACGTTCGGA GGCTCCTCGC TGTCCATTCG CTCCCCTGGC GCGGTATGAA CCGCCGCCTC ATAGTGCAGT 5800
TTGATCCTGA CGAGCCCAGC ATGTCTGCGC CCACCTTCGC GGAACCTGAC CAGGGTCCGC TAGCGGGCGG CCGGAAGGTG AATGCTAGGC ATGATCTAAC 5900
CCTCGGTCTC TGGCGTCGCG ACTGCGAAAT TTCGCGAGGG TTTCCGAGAA GGTGATTGCG CTTCGCAGAT CTCCAGGCGC GTGGGTGCGG ACGTAGTCAG 6000
CGCCATTGCC GATCGCGTGA AGTTCCGCCG CAAGGCTCGC TGGACCCAGA TCCTTTACAG GAAGGCCAAC GGTGGCGCCC AAGAAGGATT TCCGCGACAC 6100
CGAGACCAAT AGCGGAAGCC CCAACGCCGA CTTCAGCTTT TGAAGGTTCG ACAGCACGTG CAGCGATGTT TCCGGTGCGG GGCTCAAGAA AAATCCCATC 6200
CCCGGATCGA GGATGAGCCG GTCGGCAGCG ACCCCGCTCC GTCGCAAGGC GGAAACCCGC GCCTCGAAGA ACCGCACAAT CTCGTCGAGC GCGTCTTCGG 6300
GTCGAAGGTG ACCGGTGCGG GTGGCGATGC CATCCCGCTG CGCTGAGTGC ATAACCACCA GCCTGCAGTC CGCCTCAGCA ATATCGGGAT AGAGCGCAGG 6400
GTCAGGAAAT CCTTGGATAT CGTTCAGGTA GCCCACGCCG CGCTTGAGCG CATAGCGCTG GGTTTCCGGT TGGAAGCTGT CGATTGAAAC ACGGTGCATC 6500
TGATCGGACA GGGCGTCTAA GAGCGGCGCA ATACGTCTGA TCTCATCGGC CGGCGATACA GGCCTCGCGT CCGGATGGCT GGCGGCCGGT CCGACATCCA 6600
CGACGTCTGA TCCGACTCGC AGCATTTCGA TCGCCGCGGT GACAGCGCCG GCGGGGTCTA GCCGCCGGCT CTCATCGAAG AAGGAGTCCT CGGTGAGATT 6700
CAGAATGCCG AACACCGTCA CCATGGCGTC GGCCTCCGCA GCGACTTCCA CGATGGGGAT CGGGCGAGCA AAAAGGCAGC AATTATGAGC CCCATACCTA 6800
CAAAGCCCCA CGCATCAAGC TTTTGCCCAT GAAGCAACCA GGCAATGGCT GTAATTATGA CGACGCCGAG TCCCGACCAG ACTGCATAAG CAACACCGAC 6900
AGGGATGGAT TTCAGAACCA GAGAAAGAAA ATAAAATGCG ATGCCATAAC CGATTATGAC AACGGCGGAA GGGGCAAGCT TAGTAAAGCC CTCGCTAGAT 7000
TTTAATGCGG ATGTTGCGAT TACTTCGCCA ACTATTGCGA TAACAAGAAA AAGCCAGCCT TTCATGATAT ATCTCCCAAT TTGTGTAGGG CTTATTATGC 7100
ACGCTTAAAA ATAATAAAAG CAGACTTGAC CTGATAGTTT GGCTGTGAGC AATTATGTGC TTAGTGCATC TAACGCTTGA GTTAAGCCGC GCCGCGAAGC 7200
GGCGTCGGCT TGAACGAATT GTTAGACATT ATTTGCCGAC TACCTTGGTG ATCTCGCCTT TCACGTAGTG GACAAATTCT TCCAACTGAT CTGCGCGCGA 7300
GGCCAAGCGA TCTTCTTCTT GTCCAAGATA AGCCTGTCTA GCTTCAAGTA TGACGGGCTG ATACTGGGCC GGCAGGCGCT CCATTGCCCA GTCGGCAGCG 7400
ACATCCTTCG GCGCGATTTT GCCGGTTACT GCGCTGTACC AAATGCGGGA CAACGTAAGC ACTACATTTC GCTCATCGCC AGCCCAGTCG GGCGGCGAGT 7500
TCCATAGCGT TAAGGTTTCA TTTAGCGCCT CAAATAGATC CTGTTCAGGA ACCGGATCAA AGAGTTCCTC CGCCGCTGGA CCTACCAAGG CAACGCTATG 7600
TTCTCTTGCT TTTGTCAGCA AGATAGCCAG ATCAATGTCG ATCGTGGCTG GCTCGAAGAT ACCTGCAAGA ATGTCATTGC GCTGCCATTC TCCAAATTGC 7700
AGTTCGCGCT TAGCTGGATA ACGCCACGGA ATGATGTCGT CGTGCACAAC AATGGTGACT TCTACAGCGC GGAGAATCTC GCTCTCTCCA GGGGAAGCCG 7800
AAGTTTCCAA AAGGTCGTTG ATCAAAGCTC GCCGCGTTGT TTCATCAAGC CTTACGGTCA CCGTAACCAG CAAATCAATA TCACTGTGTG GCTTCAGGCC 7900
GCCATCCACT GCGGAGCCGT ACAAATGTAC GGCCAGCAAC GTCGGTTCGA GATGGCGCTC GATGACGCCA ACTACCTCTG ATAGTTGAGT CGATACTTCG 8000
GCGATCACCG CTTCCCTCAT GATGTTTAAC TTTGTTTTAG GGCGACTGCC CTGCTGCGTA ACATCGTTGC TGCTCCATAA CATCAAACAT CGACCCACGG 8100
CGTAACGCGC TTGCTGCTTG GATGCCCGAG GCATAGACTG TACCCCAAAA AAACAGTCAT AACAAGCCAT GAAAACCGCC ACTGCGCCGT TACCACCGCT 8200
GCGTTCGGTC AAGGTTCTGG ACCAGTTGCG TGAGCGCATA CGCTACTTGC ATTACAGCTT ACGAACCGAA CAGGCTTATG TCCACTGGGT TCGTGCCTTC 8300
ATCCGTTTCC ACGGTGTGCG TCACCCGGCA ACCTTGGGCA GCAGCGAAGT CGAGGCATTT CTGTCCTGGC TGGCGAACGA GCGCAAGGTT TCGGTCTCCA 8400
CGCATCGTCA GGCATTGGCG GCCTTGCTGT TCTTCTACGG CAAGGTGCTG TGCACGGATC TGCCCTGGCT TCAGGAGATC GGAAGACCTC GGCCGTCGCG 8500
GCGCTTGCCG GTGGTGCTGA CCCCGGATGA AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG CGAGCATCGT TTGTTCGCCC AGCTTCTGTA TGGAACGGGC 8600
ATGCGGATCA GTGAGGGTTT GCAACTGCGG GTCAAGGATC TGGATTTCGA TCACGGCACG ATCATCGTGC GGGAGGGCAA GGGCTCCAAG GATCGGGCCT 8700
TGATGTTACC CGAGAGCTTG GCACCCAGCC TGCGCGAGCA GCTGTCGCGT GCACGGGCAT GGTGGCTGAA GGACCAGGCC GAGGGCCGCA GCGGCGTTGC 8800
GCTTCCCGAC GCCCTTGAGC GGAAGTATCC GCGCGCCGGG CATTCCTGGC CGTGGTTCTG GGTTTTTGCG CAGCACACGC ATTCGACCGA TCCACGGAGC 8900
GGTGTCGTGC GTCGCCATCA CATGTATGAC CAGACCTTTC AGCGCGCCTT CAAACGTGCC GTAGAACAAG CAGGCATCAC GAAGCCCGCC ACACCGCACA 9000
CCCTCCGCCA CTCGTTCGCG ACGGCCTTGC TCCGCAGCGG TTACGACATT CGAACCGTGC AGGATCTGCT CGGCCATTCC GACGTCTCTA CGACGATGAT 9100
TTACACGCAT GTGCTGAAAG TTGGCGGTGC CGGAGTGCGC TCACCGCTTG ATGCGCTGCC GCCCCTCACT AGTGAGAGGT AGGGCAGCGC AAGTCAATCC 9200
TGGCGGATTC ACTACCCCTG CGCGAAGGCC ATCGGTGCCG CATCGAACGG CCGGTTGCGG AAAGTCCTCC CTGCGTCCGC TGATGGCCGG CAGCAGCCCG 9300
TCGTTGCCTG ATGGATCCAA CCCCTCCGCT GCTATAGTGC AGTCGGCTTC TGACGTTCAG TGCAGCCGTC TTCTGAAAAC GACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC qacH_sul1 core 5857-5890 34 CCGCTAGCGG GCGGCCGGAA GGTGAATGCT AGGC
attC aadA1a core 7174-7227 54 CGCTTGAGTT AAGCCGCGCC GCGAAGCGGC GTCGGCTTGA ACGAATTGTT AGAC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA In_Tn4 106-1821 Transposase   +
tniB delta1 In_Tn4 1824-2684 Accessory Gene   +
istA IS1326 2786-4309 Transposase   +
istB IS1326 4296-5081 Accessory Gene ATPase Transposition Helper +
GNAT_fam In_Tn4 5257-5757 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In_Tn4 5885-6724 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In_Tn4 6718-7065 Passenger Gene Antibiotic Resistance -
aadA (ARO:3002601) In_Tn4 7229-8020 Passenger Gene Antibiotic Resistance -
intI1 In_Tn4 8169-9182 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA In_Tn4 1716 106-1821 +
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MLNTRVHQSE VSMATDTPRI PEQGVATLPD EAWERARRRA EIISPLAQSE TVGHEAADMA AQALGLSRRQ VYVLIRRARQ GSGLVTDLVP GQSGGGKGKG
RLPEPVERVI HELLQKRFLT KQKRSLAAFH REVTQVCKAQ KLRVPARNTV ALRIASLDPR KVIRRREGQD AARDLQGVGG EPPAVTAPLE QVQIDHTVID
LIVVDDRDRQ PIGRPYLTLA IDVFTRCVLG MVVTLEAPSA VSVGLCLVHV ACDKRPWLEG LNVEMDWQMS GKPLLLYLDN AAEFKSEALR RGCEQHGIRL
DYRPLGQPHY GGIVERIIGT AMQMIHDELP GTTFSNPDQR GDYDSENKAA LTLRELERWL TLAVGTYHGS VHNGLLQPPA ARWAEAVARV GVPAVVTRAT
SFLVDFLPIL RRTLTRTGFV IDHIHYYADA LKPWIARRER WPSFLIRRDP RDISRIWVLE PEGQHYLEIP YRTLSHPAVT LWEQRQALAK LRQQGREQVD
ESALFRMIGQ MREIVTSAQK ATRKARRDAD RRQHLKTSAR PDKPVPPDTD IADPQADNLP PAKPFDQIEE W

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB delta1 TniB delta1 In_Tn4 861 1824-2684 +
Class:   Accessory Gene
Function:   probable ATP-binding protein.
Comment:   probably truncated by insertion of IS1326::IS1353
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAC

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istA IstA IS1326 1524 2786-4309 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MINVAILSAI RRWHFRDGAS IREIARRSGL SRNTVRKYLQ SKVVEPQYPA RDSVGKLSPF EPKLRQWLST EHKKTKKLRR NLRSMYRDLV ALGFTGSYDR
VCAFARQWKD SEQFKAQTSG KGCFIPLRFA CGEAFQFDWS EDFARIAGKQ VKLQIAQFKL AHSRAFVLRA YYQQKHEMLF DAHWHAFQIF GGIPKRGIYD
NMKTAVDSVG RGKERRVNQR FTAMVSHYLF DAQFCNPASG WEKGQIEKNV QDSRQRLWQG APDFQSLADL NVWLEHRCKA LWSELRHPEL DQTVQEAFAD
EQGELMALPN AFDAFVEQTK RVTSTCLVHH EGNRYSVPAS YANRAISLRI YADKLVMAAE GQHIAEHPRL FGSGHARRGH TQYDWHHYLS VLQKKPGALR
NGAPFAELPP AFKKLQSILL QRPGGDRDMV EILALVLHHD EGAVLSAVEL ALECGKPSKE HVLNLLGRLT EEPPPKPIPI PKGLRLTLEP QANVNRYDSL
RRAHDAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istB IstB IS1326 786 4296-5081 +
Class:   Accessory Gene
Sub Class:   ATPase Transposition Helper
Function:   stimulates transposition
Protein Sequence:  
MMQHEGHVRI LKSLKLFGMA HAIEELGNQN SPAFNQALPM LDSLIKAEVA EREVRSVNYQ LRVAKFPVYR DLVGFDFSQS LVNEATVKQL HRCDFMEQAQ
NVVLIGGPGT GKTHLATAIG TQAVMHLNRR VRFFSTVDLV NALEQEKSSG RQGQIANRLL YADLVILDEL GYLPFSQTGG ALLFHLLSKL YEKTSVILTT
NLSFSEWSRV FGDEKMTTAL LDRLTHHCHI LETGNESYRF KHSSTQNKQE EKQTRKLKIE T

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In_Tn4 501 5257-5757 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  Acetyltransf_1 (Pfam:PF00583)
Comment:   putative acetyltransferase ADU64769.1
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In_Tn4 840 5885-6724 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In_Tn4 348 6718-7065 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   acridine dye (ARO:3000054)||quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA (ARO:3002601) AadA In_Tn4 792 7229-8020 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Transpoase Chemistry:   aminoglycoside nucleotidyltransferase
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  ANT(3'') (ARO:3004275)
Comment:   perfect match to reference sequence for ARO:3002601||Synonyms: aadA1-pm, aadA, aadA1, aad(3'')(9)
Protein Sequence:  
MREAVIAEVS TQLSEVVGVI ERHLEPTLLA VHLYGSAVDG GLKPHSDIDL LVTVTVRLDE TTRRALINDL LETSASPGES EILRAVEVTI VVHDDIIPWR
YPAKRELQFG EWQRNDILAG IFEPATIDID LAILLTKARE HSVALVGPAA EELFDPVPEQ DLFEALNETL TLWNSPPDWA GDERNVVLTL SRIWYSAVTG
KIAPKDVAAD WAMERLPAQY QPVILEARQA YLGQEEDRLA SRADQLEEFV HYVKGEITKV VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In_Tn4 1014 8169-9182 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS1326-KY749247.1 IS1326 Insertion Sequence 2679-5148 2470

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In_Tn4 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In_Tn4 49-67 AACACGTCGG TCGAGGACT
repeat t3 In_Tn4 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 In_Tn4 110-128 TCAATACTCG TGTGCACCA
IRL IS1326 2679-2704 TGTTGAGTTG CATCTAAAAT TGACCC
IRR IS1326 5123-5148 CCCAGTTTAA ACCCACGTTT AGTTGT
repeat i4 In_Tn4 9265-9283 AGGAGGGACG CAGGCGACT
repeat i3 In_Tn4 9293-9311 CGTCGGGCAG CAACGGACT
repeat i2 In_Tn4 9335-9353 ATCACGTCAG CCGAAGACT
IRi In_Tn4 9352-9384 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References