Mobile element type:  Transposon
Name:  Tn21.1
Synonyms: 
Accession:  Tn21.1-MH257753
Family:  Tn3
Group:  Tn21
First isolate:  ND
Partial:  ND
Evidence of transposition:  Yes

Host Organism:  Salmonella enterica subsp. enterica serovar Typhimurium
Date of Isolation: 
Country: 
Molecular Source:  plasmid pST1007-1A

Name Coordinates Direction Length Sequence
IRL 1-38 + 38 GGGGGCACCT CAGAAAACGG AAAATAAAGC ACGCTAAG
IRR 21628-21668 - 41 GGGGTCGTCT CAGAAAACGG AAAATAAAGC ACGCTAAGCC G
repeat i4 21641-21659 - 19 TCAGAAAACG GAAAATAAA

Name Associated TE Coordinates Orientation Length Sequence
attC cmlA6 In_Tn21.1 12141-12204 , 13684-13689 + 70 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT GGGCGCCCAA
attC-cmlA6 5'-end In_Tn21.1 12141-12204 + 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT GGGC
attC cmlA6 core In_Tn21.1 12141-12204 + 64 CGCCTGAGCT CAGCCGACCG AAACCGCGTA GCGGTTTTGG GTCGGCTGCA GCGATTTGTT GGGC
attC-cmlA6 3'-end In_Tn21.1 13684-13689 + 6 GCCCAA
attC-aadA2 In_Tn21.1 13690-13743 , 14540-14545 + 60 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGACGTCTAA
attC-aadA2 5'-end In_Tn21.1 13690-13743 + 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC aadA3 core In_Tn21.1 13690-13743 + 54 CGCCGGAGTT AAGCCGCCGC GCGTAGCGCG GTCGGCTTGA ACGAATTGTT AGAC
attC-aadA2 3'-end In_Tn21.1 14540-14545 + 6 GTCTAA
attC orfD In_Tn21.1 14551-14598 + 48 GAGGTAAGCC GACCGCAGAA TGCGGGTCGG CTTGACCGAA ATGTTAGA
attC dfrA12 core In_Tn21.1 14866-14949 + 84 CGTTGGACGT AACGAGAGCC GGAGCGCAGC GGAGGGAACC AAAATGCGCA GCATTTTGGC GTCCCGTTGA CGGAATGGTT AGCC
attI In_Tn21.1 15450-15505 + 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA
res Tn21.1 17137-17267 + 131 GCCGCCGTCA GGTTGAGGCA TACCCTAACC TGATGTCAGA TGCCATGTGT AAATTGCGTC AGGATAGGAT TGAATTTTGA ATTTATTGAC ATATCTCGTT GAAGGTCATA GAGTCTTCCC TGACATTTTG C
res_site_I Tn21.1 17137-17175 + 39 GCCGCCGTCA GGTTGAGGCA TACCCTAACC TGATGTCAG
res_site_II Tn21.1 17189-17232 + 44 ATTGCGTCAG GATAGGATTG AATTTTGAAT TTATTGACAT ATCT
res_site_III Tn21.1 17236-17267 + 32 TGAAGGTCAT AGAGTCTTCC CTGACATTTT GC

ORF Summary
Name Associated TE Coordinates Orientation Class Subclass
merR Tn21.1 34-468 - Passenger Gene Heavy Metal Resistance
merT Tn21.1 540-890 + Passenger Gene Heavy Metal Resistance
merP Tn21.1 904-1179 + Passenger Gene Heavy Metal Resistance
merC Tn21.1 1215-1637 + Passenger Gene Heavy Metal Resistance
merA Tn21.1 1689-3383 + Passenger Gene Heavy Metal Resistance
merD Tn21.1 3401-3763 + Passenger Gene Heavy Metal Resistance
merE Tn21.1 3760-3996 + Passenger Gene Heavy Metal Resistance
urfM 5'-end Tn21.1 3993-4663 + Passenger Gene Other
urfM 5'-end Tn21.1 3993-4663 + Passenger Gene Other
tniA 5'-end In_Tn21.1 4739-6043 + Transposase
tnpA IS26 6090-6794 + Transposase
SDR family oxidoreductase In_Tn21.1 7252-8115 + Passenger Gene Other
GrpB domain protein In_Tn21.1 8153-8398 + Passenger Gene Other
sul3 (ARO:3000413) In_Tn21.1 8867-9658 + Passenger Gene Antibiotic Resistance
tnp IS256 family In_Tn21.1 9983-10591 + Transposase
qacL (ARO:3005098) In_Tn21.1 10838-11170 - Passenger Gene Antibiotic Resistance
aadA (ARO:3002601) In_Tn21.1 11340-12131 - Passenger Gene Antibiotic Resistance
cmlA6 (ARO:3002696) In_Tn21.1 12224-13483 - Passenger Gene Antibiotic Resistance
aadA2 (ARO:3002602) In_Tn21.1 13745-14524 - Passenger Gene Antibiotic Resistance
DUF1010 family protein In_Tn21.1 14542-14832 - Passenger Gene Other
dfrA12 (ARO:3002858) In_Tn21.1 14944-15441 - Passenger Gene Antibiotic Resistance
intI1 In_Tn21.1 15586-16599 + Integron Integrase Class 1
tnpM Tn21.1 16802-17152 + Accessory Gene Inhibitor
tnpR Tn21.1 17278-17300 , 18129-18666 + Accessory Gene Resolvase
tnpR 5'-end Tn21.1 17278-17300 + Accessory Gene Resolvase
tnpA IS26 17372-18076 + Transposase
tnpR 3'-end Tn21.1 18129-18666 + Accessory Gene Resolvase
tnpA Tn21.1 18669-21635 + Transposase
ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR Tn21.1 144 34-468 -
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: activator-repressor of mer operon
Target: Mercury
Protein Sequence: MENNLENLTI GVFAKAAGVN VETIRFYQRK GLLREPDKPY GSIRRYGEAD VVRVKFVKSA QRLGFSLDEI AELLRLDDGT HCEEASSLAE HKLKDVREKM ADLARMETVL SELVCACHAR KGNVSCPLIA SLQGEAGLAR SAMP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT Tn21.1 116 540-890 +
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: cytosolic mercuric ion transport protein
Target: Mercury
Protein Sequence: MSEPQNGRGA LFAGGLAAIL ASTCCLGPLV LVALGFSGAW IGNLTVLEPY RPLFIGAALV ALFFAWKRIY RPVQACKPGE VCAIPQVRAT YKLIFWIVAV LVLVALGFPY VVPFFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP Tn21.1 91 904-1179 +
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: mercury transport
Target: Mercury
Protein Sequence: MKKLFASLAL AAAVAPVWAA TQTVTLAVPG MTCAACPITV KKALSKVEGV SKVDVGFEKR EAVVTFDDTK ASVQKLTKAT ADAGYPSSVK Q

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merC MerC Tn21.1 140 1215-1637 +
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: transmembrane protein mercury transport
Target: Mercury
Protein Sequence: MGLMTRIADK TGALGSVVSA MGCAACFPAL ASFGAAIGLG FLSQYEGLFI SRLLPLFAAL AFLANALGWF SHRQWLRSLL GMIGPAIVFA ATVWLLGNWW TANLMYVGLA LMIGVSIWDF VSPAHRRCGP DGCELPAKRL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merA MerA Tn21.1 564 1689-3383 +
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: mercuric ion reductase
Target: Mercury
Protein Sequence: MSTLKITGMT CDSCAVHVKD ALEKVPGVQS ADVSYAKGSA KLAIEVGTSP DALTAAVAGL GYRATLADAP SVSTPGGLLD KMRDLLGRND KTGSSGALHI AVIGSGGAAM AAALKAVEQG ARVTLIERGT IGGTCVNVGC VPSKIMIRAA HIAHLRRESP FDGGIAATTP TIQRTALLAQ QQARVDELRH AKYEGILEGN PAITVLHGSA RFKDNRNLIV QLNDGGERVV AFDRCLIATG ASPAVPPIPG LKDTPYWTST EALVSETIPK RLAVIGSSVV ALELAQAFAR LGAKVTILAR STLFFREDPA IGEAVTAAFR MEGIEVREHT QASQVAYING EGDGEFVLTT AHGELRADKL LVATGRAPNT RKLALDATGV TLTPQGAIVI DPGMRTSVEH IYAAGDCTDQ PQFVYVAAAA GTRAAINMTG GDAALNLTAM PAVVFTDPQV ATVGYSEAEA HHDGIKTDSR TLTLDNVPRA LANFDTRGFI KLVVEEGSGR LIGVQAVAPE AGELIQTAAL AIRNRMTVQE LADQLFPYLT MVEGLKLAAQ TFNKDVKQLS CCAG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merD MerD Tn21.1 120 3401-3763 +
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: secondary regulatory protein
Target: Mercury
Protein Sequence: MSAYTVSQLA HNAGVSVHIV RDYLVRGLLR PVACTTGGYG VFDDAALQRL CFVRAAFEAG IGLDALARLC RALDAADGAQ AAAQLAVLRQ LVERRRAALA HLDAQLASMP AERAHEEALP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merE MerE Tn21.1 78 3760-3996 +
Class: Passenger Gene
Subclass: Heavy Metal Resistance
Function: mercury transport
Target: Mercury
Comment: similar to urf-1 in pKLH2 (GenBank AF213017), pKLH272 (Genbank Y08992), pMER610 (GenBank Y08993), pKLH210 (GenBank Y10102), Tn5036 (Genbank Y09025), orf1 in Tn501 (GenBank Z00027), and urf-1 in Tn5041 (GenBank X98999)
Protein Sequence: MNAPDKLPPE TRQPVSGYLW GALAVLTCPC HLPILAAVLA GTTAGAFLGE HWGVAALALT GLFVLAVTRL LRAFRGGS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM 5'-end N/A Tn21.1 223 3993-4663 +
Class: Passenger Gene
Subclass: Other
Comment: urfMORF interrupted by insertion of In2
Protein Sequence: MTSSQPAGWT AAELAQAAAR GQLDLHYQPL VDLRDHRIAG AEALMRWRHP RLGLLPPGQF LPLAESFGLM PEIGAWVLGE ACRQMHKWQG PAWQPFRLAI NVSASQVGPT FDDEVKRVLA DMALPAELLE IELTESVAFG NPALFASFDA LRAIGVRFAA DDFGTGYSCL QHLKCCPITT LKIDQSFVAR LPDDARDQTI VRAVIQLAHG LGMDVIFRRR LHQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM 5'-end N/A Tn21.1 223 3993-4663 +
Class: Passenger Gene
Subclass: Other
Comment: urfMORF interrupted by insertion of In2
Protein Sequence: MTSSQPAGWT AAELAQAAAR GQLDLHYQPL VDLRDHRIAG AEALMRWRHP RLGLLPPGQF LPLAESFGLM PEIGAWVLGE ACRQMHKWQG PAWQPFRLAI NVSASQVGPT FDDEVKRVLA DMALPAELLE IELTESVAFG NPALFASFDA LRAIGVRFAA DDFGTGYSCL QHLKCCPITT LKIDQSFVAR LPDDARDQTI VRAVIQLAHG LGMDVIFRRR LHQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA 5'-end TniA 5'-end In_Tn21.1 434 4739-6043 +
Class: Transposase
Function: integrase
Transposase Chemistry: DDE
Comment: Contains the first 429 amino acids of tniA (In2)||probably truncated by insertion of IS26
Protein Sequence: MLNTRVHQSE VSMATDTPRI PEQGVATLPD EAWERARRRA EIISPLAQSE TVGHEAADMA AQALGLSRRQ VYVLIRRARQ GSGLVTDLVP GQSGGGKGKG RLPEPVERVI HELLQKRFLT KQKRSLAAFH REVTQVCKAQ KLRVPARNTV ALRIASLDPR KVIRRREGQD AARDLQGVGG EPPAVTAPLE QVQIDHTVID LIVVDDRDRQ PIGRPYLTLA IDVFTRCVLG MVVTLEAPSA VSVGLCLVHV ACDKRPWLEG LNVEMDWQMS GKPLLLYLDN AAEFKSEALR RGCEQHGIRL DYRPLGQPHY GGIVERIIGT AMQMIHDELP GTTFSNPDQR GDYDSENKAA LTLRELERWL TLAVGTYHGS VHNGLLQPPA ARWAEAVARV GVPAVVTRAT SFLVDFLPIL RRTLTRTGFV IDHIHYYADG HCCK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA IS26 234 6090-6794 +
Class: Transposase
Transposase Chemistry: DDE
Protein Sequence: MNPFKGRHFQ RDIILWAVRW YCKYGISYRE LQEMLAERGV NVDHSTIYRW VQRYAPEMEK RLRWYWRNPS DLCPWHMDET YVKVNGRWAY LYRAVDSRGR TVDFYLSSRR NSKAAYRFLG KILNNVKKWQ IPRFINTDKA PAYGRALALL KREGRCPSDV EHRQIKYRNN VIECDHGKLK RIIGATLGFK SMKTAYATIK GIEVMRALRK GQASAFYYGD PLGEMRLVSR VFEM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
SDR family oxidoreductase SDR family oxidoreductase In_Tn21.1 287 7252-8115 +
Class: Passenger Gene
Subclass: Other
Sequence Family: WP_000612791.1
Protein Sequence: MIPNSENKRV WFITGASKGL GYAFTCAALK AGDKVVAVAR TIDNLAKLEE TYQESLLPLN LDVTDREAVF STVETAVKHF GRLDIVVNNA GIMTMGMIEE LNESDARKLM DTNFFGALWV CQAVMPYLRS QRSGHIIQIT SIGAIISGPM SGIYSASKFA LEGMSEALAK EAEHFGVKLT MVEPGGYWTD LYTSMSYSNP LDSYGTLRDE LAKQYSEDSV DSDPSLAAEA LMKLVASNNP PLRLILGSMV YDLAMDTLKA RMATWEEWEA VSRASEKAIP APERYGV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GrpB domain protein GrpB domain protein In_Tn21.1 81 8153-8398 +
Class: Passenger Gene
Subclass: Other
Sequence Family: GrpB (Pfam:PF04229)
Protein Sequence: MKIEIMEYNP DWTKNFEEEK IKLLHFFGSH AVAIEHIGST AIPNQRAKPV IDIFIGVSPF AELPFISAFL MQRSITTLRQ I

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul3 (ARO:3000413) Sul3 In_Tn21.1 263 8867-9658 +
Class: Passenger Gene
Subclass: Antibiotic Resistance
Function: antibiotic target replacement (ARO:0001002)
Sequence Family: sulfonamide resistant sul (ARO:3004238)
Target: sulfone antibiotic (ARO:3003401)||sulfonamide antibiotic (ARO:3000282)
Comment: perfect match to reference sequence for ARO:3000413
Protein Sequence: MSKIFGIVNI TTDSFSDGGL YLDTDKAIEH ALHLVEDGAD VIDLGAASSN PDTTEVGVVE EIKRLKPVIK ALKEKGISIS VDTFKPEVQS FCIEQKVDFI NDIQGFPYPE IYSGLAKSDC KLVLMHSVQR IGAATKVETN PEEVFTSMME FFKERIAALV EAGVKRERII LDPGMGFFLG SNPETSILVL KRFPEIQEAF NLQVMIAVSR KSFLGKITGT DVKSRLAPTL AAEMYAYKKG ADYLRTHDVK SLSDALKISK ALG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp IS256 family Tnp IS256 family In_Tn21.1 202 9983-10591 +
Class: Transposase
Function: tranposase
Transposase Chemistry: DDE
Protein Sequence: MEFYPSCIEK GMRSERALKL AIAEMYVKGV STRRVSDIVE ILCGTEVSSS QVSRLAKELD EEITSWKAQP VGQIQYLVLD ATYESVRVGS HVVKQALLVA IGVDYSGNRH ILDAEVANSE AEVNWRSFLE GLVRRGMHGL RMITSDDHSG LRAAIDAVFP GILWQRCQFH LQQNAHSYVT KKDEIPLIAA DIRKVFNRNM SR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacL (ARO:3005098) QacL In_Tn21.1 110 10838-11170 -
Class: Passenger Gene
Subclass: Antibiotic Resistance
Function: antibiotic efflux (ARO:0010000)
Sequence Family: small multidrug resistance (SMR) antibiotic efflux pump (ARO:0010003)
Target: disinfecting agents and antiseptics(ARO:3005386)
Comment: subunit of the qac multidrug efflux pump||strict match to reference sequence for ARO:3005098 (bitscore: 202)
Protein Sequence: MKNWLFLAIA IFGEVVATSA LKSSHGFTKL VPSVVVVAGY GLAFYFLSLA LKSIPVGIAY AVWAGLGIVL VAAIAWIFHG QKLDLWAFVG MGLIVSGVAV LNLLSKVSAH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA (ARO:3002601) AadA In_Tn21.1 263 11340-12131 -
Class: Passenger Gene
Subclass: Antibiotic Resistance
Function: antibiotic inactivation (ARO:0001004)
Sequence Family: ANT(3'') (ARO:3004275)
Transposase Chemistry: aminoglycoside nucleotidyltransferase
Target: aminoglycoside antibiotic (ARO:0000016)
Comment: perfect match to reference sequence for ARO:3002601||Synonyms: aadA1-pm aadA, aadA1, aad(3'')(9)
Protein Sequence: MREAVIAEVS TQLSEVVGVI ERHLEPTLLA VHLYGSAVDG GLKPHSDIDL LVTVTVRLDE TTRRALINDL LETSASPGES EILRAVEVTI VVHDDIIPWR YPAKRELQFG EWQRNDILAG IFEPATIDID LAILLTKARE HSVALVGPAA EELFDPVPEQ DLFEALNETL TLWNSPPDWA GDERNVVLTL SRIWYSAVTG KIAPKDVAAD WAMERLPAQY QPVILEARQA YLGQEEDRLA SRADQLEEFV HYVKGEITKV VGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
cmlA6 (ARO:3002696) CmlA6 In_Tn21.1 419 12224-13483 -
Class: Passenger Gene
Subclass: Antibiotic Resistance
Function: antibiotic efflux (ARO:0010000)
Sequence Family: major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Target: phenicol antibiotic (ARO:3000387)
Comment: strict match to reference sequence for ARO:3002696 (bitscore: 819)
Protein Sequence: MSSKNFSWRY SLAATVLLLS PFDLLASLGM DMYLPAVPFM PNALGTTAST IQLTLTTYLV MIGAGQLLFG PLSDRLGRRP VLLGGGLAYV VASMGLALTS SAEVFLGLRI LQACGASACL VSTFATVRDI YAGREESNVI YGILGSMLAM VPAVGPLLGA LVDMWLGWRA IFAFLGLGMI AASAAAWRFW PETRVQRVAG LQWSQLLLPV KCLNFWLYTL CYAAGMGSFF VFFSIAPGLM MGRQGVSQLG FSLLFATVAI AMVFTARFMG RVIPKWGSPS VLRMGMGCLI AGAVLLAITE IWALQSVLGF IAPMWLVGIG VATAVSVAPN GALRGFDHVA GTVTAVYFCL GGVLLGSIGT LIISLLPRNT AWPVVVYCLT LATVVLGLSC VSRVKGSRGQ GEHDVVALQS AESTSNPNR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
aadA2 (ARO:3002602) AadA2 In_Tn21.1 259 13745-14524 -
Class: Passenger Gene
Subclass: Antibiotic Resistance
Function: antibiotic inactivation (ARO:0001004)
Sequence Family: ANT(3'') (ARO:3004275)
Target: aminoglycoside antibiotic (ARO:0000016)
Comment: strict match to reference sequence for ARO:3002602 (bitscore: 520)
Protein Sequence: VTIEISNQLS EVLSVIERHL ESTLLAVHLY GSAVDGGLKP YSDIDLLVTV AVKLDETTRR ALLNDLMEAS AFPGESETLR AIEVTLVVHD DIIPWRYPAK RELQFGEWQR NDILAGIFEP AMIDIDLAIL LTKAREHSVA LVGPAAEEFF DPVPEQDLFE ALRETLKLWN SQPDWAGDER NVVLTLSRIW YSAITGKIAP KDVAADWAIK RLPAQYQPVL LEAKQAYLGQ KEDHLASRAD HLEEFIRFVK GEIIKSVGK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
DUF1010 family protein DUF1010 family protein In_Tn21.1 96 14542-14832 -
Class: Passenger Gene
Subclass: Other
Sequence Family: DUF1010 (Pfam:PF06231)
Protein Sequence: MFIQTAFSFS GVIQCLFCLF SGLRLHGLRR FSVFLASSPC VASASSYRFC SAVPPRWRSV FSRLAPVAKF KLSVLASGSN ISVKPTRILR SAYLAR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
dfrA12 (ARO:3002858) DfrA12 In_Tn21.1 165 14944-15441 -
Class: Passenger Gene
Subclass: Antibiotic Resistance
Function: antibiotic target replacement (ARO:0001002)
Sequence Family: trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218)
Target: diaminopyrimidine antibiotic (ARO:3000171)
Comment: 100% identity with reference sequence for ARO:3002858 (bitscore: 339)||Synonyms:
Protein Sequence: MNSESVRIYL VAAMGANRVI GNGPNIPWKI PGEQKIFRRL TEGKVVVMGR KTFESIGKPL PNRHTLVISR QANYRATGCV VVSTLSHAIA LASELGNELY VAGGAEIYTL ALPHAHGVFL SEVHQTFEGD AFFPMLNETE FELVSTETIQ AVIPYTHSVY ARRNG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In_Tn21.1 337 15586-16599 +
Class: Integron Integrase
Subclass: Class 1
Function: Integrase
Sequence Family: Class 1 Integron Tyrosine Integrase
Transposase Chemistry: Tyrosine
Protein Sequence: MKTATAPLPP LRSVKVLDQL RERIRYLHYS LPTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpM TnpM Tn21.1 116 16802-17152 +
Class: Accessory Gene
Subclass: Inhibitor
Function: transposition regulator||reported to enhance Tn21 transposition and suppress resolution of cointegrate replicons in vivo
Comment: 3'-end of urfM ORF, which is interrupted by insertion of In2||inhibits tranposition probably by inhibiting resolution
Protein Sequence: MEVVAEGVET PDCLAWLRQA GCDTVQGFLF ARPMPAAAFV GFVNQWRNTT MNANEPSTSC CVCCKEIPLD AAFTPEGAEY VEHFCGLECY QRFQARASTA TETSVKPDAC DSPPSG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn21.1 186 17278-17300 , 18129-18666 +
Class: Accessory Gene
Subclass: Resolvase
Function: resolvase
Sequence Family: Serine Site-Specific Recombinase
Transposase Chemistry: Serine
Comment: Transposon Tn21 resolvase Cterminal part, truncated by IS26
Protein Sequence: MTGQRIGYIR VSTFDQNPER QLEGVKVDRA FSDKASGKDV KRPQLEALIS FARTGDTVVV HSMDRLARNL DDLRRIVQTL TQRGVHIEFV KEHLSFTGED SPMANLMLSV MGAFAEFERA LIRERQREGI ALAKQRGAYR GRKKSLSSER IAELRQRVEA GEQKTKLARE FGISRETLYQ YLRTDQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR 5'-end N/A Tn21.1 7 17278-17300 +
Class: Accessory Gene
Subclass: Resolvase
Sequence Family: Serine Site-Specific Recombinase
Transposase Chemistry: Serine
Comment: tnpR ORF interrupted by IS26 insertion
Protein Sequence: MTGQRIG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA IS26 234 17372-18076 +
Class: Transposase
Transposase Chemistry: DDE
Protein Sequence: MNPFKGRHFQ RDIILWAVRW YCKYGISYRE LQEMLAERGV NVDHSTIYRW VQRYAPEMEK RLRWYWRNPS DLCPWHMDET YVKVNGRWAY LYRAVDSRGR TVDFYLSSRR NSKAAYRFLG KILNNVKKWQ IPRFINTDKA PAYGRALALL KREGRCPSDV EHRQIKYRNN VIECDHGKLK RIIGATLGFK SMKTAYATIK GIEVMRALRK GQASAFYYGD PLGEMRLVSR VFEM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR 3'-end N/A Tn21.1 179 18129-18666 +
Class: Accessory Gene
Subclass: Resolvase
Sequence Family: Serine Site-Specific Recombinase
Transposase Chemistry: Serine
Comment: tnpR ORF interrupted by IS26 insertion
Protein Sequence: YQGQHLRPEP GTATGRRQG* SRF*RQGIRQ GCQASATGSA DKLRPHRRHR GGA*HGSPGA QSR*FAPDRA NADTTRRAYR IRQGTPQFYW RRLSDGEPDA LGDGRVRRVR ARPDPRASAR GYCARQATRG LPWQEEIPVV *AYCRTAPTC RGWRAKDQAC S*IRNQSRNP VSILENGSV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn21.1 988 18669-21635 +
Class: Transposase
Function: transposition, DNA-mediated (GO:0006313)
Transposase Chemistry: DDE
Protein Sequence: MPRRSILSAA ERESLLALPD SKDDLIRHYT FNDTDLSIIR QRRGPANRLG FAVQLCYLRF PGVILGVDEL PFPPLLKLVA DQLKVGVESW NEYGQREQTR REHLSELQTV FGFRPFTMSH YRQAVQMLTE LAMQTDKGIV LASALIGHLR RQSVILPALN AVERASAEAI TRANRRIYDA LAEPLADAHR RRLDDLLKRR DNGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPT GIERLVHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LATEGMATVT DEIIDLHDRI LGKLFNAAKN KHQQQFQASG KAINAKVRLY GRIGQALIDA KQSGRDAFAA IEAVMSWDSF AESVTEAQKL AQPDDFDFLH RIGESYATLR RYAPEFLAVL KLRAAPAAKN VLDAIEVLRG MNTDNARKLP ADAPTGFIKP RWQKLVMTDA GIDRRYYELC ALSELKNSLR SGDIWVQGSR QFKDFEDYLV PPEKFTSLKQ SSELPLAVAT DCEQYLHERL TLLEAQLATV NRMAAANDLP DAIITESGLK ITPLDAAVPD TAQALIDQTA MVLPHVKITE LLLEVDEWTG FTRHFTHLKS GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHTRDET YSTALAELVN AQFRHPFAGH WGDGTTSSSD GQNFRTASKA KSTGHINPKY GSSPGRTFYT HISDQYAPFH TKVVNVGLRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYIPKGD AAYDALKPMI GGTLNIKHVR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAAHALRGN GHAVDDSLLQ YLSPLGWEHI NLTGDYLWRS SAKIGAGKFR PLRPLQPA

TnCentral Accession TE Name Type Coordinates Strand Length
In_Tn21.1-MH257753 In_Tn21.1 integron 4634-16801 + 12168
IS26-MH257753 IS26 insertion sequence 6027-6846 + 820
IS26-MH257753 IS26 insertion sequence 17309-18128 + 820

Name Associated TE Coordinates Direction Length Sequence
IRt In_Tn21.1 4634-4666 + 33 TGTCATTTTC AGAAGACGAC TGCACCAGTT GAT
repeat t1 In_Tn21.1 4642-4660 + 19 TCAGAAGACG ACTGCACCA
repeat t2 In_Tn21.1 4682-4700 - 19 TCAGGAGCTG GCTGCACAA
repeat t3 In_Tn21.1 4711-4730 + 20 TCAGAAGTGA TCTGCACCAA
repeat t4 In_Tn21.1 4743-4761 + 19 TCAATACTCG TGTGCACCA
IRL IS26 6027-6040 + 14 GGCACTGTTG CAAA
IRR IS26 6833-6846 - 14 GGCACTGTTG CAAA
repeat i4 In_Tn21.1 16682-16700 - 19 TCAGCGGACG CAGGGAGGA
repeat i3 In_Tn21.1 16710-16728 - 19 TCAGGCAACG ACGGGCTGC
repeat i2 In_Tn21.1 16752-16770 - 19 TCAGAAGCCG ACTGCACTA
IRi In_Tn21.1 16769-16801 - 33 TGTCGTTTTC AGAAGACGGC TGCACTGAAC GTC
repeat i1 In_Tn21.1 16775-16793 - 19 TCAGAAGACG GCTGCACTG
IRL IS26 17309-17322 + 14 GGCACTGTTG CAAA
IRR IS26 18115-18128 - 14 GGCACTGTTG CAAA