Transposon
Name: TnThsp9       (Synonyms: Tn7202)
Family: Tn3        Group: Tn3
Evidence of Transposition: no
 Host     

Host Organism:Thiomonas sp. str. 3As Molecular Source:plasmid pTHI
Date of Isolation:2010

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 47 bp)GGGGTTCGGGGAGCAATGGAACAGCCAACCGACTTAAGCGCTCGGCA
IRR (Length: 45 bp)GGGGTTCGGGGAGCAATGGAACAGCCAACCGACTTAAGCCCTGGC

 Sequence     
DNA SequenceLength  6215 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTCGGG GAGCAATGGA ACAGCCAACC GACTTAAGCG CTCGGCATCA CCAGTAGCGG TCAGTCCCAT CCATGCGGGC ATCGCTTTCC GAGAAGCAGT 100
TGCGGTCGTC CGCCTCGCTT GAGATGTCAC CCAGCAGCTC GTGCATGCGC TTGTCGAGGT CGTCATCACT GCTGTAGGGC ACCTTCAGTT CGTACTCGCC 200
ACTGGGTAGC CGCTGGCCGT CGTACTCCGA CAAGCAGTAG CGCTCGATGT CCTCACGGGC GCGCTTCTTG CCGCGCACGA ACTTGCTGTT GTTTTCGATG 300
CGCAGCGTCA GCCGGATCGT GGCGACCTTC GGCGGATCAG CCTCGCCAGC ACTCGTCGAT GGGCCTGGCT GGCCCCGCGC CGGTTTCTTG TAGGGGCCGA 400
TCTCGACGCC GCGATGCCGT AGGTAGCTGT ACAGCGTGCT CTTGGACACA TGCAGCTTCT GCGCGATGGC AGCCACCGAC AGCTTGCGCT CGCGGTACAG 500
CGTGTCGGCG GCCAGCGCCG TCGCCTCGGC CTGCGGCGTC AAACCTTTGG GCCGCCCACC GACCCGTCCG CGCGCCCGTG CCGCCGTCAG TCCGGCCTGC 600
GTGCGCTCGC GGATCAGCTC GCGCTCGAAC TCCGCGAGCG AGGCGAACAG GTTGAACACG AGCCGCCCCT GGGCGCTGGT GGTGTCGATT GGGTCGTTCA 700
GGCTCAGCAG CCCGACCTTG CGTTCCATCA GGCTGCCGAC CAGTTCAACC AGGTGCTTGA GCGAACGCCC CATGCGGTCG AGCTTCCAGA TCACCAGCAC 800
GTCGCCGGCT CGCAACTGAC CCAGCAACTC GTCGAGCGCC GGCCGCGCGG TCTTCGCGCC GCTCGCCACG TCCTGGTAGA CGCGCTCGCA ACCGGCCGCC 900
TTGAGGGCGT CCACCTGCAA GTCTGATTTC TGGTCGCGCG TGGAAACGCG GGCGTAGCCG ATCTTCATGC TTAGAGTCTG CTTTACTCGT TGAAAGAGAA 1000
TAATATCGAA CTTTGATTCA TCAAACCACA AAGTTAGACG AATTGCGGTG CTTCACCGTT TGCCCGTCAG GTCGCGCGGC AGAGTCTTGT AAAACGAAGG 1100
TTTTATCGAA CCGTTGCCAG TGACGGCTAT AGGGTGTATA AACACCTATG CACCCAAACA CCGTACTGGA GGGCGTTATG AACACCGTTC GCTGGAACAT 1200
CGCCGTGTCG CCGGACGTAG ATCAGTCCGT GCGCATGTTC ATCGCCGCGC AGGGCGGCGG CCGCAAGGGC GACCTGTCGC GCTTCATCGA GGAAGCTGTG 1300
CGCGCCTACC TGCTGGAGCG CGCCGTCGAT CAGGCCAAGA CGGCTGCGGC CAGCATGAGC GAAACCGAAC TGACCGACCT CATCGACGAG GCTGTGCAAT 1400
GGGCGCGTGA GCACTGATGC GCGTCATCCT TGACACCAAC GTGCTGCTCG GTGCGCTGAT TTCGCCCCAT GGGCCACCCG ATGCGATCTA TCGCGCCTGG 1500
CGCGCGGCGC GTTTCGAGCT GGTGACATCG GCGGCACAGC TCGACGAGTT GCGGCGAGTG AGCCGCTACC CCAAACTCAA GACCATCTTG CCCGCGCACC 1600
GCGTCGGCAC GATGGTCAAC AACATGCAGC GCGCCATCGT TTTGGCGCAA CTGCCGCCAC TGCCGGATGG CATCGAGGCG AATGATCCGA ACGATGCGTT 1700
CCTGCTGGCG ATGGCCTTGG GCGGCGAGGC CGACTACCTG GTGACAGGCG ACCGCCGCGC CGGGCTGCTG CAACGCGGCA GCATCGGGCG CACGCGCATC 1800
GTCACGCCGG CCACCTTCTG CGCCGAGGCG CTTTGAGCCA TGCCGGTCAG CTTCCTGTCC ACCACGCAAC GGGAACGCTA CGGCCGCTAT CCAGAGACGC 1900
TTTCCAGCGA AGAACTGGCG CGCTACTTCC ACCTGGACGA CGATGACCGC GAGTGGATCG CCACCAAGCG GCGCGACAGC CACCGACTGG GCTATGCGCT 2000
GCAACTGACC ACGGCTCGCT TTCTCGGCAC CTTCCTGGAA GACCCGGCCG CCGTGCCCGG TGCGGTACTG CATACGCTGT CGTCCCAGCT CGGCATCGCC 2100
GACCCCGACT GCGTGCTGGC CTACCGCGAG AGCGAGCAAC GCTGGCGGCA CACGACCGAG ATTCGTGCCC GCTACGGCTA TCGTGAATTT GCCGACGGCG 2200
GCGTGCAGTT CCGCCTTGGC CGGTGGCTAT GCGCGCTGTG CTGGACGGGC ACCGACCGGC CGAGCGCGCT GTTCGATTAC GCCAACGGCT GGCTGGTCGG 2300
TCACAAGGTG CTGCTGCCCG GCGTCACGGT GCTGGAACGC TTCATCGCTG AAGTGCGCTC GCGCATGGAG TCGCGCCTGT GGCGTCTGCT CGTGCGCGGC 2400
GTGACGGCTG CACAGCGGCA GCGACTCGAT GATCTGCTCA AGCCCGCCGA AGGCAGCCGC CAGTCCTGGC TGGATCGATT GCGCAAGGGG CCAGTGCGCG 2500
TCAGCGCTCC GGCGCTCGTG TTGGCCTTGC TGCGCATCGA GACCGTGCGC GGCCTGGGCA TCAGGCTGCC TGGCACCCAT GTGCCGCCGA GCCGCATCGC 2600
GGCACTGGCC CGCTTCGCCA GCACCGTCAA GGTGTCCGCC GTGGCCCGGC TGCCGGAAGC GCGGCGCATC GCCACGCTGG TGGCCTTCGT GCATTGCCTG 2700
GAAGCCAGCG CGCAGGACGA CGCCCTCGAT GTGCTCGACC TGCTGCTACG CGAACTGTTC ACCAAGGCCG AGAAGGAAGA CCGCAAGGTC AGGCAGCGCT 2800
CGCTCAAGGA TCTGGATCGG GCCGCCTCGA CGCTGGCCGA GGCTTGCCGG ATGCTGCTCG ATCCGGCCTT GCCGGACGGC GAACTGCGCG AGCGCGTCTA 2900
TGCCGCCATC GGCCGCGATG AACTGGCCCA GGCGCTCAAT GAGGTTCGCG GCCTGGTGCG TCCGCCAAAC GATGTGTTCT ATACCGAGCT GGAGGCCCGC 3000
AAGGCCACCG TTTCGCGTTT CCTGCCGACG TTGCTGCGCG TCATCCGCTT CGACGCCAAT CCGGCCGCGC AGCCTTTGGC GCAGGCGTTG CAATGGCTGC 3100
ATGAGAAGCC CGACCATGAT CCGCCTACGG CCATCGTCGG CAAGGCGTGG CAGCGCCATG TCGTGCAGGA TGACGGCCGC ATCAATGCCA CGGCCTATTC 3200
GTTCTGCGCG CTCGACAAGC TGCGCAGCGC GATTCGCCGC CGCGACGTGT TCATCAGCCC GAGTTGGCGC TACGCCGATC CACGCGCCGG GCTGCTGGCC 3300
GGGGCCGAGT GGGAGGCCGC GCGGCCCATC GTCTGCCGCT CGCTGGGCCT GACGGCGCAG CCCGAGGCTA CGTTGTCCGC GCTGACGCGC GAACTGGACG 3400
AGACCTACCG GCGCGTCGCC GCACGCCTGC CCGAGAACGA CGCAGTGCGC TTCGAGACAG TCGGCGACAA GACCGAACTG GTGCTCAGCC CCTTGGAGGC 3500
GCTGGAAGAA CCGGCTTCGC TGATCGCGCT GCGCAACGAG ATCAAGGCGC GCATGCCGCG TGTCGATCTG CCGGAAATCC TATTGGAAGT CGCCGCGCGC 3600
ACCGGCTGCA TGGAGGCGTT CACGCACCTG ACCGAGCGCA CCGCACGCGC GGCCGATCTG ACCACCAGCC TGTGCGCGGT GCTGATGGCC GAGGCCTGCA 3700
ACACCGGCCC CGAACCGCTC GTGCGGCAGG ACTCCCCGGC GCTCAGGCGC GACCGGCTGA TGTGGGTCGA TCAGAACTAC GTGCGCGACG ACACGTTGAT 3800
CGCCTGCAAC GCCGTGCTGG TGGCGGCGCA GAACCGCATC GCGCTGGCCC GCGCCTGGGG CGGCGGCGAC GTGGCCTCGG CGGACGGCAT GCGCTTCGTG 3900
GTGCCGGTGC GCACCATCCA CGCCGCGCCG AACCCGAAAT ACTTCAATCG CGGGCGCGGC GTCACCTGGT ACAACCTGCT GTCCGATCAA TGCACCGGGC 4000
TGAACGCCAT CACCGTTCCC GGCACGCTGC GCGACAGCCT GGTCTTGCTG GCGGTTGTGT TGGAGCAGCA GACCGAATTG CAGCCGACGC AGATCATGAC 4100
CGACACCGGC GCGTACAGCG ACGTGGTGTT TGGTTTGTTC CGTCTCTCCG GCTACCGTTT CTGCCCACGC CTGGCCGACG TTGGCGGAAC GCGCTTCTGG 4200
CGCGTGGACG CCGAGGCCGA CTATGGCGAC CTCAATACAC TGGCACGGCA GCGCGTGAAC CTCGACCGCA TCACGCCGCA TTGGGATGAC GTGCTGCGCC 4300
TGGTCGGCTC GCTCAAACTC GGCATGGTGC CGGCGATGGG CATCATGCGC ACCTTGCAGG TCGATGAGCG GCCCACCAGC CTGGCGCAGG CCATCGCCGA 4400
AATCGGCCGC ATCGACAAGA CCATCCATAC GCTGAACTTC ATCGACGACG AGGCCCGCCG CCGCGCCACG TTGCTGCAAT TGAATCTCGG CGAAGGCCGC 4500
CACAGCCTGG CGCGCGAAGT CTTCCACGGC AAGCGCGGCG AGCTGTTCCA GCGTTACCGC GAAGGGCAGG AGGACCAGTT GAGCGCGCTC GGCCTGGTCG 4600
TCAACATGAT CGTGCTGTGG AACACGCTGT ACATGGACGC GGTGCTCGCG CAGTTGCGCA GCGAGGGCTA CCGGGTGAAG CCCGAGGACG AGGCTCGGCT 4700
GTCGCCGTTC GGCCATGAGC ACATCAACAT GCTTGGACGC TACTCGTTCT CGGTGCCGGA GGCGGTCGCC CGTGGCGAGC TGCGACCCCT GAGCAGGAAG 4800
GACGACGTTT GAGCCGCTTC AAGCCTATCC GCAGCATCCT TGCTGCTGCT GGATAGGCGG GCACGGCACC GAGCCGAACG AACAGAACAC GCAGCAGTCG 4900
CCGGGCTTGG GGCGCAGCAG CGCGTGGCAC GCCGGGCACT CGTCGTAGAA CTGGCAGGCA TCCGTCGGCA TGGTCTTCTG CCAAGCGTGC CCGCAGTGCG 5000
GGCAGGTCAG CACAGATTCG AGAATGATGG TCATCTCGGC ATCACTTCTG CACGGCGGAC GGGTAGCCCG CGTTGGTCGT GGCCTTGGTC AGCGCTTCCG 5100
GCTGGGCCTT GTCCGGGTCG TAGGTGACGG TGGCTGTTTT TTTGTCGAAA TCGACCTGTA CTGCGCTGAC GCCAGCTACC TTCTCCAGCG ATTTCTTGAC 5200
CGTGATCGGG CACAGCGGAC AGGTCATGTT CTGGACGGCC AGTGTGACCG TTTTCGACGG CGCGGCCAGC ACCGTGAACG GCAAGGCAAC CAGTAGAGCG 5300
ATGAGTAATT TGCGCATGGC GGACTCCTTT CAGTAGAACA ACGGGGCCAG CCACGGCACG GCCAGCAGGG CGAGCAACAG GATGGCAACG ATCCAGAAGG 5400
TGAGCCGTTG CCGGCGGCGC GTGCGCGGAT CGGCGCAGGG CGTGCCTGGG GTGCAGACCT GCTGCACCAG GTAAAGCTTG CGGAAGGCCA GGCCGAGAAA 5500
CAGCAGCGTC AAGCCAATGA AGAACGGCCG GTACGGCTCC ATTGCCGTCA GGCTGCCGAC CCAGGTGCCG CCGATGCCCA GGGCCAGCAG CACGAGCGGT 5600
CCGACGCAGC ACACCGATGC GCCGATGGCG GCGAGGATGC CTGCGACCAA TGAGCTTTTG CCGGTGAGTG ATGCCATGCC GATTCCTTGA ACTTGTGCGC 5700
CCAAGTACTA CTCTACATTC CGTACCTAAG TACGGAATCA AGGGGAATTT GCATGGGCGC AGACCTGACC ATCGGCAAGC TGGCGGACGC TGCCGGAGTG 5800
AACATCGAGA CGATCCGCTA TTACCAGCGG CGCGGGCTGC TGGATGAACC GGCCAAGCCG CTGGGCGGCC ATCGGCGCTA TCCGGCAGGG GAGGCCAAGC 5900
GGGTGCGCTT CATCAAGCGG GCGCAGGCGC TGGGCTTCAC GCTGGATGAA GTCGGCATGC TGCTGACGCT GGACTCGGCG TGTGGCTGCT CGGACACGCG 6000
GGCGCTGGCC GCCCGAAAGC AGGGACTGAT CGAGCGAAAG ATGGCCGACC TGACGGCCAT GTACCAAGCG CTGGGCGATC TGATCCAGCA ATGCGATACC 6100
GGTGGCAGCA CAAGGCCGTG CCCGATCATC GACGTGCTGG AGCGTGACTG ACCACCTCCG GCTTTCCCTT GCCAGGGCTT AAGTCGGTTG GCTGTTCCAT 6200
TGCTCCCCGA ACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 975-1111 137 AGTCTGCTTT ACTCGTTGAA AGAGAATAAT ATCGAACTTT GATTCATCAA ACCACAAAGT
TAGACGAATT GCGGTGCTTC ACCGTTTGCC CGTCAGGTCG CGCGGCAGAG TCTTGTAAAA
CGAAGGTTTT ATCGAAC
res_site_III 975-1004 30 AGTCTGCTTT ACTCGTTGAA AGAGAATAAT
res_site_II 1009-1048 40 AACTTTGATT CATCAAACCA CAAAGTTAGA CGAATTGCGG
res_site_I 1084-1111 28 GTCTTGTAAA ACGAAGGTTT TATCGAAC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR TnThsp9 48-968 Accessory Gene Resolvase -
RHH_6 TnThsp9 1148-1417 Passenger Gene Antitoxin +
PIN_3 TnThsp9 1417-1836 Passenger Gene Toxin +
tnpA TnThsp9 1840-4812 Transposase   +
THI_p0011 TnThsp9 4825-5034 Passenger Gene Hypothetical -
merP TnThsp9 5042-5317 Passenger Gene Heavy Metal Resistance -
merT TnThsp9 5330-5677 Passenger Gene Heavy Metal Resistance -
merR TnThsp9 5753-6151 Passenger Gene Heavy Metal Resistance +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnThsp9 921 48-968 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MKIGYARVST RDQKSDLQVD ALKAAGCERV YQDVASGAKT ARPALDELLG QLRAGDVLVI WKLDRMGRSL KHLVELVGSL MERKVGLLSL NDPIDTTSAQ
GRLVFNLFAS LAEFERELIR ERTQAGLTAA RARGRVGGRP KGLTPQAEAT ALAADTLYRE RKLSVAAIAQ KLHVSKSTLY SYLRHRGVEI GPYKKPARGQ
PGPSTSAGEA DPPKVATIRL TLRIENNSKF VRGKKRARED IERYCLSEYD GQRLPSGEYE LKVPYSSDDD LDKRMHELLG DISSEADDRN CFSESDARMD
GTDRYW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
RHH_6 RHH_6 TnThsp9 270 1148-1417 +
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  RHH_6 (Pfam:PF16762)
Protein Sequence:  
MHPNTVLEGV MNTVRWNIAV SPDVDQSVRM FIAAQGGGRK GDLSRFIEEA VRAYLLERAV DQAKTAAASM SETELTDLID EAVQWAREH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
PIN_3 PIN_3 TnThsp9 420 1417-1836 +
Class:   Passenger Gene
Sub Class:   Toxin
Function:   18 : Unknown function
Target:   single stranded RNA
Sequence Family:  PIN_3 (Pfam:PF13470)
Comment:   tRNA(fMet)-specific endonuclease
Protein Sequence:  
MRVILDTNVL LGALISPHGP PDAIYRAWRA ARFELVTSAA QLDELRRVSR YPKLKTILPA HRVGTMVNNM QRAIVLAQLP PLPDGIEAND PNDAFLLAMA
LGGEADYLVT GDRRAGLLQR GSIGRTRIVT PATFCAEAL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnThsp9 2973 1840-4812 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPVSFLSTTQ RERYGRYPET LSSEELARYF HLDDDDREWI ATKRRDSHRL GYALQLTTAR FLGTFLEDPA AVPGAVLHTL SSQLGIADPD CVLAYRESEQ
RWRHTTEIRA RYGYREFADG GVQFRLGRWL CALCWTGTDR PSALFDYANG WLVGHKVLLP GVTVLERFIA EVRSRMESRL WRLLVRGVTA AQRQRLDDLL
KPAEGSRQSW LDRLRKGPVR VSAPALVLAL LRIETVRGLG IRLPGTHVPP SRIAALARFA STVKVSAVAR LPEARRIATL VAFVHCLEAS AQDDALDVLD
LLLRELFTKA EKEDRKVRQR SLKDLDRAAS TLAEACRMLL DPALPDGELR ERVYAAIGRD ELAQALNEVR GLVRPPNDVF YTELEARKAT VSRFLPTLLR
VIRFDANPAA QPLAQALQWL HEKPDHDPPT AIVGKAWQRH VVQDDGRINA TAYSFCALDK LRSAIRRRDV FISPSWRYAD PRAGLLAGAE WEAARPIVCR
SLGLTAQPEA TLSALTRELD ETYRRVAARL PENDAVRFET VGDKTELVLS PLEALEEPAS LIALRNEIKA RMPRVDLPEI LLEVAARTGC MEAFTHLTER
TARAADLTTS LCAVLMAEAC NTGPEPLVRQ DSPALRRDRL MWVDQNYVRD DTLIACNAVL VAAQNRIALA RAWGGGDVAS ADGMRFVVPV RTIHAAPNPK
YFNRGRGVTW YNLLSDQCTG LNAITVPGTL RDSLVLLAVV LEQQTELQPT QIMTDTGAYS DVVFGLFRLS GYRFCPRLAD VGGTRFWRVD AEADYGDLNT
LARQRVNLDR ITPHWDDVLR LVGSLKLGMV PAMGIMRTLQ VDERPTSLAQ AIAEIGRIDK TIHTLNFIDD EARRRATLLQ LNLGEGRHSL AREVFHGKRG
ELFQRYREGQ EDQLSALGLV VNMIVLWNTL YMDAVLAQLR SEGYRVKPED EARLSPFGHE HINMLGRYSF SVPEAVARGE LRPLSRKDDV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
THI_p0011 THI_p0011 TnThsp9 210 4825-5034 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Function:   18 : Unknown function
Comment:   part of mercury operon
Protein Sequence:  
MTIILESVLT CPHCGHAWQK TMPTDACQFY DECPACHALL RPKPGDCCVF CSFGSVPCPP IQQQQGCCG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merP MerP TnThsp9 276 5042-5317 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   Periplasmic mercury ion-binding protein
Protein Sequence:  
MRKLLIALLV ALPFTVLAAP SKTVTLAVQN MTCPLCPITV KKSLEKVAGV SAVQVDFDKK TATVTYDPDK AQPEALTKAT TNAGYPSAVQ K

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merT MerT TnThsp9 348 5330-5677 -
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Comment:   mercury transporter
Protein Sequence:  
MASLTGKSSL VAGILAAIGA SVCCVGPLVL LALGIGGTWV GSLTAMEPYR PFFIGLTLLF LGLAFRKLYL VQQVCTPGTP CADPRTRRRQ RLTFWIVAIL
LLALLAVPWL APLFY

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
merR MerR TnThsp9 399 5753-6151 +
Class:   Passenger Gene
Sub Class:   Heavy Metal Resistance
Target:   Mercury
Protein Sequence:  
MGADLTIGKL ADAAGVNIET IRYYQRRGLL DEPAKPLGGH RRYPAGEAKR VRFIKRAQAL GFTLDEVGML LTLDSACGCS DTRALAARKQ GLIERKMADL
TAMYQALGDL IQQCDTGGST RPCPIIDVLE RD