Transposon
Name: TnTsp1       (Synonyms: Tn7204)
Family: Tn3        Group: Tn4651
Evidence of Transposition: no
 Host     

Host Organism:Thiomonas intermedia K12 Molecular Source:plasmid pTINT01
Place of Origin:Hamburg, Germany Date of Isolation:2010
Other Geographic Information:sewage pipe

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGCTGGTCTGTCGTAGCTTTCCAAAGGACAGCAGGT
IRR (Length: 38 bp)GGGGTAGAGCAAGCAGTCGTGTCACCAGCCGCCATTTC

 Sequence     
DNA SequenceLength  4575 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGCTGGTC TGTCGTAGCT TTCCAAAGGA CAGCAGGTAA TTGCTCCAAA GCTGACGGCT AAACGGCTGG ATTCGTGGGT TATTGATTGA CCGCTCCTGG 100
CCGCCTGCCG CCGACCGCGG TCTGGCTCGA AAGCAGTCTG TCAACGCGCC GTTTCATTGT TGCCATGATC ATCTCTTAAT CGCGCACAGG CGGCCACAGG 200
CGGGCAGTAT GAACCAGCGT CAGTATCCAC ACCGTTTCGC CGTCGATCTG ATACACCAGG CGATAGCTTT CGTGCGGGAT CAACTCGCGG GTCCCGGGAA 300
TCTTTCCCGG CTTGCCCAGC ATGGGGTGCT GGATCAAGCG GGCGGCCGCG TCGCTGAAAA TCTCATCCAT CCGGGCCGCC GCGCGCGGAT TGTCGGCTGC 400
GATGTAGTCC CACACATCGG CACGGTCTTG CTGCGCTTCG GGCGTCCAGA CAACTCTCAC GACTGCCTCG CCACACTGGC ACGCCGTGCG GCGAATTCGG 500
CCTCAACTTC ATCGTTCGAC CGCCCCAATC CAGCGCGCAT CGAAGCCCGG CCGGCTTCGA CCTTGCGGCG CAGGAACTCG TCGTACTCGC GCGACTCGCG 600
CTGGCGCTGA ACGAACTCGC GCATCAGCTC GCGCAGCACT TGTGACGCCG GGCGATGGGC CGCCTCGGCT TCGGCCATAA ACTCGGCGCG CAACTCAGGC 700
TCCAGCTTCA TCGTGAAAAC GGCTTGTTTT GACATGATCG GGGCCTCCTG CCACTTGATA CTAACAAAGT ATATACGCCG TCATTACTAA GCGCTATTCA 800
CAGAACGCTG CAAGGCGGGC GTGCGCTAGG CCAAGGCCTG TCGGAAAACA TTTGATTGTC GACAGATCTT CAACAGCCCT CTGCACCAAG CTCCGAGTGG 900
CCGCAAAATT GTGCGGAAAA CTGTGTCGCC AGACGCTACC ATGCGGAAAC CGCGTCTTGA TGGTTTTCCG CTTATGTTAG TAGGTTACAT GCGCGTGTCG 1000
TCAGATTCCG ACCGGCAGAG CACGGACTTG CAGCGCGACG CGCTGCTCGC CGCCGGCGTC GATCCGCGTC ACCTGTTCGA GGATTACGCC TCCGGCGCGA 1100
AGGACGACCG CGCCGGCTTG GTGCAGGCGC TCGAATTCGT CAGGCCCGGC GACGTACTGG TCGTGTGGAA GCTCGACCGG CTCGGCCGAT CGCTGTCGCA 1200
CCTACTCACC ATCGTGACCT CGCTCAAGGA CAAGCGGGTG GCGTTCCGCT CGCTGACGGA GAATCTGGAC ACCACCACGC CATCGGGCGA GTTTCTATTC 1300
CAGGTGTTCG GCGCGCTCGC GCAGTACGAG CGCGCCTTGA TCCAGGAGCG TGTCGTTGCC GGCCTGGCTG CTGCCCGCAA ACGTGGCCGG ATCGGTGGGC 1400
GGCCGCAGGC AATCACCGGC GAGAAGCTGG ATGCCATTAT CGCCGCGCTC GAAGGTGGCA TGTCCAAGGC AGCCGTGTGC CGCAACTTCG GCGTGAAGCG 1500
GACCACCTTG ATCGAAACTC TGACGCGGGC CGGCTGGCCC GCGTCAGTGC CCCCAGGAGC GTAGCGGTAT GGCGGACGGC GGCGGCAGTA AGGAAGATCA 1600
GTGGGTATTG GCGCCAGCCG AACGCGAACT GGTGATGACC AAAAACCGGG CGAACCGGTT GGGCTTTGCC ATCCTGCTGA CCTTCTTCCG CGATCGCGGC 1700
CGTTTTCCGC GCGACGAAAC CGAAGTCGAG GTACAGGGCA TAGCCGCGCT CGCCAAACAA CTCGACGCAC CCGCGCCCAT CGATGGCGAA GCCTTCCTCA 1800
CGGGCCGCAC TGCCGAGCGG CTGCGCGGCG AAATCCGCGT GCGTTTCGGC TTCCGCGAAG CGACGGTAGC CGATGCCGAG ATGCTGACGG AGTGGCTGCG 1900
TGATCATGTT GCCGGAGAAG TTGGCGGTGA CATTGAGCCG ATGATCGAGC GGCTGGAAGG ACGTTGCCGC GAACTCGCCA TCGAGCCGCC GAAACCAGAC 2000
CGGATGGAGC GCATCGCGCG CAGTGCGTTG CGCTCCCACG AAGACCGCTT CCATAGCTGC GTGTATGGGC GGCTGCCGCC CGCGACTCGC GAACGCCTGG 2100
ATGCCTTGCT GCGCCCAGAA GAATCGGGCC ACGGGGAGAG CGCCGTTGAA GATGCTCAAG GCGAAGCCGC AGGCAACGCG CCGGCCGTCT TGCTGAAACT 2200
GCGCGGCAGT CCCGGCCGCC CAAGCCTTGC CAGCATGCAG GATGAGTTGG CGAAGCTCGA ACTGATCCGG GGGATCGAGC TGTCTGCCGA TCTGTTCGAC 2300
CGGACTTCGC CGCGTGACCT GGAGCGCTGC CGCCAGCGTG TGTCGGTCGA GGTTCCCCGC GACCTGCGCC GACATCCCGA TGCAGCGCGC CTCACCTGGC 2400
TGGCCGCTTT CGTCCACCTG CGCGCCCGCA GCCTGACCGA CGACCTGGTG GACTTGCTGA TCGAGACCAT CCACCAGATC GGCGCGCGTG CCGAACGCAA 2500
GGTCGAACGC GAACTGCTGG AGGACCTCAA GCGCGTGTCC GGCAAGCAGA ACCTGCTGTT CAATCTGGCC GACGCCACCT TGGCCCAGCC GGACGGCGTG 2600
GTGCGCGACG TGGTGTTTCC AGTGGTCGGC GAGCAGACGC TGCGCGATCT GGTCAAGGAG TGGAAGGCCA CCGGCCCGAC CTACCGCATC ACGCTGCGCA 2700
CTGTGATCCG CAATTCGTAC CAGGGCCACT ACCGGCGCAT AGTACCGACC TTGCTGGCCG CGCTGGTATT CCGCTCCAAC AACGACCGCC ACCGCCCGGT 2800
GATGGACGCG CTCGCACTGG TGAAGCGCTT TGCCGACACC AAGGTGCATA CCTTCCCAGC CGACATCGAG GTGCCGCTCG ATGGCGTGGT ACGTGGCCTG 2900
TGGCGAGAAG CCGTCATGGA GACGGACGCC GCCGGCCGGG ATCGGGTCAA CCGCGTCACC TATGAAATCG CCGTGCTGGA AGCCCTGCGC GAGCGGCTGC 3000
GCTGCAAGGA AATCTGGGTG GTCGGCGCGA ACCGCTACCG CAACCCCGAC GACGATCTGC CGGCTGACTT CGAGCAAAAC CGCGAGGACT ACTACCGGGC 3100
GCTGAACCTG CCTCTCGATG TGGAGCGCTT CATCGCCGAC TTGCAGGCCG AAATGCGCGC GGCGCTGTCC ACCTTCGACG CTGGCTTGAA GAAGAATCCA 3200
TCCGTCCGGC TGAGCAGCAA GGGCGGTGGC TGGATCACGC TGACGCCGCT CGATGCGCAA CCCGATCCCC CCAATCTGAC CGCGCTAAAG GCCGAACTCA 3300
ATGTCCTCTG GCCGATGACC AGCCTGCTCG ATATGGTCAA GGAAACCGAT CTGCGGTTGA GCTTTACCGA TGCCCTGAAA AGCCCGACCT CCTACGAGTC 3400
GATGGATCGC TCGGTGTTGC AGCCGCGCCT GCTCCTGTGT CTGCACGGCC TGGGCACCAA TGCTGGCTTG CAGCGCATGG CCGGGCTGGA TTCCGGCACC 3500
ACGGCGCGCG ACCTGGCCTA TGTGCGCCGC CGTTACATCA GCGTGGACGC GATGCGCCGC GCGATTGCCA TCGTCGCAGA CGGCACGCTG CAAGCCCGCA 3600
ACCCGGCGAT CTGGGGTAGC GGCACCACCG CTTGCGCGTC GGACTCGAAA CACTTCGGCG CGTGGGATCA GAACCTCACC ACGCAATGGC ACGTCCGCTA 3700
CGGCGGGCGC GGCGTGATGA TCTACTGGCA TGTCGAGCGC AGCTCGCTGT GCATCCATTC GCAGCTCAAG TCGCCGTCGT CGTCGGAGGT GGCGTCGATG 3800
ATCGAGGGCG TGATCCACCA TTGCACCGAG ATGGAGGTGG ATCGGCAGTA TGTCGATTCG CACGGCCAGA GCACGGTGGC GTTCGCCTTC TGCCGCCTGC 3900
TGGGCTTCCA GTTGCTGCCA CGGCTGAAGG CCATCCACTC ACAGAAGCTG TACCGGCCAG AGACCGGCAA GGCCGACGTC TACGCGAACC TGCAACAGAT 4000
TCTGACCAAG CCCATCGACT GGGACTCGGT GCGGCAACAG TACGACCAGA TGGTCAAGTA CGCTACCGCG CTGCGCCTGG GGACAGCGGA CACCGAAGCC 4100
ATCCTGCGCC GCTTCACCAA GAAGAACGTG CAGCACCCGA CCTACAAGGC ATTCGCTGAG TTGGGCAAGG CGATCAAGAC TATCTTCCTG TGCCGCTACC 4200
TGCACGACGA GGCGTTGCGC CGGGAAATCA ACGAAGGGCT GAACGTAGTC GAGCAGTGGA ACGGCGCGAC CGACTTCGTG TTCTTCGCCC GCCGGGGCGA 4300
GATGGCGAGC AACCGCCGCG AGGATCACGA GGTCAGCATG CTCGCGCTGC ACTTGATCCA GAACTGTATG GTCTACATCA ACACGCTGAT GATCCAGAAG 4400
GTCTTGGCCC TGCCGCATTG GCAGGGCAGG TTCACACCAC GCGACTACGC CGCCCTGACG CCGCTGATCT GGGAACACGT CAACCCGTAT GGTCGGTTCG 4500
ATCTCGATAT GAACACCCGG CTCGACCTAC CGTGATCGAA ATGGCGGCTG GTGACACGAC TGCTTGCTCT ACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 836-966 131 GCCTGTCGGA AAACATTTGA TTGTCGACAG ATCTTCAACA GCCCTCTGCA CCAAGCTCCG
AGTGGCCGCA AAATTGTGCG GAAAACTGTG TCGCCAGACG CTACCATGCG GAAACCGCGT
CTTGATGGTT T
res_site_I 836-864 29 GCCTGTCGGA AAACATTTGA TTGTCGACA
res_site_II 898-941 44 TGGCCGCAAA ATTGTGCGGA AAACTGTGTC GCCAGACGCT ACCA
res_site_III 942-966 25 TGCGGAAACC GCGTCTTGAT GGTTT

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
parE TnTsp1 176-415 Passenger Gene Toxin -
parD TnTsp1 457-735 Passenger Gene Antitoxin -
tnpR TnTsp1 989-1564 Accessory Gene Resolvase +
tnpA TnTsp1 1569-4535 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parE ParE TnTsp1 240 176-415 -
Class:   Passenger Gene
Sub Class:   Toxin
Target:   DNA gyrase
Sequence Family:  ParE_toxin (Pfam:PF05016)
Protein Sequence:  
MWDYIAADNP RAAARMDEIF SDAAARLIQH PMLGKPGKIP GTRELIPHES YRLVYQIDGE TVWILTLVHT ARLWPPVRD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parD ParD TnTsp1 279 457-735 -
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  parD (PDB:4Q2U)
Comment:   RelB
Protein Sequence:  
MSKQAVFTMK LEPELRAEFM AEAEAAHRPA SQVLRELMRE FVQRQRESRE YDEFLRRKVE AGRASMRAGL GRSNDEVEAE FAARRASVAR QS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnTsp1 576 989-1564 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MRVSSDSDRQ STDLQRDALL AAGVDPRHLF EDYASGAKDD RAGLVQALEF VRPGDVLVVW KLDRLGRSLS HLLTIVTSLK DKRVAFRSLT ENLDTTTPSG
EFLFQVFGAL AQYERALIQE RVVAGLAAAR KRGRIGGRPQ AITGEKLDAI IAALEGGMSK AAVCRNFGVK RTTLIETLTR AGWPASVPPG A

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnTsp1 2967 1569-4535 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MADGGGSKED QWVLAPAERE LVMTKNRANR LGFAILLTFF RDRGRFPRDE TEVEVQGIAA LAKQLDAPAP IDGEAFLTGR TAERLRGEIR VRFGFREATV
ADAEMLTEWL RDHVAGEVGG DIEPMIERLE GRCRELAIEP PKPDRMERIA RSALRSHEDR FHSCVYGRLP PATRERLDAL LRPEESGHGE SAVEDAQGEA
AGNAPAVLLK LRGSPGRPSL ASMQDELAKL ELIRGIELSA DLFDRTSPRD LERCRQRVSV EVPRDLRRHP DAARLTWLAA FVHLRARSLT DDLVDLLIET
IHQIGARAER KVERELLEDL KRVSGKQNLL FNLADATLAQ PDGVVRDVVF PVVGEQTLRD LVKEWKATGP TYRITLRTVI RNSYQGHYRR IVPTLLAALV
FRSNNDRHRP VMDALALVKR FADTKVHTFP ADIEVPLDGV VRGLWREAVM ETDAAGRDRV NRVTYEIAVL EALRERLRCK EIWVVGANRY RNPDDDLPAD
FEQNREDYYR ALNLPLDVER FIADLQAEMR AALSTFDAGL KKNPSVRLSS KGGGWITLTP LDAQPDPPNL TALKAELNVL WPMTSLLDMV KETDLRLSFT
DALKSPTSYE SMDRSVLQPR LLLCLHGLGT NAGLQRMAGL DSGTTARDLA YVRRRYISVD AMRRAIAIVA DGTLQARNPA IWGSGTTACA SDSKHFGAWD
QNLTTQWHVR YGGRGVMIYW HVERSSLCIH SQLKSPSSSE VASMIEGVIH HCTEMEVDRQ YVDSHGQSTV AFAFCRLLGF QLLPRLKAIH SQKLYRPETG
KADVYANLQQ ILTKPIDWDS VRQQYDQMVK YATALRLGTA DTEAILRRFT KKNVQHPTYK AFAELGKAIK TIFLCRYLHD EALRREINEG LNVVEQWNGA
TDFVFFARRG EMASNRREDH EVSMLALHLI QNCMVYINTL MIQKVLALPH WQGRFTPRDY AALTPLIWEH VNPYGRFDLD MNTRLDLP