Transposon
Name: TnTin1       (Synonyms: Tn7203)
Family: Tn3        Group: Tn4651
Evidence of Transposition: no
 Host     

Host Organism:Thiomonas intermedia K12
Date of Isolation:2010

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 32 bp)GAGGGTCGGCAGGGATTCATGTAAAAAACCGC
IRR (Length: 32 bp)GAGGGTCGGCAGGGATTCATGTAAAAAACCGC

 Sequence     
DNA SequenceLength  6124 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GAGGGTCGGC AGGGATTCAT GTAAAAAACC GCAGAAACAA GCTTAAGTCC TTGTCGCATC TGGTTTTTTT TCATTCCTCT TCGCCGATTT TTCGGCCCCT 100
TCAATGGGGT CCTTCGCTGA CGCATCTGGA CTGGGCACCG TAGCGCTGAA ACGCTCCAGC ATGCTGGCCA CGACCTGCTC TTGCGCCGCG ACGGATGACT 200
TGGCTGCCGC CAGCGCCAGT TCCAGCTGTT GCTTTGCCGT GATCAGATCG TTGACCTTGG CCTGCAACTG CTCGTTTGAA GCGTTGAGCT GTTGGACGGC 300
CGCCTCCTGT TCCACCAGAC GCCGGCCCAA CTCTTCGGTG CGCCTCTGAG CAAACCCCAG TTCATCCTGG AGCGGCCGTA GGCCTCGCAC TTCTTCCTGG 400
ACTTGGTGCA GATCGCCTTG GGCACGCGAC AAATCGCCTA ACAAGCGCGC GTTGTCCTGA AGGGTGTGGA CGGCTTGCTG CTGCTTGGTT GCCAGGGTCT 500
CGTTCACGGA GCGCAACTCG GCCTGCAGGT ACTGAACCTG CTGCTCATGC TTGCGCTGGT CCTGATCGCG CTGCTCTTTT GTCGATTCTC GGAAATGCTC 600
CAGTGCCTGG CGGGCATGCT GGTGCTTTTC TTCCAGGGAT TGGCGATGGC GTTCTTCGGC CGCGAGGCGC TCCTGCAGGT CGGCCACCTG CTGGGCGACC 700
TGCGTGCATT CCAGGGTCTT GCGGCTCAGC GCTTCGCTGG TTTTGCCGTG AGCGGTTTTT TCCTCGGCCA AGATGCGCTG GGTGTGCTCC AGTTGCTGGC 800
GGGTAGATTG CGCCTCCGTT TTCAGGGCCG CGGCGGCTTG TTGCTGCTGG CTCAGTTGTT CGGCGTGCCT GGCCTGCGCC GCGGTGACCC GTTCCTCGGC 900
TTCCTCGTTG ACGCGGGCGG CCAGGCGGCC CACCAGGTCC TGGATCGCCT CGCTGACGGC AATGCGGGTG CCGGTGGCCC CGCCTTCCTC TTCCTCGATC 1000
TCCTTCAGGT AACGGTGGAT CGTGCTCTTG GAGCCGGTGC CCAGCTCTTC ACGCACCGCA TCAATGGACG GATTGCGCCC CATGACCAGC AATTTATCGC 1100
GGGCGCGTAG CACTTCCGAC TTGTAGATCC CGGCCCTGGC CATGGCTCAC CTCTTTATTT CGTACCGTAG TATGTACTAT GATATTACAT ACTAGTATGG 1200
CGGTCAATAA ATTCATGATT TATCATAATC TCACATTGGG TAATAGTAGA TTATCCAATG TGAAGGGGAA TTTGGGCCGA CATCCCGCCT GCGGGCCAGG 1300
GGAGCCCAGC CTATGCTGAT CGATGGCAAT ACCCAGCTGC CGGTCTGTAG GCATTTCTGG CCGTTGCCGA TCATGTTTCA CCAGTAGCCC TCCACAAAAG 1400
GCCAGTCACT CTTGTTCGTG CCGAACGAAA ATTGACCCAC TTTTGAAGGT TATGCCGACG CAAAATTGAC CCGGGTGTTC CACCTACCCT GCTGCTTTTT 1500
TAGGCACGGG GAGCAGGAGT GATCACCATG GACATGATTG GCAAGATCCG GCGGATGCAT TTCCGCCAGA ACAAATCCGT TCGAGAGATT GCCCGCAGTA 1600
CGGGGCTGTC GCGCAACACG GTGCGTACCT GGCTACGCCA GCCGGGCGAT GTCGTACCGC GGTACGCGGC GCGGCGAACA CAACCCTCGC GCAAGCTCGC 1700
GCCCTATGTG GAGTGGCTCC GGCAGGCCGT GGCCATTGAT GCGCAGCGTC CTAAAGCGCA GCGGCGCACG GCCAGGGCAT TGCATGCCGA GTTGAAGCGA 1800
CAAGGCTATG ACGGCGCCTA CAGCCGCGTC ACCGATTTAC TGCGCGCCTG GCCACAGGCC GATGGCACGA GCGCTGTGCA CGCCTTCGTG CCCTTGACGT 1900
TTGCGCTGGG CGAAGCCTTC CAGTTCGACT GGAGTGAAGA GAGCATGGTG GTGGGCGGCG TGCCGTACCG CGTGCAGGTT GCCCACGTCA AGCTGTGCGC 2000
CAGCCGTGCC TTCTGGCTGG GGGCCTATCC CAGCCAAGGT CACGAGATGC TGTTTGACGC CCATACCCAG GCGTTTGCCG CCTTCGGCGG AGTGCCGCAT 2100
CGCGGCATCT ATGACAACAT GAAGACAGCC GTCGACAAAG TGCACCCGCG CAAGAAGCGC GACGTCAACG CGCGCTTTGC GGCGATGTGC AGCCACTACC 2200
TGTTCGACCC CGACTTCTGC AACGTCGCTT CCGGTTGGGA GAAGGGGGTG GTGGAGAAGA ACGTCCAGGA CAGCAGGCGA CGTGTGTGGA TTGAGGCCCT 2300
GTCGACGCCC TGGCGCTCCT TTGCCGAACT CAATGCCTGG CTGGCCATGC GCTGCCGTGG CTTGTGGCAG GAGATCTTCC ATCCCGAGTT CAGCCAGTTC 2400
ACCGTGGCCG AGATGCTCGA GCACGAGCAG CCGCACCTCA TGCCCATGCC CACAGCGTTC GACGGCTATG TGGACAACAC GGTCAAGGTC AGCAGCACCT 2500
GTCTGGTGGC CATGGCTGGC AATCGTTACT CCGTGCCCTG TGAACTCGCC GGGCGGCGGG TGAGCACCCG GGTGTATCCC ACCGAGGTGG TCGTGGTGCA 2600
TGACGGCGTG TGTGTGGCCC GCCATGCCCG GCTGGCCAAC CGGGGACAGA CCATCTACGA CTGGCAGCAC TATGTGCCGC TGATTCAACG CAAACCCGGG 2700
GCGTTGCGCA ACGGCGCGCC GTTTGCCGAT CTGCCCCAGC CCTTGCAGCA ACTGCGCCAG GCGCTGCTGC GGCAAGACGG GGGCGACCGG CTCATGGCCC 2800
AGGTGCTGGC GCTGGTGCCC GCATCCGGGC TGGAGGCGGT GCTGGTGGCC GCCGCGCTGG TGCTCGAAGC CACGGCCCCG TCCGGGCGCA TCAGCGTGGA 2900
GCATGTGGTC AACGTGATGG GGCGCTTGCA GACCGGCCCC CAGCCCGCCC AAGTGACCAC GGCCTTGACG ATGGCCGATC CACCCCGCGC CGACACGGCC 3000
CGCTATGACC GCCTGCGGGT GACTGACGAG GAGCGAGATC ATGCGTGAGG TCATTGCGGA ACTCAAGGCG TTGCGGCTGC ACGGCATGGC CGGCGCCTGG 3100
GCCGATTTGC AAGGTCTGGG GACGAACGCC AGGCTGGATG CCGCGCAGTG GCTCGTCGAG CATTTGCTGC AGGCCGAGCA GGAAGATCGC GCCGTGCGCT 3200
CGGTACGCCA TCAAATCCTC TCGGCCCGCT TCCCGGTGCA CCGCGATCTA GCGGGGTTTG ACTTTGATGC CTCAAGGGTT GATCGCACGC TGGTGGGGCA 3300
ACTCGCCAGC ATGGCCTTTA CCGAAGCCAC GCACAACGTC GTGCTGGTGG GCGGGCCCGG CACGGGCAAG AGCCATCTGG GCACGGCCAT CGGCGTGGCG 3400
GGCATCACGC AGCACGGCAA GCGGGTGCGG TTTTACTCCA CCGTCGATCT GGTCAACGCG CTGGAGCAGG AGAAGGCGCA GGGCAAAGCC GGGCGGATTG 3500
CGGCAAGCCT CTTGCGCATG GACTGGGTCA TTCTCTATGA GCTGGGGTAT CTGCCCTTCA GTCAGGCGGG CGGGGCTTTG TTGTTCCATC TGCTGTCCAA 3600
GCTCTACGAG CACACCAGCG TGCTGATCAC GACCAATCTG GCCTTCGGCG AGTGGTCCAG CGTGTTCATC GATGCGAAGA TGACCACCGC CTTGCTCGAC 3700
CGGCTCACAC ACCACTGCCA CATTCTGGAA ACCGGGAACG AGAGCTACCG CTTCCGTCAC AGCACGGAAA CCGCCAAGAC CCGCATCAAG GCGCGGGAGC 3800
AAAGGCGTTC GGGCACAGCA CAGGCACCCC AGTCCGCTGA GCCCGAGACA CCGCTTTGAC AGACAAACCC TTGGGGCAAG CCGCTTTGGG CTACGCCCGA 3900
CGCAGCTTGC CCCAAGGATC CAAACCCACC AACCAGGAGC CAACCCCACG CAAAAACCGT AGACTTATCA CACCCGACTG CGACGCAGTC GGCCTTAGAC 4000
CCCGGGTCAA AATTGAATCG GCACGGTGGG TCAAAATTCC ATCGGCACGA ACAGTCGGGC AGCTCAAGGA CATCCTGAGC GCCTACCAGT TGGATGGGAC 4100
CGACACCCAG CGCGTCGAAG CCATCAGCAC GACCCTGGTG GCCGAGGTGG ACGAGCTGCT GAACGAATGC GAACAGCACC TGGCCTATGC CGGGCGCAAC 4200
CACCTGCCGT TCCTGCTGCA GCCGTACAAG ATGGTGCGAG CGCAGCTGCT CAACTGCATC GACATCGCGT CGCCCAAGGC CAGCAGCGAG GACCTGGTGG 4300
TGGAGCGGCT GATGGAAGCC CTCTCCAAAC TGCGCGACAA CCGCGCCGAC ACCGTGCCAC TGGACATGCT GGGCCTGAGT GAAGACACGG ATTTGCGCTG 4400
GATGTCGGCC CAGTGGAGAA AGCTTGTGCT GGTCAAACCG GCAGGCAAGG GCCGCGCGGA GTCCGTCCGC CGTCGGTATT TTGAGCTGGC CGTCATGCAC 4500
GCCGTCAAGG ACGATCTGAA GTCCGGAGAC CTCTTCATCA AGTTCGGGGA GCGCTACGAC GACTACCGCG AGCAGTTGGT CGATGACGAA ACCTTTGAGC 4600
GCGAGCTGGG CGACTATGGG CAGGTCACGG GCATCGAGAC GGAGCCCGGC GCCTTCGTGT CGAAGCTGAA ATCTTTCATG GCGCTGCGAG CCATGGAAAT 4700
CGATGCCGGT TTTCCGGAAA ACGCCCATGC TGAAATCGTC GATGGACGCC TGATCCTCAG AAAACCGCCT CGCTCCGACA TCGTCGAGGC CGCCGCGCAC 4800
ATCGATGGCA TGATCACCGA GCGGATGGAG GCGGCCAGCA TCGTGGACGT GATGATCGAC ACCGAGCGGT GGCTGGACCT TCACAAGCTG TTTCGTCCGC 4900
TGGCGGGGAC CGACAGCCGC CTTGAGGACT TGCGCATGCG CGTGATCACG ACCTTGTTCT GCTATGGCTG CAACCTGGGA CCCGTTCAGA CCGCCAAATC 5000
GATCAAAGGG TTGAGCCGGC GTCAGATTTC CTGGCTGAAC CTGAAGTACG TCAGCGAAGA CCTTCTGGAC AAGGCCATCG TCAAGGTGAT CAACGCCTAC 5100
AACAAGTTCG AACTGCCCGG TTACTGGGGC ACCGGCAAAC ACGCGTCGGC CGATGGCACC AAGTGGAACC TGTATGAGCA GAACCTGCTG TCCGAGCACC 5200
ACATCCGCTA CGGTGGTTAC GGCGGCATCG GCTACTACCA TGTCTCCGAC AAGTTCATCG CGCTGTTCAG CCACTTCATT TCATGCGGCA CCTATGAGGG 5300
CATCCACATC CTCGATGGGC TGATGAGCAA CGAGTCCGAC ATTCGGCCCG ACACGATCCA CGGGGACACG CAGGCGCAGA GCTACCCGGT TTTTGCGCTG 5400
GCTCACCTGC TGGGGATCCA GCTCATGCCG CGCATCCGGG GCATTCAGGA CCTGAAATTC CATCGCCCGC AGGCGGGCAC CGTGTACCAG AACATCAATG 5500
CGCTGTTCAG CGACGTGATC GACTGGCAGT TGATTGAACT TCATCTGCCG GCGATGCTGC GGGTGGCGGT GTCGATCAAG ACGGGCAAGA TCACGCCATC 5600
GGCCATCCTG CGCCGACTTG GCACTTACAG CCGCAAGAAC AAGCTGTACT TCGCGTTCGT CGAACTCGGC AAAGTGATCC GGACCATGTT TCTGCTGAGT 5700
TACATCGGGG ACGTCGGACT GCGCAAGGTG ATCCACGCCG AAACCAACAA GAGCGAACAA TTCAACGGCT TTGCCGGCTG GTCGTTTTTC GGGGGAGAGG 5800
GGATCATTGC CGAGAACATC CGGCACGAGC AGCGCAAGGT GATCAAGTAC AACCACCTCG TGGCCAACAT GATCATCTTG CACAACGTGG TGGGCATGAC 5900
ACGTGTACTG CGAGAGCTTC GCGACGAAGG CACCGAGATC ACGCCGGAGA TCCTGGGAGG CTTGGCACCG TTTCGCACGG CCCACATCAA CCGGTTTGGG 6000
GACTACACGC TCGACTTCCG ACGGAAGATA GGACCGCTGG ATTTTGAAGC TACCATTATT CCAATGGAAT CATAGGGTTA GGTTCGTTTC TGCGGTTTTT 6100
TACATGAATC CCTGCCGACC CTCC

 Recombination Sites     

Name Coordinates Gene Sequence
res_acc_IR1 1168-1176 9 TAGTATGTA
res_acc_IR1 1186-1194 9 TACATACTA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpT TnTin1 43-1143 Accessory Gene Resolvase -
istA ISThsp2 1519-3048 Transposase   +
istB ISThsp2 3041-3859 Accessory Gene ATPase Transposition Helper +
tnpA C-ter TnTin1 4054-6075 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpT TnpT TnTin1 1101 43-1143 -
Class:   Accessory Gene
Sub Class:   Resolvase
Comment:   enhances recombination (resolution)||integrase-like protein
Protein Sequence:  
MARAGIYKSE VLRARDKLLV MGRNPSIDAV REELGTGSKS TIHRYLKEIE EEEGGATGTR IAVSEAIQDL VGRLAARVNE EAEERVTAAQ ARHAEQLSQQ
QQAAAALKTE AQSTRQQLEH TQRILAEEKT AHGKTSEALS RKTLECTQVA QQVADLQERL AAEERHRQSL EEKHQHARQA LEHFRESTKE QRDQDQRKHE
QQVQYLQAEL RSVNETLATK QQQAVHTLQD NARLLGDLSR AQGDLHQVQE EVRGLRPLQD ELGFAQRRTE ELGRRLVEQE AAVQQLNASN EQLQAKVNDL
ITAKQQLELA LAAAKSSVAA QEQVVASMLE RFSATVPSPD ASAKDPIEGA EKSAKRNEKK PDATRT

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istA IstA ISThsp2 1530 1519-3048 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
VITMDMIGKI RRMHFRQNKS VREIARSTGL SRNTVRTWLR QPGDVVPRYA ARRTQPSRKL APYVEWLRQA VAIDAQRPKA QRRTARALHA ELKRQGYDGA
YSRVTDLLRA WPQADGTSAV HAFVPLTFAL GEAFQFDWSE ESMVVGGVPY RVQVAHVKLC ASRAFWLGAY PSQGHEMLFD AHTQAFAAFG GVPHRGIYDN
MKTAVDKVHP RKKRDVNARF AAMCSHYLFD PDFCNVASGW EKGVVEKNVQ DSRRRVWIEA LSTPWRSFAE LNAWLAMRCR GLWQEIFHPE FSQFTVAEML
EHEQPHLMPM PTAFDGYVDN TVKVSSTCLV AMAGNRYSVP CELAGRRVST RVYPTEVVVV HDGVCVARHA RLANRGQTIY DWQHYVPLIQ RKPGALRNGA
PFADLPQPLQ QLRQALLRQD GGDRLMAQVL ALVPASGLEA VLVAAALVLE ATAPSGRISV EHVVNVMGRL QTGPQPAQVT TALTMADPPR ADTARYDRLR
VTDEERDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istB IstB ISThsp2 819 3041-3859 +
Class:   Accessory Gene
Sub Class:   ATPase Transposition Helper
Comment:   ATPase
Protein Sequence:  
MREVIAELKA LRLHGMAGAW ADLQGLGTNA RLDAAQWLVE HLLQAEQEDR AVRSVRHQIL SARFPVHRDL AGFDFDASRV DRTLVGQLAS MAFTEATHNV
VLVGGPGTGK SHLGTAIGVA GITQHGKRVR FYSTVDLVNA LEQEKAQGKA GRIAASLLRM DWVILYELGY LPFSQAGGAL LFHLLSKLYE HTSVLITTNL
AFGEWSSVFI DAKMTTALLD RLTHHCHILE TGNESYRFRH STETAKTRIK AREQRRSGTA QAPQSAEPET PL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA C-ter TnpA C-ter TnTin1 2022 4054-6075 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
VGQLKDILSA YQLDGTDTQR VEAISTTLVA EVDELLNECE QHLAYAGRNH LPFLLQPYKM VRAQLLNCID IASPKASSED LVVERLMEAL SKLRDNRADT
VPLDMLGLSE DTDLRWMSAQ WRKLVLVKPA GKGRAESVRR RYFELAVMHA VKDDLKSGDL FIKFGERYDD YREQLVDDET FERELGDYGQ VTGIETEPGA
FVSKLKSFMA LRAMEIDAGF PENAHAEIVD GRLILRKPPR SDIVEAAAHI DGMITERMEA ASIVDVMIDT ERWLDLHKLF RPLAGTDSRL EDLRMRVITT
LFCYGCNLGP VQTAKSIKGL SRRQISWLNL KYVSEDLLDK AIVKVINAYN KFELPGYWGT GKHASADGTK WNLYEQNLLS EHHIRYGGYG GIGYYHVSDK
FIALFSHFIS CGTYEGIHIL DGLMSNESDI RPDTIHGDTQ AQSYPVFALA HLLGIQLMPR IRGIQDLKFH RPQAGTVYQN INALFSDVID WQLIELHLPA
MLRVAVSIKT GKITPSAILR RLGTYSRKNK LYFAFVELGK VIRTMFLLSY IGDVGLRKVI HAETNKSEQF NGFAGWSFFG GEGIIAENIR HEQRKVIKYN
HLVANMIILH NVVGMTRVLR ELRDEGTEIT PEILGGLAPF RTAHINRFGD YTLDFRRKIG PLDFEATIIP MES

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
ISThsp2-MH257753 ISThsp2 Insertion Sequence 1413-4053 2641

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRL ISThsp2 1413-1440 TGTTCGTGCC GAACGAAAAT TGACCCAC
IRR ISThsp2 4026-4053 CACCCAGTTT TAAGGTAGCC GTGCTTGT