|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
References | |
|
|
|
|
|
|
|
|
|
Name: TnTin1 (Synonyms: Tn7203) |
|
Family: Tn3 Group: Tn4651 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Thiomonas intermedia K12 | | |
| | Date of Isolation: | 2010 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 32 bp) | | GAGGGTCGGCAGGGATTCATGTAAAAAACCGC |
IRR (Length: 32 bp) | | GAGGGTCGGCAGGGATTCATGTAAAAAACCGC |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GAGGGTCGGC AGGGATTCAT GTAAAAAACC GCAGAAACAA GCTTAAGTCC TTGTCGCATC TGGTTTTTTT TCATTCCTCT TCGCCGATTT TTCGGCCCCT 100
TCAATGGGGT CCTTCGCTGA CGCATCTGGA CTGGGCACCG TAGCGCTGAA ACGCTCCAGC ATGCTGGCCA CGACCTGCTC TTGCGCCGCG ACGGATGACT 200
TGGCTGCCGC CAGCGCCAGT TCCAGCTGTT GCTTTGCCGT GATCAGATCG TTGACCTTGG CCTGCAACTG CTCGTTTGAA GCGTTGAGCT GTTGGACGGC 300
CGCCTCCTGT TCCACCAGAC GCCGGCCCAA CTCTTCGGTG CGCCTCTGAG CAAACCCCAG TTCATCCTGG AGCGGCCGTA GGCCTCGCAC TTCTTCCTGG 400
ACTTGGTGCA GATCGCCTTG GGCACGCGAC AAATCGCCTA ACAAGCGCGC GTTGTCCTGA AGGGTGTGGA CGGCTTGCTG CTGCTTGGTT GCCAGGGTCT 500
CGTTCACGGA GCGCAACTCG GCCTGCAGGT ACTGAACCTG CTGCTCATGC TTGCGCTGGT CCTGATCGCG CTGCTCTTTT GTCGATTCTC GGAAATGCTC 600
CAGTGCCTGG CGGGCATGCT GGTGCTTTTC TTCCAGGGAT TGGCGATGGC GTTCTTCGGC CGCGAGGCGC TCCTGCAGGT CGGCCACCTG CTGGGCGACC 700
TGCGTGCATT CCAGGGTCTT GCGGCTCAGC GCTTCGCTGG TTTTGCCGTG AGCGGTTTTT TCCTCGGCCA AGATGCGCTG GGTGTGCTCC AGTTGCTGGC 800
GGGTAGATTG CGCCTCCGTT TTCAGGGCCG CGGCGGCTTG TTGCTGCTGG CTCAGTTGTT CGGCGTGCCT GGCCTGCGCC GCGGTGACCC GTTCCTCGGC 900
TTCCTCGTTG ACGCGGGCGG CCAGGCGGCC CACCAGGTCC TGGATCGCCT CGCTGACGGC AATGCGGGTG CCGGTGGCCC CGCCTTCCTC TTCCTCGATC 1000
TCCTTCAGGT AACGGTGGAT CGTGCTCTTG GAGCCGGTGC CCAGCTCTTC ACGCACCGCA TCAATGGACG GATTGCGCCC CATGACCAGC AATTTATCGC 1100
GGGCGCGTAG CACTTCCGAC TTGTAGATCC CGGCCCTGGC CATGGCTCAC CTCTTTATTT CGTACCGTAG TATGTACTAT GATATTACAT ACTAGTATGG 1200
CGGTCAATAA ATTCATGATT TATCATAATC TCACATTGGG TAATAGTAGA TTATCCAATG TGAAGGGGAA TTTGGGCCGA CATCCCGCCT GCGGGCCAGG 1300
GGAGCCCAGC CTATGCTGAT CGATGGCAAT ACCCAGCTGC CGGTCTGTAG GCATTTCTGG CCGTTGCCGA TCATGTTTCA CCAGTAGCCC TCCACAAAAG 1400
GCCAGTCACT CTTGTTCGTG CCGAACGAAA ATTGACCCAC TTTTGAAGGT TATGCCGACG CAAAATTGAC CCGGGTGTTC CACCTACCCT GCTGCTTTTT 1500
TAGGCACGGG GAGCAGGAGT GATCACCATG GACATGATTG GCAAGATCCG GCGGATGCAT TTCCGCCAGA ACAAATCCGT TCGAGAGATT GCCCGCAGTA 1600
CGGGGCTGTC GCGCAACACG GTGCGTACCT GGCTACGCCA GCCGGGCGAT GTCGTACCGC GGTACGCGGC GCGGCGAACA CAACCCTCGC GCAAGCTCGC 1700
GCCCTATGTG GAGTGGCTCC GGCAGGCCGT GGCCATTGAT GCGCAGCGTC CTAAAGCGCA GCGGCGCACG GCCAGGGCAT TGCATGCCGA GTTGAAGCGA 1800
CAAGGCTATG ACGGCGCCTA CAGCCGCGTC ACCGATTTAC TGCGCGCCTG GCCACAGGCC GATGGCACGA GCGCTGTGCA CGCCTTCGTG CCCTTGACGT 1900
TTGCGCTGGG CGAAGCCTTC CAGTTCGACT GGAGTGAAGA GAGCATGGTG GTGGGCGGCG TGCCGTACCG CGTGCAGGTT GCCCACGTCA AGCTGTGCGC 2000
CAGCCGTGCC TTCTGGCTGG GGGCCTATCC CAGCCAAGGT CACGAGATGC TGTTTGACGC CCATACCCAG GCGTTTGCCG CCTTCGGCGG AGTGCCGCAT 2100
CGCGGCATCT ATGACAACAT GAAGACAGCC GTCGACAAAG TGCACCCGCG CAAGAAGCGC GACGTCAACG CGCGCTTTGC GGCGATGTGC AGCCACTACC 2200
TGTTCGACCC CGACTTCTGC AACGTCGCTT CCGGTTGGGA GAAGGGGGTG GTGGAGAAGA ACGTCCAGGA CAGCAGGCGA CGTGTGTGGA TTGAGGCCCT 2300
GTCGACGCCC TGGCGCTCCT TTGCCGAACT CAATGCCTGG CTGGCCATGC GCTGCCGTGG CTTGTGGCAG GAGATCTTCC ATCCCGAGTT CAGCCAGTTC 2400
ACCGTGGCCG AGATGCTCGA GCACGAGCAG CCGCACCTCA TGCCCATGCC CACAGCGTTC GACGGCTATG TGGACAACAC GGTCAAGGTC AGCAGCACCT 2500
GTCTGGTGGC CATGGCTGGC AATCGTTACT CCGTGCCCTG TGAACTCGCC GGGCGGCGGG TGAGCACCCG GGTGTATCCC ACCGAGGTGG TCGTGGTGCA 2600
TGACGGCGTG TGTGTGGCCC GCCATGCCCG GCTGGCCAAC CGGGGACAGA CCATCTACGA CTGGCAGCAC TATGTGCCGC TGATTCAACG CAAACCCGGG 2700
GCGTTGCGCA ACGGCGCGCC GTTTGCCGAT CTGCCCCAGC CCTTGCAGCA ACTGCGCCAG GCGCTGCTGC GGCAAGACGG GGGCGACCGG CTCATGGCCC 2800
AGGTGCTGGC GCTGGTGCCC GCATCCGGGC TGGAGGCGGT GCTGGTGGCC GCCGCGCTGG TGCTCGAAGC CACGGCCCCG TCCGGGCGCA TCAGCGTGGA 2900
GCATGTGGTC AACGTGATGG GGCGCTTGCA GACCGGCCCC CAGCCCGCCC AAGTGACCAC GGCCTTGACG ATGGCCGATC CACCCCGCGC CGACACGGCC 3000
CGCTATGACC GCCTGCGGGT GACTGACGAG GAGCGAGATC ATGCGTGAGG TCATTGCGGA ACTCAAGGCG TTGCGGCTGC ACGGCATGGC CGGCGCCTGG 3100
GCCGATTTGC AAGGTCTGGG GACGAACGCC AGGCTGGATG CCGCGCAGTG GCTCGTCGAG CATTTGCTGC AGGCCGAGCA GGAAGATCGC GCCGTGCGCT 3200
CGGTACGCCA TCAAATCCTC TCGGCCCGCT TCCCGGTGCA CCGCGATCTA GCGGGGTTTG ACTTTGATGC CTCAAGGGTT GATCGCACGC TGGTGGGGCA 3300
ACTCGCCAGC ATGGCCTTTA CCGAAGCCAC GCACAACGTC GTGCTGGTGG GCGGGCCCGG CACGGGCAAG AGCCATCTGG GCACGGCCAT CGGCGTGGCG 3400
GGCATCACGC AGCACGGCAA GCGGGTGCGG TTTTACTCCA CCGTCGATCT GGTCAACGCG CTGGAGCAGG AGAAGGCGCA GGGCAAAGCC GGGCGGATTG 3500
CGGCAAGCCT CTTGCGCATG GACTGGGTCA TTCTCTATGA GCTGGGGTAT CTGCCCTTCA GTCAGGCGGG CGGGGCTTTG TTGTTCCATC TGCTGTCCAA 3600
GCTCTACGAG CACACCAGCG TGCTGATCAC GACCAATCTG GCCTTCGGCG AGTGGTCCAG CGTGTTCATC GATGCGAAGA TGACCACCGC CTTGCTCGAC 3700
CGGCTCACAC ACCACTGCCA CATTCTGGAA ACCGGGAACG AGAGCTACCG CTTCCGTCAC AGCACGGAAA CCGCCAAGAC CCGCATCAAG GCGCGGGAGC 3800
AAAGGCGTTC GGGCACAGCA CAGGCACCCC AGTCCGCTGA GCCCGAGACA CCGCTTTGAC AGACAAACCC TTGGGGCAAG CCGCTTTGGG CTACGCCCGA 3900
CGCAGCTTGC CCCAAGGATC CAAACCCACC AACCAGGAGC CAACCCCACG CAAAAACCGT AGACTTATCA CACCCGACTG CGACGCAGTC GGCCTTAGAC 4000
CCCGGGTCAA AATTGAATCG GCACGGTGGG TCAAAATTCC ATCGGCACGA ACAGTCGGGC AGCTCAAGGA CATCCTGAGC GCCTACCAGT TGGATGGGAC 4100
CGACACCCAG CGCGTCGAAG CCATCAGCAC GACCCTGGTG GCCGAGGTGG ACGAGCTGCT GAACGAATGC GAACAGCACC TGGCCTATGC CGGGCGCAAC 4200
CACCTGCCGT TCCTGCTGCA GCCGTACAAG ATGGTGCGAG CGCAGCTGCT CAACTGCATC GACATCGCGT CGCCCAAGGC CAGCAGCGAG GACCTGGTGG 4300
TGGAGCGGCT GATGGAAGCC CTCTCCAAAC TGCGCGACAA CCGCGCCGAC ACCGTGCCAC TGGACATGCT GGGCCTGAGT GAAGACACGG ATTTGCGCTG 4400
GATGTCGGCC CAGTGGAGAA AGCTTGTGCT GGTCAAACCG GCAGGCAAGG GCCGCGCGGA GTCCGTCCGC CGTCGGTATT TTGAGCTGGC CGTCATGCAC 4500
GCCGTCAAGG ACGATCTGAA GTCCGGAGAC CTCTTCATCA AGTTCGGGGA GCGCTACGAC GACTACCGCG AGCAGTTGGT CGATGACGAA ACCTTTGAGC 4600
GCGAGCTGGG CGACTATGGG CAGGTCACGG GCATCGAGAC GGAGCCCGGC GCCTTCGTGT CGAAGCTGAA ATCTTTCATG GCGCTGCGAG CCATGGAAAT 4700
CGATGCCGGT TTTCCGGAAA ACGCCCATGC TGAAATCGTC GATGGACGCC TGATCCTCAG AAAACCGCCT CGCTCCGACA TCGTCGAGGC CGCCGCGCAC 4800
ATCGATGGCA TGATCACCGA GCGGATGGAG GCGGCCAGCA TCGTGGACGT GATGATCGAC ACCGAGCGGT GGCTGGACCT TCACAAGCTG TTTCGTCCGC 4900
TGGCGGGGAC CGACAGCCGC CTTGAGGACT TGCGCATGCG CGTGATCACG ACCTTGTTCT GCTATGGCTG CAACCTGGGA CCCGTTCAGA CCGCCAAATC 5000
GATCAAAGGG TTGAGCCGGC GTCAGATTTC CTGGCTGAAC CTGAAGTACG TCAGCGAAGA CCTTCTGGAC AAGGCCATCG TCAAGGTGAT CAACGCCTAC 5100
AACAAGTTCG AACTGCCCGG TTACTGGGGC ACCGGCAAAC ACGCGTCGGC CGATGGCACC AAGTGGAACC TGTATGAGCA GAACCTGCTG TCCGAGCACC 5200
ACATCCGCTA CGGTGGTTAC GGCGGCATCG GCTACTACCA TGTCTCCGAC AAGTTCATCG CGCTGTTCAG CCACTTCATT TCATGCGGCA CCTATGAGGG 5300
CATCCACATC CTCGATGGGC TGATGAGCAA CGAGTCCGAC ATTCGGCCCG ACACGATCCA CGGGGACACG CAGGCGCAGA GCTACCCGGT TTTTGCGCTG 5400
GCTCACCTGC TGGGGATCCA GCTCATGCCG CGCATCCGGG GCATTCAGGA CCTGAAATTC CATCGCCCGC AGGCGGGCAC CGTGTACCAG AACATCAATG 5500
CGCTGTTCAG CGACGTGATC GACTGGCAGT TGATTGAACT TCATCTGCCG GCGATGCTGC GGGTGGCGGT GTCGATCAAG ACGGGCAAGA TCACGCCATC 5600
GGCCATCCTG CGCCGACTTG GCACTTACAG CCGCAAGAAC AAGCTGTACT TCGCGTTCGT CGAACTCGGC AAAGTGATCC GGACCATGTT TCTGCTGAGT 5700
TACATCGGGG ACGTCGGACT GCGCAAGGTG ATCCACGCCG AAACCAACAA GAGCGAACAA TTCAACGGCT TTGCCGGCTG GTCGTTTTTC GGGGGAGAGG 5800
GGATCATTGC CGAGAACATC CGGCACGAGC AGCGCAAGGT GATCAAGTAC AACCACCTCG TGGCCAACAT GATCATCTTG CACAACGTGG TGGGCATGAC 5900
ACGTGTACTG CGAGAGCTTC GCGACGAAGG CACCGAGATC ACGCCGGAGA TCCTGGGAGG CTTGGCACCG TTTCGCACGG CCCACATCAA CCGGTTTGGG 6000
GACTACACGC TCGACTTCCG ACGGAAGATA GGACCGCTGG ATTTTGAAGC TACCATTATT CCAATGGAAT CATAGGGTTA GGTTCGTTTC TGCGGTTTTT 6100
TACATGAATC CCTGCCGACC CTCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res_acc_IR1 |
1168-1176 |
9 |
TAGTATGTA |
res_acc_IR1 |
1186-1194 |
9 |
TACATACTA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tnpT |
TnTin1 |
43-1143 |
Accessory Gene |
Resolvase |
- |
istA |
ISThsp2 |
1519-3048 |
Transposase |
|
+ |
istB |
ISThsp2 |
3041-3859 |
Accessory Gene |
ATPase Transposition Helper |
+ |
tnpA C-ter |
TnTin1 |
4054-6075 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpT |
TnpT |
TnTin1 |
1101 |
43-1143 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Comment: | enhances recombination (resolution)||integrase-like protein |
Protein Sequence:
|
MARAGIYKSE VLRARDKLLV MGRNPSIDAV REELGTGSKS TIHRYLKEIE EEEGGATGTR IAVSEAIQDL VGRLAARVNE EAEERVTAAQ ARHAEQLSQQ QQAAAALKTE AQSTRQQLEH TQRILAEEKT AHGKTSEALS RKTLECTQVA QQVADLQERL AAEERHRQSL EEKHQHARQA LEHFRESTKE QRDQDQRKHE QQVQYLQAEL RSVNETLATK QQQAVHTLQD NARLLGDLSR AQGDLHQVQE EVRGLRPLQD ELGFAQRRTE ELGRRLVEQE AAVQQLNASN EQLQAKVNDL ITAKQQLELA LAAAKSSVAA QEQVVASMLE RFSATVPSPD ASAKDPIEGA EKSAKRNEKK PDATRT
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
istA |
IstA |
ISThsp2 |
1530 |
1519-3048 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
VITMDMIGKI RRMHFRQNKS VREIARSTGL SRNTVRTWLR QPGDVVPRYA ARRTQPSRKL APYVEWLRQA VAIDAQRPKA QRRTARALHA ELKRQGYDGA YSRVTDLLRA WPQADGTSAV HAFVPLTFAL GEAFQFDWSE ESMVVGGVPY RVQVAHVKLC ASRAFWLGAY PSQGHEMLFD AHTQAFAAFG GVPHRGIYDN MKTAVDKVHP RKKRDVNARF AAMCSHYLFD PDFCNVASGW EKGVVEKNVQ DSRRRVWIEA LSTPWRSFAE LNAWLAMRCR GLWQEIFHPE FSQFTVAEML EHEQPHLMPM PTAFDGYVDN TVKVSSTCLV AMAGNRYSVP CELAGRRVST RVYPTEVVVV HDGVCVARHA RLANRGQTIY DWQHYVPLIQ RKPGALRNGA PFADLPQPLQ QLRQALLRQD GGDRLMAQVL ALVPASGLEA VLVAAALVLE ATAPSGRISV EHVVNVMGRL QTGPQPAQVT TALTMADPPR ADTARYDRLR VTDEERDHA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
istB |
IstB |
ISThsp2 |
819 |
3041-3859 |
+ |
Class: | Accessory Gene |
Sub Class: | ATPase Transposition Helper |
Comment: | ATPase |
Protein Sequence:
|
MREVIAELKA LRLHGMAGAW ADLQGLGTNA RLDAAQWLVE HLLQAEQEDR AVRSVRHQIL SARFPVHRDL AGFDFDASRV DRTLVGQLAS MAFTEATHNV VLVGGPGTGK SHLGTAIGVA GITQHGKRVR FYSTVDLVNA LEQEKAQGKA GRIAASLLRM DWVILYELGY LPFSQAGGAL LFHLLSKLYE HTSVLITTNL AFGEWSSVFI DAKMTTALLD RLTHHCHILE TGNESYRFRH STETAKTRIK AREQRRSGTA QAPQSAEPET PL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA C-ter |
TnpA C-ter |
TnTin1 |
2022 |
4054-6075 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
VGQLKDILSA YQLDGTDTQR VEAISTTLVA EVDELLNECE QHLAYAGRNH LPFLLQPYKM VRAQLLNCID IASPKASSED LVVERLMEAL SKLRDNRADT VPLDMLGLSE DTDLRWMSAQ WRKLVLVKPA GKGRAESVRR RYFELAVMHA VKDDLKSGDL FIKFGERYDD YREQLVDDET FERELGDYGQ VTGIETEPGA FVSKLKSFMA LRAMEIDAGF PENAHAEIVD GRLILRKPPR SDIVEAAAHI DGMITERMEA ASIVDVMIDT ERWLDLHKLF RPLAGTDSRL EDLRMRVITT LFCYGCNLGP VQTAKSIKGL SRRQISWLNL KYVSEDLLDK AIVKVINAYN KFELPGYWGT GKHASADGTK WNLYEQNLLS EHHIRYGGYG GIGYYHVSDK FIALFSHFIS CGTYEGIHIL DGLMSNESDI RPDTIHGDTQ AQSYPVFALA HLLGIQLMPR IRGIQDLKFH RPQAGTVYQN INALFSDVID WQLIELHLPA MLRVAVSIKT GKITPSAILR RLGTYSRKNK LYFAFVELGK VIRTMFLLSY IGDVGLRKVI HAETNKSEQF NGFAGWSFFG GEGIIAENIR HEQRKVIKYN HLVANMIILH NVVGMTRVLR ELRDEGTEIT PEILGGLAPF RTAHINRFGD YTLDFRRKIG PLDFEATIIP MES
|
|
Internal Transposable Elements (TE) |
|
|
TnCentral Accession |
TE Name |
Type |
Coordinates |
Length |
ISThsp2-MH257753 |
ISThsp2 |
Insertion Sequence |
1413-4053 |
2641 |
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
IRL |
ISThsp2 |
1413-1440 |
TGTTCGTGCC GAACGAAAAT TGACCCAC |
IRR |
ISThsp2 |
4026-4053 |
CACCCAGTTT TAAGGTAGCC GTGCTTGT |
|
|