Transposon
Name: TnSku1       (Synonyms: Tn7197)
Family: Tn3        Group: Tn4430
Evidence of Transposition: no
 Host     

Host Organism:Sulfuricurvum kujiense DSM 16994 Molecular Source:plasmid pSULKU03
Place of Origin:Kuji in Iwate, Japan Date of Isolation:2006
Other Geographic Information:drain water from an underground crude-oil storage cavity 1999

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTCACGTGCCGAAACGGATAGAATCGTACGCTAAG
IRR (Length: 38 bp)GGGGTCACGTGCCGAAACGGATAGAATCGTACGCTAAG

 Sequence     
DNA SequenceLength  4484 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCACGT GCCGAAACGG ATAGAATCGT ACGCTAAGCA AATATTACAA TAGATTTTCC CGATCATTTT CTAAAATATA CCCTTCTTTC TTGACACCAT 100
GTATCTAACT GGATACAATT TGTGTATGTA TAACATTCAA CAAACCCAAA AGTTTTCACA ATGGCTGATA AAGCTAAAAG ATATGCGAGC CCGTATCGCA 200
ATCGCACGTC GTATTGAACG TGCTCAATGT GGTAATCTTG GGGATGCTAA AAGTGTTGGT GAAAGTGTTT ACGAGATGCG TATCGATATG GGTCCGGGTT 300
ATCGGCTCTA TTATACGATG CGTGGAGACG AAATAATTGT CTTGCTTGTT GGTGGCGATA AGTCGACTCA ACAACGCGAT ATTGAAAAAG CCATAGAGAT 400
GGCAAAGGAG ATATAACGAT GGAACTGACA TTAACCCCAT TTGATATGAC ACAGTATTTG GATAGCGAAG AGGCGATTGC CGAATATCTA ACCCAAGTCC 500
TCGAAGAAGG TGATACCGAT GAGCTGATCC GTGCTATCGG GCATATCGCA AAAGCCCGTG GGATGAGTCA AATCGCACAA GAGAGTGGTC TTGGTCGTGA 600
AAGCCTCTAT AAAACATTTG CACCCGGCTC AAAACCTCGT TTCGATACCA TTGCAAAAGT AGTGCATTCA CTTGGATTGC ATTTTCAGAT TACACCGGTG 700
CGGGCATCGT AATCAGGGAG TCTCTTCAAT TCCCAATACT TTCTTTCCAA ATTCTCGTAA CTCTCCAGTT GGTCCGACAT AACGATAGAG TGTCGCACGT 800
GTAACACCGA TCTCTTTACA CAACTCAGAT ACCGACGTAT CGCGATTAGA CATTGCTGCT TGTGCTAAAC GGATTTGCGC TTTGGTGAGC TGAAATTTTC 900
TTCCCCCTTT TCGTCCACGT GCGCGAGCAG CATTAAGTCC AGCAATAGTG CGTTCACGAA TAAGCTCACG TTCAAACTCA GCCAATGCAG CAAATATTCC 1000
AAATACCATT TTTCCCGCAG GTGTTGTCGT ATCGATATCG GCACCTTGAC CACTGAGAAC TTTGAATCCA ATATTACGCC TGGACAGATC ATCGACAGTT 1100
GAGACCAAAT GTTTGAGATT TCGTCCCAAC CGATCCAGTT TCCAAACGAC CAACGTATCC CCTTCGCGCA ACGCTTTAAG GCAGTTATCC AAACCGGGAC 1200
GATCGTCTTT GGCTCCGGAA GCTTTATCGA CATATATTTG TCCGTACGTG ATTCCAATAC CTTGAAGTGC ATCGATCTGC AAATCCAATA CCTGATTTCC 1300
ATCCGCCTTG GAGACACGTG CATAACCAAT CAGCATTTTT GTCCTTTTAG TGCCGTATCA TAAACGAAGG TTTGTGATAC ATATGAATTT CTTTCTGAAC 1400
AGATAAGAAT AATTGCTTCA TAAACAGTAT ATTATTCATA GTATTATAAA CAACATAATA TTATATTACT AAACACTGTG AAGGAACCTA TATGCCACGT 1500
AAAAAAATTC TTTCATCATT ACAGCGCGAA ATGCTAGAAA AGTTGCCAGA GCAATTTGAG GAATTAGCAA AACACTACCT CCTAACTTCA GATGATCTTG 1600
AATTTATATC TCAGCGACGA CGCTCTTCTA ATAAATTAGG GTTTGCACTT TCTTTGTGCA TGATGCGCTA TCCGGGGCGC ACGCTGAAAA TCGATGAAAT 1700
TCCACCCAAA GCTTTGGTTG AATTTATTGC GGAACAACTT GGATTGAACT ATGAAGATTT TTTGGGATAC GCATTGCGTG CAGAGACCAG ACGTGAGCAC 1800
CAATACTTGT TGATGCAAAA GTATGGATAC AAAAATTTCA ACGATCGATA TTTGAAAGAG ATGGTTCGCT GGTTAACCCC AATCGCAATT GAAAACCACA 1900
AAGGCATTTT TTTATTACAG ACTTTGATGC ATGAAATGCG GTATCGAAAA ATTCTTTTGC CTTCGTTGCT GCAAATAGAA TCAATGGTAT CACTGTCTCT 2000
GAGTCGTGCT GAAAAAATTA TTCAACAAAC CGTGACATCA TTTTTAAATA AGAAGAACAT GCAATTACTC GATGACTTGC TTGCGTTTGA CGTGTCTGTC 2100
GGCCAAAGTA TTTTTTCATG GCTTAGGCAA AGTGAGTCAC GTCCAACATC CGCTAATATG CTCGTTATTA TGGAACGGCT GCAAACAATC AAGAGGCTTG 2200
AAATTTCCGA AACTATTTCT AGTAGAGTAT CGCAAATCCG AATGCAGCAA TACAGTCGTG AAGGGTACCG AATCTCCATT AGCCATCTTC GTCAGTATGA 2300
ACCAAAGCGC CGTTATACCC TCATAGCCAT TACTCTTCTT GAAATTCAAA AAGCTCTTAT TGATGATGCG ATCACTATGC ATGATCGTAT TATTGGTTCA 2400
CTGATGCGGC GAAGTCAGAA CCGATTTCAT CAAACAATCC AAGATGGGGA CAAAAAGGCA AAAGAGCTTA TACGGTTTTT CTCCCGCACC TCACAACTTT 2500
TGATTGAATC TAGAAACAAC GGAGCAAATC CATGGGAATC TATCGATTTG CACATCGGAT GGGATTCGTT TTTGCAGTCT GTACAAGAGG CTACTAGCCT 2600
CTCAGAACCT ATCAAACAAG ACCATTTGGC TTTCATAGAA ACCCACTACG TACAGATGCG CAAATATACA TCCCGTTTTT TAGAAGAGTT TGATTTTAAA 2700
GCCTCCGCGG GCGGAGGGGA TGTCATGGAT GCGATCAGAT TGCTTCAAGA ACTCAATTCA AAAAATAAAC GAAAACTCCC CGAGGATGCG CCGGTTACTT 2800
TTATCCCCTC TCGATGGAAA CCTTTTATCT ATACTGAGGA AGGGATGGAT CGGCGATATT ATGAACTGTG TGCACTCAGC GAGCTAAAAA ATAGACTCCG 2900
TTCAGGAGAC ATTTGGATTG AGGGGAGTAA ACAATATCTT GATTTCGACA GCTATTTGAT GACACGTCAT AGCTACAGCG AGATGAAATC TCAAAACAAT 3000
CTACCTATCG CCGTGGAAAT CGACTTTTCA AGGTATTTGA TAAATCGCCA AAAGCAACTG GGTGAGAGTA TCGATGAGCT TTCAGATATG ATTAAAGACG 3100
GTACAATTGC CGATGTTGAG TTTACACAAA ATGGTTATTC GATTAAACCT TATCGCACAG ACGGCGCTCC AAAAGAAGCG ACTAATATCA TACGCTATGT 3200
AGCCAATTTA CTCCCACGAA TTAAAATAAC CGACTTGCTC ATCGAGGTAG ATACTTGGAC AAAATTCTCC GATCAGTTTA CCCATCTGCG CTCTGGAATG 3300
ACATCTACAG ATAAAAAAGC ACTATTTGCG GCGATACTAT CTGATGGTAT CAATCTAGGG CATCAAAAGA TGGCGGATGC TTGCGCCGAT ATATCCGTCG 3400
GAAGACTTAG AAGTGCATCT GATTGGTATA TTCGGGATGA AAATTATTCA AGTGCATTGG CGCAAATAAC CAACCATCAT CACCAAACTT CGCTGGTCAA 3500
GAATTGGGGA GACGGTACAA CCTCTTCATC AGATGGTCAG CGATTTCCCA CCGGAGGGAC AGGGGACGCG TTAGGTGGGT ACAACGCAAA GTACGGCACT 3600
CAGCCCGGAG TGGTATTTTA CACTCATATC AGTGATCAGT ATTCTCCATT TCATACCCAG ATTATCCACG CAAATGCCAG AGATGCTACA TATGTTCTTG 3700
ACGGACTGCT TTATCACGAA TCAGATTTAA AGATTGAAGA ACATTATACC GATACAGCTG GTTTCACTGA TCATGTGTTT GCGTTGTGTC ATCTGCTTGG 3800
ATTTAAATTT GCTCCCAGAA TTCGAGATAT AGCCGATACG AAGCTCTTTG CATTAGAAGG AGTTGAAGTT CCAGAGAAAT TTAACAATAT CATCACCGGT 3900
TCCCTCAAAA TAAAAGATAT GCAGCAAAAC TGGGATGATG TCCTTAGATT AGCCGCTTCT ATCAAAATAG GTACAGCTTC AGCGTCACTG TTGATCCGCA 4000
AGCTTGCGTC TTATCCAAGG CAAAATCGTC TTGCACTTGC TATTCGCGAT GTTGGTCGGA TTGAACGAAC ACTTTTTATA CTTGAATGGC TCAAGAACCC 4100
ATCACTCCGG AGTAGAGTCC AAATTGGTCT CAACAAAGGT GAAGCGAGAA ATTCCTTGGC GCGTACCGTA TTTTTTAATC GATTAGGGGA AGTTCGTGAT 4200
AGAACTTTCG AAAATCAACT ATACCGTGCA TCTGGTCTGA ACCTGATCGT GGCTGCGATT ATTCTTTGGA ATACTGTCTA TTTGGAGAAA GCGATTGAAC 4300
AGGCTAAAAA AGAAATTGAA ATTCCGGATG AGTTTTTGAA GTATTTTTCA CCTCTAGGAT GGGAGCATAT TAATCTGACA GGCGATTATG TCTGGCGTAT 4400
GGAGCCTTAG CGTACGAAAA AATGATAAAA TTTACAGTTT TTAGCTCTTA GCGTACGATT CTATCCGTTT CGGCACGTGA CCCC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
HTH TnSku1 126-416 Passenger Gene Antitoxin +
Gp49 TnSku1 419-712 Passenger Gene Toxin +
tnpR TnSku1 713-1336 Accessory Gene Resolvase -
tnpA TnSku1 1492-4410 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
HTH HTH TnSku1 291 126-416 +
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  HTH (PDB:5TN0)
Protein Sequence:  
MYNIQQTQKF SQWLIKLKDM RARIAIARRI ERAQCGNLGD AKSVGESVYE MRIDMGPGYR LYYTMRGDEI IVLLVGGDKS TQQRDIEKAI EMAKEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
Gp49 Gp49 TnSku1 294 419-712 +
Class:   Passenger Gene
Sub Class:   Toxin
Target:   ribosome associated mRNA
Sequence Family:  Gp49 (Pfam:PF05973)
Protein Sequence:  
MELTLTPFDM TQYLDSEEAI AEYLTQVLEE GDTDELIRAI GHIAKARGMS QIAQESGLGR ESLYKTFAPG SKPRFDTIAK VVHSLGLHFQ ITPVRAS

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnSku1 624 713-1336 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MLIGYARVSK ADGNQVLDLQ IDALQGIGIT YGQIYVDKAS GAKDDRPGLD NCLKALREGD TLVVWKLDRL GRNLKHLVST VDDLSRRNIG FKVLSGQGAD
IDTTTPAGKM VFGIFAALAE FERELIRERT IAGLNAARAR GRKGGRKFQL TKAQIRLAQA AMSNRDTSVS ELCKEIGVTR ATLYRYVGPT GELREFGKKV
LGIEETP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnSku1 2919 1492-4410 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRKKILSSL QREMLEKLPE QFEELAKHYL LTSDDLEFIS QRRRSSNKLG FALSLCMMRY PGRTLKIDEI PPKALVEFIA EQLGLNYEDF LGYALRAETR
REHQYLLMQK YGYKNFNDRY LKEMVRWLTP IAIENHKGIF LLQTLMHEMR YRKILLPSLL QIESMVSLSL SRAEKIIQQT VTSFLNKKNM QLLDDLLAFD
VSVGQSIFSW LRQSESRPTS ANMLVIMERL QTIKRLEISE TISSRVSQIR MQQYSREGYR ISISHLRQYE PKRRYTLIAI TLLEIQKALI DDAITMHDRI
IGSLMRRSQN RFHQTIQDGD KKAKELIRFF SRTSQLLIES RNNGANPWES IDLHIGWDSF LQSVQEATSL SEPIKQDHLA FIETHYVQMR KYTSRFLEEF
DFKASAGGGD VMDAIRLLQE LNSKNKRKLP EDAPVTFIPS RWKPFIYTEE GMDRRYYELC ALSELKNRLR SGDIWIEGSK QYLDFDSYLM TRHSYSEMKS
QNNLPIAVEI DFSRYLINRQ KQLGESIDEL SDMIKDGTIA DVEFTQNGYS IKPYRTDGAP KEATNIIRYV ANLLPRIKIT DLLIEVDTWT KFSDQFTHLR
SGMTSTDKKA LFAAILSDGI NLGHQKMADA CADISVGRLR SASDWYIRDE NYSSALAQIT NHHHQTSLVK NWGDGTTSSS DGQRFPTGGT GDALGGYNAK
YGTQPGVVFY THISDQYSPF HTQIIHANAR DATYVLDGLL YHESDLKIEE HYTDTAGFTD HVFALCHLLG FKFAPRIRDI ADTKLFALEG VEVPEKFNNI
ITGSLKIKDM QQNWDDVLRL AASIKIGTAS ASLLIRKLAS YPRQNRLALA IRDVGRIERT LFILEWLKNP SLRSRVQIGL NKGEARNSLA RTVFFNRLGE
VRDRTFENQL YRASGLNLIV AAIILWNTVY LEKAIEQAKK EIEIPDEFLK YFSPLGWEHI NLTGDYVWRM EP

 References