Transposon
Name: TnBth4       (Synonyms: Tn7153)
Family: Tn3        Group: Tn3
Evidence of Transposition: yes
 Host     

Host Organism:Bacillus thuringiensis serovar galleriae HD-29 Molecular Source:plasmid pBMB126
Place of Origin:Czechoslovakia Date of Isolation:2014
Other Geographic Information:from Dendrolimus sibericus gives toxicity to Lepidoptera insect larva

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTCTGTGTAGCAATGGAACCAGATCACGCAATAAG
IRR (Length: 38 bp)GGGGTCTGTGTAGCAATGGAACCAAATCACGCAATAAG

 Sequence     
DNA SequenceLength  4848 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCTGTG TAGCAATGGA ACCAGATCAC GCAATAAGCA TTAGCGGACA TTATCAGCGC AAAAAAAGGA AGTTTCTCTA ATTTCAAGAA CCTTCCTTTT 100
TAAAAATTCA TGTTAGCATT ATTTATAAAT GTCACCACGA TTTCCGATAG CTTGTATGTA TATGACTTTC TCATCATGAT TTATTTCAAA TAAGATTCGA 200
AAGGTTCCAA CCCGCAATCG ATATAAATCT GTGTAACCCT TCATACTTTT AATATCTCCT TCAGGAGGAA TTTCAAGAAG CCCCTTCAAC CCTTCAGCAA 300
TTCTTTTTTG TATGCCTTTT TCTTGTTTTG CAATAAATTT CACCGCGGAC TTATGGTAAA TCAATTTGTA GTCCGAATTC ACGTTTTGCG TCCTCCCCTG 400
ATACATATCC TTCTTCACTA TTTAACTGTT CTAATTCTTG TTTAGACAAG GGTTCATCAT CTGAATCCGC TATATCAATT TTTTCCCAAT CGTTAGGTTT 500
CTTTTTTGAG CGTTGAACAA GGAATTCTAA AAAATCAAAT GCTGCTTTTT CATCTTGGTG ACCAAGACAA TCAATTAATC GATATAACTC TTCTTTACGA 600
ACAGCCATAT TCTACACCTA CCTTCGGTAT TTTTAATGTC CGCTAATTAA TATTAGCGGA CATATGATGC TTTGAAATAA ACAATTGATG TCCGCTAATG 700
TAACTGATAA GATTAGATTA GCATGTCCAC TAATATAAGT CAATATCGGA GGAGGAATAT ATGTCATCCA TCAAGATTAT TTCACTACAA GGAACATCTT 800
TTATCTCAGA TTTTATTTCT AGCTTATCTC TTGAAGGAGA TTTGCATACA AAAACACTTA CAGAATATAA AAGTGATCTT AAAGATTTTG TATACTGGTT 900
TGAACATCTA TGGGGAAATC TTTCGGAAGA AACTTTCTTC CATCCAACTG AAGTTACTGC TCGAACAATT GCAAGGTATC GAGAACATAT GCAAATTACC 1000
AGATCTCTAA AGCCGGCTAC AATCAATAGA AGAATTAACT CTATTAAGCG TTATTTTGAC TGGGCGAAAC AACAAGGAAT CATTCAAACT GATTATTCCA 1100
AGTCAATTAA ATTTGTTCCA GCAGTAAAAA CGAGTCCGAA GCAAATGACG GATAAAGAAG AAGCAGCCTT AATGAATGCT GTTGAGAAAT ATGGCACGCT 1200
ACGTGACAAA GCAATGATTA TTTTTATGCT TCATACTGGT TTTCGTTCAA TGGAGGTTTG CGATGTTCAA ATAGAAGATA TTGTTATGAG AAAAAGGGGA 1300
GGCAATGTCA TTGTTCGATC TGGAAAGCGA AATAAACAGA GGGAAGTTCC TCTGAATAGT ACAGTTCGTT TTGCACTAGA AGAATATATT GGATCGAATA 1400
ATATTACACA TAGCTATTTG TTTCCTTCTT CTAAAACAGG AAAACGGCTA CAAGAAAGAG CTATCCGCCA TATTCTCCAG AAGTACATGC GTCTTGCTAA 1500
TTTAGATGGA TTTAGTGCCC ATGATTTAAG GCATCGTTTT GGTTATGTAA TGGCTGAACG TACACCCTTA CATCGTCTGG CACAAATTAT GGGCCACGAT 1600
AACTTGAATA CCACGATGAT TTATGTGAGA GCTACTCAAG AAGATTTACA GGGAGAAGTG GAGAAGATTG CCTGGAATTA AGGAATAAAT ATCATCATAC 1700
TAATTTTGTC ATTTGATACA AACTAATAAT TGTAACAGGA GGAACAAGGA TTATGCCTGT AGATTTTTTA ACACCTGAAC AAGAGGAGAA ATATGGTTGT 1800
TTTTGTGACA CTCCAACATC AGAGCAATTA GCAAAATATT TTTGGTTAGA TGATAAAGAT AAAGAGCTTA TATGGAATCG TCGTGGAGAG CATAATCAAC 1900
TTGGTTTTGC TGTTCAGCTA GGAACCGTGA GGTTCCTAGG TACATTTTTA TCTGACCCTA CAGATGTACC ACATGCTGTG GTGACATATA TAGCAGATCA 2000
ACTTGGGTTA GACGCCAAAT GCTTTGCTGA TTACCGAAGT AAACGAAATC ATTGGCAACA CATGAATGAA ATACGCTCTA CTTATGAATA CAAAAATTTT 2100
ACAGATCAGC CCGGACATTG GCGTCTGATC AGATGGTTAT ATACACGTGC TTGGCTACAC AATGACCGAC CAAGTATATT GTTTGATCTA GCCACAGCAC 2200
GATGTATAGA ACAAAAAATT TTATTACCTG GTGTATCAGT ATTAACAAAG TTAGTCGCGA AAGTTCGTGA TCGTGCGTCA GAAAATCTAT GGGAAAAACT 2300
TGCGGAACTT CCTTGTACTG AACAGCGCAA ACAATTGGAG AATCTTCTTC AATCAGGTCC TAAAAAAAAG AAAACACATT TAGAGCGTCT GAGTAATCCT 2400
CCGTTTACTA TCAGTATTAC AGGTATTAAA CATTCTCTTC ACCGACTACA AGAGCTTCGG CAATTAGAAG CTGAATATTG GGATACATCT GGAATCCCTA 2500
CCAAAAGACT GCAACAACTT GCTCGGCATG CTGTAGCTGT GAGATCACAA GCCATTGCCA GAATGAATGA TGAGCGTCGC ATAGCTGTGT TAGTAGCATT 2600
TGCTAAAATT TATACACAAA ATGCCCAAGA TGATGTGATT GATATACTGG ATCGATACTT AACAGATTTA TTTGCTAAGA CTTATCGAAA AGAACAAAAA 2700
GAACGTCTTC GTACAATTAA AGATTTAGAT AAAGCGGCAC GTCAACTCCG GGAAGCTTGT ATAACATTAT TAGAGCATAC GGATCCTTCT ATCCATCCAA 2800
AGGTTGCAGT GTTCAAGAAA GTCCCAGAAA AAGATTTGAT ACAGGCTGTT CAAATTGTTG ATTCACTTAC CTGTCCGCCA GATCAAACGC TAGCATATTC 2900
AGAGTTATTG CAATATTACG GTACAATTCG AAAATTCCTT CCGCTACTCA TGGAAGAGAT TGAATTACAA GCAACACCCG CTGGACTTCC TATTTTACAA 3000
GCATGGAATT TTGTAAAAGA ACATGGGGAT TCTAGCAAGA AAAGATGGAG AAATGCTCCT CTTGTTGGTT TAAATACAAA TTGGTCTAAA ATTGTAGTTG 3100
ATAAGAAAAC ACGAACTGTA AATCATCGTG CTTATACATT TTGGATGCTT GAACAGGTAG TAGATGCTCT ACGCCGTCAT GACCTTTATA TCGTTGGAAG 3200
TGTAAAATAT GGAGATCTCC GCGCACAATT GCTACAAGGA GAAGAATGGA AAGCAATCCG TCCTAATGTT CTTCGCTCAC TAGACTGGTC TTTAGATTCT 3300
TATGAATCTT TAGCACCTCT TAAGGAAGAA TTAGACTTGG CTTATCATCA AACCGTTGAA AATTGGGACA ATAATCCTGC AGTTCAAATA GAGACGTTTG 3400
CTGGTAAACA AAGAATTACT CTTACACCTC TACAGAAGCT TCAAGAATCA GAGACACTAG AGATATTAAA AAAACGCATA CAGGATATGT TACCAAACAT 3500
AGATATTCCT CAACTATTAT TAGAAGTAAA TCGTTGGACT GGGTTTATGA ATGATTTCCG ACATATTAGT GAAGCTAAAT CAAGGATTAA TGAGTTACCC 3600
ATAAGTATTT GTGCATTACT TATATCTCAA GCTTGTAATA TAGGATTACG ACCTCTTGTT CAAGATGGTG TTCCTGCATT GGCACGTGAT CGTCTTACAT 3700
GGATTGAACA AAATTATTTT CGTGCAGAAA CTCTTACAGA AGCTAATACA AGACTTGTAG ATTTTCATAG TCAGTTAGAC CTAGCAAATA TGTGGGGAGG 3800
CGGTGAAATC GCCTCAGCAG ACGGGCTACG TTTCTTTACT CCAGTAAAAT CTGTACATTC TGGACCGAAC CCTAAATATT TTGGCACAGG TCGTGGAGTT 3900
ACTTTTTACA ACTTTACTAG TGATCAATTT ACAGGACTTC ACGGCCTTGT GATTCCAGGA ACCATTCATG ATTCTTTATA TTTACTTCAA TGTGTGTTAG 4000
AACAAGATAC GAGTTTACAG CCAAAAGAAA TCATGACAGA TACAGCTGGT TATAGTGATA TTATTTTTGG GCTATTTGGA TTATTAGGTT ATCAATTTAG 4100
TCCACGATTA GCCGATGTTG GAAAATCACG TCTCTGGCGT TTTGATGCCA CATCAGATTA TGGAATTCTA AATCCGTTAT CTAAAGGGCG CATTCGTGAA 4200
GATTTGATAC ATCGTCATTG GGAAGACATG CTTCGAGTTG CCGGCTCTCT CTCATTAAAT AAAGTCAATG CAACTCATCT TATCCAAGCA TTGCAACAGA 4300
ATGGAAAACC AACCATGTTA GGGCGAGCGA TTGGAGAATT TGGACGGATT TTTAAAACAC GTTATTTGCT TTTATACTTA AATGATGAGA ATTATCGTCG 4400
GAAAATTCTA ACGCAATTAA ACAGAGGAGA AGCAAGACAC AGTTTAGCTA GAGCTGTATT TTATGGAAAG CGAGGAGAGC TTCATCAAGC TTATCGAGCG 4500
GGCCAAGAAG ATCAATTGGG TGCACTGGGG TTAGTCGTAA ATGCGATTGT AGTGTGGAAT ACTCGTTATA TGGAATCAGC CTTACAAGTT CTCCGAAATC 4600
GTGGTCATAC TCTGGACGAT AATAATATTG CTAGGCTGTC TCCACTTGGT CATGAACATA TCAATATAGT AGGGCGCTAT TCATTTATAC TTCCTGAAGA 4700
AATAAAAGAT GGCCAATTGC GTAATTTAAC ATATAAAGAA GATCGTTTGA TGGAATAGAA TAGGAATCCT ATATTTCTAC ATGATATAGG ATTCCTATTG 4800
TCCGCTAATG CTTATTGCGT GATTTGGTTC CATTGCTACA CAGACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res_site_d 40-51 12 ATTAGCGGAC AT
res 636-734 99 ATGTCCGCTA ATTAATATTA GCGGACATAT GATGCTTTGA AATAAACAAT TGATGTCCGC
TAATGTAACT GATAAGATTA GATTAGCATG TCCACTAAT
res_site_a 636-647 12 ATGTCCGCTA AT
res_site_b 652-663 12 ATTAGCGGAC AT
res_site_c 688-699 12 ATGTCCGCTA AT
res_site_a 723-734 12 ATGTCCACTA AT

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
parE TnBth4 119-406 Passenger Gene Toxin -
parD TnBth4 351-608 Passenger Gene Antitoxin -
tnpI TnBth4 761-1681 Accessory Gene Resolvase +
tnpA TnBth4 1753-4758 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parE ParE TnBth4 288 119-406 -
Class:   Passenger Gene
Sub Class:   Toxin
Target:   DNA gyrase
Sequence Family:  ParE_toxin (Pfam:PF05016)
Protein Sequence:  
MYQGRTQNVN SDYKLIYHKS AVKFIAKQEK GIQKRIAEGL KGLLEIPPEG DIKSMKGYTD LYRLRVGTFR ILFEINHDEK VIYIQAIGNR GDIYK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parD ParD TnBth4 258 351-608 -
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  parD (PDB:4Q2U)
Comment:   no PBD
Protein Sequence:  
MAVRKEELYR LIDCLGHQDE KAAFDFLEFL VQRSKKKPND WEKIDIADSD DEPLSKQELE QLNSEEGYVS GEDAKREFGL QIDLP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpI TnpI TnBth4 921 761-1681 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Tyrosine
Sequence Family:  Tyrosine Site-Specific Recombinase
Comment:   belongs to the phage integrase family of site-specific recombinases
Protein Sequence:  
MSSIKIISLQ GTSFISDFIS SLSLEGDLHT KTLTEYKSDL KDFVYWFEHL WGNLSEETFF HPTEVTARTI ARYREHMQIT RSLKPATINR RINSIKRYFD
WAKQQGIIQT DYSKSIKFVP AVKTSPKQMT DKEEAALMNA VEKYGTLRDK AMIIFMLHTG FRSMEVCDVQ IEDIVMRKRG GNVIVRSGKR NKQREVPLNS
TVRFALEEYI GSNNITHSYL FPSSKTGKRL QERAIRHILQ KYMRLANLDG FSAHDLRHRF GYVMAERTPL HRLAQIMGHD NLNTTMIYVR ATQEDLQGEV
EKIAWN

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnBth4 3006 1753-4758 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPVDFLTPEQ EEKYGCFCDT PTSEQLAKYF WLDDKDKELI WNRRGEHNQL GFAVQLGTVR FLGTFLSDPT DVPHAVVTYI ADQLGLDAKC FADYRSKRNH
WQHMNEIRST YEYKNFTDQP GHWRLIRWLY TRAWLHNDRP SILFDLATAR CIEQKILLPG VSVLTKLVAK VRDRASENLW EKLAELPCTE QRKQLENLLQ
SGPKKKKTHL ERLSNPPFTI SITGIKHSLH RLQELRQLEA EYWDTSGIPT KRLQQLARHA VAVRSQAIAR MNDERRIAVL VAFAKIYTQN AQDDVIDILD
RYLTDLFAKT YRKEQKERLR TIKDLDKAAR QLREACITLL EHTDPSIHPK VAVFKKVPEK DLIQAVQIVD SLTCPPDQTL AYSELLQYYG TIRKFLPLLM
EEIELQATPA GLPILQAWNF VKEHGDSSKK RWRNAPLVGL NTNWSKIVVD KKTRTVNHRA YTFWMLEQVV DALRRHDLYI VGSVKYGDLR AQLLQGEEWK
AIRPNVLRSL DWSLDSYESL APLKEELDLA YHQTVENWDN NPAVQIETFA GKQRITLTPL QKLQESETLE ILKKRIQDML PNIDIPQLLL EVNRWTGFMN
DFRHISEAKS RINELPISIC ALLISQACNI GLRPLVQDGV PALARDRLTW IEQNYFRAET LTEANTRLVD FHSQLDLANM WGGGEIASAD GLRFFTPVKS
VHSGPNPKYF GTGRGVTFYN FTSDQFTGLH GLVIPGTIHD SLYLLQCVLE QDTSLQPKEI MTDTAGYSDI IFGLFGLLGY QFSPRLADVG KSRLWRFDAT
SDYGILNPLS KGRIREDLIH RHWEDMLRVA GSLSLNKVNA THLIQALQQN GKPTMLGRAI GEFGRIFKTR YLLLYLNDEN YRRKILTQLN RGEARHSLAR
AVFYGKRGEL HQAYRAGQED QLGALGLVVN AIVVWNTRYM ESALQVLRNR GHTLDDNNIA RLSPLGHEHI NIVGRYSFIL PEEIKDGQLR NLTYKEDRLM
E