|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
References | |
|
|
|
|
|
|
|
|
|
Name: TnBth4 (Synonyms: Tn7153) |
|
Family: Tn3 Group: Tn3 |
|
Evidence of Transposition: yes |
|
|
Host |
|
|
Host Organism: | Bacillus thuringiensis serovar galleriae HD-29 | Molecular Source: | plasmid pBMB126 |
Place of Origin: | Czechoslovakia | Date of Isolation: | 2014 |
| | Other Geographic Information: | from Dendrolimus sibericus gives toxicity to Lepidoptera insect larva |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 38 bp) | | GGGGTCTGTGTAGCAATGGAACCAGATCACGCAATAAG |
IRR (Length: 38 bp) | | GGGGTCTGTGTAGCAATGGAACCAAATCACGCAATAAG |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCTGTG TAGCAATGGA ACCAGATCAC GCAATAAGCA TTAGCGGACA TTATCAGCGC AAAAAAAGGA AGTTTCTCTA ATTTCAAGAA CCTTCCTTTT 100
TAAAAATTCA TGTTAGCATT ATTTATAAAT GTCACCACGA TTTCCGATAG CTTGTATGTA TATGACTTTC TCATCATGAT TTATTTCAAA TAAGATTCGA 200
AAGGTTCCAA CCCGCAATCG ATATAAATCT GTGTAACCCT TCATACTTTT AATATCTCCT TCAGGAGGAA TTTCAAGAAG CCCCTTCAAC CCTTCAGCAA 300
TTCTTTTTTG TATGCCTTTT TCTTGTTTTG CAATAAATTT CACCGCGGAC TTATGGTAAA TCAATTTGTA GTCCGAATTC ACGTTTTGCG TCCTCCCCTG 400
ATACATATCC TTCTTCACTA TTTAACTGTT CTAATTCTTG TTTAGACAAG GGTTCATCAT CTGAATCCGC TATATCAATT TTTTCCCAAT CGTTAGGTTT 500
CTTTTTTGAG CGTTGAACAA GGAATTCTAA AAAATCAAAT GCTGCTTTTT CATCTTGGTG ACCAAGACAA TCAATTAATC GATATAACTC TTCTTTACGA 600
ACAGCCATAT TCTACACCTA CCTTCGGTAT TTTTAATGTC CGCTAATTAA TATTAGCGGA CATATGATGC TTTGAAATAA ACAATTGATG TCCGCTAATG 700
TAACTGATAA GATTAGATTA GCATGTCCAC TAATATAAGT CAATATCGGA GGAGGAATAT ATGTCATCCA TCAAGATTAT TTCACTACAA GGAACATCTT 800
TTATCTCAGA TTTTATTTCT AGCTTATCTC TTGAAGGAGA TTTGCATACA AAAACACTTA CAGAATATAA AAGTGATCTT AAAGATTTTG TATACTGGTT 900
TGAACATCTA TGGGGAAATC TTTCGGAAGA AACTTTCTTC CATCCAACTG AAGTTACTGC TCGAACAATT GCAAGGTATC GAGAACATAT GCAAATTACC 1000
AGATCTCTAA AGCCGGCTAC AATCAATAGA AGAATTAACT CTATTAAGCG TTATTTTGAC TGGGCGAAAC AACAAGGAAT CATTCAAACT GATTATTCCA 1100
AGTCAATTAA ATTTGTTCCA GCAGTAAAAA CGAGTCCGAA GCAAATGACG GATAAAGAAG AAGCAGCCTT AATGAATGCT GTTGAGAAAT ATGGCACGCT 1200
ACGTGACAAA GCAATGATTA TTTTTATGCT TCATACTGGT TTTCGTTCAA TGGAGGTTTG CGATGTTCAA ATAGAAGATA TTGTTATGAG AAAAAGGGGA 1300
GGCAATGTCA TTGTTCGATC TGGAAAGCGA AATAAACAGA GGGAAGTTCC TCTGAATAGT ACAGTTCGTT TTGCACTAGA AGAATATATT GGATCGAATA 1400
ATATTACACA TAGCTATTTG TTTCCTTCTT CTAAAACAGG AAAACGGCTA CAAGAAAGAG CTATCCGCCA TATTCTCCAG AAGTACATGC GTCTTGCTAA 1500
TTTAGATGGA TTTAGTGCCC ATGATTTAAG GCATCGTTTT GGTTATGTAA TGGCTGAACG TACACCCTTA CATCGTCTGG CACAAATTAT GGGCCACGAT 1600
AACTTGAATA CCACGATGAT TTATGTGAGA GCTACTCAAG AAGATTTACA GGGAGAAGTG GAGAAGATTG CCTGGAATTA AGGAATAAAT ATCATCATAC 1700
TAATTTTGTC ATTTGATACA AACTAATAAT TGTAACAGGA GGAACAAGGA TTATGCCTGT AGATTTTTTA ACACCTGAAC AAGAGGAGAA ATATGGTTGT 1800
TTTTGTGACA CTCCAACATC AGAGCAATTA GCAAAATATT TTTGGTTAGA TGATAAAGAT AAAGAGCTTA TATGGAATCG TCGTGGAGAG CATAATCAAC 1900
TTGGTTTTGC TGTTCAGCTA GGAACCGTGA GGTTCCTAGG TACATTTTTA TCTGACCCTA CAGATGTACC ACATGCTGTG GTGACATATA TAGCAGATCA 2000
ACTTGGGTTA GACGCCAAAT GCTTTGCTGA TTACCGAAGT AAACGAAATC ATTGGCAACA CATGAATGAA ATACGCTCTA CTTATGAATA CAAAAATTTT 2100
ACAGATCAGC CCGGACATTG GCGTCTGATC AGATGGTTAT ATACACGTGC TTGGCTACAC AATGACCGAC CAAGTATATT GTTTGATCTA GCCACAGCAC 2200
GATGTATAGA ACAAAAAATT TTATTACCTG GTGTATCAGT ATTAACAAAG TTAGTCGCGA AAGTTCGTGA TCGTGCGTCA GAAAATCTAT GGGAAAAACT 2300
TGCGGAACTT CCTTGTACTG AACAGCGCAA ACAATTGGAG AATCTTCTTC AATCAGGTCC TAAAAAAAAG AAAACACATT TAGAGCGTCT GAGTAATCCT 2400
CCGTTTACTA TCAGTATTAC AGGTATTAAA CATTCTCTTC ACCGACTACA AGAGCTTCGG CAATTAGAAG CTGAATATTG GGATACATCT GGAATCCCTA 2500
CCAAAAGACT GCAACAACTT GCTCGGCATG CTGTAGCTGT GAGATCACAA GCCATTGCCA GAATGAATGA TGAGCGTCGC ATAGCTGTGT TAGTAGCATT 2600
TGCTAAAATT TATACACAAA ATGCCCAAGA TGATGTGATT GATATACTGG ATCGATACTT AACAGATTTA TTTGCTAAGA CTTATCGAAA AGAACAAAAA 2700
GAACGTCTTC GTACAATTAA AGATTTAGAT AAAGCGGCAC GTCAACTCCG GGAAGCTTGT ATAACATTAT TAGAGCATAC GGATCCTTCT ATCCATCCAA 2800
AGGTTGCAGT GTTCAAGAAA GTCCCAGAAA AAGATTTGAT ACAGGCTGTT CAAATTGTTG ATTCACTTAC CTGTCCGCCA GATCAAACGC TAGCATATTC 2900
AGAGTTATTG CAATATTACG GTACAATTCG AAAATTCCTT CCGCTACTCA TGGAAGAGAT TGAATTACAA GCAACACCCG CTGGACTTCC TATTTTACAA 3000
GCATGGAATT TTGTAAAAGA ACATGGGGAT TCTAGCAAGA AAAGATGGAG AAATGCTCCT CTTGTTGGTT TAAATACAAA TTGGTCTAAA ATTGTAGTTG 3100
ATAAGAAAAC ACGAACTGTA AATCATCGTG CTTATACATT TTGGATGCTT GAACAGGTAG TAGATGCTCT ACGCCGTCAT GACCTTTATA TCGTTGGAAG 3200
TGTAAAATAT GGAGATCTCC GCGCACAATT GCTACAAGGA GAAGAATGGA AAGCAATCCG TCCTAATGTT CTTCGCTCAC TAGACTGGTC TTTAGATTCT 3300
TATGAATCTT TAGCACCTCT TAAGGAAGAA TTAGACTTGG CTTATCATCA AACCGTTGAA AATTGGGACA ATAATCCTGC AGTTCAAATA GAGACGTTTG 3400
CTGGTAAACA AAGAATTACT CTTACACCTC TACAGAAGCT TCAAGAATCA GAGACACTAG AGATATTAAA AAAACGCATA CAGGATATGT TACCAAACAT 3500
AGATATTCCT CAACTATTAT TAGAAGTAAA TCGTTGGACT GGGTTTATGA ATGATTTCCG ACATATTAGT GAAGCTAAAT CAAGGATTAA TGAGTTACCC 3600
ATAAGTATTT GTGCATTACT TATATCTCAA GCTTGTAATA TAGGATTACG ACCTCTTGTT CAAGATGGTG TTCCTGCATT GGCACGTGAT CGTCTTACAT 3700
GGATTGAACA AAATTATTTT CGTGCAGAAA CTCTTACAGA AGCTAATACA AGACTTGTAG ATTTTCATAG TCAGTTAGAC CTAGCAAATA TGTGGGGAGG 3800
CGGTGAAATC GCCTCAGCAG ACGGGCTACG TTTCTTTACT CCAGTAAAAT CTGTACATTC TGGACCGAAC CCTAAATATT TTGGCACAGG TCGTGGAGTT 3900
ACTTTTTACA ACTTTACTAG TGATCAATTT ACAGGACTTC ACGGCCTTGT GATTCCAGGA ACCATTCATG ATTCTTTATA TTTACTTCAA TGTGTGTTAG 4000
AACAAGATAC GAGTTTACAG CCAAAAGAAA TCATGACAGA TACAGCTGGT TATAGTGATA TTATTTTTGG GCTATTTGGA TTATTAGGTT ATCAATTTAG 4100
TCCACGATTA GCCGATGTTG GAAAATCACG TCTCTGGCGT TTTGATGCCA CATCAGATTA TGGAATTCTA AATCCGTTAT CTAAAGGGCG CATTCGTGAA 4200
GATTTGATAC ATCGTCATTG GGAAGACATG CTTCGAGTTG CCGGCTCTCT CTCATTAAAT AAAGTCAATG CAACTCATCT TATCCAAGCA TTGCAACAGA 4300
ATGGAAAACC AACCATGTTA GGGCGAGCGA TTGGAGAATT TGGACGGATT TTTAAAACAC GTTATTTGCT TTTATACTTA AATGATGAGA ATTATCGTCG 4400
GAAAATTCTA ACGCAATTAA ACAGAGGAGA AGCAAGACAC AGTTTAGCTA GAGCTGTATT TTATGGAAAG CGAGGAGAGC TTCATCAAGC TTATCGAGCG 4500
GGCCAAGAAG ATCAATTGGG TGCACTGGGG TTAGTCGTAA ATGCGATTGT AGTGTGGAAT ACTCGTTATA TGGAATCAGC CTTACAAGTT CTCCGAAATC 4600
GTGGTCATAC TCTGGACGAT AATAATATTG CTAGGCTGTC TCCACTTGGT CATGAACATA TCAATATAGT AGGGCGCTAT TCATTTATAC TTCCTGAAGA 4700
AATAAAAGAT GGCCAATTGC GTAATTTAAC ATATAAAGAA GATCGTTTGA TGGAATAGAA TAGGAATCCT ATATTTCTAC ATGATATAGG ATTCCTATTG 4800
TCCGCTAATG CTTATTGCGT GATTTGGTTC CATTGCTACA CAGACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res_site_d |
40-51 |
12 |
ATTAGCGGAC AT |
res |
636-734 |
99 |
ATGTCCGCTA ATTAATATTA GCGGACATAT GATGCTTTGA AATAAACAAT TGATGTCCGC TAATGTAACT GATAAGATTA GATTAGCATG TCCACTAAT |
res_site_a |
636-647 |
12 |
ATGTCCGCTA AT |
res_site_b |
652-663 |
12 |
ATTAGCGGAC AT |
res_site_c |
688-699 |
12 |
ATGTCCGCTA AT |
res_site_a |
723-734 |
12 |
ATGTCCACTA AT |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
parE |
TnBth4 |
119-406 |
Passenger Gene |
Toxin |
- |
parD |
TnBth4 |
351-608 |
Passenger Gene |
Antitoxin |
- |
tnpI |
TnBth4 |
761-1681 |
Accessory Gene |
Resolvase |
+ |
tnpA |
TnBth4 |
1753-4758 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parE |
ParE |
TnBth4 |
288 |
119-406 |
- |
Class: | Passenger Gene |
Sub Class: | Toxin |
Target: | DNA gyrase |
Sequence Family: | ParE_toxin (Pfam:PF05016) |
Protein Sequence:
|
MYQGRTQNVN SDYKLIYHKS AVKFIAKQEK GIQKRIAEGL KGLLEIPPEG DIKSMKGYTD LYRLRVGTFR ILFEINHDEK VIYIQAIGNR GDIYK
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parD |
ParD |
TnBth4 |
258 |
351-608 |
- |
Class: | Passenger Gene |
Sub Class: | Antitoxin |
Sequence Family: | parD (PDB:4Q2U) |
Comment: | no PBD |
Protein Sequence:
|
MAVRKEELYR LIDCLGHQDE KAAFDFLEFL VQRSKKKPND WEKIDIADSD DEPLSKQELE QLNSEEGYVS GEDAKREFGL QIDLP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpI |
TnpI |
TnBth4 |
921 |
761-1681 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Tyrosine Site-Specific Recombinase |
Comment: | belongs to the phage integrase family of site-specific recombinases |
Protein Sequence:
|
MSSIKIISLQ GTSFISDFIS SLSLEGDLHT KTLTEYKSDL KDFVYWFEHL WGNLSEETFF HPTEVTARTI ARYREHMQIT RSLKPATINR RINSIKRYFD WAKQQGIIQT DYSKSIKFVP AVKTSPKQMT DKEEAALMNA VEKYGTLRDK AMIIFMLHTG FRSMEVCDVQ IEDIVMRKRG GNVIVRSGKR NKQREVPLNS TVRFALEEYI GSNNITHSYL FPSSKTGKRL QERAIRHILQ KYMRLANLDG FSAHDLRHRF GYVMAERTPL HRLAQIMGHD NLNTTMIYVR ATQEDLQGEV EKIAWN
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
TnBth4 |
3006 |
1753-4758 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPVDFLTPEQ EEKYGCFCDT PTSEQLAKYF WLDDKDKELI WNRRGEHNQL GFAVQLGTVR FLGTFLSDPT DVPHAVVTYI ADQLGLDAKC FADYRSKRNH WQHMNEIRST YEYKNFTDQP GHWRLIRWLY TRAWLHNDRP SILFDLATAR CIEQKILLPG VSVLTKLVAK VRDRASENLW EKLAELPCTE QRKQLENLLQ SGPKKKKTHL ERLSNPPFTI SITGIKHSLH RLQELRQLEA EYWDTSGIPT KRLQQLARHA VAVRSQAIAR MNDERRIAVL VAFAKIYTQN AQDDVIDILD RYLTDLFAKT YRKEQKERLR TIKDLDKAAR QLREACITLL EHTDPSIHPK VAVFKKVPEK DLIQAVQIVD SLTCPPDQTL AYSELLQYYG TIRKFLPLLM EEIELQATPA GLPILQAWNF VKEHGDSSKK RWRNAPLVGL NTNWSKIVVD KKTRTVNHRA YTFWMLEQVV DALRRHDLYI VGSVKYGDLR AQLLQGEEWK AIRPNVLRSL DWSLDSYESL APLKEELDLA YHQTVENWDN NPAVQIETFA GKQRITLTPL QKLQESETLE ILKKRIQDML PNIDIPQLLL EVNRWTGFMN DFRHISEAKS RINELPISIC ALLISQACNI GLRPLVQDGV PALARDRLTW IEQNYFRAET LTEANTRLVD FHSQLDLANM WGGGEIASAD GLRFFTPVKS VHSGPNPKYF GTGRGVTFYN FTSDQFTGLH GLVIPGTIHD SLYLLQCVLE QDTSLQPKEI MTDTAGYSDI IFGLFGLLGY QFSPRLADVG KSRLWRFDAT SDYGILNPLS KGRIREDLIH RHWEDMLRVA GSLSLNKVNA THLIQALQQN GKPTMLGRAI GEFGRIFKTR YLLLYLNDEN YRRKILTQLN RGEARHSLAR AVFYGKRGEL HQAYRAGQED QLGALGLVVN AIVVWNTRYM ESALQVLRNR GHTLDDNNIA RLSPLGHEHI NIVGRYSFIL PEEIKDGQLR NLTYKEDRLM E
|
|
|