|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: Tn5401 |
|
Family: Tn3 Group: Tn3 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Bacillus thuringiensis serovar morrisoni EG2158 | | |
Place of Origin: | Kansas, U.S.A. | Date of Isolation: | 1994 |
| | Other Geographic Information: | soybean grain dust |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 53 bp) | | GGGGTATGTGTAGCAATGGAACAGAATCACGCAACAAGCATTAGCGGACATTA |
IRR (Length: 53 bp) | | GGGGTATGTGTAGCAATGGAACAAAATCACGCAACAAGCATTAGCGGACATTA |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTATGTG TAGCAATGGA ACAGAATCAC GCAACAAGCA TTAGCGGACA TTATTCGCAC ACAAAAAAGG AAGGTTCTTC GATTCAGAAG ACCTTTCTTT 100
TAAAAATGCA TGTTTGCCTT ATTTATAGAT GTCACCACGA TTTCCAATTG CTTGTATGTA TATGACTTTC TCATCATGAT TTATTTCAAA TAAAATTCGA 200
AAGGTTCCAA TCCGTAATCG ATATAGTTCT GTGTAACCTT TCATACTTTT AATATCTCCT TCAGGAGGAA TCTTAAGAAG TCCCTTCAAT CCTTCTGCAA 300
TTCTTTTTTG AATCCCTTTT TCTTGCTTTG CAATAAATTT CACCGCGGAC TTATGGTAAA TCAATTTGTA GTCCGAATTC ACGTTTTGCG TCCTCCCCTG 400
ATACATATCC TTCTTCACTG TTTAACTGTT CTAACTCTTG TGTAGACAGC GGTTCATGAT CAGGATCTGC CATATCAATT TTTTCCCATT CTTTAGGTTT 500
TCTTCTTGAC CGTTGAACAA GAAATTCTAA AAAGTCAAAT GCTGCTTTTT CATCTTGTTG ATCCAGGTGA TCAATTAACC GATACAATTC ATCTTTACGA 600
ATAGCCATGT GTTACACCTA CTTTCGAGAT AGTTTTAAAT GTCCACTAAT TAATATTAGT GGACATGAAG TGTGGGAAAA TAAATGTTTG ATGTCCGCTA 700
ACATAATTGA TAAGATTAAA ATATCATGTC CGCTAATGTA AGTCAATAAA AGAGGAGGTA TTTATGCATT CCACTAAAAC AATTTCTATA CAAGCAACAT 800
CTTTGATTTC CGATTTTATT TCTAGCTTAT CTCAAGAAGG AGATTTGCAT ACAAAAACAC TAAAAGAATA TACGAGTGAT TTAAAAGATT TTGTATTTTG 900
GTTTGAAAAT GTGTGGGGAA AACATGCTGA GGATACTCTT TTTCATCCAA TAGAAGTTAC CGCTCGCACT ATTGCTCGAT ATCGAGGGCA TATGCAAGTT 1000
ACAAGATTAC TAAAACCTTC TACGATTAAC CGGCGCATTA ATTCAATCAA ACGTTATTTT GACTGGGCTA AGCAAAAAGG ACTGGTACAA ACAAATTATT 1100
CAAAATCAAT TAAGTTTGTA CCAACAGAAA AAACGAGTCC CAAACGCATG TCAGATAAAG AAGAAGCCGC TTTAATGCAT GCCGTTGAAA AATACGGCAC 1200
ACTACGTGAC AGGGCAATGA TTATTTTTAT GCTTCATACT GGCCTTCGTT CAATGGAAGT GTGTGATGTT CAAATAGAGG ATGTTATCAT GAGAAAAAGA 1300
GGCGGCTATG TTGTTGTTCG ATCTGGAAAA CGAAATAAAC AGAGGGAAGT GCCTTTGAAT AGTACAGCTC GTTGTGCACT AGAAGAACAT ATCAGATTAA 1400
GTGAGATTTC ACAGAGTTAT TTGTTTCCTT CTTCTAAAAC AGGAAAACGC CTACAAGAAA GAGCGATCCG CCATATTCTT CAGAAGTATA TTAGACTTGC 1500
AAAGTTAGAA GGATTTAGTG CCCATGATTT AAGGCATCGC TTTGGTTATG TGATGGCTGA ACGCACACCA TTACATCGTC TTGCACAAAT TATGGGACAC 1600
GATAACTTGA ATACCACGAT GATTTATGTA AGAGCTACAC AAGAAGATTT ACAGGGAGAA GTAGAAAAGA TTGCCTGGAA CTAAAGAATG CACATTATCC 1700
TACTCATTTG GTCATGTGAT ACAAAATAAG AATTGTAACA GGAGGAACAA GGGTTATGCC TGTAGATTTT TTAACACCTG AACAAGAAGA AAAATATGGT 1800
TGTTTTTGTG ACACTCCAAC ATCAGAGCAG TTAGCAAAAT ATTTTTGGTT AGATGATACA GACAAAGAAC TGATATGGAA TCGTCGTGGA GAGCATAATC 1900
AACTTGGTTT CGCTGTTCAA TTAGGAACCG TTAGGTTCTT AGGAACATTT TTATCTGATC CTACAAATGT ACCACAATCG GTTATTACAT ATATGGCAAA 2000
TCAACTTCAT CTAGATGCTC AAAGCTTTTC TCGTTATCGA AATAAACGAA GTCAGTGGGA TCAAATGCAA GAGATACGTT CTGTTTATGG ATATAAAAAC 2100
TTTACAGATA AATCAACACA TTGGCGATTC ATCAGATGGC TATATGCACG TGCTTGGCTA TATAATGAAC GGCCAAGTGT CTTATTTGAT TTAGCAACAG 2200
CACGATGTAT CGAACAAAAA ATTTTACTAC CTGGTGTATC TGTATTAACA AGGCTAGTAT CAACGGTTCG TGATCGTTCA GCAGAAAATA TATGGAAAAA 2300
GCTCTCTAGT CTTCCGGATA ATGTTCAGAA AAAACAATTA GAAAACCTTC TTCAGATAGA TCAAAAAACA AAGAAAACGT ATTTAGAGCG TCTAAGTAAT 2400
CCCCCTGTTC CGATTAGTGT TACGGGCATT AAGAATACGC TGATTCGTTT ACAAGAGCTT CGTCAATTGA ACACTGAAAA TTGGGATATG TCTAGAATTC 2500
CTTCGAAAAG ATTACAACAA TTCGCGCGTC ACACAGTCGC TGTTAGATCA CAAGCAATTG CTAGAATGCC CGATCAACGA CGTATGGCTA TGTTAGTTGC 2600
ATTTGCTAAA ATGTATACAC AAAGTGCTCA GGATGATGTC ATTGATATTT TTGATAGATA TTTAACAGAT TTATTTGCTA AGACATATCG AAAGGAACAA 2700
AAAGAACGTC TTCGTACAAT TAAGGATTTA GATAAGGCAG CGCGCCAATT ACGGGAAGCT TGTGTAATAT TATTAGAACA TACGGATCCT TCTGTCCATC 2800
CAAAAACGGC AGTGTTTGAA AAAATTTCAG AAAAGGATTT AATACAAGCT GTCCAAATTG TTGATTCACT CACCTATTCA CCAAATCAAA CACTAGCCTA 2900
TTCAGGATTG TTACAACATT ATGGCATAAT CCGAAAATTT CTTCCTTTAC TCATGGAAGA AATTGAATTA CAAGCAACGC CTGCTGGATT ACCCATCTTG 3000
CAAGCATGGA ATTTTGTAAA AGAGCATGGG AAATCCAATA AGAAAAGATG GAAAAATGCT CCTCTTGCCG GTTTGAATGC AAATTGGTCT AAGGTTGTAA 3100
TTGATAAAGA TTCCGGAACT GTAAATCATC GAGCATATAC GTTTTGGATG CTCGAACAAG TATTAGAAGC TTTGCACCGA CATGATCTAT ATATAGTAGG 3200
AAGTGAAAAA TATGGGGACC TTCGCGCACA ATTATTACAA GACGAAGAAT GGAAAAGTAT TCGTCCTAGT ATTCTTCGCT CATTAGACTG GTCAATAGAT 3300
TCTTATGAAT CATTGACACC GTTAAAAGAA GAGTTAGACA AAACTTATCA TCAAGTCATT GAGAATTGGG AGAATAATCC TGCGGTGCAA ATAGACACAT 3400
TTGCAGGTAA AGAGAGAATT GTTTTGACAC CTTTAGACAA ACAACCAGAA CCTGAATCAC TACAAAAACT AAAACAACAA ATACATACGA TGTTGCCAAA 3500
TATAGATATT CCTCAATTAT TACTCGAAGT AAATCGTTGG ACGGGATTTA TGGATGGTTT TCGACATATT AGTGAGGCTA AATCTAGAAT TAACGAGTTA 3600
CCTATAAGTA TCTGTGCATT GCTTATATCT CAAGCATGCA ATATTGGGTT AAGACCTTTA GTTCAAGATG GGGTTCCTTC ATTAGAACGT GATCGTCTTA 3700
CATGGATTGA ACAAAATTAT TTTCGTGCAG AAACACTTTC AGAATCAAAC GCGAAACTTG TAGATTTTCA TAGCCAATTA CAGCTGGCTA AAATGTGGGG 3800
TGGTGGAGAA ATTGCTTCAG CTGATGGATT ACGTTTCATC ACACCAGTAA AATCCGTACA CACTGGTCCA AATCCTAAAT ATTTCGGTTC TGGTCGTGGT 3900
GTTACGTATT ACAACTATAC GAGCGATCAA TTTACCGGAC TCCACGGTTT GGTGATTCCA GGCACAATTC GTGATTCATT ATACTTACTT CAATGTGTGT 4000
TAGAACAAAA TACGAACTTA CAGCCAAAAG AAATTATGAC AGATACAGCT GGGTATAGTG ATATTATTTT TGGGCTCTTT GGATTATTAG GATATCAATT 4100
TAGTCCTCGT TTAGCTGATA TCAGTGAATC ACGTCTTTGG CGTTTTGATG CGAACTCAGA TTATAGCATG TTAAATAATT TGTCTAAAAG TCGCATTCGT 4200
GAAGAACTCA TACATCGTCA TTGGGAAGAC ATGCTTCGTG TTGCGGGATC TTTGAAACTA AATAAAATAA ATGCAACACA TCTTATCCAA GCACTTCAGT 4300
ATAATGGGAA ACCAACTATG TTAGGGCGAG CAATTGGAGA ATTGGGGAGA CTCTTTAAAA CACGTTATTT ACTCTTATAT TTACATGATG AAAATTATCG 4400
TCGTAAAATT TTAAATCAAC TCAATAGAGG GGAAGCAAGG CATAGTTTAG CGAGGGCTGT ATTTTACGGC AAACGTGGAG AACTTCATCA ATCCTATCGA 4500
GAAGGACAAG AAGAGCAATT AGGTGCATTA GGTTTAGTAG TAAATGCAAT TATTGTATGG AATACACGAT ATATAGAATC TGCGTTACAA GTACTCCGAA 4600
ATCGCGGTCA TACAATTGAT AATGATGATA TATCTAGACT TTCACCATTA GGCCATAAAC ACATTAACAT AGTAGGTCGG TATTCATTTG TTCTCCCAGA 4700
AGAAGTAAAA GATGGGCAAT TACGTACACT AACATATGAA GAAACAAACA AAAAGGAACC TGATTCTTTA TAAGAATAGG TTCCTAATGT CCGCTAATGC 4800
TTGTTGCGTG ATTTTGTTCC ATTGCTACAC ATACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res_site_d |
40-51 |
12 |
ATTAGCGGAC AT |
res_site_a |
639-650 |
12 |
ATGTCCACTA AT |
res_site_b |
655-666 |
12 |
ATTAGTGGAC AT |
res_site_c |
691-702 |
12 |
ATGTCCGCTA AC |
res_site_d |
726-737 |
12 |
ATGTCCGCTA AT |
res_site_d |
4787-4798 |
12 |
ATGTCCGCTA AT |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
parE |
Tn5401 |
119-343 |
Passenger Gene |
Toxin |
- |
parD-noDBD |
Tn5401 |
351-608 |
Passenger Gene |
Antitoxin |
- |
tnpI |
Tn5401 |
764-1684 |
Accessory Gene |
Resolvase |
+ |
tnpA |
Tn5401 |
1756-4773 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parE |
ParE |
Tn5401 |
225 |
119-343 |
- |
Class: | Passenger Gene |
Sub Class: | Toxin |
Function: | unknown |
Target: | DNA gyrase |
Sequence Family: | ParE_toxin (Pfam:PF05016) |
Protein Sequence:
|
MKFIAKQEKG IQKRIAEGLK GLLKIPPEGD IKSMKGYTEL YRLRIGTFRI LFEINHDEKV IYIQAIGNRG DIYK
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parD-noDBD |
ParD-noDBD |
Tn5401 |
258 |
351-608 |
- |
Class: | Passenger Gene |
Sub Class: | Antitoxin |
Function: | unknown |
Sequence Family: | parD (PDB:4Q2U) |
Comment: | no PBD |
Protein Sequence:
|
MAIRKDELYR LIDHLDQQDE KAAFDFLEFL VQRSRRKPKE WEKIDMADPD HEPLSTQELE QLNSEEGYVS GEDAKREFGL QIDLP
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpI |
TnpI |
Tn5401 |
921 |
764-1684 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Function: | cointegrate resolution, transcriptional regulation |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Tyrosine Site-Specific Recombinase |
Comment: | belongs to the phage integrase family of site-specific recombinases |
Protein Sequence:
|
MHSTKTISIQ ATSLISDFIS SLSQEGDLHT KTLKEYTSDL KDFVFWFENV WGKHAEDTLF HPIEVTARTI ARYRGHMQVT RLLKPSTINR RINSIKRYFD WAKQKGLVQT NYSKSIKFVP TEKTSPKRMS DKEEAALMHA VEKYGTLRDR AMIIFMLHTG LRSMEVCDVQ IEDVIMRKRG GYVVVRSGKR NKQREVPLNS TARCALEEHI RLSEISQSYL FPSSKTGKRL QERAIRHILQ KYIRLAKLEG FSAHDLRHRF GYVMAERTPL HRLAQIMGHD NLNTTMIYVR ATQEDLQGEV EKIAWN
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5401 |
3018 |
1756-4773 |
+ |
Class: | Transposase |
Function: | cointegrate formation during transposition |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPVDFLTPEQ EEKYGCFCDT PTSEQLAKYF WLDDTDKELI WNRRGEHNQL GFAVQLGTVR FLGTFLSDPT NVPQSVITYM ANQLHLDAQS FSRYRNKRSQ WDQMQEIRSV YGYKNFTDKS THWRFIRWLY ARAWLYNERP SVLFDLATAR CIEQKILLPG VSVLTRLVST VRDRSAENIW KKLSSLPDNV QKKQLENLLQ IDQKTKKTYL ERLSNPPVPI SVTGIKNTLI RLQELRQLNT ENWDMSRIPS KRLQQFARHT VAVRSQAIAR MPDQRRMAML VAFAKMYTQS AQDDVIDIFD RYLTDLFAKT YRKEQKERLR TIKDLDKAAR QLREACVILL EHTDPSVHPK TAVFEKISEK DLIQAVQIVD SLTYSPNQTL AYSGLLQHYG IIRKFLPLLM EEIELQATPA GLPILQAWNF VKEHGKSNKK RWKNAPLAGL NANWSKVVID KDSGTVNHRA YTFWMLEQVL EALHRHDLYI VGSEKYGDLR AQLLQDEEWK SIRPSILRSL DWSIDSYESL TPLKEELDKT YHQVIENWEN NPAVQIDTFA GKERIVLTPL DKQPEPESLQ KLKQQIHTML PNIDIPQLLL EVNRWTGFMD GFRHISEAKS RINELPISIC ALLISQACNI GLRPLVQDGV PSLERDRLTW IEQNYFRAET LSESNAKLVD FHSQLQLAKM WGGGEIASAD GLRFITPVKS VHTGPNPKYF GSGRGVTYYN YTSDQFTGLH GLVIPGTIRD SLYLLQCVLE QNTNLQPKEI MTDTAGYSDI IFGLFGLLGY QFSPRLADIS ESRLWRFDAN SDYSMLNNLS KSRIREELIH RHWEDMLRVA GSLKLNKINA THLIQALQYN GKPTMLGRAI GELGRLFKTR YLLLYLHDEN YRRKILNQLN RGEARHSLAR AVFYGKRGEL HQSYREGQEE QLGALGLVVN AIIVWNTRYI ESALQVLRNR GHTIDNDDIS RLSPLGHKHI NIVGRYSFVL PEEVKDGQLR TLTYEETNKK EPDSL
|
|
References |
|
|
Baum JA. Tn5401, a new class II transposable element from Bacillus thuringiensis. J Bacteriol. 1994 May;176(10):2835-45. doi: 10.1128/jb.176.10.2835-2845.1994. PubMed ID: 7514590
| |
| | |
|
|