|
|
|
|
|
|
|
|
|
|
|
|
Recombination Sites | |
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: Tn1071 |
|
Family: Tn3 Group: Tn1071 |
|
Evidence of Transposition: Yes |
|
|
Host |
|
|
Host Organism: | Alcaligenes sp. BR60 | | |
Place of Origin: | Canada | Date of Isolation: | 1991 |
| | Other Geographic Information: | Hyde Park industrial landfill site, Niagara watershed |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 106 bp) | | GGGGTCTCCTCGTTTTCAGTGCAATAAGTGACGGTACGCAAAGCTAGCACTGGCGCGGGGGTGGTCTGGGTAGACCGTTGATTTCATTGACTTTCCTGTTCGCTTT |
IRR (Length: 110 bp) | | GGGGTCTCCTCGTTTTCAGTGCAATAAGTGACGGTACGAAAAGCTAGCACTGGCGCGGAGGTGGTGTTGGTAGATCGTTGATTTCATTGACTTTCCTGTTCACTTTCAAA |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCTCCT CGTTTTCAGT GCAATAAGTG ACGGTACGCA AAGCTAGCAC TGGCGCGGGG GTGGTCTGGG TAGACCGTTG ATTTCATTGA CTTTCCTGTT 100
CGCTTTGTAA ACGGGTATGG TGGCCTCCCA CTTTTGAGGT TCACGATGCA GGGTTGGCAC ACAACGTTTT TGGGGATGCG TGGGCTCCCC CGCGATATCA 200
GCGACTTCGA GATGAAGGCA TTTTTCACCT TCGATGGTGC CGAGCGCGAC GCAATCAATG CACGCCGAGG TGATTCCCAC AAGCTTGGTC TGGCGCTCCA 300
TATTGGTTTC CTGCGCATGA GTGGGCGTTT GCTCGGTGCC TTTCGGGTAA TTCCAGTAGC CTTGTGGCGC CACCTTGGCA ACGAGCTTGG CATTGCAGCA 400
CCAGAAGTCG CCTCGCTGAG AGCCATGTAT GAACGCGGGC GCACGCTATT CGATCACCAA CAAGTAGCCT GCACGGTCCT TGGATTCCAG TGGATGAGCG 500
AGCACCAGCG CCGCTCACTG GTACGTGAAC TGCGCGACGA AGTGGCCGGC TGCGACCGCG ATCAGCTACT CGTGCGGGCG CGTCAATGGC TGTACAAGAA 600
CAAGCTGGTG ATCGTGCACG AGCGGGCAAT TCGGACACTG ATTGCGGCGG CACTTGCCCA GCTTGAAGTT GAAACAGGCA CCGCCATCGC CGCCAGCGTT 700
GATCCAGCAA CACTTGATCG CTGGCGAGCC TCAGTTTCAG AGCTGCGCCC AGATGGACAA ACCCAGCAGA GTTGGCTATG GGCTGCACCG GCGAAACACT 800
CAACCCGCCA AATCAGCGAG GTACTGGAGC GCATCGACCT GCTTTACACG CTGGACGTTC ATAAGCACCT GGCAGACATC CCCGATCTCA TCTTGCGCCG 900
CTACGCGCGC CGACTTGTCT CCAGGCCGCC CTCAGCCGGA GCCAAGATCA AAGAGCCAGC GCGCACCGTG GAGGTCGCAT GCTTTCTTCG GTATTGCCTG 1000
TTCACCACCA CAGACCAGTT GATCCTTATG GTGCAGCGCC GGATCGCCGA TCTGTGGCGT CAGGCTGCCG CCGATGTCCC CGCTACCGTC AATTGGGCCG 1100
CAATGTACAA AACGCTGCTC GGCGAACTTG TTGCCTTGAG CGCGCAAGGT GCGGTGCCAG ATGCTGAGTT GCGTGCCCGT CTTGAAGCCT TGATCACCGA 1200
AACCCAGAAA CGCAAACCAC CGAGCAGGGC CTCCCTGGTC CGCGAGGGAT TGATTGATGG AATTCGCCCC GTGCGGTCGT TGCTCGTCGC CATTGCAAAG 1300
CTGCCCTGGC AGGCCACCGG CGAGCATCCT GCCATCGAGT ACCTTGCCAA GCTGCAAGCT TTATATCTCA AAGGATCCAG AAAGCTGCCA GTTGAAGTGG 1400
TGGCACCAAG TCTGGGAATG ATCTGGCAGG TTTCGATCTC CAGCCCAGAC CGGGAACGGG CGTTTCAGGC GTTGGAGGTG GCCACCCTGT TTGCCCTGCG 1500
CCGCGCGGTG CGCAATGGCT CGGTCTGGAT TGAGCACAGC CTGAGCTTTC GGGGTCGTGC GCGCTTGTTC TTCACGGACG AGCGTTGGCA GGCAGAGTCC 1600
AAGAAACACT ATGCCCGTCT ATCGTTACCC AGCAAGGCTG CCACTTTCTT GAAGCCTTTG CTGGCCAGAG TAACTGCCGG TGTCGATGCG GTGGCCGCTG 1700
CAGCCCGCAG TGGCGTACTG CGCGTGGATG ATGAACTCCA TTTGTCGCCA TTGCCCGCAG AGGACGAAGA CCCAGAAGTG ACCAAGCTGC GCGCGGCTTT 1800
GGATCACCGC ATCGGTGAGG TTCAATTGCC GGAAGTGATT CTGGCCGTTG ACGCCCAGGT GCGCTTTAGC TGGATCATGC TCGGACGTGA GCCGCGCTCT 1900
ACCGACGAGC TGCTGATGGT CTATGCCGGC ATCATGGCCC ACGGCACCAG TCTGACTGCG GTCGAATGCG CGCGCATGAT TCCGCAATTG TCTGCCACCA 2000
GCATTCGCCA GGCCATGCGC TGGGCGCGGG ACGAACGGCG TCTGAGCCAG GCCTGCCAGG CTGTGCTGGA ATTCATGCAG CGACACCCGA TTGCCGCCAC 2100
CTGGGGGCGG TCCGATTTGG CATCTTCTGA CATGATGAGC ATGGAGACCA CCAAACGGGT GTGGCAAGCC CGGCTTGATC CTCGGCGCAA CACACCTTCC 2200
ATTGGAATCT ACTCCCATGT AAAAGACCGG TGGGGCATCT TCCATGCGCA GCCCTTTGTG CTCAATGAGC GCCAGGCGGG CGTGGCCATT GAAGGTGTCA 2300
TCCGCCAAGA AAAGCTGGAG ACCAGCCAGC TTGCTGTGGA TACCCATGGC TACACCGACT TTGCCATGTC ACATGCCCGT TTGCTTGGTT TTGATCTTTG 2400
CCCGCGGTTG AAGGAACTCA AACAGCGCCA CCTCTTTGTG CCACGCGGCA CCAAAGTGCC CGCAGAAATC GCTGCGGTGT GCGAAGCCAA TGTCGACGTC 2500
GCTTTGATCG AAAAGCATTG GGATAGTCTG GTGCACCTGG CAGCCTCGGT CATGAGCGGA CATGCCAGTG CGGTGGCAGC TCTTGCGCGG TTCGGTTCTG 2600
CCGCCCAGGG CGATCCAATC TATGAGGCTG GCGTGCAATT GGGGCGGTTG CTGCGTACGG CGTTTTTGGC TGACTACTTT GTCAAGGACG CTTTCAGGAA 2700
CGAGTTGCGC CGGGTGCTCA ATCGGGGCGA GGCTGTTAAC GCCCTCAAGC GCGCCATTTA TACCGGCCGG ATCAGCCCGG CGCAGGCCAA ACGTGTCGAT 2800
GAAATGCAGG CTGTGGCCGA TGCGTTGAGC CTGATGGCCA ACATCGTGAT GGCGTGGAAT ACCTCACAGA TGCAGGCGGT CCTGGATCGC TGGTCGAACC 2900
GCCGCCAGGT CATTCCACCG GAACTGATCG GGAAGATTGC GCCCACCAGG CTGGAGAGCA TCAACTTGCG GGGTGTGTTT CGCTTCCCGG TTGACCGCTA 3000
TGCTGACCAA ATCCTGCCTT CGCGGCCAAA TGCATCGATA ACTGGCACCA ATGGATGAAA CCGACCACGG TTTGACGCCA CGAATCGCAG ATTTGAAAGT 3100
GAACAGGAAA GTCAATGAAA TCAACGATCT ACCAACACCA CCTCCGCGCC AGTGCTAGCT TTTCGTACCG TCACTTATTG CACTGAAAAC GAGGAGACCC 3200
C
|
|
|
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tnpA |
Tn1071 |
146-3058 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn1071 |
2913 |
146-3058 |
+ |
Class: | Transposase |
Function: | transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MQGWHTTFLG MRGLPRDISD FEMKAFFTFD GAERDAINAR RGDSHKLGLA LHIGFLRMSG RLLGAFRVIP VALWRHLGNE LGIAAPEVAS LRAMYERGRT LFDHQQVACT VLGFQWMSEH QRRSLVRELR DEVAGCDRDQ LLVRARQWLY KNKLVIVHER AIRTLIAAAL AQLEVETGTA IAASVDPATL DRWRASVSEL RPDGQTQQSW LWAAPAKHST RQISEVLERI DLLYTLDVHK HLADIPDLIL RRYARRLVSR PPSAGAKIKE PARTVEVACF LRYCLFTTTD QLILMVQRRI ADLWRQAAAD VPATVNWAAM YKTLLGELVA LSAQGAVPDA ELRARLEALI TETQKRKPPS RASLVREGLI DGIRPVRSLL VAIAKLPWQA TGEHPAIEYL AKLQALYLKG SRKLPVEVVA PSLGMIWQVS ISSPDRERAF QALEVATLFA LRRAVRNGSV WIEHSLSFRG RARLFFTDER WQAESKKHYA RLSLPSKAAT FLKPLLARVT AGVDAVAAAA RSGVLRVDDE LHLSPLPAED EDPEVTKLRA ALDHRIGEVQ LPEVILAVDA QVRFSWIMLG REPRSTDELL MVYAGIMAHG TSLTAVECAR MIPQLSATSI RQAMRWARDE RRLSQACQAV LEFMQRHPIA ATWGRSDLAS SDMMSMETTK RVWQARLDPR RNTPSIGIYS HVKDRWGIFH AQPFVLNERQ AGVAIEGVIR QEKLETSQLA VDTHGYTDFA MSHARLLGFD LCPRLKELKQ RHLFVPRGTK VPAEIAAVCE ANVDVALIEK HWDSLVHLAA SVMSGHASAV AALARFGSAA QGDPIYEAGV QLGRLLRTAF LADYFVKDAF RNELRRVLNR GEAVNALKRA IYTGRISPAQ AKRVDEMQAV ADALSLMANI VMAWNTSQMQ AVLDRWSNRR QVIPPELIGK IAPTRLESIN LRGVFRFPVD RYADQILPSR PNASITGTNG
|
|
References |
|
|
1. | Nakatsu C, Ng J, Singh R, Straus N, Wyndham C. Chlorobenzoate catabolic transposon Tn5271 is a composite class I element with flanking class II insertion sequences. Proc Natl Acad Sci U S A. 1991 Oct 1;88(19):8312-6. doi: 10.1073/pnas.88.19.8312. PubMed ID: 1656436
| | 2. | Nakatsu CH, Straus NA, Wyndham RC. The nucleotide sequence of the Tn5271 3-chlorobenzoate 3,4-dioxygenase genes (cbaAB) unites the class IA oxygenases in a single lineage. Microbiology (Reading). 1995 Feb;141 ( Pt 2):485-95. doi: 10.1099/13500872-141-2-485. PubMed ID: 7704279
| |
| | |
|
|