|
|
|
|
|
|
|
|
|
|
|
|
Recombination Sites | |
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: TnMex22 (Synonyms: Tn7165) |
|
Family: Tn3 Group: Tn163 |
|
Evidence of Transposition: yes |
|
|
Host |
|
|
Host Organism: | Methylobacterium extorquens AM1 | Molecular Source: | megaplasmid |
Place of Origin: | Oxford, England | Date of Isolation: | 2009 |
| | Other Geographic Information: | 1960 airborne contaminant growing on methylamine |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 36 bp) | | GGCCCCTGAACATTAAAGGGGCACGGATATACGGTA |
IRR (Length: 36 bp) | | GGCCCCTGAACATTAAAGGGGCACGGATATACGGTA |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGCCCCTGAA CATTAAAGGG GCACGGATAT ACGGTAGGGG CTTGCGAGCG GCCGTCGGCG CCGCGGGCTA GGCTCGCATG CGTCTCTGGG CAAGAAGCTT 100
TTCGCCGTCA GGGCGGAGTT CCCCGCGCGG CCCGACGAAC CGGTAGAGGG TTTGTCGGGT GATGCCGAGT TCGACGCAGA GGTCGGCGAC ACAGGTTTCA 200
GGGTTTCCCA TCGCGGCCTG AGCCAAGCGC AGCTTGGCGG GCGTCATCTT GAAGGGGCGG CCGCCCTTGC GGCCTCGTGC CCGTGCGGCA GCCAGCCCGG 300
CCTTTGTGCG CTCGGAGATC AGCTCGCGCT CGAACTCGGC CAAGCCGGCG AAGATCGCGA ACACCAAGCG GCCGTTGGCC GTGGTGGTGT CGATCGAGGC 400
GCCTTCGCCC GACAGCACCT TCAGGCCGAC GTGGCGCCGA GTGAGGTCTC CAACCAGGTT GACCAAGTGG CGCAGGTCGC GGCCGAGCCG GTCCAGCTTC 500
CACACCGCGA GCGTATCGCC GGCGCGCAGC GCCTTGAGGC AGGCCTCGAG CCCGGGACGG TCGTCGCTCC GTCCGGACGC GGCATCCTCG TAGATGTGCT 600
CGGGATCGAC CCCGGCCTTC CTCAGCGCAT CGCGCTGCAG GTCGTGCACC TGGCTGCCGT CGGCCTTCGA CACCCGCGCG TAGCCGATCA GCACCGTCAT 700
TGATACGTCC GTTGGCGTGA CACTGCCGAC CGTGCCCTGT TGAGGCCGCT CGTTTGTCAC ATAATCCGTC TTGCAACGTA AGCTCTGCCA ACGGCGTCTG 800
CGCGGGATTT CGTGACACCC TGGAAGAGCC GATGCCGCGC CGCCTGATCC TGACCGATGC CGAGCGGCGG ACGATCCTCG CCCTGCCGAC CGATGAGGCG 900
ACTTTGATCC GTCACTGGAG CCTCGATGAC CAGGATCTCG CGCTCCTCGA CACGCGGCGG CGCGACGACA CCCGGCTCGG CCTGGCGCTC CAGCTCTGCG 1000
CCCTGCGCTA CCCCGGCCGC CTGATCCGCC CCGGTGAGAC GATCCCCGAG GCCGCGGCGG TGTTCCTGGC CGACCAGCTC GGGGGCGATC CGGACGCGCT 1100
CGCCAGCTTC GCGCGCCGCG CCCCCACCCG CTACGAGCAA CTGACGATCC TGCGCCGACG CTTCGGCTTC ACCGACCTGT GCCGCCCGTT GCGCGGCGAT 1200
CTCGTCGCCT TCGCACGGGG CATCGCTCTG GCGGTTGCCA AGGATCGCCT CGTCGTCACG GCCCTGGCCG AGGAGATGCG GCGGCGGCGC ATCGTCATCC 1300
CCGGCATCAC GGTGCTGGAG CGCCTCGCGG CTCAGGCCTG CACCGAGGCC GAGGACGCTC TTCTGGCCGA CGTCGCGGGG CGGCTGACGC CCGACCTCGT 1400
CATCCGCATG GAGGCGCTGC TCACTGTGGG ACCGCTCGCC ATGGGACCAC GACACGCCCG GCAGAGCGGG ATCTCCTGGC TGCGCGAGCC GCCGGGATCC 1500
GCCGGCACGG CGGCCATGCG CGGCCTCGTC GACCGCCTCG AAGCCGTGCG TCACGTCGGC GTTCCCGCAA CCGTGCTCGG GGGCGTTCCG GCCCACCGCA 1600
TCCGCCGCAT GGCGCAGGAA GGCCGTCGCC TCACGGCCCA GAACTTCGCG CAAATGCGCC CCAGCCGCCG GCACGCGACC TTGGCCGCCT TCCTGCACGA 1700
CACGCAGACG GCGCTGACCG ATGCGGCGAT CGGCATGTTC GAGATCCTGG TCGGCCGCGC GTTCCGGCAG GCCGAGGCCG ATCGTGAGGC ACATCTCACC 1800
GCCAGCGTCG TCGCGGCGGC CGAGGCGCTC GACTTCTTCG CAGGGTTCGG CGACGCCCTT GTGGCCCACA AAGGCGTCGG CCTGTCGCTC GATGCGGCGA 1900
TCACGACCGT CGCGACTTGG GAGGCGCTCG CCCGAGCCAC CGCGGCGGCC CAGGCCAACA GGCAGGCCCG GCACGGTGAC GACACGATCG CCTTCCTGCG 2000
TCGGCATCAT GGCCGCATCC GCGCCTTCGC GGCCCCCTTC CTGACGCGCT TCACGTTCGA GGCCGCCCGG CCCGGCATGG CCCTCGTCAC CGCCGTCTCC 2100
CAACTCGGGG AGGCCTGGAA GGCCGGGCGC CGCTCACCGG GCCAGGCCTG GATCGACGCC GCCTTGTCGT TGCTCGACCG GCGCTGGTCC AGGCACGTCC 2200
GTGCCCCGGA CGGTACCATC GACCGCAAGA TGCTGGAGAT CTTCCTCGTC GTCGAGCTGA AGAACCGGAT CACCGCCGGC GAGGTCTGGG TGGCGGGGTC 2300
ACGGACCTAC CGGGCGCTCG AGGAGAAGCT GATCCCGCCG CAGACCTTCG CGATCATCAA GGCGGAGGCC CGCGTACCCG TCGCTATCCC GGTCGATGTG 2400
GAGATCTACC TGGCCGAGAA GGCCGCCGCG CTCGAAGGGA AGCTGCAGGC GGCGGCGCGC CGCCTGAAGA CGGGACGCGG CGAGACGCGC ATCGGCGCCA 2500
AGGGTCTACG GGTGCCGGCC GTCAGGACAG CGGAGACCGA GGCGGCCGTC GCCCTGGCCC GGCAGGTGGC CGCGACCATG CCGCCGATCC GGCTCACCGA 2600
CCTCATGGCC GACGTCGACC GGATGACCGG CTTCAGTGCC CTGTTCGAAC ATCTGCAGAC CGGACGGCCG CCGGCCGATC GGCGCGTCTT CCTCGCCGCC 2700
CTGATCGCCG AGGCGACCAA CCTCGGCTTC GGCAAGATGG CCTTGGCCTG CCCCGGCCTC ACGCGGCGCC AGCTGCAGCA GGTGGCGATC TGGCACTTCC 2800
GGGAAGACAC CTTCGCCCTG GCTCTGGCCC GGCTGGTCGA GGCCCAACAC GCCGCCCCGT TCTCCGCCAC CTTCGGATCG CACGCCATCG CGTCGTCCGA 2900
CGGCCAGCAC ATCTACCTGG GCGACGGCGG CGAGATCGCC GGCGGCGTCA ACGGCCACTA CGGCTCCGAC CCGATCACCA AGCTCTACAC CACGATCTCG 3000
GGCCGCTATG CGCCCTTCCA CGTCAAGATC ATCGCCGCCA CGGCGAGCGA GGCCGTGCAC GTGCTCGACG CGTTGCTTGA GACTGAGGCC GGCGCGGCCG 3100
TCACCCGGCA CCATGTCGAT GGCGGCGGCG TCAGCGACCT GGTGTTCGCG CTCTGCCATG GGCTCGGCTT CGCCTTCGTG CCGCGCATCC CCGATCTCGA 3200
CGGCCGCTGC CTCTACGGCT TTGCACCAGC CCGGCACTAC GGCGTGCTGC AATCGGTCAT GGGCGAGCGC CTCGACGCCG GCCTGATCCG CCGCCATTGG 3300
GATGACATCC TGCGCCTTCT GACCTCGCTC AGGACCCGCA CCGTCAGCGC CTCGCTGGTG CTGCGACAGC TGTCGGCGAC GCCGCGCCAG AGTGGCCTCG 3400
TCCAGGCGCT GCGGCAGATG GGGCGCGTCG AGCGCACCCT CTTCACCCTC GACTGGATCG GTGACGAACA GCTCCGCAAA GGTACCACGG CCGAACTCAA 3500
CAAGGGCGAG CGCCGCAACG GCCTCGTGCG CGCCGTCAAC CTGCATCGGC TCGGCCGCTT CCGCGACCGC AGCCAGGACA GCCTGGCGAT CCGGGCCTCC 3600
GCCCTCAACC TGGTGGTCAC CGCCATCATC TACTGGAACA CGATCTACAC GGGCCGCGTC GTCGACGCCT TGCGAGCCAG GGGTGCACTC CTTCCCGACC 3700
ACCTCCTCAC CGGCCTGTCG CCCCTCGGCT GGGAGCATAT CGGCCTCACC GGCGACTATC TCTGGGAGGA AACGCCCGGC ATCGATCAGA CCGGGTTCCG 3800
GGCTATCCCG ATCACACCCT GAGCGCATTA CCGTATATCC GTGCCCCTTT AATGTTCAGG GGCC
|
|
|
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tnpR |
TnMex22 |
68-700 |
Accessory Gene |
Resolvase |
- |
tnpA |
TnMex22 |
832-3822 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
TnMex22 |
633 |
68-700 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MTVLIGYARV SKADGSQVHD LQRDALRKAG VDPEHIYEDA ASGRSDDRPG LEACLKALRA GDTLAVWKLD RLGRDLRHLV NLVGDLTRRH VGLKVLSGEG ASIDTTTANG RLVFAIFAGL AEFERELISE RTKAGLAAAR ARGRKGGRPF KMTPAKLRLA QAAMGNPETC VADLCVELGI TRQTLYRFVG PRGELRPDGE KLLAQRRMRA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
TnMex22 |
2991 |
832-3822 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPRRLILTDA ERRTILALPT DEATLIRHWS LDDQDLALLD TRRRDDTRLG LALQLCALRY PGRLIRPGET IPEAAAVFLA DQLGGDPDAL ASFARRAPTR YEQLTILRRR FGFTDLCRPL RGDLVAFARG IALAVAKDRL VVTALAEEMR RRRIVIPGIT VLERLAAQAC TEAEDALLAD VAGRLTPDLV IRMEALLTVG PLAMGPRHAR QSGISWLREP PGSAGTAAMR GLVDRLEAVR HVGVPATVLG GVPAHRIRRM AQEGRRLTAQ NFAQMRPSRR HATLAAFLHD TQTALTDAAI GMFEILVGRA FRQAEADREA HLTASVVAAA EALDFFAGFG DALVAHKGVG LSLDAAITTV ATWEALARAT AAAQANRQAR HGDDTIAFLR RHHGRIRAFA APFLTRFTFE AARPGMALVT AVSQLGEAWK AGRRSPGQAW IDAALSLLDR RWSRHVRAPD GTIDRKMLEI FLVVELKNRI TAGEVWVAGS RTYRALEEKL IPPQTFAIIK AEARVPVAIP VDVEIYLAEK AAALEGKLQA AARRLKTGRG ETRIGAKGLR VPAVRTAETE AAVALARQVA ATMPPIRLTD LMADVDRMTG FSALFEHLQT GRPPADRRVF LAALIAEATN LGFGKMALAC PGLTRRQLQQ VAIWHFREDT FALALARLVE AQHAAPFSAT FGSHAIASSD GQHIYLGDGG EIAGGVNGHY GSDPITKLYT TISGRYAPFH VKIIAATASE AVHVLDALLE TEAGAAVTRH HVDGGGVSDL VFALCHGLGF AFVPRIPDLD GRCLYGFAPA RHYGVLQSVM GERLDAGLIR RHWDDILRLL TSLRTRTVSA SLVLRQLSAT PRQSGLVQAL RQMGRVERTL FTLDWIGDEQ LRKGTTAELN KGERRNGLVR AVNLHRLGRF RDRSQDSLAI RASALNLVVT AIIYWNTIYT GRVVDALRAR GALLPDHLLT GLSPLGWEHI GLTGDYLWEE TPGIDQTGFR AIPITP
|
|
References |
|
|
Vuilleumier S, Chistoserdova L, Lee MC, Bringel F, Lajus A, Zhou Y, Gourion B, Barbe V, Chang J, Cruveiller S, Dossat C, Gillett W, Gruffaz C, Haugen E, Hourcade E, Levy R, Mangenot S, Muller E, Nadalig T, Pagni M, Penny C, Peyraud R, Robinson DG, Roche D, Rouy Z, Saenampechek C, Salvignol G, Vallenet D, Wu Z, Marx CJ, Vorholt JA, Olson MV, Kaul R, Weissenbach J, Médigue C, Lidstrom ME. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS One. 2009;4(5):e5584. doi: 10.1371/journal.pone.0005584. Epub 2009 May 18. PubMed ID: 19440302
| |
| | |
|
|