Transposon
Name: TnMex22       (Synonyms: Tn7165)
Family: Tn3        Group: Tn163
Evidence of Transposition: yes
 Host     

Host Organism:Methylobacterium extorquens AM1 Molecular Source:megaplasmid
Place of Origin:Oxford, England Date of Isolation:2009
Other Geographic Information:1960 airborne contaminant growing on methylamine

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 36 bp)GGCCCCTGAACATTAAAGGGGCACGGATATACGGTA
IRR (Length: 36 bp)GGCCCCTGAACATTAAAGGGGCACGGATATACGGTA

 Sequence     
DNA SequenceLength  3864 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGCCCCTGAA CATTAAAGGG GCACGGATAT ACGGTAGGGG CTTGCGAGCG GCCGTCGGCG CCGCGGGCTA GGCTCGCATG CGTCTCTGGG CAAGAAGCTT 100
TTCGCCGTCA GGGCGGAGTT CCCCGCGCGG CCCGACGAAC CGGTAGAGGG TTTGTCGGGT GATGCCGAGT TCGACGCAGA GGTCGGCGAC ACAGGTTTCA 200
GGGTTTCCCA TCGCGGCCTG AGCCAAGCGC AGCTTGGCGG GCGTCATCTT GAAGGGGCGG CCGCCCTTGC GGCCTCGTGC CCGTGCGGCA GCCAGCCCGG 300
CCTTTGTGCG CTCGGAGATC AGCTCGCGCT CGAACTCGGC CAAGCCGGCG AAGATCGCGA ACACCAAGCG GCCGTTGGCC GTGGTGGTGT CGATCGAGGC 400
GCCTTCGCCC GACAGCACCT TCAGGCCGAC GTGGCGCCGA GTGAGGTCTC CAACCAGGTT GACCAAGTGG CGCAGGTCGC GGCCGAGCCG GTCCAGCTTC 500
CACACCGCGA GCGTATCGCC GGCGCGCAGC GCCTTGAGGC AGGCCTCGAG CCCGGGACGG TCGTCGCTCC GTCCGGACGC GGCATCCTCG TAGATGTGCT 600
CGGGATCGAC CCCGGCCTTC CTCAGCGCAT CGCGCTGCAG GTCGTGCACC TGGCTGCCGT CGGCCTTCGA CACCCGCGCG TAGCCGATCA GCACCGTCAT 700
TGATACGTCC GTTGGCGTGA CACTGCCGAC CGTGCCCTGT TGAGGCCGCT CGTTTGTCAC ATAATCCGTC TTGCAACGTA AGCTCTGCCA ACGGCGTCTG 800
CGCGGGATTT CGTGACACCC TGGAAGAGCC GATGCCGCGC CGCCTGATCC TGACCGATGC CGAGCGGCGG ACGATCCTCG CCCTGCCGAC CGATGAGGCG 900
ACTTTGATCC GTCACTGGAG CCTCGATGAC CAGGATCTCG CGCTCCTCGA CACGCGGCGG CGCGACGACA CCCGGCTCGG CCTGGCGCTC CAGCTCTGCG 1000
CCCTGCGCTA CCCCGGCCGC CTGATCCGCC CCGGTGAGAC GATCCCCGAG GCCGCGGCGG TGTTCCTGGC CGACCAGCTC GGGGGCGATC CGGACGCGCT 1100
CGCCAGCTTC GCGCGCCGCG CCCCCACCCG CTACGAGCAA CTGACGATCC TGCGCCGACG CTTCGGCTTC ACCGACCTGT GCCGCCCGTT GCGCGGCGAT 1200
CTCGTCGCCT TCGCACGGGG CATCGCTCTG GCGGTTGCCA AGGATCGCCT CGTCGTCACG GCCCTGGCCG AGGAGATGCG GCGGCGGCGC ATCGTCATCC 1300
CCGGCATCAC GGTGCTGGAG CGCCTCGCGG CTCAGGCCTG CACCGAGGCC GAGGACGCTC TTCTGGCCGA CGTCGCGGGG CGGCTGACGC CCGACCTCGT 1400
CATCCGCATG GAGGCGCTGC TCACTGTGGG ACCGCTCGCC ATGGGACCAC GACACGCCCG GCAGAGCGGG ATCTCCTGGC TGCGCGAGCC GCCGGGATCC 1500
GCCGGCACGG CGGCCATGCG CGGCCTCGTC GACCGCCTCG AAGCCGTGCG TCACGTCGGC GTTCCCGCAA CCGTGCTCGG GGGCGTTCCG GCCCACCGCA 1600
TCCGCCGCAT GGCGCAGGAA GGCCGTCGCC TCACGGCCCA GAACTTCGCG CAAATGCGCC CCAGCCGCCG GCACGCGACC TTGGCCGCCT TCCTGCACGA 1700
CACGCAGACG GCGCTGACCG ATGCGGCGAT CGGCATGTTC GAGATCCTGG TCGGCCGCGC GTTCCGGCAG GCCGAGGCCG ATCGTGAGGC ACATCTCACC 1800
GCCAGCGTCG TCGCGGCGGC CGAGGCGCTC GACTTCTTCG CAGGGTTCGG CGACGCCCTT GTGGCCCACA AAGGCGTCGG CCTGTCGCTC GATGCGGCGA 1900
TCACGACCGT CGCGACTTGG GAGGCGCTCG CCCGAGCCAC CGCGGCGGCC CAGGCCAACA GGCAGGCCCG GCACGGTGAC GACACGATCG CCTTCCTGCG 2000
TCGGCATCAT GGCCGCATCC GCGCCTTCGC GGCCCCCTTC CTGACGCGCT TCACGTTCGA GGCCGCCCGG CCCGGCATGG CCCTCGTCAC CGCCGTCTCC 2100
CAACTCGGGG AGGCCTGGAA GGCCGGGCGC CGCTCACCGG GCCAGGCCTG GATCGACGCC GCCTTGTCGT TGCTCGACCG GCGCTGGTCC AGGCACGTCC 2200
GTGCCCCGGA CGGTACCATC GACCGCAAGA TGCTGGAGAT CTTCCTCGTC GTCGAGCTGA AGAACCGGAT CACCGCCGGC GAGGTCTGGG TGGCGGGGTC 2300
ACGGACCTAC CGGGCGCTCG AGGAGAAGCT GATCCCGCCG CAGACCTTCG CGATCATCAA GGCGGAGGCC CGCGTACCCG TCGCTATCCC GGTCGATGTG 2400
GAGATCTACC TGGCCGAGAA GGCCGCCGCG CTCGAAGGGA AGCTGCAGGC GGCGGCGCGC CGCCTGAAGA CGGGACGCGG CGAGACGCGC ATCGGCGCCA 2500
AGGGTCTACG GGTGCCGGCC GTCAGGACAG CGGAGACCGA GGCGGCCGTC GCCCTGGCCC GGCAGGTGGC CGCGACCATG CCGCCGATCC GGCTCACCGA 2600
CCTCATGGCC GACGTCGACC GGATGACCGG CTTCAGTGCC CTGTTCGAAC ATCTGCAGAC CGGACGGCCG CCGGCCGATC GGCGCGTCTT CCTCGCCGCC 2700
CTGATCGCCG AGGCGACCAA CCTCGGCTTC GGCAAGATGG CCTTGGCCTG CCCCGGCCTC ACGCGGCGCC AGCTGCAGCA GGTGGCGATC TGGCACTTCC 2800
GGGAAGACAC CTTCGCCCTG GCTCTGGCCC GGCTGGTCGA GGCCCAACAC GCCGCCCCGT TCTCCGCCAC CTTCGGATCG CACGCCATCG CGTCGTCCGA 2900
CGGCCAGCAC ATCTACCTGG GCGACGGCGG CGAGATCGCC GGCGGCGTCA ACGGCCACTA CGGCTCCGAC CCGATCACCA AGCTCTACAC CACGATCTCG 3000
GGCCGCTATG CGCCCTTCCA CGTCAAGATC ATCGCCGCCA CGGCGAGCGA GGCCGTGCAC GTGCTCGACG CGTTGCTTGA GACTGAGGCC GGCGCGGCCG 3100
TCACCCGGCA CCATGTCGAT GGCGGCGGCG TCAGCGACCT GGTGTTCGCG CTCTGCCATG GGCTCGGCTT CGCCTTCGTG CCGCGCATCC CCGATCTCGA 3200
CGGCCGCTGC CTCTACGGCT TTGCACCAGC CCGGCACTAC GGCGTGCTGC AATCGGTCAT GGGCGAGCGC CTCGACGCCG GCCTGATCCG CCGCCATTGG 3300
GATGACATCC TGCGCCTTCT GACCTCGCTC AGGACCCGCA CCGTCAGCGC CTCGCTGGTG CTGCGACAGC TGTCGGCGAC GCCGCGCCAG AGTGGCCTCG 3400
TCCAGGCGCT GCGGCAGATG GGGCGCGTCG AGCGCACCCT CTTCACCCTC GACTGGATCG GTGACGAACA GCTCCGCAAA GGTACCACGG CCGAACTCAA 3500
CAAGGGCGAG CGCCGCAACG GCCTCGTGCG CGCCGTCAAC CTGCATCGGC TCGGCCGCTT CCGCGACCGC AGCCAGGACA GCCTGGCGAT CCGGGCCTCC 3600
GCCCTCAACC TGGTGGTCAC CGCCATCATC TACTGGAACA CGATCTACAC GGGCCGCGTC GTCGACGCCT TGCGAGCCAG GGGTGCACTC CTTCCCGACC 3700
ACCTCCTCAC CGGCCTGTCG CCCCTCGGCT GGGAGCATAT CGGCCTCACC GGCGACTATC TCTGGGAGGA AACGCCCGGC ATCGATCAGA CCGGGTTCCG 3800
GGCTATCCCG ATCACACCCT GAGCGCATTA CCGTATATCC GTGCCCCTTT AATGTTCAGG GGCC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR TnMex22 68-700 Accessory Gene Resolvase -
tnpA TnMex22 832-3822 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnMex22 633 68-700 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MTVLIGYARV SKADGSQVHD LQRDALRKAG VDPEHIYEDA ASGRSDDRPG LEACLKALRA GDTLAVWKLD RLGRDLRHLV NLVGDLTRRH VGLKVLSGEG
ASIDTTTANG RLVFAIFAGL AEFERELISE RTKAGLAAAR ARGRKGGRPF KMTPAKLRLA QAAMGNPETC VADLCVELGI TRQTLYRFVG PRGELRPDGE
KLLAQRRMRA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnMex22 2991 832-3822 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPRRLILTDA ERRTILALPT DEATLIRHWS LDDQDLALLD TRRRDDTRLG LALQLCALRY PGRLIRPGET IPEAAAVFLA DQLGGDPDAL ASFARRAPTR
YEQLTILRRR FGFTDLCRPL RGDLVAFARG IALAVAKDRL VVTALAEEMR RRRIVIPGIT VLERLAAQAC TEAEDALLAD VAGRLTPDLV IRMEALLTVG
PLAMGPRHAR QSGISWLREP PGSAGTAAMR GLVDRLEAVR HVGVPATVLG GVPAHRIRRM AQEGRRLTAQ NFAQMRPSRR HATLAAFLHD TQTALTDAAI
GMFEILVGRA FRQAEADREA HLTASVVAAA EALDFFAGFG DALVAHKGVG LSLDAAITTV ATWEALARAT AAAQANRQAR HGDDTIAFLR RHHGRIRAFA
APFLTRFTFE AARPGMALVT AVSQLGEAWK AGRRSPGQAW IDAALSLLDR RWSRHVRAPD GTIDRKMLEI FLVVELKNRI TAGEVWVAGS RTYRALEEKL
IPPQTFAIIK AEARVPVAIP VDVEIYLAEK AAALEGKLQA AARRLKTGRG ETRIGAKGLR VPAVRTAETE AAVALARQVA ATMPPIRLTD LMADVDRMTG
FSALFEHLQT GRPPADRRVF LAALIAEATN LGFGKMALAC PGLTRRQLQQ VAIWHFREDT FALALARLVE AQHAAPFSAT FGSHAIASSD GQHIYLGDGG
EIAGGVNGHY GSDPITKLYT TISGRYAPFH VKIIAATASE AVHVLDALLE TEAGAAVTRH HVDGGGVSDL VFALCHGLGF AFVPRIPDLD GRCLYGFAPA
RHYGVLQSVM GERLDAGLIR RHWDDILRLL TSLRTRTVSA SLVLRQLSAT PRQSGLVQAL RQMGRVERTL FTLDWIGDEQ LRKGTTAELN KGERRNGLVR
AVNLHRLGRF RDRSQDSLAI RASALNLVVT AIIYWNTIYT GRVVDALRAR GALLPDHLLT GLSPLGWEHI GLTGDYLWEE TPGIDQTGFR AIPITP

 References     

Vuilleumier S, Chistoserdova L, Lee MC, Bringel F, Lajus A, Zhou Y, Gourion B, Barbe V, Chang J, Cruveiller S, Dossat C, Gillett W, Gruffaz C, Haugen E, Hourcade E, Levy R, Mangenot S, Muller E, Nadalig T, Pagni M, Penny C, Peyraud R, Robinson DG, Roche D, Rouy Z, Saenampechek C, Salvignol G, Vallenet D, Wu Z, Marx CJ, Vorholt JA, Olson MV, Kaul R, Weissenbach J, Médigue C, Lidstrom ME. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS One. 2009;4(5):e5584. doi: 10.1371/journal.pone.0005584. Epub 2009 May 18. PubMed ID: 19440302