Transposon
Name: TnMex38       (Synonyms: Tn7166)
Family: Tn3        Group: Tn163
Evidence of Transposition: yes
 Host     

Host Organism:Methylobacterium extorquens AM1 Molecular Source:plasmid p2META1
Place of Origin:Oxford Date of Isolation:2009
Other Geographic Information:1960 airborne contaminant growing on methylamine

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 36 bp)GAGAGCATCGTGTTGGTTGTTCGCCCATTTACGGTT
IRR (Length: 36 bp)GAGGGCATCGTGTTGACTGTTCGCCCAGTTACGCTT

 Sequence     
DNA SequenceLength  3841 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GAGAGCATCG TGTTGGTTGT TCGCCCATTT ACGGTTGGCA GGACCGGGCC GTGAACGGGA CATGCTTTTC TGATGGGCGC AACGGTCGGC TACGCCGGTC 100
GGGCGACGCC CGCCAAGGCC ATTTCGCGGT AGACGGTCGC GCGTCCGAGC CCGACCTGGC GAGCCGCCTC GGTCGGTGAG AGGCCGGCCT GGACGAGCTT 200
CAGGGCGGCG CCGACCTTGT CGGGATCGAT CGGCTGCCGA CCCGGGCGCC GACCCTTGGC CCGGGCCGCC GCGATGCCGT CCTTGGTGCG CTCGGCGATG 300
AGCCGCCGCT CGAAGTGCGC GATGGCGCCG AACACGTGGA AGACCAGCTC GCCGGCGGCC GAGCCGGTGT CGATCTTCTC CTCGAGGCTG AGGAGTGCGA 400
TGCCGCGCTC CTTGAGCAGG GTCACGCTCG ACAGCAGCTC GGCGAGGGAT CGGCCCAGAC GATCGAGGCG CACAACAGCC AGGGTGTCCC CTTTTCGAGC 500
ATAGGCCAGG AGTTCGCTCA GACCGGGTCG ATCCATGGAT TTGCCGGACC GCACGTCCGT GAACACCCGG ATGGCGCCGG CGTGTTCCAG CCGCAGCCTC 600
TGTCCGGCGA CGTCCTGGTC CCCCGTCGAC ACCCGGGCAT AGCCCAGCAC ATCGCCCATC GACGCCCTCT GCCCGTCCCG CAACCGGCCG TTCTGGGGAC 700
GGGTGCGCCG ACGATCCGCG ACGCGCCTCG ATCCCGTCCA CCAACTATGT CCCGTTAAGC CGCACTGTCT AGCTTCGACC GCGACCTTTC CTGGACGGTT 800
CGGGCATAGC GGGACGATGG CAAGACGGTC GCTTCTGAGC ACGGCGGAGC GCGCGCGCCT GTTCGGCATC CCGGTCGATC GCGATGGGCT GGCGCGGCAC 900
TACACCTTCG ATCGCCAGGA CCTCGCCCTC ATCGCCACGC GCCGCGGCGA CGCCAACCGG ATCGGCTTCG CGGTGCAGCT CGCGCTGCTG CGTCATCCTG 1000
GCTTCGGCTT GTCCCCGGCG ATCACGGTCG AGCCGGCACT GGTCACCCGC ATCGCCGAAC AGCTCGCGAT CGACCCGAGG GCCTTCACCG CCTATGCGGG 1100
GCGCAGTCCC ACGATATCCG ATCACGCCCG CGCCCTCGAA CGGGTGCTGG GACTTCGCCC CTGCGCGAGG GCCGACCTGC CGTCGATGAT CACGGCCGCG 1200
GCCCGGGCGG CATGGCCGAC CGACCAGGGG GAGCCGATCG CGGTCGCGGT GATGGCCGCG TTGCGCGACA GCGGCATCGT CCTGCCGGCA CCCGACACGA 1300
TCGAGCGCGC GGGGCTCGCC GGTCGCGCGC GTGCGAGGAA ACAGGTCGCC GCCGCGCTGC TCGCGGGCAT GACCGACGCG CTCGCGGCGC AGCTCGACGC 1400
CCTTCTGGCG ATCGATCCGA AGATCGGGCG GCCACCCCTC TCCTGGATGA AGGACCTGCC CCGTGCGCCC AAGCCCAACC ATGTCCGCGA ACTCCTCGAC 1500
AAGCTCGCTG CCGTGCGGGA TCTCGGGCTC GATCCGCGGG CGGCCGAGCG CATCCACCCC GACCGGCTCG CGCTCCTGAT GCGCGAAGGT CGGATCACGC 1600
CGGCCTCTAC CCTCGAGCGT TATGCCCCGT CGCGGCGGCG TGCCATCGTG GTCGCGACGT TGCTCGATCT CGAACGCCGC TTGACCGATG CGGCGCTGTC 1700
CATGGCGGAC CGCCTCATCG GCGCGAGCTT CACACGCGGC AAAGCCGCGC GCGAGAGGAC CTTCGTGGCC ACCTCGCGCG ACGTCGGCCG GCTCATGCGC 1800
CTTCTCGCGG GGACCGCCGG CGCAGTCGCG ACGGCCATGA AGGAGAACGG CGACGCGCTC GCCGCGATCG ATGCCGCCGT CGGGCTCGAC AGGCTCATCG 1900
CGGCAAAGCC CCAGGCGGCC GAGATCGCCG ACGTCGCAGA GGAGGATCCG CTCGTGCGCG CCGCCGATCG CTGGATGAGG CTGCGCAAGT ACGGGCCGAT 2000
GCTGATCGAG GCGATCGACT TCAAGGCGGC GCGCGCCGAT GACCGCACGG TCGCGGCCCT GACCGCGTTG CGCGATCTGA ACCGCTCGGG CAAGCGGGAC 2100
CTTCCCAAGG GTACGCCGAT GCCGTTCAAG AAGGAATGGC GCCGGCTCGT GGCCGGGGCG GACGGCAGGC TCGATCGCCG ACTGTTCGAG ACGGCCCTGT 2200
TCGCCCATCT GAGGAACAAA TGGCGCTCGG GTGACCTGTG GGTCGAGCGC TCGACCCACT ACCGTCGCTT CGACAGTTAT CTCCTGCCCC TCGACGAAGC 2300
GCGGACTATC GTCGCCCCGC TCGGCCTGCC TTGCGACCCC GACGCCTGGC TGGCGGCCCG CGCGGAGCGG CTCGACCGGC GGCTGAAGCG CCTCGGCCGG 2400
CATCTCGGCC GCGGGACTCT CGAAGGCGTG AGCCTGAGGA ACGGCAAGCT CTCCATCGCG CCGGTCCGTG CCGACAAGAA CCCGGAGGCA GAGGCTCTCG 2500
CAGCCCGCAT CGGCGCGCTG ATGCCGCGTA TCCGCATCAC CGAACTCCTC CACGAGGTGG CGCGCGAGAC CGGGTTCCTG TCCGCCTTCA CCAACGTCCG 2600
CACCCGGCAG CCGGTCGAGG ACGGGAACGC GCTGCTGGCC GTCATCCTCG CCGACGCGAC CAATCTCGGC CTGTCCCGCA TGGCCGAAGC CAGCCAGGGG 2700
GTCACGCGCG ACCAGCTGTT CTGGACGCGC GACGCCTTCA TCCGCGACGA GACCTACAAG GCCGCCCTCG GCCGGATCGT CGATGCGCAT CATGCACTCC 2800
CGATCGCGGC CGTGTGGGGC GAGGGCACCA CCGCGGCGAG CGACGGCCAG TTCTTCCGCT CCGGCAAGCG CGGCGACGGT GCCGGCCAGG TCAACGCCCG 2900
CCACGGCATC GAGCCGGGCT ACTGCTTCTA CACCCACACC TCCGACCAGC ACGGGCCGAT GCGTTCGGTC AGCATGGCGG CGGCCGAGCA CGAGGCCCCC 3000
TACGTGCTCG ACGGGCTTCT GCACCACGGC ACCGGCCTGA CCATCGCCGA GCACTACACC GATACCGGGG GCAGTTCGGA TCACGTCCAC TTCCTGTGCG 3100
ACAGCCTGGG CATCCGGTTC TGTCCGCGGC TGCGCGACTT CCCCGATCGG CGGCTCGCCT GCCTGGAGCC ACCGTCACGC TACCCGGCGC TCGGCGGCCT 3200
CCTGGGCAAG CGGGTCAAGG CCGATCTGAT CCGCGCGCAC TGGAACGACA TCGTCCGGCT GGTCGCCACC CTGAAGGCGG GCGTCGTCGC GCCCTCCACG 3300
ATGTTGAAGA AGCTCGCGGC CTACGAGCGG CAGAACCAGC TGGACCTCGC GATCGGGGAA GCCGGCCGTC TCGTGCGGGC CGAGTTCATG ATCGACTGGA 3400
TGGAGGGACC GGCCCTGCGA CGGCGCAGCC AGGCCGGGCT CAACAAATCC GAGCAACGCC ATACGCTGGC GAGCGTCGTG TGCACCTACG GGCAGGGCCG 3500
CATCGCCGAT CGGAGCCAGG AGGTGCAGGA GTACCGGGCC TCGGGGCTCA ACCTGGTGAT CGCCGCCATC GTGTACTGGA ACTCGACCTA CATGGCCGAC 3600
GCGGTCGCGC ATCTGCGTCG CAGCGGTGAT CCCACCCCCG ATCGTCTGCT CGCCCACACG TCGCCGGTCG AATGGGAGCA CATCGGCTTC TCGGGCGACT 3700
TCCTGTGGCA CCGCGCCGCC ATGATGCCTG CCAGCCGGCG AAGGCTCAAT CTCACCAAGA CCCAGCCCGC AGCCGCTTGA ACCACGTTCA CTGAACGTTC 3800
GCACAAAGCG TAACTGGGCG AACAGTCAAC ACGATGCCCT C

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR TnMex38 90-659 Accessory Gene Resolvase -
tnpA TnMex38 817-3780 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnMex38 570 90-659 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MGDVLGYARV STGDQDVAGQ RLRLEHAGAI RVFTDVRSGK SMDRPGLSEL LAYARKGDTL AVVRLDRLGR SLAELLSSVT LLKERGIALL SLEEKIDTGS
AAGELVFHVF GAIAHFERRL IAERTKDGIA AARAKGRRPG RQPIDPDKVG AALKLVQAGL SPTEAARQVG LGRATVYREM ALAGVARPA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnMex38 2964 817-3780 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MARRSLLSTA ERARLFGIPV DRDGLARHYT FDRQDLALIA TRRGDANRIG FAVQLALLRH PGFGLSPAIT VEPALVTRIA EQLAIDPRAF TAYAGRSPTI
SDHARALERV LGLRPCARAD LPSMITAAAR AAWPTDQGEP IAVAVMAALR DSGIVLPAPD TIERAGLAGR ARARKQVAAA LLAGMTDALA AQLDALLAID
PKIGRPPLSW MKDLPRAPKP NHVRELLDKL AAVRDLGLDP RAAERIHPDR LALLMREGRI TPASTLERYA PSRRRAIVVA TLLDLERRLT DAALSMADRL
IGASFTRGKA ARERTFVATS RDVGRLMRLL AGTAGAVATA MKENGDALAA IDAAVGLDRL IAAKPQAAEI ADVAEEDPLV RAADRWMRLR KYGPMLIEAI
DFKAARADDR TVAALTALRD LNRSGKRDLP KGTPMPFKKE WRRLVAGADG RLDRRLFETA LFAHLRNKWR SGDLWVERST HYRRFDSYLL PLDEARTIVA
PLGLPCDPDA WLAARAERLD RRLKRLGRHL GRGTLEGVSL RNGKLSIAPV RADKNPEAEA LAARIGALMP RIRITELLHE VARETGFLSA FTNVRTRQPV
EDGNALLAVI LADATNLGLS RMAEASQGVT RDQLFWTRDA FIRDETYKAA LGRIVDAHHA LPIAAVWGEG TTAASDGQFF RSGKRGDGAG QVNARHGIEP
GYCFYTHTSD QHGPMRSVSM AAAEHEAPYV LDGLLHHGTG LTIAEHYTDT GGSSDHVHFL CDSLGIRFCP RLRDFPDRRL ACLEPPSRYP ALGGLLGKRV
KADLIRAHWN DIVRLVATLK AGVVAPSTML KKLAAYERQN QLDLAIGEAG RLVRAEFMID WMEGPALRRR SQAGLNKSEQ RHTLASVVCT YGQGRIADRS
QEVQEYRASG LNLVIAAIVY WNSTYMADAV AHLRRSGDPT PDRLLAHTSP VEWEHIGFSG DFLWHRAAMM PASRRRLNLT KTQPAAA

 References     

Vuilleumier S, Chistoserdova L, Lee MC, Bringel F, Lajus A, Zhou Y, Gourion B, Barbe V, Chang J, Cruveiller S, Dossat C, Gillett W, Gruffaz C, Haugen E, Hourcade E, Levy R, Mangenot S, Muller E, Nadalig T, Pagni M, Penny C, Peyraud R, Robinson DG, Roche D, Rouy Z, Saenampechek C, Salvignol G, Vallenet D, Wu Z, Marx CJ, Vorholt JA, Olson MV, Kaul R, Weissenbach J, Médigue C, Lidstrom ME. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS One. 2009;4(5):e5584. doi: 10.1371/journal.pone.0005584. Epub 2009 May 18. PubMed ID: 19440302