ISMex22

  • Family Tn3
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s) TnMex22
Accession numberTranspositionOriginHost
ND Methylobacterium extorquens
Methylobacterium extorquens AM1
DNA section
IS Length : 3864 bp

Ends


IR Length : 36

IRL : GGCCCCTGAACATTAAAGGGGCACGGATATACGGTAGGGGCTTGCGAGCG
IRR : GGCCCCTGAACATTAAAGGGGCACGGATATACGGTAATGCGCTCAGGGTG

Insertion site


Left flankDirect repeatRight flankDR Length
CGAGCAGGCGATGGGTCAGCGACGCG6
TGATCCCGCGTTCACGGGGCGGCGG5
AGGCCACCCCATGATATGATCCTGT5
GGTGCTTCCCACCGTGGGCAGCGCG5
CAGGTGCTGATGGATGCCCTGCGT4

DNA sequence

GGCCCCTGAACATTAAAGGGGCACGGATATACGGTAGGGGCTTGCGAGCGGCCGTCGGCGCCGCGGGCTAGGCTCGCATGCGTCTCTGGGCAAGAAGCTT
TTCGCCGTCAGGGCGGAGTTCCCCGCGCGGCCCGACGAACCGGTAGAGGGTTTGTCGGGTGATGCCGAGTTCGACGCAGAGGTCGGCGACACAGGTTTCA
GGGTTTCCCATCGCGGCCTGAGCCAAGCGCAGCTTGGCGGGCGTCATCTTGAAGGGGCGGCCGCCCTTGCGGCCTCGTGCCCGTGCGGCAGCCAGCCCGG
CCTTTGTGCGCTCGGAGATCAGCTCGCGCTCGAACTCGGCCAAGCCGGCGAAGATCGCGAACACCAAGCGGCCGTTGGCCGTGGTGGTGTCGATCGAGGC
GCCTTCGCCCGACAGCACCTTCAGGCCGACGTGGCGCCGAGTGAGGTCTCCAACCAGGTTGACCAAGTGGCGCAGGTCGCGGCCGAGCCGGTCCAGCTTC
CACACCGCGAGCGTATCGCCGGCGCGCAGCGCCTTGAGGCAGGCCTCGAGCCCGGGACGGTCGTCGCTCCGTCCGGACGCGGCATCCTCGTAGATGTGCT
CGGGATCGACCCCGGCCTTCCTCAGCGCATCGCGCTGCAGGTCGTGCACCTGGCTGCCGTCGGCCTTCGACACCCGCGCGTAGCCGATCAGCACCGTCAT
TGATACGTCCGTTGGCGTGACACTGCCGACCGTGCCCTGTTGAGGCCGCTCGTTTGTCACATAATCCGTCTTGCAACGTAAGCTCTGCCAACGGCGTCTG
CGCGGGATTTCGTGACACCCTGGAAGAGCCGATGCCGCGCCGCCTGATCCTGACCGATGCCGAGCGGCGGACGATCCTCGCCCTGCCGACCGATGAGGCG
ACTTTGATCCGTCACTGGAGCCTCGATGACCAGGATCTCGCGCTCCTCGACACGCGGCGGCGCGACGACACCCGGCTCGGCCTGGCGCTCCAGCTCTGCG
CCCTGCGCTACCCCGGCCGCCTGATCCGCCCCGGTGAGACGATCCCCGAGGCCGCGGCGGTGTTCCTGGCCGACCAGCTCGGGGGCGATCCGGACGCGCT
CGCCAGCTTCGCGCGCCGCGCCCCCACCCGCTACGAGCAACTGACGATCCTGCGCCGACGCTTCGGCTTCACCGACCTGTGCCGCCCGTTGCGCGGCGAT
CTCGTCGCCTTCGCACGGGGCATCGCTCTGGCGGTTGCCAAGGATCGCCTCGTCGTCACGGCCCTGGCCGAGGAGATGCGGCGGCGGCGCATCGTCATCC
CCGGCATCACGGTGCTGGAGCGCCTCGCGGCTCAGGCCTGCACCGAGGCCGAGGACGCTCTTCTGGCCGACGTCGCGGGGCGGCTGACGCCCGACCTCGT
CATCCGCATGGAGGCGCTGCTCACTGTGGGACCGCTCGCCATGGGACCACGACACGCCCGGCAGAGCGGGATCTCCTGGCTGCGCGAGCCGCCGGGATCC
GCCGGCACGGCGGCCATGCGCGGCCTCGTCGACCGCCTCGAAGCCGTGCGTCACGTCGGCGTTCCCGCAACCGTGCTCGGGGGCGTTCCGGCCCACCGCA
TCCGCCGCATGGCGCAGGAAGGCCGTCGCCTCACGGCCCAGAACTTCGCGCAAATGCGCCCCAGCCGCCGGCACGCGACCTTGGCCGCCTTCCTGCACGA
CACGCAGACGGCGCTGACCGATGCGGCGATCGGCATGTTCGAGATCCTGGTCGGCCGCGCGTTCCGGCAGGCCGAGGCCGATCGTGAGGCACATCTCACC
GCCAGCGTCGTCGCGGCGGCCGAGGCGCTCGACTTCTTCGCAGGGTTCGGCGACGCCCTTGTGGCCCACAAAGGCGTCGGCCTGTCGCTCGATGCGGCGA
TCACGACCGTCGCGACTTGGGAGGCGCTCGCCCGAGCCACCGCGGCGGCCCAGGCCAACAGGCAGGCCCGGCACGGTGACGACACGATCGCCTTCCTGCG
TCGGCATCATGGCCGCATCCGCGCCTTCGCGGCCCCCTTCCTGACGCGCTTCACGTTCGAGGCCGCCCGGCCCGGCATGGCCCTCGTCACCGCCGTCTCC
CAACTCGGGGAGGCCTGGAAGGCCGGGCGCCGCTCACCGGGCCAGGCCTGGATCGACGCCGCCTTGTCGTTGCTCGACCGGCGCTGGTCCAGGCACGTCC
GTGCCCCGGACGGTACCATCGACCGCAAGATGCTGGAGATCTTCCTCGTCGTCGAGCTGAAGAACCGGATCACCGCCGGCGAGGTCTGGGTGGCGGGGTC
ACGGACCTACCGGGCGCTCGAGGAGAAGCTGATCCCGCCGCAGACCTTCGCGATCATCAAGGCGGAGGCCCGCGTACCCGTCGCTATCCCGGTCGATGTG
GAGATCTACCTGGCCGAGAAGGCCGCCGCGCTCGAAGGGAAGCTGCAGGCGGCGGCGCGCCGCCTGAAGACGGGACGCGGCGAGACGCGCATCGGCGCCA
AGGGTCTACGGGTGCCGGCCGTCAGGACAGCGGAGACCGAGGCGGCCGTCGCCCTGGCCCGGCAGGTGGCCGCGACCATGCCGCCGATCCGGCTCACCGA
CCTCATGGCCGACGTCGACCGGATGACCGGCTTCAGTGCCCTGTTCGAACATCTGCAGACCGGACGGCCGCCGGCCGATCGGCGCGTCTTCCTCGCCGCC
CTGATCGCCGAGGCGACCAACCTCGGCTTCGGCAAGATGGCCTTGGCCTGCCCCGGCCTCACGCGGCGCCAGCTGCAGCAGGTGGCGATCTGGCACTTCC
GGGAAGACACCTTCGCCCTGGCTCTGGCCCGGCTGGTCGAGGCCCAACACGCCGCCCCGTTCTCCGCCACCTTCGGATCGCACGCCATCGCGTCGTCCGA
CGGCCAGCACATCTACCTGGGCGACGGCGGCGAGATCGCCGGCGGCGTCAACGGCCACTACGGCTCCGACCCGATCACCAAGCTCTACACCACGATCTCG
GGCCGCTATGCGCCCTTCCACGTCAAGATCATCGCCGCCACGGCGAGCGAGGCCGTGCACGTGCTCGACGCGTTGCTTGAGACTGAGGCCGGCGCGGCCG
TCACCCGGCACCATGTCGATGGCGGCGGCGTCAGCGACCTGGTGTTCGCGCTCTGCCATGGGCTCGGCTTCGCCTTCGTGCCGCGCATCCCCGATCTCGA
CGGCCGCTGCCTCTACGGCTTTGCACCAGCCCGGCACTACGGCGTGCTGCAATCGGTCATGGGCGAGCGCCTCGACGCCGGCCTGATCCGCCGCCATTGG
GATGACATCCTGCGCCTTCTGACCTCGCTCAGGACCCGCACCGTCAGCGCCTCGCTGGTGCTGCGACAGCTGTCGGCGACGCCGCGCCAGAGTGGCCTCG
TCCAGGCGCTGCGGCAGATGGGGCGCGTCGAGCGCACCCTCTTCACCCTCGACTGGATCGGTGACGAACAGCTCCGCAAAGGTACCACGGCCGAACTCAA
CAAGGGCGAGCGCCGCAACGGCCTCGTGCGCGCCGTCAACCTGCATCGGCTCGGCCGCTTCCGCGACCGCAGCCAGGACAGCCTGGCGATCCGGGCCTCC
GCCCTCAACCTGGTGGTCACCGCCATCATCTACTGGAACACGATCTACACGGGCCGCGTCGTCGACGCCTTGCGAGCCAGGGGTGCACTCCTTCCCGACC
ACCTCCTCACCGGCCTGTCGCCCCTCGGCTGGGAGCATATCGGCCTCACCGGCGACTATCTCTGGGAGGAAACGCCCGGCATCGATCAGACCGGGTTCCG
GGCTATCCCGATCACACCCTGAGCGCATTACCGTATATCCGTGCCCCTTTAATGTTCAGGGGCC
Protein section
ORF number : 2

 

ORF 1
LengthBeginEndStrandFusion ORF
633 bp210 aa68700-No
ORF function : Accessory Gene
AG : Tn3 resolvase

ORF sequence :

MTVLIGYARVSKADGSQVHDLQRDALRKAGVDPEHIYEDAASGRSDDRPGLEACLKALRAGDTLAVWKLDRLGRDLRHLVNLVGDLTRRHVGLKVLSGEG
ASIDTTTANGRLVFAIFAGLAEFERELISERTKAGLAAARARGRKGGRPFKMTPAKLRLAQAAMGNPETCVADLCVELGITRQTLYRFVGPRGELRPDGE
KLLAQRRMRA

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
2991 bp996 aa8323822+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MPRRLILTDAERRTILALPTDEATLIRHWSLDDQDLALLDTRRRDDTRLGLALQLCALRYPGRLIRPGETIPEAAAVFLADQLGGDPDALASFARRAPTR
YEQLTILRRRFGFTDLCRPLRGDLVAFARGIALAVAKDRLVVTALAEEMRRRRIVIPGITVLERLAAQACTEAEDALLADVAGRLTPDLVIRMEALLTVG
PLAMGPRHARQSGISWLREPPGSAGTAAMRGLVDRLEAVRHVGVPATVLGGVPAHRIRRMAQEGRRLTAQNFAQMRPSRRHATLAAFLHDTQTALTDAAI
GMFEILVGRAFRQAEADREAHLTASVVAAAEALDFFAGFGDALVAHKGVGLSLDAAITTVATWEALARATAAAQANRQARHGDDTIAFLRRHHGRIRAFA
APFLTRFTFEAARPGMALVTAVSQLGEAWKAGRRSPGQAWIDAALSLLDRRWSRHVRAPDGTIDRKMLEIFLVVELKNRITAGEVWVAGSRTYRALEEKL
IPPQTFAIIKAEARVPVAIPVDVEIYLAEKAAALEGKLQAAARRLKTGRGETRIGAKGLRVPAVRTAETEAAVALARQVAATMPPIRLTDLMADVDRMTG
FSALFEHLQTGRPPADRRVFLAALIAEATNLGFGKMALACPGLTRRQLQQVAIWHFREDTFALALARLVEAQHAAPFSATFGSHAIASSDGQHIYLGDGG
EIAGGVNGHYGSDPITKLYTTISGRYAPFHVKIIAATASEAVHVLDALLETEAGAAVTRHHVDGGGVSDLVFALCHGLGFAFVPRIPDLDGRCLYGFAPA
RHYGVLQSVMGERLDAGLIRRHWDDILRLLTSLRTRTVSASLVLRQLSATPRQSGLVQALRQMGRVERTLFTLDWIGDEQLRKGTTAELNKGERRNGLVR
AVNLHRLGRFRDRSQDSLAIRASALNLVVTAIIYWNTIYTGRVVDALRARGALLPDHLLTGLSPLGWEHIGLTGDYLWEETPGIDQTGFRAIPITP*

 

Blast result :
Comments
ISMex22 was found with one copy in each of the five replicons in AM1.
ISMex22 is 64% aa similar to ISThsp9 and 54%(ORFB) to ISSod9.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted.
2] Ming-Chun Lee (2009) Direct submission.