ISMex38

  • Family Tn3
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s) TnMex38
Accession numberTranspositionOriginHost
ND Methylobacterium extorquens
Methylobacterium extorquens AM1
Methylobacterium chloromethanicum CM4 plasmid pMCHL01
DNA section
IS Length : 3841 bp

Ends


IR Length : 31/36

IRL : GAGAGCATCGTGTTGGTTGTTCGCCCATTTACGGTTGGCAGGACCGGGCC
IRR : GAGGGCATCGTGTTGACTGTTCGCCCAGTTACGCTTTGTGCGAACGTTCA

Insertion site


Left flankDirect repeatRight flankDR Length
AGGATCTCTTGGTAGAGACAGCC5

DNA sequence

GAGAGCATCGTGTTGGTTGTTCGCCCATTTACGGTTGGCAGGACCGGGCCGTGAACGGGACATGCTTTTCTGATGGGCGCAACGGTCGGCTACGCCGGTC
GGGCGACGCCCGCCAAGGCCATTTCGCGGTAGACGGTCGCGCGTCCGAGCCCGACCTGGCGAGCCGCCTCGGTCGGTGAGAGGCCGGCCTGGACGAGCTT
CAGGGCGGCGCCGACCTTGTCGGGATCGATCGGCTGCCGACCCGGGCGCCGACCCTTGGCCCGGGCCGCCGCGATGCCGTCCTTGGTGCGCTCGGCGATG
AGCCGCCGCTCGAAGTGCGCGATGGCGCCGAACACGTGGAAGACCAGCTCGCCGGCGGCCGAGCCGGTGTCGATCTTCTCCTCGAGGCTGAGGAGTGCGA
TGCCGCGCTCCTTGAGCAGGGTCACGCTCGACAGCAGCTCGGCGAGGGATCGGCCCAGACGATCGAGGCGCACAACAGCCAGGGTGTCCCCTTTTCGAGC
ATAGGCCAGGAGTTCGCTCAGACCGGGTCGATCCATGGATTTGCCGGACCGCACGTCCGTGAACACCCGGATGGCGCCGGCGTGTTCCAGCCGCAGCCTC
TGTCCGGCGACGTCCTGGTCCCCCGTCGACACCCGGGCATAGCCCAGCACATCGCCCATCGACGCCCTCTGCCCGTCCCGCAACCGGCCGTTCTGGGGAC
GGGTGCGCCGACGATCCGCGACGCGCCTCGATCCCGTCCACCAACTATGTCCCGTTAAGCCGCACTGTCTAGCTTCGACCGCGACCTTTCCTGGACGGTT
CGGGCATAGCGGGACGATGGCAAGACGGTCGCTTCTGAGCACGGCGGAGCGCGCGCGCCTGTTCGGCATCCCGGTCGATCGCGATGGGCTGGCGCGGCAC
TACACCTTCGATCGCCAGGACCTCGCCCTCATCGCCACGCGCCGCGGCGACGCCAACCGGATCGGCTTCGCGGTGCAGCTCGCGCTGCTGCGTCATCCTG
GCTTCGGCTTGTCCCCGGCGATCACGGTCGAGCCGGCACTGGTCACCCGCATCGCCGAACAGCTCGCGATCGACCCGAGGGCCTTCACCGCCTATGCGGG
GCGCAGTCCCACGATATCCGATCACGCCCGCGCCCTCGAACGGGTGCTGGGACTTCGCCCCTGCGCGAGGGCCGACCTGCCGTCGATGATCACGGCCGCG
GCCCGGGCGGCATGGCCGACCGACCAGGGGGAGCCGATCGCGGTCGCGGTGATGGCCGCGTTGCGCGACAGCGGCATCGTCCTGCCGGCACCCGACACGA
TCGAGCGCGCGGGGCTCGCCGGTCGCGCGCGTGCGAGGAAACAGGTCGCCGCCGCGCTGCTCGCGGGCATGACCGACGCGCTCGCGGCGCAGCTCGACGC
CCTTCTGGCGATCGATCCGAAGATCGGGCGGCCACCCCTCTCCTGGATGAAGGACCTGCCCCGTGCGCCCAAGCCCAACCATGTCCGCGAACTCCTCGAC
AAGCTCGCTGCCGTGCGGGATCTCGGGCTCGATCCGCGGGCGGCCGAGCGCATCCACCCCGACCGGCTCGCGCTCCTGATGCGCGAAGGTCGGATCACGC
CGGCCTCTACCCTCGAGCGTTATGCCCCGTCGCGGCGGCGTGCCATCGTGGTCGCGACGTTGCTCGATCTCGAACGCCGCTTGACCGATGCGGCGCTGTC
CATGGCGGACCGCCTCATCGGCGCGAGCTTCACACGCGGCAAAGCCGCGCGCGAGAGGACCTTCGTGGCCACCTCGCGCGACGTCGGCCGGCTCATGCGC
CTTCTCGCGGGGACCGCCGGCGCAGTCGCGACGGCCATGAAGGAGAACGGCGACGCGCTCGCCGCGATCGATGCCGCCGTCGGGCTCGACAGGCTCATCG
CGGCAAAGCCCCAGGCGGCCGAGATCGCCGACGTCGCAGAGGAGGATCCGCTCGTGCGCGCCGCCGATCGCTGGATGAGGCTGCGCAAGTACGGGCCGAT
GCTGATCGAGGCGATCGACTTCAAGGCGGCGCGCGCCGATGACCGCACGGTCGCGGCCCTGACCGCGTTGCGCGATCTGAACCGCTCGGGCAAGCGGGAC
CTTCCCAAGGGTACGCCGATGCCGTTCAAGAAGGAATGGCGCCGGCTCGTGGCCGGGGCGGACGGCAGGCTCGATCGCCGACTGTTCGAGACGGCCCTGT
TCGCCCATCTGAGGAACAAATGGCGCTCGGGTGACCTGTGGGTCGAGCGCTCGACCCACTACCGTCGCTTCGACAGTTATCTCCTGCCCCTCGACGAAGC
GCGGACTATCGTCGCCCCGCTCGGCCTGCCTTGCGACCCCGACGCCTGGCTGGCGGCCCGCGCGGAGCGGCTCGACCGGCGGCTGAAGCGCCTCGGCCGG
CATCTCGGCCGCGGGACTCTCGAAGGCGTGAGCCTGAGGAACGGCAAGCTCTCCATCGCGCCGGTCCGTGCCGACAAGAACCCGGAGGCAGAGGCTCTCG
CAGCCCGCATCGGCGCGCTGATGCCGCGTATCCGCATCACCGAACTCCTCCACGAGGTGGCGCGCGAGACCGGGTTCCTGTCCGCCTTCACCAACGTCCG
CACCCGGCAGCCGGTCGAGGACGGGAACGCGCTGCTGGCCGTCATCCTCGCCGACGCGACCAATCTCGGCCTGTCCCGCATGGCCGAAGCCAGCCAGGGG
GTCACGCGCGACCAGCTGTTCTGGACGCGCGACGCCTTCATCCGCGACGAGACCTACAAGGCCGCCCTCGGCCGGATCGTCGATGCGCATCATGCACTCC
CGATCGCGGCCGTGTGGGGCGAGGGCACCACCGCGGCGAGCGACGGCCAGTTCTTCCGCTCCGGCAAGCGCGGCGACGGTGCCGGCCAGGTCAACGCCCG
CCACGGCATCGAGCCGGGCTACTGCTTCTACACCCACACCTCCGACCAGCACGGGCCGATGCGTTCGGTCAGCATGGCGGCGGCCGAGCACGAGGCCCCC
TACGTGCTCGACGGGCTTCTGCACCACGGCACCGGCCTGACCATCGCCGAGCACTACACCGATACCGGGGGCAGTTCGGATCACGTCCACTTCCTGTGCG
ACAGCCTGGGCATCCGGTTCTGTCCGCGGCTGCGCGACTTCCCCGATCGGCGGCTCGCCTGCCTGGAGCCACCGTCACGCTACCCGGCGCTCGGCGGCCT
CCTGGGCAAGCGGGTCAAGGCCGATCTGATCCGCGCGCACTGGAACGACATCGTCCGGCTGGTCGCCACCCTGAAGGCGGGCGTCGTCGCGCCCTCCACG
ATGTTGAAGAAGCTCGCGGCCTACGAGCGGCAGAACCAGCTGGACCTCGCGATCGGGGAAGCCGGCCGTCTCGTGCGGGCCGAGTTCATGATCGACTGGA
TGGAGGGACCGGCCCTGCGACGGCGCAGCCAGGCCGGGCTCAACAAATCCGAGCAACGCCATACGCTGGCGAGCGTCGTGTGCACCTACGGGCAGGGCCG
CATCGCCGATCGGAGCCAGGAGGTGCAGGAGTACCGGGCCTCGGGGCTCAACCTGGTGATCGCCGCCATCGTGTACTGGAACTCGACCTACATGGCCGAC
GCGGTCGCGCATCTGCGTCGCAGCGGTGATCCCACCCCCGATCGTCTGCTCGCCCACACGTCGCCGGTCGAATGGGAGCACATCGGCTTCTCGGGCGACT
TCCTGTGGCACCGCGCCGCCATGATGCCTGCCAGCCGGCGAAGGCTCAATCTCACCAAGACCCAGCCCGCAGCCGCTTGAACCACGTTCACTGAACGTTC
GCACAAAGCGTAACTGGGCGAACAGTCAACACGATGCCCTC
Protein section
ORF number : 2

 

ORF 1
LengthBeginEndStrandFusion ORF
780 bp259 aa90869-No
ORF function : Accessory Gene
AG : Tn3 resolvase

ORF sequence :

MPNRRARSAVLRSDRLAIVPLCPNRPGKVAVEARQCGLTGHSWWTGSRRVADRRRTRPQNGRLRDGQRASMGDVLGYARVSTGDQDVAGQRLRLEHAGAI
RVFTDVRSGKSMDRPGLSELLAYARKGDTLAVVRLDRLGRSLAELLSSVTLLKERGIALLSLEEKIDTGSAAGELVFHVFGAIAHFERRLIAERTKDGIA
AARAKGRRPGRQPIDPDKVGAALKLVQAGLSPTEAARQVGLGRATVYREMALAGVARPA

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
2964 bp987 aa8173780+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MARRSLLSTAERARLFGIPVDRDGLARHYTFDRQDLALIATRRGDANRIGFAVQLALLRHPGFGLSPAITVEPALVTRIAEQLAIDPRAFTAYAGRSPTI
SDHARALERVLGLRPCARADLPSMITAAARAAWPTDQGEPIAVAVMAALRDSGIVLPAPDTIERAGLAGRARARKQVAAALLAGMTDALAAQLDALLAID
PKIGRPPLSWMKDLPRAPKPNHVRELLDKLAAVRDLGLDPRAAERIHPDRLALLMREGRITPASTLERYAPSRRRAIVVATLLDLERRLTDAALSMADRL
IGASFTRGKAARERTFVATSRDVGRLMRLLAGTAGAVATAMKENGDALAAIDAAVGLDRLIAAKPQAAEIADVAEEDPLVRAADRWMRLRKYGPMLIEAI
DFKAARADDRTVAALTALRDLNRSGKRDLPKGTPMPFKKEWRRLVAGADGRLDRRLFETALFAHLRNKWRSGDLWVERSTHYRRFDSYLLPLDEARTIVA
PLGLPCDPDAWLAARAERLDRRLKRLGRHLGRGTLEGVSLRNGKLSIAPVRADKNPEAEALAARIGALMPRIRITELLHEVARETGFLSAFTNVRTRQPV
EDGNALLAVILADATNLGLSRMAEASQGVTRDQLFWTRDAFIRDETYKAALGRIVDAHHALPIAAVWGEGTTAASDGQFFRSGKRGDGAGQVNARHGIEP
GYCFYTHTSDQHGPMRSVSMAAAEHEAPYVLDGLLHHGTGLTIAEHYTDTGGSSDHVHFLCDSLGIRFCPRLRDFPDRRLACLEPPSRYPALGGLLGKRV
KADLIRAHWNDIVRLVATLKAGVVAPSTMLKKLAAYERQNQLDLAIGEAGRLVRAEFMIDWMEGPALRRRSQAGLNKSEQRHTLASVVCTYGQGRIADRS
QEVQEYRASGLNLVIAAIVYWNSTYMADAVAHLRRSGDPTPDRLLAHTSPVEWEHIGFSGDFLWHRAAMMPASRRRLNLTKTQPAAA

 

Blast result :
Comments
ISMex38 is 93% (ORFA) and 74% (ORFB, the transposase) aa similar to ISAli20.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources.(2009) PLoS ONE Submitted.