ISMdi18

  • Family IS1595
  • Group ISNwi1
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Methylobacterium dichloromethanicum
Methylobacterium dichloromethanicum DM4
DNA section
IS Length : 3479 bp

Ends


IR Length : 23/24

IRL : GGCGATTATATCGTTGACACATACGGCAACCGCAGGTAGGTTTTAGCTCT
IRR : GGCGATTATATCGTTCACACATACACCTTGACCGACCACGACCGTTGGGC

Insertion site


Left flankDirect repeatRight flankDR Length
GAGGGCCGCTTAATTTTTAGAGGCGGAT8

DNA sequence

GGCGATTATATCGTTGACACATACGGCAACCGCAGGTAGGTTTTAGCTCTAGGAGCTAAACCGATGTCCGCCCTTTCCGCTGCATTCTTCCACGACGAGG
CCGCCGCTTTCGCCAAGCTGGAAAGCCTTCTGTGGCCGGAAGGCCCGGTCTGCCCTCACTGCGGCGGCATGGGCCGGATCACGAACGTAAAGGGCGGTCG
CATGGGCCTGCGCCGCTGCGGCGACTGCAAGAAGCAGTTCACGGTGACGGTCGGCACGGTTTTCGAGTCCAGCCATGTGAAGCTGCACCTGTGGCTCCAG
GCTGCGCACCTTCTTGCGTCCAGCAAGAAGGGCTTCTCCGCGCACCAACTGCACCGGACCCTCGGCGTCACCTACAAGACCGCGTGGTTCATGTTCCATC
GCCTGCGTGAGGCGATGCGCGACGGCGCGCTTGCCCCGATGGGCGGTCTGGACGGCCCGGCCGGTGGCGCTGGCATCGTGGAAGCCGACGAGACGTTCAT
CGGCCGCGAGCCCGGCAAGCCGAAGAAGCGCGCCTATCACCACAAGATGAAGGTGCTGAGCCTTGTGGATCGCGACACGAAGCAGGTCCGCTCCGTCGTC
GTGGATGATCTCAAGCCCGACACGGTGAAGCCGATCCTCGCCGCGAACATCGCCAAGGAAGCCAGCCTCTTCACCGACGAGGCGGGCCACTACGTGAAGC
TCGGCAAGGACTTCATTGAGCATCAGATCACGTCTCACGGCAAAGGCGAGTACGTGCGCGGCATCGTCCATACGAACACCGTCGAAGGCTACTTCTCCGT
CTTCAAGCGCGGCATGAAGGGTGTCTATCAGCACTGCGGCAAGCAGCACTTACACCGCTACCTCGCGGAGTTCGATTTCCGGTACAACAACCGGGTGAAG
CTCGGTGTGGGCGATGTCGAGCGGGCCGAGCGGGCGCTTCGGGGCGTCGTTGGGAAGCGCCTTACGTACCAAACAACTCGTTGACCGGAGACGGCGTTAA
CCCCATATTCCCCCCTGTCCCCGAGCGCGAAAACCTGGGAGATATCCTGGACTGCCGCAGTAGCGGCTAGGCTCGGGGGCCGCGCTTTAAGTTGAACTTC
GGGCGCACATTCCAAACAGATAATCGGTGTTGCGAAATTGAAGTGCTGGCCTCTAAGCTATGCAGCTTCGTTTTTTCGAACGCATAGATCGCTAACTTGC
GTTTCATATCGCTCGTTTCTGGCAAAAGGTGCCTACCGTTTCGATATGCGGTCAATGCACCTAGCGTTACATCTAAGAACTGAAGAAATGGCGTTTGCTT
AGAGTCTCTGGGTTCAATTGTTCGGACACAATTAGGCGGATGATTGAACTCGCTTACAGCGCCTTCGTTCAGCTTTTGCTTATAGCCCGGCAATAGAGAC
GTGCAGTTGCCATTATCGGGGCGAATGTGGATGTCATACTCCGCGCCGTAGTACCTAACTGCGCGGTGCAGCATTAATTGATAATGGGCCTTACTAACAG
TGTCTACAAGCTTGCGCTCTCCACTGATACTATGATCATACTTGCCCATCTCTTGAAATCGAATGTGGAAGTTGACGATATTTGTCTCTATTAGCTTAAA
AAGCAAATCAATATAAGCAATATGGGCGCTATCTCGGCGTTTTTTGGCTGTAGCCCATTTGATCTCTGAAGTAATTTCATAATTGTAATTAATTTTCTGA
ATATCTGATAATATATAATCAGCCATTCCCAAATCGATCGCAAGACCGCCGATGCCCATAAACGCATCCTTCTGGCTTGTCTCGTCACAATAATATAAGA
TGCGTGGACGCGGCTCTTGCAAAGTGGGGTCTAGCAATGCTGGAAAAGACGCGTACGGAGGCGTACCGCCATGGGCTCAGGAAAATAATCGAGAAAAGCC
CCAAGGTGCAATCAGGAAATGCGCGGGCACCCGCAGGGGCGTGTGGCGACAAGCAAGCGGCGGAAACCCCTAGCACACAAATCGCGCGGAGATCTGACGC
TTCTCGCGAACGCTGACGCGAAGGAACATGCTGCGCGGCATGTATCAACTCCGAAGCGGAGGGCCTGGAGCCTCTTGGTTGATGGCGCCGCTTAGGCATC
TTCCGTGGACGGCGGCGGAAGCGCCTCGGCTGGCTTCACTCCACGTGCCAGCCGGTCAAGGAACAGGGGCGCGCTCGCACCTAGATAAAATGATGTGAGT
GCGTTGGGCGCCTCGGCTATGGCAGGGATAGTGCCGCTAGCCAGTGCTAAGAGCGCCCGCACACAAAGGTAGAATGGCTGTTTGTAGCGCTCAGGGCAGC
GCCCATTCAGATCGCTGCACTCCTTGAACGCCGCGCCTAGTTCGACAGCGACACTTCCAATGACGCCGAGCCAGTAGGGAGATAGCTCGCCTACGAACGG
AAGGAGAGCCACTTAGGCGTGCTTAGAACGCTCGCGCTGACTCCGAGTGCTCCCGCCCAGCCGCAGATAGGAAGCGAGAGGGCTGAGACGATGCACGACG
ATGCGCGCCTGGGTGGGCCGGTGCGACAGCCGCAAGGCGGCAACCACGACGTTGGCGGCAAAAAGCAGTACGCCCACGCCGACGAGGGCTGTGAACACAA
TTTGCGTCACGAGTTCCATCCTGTCCCTCCAGCTTGGCTGTAAGCGGAGGATGATGCTTTTTGTCTTTCTAGGATGTGAATAACTGGCACTGCCCCCTTG
CGGCGTCAATGCCGCGTGCCGCCGGATTTTTGCAGGCCGATGTGATGAGGCGTACGGTTGTCCGTTTTTGAAAGGCCGAACTGATGCCGACAGACACGCC
GACCAAGCCCAAACCTTCCAAGCTCGGCGCCGAGAAGCTGACGCCAGAGGAACAGAAGCGTCGCTTCATTGAAGCCGCGCGGGAAGCGGGTGTGAGCGAG
AACGAAGCGGACTTCGATGCGGCGTTGAAGCAGATCGCGAAGCCGAAGCAGCCCAACTCCAGCCAAGGGTAGAGTGTGGCACTCTCACGCTAGCCCACGT
CGCGCAGTCGCGTTAGCAGCGCCGTAGAAAGGGGCTGCGAAATGTTAACGAACTCGATCATCGGCATGATACGCCGCCGCTGGCGGACCGAGCGTCCTTT
CCAAGGGCACCCGAAGGAAAGCTTGCTGCGGAAGATCCAGAACGATGCGGTTCGCGAGGACGTCAAGGAGCACGCGCGTCGCGAGCTACAGCTTCGCAAA
GATCAGCGCCTAATCGGCTGAGCTGAGCACTGCCGCCATTTGTCGATGCAGGTCGTCGGCAACGGACTGGGGAAGCAGCACGACCGCCTCGGTTCCGTCC
GCCTTCTCCAAGCGAACGCCCAACGTCCGGCCGCCGTCGATCAACTCGACGCTGATAACCCGACCAACCGCAAACGCGTCGGGGGCGTCATCACTCAACA
TGCACTGAACTCCACGAAGGGCGAGACTCGCCCAACGGTCGTGGTCGGTCAAGGTGTATGTGTGAACGATATAATCGCC
Protein section
ORF number : 5

 

ORF 1
LengthBeginEndStrandFusion ORF
783 bp260 aa202984+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MGLRRCGDCKKQFTVTVGTVFESSHVKLHLWLQAAHLLASSKKGFSAHQLHRTLGVTYKTAWFMFHRLREAMRDGALAPMGGLDGPAGGAGIVEADETFI
GREPGKPKKRAYHHKMKVLSLVDRDTKQVRSVVVDDLKPDTVKPILAANIAKEASLFTDEAGHYVKLGKDFIEHQITSHGKGEYVRGIVHTNTVEGYFSV
FKRGMKGVYQHCGKQHLHRYLAEFDFRYNNRVKLGVGDVERAERALRGVVGKRLTYQTTR

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
540 bp179 aa16061067-No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

LLFKLIETNIVNFHIRFQEMGKYDHSISGERKLVDTVSKAHYQLMLHRAVRYYGAEYDIHIRPDNGNCTSLLPGYKQKLNEGAVSEFNHPPNCVRTIEPR
DSKQTPFLQFLDVTLGALTAYRNGRHLLPETSDMKRKLAIYAFEKTKLHSLEASTSISQHRLSVWNVRPKFNLKRGPRA

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
186 bp61 aa25982413-No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

VFTALVGVGVLLFAANVVVAALRLSHRPTQARIVVHRLSPLASYLRLGGSTRSQRERSKHA

 

Blast result :
ORF 4
LengthBeginEndStrandFusion ORF
189 bp62 aa27842972+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MPTDTPTKPKPSKLGAEKLTPEEQKRRFIEAAREAGVSENEADFDAALKQIAKPKQPNSSQG

 

Blast result :
ORF 5
LengthBeginEndStrandFusion ORF
240 bp80 aa32403479+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

LSMQVVGNGLGKQHDRLGSVRLLQANAQRPAAVDQLDADNPTNRKRVGGVITQHALNSTKGETRPTVVVGQGVCVNDIIA

 

Blast result :
Comments
ISMdi18 is 98% aa similar to ISMpo2.
The first ORF is the transposase, the others are passengers genes annotated as hypothetical protein.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. (2009) PLoS ONE Submitted.