ISEc86

  • Family IS3
  • Group IS150
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
CP039405 ND Escherichia coli
Escherichia coli 377323_2f plasmid unnamed
DNA section
IS Length : 1420 bp

Ends


IR Length : 19/25

IRL : TGAAGTGCGCCCCAAAGGTTAGACACCAGACCAAAAACTGAGGATGCATG
IRR : TGAACTGCGCCCCAGATCCTGGACAACTTATGCAGCCTGACTCAATCGAT

Insertion site


Left flankDirect repeatRight flankDR Length
ATTATCCAACAAAATGATGGAC2

DNA sequence

TGAAGTGCGCCCCAAAGGTTAGACACCAGACCAAAAACTGAGGATGCATGGAAAAGTTTTCATTGGAGTTCAAAAAGGTAGTATCAGAAAAGTACCTGGA
GGGAGAGCTGAGTCTGAAATCTGTAGCAAGGATGTACGGTATCAGTCCCTGCACAGTAAGAAAATGGTCTTATGCCTATCGGGAACATGGGATAAGCATA
CTTACGGGTAAAAAAGGACGCTATTCTGCTGAGTTAAAACTCACAGTAGTAAAAGAAGTCGTGGATGACTGTTTTTCTGTTCGTGAAGTAGCGGTAAAAC
ACGGAATACCAGCCTTTGGTACTGTATGCAACTGGCTTGAAAAATATAGAAAATATGGCGAAGATGCTTTCATTCGAAAAAATAAAAAGAGTATTCCTGT
GCCGGATAAAAGCGCCATTTCAGCACCACCACTACCAACATTAACCAAAGATGAGAGAGAAGAACTTGAACAACTCAGGGTCGAGAATACCTACTTAAAA
AAGCTGAAAGCCTTGGTTCAGCAGAAAACATGCTCAACGTTCAGGAAAAAGTGAGCATTGTGAATGAACTAAGGCAGGAATGGCCATTGTCCCGATTGTT
AATTGTTTCCGGGTTGCCCCGGAGTACATTTTACTATCATGTCAGACGGCTTGCGGCTCCTGACCGGTATCAGTCTGCCAGAGCGCTTCTGCTGAAAATT
TATCACCAGCACAAAGGCCGCTATGGTTACAGACGCATCAGGCTGGCATGTCGTAACGAGGGGGTTTTACTTAATGGTAAAACCGTCAGAAAACTGATGA
AGGAACTGGGTATTAGCAGCCTGATCCGCGTTAAAAAATACCGTTCATATAAAGGAGAACAAGGACGAATTTGCGATAATCTCCTGAAACGCAATTTTGA
TGCAAAACGCCCCAACGAGAAGTGGGTTACAGATGTCACTGAATTTAAAGTGAATGGCAAGAAGCTGTATTTGTCACCCATCATGGATCTTTATAATGGA
GAAATAATTTCCTATAACCTCGCTACTCGCCCTCAACCTTCAATGGTGCAGACTATGCTTACGGATGCGCTGAAACAGCTGTCGAAAGATGAACACCCCA
TACTGCACAGCGATCAAGGTTGGCAATATCAGATGTCCCGGTGGCAACGATGGTTAAAGGATAGCGGTATAGTCCAGAGTATGTCTCGCCGGGGAAATTG
TCTTGATAATGCAGTTATAGAGAGCTTTTTTGGAACATTGAAATCAGAATGTTATTACCTCAATGAATATAAAAATGTGGAGGACTTAAAGCGAGACATC
ATTGATTACATAAACTATTATAATCAATTGAGGATAAAGGAAAAACTTGGCGGTCTTAGCCCGGTACAATATCGATTGAGTCAGGCTGCATAAGTTGTCC
AGGATCTGGGGCGCAGTTCA
Recoding section
  • Recoding by frameshift
  • Frame -1
  • Type translational
  • Experimentally demonstrated No

Stimulators :

  • Shine-Dalgarno sequence :
  • Secondary structure :

Recoding motif :

Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
507 bp168 aa48554+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MEKFSLEFKKVVSEKYLEGELSLKSVARMYGISPCTVRKWSYAYREHGISILTGKKGRYSAELKLTVVKEVVDDCFSVREVAVKHGIPAFGTVCNWLEKY
RKYGEDAFIRKNKKSIPVPDKSAISAPPLPTLTKDEREELEQLRVENTYLKKLKALVQQKTCSTFRKK

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
924 bp307 aa4701393+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

TTQGREYLLKKAESLGSAENMLNVQEKVSIVNELRQEWPLSRLLIVSGLPRSTFYYHVRRLAAPDRYQSARALLLKIYHQHKGRYGYRRIRLACRNEGVL
LNGKTVRKLMKELGISSLIRVKKYRSYKGEQGRICDNLLKRNFDAKRPNEKWVTDVTEFKVNGKKLYLSPIMDLYNGEIISYNLATRPQPSMVQTMLTDA
LKQLSKDEHPILHSDQGWQYQMSRWQRWLKDSGIVQSMSRRGNCLDNAVIESFFGTLKSECYYLNEYKNVEDLKRDIIDYINYYNQLRIKEKLGGLSPVQ
YRLSQAA

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1346 bp448 aa481393+Yes
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MEKFSLEFKKVVSEKYLEGELSLKSVARMYGISPCTVRKWSYAYREHGISILTGKKGRYSAELKLTVVKEVVDDCFSVREVAVKHGIPAFGTVCNWLEKY
RKYGEDAFIRKNKKSIPVPDKSAISAPPLPTLTKDEREELEQLRVENTYLKKAESLGSAENMLNVQEKVSIVNELRQEWPLSRLLIVSGLPRSTFYYHVR
RLAAPDRYQSARALLLKIYHQHKGRYGYRRIRLACRNEGVLLNGKTVRKLMKELGISSLIRVKKYRSYKGEQGRICDNLLKRNFDAKRPNEKWVTDVTEF
KVNGKKLYLSPIMDLYNGEIISYNLATRPQPSMVQTMLTDALKQLSKDEHPILHSDQGWQYQMSRWQRWLKDSGIVQSMSRRGNCLDNAVIESFFGTLKS
ECYYLNEYKNVEDLKRDIIDYINYYNQLRIKEKLGGLSPVQYRLSQAA

 

Blast result :
Comments
ISEc86 is 69% aa (transposase) similar to ISEcB1.
The third ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Dongsheng Zhou (2020) Direct submission.
2] Greig,D.R., Dallman,T.J., Gally,D.L. and Jenkins,C. (2019) Direct GenBank submission.