ISShdy3

  • Family Tn3
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
CP026830 ND Shigella dysenteriae
Shigella dysenteriae ATCC 12039
DNA section
IS Length : 3867 bp

Ends


IR Length : 55

IRL : GGGGGTTGGGGAGCAATGGAACAGTAAACGCCGTTAAGAAGTCAATTTAA
IRR : GGGGTTTGGGGAGCAATGGAACAGTAAACGCCGTTAAGAGGTTAATTAAA

Insertion site


Left flankDirect repeatRight flankDR Length
TGCGACCCAGACCTGATTAGGTGCTGTGGGCGTAA4
0
0

DNA sequence

GGGGGTTGGGGAGCAATGGAACAGTAAACGCCGTTAAGAAGTCAATTTAAATAGCCAAAAATTAACTAGCCAATTTACATTGCAGGTTGTAATTTTTAAC
CACCTTTTTTACGATAGCCAAATTACCTATACTCATTTTGGCTACAGTAAATGAAAATTGCTCGCATCTATGAGAGAGTCAGTACATCAGAGCAGGATCT
GACACGCCAGGCTGACATCGAAAAAACGGCCATTGCCAGTGGTTTTTATATCGCCGGTATTTATCGAGAAAAAGCATCGGGTGCCAGGGCTGACAGACCT
GAACTTCTCCGCATGATCGCTGATCTTCAGCCTGGTGATGTGGTGATAGCCGAAAAAATTGACCGTATCAGTCGGCTCCCCCTTCCTGAAGCTGAAAAAC
TAATTGCTTCCATAAGAGACAAAGGTGCCCGGCTGGCTATTCCTGGCATAGTGGATTTGTCAGACGTGGTAGCAGAAAATGATGGTGTCTCCCGCATTGT
CCTTGAATCTGTTCAGGAGTTGTTGCTGAAACTTGCTCTTCAGACTGCACGAGATGACTACGAAATACGGCGAGAAAGACAGCGACAGGGGGTTCAGCTC
GCCAAAGCCGCAGGCAAGTACACCGGCAGGAAGGCCGATCTCGTGACTCATGAGCGTATCATTACCCTCCGGCAGTCCGGATTAACCATTGAGCGCACAG
CCACACTGGCCGGATGCAGCATCAGCCAGGTCAAACGCGTCTGGGCAATCCATCAGACGCAAAAAAATTCGTGATATATCCAGCTTAGGATGCTTTACGA
TGCCAGTTAATTTTCTTTCCGAAGATCAAAAAGCAAGTTACGGTAACCATCATGGCGAACTGACAAAGGAAACATTGGCGCGTTATTTTCATCTTGATGA
TTTTGACCGGATGAATATTTCAGAAAAACGCGGAGACCATAACCGGTTAGGCTATGCCGTATTATTATGTACAGTCCGCTACCTCGGTCGCTTCCCCGAT
TTAGCGACATCAATTCCGATAGCTGTCATTGATTTTTTGGCAGAGCAACTGCATATTGAGAATGGTAGTGAGCAGGTTAATTTATATAACTCAGGAAAAC
AGCGCCGACAACACATTGTTGAAATCACAAAAATATATGGTTACACAGAGTTTACTGACTCCCGAGTTGTTTTTTCACTGACCCGCTGGTTATATTCTTT
ATGCTGGACAGGAACAAGCCGACCAGGTATTCTTTTTGAAAGATGTACCGGTTGGCTTTTATCTCATAAAGTATTATTACCCGGATATTCGTTGCTCGAA
CGCTATATAGCCCGATTACGCAACAGAGTTGAAAACCGTCTCTGGCATTCACTGGCAGCTTGCATTGATGAGTCTCAGACTCAACAGCTTCTGGATCTTC
TTTCGGTTCCTGCCGGAAGCCGCTATTCGTTACTTGATCAATTACGTGCCGGACCTACTAAAGTGAATGCCACATCACTGGTACAGGCTATAGGTCGCCT
TCAGACGATACGTAGCCTGGGAGTGACATTACCTGCAATTACGCCAGTTTCGGATATCCGTATTGCGGCCATGGCACGTTATGCCTCAACGGCTAAGATA
ACAGCGCTTCAGCGCTTACCAGAAAAAAGAAAACTGGCTACGCTTGTTGCATTCTCTTGCTGCATGGAGGCGACAGCTCAGGATGATGCTCTTGAATTAC
TTGAAGCGCTACTCCGGGATCTGTTTAATGAAGCGGTTCAAGCAGATAAACGTAACCGCCAACGAACACTTAAAGATCTGGATCGTGCAGCAGAAATACT
GGCAAAAGCATGTCGGATGCTTCTTGACGATAAATTATCTGATACTGATGTTCGTGATAGTATCTTCAATATCATTCCCGAAGATGTACTGACTCATGCA
GTAAATAATGTGACATCAATTATTCGACCAGATAATAACGTTTACTTCAATGAACTTGATTCTAAATTCAAAACGGTGCGGCGTTTCTTACCTGACTTAC
TTTCCAGAATACATTTTGAGGGCAATGCCTCAGCCAAAACATTAATTGAAGCGCTTTGCTGGATTGAAGTTAATTTAAAAAAGAAAAAAACAGATAACGA
TGCACCACGCGAAATAATCAATAAACCGTGGCAACAGCATGTGATAAGAAAAGATGGTAGTATTGATTTCCACGCCTATACATTTTGTGCACTTAAGGAA
CTTCAGCTTGCACTGAAAAAACGGGATATTTATGTCAATCCCAGCTGGCGTTATGCAGATCCCCGGGCAGGACTCATCGAAGGTAAGGAATGGGAAGCTC
TGCATTCAATAATCTGCCGCTCTCTTGGATTATCATCAACACCTGGTGCCACTCTGTCAGCAATCGCAACAGAACTGGATTCAACATATCGTGATGTACT
GAACCGACTACCGGAAAATCCGGCTGTCAGATTTGCGGAAAATGGCGATAAAACTGAGTTGATCCTGAGTCCTCTGGATGCCGTTGAAGAAACACCATCA
CTGATCGCACTACGGCAGCGGGTGGCCAATATGTTGCCACGTGTTGATTTACCTGAACTATTACTGGAGATCGATGCCCGGACACATTTCACTGATGCCT
TTACCCATGCATCAGAACAAAACTCCCGCGTCTCGGACCTGAATATCAGCATTTGTGCCATGCTGATGGCAGAAGCCTGTAACACCGGTCCTGAGCCTTT
TATACGTAATGATGTCGCAGCACTGAAACGTGACCGTCTGACCTGGACAGACAGTAATTACATCAGGGATGAAACTATTCGGGCAGCCAACGCTATTCTG
GTTGCAGCACAAAGAGAAGTTCCCCTAGCGAGTCTCTGGGGTAGTGGTGAGGTCGCGTCAGCCGATGGTATGCGATTTGTTGTTCCTGTGCATACAGTAC
ACACCGGCCCCAACCCTAAATACTTCAAGGAAGGACGTGGAGTTACATGGTACAACCTGATATCTGACCAGTATTCGGGGATAAACGACATCGTTGTACC
TGGAACGCTCAAAGATAGTCTGGTCATCCTGGCTGTTATTCTGGAGCAACAAACGGATCAAATTCCATACCAGATAATGACCGATACGGGGGCATACAGC
GATGTTATTTTCGGGCTTTTCCGCCTGCTGGGATATCGTTTTTGTCCAAGGCTTGCGGATATGGGGGGAGCCAGATTTTGGCGCATCGATCCTAAAGCTG
ATTACGGTCCGTTTAACGCAATATCTTCTCATCGCCTTAATTTTGGGAAAAAAACGGAGCCACACTGGGATGATATTCTGAGGCTGATAGCCTCCCTCAA
ACTTGGACGACTGAACGTAATGTCCATAATGAAAACACTGCAAACAGGTGACAGGCCAACCAGTCTAGCGCAGGCTATAGCCGAAATAGGACGCGCCGAT
AAAACTATCCATATGCTGACTTACCTCGATGATGAAAACAAACGACGGAGAACGTTACAACAACTCAACCGTGGAGAGGGGCACCATGCAGTGGCCAGAA
ATGTCTTTCATGGTAAGCGAGGAGAACTGAGACAGGCTTACCGTGAAGGCCAAGAAGATCAACTTGGAGCGTTAGGTCTGGTGCTCAATATTATTGTTCT
GTGGAACACTATTTACATGGATGCAGCAATTCAGCAGCTCAGGCGTGAGGGGTATCCGGTCATGGATTCAGACGTTGAAAAACTGTCACCATTGCAGTGC
GGGCATATTAATATGCAGGGGCGCTATTCATTTACAGTGCCGGAATCGGTCAGTAAGGGTGAGCTGAGAGCGTTCAATGAATAGATGTAATTTACTGATT
AATAAATAATTTGCTATTTTAATTAACCTCTTAACGGCGTTTACTGTTCCATTGCTCCCCAAACCCC
Protein section
ORF number : 2

 

ORF 1
LengthBeginEndStrandFusion ORF
624 bp207 aa151774+No
ORF function : Accessory Gene
AG : Tn3 resolvase

ORF sequence :

MKIARIYERVSTSEQDLTRQADIEKTAIASGFYIAGIYREKASGARADRPELLRMIADLQPGDVVIAEKIDRISRLPLPEAEKLIASIRDKGARLAIPGI
VDLSDVVAENDGVSRIVLESVQELLLKLALQTARDDYEIRRERQRQGVQLAKAAGKYTGRKADLVTHERIITLRQSGLTIERTATLAGCSISQVKRVWAI
HQTQKNS

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
2985 bp994 aa8003784+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MPVNFLSEDQKASYGNHHGELTKETLARYFHLDDFDRMNISEKRGDHNRLGYAVLLCTVRYLGRFPDLATSIPIAVIDFLAEQLHIENGSEQVNLYNSGK
QRRQHIVEITKIYGYTEFTDSRVVFSLTRWLYSLCWTGTSRPGILFERCTGWLLSHKVLLPGYSLLERYIARLRNRVENRLWHSLAACIDESQTQQLLDL
LSVPAGSRYSLLDQLRAGPTKVNATSLVQAIGRLQTIRSLGVTLPAITPVSDIRIAAMARYASTAKITALQRLPEKRKLATLVAFSCCMEATAQDDALEL
LEALLRDLFNEAVQADKRNRQRTLKDLDRAAEILAKACRMLLDDKLSDTDVRDSIFNIIPEDVLTHAVNNVTSIIRPDNNVYFNELDSKFKTVRRFLPDL
LSRIHFEGNASAKTLIEALCWIEVNLKKKKTDNDAPREIINKPWQQHVIRKDGSIDFHAYTFCALKELQLALKKRDIYVNPSWRYADPRAGLIEGKEWEA
LHSIICRSLGLSSTPGATLSAIATELDSTYRDVLNRLPENPAVRFAENGDKTELILSPLDAVEETPSLIALRQRVANMLPRVDLPELLLEIDARTHFTDA
FTHASEQNSRVSDLNISICAMLMAEACNTGPEPFIRNDVAALKRDRLTWTDSNYIRDETIRAANAILVAAQREVPLASLWGSGEVASADGMRFVVPVHTV
HTGPNPKYFKEGRGVTWYNLISDQYSGINDIVVPGTLKDSLVILAVILEQQTDQIPYQIMTDTGAYSDVIFGLFRLLGYRFCPRLADMGGARFWRIDPKA
DYGPFNAISSHRLNFGKKTEPHWDDILRLIASLKLGRLNVMSIMKTLQTGDRPTSLAQAIAEIGRADKTIHMLTYLDDENKRRRTLQQLNRGEGHHAVAR
NVFHGKRGELRQAYREGQEDQLGALGLVLNIIVLWNTIYMDAAIQQLRREGYPVMDSDVEKLSPLQCGHINMQGRYSFTVPESVSKGELRAFNE

 

Blast result :
Comments
ISShdy3 is 75% aa similar to ISXc4.
The sequence of ISShdy3 was reconstituted from sequences carried by various invasion plasmids, mostly the plasmid harbored by the S. dysenteriae strain ATCC 12039 (serotype 10). In the plasmid of this strain, ISShdy3 is inserted with an ISSfl11, with duplication of a 4-bp target; however, it is now interrupted at position 3207 by insertion of an IS2 and two As are missing after position 1626, as suggested by comparison with sequence carried by homologous plasmids. Similar IS are carried by plasmids in, e.g., Salmonella enterica subsp. enterica serovars Poona and Birkenhead and Serratia fonticola.
References
1] Claude Parsot (2021) Direct submission.
2] Kim,J., Lindsey,R.L., Garcia-Toledo,L., Loparev,V.N., Rowe,L.A., Batra,D., Juieng,P., Stoneburg,D., Martin,H., Knipe,K., Smith,P. and Strockbine,N. (2018) Genome Announc 6 (15), e00282-18.