ISShdy3
- Family Tn3
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP026830 | ND | Shigella dysenteriae | Shigella dysenteriae ATCC 12039 |
DNA section
IS Length : 3867 bp
Ends
IR Length : 55
IRL : GGGGGTTGGGGAGCAATGGAACAGTAAACGCCGTTAAGAAGTCAATTTAA
IRR : GGGGTTTGGGGAGCAATGGAACAGTAAACGCCGTTAAGAGGTTAATTAAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGCGACCCAGACCTG | ATTA | GGTGCTGTGGGCGTAA | 4 |
0 | |||
0 |
DNA sequence
GGGGGTTGGGGAGCAATGGAACAGTAAACGCCGTTAAGAAGTCAATTTAAATAGCCAAAAATTAACTAGCCAATTTACATTGCAGGTTGTAATTTTTAAC
CACCTTTTTTACGATAGCCAAATTACCTATACTCATTTTGGCTACAGTAAATGAAAATTGCTCGCATCTATGAGAGAGTCAGTACATCAGAGCAGGATCT
GACACGCCAGGCTGACATCGAAAAAACGGCCATTGCCAGTGGTTTTTATATCGCCGGTATTTATCGAGAAAAAGCATCGGGTGCCAGGGCTGACAGACCT
GAACTTCTCCGCATGATCGCTGATCTTCAGCCTGGTGATGTGGTGATAGCCGAAAAAATTGACCGTATCAGTCGGCTCCCCCTTCCTGAAGCTGAAAAAC
TAATTGCTTCCATAAGAGACAAAGGTGCCCGGCTGGCTATTCCTGGCATAGTGGATTTGTCAGACGTGGTAGCAGAAAATGATGGTGTCTCCCGCATTGT
CCTTGAATCTGTTCAGGAGTTGTTGCTGAAACTTGCTCTTCAGACTGCACGAGATGACTACGAAATACGGCGAGAAAGACAGCGACAGGGGGTTCAGCTC
GCCAAAGCCGCAGGCAAGTACACCGGCAGGAAGGCCGATCTCGTGACTCATGAGCGTATCATTACCCTCCGGCAGTCCGGATTAACCATTGAGCGCACAG
CCACACTGGCCGGATGCAGCATCAGCCAGGTCAAACGCGTCTGGGCAATCCATCAGACGCAAAAAAATTCGTGATATATCCAGCTTAGGATGCTTTACGA
TGCCAGTTAATTTTCTTTCCGAAGATCAAAAAGCAAGTTACGGTAACCATCATGGCGAACTGACAAAGGAAACATTGGCGCGTTATTTTCATCTTGATGA
TTTTGACCGGATGAATATTTCAGAAAAACGCGGAGACCATAACCGGTTAGGCTATGCCGTATTATTATGTACAGTCCGCTACCTCGGTCGCTTCCCCGAT
TTAGCGACATCAATTCCGATAGCTGTCATTGATTTTTTGGCAGAGCAACTGCATATTGAGAATGGTAGTGAGCAGGTTAATTTATATAACTCAGGAAAAC
AGCGCCGACAACACATTGTTGAAATCACAAAAATATATGGTTACACAGAGTTTACTGACTCCCGAGTTGTTTTTTCACTGACCCGCTGGTTATATTCTTT
ATGCTGGACAGGAACAAGCCGACCAGGTATTCTTTTTGAAAGATGTACCGGTTGGCTTTTATCTCATAAAGTATTATTACCCGGATATTCGTTGCTCGAA
CGCTATATAGCCCGATTACGCAACAGAGTTGAAAACCGTCTCTGGCATTCACTGGCAGCTTGCATTGATGAGTCTCAGACTCAACAGCTTCTGGATCTTC
TTTCGGTTCCTGCCGGAAGCCGCTATTCGTTACTTGATCAATTACGTGCCGGACCTACTAAAGTGAATGCCACATCACTGGTACAGGCTATAGGTCGCCT
TCAGACGATACGTAGCCTGGGAGTGACATTACCTGCAATTACGCCAGTTTCGGATATCCGTATTGCGGCCATGGCACGTTATGCCTCAACGGCTAAGATA
ACAGCGCTTCAGCGCTTACCAGAAAAAAGAAAACTGGCTACGCTTGTTGCATTCTCTTGCTGCATGGAGGCGACAGCTCAGGATGATGCTCTTGAATTAC
TTGAAGCGCTACTCCGGGATCTGTTTAATGAAGCGGTTCAAGCAGATAAACGTAACCGCCAACGAACACTTAAAGATCTGGATCGTGCAGCAGAAATACT
GGCAAAAGCATGTCGGATGCTTCTTGACGATAAATTATCTGATACTGATGTTCGTGATAGTATCTTCAATATCATTCCCGAAGATGTACTGACTCATGCA
GTAAATAATGTGACATCAATTATTCGACCAGATAATAACGTTTACTTCAATGAACTTGATTCTAAATTCAAAACGGTGCGGCGTTTCTTACCTGACTTAC
TTTCCAGAATACATTTTGAGGGCAATGCCTCAGCCAAAACATTAATTGAAGCGCTTTGCTGGATTGAAGTTAATTTAAAAAAGAAAAAAACAGATAACGA
TGCACCACGCGAAATAATCAATAAACCGTGGCAACAGCATGTGATAAGAAAAGATGGTAGTATTGATTTCCACGCCTATACATTTTGTGCACTTAAGGAA
CTTCAGCTTGCACTGAAAAAACGGGATATTTATGTCAATCCCAGCTGGCGTTATGCAGATCCCCGGGCAGGACTCATCGAAGGTAAGGAATGGGAAGCTC
TGCATTCAATAATCTGCCGCTCTCTTGGATTATCATCAACACCTGGTGCCACTCTGTCAGCAATCGCAACAGAACTGGATTCAACATATCGTGATGTACT
GAACCGACTACCGGAAAATCCGGCTGTCAGATTTGCGGAAAATGGCGATAAAACTGAGTTGATCCTGAGTCCTCTGGATGCCGTTGAAGAAACACCATCA
CTGATCGCACTACGGCAGCGGGTGGCCAATATGTTGCCACGTGTTGATTTACCTGAACTATTACTGGAGATCGATGCCCGGACACATTTCACTGATGCCT
TTACCCATGCATCAGAACAAAACTCCCGCGTCTCGGACCTGAATATCAGCATTTGTGCCATGCTGATGGCAGAAGCCTGTAACACCGGTCCTGAGCCTTT
TATACGTAATGATGTCGCAGCACTGAAACGTGACCGTCTGACCTGGACAGACAGTAATTACATCAGGGATGAAACTATTCGGGCAGCCAACGCTATTCTG
GTTGCAGCACAAAGAGAAGTTCCCCTAGCGAGTCTCTGGGGTAGTGGTGAGGTCGCGTCAGCCGATGGTATGCGATTTGTTGTTCCTGTGCATACAGTAC
ACACCGGCCCCAACCCTAAATACTTCAAGGAAGGACGTGGAGTTACATGGTACAACCTGATATCTGACCAGTATTCGGGGATAAACGACATCGTTGTACC
TGGAACGCTCAAAGATAGTCTGGTCATCCTGGCTGTTATTCTGGAGCAACAAACGGATCAAATTCCATACCAGATAATGACCGATACGGGGGCATACAGC
GATGTTATTTTCGGGCTTTTCCGCCTGCTGGGATATCGTTTTTGTCCAAGGCTTGCGGATATGGGGGGAGCCAGATTTTGGCGCATCGATCCTAAAGCTG
ATTACGGTCCGTTTAACGCAATATCTTCTCATCGCCTTAATTTTGGGAAAAAAACGGAGCCACACTGGGATGATATTCTGAGGCTGATAGCCTCCCTCAA
ACTTGGACGACTGAACGTAATGTCCATAATGAAAACACTGCAAACAGGTGACAGGCCAACCAGTCTAGCGCAGGCTATAGCCGAAATAGGACGCGCCGAT
AAAACTATCCATATGCTGACTTACCTCGATGATGAAAACAAACGACGGAGAACGTTACAACAACTCAACCGTGGAGAGGGGCACCATGCAGTGGCCAGAA
ATGTCTTTCATGGTAAGCGAGGAGAACTGAGACAGGCTTACCGTGAAGGCCAAGAAGATCAACTTGGAGCGTTAGGTCTGGTGCTCAATATTATTGTTCT
GTGGAACACTATTTACATGGATGCAGCAATTCAGCAGCTCAGGCGTGAGGGGTATCCGGTCATGGATTCAGACGTTGAAAAACTGTCACCATTGCAGTGC
GGGCATATTAATATGCAGGGGCGCTATTCATTTACAGTGCCGGAATCGGTCAGTAAGGGTGAGCTGAGAGCGTTCAATGAATAGATGTAATTTACTGATT
AATAAATAATTTGCTATTTTAATTAACCTCTTAACGGCGTTTACTGTTCCATTGCTCCCCAAACCCC
CACCTTTTTTACGATAGCCAAATTACCTATACTCATTTTGGCTACAGTAAATGAAAATTGCTCGCATCTATGAGAGAGTCAGTACATCAGAGCAGGATCT
GACACGCCAGGCTGACATCGAAAAAACGGCCATTGCCAGTGGTTTTTATATCGCCGGTATTTATCGAGAAAAAGCATCGGGTGCCAGGGCTGACAGACCT
GAACTTCTCCGCATGATCGCTGATCTTCAGCCTGGTGATGTGGTGATAGCCGAAAAAATTGACCGTATCAGTCGGCTCCCCCTTCCTGAAGCTGAAAAAC
TAATTGCTTCCATAAGAGACAAAGGTGCCCGGCTGGCTATTCCTGGCATAGTGGATTTGTCAGACGTGGTAGCAGAAAATGATGGTGTCTCCCGCATTGT
CCTTGAATCTGTTCAGGAGTTGTTGCTGAAACTTGCTCTTCAGACTGCACGAGATGACTACGAAATACGGCGAGAAAGACAGCGACAGGGGGTTCAGCTC
GCCAAAGCCGCAGGCAAGTACACCGGCAGGAAGGCCGATCTCGTGACTCATGAGCGTATCATTACCCTCCGGCAGTCCGGATTAACCATTGAGCGCACAG
CCACACTGGCCGGATGCAGCATCAGCCAGGTCAAACGCGTCTGGGCAATCCATCAGACGCAAAAAAATTCGTGATATATCCAGCTTAGGATGCTTTACGA
TGCCAGTTAATTTTCTTTCCGAAGATCAAAAAGCAAGTTACGGTAACCATCATGGCGAACTGACAAAGGAAACATTGGCGCGTTATTTTCATCTTGATGA
TTTTGACCGGATGAATATTTCAGAAAAACGCGGAGACCATAACCGGTTAGGCTATGCCGTATTATTATGTACAGTCCGCTACCTCGGTCGCTTCCCCGAT
TTAGCGACATCAATTCCGATAGCTGTCATTGATTTTTTGGCAGAGCAACTGCATATTGAGAATGGTAGTGAGCAGGTTAATTTATATAACTCAGGAAAAC
AGCGCCGACAACACATTGTTGAAATCACAAAAATATATGGTTACACAGAGTTTACTGACTCCCGAGTTGTTTTTTCACTGACCCGCTGGTTATATTCTTT
ATGCTGGACAGGAACAAGCCGACCAGGTATTCTTTTTGAAAGATGTACCGGTTGGCTTTTATCTCATAAAGTATTATTACCCGGATATTCGTTGCTCGAA
CGCTATATAGCCCGATTACGCAACAGAGTTGAAAACCGTCTCTGGCATTCACTGGCAGCTTGCATTGATGAGTCTCAGACTCAACAGCTTCTGGATCTTC
TTTCGGTTCCTGCCGGAAGCCGCTATTCGTTACTTGATCAATTACGTGCCGGACCTACTAAAGTGAATGCCACATCACTGGTACAGGCTATAGGTCGCCT
TCAGACGATACGTAGCCTGGGAGTGACATTACCTGCAATTACGCCAGTTTCGGATATCCGTATTGCGGCCATGGCACGTTATGCCTCAACGGCTAAGATA
ACAGCGCTTCAGCGCTTACCAGAAAAAAGAAAACTGGCTACGCTTGTTGCATTCTCTTGCTGCATGGAGGCGACAGCTCAGGATGATGCTCTTGAATTAC
TTGAAGCGCTACTCCGGGATCTGTTTAATGAAGCGGTTCAAGCAGATAAACGTAACCGCCAACGAACACTTAAAGATCTGGATCGTGCAGCAGAAATACT
GGCAAAAGCATGTCGGATGCTTCTTGACGATAAATTATCTGATACTGATGTTCGTGATAGTATCTTCAATATCATTCCCGAAGATGTACTGACTCATGCA
GTAAATAATGTGACATCAATTATTCGACCAGATAATAACGTTTACTTCAATGAACTTGATTCTAAATTCAAAACGGTGCGGCGTTTCTTACCTGACTTAC
TTTCCAGAATACATTTTGAGGGCAATGCCTCAGCCAAAACATTAATTGAAGCGCTTTGCTGGATTGAAGTTAATTTAAAAAAGAAAAAAACAGATAACGA
TGCACCACGCGAAATAATCAATAAACCGTGGCAACAGCATGTGATAAGAAAAGATGGTAGTATTGATTTCCACGCCTATACATTTTGTGCACTTAAGGAA
CTTCAGCTTGCACTGAAAAAACGGGATATTTATGTCAATCCCAGCTGGCGTTATGCAGATCCCCGGGCAGGACTCATCGAAGGTAAGGAATGGGAAGCTC
TGCATTCAATAATCTGCCGCTCTCTTGGATTATCATCAACACCTGGTGCCACTCTGTCAGCAATCGCAACAGAACTGGATTCAACATATCGTGATGTACT
GAACCGACTACCGGAAAATCCGGCTGTCAGATTTGCGGAAAATGGCGATAAAACTGAGTTGATCCTGAGTCCTCTGGATGCCGTTGAAGAAACACCATCA
CTGATCGCACTACGGCAGCGGGTGGCCAATATGTTGCCACGTGTTGATTTACCTGAACTATTACTGGAGATCGATGCCCGGACACATTTCACTGATGCCT
TTACCCATGCATCAGAACAAAACTCCCGCGTCTCGGACCTGAATATCAGCATTTGTGCCATGCTGATGGCAGAAGCCTGTAACACCGGTCCTGAGCCTTT
TATACGTAATGATGTCGCAGCACTGAAACGTGACCGTCTGACCTGGACAGACAGTAATTACATCAGGGATGAAACTATTCGGGCAGCCAACGCTATTCTG
GTTGCAGCACAAAGAGAAGTTCCCCTAGCGAGTCTCTGGGGTAGTGGTGAGGTCGCGTCAGCCGATGGTATGCGATTTGTTGTTCCTGTGCATACAGTAC
ACACCGGCCCCAACCCTAAATACTTCAAGGAAGGACGTGGAGTTACATGGTACAACCTGATATCTGACCAGTATTCGGGGATAAACGACATCGTTGTACC
TGGAACGCTCAAAGATAGTCTGGTCATCCTGGCTGTTATTCTGGAGCAACAAACGGATCAAATTCCATACCAGATAATGACCGATACGGGGGCATACAGC
GATGTTATTTTCGGGCTTTTCCGCCTGCTGGGATATCGTTTTTGTCCAAGGCTTGCGGATATGGGGGGAGCCAGATTTTGGCGCATCGATCCTAAAGCTG
ATTACGGTCCGTTTAACGCAATATCTTCTCATCGCCTTAATTTTGGGAAAAAAACGGAGCCACACTGGGATGATATTCTGAGGCTGATAGCCTCCCTCAA
ACTTGGACGACTGAACGTAATGTCCATAATGAAAACACTGCAAACAGGTGACAGGCCAACCAGTCTAGCGCAGGCTATAGCCGAAATAGGACGCGCCGAT
AAAACTATCCATATGCTGACTTACCTCGATGATGAAAACAAACGACGGAGAACGTTACAACAACTCAACCGTGGAGAGGGGCACCATGCAGTGGCCAGAA
ATGTCTTTCATGGTAAGCGAGGAGAACTGAGACAGGCTTACCGTGAAGGCCAAGAAGATCAACTTGGAGCGTTAGGTCTGGTGCTCAATATTATTGTTCT
GTGGAACACTATTTACATGGATGCAGCAATTCAGCAGCTCAGGCGTGAGGGGTATCCGGTCATGGATTCAGACGTTGAAAAACTGTCACCATTGCAGTGC
GGGCATATTAATATGCAGGGGCGCTATTCATTTACAGTGCCGGAATCGGTCAGTAAGGGTGAGCTGAGAGCGTTCAATGAATAGATGTAATTTACTGATT
AATAAATAATTTGCTATTTTAATTAACCTCTTAACGGCGTTTACTGTTCCATTGCTCCCCAAACCCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
624 bp | 207 aa | 151 | 774 | + | No |
AG : Tn3 resolvase
ORF sequence :
MKIARIYERVSTSEQDLTRQADIEKTAIASGFYIAGIYREKASGARADRPELLRMIADLQPGDVVIAEKIDRISRLPLPEAEKLIASIRDKGARLAIPGI
VDLSDVVAENDGVSRIVLESVQELLLKLALQTARDDYEIRRERQRQGVQLAKAAGKYTGRKADLVTHERIITLRQSGLTIERTATLAGCSISQVKRVWAI
HQTQKNS
VDLSDVVAENDGVSRIVLESVQELLLKLALQTARDDYEIRRERQRQGVQLAKAAGKYTGRKADLVTHERIITLRQSGLTIERTATLAGCSISQVKRVWAI
HQTQKNS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
2985 bp | 994 aa | 800 | 3784 | + | No |
Chemistry : DDE
ORF sequence :
MPVNFLSEDQKASYGNHHGELTKETLARYFHLDDFDRMNISEKRGDHNRLGYAVLLCTVRYLGRFPDLATSIPIAVIDFLAEQLHIENGSEQVNLYNSGK
QRRQHIVEITKIYGYTEFTDSRVVFSLTRWLYSLCWTGTSRPGILFERCTGWLLSHKVLLPGYSLLERYIARLRNRVENRLWHSLAACIDESQTQQLLDL
LSVPAGSRYSLLDQLRAGPTKVNATSLVQAIGRLQTIRSLGVTLPAITPVSDIRIAAMARYASTAKITALQRLPEKRKLATLVAFSCCMEATAQDDALEL
LEALLRDLFNEAVQADKRNRQRTLKDLDRAAEILAKACRMLLDDKLSDTDVRDSIFNIIPEDVLTHAVNNVTSIIRPDNNVYFNELDSKFKTVRRFLPDL
LSRIHFEGNASAKTLIEALCWIEVNLKKKKTDNDAPREIINKPWQQHVIRKDGSIDFHAYTFCALKELQLALKKRDIYVNPSWRYADPRAGLIEGKEWEA
LHSIICRSLGLSSTPGATLSAIATELDSTYRDVLNRLPENPAVRFAENGDKTELILSPLDAVEETPSLIALRQRVANMLPRVDLPELLLEIDARTHFTDA
FTHASEQNSRVSDLNISICAMLMAEACNTGPEPFIRNDVAALKRDRLTWTDSNYIRDETIRAANAILVAAQREVPLASLWGSGEVASADGMRFVVPVHTV
HTGPNPKYFKEGRGVTWYNLISDQYSGINDIVVPGTLKDSLVILAVILEQQTDQIPYQIMTDTGAYSDVIFGLFRLLGYRFCPRLADMGGARFWRIDPKA
DYGPFNAISSHRLNFGKKTEPHWDDILRLIASLKLGRLNVMSIMKTLQTGDRPTSLAQAIAEIGRADKTIHMLTYLDDENKRRRTLQQLNRGEGHHAVAR
NVFHGKRGELRQAYREGQEDQLGALGLVLNIIVLWNTIYMDAAIQQLRREGYPVMDSDVEKLSPLQCGHINMQGRYSFTVPESVSKGELRAFNE
QRRQHIVEITKIYGYTEFTDSRVVFSLTRWLYSLCWTGTSRPGILFERCTGWLLSHKVLLPGYSLLERYIARLRNRVENRLWHSLAACIDESQTQQLLDL
LSVPAGSRYSLLDQLRAGPTKVNATSLVQAIGRLQTIRSLGVTLPAITPVSDIRIAAMARYASTAKITALQRLPEKRKLATLVAFSCCMEATAQDDALEL
LEALLRDLFNEAVQADKRNRQRTLKDLDRAAEILAKACRMLLDDKLSDTDVRDSIFNIIPEDVLTHAVNNVTSIIRPDNNVYFNELDSKFKTVRRFLPDL
LSRIHFEGNASAKTLIEALCWIEVNLKKKKTDNDAPREIINKPWQQHVIRKDGSIDFHAYTFCALKELQLALKKRDIYVNPSWRYADPRAGLIEGKEWEA
LHSIICRSLGLSSTPGATLSAIATELDSTYRDVLNRLPENPAVRFAENGDKTELILSPLDAVEETPSLIALRQRVANMLPRVDLPELLLEIDARTHFTDA
FTHASEQNSRVSDLNISICAMLMAEACNTGPEPFIRNDVAALKRDRLTWTDSNYIRDETIRAANAILVAAQREVPLASLWGSGEVASADGMRFVVPVHTV
HTGPNPKYFKEGRGVTWYNLISDQYSGINDIVVPGTLKDSLVILAVILEQQTDQIPYQIMTDTGAYSDVIFGLFRLLGYRFCPRLADMGGARFWRIDPKA
DYGPFNAISSHRLNFGKKTEPHWDDILRLIASLKLGRLNVMSIMKTLQTGDRPTSLAQAIAEIGRADKTIHMLTYLDDENKRRRTLQQLNRGEGHHAVAR
NVFHGKRGELRQAYREGQEDQLGALGLVLNIIVLWNTIYMDAAIQQLRREGYPVMDSDVEKLSPLQCGHINMQGRYSFTVPESVSKGELRAFNE
Blast result :
Comments
ISShdy3 is 75% aa similar to ISXc4.
The sequence of ISShdy3 was reconstituted from sequences carried by various invasion plasmids, mostly the plasmid harbored by the S. dysenteriae strain ATCC 12039 (serotype 10). In the plasmid of this strain, ISShdy3 is inserted with an ISSfl11, with duplication of a 4-bp target; however, it is now interrupted at position 3207 by insertion of an IS2 and two As are missing after position 1626, as suggested by comparison with sequence carried by homologous plasmids. Similar IS are carried by plasmids in, e.g., Salmonella enterica subsp. enterica serovars Poona and Birkenhead and Serratia fonticola.
The sequence of ISShdy3 was reconstituted from sequences carried by various invasion plasmids, mostly the plasmid harbored by the S. dysenteriae strain ATCC 12039 (serotype 10). In the plasmid of this strain, ISShdy3 is inserted with an ISSfl11, with duplication of a 4-bp target; however, it is now interrupted at position 3207 by insertion of an IS2 and two As are missing after position 1626, as suggested by comparison with sequence carried by homologous plasmids. Similar IS are carried by plasmids in, e.g., Salmonella enterica subsp. enterica serovars Poona and Birkenhead and Serratia fonticola.
References
1] Claude Parsot (2021) Direct submission.
2] Kim,J., Lindsey,R.L., Garcia-Toledo,L., Loparev,V.N., Rowe,L.A., Batra,D., Juieng,P., Stoneburg,D., Martin,H., Knipe,K., Smith,P. and Strockbine,N. (2018) Genome Announc 6 (15), e00282-18.
2] Kim,J., Lindsey,R.L., Garcia-Toledo,L., Loparev,V.N., Rowe,L.A., Batra,D., Juieng,P., Stoneburg,D., Martin,H., Knipe,K., Smith,P. and Strockbine,N. (2018) Genome Announc 6 (15), e00282-18.