ISShdy4
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Shigella dysenteriae | Shigella dysenteriae |
DNA section
IS Length : 2506 bp
Ends
IR Length : 24
IRL : GTAAGCGTAAACTGACCGCCGTATGTAGCCATTAAGCCTGTATTGGTAAC
IRR : GTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTGCTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTCCGGTGGC | GAACGTGA | AGTCATACT | 8 |
DNA sequence
GTAAGCGTAAACTGACCGCCGTATGTAGCCATTAAGCCTGTATTGGTAACGTAGGTGCCCACCTTTTCAACTAGTGGACACCTGTTATGGAACAGAAAGC
ATTATCTGCAGAACCCCGCAGATCATTTTCAAATGAGTTTAAACTTCAAATGGTTAAACTGGCTTCACAACCAGGAGCCTCTGTTGCCCGTATTGCCCGG
GAACACGATATCAATGATAACCTGCTGTTCAAATGGCTCAGGCTCTGGCAGAACGAAGGGCGCATATCGCGGCGTCTTCCGGTAACAACCTCTTCTGACA
CTGGCGTTGAATTATTACCTGTGGAGATAACGCCGGATGAGCCGAAAGAACCGGTGGCTGCTCTTACTCCGTCTTTATCTACTCAGACTACAGTTAGTGC
CAGCTCCTGCAAGGTGGAGTTCCGTCACGGTAACATGACGCTGGAAAATCCTTCACCAGAGCTGCTCACAGTGTTGATCCGTGAACTGACCGGGTGGGGA
CGATGATCTCACTCCCGTCAGGCACCCGCATCTGGCTCGTTGCTGGGATAACCGATATGCGTAAGTCTTTCAACGGGCTGGGTGAACAGGTACAGCATGT
GCTGAATGATAATCCCTTCTCCGGTCACCTGTTCATCTTCCGTGGCCGACGGGGTGACATGATTAAAATCCTGTGGGCTGATGCTGATGGTCTGTGCCTG
TTCACCAGACGCCTGGAGGAAGGCCAGTTTATCTGGCCTGCTGTGCGTGACGGCAAGGTATCCATTACCCGCTCGCAACTGGCAATGCTCCTCGATAAGC
TGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAATGTCATGGCCGGATTATAAAAACGGCCATGAATCAAAAATACCT
CATTCGCATTGCAGAACTGGAATGCCAGCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAGAGACGGAGGCCTTCCTGCGCTCTGCACTGGCCCGC
GCCGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGTACCCGTTCTGAAAAAC
TGCGTCGTGAAGTTGAACAGGCTGAGGCCCTGCTGAAACAACGCGAACAGGACAGTGATCGTTACAGTGGGCGGGAAGACGATCCGCAGGTTCCCCGCCA
GTTGCGACAGTCTCGTCATCGTCGCCCGTTACCGGAGCATCTGCCCCGCGAAATAAATCGCCTGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGT
GAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCTCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAA
AATGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCGGGGTTACTTGCCCGCGTGTTAACGGGAAAATACTG
CGAACACCTGCCACTGTATCGTCAGAGTGAAATTTTTGCCCGTCAGGGTGTCGAACTGAGCCGTGCATTACTCTCCAACTGGGTTGACGCGTGCTGCCAG
TTAATGACGCCGCTGAATGATGCTCTGTACCGTTATGTGATGAACAGCCGCAAAGTTCACACTGATGACACACCAGTAAAAGTGCTGGCACCGGGCAGGA
AGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATGACAGGAATGCCGGTTCGCCAGAGCCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCA
TCAGGGTAAACATCCGGAGCAGCACCTTAGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTTAATGGTTACGATCGGCTGTTCAGTGCCGAACGAGAA
GGCGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAGTCCACGATGTATATATCAGTACCAAAAGCGCGACAGCGGAAGAAGCCCTGA
AACTAATCGGTGAGCTGTACGCCATCGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGCCTGGCGGTCAGGCAAATGCAGAGTAAACCGCTACTGAC
TTCCCTGTATAAGCTGATGCAGGAGAAAGAACAGACGTTATCGAAAAAATGCCGTCTGAGAGATGCGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTG
TGCAACTTCAGTGATGATGGTCTGGCTGAGGCGGATAATAATGCCGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGAAAAACTTTATGTTCTTCG
GCAGCGATCACGGTGGAGAGCGTGGTGCGCTACTGTACGGGCTGATCGGCACCTGCCGACTGAACGGTATCGATCCGGAAGCGTATCTGCGCTATATCCT
GAGCGTACTGCCGGAATGGCCTTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGCACTCACCAATAAATAAGCGTCAATACGGTGCTCCGTTGAC
GCTTAC
ATTATCTGCAGAACCCCGCAGATCATTTTCAAATGAGTTTAAACTTCAAATGGTTAAACTGGCTTCACAACCAGGAGCCTCTGTTGCCCGTATTGCCCGG
GAACACGATATCAATGATAACCTGCTGTTCAAATGGCTCAGGCTCTGGCAGAACGAAGGGCGCATATCGCGGCGTCTTCCGGTAACAACCTCTTCTGACA
CTGGCGTTGAATTATTACCTGTGGAGATAACGCCGGATGAGCCGAAAGAACCGGTGGCTGCTCTTACTCCGTCTTTATCTACTCAGACTACAGTTAGTGC
CAGCTCCTGCAAGGTGGAGTTCCGTCACGGTAACATGACGCTGGAAAATCCTTCACCAGAGCTGCTCACAGTGTTGATCCGTGAACTGACCGGGTGGGGA
CGATGATCTCACTCCCGTCAGGCACCCGCATCTGGCTCGTTGCTGGGATAACCGATATGCGTAAGTCTTTCAACGGGCTGGGTGAACAGGTACAGCATGT
GCTGAATGATAATCCCTTCTCCGGTCACCTGTTCATCTTCCGTGGCCGACGGGGTGACATGATTAAAATCCTGTGGGCTGATGCTGATGGTCTGTGCCTG
TTCACCAGACGCCTGGAGGAAGGCCAGTTTATCTGGCCTGCTGTGCGTGACGGCAAGGTATCCATTACCCGCTCGCAACTGGCAATGCTCCTCGATAAGC
TGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAATGTCATGGCCGGATTATAAAAACGGCCATGAATCAAAAATACCT
CATTCGCATTGCAGAACTGGAATGCCAGCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAGAGACGGAGGCCTTCCTGCGCTCTGCACTGGCCCGC
GCCGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGTACCCGTTCTGAAAAAC
TGCGTCGTGAAGTTGAACAGGCTGAGGCCCTGCTGAAACAACGCGAACAGGACAGTGATCGTTACAGTGGGCGGGAAGACGATCCGCAGGTTCCCCGCCA
GTTGCGACAGTCTCGTCATCGTCGCCCGTTACCGGAGCATCTGCCCCGCGAAATAAATCGCCTGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGT
GAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCTCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAA
AATGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCGGGGTTACTTGCCCGCGTGTTAACGGGAAAATACTG
CGAACACCTGCCACTGTATCGTCAGAGTGAAATTTTTGCCCGTCAGGGTGTCGAACTGAGCCGTGCATTACTCTCCAACTGGGTTGACGCGTGCTGCCAG
TTAATGACGCCGCTGAATGATGCTCTGTACCGTTATGTGATGAACAGCCGCAAAGTTCACACTGATGACACACCAGTAAAAGTGCTGGCACCGGGCAGGA
AGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATGACAGGAATGCCGGTTCGCCAGAGCCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCA
TCAGGGTAAACATCCGGAGCAGCACCTTAGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTTAATGGTTACGATCGGCTGTTCAGTGCCGAACGAGAA
GGCGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAGTCCACGATGTATATATCAGTACCAAAAGCGCGACAGCGGAAGAAGCCCTGA
AACTAATCGGTGAGCTGTACGCCATCGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGCCTGGCGGTCAGGCAAATGCAGAGTAAACCGCTACTGAC
TTCCCTGTATAAGCTGATGCAGGAGAAAGAACAGACGTTATCGAAAAAATGCCGTCTGAGAGATGCGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTG
TGCAACTTCAGTGATGATGGTCTGGCTGAGGCGGATAATAATGCCGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGAAAAACTTTATGTTCTTCG
GCAGCGATCACGGTGGAGAGCGTGGTGCGCTACTGTACGGGCTGATCGGCACCTGCCGACTGAACGGTATCGATCCGGAAGCGTATCTGCGCTATATCCT
GAGCGTACTGCCGGAATGGCCTTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGCACTCACCAATAAATAAGCGTCAATACGGTGCTCCGTTGAC
GCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
420 bp | 135 aa | 87 | 506 | + | No |
AG : IS66 TnpA
ORF sequence :
MEQKALSAEPRRSFSNEFKLQMVKLASQPGASVARAREHDINDNLLFKWLRLWNEGRISRRLPVTTSSDTGVELLPVEITPDEPKEPVAALTPSLSTQTT
VASSCKVEFRHGNMTLENSPELLTVLIRELTGWGR
VASSCKVEFRHGNMTLENSPELLTVLIRELTGWGR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 503 | 853 | + | No |
AG : IS66 TnpB
ORF sequence :
MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
DWRQPKTSRLNALTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1593 bp | 530 aa | 884 | 2476 | + | No |
Chemistry : DDE
ORF sequence :
MNQKYLIRIAELECQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDD
PQVPRQLRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARV
LTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWF
AYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQ
SKPLLTSLYKLMQEKEQTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEA
YLRYILSVLPEWPSNRVDELLPWNVALTNK
PQVPRQLRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARV
LTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWF
AYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQ
SKPLLTSLYKLMQEKEQTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEA
YLRYILSVLPEWPSNRVDELLPWNVALTNK
Blast result :
Comments
ISShdy4 is 96% aa (transposase) similar to ISEc43.
References
1] Claude Parsot (2021) Direct submission.