ISShdy4

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Shigella dysenteriae
Shigella dysenteriae
DNA section
IS Length : 2506 bp

Ends


IR Length : 24

IRL : GTAAGCGTAAACTGACCGCCGTATGTAGCCATTAAGCCTGTATTGGTAAC
IRR : GTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTGCTA

Insertion site


Left flankDirect repeatRight flankDR Length
CTCCGGTGGCGAACGTGAAGTCATACT8

DNA sequence

GTAAGCGTAAACTGACCGCCGTATGTAGCCATTAAGCCTGTATTGGTAACGTAGGTGCCCACCTTTTCAACTAGTGGACACCTGTTATGGAACAGAAAGC
ATTATCTGCAGAACCCCGCAGATCATTTTCAAATGAGTTTAAACTTCAAATGGTTAAACTGGCTTCACAACCAGGAGCCTCTGTTGCCCGTATTGCCCGG
GAACACGATATCAATGATAACCTGCTGTTCAAATGGCTCAGGCTCTGGCAGAACGAAGGGCGCATATCGCGGCGTCTTCCGGTAACAACCTCTTCTGACA
CTGGCGTTGAATTATTACCTGTGGAGATAACGCCGGATGAGCCGAAAGAACCGGTGGCTGCTCTTACTCCGTCTTTATCTACTCAGACTACAGTTAGTGC
CAGCTCCTGCAAGGTGGAGTTCCGTCACGGTAACATGACGCTGGAAAATCCTTCACCAGAGCTGCTCACAGTGTTGATCCGTGAACTGACCGGGTGGGGA
CGATGATCTCACTCCCGTCAGGCACCCGCATCTGGCTCGTTGCTGGGATAACCGATATGCGTAAGTCTTTCAACGGGCTGGGTGAACAGGTACAGCATGT
GCTGAATGATAATCCCTTCTCCGGTCACCTGTTCATCTTCCGTGGCCGACGGGGTGACATGATTAAAATCCTGTGGGCTGATGCTGATGGTCTGTGCCTG
TTCACCAGACGCCTGGAGGAAGGCCAGTTTATCTGGCCTGCTGTGCGTGACGGCAAGGTATCCATTACCCGCTCGCAACTGGCAATGCTCCTCGATAAGC
TGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAATGTCATGGCCGGATTATAAAAACGGCCATGAATCAAAAATACCT
CATTCGCATTGCAGAACTGGAATGCCAGCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAGAGACGGAGGCCTTCCTGCGCTCTGCACTGGCCCGC
GCCGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGTACCCGTTCTGAAAAAC
TGCGTCGTGAAGTTGAACAGGCTGAGGCCCTGCTGAAACAACGCGAACAGGACAGTGATCGTTACAGTGGGCGGGAAGACGATCCGCAGGTTCCCCGCCA
GTTGCGACAGTCTCGTCATCGTCGCCCGTTACCGGAGCATCTGCCCCGCGAAATAAATCGCCTGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGT
GAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCTCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAA
AATGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCGGGGTTACTTGCCCGCGTGTTAACGGGAAAATACTG
CGAACACCTGCCACTGTATCGTCAGAGTGAAATTTTTGCCCGTCAGGGTGTCGAACTGAGCCGTGCATTACTCTCCAACTGGGTTGACGCGTGCTGCCAG
TTAATGACGCCGCTGAATGATGCTCTGTACCGTTATGTGATGAACAGCCGCAAAGTTCACACTGATGACACACCAGTAAAAGTGCTGGCACCGGGCAGGA
AGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATGACAGGAATGCCGGTTCGCCAGAGCCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCA
TCAGGGTAAACATCCGGAGCAGCACCTTAGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTTAATGGTTACGATCGGCTGTTCAGTGCCGAACGAGAA
GGCGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAGTCCACGATGTATATATCAGTACCAAAAGCGCGACAGCGGAAGAAGCCCTGA
AACTAATCGGTGAGCTGTACGCCATCGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGCCTGGCGGTCAGGCAAATGCAGAGTAAACCGCTACTGAC
TTCCCTGTATAAGCTGATGCAGGAGAAAGAACAGACGTTATCGAAAAAATGCCGTCTGAGAGATGCGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTG
TGCAACTTCAGTGATGATGGTCTGGCTGAGGCGGATAATAATGCCGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGAAAAACTTTATGTTCTTCG
GCAGCGATCACGGTGGAGAGCGTGGTGCGCTACTGTACGGGCTGATCGGCACCTGCCGACTGAACGGTATCGATCCGGAAGCGTATCTGCGCTATATCCT
GAGCGTACTGCCGGAATGGCCTTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGCACTCACCAATAAATAAGCGTCAATACGGTGCTCCGTTGAC
GCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
420 bp135 aa87506+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MEQKALSAEPRRSFSNEFKLQMVKLASQPGASVARAREHDINDNLLFKWLRLWNEGRISRRLPVTTSSDTGVELLPVEITPDEPKEPVAALTPSLSTQTT
VASSCKVEFRHGNMTLENSPELLTVLIRELTGWGR

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
351 bp116 aa503853+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MISLPSGTRIWLVAGITDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDMIKILWADADGLCLFTRRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1593 bp530 aa8842476+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MNQKYLIRIAELECQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDD
PQVPRQLRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARV
LTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWF
AYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQ
SKPLLTSLYKLMQEKEQTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEA
YLRYILSVLPEWPSNRVDELLPWNVALTNK

 

Blast result :
Comments
ISShdy4 is 96% aa (transposase) similar to ISEc43.
References
1] Claude Parsot (2021) Direct submission.