ISShdy2

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Shigella dysenteriae
Shigella dysenteriae CDC 08-3330
DNA section
IS Length : 2496 bp

Ends


IR Length : 25/26

IRL : GTAAGCATCCGGTGAACTCGCATTGCCGGCCGTCGTGAACCACGCCATTC
IRR : GTAAGCATCCGGTGAACTCGCATGGCGCGATCACCGGCGTTATGGCAGGG

Insertion site


Left flankDirect repeatRight flankDR Length
GCTCGACACGGACTCATTGTCACCACCC8

DNA sequence

GTAAGCATCCGGTGAACTCGCATTGCCGGCCGTCGTGAACCACGCCATTCTGCAGGCTCTTCCTTTATATGCAAAAGTGAGAAACCTGATGGCAAAAAAA
ATGACCCCGGCGCAGCGACGCCAGCACTACGACGCCTGGCGCGTCAGCGGCATGTCCCGGGCCGCGTATGCCCGGCTACACGGCATCAACAACAAAACCT
TCTGGCATCTCTGCCGGGCCCTCTCCGCTGACGACGCCCGTGGCACCGCTCCGGCTGATAACAGACCGGCTGTCCTGCCGGTCACCCTCTCCGTCAGTGA
CACTGCCACCCTGAAACTCCAGCGCGCCTGCGTCACCTCCACCCCGGCCGGCATCGCCGCCATCATCCGGGAGCTTCACCTGTGCTGAACCCTCATGCCC
TCTGGCTCGTCCGCGAGCCTGCCGACATGCGCGCCGGCATCGACTCCCTGACGCGGCTCGCCACTCAGGCCGCCGGGCATCCGCCCCGGGAAGGTGAAGC
CTTCCTCTTCACCGGGAAAAAACGCACCCGCATGAAACTCCTGATGTGGGACCGACACGGCGTCTGGCTCTGCACCCGCCGCCTGCACCAGGGCGCCTTC
CGCTGGCCCCGCGACGGCGACACCACCTGGTCACTCACGGCGGAACAGTTCGCCTGGCTTACTGCCGGCATTGACTGGCTGCGTCTCTCTGCCGGCCCCC
TGCAGAGGTGGACTGAATAACCTCCTGAGCAAAAATCATAAATAATCATCCTGTTATCAGGATGGTGTTCCCCCTCATCTGTCATCATGGCTGCATGACA
GATGATATCCTGAACTCCACACAAAACCCCGATGAACTGCGCCGTATGGTAACAGCGCTGCTGACGGCACAGGCATGCGAATATGAGCAGCGCATTCATG
ACCTTAATGTCGCCATGCAGGCAGAAAAGTTGACGCTGGAGCAGCGCCTCCATGACCTTAATGCCGCCATGCAGGCAGAAAAAACCGTATATGAACAGCG
CATCCGCGAGCTGGAAGACGCCCTGAAGCTGGCACAGCAGTGGCGCTTCGGCCGCAAAAGTGAGCGCCTGCCGGCCAGCCAGAAACCACTGGCTGACGAG
GACGCGGCCAGCGATGAGGCTGATATCACCCGGCAACTGAGCGACCTGCTGCCGGAGAAGGAGAAAACCGGGAAGAAGCCGGCCCGCCAGCCCCTGCCGG
CACACCTGCCGCGCCAGGAAACCGTACTCATGCCTGAGACCGGCAGCACCTGTCCGGACTGCGGCAGTGAAATGCGACATATCCGCGACGAGGTGAATGA
GGTGCTGGAATATGTACCGGCACACTTCGTGGTGAAACGGACCGTGAGACCGCAGTACAGCTGCCCGTGCTGCGACACGGTGCACAGTGCCGTGCTGCCG
TCGGCAGTCATCGACAAAGGGCAGCCGGGCCCGGGTCTGCTGGCGCAGGTGGTGACCGCGAAGGTGCTGGAACACCTGCCACTGCAGCGGCAGCAGAAGA
TATACGCCCGTGAAGGGGTACAGCTGCCGGAAAGCACGCTGACGGACTGGTTCGGGCAGACGGCGGCGGTGCTGTCGCCGCTGGCGGCAGCCCTGAAACG
TGACCTGCTCAGGCAACCGGTGCTGCAGGCGGACGAGACGCCGCTGCAGATACTGGATACGCGGAAAGGGAAAGTCCGGAAAGGGTACCTGTGGGCATAC
GTGAGCGCGGCGGGCAGTGCCCGGGACATCGTGGTGTACGACTGCCGGCCGGGGCGTGCGGGGCAGTATGCGTGTGAGATGCTGAGCGGGTGGTCGGGGA
CTCTGGTGGCTGACGGTTATGCCGGTTACCGGGCGCTGTTCCGTGACGGGCAGGAAGGGGCCCCTCCGGTGGCCCCGGGTATCCGTGAGGCGGGGTGCAT
GGCGCACGTGCGCAGAAAGTTCATGGAGCTGTACAAAATGAACGGCAGTCCGGGGGCGAAGGAGGCGCTGAAACAGATACGGGCGCTGTATATCCTGGAG
CGGAGCATCCGGAACCGTCCGGCGGAGCAGAAACGGCGATGGCGGCGGCGGTACGCGAAGCCGCAGATGGAGGCGTTCCACAGCTGGCTGAGGGCGACGG
AAAAGACGAGCGCGCCGGGTGGCAGGCTGCACGGTGCGGTGAGGTATGCCCTGAAGCGTTGGCCGGCGCTGGAAACATACCTGAATGACGGACGGGTACC
GCTGGACAACAACCGGTGTGAGCAGATGATGCGTCCGGTGGCGCAGGGGCGGAAGTCATGGCTGTTCGCGGGTTCGCAGCGGGGAGGAGAGCGGCTGGCG
GAGCTGCTGACGCTGCTGCACACGGCGAGGCTGAACGGTCTGGAGCCAGTAGCCTGGCTGCGTGATGTGCTGGAGAAGTTGCCGTCATGGCCGGCGTCCC
GGCTGGATGAACTGCTGCCTTACCGCCGTCCGGCGGACTGAATACCCCCTGCCATAACGCCGGTGATCGCGCCATGCGAGTTCACCGGATGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
300 bp99 aa89388+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MAKKMTPAQRRQHYDAWRVSGMSRAAYARLHGINNKTFWHLCRALSADDARGTAPADNRPAVLPVTLSVSDTATLKLQRACVTSTPAGIAAIIRELHLC

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
294 bp97 aa427720+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MRAGIDSLTRLATQAAGHPPREGEAFLFTGKKRTRMKLLMWDRHGVWLCTRRLHQGAFRWPRDGDTTWSLTAEQFAWLTAGIDWLRLSAGPLQRWTE

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1680 bp559 aa7622441+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MVFPLICHHGCMTDDILNSTQNPDELRRMVTALLTAQACEYEQRIHDLNVAMQAEKLTLEQRLHDLNAAMQAEKTVYEQRIRELEDALKLAQQWRFGRKS
ERLPASQKPLADEDAASDEADITRQLSDLLPEKEKTGKKPARQPLPAHLPRQETVLMPETGSTCPDCGSEMRHIRDEVNEVLEYVPAHFVVKRTVRPQYS
CPCCDTVHSAVLPSAVIDKGQPGPGLLAQVVTAKVLEHLPLQRQQKIYAREGVQLPESTLTDWFGQTAAVLSPLAAALKRDLLRQPVLQADETPLQILDT
RKGKVRKGYLWAYVSAAGSARDIVVYDCRPGRAGQYACEMLSGWSGTLVADGYAGYRALFRDGQEGAPPVAPGIREAGCMAHVRRKFMELYKMNGSPGAK
EALKQIRALYILERSIRNRPAEQKRRWRRRYAKPQMEAFHSWLRATEKTSAPGGRLHGAVRYALKRWPALETYLNDGRVPLDNNRCEQMMRPVAQGRKSW
LFAGSQRGGERLAELLTLLHTARLNGLEPVAWLRDVLEKLPSWPASRLDELLPYRRPAD

 

Blast result :
Comments
ISShdy2 is 50% (ORFA), 67% (ORFB) and 63%(ORFC : the transposase) aa similar to ISEc49.
This structure was inserted into an IS10. This PacBio genome is not yet deposited but will be soon at the EBI-ENA database.
References