ISSso1

  • Family IS110
  • Group IS1111
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Shigella sonnei
Shigella sonnei 53G
DNA section
IS Length : 1443 bp

Ends


IR Length : 13

IRL : aaatgtaATGAACGCATCCCACACAATTAAACACGGAGCAGTATGATGGT
IRR : ----tatATGAACGCATCCCGTTTGCAAGGCATCCATTGTTTTGACATGA

Insertion site


Left flankDirect repeatRight flankDR Length

DNA sequence

AAATGTAATGAACGCATCCCACACAATTAAACACGGAGCAGTATGATGGTTAAAAGGGTGAAACTGGCAGGTGTGGAGCCTCGCAGATTTCCGGCATCAT
AGTGCCCAATACGGGATGCATAAAGCTCCCTCAGACTGGAGACTCCACAAATGAAATATACACCGGTTGGCGTTGATATCGCAAAACATGTCATTCAGAT
TCACTTCATCAATGAGCACACAGGTGAAGTGGTTGATAAACAGTTGCGTAGACAGGATTTTCTGACGTTCTTCGGCAACCGTGAGCCATGCCTGATTGGT
ATGGAGGCCTGTGGAGGTTCTCAGCACTGGGCACGGGAACTGACAAAACTTGGTCATAAAGTCCGGTTGTTGCAGGCCCGCTTCGTTAAGGCATTCGTCA
TGGGCAATAAGAATGATGTGATGGATGCCCGGGCTATCTGGATGGCGGTTCAGCAGCCGGGTAAAGAAATCGCCGTAAAAACAGAAGAACAGCAGTCGGT
ACTGGTTCTGCACCGTACCCGCATGCAACTGGTGAAGTTCCGGACCGCACAAATTAATGCCCTGCACGGGACGTTACTGGAGTTTGGTGAAACCATCCAC
AAAGGCCGGGCAGCGATGGAGCGGGAGTTCCCCGAAGCACTGGAACGGATGAAAGAGAGACTGCCACCGTATCTCATTATGGTTCTGGAAAACCAGTACA
ACCGACTGAATGAGCTGGACTCACTGATAGAGGATATTGAAAAACAGCTTACCAGCGTGGCGAGGCAGAATGAAACCTGTAAGCGGTTGCTGGATATTCC
TGGCGTTGGACCACTTATTGCGACGGCAGCGGTGGCCACCATGGGGGAAGCATCAGCGTTTAAATCGGGGCGAGAGTTCGCCGCATATGTTGGTCTGGTT
CCAAAACAAACTGGCTCCGGAGGGAAAGTACGTCTGCTGGGGATAAGCAAACGTGGTGACACTTATCTCAGGACATTATTTATCCACGGTGCAAGAGCGG
TGGCATTAGTAGCTAAAGAGCCTGGCCCGTGGATAACCGAACTGAAAAAACGTCGTCCAGCCAGTGTGGCAATCGTCGCCATGGCAAACAAGCTGGCACG
AACAGTATGGGCGATAACCGCCCATGACCGTAAGTATGACAGGAACCACGTCAGTATCAGACCATATTAATCGCTGATACCATTAAACAATGAACTCTTA
ACAAAAGGGTGAATGCTGAAAGGTTGCTATGGCGGCCAGAGTGATGACAAAGACAGGTAAGACCGTGACTCACTAAACCTGAACAGTATTTTGGGCTTGA
AGTCCGCCGTGAAAATAAGGGGTGAGTCGGCGAATTACATAGGGGCTCGCAGCGTTACGGCTGCAATAAAGCCGGATATAAAGCTGCAACCTACCCGTCA
TGTCAAAACAATGGATGCCTTGCAAACGGGATGCGTTCATATA
Protein section
ORF number : 1

 

ORF 1
LengthBeginEndStrandFusion ORF
1020 bp339 aa1511170+No
ORF function : Transposase
Chemistry : DEDD

ORF sequence :

MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIGMEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIHKGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLVPKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY

 

Blast result :
Comments
No uninterrupted target sequence was found, so the ends of the IS have been defined by analogy with other IS1111 family elements, assuming that 7 nt separate IRl from the left-hand end of the element and 4 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.

The transposase protein is 39% identical to that of IS1111.
References
1] The Welcome Trust Sanger Institute http://www.sanger.ac.uk
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384