ISSso1
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Shigella sonnei | Shigella sonnei 53G |
DNA section
IS Length : 1443 bp
Ends
IR Length : 13
IRL : aaatgtaATGAACGCATCCCACACAATTAAACACGGAGCAGTATGATGGT
IRR : ----tatATGAACGCATCCCGTTTGCAAGGCATCCATTGTTTTGACATGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
AAATGTAATGAACGCATCCCACACAATTAAACACGGAGCAGTATGATGGTTAAAAGGGTGAAACTGGCAGGTGTGGAGCCTCGCAGATTTCCGGCATCAT
AGTGCCCAATACGGGATGCATAAAGCTCCCTCAGACTGGAGACTCCACAAATGAAATATACACCGGTTGGCGTTGATATCGCAAAACATGTCATTCAGAT
TCACTTCATCAATGAGCACACAGGTGAAGTGGTTGATAAACAGTTGCGTAGACAGGATTTTCTGACGTTCTTCGGCAACCGTGAGCCATGCCTGATTGGT
ATGGAGGCCTGTGGAGGTTCTCAGCACTGGGCACGGGAACTGACAAAACTTGGTCATAAAGTCCGGTTGTTGCAGGCCCGCTTCGTTAAGGCATTCGTCA
TGGGCAATAAGAATGATGTGATGGATGCCCGGGCTATCTGGATGGCGGTTCAGCAGCCGGGTAAAGAAATCGCCGTAAAAACAGAAGAACAGCAGTCGGT
ACTGGTTCTGCACCGTACCCGCATGCAACTGGTGAAGTTCCGGACCGCACAAATTAATGCCCTGCACGGGACGTTACTGGAGTTTGGTGAAACCATCCAC
AAAGGCCGGGCAGCGATGGAGCGGGAGTTCCCCGAAGCACTGGAACGGATGAAAGAGAGACTGCCACCGTATCTCATTATGGTTCTGGAAAACCAGTACA
ACCGACTGAATGAGCTGGACTCACTGATAGAGGATATTGAAAAACAGCTTACCAGCGTGGCGAGGCAGAATGAAACCTGTAAGCGGTTGCTGGATATTCC
TGGCGTTGGACCACTTATTGCGACGGCAGCGGTGGCCACCATGGGGGAAGCATCAGCGTTTAAATCGGGGCGAGAGTTCGCCGCATATGTTGGTCTGGTT
CCAAAACAAACTGGCTCCGGAGGGAAAGTACGTCTGCTGGGGATAAGCAAACGTGGTGACACTTATCTCAGGACATTATTTATCCACGGTGCAAGAGCGG
TGGCATTAGTAGCTAAAGAGCCTGGCCCGTGGATAACCGAACTGAAAAAACGTCGTCCAGCCAGTGTGGCAATCGTCGCCATGGCAAACAAGCTGGCACG
AACAGTATGGGCGATAACCGCCCATGACCGTAAGTATGACAGGAACCACGTCAGTATCAGACCATATTAATCGCTGATACCATTAAACAATGAACTCTTA
ACAAAAGGGTGAATGCTGAAAGGTTGCTATGGCGGCCAGAGTGATGACAAAGACAGGTAAGACCGTGACTCACTAAACCTGAACAGTATTTTGGGCTTGA
AGTCCGCCGTGAAAATAAGGGGTGAGTCGGCGAATTACATAGGGGCTCGCAGCGTTACGGCTGCAATAAAGCCGGATATAAAGCTGCAACCTACCCGTCA
TGTCAAAACAATGGATGCCTTGCAAACGGGATGCGTTCATATA
AGTGCCCAATACGGGATGCATAAAGCTCCCTCAGACTGGAGACTCCACAAATGAAATATACACCGGTTGGCGTTGATATCGCAAAACATGTCATTCAGAT
TCACTTCATCAATGAGCACACAGGTGAAGTGGTTGATAAACAGTTGCGTAGACAGGATTTTCTGACGTTCTTCGGCAACCGTGAGCCATGCCTGATTGGT
ATGGAGGCCTGTGGAGGTTCTCAGCACTGGGCACGGGAACTGACAAAACTTGGTCATAAAGTCCGGTTGTTGCAGGCCCGCTTCGTTAAGGCATTCGTCA
TGGGCAATAAGAATGATGTGATGGATGCCCGGGCTATCTGGATGGCGGTTCAGCAGCCGGGTAAAGAAATCGCCGTAAAAACAGAAGAACAGCAGTCGGT
ACTGGTTCTGCACCGTACCCGCATGCAACTGGTGAAGTTCCGGACCGCACAAATTAATGCCCTGCACGGGACGTTACTGGAGTTTGGTGAAACCATCCAC
AAAGGCCGGGCAGCGATGGAGCGGGAGTTCCCCGAAGCACTGGAACGGATGAAAGAGAGACTGCCACCGTATCTCATTATGGTTCTGGAAAACCAGTACA
ACCGACTGAATGAGCTGGACTCACTGATAGAGGATATTGAAAAACAGCTTACCAGCGTGGCGAGGCAGAATGAAACCTGTAAGCGGTTGCTGGATATTCC
TGGCGTTGGACCACTTATTGCGACGGCAGCGGTGGCCACCATGGGGGAAGCATCAGCGTTTAAATCGGGGCGAGAGTTCGCCGCATATGTTGGTCTGGTT
CCAAAACAAACTGGCTCCGGAGGGAAAGTACGTCTGCTGGGGATAAGCAAACGTGGTGACACTTATCTCAGGACATTATTTATCCACGGTGCAAGAGCGG
TGGCATTAGTAGCTAAAGAGCCTGGCCCGTGGATAACCGAACTGAAAAAACGTCGTCCAGCCAGTGTGGCAATCGTCGCCATGGCAAACAAGCTGGCACG
AACAGTATGGGCGATAACCGCCCATGACCGTAAGTATGACAGGAACCACGTCAGTATCAGACCATATTAATCGCTGATACCATTAAACAATGAACTCTTA
ACAAAAGGGTGAATGCTGAAAGGTTGCTATGGCGGCCAGAGTGATGACAAAGACAGGTAAGACCGTGACTCACTAAACCTGAACAGTATTTTGGGCTTGA
AGTCCGCCGTGAAAATAAGGGGTGAGTCGGCGAATTACATAGGGGCTCGCAGCGTTACGGCTGCAATAAAGCCGGATATAAAGCTGCAACCTACCCGTCA
TGTCAAAACAATGGATGCCTTGCAAACGGGATGCGTTCATATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1020 bp | 339 aa | 151 | 1170 | + | No |
Chemistry : DEDD
ORF sequence :
MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNREPCLIGMEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAV
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIHKGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLVPKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
QQPGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIHKGRAAMEREFPEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQL
TSVARQNETCKRLLDIPGVGPLIATAAVATMGEASAFKSGREFAAYVGLVPKQTGSGGKVRLLGISKRGDTYLRTLFIHGARAVALVAKEPGPWITELKK
RRPASVAIVAMANKLARTVWAITAHDRKYDRNHVSIRPY
Blast result :
Comments
No uninterrupted target sequence was found, so the ends of the IS have been defined by analogy with other IS1111 family elements, assuming that 7 nt separate IRl from the left-hand end of the element and 4 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
The transposase protein is 39% identical to that of IS1111.
The transposase protein is 39% identical to that of IS1111.
References
1] The Welcome Trust Sanger Institute http://www.sanger.ac.uk
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384