ISYen1
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AE005940 | ND | Yersinia enterocolitica | Yersinia enterocolitica 8081 |
DNA section
IS Length : 1376 bp
Ends
IR Length : 11
IRL : taatgaaATGGACGCTCCAACTTTTACGGCATCAGAGTGCCTAAATGTGA
IRR : ----tatATGGACGCTCCCAATAAACCAAGTGTTTTTTCTTGAAAATAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
TAATGAAATGGACGCTCCAACTTTTACGGCATCAGAGTGCCTAAATGTGAGTGTTAAAACTCAGAAGAAGGAGCGTCCACATGAAAGTATCTACTCTTGG
TATCGACTTGGCAAAAAATGTTTTCCAGCTTCATGGTGTCGGCTGCAACGGTCAAACTGTTCTTAAGAAGAAACTCACCCGCGATAAATTCCTTCCTTTT
CTCATGCAACTTGAACCTTGCTTGATTGGCATGGAGGCCTGTGCTTCCAGTCATCATTTTGCGCGTGTTTTACGGCAGTATGGGCATGAGGTTAAACTCA
TTCCCCCTCAGTATGTGAAACCTTATGTCAAAACGAATAAGACAGATGCCGCTGATGCAGAAGCCATTTGTGAAGCTGTTGCACGGCCTAATATGCGTTT
TGTTCAGATTAAAACGGCAGAGCAACAAGCTATTCTGGTGCTGCATACCGAGCGAAACATCCTTATCCGCGAACGCACGGCTTGTGCTAATAGTATGCGG
GCCATTTTGGCTGAATTTGGCATCATCATGCCTCGCACATTAAGTCAGCTGTATAAGAAAATCCCTGAAATACTGGAAGAATATGATAACGAGTTATCAC
CTTTTGTCCGTTGTAGTGTCGCGCGTCAACTTGAACACCTTCAGGGTGTGGAAGATCAAATCACGTTGATCGAACAAGAACTTAGCAGTTGGGCAAAAAC
ACAACCCGCCTGCCAGCGGGTCTTGAAAGTCCCTGGTGTGGGATTGATGACGGCGACCTACCTTGTGGCGTCAGTGGGGAATGGGCAACAATTTCATTCA
GCGAAACAGTTTGCCGCCTGGTTGGGATTGGTGCCGAGAGAGTTCTCCAGTGGCGGTAAGCAGAGATTGGGCCGAATCAGCAAAAGAGGTGACCGCTATT
TCCGTTATCTTCTGGTCCATGGTGCGCGGGCAGTTGCAGCCGTTATTGAAAGACACAAAGACAATATGCCGTGGCTTTACAGGCTGTTGAGTAAAAAGGC
CTATAACGTGGCCGTTGTGGCACAGGCCAATAAAACGGCACGTATTTTGTGGTCGATGCTGGTTCAGCATACGGAATATCGAACCTTATCCGTGGTTTGA
GATAATCTCAAACACGGATAAGGCTCAAGAACAAAAGTGAATACGTTTCAGGTAATTGCAAGTGTAATTATTGAGATGGCAAAACAGGTAAGACCGCAAG
TGAGAAACTCCGTGAGCCGCTGGGGCATTTGGCCCGTAAGGATGATAGGAACTCACTTGGCGGATTTCATCATGGTTCGGGCGCAATCGCGCCCATAAAG
AAACCGGATATATGGCTGCAATACCTGAGCGCTATTTTCAAGAAAAAACACTTGGTTTATTGGGAGCGTCCATATA
TATCGACTTGGCAAAAAATGTTTTCCAGCTTCATGGTGTCGGCTGCAACGGTCAAACTGTTCTTAAGAAGAAACTCACCCGCGATAAATTCCTTCCTTTT
CTCATGCAACTTGAACCTTGCTTGATTGGCATGGAGGCCTGTGCTTCCAGTCATCATTTTGCGCGTGTTTTACGGCAGTATGGGCATGAGGTTAAACTCA
TTCCCCCTCAGTATGTGAAACCTTATGTCAAAACGAATAAGACAGATGCCGCTGATGCAGAAGCCATTTGTGAAGCTGTTGCACGGCCTAATATGCGTTT
TGTTCAGATTAAAACGGCAGAGCAACAAGCTATTCTGGTGCTGCATACCGAGCGAAACATCCTTATCCGCGAACGCACGGCTTGTGCTAATAGTATGCGG
GCCATTTTGGCTGAATTTGGCATCATCATGCCTCGCACATTAAGTCAGCTGTATAAGAAAATCCCTGAAATACTGGAAGAATATGATAACGAGTTATCAC
CTTTTGTCCGTTGTAGTGTCGCGCGTCAACTTGAACACCTTCAGGGTGTGGAAGATCAAATCACGTTGATCGAACAAGAACTTAGCAGTTGGGCAAAAAC
ACAACCCGCCTGCCAGCGGGTCTTGAAAGTCCCTGGTGTGGGATTGATGACGGCGACCTACCTTGTGGCGTCAGTGGGGAATGGGCAACAATTTCATTCA
GCGAAACAGTTTGCCGCCTGGTTGGGATTGGTGCCGAGAGAGTTCTCCAGTGGCGGTAAGCAGAGATTGGGCCGAATCAGCAAAAGAGGTGACCGCTATT
TCCGTTATCTTCTGGTCCATGGTGCGCGGGCAGTTGCAGCCGTTATTGAAAGACACAAAGACAATATGCCGTGGCTTTACAGGCTGTTGAGTAAAAAGGC
CTATAACGTGGCCGTTGTGGCACAGGCCAATAAAACGGCACGTATTTTGTGGTCGATGCTGGTTCAGCATACGGAATATCGAACCTTATCCGTGGTTTGA
GATAATCTCAAACACGGATAAGGCTCAAGAACAAAAGTGAATACGTTTCAGGTAATTGCAAGTGTAATTATTGAGATGGCAAAACAGGTAAGACCGCAAG
TGAGAAACTCCGTGAGCCGCTGGGGCATTTGGCCCGTAAGGATGATAGGAACTCACTTGGCGGATTTCATCATGGTTCGGGCGCAATCGCGCCCATAAAG
AAACCGGATATATGGCTGCAATACCTGAGCGCTATTTTCAAGAAAAAACACTTGGTTTATTGGGAGCGTCCATATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1020 bp | 339 aa | 81 | 1100 | + | No |
Chemistry : DEDD
ORF sequence :
MKVSTLGIDLAKNVFQLHGVGCNGQTVLKKKLTRDKFLPFLMQLEPCLIGMEACASSHHFARVLRQYGHEVKLIPPQYVKPYVKTNKTDAADAEAICEAV
ARPNMRFVQIKTAEQQAILVLHTERNILIRERTACANSMRAILAEFGIIMPRTLSQLYKKIPEILEEYDNELSPFVRCSVARQLEHLQGVEDQITLIEQE
LSSWAKTQPACQRVLKVPGVGLMTATYLVASVGNGQQFHSAKQFAAWLGLVPREFSSGGKQRLGRISKRGDRYFRYLLVHGARAVAAVIERHKDNMPWLY
RLLSKKAYNVAVVAQANKTARILWSMLVQHTEYRTLSVV
ARPNMRFVQIKTAEQQAILVLHTERNILIRERTACANSMRAILAEFGIIMPRTLSQLYKKIPEILEEYDNELSPFVRCSVARQLEHLQGVEDQITLIEQE
LSSWAKTQPACQRVLKVPGVGLMTATYLVASVGNGQQFHSAKQFAAWLGLVPREFSSGGKQRLGRISKRGDRYFRYLLVHGARAVAAVIERHKDNMPWLY
RLLSKKAYNVAVVAQANKTARILWSMLVQHTEYRTLSVV
Blast result :
Comments
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end.
There are 7 copies of this IS in the Yersinia enterocolitica 8081 genome and, unlike most IS1111 family elements, the first nt of the sequence shown (T) does not always match the 4th nt to the right of IRr (not shown in above sequence), which can be T, C or G.
The transposase is 69% identical to that of ISEch3 and 41% identical to that of IS1111.
By analogy with IS4321, ISYen1 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
There are 7 copies of this IS in the Yersinia enterocolitica 8081 genome and, unlike most IS1111 family elements, the first nt of the sequence shown (T) does not always match the 4th nt to the right of IRr (not shown in above sequence), which can be T, C or G.
The transposase is 69% identical to that of ISEch3 and 41% identical to that of IS1111.
By analogy with IS4321, ISYen1 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
References
1] The Welcome Trust Sanger Institute http://www.sanger.ac.uk
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384