IS5
- Family IS5
- Group IS5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
J01735 | Y | Escherichia coli | Neisseria gonorrhoeae plasmid pGF1 Escherichia coli K-12 Klebsiella pneumoniae Citrobacter freundii ATCC8090 Shigella flexneri SF-9 |
DNA section
IS Length : 1195 bp
Ends
IR Length : 15/16
IRL : GGAAGGTGCGAACAAGTCCCTGATATGAGATCATGTTTGTCATCTGGAGC
IRR : GGAAGGTGCGAATAAGCGGGGAAATTCTTCTCGGCTGACTCAGTCATTTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAACGCAGAA | CTAG | GACAACTCTT | 4 |
TTTACAACCA | CTAA | ACCCACAGTA | 4 |
NNNNCCAGGT | CTAA | TTCACGNNNN | 4 |
NNNNTGATTT | CAAG | AATCGGNNNN | 4 |
AAAGCTGGAG | CTAG | CTTGTTAGTG | 4 |
GAATATGCAA | CTAG | TCCGTTTGGG | 4 |
ACATCCGATA | CTAG | CATTTATCGT | 4 |
ACACTTAAAC | TTAG | TTTTCCTTTA | 4 |
CGCACCTTCC | CTAG | AGTTTATTTT | 4 |
GGGAATGGCC | CTAG | GGAAGGTGCG | 4 |
ATCCATCCCA | CTAG | GAGTTGGGTA | 4 |
CTGGTTTGGG | CTAG | AATGGCGCGC | 4 |
TTAAGGTGGT | CTAG | CCATTGTCAG | 4 |
TGAATATCGC | CTAG | ACATGGTTCC | 4 |
TACCTCAGTA | CTAG | TCTCGATGGT | 4 |
GTTATAAAAC | CTTT | GTTCATTTGA | 4 |
AAAAACAGTT | CTAG | TACAAGTTCT | 4 |
TAAATTTGTA | CTTA | ACCACTCATT | 4 |
CAGGTGATAA | CTAG | AACAGTTTTT | 4 |
AGTGTTTGTA | CTAG | CTACACTGAA | 4 |
ATAAGTTCAA | CTAG | AGGATGTTGA | 4 |
CAAGAAGGCC | TTAG | CAAATTATAT | 4 |
GCTGCCCTCG | CTAA | AAAAGCAATA | 4 |
CTGGATTTCG | TTAG | GTCCTCGGTA | 4 |
GAAGTCCTAT | CTTGAGCAGA | 0 | |
AATGTGTTAG | CTAGCTTAAT | 0 | |
GGCAACCAAG | CTAACAGATA | 0 | |
TCTCAGTTTG | CTAGTGAACA | 0 | |
TCAACACTAG | CTATGTCACC | 0 | |
AATGCGTAAG | CTAGGAGGTA | 0 | |
AGAGCGTTAG | CTAGACAAAT | 0 | |
GCCTGACTAG | TTAGATAATG | 0 | |
ACTCCTCTAG | CTTATTCGCA | 0 | |
CTCGGTCTGG | CTAGACATGA | 0 |
DNA sequence
GGAAGGTGCGAACAAGTCCCTGATATGAGATCATGTTTGTCATCTGGAGCCATAGAACAGGGTTCATCATGAGTCATCAACTTACCTTCGCCGACAGTGA
ATTCAGCAGTAAGCGCCGTCAGACCAGAAAAGAGATTTTCTTGTCCCGCATGGAGCAGATTCTGCCATGGCAAAACATGGTGGAAGTCATCGAGCCGTTT
TACCCCAAGGCTGGTAATGGCCGGCGACCTTATCCGCTGGAAACCATGCTACGCATTCACTGCATGCAGCATTGGTACAACCTGAGCGATGGCGCGATGG
AAGATGCTCTGTACGAAATCGCCTCCATGCGTCTGTTTGCCCGGTTATCCCTGGATAGCGCCTTGCCGGACCGCACCACCATCATGAATTTCCGCCACCT
GCTGGAGCAGCATCAACTGGCCCGCCAATTGTTCAAGACCATCAATCGCTGGCTGGCCGAAGCAGGCGTCATGATGACTCAAGGCACCTTGGTCGATGCC
ACCATCATTGAGGCACCCAGCTCGACCAAGAACAAAGAGCAGCAACGCGATCCGGAGATGCATCAGACCAAGAAAGGCAATCAGTGGCACTTTGGCATGA
AGGCCCACATTGGTGTCGATGCCAAGAGTGGCCTGACCCACAGCCTGGTCACCACCGCGGCCAACGAGCATGACCTCAATCAGCTGGGTAATCTGCTGCA
TGGAGAGGAGCAATTTGTCTCAGCCGATGCCGGCTACCAAGGGGCGCCACAGCGCGAGGAGCTGGCCGAGGTGGATGTGGACTGGCTGATCGCCGAGCGC
CCCGGCAAGGTAAGAACCTTGAAACAGCATCCACGCAAGAACAAAACGGCCATCAACATCGAATACATGAAAGCCAGCATCCGGGCCAGGGTGGAGCACC
CATTTCGCATCATCAAGCGACAGTTCGGCTTCGTGAAAGCCAGATACAAGGGGTTGCTGAAAAACGATAACCAACTGGCGATGTTATTCACGCTGGCCAA
CCTGTTTCGGGCGGACCAAATGATACGTCAGTGGGAGAGATCTCACTAAAAACTGGGGATAACGCCTTAAATGGCGAAGAAACGGTCTAAATAGGCTGAT
TCAAGGCATTTACGGGAGAAAAAATCGGCTCAAACATGAAGAAATGAAATGACTGAGTCAGCCGAGAAGAATTTCCCCGCTTATTCGCACCTTCC
ATTCAGCAGTAAGCGCCGTCAGACCAGAAAAGAGATTTTCTTGTCCCGCATGGAGCAGATTCTGCCATGGCAAAACATGGTGGAAGTCATCGAGCCGTTT
TACCCCAAGGCTGGTAATGGCCGGCGACCTTATCCGCTGGAAACCATGCTACGCATTCACTGCATGCAGCATTGGTACAACCTGAGCGATGGCGCGATGG
AAGATGCTCTGTACGAAATCGCCTCCATGCGTCTGTTTGCCCGGTTATCCCTGGATAGCGCCTTGCCGGACCGCACCACCATCATGAATTTCCGCCACCT
GCTGGAGCAGCATCAACTGGCCCGCCAATTGTTCAAGACCATCAATCGCTGGCTGGCCGAAGCAGGCGTCATGATGACTCAAGGCACCTTGGTCGATGCC
ACCATCATTGAGGCACCCAGCTCGACCAAGAACAAAGAGCAGCAACGCGATCCGGAGATGCATCAGACCAAGAAAGGCAATCAGTGGCACTTTGGCATGA
AGGCCCACATTGGTGTCGATGCCAAGAGTGGCCTGACCCACAGCCTGGTCACCACCGCGGCCAACGAGCATGACCTCAATCAGCTGGGTAATCTGCTGCA
TGGAGAGGAGCAATTTGTCTCAGCCGATGCCGGCTACCAAGGGGCGCCACAGCGCGAGGAGCTGGCCGAGGTGGATGTGGACTGGCTGATCGCCGAGCGC
CCCGGCAAGGTAAGAACCTTGAAACAGCATCCACGCAAGAACAAAACGGCCATCAACATCGAATACATGAAAGCCAGCATCCGGGCCAGGGTGGAGCACC
CATTTCGCATCATCAAGCGACAGTTCGGCTTCGTGAAAGCCAGATACAAGGGGTTGCTGAAAAACGATAACCAACTGGCGATGTTATTCACGCTGGCCAA
CCTGTTTCGGGCGGACCAAATGATACGTCAGTGGGAGAGATCTCACTAAAAACTGGGGATAACGCCTTAAATGGCGAAGAAACGGTCTAAATAGGCTGAT
TCAAGGCATTTACGGGAGAAAAAATCGGCTCAAACATGAAGAAATGAAATGACTGAGTCAGCCGAGAAGAATTTCCCCGCTTATTCGCACCTTCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
981 bp | 326 aa | 69 | 1049 | + | No |
Chemistry : DDE
ORF sequence :
MSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALP
DRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANE
HDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKND
NQLAMLFTLANLFRADQMIRQWERSH
DRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANE
HDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKND
NQLAMLFTLANLFRADQMIRQWERSH
Blast result :
Comments
IS5 was originally isolated as an insertion into the lambda KH100 cI gene (Blattner et al., 1974), and was also shown (by hybridization) to be present in Citrobacter freundii (Nisen et al., 1979) and Shigella flexneri (Shoner and Kahn, 1981). Although the IS5 copy number varies from 10 to 23 among the Escherichia coli K-12 strains, there is an extensive variation among other E. coli strains (ECOR and Murray collections), many of them been devoted of IS5 or having only a single copy (Green et al., 1984). 23 IS5 copies have also been mapped in the E. coli W3110 chromosome, including their orientation (Umeda and Ohtsubo, 1990): most copies were scattered over the entire chromosome, only 4 elements been clustered. IS5 is thought to be polar in both orientations. An IS distribution study among 23 closely related O1:K1 pathogenic E. coli strains showed an average of 9 IS2 copies/genome, as compared to the 41, 17, 4, and 3 copies/genome for IS1, IS4, IS5 and IS30, respectively (Lawrence et al., 1989). IS5 shows a clear preference for 5'-CWAR-3' target sequences. The role of the putative second and third ORF (ins5B and ins5C) located on the other strand (Rak and von Reutern, 1984) remains to be confirmed. IS5 has also been shown to behave as a transcriptional enhancer of the otherwise cryptic E. coli bgl operon (Schnetz and Rak, 1992). Muramatsu et al. (1988) have shown that IS5 contains a sharply curved DNA structure (sequence-directed bent DNA) centered at about 60-70 bp from its right end, in an AT-rich region.
IS5 is also present in Neisseria gonorrhoeae (Accession number : U20421), the sequence is 100% DNA similar to the sequence of the database.
IS5 is also present in Neisseria gonorrhoeae (Accession number : U20421), the sequence is 100% DNA similar to the sequence of the database.
References