Jump to content

IS Families/IS5 and related IS1182 families

From TnPedia

Original Identification

IS5 was originally isolated as an insertion into the immunity region of bacteriophage lambda and subsequently found as a cause of mutation in a number of E. coli genes[1][2][3][4]. Together with IS1, it was also identified as an activator (by insertion) of expression of the usually cryptic beta-glucosidase gene of E. coli[5][6][7][8][9][10].

Presence in Compound Transposons

Several members are associated with compound transposons. These include IS903 and IS602, which form part of the kanamycin resistance transposons Tn903[11], and Tn602[12] respectively, and ISVa1 and ISVa2 which form part of a transposon carrying iron transport genes[13].

Distribution

The IS5 family, like the IS4 family, is also a relatively heterogeneous group which now requires reanalysis. It also includes sequences from both eubacteria and the archaea. There are now a large number of identified members of the IS5 family (>550 members) and of a closely related IS1182 family (>150 members) which have allowed a more detailed analysis and a separation into various subgroups and families. The IS5 family is partitioned into 6 subgroups: IS5, IS903, ISL2, ISH1, IS1031 and IS427[14] (Table Characteristics of IS families; Fig.5.1). Some of these may prove to be emerging families. Members of the IS5 subgroup appear to be composed of two groups with different lengths: one of 1060-1300 bp and a second of 1460-1610 (Fig.5.2 A).

Fig. IS5.1. Correspondence between the IS IRs and different IS5 family subgroups.
Fig. IS5.2. IS5 family IS5 subgroups A) distribution of IS length (base pairs); B) distribution of the length of transposase (amino acid residues).

Diversity

The transposases of these are also of different lengths (Fig.5.2 B) and transposase length is correlated with that of the IS. The lengths of the IS1013 subgroup are between ~900 and ~1200 bp with the majority between 103 and 1090 bp (Fig.5.3), those of the IS427 group are between 800 and 1070 bp in length with most having lengths in the range of 810 900 bp (Fig.5.3). Members of the IS903 subgroup are generally about 1030-1090 bp long (Fig.5.4 A ), those of the ISH1 subgroup are about 850 - 1200 bp long (note that this subgroup includes a number of Miniature Inverted repeat Transposable Elements (MITES) (Fig.5.4 B) and ISL2 members are 820 to 1260 bp long with a majority of about 820-970 bp (Fig.5.4 C). There are a large number of additional IS5 family members whose attribution to subgroups has yet to be established.

Fig. IS5.3. IS5 family IS1031 and IS427 subgroups. Top: distribution of IS length (base pairs) IS1031; Bottom: distribution of IS length (base pairs) IS427. The number of examples used in the sample is shown above each column.
Fig. IS5.4. Length (base pairs) distribution of IS5 family IS903, ISH1 and ISL2 subgroups. The number of examples used in the sample is shown above each column.

There is a distant relationship, about 30% similarity, between IS5 and the Pif/Harbinger group of eukaryotic TE[15].

Organization

Although the majority of members have a single Tpase orf, about 20% may express Tpase by frameshifting since it is distributed between two translation phases similar to most of the IS427 subgroup (82/116)[14]. In these cases if frameshifting indeed occurs the frameshifting signals appear more appropriate for a programmed transcriptional realignment frameshift mechanism (PTR) rather than for classical translation frameshifting (PRF) since there are no obvious downstream enhancement signals[16].

Similar split reading frames have also been identified in several of the other subgroups: IS1031 (13/65 members); ISL2 (7/43); and few in the IS5 subgroup (7/149). There is no experimental evidence that these frameshift signals are functional but many of these IS are in multiple copies suggesting that the derivatives are active. In view of their diversity compared to families such as IS3, the subgroups will certainly be partitioned into additional groups as more ISs are identified.

At present, the IS903 and the archaeal ISH1 subgroups whose IR are quite similar (Fig.5.5) do not contain members with potential frameshifting.

Fig. IS5.5. WebLogo showing the most common ISH1 and IS903 ends. The left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format. From top to bottom: IS1031, IS427, ISL2, ISH1, IS903, IS5 subgroups.

In addition to their Tpases and the presence or absence of potential frameshifting, a further distinction between these elements resides in their target specificities.

Certain IS427 subgroup members and IS1182 family members do not carry a termination codon for their Tpases but generate this on insertion into a specific target sequence, CTAG, which is duplicated on insertion. Other IS such as IS1031, duplicate a sequence TNA while others such ISL2 appear to duplicate ANT.

The lengths of the entire group range from 789 bp (e.g., ISMbu1) to 1643 bp (eg., IS493). The latter carries a second open reading frame upstream of the "Tpase" frame inessential for transposition[17]. IS4811 (Tn4811[18], which is greater than 5kb, clearly contains a number of passenger genes including one with a consensus ATP/GTP-binding motif; an oxidoreductase-like protein; and one related to bacterial transcription regulators of the AraC family. Another, IS881 from Streptomyces, is interrupted by a group II intron.

The major feature which defines this group is the similarities between their putative Tpases[19]. This includes the N2, N3 and C1 domains carried by the IS4 group[20]. However, IS5 family Tpases exhibit a spacing between the N3 and C1 domains of approximately 40 residues, a distance more consistent with the canonical DDE motif[14].

Analysis of the largely increased number of members generally confirms these subgroups. Members within each group also generate distinct DRs of similar lengths (IS5, 4 bp; ISL2, 2-3 bp; IS1031, 3-4 bp; IS903, 8-9 bp; and IS427, 2-3 bp).

The IS903 and ISH1 subgroups have similar terminal IRs (Fig.5.5) but appear distinct by correlation with the length of the target duplication and, to a lesser extent, by the typical length of the entire IS (Fig.5.4).

Several members exhibit GATC sites within their terminal 50 bp. This includes all members of the IS903 subgroup and many members of the IS1031 and IS427 subgroups. IS903 transposition activity has been shown to be modulated by Dam in vivo (cited in [21]).

A preferred target sequence, YTAR (often CTAG), is observed for two subgroups, IS5 and IS427, and for two members of the ISL2 group (eg., IS112 and IS1373) in which either all four base pairs or the central TA are duplicated on insertion.

It is important to underline that, in many cases, the sequence of the original target site before insertion is not available. This can introduce ambiguities not only in estimating the number of duplicated target base pairs but also in defining the IRs. It is particularly important in several cases where the target repeat is symmetrical (e.g. CTAG) and where it is impossible to distinguish whether the element duplicates 2 or 4 bp and therefore to determine the exact ends of the element. Alignment of the ends of these elements in subgroups has permitted a number of ambiguities to be resolved. Members of the ISL2 group which generate 3 bp DRs exhibit a preference for ANT while those from the IS1031 group (which generate exclusively a 3 bp DR) exhibits a preference for insertion sites with the sequence TNA. Neither the small ISH1 group (8 bp DRs) nor the IS903 group (9 bp DRs) exhibit marked target specificity (see IS903 and also Target specificity). Only two of these elements, IS5 and IS903, have received significant attention.

IS5 group

In spite of the historical importance of IS5 in generating mutations, the published work concerning this element is largely directed to an understanding of its coding capacity and expression properties. IS5 carries one large orf, ins5A, spanning the entire element and shown to be essential for transposition IS5 (see [7]), and two small orfs (ins5B and 5C[22][23][24][25], whose relevance to transposition remains to be demonstrated. Nothing is known about the transposition mechanism of this element.

Mechanism: IS903

The only IS5 family member which transposition mechanism has been addressed at present is IS903. The ends of IS903 carry IRs of 18 bp which exhibit the typical two-domain organization[26] . Transposase has been shown to bind specifically to the ends using a region located in the amino-terminal portion of the protein[27][28]. In addition, a region possibly involved in the formation of higher order multimers has been identified and residues probably involved in catalysis have been pinpointed among the conserved residues in the catalytic DDE domain[28]. Insertion generates a 9 bp target duplication.

An elegant genetic analysis provided strong evidence that IS903 is not only capable of undergoing direct insertion but can also generate adjacent deletions in a duplicative manner. Moreover, point mutations in the terminal base pair of the IRs decrease overall transposition frequency but increase the frequency of cointegrate formation[29]. Similarly, mutation of the first nucleotide flanking an IR also influences the level of cointegrate formation[30]. The level of cointegrate formation can also be increased by mutation of the Tpase. The molecular nature of these effects requires further investigation.

Factors affecting IS903 target site choice have been addressed in some detail. Initial studies[31] identified that insertion into the conjugative plasmid pOX38 showed no consensus in the 9 bp target duplication produced on insertion but the alignment of the target sequences indicated a preference for sites with symmetry on either side. A cloned copy of one native symmetric site into a second conjugative plasmid, pUB307, confirmed its attractiveness for insertion. More extensive studies provided a consensus symmetric target sequence which, when cloned into a target replicon, proved highly efficient[32]. The preferred target was a 21 bp palindrome cantered on the 9 bp target duplication. It could be dissected into: the 5 bp flanking sequences, the most important for site-specific insertion; the 7 bp palindromic core within the target duplication; the dinucleotide pair at the transposon-target junction; and the local DNA context.

Insertion into pUB307 itself showed a strong preference for a single orientation. By inverting either the vegetative (oriV) or transfer, oriT, origins, it was concluded that orientation was determined by the direction of conjugative transfer. This of course implies that the ends of IS903 are not equivalent. It also implies, as is the case for Tn7[33][34][35][36][37] and members of the IS200/IS608 family[38][39][40], that transposition targets replication forks.

The requirement the most abundant nucleoid proteins in transposition[41]. Most notably, H-NS was required for efficient transposition. Similar results were obtained for IS10 and Tn522 suggesting a more general role for H-NS in bacterial transposition. H-NS exerts its effect on target capture: IS903. Targeting preferences in the E. coli chromosome were dramatically altered in the absence of H-NS.

Several other host mutants were identified exhibiting a unique population pattern[42]: a ring phenotype with predominant papillae located just inside the edge of the colony, implying a spatial triggering of transposition within the. These mutants were found to be in pur genes, whose products are involved in purine biosynthesis. The genetic evidence was consistent with a requirement for GTP in IS903 transposition. These observations suggest that transposition occurs in later stages of colony growth. Transposition may occur within the colony edge in response to either a gradient of exogenous purines across the colony and may also reflect the developmental stage of the cells.

IS903 transposase like those of a variety of other IS, exhibits a strong preference for action in cis: complementation of defective transposons in trans occurs at less than 1% [42]. Transposition is extremely sensitive to the distance between the 3' end of the transposase gene and the nearest transposon IR. Insertion of 1 kb of DNA reduces transposition to 1-2%. There is a strong correlation between the stability of transposase and its ability to act in trans. wild-type transposase has a half-life of about 3 min. Fusion with α-galactosidase stabilizes the protein and results in an increase in its capacity to act in trans. A similar effect was noted in a lon mutant strain where trans activity was increased by a factor of 10-100. Further studies identified a class of transposase mutants specifically enhanced in trans activity and reduced in cis activity without increasing the overall transposition frequency. This was correlated with an increase in transposase half-life compared to the wildtype Derbyshire[43]. A second class of mutants with enhanced cis activity resulted in increased levels of transposase expression (as for IS10[44]).

IS1182

IS1182 family members exhibit a diverse set of target specificities. Some duplicate 4 bp. These are of two types: those specific for CTAG and those that show no apparent target sequence specificity. Yet others target palindromic sequences. These are also of different types: some insert at the 3’ foot of a stem-loop and duplicate the entire structure while others insert 3’ of the loop and simply duplicate the loop (P. Siguier, E. Gourbeyre and M. Chandler, unpublished) (Fig.IS1182.1).

Fig. IS1182.1. The IS1182 family's main characteristics. Top: The left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format. Bottom: distribution of IS length (base pairs) IS1182 family members. The number of examples used in the sample is shown above each column.

ISDol1 group (ISNCY)

Another small group, ISDol1, with 58 members from a large number of bacterial species has emerged from the ISNCY “orphan” group. Members have a length of between 1600-1900 bp (Fig.ISDol.1) and generate DRs of 6-7bp.

Fig. ISDol1.1. The IS1182 family's main characteristics. Top: The left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format. Bottom: distribution of IS length (base pairs) IS1182 family members. The number of examples used in the sample is shown above each column.

Bibliography

  1. Blattner et al.. Deletions and insertions in the immunity region of coliphage lambda: revised measurement of the promoter-startpoint distance. Virology. 1974. 62. pp. 458-71. doi: 10.1016/0042-6822(74)90407-3. PMID: 4432374.
  2. Charlier et al.. Heteroduplex analysis of regulatory mutations and of insertions (IS1, IS2, IS5) in the bipolar argECBH operon of Escherichia coli. Molecular & general genetics : MGG. 1978. 161. pp. 175-84. doi: 10.1007/BF00274186. PMID: 353507.
  3. Charlier et al.. Bidirectional polarity of IS2 elements and the polar effect of an IS5 insertion in the argECBH gene cluster of Escherichia coli [proceedings]. Archives internationales de physiologie et de biochimie. 1978. 86. pp. 909-10. PMID: 84614.
  4. Chow & Broker. Adjacent insertion sequences IS2 and IS5 in bacteriophage Mu mutants and an IS5 in a lambda darg bacteriophage. Journal of bacteriology. 1978. 133. pp. 1427-36. doi: 10.1128/jb.133.3.1427-1436.1978. PMID: 641012.
  5. Reynolds et al.. Insertion of DNA activates the cryptic bgl operon in E. coli K12. Nature. 1981. 293. pp. 625-9. doi: 10.1038/293625a0. PMID: 6270569.
  6. Schnetz et al.. Beta-glucoside (bgl) operon of Escherichia coli K-12: nucleotide sequence, genetic organization, and possible evolutionary relationship to regulatory components of two Bacillus subtilis genes. Journal of bacteriology. 1987. 169. pp. 2579-90. doi: 10.1128/jb.169.6.2579-2590.1987. PMID: 3034860.
  7. 7.0 7.1 Schnetz & Rak. IS5: a mobile enhancer of transcription in Escherichia coli. Proceedings of the National Academy of Sciences of the United States of America. 1992. 89. pp. 1244-8. doi: 10.1073/pnas.89.4.1244. PMID: 1311089.
  8. Schnetz & Rak. Regulation of the bgl operon of Escherichia coli by transcriptional antitermination. The EMBO journal. 1988. 7. pp. 3271-7. doi: 10.1002/j.1460-2075.1988.tb03194.x. PMID: 2846278.
  9. Schnetz. Silencing of Escherichia coli bgl promoter by flanking sequence elements. The EMBO journal. 1995. 14. pp. 2545-50. doi: 10.1002/j.1460-2075.1995.tb07252.x. PMID: 7781607.
  10. Schnetz & Wang. Silencing of the Escherichia coli bgl promoter: effects of template supercoiling and cell extracts on promoter activity in vitro. Nucleic acids research. 1996. 24. pp. 2422-8. doi: 10.1093/nar/24.12.2422. PMID: 8710516.
  11. Grindley & Joyce. Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903. Proceedings of the National Academy of Sciences of the United States of America. 1980. 77. pp. 7176-80. doi: 10.1073/pnas.77.12.7176. PMID: 6261245.
  12. Stibitz & Davies. Tn602: a naturally occurring relative of Tn903 with direct repeats. Plasmid. 1987. 17. pp. 202-9. doi: 10.1016/0147-619x(87)90028-x. PMID: 2819910.
  13. Tolmasky & Crosa. Iron transport genes of the pJM1-mediated iron uptake system of Vibrio anguillarum are included in a transposonlike structure. Plasmid. 1995. 33. pp. 180-90. doi: 10.1006/plas.1995.1019. PMID: 7568465.
  14. 14.0 14.1 14.2 Mahillon & Chandler. Insertion sequences. Microbiology and molecular biology reviews : MMBR. 1998. 62. pp. 725-74. doi: 10.1128/MMBR.62.3.725-774.1998. PMID: 9729608.
  15. Zhang et al.. PIF- and Pong-like transposable elements: distribution, evolution and relationship with Tourist-like miniature inverted-repeat transposable elements. Genetics. 2004. 166. pp. 971-86. doi: 10.1534/genetics.166.2.971. PMID: 15020481.
  16. Sharma et al.. A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment. Molecular biology and evolution. 2011. 28. pp. 3195-211. doi: 10.1093/molbev/msr155. PMID: 21673094.
  17. Baltz et al.. Transposition of Tn5096 and related transposons in Streptomyces species. Gene. 1992. 115. pp. 61-5. doi: 10.1016/0378-1119(92)90541-v. PMID: 1319378.
  18. Chen et al.. Discovery and characterization of a new transposable element, Tn4811, in Streptomyces lividans 66. Journal of bacteriology. 1992. 174. pp. 7762-9. doi: 10.1128/jb.174.23.7762-7769.1992. PMID: 1332944.
  19. Rezsöhazy et al.. The IS4 family of insertion sequences: evidence for a conserved transposase motif. Molecular microbiology. 1993. 9. pp. 1283-95. doi: 10.1111/j.1365-2958.1993.tb01258.x. PMID: 7934941.
  20. Rezsöhazy et al.. The IS4 family of insertion sequences: evidence for a conserved transposase motif. Molecular microbiology. 1993. 9. pp. 1283-95. doi: 10.1111/j.1365-2958.1993.tb01258.x. PMID: 7934941.
  21. Roberts et al.. IS10 transposition is regulated by DNA adenine methylation. Cell. 1985. 43. pp. 117-30. doi: 10.1016/0092-8674(85)90017-0. PMID: 3000598.
  22. Engler & van Bree. The nucleotide sequence and protein-coding capability of the transposable element IS5. Gene. 1981. 14. pp. 155-63. doi: 10.1016/0378-1119(81)90111-6. PMID: 6269958.
  23. Schoner & Kahn. The nucleotide sequence of IS5 from Escherichia coli. Gene. 1981. 14. pp. 165-74. doi: 10.1016/0378-1119(81)90112-8. PMID: 6269959.
  24. Rak et al.. Expression of two proteins from overlapping and oppositely oriented genes on transposable DNA insertion element IS5. Nature. 1982. 297. pp. 124-8. doi: 10.1038/297124a0. PMID: 6281651.
  25. Rak & von Reutern. Insertion element IS5 contains a third gene. The EMBO journal. 1984. 3. pp. 807-11. doi: 10.1002/j.1460-2075.1984.tb01889.x. PMID: 6327289.
  26. Derbyshire et al.. Genetic analysis of the interaction of the insertion sequence IS903 transposase with its terminal inverted repeats. Proceedings of the National Academy of Sciences of the United States of America. 1987. 84. pp. 8049-53. doi: 10.1073/pnas.84.22.8049. PMID: 2825175.
  27. Derbyshire & Grindley. Binding of the IS903 transposase to its inverted repeat in vitro. The EMBO journal. 1992. 11. pp. 3449-55. doi: 10.1002/j.1460-2075.1992.tb05424.x. PMID: 1324175.
  28. 28.0 28.1 Tavakoli et al.. Defining functional regions of the IS903 transposase. Journal of molecular biology. 1997. 274. pp. 491-504. doi: 10.1006/jmbi.1997.1410. PMID: 9417930.
  29. Tavakoli & Derbyshire. IS903 transposase mutants that suppress defective inverted repeats. Molecular microbiology. 1999. 31. pp. 1183-95. doi: 10.1046/j.1365-2958.1999.01260.x. PMID: 10096085.
  30. Tavakoli & Derbyshire. Tipping the balance between replicative and simple transposition. The EMBO journal. 2001. 20. pp. 2923-30. doi: 10.1093/emboj/20.11.2923. PMID: 11387225.
  31. Hu & Derbyshire. Target choice and orientation preference of the insertion sequence IS903. Journal of bacteriology. 1998. 180. pp. 3039-48. doi: 10.1128/JB.180.12.3039-3048.1998. PMID: 9620951.
  32. Hu et al.. Anatomy of a preferred target site for the bacterial insertion sequence IS903. Journal of molecular biology. 2001. 306. pp. 403-16. doi: 10.1006/jmbi.2000.4421. PMID: 11178901.
  33. Wolkow et al.. Conjugating plasmids are preferred targets for Tn7. Genes & development. 1996. 10. pp. 2145-57. doi: 10.1101/gad.10.17.2145. PMID: 8804309.
  34. Peters & Craig. Tn7 recognizes transposition target structures associated with DNA replication using the DNA-binding protein TnsE. Genes & development. 2001. 15. pp. 737-47. doi: 10.1101/gad.870201. PMID: 11274058.
  35. Peters & Craig. Tn7 transposes proximal to DNA double-strand breaks and into regions where chromosomal DNA replication terminates. Molecular cell. 2000. 6. pp. 573-82. doi: 10.1016/s1097-2765(00)00056-3. PMID: 11030337.
  36. Peters. Tn7. Microbiology spectrum. 2014. 2. doi: 10.1128/microbiolspec.MDNA3-0010-2014. PMID: 26104363.
  37. Parks et al.. Transposition into replicating DNA occurs through interaction with the processivity factor. Cell. 2009. 138. pp. 685-95. doi: 10.1016/j.cell.2009.06.011. PMID: 19703395.
  38. He et al.. The IS200/IS605 Family and "Peel and Paste" Single-strand Transposition Mechanism. Microbiology spectrum. 2015. 3. doi: 10.1128/microbiolspec.MDNA3-0039-2014. PMID: 26350330.
  39. Lavatine et al.. Single strand transposition at the host replication fork. Nucleic acids research. 2016. 44. pp. 7866-83. doi: 10.1093/nar/gkw661. PMID: 27466393.
  40. Ton-Hoang et al.. Single-stranded DNA transposition is coupled to host replication. Cell. 2010. 142. pp. 398-408. doi: 10.1016/j.cell.2010.06.034. PMID: 20691900.
  41. Swingle et al.. The effect of host-encoded nucleoid proteins on transposition: H-NS influences targeting of both IS903 and Tn10. Molecular microbiology. 2004. 52. pp. 1055-67. doi: 10.1111/j.1365-2958.2004.04051.x. PMID: 15130124.
  42. 42.0 42.1 Coros et al.. Genetic evidence that GTP is required for transposition of IS903 and Tn552 in Escherichia coli. Journal of bacteriology. 2005. 187. pp. 4598-606. doi: 10.1128/JB.187.13.4598-4606.2005. PMID: 15968071.
  43. Derbyshire & Grindley. Cis preference of the IS903 transposase is mediated by a combination of transposase instability and inefficient translation. Molecular microbiology. 1996. 21. pp. 1261-72. doi: 10.1111/j.1365-2958.1996.tb02587.x. PMID: 8898394.
  44. Jain & Kleckner. Preferential cis action of IS10 transposase depends upon its mode of synthesis. Molecular microbiology. 1993. 9. pp. 249-60. doi: 10.1111/j.1365-2958.1993.tb01687.x. PMID: 8412678.


How to Cite?

TnPedia Team. (2025). TnPedia: IS5 and related IS1182 families of Prokaryotic Insertion Sequences. Zenodo. https://doi.org/10.5281/zenodo.15636051

DOI badge