The Growth-Suppressive Function of the Polycomb Group Protein Polyhomeotic Is Mediated by Polymerization of Its Sterile Alpha Motif (SAM) Domain*

Background: The mechanism by which Sterile Alpha Motifs (SAMs) self-associate and polymerize to control protein function is unknown. Results: SAM polymerization in Polyhomeotic, a Polycomb group protein, is controlled by an unstructured linker sequence in Polyhomeotic. Conclusion: Polyhomeotic growth suppressive function is enhanced by increasing SAM polymerization. Significance: Functions of other SAM domain-containing proteins could be manipulated through their unstructured linkers. Polyhomeotic (Ph), a member of the Polycomb Group (PcG), is a gene silencer critical for proper development. We present a previously unrecognized way of controlling Ph function through modulation of its sterile alpha motif (SAM) polymerization leading to the identification of a novel target for tuning the activities of proteins. SAM domain containing proteins have been shown to require SAM polymerization for proper function. However, the role of the Ph SAM polymer in PcG-mediated gene silencing was uncertain. Here, we first show that Ph SAM polymerization is indeed required for its gene silencing function. Interestingly, the unstructured linker sequence N-terminal to Ph SAM can shorten the length of polymers compared with when Ph SAM is individually isolated. Substituting the native linker with a random, unstructured sequence (RLink) can still limit polymerization, but not as well as the native linker. Consequently, the increased polymeric Ph RLink exhibits better gene silencing ability. In the Drosophila wing disc, Ph RLink expression suppresses growth compared with no effect for wild-type Ph, and opposite to the overgrowth phenotype observed for polymer-deficient Ph mutants. These data provide the first demonstration that the inherent activity of a protein containing a polymeric SAM can be enhanced by increasing SAM polymerization. Because the SAM linker had not been previously considered important for the function of SAM-containing proteins, our finding opens numerous opportunities to manipulate linker sequences of hundreds of polymeric SAM proteins to regulate a diverse array of intracellular functions.

Sterile Alpha Motif (SAM) 2 domains are highly prevalent in eukaryotes, found in over 3000 proteins that exist in every cell compartment. Proteins with SAM domains function as transcription factors, scaffolding proteins, nucleic acid-binding proteins, kinases and others (1). The ability to self-associate is a common feature present in a number of SAM domains. Examination of the structures of several of these SAM domains has revealed that, despite a lack of sequence identity, they all form a head-to-tail, open-ended, left-handed helical, homo-polymer architecture (2)(3)(4)(5)(6)(7)(8) (Fig. 1, A and B). A recent analysis of all SAM domain sequences upon "threading" them into known polymer structures estimates that nearly 700 SAM domains are polymeric (9). This may be a low estimate if other SAM domains also form a similar architecture utilizing different residues for polymerization than those already identified for polymer forming SAM domains.
Polymerization of SAM domains has been shown to be vital to the proper function of a protein (5)(6)(7)(8)10) indicating a close relationship between protein function and SAM domain polymerization and suggesting protein activity could be manipulated by controlling the dynamic polymerization of the SAM domain. However, what remains largely unresolved is how SAM polymerization is regulated.
Polymeric SAM domains likely play an important role in Polycomb Group (PcG) function. The PcG is a family of gene silencing proteins that are noted for their role as developmental regulators that maintain the repressed state of Drosophila homeotic (HOX) genes (11). Of the 15 to 20 PcG proteins identified thus far, four contain SAM domains: Ph (Polyhomeotic), Scm (Sex combs on midleg), L3mbt (lethal 3 malignant brain tumor), and Sfmbt (Scm-related gene containing four mbt domains). In vitro, Ph SAM and Scm SAM can both form * This work was supported by the American Cancer Society (RSG-08-285-01-homo-polymers in addition to being able to hetero-polymerize with each other (3,4,12,13). Binding affinity measurements suggest a model whereby the two homo-polymers come together at a single junction (4). While the polymeric status of L3mbt and Sfmbt are not known, sequence analysis of their SAM domains indicate that polymerization is likely (9). For Ph and Scm SAM (like all other SAM domains that polymerize into a helical architecture), the head-to-tail homo-polymerization is mediated through two different binding surfaces on the SAM domain: the Mid-Loop (ML) binding surface, named for the loop structure in the middle of the SAM sequence that houses key SAM-SAM-interacting residues, and the End-Helix (EH) binding surface which contains binding residues on the C-terminal helix of the SAM structure (Fig. 1B). The SAM domains of Ph and Scm are at the C terminus of both proteins (Fig. 1C). It is important to note that both the N and C termini of these and other polymeric SAM domains point outward from the polymer helix axis. Therefore, it would be feasible to both form a helical polymer structure while being able to sterically accommodate the other parts of the protein that extend away from either the N or C terminus of the SAM domain.
Ph is a core member of a multi-protein PcG complex called Polycomb Repression Complex 1 (PRC1), which also includes Posterior Sex combs (Psc), RING1, and Polycomb (Pc) (14,15). Scm can associate with PRC1 though in less than stoichiometric amounts compared with the other PRC1 core components (16). Even though the isolated Ph SAM is able to polymerize as evidenced by its ability to form long filaments and the strong SAM-SAM self-association required for mediating polymerization demonstrated in both GST pull-down experiments and measurement of the K d of 200 nM (3), PRC1 is not polymeric. Electron micrographs of PRC1 isolated from a baculovirus expression system does not show the presence of long, fibrous structures that would be consistent with a polymer structure (17). Scanning force microscopy analyses show limited PRC1 oligomerization (estimated in the range of four to six PRC1 units) when PRC1 binds DNA, but longer polymers were not detected (18). Interestingly, the architecture of the PRC1/DNA complex does exhibit a left-handed helical assembly, perhaps as a result of short polymers formed through Ph SAM polymerization. Because no studies to date have been reported in which Ph SAM polymerization has been directly shown to be an important element of Ph function, we sought to determine the importance of Ph SAM in Ph function. In addition, we wanted to determine why a protein complex that contains a polymeric SAM domain appears to exhibit limited polymerization in the context of PRC1. The results of our study indicate that unstructured residues adjacent to Ph SAM play an important role in controlling polymerization, perhaps facilitated by the helical architecture which is common to all SAM polymers identified thus far. Furthermore, given that the function of proteins that contain polymeric SAM domains are dependent on polymerization, our results could have consequences toward being able to modulate protein activity, and consequently cell function, through SAM linker control of the dynamic polymerization of SAM domains.

EXPERIMENTAL PROCEDURES
Luciferase Reporter Transcription Assay-The polyhomeoticproximal (ph-p, also called ph) genes were cloned into pPAc-Flag (kind gift from Dr. Albert J. Courey). The expression vectors for ph (100 ng), lacZ (7.5 ng) (kind gift from Dr. Yuzuru Shiio), both under control of the actin 5c promoter, along with the luciferase reporter gene were transfected into 1 ϫ 10 5 Drosophila S2 cells using the Fugene HD transfection reagent (Roche Applied Science). For Fig. 6C, 50 ng and 3.75 ng of the  1KW4). B, illustration of the head-to-tail arrangement of the SAM polymer. The K d measurement was previously reported (3). C, close-up of the SAM-SAM interface of A showing the location of the residues targeted for mutagenesis. The structure shows the L1547R mutation as it was used for the crystal structure determination. Ph SAM surface mutants L1547R, H1552R, and L1565R correspond to L51R, H56R, and L69R, respectively, that were used previously (3,4). D, domain organization of the proximal Ph of Drosophila melanogaster used in this study. HD1: homology domain 1; FCS: Phe-Cys-Ser domain; SAM: sterile alpha motif. These three domains are conserved in all Ph orthologs.
ph and lacZ plasmids were used. There are three tandem repeats of the Zif268 DNA binding elements immediately upstream of a metallothionine promoter (MTp) controlling the expression of the luciferase reporter gene. Two days after transfection, the expression of the luciferase gene was induced by adding 100 M CuSO 4 . The cells were harvested the following day and lysed using 100 mM potassium phosphate buffer pH 7.8, 0.2% Triton X-100, and 0.5 mM DTT. For all individual transfections, equal volumes of lysate were used for the Dual-Light Combine Reporter Gene Assay System (Applied Biosystems) to measure both luciferase and ␤-galactosidase activities. The data are presented as the ratio of the two enzyme activities.
Transgenic Drosophila-All ph transgenes were cloned into pattB-UASp, a vector required for PhiC31 integration (19). The DNA was injected into early syncytial-stage blastoderm embryos (Rainbow Transgenics) that carries both a source of the PhiC31 integrase on the X chromosome and attP target site at chromosomal position 58A (yw; M{eGFP.vas-int.Dm} 2A ; M{RFP.attP} 58A ). Successful transformants were selected on the basis of the stable integration of the white gene into white mutant background. All transgenes include a Flag-epitope tag and are under control of the UAS promoter whose expression was driven by mating the transgenic flies with either the engrailed Gal4 (en-Gal4) or actin-Gal4 lines (see FlyBase database for descriptions of the Gal4 driver lines). Table S1 summarizes all the protein constructs used in the in vitro experiments. Except for those fused to MBP, ph genes were subcloned into a modified pET-3c vector (Novagen), transformed into BL21-pLysS cells and expression induced with 1 mM IPTG. For the MBP fused constructs, ph genes were subcloned into pBADM-41ϩ (EMBL), transformed into ARI814 cells (20), and expression induced with 0.2% arabinose. Typically, harvested bacterial cells from 1-liter cultures were resuspended with 10 ml of 50 mM Tris, pH 8.0, 100 mM NaCl, 25 mM imidazole, pH 7.5, 1 mM PMSF, and 10 mM ␤ME. Cells were lysed by sonication and initial purification of the proteins was performed using Ni affinity chromatography followed by ion exchange chromatography. Proteins that required cleavage of tag sequences were purified by Ni affinity chromatography, cleaved with either TEV or Senp2 (a kind gift from Dr. Christopher D. Lima) followed by a second Ni affinity chromatography where the non-binding fractions were collected. Further purifications were performed using ion exchange chromatography. Ph sc1398 -1484 1485-1577 expressed insolubly in bacteria. For this protein, bacterial cells were first resuspended in 50 mM Tris, pH 8, 200 mM NaCl, 10 mM imidazole, pH 7.5, 1 mM PMSF, 10 mM ␤-ME, and lysed by sonication. The soluble lysate was separated, and the pellet solubilized in the same buffer containing 6 M urea. Following non-native Ni affinity purification, the protein was refolded through dialysis into native buffer then further purified as described above for the proteins expressed in the soluble lysate.

Protein Preparation-Supplemental
Analytical Ultracentrifugation-All samples for AUC were placed in 10 mM Tris pH 8.0, 50 mM NaCl, and 1 mM TCEP. The SV data were analyzed with Ultrascan (21) (version 9.9 release 1282). All experiments were scanned in intensity mode at 280 nm, 20°C, ranging between 25-40 krpm in standard 2-chanel epon centerpieces (Beckman-Coulter). All data were initially analyzed with the 2-dimensional spectrum analysis to remove time and radially invariant noise from the data (22), followed by van Holde -Weischet analysis (23), and Genetic Algorithm-Monte Carlo analysis (24,25). All sedimentation velocity experiments were performed at the Center for Analytical Ultracentrifugation of Macromolecular Assemblies at the University of Texas Health Science Center in San Antonio using a Beckman XLI ultracentrifuge. All computations were carried out using the TeraGrid infrastructure (26).
Atomic Force Microscopy-AFM imaging was performed in oscillation (tapping) mode in air, with a Multimode Nanoscope IIIa microscope (Bruker Corp.). Ph SAM, RLink Ph-(1485-1577) and Ph-(1397-1577) at 588, 620, and 620 M stock concentration, respectively, were prepared in 10 mM Tris/HCl pH 8.0, 50 mM NaCl, and 1 mM TCEP then diluted with 5 mM Tris/HCl, pH 7.4 buffer. Between 800 and 1700 particles of at least 4 nm in length were analyzed for each sample. 3 l of the sample was deposited on a freshly cleaved muscovite mica substrate and immediately briefly spinned on a turntable to prevent clumping of particles. After 2 min at room temperature the substrate with electrostatically attached protein molecules was washed three times with 50 l of double distilled water and dried under a stream of nitrogen. The imaging was carried out with TESP probes (Veeco Probes) with resonant frequency tuned to 290 -300 kHz, the amplitude of 100 -300 mV, the setpoint between 1.5 and 1.8 V, and the scan rate of 3.05 Hz. Images of 1 ϫ 1 m fields were collected in a height mode with digital resolution of 512 ϫ 512 pixels, with at least ten distinct areas imaged for each sample. The images were subjected to a standard flattening and plain-fitting in the Nanoscope software (v. 5.12). Morphometric analysis of imaged particles was performed with the SPIP software (v. 5.1.5, Image Metrology, Denmark), with the grains (particles) detected and fiber lengths approximated automatically.
CD Spectroscopy-Ph-(1397-1507) was prepared to a concentration of 0.2 mg/ml in 10 mM sodium phosphate buffer pH 7, 5 mM NaCl. The CD spectrum of was collected at room temperature using a Jasco J-815 CD spectrometer. Data were collected between 250 to 195 nm.
NMR Spectroscopy-All NMR samples (both the 15 N and the doubly labeled 15 N, 13 C Ph-(1397-1507)) were prepared to a concentration of 1.5 mM in 10 mM NaPO4 pH 5.0, 50 mM NaCl. The concentration of 15 N Ph-(1397-1507) before the addition of potential binding proteins was 1.5 mM. Ph SAM L1565R, TEL SAM A61D (2) or chicken egg white lysozyme (Sigma-Aldrich) were placed into the same buffer as Ph-(1397-1507) before adding 0.4 and 1.6 molar equivalents and measuring a { 1 H}-15 N HSQC spectrum after each addition. Backbone assignments were made using Sparky (T. D. Goddard and D. G. Kneller, SPARKY 3, University of California, San Francisco) on data collected on a Bruker 600 MHz spectrometer at 310 K. The backbone chemical shifts were deposited to the BMRB (accession number 17552). All other spectra were processed and analyzed with NMRPipe (27) and NMRView (28). Chemical shift perturbation experiments were performed at 310 K on a 700 MHz spectrometer. The identity of the backbone amide signals of the HSQCs collected at 300 K on the 700 MHz spectrometer were sufficiently similar to that collected at 310 K on the 600 MHz spectrometer to easily correlate the signals.

Ph SAM Polymerization Is Required for Ph-mediated Gene
Repression-To investigate the role of Ph SAM polymerization in Ph-mediated repression, we utilized a transcription assay carried out in Drosophila S2 cells ( Fig. 2A). We found that targeting wild-type Ph to the luciferase reporter gene is able to repress transcription compared with Ph that is not targeted (Fig. 2B). Moreover, a SAM domain-deleted Ph protein was unable to repress transcription showing that the SAM domain is required for the repressive function of Ph. We also tested Ph with mutations within its SAM domain that disrupt SAM-SAM interactions and are thus deficient in polymerization (3) (Fig. 1, B and C). Mutating either the ML and EH binding surface (Ph L1547R/H1552R and Ph L1565R, respectively), fails to repress luciferase compared with wild-type (Fig. 2B).
We further investigated the role of Ph SAM polymerization in repression using transgenic Drosophila. We ectopically expressed Flag-tagged wild-type or either of the two SAM polymer-deficient mutant Ph proteins in the posterior compartment of the imaginal wing disc and assessed the derepression of Abdominal B (AbdB), which is normally repressed by the PcG in these cells. When exogenous wild-type Ph was expressed, there was normal wing disc development and maintenance of the repressed state of AbdB (Fig. 2C). In contrast, expression of either Ph L1547/H1552R or L1565R mutants lead to AbdB derepression, as indicated by immunostaining with anti-AbdB antibodies (Fig. 2, D and E). In addition, overexpression of the polymer-deficient Ph L1547/H1552R or L1565R mutants resulted in a dominant negative phenotype showing grossly abnormal wing discs consistent with tissue overgrowth. Taken together, these results are the first to suggest that SAM-dependent polymerization of Ph is required for transcription repression.
Mini-Ph (Ph-(1291-1577)) Forms Shorter Polymers than Ph SAM Alone (Ph-(1502-1577))-The discrepancy between the evidence for Ph SAM polymerization (both in vitro (3) and gene silencing (Fig. 2) experiments) and the lack of open-ended polymers in microscopy studies (17,18) may be due to only shorter polymers forming or polymerizing not occurring in the larger context of the entire Ph protein or when Ph is a component of PRC1. To investigate the factors that might influence Ph SAM polymerization we used sedimentation velocity (SV) to measure the level of in vitro polymerization of a series of Ph constructs (Fig. 3). We analyzed the SV data via the van Holde-Weischet (vHW) analysis (23). This approach provides sedimentation coefficient distributions from which an assessment of heterogeneity can be obtained. Ph SAM alone (Ph-(1502-1577)) shows an S-value distribution from 6 -11 S (Fig.  3A), consistent with the self-association of the 10 kDa Ph SAM into high molecular weight oligomers and provides further support for the previously observed in vitro Ph SAM polymerization (3). We next characterized a Ph construct encompassing residues 1291-1577, which we refer to as mini-Ph because it contains the three predicted structured domains: HD1 (homology domain 1), the nucleic acid binding FCS (Phe-Cys-Ser) domain (29,30), and SAM (Fig. 1D) that are conserved in all Ph homologs. In contrast to Ph SAM, the sedimentation profile of mini-Ph exhibited lower S-values ranging from 5-6 S, even though mini-Ph has a higher calculated monomeric molecular mass (32 kDa) compared with just Ph SAM (10 kDa). Thus, while the individual Ph SAM domain forms long polymers, mini-Ph appears far smaller in its oligomeric state. We next determined whether the smaller mini-Ph oligomers were smaller polymers mediated by SAM-SAM interactions. We introduced mutations intended to interfere with the SAM-SAM interaction (3) into mini-Ph and performed SV. The ML surface residue L1547 is at the edge of the SAM-SAM interaction and while the L1547R mutation does hinder polymerization, some SAM-SAM self-association ability is still retained (3). Mini-Ph L1547R shows a lower S-value distribution compared with wild-type mini-Ph indicating a smaller oligomeric state. Ph SAM with the L1565R mutations, which exhibits no ability to self-associate (3), displays an even lower S-value distribution than that of mini-Ph L1547R. These results would be predicted if SAM-SAM interactions were mediating mini-Ph oligomerization. The wide range of S-values indicate that Ph SAM and mini-Ph are heterogeneous, existing as a range of polymer sizes that contain different number of individual Ph SAM or mini-Ph units. We used a genetic algorithm-Monte Carlo analysis (24,25,31,32) to obtain molecular weight distributions for Ph SAM alone, mini-Ph and mini-Ph mutants (Fig. 3B). For Ph SAM, the polymers exhibited molecular weights as large as 5 ϫ 10 5 Da with a weight average molecular mass of 220 kDa (Fig. 3B, top), while mini-Ph polymers were much smaller (Fig. 3B, top center). In agreement with the sedimentation coefficient profiles, mini-Ph L1547R (Fig. 3B, bottom center) and L1565R (Fig. 3B, bottom) mutants exhibited even lower molecular weights than wild-type mini-Ph. Interestingly, the weight average molecular mass for wild-type mini-Ph is 178 kDa, which is close to the calculated value for a hexamer. While the heterogeneous nature of the sample prevents determination of the precise molecular weight of the oligomers, the results suggest an explanation for why a 6-fold screw axis helical architecture is present for all known structures of polymeric SAM domains (see "Discussion").
Ph SAM N-terminal Linker Can Influence Polymerization-To determine which region outside of the Ph SAM domain can influence the degree of polymerization, we prepared Ph constructs that deleted residues from the N terminus of mini-Ph and measured their level of polymerization. The sedimentation profile of Ph-(1397-1577) (Fig. 3C) (Fig. 3, C--E). We also prepared proteins with maltose-binding protein (MBP) fused to the N terminus of these construct allowing us to perform similar SV experiments at lower protein concentrations (MBP increases the extinction coefficient) while also better mimicking the configuration of the intact protein with the additional 42 kDa to the N terminus of the protein. The MBP-fused Ph constructs exhibited increasing S-values, and thus polymerization, as more of the linker was deleted (Fig. 3, F and G). From these experiments, we conclude that the Ph SAM N-terminal linker helps control Ph SAM polymerization.
A Random, Unstructured Linker Can Hinder Polymerization but Not As Well As Native Linker-The Ph sequence that links the FCS and SAM domains is not conserved among the human Ph orthologs (Fig. 4A) and is predicted to be disordered (supplemental Fig. S1). A non-conserved, unstructured linker capable of influencing the extent of SAM polymerization may do so via conformational entropy (see "Discussion"). To test whether any unstructured sequence could produce shorter Ph SAM polymers, we engineered a randomized 87-amino acid sequence (RLink) composed of residues that have been found to be enriched in intrinsically disordered proteins (M, K, R, S, Q, P, E) (33) along with several glycines to impart additional conformational entropy to the sequence, and attached it to the N terminus of Ph-(1485-1577) (RLink Ph-(1485-1577)). The sedimentation profile of RLink Ph-(1485-1577) (Fig. 4B) exhibited a lower average S-value profile than Ph SAM, indicating that RLink can indeed control polymerization. However, compared with Ph- (1397-1577), which has the native linker, RLink Ph-(1485-1577) exhibits significantly higher S-values, indicating that there are additional features of the native linker that are involved in controlling polymerization.
We used AFM as an independent technique to examine the differences in polymerization between Ph SAM, Ph-(1397-1577) and RLink Ph-(1485-1577) (Fig. 4C). At 100 nM concentration, each of the three samples showed fibers of diverse lengths, suitable for particle size analysis. We used the "fiber length" parameter as a measure of the maximal linear size of a fiber. In order to include tangled polymers in our analysis, we also used the "skeleton length" parameter, which adds the length of branches to the fiber length. The scatter plots in Fig.  4C shows clear differences in polymer lengths with Ph SAM being the longest, followed by RLink Ph-(1485-1577) then Ph-(1397-1577). These differences are also discernible in the maximal fiber lengths which were 248 nm (PhSAM), 154 nm (RLink Ph-(1485-1577)), and 99 nm (Ph-(1397-1577)). Even more striking were the maximum skeleton lengths of the polymers for each protein: 468 nm (Ph SAM), 338 nm (RLink Ph-(1485-1577)), and 99 nm (Ph-(1397-1577)). The AFM results together with those from AUC SV (Fig. 4B) indicate there are longer polymers formed when RLink is attached to the SAM domain compared with the native linker.
Amino Acid Content of the SAM Linker Is Important for Controlling SAM Polymerization-We tested two additional linker sequences to further examine the role of the SAM linker in controlling SAM polymerization. We designed a construct encompassing Ph residues 1398 -1577 where residues 1398 -1484 were replaced with a sequence containing the same amino acids but arranged in a scrambled manner (Ph sc1398 -1484 1485-1577). Likewise, the linker sequence found N-terminal to Scm SAM (Scm residues 691-777) were also scrambled and attached to Ph-(1485-1577) (Scm sc691-777 Ph-(1485-1577)). While Scm sc691-777 Ph-(1485-1577) exhibited increased polymerization compared with the Ph SAM with its native linker (Ph-(1397-1577)), Ph sc1398 -1484 1485-1577, showed the identical sedimentation profile as Ph-(1397-1577) exhibiting a sedimentation coefficient of 3 s throughout the boundary fraction (Fig. 4D). These results suggest that the amino acid content of the SAM linker plays a role in controlling SAM polymerization.
Ph SAM Linker Can Interact with Ph SAM-We investigated the structural properties of the Ph SAM native linker (Ph-(1397-1507)) to try to probe why it is more effective in controlling Ph SAM polymerization better than RLink. Ph-(1397-1507) was analyzed by circular dichrosim (CD) and NMR spectroscopies. The CD spectrum of Ph-(1397-1507) (Fig. 5A) has features that would be expected for a random coil. The one-dimensional 1 H NMR spectrum of Ph-(1397-1507) shows neither up-nor down-field shifted resonances, indicating the absence of any tertiary structure (Fig. 5B). Moreover, the backbone amide signals in the Ph-(1397-1507) { 1 H}-15 N HSQC NMR spectrum (Fig. 5C, black) are all clustered within the region expected for a random coil between 7.8 and 8.6 1 H ppm. Together, these results indicate that Ph-(1397-1507) lacks any structure in the absence of the SAM domain.
To determine whether the linker residues directly contact the SAM domain as a way to control polymerization, we added, in trans, Ph SAM with the L1565R mutation to 15 N-labeled linker (Ph-1397-1507)) and observed perturbations to its { 1 H}-15 N HSQC spectrum amide backbone signals. Ph SAM L1565R was used for this experiment because it does not self-associate (based on GST pull-down assays (3)) thus helping to eliminate complications due to aggregation that would be seen with use of the wild-type Ph SAM polymer. The titration produced substantial perturbation of a number of signals in the Ph-(1397-1507) two-dimensional { 1 H}-15 N HSQC spectrum (Fig. 5C).
Upon addition of 0.4 equivalents of Ph SAM L1565R (Fig. 5C,  green), peak broadening occurred to several residues between 1415 and 1433 to the extent they were no longer visible. Upon addition of 1.6 equivalents of Ph SAM L1565R (Fig. 5C, red), the majority of the backbone amide signals in the linker (65 residues) were no longer visible (Fig. 4A). These results indicate a direct interaction can occur, in trans, between the Ph-(1397-1507) and the Ph SAM domain. Unfortunately, we were unable to perform the complementary experiment of adding unlabeled Ph SAM linker to 15 N-labeled Ph SAM L1565R because the two-dimensional { 1 H}-15 N HSQC of the Ph SAM L1565R was not of sufficient quality to perform the experiment. Control experiments adding either a monomeric mutant SAM domain from a protein called TEL (TEL SAM) or lysozyme showed no broadening of the amide signals but did exhibit movement due to weaker binding that occurred within the fast exchange NMR timescale (supplemental Fig. S2). Interestingly, the signals that  Fig. 3A and 3C, respectively. They are also shown here to allow better direct comparison. All samples were at 37 M and spun at a rotor speed of 40K RPM. C, AFM images (top) and scatter plots of polymer fiber lengths (bottom) of Ph SAM, RLink Ph-(1485-1577), and Ph-(1397-1577). All AFM fields are 200 ϫ 300 nm, with fiber and skeleton lengths marked as red lines, as detected with the grain analysis of the SPIP software. The gray color scale in the images represents particle height, with black as a background (0 nm) and white as 10 nm. D, vHW for the scrambled linker Ph SAM chimeras. were perturbed by the addition of either the TEL SAM mutant and lysozyme are, in general, the same ones which were broadened upon addition of 0.4 molar equivalents of Ph SAM L1565R (Ph-(1415-1420) and -(1427-1433)). However, mutating these residues did not result in significant changes to polymerization or repressive ability (supplemental Fig. S3) suggesting that more extensive changes, such as with RLink (see below), are required to alter Ph SAM polymerization. We have not yet isolated 15 N-labeled Ph-scrambled linker (Ph sc1398 -1484) to determine whether it too can directly contact Ph SAM.
Ph RLink Has Enhanced Growth Suppressive Ability Compared with Wild-type-Given that Ph-mediated gene repression is dependent on Ph SAM polymerization (Fig. 2), it was of interest to determine whether it would be possible to produce the opposite outcome of an engineered Ph protein with a SAM domain that has a greater ability to polymerize and hence greater gene silencing ability. For example, the interaction between two Ph SAMs (K d , 200 nM (3)) is much weaker than for the TEL SAM self-association (2 nM (2)). Thus, a Ph protein in which the SAM domain is replaced with that of TEL SAM might produce a more stable SAM polymer leading to better repression. Also, SV experiments (Fig. 3) indicated that deleting segments of the linker leads to increased S-values, i.e. polymerization, as does replacing the native linker with RLink (Fig. 4B). We incorporated these potential polymer enhancing variations and tested their repressive ability using the luciferase reporter assay (Fig. 6A). While all constructs were capable of repressing the reporter gene better than the control (Ph that is not targeted to the promoter), only the mutant Ph in which Ph-(1398 -1484) was replaced with RLink (Ph RLink) showed slightly lower luciferase expression than wild-type suggesting that Ph gene silencing ability can be enhanced through increased polymerization. Ph RLink that contain polymer-deficient mutations were not capable of repressing luciferase as well as Ph RLink suggesting that Ph RLink-mediated repression is indeed dependent on Ph SAM polymerization (Fig. 6C). We have compared the repressive abilities of wild-type Ph and Ph RLink several times and in each instance we have observed Ph RLink repression better than wildtype though the differences in standard deviations from the three independent transfections were not always exceeded between Ph wild-type and Ph RLink. While these results do point toward Ph RLink being a better repressor than wild-type Ph, the lack of clear differences in repression levels did not allow us to confidently gauge whether Ph RLink is indeed a better repressor than wildtype in the context of this assay.
However, the effect of RLink was quite more evident in vivo. We engineered transgenic flies and expressed the mutant ph transgenes in the wing disc using the engrailed Gal4; UAS-GFP (en-Gal4) driver. The engrailed gene is negatively regulated by Ph (34), which we hypothesize maintains sufficiently low levels of wild-type Ph to not have an adverse effect on wing disc development unlike the more robust actin-Gal4; UAS-GFP (actin-Gal4) driver (supplemental Fig. S4). While wild-type ph expression using the en-Gal4 driver results in production of normal wing discs (Figs. 2C and 6D), expression of both the polymer-deficient Ph L1547R/H1552R and Ph L1565R mutants led to large tissue overgrowth (Figs. 2, D and E and 6E). Remarkably, Ph RLink, engineered with the expectation that it would increase polymerization, exhibited the opposite phenotype compared with the Ph SAM polymer-deficient mutants showing even smaller discs than those that expressed wild-type ph (Fig. 6F). Recent studies have shown that ph possesses growth suppressive properties where overgrowth phenotypes are observed in flies with mutant ph (35)(36)(37). Moreover, wild-type ph overexpression in the eye can induce a small eye phenotype (35). We have similarly obtained an enhanced growth suppressive function not through the overexpression of wild-type ph, but rather, by increasing the polymerization of Ph SAM. Potential molecular bases of the result shown in Fig. 6F are in the "Discussion."

DISCUSSION
We have found that the sequence of unstructured residues adjacent to a polymeric SAM domain can control SAM polymerization. Using this knowledge, the polymerization of a SAM domain containing protein was, for the first time, able to be increased, resulting in an in vivo phenotype opposite to that produced by polymer-deficient mutants.
The measurements of Ph SAM polymerization used in our study were performed with small Ph fragments. It cannot be stated for certain whether changes in polymerization for these small fragments also occur in the context of the full-length protein. However, the function of full-length Ph does indeed correlate with the level of polymerization observed for small Ph fragments. Thus, at least in this indirect way of assessing fulllength Ph polymerization, the observed changes to polymerization for small Ph fragments can be extended to full-length Ph. Future investigation of Ph and its oligomeric state will be required to determine how manipulating just the SAM domain affects the oligomeric state of the entire Ph protein.
The simplest interpretation of our cellular and in vivo data is that controlling polymerization can in turn modulate the repressive and growth suppressive functions of Ph. However, it is possible that the Ph mutants used here altered other roles of Ph. For example, the inability to repress transcription and the overgrowth phenotypes of the SAM polymer-deficient mutants (Figs. 2, D and E and 6F) may be the result of disrupting Ph binding to a positive regulator. Ph SAM uses the same binding surfaces that mediate its own SAM-SAM interaction to bind directly to Scm SAM (4). The Ph SAM ML binding surface binds to the EH surface of Scm SAM with much stronger affinity (K d , 54 nM) than the Ph SAM EH binds to the Scm SAM ML surface (K d ϳ1 mM, all other Ph SAM/Scm SAM combinations do not result in detectable binding (4)). Thus, the Ph SAM EH mutant still possesses a viable Scm SAM binding surface, yet both ML and EH mutants allow the same level of luciferase expression (Fig. 2B) and show the same level of wing disc overgrowth (Fig. 2, D and E). For the results obtained with Ph RLink, replacing Ph residue 1398 -1484 may have disrupted an interaction with a negative regulator. While possible, no such protein has yet to be identified. Furthermore, we have shown that Ph RLink-mediated repression is lost when polymer-deficient mutations are introduced in Ph RLink showing that the repression is dependent on SAM polymerization (Fig. 6C).
While in vitro we have been able to increase polymerization of various Ph fragments, only Ph RLink shows similar or slightly better repressive ability in our transcription assay (Fig. 6, A and  B). This discrepancy may be the consequence of the precise structural features of the polymer structure unique to Ph that are required for Ph function. The affinity of the TEL SAM-TEL SAM interaction is ten times the strength of the Ph SAM-Ph SAM interaction and would lead to greater polymerization for the isolated TEL SAM. However, the TEL SAM polymer repeat distance is 53 Å (2) compared with 45 Å for the Ph SAM polymer (3) and may not ultimately assemble the precise chromatin architecture required for Ph-mediated repression. For the linker deletion constructs, it is possible that a certain length of linker sequence is required to create a proper repressive structure. It is worth noting that the number of residues measured between the end of the FCS domain and beginning of the SAM domain in Ph is 118. The three human Ph orthologs all have similar number of linker residues ranging from 103 to 121 amino acids (Fig. 4A). Thus, shortening the linker, while resulting in longer polymers for the small Ph fragments, may have altered the polymer structure sufficiently enough to negatively affect transcription repression ability. Ph RLink has the identical number of amino acids as wild-type Ph which may allow the fully repressive competent polymer structure to form.
Like many proteins that contain a SAM domain that can polymerize, Ph functions as a transcriptional repressor. However, neither the molecular mechanisms of repression mediated by Ph nor the role played by the SAM domain is known. While SAM polymerization provided an attractive model to explain how a repressive PcG complex might spread along chromatin, the lack of open-ended polymer structures suggests SAM polymers play an alternative repressive role. Given the results presented here, it is possible that polymeric SAM-containing proteins exist in a limited number of dynamic oligomeric states, instead of participating as an open-ended polymer. That is, they are able to take on multiple, but limited, stoichiometries to accommodate specific repressive chromatin architectures associated with different genetic loci. For Ph SAM, one possible role for the SAM domain is in mediating localized, higher order repressive chromatin structures. Scanning force microscopy studies of PRC1 bound to DNA revealed the presence of a lefthanded helical architecture where the DNA is wrapped around several PRC1 units (18). We speculate that the structure of this PRC1 oligomer is mediated by short (four to six) PRC1 units. Alternatively, it has been suggested (38) that PcG mediated long range interactions (39) may occur through in trans PRC1 recruitment of different Polycomb response elements, perhaps through Ph SAM-Ph SAM interactions. Either mechanism of gene silencing would be enhanced with increased Ph SAM polymerization with RLink. The increased polymerization may result in greater difficulty in disassembling these complexes required during gene replication thus resulting in the reduced tissue growth phenotype observed for Ph RLink (Fig. 6D).
What remains to be determined is the precise mechanism or mechanisms by which the unstructured residues influence polymerization. Conformational entropy of the linker appears to play a key role in controlling SAM polymerization as attaching RLink to Ph SAM results in smaller polymers, though not as small as Ph SAM with its native linker. This result implies that any unstructured linker attached to a polymeric SAM domain could hinder polymerization. To this point, we have been able to obtain soluble forms of polymeric SAM domains, which by themselves are insoluble due to polymerization, by attaching RLink to them. 3 The reason why an unstructured stretch of amino acids results in making shorter polymer units could stem from the helical architecture of SAM polymers. Such an architecture would make it favorable for unstructured conformations of the linker attached to the first SAM unit in the polymer to occupy conformations that would be in place to sterically hinder the addition of SAM polymer units as the extending polymer returns near the same side of the helical structure as the first SAM unit. This polymer limiting mechanism would only require conservation of the helical architecture, which is the only consistent characteristic among the known SAM polymer structures (2)(3)(4)(5)(6)(7)(8). Additional factors specific to the amino acids present in the linker also appears to play a role as a linker with the same amino acid content as the native linker but arranged in a scrambled order can still hinder polymerization as well as the native linker (Fig. 4D). We have also observed direct contacts between the native linker and SAM domain in trans (Fig. 5C), though the exact consequence of this interaction has not yet been determined. Further investigation of the role of the linker will be required to determine the mechanism and precise role of the Ph SAM linker and those of other SAM polymers.