Insights into High Affinity Small Ubiquitin-like Modifier (SUMO) Recognition by SUMO-interacting Motifs (SIMs) Revealed by a Combination of NMR and Peptide Array Analysis*

Background: The SUMO-interacting motif (SIM) mediates SUMO-dependent regulation. Results: The structure of a SUMO1-specific SIM in complex with SUMO1 is solved, and the SIM sequence requirements are identified by peptide arrays. Conclusion: SIMs bound in the parallel orientation have more strictly conserved sequence than those in the antiparallel orientation. Significance: The findings will facilitate the identification of new SIMs and the design of SIM mimetics. The small ubiquitin-like modifiers (SUMOs) regulate many essential cellular functions. Only one type of SUMO-interacting motif (SIM) has been identified that can extend the β-sheet of SUMO as either a parallel or an antiparallel strand. The molecular determinants of the bound orientation and paralogue specificity of a SIM are unclear. To address this question, we have conducted structural studies of SUMO1 in complex with a SUMO1-specific SIM that binds to SUMO1 with high affinity without post-translational modifications using nuclear magnetic resonance methods. In addition, the SIM sequence requirements have been investigated by peptide arrays in comparison with another high affinity SIM that binds in the opposing orientation. We found that antiparallel binding SIMs tolerate more diverse sequences, whereas the parallel binding SIMs prefer the more strict sequences consisting of (I/V)DLT that have a preference in high affinity SUMO2 and -3 binding. Comparison of two high affinity SUMO1-binding SIMs that bind in opposing orientations has revealed common SUMO1-specific interactions needed for high affinity binding. This study has significantly advanced our understanding of the molecular determinants underlining SUMO-SIM recognition.

The small ubiquitin-like modifiers (SUMOs) regulate many essential cellular functions. Only one type of SUMO-interacting motif (SIM) has been identified that can extend the ␤-sheet of SUMO as either a parallel or an antiparallel strand. The molecular determinants of the bound orientation and paralogue specificity of a SIM are unclear. To address this question, we have conducted structural studies of SUMO1 in complex with a SUMO1-specific SIM that binds to SUMO1 with high affinity without post-translational modifications using nuclear magnetic resonance methods. In addition, the SIM sequence requirements have been investigated by peptide arrays in comparison with another high affinity SIM that binds in the opposing orientation. We found that antiparallel binding SIMs tolerate more diverse sequences, whereas the parallel binding SIMs prefer the more strict sequences consisting of (I/V)DLT that have a preference in high affinity SUMO2 and -3 binding. Comparison of two high affinity SUMO1-binding SIMs that bind in opposing orientations has revealed common SUMO1-specific interactions needed for high affinity binding. This study has significantly advanced our understanding of the molecular determinants underlining SUMO-SIM recognition.
Intricate protein interaction networks are responsible for almost all aspects of cellular functions. These networks are mediated by a small number of common modules, of which the ubiquitin-like proteins are a special type that can be covalently attached to other proteins enzymatically. Conjugation and deconjugation of these modules allows the cell to quickly turn on and off protein-protein interactions (1,2). The small ubiquitin-like modifier (SUMO) 5 family of proteins has recently been established as an important mechanism in regulating disease pathogenesis and many essential cellular functions (3)(4)(5)(6)(7). Like ubiquitination and other ubiquitin-like modifications, the attachment of SUMO to cellular proteins is catalyzed by three types of enzymes referred to generally as E1 (activation enzyme), E2 (conjugation enzyme), and E3 (ligase). SUMO modifications can be removed by a family of SUMO-specific proteases. At least three SUMO paralogues, known as SUMO1, -2, and -3, are expressed in human cells (8,9). SUMO1 has less than 50% sequence identity with the other SUMO isoforms, and SUMO2 and -3 are nearly identical.
The SUMO-interacting (binding) motif (SIM or SBM) sequences are critical to both SUMO conjugation and SUMOmediated effects. SIMs in E3 ligases control SUMO paraloguespecific modifications (10,11). In addition, SIMs in substrate proteins promote SUMO modifications (12) or protect the substrates from deconjugation enzymes (13). The SIMs in receptor proteins are responsible for recognizing SUMOylated substrates (14) and thereby determining intracellular trafficking, protein-protein interactions, and localization of SUMOylated substrates. Unlike ubiquitin-mediated protein-protein interactions, which involve many different ubiquitin-binding motifs (15), only one type of SIM is prevalent in SUMO-mediated protein-protein interactions of all SUMO paralogues (14,16). All SUMO proteins form a ubiquitin-like fold, which contains an ␣-helix and a ␤-sheet. The SIM binds to a surface between the ␣-helix and ␤-sheet and extends the ␤-sheet as either a parallel or antiparallel ␤-strand (17). The known characteristics of SIMs are that they are short (less than 10 amino acid residues) and rich in hydrophobic residues. The molecular mechanism underlining the choice of SIM bound orientation and paralogue specificity is unclear.
Previous studies have shown that a segment spanning the IR1 (internal repeat 1)-M (spacer region) IR2 (internal repeat 2) domains of the nuclear pore protein RanBP2 (Ran-binding protein 2) contains two SIMs. One SIM is located in IR1, and the other is located at the junction of the M and IR2 regions. Further analysis demonstrated that the E3 ligase activity of RanBP2 is dependent on the SIMs (10,18). The IR1 domain could not pull down SUMO1 or SUMO2 or -3, but it could stimulate modifications involving both SUMO1 and SUMO2, suggesting that this SIM binds the SUMO proteins weakly and does not have paralogue specificity. In contrast, the M-IR2 domain only pulled down SUMO1, and not SUMO2 or -3, and only stimulated SUMO1 modifications (10), indicating its specificity and high affinity for SUMO1. The high affinity and specificity of the M-IR2 SIM to SUMO1 is independent of post-translational modifications, such as phosphorylation, as reported for the Daxx SIM (19).
Structural studies have been reported for only three SIMs, from the IR1 domain of RanBP2, PIASX (protein inhibitor of activated STAT X), and Daxx, in complex with SUMO1 (10, 19 -21) and only one SIM from MCAF1 that contains a similar sequence as PIASX in complex with SUMO3 (22). Whereas the MCAF1 SIM is specific for SUMO2/3, the IR1 and PIASX SIMs do not show preferences for a specific SUMO paralogue. The Daxx SIMs were studied by two groups that reported inconsistent findings (19,21). Thus, structural studies of the SIMs thus far have not included a high affinity SUMO1-specific SIM, for which the affinity and specificity do not depend on post-translational modifications at a site outside of the SIM.
To improve our understanding of the sequence requirements of SIMs for high affinity and high specificity interactions, we carried out NMR analysis of the structure of SUMO1 in complex with the high affinity SUMO1-specific M-IR2 SIM. We found that this SIM binds SUMO1 as an antiparallel ␤-strand. In addition, pull-down studies with peptide arrays were carried out for the M-IR2 SIM in order to better understand the SIM sequence requirements. Furthermore, the sequence requirements were also investigated for the parallel orientation based on the PIASX SIM, which binds SUMO1 with affinity similar to the M-IR2 SIM. Binding orientation-dependent and -independent features have been identified for high affinity interaction with SUMO1. These studies were supplemented with isothermal titration calorimetry and biochemical pull-down analyses. The molecular insights obtained here provide further understanding of the sequence requirements and mechanism underlying SUMO-SIM recognition.

NMR Resonance Assignments and Derivation of Structural
Constraints-The 1 H resonances of SUMO1-bound M-IR2 (DNEIEVIIVWEKK) SIM peptide (1:1 molar ratio) were assigned using standard homonuclear TOCSY and NOESY spectra using perdeuterated/ 15 N-labeled SUMO1. Such a sample in 100% D 2 O allowed for unambiguous assignment of the 13 C-1 H resonances of SUMO1-complexed M-IR2 on natural abundance 13 C. The 13 C␣ shifts were used to deduce the dihedral constraints of the bound SIM for structural calculation.
The structure determination used 13 C/ 15 N/ 2 D-enriched SUMO1 in complex with unlabeled synthetic peptide. For the SUMO1⅐M-IR2 complex, a full suite of sequential assignment experiments, HNCA, HNCOCA, HNCACB, HNCOCACB, and 15 N-edited NOESYHSQC, was acquired to obtain the backbone resonance assignments of SUMO1 in the complex. The assigned backbone ( 15 N, 13 C, 1 H) shifts allowed for calculation of the dihedral constraints of bound SUMO1 using TALOSϩ (23). NOE distance constraints used in the structure calculation were obtained from the following data sets at 250 ms mixing time: three-dimensional 15 (Table 1).
Structure Calculations-The program HADDOCK (high ambiguity-driven docking) (24) was used for docking calculations for the structure of the complex between SUMO1 and the M-IR2 SIM. No ambiguous constraints were used. For the dihedral constraints, all ⌽ and ⌿ angles predicted by TALOS for M-IR2 (residues 2706 -2716) and angles of SUMO1 residues that experienced significant amide chemical shift changes upon binding the M-IR2 SIM (regions 20 -27 and 33-56) were used. Similarly, the side chains of these regions were allowed to be flexible in the docking calculation. The HADDOCK calculation started with rigid body dockings of 1000 structures, of which 200 were further refined, and the 10 with the lowest energy were chosen for analysis.
NMR Studies to Extract Binding Affinities-Two-dimensional 15 N-1 H HSQC NMR spectra of SUMO1 or -2 (100 -200 M) were recorded at 298 K at each incremental titration of the SIM peptide (0.6 -2 mM) until no additional chemical shift changes were observed. We extracted the binding dissociation constant (K d ) by integrating line shapes of the one-dimensional 15 N slices of the two-dimensional 15 N-1 H HSQC spectra of SUMO at each titration point and fitted them to the Bloch-McConnell equations for two-state chemical exchange (free versus bound) using the LineShapeKin package (25).
Peptide Array Immunoblotting-Peptide arrays were synthesized using SPOT technology as described previously (26). The peptide array membrane was blocked with 5% skim milk in TBST (0.1 mM Tris-HCl, pH 7.4, 150 mM NaCl, and 0.1% Tween 20) for 1 h. For the PIASX-SIM peptide array, GST-SUMO1 or GST-SUMO3 protein was added directly to the blocking buffer to a final concentration of 1 g/ml and incubated with the membrane at room temperature for 1 h. The membrane was then washed three times for 5 min each with TBST before a rabbit anti-GST antibody (Santa Cruz Biotechnology, Inc., Santa Cruz, CA) was added. The membrane was allowed to incubate at room temperature for 30 min prior to three 5-min washes with TBST. A goat anti-rabbit GST-horseradish peroxidase conjugate was then added and incubated with the membrane for another 30 min. After three final 5-min washes in TBST, the peptide array membrane was visualized by enhanced chemiluminescence.
For peptide array based on the M-IR2 SIM, competitive SUMO1 and SUMO3 binding was used as described below. Peptide arrays were first treated with blocking buffer (LI-COR) for 2 h and then incubated with SUMO1 and SUMO3 (10 g/ml) in PBS overnight. The array was then washed three times with 500 mM NaCl in phosphate buffer (pH 7.5) to remove unbound SUMO. Subsequently, mouse anti-SUMO1 (1:1000; Abgent) and rabbit anti-SUMO2/3 (1:2000; Abcam) were incubated with the array for 2 h at room temperature. Unbound antibodies were removed by three washes in PBST. Secondary antibodies (donkey anti-mouse CW800 and donkey anti-rabbit CW680 (1:2000; LI-COR) were incubated with the array for 1 h at room temperature. After three washes with PBST, the array was imaged using the Odyssey imaging system (LI-COR).

Structure of Highly Specific SUMO1-binding SIM in Complex
with SUMO1-As discussed in the Introduction, the M-IR2 domain of RanBP2 harbors a high affinity SIM that is specific to SUMO1 in the absence of a post-translational modification. We conducted solution structural studies of this SIM in complex with SUMO1. The boundary of the SIM spans the segment DNEKECIIVWEKK (residues 2705-2717), identified from chemical shift perturbation analysis of 13 C/ 15 N-labeled M-IR2 domain in complex with SUMO-1 and confirmed by a synthetic peptide (supplemental Table S1). In addition, substituting Cys in this segment with a Val to avoid potential complications due to oxidation of the Cys side chain -SH group did not change the SUMO1 binding affinity and specificity (supplemental Fig. S1). We also substituted the first N-terminal Lys of this peptide to an Ile (K2708I) in order to resolve resonance overlap. Consistent with the biochemical pull-down data (supplemental Fig.  S1), the K2708I peptide did not alter binding specificity to SUMO1, as indicated by the nearly identical SUMO NMR spectra between SUMO bound to K2708I and to the wild-type SIM (supplemental Figs. S2 and S3). Measurement of the binding affinity by isothermal titration calorimetry was not successful due to a small enthalpy change, suggesting an entropy-driven mechanism for the recognition. Thus, using NMR chemical shift perturbation and line shape analysis, the K d was estimated to be ϳ1.8 M for SUMO1 and 22 M for SUMO2, consistent with the high affinity and specificity of M-IR2 for SUMO1 (Fig.  4C and supplemental Table S1). Structure determination was carried out using the K2708I peptide (DNEIEVIIVWEKK).
The bound SIM showed a well defined conformation in the middle of the peptide, with flexible termini, as shown by superimposition of the 10 best structures ( Fig. 1A and Table 1). This is consistent with changes of line widths and chemical shifts of the 1 H␣ and 13 C␣ resonances between the free and bound peptide (Fig. 1B). Residues Asp-2705, Asn-2706, and Lys-2717 had sharp resonances, suggesting that they undergo fast dynamics and do not form tight interactions with SUMO1 (Fig. 1B). However, the negatively charged residues Asp-2705 to Glu-2707 at the N terminus are adjacent to a positively charged patch on the SUMO1 surface, and thus, these residues probably contribute to the electrostatic interactions. Molecular Details of SUMO1-SIM Interactions-In order to identify features that contribute to the high affinity SUMO1 binding, the structure was compared with that of another high affinity SUMO1-binding SIM that bound in the opposing orientation, the SIM from PIASX, as we previously characterized (16). Both the M-IR2 and PIASX SIMs have K d values in the low micromolar range for binding SUMO1, and their affinities for SUMO1 are among the strongest of the characterized SIMs. Both SIM peptides bind between the ␣-helix and a ␤-strand of SUMO1, but they extend the ␤-sheet in opposing orientations (Fig. 2). The antiparallel binding M-IR2 SIM backbone displayed an extended conformation, whereas the parallel binding PIASX SIM showed a curved backbone that appeared to accommodate optimal side chain contacts. This comparison indicates that there are key differences in the binding mechanisms for these two high affinity SIMs.
The 2710 VIIV 2713 segment of the M-IR2 SIM and 4 VIDL 7 segment of the PIASX SIM form interactions with the same residues on SUMO1, which include residues 35-39 on the ␤-strand and residues 46, 47, 50, and 54 on the ␣-helix (Fig. 2). The hydrophobic SUMO residues Phe-36 and Leu-47 form the bottom of the binding groove with residues Lys-37, Lys-46, and Ser-50 forming the sides. For both SIMs, three notable backbone hydrogen bonds are formed between the core residues of the SIMs and residues His-35 and Lys-37 of the SUMO1 ␤-strand.  (21), although it does not appear to participate in binding the SIM-C of Daxx (19), which was reported to undergo dynamic exchange between parallel and antiparallel bound orientations (21).
The ␤-strand adjacent to the ␤-strand that forms the SIMbinding groove contributes to binding of both SIMs. Glu-2709 (M-IR2) forms a hydrogen bond with the aromatic -OH group of SUMO1 Tyr-21, but the equivalent H17 side chain of SUMO2/3, which lacks the -OH group, may not form this hydrogen bond with the Glu-2709 side chain (Fig. 2C). In the PIASX SIM, which binds all SUMO paralogues with similar affinities, the T8 methyl group forms hydrophobic interactions with the SUMO1 Tyr-21 aromatic ring (Fig. 2D) (16). His-17 of SUMO2/3 is expected to form similar interactions with Thr-8. Similar contacts involving Tyr-21 were not observed in the Daxx SIM-C complex with SUMO1, which is the only other SUMO1-SIM structure available to date (19,21).
Flexibilities at SUMO-SIM Interface-NMR resonances contain information regarding conformational flexibilities. The interface of the parallel M-IR2 SIM in complex with SUMO1 is more flexible than that of the antiparallel PIASX SIM in complex with SUMO1. Upon forming a complex with SUMO1, a key interacting residue, Trp-2714 of the M-IR2 SIM peptide, displayed two resonances for the aromatic side chain epsilon H N , indicating that this side chain is in at least two distinct conformational states that are in exchange on a time scale slower than the NMR chemical shift (Fig. 3A). This is not due to exchange between the parallel and antiparallel orientations, as observed for the Daxx SIM (21), because other SIM residues did not produce two sets of resonances, and no evidence of another bound orientation was observed in intermolecular NOEs. The intensity of each peak was proportional to the population of each conformational state, and their relative ratios changed with temperature, consistent with the temperature-based modulations of exchange kinetics and population between the different conformational states due to changes of the energy barrier (Fig. 3A). Additionally, more severe line-broadening effects of SUMO1 resonances were observed when SUMO1 was in complex with the M-IR2 SIM than with the PIASX SIM (Fig. 3, B and C) (16). The line-broadening effect on SUMO1 upon SIM residue was replaced with each of the 20 naturally occurring amino acid residues. To investigate the SUMO paralogue specificity, SUMO1 and SUMO3 were used in competition for binding the peptide array, followed by simultaneous identification of bound SUMO1 and SUMO3 by their respective antibodies ( Fig. 4A and supplemental Fig. S5). However, the peptides in the array did not bind SUMO2/3 well, suggesting that single amino acid substitution did not lead to enhanced SUMO2/3 binding. The peptide array results were validated by introducing mutations into the entire M-IR2 domain, followed by pulldown experiments (supplemental Fig. S1). In addition, a series of peptides were synthesized, and their interactions with either SUMO1 or SUMO2 were investigated by NMR studies (supplemental Table S1 and Fig. S4). This array did not show a strict sequence requirement of the M-IR2 SIM. The essential requirement is that the central segment of five residues, WVIIV, be made of bulky hydrophobic or aromatic residues with propensities to form ␤-strands (27) (Fig. 4B). These residues contact the key residues in SUMO1, including Phe-36, Lys-37, Lys-46, Leu-47, and Ser-50 (Fig. 2C). The specificity of the M-IR2 SIMs for SUMO1 is also demonstrated by the much larger chemical shift perturbation generated upon binding SUMO1 than binding SUMO3 (Fig. 4C).
A similar peptide array experiment was conducted for the PIASX SIM, which binds SUMO1 in the opposing orientation. Because the PIASX SIM ( 1 KVDVIDLTIE 10 ) binds to all SUMO paralogues with similar affinities (14,17), the peptide array was synthesized in two identical copies on cellulose membranes (28); one array was used for binding SUMO1, and the other was used for binding SUMO3. Each position of the PIASX SIM (vertical sequences, Fig. 5, A and B, and supplemental Fig. S6) was substituted by each of the 20 amino acid residues (horizontal sequences). Because SUMO2 and -3 have nearly identical amino acid sequences in the SIM-binding surface, SUMO3 was used as a representative of the two. Binding of GSTtagged SUMO1 or SUMO3 was detected by an anti-GST Western blot, and the relative binding affinities are indicated by the signal strength (Fig. 5, A and B, and supplemental Fig.  S6). The peptide array results were validated by measuring the binding affinities of synthetic peptides to SUMO1 and SUMO2 by isothermal titration calorimetry (supplemental Table S2 and supplemental Fig. S7). In striking contrast to the M-IR2 SIM, residues in the middle of the PIASX SIM were very restricted, with the Asp-6 and Leu-7 residues being strictly required for high affinity binding to both SUMO1 and SUMO2/3 (Fig. 5C). This array has identified the sequence requirements of SUMO2/3-specific SIM. Substitution of Val-2 of the PIASX SIM by a Gly or an Ala significantly reduced SUMO1 but not SUMO2 binding affinity ( Fig. 5 and supplemental Fig. S6). The sequence requirements for binding SUMO-2/3, (I/V)DLT, identified by peptide array, have been validated by recent findings of SUMO2/3-specific SIM sequences (11,12,22).
SUMO Paralogue-specific SIMs Confer Paralogue Specificity in Cells-To examine the cellular SUMO paralogue specificity of the SIMs identified from the studies described above, colocalization of the SIMs with SUMO1 and SUMO2/3 was investigated in cells. MCF-7 cells were transfected with pcDNA-FLAG-SIM1-GFP and pcDNA-FLAG-SIM2-GFP, for which SIM1 corresponds to the SUMO1-specific SIM (M-IR2) used for structure determination discussed above, and SIM2 corresponds to the Val-2 to Gly mutation of the PIASX SIM that demonstrated specificity for SUMO2/3. The SIM1 and SIM2 fusion proteins were identified with anti-FLAG M2 and FITC-conjugated anti-mouse antibodies, whereas rabbit anti-SUMO1, rabbit anti-SUMO2/3, and Texas Red-conjugated donkey anti-rabbit antibodies were used to identify SUMO-1 and SUMO2/3. The subcellular localizations of the SUMO paralogues are distinct (29); SUMO1 mainly associates with the . The labeled residues represent part of the SIM recognition region on SUMO1, except for Leu-65 (L65), which is not at the binding interface and was used as an internal control for relative peak intensity analysis. The selective broadening of resonances encompassing the recognition ␣-helix and ␤-sheet of SUMO1 was normalized against Leu-65 and is shown in the bar charts. Differential peak intensities reflect differences in line-broadening effects, due to chemical exchange dynamics on the microsecond to millisecond time scale. Missing bars indicate overlapped resonances. The average peak intensity is depicted by horizontal lines.
nuclear membrane and nucleoli, whereas SUMO2 and -3 localize to PML nuclear bodies and the nucleoplasm. The paraloguespecific SIMs specifically colocalized with SUMO1 and SUMO2/3 in these locations in MCF-7 cells. SIM1 was mainly recruited to the nuclear membrane and nucleoli, whereas SIM2 colocalized strongly with nuclear foci that contained SUMO2, some of which also contained SUMO1 (Fig. 6). In addition, in 293T cells, a cell line with faster proliferation, ϳ50% of SIM1 was also present in the nucleoplasm. The difference in SIM1 localization in rapidly proliferating cells was probably due to dynamic SUMO1 localization at different points in the cell cycle (30,31). In contrast to SIM1, the subcellular localization of SIM2 in 293T and MCF-7 cells was similar. Taken together, these results suggest that the paralogue-specific SIMs identified by the studies described above confer the same paralogue specificity in cells and can be used to probe SUMO paralogue-specific protein-protein interactions, as shown previously (32).

DISCUSSION
Insights into Orientation-dependent SIM Sequence Requirements-The combination of structural analysis and peptide arrays described here has revealed striking differences in the sequence requirements of the SIMs bound in the opposing orientations. This is probably due to the fact that antiparallel ␤-strand orientation usually has a more favorable hydrogen bond geometry than the parallel ␤-strand. That the antiparallel M-IR2 SIM tolerates more diverse amino acid sequences than the parallel PIASX SIM indicates that backbone hydrogen bonds, as well as the ␤-strand propensity of the SIM residues (27), make more significant contributions than specific side chain interactions to formation of the complex. In contrast, the side chain interactions between the PIASX SIM and SUMO1 appear to make more significant contributions to binding than the M-IR2 SIM, as suggested by the more strict sequence requirements. The importance of side chain interactions in the SUMO1 complex with the PIASX SIM relative to that with the M-IR2 SIM is also evident from the structures, in which each of the PIASX SIM side chains fits well into a specific pocket, whereas SIM backbone "twists" to accommodate the side chain contacts (Fig. 2B). In contrast, the backbone of the M-IR2 SIM appears to be more linear and "relaxed" (Fig. 2A). The less restricted SIM residues in the M-IR2 SIM as compared with the PIASX SIM are also consistent with the greater conformational flexibility observed at the interfaces of SUMO1 with M-IR2 than with PIASX.
Common Features for High Affinity SUMO1 Binding-Comparison of the structures of SUMO1 in complexes with the M-IR2 and PIASX SIMs, which both bind to SUMO1 with similarly high affinities, has also suggested key interactions for high affinity SUMO1 recognition. Although the M-IR2 and PIASX SIMs bind in opposing orientations, the interaction of Trp-2714 of M-IR2 or Val-2 of PIASX with His-35 of SUMO1 is required for SUMO1 binding affinity. As shown by peptide arrays, substitution of Val-2 by bulky and hydrophobic residues (Ile, Tyr, and Trp) is required for maintaining the binding affinity for SUMO1 but not for SUMO2 ( Fig. 5 and supplemental Fig. S6). Similarly, substitution of Trp-2714 by residues that are not bulky and hydrophobic resulted in severe reduction of the binding affinity to SUMO1 (Fig. 4). Because His-35 is not homologous among the SUMO paralogues, the findings suggest that it is a key residue for specificity and affinity for SUMO1. All SUMO2/3-specific SIM sequences identified to date contain the highly conserved (V/I)DLT sequence and not the hydrophobic residue required for interactions with His-35 (11,12,22).
Paralogue Specificity of SUMO-SIM Interactions-Although the M-IR2 SIM has preference for SUMO1, it also binds SUMO2 or -3 but with 10-fold lower affinity (supplemental Figs. S2-S4). The central residues of the M-IR2 SIM bind the same SUMO1 core residues, including Phe-36, Lys-37, Lys-46, Leu-47, and Ser-50, that also participate in binding other SIMs, regardless of the bound orientation. These residues are identical across the SUMO paralogues. Thus, it is expected that the SIMs with preference for one SUMO paralogue can also bind another paralogue although with a lower affinity. The 10-fold difference of the M-IR2 SIM for binding SUMO1 compared with SUMO2 or -3 corresponds to ϳ1.4 kcal/mol in free energy. Such a difference can result from factors not detected in the structures. For example, formation of a SUMO-SIM complex probably involves the removal of water molecules bound to the hydrophobic surface of a SUMO protein. If the number of water molecules removed from SUMO1 and SUMO2 or -3 differs by one, the difference in the free energy of the interaction can be as much as 2 kcal/mol (33), more than enough to account for the observed difference in specificity. Further studies are required FIGURE 6. In vivo SUMO paralogue specificity of the SIMs. A and B, colocalization of FLAG-tagged and GFP-fused SUMO1-specific SIM (SIM1; green) (A) and SUMO2/3-specific SIM (SIM2; green) (B) with SUMO1 (red) in MCF-7 cells. The SIM1 used is identical to that used for the structure determination (Fig. 1). SIM2 is the V2G mutant of the PIASX SIM. C and D, colocalization of SIM1 (green) (C) or SIM2 (green) (D) with SUMO2/3 (red) in MCF-7 cells. A-D, nuclear DNA was stained with DAPI (blue). Scale bars, 5 m.
to understand the differences of the different SUMO paralogues.
In summary, this study has significantly improved our understanding of the molecular mechanism underlying SIM sequence requirements and preferences for SUMO1. The M-IR2 SIM confers SUMO1 specificity in cells, and the V2G substitution of the PIASX SIM confers SUMO2/3 specificity in cells. Thus, the molecular insights obtained can form the basis for the design of SUMO paralogue-specific reagents that will be useful for cellular studies and future medical applications.