Phosphoproteomics reveals that glycogen synthase kinase-3 phosphorylates multiple splicing factors and is associated with alternative splicing

Glycogen synthase kinase-3 (GSK-3) is a constitutively active, ubiquitously expressed protein kinase that regulates multiple signaling pathways. In vitro kinase assays and genetic and pharmacological manipulations of GSK-3 have identified more than 100 putative GSK-3 substrates in diverse cell types. Many more have been predicted on the basis of a recurrent GSK-3 consensus motif ((pS/pT)XXX(S/T)), but this prediction has not been tested by analyzing the GSK-3 phosphoproteome. Using stable isotope labeling of amino acids in culture (SILAC) and MS techniques to analyze the repertoire of GSK-3–dependent phosphorylation in mouse embryonic stem cells (ESCs), we found that ∼2.4% of (pS/pT)XXX(S/T) sites are phosphorylated in a GSK-3–dependent manner. A comparison of WT and Gsk3a;Gsk3b knock-out (Gsk3 DKO) ESCs revealed prominent GSK-3–dependent phosphorylation of multiple splicing factors and regulators of RNA biosynthesis as well as proteins that regulate transcription, translation, and cell division. Gsk3 DKO reduced phosphorylation of the splicing factors RBM8A, SRSF9, and PSF as well as the nucleolar proteins NPM1 and PHF6, and recombinant GSK-3β phosphorylated these proteins in vitro. RNA-Seq of WT and Gsk3 DKO ESCs identified ∼190 genes that are alternatively spliced in a GSK-3–dependent manner, supporting a broad role for GSK-3 in regulating alternative splicing. The MS data also identified posttranscriptional regulation of protein abundance by GSK-3, with ∼47 proteins (1.4%) whose levels increased and ∼78 (2.4%) whose levels decreased in the absence of GSK-3. This study provides the first unbiased analysis of the GSK-3 phosphoproteome and strong evidence that GSK-3 broadly regulates alternative splicing.

Glycogen synthase kinase-3 (GSK-3) 2 is a ubiquitously expressed, highly conserved serine/threonine kinase that plays a central role in multiple signaling pathways, most prominently as an antagonist of insulin/AKT and Wnt/␤-catenin signaling pathways (1). Unlike most protein kinases, GSK-3 is constitutively active and in general is inhibited by upstream signaling. GSK-3 was first identified as a protein kinase that phosphorylates and inhibits glycogen synthase, the rate-limiting enzyme in glycogen synthesis (2) and was later shown to antagonize canonical Wnt signaling (1). However, GSK-3 is now known to regulate a broad range of cellular processes, including cytoskeletal organization, circadian rhythm, cell growth and survival, immune responses, and developmental processes (1).
In metazoans, GSK-3 is encoded by two similar genes, Gsk3a and Gsk3b, with 98% amino acid sequence identity in the catalytic domains of mammalian GSK-3␣ and GSK-3␤. The two genes are partially redundant, and mice with complete loss of Gsk3a are viable due to compensation by Gsk3b. However, Gsk3b loss-of-function mutations in mice are embryonic or neonatal lethal (3,4), and the Gsk3a;Gsk3b DKO is lethal in early embryogenesis (5,6). Furthermore, DKO mouse ESCs maintain expression of pluripotency markers and are unable to differentiate into most embryonic lineages either in vitro (as embryoid bodies) or in vivo (as teratomas) (6). Here, we will refer to the two genes together as Gsk3 (or GSK-3 for the protein) unless referring to isoform-specific features.
Hormones and growth factors, such as insulin, activate receptor tyrosine kinases that in turn initiate PI3K/AKT signaling. AKT phosphorylates serine 21 of GSK-3␣ and serine 9 of GSK-3␤, thereby inhibiting activity. GSK-3 tends to phosphorylate substrates that are prephosphorylated ("primed") at the ϩ4-position; the phosphorylated N terminus of GSK-3 mimics these primed substrates, creating a pseudosubstrate inhibitor of catalytic activity (7). The (S/T)XXX(S/T) consensus was first demonstrated in glycogen synthase; phosphorylation by casein kinase II at serine 657 leads to processive phosphorylation by GSK-3 at serines 653, 649, 645, and 641 (8). The requirement of the consensus sequence has since been demonstrated for multiple GSK-3 substrates and is often used to predict GSK-3 phosphorylation sites (7, 9 -12). However, several established GSK-3 substrates do not have a serine or threonine at the ϩ4-position, including cyclin D1, c-Myc, and protein phosphatase-1 inhibitor-2.
GSK-3 is also inhibited by mechanisms that are independent of its N-terminal phosphorylation. Most importantly, Wnt-mediated inhibition of GSK-3 catalytic activity does not require N-terminal phosphorylation, as mice with serine-to-alanine knock-in mutations in Gsk3a and Gsk3b display normal Wnt signaling (6,13,14). Inhibition of GSK-3 by Wnts may instead involve rapid dissociation of APC (15,16), interaction with the phosphorylated C terminus of LRP5/6 (17), and, with slower kinetics, sequestration of GSK-3 into multivesicular bodies (18).
Although GSK-3 has been proposed to phosphorylate many substrates, these experiments were conducted using a variety of approaches and experimental settings, including in vitro kinase assays and pharmacological inhibition in diverse cell types. In vitro protein kinase assays may not recapitulate in vivo regulation, especially for substrates that require priming phosphorylation by unknown protein kinases, and off-target effects of small molecule inhibitors (20) can confound the analysis. Furthermore, the range of substrates and cellular processes regulated by GSK-3 in a single cell type has not previously been examined using specific inhibition of GSK-3 (e.g. by gene knockouts). We therefore used a largescale proteomic approach to characterize the GSK-3 phosphoproteome and determine the range of GSK-3 substrates within a single cell type, comparing wild type with cells with a complete loss of Gsk3 (6). Specifically, stable isotope labeling by amino acids in culture (SILAC) (21) was used to metabolically label the proteome of wild-type and Gsk3a/b double-knock-out (DKO) mouse embryonic stem cells (ESCs). To our knowledge, this is the first study to conduct a global analysis of GSK-3-dependent phosphorylation.

Global phosphoproteomic analysis of GSK-3-dependent phosphorylation
To characterize the GSK-3 phosphoproteome, we performed SILAC and quantitative MS using wild-type and Gsk3 DKO mouse ESCs (Fig. 1A). DKO ESCs were cultured in heavy isotope-labeled amino acids for at least 4 passages to achieve complete labeling of the proteome (supplemental Fig. 1). Wild-type ESCs were cultured in parallel in light amino acid medium.
Before protein isolation, expression of the stem cell markers Oct4 and Nanog was assessed by flow cytometry to establish homogeneity of the two-cell populations; in the three replicates, 70 -92% of wild-type cells and 75-90% of DKO cells expressed high levels of both markers (Fig. 1B). Deletion of GSK-3␣ and GSK-3␤ and activation of Wnt signaling by ␤-catenin accumulation were confirmed by Western blotting (Fig. 1C). Whole-cell lysates in biological triplicate were digested with trypsin, and phosphorylated peptides were enriched using titanium dioxide (TiO 2 ) chromatography and analyzed by LC-MS/MS on an LTQ-orbitrap (Fig. 1A). The unbound peptides from the TiO 2 flow-through were used for non-modified, proteomic identification. The level of each phosphopeptide was normalized to total protein abundance. (All MS raw files are available at https://chorusproject.org, 3 project number 1290.) Overall, we identified 5858 phosphosites belonging to 1939 phosphoproteins. Most phosphosites were identified from peptides carrying a single phosphorylation (Fig. 1D, right). The relative frequency of phosphorylation of serine/threonine/tyrosine was 88:11:0.4, comparable with the 80:20:1 distribution known from most mammalian reports (Fig. 1D, left) (22). The overwhelming majority (80%) of identified phosphorylation sites could be localized with a probability of Ն85% to specific positions on the amino acid sequence (supplemental Fig. 2).
Known GSK-3 substrates were identified in this analysis, with reduced phosphorylation in DKO cells observed for the microtubule associated proteins MAP1b, MAP2, and tau (although tau was expressed at low abundance in ESCs, and the phosphopeptide was detected in only one biological replicate) as well as the GSK-3 substrates c-Jun and cryptochrome-2 (phosphorylated Cry2 was present in wild-type cells but was undetectable in cells lacking Gsk3). Furthermore, the level of ␤-catenin protein increased in DKO cells, as predicted and consistent with the Western blotting (Fig. 1C). However, ␤-catenin phosphopeptides were not detected in our analysis, most likely because the phosphorylated form of ␤-catenin is rapidly degraded by the proteasome (23).
Using a p value Ͻ 0.05, we found 404 phosphopeptides representing 269 unique proteins that were differentially phosphorylated in DKO compared with wild-type ESCs. Filtering the data set to include only phosphopeptides with an absolute -fold change of Ն1.5-fold, we found 89 phosphopeptides from 65 unique proteins (3.4% of all phosphoproteins identified) that had significantly reduced phosphorylation in DKO ESCs compared with wild type (Fig. 2 (A and B) and supplemental Table  1). Of these phosphorylation events, 88% were on a serine residue and 12% were on threonine. It should be noted that SILAC does not distinguish whether these phosphorylation events are mediated directly or indirectly by GSK-3; for example, we also detected an increase in phosphorylation of 72 phosphopeptides representing 59 unique proteins in the DKO ESCs (Fig. 2A and  supplemental Table 2), consistent with indirect regulation of these proteins by GSK-3.
Notably, 143 phosphopeptides from 120 proteins were identified in wild-type cells in two or more biological replicates but were undetectable in Gsk3 DKO ESCs (Fig. 2B). The absence of these phosphopeptides in DKO cells may indicate an absolute requirement for GSK-3-dependent phosphorylation and may further suggest that as many as 185 proteins are phosphorylated in a GSK-3-dependent manner in ESCs. In addition, 65 phosphopeptides from 53 proteins were identified in Gsk3 DKO ESCs but were undetectable in wild-type cells, which may represent indirect regulation by GSK-3.

Phosphoproteome analysis identifies novel candidate GSK-3 substrates
To analyze GSK-3-dependent phosphorylation events further, we selected the 89 phosphosites that were reduced in DKO compared with wild type as high-confidence GSK-3-dependent phosphorylation sites. GSK-3 commonly engages with "primed" substrates that have been prephosphorylated at a serine or threonine 4 residues C-terminal to the GSK-3 site, providing the loose consensus (S/T)XXX(S/T) in which the first serine or threonine (S/T) is a GSK-3 site and (S/T) at the ϩ4-position has been previously phosphorylated by a "priming" protein kinase (Fig. 2C). However, the prevalence of this consensus has not previously been addressed in a systematic manner in a single cell type. We therefore asked how many of the high-confidence sites identified in ESCs contain the putative GSK-3 consensus sequence. Among GSK-3-regulated phosphopeptides, serine is the most common residue at the ϩ4-position, occurring in 20 phosphopeptides, and threonine is present at the ϩ4-position in an additional six phosphopep- Figure 1. Global phosphoproteome analysis using SILAC. A, experimental work flow. Cells were cultured for four passages in medium supplemented with either heavy (Arg 10/Lys 8) or light (Arg 0/Lys 0) amino acids to ensure complete incorporation. Whole-cell lysates were digested with trypsin, and phosphorylated peptides were enriched using titanium oxide columns. Flow-through proteins were used to identify the proteome. B, flow cytometry for the ESC pluripotency markers Oct4 and Nanog indicates homogeneity of the wild-type and DKO cell populations. C, Western blotting to assess Gsk3 knock-out (middle) and Wnt signaling activation by ␤-catenin protein accumulation (top). GAPDH is a loading control. D, overview of phosphorylation sites identified in the screen.

GSK-3 phosphoproteome
tides (Fig. 2C). These data indicate that 29% of the GSK-3-dependent phosphosites have the potential to be primed by phosphorylation at the ϩ4-position. (The data in Fig. 2C refer to phosphopeptides that were reduced but still detectable in DKO cells). Similarly, 30% of phosphopeptides had a serine or threonine at the ϩ4-position when we included the 143 phosphopeptides that were detected only in Gsk3 wild-type cells (data not shown). In contrast, serine and threonine together accounted for 18% of residues at the ϩ4-position of GSK-3independent phosphopeptides (supplemental Fig. 3A). These data suggest a modest enrichment for serine and threonine at the ϩ4-position relative to GSK-3-dependent phosphorylation sites.
We confirmed this analysis using the pLogo generator tool, which employs iterative comparisons of selected peptide sequences to background peptide sequences to extract significantly enriched motifs (24,25). Although we did not observe statistically significant enrichment of any residues, the most common residues at the ϩ4-position in the set of GSK-3-dependent phosphopeptides were serine and threonine (supplemental Fig. 3B). Taken together, these analyses suggest a bias for serine or threonine at the ϩ4-position but also indicate that the majority of GSK-3-dependent phosphorylation sites do not contain the (S/T)XXX(S/T) consensus.
As an alternative approach, we performed in vitro GSK-3 phosphorylation using denatured cell lysates from DKO ESCs as a source of substrates. With this approach, substrates that require prephosphorylation at the ϩ4-position should be phosphorylated by endogenous kinases, and GSK-3 sites should be unphosphorylated. To distinguish sites phosphorylated by GSK-3, we used [␥- 18 O]ATP for in vitro labeling. Filtering for a Ն2-fold increase in 18 O-phosphoryl groups (p Ͻ 0.05), we identified 370 peptides that could be phosphorylated by GSK-3 in vitro, and 23% of these had serine or threonine at the ϩ4-position. (These data are available at https://chorusproject.org, 3 project 1290.) Thus, consistent with the SILAC data and published data, GSK-3 readily phosphorylates substrates lacking the (S/T)XXX(S/T) motif.
As 20% of the human proteome contains a (S/T)XXX(S/T) motif, a large proportion of the proteome has been predicted to include potential GSK-3 substrates (10,18). We therefore asked whether the (S/T)XXX(S/T) motif predicts GSK-3-dependent phosphorylation. Similar to published predictions, 1075 (18%) of all phosphopeptides detected here have a serine or threonine at the ϩ4-position, but only 26 (2.4%) of these phosphopeptides showed reduced phosphorylation in DKO ESCs (Fig. 2D). Broken red lines indicate peptides with an absolute change of 1.5-fold or greater, p value Յ 0.05. B, distribution of phosphopeptides that were significantly reduced in Gsk3 DKO (red), phosphopeptides present in wild type but undetectable in DKO (yellow), and phosphopeptides that did not change significantly (blue). C, frequency of each amino acid at the ϩ4-position relative to the GSK-3-dependent phosphorylation site (inset shows reported GSK-3 consensus sequence with serine or threonine at ϩ4-position (potential priming sites)). D, frequency of GSK-3-dependent phosphopeptides with serine (1.9%) or threonine (0.6%) at the ϩ4-position as a percentage of all 1075 identified phosphopeptides with serine or threonine at the ϩ4-position.

GSK-3 phosphoproteome
We next conducted a gene ontology (GO) analysis using the Database for Annotation, Visualization, and Integrated Discovery (DAVID, version 6.8) to categorize the 65 candidate GSK-3 substrates based on previously reported functional terms ( Fig.  3A and supplemental Table 9) (26). Significantly enriched (p Ͻ 0.05) GO terms include the following: mRNA processing (10 genes), RNA splicing (8 genes), regulation of transcription (14 genes), cell cycle (7 genes), and DNA repair (6 genes). Similar categories were enriched when GO and DAVID analyses were repeated to include the 120 phosphoproteins detected in wildtype but not in Gsk3 DKO cells (supplemental Table 9), including mRNA processing (14 genes), RNA splicing (12 genes), negative regulation of transcription (19 genes), transcription regulation (35 genes), and cell cycle (13 genes).
To identify potential relationships between the identified substrates, we analyzed direct and indirect interactions among the proteins enriched in the GO analysis using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database (27). This tool provides a confidence score for each interaction based on known experimental data, pathway knowledge from curated databases, text mining, and co-expression. The analysis enriched for three networks of proteins with related function: alternative splicing, transcriptional regulators, and cell division (Fig. 3B).
Identification of GSK-3-dependent phosphoproteins associated with transcriptional regulation is consistent with published observations that multiple transcription factors are phosphorylated by GSK-3, leading to changes in protein stability, DNA binding, and subcellular localization (12,28). For example, the transcription factor c-Jun, a known GSK-3 substrate, was included in this cluster; phosphorylation by GSK-3 inhibits c-Jun DNA binding activity and also targets c-Jun for proteasomal degradation (29 -32). Although GSK-3 phosphorylation of c-Jun at threonine 239 has been shown previously to target
We also identified a network of candidate substrates with functions related to cell division (Fig. 3B, Cell Division). Ajuba (Jub), a regulator of Aurora A kinase (36,37), was one of the strongest candidates in this cluster, with a 3-fold reduction in GSK-3-dependent phosphorylation at serine 129 (Fig. 3B), a site previously shown to be phosphorylated by other protein kinases to regulate entry into mitosis (37,38). Septin-9 (SEPT9), a member of the septin family of cytoskeletal proteins that regulates multiple functions, including cytokinesis and chromosome segregation (39,40), is phosphorylated at serine 85, also a GSK-3 consensus site, and this phosphorylation was reduced almost 2-fold in DKO ESCs. A weaker signal suggestive of GSK-3-dependent phosphorylation was also observed at serine 82 in SEPT9, and, although not detected in all replicates, reduced phosphorylation was also observed in SEPT1 serine 247 and SEPT5 serine 225. Other candidates included cyclin-dependent kinase 1 (CDK-1) and CDC20, an activator of the anaphase-promoting complex and component of the spindle assembly checkpoint.

Phosphoproteome analysis identifies a role for GSK-3 in alternative splicing
Multiple proteins involved in alternative mRNA splicing were identified in the STRING analysis (Fig. 3B). RNA splicing and mRNA processing were also the most significantly enriched terms in GO analysis ( Fig. 3A) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Moreover, previous work has shown that GSK-3 phosphorylates PSF, a proline-and glutamine-rich splicing factor, to regulate alternative splicing of CD45 in human T-cells (41), but broader roles for GSK-3 in the phosphorylation of splicing factors and regulation of alternative splicing have not been addressed.
Many of the identified splicing factors contain arginine-serine (RS) dipeptide motifs, including multiple SR family members, SR family-related factors, and other RS-containing splicing factors. These include, for example, RNA-binding protein 8A (RBM8A, also known as Y14), SRSF9, TRA2B (SRSF10), CELF1, and RBM39 ( Fig. 3B and Table 1). We selected RBM8A, a component of the exon-junction complex, for further characterization. The SILAC/MS data identified GSK-3-dependent phosphorylation of RBM8A (without a change in overall protein level, supplemental Fig. 4) at serine 166 and/or serine 168, which lie within adjacent RS motifs. Prior work has shown that phosphorylation of these residues inhibits RBM8A interaction with other exon-junction complex components (42,43). As these sites do not have a ϩ4 priming site, GSK-3 should be able to phosphorylate recombinant RBM8A in vitro without requiring prephosphorylation. Recombinant RBM8A was phosphor-ylated by GSK-3, as detected by reduced mobility (Fig. 4A, red arrow) in a Phos-tag/polyacrylamide gel, in which a phosphatebinding moiety slows mobility of phosphorylated species relative to non-phosphorylated forms (44).
To confirm phosphorylation within the RS motif, we performed MS of in vitro phosphorylated RBM8A and identified serine 168 as a site of direct phosphorylation by GSK-3 (Fig. 4B  and supplemental Table 3), consistent with the observations from SILAC. GSK-3 also phosphorylated SRSF9 in vitro, as detected by reduced mobility in standard SDS-PAGE (not shown) and by MS (supplemental Table 3). In parallel, we also assessed GSK-3 phosphorylation of PSF (41). Prior work had shown that PSF function is sensitive to a small molecule GSK-3 inhibitor, and mutagenesis studies suggested threonine 679 (equivalent to threonine 687 in the standard human nomenclature) as a likely site of phosphorylation. MS after in vitro phosphorylation of recombinant PSF confirmed that threonine 679 is phosphorylated directly by GSK-3 ( Fig. 4C and supplemental Table 3). We also detected in vitro phosphorylation of PSF at serine 33 and threonine 468 (supplemental Table 3). Overall, we demonstrate that GSK-3 directly phosphorylates three splicing factors, RBM8A, SRSF9, and PSF.
We also selected nucleophosmin 1 (NPM1) and PHF6 for further validation, as they appeared in multiple enriched functional categories related to RNA biosynthesis, and both are commonly mutated in acute myeloid leukemia (AML). NPM1 is a nucleolar protein with diverse functions in the cell, including ribosome biogenesis, chromatin remodeling, and mRNA processing, in which it has been reported to function as a negative regulator of alternative splicing (45,46). We observed 11 NPM1 phosphopeptides that have also been identified in previous phosphoproteomic screens (47)(48)(49). Furthermore, phosphorylation at one of these sites, serine 225, was significantly reduced in Gsk3 DKO cells. We confirmed a reduction in NPM1 phosphorylation in DKO compared with wild-type cell lysates using Phos-tag gels and immunoblotting for NPM1 (Fig.  5A). The appearance of multiple bands with reduced mobility compared with the non-phosphorylated band (37 kDa) is consistent with the known multiple phosphorylated species of NPM1 in ESCs (47)(48)(49). We next performed in vitro kinase assays using DKO cell lysates and recombinant GSK-3 and detected a GSK-3-dependent increase in a phosphorylated form of NPM1 (Fig. 5B, indicated by a red arrow, compare lanes  1-4). We also observed GSK-3-dependent phosphorylation of Table 1 GSK-3-dependent phosphorylation associated with RS motifs in splicing factors detected by SILAC Phosphorylation sites are underlined, and overlapping or adjacent RS dipeptide motifs are in boldface type. (MS could not distinguish phosphorylation of RBM8A at serine 166 from serine 168, but both sites have been reported previously to be phosphorylated (42,43). Similarly, MS could not distinguish phosphorylation of TRA2B at serine 83 from serine 85 and/or serine 87. For a complete list of phosphopeptides, see supplementary Table 1A).

GSK-3 phosphoproteome
recombinant NPM1 in an in vitro kinase assay, confirming that NPM1 can be directly phosphorylated by GSK-3 (Fig. 5C), although MS detected in vitro phosphorylation at serine 220 rather than serine 225. MS analysis also confirmed direct, GSK-3-mediated phosphorylation of PHF6 at serine 155 after in vitro phosphorylation ( Fig. 4D and supplemental Table 3).

GSK-3 regulates alternative splicing of multiple genes
To address whether GSK-3 modulates alternative splicing in ESCs, as observed previously using the GSK-3 inhibitor SB216763 in T-cells (41,50), we examined global changes in splicing after complete loss of both Gsk3 isoforms by performing deep sequencing of poly(A) selected RNA from wild-type and Gsk3 DKO ESCs. We then used MAJIQ software to identify and quantify local splice variants (LSVs) and to measure the relative LSV abundance (change in percentage spliced in (dPSI)) between wild-type and DKO ESCs (Fig. 6A). This analysis identified 194 significant splicing differences (dPSI Ͼ20) in 188 genes (supplemental Table 4). When we reduced the stringency of detection to dPSI Ͼ 10, we observed 620 splicing events altered between wild-type and DKO ESCs (supplemental Table 5). Comparison of overall differential gene expression between wild type and DKO identified 321 genes with a log 2 -fold change Ͼ 1.5 (adjusted p Ͻ 0.05) in mRNA levels (supplemental Table 6). Expression of multiple known Wnt target genes was increased in DKO cells, as expected, including brachyury (T) , Sp5, Axin2, Cdx1, Snail1, Wif1, Ido1 and -2, and Dkk1. Notably, only six genes exhibited GSK-3-dependent reg-ulation of both splicing and mRNA abundance (Fig. 6B), indicating that the splicing changes we detected are not a secondary effect of altered transcription or mRNA stability. Cassette exons, intron retention, and alternative first exon use were among the most common splicing changes in wild-type versus DKO ESCs (Fig. 6C). Whereas the latter could be driven by alternative promoter choice due to changes in transcription factors, regulation of cassette exons and intron retention most likely result from altered activity of RNA-binding proteins. Taken together, these data confirm that GSK-3 regulates alternative splicing in mouse ESCs, as observed previously with pharmacological inhibition of GSK-3 in human T cells (41,50).

Protein abundance in Gsk3 WT and DKO ESCs
As GSK-3 regulates the stability of many of its target proteins (23,51), we also analyzed changes in total protein abundance between the wild-type and DKO ESCs. Overall, our analysis identified 3299 proteins, and 459 of these had significantly different abundance in wild type compared with DKO. To select high-confidence candidates, we filtered the data for an absolute -fold change of Ն1.5 and then removed candidates that showed a significant (p Ͻ 0.05) corresponding change in mRNA abundance in the RNA-Seq data. This analysis identified 47 proteins (1.4%) with increased abundance in the DKO and 78 (2.4%) with reduced abundance in DKO, compared with wild type (Fig. 7 (A  and B) and supplemental Table 7). In addition, 32 proteins were detected only in DKO ESCs, and 96 proteins were detected only in Gsk3 wild-type ESCs, without a significant difference in RNA    Table 8). The fraction of proteins with increased abundance in DKO cells agrees closely with reports showing GSK-3-dependent degradation or polyubiquitination of ϳ0.4 -1.3% of queried proteins (23,52). The decrease in abundance of proteins in DKO cells is also consistent with previous reports that GSK-3 phosphorylation stabilizes some targets, including the nuclear hormone Reverb␣ and the NF-B1 subunit p105 (10,53,54).

GSK-3 phosphoproteome
Of the proteins that increase upon Gsk3 deletion, we detected ␤-catenin, a well-known substrate of GSK-3 (Fig. 1C), which increased ϳ4-fold in DKO cells without a change in mRNA abundance. We also observed an increase in prolyl oligopeptidase (PREP or PO), which in other cell types modulates sensitivity to lithium, a well-characterized inhibitor of GSK-3 as well as inositol monophosphatase (55). Prep RNA abundance did not change significantly by RNA-Seq, consistent with posttranscriptional regulation of PREP protein abundance, although a small increase in RNA abundance that did not achieve statistical significance could also account for this increase in PREP. MS did not detect a change in PREP phosphorylation in DKO cells compared with WT, which could indicate that, similar to ␤-catenin, the phosphorylated form of PREP is highly unstable and falls below our limit of detection, or it may indicate that the regulation of PREP abundance by GSK-3 is indirect.

Discussion
GSK-3 is a central regulator of multiple cellular processes and signaling pathways, and the full range of GSK-3 regulated functions is not known. The identification of GSK-3 substrates has so far been based on a diverse range of assays performed in different experimental models, leaving an incomplete picture of the overall GSK-3 phosphoproteome. We have used SILAC to characterize the GSK-3 phosphoproteome in ESCs. To identify global changes in phosphorylation, we compared the phosphorylation state of proteins in ESCs lacking Gsk3a and Gsk3b with wild-type ESCs. This approach provides a quantitative, unbiased picture of the spectrum of GSK-3-dependent phosphoproteins. Phosphorylation of 65 proteins was significantly reduced in DKO ESCs compared with wild-type cells, and phosphorylation of an additional 120 proteins was detected only in wild-type ESCs, consistent with an absolute requirement for GSK-3-mediated phosphorylation. The identified GSK-3-dependent phosphoproteins span a wide array of functions, including splicing factors, translational initiation factors and other RNA-regulatory proteins, microtubule-associated proteins, transcription factors, chromatin modifiers, and regulators of cell division.
One of the most striking findings in the phosphoproteomic data set was the identification of multiple proteins involved in Figure 7. Protein abundance changes in DKO compared with wild type. A, volcano plot of log 2 (-fold change in protein abundance) in DKO/wild-type versus Ϫlog 2 (p value) demonstrates multiple proteins that are both increased and decreased in abundance in a Gsk3 dependent manner. B, percentage of proteins that are increased (red) or decreased (yellow) 1.5-fold (p Ͻ 0.05) in abundance in Gsk3 DKO cells without a statistically significant change in RNA abundance. The percentage of proteins detected only in DKO cells (green) or only in Gsk3 wild-type ESCs (gray) without a statistically significant change in RNA abundance is also shown. C, PREP/PO total protein levels are increased in DKO compared with wild type without a significant change in RNA abundance.

GSK-3 phosphoproteome
RNA splicing, including RBM8A, RBM39 (also known as CAPER), SRRM1, SRRM2, SRSF9, TRA2B (SRSF10), CELF1, WBP11/SIPP1 (splicing factor that interacts with PQBP-1 and PP1), PPP4R2, and CDK13. We used in vitro phosphorylation assays to confirm that GSK-3␤ can directly phosphorylate the splicing factors SRSF9 and RBM8a, a component of the exon junction complex that is involved in nonsense-mediated decay in addition to its role in alternative splicing. We also confirmed direct GSK-3␤ phosphorylation of the splicing factor PSF at threonine 679 (equivalent to threonine 687 in humans), shown previously to regulate the alternative splicing of CD45 (41). In that work, GSK-3 was proposed to phosphorylate threonine 687 of PSF in unstimulated T-cells, promoting its interaction with TRAP150. Upon activation of T-cell receptor signaling, GSK-3 is inhibited, reducing PSF phosphorylation and allowing PSF to dissociate from TRAP150, bind to CD45 preRNA, and suppress variable exon inclusion (41). Here we have demonstrated direct phosphorylation of PSF by GSK-3␤ and confirmed phosphorylation at the predicted threonine residue. An important question for future study is whether GSK-3␣ is also able to phosphorylate these substrates directly; although the amino acid sequence of GSK-3␣ is 98% identical to GSK-3␤ in the catalytic domain, the two genes are not completely redundant, as Gsk3a KO mice are viable, whereas the Gsk3b KO is embryonic or perinatal lethal (3)(4)(5).
The finding that multiple splicing factors are phosphorylated in a GSK-3-dependent manner led us to ask whether GSK-3 might play a more general role in alternative splicing. Deep sequencing of wild-type and Gsk3a/b DKO ESCs revealed 188 genes that exhibit Ն20% dPSI in splice form expression (Ͼ95% probability) between wild-type and Gsk3 KO cells. These data demonstrate that GSK-3 regulates alternative splicing of multiple genes in mouse ES cells and confirm previous observations using pharmacological inhibition of GSK-3 in human T cells (41,50). Formally, one possibility is that GSK-3 affects alternative splicing through changing transcription rates, given that GSK-3 phosphorylates a number of transcription factors (Fig.  3B), and transcription efficiency can affect splicing patterns (56,57). However, only 6 of the 188 genes that are alternatively spliced in the DKO ESCs exhibit changes in mRNA abundance between DKO and wild-type ESCs. This separation of differential expression and differential splicing is typical of what has been observed in cellular conditions known to directly regulate splicing (58,59) and is consistent with a direct effect of GSK-3 on RNA-binding proteins that we observe here.
These findings are also consistent with a previous report that GSK-3 can phosphorylate SRSF2 (SC35) in vitro and that inhibition of GSK-3 in cultured neurons increases the inclusion of exon 10 in the microtubule associated protein tau (60). Taken together, the data presented here identifying GSK-3-dependent phosphorylation of multiple splicing factors, published work demonstrating GSK-3 phosphorylation of the specific factors PSF and SRSF2, and our finding that genetic loss of Gsk3 significantly disrupts alternative splicing of Ͼ190 genes argue strongly for a broad role for GSK-3 in the regulation of alternative splicing. Interestingly, both insulin and Wnt signaling, which function by inhibiting GSK-3, alter splicing of 149 (insulin) and 85 (Wnt/wingless) genes in Drosophila S2 cells (61), raising the possibility that these pathways could regulate gene expression through modulation of GSK-3 activity and alternative splicing in other organisms.
Somatic mutations in several of the splicing factors identified in our screen occur frequently in AML and myelodysplastic syndrome (MDS) (62)(63)(64)(65)(66)(67). Knock-out of Gsk3 (DKO) in hematopoietic cells in mice causes a dramatic myeloproliferative and myelodysplastic disorder with features of AML (68). Whereas the signaling networks downstream of GSK-3 that mediate these neoplastic phenotypes are probably complex, disruption of splicing factor function through altered GSK-3-dependent phosphorylation, with subsequent disruption of splicing of mRNAs that encode hematopoietic regulators, could contribute to the progression of these neoplasms. Consistent with this, mRNA splicing is disrupted in MDS and AML progenitor cell populations compared with normal hematopoietic stem and progenitor cells, and drugs that modulate alternative splicing have been proposed as therapies for MDS and AML (67,69,70). Furthermore, inhibition of GSK-3 in primary human T cells alters the splicing of RUNX1 and ASXL1, which are both commonly mutated in AML. GSK-3 inhibition increases the frequency of splice forms that encode nonfunctional proteins, with 80% of alternatively spliced ASXL1 transcripts encoding an early termination codon that probably generates either a nonfunctional protein or an mRNA subject to nonsense-mediated decay (50). Predominance of these nonfunctional mRNAs could mimic somatic mutations and similarly contribute to AML pathogenesis. Thus, the hematopoietic neoplasms that arise from Gsk3 DKO mouse bone marrow could be due, in part, to disruption of splicing factor function.
We also identified GSK-3-dependent phosphorylation of NPM1, one of the most commonly mutated genes in AML, and PHF6, which is mutated in ϳ3% of AML (62). Although the functions of NPM1 and PHF6 are not well-characterized, both are nucleolar proteins that regulate rRNA biosynthesis and may also affect splicing (34,35,45).
GSK-3 is a core negative regulator of canonical Wnt signaling with a primary role to phosphorylate ␤-catenin and target it for proteasomal degradation. However, emerging data suggest that "canonical" Wnt signaling (Wnt signaling that depends on the Axin-APC-GSK-3 complex) regulates multiple processes that are independent of ␤-catenin and the transcriptional response to ␤-catenin/Tcf activation, including mitotic spindle orientation (71), mTORC1 activity (15,72,73), and protein stability (18,23,51,52,74). For example, Wnt-dependent stabilization of proteins (Wnt/STOP), which occurs primarily at the onset of mitosis, leads to stabilization of multiple proteins in addition to ␤-catenin, consistent with a role for GSK-3 in the canonical pathway that bypasses ␤-catenin/TCF transcription (51) (23,51,52,75,76). However, the overlap between the set of GSK-3-dependent phosphopeptides and the group of proteins that either increased (12 proteins) or decreased (4 proteins) in abundance was limited. In cases where GSK-3 phosphorylation targets proteins for degradation, this may indicate that the phosphorylated form is below the limit of detection by SILAC, as observed with ␤-catenin, but it could also represent indirect regulation of target stability by GSK-3.

GSK-3 phosphoproteome
The observation that 20% of proteins in the human proteome contain the (S/T)XXX(S/T) motif led to the suggestion that the stability of a large number of proteins could depend on GSK-3 phosphorylation (10,18). In support of Wnt stabilization of proteins, a microarray analysis of ubiquitinated proteins identified 119 of Ͼ9000 interrogated proteins (1.3%) that showed GSK-3-dependent polyubiquitination, indicative of proteasomal targeting (52). Similarly, an in vitro screen for GSK-3dependent degradation identified 42 proteins (ϳ0.6% of those screened) that were stabilized by GSK-3 inhibitors (23). In close agreement with these studies, we identified 47 proteins whose abundance increased Ͼ1.5-fold in Gsk3 DKO ESCs without a significant increase in the mRNAs encoding those proteins (based on RNA-Seq data) as well as 32 proteins that were detected only in DKO cells, consistent with GSK-3-mediated destabilization of 1.4 -2.4% of the proteome. However, GSK-3 phosphorylation also stabilizes multiple proteins, including the circadian regulators Reverb␣ (53), Timeless (77), and cryptochrome (78) as well as the apoptosis signal-regulating kinase-1 (ASK1) (79) and the p105 precursor of the NF-B p50 subunit (54). Consistent with a positive role for GSK-3 in protein stabilization, we found 78 proteins that were reduced in abundance by Gsk3 deletion (without a significant change in the corresponding mRNAs) and another 96 proteins that were only detectable in cells expressing Gsk3. Therefore, up to 5% of the proteome may be stabilized by GSK-3. Small changes in RNA that do not reach statistical significance in the RNA-Seq data could, in principle, account for statistically significant changes in protein abundance. Nevertheless, overall, the analysis presented here together with published work from others indicates that the abundance of a substantial fraction of the proteome is posttranscriptionally regulated by GSK-3.
The priming mechanism for GSK-3 substrates has been elegantly established for glycogen synthase (80), ␤-catenin (81), and a handful of other substrates, and the structural basis for GSK-3 recognition of primed substrates has also been worked out (7,(82)(83)(84). The identification of a loose consensus has led to identification of additional proteins that contain this consensus sequence but may also have biased the search for GSK-3 substrates to proteins that contain the consensus. Our unbiased phosphoproteome analysis revealed that 26 of 68 of the high-confidence candidate substrates contain the consensus sequence, with either serine or threonine at the ϩ4-position, supporting the importance of the (S/T)XXX(S/T) consensus sequence but also indicating that this consensus is neither essential nor sufficient to direct GSK-3-dependent phosphorylation. Moreover, of all of the phosphopeptides identified by SILAC, 1075 peptides have a serine or threonine at the ϩ4-position, and only 4.6% of these demonstrated GSK-3-dependent phosphorylation. Published evidence also supports GSK-3-dependent phosphorylation of sites that lack serine or threonine at the ϩ4-position, including c-Myc, the protein phosphatase-1 inhibitor-2 (I-2), cyclin D1, and tau. In a published tabulation of over 100 putative GSK-3 substrates (12), only 27% were known to be primed, consistent with our finding of ϳ70% of GSK-3-dependent phosphorylation occurring at sites lacking the consensus sequence.
These observations suggest additional complexity to the regulation of GSK-3-dependent phosphorylation.
Our analysis of in vivo phosphorylation does not distinguish direct from indirect phosphorylation. We therefore showed direct phosphorylation of RBM8A, SRSF9, PHF6, PSF, and NPM1 by GSK-3. We were able to test these by in vitro phosphorylation because the tested sites do not require a priming step. An in vitro analysis of substrates that absolutely require priming will be challenging without first identifying the priming kinase. In addition, in vitro kinase reactions are inherently limited by the artificial conditions of the assay. Therefore, these assays can only establish that a putative substrate can be phosphorylated by GSK-3, and establishing direct phosphorylation in vivo is a more challenging task that, to our knowledge, has not yet been feasible for GSK-3 substrates. Our phosphoproteomic screen nevertheless identifies multiple targets that demonstrate GSK-3-dependent phosphorylation. These will be interesting to pursue in more depth in the setting of Wnt signaling, stem cell homeostasis, leukemia pathogenesis, and other GSK-3regulated biological processes.

Cell culture
Gsk-3a/b double knock-out and wild-type (floxed) E14 ESCs (6) were kindly provided by Drs. Bradley Doble (McMaster Stem Cell and Cancer Institute, Hamilton, Canada) and James Woodgett (Lunenfeld-Tanenbaum Research Institute, Toronto, Canada). All media and supplements were obtained from Gibco (Invitrogen) unless otherwise noted. E14 mESCs were cultured at 37°C in 5% CO 2 in DMEM supplemented with 15% FBS (HyClone Defined), 0.1 mM Eagle's minimum essential medium nonessential amino acids, 2 mM Glutamax, 0.055 mM ␤-mercaptoethanol, and 1000 units/ml LIF (Chemicon) on 0.1% gelatin-coated plates with irradiated mouse embryonic fibroblasts (Global Stem). Cells were routinely tested for mycoplasma using the MycoAlert detection kit (Lonza). For SILAC labeling experiments, cells were cultured with DMEM without arginine or lysine supplemented with 0.798 mM isotope heavy L-lysine ( 13 C 6 , 15 N 2 ) and 0.398 mM heavy L-arginine ( 13 C 6 , 15 N 4 ) (Cambridge Isotope Laboratories) and 15% KnockOut Serum Replacement with the same medium components as standard culture. In case of light media, standard L-lysine and L-arginine were used. During isolation, ES cells were dissociated into a single cell suspension with TrypLE, and mouse embryo fibroblasts were removed by absorption to plastic dishes for two consecutive 30-min incubations. Proteomic experiments were conducted in biological triplicate. For protein stabilization experiments, cells were treated with MG-132 or cycloheximide at the indicated concentrations and times. MG-132 and cycloheximide were added to mouse embryo fibroblast medium during feeder separation.

Proteomics and phosphoproteomics analysis using nLC-MS/MS
All chemicals used for preparation of nLC-MS/MS samples were of at least sequencing grade and were purchased from Sigma-Aldrich, unless otherwise stated. Light and heavy labeled cells were separately lysed using 6 M urea, 2 M thiourea, GSK-3 phosphoproteome 50 mM ammonium bicarbonate, pH 8.2, and phosphatase and protease inhibitor mix (Thermo Fisher Scientific) by vortexing. After protein quantification using Bradford, equal amounts of light and heavy protein lysates were mixed. Proteins were first digested using endopeptidase Lys-C (Wako, MS grade) for 3 h, after which the solution was diluted 10 times with 20 mM ammonium bicarbonate. Subsequently, samples were reduced using 10 mM DTT for 1 h at room temperature and alkylated with 20 mM iodoacetamide in the dark for 30 min at room temperature. Further digestion was performed using trypsin (Promega) at an enzyme/substrate ratio of ϳ1:50 overnight at room temperature. After digestion, the samples were concentrated to ϳ100 l by lyophilization. Phosphopeptide enrichment using TiO 2 chromatographic resin was performed as described previously (85). The lyophilized phosphorylated peptide samples were reconstituted in 0.1% TFA and desalted using Poros Oligo R3 RP (PerSeptive Biosystems) P200 columns. Unbound peptides from the TiO 2 flow-through and subsequent TiO 2 washes were combined and lyophilized to produce the non-modified peptide fraction. The non-modified peptide fraction was resuspended in 0.1% TFA and desalted using a Sep-Pak tC18 Plus Light Cartridge (Waters). Afterward, samples were dried to completely remove traces of acetonitrile from stage tip elution. Dried samples were resuspended in buffer A (0.1% formic acid) and loaded onto an Easy-nLC system (Thermo Fisher Scientific), coupled online with a Q-Exactive or an Orbitrap Fusion Tribrid mass spectrometer (both from Thermo Scientific). Peptides were loaded into a picofrit fused silica capillary column (75-m inner diameter) packed in-house with reversed-phase Repro-Sil Pur C18-AQ 3-m resin of about 18 cm. A gradient of 165 min was set for peptide elution from 2 to 28% buffer B (100% acetonitrile, 0.1% formic acid) at a flow rate of 300 nl/min. Both instruments were programmed in a data-dependent acquisition mode. For the Q-Exactive, the full MS scan range was 360 -1200 m/z in the orbitrap with a resolution of 70,000 (200 m/z) and an AGC target of 5 ϫ 10e5. MS/MS was performed in the orbitrap selecting the top 12 ions, a resolution of 17,500, an AGC target of 5 ϫ 10e4, and a collision energy of 22. For the Orbitrap Fusion Tribrid, the full MS scan was 350 -1200 m/z in the orbitrap with a resolution of 120,000 (200 m/z) and an AGC target of 5 ϫ 10e5. MS/MS was performed in the ion trap using the top speed mode (3 s), an AGC target of 10e4, and a higher-energy collisional dissociation collision energy of 32. MS raw files were analyzed by the MaxQuant software version 1.5.2.8. MS/MS spectra were searched by the Andromeda search engine against the mouse UniProt FASTA database (version November 2015) (86,87). The search for total proteome included variable modifications of methionine oxidation and N-terminal acetylation and fixed modification of carbamidomethyl cysteine. Analysis of the phosphoproteome included carbamidomethylation on cysteine residues as a fixed modification, whereas phosphorylation on serine, threonine, and tyrosine residues was set as variable modification. Trypsin was specified as the digestive enzyme. SILAC labeling was used as quantification. Match between runs was enabled and set to a 1-min window. All other values were kept as default. Protein tables were filtered to eliminate the identifications from the reverse database, only identified by site and common contami-nants. All MS raw files are available at https://chorusproject.org 3 at project number 1290.

Statistical analysis and downstream bioinformatics
Each analysis was performed with three biological replicates. Heteroscedastic t test was used to assess the significant differences in peptide/protein abundance (p value Ͻ 0.05). Data distribution was assumed to be normal, but this was not formally tested. The protein interaction network was extracted from the STRING database (27) and visualized using Cytoscape (88). GO and KEGG pathway analysis were performed using DAVID version 6.8. The list of proteins with GSK-3-dependent phosphorylation (supplemental Table 1) was submitted as the gene list, and automatic Mus musculus background was used. Categories with a p value of Ͻ0.05 and DAVID enrichment score Ͼ1.0 were selected.

RNA-Seq
Total RNA was isolated by TRIzol from wild-type and DKO mESC cells after a similar number of passages. RNAs were then further purified by DNase treatment and the RNeasy kit (Qiagen). Purity was confirmed by bioanalyzer (RNA integrity number (RIN) Ͼ8). RNAs were sent to the Genomics Division of the Iowa Institute of Human Genetics for poly(A)-selection, generation of stranded Illumina RNA-Seq library, and sequencing on three lanes of a HiSeq4000. 269 million unique, mappable reads (402 million total) were obtained for wild-type mESCs, and 255 million unique, mappable reads (383 million total) were obtained for DKO mESCs.

Quantification of alternative spliced and differential expressed genes
Alternative splicing and differential expression analyses relied on RNA-Seq reads mapped to the reference mouse genome (mm10) using STAR with the option alignSJoverhang-Min 8 (89) (see above for numbers). Alternative splicing events were analyzed using MAJIQ and VOILA with the default parameters (90). Briefly, uniquely mapped, junction-spanning reads (217 million for wild-type and 203 million for DKO mESCs) were used by MAJIQ to construct splice graphs for transcripts by using the Ensembl transcriptome annotation (release 82) supplemented with de novo-detected junctions. Here, de novo refers to junctions that were not in the Ensembl transcriptome database but had sufficient evidence in the RNA-Seq data (default: at least three reads mapping to at least two different start positions). The resulting gene splice graphs were analyzed for all identified LSVs, defined as splits in a splice graph to or from a given exon. For every junction in each LSV, MAJIQ then quantified the expected PSI value in control and GSK3DKO samples and expected dPSI between WT and GSK3DKO samples. Results from VOILA were then filtered for high-confidence changing LSVs (whereby one or more junctions had at least a 95% probability of expected dPSI of at least an absolute value of 20 PSI units (noted as "20% dPSI") between WT and GSK3DKO) and candidate changing LSVs (95% probability, 10% dPSI).
Finally, to correct for overlapping cases where two LSVs capture the same alternativity in isoform usage at one locus, LSVs with a changing junction were grouped together into events based on shared changing junctions. Each group was assigned GSK-3 phosphoproteome an event ID. For the high confidence results (dPSI Ն 20%), the events were further categorized as single-exon cassette, multiexon cassette, alternative 5Ј-and/or 3Ј-splice site, de novo intron removal, intron retention, or alternate transcript start or end events. No mutually exclusive exon usage events were observed. All remaining events that did not fit into these categories were classified as complex.
Counts of STAR-aligned reads that overlapped transcript exons were quantified with the GenomicRanges package in R (count mode: Union) (91). Counts were then normalized, and differentially expressed genes between WT and GSK3 DKO with a log 2 -fold change of Ն1.5 and a Benjamini and Hochberg corrected p value of Ͻ0.05 were identified using default DESeq parameters in R (92).

Phos-tag electrophoresis and immunoblotting
Cells were lysed in 6 M urea, 2 M thiourea, 50 mM ammonium bicarbonate, pH 8.2, with protease inhibitor mixture (Sigma) and phosphatase inhibitor mixtures 2 and 3 (Sigma), used at 1:100. Supernatants were collected after centrifugation at 14,000 rpm for 10 min at 4°C. Protein was quantified by a Bradford assay. Samples were diluted in standard 2ϫ Laemmli sample buffer and heated at 95°C for 5 min before PhosTag analysis. PhosTag gels were prepared using PhosTag TM acrylamide reagent (Wako Pure Chemical Industries, Ltd., catalogue no. AAL-107) with ZnCl 2 based on the manufacturer's protocol (44). The composition for optimal separation of NPM1 was 30 M PhosTag TM acrylamide reagent and 10% acrylamide using a acrylamide/bisacrylamide solution (29:1). The composition for optimal separation of RBM8A was 30 M PhosTag TM acrylamide reagent and 12% acrylamide. Gels were run at constant 70 V for 3-4 h, until dye front was off the gel. PhosTag gels were soaked in transfer buffer with 1 mM EDTA for 15 min and then washed for 15 min in transfer buffer without EDTA. Gels were transferred using PVDF membrane, at 150 mA for 16 h. Gels were blocked with 5% nonfat milk. The following antibodies were used at 1:1000: NPM1 (Bethyl Laboratories) and RBM8A (Bethyl Laboratories).

In vitro protein kinase assays
Recombinant proteins were incubated in kinase reaction buffer (100 mM Tris, pH 7.5, 5 mM DTT, 10 mM MgCl 2 ) with 800 M ATP for 60 min at 30°C. Reactions were stopped by adding standard 2ϫ Laemmli sample buffer and incubating at 95°C for 5 min. Samples were run on PhosTag gels following the reaction. 0.4 g of recombinant protein was used; RBM8A (ProSpecBio), NPM1 (ProSpecBio), PHF6 (Origene), and PSF (41) were expressed in and purified from bacteria, and SRSF9 (Abnova) was translated in vitro in wheat germ extract and then purified. Recombinant GSK-3 was used at 50 units/20-l reaction (New England Biolabs). For MS analysis, MS proteolysis methods were followed starting at the 10 mM DTT step. Peptides were analyzed following similar methods as indicated under "Proteomics and phosphoproteomics analysis using nLC-MS/MS." For in vitro phosphorylation of proteins in ESC cell lysates, wild-type and DKO ESCs (in triplicate) were lysed in 8 M urea (6 M urea, 2 M thiourea, 50 mM ammonium bicarbonate, pH 8.2, phosphatase and protease inhibitors) as described above for SILAC. Samples were diluted in protein kinase buffer (50 mM Tris, pH 7.5, 10 mM MgCl 2 , 0.1 mM EDTA, and 2 mM DTT with 0.8 mM [␥- 18 O]ATP (Cambridge Isotopes catalog no. OLM-7858-20)) to a final protein concentration of ϳ0.5 mg/ml in a 200-l total reaction volume (urea Ͻ0.4 M). Recombinant GSK-3␤ (25 units) or water (control) was added, and samples were incubated for 60 min at 30°C and then reduced, alkylated, trypsinized, and processed for MS as described above for SILAC.

Intracellular flow cytometry
Up to 5 million cells were fixed in 1.6% paraformaldehyde for 20 min at 37°C and washed in FACS buffer. (Cells were permeabilized with 1ϫ saponin buffer (Biolegend 421002).) To stain for Oct4 and Nanog, cells for were incubated with primary antibody at 1:200 for 1 h at room temperature. Antibodies were as follows: Oct4 (Santa Cruz Biotechnology, Inc.) and Nanog (Abcam). Cells were incubated with secondary antibody at 1:400 for 30 min at room temperature. Antibodies were as follows: goat anti-mouse IgG-PE and IgG-CFL488 (both from Santa Cruz Biotechnology). Samples were resuspended in FACS buffer (1ϫ PBS ϩ 0.5% BSA, 0.05% azide). Analysis was performed using a FACSCanto flow cytometer (BD Biosciences), and data were analyzed using FlowJo software (Treestar). supervised MS experiments and contributed to experimental design and interpretation of data and editing of the manuscript. P. S. K. conceived and coordinated the study, contributed to experimental design and interpretation of data, and wrote the manuscript. All authors reviewed the results and approved the final version of the manuscript.