O-GlcNAcase Is an RNA Polymerase II Elongation Factor Coupled to Pausing Factors SPT5 and TIF1β*

We describe here the identification and functional characterization of the enzyme O-GlcNAcase (OGA) as an RNA polymerase II elongation factor. Using in vitro transcription elongation assays, we show that OGA activity is required for elongation in a crude nuclear extract system, whereas in a purified system devoid of OGA the addition of rOGA inhibited elongation. Furthermore, OGA is physically associated with the known RNA polymerase II (pol II) pausing/elongation factors SPT5 and TRIM28-KAP1-TIF1β, and a purified OGA-SPT5-TIF1β complex has elongation properties. Lastly, ChIP-seq experiments show that OGA maps to the transcriptional start site/5′ ends of genes, showing considerable overlap with RNA pol II, SPT5, TRIM28-KAP1-TIF1β, and O-GlcNAc itself. These data all point to OGA as a component of the RNA pol II elongation machinery regulating elongation genome-wide. Our results add a novel and unexpected dimension to the regulation of elongation by the insertion of O-GlcNAc cycling into the pol II elongation regulatory dynamics.

We describe here the identification and functional characterization of the enzyme O-GlcNAcase (OGA) as an RNA polymerase II elongation factor. Using in vitro transcription elongation assays, we show that OGA activity is required for elongation in a crude nuclear extract system, whereas in a purified system devoid of OGA the addition of rOGA inhibited elongation. Furthermore, OGA is physically associated with the known RNA polymerase II (pol II) pausing/elongation factors SPT5 and TRIM28-KAP1-TIF1␤, and a purified OGA-SPT5-TIF1␤ complex has elongation properties. Lastly, ChIP-seq experiments show that OGA maps to the transcriptional start site/5 ends of genes, showing considerable overlap with RNA pol II, SPT5, TRIM28-KAP1-TIF1␤, and O-GlcNAc itself. These data all point to OGA as a component of the RNA pol II elongation machinery regulating elongation genome-wide. Our results add a novel and unexpected dimension to the regulation of elongation by the insertion of O-GlcNAc cycling into the pol II elongation regulatory dynamics.
RNA polymerase II is the species of polymerase that transcribes the protein-coding genes in the cell. Largely based on in vitro transcription data and bacterial transcriptional regulation, it was thought that the pathway of transcription initiation was the de novo assembly and recruitment of general factors and pol II 4 into a preinitiation complex at promoters. In other words, transcription was solely controlled by whether initiation occurred or not. However, in the late 1980s, a transcriptionally engaged pol II was found on several genes, located roughly at ϩ50 relative to the transcriptional start site (TSS) (1)(2)(3). More recently, genome-wide approaches have shown that paused pol II exists on at least 40% of promoters in the genome (4 -9).
These data show that the regulation of elongation is a significant and widespread method by which cells regulate gene expression.
Biochemically, the establishment of a paused polymerase requires the recruitment of DSIF and NELF to pol II early in elongation to prevent pol II from productive elongation (10 -13). Release of the paused polymerase is achieved with P-TEFb phosphorylation of DSIF and NELF, ejecting NELF from pol II and converting DSIF into a positive elongation factor (14,15). The more recent discoveries of additional factors likely involved in pause establishment and release include human capping enzyme (16), the TFIIH-associated kinase, CDK7 (17,18), the TFIIH ERCC3 helicase (19), Mediator (20 -22), Integrator (23,24), ELL (25), TFIIS (26), TRIM28-KAP1-TIF1␤ (27,28), Top1 (29), SEC (30), PAF complex (31,32), and Gdown1 (33). The plethora of factors suggests that more complicated dynamics are at play in regulating pausing and elongation. Additionally, it is not clear, and indeed difficult to know, whether all of the factors involved in pol II pausing and elongation have been identified. This difficulty is due mostly to the absence of a human cell-based in vitro transcription system that recapitulates a paused polymerase and with which the full complement of factors can be functionally identified. Lastly, it is not clear how all of these factors integrate into a unified mechanistic and regulatory model of pol II pausing.
The requirements for serine/threonine phosphorylation in pausing release imply that O-GlcNAcylation might also be a participatory factor; the O-GlcNAcylation of proteins is a very common post-translational modification, occurring on serine and threonine residues, and is often mutually exclusive with the phosphorylation of those same residues (34). O-GlcNAc transferase (OGT) utilizes UDP-GlcNAc as a high energy donor substrate and the ␤-N-acetylglucosaminidase or O-GlcNAcase (OGA) removes GlcNAc from protein substrates (35)(36)(37). Physiologically, O-GlcNAcylation likely plays significant causal roles in type 2 diabetes and cancer. OGA is essential for tumor cell viability and metabolic homeostasis (38 -40). Additionally, OGT knock-out mice are embryonic lethal, and brain-specific OGT knock-out disrupts eating behavior in mice (41,42), underscoring both the essential nature of OGT and the nutrient-sensing properties of the O-GlcNAcylation regulatory space. Lastly, ϳ2-5% of glucose flux into the cell is diverted to the hexosamine biosynthetic pathway for the synthesis of UDP-GlcNAc, and it has been hypothesized then that UDP-GlcNAc levels represent a nutrient sensor and reflect the intracellular glucose concentration (35).
We describe here the unexpected involvement of O-GlcN-Acylation in RNA pol II pausing and elongation. We had previously found requirements for both OGT and OGA in transcription in vitro (43), although the point at which the enzymes acted was not determined. Here we examined OGA function further using in vitro transcription assays. In vitro elongation assays showed a requirement for OGA during elongation in crude nuclear extracts. Secondly, we identified an ϳ600-kDa complex that contains OGA and the pausing/elongation factors SPT5 and TRIM28-KAP1-TIF1␤ (27,28). In vivo ChIP-seq analysis shows that OGA localizes to TSS and OGA peaks overlap considerably with pol II, SPT5, and TRIM28-KAP1-TIF1␤ peaks. These data suggest a novel role for OGA activity in the regulation of RNA pol II pausing and elongation.

OGA Activity Is Required for Efficient Elongation in Vitro-
Previous results showed that chemical inhibition of OGA blocked RNA pol II-dependent transcription in vitro (43). However, it was not clear exactly when in the transcription process OGA activity was required. A hint came from our recent study of the effects of OGT and OGA inhibitors on PIC formation and pol II promoter recruitment (44). The absence of an effect of the OGA inhibitor PUGNAc suggested that the inhibition of transcription might occur after PIC formation, thus implicating early elongation steps as being OGA-dependent.
To address a possible OGA function during elongation, we adopted the elongation assay established by Price and co-workers (46). We used nuclear extracts made from HeLa cells (45) and assayed transcription activity using the CMV promoter and a pulse-chase assay (46). We used this assay to test various kinase and OGA inhibitors specifically for effects on elongation by adding the inhibitor(s) after allowing PIC formation to occur for 30 min. Inhibitors were then added for 10 min followed by NTPs to initiate transcription. We found as expected a fulllength 548-nucleotide run-off transcript using untreated HeLa nuclear extracts (Fig. 1A). We next tested inhibitors of TFIIH CDK7 kinase activity (THZ) and the P-TEFb CDK9 inhibitor flavopiridol (FP) in our assay (47,48). Both THZ and flavopiridol have been shown previously to block transcription elongation in vitro (48,49). Indeed, both inhibitors blocked elongation in our assay (Fig. 1A). We next tested two OGA inhibitors: PUGNAc and Thiamet G. PUGNAc is considered a more general hexosaminidase inhibitor and can inhibit, for example, lysosomal hexosaminidases as well as OGA (however, due to the use of a nuclear extract and the neutral pH of the transcription assay, we do not expect any lysosomal hexosaminidase activity), whereas Thiamet-G is specific for OGA (50 -52). Both inhibited transcription elongation of the full-length product but not smaller (Ͻ75 nucleotides) RNAs (Fig. 1A). These data show that OGA activity in vitro is required for efficient elongation in crude nuclear extracts.
Because TFIIH, P-TEFb, and OGA activities are required for elongation (Fig. 1A), we asked whether one acted before the other or whether their activities were additive or synergistic. To do this, we compared the P-TEFb or OGA inhibitors separately (as in Fig. 1A) to the effects of mixing both inhibitors together. If the effects of the drugs are temporally distinct, then mixing inhibitors A and B together would have the same effect as either A or B by itself. For example, if the mixing of A and B has the same effect as A, then we would conclude that A acts before B. Conversely, an additive/synergistic effect from mixing A and B would appear as an elongation defect more severe than observed for either A or B by itself. Our results are shown in Fig.  1A, lanes 6 -8. The combination of THZ and FP showed a THZ effect, indicating that CDK7 acted before P-TEFb (49) (compare lanes 2 and 4 to lane 6 in Fig. 1A; see similar results in Fig.  1B). Both OGA inhibitor (PUGNAc and Thiamet G) combinations with FP showed FP effects and not PUGNAc or Thiamet G effects, indicating that P-TEFb acts before OGA in the elongation process (Fig. 1A, compare lanes 3, 4, and 5 to lanes 7 and 8).
We also assayed the combination of THZ and Thiamet G (Fig.  1B). Strikingly, THZ effects are predominant to those of Thiamet G (compare lanes 2 and 3 to lanes 5 and 7, respectively). Overall, these experiments indicate that TFIIH/CDK7 acts first, P-TEFb acts second, and OGA acts third during the elongation process, and equally importantly, they do not appear to be acting in any sort of additive or synergistic way, suggesting each enzyme represents a distinct step in the elongation process.
We next asked whether OGA protein could affect elongation. To that end, we purified recombinant OGA from bacterial extracts via a His tag and titrated rOGA into an elongation assay designed to specifically measure later elongation events. PICs were formed on immobilized CMV templates and given a 30-s pulse of [ 32 P]CTP/cold GTP/UTP/ATP. Elongation was stopped with EDTA, and complexes were washed. The elongation was then chased with cold NTPs. A representative elongation product after the cold NTP chase is shown in Fig. 1C, lane 1. TFIIF is an established initiation and elongation factor (54 -56), and we exploited its elongation properties by adding it to the chase step where it efficiently promoted elongation (compare lanes 1 and 2, Fig. 1C) (46). To this TFIIF-dependent elongation assay we titrated rOGA, and we found a clear reduction in elongation (lanes 3-5, Fig. 1C).
We then asked whether OGA catalytic activity was required for the block to elongation. We purified a catalytically inactive rOGA (D174A) (57) from bacteria and added identical amounts (Fig. 1D) of the wild type or mutant rOGA to the TFIIF-dependent elongation assay and found that the Asp-174 mutant had no effect on elongation (Fig. 1C, lanes 6 -9). The data in Fig. 1 show that OGA can have both positive and negative elongation functions, depending on the assay used.
OGA Is Associated with the Pausing and Elongation Factor SPT5 and TRIM28-KAP1-TIF1␤-The requirement for OGA activity in transcription elongation processes in vitro and in vivo suggested that OGA might interact with members of the elongation machinery. The now classic purification scheme of nuclear extracts for the components of the transcription machinery was done by first fractionating a nuclear extract over a P11 resin (58). This scheme fractionates TFIIA in the flowthrough, DSIF and NELF in the 0.1 and 0.3 M fraction, RNA pol II, TFIIH, TFIIB, TFIIE, TFIIF, and Mediator in the 0.5 M fraction, and TFIID and some Mediator in the 1 M fraction (59). To determine OGT and OGA chromatographic behavior relative to the other transcriptional machinery, we ran a HeLa nuclear extract over a P11 resin and collected fractions from the flowthrough, 0.1 M KCl elution, the 0.3 M, 0.5 M, and 1 M KCl step elutions. We then analyzed each step of elution for the presence of OGT and OGA by Western blot. We found that OGA eluted solely in the flow-through/0.1 M fraction ( Fig. 2A). In contrast, OGT was distributed in all four of the KCl step elutions ( Fig. 2A and Ref. (43). As evidence of the proper fractionation of the extract, RNA pol II was found predominantly in the 0.5 M KCl elution as expected (58).
We next looked for the coelution of OGA with elongation factors that are present in the P11 flow-through fraction. The most obvious factor was DSIF, which was mentioned by Handa and co-workers (10, 12) as existing in both the 0.1 and 0.3 M P11 elutions. We separated our 0.1 M material over a Superose 6 gel filtration column and probed for both OGA and the SPT5 subunit of DSIF by Western blot. We found that these two factors coeluted at ϳ500 -600 kDa (Fig. 2B). We then assayed for an interaction between the 2 by immunoprecipitating the 500-kDa Superose 6 fraction with anti-SPT5 and anti-OGA antibodies and probing both immunoprecipitates for OGA. The anti-OGA immunoprecipitation precipitated OGA, as expected, whereas the SPT5 immunoprecipitation also precipitated OGA (Fig.  2C), demonstrating that they are found in the same complex, interacting either directly or indirectly. To confirm this interaction, we immunoprecipitated the Superose 6 fractions containing OGA and SPT5 and analyzed them by mass spectrometry. We found that the SPT5 antibody coimmunoprecipitated OGA and vice versa as indicated by the peptide numbers in Fig.   FIGURE 1. OGA is a positive elongation factor in crude nuclear extracts and is required for efficient elongation. A, lanes 1-5, the addition of hexosaminidase inhibitor PUGNAc and OGA-specific inhibitor Thiamet G (after PIC formation) both block efficient elongation in crude nuclear extract elongation assays. Shown are pulse-chase elongation assays using a CMV promoter template. Transcription was initiated by the addition of [ 32 P]CTP/GAT for 30 s and chased for 5Ј with 1.2 mM cold CTP (46). Labeled RNA was isolated and separated on an 8% urea, Tris borate-EDTA polyacrylamide gel. The full-length labeled RNA run-off product of 548 nucleotides is indicated by the arrow. THZ is an inhibitor of the CDK7 kinase subunit of TFIIH, and FP is a P-TEFb inhibitor (47,48). PUGNAc and Thiamet G were added after a 30 h PIC formation step as our previous work suggested that OGA might be required for PIC formation (43). THZ and FP were added either concomitantly with PIC formation or afterward (as with PUGNAc and OGA). Lanes 6 -8, epistatic assays indicate that OGA acts after P-TEFb and CDK7/TFIIH. In vitro transcriptions were done as in A, but in addition to assaying THZ, FP, PUGNAc, or Thiamet G separately, inhibitors were paired to assay for a temporal defect. nt, nucleotides. B, in a separate experiment, done as in panel A, we also assessed the epistatic relationship between THZ and Thiamet G (in addition to a THZ/FP and Thiamet G/FP assay). Note the identical effects of THZ, whether assayed by itself or in combination with FP or Thiamet G (compare lanes 2 (THZ), 5 (THZϩFP), and 7 (THZϩThiamet G) to the effect of FP or Thiamet G in lanes 3 and 4, respectively). C, rOGA inhibits RNA pol II elongation on immobilized templates. PICs were formed on immobilized templates followed by a 30-s pulse of [ 32 P]CTP and 0.5 mM GTP/ATP/UTP. Beads were washed and then incubated with or without rTFIIF and wild type or mutant OGA (mOGA; D174A) as indicated. This incubation step was followed by a 7-min incubation with 0.5 mM NTPs. Labeled RNA was isolated and run on an 8% polyacrylamide/urea/Tris borate-EDTA gel. D, Western blots of recombinant wild type or mutant Asp-174 catalytically inactive OGA showing that equal amounts were added to the transcription assays in panel C. 2C. These experiments show that OGA and SPT5 are physically interacting, thus connecting OGA to the elongation machinery.
In addition to confirming the interaction between OGA and SPT5, mass spectroscopy analysis indicated that the pausing/ elongation factor TRIM28-KAP1-TIF1␤ (27, 28) also immunoprecipitated with OGA (Fig. 2D). To confirm this interaction, we were able to detect OGA by immunoprecipitating TRIM28-KAP1-TIF1␤ (hereafter referred to as TIF1␤) from the P11 0.1 M fraction (Fig. 2D). Thus, we have multiple confirmations of the interaction of OGA with two RNA pol II factors that are known participants in elongation, SPT5 and TIF1␤, consistent with an OGA-dependent elongation function.
An OGA-SPT5-TIF1␤ Complex Has Elongation Properties-To further examine the relationship between OGA, SPT5, and TIF1␤, we attempted to construct various FLAG-tagged cell lines. Although we were unable to establish tagged OGT and OGA cells lines, we were successful in creating a FLAG-SPT5 HeLa cell line. We grew ϳ10 liters of the FLAG-SPT5 cell line, made a nuclear extract, and fractionated the extract over a P11 column. We then isolated FLAG-tagged SPT5 from the P11 0.1 M fraction using M2-agarose beads and eluted the complex from the resin with FLAG peptide (Fig. 3A). To confirm the content of the F-SPT5 complex, we immunoblotted the FLAGtagged material with OGA, SPT5, and TIF1␤ antibodies and found that all three were present in the complex (Fig. 3B). Hereafter, we refer to this complex as the OGA elongation complex (OEC).
We examined the OEC for elongation properties by titrating the OEC into the TFIIF-dependent elongation assay (Fig. 1C). We found that the OEC inhibited elongation of longer RNAs (Ͼ200 nucleotides) while having little effect on shorter RNAs (Fig. 3C). To confirm the specificity of this effect, we incubated a P11 0.1 M fraction that did not contain FLAG-SPT5 with M2-agarose and eluted bound material from that resin. The addition of that material (neg. control lane, Fig. 3D) had no effect on TFIIF-dependent elongation. These results indicate that the OEC has properties characteristic of a complex regulating pol II elongation.
Lastly, we asked whether OGA catalytic activity was required for OEC activity. We again used the TFIIF-dependent elongation assay and assayed two different OEC amounts, both of which inhibited elongation (compare lanes 2 and 3 and lanes 2 and 5, Fig. 3E). We then incubated each OEC amount with PUGNAc before adding it to the elongation assay (compare lanes 3 and 4 and lanes 5 and 6, Fig. 3E). In both cases, the PUGNAc did not change the level of elongation relative to the untreated OEC, indicating that OGA activity is not required for OEC activity in this assay.
OGA and O-GlcNAc Map to TSS/5Ј Ends of genes in Vivo-Using the ChIP-seq assay we looked in vivo at the distribution of OGA across the genome and its overlap with pol II, SPT5, and TIF1␤. Remarkably, we found discrete peaks of OGA that mapped to TSSs in metagene analysis (Fig. 4A): 46% of OGA peaks overlapped with pol II (Table 1), 70% of OGA peaks overlapped with SPT5, and 93% of OGA peaks overlapped with TIF1␤ ( Fig. 4B and Table 2). OGA overlapped with only 6% of promoter pol II (Table 3); however, of those overlaps, 100% also contained SPT5, and 91% contained SPT5 and TIF1␤ (Table 4). In conclusion, there is a clear partitioning of OGA, SPT5, and TIF1␤ to pol II promoter peaks. Visual inspection of individual genes, randomly selected, showed that OGA, pol II, SPT5, and TIF1␤ peaks all tightly correlated with each other (Fig. 4, C-F), including prominent peaks on paused genes such as the heat shock gene HSP90AA1 (Fig. 4C). These tight co-localizations between the three proteins are consistent with their biochemical interactions in Figs. 2 and 3.
Our analysis also showed a significant number of pol II peaks within gene bodies. These peaks are not at all well understood, and it not clear whether they represent paused or arrested/ stalled pol II. Our mapping shows that at gene-body pol II peaks, there are coincident peaks of OGA, SPT5, and TIF1␤ (Fig. 4, D and F, and Table 4). 99% of pol II peaks had an overlapping SPT5 peak, and 72% had an overlapping TIF1␤ peak. Of those, 100% had both OGA and SPT5, and 93% had OGA, SPT5, and TIF1␤. As with promoter pol II, the overlap of gene body pol II and OGA also shows enormous overlaps with SPT5 and TIF1␤. Although there are many more SPT5 and TIF1␤ peaks, virtually all of the OGA peaks co-localize with these two elongation factors (Table 3). Because of the presence of SPT5 and TIF1␤ (as well as OGA), these data suggest that these pol II peaks are likely paused in some manner and should be considered legitimate concentrations of SPT5-dependent paused pol II within gene bodies genome-wide due to the prior assignment of both SPT5 and TIF1␤ as pausing factors (27,28).
Lastly, we investigated the genome-wide distribution of O-GlcNAc itself by ChIP-seq using the anti-GlcNAc antibody RL2. Previous work in Caenorhabditis elegans showed that O-GlcNAc was considerably enriched at TSSs (60). We found similar and striking enrichments of O-GlcNAc at TSSs in human BJAB cells (Fig. 4G and Table 3). 70% of pol II promoter peaks overlapped with an O-GlcNAc peak, and ϳ30% ofok;1 O-GlcNAc peaks overlapped with promoter pol II. Thus, all components of the O-GlcNAcylation regulatory space are localized at promoters: O-GlcNAc (Fig. 4G), OGA (Fig. 4A), and OGT (61).

Discussion
We describe here the unexpected discovery of a novel elongation factor, O-GlcNAcase, which is the enzyme responsible for O-GlcNAc removal from a protein substrate, as a regulator of RNA pol II pausing and elongation. Thus, the O-GlcNAcylation system is part of a novel elongation regulatory phenomenon, one that is catalytic and adds an entirely new regulatory space governing both pausing and elongation by inserting O-GlcNAcylation regulatory steps into the elongation process. We show for the first time that OGA inhibition blocks elongation, that OGA is in physical contact with the elongation machinery, and that OGA maps to promoters genome-wide.
OGA Is an Elongation Factor-Inhibition of OGA in a crude nuclear extract (Fig. 1) results in an inhibition of elongation that is qualitatively very similar to the well known inhibition of P-TEFb-dependent elongation by flavopiridol and 5,6-Dichlorobenzimidazole riboside (62). Epistatic assays also show that OGA activity is part of a sequence of enzymatic activities in vitro: CDK7/TFIIH activity, followed by P-TEFb activity, followed by OGA activity (Fig. 1). In contrast to the crude nuclear extract system, in a partially purified elongation system where OGA has been removed by washes of the DNA template, FIGURE 3. OEC has elongation factor properties. A, purification of OEC. Shown is a silver-stained PAGE of FLAG-tagged SPT5/OEC as well as input and flow-through of the M2-agarose resin used to capture the FLAG epitope. B, OEC contains OGA, SPT5, and TIF1␤. F-SPT5/OEC was run on SDS-PAGE and subjected to immunoblotting with the indicated antibodies on the right. C, OEC inhibits transcription elongation. F-SPT5/OEC was titrated into TFIIF-dependent elongation assays (see Fig. 1C and lanes 1 and 2 of panel D for examples of the assay). D, a non-FLAG eluate from M2-agarose does not inhibit elongation. An untagged P11 0.1 M fraction was incubated with M2-agarose and subsequently eluted with FLAG peptide. The resulting eluate was used as a control in the TFIIF elongation assays (compare lanes 3 and 4; lane 3 contains the F-SPT5/OEC activity). Equal amounts of protein input onto the M2 resin, and equal volumes of each eluate were used. E, OEC activity is not dependent on OGA catalytic activity. The OEC inhibition in the TFIIF-dependent elongation assay was assessed in the presence of the OGA inhibitor PUGNAc. Two different OEC concentrations were used (indicated by ϩ and ϩϩ) and were incubated with PUGNAc before adding to the elongation assay, as in Fig. 3D. elongation is inhibited by the addition of rOGA (Fig. 1). All of these experiments also argue for a direct effect of OGA on transcription elongation. But here, when OGA is assayed without other components of a nuclear extract (which were by definition removed during template washing), the assay revealed a possible positive function for OGA in elongation under different biochemical conditions. In any case it is clear that OGA and OGA-containing complexes can regulate pol II elongation (Figs. 1 and 3).
OGA, SPT5, and TIF1␤-Our finding of a physical interaction between OGA and SPT5 (Fig. 2), a component of the elongation machinery, further suggests that the relationship between the regulation of O-GlcNAcylation and elongation is a bone fide one. SPT5 is a well known elongation factor required for establishing a paused pol II and, after phosphorylation, promoting productive elongation (62). To our knowledge, SPT5 has only been documented in a complex with SPT4, and we have yet to find any indication of SPT4 in the OEC. This suggests that SPT5 then has elongation functions other than those as part of DSIF.
Our finding of TIF1␤ in the OEC is also quite significant. TIF1␤ is also known as TRIM28 and KAP1 and was recently described as a factor participating in pol II pausing on the human HSPA1B promoter (27). A subsequent analysis suggested similar functions (28,63). We have elected to use the TIF1␤ designation to emphasize the relationship to elongation, as TIF1␥ also has elongation properties, including recruiting P-TEFb to promoters (64,65). Additionally, TIF1␣, -␤, and -␥, all, are quite similar in primary structure (66).
We also assayed whether OGA catalytic activity was required for the OEC's ability to inhibit elongation. Somewhat surprisingly, we found that PUGNAc did not affect OEC function. This suggests that the OGA is inactive in the complex and that the OEC inhibitory functions are due to the activity of SPT5 and TIF1␤. It is thus possible that OGA activity is regulated by the other factors in the OEC and that it is then activated at some point in the elongation process. To examine this further, a reconstituted elongation assay, based on the nuclear extract system, will be needed.
ChIP-seq Analysis-The cross-linking of OGA to promoters and its overlap with pol II positions across the genome is a remarkable result, further supporting the functional data that OGA is an elongation factor. Approximately half of the OGA peaks overlap with pol II, suggesting that OGA significantly partitions with pol II. Additionally, there are considerable overlaps of both SPT5 and TIF1␤ with OGA. These correlations are consistent with the physical isolation of both SPT5 and TIF1␤ with OGA. Furthermore, the overlaps of these three factors with pol II in the individual gene tracks further implicates OGA with pol II pausing and elongation. Finally, the localization of all of the components of the O-GlcNAcylation system at promoters, OGT (61), OGA (Fig. 4), and O-GlcNAc itself is quite striking (Fig. 4). Thus, promoters (and to a similar extent pol II peaks) appear to be focal points of O-GlcNAc information and regulation. We suggest that the promoter itself is acting, via the presence of the O-GlcNAcylation system, as a nutrient sensor, where activities of both OGT and OGA dynamically sample the levels of intracellular UDP-GlcNAc, and thus indirectly, the flux of glucose into the cell. This information would then be transmitted to various steps in PIC formation and pol II recruitment (44) and the regulation of elongation (this work). Others have proposed that UDP-GlcNAc concentrations are governed      by nutrient flux, as glucose, glutamine, acetyl CoA, and UTP are all necessary for UDP-GlcNAc biosynthesis (35,67). It is likely though that the UDP-GlcNAc is not the only component of the nutrient-sensing mechanism. OGT obviously must participate too, but more importantly, it is the set of O-GlcNAcylated proteins that are undergoing a dynamic cycling of O-GlcNAcylation that are the nutrient sensors. By constantly adding and removing O-GlcNAc from a protein(s), cycling allows the OGT and the target protein to continuously sample the nutrient state of the cell and respond to any changes in the supply and concentration of those nutrients. Two-step Model of Pausing/Pause Release-The general implication of these data suggest that the removal of O-GlcNAc is necessary for proper elongation. It is well known that O-GlcNAc and phosphorylation are often found in a mutually exclusive relationship. It follows then that removal of O-GlcNAc might be a necessary prior event to the subsequent phosphorylation of elongation factors (for example, DSIF and NELF) by various kinases (such as P-TEFb). In that regard we have noted that virtually all of the known elongation machinery is O-GlcNAcylated, as determined by mass spectroscopy (68). Nevertheless, we do not have a good grasp on the functional targets of either OGT or OGA in elongation. However, it is clear that OGT is required for elongation events just as much as OGA is, as OGA needs O-GlcNAcylated substrates. As such, we predict that OGT is also functioning as an elongation factor and is coupled to other elongation factors. Determining the mechanistic function and regulation of OGA in the elongation processes will likely require the establishment of a more highly purified O-GlcNAc-dependent transcription system. Additionally, the elucidation of these processes will likely require the identification of OGT and OGA substrate(s) in the elongation machinery.

Peaks
The OGA requirement may also impinge on the phosphorylation state of RNA pol II, as we have shown that both serine 2 and serine 5 residues of the pol II CTD can be O-GlcNAcylated (44). It is likely that OGA inhibition leads to altered phosphorylation states of the CTD, which would then influence pausing (in the case of serine 5) and various post-transcriptional events (in the case of serine 2). Again, it might be possible to test some of these predictions with highly purified pol II added into an O-GlcNAc-dependent reconstituted in vitro transcription system.
A Possible Connection between Cancer and Diabetes Phenotypes and OGA Activity Genome-wide-We suggest that the dysregulation of O-GlcNAcylation in cancer cells (69) results in the concomitant dysregulation of gene expression genomewide via altered pol II elongation. The same idea can also be invoked to explain O-GlcNAcylation functions in type 2 diabetes (35), thus reframing both disease phenotypes as a defect in pol II elongation genome-wide. Indeed, increased O-GlcNAcylation and glucose flux resulted in increased expression of a subset of genes in beta cells of the pancreas (70).
Additional Functions of O-GlcNAcylation during Pausing and Elongation-One major question that arises from this work is why would a cell employ an apparently continuous cycling of O-GlcNAc on and off of a set of protein targets? When looked at in isolation, this appears to be nothing more than a waste of energy and resources, a classic futile cycle. However, the UDP-GlcNAc substrate for OGT is a high energy donor, contributing a ⌬G (Ϫ8.8 kcal/mol; Ref. 71) similar to that of ATP hydrolysis (standard ⌬G ϭ Ϫ7.3 kcal/mol; Ref. 72), which may be a part of an energy requirement for pol II elongation and pausing.
Summary-Our studies on the regulation of O-GlcN-Acylation in RNA pol II-dependent transcription provide novel, unforeseen, and fascinating insights into the regulation of initiation and elongation. Our results have described a novel regulatory space that regulates RNA pol II transcription at promoters and during elongation. Furthermore, these studies also connect the nutrient state of the cell to the transcriptional machinery and all that is implied by the metabolic dysregulation seen in human disease, from diabetes to cancer (73). Lastly, these studies underscore the immense power of transcription biochemistry and in vivo approaches that together provide avenues of investigation that neither one can supply on its own.

ChIP-seq Bioinformatics Analysis
ChIP-seq reads were mapped to human genome (hg19) using BWA (PMID: 19451168). Low quality reads and adaptor sequences were removed by Trim-Galore. Peaks regions were identified by Macs2 using default options. Overlaps between peaks were calculated using Bedtools (PMID: 20110278.) Promoter regions were defined as Ϫ30 and ϩ300 in reference to the TSS, and the gene bodies were defined as ϩ301 of the TSS to the transcription termination site defined in Gencode release 19 (PMID: 22955987). Elongation rates were calculated by the ratio between the read density of 5Ј promoter region (between Ϫ30 and ϩ300 from TSS) and the read density of gene body (between ϩ301 and ϩ2000 from TSS). Genes with Ͻ2000 nucleotides and overlapping with other genes were omitted from the analyses.

In Vitro Transcriptions
In vitro transcriptions were done essentially as described (46). Nuclear extracts were prepared essentially as described (45). All drug inhibitors were added after PIC formation for 15 min before nucleotide addition. rOGA was made as described (43). Additions of flavopiridol and THZ were done as described (49). rTFIIF was expressed in Escherichia coli and purified as described previously (58).

Chromatography
Nuclear extract was separated by P11 on an AKTA Purifier using the indicated step elutions. 0.1 M fraction was loaded onto a Superose 6 column and developed in BC100. FLAG-SPT5 was purified from Flag-SPT5 expressing HeLa cells (10L) nuclear extract that was fractionated over a P11 column. The 0.1 M fraction was then subjected to an affinity purification step using M2-agarose beads (Sigma). FLAG-SPT5 was eluted using 0.5 mg/ml FLAG peptide (Sigma) in BC100 buffer.

Mass Spectrometry
On Bead Trypsin Digestion-The beads were resuspended in 25 mM NH 4 HCO 3 , pH 8.4, and heated at 95°C for 5 min to denature the proteins. The samples were digested overnight with 2 g of trypsin at 37°C. After centrifugation the supernatant containing the peptides was collected, the beads washed twitch with 25 mM NH 4 HCO 3 , pH 8.4, and the supernatant was collected for maximal recovery of peptides. The tryptic digest was lyophilized and then reconstituted in 25% acetonitrile, 0.1% FA for fractionation using strong cation exchange liquid chromatography (LC). The strong cation exchange fractionation was performed as described (53).
LC-MS/MS Analysis-All LC-MS/MS analysis was performed on an LTQ Velos Pro ion trap mass spectrometer (Thermo Scientific, San Jose, CA). Dried tryptic digest were resuspended in 12 l of 0.1% TFA, and 6 l was loaded onto Acclaim TM PepMap TM 100 C18 LC column (Thermo Scientific) using a Thermo Easy nLC 1000 liquid chromatography system (Thermo Scientific) connected online to the mass spectrometer. After sample injection, the column was washed, and peptides were eluted using a linear gradient. The mass spectrometer was operated in a data-dependent mode with each full MS scan followed by 15 data-dependent MS/MS acquisitions.
Protein Identification-Acquired MS/MS spectra were searched against a human UniProt protein database, 2014, using a SEQUEST and Fixed Value PSM validator algorithms in the Proteome Discoverer 1.4 software (Thermo Scientific). The precursor ion tolerance was set at 1.5 Da, and the fragment ions tolerance was set at 0.6 Da along with methionine oxidation included as dynamic modification. Only fully tryptic peptides with up to two miscleavages with charge state-dependent cross-correlation Xcorr Ն 2.1 for [MϩH] 1ϩ , Ն 2.5 for [Mϩ2H] 2ϩ , and Ն 3.2 for [Mϩ3H] 3ϩ and delta correlation (⌬C n ) Ն 0.08 were considered as positive identification.