Originally published In Press as doi:10.1074/jbc.M505292200 on September 7, 2005
J. Biol. Chem., Vol. 280, Issue 45, 37681-37688, November 11, 2005
Different Strategies for Carboxyl-terminal Domain (CTD) Recognition by Serine 5-specific CTD Phosphatases*
Stéphane Hausmann
,
Hisashi Koiwa
,
Shankarling Krishnamurthy£,
Michael Hampsey£, and
Stewart Shuman, An American Cancer Society Research Professor
1
From the
Molecular Biology Program, Sloan-Kettering Institute, New York, New York 10021,
Department of Horticultural Sciences, Texas A&M University, College Station, Texas 77843, and the £Department of Biochemistry, Robert Wood Johnson Medical School, Piscataway, New Jersey 08854
Received for publication, May 13, 2005
, and in revised form, August 31, 2005.
 |
ABSTRACT
|
|---|
The phosphorylated carboxyl-terminal domain (CTD) of RNA polymerase II, consisting of (1YSPTSPS7)n heptad repeats, encodes information about the state of the transcriptional apparatus that can be conveyed to factors that regulate mRNA synthesis and processing. Here we describe how the CTD code is read by two classes of protein phosphatases, plant CPLs and yeast Ssu72, that specifically dephosphorylate Ser5 in vitro. The CPLs and Ssu72 recognize entirely different positional cues in the CTD primary structure. Whereas the CPLs rely on Tyr1 and Pro3 located on the upstream side of the Ser5-PO4 target site, Ssu72 recognizes Thr4 and Pro6 flanking the target Ser5-PO4 plus the downstream Tyr1 residue of the adjacent heptad. We surmise that the reading of the CTD code does not obey uniform rules with respect to the location and phasing of specificity determinants. Thus, CTD code, like the CTD structure, is plastic.
 |
INTRODUCTION
|
|---|
The carboxyl-terminal domain (CTD)2 of the largest subunit of RNA polymerase II is a landing pad for proteins and multiprotein complexes that regulate transcription and catalyze mRNA processing. The CTD is composed of a tandemly repeated heptapeptide of consensus sequence 1YSPTSPS7. CTD positions Ser5 and Ser2 undergo waves of phosphorylation and dephosphorylation during the transcription cycle. The potential complexity of the CTD serine phosphorylation array comprises 4n different structures, where n is the number of heptad repeats (n varies from 15 in microsporidia to 52 in mammals). Thus, the CTD is a vast repository of information about the state of the transcriptional apparatus (a so-called "CTD code" (1)) that can be read by trans-acting factors. Studies of the interaction of mRNA capping enzymes with the phosphorylated CTD have illuminated its atomic structure and how CTD length, amino acid sequence, and the phosphorylation array influence CTD-PO4 effector functions (2-4). Three important points about the CTD code have emerged: (i) the phosphorylated CTD is structurally plastic and can assume markedly different conformations depending on its binding partner (4-7); (ii) CTD primary structure is recognized independent of CTD phosphorylation state (2, 3, 8, 9); and (iii) encoded CTD-PO4 information can be assembled from multiple noncontiguous repeats (4).
The information content of the CTD reflects the instantaneous balance between the activities of CTD kinases and CTD phosphatases at either some or all of the serine phosphorylation sites. A subject of intense interest (and debate) is whether and how CTD kinases and phosphatases are themselves responsive to specificity determinants within the CTD. Among the CTD phosphatases, some enzymes fail to discriminate between Ser2-PO4 and Ser5-PO4 substrates (9, 10), whereas others display a preference (2- to 10-fold) for either Ser2-PO4 or Ser5-PO4 (11, 12). Recently, we described two types of CTD phosphatases that act exclusively on Ser5-PO4: (i) the paralogous plant enzymes CPL1 and CPL2, which were identified genetically as regulators of osmotic stress and abscisic acid-responsive transcription in Arabidopsis thaliana (13, 14), and (ii) yeast Ssu72, which interacts with TFIIB and with proteins involved in RNA 3'-end formation (15-20).
CPL1/2 and Ssu72 belong to different enzyme families. The plant CPL1/2 proteins, like the prototypal CTD phosphatase Fcp1, are members of the DXDXT superfamily of metal-dependent phosphotransferases that act via an aspartyl-phosphoenzyme intermediate (8, 11, 13, 21-23). Ssu72 belongs to the CXXXXXR superfamily of metal-independent phosphohydrolases that act via a cysteinyl phosphoenzyme intermediate (15, 24, 25). This situation, whereby nature has selected two entirely different structural solutions to perform the same site-specific chemical transformation of the CTD, provides an opportunity to address key questions about how the CTD code is read. Here we demonstrate that CPL1/2 and Ssu72 rely on different positional cues in the CTD primary structure to recognize and hydrolyze Ser5-PO4.
 |
EXPERIMENTAL PROCEDURES
|
|---|
Recombinant ProteinsThe CPL1-(1-646) and CPL2-(1-649) coding sequences were amplified by PCR using primers designed to introduce BglII sites at the start codon and immediately 3' of the stop codon. The BglII-digested PCR products were inserted into pET28-His10Smt3 that had been linearized with BamHI. The resulting expression plasmids pET-His10Smt3-CPL1-(1-646) and pET-His10Smt3-CPL2-(1-649) encode the respective CPL proteins fused in-frame to an amino-terminal His10Smt3 domain consisting of a His10 leader (MGHHHHHHHHHHSSGHIEGRH) followed by the 98-amino acid Saccharomyces cerevisiae Smt3 protein and a single serine. (Smt3 is the yeast ortholog of the small ubiquitin-like modifier SUMO.) pET-His10Smt3 plasmids encoding mutated versions CPL1-(1-646)-D161A and CPL2-(1-649)-D144A were generated by two-stage overlap extension PCR and cloning of the BglII-digested mutated PCR products into pET28-His10Smt3. The pET-His10Smt3-CPL plasmids were transformed into Escherichia-coli BL21-CodonPlus(DE3). Cultures (500 ml) derived from single transformants were grown at 37 °C in Luria Bertani medium containing 50 µg/ml kanamycin and 50 µg/ml chloramphenicol until the A600 reached 0.6. The cultures were adjusted to 0.2 mM isopropyl-1-thio-
-D-galactopyranoside and 2% ethanol, and incubation was continued for 20 h at 17 °C. Cells were harvested by centrifugation and stored at -80 °C. All subsequent procedures were performed at 4 °C. Thawed bacteria were resuspended in 25 ml of buffer A (50 mM Tris-HCl, pH 8.0, 200 mM NaCl, 10% glycerol). Phenylmethylsulfonyl fluoride and lysozyme were added to final concentrations of 500 µM and 100 µg/ml, respectively. After incubation on ice for 30 min, Triton X-100 was added to a final concentration of 0.1%, and the lysate was sonicated to reduce viscosity. Insoluble material was removed by centrifugation. The soluble extracts were mixed for 30 min with 1 ml of Ni2+- nitrilotriacetic acid-agarose (Qiagen) that had been equilibrated with buffer A containing 0.1% Triton X-100. The resin was recovered by centrifugation, resuspended in buffer A, and poured into columns. The columns were washed with 10 ml of 20 mM imidazole in buffer A and then eluted stepwise with 1.5 ml of buffer A containing 50, 100, 250, and 500 mM imidazole. The polypeptide compositions of the column fractions were monitored by SDS-PAGE. The recombinant His10Smt3-CPL polypeptides were recovered predominantly in the 250 mM imidazole fractions. The 250 mM imidazole eluates were dialyzed against buffer A containing 2 mM dithiothreitol and 0.01% Triton X-100 and then stored at -80 °C. The CPL1 and CPL2 concentrations were determined by SDS-PAGE analysis of serial dilutions of the CPL preparations in parallel with serial dilutions of a BSA standard. The gels were stained with Coomassie Blue, and the staining intensities of the His10Smt3-CPL and BSA polypeptides were quantified using a Fujifilm FLA-5000 digital imaging and analysis system. CPL1 and CPL2 concentrations were calculated by interpolation to the BSA standard curve.
Wild-type Ssu72 and the C15S mutant were produced in E. coli as glutathione S-transferase fusions and purified from soluble bacterial extracts by glutathione-Sepharose affinity chromatography as described (15). Protein concentration was measured with the Bio-Rad dye reagent using BSA as the standard.
Velocity SedimentationAliquots (40 µg) of the nickel-agarose preparations of CPL1 and CPL2 were mixed with catalase (45 µg), BSA (45 µg), and cytochrome c (45 µg), and the mixtures were applied to 4.8-ml 15-30% glycerol gradients containing 50 mM Tris-HCl (pH 8.0), 0.2 M NaCl, 1 mM EDTA, 2 mM dithiothreitol, 0.01% Triton X-100. The gradients were centrifuged in a SW50 rotor at 50,000 rpm for 17 h at 4 °C. Fractions (
0.26 ml) were collected from the bottoms of the tubes.
CTD PhosphopeptidesCTD Ser-PO4 peptides were synthesized and purified by the Sloan-Kettering Microchemistry Core Laboratory as described previously (2, 8). The peptides were dissolved in 10 mM Tris-HCl (pH 7.4), 1 mM EDTA and stored at 4 °C. The molar concentrations of the phosphopeptides were initially estimated from the absorbance at 274 nM using an extinction coefficient of 1.4 x 103 M-1 for tyrosine. The content of Ser-PO4 was then determined for each peptide, measuring the release of inorganic phosphate after digestion with calf intestinal phosphatase as described (8).
CTD Phosphatase AssayReaction mixtures (25 µl) containing 50 mM Tris acetate (pH 5.5), 10 mM MgCl2, CTD phosphopeptide, and CPL1 or CPL2 were incubated for 60 min at 37 °C. Reaction mixtures (25 µl) containing 50 mM Tris acetate (pH 7.0), 5 mM dithiothreitol, CTD phosphopeptide, and Ssu72 as specified were incubated for 120 min at 30 °C. The reactions were quenched by adding 0.5 ml of malachite green reagent (BIOMOL Research Laboratories, Plymouth Meeting, PA). Release of phosphate was determined by measuring A620 and interpolating the value to a phosphate standard curve. The amounts of CTD substrate in the CTD phosphatase reactions are expressed as input phosphoserine, as determined by CIP digestion performed in parallel with each assay.
 |
RESULTS AND DISCUSSION
|
|---|
Physical and Biochemical Characterization of CPL1 and CPL2CPL1 and CPL2 were shown previously to dephosphorylate in vitro Ser5 of the Arabidopsis CTD (consisting of 34 heptad repeats), but not Ser2 (13).Exclusive hydrolysis of Ser5-PO4, but not Ser2-PO4, was also demonstrated using defined synthetic CTD phosphopeptides. Physical and biochemical characterization of the plant phosphatases was hampered initially by the poor solubility of the recombinant proteinsinE. coli and their susceptibility to proteolysis invivo. Deletion analysis showed that the carboxyl-terminal segments of CPL1 and CPL2 could be removed without diminishing phosphatase activity or affecting the Ser5 specficity (13).3 Here, we produced the catalytically active amino-terminal domains as His10Smt3-CPL fusions; this maneuver improved their yield and solubility compared with previous expression strategies. The His10Smt3-tagged proteins CPL1-(1-646) and CPL2-(1-649) were isolated from soluble bacterial extracts by adsorption to nickel-agarose and elution with imidazole. SDS-PAGE revealed the presence of polypeptides corresponding to the intact fusion proteins (Fig. 1A, arrows). The apparent sizes of the Smt3-CPL fusion proteins by SDS-PAGE (97-98 kDa) were larger than their calculated molecular masses of 87 kDa; this is because the Smt3 domain migrates aberrantly during SDS-PAGE, appearing
10 kDa larger than its predicted size. The CPL1 and CPL2 preparation contained several polypeptides in the 60-90-kDa range and a cluster of smaller polypeptides migrating at 20-27 kDa (the latter correspond to His10Smt3- and His10Smt3-peptide fusions arising via proteolysis) (Fig. 1A and data not shown).
The quaternary structure of recombinant CPL1 and CPL2 was investigated by zonal velocity sedimentation through a 15-30% glycerol gradient (Fig. 1, B and C). Marker protein catalases (native size 248 kDa), BSA (66 kDa), and cytochrome c (12 kDa) were included as internal standards in the gradient. The tagged CPL1 and CPL2 proteins sedimented as discrete peaks coincident with BSA. CPL1 and CPL2 were clearly separated from the cluster of low molecular weight polypeptides, which cosedimented with cytochrome c. The CTD Ser5 phosphatase activity profile (measured by the release of Pi from a synthetic phosphopeptide) paralleled the abundance of the intact CPL1 and CPL2 polypeptides. These results are consistent with a monomeric quaternary structure for the catalytic domains of CPL1 and CPL2.
CPL1- and CPL2-catalyzed release of Pi from a 28-amino acid tetraheptad CTD Ser5-PO4 substrate was proportional to enzyme concentration (expressed as the amount of full-length CPL fusion protein); 95% of the input Ser5-PO4 residues were hydrolyzed at saturating CPL levels (Fig. 2A). From the slope of the titration curves we estimated that CPL1 and CPL2 hydrolyzed a 6400- and 9300-fold molar excess of Ser5-PO4/enzyme, respectively, during the 60-min reaction. Neither CPL1 nor CPL2 catalyzed Pi release from a tetraheptad CTD Ser2-PO4 substrate (Fig. 2A). A kinetic analysis is shown in Fig. 2B of the reaction of CPL2 with the tetraheptad CTD-Ser5-PO4 substrate at two different levels of input enzyme. Product accumulated steadily with a pseudo-first order profile. The initial rate was proportional to enzyme concentration; the turnover number was 13 s-1.
Reaction of CPL1 and CPL2 with a 14-amino acid diheptad CTD Ser5-PO4 substrate YSPTSPSYSPTSPS resulted in Pi release proportional to the amount of input enzyme (Fig. 2C). 85% of the input phosphoserine was hydrolyzed at saturating enzyme levels (Fig. 2C and data not shown). CPL1 and CPL2 released an
11,000-fold molar excess of Pi/enzyme during the 60-min reaction. In contrast, CPL1 and CPL2 were inert in hydrolyzing a diheptad CTD Ser2-PO4 substrate YSPTSPSYSPTSPS (Fig. 2C). This experiment showed that two heptad repeats suffice for Ser5-specific CTD phosphatase activity.

View larger version (28K):
[in this window]
[in a new window]
|
FIGURE 1. Purification and velocity sedimentation of CPL1 and CPL2. A, aliquots of the nickel-agarose fractions containing 2.5 µg of the His10Smt3-CPL1-(1-646) and His10Smt3-CPL2-(1-649) polypeptides (arrows) were analyzed by SDS-PAGE. The gel was stained with Coomassie Blue dye. The positions and sizes (kDa) of marker polypeptides are indicated on the left. B and C, sedimentation analysis of CPL1 (B) and CPL2 (C) was performed as described under "Experimental Procedures." Aliquots (20 µl) of odd-numbered glycerol gradient fractions were analyzed by SDS-PAGE. The Coomassie Blue-stained gels are shown (top panels). The phosphatase activity profiles are shown in the bottom panels. Phosphatase reaction mixtures containing 2 µl of a 1/8 dilution of the gradient fractions and 2.9 nmol CTD phosphopeptide SPSYSPTSPS were incubated for 60 min at 37 °C.
|
|

View larger version (10K):
[in this window]
[in a new window]
|
FIGURE 2. Ser5 specificity of CPL1 and CPL2. A, reaction mixtures containing 23 µM tetraheptad Ser5-PO4 substrate (YSPTSPS)4 (2.3 nmol Ser-PO4) or 25 µM tetraheptad Ser2-PO4 substrate (YSPTSPS)4 (2.5 nmol Ser-PO4) and either CPL1 (circles) or CPL2 (squares) as specified were incubated for 60 min at 37 °C. B, reaction mixtures containing (per 25 µl) 29 µM tetraheptad Ser5-PO4 substrate (2.9 nmol Ser-PO4) and either 25 or 50 ng of CPL2 were incubated at 37 °C. Aliquots (25 µl) were withdrawn at the times specified and quenched immediately. C, reaction mixtures containing the diheptad Ser5-PO4 substrate YSPTSPSYSPTSPS (5.4 nmol Ser-PO4) or diheptad Ser2-PO4 substrate YSPTSPSYSPTSPS (5.1 nmol Ser-PO4) and either CPL1 (circles) or CPL2 (squares) as specified were incubated for 60 min at 37 °C.
|
|
CPL1 and CPL2 are putative members of the DXDXT phosphotransferase family. The signature feature of these enzymes is the formation of an intermediate in which phosphate is attached covalently to the first Asp residue of the DXDXT motif (23). We mutated the presumptive Asp nucleophiles of CPL1 (Asp161) and CPL2 (Asp144) to alanine, produced the His10Smt3-CPL1-(1-646)-D161A and His10Smt3-CPL2-(1-649)-D144A proteins in bacteria, and isolated them from soluble extracts by affinity chromatography. The polypeptide compositions of the mutant proteins were virtually identical to those of the wild-type CPL1 and CPL2 (not shown). The CPL1-D161A and CPL2-D144A preparations displayed no detectable CTD-Ser5 phosphatase activity at levels of input protein that were saturating for the respective wild-type enzymes (not shown). These results suggest that the plant CPLs are bona fide acyl-phosphatases and, in conjunction with the sedimentation data, prove that the observed CTD-Ser5-specific CTD phosphatase activity is intrinsic to the recombinant CPLs.
CTD Ser5 Phosphatase Activity of Ssu72The specificity of Ssu72 was characterized using a recombinant glutathione S-transferase-Ssu72 fusion protein purified from a soluble bacterial lysate by glutathione-affinity chromatography (Fig. 3D). Ssu72 hydrolyzed Pi from the tetraheptad Ser5-PO4 substrate; activity reached a plateau at 2.5-5 µg of input protein (Fig. 3A), at which point 55% of the input Ser5-PO4 residues had been hydrolyzed. Product accumulated steadily during a 3-h reaction (Fig. 3B), which signified that the low end point was not dictated by inactivation of the enzyme during the incubation. Additional experiments showed that, although adding more Ssu72 to the reaction after the end point had been achieved did not result in further Pi release, adding more peptide without more enzyme did result in hydrolysis of
50% of the additional substrate (not shown). Ssu72 failed to release Pi from the tetraheptad Ser2-PO4 substrate (Fig. 3, A and B).

View larger version (13K):
[in this window]
[in a new window]
|
FIGURE 3. Ser5 specificity of Ssu72. A, reaction mixtures containing 29 µM tetraheptad Ser5-PO4 substrate (2.9 nmol Ser-PO4) or 30 µM tetraheptad Ser2-PO4 substrate (3.0 nmol Ser-PO4) and Ssu72 as specified were incubated for 2 h at 30°C. B, reaction mixtures containing (per 25 µl) 24 µM tetraheptad Ser5-PO4 substrate (2.4 nmol Ser-PO4) or 32 µM tetraheptad Ser2-PO4 (3.2 nmol Ser-PO4) substrate and 1.25 µg of Ssu72 were incubated at 30 °C. Aliquots (25 µl) were withdrawn at the times specified and quenched immediately. C, reaction mixtures containing the diheptad Ser5-PO4 substrate (5.1 nmol Ser-PO4) or diheptad Ser2-PO4 substrate (4.2 nmol Ser-PO4) and Ssu72 as specified were incubated for 2 h at 30 °C. D, aliquots (5 µg) of the glutathione S-transferase-tagged wild-type Ssu72 and C15S mutant proteins were analyzed by SDS-PAGE. The polypeptides were visualized by staining the gel with Coomassie Blue dye. The positions and sizes (kDa) of marker polypeptides are indicated on the left.
|
|
Ssu72 also hydrolyzed the diheptad CTD Ser5-PO4 substrate YSPTSPSYSPTSPS. 50% of the input phosphoserine was hydrolyzed at saturating enzyme levels (Fig. 3C). We estimated Ssu72 hydrolyzed a 150-fold molar excess of Ser5-PO4/enzyme during the 120-min reaction, which corresponds to a turnover number of 1.2 min-1. Ssu72 was unreactive with the diheptad CTD Ser2-PO4 substrate YSPTSPSYSPTSPS (Fig. 3C). Ssu72 resembles the low molecular weight cysteinyl phosphatase enzymes. The presumptive catalytic mechanism entails attack of a cysteine thiolate nucleophile on the substrate to form a cysteinyl-phosphoenzyme intermediate. Here we found that the purified recombinant active site mutant Ssu72-C15S (Fig. 3D) was incapable of hydrolyzing the CTD-Ser5-PO4 peptide substrate (not shown).
CPLs and Ssu72 Recognize Different Specificity Determinants in the CTDTo probe the role of CTD primary structure in Ser5 phosphatase activity, we tested mutated versions of the tetraheptad CTD Ser5-PO4 peptide (YSPTSPS)4, wherein every Tyr1, Ser2, Pro3, Thr4, or Pro6 was replaced by alanine. The titration profiles of the reaction of CPL1 with the S2A, T4A and P6A substrates were similar to that of the wild-type CTD (Fig. 4A). However, the CPL1 activity profiles with the P3A and Y1A substrates displayed a shift to the right (Fig. 4A). The specific activities of CPL1 with the P3A and Y1A substrates were 16 and 4%, respectively, of the activity with the wild-type (YSPTSPS)4 substrate. CPL2 hydrolyzed the S2A and T4A CTDs as well as the wild-type CTD. Activity was reduced modestly by the P6A change (to 27% of wild-type) and drastically by the P3A (3%) and Y1A (1%) CTD mutations (Fig. 4B). These experiments demonstrate that Tyr1 and Pro3 are the critical determinants of CPL phosphatase activity at position Ser5.
Ssu72 responded quite differently to changes in CTD primary structure (Fig. 4C). The P3A change that was deleterious to the CPLs had virtually no ill effect on Ssu72, whereas the T4A mutation, which was well tolerated by the CPLs, was inimical to Ssu72, reducing its specific activity to 3% of the wild-type CTD level (Fig. 4C). Moreover, the P6A lesion abolished Ser5 phosphatase activity of Ssu72, though it had no effect on CPL1 and only a modest effect on CPL2. Ssu72 activity was reduced severely (to 4% of wild-type) by the Y1A change and was unaffected by S2A. We conclude that: (i) Tyr1, Thr4, and Pro6 are the critical determinants of Ssu72 phosphatase, and (ii) different CTD phosphatases with the same exquisite specificity for Ser5-PO4 achieve their specificities by recognizing different structural cues in the CTD.
By titrating the wild-type (YSPTSPS)4 substrate against fixed amounts of CPL1, CPL2, and Ssu72, we determined apparent Km values of 60, 140, and 280 µM, respectively (not shown). These values are higher than the substrate concentrations (
30 µM) used in the specific activity determinations in Fig. 4. Thus, we retested the activity of CPL1 and CPL2 with the wild-type, Y1A, P3A, and P6A tetraheptad Ser5-PO4 peptides at substrate concentrations (240-260 µM) in excess of the Km for wild-type peptide (Fig. 5). The hierarchy of mutational effects at higher substrate was similar to what was observed at limiting substrate. The extents of phosphate release by CPL1 with the P6A, P3A, and Y1A substrates were 96, 27, and 13%, respectively, of the activity with the wild-type (YSPTSPS)4 substrate (Fig. 5, left panel). Phosphate release by CPL2 with the P6A, P3A, and Y1A substrates was 64, 7, and 4% of activity with the wild-type peptide (middle panel). Thus, raising the CTD concentration above Km only slightly mitigated the effects of the Y1A or P3A changes on CPL1 activity, or the effects of Y1A, P3A, and P6A on CPL2 function, vis-à-vis their activity with the wild-type tetraheptad substrate. A parallel experiment with Ssu72 at the higher substrate concentrations showed that phosphate release with the Y1A, T4A, and P6A substrates was 8, 12, and 3% of its activity with the wild-type peptide (Fig. 5, right panel).
Effects of a Lysine Substitution at Position 7 of the CTD HeptadThe identity of the amino acid at position 7 of the CTD is either highly conserved (e.g. all serines in the Schizosaccharomyces pombe CTD (GenBankTM accession code CAB57941
[GenBank]
) and Encephalitozoon cuniculi CTD (GenBankTM CAD26175
[GenBank]
)) or subject to variation from one heptad to another within the same CTD. In the case of human RNA polymerase II (GenBankTM P24928
[GenBank]
), one-half (26/52) of the heptads have a consensus Ser7, and the others are substituted, most commonly with Lys7 (7 heptads) or Asn7 (5 heptads). The CTD of the malaria parasite Plasmodium falciparum (GenBankTM Z98551
[GenBank]
) is dominated by Lys7-containing heptads (11/15 repeats).

View larger version (24K):
[in this window]
[in a new window]
|
FIGURE 4. CPLs and Ssu72 recognize different specificity determinants in the CTD. Reaction mixtures contained 29-37 µM wild-type or mutated tetraheptad CTD phosphopeptides (2.9-3.7 nmol Ser-PO4) and either CPL1 (A), CPL2 (B), or Ssu72 (C) as specified.
|
|

View larger version (8K):
[in this window]
[in a new window]
|
FIGURE 5. Effect of Y1A, P3A, T4A, and P6A mutations at higher substrate concentrations. Reaction mixtures contained 240-260 µM wild-type or mutated tetraheptad CTD phosphopeptides (24-26 nmol Ser-PO4) and either 50 ng of CPL1 (left panel), 50 ng of CPL2 (middle panel), or 2.5 µg of Ssu72 (right panel).
|
|
To gauge the role, if any, of the position 7 side chain as a CTD phosphatase specificity determinant, we tested a tetraheptad CTD Ser5-PO4 peptide (YSPTSPK)4 in which every Ser7 was replaced by lysine. The titration profiles of the reaction of CPL1 and CPL2 with the S7K substrates were similar to that of the wild-type CTD (Fig. 6, A and B). Ssu72 was
30% as active with the S7K CTD substrate as it was with the wild-type CTD (Fig. 6C). Thus, neither the CPLs nor Ssu72 depend on Ser7 for their Ser5-specific phosphatase activity.
Different Phasing of Specificity Determinants for CPLs and Ssu72 Further insight into the location of the specificity determinants emerged from an analysis of the reaction of CPLs and Ssu72 with diheptad monophosphorylated CTDs, which contained Ser5-PO4 in either the amino-terminal heptad (YSPTSPSYSPTSPS) or the carboxyl-terminal heptad (YSPTSPSYSPTSPS). Ssu72 displayed a 10-fold preference for hydrolysis of Ser5-PO4 within the amino-terminal heptad (Fig. 7C). This result implies that the recognition of Tyr1 by Ssu72 occurs on the carboxyl side of the Ser5-PO4 moiety that is being hydrolyzed, i.e. Ssu72 has higher activity on the proximal repeat because Ssu72 recognizes the Thr4 and Pro6 flanking the Ser5-PO4 and the Tyr1 in the distal repeat (see Fig. 9). Ssu72 is only weakly active on the distal repeat because there is no Tyr1 available on the carboxyl side of the phosphoserine. From a substrate titration experiment at a fixed level of Ssu72, we determined an apparent Km of 850 µM for the YSPTSPSYSPTSPS peptide (data not shown); thus, the affinity of the enzyme for a tetraheptad CTD with four phosphoserines is
3-fold higher than for a diheptad with a single phosphoserine.

View larger version (10K):
[in this window]
[in a new window]
|
FIGURE 6. Effect of changing Ser7 to lysine. Reaction mixtures contained 28 µM wild-type or 30 µM S7K tetraheptad CTD phosphopeptides (2.8 and 3 nmol Ser-PO4) and either CPL1 (A), CPL2 (B), or Ssu72 (C) as specified.
|
|

View larger version (12K):
[in this window]
[in a new window]
|
FIGURE 7. Different phasing of the specificity determinants of CPLs and Ssu72. The diheptad monophosphorylated CTD phosphopeptides are depicted with Ser5-PO4 positions highlighted in shaded boxes. Reaction mixtures contained 2.5-2.6 nmol CTD phosphopeptide and either CPL1 (A), CPL2 (B), or Ssu72 (C) as specified.
|
|
CPL1 and CPL2 display the opposite preference; they are 2- to 3-fold more active in hydrolyzing Ser5-PO4 when it is located in the carboxyl-terminal heptad (Fig. 7, A and B). These results suggests that: (i) the CPLs recognize the Tyr1 residue upstream of the phosphorylated Ser5 that is being hydrolyzed, and (ii) 2 amino acids downstream of Ser5-PO4 suffice. From substrate titration experiments, we determined apparent Km values of CPL1 and CPL2 for the preferred YSPTSPSYSPTSPS peptide of 100 and 570 µM, respectively (data not shown). The disfavored CPL substrate does contain a Tyr1 upstream of Ser5-PO4 in the first repeat; perhaps the free amine of the terminal Tyr1 (which adds a positive charge not present normally) might account for the lower activity of this substrate. Alternatively, 1 or more amino acids on the upstream side of Tyr1 might contribute to substrate recognition, either directly or by limiting the flexibility of the Tyr1 residue. To address these issues, we tested a series of incrementally truncated 12-mer (PTSPSYSPTSPS), 10-mer (SPSYSPTSPS), and 8-mer (SYSPTSPS) peptides containing a complete Ser5-PO4 heptad at their carboxyl termini and either 5, 3, or 1 amino acids from an upstream heptad (Fig. 8). CPL1 and CPL2 readily hydrolyzed the 12- and 10-mer substrates, whereas their activities with the 8-mer were reduced by factors of 2 and 3, respectively (Fig. 8). We surmise that CPL1/2 activity is optimal with as few as 3 amino acids on the amino-terminal side of Tyr1. Apparent Km values of CPL1 and CPL2 for the 12-mer PTSPSYSPTSPS were 180 and 950 µM, respectively (not shown).
Properties of the CTD CodeHere we have begun to illuminate how the CTD code is read by two classes of protein phosphatases that specifically dephosphorylate Ser5. The instructive findings are that the plant CPLs and yeast Ssu72 recognize entirely different constellations of specificity determinants in the CTD. Whereas the CPLs rely on Tyr1 and Pro3 located on the upstream side of the Ser5-PO4 target site, Ssu72 recognizes Thr4 and Pro6 flanking the target Ser5-PO4 plus the downstream Tyr1 residues of the adjacent heptad (Fig. 9). The two classes of phosphatases not only see different amino acids, they see them in entirely different registers with respect to Ser5-PO4.

View larger version (16K):
[in this window]
[in a new window]
|
FIGURE 8. Minimizing the substrate for plant CPLs. The sequences of the CTD phosphopeptides are shown with Ser5-PO4 positions highlighted in shaded boxes. Reaction mixtures containing 3.1-3.3 nmol CTD peptide and CPL1 (A) or CPL2 (B) as specified were incubated for 60 min at 37 °C.
|
|
The simple interpretation of our results is that: (i) the functionally relevant CTD structural elements comprise the respective binding sites for the CPLs and Ssu72, and (ii) those CTD elements are likely to be disposed on the surface of the CTD Ser5-PO4 substrate to which the phosphatase docks. The most plausible scenario is that the proper CTD conformation for catalysis is templated by interaction with the phosphatase via an induced fit mechanism, as shown for other CTD-binding proteins. Although we do not yet have an atomic structure of the CTD bound to either of the phosphatases studied here, we can make an educated guess as to what conformations the phosphatases might select for, based on principles emerging from the structure of the mRNA capping enzyme bound to the Ser5-PO4 CTD (4). Interactions with the phosphorylated CTD occur at two distinct CTD-docking sites (CDS1 and CDS2) on the guanylyltransferase surface. CDS1 engages a CTD segment TSPSYSP, whereas CDS2 interacts with a downstream segment SYSPTSP located two heptads away. Thus, the capping enzyme samples two distinct CTD structures that are phased in different registers with respect to Ser5-PO4. The CTD bound at sites CDS1 and CDS2 adopts a
-like extended conformation, whereby every other residue is oriented toward or away from the protein surface (Fig. 9).
It is remarkable that the CTD segments and the CTD conformations at CDS1 and CDS2 of the capping enzyme satisfy reasonably well the specificity parameters elucidated here for Ssu72 and the CPLs, respectively. To wit, the Tyr1, Pro3, and Ser-PO4 moieties that comprise the recognition code for CPL1/2 project onto a common surface within CDS2, whereas Ser2 and Thr4, which are not needed for CPL1/2 activity, project away from the docking site (Fig. 9). Pro6 also projects onto the protein side of the interface in CDS2; this residue is not critical for CPL1 phosphatase activity but does play a modest role in CPL2 function. We envision that the CTD adopts a CDS2-like conformation in its interaction with the CPL1 and CPL2 phosphatases.
The Ser5-PO4, Pro6, and Tyr1 moieties that Ssu72 recognizes are located on the protein side of the interface in CDS1, whereas Ser2 and Pro3, to which Ssu72 is indifferent, project away from the protein surface (Fig. 9). We therefore speculate that the CTD adopts a CDS1-like conformation in its interaction with Ssu72. Although Thr4 does not contribute to the CTD-protein interface at CDS1 of the capping enzyme, the Thr O
is in position to donate a hydrogen bond to the vicinal phosphate group at Ser5 (4).
The finding of at least two CTD coding elements that signal specific hydrolysis of Ser5-PO4 has implications for understanding CTD information content. First, the functional "footprints" of CPL1/2 and Ssu72 on the CTD primary structure collectively span more than one heptad repeat, a state of affairs that is consistent with recent genetic evidence that the essential functional unit of the CTD is contained with a contiguous diheptad pair (26). Second, there is no simple rule for what is recognized. The CTD phosphorylation sites are located within Ser-Pro dipeptides. With respect to CTD Ser5-specific dephosphorylation, Ssu72 is identified here as a stringently Pro6-directed phosphatase, whereas CPL1 is Pro6-independent and CPL2 is only modestly affected by elimination of Pro6. Third, even though the CTD is structurally plastic, there are likely to exist a set of CTD conformational states that are utilized reiteratively by different CTD-binding proteins, even where the proteins are themselves not structurally related (e.g. the CTD phosphatases and the mRNA capping enzyme). Fourth, the information content of the CTD can be amplified by blending multiple distinct CTD coding elements (such as those seen here for the CPLs and Ssu72) to form a bipartite, or even higher order, recognition site for CTD-associated factors (such as the capping enzyme).
 |
FOOTNOTES
|
|---|
* This work was supported by National Institutes of Health Grants GM52470 (to S. S.) and GM68887 (to M. H.) and National Science Foundation Grant MCB0421889 (to H. K.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 
1 To whom correspondence should be addressed. E-mail: s-shuman{at}ski.mskcc.org.
2 The abbreviations used are: CTD, carboxyl-terminal domain; CDS, CTD-docking site; BSA, bovine serum albumin. 
3 S. Hausmann, unpublished information. 
 |
REFERENCES
|
|---|
- Buratowski, S. (2003) Nature Struct. Biol. 10, 679-680[CrossRef][Medline]
[Order article via Infotrieve]
- Ho, C.K., and Shuman, S. (1999) Mol. Cell 3, 405-411[CrossRef][Medline]
[Order article via Infotrieve]
- Pei, Y., Hausmann, S., Ho, C. K., Schwer, B., and Shuman, S. (2001) J. Biol. Chem. 276, 28075-28082[Abstract/Free Full Text]
- Fabrega, C., Shen, V., Shuman, S., and Lima, C. D. (2003) Mol. Cell 11, 1549-1561[CrossRef][Medline]
[Order article via Infotrieve]
- Noble, C. G., Hollingworth, D., Martin, S. R., Ennis-Adeniran, V., Smerdon, S. J., Kelly, G., Taylor, A. A., and Ramos, A. (2005) (2005) Nature Struct. Mol. Biol. 12, 144-151
- Meinhart, A., and Cramer, P. (2004) Nature 430, 223-226[CrossRef][Medline]
[Order article via Infotrieve]
- Verdecia, M. A., Bowman, M. E., Lu, K. P., Hunter, T., and Noel J. P. (2000) Nature Struct. Biol. 7, 639-643[CrossRef][Medline]
[Order article via Infotrieve]
- Hausmann, S., Erdjument-Bromage, H., and Shuman, S. (2004) J. Biol. Chem. 279, 10892-10900[Abstract/Free Full Text]
- Hausmann, S., Schwer, B., and Shuman, S. (2004) Biochemistry 43, 7111-7120[CrossRef][Medline]
[Order article via Infotrieve]
- Lin, P. S., Dubois, M. F., and Dahmus, M. E. (2002) J. Biol. Chem. 277, 45949-45956[Abstract/Free Full Text]
- Hausmann, S., and Shuman, S. (2002) J. Biol. Chem. 277, 21213-21220[Abstract/Free Full Text]
- Yeo, M., Lin, P. S., Dahmus, M. E., and Gill, G. N. (2003) (2003) J. Biol. Chem. 278, 26078-26085[Abstract/Free Full Text]
- Koiwa, H., Hausmann, S., Bang, W. Y., Ueda, A., Kondo, N., Hiraguri, A., Fukuhara, T., Bahk, J. D., Yun, D. J., Bressan, R. A., Hasegawa, P. M., and Shuman, S. (2004) Proc. Natl. Acad. Sci. U. S. A. 101, 14539-14544[Abstract/Free Full Text]
- Koiwa, H., Barb, A. W., Xiong, L., Li, F., McCully, M. G., Lee, B., Sokolchik, I., Zhu, J., Gong, Z., Reddy, M., Sharkhuu, A., Manabe, Y., Yokoi, S., Zhu, J. K., Bressan, R. A., and Hasegawa, P. M. (2002) Proc. Natl. Acad. Sci. U. S. A. 99, 10893-10898[Abstract/Free Full Text]
- Krishnamurthy, S., He, X., Reyes-Reyes, M., Moore, C., and Hampsey, M. (2004) Mol. Cell 14, 387-394[CrossRef][Medline]
[Order article via Infotrieve]
- Sun, Z., and Hampsey, M. (1996) Mol. Cell. Biol. 16, 1557-1566[Abstract]
- He, X., Khan, A. U., Cheng, H., Pappas, D. L., Hampsey, M., and Moore, C. L. (2003) Genes Dev. 17, 1030-1042[Abstract/Free Full Text]
- Dichtl, B., Blank, D., Ohnacker, M., Friedlin, A., Roeder, D., Langen, H., and Keller, W. (2002) Mol. Cell 10, 1139-1150[CrossRef][Medline]
[Order article via Infotrieve]
- Nedea, E., He, X., Kim, M., Pootoolal, J. H., Zhong, G., Canadien, V., Hughes, T., Buratowski, S., Moore, C. L., and Greenblatt, J. (2003) J. Biol. Chem. 278, 33000-33010[Abstract/Free Full Text]
- Steinmetz, E. J., and Brow, D. A. (2003) Mol. Cell. Biol. 23, 6339-6349[Abstract/Free Full Text]
- Hausmann, S., and Shuman, S. (2003) J. Biol. Chem. 278, 13627-13632[Abstract/Free Full Text]
- Kamenski, T., Heilmeier, S., Meinhart, A., and Cramer, P. (2004) Mol. Cell 15, 399-407[CrossRef][Medline]
[Order article via Infotrieve]
- Lahiri, S., Zhang, G., Dunaway-Mariano, D., and Allen, K. N. (2003) Science 299, 2067-2071[Abstract/Free Full Text]
- Ganem, C., Devaux, F., Torchet, C., Jacq, C., Quevillon-Cheruel, S., Labesse, G., Facca, C., and Faye, G. (2003) EMBO J. 7, 1588-1598[CrossRef]
- Meinhart, A., Silberzahn, T., and Cramer, P. (2003) J. Biol. Chem. 278, 15917-15921[Abstract/Free Full Text]
- Stiller, J. W., and Cook, M. S. (2004) Eukaryotic Cell 3, 735-740[Abstract/Free Full Text]

CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
R. Becker, B. Loll, and A. Meinhart
Snapshots of the RNA Processing Factor SCAF8 Bound to Different Phosphorylated Forms of the Carboxyl-terminal Domain of RNA Polymerase II
J. Biol. Chem.,
August 15, 2008;
283(33):
22659 - 22669.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Liu, A. L. Greenleaf, and J. W. Stiller
The Essential Sequence Elements Required for RNAP II Carboxyl-terminal Domain Function in Yeast and Their Evolutionary Conservation
Mol. Biol. Evol.,
April 1, 2008;
25(4):
719 - 727.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Reyes-Reyes and M. Hampsey
Role for the Ssu72 C-Terminal Domain Phosphatase in RNA Polymerase II Transcription Elongation
Mol. Cell. Biol.,
February 1, 2007;
27(3):
926 - 936.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Ansari and M. Hampsey
A role for the CPF 3'-end processing machinery in RNAP II-dependent gene looping
Genes & Dev.,
December 15, 2005;
19(24):
2969 - 2978.
[Abstract]
[Full Text]
[PDF]
|
 |
|
Copyright © 2005 by the American Society for Biochemistry and Molecular Biology.