|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 15, 10846-10852, April 13, 2007
A Novel Protein-processing Domain in Gli2 and Gli3 Differentially Blocks Complete Protein Degradation by the Proteasome*![]() ![]() 1
From the
Received for publication, September 6, 2006 , and in revised form, January 23, 2007.
The proteasome usually completely degrades its target proteins, but it can also degrade a handful of proteins in a limited and site-specific manner. The molecular mechanism for such limited degradation is unknown. The repressor forms of Gli2 and Gli3 transcription factors are generated from their full-length proteins through limited proteasome-mediated protein degradation. In this study, we have taken advantage of the fact that Gli3 is efficiently processed, whereas Gli2 is not, and identified a region of 200 residues in their C termini that determine differential processing of the two proteins. This region, named processing determinant domain, functions as a signal for protein processing in the context of not only Gli2 and Gli3 protein sequences but also a heterologous hybrid protein, which would otherwise be completely degraded by the proteasome. Thus, the processing determinant domain constitutes a novel domain that functions independently. Our findings explain, at the molecular level, why Gli2 and Gli3 are differentially processed and, more importantly, may help understand a probably general mechanism by which the proteasome degrades some of its target proteins partially rather than completely.
The ubiquitin-proteasome proteolytic system plays an important role in a wide variety of basic cellular processes. In the majority of these processes, the target proteins are completely degraded. In a few cases, however, the proteasome only partially degrades its target proteins. For instance, the generation of the p50 and p52 subunits of the transcriptional regulator NF- B from their full-length precursors p105 and p100 is mediated by the proteasome in a site-specific manner (16). Although the detailed mechanism of NF- Bp100/p105 processing still needs to be worked out (7, 8), such limited degradation is thought to be due to the presence of a glycine-rich region (GRR)2 located immediately upstream of the NF- B precursor cleavage site. The GRR region is proposed to serve as a "stop signal" to block further degradation of p50 or p52 (9, 10). More recently, it has been shown that a tightly folded domain upstream of GRR is also required as a part of the processing signal (34). Nevertheless, it is debatable whether GRR can function as an independent processing signal because fusing GRR with several proteins either closely related or unrelated to NF- B but known to be the substrates of the ubiquitin-proteasome system does not cause the proteins to be processed (10).
The generation of Gli3/Ci (Cubitus interruptus) transcription repressors from their full-length precursors in the Hedgehog (Hh) signaling pathway may be another instance of a protein being partially degraded by the proteasome. Several lines of evidence support this view. First, Ci/Gli3 processing requires the sequential phosphorylation of numerous serine residues at their C-terminal regions by at least three kinases: protein kinase A (PKA), casein kinase 1 (CK1), and glycogen synthase kinase 3 (GSK3) (1114). Second, Ci/Gli3 processing is also dependent on the proteasome (13, 15, 16) and on Slimb/ Gli2 is another member of the Gli/Ci family of transcription factors and shares with Gli3 a 44% sequence identity and conserved PKA, CK1, and GSK3 phosphorylation sites, but the protein is inefficiently processed in vivo (22), and its processing is undetectable in cell culture under conditions that have been used previously (2224). The molecular basis for such differential processing between Gli2 and Gli3 is not known. In this study, by taking advantage of the fact that Gli2 processing is inefficient, but Gli3 processing is readily detected in cell culture, we identified a specific region in Gli2 and Gli3 C termini that determines the extent to which these proteins are processed. This specific region, named processing determinant domain (PDD), can function as a degradation block in the context of not only Gli2 and Gli3 protein sequences but also an unrelated heterologous protein that is the target of the proteasome. Thus, the PDD is a novel domain that can function independently. Our study, at the molecular level, explains why Gli2 and Gli3 are differentially processed in vivo and why the majority of Gli3 is degraded partially but not completely. Our study also provides a potentially general mechanism by which the proteasome partially degrades its target proteins.
DNA ConstructsExpression constructs for mouse Gli2 (mGli2) and human Gli3 were described previously (24). Gli2-3CT contained the Gli3 C-terminal sequence starting from residue 648, whereas the Gli3-2CT chimera contained the Gli2 C terminus from residue 585. For all other chimeric constructs, amino acid residues that were swapped between Gli2 and Gli3 or deleted from Gli3 are indicated in diagrams in Figs. 2, 3, 4. All constructs were generated by a combination of PCR and restriction digestions. I B cDNA was obtained by reverse transcription-PCR. I B m, which contained S32A and S36A amino acid substitutions, was engineered by PCR. 3-HA-tagged triple hybrid expression constructs, HA-Tub-Gli2PDD-I B , HA-Tub-Gli2PDD-I B m, HA-Tub-Gli3PDD-I B , and HA-Tub-Gli3PDD-I B m, were created by inserting either the Gli2 sequence (585780 residues) or Gli3 sequence (648844 residues) between -tubulin at their N termini and I B or I B m at their C termini. Two glycine residues were inserted between -tubulin and PDD to avoid the possible steric interference. A mouse Ikk cDNA fragment encoding 1680 residues, which is catalytically active, was obtained from Hao Wu at Weill Medical College and cloned into NotI and EcoRV sites of the pRK expression vector. Nucleotide sequences for all constructs created by PCR were verified by sequencing analysis. Cell Culture, Transfection, Protein Analysis, and Cell StainingCell culture conditions, cell staining, methods of transfection, pharmacological treatment, and protein analysis for HEK293 cells and chick limb bud cells were as described (22, 24, 25).
The Efficiency of Gli2 and Gli3 Processing Is Determined by Their C TerminiWe have previously shown that the majority of Gli3 protein is proteolytically processed in vivo and that Gli3 processing can be induced by PKA stimulation in cultured cells (24). We have recently demonstrated that there is a very low level of Gli2 processing in vivo (22), but in cultured cells, it is not detectable (2224) (Fig. 1A). The undetectability of Gli2 processing in cultured cells could be simply due to the inefficient processing of the protein. To test this possibility, we enriched the Gli2 protein using Sepharose conjugated with Gli-binding oligonucleotides prior to immuoblotting detection (22). Indeed, a weak Gli2 processing was detected in cultured cells treated with forskolin (Fig. 1C, compare lane 4 to lane 3). Thus, the findings made in cultured cells regarding Gli2/Gli3 protein processing recapitulate those found in vivo. Besides the weak processing, full-length Gli2 protein, Gli2185, is also readily degraded by proteasome, whereas full-length Gli3 protein, Gli3190, is more stable than Gli2185 (22, 26). Biochemical analysis has demonstrated that both Gli2 degradation and Gli3 processing are dependent on ubiquitination and proteasome activity and are inhibited by Hh signaling (13, 14, 22, 26, 27). These findings have led us to hypothesize that the Gli2 polypeptide sequence has evolved to inhibit its processing to fulfill its main role as a transcriptional activator, whereas the Gli3 polypeptide sequence has been evolutionally selected for enhanced processing to accomplish its potent repressing function. Based on this hypothesis, the different level of processing between Gli2 and Gli3 must be controlled by their amino acid sequences. To test this prediction, we swapped the entire C-terminal regions between Gli2 and Gli3 to generate chimeric constructs, Gli2-3CT and Gli3-2CT (Fig. 1B, lower panel). When the chimeras were tested for proteolytic processing in a chicken limb bud primary culture, Gli3-2CT did not show any processed protein band that was detectable, whereas Gli2-3CT was processed as efficiently as Gli3 (Fig. 1B), indicating that the sequences responsible for the efficiency of Gli2 and Gli3 protein processing reside in the C termini of the proteins.
To define a specific region within the Gli3 C terminus that is required for processing, a series of C-terminally truncated Gli3 constructs were engineered and tested for processing (Fig. 2A). Both Gli3-(11260) and Gli2-(11048), which contain all six PKA sites, were efficiently processed. Furthermore, Gli3-(1946), which contains the first four PKA sites, was still processed, whereas Gli3-(1860), which retains only the first two PKA sites, was no longer processed (Fig. 2B). These results are consistent with the observations that the PKA sites are required for Gli3 processing (13, 14, 24) and indicate that the region from the beginning of the C terminus to the fourth PKA site is necessary for the extent of Gli3 processing. This region is designated 3CTN (Gli3CT N-terminal region) from now on for the sake of simplicity.
Although the Gli2-3CT chimeric protein shows very efficient processing, the finding itself does not exclude the possibility that the Gli2 N terminus and zinc finger domain are involved in its processing. To rule out this possibility, we fused the Gli3 C-terminal sequence from residue 645 to 1050 with glutathione S-transferase (GST) and examined the processing of the fusion protein. Indeed, the fusion protein could still be induced to be efficiently processed by PKA, and its processing was also enhanced by the co-expression of Two Amino Acid Changes Are Sufficient to Process Gli2 EfficientlyWe then asked if the 3CTN substitution for the equivalent region of Gli2 would be sufficient to make Gli2 efficiently processed. To this end, a chimeric construct, Gli2-3CTN, was engineered and tested for processing. For the convenience of cloning, 3CTN in the chimeric construct contains the sequence from residue 648 to 915, not to 945 as in Gli3, because this still includes the first four PKA sites (Fig. 3A). As predicted, the Gli2-3CTN was processed as efficiently as the Gli2-3CT chimera (Fig. 3B, lanes 2, 11, and 20), indicating that 3CTN and its Gli2 equivalent region determine how efficiently the proteins are processed.
To define a smaller region within 3CTN that makes Gli2 processed efficiently, we created six more constructs: Gli2-3CTN
To define the minimal region required for making Gli2 efficiently processed, more sequence from either the N- or C-terminal end of the 648748-amino acid region of Gli3 (i.e. 3CTN
To further elucidate the molecular basis of the differential processing of Gli2 and Gli3 proteins, we compared the 616626 amino acids of Gli2 with the 681692 amino acids of Gli3 and found that although both sequences contain a consensus sequence for casein kinase 2 (CK2) phosphorylation ((S/T)XX(E/D)), 620TVED623 for Gli2, and 684SKRE687 for Gli3, the 2 middle residues of each, VE and KR, are significantly different in property. We thus reasoned that the difference in the 2 residues might determine how efficiently the two proteins are processed. To test this hypothesis, we mutated 621VE622 to 621KR622 in Gli2 to generate a Gli2-VE>KR mutant construct (Fig. 3A). We found that the mutant protein did indeed exhibit a level of processing similar to those of both Gli2-3CTN N5 and Gli2-3CTN C4 mutants (Fig. 3B, right panel), thus indicating that these 2 residues are required to determine the extent of Gli2 processing. A Region of the First 196 Residues of the Gli2 C Terminus Is Sufficient to Inhibit Gli3 ProcessingThe fact that a Gli2 alteration of 621VE622 to 621KR622 makes processing efficient prompted us to test whether the difference in these 2 residues can explain the differential processing of Gli2 and Gli3 proteins. To address this question, we generated a reciprocal construct, Gli3-KR> VE. The mutant protein displayed a level of processing similar to those of wild type Gli3 and Gli2-3CT. Interestingly, the processed protein product was significantly larger than Gli383 in size, as measured by its migration (Fig. 4B, compare lane 5 with lane 3), indicating that the two amino acid changes caused the processing site to shift C-terminally. From these results, we conclude that the 685KR686 residues are required for Gli3 to be efficiently processed at the correct position.
Because mutations in the 2 residues did not block Gli3 processing, we wanted to determine the minimal Gli3 sequence that, when replaced by Gli2, would prevent the Gli3 protein from being processed in cultured cells. Based on the results in Fig. 3, we created three more Gli3-Gli2 chimeric constructs, designated Gli3-2CTN Subcellular Localization Does Not Determine the Differential Processing of Gli2 and Gli3The extent of processing of the Gli2-Gli3 chimeric molecules described above may not necessarily be due to changes in their intrinsic nature but rather to changes in their subcellular localizations. To rule out this possibility, we examined the subcellular localization of Gli2, Gli3, and three representative chimeras: Gli2-3CT, Gli3-2CT, and Gli2-3CTN, in transfected chick limb bud primary cells. As shown in Fig. 5, Gli2 was exclusively localized in the nucleus, whereas Gli3 was predominantly found in the cytoplasm. Although our observation for Gli2 localization is consistent with a recent report, the predominantly cytoplasmic localization of Gli3 is different (28). We believe that the observed Gli3 localization more closely reflected its subcellular localization in vivo, because the Gli3 construct we used was not fused with any epitope tag. The localization of the three chimera proteins did not correlate to whether or how extensively they were processed. Most Gli2-3CT-expressing cells showed cytoplasmic staining (5 nuclear versus 15 cytoplasmic), and almost all Gli3-2CT-transfected cells exhibited nuclear staining (18 nuclear versus 2 cytoplasmic), whereas Gli2-3CTN was found in the nucleus in most of the transfected cells (19 nuclear versus 1 cytoplasmic). These results suggest that the sequences that determine the localization of Gli2 and Gli3 proteins are located within their C-terminal regions and also argue strongly against the notion that subcellular localization determines how efficiently Gli2 and Gli3 proteins are processed. We thus conclude that the differential processing of Gli2 and Gli3 proteins is determined by the PDD of the Gli2 and Gli3 C termini.
The Gli3 PDD Functions Independently as a Signal for Protein ProcessingWe next wanted to know whether the Gli3 PDD could serve as a functional domain that independently mediates protein processing. To address this question, we fused an HA-tagged -tubulin to the N-terminal end and I B to the C-terminal end of either the Gli2 or Gli3 PDD to create HA-Tub-Gli2PDD-I B and HA-Tub-Gli3PDD-I B (Fig. 6A), respectively. I B was chosen because the mechanism of its degradation has been well understood. I B is first phosphorylated at Ser-32 and Ser-36 by the I B kinase (Ikk), and the phosphorylated I B is then bound and ubiquitinated by SCF TrCP ubiquitin ligase and is subsequently completely degraded by the proteasome (29, 30). Thus, if Gli3 PDD functions independently as a processing signal, we predict that, upon phosphorylation by Ikk, the HA-Tub-Gli3PDD-I B fusion protein would be processed approximately up to the first 50 residues of PDD from the C-terminal end based on the approximate processing site of the full-length Gli3 protein (24). In contrast, the HA-Tub-Gli2PDD-I B fusion protein would be completely degraded by the proteasome and/or processed at a very low efficiency.
To test the above prediction, HEK293 cells, which are known to contain all necessary components for I
Because the proteasome usually degrades its protein substrates completely, it is not clear how Gli2/Gli3 undergo a site-specific degradation by the proteasome and how Gli2/Gli3 are each processed at precise sites and to different extents. In the current study, we have identified the PDD necessary to and sufficient for the efficient processing of Gli3 but inefficient processing of Gli2 in the cell culture. The PDD contains the first 197 amino acid residues of the Gli2/Gli3 C termini and is equivalent to a simple sequence region in Ci, which serves as a part of the processing signal (34). The Ci processing signal consists of the zinc finger DNA binding domain and the subsequent simple sequence region. The zinc finger DNA binding domain is required for Ci processing, because it is thought to form a tightly folded domain. In the case of Gli2/Gli3 processing, the zinc finger DNA binding domain is clearly not required because replacing the Gli2/Gli3 N termini and zinc finger DNA binding domain with GST (Fig. 2) or -tubulin (Fig. 6) does not prevent the fusion proteins from processing. However, it is possible that the ability of GST or tubulin to block the proteasome progression is because of the potential presence of a tightly folded domain in GST and -tubulin, which would be functionally equivalent to the zinc finger DNA binding domain in terms of preventing the proteasome movement. The PDD is not at all similar to and much larger in size than the GRR found immediately upstream of the processing site of NF- B. Based on the estimated processing site of Gli3, approximately only the first 50 residues of the Gli3 PDD is mapped upstream of the processing site, whereas the rest is downstream. This is very different from the NF- B GRR, which is completely located N-terminal to the processing site. The GRR sequence is thought to act as a "stop signal" to inhibit further degradation of NF- Bby the proteasome (9, 10). A data base search for proteins that share a sequence similar to that of the Gli3 region did not yield any additional information. This is consistent with a recent finding that the protein processing signal depends on the complexity of the simple sequence rather than on amino acid identity (34). It is also probably in part because only a few proteins are known to be processed by the proteasome.
Mechanistically, as recently proposed for Ci (34), approximately the first 50 residues of Gli3 PDD may have a low affinity for proteasome binding, resulting in the release of the proteasome from the Gli2/Gli3 protein substrates, and the actual domain that inhibits the proteasome movement is located immediately upstream of the 50 residues. Alternatively, the first 50 residues may themselves form a certain secondary structure that prevents the proteasome from unfolding the protein, thus blocking further degradation of the protein, since proteasome can only degrade unfolded proteins. Nevertheless, the structure of the 50 residues may be influenced by the rest of the PDD sequence, because the replacement of the first half of Gli3 PDD with the Gli2 equivalent region, which contains 100 residues (Gli3-2CTN C3 in Fig. 4), was not sufficient to prevent the chimeric fusion from being processed. The requirement of amino acid sequence downstream of processing site can be explained if Gli3 protein is progressively processed from an internal site rather than from its C-terminal end. It has been shown that proteasome has an endoproteolytic activity that degrades its protein targets from its internal sequence (3133). In the case of Gli2, the secondary structure formed by the corresponding region may not effectively prevent the proteasome from unfolding the protein because its amino acid sequence is different from that of Gli3. As a result, the majority of Gli2185 is degraded completely by the proteasome, whereas only a small fraction is processed in embryos and cultured cells (22) (Fig. 1C). The difference between Gli2 and Gli3 PDD sequences should have been favorably selected during evolution so that the function of Gli2 and Gli3 could become more specialized to regulate more complex biological processes in vertebrates.
The results from our analysis of the processing of Gli2-Gli3 chimeric molecules are consistent with the above hypothesis. Our observation that a Gli2 to Gli3 change of only 2 amino acids, at residues 620 and 621 of the C terminus (Gli2-VE>KR), is sufficient to make Gli2 efficiently processed suggests that these 2 residues play a critical role in determining how efficiently a protein undergoes processing or degradation. These 2 residues also control the location of the processing site, because the reciprocal amino acid substitutions in Gli3 result in a slight shift of the processing site toward the C-terminal end (Fig. 4B). Nevertheless, the role that these 2 amino acids play in processing must be placed in the context of surrounding sequences because the Gli3 protein with changes in these 2 amino acids is still processed efficiently; processing becomes undetectable only when the first 197 amino acid residues of the Gli3 C terminus are replaced by the equivalent Gli2 sequence. These findings support the notion that the complexity of the PDD sequence rather than amino acid identity determines the extent and position of Gli2/Gli3 processing, as recently suggested for NF-
The GRR from NF-
* This work was supported by National Institutes of Health Grant R01 GM70820 (to B. W.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 1 To whom correspondence should be addressed: Dept. of Genetic Medicine, Weill Medical College of Cornell University, 1300 York Ave., W404, New York, NY 10021. Tel.: 212-746-5357; Fax: 212-746-8318; E-mail: baw2001{at}med.cornell.edu.
2 The abbreviations used are: GRR, glycine-rich region; PDD, processing determinant domain; HA, hemagglutinin; Hh, Hedgehog; PKA, protein kinase A; CK1, casein kinase 1; GSK3, glycogen synthase kinase 3; Ikk, I
We thank Wendy Wang for reading the manuscript and Hao Wu for a Ikk cDNA.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||