![]()
|
|
||||||||
J. Biol. Chem., Vol. 279, Issue 6, 4768-4781, February 6, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

¶
||




**
From the
Department of Plant Biology and the ||Computational Biology Service Unit, Cornell Theory Center, Cornell University, Ithaca, New York 14853
Received for publication, August 19, 2003 , and in revised form, October 30, 2003.
| ABSTRACT |
|---|
|
|
|---|
320-kDa homo-tetradecameric ClpP2 complex, without association of ClpR or ClpS proteins. It is surprising that the Clp core composition appears identical in all three plastid types, despite the remarkable differences in plastid proteome composition. This suggests that regulation of plastid proteolysis by the Clp machinery is not through differential regulation of ClpP/R/S gene expression, but rather through substrate recognition mechanisms and regulated interaction of chaperone-like molecules (ClpS1,2 and others) to the ClpP/R core. | INTRODUCTION |
|---|
|
|
|---|
To maintain a correct stoichiometry between different proteins and pathways, to remove and recycle damaged or mis-folded proteins, and to control gene expression by proteolysis of transcription or translation factors, different proteolytic systems are present in the plastid. Members of at least five protease families are present in plastids, but their structures, functions, substrates, and biological importance are poorly understood (2).
A very prominent group of proteases in plants is the Clp protease family. Our latest analysis of the Arabidopsis thaliana nuclear genome indicates the presence of at least 26 Clp-related genes, with 15 genes encoding for plastid-localized proteins (3) (Fig. 1). An additional gene, ClpP1, is located on the plastid genome and is essential in tobacco (4, 5) and Chlamydomonas reinhardtii (6). Reduced expression of the ClpP1 gene in C. reinhardtii suggested that ClpP1 is involved in degradation of photosynthetic complexes (7). Transcript analysis suggested that in plants the Clp genes are constitutively expressed, with only minor altered gene expression under specific stress conditions or during senescence (8-10). In the photosynthetic cyanobacterium Synechococcus, the ClpP1-3, ClpR, ClpX, and ClpC genes are essential for autotrophic growth (11, 12).
|
The functional role of Clp complexes in E. coli and other bacteria, such as Bacillus subtilis, involves (i) degradation of mis-folded proteins, (ii) stress responses, and (iii) gene regulation by proteolysis of transcription factors. Recognition of proteins by the Clp machinery in bacteria occurs via N- or C-terminal tags or via internal domains (see Refs. 19-21 and for discussion see Refs. 22 and 23).
In this study, we have investigated the Clp machinery in non-photosynthetic plastids of roots and petals, in chloroplasts of leaves, as well as in mitochondria. All three plastid types were found to contain a single tetradecameric Clp protease core complex (
325 kDa and pI
5.0) consisting of one to three copies of five different serine-type protease Clp proteins (ClpP1,3-6) and four non-proteolytic ClpR proteins (ClpR1-4). In addition, two chaperone-like proteins, ClpS1,2, unique to land plants, were found to interact tightly with this core complex. In contrast, mitochondria contained a single homo-tetradecameric ClpP2 complex, without association of ClpR or ClpS proteins.
Large scale crystallization projects and three-dimensional homology modeling studies have shown that the structure of a protein can be predicted accurately from the x-ray crystal structure of a homologous protein. This removes the need to crystallize each protein or protein complex to know their structure (24, 25). Given the high level of sequence identity between the ClpP,R and the E. coli ClpP sequences, we decided to build models for each of the ClpP,R proteins. Excluding the C-terminal regions, the C
r.m.s.d.1 in our ClpP,R models are expected to be on average around 1.5 Å, with 70% of the side chain positions modeled correctly. The three-dimensional modeling provided suggestions for the individual contributions of the different ClpP/R proteins, as well as molecular details of the overall oligomer. ClpS1,2 models were built by threading, based on sequence identity to the N terminus of ClpA. This showed that ClpS1,2 cannot be part of this ring structure but that they likely attach to specific pockets on top (the axial site) of the core, explaining the tight interaction observed experimentally. The coordinates of our Clp models can be downloaded from the Protein Data Bank (www.rcsb.org/pdb/), as well as via cbsu.tc.cornell.edu/vanwijk/.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Protein Separation by Colorless Native Gels (CN-PAGE) and Native IEFConcentrated protein samples were supplemented with aminocaproic acid, BisTris/HCl, pH 7.0, and glycerol to a final concentration of 5 and 50 mM and 20%, respectively. Samples were immediately loaded on native gradient gels (6-15% acrylamide), according to Ref. 28. After separation, gel lanes were cut, and the proteins were denatured, reduced, alkylated, and run on Tricine-SDS-PAGE acrylamide gradient gels, according to Refs. 3 and 29. For native IEF, chloroplast stroma (10-15 mg) was loaded on 2.2-ml (0.1-1 M) sucrose gradients, spun for 2.5 h at 259,000 x g, and collected in 0.2-ml fractions. The fractions were concentrated and loaded on IEF tube gels with pI range of 3 to 10 or 4 to 7, principally as described previously (30). Proteins were focused for 20 kV-h, subsequently solubilized, reduced, and alkylated as described earlier (3). The gels were then loaded on Tricine-SDS-PAGE gels, as described for CN-PAGE above, and proteins were separated.
Protein Identification by MS and BioinformaticsStained protein spots were excised, washed, digested with modified trypsin, and peptides extracted automatically (ProPic and Progest, Genomic Solutions, Ann Arbor, MI), and peptides were applied to the MALDI-TOF target plates as described previously (1, 26). Automated MS or MS/MS analyses were carried out as follows. A Voyager-DE-STR (PerSeptive Biosystems) generated the MALDI-TOF-MS spectra in reflectron mode, which were then annotated and internally calibrated. For nano-LC-ESI-MS/MS, a quadrupole-time of flight (micromass) was used. Separation of peptide mixtures was achieved by using a µ-Guard Pre-Column (LC Packings: inner diameter 300 µm, 1 mm long) followed by a PepMap C18 Reverse Phase Nano Column (LC Packings: inner diameter 180 µm, 15 cm long). Peptides were eluted by running a gradient using 95% H2O, 5% acetonitrile, 0.1% formic acid as solvent A and 95% acetonitrile, 5% H2O, 0.1% formic acid as solvent B, at a flow rate of 0.75 µl. The MS or MS/MS spectra were used to search the latest versions of the NCBI BLAST non-redundant data base or Arabidopsis MIPS or TIGR data bases automatically via Knexus using the search engines ProFound for MS and Sonar-MS for MS/MS data. Alternatively, MS/MS data were manually submitted to Mascot (www.matrixscience.com).
Criteria for positive identification of proteins by peptide mass finger printing were at least 5 matching unique peptides (with mass accuracy <50 ppm, but typically <10 ppm), allowing no missed cleavage or modifications, and requiring at least 15% coverage of the predicted protein sequence. Criteria for nano-LC-ESI-MS/MS were at least two matching sequence tags of high quality (clear Y-ion series and several complementary B ions; manually verified). In case of identification of closely related paralogues or orthologues, it was verified that these sequences were unique. The proteome data will become available via cbsu.tc.cornell.edu/vanwijk/and as Tables 1-4 in the Supplemental Material.
ClpP2:GFP Fusion and Transient Expression AssayThe ClpP2 cDNA was amplified from an A. thaliana cDNA library (CD4-7; Arabidopsis Biological Resource Center). The cDNA was then cloned as an NcoI/SpeI fragment into p1302, a pCAMBIA vector (www.cambia.org), containing an mGFP5 expression cassette driven by a CaMV35S promoter. The resulting plasmid was sequenced and used for particle bombardment of Nicotiana tabacum cv. petit havana leaves on agar plates. Leaves were bombarded twice with plasmid-coated 1.1-µm tungsten particles using a model PDS 1000/He Biolistic Particle Delivery System and were kept in the dark for 3 days before imaging. Confocal laser scanning microscopy was performed on an MRC600 attached to a Zeiss Axiovert10 and COMOS program. Images were colored and projected using the Confocal Assistant 4.02 program. Equipment and software were from Bio-Rad.
Homology Modeling and ThreadingReliable pairwise alignments were obtained by BLAST. Three-dimensional models were generated by comparative protein modeling through satisfaction of spatial restraints with the program MODELLER (31), minimizing the violations of distance and dihedral angle restraints from the template structure (i.e. the x-ray crystal structure of eClpP, 1TYF [PDB] ).
The main chain atoms of the models are expected to be within 0.4 Å of the equivalent atoms in the template structure in the conserved regions. Excluding the C-terminal regions, the C
r.m.s.d. in the ClpP,R models are expected to be on average around 1.5 Å, with 70% of the side chain positions modeled correctly (32). The models of residues in the C-terminal part of ClpP,R proteins as well as those in the insertion loop (L1) of ClpR1,3,4 were built based on the x-ray crystal structures of structural neighbors (Protein Data Bank codes 1NZY
[PDB]
and 1HNO
[PDB]
) of the structure of eClpP, 1TYF
[PDB]
. These parts of the models are expected to be within a 6-Å r.m.s.d. of the real structure.
By using the threading program 3DPSSM (33) and MODELLER, we built the models for ClpS1,2, respectively, based on the N-terminal domain. All models were validated using the WHAT_CHECK verification routines (34) from the program WHAT_IF (35). The coordinates of our Clp models can be downloaded from the Protein Data Bank (www.rcsb.org/pdb/), as well as via cbsu.tc.cornell.edu/vanwijk/. The PDB codes are as follows: 1R8Z (ClpP1), 1R8V (ClpP2), 1R90 (ClpP3), 1R91 (ClpP4), 1R92 (ClpP5), 1R93 (ClpP6), 1R96 (ClpR1), 1R97 (ClpR2), 1R98 (ClpR3), 1R99 (ClpR4), 1R9A (ClpS1), and 1R9B (ClpS2).
| RESULTS |
|---|
|
|
|---|
|
NomenclatureThroughout this paper we will often refer to Clp proteins and genes from E. coli. For the sake of clarity and to save space, we will refer to all E. coli Clp proteins as eClp (e.g. eClpP, eClpA, etc). Unless otherwise specified, all other proteins are from A. thaliana.
Clp Cores and Chaperones in Chloroplasts of A. thalianaIn an earlier study, we identified an
350-kDa Clp core complex copurified with the thylakoid by "blue native" gel electrophoresis (BN-PAGE), followed by SDS-PAGE and MS (3). Six Clp proteins (ClpP1-P6), three ClpR proteins (ClpR1-3), and a completely new protein, assigned ClpS1, were identified in this core complex. The heterogeneity of the Clp complex and the identification of a new Clp protein, ClpS1, was extremely surprising. At that time, we did not identify ClpR4 and ClpS2, both predicted to be targeted to the chloroplast. In the current study, by using native isoelectric focusing and a very "mild" native mass gel system, we identified the stromal Clp core complexes in chloroplasts, as well as non-photosynthetic plastids in roots and petals, and we reanalyzed the composition of the thylakoid-associated ClpPRS core.
Fig. 2, A and B, shows the two-dimensional separation of stromal and thylakoid-associated Clp cores with "colorless native" gel electrophoresis (CN-PAGE) as the first dimension. We changed from BN-PAGE to CN-PAGE, because this resulted in greater stability of several chloroplast protein complexes.2 All stained protein spots were analyzed by a combination of matrix-assisted laser desorption ionization time-of-flight MS (MALDI-TOF MS) and nano-LC-ESI-MS/MS. The purification of Clp complexes, their gel-based separation, and MS analysis were repeated several times. The relevant MS/MS data are listed in Table 1, A and B, of the Supplemental Material, whereas complementary MALDI-TOF MS data are not shown. The criteria for positive identification are described under "Experimental Procedures."
|
We have indicated the Clp complex in Fig. 2, A and B, and also numbered the most abundant proteins around the Clp complex. About 95% of the Clp core was soluble in the stroma, and 5% was associated with the membrane. In both complexes, four ClpR proteins (ClpR1-4), two ClpS proteins (ClpS1,2), as well as five of six ClpP proteins (ClpP1 and ClpP3-P6) were identified by MS. Thus we now also identified ClpS2 and ClpR4 in the core complex. ClpP2 was also identified in the thylakoid-associated complex but not in the stromal complex. Further below, we demonstrate that this was because of a minor mitochondrial contamination. Protein spots in the area around the Clp complex were all identified by MS and were unrelated to the Clp machinery (Fig. 2, A and B).
The ClpC chaperone was identified in the stroma as a dimeric complex of
200 kDa (Fig. 2C). A very large number of sequence tags generated by MS/MS were identified, matching to both the C1 and C2 protein (Table 1C, Supplemental Material). A number of tags were unique for ClpC1, and none were unique to ClpC2. Thus the dimer certainly contained ClpC1 and probably not (or very little) ClpC2. We searched for ClpD and ClpB3, and a small gel spot with ClpB3 was found in the interface of the stacking gel of the CN-PAGE dimension (above 1000 kDa) (not shown).
Separation of Stromal Complexes by Sucrose Gradient Fractionation and Native Isoelectric Focusing (NIEF)The chloroplast Clp complex always migrated on both the BN-PAGE and CN-PAGE gels as a single heteromeric complex of
325 kDa. However, it is still possible that the complex observed in the gels is in fact a mixture of different complexes with approximately the same mass. To elucidate this important point, stromal complexes were separated based on their native pI values, rather than native mass. If indeed Clp core complexes of different composition exist, they are likely to migrate at different pI values, because the predicted pI of the different processed ClpP,R and S subunits vary widely between 4.7 and 8.6 (Table II).
|
250 and 450 kDa) were collected and separated on native IEF tube gels with a pI from 3 to 10, followed by SDS-PAGE. Extensive MALDI-TOF MS and nano-LC-ESI-MS/MS analysis of all visible protein spots indicated that all Clp complexes migrated between pI 4 and 6 in a narrow pI range (data not shown). The sucrose gradient fractions were then analyzed on a narrower pI range gel (from 3.5 to 7) to improve resolution. Fig. 3A shows the chloroplast Clp complex migrating at pI of
5.0. Extensive MS analysis showed that it contained the same set of 11 Clp proteins, similarly as the
325-kDa complex separated based on native mass (Fig. 2A). This suggests that there is a single Clp core complex with the same subunit composition. It is, however, possible that there is a mixture of Clp core complexes with the same composition but with different spatial arrangements (positions of the Clp subunits in the rings) or, less likely, that minor differences in composition do not affect the native pI value. But it is unlikely that there are Clp complexes with a strongly reduced complexity, because they would likely have been separated, based on either native mass or native pI value.
|
26 kDa, whereas the apparent molecular mass of the ClpS proteins is
22 kDa. Together, this suggest that the ClpR2:ClpS1:ClpS2 molar ratio is
1.5 x 31/42 x 22/26 =
0.93:1:1. We thus conclude that ClpR2, ClpS1, and CpS2 are present in approximately equal molar ratios.
Because the other spots contained mixtures of the other Clp proteins, we sought further separation. Purified Clp cores on native gels were separated first by SDS-PAGE with 9-15% acrylamide, and then the gel lane was incubated in SDS and urea and separated by SDS-PAGE with 13-20% acrylamide (Fig. 3C). Protein spots were stained with Coomassie, but because staining was relatively weak, they were destained and then restained with MS-compatible silver stain (38). Both images were quantified, and spots were subsequently identified by nano-LC-ESI-MS/MS. Now ClpR2, ClpR4, and ClpP4 were separated as individual spots, whereas ClpP1, -P5, and -P6 were still partially overlapping, as were ClpP3, -R1, and -R3. The staining volume of ClpR2 was about 30% of ClpR4 and -P4. The staining volume of ClpR4 and ClpP4 and their molecular masses are approximately identical. However, ClpR4 has 21 lysine residues, whereas ClpP4 has only 12. This suggests that the molar ratio between ClpP4 and ClpR4 is about 2:1. ClpR2 has 11 Lys residues, nearly the same as ClpP4, and ClpR2 is only 1-2 kDa smaller than ClpP4. Given the 30% staining volume of ClpR2 as compared with ClpP4, this suggests that the molar ratio between ClpP4 and ClpR2 is about 3:1. In our earlier paper (3), we suggested that ClpP5 is probably more abundant than ClpP6 and ClpP1, which was based on multiple strongly dominating peptide ions derived from ClpP5 in both MALDI-TOF MS and ESI-MS/MS. Indeed as shown in Fig. 3C, ClpP5 was found in three adjacent spots, whereas P1 and P6 were found together in one spot. Most likely, ClpP1 and -P6 run closely together in this spot, with the presence of ClpP5 being a minor contamination from the adjacent pure ClpP5 spots. Considering that ClpP5 has only 8 Lys and that the total staining volume of the two spots with pure ClpP5 is about 50% of those of R4 and P4, we suggest that the molar ratio between ClpP5 and ClpP4 is at least
0.5 x 12/8 x 27/23 =
0.88:1. Taken these data together, we conclude that the approximate molar stoichiometries between P4:P5:R4:R2:S1:S2 is about 3:3:2:1:1:1:1.
Considering all gels, the low lysine content of ClpP1 and ClpP6, and the comparable number of sequence tags obtained by MS/MS from both P1 and P6, we speculate that ClpP1 and -P6 are present at approximately equal molar ratios to each other and ClpS1 and ClpS2. ClpP3,R1,R3 clustered together in most gels and the staining volume both in Coomassie and silver stain seems quite similar or somewhat less than of ClpP4 + -R4. We tentatively suggest that the molar ratio between the sum of P3 + R1 + R3 is comparable with the sum of ClpP4 + -R4. This roughly leads to a approximate stoichiometry of P4:P5:R4:R2:(P3 + R1 + R3):P1:P6:S1:S2 = 3:3:2:1:(3):1:1:1:1, totaling 16 proteins per core (including S1 and S2).
Clp Cores in Plastids of Roots and Petals in the Genus BrassicaPlastids have a large number of biochemical functions and are present in both photosynthetic tissues (leaves and stems) and non-photosynthetic tissues, such as roots and flowers. The proteome composition between non-photosynthetic and photosynthetic plastids is very different3 and might require different proteolytic systems. One could expect that the expressed Clp machinery is different in these non-photosynthetic plastid types, as compared with the chloroplast Clp complex as an adjustment to the substrate requirements. We therefore determined the Clp composition in non-photosynthetic plastids from roots and (white) flower petals from B. rapa and B. oleracea, respectively (Fig. 4, A and B). Due to the much larger size of their roots and flowers, respectively, they are a much better source for purification of non-photosynthetic plastids than A. thaliana. These Brassica species and A. thaliana belong to the Brassicaceae family, and their proteomes share a very high degree of protein sequence identity. Alignment of the genomic sequences of several Clp genes in B. oleracea (www.tigr.org/tdb/e2k1/bog1/) and A. thaliana indicates an average sequence identity of more than 90% between processed Clp orthologues (not shown). We are therefore confident that the correct ("true") orthologue was identified when searching the A. thaliana data base with MS/MS data from B. rapa and B. oleracea proteins.
|
250 most abundant proteins were identified by MALDI-TOF MS and nano-LC-ESI-MS/MS (not shown). The protein patterns of these two-dimensional gels showed that the Clp protease is the most abundant stromal protease in non-photosynthetic plastids, similarly as in the chloroplast (Figs. 2 and 4).3
Fig. 4, A and B, shows the region of complexes between 200-500 kDa, and at this mass resolution, one Clp core was found for both organs. ClpP1,3-6, ClpR1-4, and ClpS1,2 were all identified in these complexes by MS/MS. The MS/MS data are summarized in the Table 2, A and B, of the Supplemental Material. ClpP2 was found in root plastids but not in flower plastids, most likely due to a minor contamination of mitochondria (see below). Similarly as for the chloroplast stroma, ClpC was identified as an
200-kDa complex in both plastid types (spot 8 in lower panels of Fig. 4, A and B), and similarly as in chloroplast, the ClpC1 (rather than ClpC2) was the main species. ClpB3 was identified as a soluble
200-kDa dimer in petal plastids (spot 7 in Fig. 4A, lower panel). We do want to point out that glucan phosphorylase (At3g29330) was present as a
200-kDa complex on the stromal maps of all three plastid types (Figs. 2C and 4, A and B, lower panels), and forms an excellent reference for this region of the gels.
We then separated stromal proteomes of both non-photosynthetic plastids by NIEF (Fig. 4C). In both plastids, the Clp core complex migrated as a single complex in a very focused manner, strongly supporting the results from CN-PAGE.
The Clp Core in Mitochondria of Potato TubersTo investigate the Clp machinery in plant mitochondria, potato tubers were used as a rich and established source of pure mitochondria. The soluble matrix proteome was separated by CN-PAGE, and all stained protein spots were analyzed by MS (not shown). ClpP2 was identified by nano-ESI-MS/MS only in one location on the two-dimensional gel (Fig. 5), suggesting that ClpP2 forms an
320-kDa complex (Supplemental Material Table 3). The sequence alignment between ClpP2 of A. thaliana and Solanum tuberosum (constructed from overlapping ESTs; e.g. gi 21917555) indicated an 86% identity over 196 amino acid residues, with much less sequence identity to the other A. thaliana Clp proteins (not shown). Protein spots in the area above and below ClpP2 were all identified by MS/MS and were 23/26-kDa heat shock proteins and ATP synthase subunits, all unrelated to the Clp machinery (Supplemental Material Table 3). No evidence was found for accumulation of any other ClpP,R,S proteins, other than ClpP2, as expected from localization prediction (Table I) and Western blotting for ClpP2 (39). Thus, the plant mitochondrial Clp core is homomeric in composition, similar as in other eukaryotes (40). The presence of ClpX chaperones subunits (ClpX1-3) is under investigation.
|
Compatibility between the ClpP/R Proteins and Unique ContributionsTo investigate the structural compatibility of the different ClpP/R proteins, unique structural contributions of the different ClpP/R/S subunits, and to determine the location of ClpS in the Clp complex, we used the structural information of the homotetradecameric ClpP core complex at 2.3-Å resolution (PDB code 1TYF [PDB] ) (15, 42) and the ClpA N-domain (PDB code 1K6K [PDB] ) at 1.8-Å resolution (43) in E. coli by x-ray crystallography.
The six ClpPs and four ClpRs proteins from A. thaliana are homologous to eClpP for most of the sequence lengths, with the exception of 9 or 10 residue insertions in ClpR1,3,4 and extended C termini for several ClpP/R proteins (Table II and Fig. 6). The x-ray crystal structure of 1TYF
[PDB]
was used as a template to build three-dimensional models for all ClpP/R proteins. The 9/10 residue insertions (assigned L1 insertions) occur at equivalent sequence positions with reference to the template 1TYF
[PDB]
(positions 70-80, in Fig. 6). To model these insertions and the extended C termini, we identified structural neighbors of 1TYF
[PDB]
using the combinatorial extension method (44). Among these structures are the monomeric units of both, 4-chlorobenzoyl-coenzyme A dehalogenase (PDB code 1NZY
[PDB]
) and D3,D2-enoyl-CoA isomerase ECI1 (PDB code 1HNO
[PDB]
). These proteins show very little sequence identity with eClpP, despite their very high structural similarities with 1TYF
[PDB]
(see Fig. 6). Modeling the non-catalytic monomers ClpR1,3,4 (Fig. 7A for ClpR1) indicates that the insertion loop L1 replaces some of the residues forming the oxyanion hole in the catalytic units and lies over the catalytic clefts between the head domains and the handles, i.e. they fill the volume between strands 4 and 9 (strand numbering follows the convention of Ref. 15) that constitutes the substrate-binding pocket in the catalytic units. Sequence alignments of the monomers of 1TYF
[PDB]
, 1NZY
[PDB]
, 1HNO
[PDB]
based on the structural alignment obtained from the combinatorial extension method and the BLAST alignments of the ClpP/R sequences with the eClpP sequence are shown in Fig. 6. Information about the secondary structure elements (
-helices and
-strands) is also indicated.
|
|
-helical fragments. Modeling suggests that the extended C termini present in ClpP3-5 and ClpR1-4 can form
-helical structures that can fold over the top of the barrel-like core structure. An example for a three-dimensional model is shown for ClpR1 superimposed on eClpP (Fig. 7A).
After constructing reliable three-dimensional models for all ClpP and ClpR monomers, we created models for different hetero-tetradecameric plastid Clp cores, as well as for the homo-tetradecameric ClpP2 core from mitochondria (Fig. 7, B and C). All possible contacts between different types of monomers (each unit is in contact with 4 others) were generated and indicated that the different ClpP and CpR proteins could be part of the same tetradecameric ring structure, without obvious incompatibilities. The root mean square deviations of the C
atomic positions (C
r.m.s.d.) obtained after optimal superposition of the structures of the template 1TYF
[PDB]
and the ClpP/R models are low (see Table II), showing that the models are highly reliable. All models were carefully analyzed by visual inspection and validated using the program WHAT_CHECK (34).
Putative Lateral Openings in the Clp CoreSequence conservation of two Arg residues present in all ClpP/R proteins motivated us to look into their three-dimensional environment in 1TYF [PDB] and the modeled proteolytic ClpPR cores. Careful analysis of this part of the structures suggests putative lateral openings that connect the interior of the central cavity with the outside (Fig. 8, A and B). Two types of putative exits/entrances can be observed.
8 Å in diameter and are remarkably polar. The type of residues lining the opening varies, depending on the ClpP/R monomers that contribute to the site. However, two conserved Arg residues are present in all ClpP/R and contributed to these openings by opposite monomers (A and N in Fig. 8A). One of these Arg residue precedes the catalytic Asp residue (Asp-171 using the 1TYF
[PDB]
[PDB]
residue numbering) in all ClpP proteins. We also observed that the insertion loops L1 in ClpR1,3,4 can contribute with additional polar residues to flank the internal lips of the polar opening when the monomers are located in any of the two positions shown in blue in the diagram shown in Fig. 8C. All combinations of monomers maintain the polar character of this putative opening with the only exception of ClpR3. A two-residue insertion (Leu-123 and Met-124) in ClpR3 may block or partially close this opening when ClpR3 occupies the two positions shown in red in the diagram (Fig. 8C). We suggest that these polar openings may provide fast exit routes for the proteolytic products (see "Discussion").
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Surprisingly, in all three plastid types, we always detected a Clp complex of
325 kDa containing 11 different Clp proteins, namely ClpP1, ClpP3-6, ClpR1-4, and ClpS1-2. ClpR4 and ClpS2 had not yet been identified in any Clp complex before. This complex migrated at a pI of
5.0 in native IEF tube gels. Considering all gels and image analysis, amino acid content, and the number of peptide sequence tags from MS/MS, we suggest a tentative stoichiometry of P4:P5:R4:R2:(P3 + R1 + R3):P1:P6:S1:S2 = 3:3:2:1:(3):1:1:1:1, totaling 16 proteins per core (including S1 and S2). The take-home message is that the analysis clearly excludes that plastids contain significant levels of homomeric cores or cores with only a small number of ClpP/R forms, because they would separate based on either their native mass or pI.
In contrast, the mitochondrial Clp complex is an
320-kDa homotetradecamer of ClpP2, without other ClpP/R orthologues and without association of ClpS1,2. Mitochondrial localization was independently confirmed by transient expression essays of a ClpP2-GFP fusion in leaves, in agreement with Western blot data (39).
Thus all plastid-localized ClpP/R/S gene family members are expressed in roots, leaves, and petal plastids and are assembled into heteromeric structures. The presence of all plastid-localized ClpP/R proteins in one well focused complex, as shown by two different separation principles (by native mass and native IEF), is surprising and suggests that each paralogue is required for structure and/or function. This suggestion is supported by preliminary data from T-DNA insertion lines of A. thaliana indicating that several of the nuclear-encoded ClpP and ClpR proteins are essential.4 The chloroplast-encoded ClpP1 is also known to be essential in tobacco (4, 5) and C. reinhardtii (6). This also implies that regulation of plastid and mitochondrial proteolysis in not through regulation of Clp gene expression but rather through substrate recognition mechanisms and interaction of chaperone-like molecules (ClpS1,2 and others) to the Clp core. Western blot analysis of protein extracts of different plant tissues indicated constitutive expression of the ClpP proteins (8, 10), without significant expression changes under different stress conditions. This is in agreement with our results of a single ClpP/R complex and suggests that the heteromeric ClpP/R cores observed in our study do not change their composition significantly when plants are exposed to stress.
Thus to understand regulation of Clp proteolysis, it will be critical to obtain detailed insight in the ClpP/R core structure, and its association with ClpS1,2 and other chaperones. Therefore, homology models were created for all ClpP and ClpR monomers and organized in different configurations in tetradecameric structures. This showed that the different ClpP and CpR proteins could be part of the same heptameric ring structure and that there were no strong incompatibilities between heptameric rings of different composition forming tetradecameric complexes.
The models were further used to determine how the different ClpP/R proteins contribute to the structures and potentially affect proteolysis, as well as how they affect interactions with the chaperones and ClpS1,2. From a structural perspective, the obvious difference between the ClpP proteins is the length of their C termini. In particular, the C termini of ClpP3 and ClpP4 are 52 and 41 amino acid residues longer than ClpP1 and eClpP. Modeling based on the structural neighbors of 1TYF
[PDB]
suggests that these C termini may form
-helical structures that can fold over the axial side of the core into a specific P1 pocket (or new tripeptide motif), thereby potentially interfering with docking of the hexameric chaperones. This is a novel feature for the plastid ClpPR core. There are no other differences between the ClpP isoforms that can be directly related to function or structure, but it is important to point out that the overall sequence identities between the ClpP proteins are not more than 60%.
The presence of the four non-catalytic ClpR proteins is very intriguing. What is the difference between the four gene products and why are these non-catalytic subunits part of the complex? Clearly, the ClpR proteins have lost the catalytic protease site, and they do therefore not directly contribute to proteolysis themselves. However, three of the four ClpR proteins (ClpR1,3,4) have a 9 or 10 amino acid insert (the L1 inserts), possibly affecting the presentation of the substrate to the neighboring catalytic triads in the ClpP proteins. In addition, the L1 inserts do contribute residues to the putative lateral polar openings that might stimulate (ClpR1,4) or inhibit (ClpR3) exit of peptide fragments from the core (see more below). The four ClpR proteins differ also in the length of their C termini, similarly as the ClpP subfamily and can potentially influence chaperone interaction to the ClpP/R core. Thus the ClpR proteins likely have a regulatory role. It is important to note that the actual mechanism of proteolysis in not known for any type Clp core complex in any organism, and it is therefore difficult to predict the impact of the presence of the ClpR proteins on efficiency of proteolysis, as well as the size of the proteolytic end products. It is conceivable that some of the differences in the primary sequence of the ClpP/R proteins are needed for the assembly process of the core, especially because there are no strong structural constraints in varying the ClpP/R composition.
The two axial pores of the Clp proteolytic core have been suggested as entrances to the central chamber, as well as exits for proteolytic products (15). This axial exit route requires that either the chaperones are removed from the core prior to product release or that one side of the core remains unoccupied. Conservation of two Arg residues in all ClpP/R and eClpP and re-analysis of the 1TYF [PDB] and the three-dimensional models of plastid and mitochondrial cores suggested conserved putative highly polar lateral openings formed by residues belonging to four subunits located at the interface between the two rings. These polar openings seem to be better candidates as a route to clear the products of catalysis, due to their proximity to the catalytic sites, thus avoiding two-way traffic through the axial pores. All combinations of ClpP/R isomers (with the exception of ClpR3) maintain the polar characteristics of this opening. A second type of putative openings is less attractive and may be less likely to be real openings, given the hydrophobicity of the residues lining the putative openings.
We showed that Clp cores in all plastid types (but not in mitochondria) contain ClpS1,2; the stoichiometry analysis suggested (on average) one copy of each ClpS per core. This indicates that the interaction between ClpS and the core is quite strong, because it "survives" both native PAGE and native IEF. The three-dimensional models strongly indicate that ClpS1,2 do not fit in the tetradecameric ring structure but that they do fit well on the axial sites of the ClpPR cores. This interaction involves the hydrophobic (aromatic) P1 pocket in the core and the S1,2 loops LL1 and LL2 and a number of additional conserved residues in ClpS1,2. This predicted interaction site would indeed give the strong interaction as observed experimentally in this study.
ClpS1,2 are found from early land plants (the moss Physcomitrella) to higher plants. However, it seems to be absent in prokaryotes, including photosynthetic bacteria. ClpS also seems absent from the unicellular algae C. reinhardtii5 for which the genome was recently sequenced (genome.jgi-psf.org/chlre1/chlre1.home.html). Thus, ClpS is not a peculiarity of the Brassicaceae and suggests that ClpS1,2 has a functional role in plastid proteolysis in plants.
The binding of ClpS1,2 at this axial site will likely compete directly with the binding of a ClpC hexamer. What could be the benefit of this mutual exclusion? If ClpC hexamers cannot bind to the core, proteolysis of folded or aggregated proteins will be prevented. Binding of ClpS might also function to keep the axial openings free for release of proteolytic products. Experiments are in progress to test these hypotheses.
The ClpB,C ChaperonesThe chaperone ClpC1 was identified as a dimer in the three plastid types, suggesting that this conformation is the most stable unit. Also the ClpC homologue, eClpA, crystallizes as dimers. Formation of hexamers and interaction with the Clp core requires ATP in E. coli (for ClpA) (16) and chloroplasts (for ClpC) (46). We found that the concentration of ClpC1 was in the same order of magnitude as the concentration of the ClpP/R core. Interestingly, we did clearly identify ClpC1 but not ClpC2. ClpC1 was also found to associate transiently with the inner envelope when involved in protein import (47, 48), although the majority is located in the soluble stroma, as determined by Western blots (8). We are not aware of any study in which the ClpC2 protein was identified specifically. The presence of this second ClpC gene only became clear recently, after sequencing of the A. thaliana genome. We did not find expression of the chaperone ClpD (also named ERD1), which is not surprising because it is reported to be expressed during drought stress and/or during senescence, although the literature is not consistent (compare Refs. 9 and 49). ClpB3 was identified as a 200-kDa dimer in petal plastids, and it overlapped with a 200-kDa dimer of glucan phosphorylase, visible in all three plastid types. This might have masked ClpB3 in chloroplasts and root plastids. However, based on the data from the E. coli homologue and the absence of the "IGF" motif, ClpB3 is unlikely to interact with the Clp core and is not playing a role in Clp-mediated proteolysis.
Conclusions for Clp Proteolysis in Higher Plant Plastids and MitochondriaThe Clp machinery in plastids of higher plants has acquired a surprising complexity. In contrast, the much simpler Clp machinery in plant mitochondria still resembles an E. coli eClpP core. Together, our data show that Clp core complexes present in different plastid types have a similar size and overall composition and that the Clp complex is the most abundant protease complex in the stroma. The essentiality of ClpP1 in tobacco and C. reinhardtii (4, 6), and other ClpP and ClpR genes in A. thaliana,4 together with the Clp abundance and complexity, suggests a very central role for Clp in plastid homeostasis, equivalent to the central role of the proteasome in the cytosol.
Our findings also show that regulation of plastid proteolysis by the Clp machinery is not through differential regulation of ClpP/R/S gene expression. Instead proteolysis could be regulated by substrate recognition mechanisms and interaction of chaperone-like molecules (ClpS1,2 and others) to the ClpP/R core. This is why the detailed insight obtained by three-dimensional homology modeling of the Clp machinery is so important, as it points to several regulatory mechanisms.
Given the essential nature of most (if not all) ClpP/R genes, the assembly of all ClpP/R subunits in one complex and the expression of the ClpP/R machinery in both photosynthetic and non-photosynthetic plastids, it will require a very significant effort to identify Clp substrates, mechanisms of substrate recognition and proteolysis, and the functional role of contribution of all Clp family members. Clearly, the combined experimental and theoretical analysis and the resulting working models for the Clp machinery presented in this paper will form a strong basis for future experimental studies.
| FOOTNOTES |
|---|