JBC

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M309212200 on October 30, 2003

J. Biol. Chem., Vol. 279, Issue 6, 4768-4781, February 6, 2004
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow HTML Page - index.htslp
Right arrow All Versions of this Article:
279/6/4768    most recent
M309212200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Peltier, J.-B.
Right arrow Articles by van Wijk, K. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Peltier, J.-B.
Right arrow Articles by van Wijk, K. J.

Clp Protease Complexes from Photosynthetic and Non-photosynthetic Plastids and Mitochondria of Plants, Their Predicted Three-dimensional Structures, and Functional Implications*

Jean-Benoît Peltier{ddagger}§, Daniel R. Ripoll§||, Giulia Friso{ddagger}, Andrea Rudella{ddagger}, Yang Cai{ddagger}, Jimmy Ytterberg{ddagger}, Lisa Giacomelli{ddagger}, Jaroslaw Pillardy||, and Klaas J. van Wijk{ddagger}**

From the {ddagger}Department of Plant Biology and the ||Computational Biology Service Unit, Cornell Theory Center, Cornell University, Ithaca, New York 14853

Received for publication, August 19, 2003 , and in revised form, October 30, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Tetradecameric Clp protease core complexes in non-photosynthetic plastids of roots, flower petals, and in chloroplasts of leaves of Arabidopsis thaliana were purified based on native mass and isoelectric point and identified by mass spectrometry. The stoichiometry between the subunits was determined. The protease complex consisted of one to three copies of five different serine-type protease Clp proteins (ClpP1,3-6) and four non-proteolytic ClpR proteins (ClpR1-4). Three-dimensional homology modeling showed that the ClpP/R proteins fit well together in a tetradecameric complex and also indicated unique contributions for each protein. Lateral exit gates for proteolysis products are proposed. In addition, ClpS1,2, unique to land plants, tightly interacted with this core complex, with one copy of each per complex. The three-dimensional modeling show that they do fit well on the axial sites of the ClpPR cores. In contrast to plastids, plant mitochondria contained a single ~320-kDa homo-tetradecameric ClpP2 complex, without association of ClpR or ClpS proteins. It is surprising that the Clp core composition appears identical in all three plastid types, despite the remarkable differences in plastid proteome composition. This suggests that regulation of plastid proteolysis by the Clp machinery is not through differential regulation of ClpP/R/S gene expression, but rather through substrate recognition mechanisms and regulated interaction of chaperone-like molecules (ClpS1,2 and others) to the ClpP/R core.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Plastids are essential organelles of prokaryotic origin that are present in every plant cell and differentiate from proplastids into non-photosynthetic plastids in roots and flowers and photosynthetic plastids in leafs and stems. Plastids are responsible for synthesis of key molecules required for the architecture and functions of plant cells.

To maintain a correct stoichiometry between different proteins and pathways, to remove and recycle damaged or mis-folded proteins, and to control gene expression by proteolysis of transcription or translation factors, different proteolytic systems are present in the plastid. Members of at least five protease families are present in plastids, but their structures, functions, substrates, and biological importance are poorly understood (2).

A very prominent group of proteases in plants is the Clp protease family. Our latest analysis of the Arabidopsis thaliana nuclear genome indicates the presence of at least 26 Clp-related genes, with 15 genes encoding for plastid-localized proteins (3) (Fig. 1). An additional gene, ClpP1, is located on the plastid genome and is essential in tobacco (4, 5) and Chlamydomonas reinhardtii (6). Reduced expression of the ClpP1 gene in C. reinhardtii suggested that ClpP1 is involved in degradation of photosynthetic complexes (7). Transcript analysis suggested that in plants the Clp genes are constitutively expressed, with only minor altered gene expression under specific stress conditions or during senescence (8-10). In the photosynthetic cyanobacterium Synechococcus, the ClpP1-3, ClpR, ClpX, and ClpC genes are essential for autotrophic growth (11, 12).



View larger version (40K):
[in this window]
[in a new window]
 
FIG. 1.
Summarizing overview of the Clp proteins in A. thaliana. A, Clp gene locations on the five A. thaliana chromosomes. ClpP1 is located on the plastid genome. Proteins predicted to the chloroplast are in green and those in mitochondria in blue. The localization predictions of ClpP2, ClpX1, and -X3 are ambiguous. B, sequence alignment around the (I/L)GF motif for each of the groups of plastid and mitochondrial chaperones. This motif is implied in interaction between chaperones and core in case of ClpX and ClpA homologues.

 
Clp proteins (for caseinolytic protease) were first identified in Escherichia coli (13). In E. coli, ClpP complexes were shown to be ATP-dependent serine-type proteases, consisting of a catalytic core, flanked at one or both ends by a hexamer of regulatory ClpA/X chaperone subunits, and belonging to the family of ATP-dependent HSP100 proteins (14). This catalytic core is composed of two homo-heptameric rings of ClpP subunits, all encoded by a single gene. The diameter of the cavity is 50 Å, and the diameter of entrances at either end of the complex was reported as 10 Å (15). This suggested that only peptides and proteins in an unfolded state can enter the cavity, which is in agreement with biochemical data showing that the ATPase and ATP hydrolysis are required for the degradation of folded proteins (16). However, E. coli ClpXP can accommodate 2 or 3 polypeptide chains at the same time, leading to an estimation of the opening of 20-25 Å wide (17). Clp complexes in bacteria can also be formed by ClpQ,Y (HslV and HsIU) proteins, which are unrelated to ClpP (18). No ClpQ,Y homologues has been found in any plant species.

The functional role of Clp complexes in E. coli and other bacteria, such as Bacillus subtilis, involves (i) degradation of mis-folded proteins, (ii) stress responses, and (iii) gene regulation by proteolysis of transcription factors. Recognition of proteins by the Clp machinery in bacteria occurs via N- or C-terminal tags or via internal domains (see Refs. 19-21 and for discussion see Refs. 22 and 23).

In this study, we have investigated the Clp machinery in non-photosynthetic plastids of roots and petals, in chloroplasts of leaves, as well as in mitochondria. All three plastid types were found to contain a single tetradecameric Clp protease core complex (~325 kDa and pI ~5.0) consisting of one to three copies of five different serine-type protease Clp proteins (ClpP1,3-6) and four non-proteolytic ClpR proteins (ClpR1-4). In addition, two chaperone-like proteins, ClpS1,2, unique to land plants, were found to interact tightly with this core complex. In contrast, mitochondria contained a single homo-tetradecameric ClpP2 complex, without association of ClpR or ClpS proteins.

Large scale crystallization projects and three-dimensional homology modeling studies have shown that the structure of a protein can be predicted accurately from the x-ray crystal structure of a homologous protein. This removes the need to crystallize each protein or protein complex to know their structure (24, 25). Given the high level of sequence identity between the ClpP,R and the E. coli ClpP sequences, we decided to build models for each of the ClpP,R proteins. Excluding the C-terminal regions, the C{alpha} r.m.s.d.1 in our ClpP,R models are expected to be on average around 1.5 Å, with 70% of the side chain positions modeled correctly. The three-dimensional modeling provided suggestions for the individual contributions of the different ClpP/R proteins, as well as molecular details of the overall oligomer. ClpS1,2 models were built by threading, based on sequence identity to the N terminus of ClpA. This showed that ClpS1,2 cannot be part of this ring structure but that they likely attach to specific pockets on top (the axial site) of the core, explaining the tight interaction observed experimentally. The coordinates of our Clp models can be downloaded from the Protein Data Bank (www.rcsb.org/pdb/), as well as via cbsu.tc.cornell.edu/vanwijk/.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Plant Material and Protein Preparations—A. thaliana (Col 0) was grown under 10-h light/14-h dark cycles at 25/17 °C, and leaves were collected from mature rosettes. Flowers were collected from Brassica oleracea grown with supplemental lightning. Brassica rapa and potato tubers were purchased at local stores. Thylakoids and peripheral complexes from A. thaliana were purified according to Ref. 26, with minor modifications to improve thylakoid purity. Intact plastids were isolated from the various tissues by brief homogenization in grinding medium (50 mM Hepes-KOH, pH 8.0, 330 mM sorbitol) and filtered through nylon mesh. The crude plastids were then collected by a 2-min spin at 1100 x g and further purified on 40-85% Percoll cushions (Percoll in 0.6% Ficoll, 1.8% polyethylene glycol, and 0.6% bovine serum albumin) by a 10-min spin at 3750 x g and one additional wash in the grinding medium. Plastids were subsequently lysed in 10 mM Hepes, pH 8.0, 5 mM MgCl2 with a mixture of protease inhibitors under mild mechanical disruption. The stromal content was collected, concentrated, and immediately used for two-dimensional analysis. Intact potato mitochondria were purified according to Ref. 27 with minor modifications.

Protein Separation by Colorless Native Gels (CN-PAGE) and Native IEF—Concentrated protein samples were supplemented with aminocaproic acid, BisTris/HCl, pH 7.0, and glycerol to a final concentration of 5 and 50 mM and 20%, respectively. Samples were immediately loaded on native gradient gels (6-15% acrylamide), according to Ref. 28. After separation, gel lanes were cut, and the proteins were denatured, reduced, alkylated, and run on Tricine-SDS-PAGE acrylamide gradient gels, according to Refs. 3 and 29. For native IEF, chloroplast stroma (10-15 mg) was loaded on 2.2-ml (0.1-1 M) sucrose gradients, spun for 2.5 h at 259,000 x g, and collected in 0.2-ml fractions. The fractions were concentrated and loaded on IEF tube gels with pI range of 3 to 10 or 4 to 7, principally as described previously (30). Proteins were focused for 20 kV-h, subsequently solubilized, reduced, and alkylated as described earlier (3). The gels were then loaded on Tricine-SDS-PAGE gels, as described for CN-PAGE above, and proteins were separated.

Protein Identification by MS and Bioinformatics—Stained protein spots were excised, washed, digested with modified trypsin, and peptides extracted automatically (ProPic and Progest, Genomic Solutions, Ann Arbor, MI), and peptides were applied to the MALDI-TOF target plates as described previously (1, 26). Automated MS or MS/MS analyses were carried out as follows. A Voyager-DE-STR (PerSeptive Biosystems) generated the MALDI-TOF-MS spectra in reflectron mode, which were then annotated and internally calibrated. For nano-LC-ESI-MS/MS, a quadrupole-time of flight (micromass) was used. Separation of peptide mixtures was achieved by using a µ-Guard Pre-Column (LC Packings: inner diameter 300 µm, 1 mm long) followed by a PepMap C18 Reverse Phase Nano Column (LC Packings: inner diameter 180 µm, 15 cm long). Peptides were eluted by running a gradient using 95% H2O, 5% acetonitrile, 0.1% formic acid as solvent A and 95% acetonitrile, 5% H2O, 0.1% formic acid as solvent B, at a flow rate of 0.75 µl. The MS or MS/MS spectra were used to search the latest versions of the NCBI BLAST non-redundant data base or Arabidopsis MIPS or TIGR data bases automatically via Knexus using the search engines ProFound for MS and Sonar-MS for MS/MS data. Alternatively, MS/MS data were manually submitted to Mascot (www.matrixscience.com).

Criteria for positive identification of proteins by peptide mass finger printing were at least 5 matching unique peptides (with mass accuracy <50 ppm, but typically <10 ppm), allowing no missed cleavage or modifications, and requiring at least 15% coverage of the predicted protein sequence. Criteria for nano-LC-ESI-MS/MS were at least two matching sequence tags of high quality (clear Y-ion series and several complementary B ions; manually verified). In case of identification of closely related paralogues or orthologues, it was verified that these sequences were unique. The proteome data will become available via cbsu.tc.cornell.edu/vanwijk/and as Tables 1-4 in the Supplemental Material.

ClpP2:GFP Fusion and Transient Expression Assay—The ClpP2 cDNA was amplified from an A. thaliana cDNA library (CD4-7; Arabidopsis Biological Resource Center). The cDNA was then cloned as an NcoI/SpeI fragment into p1302, a pCAMBIA vector (www.cambia.org), containing an mGFP5 expression cassette driven by a CaMV35S promoter. The resulting plasmid was sequenced and used for particle bombardment of Nicotiana tabacum cv. petit havana leaves on agar plates. Leaves were bombarded twice with plasmid-coated 1.1-µm tungsten particles using a model PDS 1000/He Biolistic Particle Delivery System and were kept in the dark for 3 days before imaging. Confocal laser scanning microscopy was performed on an MRC600 attached to a Zeiss Axiovert10 and COMOS program. Images were colored and projected using the Confocal Assistant 4.02 program. Equipment and software were from Bio-Rad.

Homology Modeling and Threading—Reliable pairwise alignments were obtained by BLAST. Three-dimensional models were generated by comparative protein modeling through satisfaction of spatial restraints with the program MODELLER (31), minimizing the violations of distance and dihedral angle restraints from the template structure (i.e. the x-ray crystal structure of eClpP, 1TYF [PDB] ).

The main chain atoms of the models are expected to be within 0.4 Å of the equivalent atoms in the template structure in the conserved regions. Excluding the C-terminal regions, the C{alpha} r.m.s.d. in the ClpP,R models are expected to be on average around 1.5 Å, with 70% of the side chain positions modeled correctly (32). The models of residues in the C-terminal part of ClpP,R proteins as well as those in the insertion loop (L1) of ClpR1,3,4 were built based on the x-ray crystal structures of structural neighbors (Protein Data Bank codes 1NZY [PDB] and 1HNO [PDB] ) of the structure of eClpP, 1TYF [PDB] . These parts of the models are expected to be within a 6-Å r.m.s.d. of the real structure.

By using the threading program 3DPSSM (33) and MODELLER, we built the models for ClpS1,2, respectively, based on the N-terminal domain. All models were validated using the WHAT_CHECK verification routines (34) from the program WHAT_IF (35). The coordinates of our Clp models can be downloaded from the Protein Data Bank (www.rcsb.org/pdb/), as well as via cbsu.tc.cornell.edu/vanwijk/. The PDB codes are as follows: 1R8Z (ClpP1), 1R8V (ClpP2), 1R90 (ClpP3), 1R91 (ClpP4), 1R92 (ClpP5), 1R93 (ClpP6), 1R96 (ClpR1), 1R97 (ClpR2), 1R98 (ClpR3), 1R99 (ClpR4), 1R9A (ClpS1), and 1R9B (ClpS2).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The Complexity of the Clp Gene Family in A. thaliana—Our latest analysis of the sequenced A. thaliana nuclear genome indicates the presence of 26 Clp-related genes, with 19 genes predicted to encode for plastid- or mitochondrial proteins (Table I and Fig. 1A). An additional ClpP gene (P1) is located on the plastid genome. The subcellular localizations of 16 of these organellar proteins have been experimentally determined in this study and/or in earlier studies (2, 3, 10). Recently, a new protein, ClpS, was identified in E. coli. This bacterial ClpS binds to the N terminus of ClpA (36) and affects substrate specificity. Interestingly, a homologue is also present in the A. thaliana genome (assigned here ClpT), and sequence analysis suggests chloroplast localization (see Table I). The bacterial ClpS protein is not related to ClpS1, which we identified as part of the chloroplast ClpPR complex (3).


View this table:
[in this window]
[in a new window]
 
TABLE I
Overview and analysis of the mitochondial and plastid Clp proteins in A. thaliana

 
The Clp gene family can be divided into three groups of ClpP/R (proteases), ClpC/D/B/F/S/Z/X (chaperones), and ClpT (modulator). The chaperone genes are predominantly located on chromosome 5, whereas the ClpP, -R, and -T genes are mostly on chromosome 1 (Fig. 1A). Analysis of the mitochondrial and plastid chaperones (ClpC1,2/D/B3/S1,2/X1,2,3) shows conservation of a I(L)GF motif combined with an upstream basic residue (Lys/Arg), in all sequences but ClpB3 (Fig. 1B). In ClpX of E. coli, this motif is implicated in interaction with the Clp core (37). This suggests that eight of nine plastid and mitochondrial chaperones might interact with Clp cores, as will be further discussed below.

Nomenclature—Throughout this paper we will often refer to Clp proteins and genes from E. coli. For the sake of clarity and to save space, we will refer to all E. coli Clp proteins as eClp (e.g. eClpP, eClpA, etc). Unless otherwise specified, all other proteins are from A. thaliana.

Clp Cores and Chaperones in Chloroplasts of A. thaliana—In an earlier study, we identified an ~350-kDa Clp core complex copurified with the thylakoid by "blue native" gel electrophoresis (BN-PAGE), followed by SDS-PAGE and MS (3). Six Clp proteins (ClpP1-P6), three ClpR proteins (ClpR1-3), and a completely new protein, assigned ClpS1, were identified in this core complex. The heterogeneity of the Clp complex and the identification of a new Clp protein, ClpS1, was extremely surprising. At that time, we did not identify ClpR4 and ClpS2, both predicted to be targeted to the chloroplast. In the current study, by using native isoelectric focusing and a very "mild" native mass gel system, we identified the stromal Clp core complexes in chloroplasts, as well as non-photosynthetic plastids in roots and petals, and we reanalyzed the composition of the thylakoid-associated ClpPRS core.

Fig. 2, A and B, shows the two-dimensional separation of stromal and thylakoid-associated Clp cores with "colorless native" gel electrophoresis (CN-PAGE) as the first dimension. We changed from BN-PAGE to CN-PAGE, because this resulted in greater stability of several chloroplast protein complexes.2 All stained protein spots were analyzed by a combination of matrix-assisted laser desorption ionization time-of-flight MS (MALDI-TOF MS) and nano-LC-ESI-MS/MS. The purification of Clp complexes, their gel-based separation, and MS analysis were repeated several times. The relevant MS/MS data are listed in Table 1, A and B, of the Supplemental Material, whereas complementary MALDI-TOF MS data are not shown. The criteria for positive identification are described under "Experimental Procedures."



View larger version (60K):
[in this window]
[in a new window]
 
FIG. 2.
Clp complexes of A. thaliana chloroplasts. Proteins were stained and identified by MS. ESI-MS/MS data for Clp spots 1-6 and 16-23 are summarized in Supplemental Material Table 1, A and B. Stromal (A) and thylakoid-associated complexes (B) were first separated on native gels, followed by denaturation and separation by SDS-PAGE. Spots are as follows: 1, RbcL (Atcg00490); 2, RbcS (At1g67090); 3, ATP {epsilon} (Atcg00470); 4, ATP {gamma} (At4g04640); 5, ATP {delta} (At1g15700); 6, ATP {alpha} (Atcg00120) and ATP {beta} (Atcg00480); and 7 and 8, different glyceraldehyde-3-phosphate dehydrogenases, and spots 9-15 are the different Clp proteins. C, ~200-kDa ClpC1 dimers identified on the native gel of stromal complexes.

 

We have indicated the Clp complex in Fig. 2, A and B, and also numbered the most abundant proteins around the Clp complex. About 95% of the Clp core was soluble in the stroma, and 5% was associated with the membrane. In both complexes, four ClpR proteins (ClpR1-4), two ClpS proteins (ClpS1,2), as well as five of six ClpP proteins (ClpP1 and ClpP3-P6) were identified by MS. Thus we now also identified ClpS2 and ClpR4 in the core complex. ClpP2 was also identified in the thylakoid-associated complex but not in the stromal complex. Further below, we demonstrate that this was because of a minor mitochondrial contamination. Protein spots in the area around the Clp complex were all identified by MS and were unrelated to the Clp machinery (Fig. 2, A and B).

The ClpC chaperone was identified in the stroma as a dimeric complex of ~200 kDa (Fig. 2C). A very large number of sequence tags generated by MS/MS were identified, matching to both the C1 and C2 protein (Table 1C, Supplemental Material). A number of tags were unique for ClpC1, and none were unique to ClpC2. Thus the dimer certainly contained ClpC1 and probably not (or very little) ClpC2. We searched for ClpD and ClpB3, and a small gel spot with ClpB3 was found in the interface of the stacking gel of the CN-PAGE dimension (above 1000 kDa) (not shown).

Separation of Stromal Complexes by Sucrose Gradient Fractionation and Native Isoelectric Focusing (NIEF)—The chloroplast Clp complex always migrated on both the BN-PAGE and CN-PAGE gels as a single heteromeric complex of ~325 kDa. However, it is still possible that the complex observed in the gels is in fact a mixture of different complexes with approximately the same mass. To elucidate this important point, stromal complexes were separated based on their native pI values, rather than native mass. If indeed Clp core complexes of different composition exist, they are likely to migrate at different pI values, because the predicted pI of the different processed ClpP,R and S subunits vary widely between 4.7 and 8.6 (Table II).


View this table:
[in this window]
[in a new window]
 
TABLE II
Properties of the ClpP, -R, -S proteins and sequence comparisons to related proteins

Sequence and structure similarities (computed using the CE method (44)) of the ClpP,R monomers with eClpP (1TYF [PDB] ) and ClpS1,S2 monomers with the N-terminal domain of eClpA (1K6K [PDB] ), as well as predicted pI and the number of residues in the mature forms of the Clp proteins, that contribute to staining intensity using silver nitrate or Coomassie.

 
Stromal complexes were first separated by sucrose density gradient centrifugation, to allow removal of the majority of the very abundant 550-kDa ribulose-bisphosphate carboxylase/oxygenase complex. This helped to improve the identification of a possible heterogeneous Clp core population. The fractions containing the Clp complexes (between ~250 and 450 kDa) were collected and separated on native IEF tube gels with a pI from 3 to 10, followed by SDS-PAGE. Extensive MALDI-TOF MS and nano-LC-ESI-MS/MS analysis of all visible protein spots indicated that all Clp complexes migrated between pI 4 and 6 in a narrow pI range (data not shown). The sucrose gradient fractions were then analyzed on a narrower pI range gel (from 3.5 to 7) to improve resolution. Fig. 3A shows the chloroplast Clp complex migrating at pI of ~5.0. Extensive MS analysis showed that it contained the same set of 11 Clp proteins, similarly as the ~325-kDa complex separated based on native mass (Fig. 2A). This suggests that there is a single Clp core complex with the same subunit composition. It is, however, possible that there is a mixture of Clp core complexes with the same composition but with different spatial arrangements (positions of the Clp subunits in the rings) or, less likely, that minor differences in composition do not affect the native pI value. But it is unlikely that there are Clp complexes with a strongly reduced complexity, because they would likely have been separated, based on either native mass or native pI value.



View larger version (34K):
[in this window]
[in a new window]
 
FIG. 3.
Separation of stromal Clp components by IEF and their stoichiometry. Proteins were identified by MS. A, chloroplast stroma was fractionated on sucrose gradients, and the fraction from 250-450 kDa was separated by NIEF. After focusing, protein complexes were run on SDS-PAGE, and proteins were visualized by MS-compatible silver staining. ESI-MS/MS data for Clp spots 1-6 are summarized in Supplemental Material Table 1. B, chloroplast stromal complexes were separated by CN-PAGE, followed by denaturation and separation on SDS-PAGE with 13-20% acrylamide and stained with Coomassie. C, the Clp core was separated first on SDS-PAGE from 9 to 15% acrylamide, then incubated with SDS and urea, and loaded on a 13-20% acrylamide SDS-PAGE and stained with silver nitrate.

 
Stoichiometry of Plastid ClpP/R/S Proteins—The E. coli eClpP core is formed by two rings of seven eClpP proteins each. Based on our modeling work in this paper (see further below), we infer that the plastid ClpP/R proteins form a similar ring structure and that ClpS1 and -S2 cannot form part of this ring structure but instead are peripherally associated. Thus the plastid ClpP/R core must contain 14 ClpP/R proteins. Given the presence of 9 different ClpP/R proteins, this means that a number of ClpP/R proteins are present in more than one copy. We thus set out to determine the stoichiometry of the core subunits in the Clp complex. This is a rather challenging task because the 11 proteins migrate very closely together in the range of 22 to 29 kDa. Stromal Clp cores were separated by CN-PAGE, followed by high resolution SDS-PAGE with 9-15% acrylamide (Fig. 3B). Coomassie staining and nano-LC-ESI-MS/MS analysis indicated that ClpS1, ClpS2, and ClpR2 now separated as individual spots. The integrated staining intensities (staining volumes) of these pure proteins were quantified by image analysis and indicated that the staining volumes of ClpS1 and ClpS2 are very comparable. For the conversion of these staining volumes into molar ratios, it is important to take the amino acid composition into consideration, because both silver and Coomassie stain bind preferentially to either Lys (for silver) or Lys, His, Tyr, and Arg residues (for Coomassie). Table II lists the number of these residues for all Clp core proteins. ClpS1 and -S2 have both a total of 31 Lys + Arg + His + Tyr amino acid residues, and the Coomassie staining volumes are thus proportional to their molar ratios. This indicates that ClpS1 and -S2 are present in equal molar ratios in the Clp core population. The staining volume of the pure ClpR2 spot was about 1.5 times higher than for ClpS1 or ClpS2 (Fig. 3B). However, ClpR2 has a total of 42 Lys, Arg, His, and Tyr residues, and its molecular size is ~26 kDa, whereas the apparent molecular mass of the ClpS proteins is ~22 kDa. Together, this suggest that the ClpR2:ClpS1:ClpS2 molar ratio is ~1.5 x 31/42 x 22/26 = ~0.93:1:1. We thus conclude that ClpR2, ClpS1, and CpS2 are present in approximately equal molar ratios.

Because the other spots contained mixtures of the other Clp proteins, we sought further separation. Purified Clp cores on native gels were separated first by SDS-PAGE with 9-15% acrylamide, and then the gel lane was incubated in SDS and urea and separated by SDS-PAGE with 13-20% acrylamide (Fig. 3C). Protein spots were stained with Coomassie, but because staining was relatively weak, they were destained and then restained with MS-compatible silver stain (38). Both images were quantified, and spots were subsequently identified by nano-LC-ESI-MS/MS. Now ClpR2, ClpR4, and ClpP4 were separated as individual spots, whereas ClpP1, -P5, and -P6 were still partially overlapping, as were ClpP3, -R1, and -R3. The staining volume of ClpR2 was about 30% of ClpR4 and -P4. The staining volume of ClpR4 and ClpP4 and their molecular masses are approximately identical. However, ClpR4 has 21 lysine residues, whereas ClpP4 has only 12. This suggests that the molar ratio between ClpP4 and ClpR4 is about 2:1. ClpR2 has 11 Lys residues, nearly the same as ClpP4, and ClpR2 is only 1-2 kDa smaller than ClpP4. Given the 30% staining volume of ClpR2 as compared with ClpP4, this suggests that the molar ratio between ClpP4 and ClpR2 is about 3:1. In our earlier paper (3), we suggested that ClpP5 is probably more abundant than ClpP6 and ClpP1, which was based on multiple strongly dominating peptide ions derived from ClpP5 in both MALDI-TOF MS and ESI-MS/MS. Indeed as shown in Fig. 3C, ClpP5 was found in three adjacent spots, whereas P1 and P6 were found together in one spot. Most likely, ClpP1 and -P6 run closely together in this spot, with the presence of ClpP5 being a minor contamination from the adjacent pure ClpP5 spots. Considering that ClpP5 has only 8 Lys and that the total staining volume of the two spots with pure ClpP5 is about 50% of those of R4 and P4, we suggest that the molar ratio between ClpP5 and ClpP4 is at least ~0.5 x 12/8 x 27/23 = ~0.88:1. Taken these data together, we conclude that the approximate molar stoichiometries between P4:P5:R4:R2:S1:S2 is about 3:3:2:1:1:1:1.

Considering all gels, the low lysine content of ClpP1 and ClpP6, and the comparable number of sequence tags obtained by MS/MS from both P1 and P6, we speculate that ClpP1 and -P6 are present at approximately equal molar ratios to each other and ClpS1 and ClpS2. ClpP3,R1,R3 clustered together in most gels and the staining volume both in Coomassie and silver stain seems quite similar or somewhat less than of ClpP4 + -R4. We tentatively suggest that the molar ratio between the sum of P3 + R1 + R3 is comparable with the sum of ClpP4 + -R4. This roughly leads to a approximate stoichiometry of P4:P5:R4:R2:(P3 + R1 + R3):P1:P6:S1:S2 = 3:3:2:1:(3):1:1:1:1, totaling 16 proteins per core (including S1 and S2).

Clp Cores in Plastids of Roots and Petals in the Genus Brassica—Plastids have a large number of biochemical functions and are present in both photosynthetic tissues (leaves and stems) and non-photosynthetic tissues, such as roots and flowers. The proteome composition between non-photosynthetic and photosynthetic plastids is very different3 and might require different proteolytic systems. One could expect that the expressed Clp machinery is different in these non-photosynthetic plastid types, as compared with the chloroplast Clp complex as an adjustment to the substrate requirements. We therefore determined the Clp composition in non-photosynthetic plastids from roots and (white) flower petals from B. rapa and B. oleracea, respectively (Fig. 4, A and B). Due to the much larger size of their roots and flowers, respectively, they are a much better source for purification of non-photosynthetic plastids than A. thaliana. These Brassica species and A. thaliana belong to the Brassicaceae family, and their proteomes share a very high degree of protein sequence identity. Alignment of the genomic sequences of several Clp genes in B. oleracea (www.tigr.org/tdb/e2k1/bog1/) and A. thaliana indicates an average sequence identity of more than 90% between processed Clp orthologues (not shown). We are therefore confident that the correct ("true") orthologue was identified when searching the A. thaliana data base with MS/MS data from B. rapa and B. oleracea proteins.



View larger version (54K):
[in this window]
[in a new window]
 
FIG. 4.
Identification of Clp complexes in non-photosynthetic plastids of the genus Brassica. Native soluble stroma from B. rapa roots (A and C) and B. oleracea petals (B and C) were separated by CN-PAGE (A and B) or by NIEF (C), followed by denaturation and separation of individual proteins by SDS-PAGE. Proteins were identified by ESI-MS/MS and homology-based searches. ESI-MS/MS data for Clp spots are summarized in Supplemental Material Tables 4, A and B. ~325-kDa Clp core complexes and ~200-kDa ClpC1 dimers in B. rapa roots (A) and B. oleracea petals (B). Gels were stained with Coomassie (A and B) or the fluorescent dye Sypro Ruby in case of the lower panel in A. C, single Clp core complexes with pI ~5.0 in B. rapa roots and B. oleracea petals. Proteins were visualized by Coomassie (B. rapa) or silver staining (B. oleracea).

 
Plastids were isolated from B. rapa roots and B. oleracea petals (petals were collected manually from plants in the green house) using newly developed protocols. The soluble stromal proteomes from both plastid types were separated by CN-PAGE in the 1st dimension, followed by SDS-PAGE. The ~250 most abundant proteins were identified by MALDI-TOF MS and nano-LC-ESI-MS/MS (not shown). The protein patterns of these two-dimensional gels showed that the Clp protease is the most abundant stromal protease in non-photosynthetic plastids, similarly as in the chloroplast (Figs. 2 and 4).3

Fig. 4, A and B, shows the region of complexes between 200-500 kDa, and at this mass resolution, one Clp core was found for both organs. ClpP1,3-6, ClpR1-4, and ClpS1,2 were all identified in these complexes by MS/MS. The MS/MS data are summarized in the Table 2, A and B, of the Supplemental Material. ClpP2 was found in root plastids but not in flower plastids, most likely due to a minor contamination of mitochondria (see below). Similarly as for the chloroplast stroma, ClpC was identified as an ~200-kDa complex in both plastid types (spot 8 in lower panels of Fig. 4, A and B), and similarly as in chloroplast, the ClpC1 (rather than ClpC2) was the main species. ClpB3 was identified as a soluble ~200-kDa dimer in petal plastids (spot 7 in Fig. 4A, lower panel). We do want to point out that glucan phosphorylase (At3g29330) was present as a ~200-kDa complex on the stromal maps of all three plastid types (Figs. 2C and 4, A and B, lower panels), and forms an excellent reference for this region of the gels.

We then separated stromal proteomes of both non-photosynthetic plastids by NIEF (Fig. 4C). In both plastids, the Clp core complex migrated as a single complex in a very focused manner, strongly supporting the results from CN-PAGE.

The Clp Core in Mitochondria of Potato Tubers—To investigate the Clp machinery in plant mitochondria, potato tubers were used as a rich and established source of pure mitochondria. The soluble matrix proteome was separated by CN-PAGE, and all stained protein spots were analyzed by MS (not shown). ClpP2 was identified by nano-ESI-MS/MS only in one location on the two-dimensional gel (Fig. 5), suggesting that ClpP2 forms an ~320-kDa complex (Supplemental Material Table 3). The sequence alignment between ClpP2 of A. thaliana and Solanum tuberosum (constructed from overlapping ESTs; e.g. gi 21917555) indicated an 86% identity over 196 amino acid residues, with much less sequence identity to the other A. thaliana Clp proteins (not shown). Protein spots in the area above and below ClpP2 were all identified by MS/MS and were 23/26-kDa heat shock proteins and ATP synthase subunits, all unrelated to the Clp machinery (Supplemental Material Table 3). No evidence was found for accumulation of any other ClpP,R,S proteins, other than ClpP2, as expected from localization prediction (Table I) and Western blotting for ClpP2 (39). Thus, the plant mitochondrial Clp core is homomeric in composition, similar as in other eukaryotes (40). The presence of ClpX chaperones subunits (ClpX1-3) is under investigation.



View larger version (42K):
[in this window]
[in a new window]
 
FIG. 5.
Identification of the ClpP2 core complex from mitochondria of potato tubers. Gels were stained with Coomassie and proteins identified by ESI-MS/MS and homology-based searches. ESI-MS/MS data for spots 1-10 are summarized in Supplemental Material Table 2. We report here orthologues in A. thaliana. Spots 1 and 2, ATP synthase {delta} (At5g13450); spots 3 and 4, Hsp23-kDa (At4g25200); spot 6 and 7, Hsp 26.5kDa (At1g52560); spot 8, cytochrome b5 reductase (At5g20080); spot 9, formate dehydrogenase (At5g14780); spot 10, ATP synthase {alpha} (At2g07698) and {beta} subunits (At5g08670). B and C, cells expressing ClpP2:GFP showing that ClpP2 (B) and the positive control coxIV:mGFP4 (C) are exclusively targeted to mitochondria. The figures show a projection of 9 transverse optical sections taken at 1-µm intervals along the optical axis. The pseudo colors applied are green for GFP and red for chlorophylls.

 
Because ClpP2 was found in the thylakoid fraction and in root plastids, and because the subcellular localization prediction is ambiguous, we verified localization of ClpP2 by using an approach not dependent on purification of organelles. We therefore cloned the full-length cDNA directly upstream of the gene encoding for green fluorescent protein, and we transformed tobacco leaves with this construct under the control of a strong constitutive promoter by biolistic bombardment. The ClpP2:GFP fusion protein was transiently expressed in tobacco leaves. Cells expressing ClpP2:GFP showed green fluorescence in globular structures identified as mitochondria (Fig. 5B). Clearly, all the green fluorescence was excluded from the chloroplasts, identifiable by the red autofluorescence. As a positive control for mitochondrial targeted GFP, we used the coxIV:mGFP4 fusion protein from Ref. 41 (Fig. 5C).

Compatibility between the ClpP/R Proteins and Unique Contributions—To investigate the structural compatibility of the different ClpP/R proteins, unique structural contributions of the different ClpP/R/S subunits, and to determine the location of ClpS in the Clp complex, we used the structural information of the homotetradecameric ClpP core complex at 2.3-Å resolution (PDB code 1TYF [PDB] ) (15, 42) and the ClpA N-domain (PDB code 1K6K [PDB] ) at 1.8-Å resolution (43) in E. coli by x-ray crystallography.

The six ClpPs and four ClpRs proteins from A. thaliana are homologous to eClpP for most of the sequence lengths, with the exception of 9 or 10 residue insertions in ClpR1,3,4 and extended C termini for several ClpP/R proteins (Table II and Fig. 6). The x-ray crystal structure of 1TYF [PDB] was used as a template to build three-dimensional models for all ClpP/R proteins. The 9/10 residue insertions (assigned L1 insertions) occur at equivalent sequence positions with reference to the template 1TYF [PDB] (positions 70-80, in Fig. 6). To model these insertions and the extended C termini, we identified structural neighbors of 1TYF [PDB] using the combinatorial extension method (44). Among these structures are the monomeric units of both, 4-chlorobenzoyl-coenzyme A dehalogenase (PDB code 1NZY [PDB] ) and D3,D2-enoyl-CoA isomerase ECI1 (PDB code 1HNO [PDB] ). These proteins show very little sequence identity with eClpP, despite their very high structural similarities with 1TYF [PDB] (see Fig. 6). Modeling the non-catalytic monomers ClpR1,3,4 (Fig. 7A for ClpR1) indicates that the insertion loop L1 replaces some of the residues forming the oxyanion hole in the catalytic units and lies over the catalytic clefts between the head domains and the handles, i.e. they fill the volume between strands 4 and 9 (strand numbering follows the convention of Ref. 15) that constitutes the substrate-binding pocket in the catalytic units. Sequence alignments of the monomers of 1TYF [PDB] , 1NZY [PDB] , 1HNO [PDB] based on the structural alignment obtained from the combinatorial extension method and the BLAST alignments of the ClpP/R sequences with the eClpP sequence are shown in Fig. 6. Information about the secondary structure elements ({alpha}-helices and {beta}-strands) is also indicated.



View larger version (43K):
[in this window]
[in a new window]
 
FIG. 6.
Structure alignment of the models for the ClpP/R monomers with 1TYF [PDB] , 1NZY [PDB] , and 1HNO [PDB] . The alignments are based on the structure alignment between the 1TYF [PDB] monomer and its structural neighbors.a, 1TYF [PDB] eClpP;b, ConsRes indicates conserved residues in all the ClpP/R sequences;c, Cat.Site indicates residues forming the catalytic site;d, Put_Open indicates residues that are involved in the putative opening in the Clp core;e, SS indicates the secondary structure information of the specified molecule: E, extended; H, helix; T, turn.

 



View larger version (52K):
[in this window]
[in a new window]
 
FIG. 7.
Homology modeling of ClpP/R. A, ribbon representation of the model of the ClpR1 monomer (in yellow) superimposed on the x-ray crystallographic structure of a monomer of eClpP (PDB code 1TYF [PDB] ) in red. Catalytic residues Ser-97, His-122, and Asp-171 in 1TYF [PDB] are shown using a "ball and stick" representation. The catalytic triad is missing in the ClpR monomers. Residues at the C terminus of ClpR1 do not have matching residues in 1TYF [PDB] and were modeled based on structural neighbors of 1TYF [PDB] . B and C, homology model of the mitochondrial homomeric ClpP2 core complex using a ribbon representation. One of the monomers is colored mostly yellow, with the modeled C-terminal portion in green, sitting on top of a P1 pocket. On the remaining monomers, the ribbons are colored according to the positions of residues in the sequence starting from blue at the N termini and ending with red at the C termini. B, top view; C, front view.

 
The extended C termini the ClpP/R proteins could be modeled using 1NZY [PDB] and 1HNO [PDB] because they share a well defined pattern at their C-terminal ends formed by two to four {alpha}-helical fragments. Modeling suggests that the extended C termini present in ClpP3-5 and ClpR1-4 can form {alpha}-helical structures that can fold over the top of the barrel-like core structure. An example for a three-dimensional model is shown for ClpR1 superimposed on eClpP (Fig. 7A).

After constructing reliable three-dimensional models for all ClpP and ClpR monomers, we created models for different hetero-tetradecameric plastid Clp cores, as well as for the homo-tetradecameric ClpP2 core from mitochondria (Fig. 7, B and C). All possible contacts between different types of monomers (each unit is in contact with 4 others) were generated and indicated that the different ClpP and CpR proteins could be part of the same tetradecameric ring structure, without obvious incompatibilities. The root mean square deviations of the C{alpha} atomic positions (C{alpha} r.m.s.d.) obtained after optimal superposition of the structures of the template 1TYF [PDB] and the ClpP/R models are low (see Table II), showing that the models are highly reliable. All models were carefully analyzed by visual inspection and validated using the program WHAT_CHECK (34).

Putative Lateral Openings in the Clp Core—Sequence conservation of two Arg residues present in all ClpP/R proteins motivated us to look into their three-dimensional environment in 1TYF [PDB] and the modeled proteolytic ClpPR cores. Careful analysis of this part of the structures suggests putative lateral openings that connect the interior of the central cavity with the outside (Fig. 8, A and B). Two types of putative exits/entrances can be observed.

  1. The first type (Fig. 8, A and B, pink arrows) is located at the interface of the two rings, and it is formed by residues contributed by four different units, i.e. two from each ring (Fig. 8C), with a total of 7 openings per core. These openings measure ~8 Å in diameter and are remarkably polar. The type of residues lining the opening varies, depending on the ClpP/R monomers that contribute to the site. However, two conserved Arg residues are present in all ClpP/R and contributed to these openings by opposite monomers (A and N in Fig. 8A). One of these Arg residue precedes the catalytic Asp residue (Asp-171 using the 1TYF [PDB] [PDB] residue numbering) in all ClpP proteins. We also observed that the insertion loops L1 in ClpR1,3,4 can contribute with additional polar residues to flank the internal lips of the polar opening when the monomers are located in any of the two positions shown in blue in the diagram shown in Fig. 8C. All combinations of monomers maintain the polar character of this putative opening with the only exception of ClpR3. A two-residue insertion (Leu-123 and Met-124) in ClpR3 may block or partially close this opening when ClpR3 occupies the two positions shown in red in the diagram (Fig. 8C). We suggest that these polar openings may provide fast exit routes for the proteolytic products (see "Discussion").
  2. The second type of putative opening (Fig. 7, A and B, yellow arrows) occurs between the two closest catalytic sites located in neighboring monomers on the same ring, thus with 14 putative openings per core. The walls of these putative openings are formed largely by residues located in helices D and E in the first monomer and residues in strands 6, 8, and 10 in the second monomer. A large number of hydrophobic residues flank the internal walls of these putative channels, and fluctuations of the side chains of these residues can provide an oval-like opening with maximum and minimum radii of about 10 and 7 Å, respectively.



View larger version (57K):
[in this window]
[in a new window]
 
FIG. 8.
A, stereo view of the interface between four ClpP2 monomers, indicated by A and G (upper ring) and H and N (lower ring), forming part of a homomeric ClpP2 core complex. The ribbon representation of the monomers is colored according to the hydrophobic/hydrophilic properties of the residues. Color scale: polar (blue), amphiphilic (white), hydrophobic (red), and catalytic residues (yellow) in "stick" representation. The pink and yellow arrows indicate two types of putative lateral openings to the central chamber. B, stereo view of the molecular surface of the same four monomers using the colors as in A. C, schematic diagram showing alternative positions of ClpR monomers that can affect the character of a putative polar lateral openings.

 
Modeling of ClpS1,2 and Their Interaction with the Plastid ClpPR Core—Searches with BLAST and the 3DPSSM threading tool for ClpS1,2 indicate excellent sequence-to-structure alignments (95% certainty) with the N-terminal domain of eClpA (PDB 1K6K [PDB] ), recently deposited by Guo et al. (43). This N-terminal domain is assigned here as the "N domain." The alignments (Fig. 9) were used to build three-dimensional models of ClpS1,2.



View larger version (44K):
[in this window]
[in a new window]
 
FIG. 9.
Alignment of the ClpS sequences with the N-terminal domain of eClpA. NT-eClpA denotes the N-terminal domain of eClpA (PDB code 1K6K [PDB] ); COG542-eClpA indicates the consensus N-terminal domain of eClpA structures found by BLAST (NCBI CD, COG0542.1,ClpA; PSSM-Idm 10413); N domain denotes the Clp N-terminal domain found in one or two copies at the N terminus of ClpA and ClpB proteins from bacteria or eukaryotes. Color code: positions with polar residue conservation in blue; charge conservation in red, and hydrophobicity conservation in green. Conserved residues are shown in boldface. The magenta (LL1) and orange (LL2) rectangles indicate residues from two loop regions on ClpS1,2. The equivalent loops in NT-eClpA possibly provide most of the interacting surface when bound to eClpS.

 
The models of ClpS1,2 (ClpS1 in Fig. 10) show that most of the hydrophobic residues (in green, Fig. 10A) constitute the central core of the structure and are probably important for structural reasons only. On the other hand, conserved residues Arg-18, Glu-28, Arg-57, and Ser-118 may be involved in functional aspects of ClpS. Our conclusions are based on the following: (a) the accessibility of these residues (surface-exposed residues), and (b) on the x-ray experimental structure of the N-terminal domain of eClpA bound to eClpS (PDB code 1MBU [PDB] ) of Ref. 45 that shows the equivalent Glu-28 and Ser-118 (same residue numbers in monomers A and B of 1MBU [PDB] ) interacting with eClpS. The latter x-ray crystal complex also shows that two loops in eClpA provide most of the surface of interaction with eClpS. The equivalent loops in ClpS involve loop LL1 from Arg-18 to Glu-28 in ClpS1 and loop LL2 from Asp-68 to Leu-80 in ClpS2. Residues in these two loops are shown in magenta and orange, respectively (Fig. 10C). It should be noted that only the first loop, LL1, involving residues Arg-18 to Glu-28, is inside the region recognized as the N domain.



View larger version (59K):
[in this window]
[in a new window]
 
FIG. 10.
ClpS1,2 structure and modeled Clp core interactions. A, stereo view of a model of ClpS1. The model was generated by using the structure of the N-terminal domain of eClpA as the template and the sequence alignment obtained using the three-dimensional PSSM threading tool. Only those residue side chains satisfying conservation of hydrophobic, charged, or polar properties are displayed. Color code see the legend to Fig. 9. Conserved residues are in thicker lines. B, hypothetical mode of interaction of ClpS with the Clp core. A solvent-accessible surface representation of a ClpS1 monomer (in magenta) bound into one of the P1 pockets formed at surface of two ClpP,R monomers (ClpP3 in red and ClpP6 in orange) in the catalytic core. The remaining 12 ClpP,R monomers in the Clp core are represented using green ribbons. C, another stereo view of the "docking" model of ClpS1 into the P1 pocket formed by the two neighbor ClpP3 and ClpP6 monomers using an atomic representation of residues that may be involved in the interaction. The remaining 12 ClpP,R monomers forming the catalytic core were omitted. Asp-68 indicates the position of a conserved Asp residue in the Clp,Rs. Similarly, Arg-18 corresponds to a conserved residue in ClpS. The N terminus of the monomer is indicated by N. The model of ClpS1 bound the Clp core was generated manually by rigid docking of the backbones of both structures, followed by subsequent modification of the positions of few residue side chains to overcome atomic clashes.

 
Given that the modeled structures of the ClpS proteins are quite different in shape from the ClpP/R proteins, it is highly unlikely that ClpS1,2 is part of the heptameric rings. In contrast, the surfaces of the Clp core and ClpS1,2 show a number of complementarities in shape and hydrophobicity/polarity. Our models of the proteolytic core show the hydrophobic P1 pockets (15) near the axial entrances as possible regions for binding. As we mentioned above, our models for all the ClpP,R monomers, with the exception of ClpP1 and ClpP6, indicate that the P1 pockets may be occluded by the C termini of the monomers forming the site (Fig. 7, B and C, and Fig. 10). When exposed, the P1 grooves always show very good surface complementarity with the epitope formed by loops LL1 and LL2 in ClpS1,2 (Fig. 10, B and C). In addition, the highly aromatic characters of the P1 pockets on the core complex and the ClpS1,2 epitope (with two Tyr, two Glu, and four additional Pro residues; see alignment in Fig. 9) can provide the elements for a tight binding between the two molecules, whereas some of the conserved charged residues, e.g. Arg-18 and Glu-28 in ClpS1,S2 and Glu-64 in the ClpP,R, may provide the steering guidance to orient the two molecules during binding. The (I/L)GF motif, as mentioned earlier, might be involved in a different way of interaction.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The Surprising Complexity of Clp Cores in Plastids and Mitochondria—Chloroplasts have an extensive thylakoid membrane system, and their soluble stromal proteome is dominated by the Calvin cycle enzymes, whereas leucoplasts and root plastids are non-photosynthetic, have no thylakoid membranes, and contain very little Calvin cycle enzymes. Thus these are very different plastid types, with potentially different needs for proteolysis. Because the Clp family in Arabidopsis and other higher plants is much more complex than in bacteria, one can postulate that the Clp genes are differentially expressed in dependence of tissue type. In this study, we have therefore analyzed the Clp machinery in these three very different plastid types, as well as in mitochondria.

Surprisingly, in all three plastid types, we always detected a Clp complex of ~325 kDa containing 11 different Clp proteins, namely ClpP1, ClpP3-6, ClpR1-4, and ClpS1-2. ClpR4 and ClpS2 had not yet been identified in any Clp complex before. This complex migrated at a pI of ~5.0 in native IEF tube gels. Considering all gels and image analysis, amino acid content, and the number of peptide sequence tags from MS/MS, we suggest a tentative stoichiometry of P4:P5:R4:R2:(P3 + R1 + R3):P1:P6:S1:S2 = 3:3:2:1:(3):1:1:1:1, totaling 16 proteins per core (including S1 and S2). The take-home message is that the analysis clearly excludes that plastids contain significant levels of homomeric cores or cores with only a small number of ClpP/R forms, because they would separate based on either their native mass or pI.

In contrast, the mitochondrial Clp complex is an ~320-kDa homotetradecamer of ClpP2, without other ClpP/R orthologues and without association of ClpS1,2. Mitochondrial localization was independently confirmed by transient expression essays of a ClpP2-GFP fusion in leaves, in agreement with Western blot data (39).

Thus all plastid-localized ClpP/R/S gene family members are expressed in roots, leaves, and petal plastids and are assembled into heteromeric structures. The presence of all plastid-localized ClpP/R proteins in one well focused complex, as shown by two different separation principles (by native mass and native IEF), is surprising and suggests that each paralogue is required for structure and/or function. This suggestion is supported by preliminary data from T-DNA insertion lines of A. thaliana indicating that several of the nuclear-encoded ClpP and ClpR proteins are essential.4 The chloroplast-encoded ClpP1 is also known to be essential in tobacco (4, 5) and C. reinhardtii (6). This also implies that regulation of plastid and mitochondrial proteolysis in not through regulation of Clp gene expression but rather through substrate recognition mechanisms and interaction of chaperone-like molecules (ClpS1,2 and others) to the Clp core. Western blot analysis of protein extracts of different plant tissues indicated constitutive expression of the ClpP proteins (8, 10), without significant expression changes under different stress conditions. This is in agreement with our results of a single ClpP/R complex and suggests that the heteromeric ClpP/R cores observed in our study do not change their composition significantly when plants are exposed to stress.

Thus to understand regulation of Clp proteolysis, it will be critical to obtain detailed insight in the ClpP/R core structure, and its association with ClpS1,2 and other chaperones. Therefore, homology models were created for all ClpP and ClpR monomers and organized in different configurations in tetradecameric structures. This showed that the different ClpP and CpR proteins could be part of the same heptameric ring structure and that there were no strong incompatibilities between heptameric rings of different composition forming tetradecameric complexes.

The models were further used to determine how the different ClpP/R proteins contribute to the structures and potentially affect proteolysis, as well as how they affect interactions with the chaperones and ClpS1,2. From a structural perspective, the obvious difference between the ClpP proteins is the length of their C termini. In particular, the C termini of ClpP3 and ClpP4 are 52 and 41 amino acid residues longer than ClpP1 and eClpP. Modeling based on the structural neighbors of 1TYF [PDB] suggests that these C termini may form {alpha}-helical structures that can fold over the axial side of the core into a specific P1 pocket (or new tripeptide motif), thereby potentially interfering with docking of the hexameric chaperones. This is a novel feature for the plastid ClpPR core. There are no other differences between the ClpP isoforms that can be directly related to function or structure, but it is important to point out that the overall sequence identities between the ClpP proteins are not more than 60%.

The presence of the four non-catalytic ClpR proteins is very intriguing. What is the difference between the four gene products and why are these non-catalytic subunits part of the complex? Clearly, the ClpR proteins have lost the catalytic protease site, and they do therefore not directly contribute to proteolysis themselves. However, three of the four ClpR proteins (ClpR1,3,4) have a 9 or 10 amino acid insert (the L1 inserts), possibly affecting the presentation of the substrate to the neighboring catalytic triads in the ClpP proteins. In addition, the L1 inserts do contribute residues to the putative lateral polar openings that might stimulate (ClpR1,4) or inhibit (ClpR3) exit of peptide fragments from the core (see more below). The four ClpR proteins differ also in the length of their C termini, similarly as the ClpP subfamily and can potentially influence chaperone interaction to the ClpP/R core. Thus the ClpR proteins likely have a regulatory role. It is important to note that the actual mechanism of proteolysis in not known for any type Clp core complex in any organism, and it is therefore difficult to predict the impact of the presence of the ClpR proteins on efficiency of proteolysis, as well as the size of the proteolytic end products. It is conceivable that some of the differences in the primary sequence of the ClpP/R proteins are needed for the assembly process of the core, especially because there are no strong structural constraints in varying the ClpP/R composition.

The two axial pores of the Clp proteolytic core have been suggested as entrances to the central chamber, as well as exits for proteolytic products (15). This axial exit route requires that either the chaperones are removed from the core prior to product release or that one side of the core remains unoccupied. Conservation of two Arg residues in all ClpP/R and eClpP and re-analysis of the 1TYF [PDB] and the three-dimensional models of plastid and mitochondrial cores suggested conserved putative highly polar lateral openings formed by residues belonging to four subunits located at the interface between the two rings. These polar openings seem to be better candidates as a route to clear the products of catalysis, due to their proximity to the catalytic sites, thus avoiding two-way traffic through the axial pores. All combinations of ClpP/R isomers (with the exception of ClpR3) maintain the polar characteristics of this opening. A second type of putative openings is less attractive and may be less likely to be real openings, given the hydrophobicity of the residues lining the putative openings.

We showed that Clp cores in all plastid types (but not in mitochondria) contain ClpS1,2; the stoichiometry analysis suggested (on average) one copy of each ClpS per core. This indicates that the interaction between ClpS and the core is quite strong, because it "survives" both native PAGE and native IEF. The three-dimensional models strongly indicate that ClpS1,2 do not fit in the tetradecameric ring structure but that they do fit well on the axial sites of the ClpPR cores. This interaction involves the hydrophobic (aromatic) P1 pocket in the core and the S1,2 loops LL1 and LL2 and a number of additional conserved residues in ClpS1,2. This predicted interaction site would indeed give the strong interaction as observed experimentally in this study.

ClpS1,2 are found from early land plants (the moss Physcomitrella) to higher plants. However, it seems to be absent in prokaryotes, including photosynthetic bacteria. ClpS also seems absent from the unicellular algae C. reinhardtii5 for which the genome was recently sequenced (genome.jgi-psf.org/chlre1/chlre1.home.html). Thus, ClpS is not a peculiarity of the Brassicaceae and suggests that ClpS1,2 has a functional role in plastid proteolysis in plants.

The binding of ClpS1,2 at this axial site will likely compete directly with the binding of a ClpC hexamer. What could be the benefit of this mutual exclusion? If ClpC hexamers cannot bind to the core, proteolysis of folded or aggregated proteins will be prevented. Binding of ClpS might also function to keep the axial openings free for release of proteolytic products. Experiments are in progress to test these hypotheses.

The ClpB,C Chaperones—The chaperone ClpC1 was identified as a dimer in the three plastid types, suggesting that this conformation is the most stable unit. Also the ClpC homologue, eClpA, crystallizes as dimers. Formation of hexamers and interaction with the Clp core requires ATP in E. coli (for ClpA) (16) and chloroplasts (for ClpC) (46). We found that the concentration of ClpC1 was in the same order of magnitude as the concentration of the ClpP/R core. Interestingly, we did clearly identify ClpC1 but not ClpC2. ClpC1 was also found to associate transiently with the inner envelope when involved in protein import (47, 48), although the majority is located in the soluble stroma, as determined by Western blots (8). We are not aware of any study in which the ClpC2 protein was identified specifically. The presence of this second ClpC gene only became clear recently, after sequencing of the A. thaliana genome. We did not find expression of the chaperone ClpD (also named ERD1), which is not surprising because it is reported to be expressed during drought stress and/or during senescence, although the literature is not consistent (compare Refs. 9 and 49). ClpB3 was identified as a 200-kDa dimer in petal plastids, and it overlapped with a 200-kDa dimer of glucan phosphorylase, visible in all three plastid types. This might have masked ClpB3 in chloroplasts and root plastids. However, based on the data from the E. coli homologue and the absence of the "IGF" motif, ClpB3 is unlikely to interact with the Clp core and is not playing a role in Clp-mediated proteolysis.

Conclusions for Clp Proteolysis in Higher Plant Plastids and Mitochondria—The Clp machinery in plastids of higher plants has acquired a surprising complexity. In contrast, the much simpler Clp machinery in plant mitochondria still resembles an E. coli eClpP core. Together, our data show that Clp core complexes present in different plastid types have a similar size and overall composition and that the Clp complex is the most abundant protease complex in the stroma. The essentiality of ClpP1 in tobacco and C. reinhardtii (4, 6), and other ClpP and ClpR genes in A. thaliana,4 together with the Clp abundance and complexity, suggests a very central role for Clp in plastid homeostasis, equivalent to the central role of the proteasome in the cytosol.

Our findings also show that regulation of plastid proteolysis by the Clp machinery is not through differential regulation of ClpP/R/S gene expression. Instead proteolysis could be regulated by substrate recognition mechanisms and interaction of chaperone-like molecules (ClpS1,2 and others) to the ClpP/R core. This is why the detailed insight obtained by three-dimensional homology modeling of the Clp machinery is so important, as it points to several regulatory mechanisms.

Given the essential nature of most (if not all) ClpP/R genes, the assembly of all ClpP/R subunits in one complex and the expression of the ClpP/R machinery in both photosynthetic and non-photosynthetic plastids, it will require a very significant effort to identify Clp substrates, mechanisms of substrate recognition and proteolysis, and the functional role of contribution of all Clp family members. Clearly, the combined experimental and theoretical analysis and the resulting working models for the Clp machinery presented in this paper will form a strong basis for future experimental studies.


    FOOTNOTES
 
The atomic coordinates and structure factors (codes 1R8Z, 1R8V, 1R90, 1R91, 1R92, 1R93, 1R96, 1R97, 1R98, 1R99, 1R9a, 1R9B) have been