Substrate Specificity of Human Carboxypeptidase A6*

Carboxypeptidase A6 (CPA6) is an extracellular matrix-bound metallocarboxypeptidase (CP) that has been implicated in Duane syndrome, a neurodevelopmental disorder in which the lateral rectus extraocular muscle is not properly innervated. Consistent with a role in Duane syndrome, CPA6 is expressed in a number of chondrocytic and nervous tissues during embryogenesis. To better characterize the enzymatic function and specificity of CPA6 and to compare this with other CPs, CPA6 was expressed in HEK293 cells and purified. Kinetic parameters were determined using a panel of synthetic carboxypeptidase substrates, indicating a preference of CPA6 for large hydrophobic C-terminal amino acids and only very weak activity toward small amino acids and histidine. A quantitative peptidomics approach using a mixture of peptides representative of the neuropeptidome allowed the characterization of CPA6 preferences at the P1 substrate position and suggested that small and acidic P1 residues significantly inhibit CPA6 cleavage. Finally, a comparison of available kinetic data for CPA enzymes shows a gradient of specificity across the subfamily, from the very restricted specificity of CPA2 to the very broad activity of CPA4. Structural data and modeling for all CPA/B subfamily members suggests the structural basis for the unique specificities observed for each member of the CPA/B subfamily of metallocarboxypeptidases.

The M14 family of metallocarboxypeptidases (CPs) 2 is a large family of enzymes that functions in the cleavage of amino acids from the C termini of a variety of peptide and protein substrates (1). This family can be divided into three subfamilies based on sequence and structural similarities. The CPA/B subfamily was the first to be described, each member containing a well conserved CP domain as well as an N-terminal prodomain that is proteolytically removed for full activity (2,3). Members of the CPN/E subfamily do not have a prodomain, but rather have a C-terminal transthyretin-like subdomain thought to be involved in protein folding (4). CPE is one well characterized member of this subfamily that processes neuropeptides and hormones in the secretory pathway (5). The cytosolic carboxypeptidase subfamily is the most recently identified, each member containing conserved CP and N-terminal domains and, in many cases, large N-and C-terminal extensions (6,7). Recent work suggests a role for these enzymes in cytosolic peptide degradation and autophagy (8).
The CPA/B subfamily is composed of nine members, six having specificity toward C-terminal aliphatic and aromatic amino acids (CPA1-6), two having specificity toward C-terminal basic amino acids (CPB1, CPB2), and one having predicted specificity toward C-terminal acidic amino acids (CPO) (1,3,9). CPA1 is the best characterized member of this subfamily, as it is a highly expressed pancreatic enzyme involved in food digestion in the gut and was one of the first proteins to have its crystal structure solved (10,11). In recent years more has been learned about the enzymatic specificity, gene expression, and physiological roles of other members of this subfamily. X-ray crystal structures have been solved for five of the nine members of this subfamily (3,(12)(13)(14)(15)(16)(17). CPA2 and CPB1 are pancreatic metallocarboxypeptidases that function in the digestion of food (18). The role of CPA3 in the protective responses of mast cells has been characterized (19). CPB2, which is also known as thrombin-activatable fibrinolysis inhibitor, has been extensively studied for its role in reducing fibrinolysis (20,21). Recently, the enzymatic characteristics of CPA4, an enzyme implicated in prostate cancer (22), have been determined and compared with those of CPA1 and CPA2 (23).
CPA6 was first identified in a bioinformatics search for additional members of the M14 metallo-CP family in the human genome (9). Soon after its identification, the CPA6 gene was shown to be present within a previously identified Duane syndrome genomic locus and disrupted in a Duane syndrome patient (24). This implicated CPA6 in the etiology of Duane syndrome, which is a neurodevelopmental disorder in which the sixth cranial nerve does not innervate its target extraocular muscle, the lateral rectus, resulting in a defect in eye abduction (25). Analysis of the CPA6 mRNA expression pattern in the mouse indicated that CPA6 is found in a number of tissues in the embryonic E14.5 mouse, including the developing vertebrae, dorsal root ganglia, skin, cerebellum, and a condensation posterior to the eye (26). We have recently shown in zebrafish that this tissue condensation near the eye is developing chondrogenic tissue near the lateral rectus muscle that may have a role in the innervation of this muscle. 3 CPA6 may also play a role in developmental processes in the adult. CPA6 expression is found in the adult mouse olfactory bulb (26), where neurons develop continuously throughout adulthood, as well as in human and mouse bone marrow and the chicken Bursa of Fabricius (Unigene data base), both locations of B-cell maturation.
CPA6 is a secreted enzyme that binds tightly to the extracellular matrix (ECM) (27), suggesting that endogenous substrates of CPA6 might also be found at the ECM. CPA6 was predicted to have specificity for hydrophobic C-terminal amino acids (9). This was confirmed in a preliminary characterization of CPA6, which investigated the enzymatic activity of CPA6 in the ECM and did not use purified enzyme (27). Furthermore, these previous studies were not quantitative and did not provide kinetic values for substrate cleavage, which are important for comparing among related enzymes. Here we report the purification of human CPA6 and the characterization of its enzymatic activity and specificity. We compare these results with similar experiments using CPA1, CPA2, and CPA4 and perform modeling of the substrate binding pocket of CPA6 to compare with x-ray crystal structure information available for related enzymes. Taken together, these studies provide a comprehensive analysis of the substrate specificity of CPA6 as well as a broader perspective on metallo-CPs in general.

EXPERIMENTAL PROCEDURES
Purification of CPA6-Human CPA6, containing C-terminal HA and His 6 tags (hCPA6-HAH6), was stably expressed in HEK293 cells (American Type Culture Collections (ATCC), Manassas, VA) grown in minimum essential medium (Mediatech, Inc., Manassas, VA) supplemented with 10% horse serum, glutamine, and penicillin/streptomycin. Over a period of 10 days these cells were adapted to SFM4 HEK293 serum-free medium (Thermo Scientific Hyclone, Logan, UT) supplemented with 0.5% horse serum (serum free medium resulted in significantly lower expression levels) and then transferred to spinner flasks for suspension culture (maximum 0.5 liters in a 2-liter flask for proper aeration). Medium was collected and replaced each week.
Carboxypeptidase Assays-Most of the 3-(2-furyl)acryloyl peptide (FA) carboxypeptidase substrates used in the present study were synthesized as described (23); some were purchased from Bachem (Torrance, CA). Substrates were dissolved in 50 mM Tris-HCl, pH 7.5, 150 mM NaCl. Human CPA6 was purified from stably expressing HEK293 cells as described above. Bovine CPA1 was purchased from Sigma. Cleavage of substrate was performed at 25°C using a total volume of 100 l in a polystyrene 96-well plate and was measured as a decrease in absorbance at 342 nm. Assays were performed two to four times in triplicate and kinetic parameters were determined by fitting to a Michaelis-Menten curve using nonlinear regression analysis. When determining the pH optimum, substrate was dissolved in 50 mM Tris acetate buffer containing 150 mM NaCl at the indicated pH values.
Peptidomics-Quantitative peptidomics experiments were performed as described (23) with minor modifications. In brief, peptides purified from mouse brain were incubated for 90 min with 100, 10, or 1 nM purified CPA6, or incubated in the absence of enzyme. Following the incubation, the reaction was quenched, peptides were labeled with stable isotopic tags, pooled, and subjected to liquid chromatography and mass spectrometry, as described (23). In the present study, the peptidomics analysis was performed twice, each time testing 3 concentrations of enzyme and one control incubation, but with different isotopic tags used for each enzyme concentration; this was done to control for potential variations in the labeling efficiency with the isotopic reagents.
Modeling-Models were made using SWISS-MODEL (expasy) (28) and incorporate active site side chain rotamers most consistent with known CPA structures. Ramachandran plots were produced for all models to verify proper amino acid stereochemistry, and local and overall model quality was verified using Prosa-web. All images were drawn using PyMol.

RESULTS
Initially, several expression systems were tested for the production of enzymatically active CPA6. A number of CPs related to CPA6 have been expressed and secreted in high levels from insect cells using a baculovirus expression system (CPA5 (9), CPE (29), and CPM (30)), and from Pichia pastoris yeast cells (CPA4 (31) and CPB1 (32)). However, neither of these systems was successful for CPA6. Expression of a His 6 -tagged CPA6 in P. pastoris was not detected in either cell extracts or in the medium. Although CPA6 was strongly expressed in Sf9 insect cells using the baculovirus expression system, protein was not secreted in appreciable quantities. The small amount of intracellular CPA6 that was soluble in Sf9 cells could not be purified on the metal chelate resin, suggesting it did not have an exposed His 6 tag due to improper folding. Previously, CPA6 was found to be secreted from HEK293T cells in an active form, in which the propeptide was cleaved (27). In these previous studies, CPA6 was largely retained in the extracellular matrix of the HEK293T cells. Therefore, we stably expressed CPA6 in mammalian HEK293 cells and tested if the protein was secreted into the medium when cells were grown in suspension. It was found that CPA6 was secreted into low serum-containing medium as an active enzyme (supplemental Fig. S1).
CPA6 was purified through a two-step affinity chromatography protocol ( Fig. 1A and supplemental Fig. S1). The first purification step was metal affinity chromatography, making use of a His 6 tag at the C terminus of CPA6. To avoid the enzymatic removal of the His 6 tag by CPA6, as discovered in a previous study (27), we supplemented the medium with 2 mM benzylsuc-cinic acid, a CPA-specific enzyme inhibitor. HEK293-conditioned medium was stirred batchwise with Talon metal affinity resin. The resin was then collected, washed, and eluted in low pH buffer. Because CPA6 binds with high affinity to heparin (27) the Talon resin eluate was passed through a heparin-agarose affinity chromatography column and CPA6 was eluted in high salt. The resulting product was analyzed by SDS-PAGE and Coomassie Blue staining and determined to be greater than 95% pure, showing the presence of major bands at 35 and 50 kDa, corresponding to the active enzyme and proenzyme, respectively (Fig. 1B). Several weak high molecular weight bands were also seen, but could not be eliminated even by size exclusion chromatography. These may be oligomeric forms of CPA6.
To confirm the correct folding and tertiary structure of the enzyme, purified CPA6 was passed through an affinity column composed of potato carboxypeptidase inhibitor (PCI) coupled to Sepharose 4B. PCI is a specific and potent inhibitor of CPA/B enzymes through extensive surface and active site interactions (33), and has been found to inhibit CPA6 with a K i in the low nanomolar range (27). All CPA6 loaded onto the PCI-Sepharose column bound strongly, as judged by Western blot and enzymatic activity of the flow-through and washes, and eluted in pH 12 buffer (supplemental Fig. S2). Although the high pH eluate was immediately neutralized, it had no detectable enzymatic activity, likely due to this brief exposure to strongly alkaline conditions. Therefore, this affinity column was not useful in the purification scheme. However, the PCI binding strongly suggests correct folding of the enzyme.
The pH optimum and kinetic properties of CPA6 were evaluated using a panel of CP substrates recently synthesized in our laboratory (23). These substrates consisted of the 3-(2-furyl)acryloyl (FA) chromogenic group conjugated to a C-terminal dipeptide (34). As a penultimate phenylalanine is thought to be preferred by CPA6 (27), each FA substrate contained a penultimate phenylalanine followed by a hydrophobic C-terminal amino acid (Phe, Tyr, Trp, Met, Leu, Ile, Val, Ala, or His).
The pH optimum for purified human CPA6 was determined using the chromogenic substrate FA-Phe-Phe and compared with the pH optimum of bovine CPA1. Both enzymes exhibited an optimum at pH 7.5-8.0 (Fig. 2), consistent with a role outside of the secretory system. Outside of this optimum CPA6 exhibited a broader range of activity than that seen for CPA1. The pH optimum of both enzymes for the substrate FA-Phe-His was also determined. Differences were suspected for this substrate as the side chain of histidine has a pK a of 6.04 and therefore would be protonated at lower pH values. The preference of both CPA1 and CPA6 for uncharged substrates was supported here by the observation of a narrower pH optimum shifted slightly toward a higher pH (Fig. 2).
A comparison of the K m and k cat of CPA6 in the binding and cleavage of a number of different C-terminal amino acids was performed using the above described panel of CP substrates, along with the commercially available FA-Arg-Leu. The substrates fell into three groups in regards to their ability to be cleaved by CPA6 (Table 1) (2), collection of conditioned medium (3) and affinity chromatography using metal-affinity (4) and heparin (5) columns. B, purified CPA6 was resolved by SDS-PAGE and analyzed by Coomassie Blue staining.

CPA6 Enzymatic Characterization
the previous group. Finally, the last three substrates, with penultimate Phe and C-terminal His, Val, and Ala exhibited very low k cat (Ͻ0.5/s) and variable affinities.
The above results showed the ability of CPA6 to cleave different C-terminal (P1Ј) amino acids, focusing on aliphatic/aro-matic residues likely to be substrates. These results gave little information on the effects of penultimate (P1) amino acids or those even farther from the C terminus. Therefore we applied a quantitative peptidomics technique to address this question. Different amounts of purified CPA6 enzyme were incubated with a peptide mixture extracted from mouse brain, representative of the peptidome of the mouse brain that might be encountered by secreted CPA6. After incubation with enzyme, the peptides were differentially labeled with isotopic tags, combined, and analyzed by liquid chromatography/mass spectrometry (LC-MS) (23,35,36).
Over 100 peptides were detected through these LC-MS analyses, with close to 50 being identified through a combination of tandem mass spectrometry (MS/MS) and close matches with previously identified peptides; the criteria used for the matches included an observed monoisotopic mass within 0.004% of the theoretical mass, an expected charge equal to the number of basic residues plus the N terminus, and a correct number of isotopic tags incorporated. A number of peptides were identified in multiple LC-MS runs. In many cases the peptides in the peak set exhibited roughly equal peak heights, indicating that these peptides were not substrates or products of CPA6 under the reaction conditions used ( Fig. 3A and supplemental Table S1). Some peptides exhibited a decrease in peak intensity upon incubation with medium amounts of enzyme and complete or near complete decrease with high amounts of enzyme; these are good substrates for CPA6 ( Fig.  3B and Table 2). Some peptides showed little decrease in intensity with medium amounts of enzyme  and no more than 70% decrease in intensity upon incubation with high amounts of enzyme; these are weak substrates of CPA6 ( Fig. 3C and Table 3). Sometimes peak intensities increased with increasing amounts of CPA6 enzyme. These peptides are products formed upon cleavage of a substrate by CPA6 ( Fig. 3D and Table 4). Analysis of both the C-terminal and penultimate residues of CPA6 substrate and non-substrate peptides indicated preferences at both positions, with preferred substrates having large hydrophobic C termini and large hydrophobic or basic penultimate residues. Specifically, good substrates of CPA6 contained C-terminal Tyr, Leu, Met, and Phe (Fig. 4A) and penultimate amino acids in these substrates were, with one exception, large hydrophobic or basic residues (Table 2 and Fig.  4B). Analysis of the downstream residues of the products indicated that formation of these peptides required cleavage of C-terminal Leu and Met (Table 4). Weak substrates of CPA6 contained C-terminal Val, Ala, Gln, and Leu (Fig. 4A). In the two cases in which a C-terminal Leu was a poor substrate, the penultimate residue was Ala (Table 3). No peptides with C-terminal basic or acidic residues were identified as substrates. However, many such peptides were detected in the study, and all were found to be non-substrates. These non-substrates included peptides with C-terminal Arg, Glu, Asp, Asn, Ser, and Pro (supplemental Table S1), consistent with predictions that CPA6 would not cleave charged or polar residues. However, some of the non-substrates contained C-terminal amino acids that had been found to be substrates in other peptides or chromogenic substrates; these residues included Leu, Ile, Val, Ala, Phe, and Gln (Fig. 4A). In cases in which the C-terminal residue was Leu or Phe, a residue known to be efficiently cleaved by CPA6, the penultimate residue was either Asp or Pro (supplemental Table S1). In cases in which the C-terminal residue was not a large hydrophobic residue, but yet known to be cleaved (Val, Gln, Ile, and Ala), the penultimate residue appeared to determine cleavage, with Gln, Thr, His, Gly, Ala, as well as Asp . Peptidomics analysis of CPA6 preferences at substrate P1 and P1 positions. A, analysis of the P1Ј residue. All good substrates have C-terminal Leu, Met, Phe, or Tyr, whereas weak substrates have C-terminal Leu, Val, Ala, or Gln. Many different C-terminal amino acids can be found in the nonsubstrate category. B, analysis of the P1 residue of peptides with permissive P1Ј residues. Altogether, 28 peptides were detected with C-terminal hydrophobic residues (Leu, Ile, Val, Met, Ala, Phe, and Tyr) or Gln; all of these residues were found to be cleaved either for some peptides in the peptidomics analysis, or with small synthetic substrates. Of these 28, only 7 were good substrates, 10 were weak substrates, and 11 were not cleaved. Analysis of the sequences of these peptides indicates that most good substrates contain hydrophobic or basic amino acids in the P1 position, whereas the majority of non-substrates contain Asp or Gln in this position.

CPA6 Enzymatic Characterization
and Pro being unfavorable for cleavage (Fig. 4B). On occasion non-substrate peptides were found with P1 and P1Ј residues that were conductive to cleavage in other substrate peptides. These may have amino acids in P2 and P3 positions that impart specificity.
With this data on the specificity of CPA6, and recently published data regarding the specificity of CPA1, -2, and -4 obtained from studies performed using the same batch of synthetic substrates (23), we now have comparable data on the enzymatic activity and specificity of four of the six mammalian CPA enzymes (supplemental Table S2). Based on this comparison, it appears that the activity of CPA6 toward its better substrates is about 10 -100-fold lower than the activities reported for CPA1, -2, and -4. Because a metal affinity resin was used in the purification of CPA6, the possibility remained that a portion of CPA6 was properly folded but enzymatically inactive due to chelation of the critical active site zinc. This possibility was investigated by incubating purified CPA6 with nanomolar concentrations of zinc, which would be expected to restore activity if zinc chelation was a problem. No large changes in CPA6 activity were detected with nanomolar levels of zinc (supplemental Fig. S3). However, as for many CPs (37), moderate concentrations of zinc (10 M to 1 mM) resulted in inhibition of CPA6 activity (supplemental Fig. S3). Another possible explanation for the weak CPA6 activity was that CPA6 might require interaction with components of the ECM for optimal activity. Because ECM-bound CPA6 was previously solubilized by heparin (27), likely by competing with heparin sulfate proteoglycans for interaction with CPA6, heparin was added to enzymatic reactions of CPA6 and the FA-Phe-Phe substrate. No change in activity was detected (results not shown). Finally, a direct comparison of enzymatic activity was made between equivalent molar amounts of purified CPA6 and CPA6 found within its native environment of the ECM following secretion from HEK293T cells. Dilutions of purified HA-tagged CPA6 were analyzed by Western blot alongside ECM extracts from cells transfected with the same HA-tagged CPA6 to determine the approximate amount of CPA6 present within the ECM. Both ECM-bound and purified CPA6 were then incubated with 0.4 mM FA-Phe-Phe substrate and activities were compared (supplemental Fig.  S4). These results suggested that CPA6 exhibited comparable enzymatic activity under both conditions. Thus, components of the ECM did not appear to have a large effect on CPA6 enzyme activity. Furthermore, this result indicated that the large difference in activity between purified CPA6 and other CPA enzymes was not due to inactivation of the CPA6 during purification.
When the absolute reaction rate was disregarded and the reaction rates of each enzyme were compared across the panel of substrates relative to FA-Phe-Phe, these four peptidases could be arranged in a hierarchy of substrate cleavage, CPA2 Ͻ CPA6 Ͻ CPA1 Ͻ CPA4, from very narrow to quite broad substrate specificity (corresponding, to a large extent, to K m differences for each substrate). Although CPA2 had very restricted specificity for aromatic amino acids such as Phe and Trp, CPA6 was able to cleave aromatic amino acids as well as Leu and Met, with negligible activity toward other substrates. CPA1 appeared to cleave practically all amino acids tested, albeit with reduced activity toward Trp, Ala, and Met. Finally, CPA4 had specificity similar in nature to CPA1, but with the additional ability to cleave Met.
An explanation for differences in substrate specificity exhibited by CPA1, -2, -4, and -6 was likely to be found in the binding pockets of these enzymes. X-ray crystal structures available for CPA1, -2, and -4, as well as for CPB1 and -2, were analyzed and used to model the active sites of CPA3, -5, and -6 ( Fig. 5 and Table S3). The strong affinity of CPA2 for large aromatic amino acids could be seen in a much larger active site pocket when compared with other CPs, given more space by Ala 268 , rather than Thr 268 as seen in CPA1 and CPA4 (Fig. 5). The exclusion of smaller branched amino acids such as Leu and Ile by CPA2 might be mediated by the presence of the longer Met 203 at the neck of the pocket, rather than Leu 203 of CPA1 (Fig. 5). This Met 203 was also present in CPA4 and CPA5 (Fig. 5 and Table  S3) and might account for the reduced affinity of CPA4 for Val and Ile when compared with CPA1. Although CPA1 and CPA2 had a Gly at the bottom of the active site pocket in position 253 (not shown), several enzymes had either a Ser (CPA3, CPA4, and CPA6) or an Ile (CPA5; Fig. 5), either of which should interfere with the binding of very large amino acids such as Trp. Finally, it might be noted that the bottom of the active site pocket in both CPA4 and CPA6 may exhibit significant electronegativity due to the presence of a number of polar residues: Thr 243 , Thr 268 , and Ser 253 of CPA4, and Ser 207 and Ser 253 of CPA6. In fact, Ser 207 was found in both CPB1 and CPB2 and appears to contribute to the electronegative pocket suitable for FIGURE 5. Comparative modeling of CPA6 and other CPA/B enzymes suggests residues critical for substrate binding. CPA/B residues involved in forming the specificity pockets (having side chains within 6 Å of bound substrate in 3CPA structure) of all members of the subfamily are shown. As is the convention in the field, all residues are numbered according to the corresponding active site residues in active bovine CPA1 (following propeptide cleavage). The GY dipeptide ligand (yellow) from the 3CPA crystal structure is overlaid on each structure for comparison, partially hiding residue 268. The CPA4 2PCU and CPB2 3D67 structures are shown with co-crystallized ligands aspartate and GEMSA, respectively, shown in cyan. GEMSA is also modeled into the CPB1 active site for comparison sake. DECEMBER 3, 2010 • VOLUME 285 • NUMBER 49 binding basic residues in these enzymes. This suggests a possible reason for the cleavage of polar amino acids such as Gln by CPA4 and -6 and for the relatively high affinity of CPA6 for FA-Phe-Tyr when compared with other substrates. The significance of a unique Met at position 255 at the base of the specificity pocket in CPA6 is unknown at this time. Modeling does not suggest any function, as it appears that the amino acid at position 253 may play a more important role in the nature of the specificity pocket in A-like enzymes. In addition, this residue in CPA6 orthologs in a number of fishes (fugu, stickleback, and medaka) is replaced by Ile, as found in CPA1, -2, and -4.

DISCUSSION
The metallocarboxypeptidase family is a large family of proteases, many with overlapping substrate specificity profiles. For example, six different CPs (CPD, CPE, CPZ, CPN, CPB1, and CPB2) are all able to cleave C-terminal basic residues Arg and Lys (38). Their unique functions are thought to depend largely on unique spatiotemporal and subcellular distributions. In a similar manner, mammalian genomes contain six CPA genes all producing enzymes able to cleave C-terminal hydrophobic amino acids (9). Many of these genes have unique expression profiles and all of their products are regulated through propeptide cleavage.
Although the CPAs are often thought to have similar substrate specificity toward hydrophobic amino acids, it has been known for some time that CPA2 is unique in that it has a strong preference for large amino acids such as tryptophan and phenylalanine (39). Recently, the substrate specificity of CPA1, CPA2, and CPA4 toward a large number of hydrophobic amino acids was compared, indicating a number of differences that may be critical in understanding the natural substrates of these enzymes (23). This study confirmed the preference of CPA2 for large hydrophobic amino acids, and showed a rather broad cleavage spectrum for CPA4. CPA1 also exhibited broad substrate specificity, but when compared with CPA4 showed significantly lower activity toward Met and Ile when compared with its activity toward Phe.
In the present study we have purified CPA6 and determined its substrate specificity. We show that CPA6, rather than cleaving any hydrophobic amino acid, exhibits substrate cleavage preferences for large hydrophobic amino acids. Reaction rates (k cat /K m ) for the cleavage of nine different C-terminal hydrophobic amino acids by CPA6 spanned 3 orders of magnitude, with C-terminal Ile, Val, Ala, and His exhibiting reaction rates ϳ100-fold lower than the bulkier amino acids Phe, Leu, Tyr, Met, and Trp. Our results also suggest that CPA6 does not cleave substrates with Gly, Pro, Asp, or Glu in the penultimate position. This is similar to many other CPs which, regardless of C-terminal specificity, have been shown not to cleave substrates containing a penultimate Pro (CPM (40), thrombin-activatable fibrinolysis inhibitor (41), CPZ (42), and CPA4 (23)). CPA4 has also been been shown not to cleave substrates with acidic residues in the penultimate P1 position (23).
It was surprising that C-terminal histidine was such a poor substrate for CPA6 in the present study. Previously we had found that CPA6 bound to the ECM cleaved histidine as part of a C-terminal His 6 tag (27). Our current results suggest two pos-sibilities: a His 6 tag as a part of a larger protein is a better substrate for CPA6 than small chromogenic substrates, and/or the localization of a substrate to the ECM, possibly at locally high concentrations, is a stronger determinant for cleavage by CPA6 than simply C-terminal amino acid affinity. This second possibility is supported by another observation; although a CP inhibitor was typically included in the culture medium of the CPA6expressing HEK293 cells to prevent His 6 tag removal, the absence of this inhibitor in suspension culture did not result in large-scale removal of His 6 tags. 3 This was in contrast to results obtained from CPA6 bound to ECM, in which most histidines were cleaved in the absence of the CP inhibitor (27).
Because CPA6 is present in the ECM and expressed during development, it is possible that it processes proteins or peptides involved in morphogenesis. Several extracellular morphogens have conserved hydrophobic C-terminal amino acids that might be cleaved by an enzyme such as CPA6; examples include Wnt1, Wnt6, several bone morphogenetic proteins, semaphorin 3a, and fibroblast growth factor-4 and -6. A few of these or their relatives have been shown to be functionally modified through C-terminal proteolytic processing. For example, the C-terminal Arg residue of Wnt4 is cleaved by CPZ (43), an ECM metallocarboxypeptidase with specificity toward basic amino acids (42,44,45). This processing step enhances the activity of Wnt4 and its effect on growth plate chondrocyte terminal differentiation (43). Wnt4 is a member of the large highly conserved family of ECM-bound Wnt ligands. Two members of this family, Wnt1 and Wnt6, have C-terminal sequences similar to Wnt4, but with a hydrophobic Leu at their C termini rather than the Arg found in Wnt4 and many other members. Removal of this C-terminal Leu by an extracellular enzyme such as CPA6 might activate Wnt1 and Wnt6 in a manner similar to Wnt4. Wnt6 in particular has an expression pattern in the developing limbs and somites consistent with that observed for CPA6 (46 -48).
A number of potential neuropeptide substrates were identified in the present study. These included peptides derived from proenkephalin (Leu-enkephalin, Met-enkephalin, heptapeptide, and octapeptide), pro-SAAS (Little SAAS and PEN), and chromogranin B (600 -613, 64 -86, 438 -446, and 438 -454). The mRNAs encoding all of these proteins are expressed in the mitral and granular cell layers of the mouse olfactory bulb (Allen brain atlas), which is the major location of CPA6 mRNA in adult mouse (26). Thus, these identified peptide substrates of CPA6 are possible in vivo targets of CPA6. However, little is known about the biological functions of many of these peptides. Removal of C-terminal Leu and Met from enkephalin virtually eliminates activity (49); however, a role for enkephalins in the olfactory bulb is unknown at this time. Pro-SAAS-derived peptides are not yet well characterized, although they may be involved in body weight regulation (50). The physiological effect of C-terminal Leu removal from Little SAAS is not known. A number of peptides have been identified from chromogranin B, some proposed to have antimicrobial functions, and others of unknown function (51). Chromogranin B 600 -613, shown in this study to be cleaved by CPA6, has been previously found to undergo carboxypeptidase-like cleavages resulting in C-terminal-truncated forms of this peptide (52).
However, to our knowledge the functions of these peptides remain unknown.
Ultimately it is the determination of the unique substrate specificities of each CP enzyme and the identification of substrates that will allow us to understand the functions of these enzymes. One step toward this understanding has been the clarification of the structural basis for the activity and specificity of CPs through x-ray crystallography. Although many members of the CPA/B subfamily of CPs have had their structures solved, CPA3, CPA5, and CPA6 have not. Previous studies have reported structural models for these enzymes (9,53), but focused largely on overall tertiary structure rather than analysis of the enzyme active sites. Here we have described a detailed analysis of the modeled substrate binding pockets of CPA3, CPA5, and CPA6 alongside crystallized members of this subfamily. Although we have kinetic data for CPA6 to support the modeling, no such data yet exists for CPA3 and CPA5. However, these structural models might be used to predict the substrate specificity of CPA3 and CPA5. The specificity pocket of CPA3 (Fig. 5) suggests similarity to the substrate binding profile of CPA1 and CPA6, with Ala 268 making the pocket deeper, yet Ser 253 making it shorter, resulting in exclusion of amino acids at the extremes in size. Experiments have shown CPA3 to be able to cleave the C-terminal residues of neurotensin (Leu), endothelin-1 (Trp), sarafotoxin 6b (Ile-Trp), kinetensin (Leu), Leuenkephalin (Leu), and xenopsin (Leu) (54). However, another study indicated that purified human CPA3 displayed no activity toward carboxyl-terminal Trp or Ala, but more activity than bovine CPA1 against carboxyl-terminal Leu residues and about equal activity toward Phe and Tyr residues (55), suggesting that, indeed, intermediate sized residues may be optimal. CPA5, in contrast to CPA3, must certainly prefer smaller substrates, as it contains a Ser 268 restricting depth, Ile 253 restricting length, and Met 203 restricting width (Fig. 5). At this point there is little data regarding the specificity of CPA5 although it has been shown to cleave FA-Arg-Leu (9).
In conclusion, the data presented confirm a role for CPA6 in the cleavage of large hydrophobic amino acids. These data also suggest that less optimal substrates, including His and small hydrophobic amino acids, are also cleaved by CPA6, possibly more so if substrates containing these C-terminal amino acids are proteins bound to the ECM along with CPA6. We show that the substrate P1 residue has a large effect on cleavage, with small and acidic residues significantly inhibiting this process. Finally, modeling illustrates the composition of the specificity pocket of CPA6, as well as CPA3 and CPA5, and has enabled some predictions to be made in regard to optimal substrates for these enzymes.