Solution Structure of the Squash Aspartic Acid Proteinase Inhibitor (SQAPI) and Mutational Analysis of Pepsin Inhibition

The squash aspartic acid proteinase inhibitor (SQAPI), a proteinaceous proteinase inhibitor from squash, is an effective inhibitor of a range of aspartic proteinases. Proteinaceous aspartic proteinase inhibitors are rare in nature. The only other example in plants probably evolved from a precursor serine proteinase inhibitor. Earlier work based on sequence homology modeling suggested SQAPI evolved from an ancestral cystatin. In this work, we determined the solution structure of SQAPI using NMR and show that SQAPI shares the same fold as a plant cystatin. The structure is characterized by a four-strand anti-parallel β-sheet gripping an α-helix in an analogous manner to fingers of a hand gripping a tennis racquet. Truncation and site-specific mutagenesis revealed that the unstructured N terminus and the loop connecting β-strands 1 and 2 are important for pepsin inhibition, but the loop connecting strands 3 and 4 is not. Using ambiguous restraints based on the mutagenesis results, SQAPI was then docked computationally to pepsin. The resulting model places the N-terminal strand of SQAPI in the S′ side of the substrate binding cleft, whereas the first SQAPI loop binds on the S side of the cleft. The backbone of SQAPI does not interact with the pepsin catalytic Asp32–Asp215 diad, thus avoiding cleavage. The data show that SQAPI does share homologous structural elements with cystatin and appears to retain a similar protease inhibitory mechanism despite its different target. This strongly supports our hypothesis that SQAPI evolved from an ancestral cystatin.

Known proteinaceous aspartic acid proteinase inhibitors (PIs) 6 are rare and unevenly distributed among classes of organisms in contrast to proteinaceous inhibitors of serine and cys-teine proteinases (1). A novel proteinaceous aspartic acid proteinase inhibitor (SQAPI) has been characterized from squash phloem exudate (Cucurbita maxima Duchesne) (2,3). The protein is distinct from the other six families of proteinaceous aspartic PIs that have been identified and had their genes cloned: the potato plant Kunitz inhibitors (4), the Ascaris inhibitors (5), the yeast proteinase A inhibitor-3 (6), a domain of the sea anemone thyroglobulin type-1 inhibitor (7,8), the pig serpin inhibitor (9), and the Bacillus sp. peptide, ATBI (10). The structures for some of these proteins have been solved (8,(11)(12)(13) and are, to date, very different from each other and exhibit distinct, and in some cases, novel inhibitory mechanisms. However, no structural information on SQAPI has been published to date, except that based on modeling (14).
It is likely that plant PIs represent examples of recent evolutionary change. Several exclusively plant PIs are restricted highly in their distribution, often to a single family (1,15). The squash serine PI family appears limited to the Cucurbitaceae, the Ryan family (proteinase inhibitor II family) to the Solanaceae, the trypsin/␣-amylase inhibitor family to the Graminaceae, the mustard seed inhibitor family to the Brassicaceae. Most recently, it has been proposed that the SQAPI family also is restricted highly in distribution to Cucurbitales (14). Such limited distribution among plant families may indicate a recent evolutionary history. However, the ancestral noninhibitor proteins from which neofunctionalization has occurred are unknown. Two inhibitor families unique to plants have a wider distribution (15), with the plant Kunitz and the Bowman-Birk inhibitor families being found in legumes and cereals and possibly other species. Only the Bud family (proteinase inhibitor I family) serpins and cystatin inhibitors have homologs outside of the plant kingdom and are distributed widely in plants (15).
An additional feature of proteinase inhibitors is the extent of subfunctionalization (i.e. evolution from a different class of PI) that has occurred (15). This is true particularly of the aspartic PIs, where four of the six known families appear to have evolved from PIs of other classes: Kunitz and serpin aspartic proteinase inhibitors belong to already established families of serine proteinase inhibitors (16); thyroglobulin type-1 inhibitors appear to be more common as a cysteine inhibitor family (MEROPS, the peptidase database), and we recently proposed that the SQAPI family has evolved from the phytocystatin inhibitor family (14).
The evidence for the hypothesis that SQAPI evolved from an ancestral cystatin was based on amino acid sequence homology, phylogenetic analysis, and three-dimensional modeling (14). Moreover, evidence from hypervariability in the putative contact regions and retention of predicted secondary structural elements suggested that the cysteine proteinase inhibitor mechanism also was recruited. Nevertheless, final acceptance of such a hypothesis must rest on direct structural determination, for understanding both the evolutionary history and the inhibitory mechanism of SQAPI. Here, we report the NMR solution structure of recombinant SQAPI and show that it does indeed have a close similarity to the structure of phytocystatin. Despite their different targets, our mutational data support a similar inhibitory mechanism. SQAPI, therefore, represents a model system that may provide an understanding of the evolutionary process of subfunctionalization.

EXPERIMENTAL PROCEDURES
Expression and Purification of rSQAPI for NMR Studies-Escherichia coli BL21 (DE23) pLysS cells (17) containing the previously described His-tagged rSQAPI (HDVA isoform GenBank TM AAT67162) expression vector (3) were grown at 37°C in LB medium containing ampicillin (100 g/ml) and chloramphenicol (35 g/ml). When the culture reached an A 600 of 0.5, solid glucose was added (5 g/liter), and the culture was incubated for a further hour. Cells were harvested, transferred to a modified M9 minimal medium (18) and incubated for 30 min to deplete nutrients carried over from the rich medium. The cells were transferred to fresh minimal medium (300 ml) containing glucose (8 g/liter) and NH 4 Cl (1.67 g/liter) in a baffled flask to ensure good aeration. For production of 15 N-labeled protein, 15 N-enriched NH 4 Cl (99%, product code: NLM-467-1, Spectra Stable Isotopes) was used. For production of 15 N 13 C-labeled protein, 15 N-enriched NH 4 Cl, and 13 C-labeled D-glucose (100% [U-13 C]C-6, 99% product code: CLM-1396-1 Cambridge Isotope Laboratories) were used, and the glucose concentration was 4 g/liter. Once the A 600 began to increase, protein expression was induced with isopropyl 1-thio-␤-D-galactopyranoside (0.32 mM), and when the A 600 reached a plateau, the cells were harvested (typically ϳ2 h after induction).
Cells were disrupted using a French press in 20 mM sodium phosphate buffer, pH 7.8, containing 500 mM NaCl. The cellfree supernatant was applied to a nickel affinity column (Pharmacia Biotech; HiTrap chelating), and the column was washed successively with 10 bed volumes of loading buffer, 20 bed volumes of 20 mM sodium phosphate buffer, pH 6.0, containing 500 mM NaCl, and 20 bed volumes of 20 mM sodium phosphate buffer, pH 6.0, containing 500 mM NaCl and 300 mM imidazole. His-tagged rSQAPI was eluted from the column with 20 mM sodium phosphate buffer, pH 6.0, containing 500 mM NaCl and 50 mM EDTA. Fractions containing His-tagged rSQAPI were pooled and concentrated, and the buffer was exchanged into 10 mM KH 2 PO 4 , pH 3.0, containing 0.2% NaN 3 .
For removal of the N-terminal fusion peptide, His-tagged rSQAPI (Ͻ3 mg/ml) in 50 mM Tris-HCl buffer, pH 8.0, containing 1 mM CaCl 2 and 0.1% Tween was incubated for 18 h at 37°C with 1 unit of enterokinase (Ekmax; Invitrogen) per 20 g of fusion protein. Purification of rSQAPI from the free His tag, the enterokinase, and any unprocessed His-tagged rSQAPI exploited the intrinsic affinity of rSQAPI for the nickel column. The reaction mixture was applied to the nickel affinity column (1 ml bed volume) pre-equilibrated with 20 mM sodium phosphate buffer, pH 7.8, containing 500 mM NaCl. The column was washed with 10 bed volumes of the equilibration buffer to remove the enterokinase, 10 bed volumes of 20 mM sodium phosphate buffer, pH 6.0, containing 500 mM NaCl, 10 bed volumes of 20 mM sodium phosphate buffer, pH 6.0, containing 30 mM imidazole and 500 mM NaCl, and then rSQAPI was eluted with 20 mM sodium phosphate buffer, pH 6.0, containing 500 mM NaCl and 300 mM imidazole. Fractions containing rSQAPI were identified by SDS-PAGE and pooled, and the buffer was exchanged into 10 mM KH 2 PO 4 , pH 3.0, containing 0.2% NaN 3 , and concentrated to between 0.06 and 0.2 mM rSQAPI. The concentration of rSQAPI was determined using the calculated extinction coefficient (3).
Preparation and Analysis of SQAPI Mutants-The gene for the HDVA variant of SQAPI was recloned into pET30 (19) and expressed with a His tag to aid purification (14). SQAPI was assayed as described previously (2,3). Equilibrium inhibition constants were measured by titration of the inhibitor against pepsin in 100 l 0.75% lactic acid-HCl, pH 2.0, using 500 ng of fluorocasein (Enzchek, Molecular Probes) at 480 nm excitation and 520 nm emission at room temperature in a fluorimetric microplate reader (Wallac Victor 1420 multichannel counter). Alternatively, the synthetic peptide 4-methylcoumarin-Arg-Lys-Pro-Ile-Glu-Phe-Phe-Ile-Leu-Lys(dinitrophenol)-Arg-OH (custom synthesized by Auspep, Parkville, Victoria, Australia) was used to assay pepsin at 0.2 mM. Substrate cleavage C-terminal of the second lysine in this synthetic substrate was followed by measuring the increase in fluorescence at 340 nm excitation and 430 nm emission. Pepsin and SQAPI were preincubated at 20°C for 15 min before pepsin assay. Each mutant and the wild-type SQAPI was quantified by electrophoresis using an Experion (Bio-Rad) electrophoresis unit. Pepsin was calibrated by titration against pepstatin (Sigma) at high pepsin concentrations. The pepsin concentration used was constant within a titration but varied from 1 to 7 nM depending on the replicate experiment. Inhibition constants were not corrected for the presence of substrate, as initial rates were measured, and previous assays had shown that the dissociation rate constant for SQAPI with pepsin is extremely slow (3). Data were fitted to equation (2), 2 Ϫ 4 E T I T ))/(2 E T ) using nonlinear regression in the Origin (Northampton, MA) graphics package. E T and I T represent the total concentration of proteinase and inhibitor in the assay, whereas v i and v o are the rates of proteinase activity in the presence and absence of inhibitor, respectively. K I is the apparent inhibition constant.
NMR-NMR experiments were performed in Shigemi tubes containing 0.6 mM SQAPI in 95% H 2 O, 5% 2 H 2 O with 10 mM K 2 HPO 4 , pH 3, 0.2% NaN 3 , and 1 mM EDTA. Spectra were recorded on a 700-MHz Bruker Avance spectrometer using a triple resonance Bruker CryoProbe TM . A series of 15 N-HSQC spectra were recorded at 10°C, 25°C, 40°C, and 50°C. Subse-quent spectra for structure determination were recorded at 50°C. The following spectra were acquired for assignment and structure determination purposes: 2D 1 H, 1 H-NOESY (mixing time 120 ms), 15 13 C-NOESY-HSQC (mixing time 120 ms) and a 15 N-NOESY-HSQC (mixing time 120 ms). Standard pulse sequences were used for data acquisition, and water suppression was achieved using either WATERGATE (20) or field gradient coherence selection schemes (21). The chemical shifts were referenced according to published methods (22), where the 1 H chemical shift is referenced to the water peak, whereas the 13 C and 15 N chemical shifts were referenced by the 13 C/ 1 H and 15 N/ 1 H gyromagnetic ratios. NMR data were processed in TOPSPIN (version 2.0; Bruker Biospin TM ). The processed spectra were analyzed using XEASY (version 1.4) software (23). 1 H, 13 C, and 15 N chemical shifts assignments have been deposited in the BioMagResBank Database (24) with accession no. 16913.
Hydrogen bond restraints were assigned based on exchange rates of 15 N-bound protons measured by running a series of two-dimensional 15 N-HSQC spectra at 8 min, 20 min, 1 h, and 2 h after dissolving the protein in 2 H 2 O at 10°C. Hydrogen bonds only were included in later rounds of structure calculations in regions of a well defined structure that initially converged on the basis of NOE restraints alone.
Structure Determination-Automated NOE cross-peak picking and structure determination were performed with the Atnos/Candid program (25,26) using the CNS (27) simulated annealing algorithm. Initial structures were generated from an extended strand conformation using simulated annealing with torsion angle dynamics for the high temperature and fast cooling stages, followed by Cartesian dynamics for a second slow cooling stage. The Atnos/Candid program incorporated 64 dihedral restraints derived from the C ␣ and C ␤ chemical shifts. These initial structures were then used as starting structures for a second round of structure generation in Atnos/Candid using Cartesian dynamics for both the high temperature and cooling phases. During this phase 137 backbone dihedral restraints generated from TALOS (28) were included. The structures were then refined in CNS with hydrogen bond restraints added for those residues with slow exchanging amides that had a unique hydrogen bond acceptor as determined by structural convergence. The final structures were refined in a layer of TIP3 water molecules in CNS. The 10 lowest energy structures with no NOE violations Ͼ0.25 Å, bond violations Ͼ0.05 Å, and no angle, improper or dihedral violations Ͼ5°were selected to represent the solution structure of SQAPI.
Docking Procedures-SQAPI was docked with porcine pepsin (Protein Data Bank code 5PEP (29)) using the program HADDOCK (30). This program allows the specification of intermolecular ambiguous distance restraints between residues  AUGUST 27, 2010 • VOLUME 285 • NUMBER 35

JOURNAL OF BIOLOGICAL CHEMISTRY 27021
on each docking partner and then uses randomization and energy minimization in CNS. Residues for which ambiguous restraints are generated are divided into two classes, termed active and passive. Active residues are usually those directly implicated in binding, whereas passive residues are near neighbors to active residues. Intermolecular ambiguous distance restraints are generated for each active residue such that the restraint is satisfied if the active residue on one binding partner is close to either an active or passive residue on the other. For SQAPI, the active residues were selected based on the mutagenesis data and comprised Ala 5 , Ile 6 , and Gly 7 of the N-terminal region and Ile 49 , Pro 50 , Trp 52 , and Asp 53 of the first loop. Residues Gly 3 , Pro 4 , Glu 8 , Val 9 , Ile 46 , Lys 47 , Gly 48 , His 51 , and Tyr 55 were designated as passive. For porcine pepsin, the designated active residues were the surface-exposed active site cleft residues within 10 Å of Asp 215 of the catalytic diad, namely Asp 32 , Gly 34 , Tyr 189 , Ile 213 , Asp 215 , Gly 217 , Thr 218 , Ser 219 , and Ile 301 . The following pepsin residues were classified as passive because of their close proximity to the active residues: Thr 12 , Glu 13 , Ser 35 , Thr 74 , Gly 76 , Thr 77 , Phe 111 , Ile 128 , Glu 288 , Met 290 , and Val 292 . Initially, 1000 structures were generated by randomization, followed by rigid body energy minimization. From these 1000 structures, the 100 with the lowest energy were refined with simulated annealing, with some flexibility allowed for the interface residues. Final minimization was by semiflexible simulated annealing in a layer of TIP3 water molecules. SQAPI N-terminal residues 1-10, which are disordered in the solution structure, were allowed to remain flexible throughout docking.

RESULTS
Protein Expression and Purification-Single-and double-labeled SQAPI was expressed and purified with a final yield of ϳ30 mg/liter of minimal media. Purity was assessed as Ͼ95% by SDS-PAGE. Enterokinase was used to cleave the His 6 tag containing leader sequence from SQAPI and was expected to cleave the fusion protein 10 residues upstream from the beginning of the native SQAPI sequence.
Resonance Assignment-A preliminary peak count in the 15 N-HSQC spectra recorded at 37°C found 99 cross-peaks. This tally was sufficiently close to the expected 101 peaks to commence acquisition of three-dimensional spectra for assignments. However, during the backbone assignments, it became apparent that the peaks corresponding to the region His 51 -Ser 61 were not visible at 37°C. Moreover, no peaks were present that could be assigned to the 10 residues of the vector encoded leader sequence. In addition, multiple resonances were assigned for the first five residues of SQAPI. To clarify the cause of this N-terminal peak doubling, an aliquot of the 15 N rSQAPI sample was subjected to N-terminal sequencing. This revealed that 74% of the rSQAPI had been cleaved, by enterokinase, at the arginine immediately preceding the native SQAPI sequence, and 26% had been cleaved at Met 1 , the two forms therefore differing by one amino acid. This sample heterogeneity accounts for both the duplication of resonances observed for the first five residues of SQAPI in the NMR spectra and the misleading 15 N-HSQC peak tally. This minor sample heterogeneity had no significant effect on structure determination, as NOEs from both N-terminal variants were indicative of an unstructured N-terminal region with only intra or sequential NOEs present. HSQC spectra were then recorded at 10, 25, 37, and 50°C. At 50°C, peaks that were ultimately assigned to residues His 51 -Ser 61 became visible in the spectrum. Spectra for assignments and structure determination were then acquired at this new temperature. An assay at 50°C showed that SQAPI retains its ability to inhibit pepsin at this temperature (data not shown). Subsequently near complete (Ͼ99%) backbone and side chain assignments were achieved for nonlabile resonances of SQAPI at 50°C.
Structure-Stereoviews of the 10 lowest energy SQAPI structures refined in water and a schematic view of the closest to average structure are shown in Fig. 1. Structural statistics for the ensemble are given in Table 1. SQAPI assumes a cystatin-like fold consisting of a twisted four-strand antiparallel ␤-sheet gripping an ␣-helix like the fingers of a hand gripping a tennis racquet. With the exception of the first 10 residues, which are relatively disordered, the solution structure is well defined with a backbone heavy atom root mean square deviation of 0.44 Å for residues 11-95. There is a well defined turn from Gly 11 through to the ␣-helix that extends from Pro 17 -Gln 29   Pepsin Inhibition Assays-To obtain further information on the interactions of SQAPI with pepsin, a series of SQAPI point mutants with stepwise glycine substitution along the Lys 47 -Asn 54 and Ala 81 -Asn 84 loops were prepared, and the inhibition constants were determined by titration of SQAPI against a constant concentration of pepsin. A typical assay is shown in Fig.  2A for wild-type SQAPI and one mutation and the effect of each residue change on the K i values is shown in Fig. 2B. The K i values were determined using two different substrates that had little effect on the K i values obtained, with those with the peptide substrate (listed first) generally being marginally lower: for Two other mutants were made by truncation of the N-terminal residues up to and including Ile 6 or Gly 7 (Fig. 2C). For the ⌬-Ile 6 construct, partial inhibition of pepsin was only evident at higher SQAPI concentrations. The ⌬-Gly 7 construct was almost completely inactive.
Docking-Earlier studies (2, 3) showed that SQAPI was a strong inhibitor of pepsin. Therefore, a simulated interaction was performed as described under "Experimental Procedures." The 10 lowest energy solutions depicted SQAPI bound to pepsin in a similar orientation. The lowest energy conformation was selected to best represent the structure of the pepsin-SQAPI complex (Fig. 3, A and  B). The structure shows the N-terminal region of SQAPI binding the SЈ side of the substrate binding cleft, whereas the Lys 47 -Asn 54 loop binds on the S side of the cleft. Whereas the N terminus of isolated SQAPI is disordered in solution, in the SQAPI-pepsin model, the N terminus forms a ␤-strand. In the model, the pepsin catalytic site Asp 32 -Asp 215 diad does not contact the peptide backbone of SQAPI, a point that will be further discussed below.

DISCUSSION
The solution structure of aspartic proteinase inhibitor SQAPI (Fig. 1) that we determined in the present study shares its fold with cystatin (31), suggesting the PI is derived from the ancestral cystatin protein, a family of cysteine proteinase inhibitors that is widely distributed through plants and animals (32). SQAPI, on the other hand, has been found only within the Cucurbitales (14), indicating a recent origin. This is the first example of recruitment of the cystatin structure to an inhibitor of a different proteinase class. Six families of proteinaceous aspartic PIs have been identified and all are distinct from SQAPI and from one another, both in structure and in mechanism. SQAPI appears to have retained a similar inhibitory mechanism to its FIGURE 2. Inhibition of pepsin activity by wild-type and mutant SQAPIs. A, inhibition titration curve for wild-type SQAPI (squares) and the W52G mutant (triangles). The lines shown are the fitted curves. The pepsin concentration was 1.1 nM, and the assay used the synthetic pepsin peptide substrate. B, inhibition constants (K i ) versus amino acid change using the pepsin synthetic peptide assay. Pepsin concentration was 1.1 nM. The bars represent the S.E. of the K i as estimated by nonlinear regression. C, inhibition by wild-type SQAPI (squares) and by the two truncation mutants (circles and triangles for ⌬1-6 SQAPI and ⌬1-7 SQAPI, respectively). The lines shown are the fitted curves. Pepsin concentration was 3.5 nM, and the assay used the synthetic pepsin peptide substrate. Calculated K i values are 1.0 Ϯ 0.1 for wild-type SQAPI, 51 Ϯ 4 for ⌬1-6 SQAPI, and 840 Ϯ 130 for ⌬1-7 SQAPI.
cystatin counterparts. Our mutagenesis and docking results implicate the N-terminal strand and the first loop of SQAPI in aspartic protease inhibition. N-terminal deletion up to and including Ile 6 resulted in a ⌬1-6SQAPI variant that was only a weak inhibitor. Deleting one residue further to produce the ⌬1-7SQAPI variant almost completely abolished activity, indicating a key binding determinant in this region. Of the five residues examined using mutagenesis in loop 1 ( 49 IPHWD 53 ), four are invariant across 25 SQAPI variants within the Cucurbitaceae (14). These correspond to the four residues for which the Gly substitution mutants showed a significantly increased K i of between 10-and 20-fold. The single substitution that showed only a very small 1.3-1.8fold change in K i was His 51 . This residue is highly variable (11 with Asp, eight with His, five with Lys, and one with Asn (14)). The relatively small effect on pepsin inhibition by the H51G mutant is consistent with its established variability. The importance of the tryptophan residue confirms earlier observations where tryptophan fluorescence was completely quenched upon pepsin binding (14).
We have generated a model of the inhibition of pepsin by SQAPI using the program HADDOCK, which utilizes ambiguous interaction restraints (Fig. 3, A and B). The docking was restrained based on our mutagenesis results for SQAPI, whereas for pepsin, in the absence of mutagenesis data, the residues within 10 Å of the active site were used to generate restraints. This approach assumes that SQAPI is an active site inhibitor of pepsin as opposed to an allosteric inhibitor. The resultant model supports the proposal that the mechanism for SQAPI binding is similar to that of cystatins, where the N-terminal strand binds on the SЈ side of the active site cleft and loop 1 to the S side (Fig. 3, A and B). The backbone of SQAPI does not interact with the catalytic diad, thus avoiding SQAPI being inactivated by catalytic cleavage. The majority of the intermolecular contacts are hydrophobic in nature with the exception of SQAPI Asp 53 , for which hydrogen bonding between its carboxylate side chain and the backbone amides of pepsin residues Ser 110 and Phe 111 are predicted.
The NMR resonances for residues His 51 -Ser 61 , which include the important binding determinants Trp 52 and Asp 53 , are only visible in the 15 N-HSQC spectra at elevated temperatures. This indicates a degree of dynamic broadening at lower temperatures attributable to intermediate conformational exchange. Intermediate exchange occurs when the NMR frequency differences arising between states are commensurate with the lifetime of the states and is usually in the order of milliseconds. Such intermediate timescale motions are commonly described for regions of proteins involved in catalysis (33) or protein-protein interactions (34). This raises the prospect of multiple conformations occurring in this region of the protein. Elevating the temperature to 50°C may have either averaged the states by inducing fast exchange or may simply favor one particular state. If multiple states of this loop exist, they cannot be separately characterized via the present analysis. Nevertheless, we have shown that SQAPI retains activity at 50°C, the temperature at which the structure was determined.
Although loop 2 of cystatin, analogous to the Ala 81 -Asn 84 loop of SQAPI, contributes binding energy to interactions with some cysteine proteinases, it does not interact strongly with others (35). The sequence Ala 81 -Lys 87 is invariant across all 11 SQAPI variants for which a C-terminal sequence is available (3), leading us to hypothesize a role for SQAPI loop 2 in aspartic peptidase inhibition. To investigate this possibility, we mutated the SQAPI residues Ser 82 , Asp 83 , and Asn 84 . The absence of variation in K i upon glycine substitution suggests that these invariant residues are not important for pepsin binding. Our docking results utilized ambiguous interaction restraints based upon loss of inhibition seen for N-terminal and Lys 47 -Asn 54 loop mutants. As a consequence, our model includes these binding determinants at the interface. In agreement with the mutagenesis data, our model does not support the direct involvement of the Ala 81 -Asn 84 loop in inhibition. However, peptide contacts are predicted by our model between the nearby residues Leu 78 and Lys 80 from the third ␤-strand of SQAPI and Gly 109 and Ser 110 of pepsin.
With SQAPI, four of the seven known proteinaceous aspartic proteinase inhibitors have been recruited from ancestral and widely distributed forms of inhibitors of other proteinase classes (36). The other three inhibitors, each a unique protein, have highly restricted distribution, suggestive of recent evolution. The paucity of aspartic proteinase inhibitors and their recent evolution may indicate that aspartic proteinases themselves are a more recent evolutionary event than serine and cysteine proteinases.