![]()
|
|
||||||||
J. Biol. Chem., Vol. 278, Issue 44, 43357-43362, October 31, 2003
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



From the
European Molecular Biology Laboratory (EMBL) Hamburg c/o DESY, D-22607 Hamburg, Germany, the **Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61704 Poznan, Poland and Institute of Biochemistry and Molecular Biology, University Hospital, Hamburg-Eppendorf, c/o DESY, 22603 Hamburg, Germany, the ¶Laboratoire de Cristallographie et Modelisation des Matériaux Mineraux et Biologiques LCM3B CNRS UHP Faculté des Sciences, 54506 Vandoeuvre-les-Nancy Cedex France, and the ||Novozymes Protein Design, 2cs.01 DK-2880 Bagsvaerd, Denmark
Received for publication, June 30, 2003 , and in revised form, August 21, 2003.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Catalysis takes place in a cleft from which a binding pocket (specificity pocket) protrudes into the interior of the enzyme. A catalytic triad (Ser195-His56-Asp99) constitutes the core of the reactive center. For the stabilization of the charges developing on the reaction intermediates during catalysis an oxyanion hole is formed by the main-chain N-H of residues 192195 in an arrangement known as a nest (1). The specificity pocket accommodates long, basic amino acid side chains. At its bottom, an aspartate residue forms a tight salt bridge with the substrate. The difference in specificity between various serine proteases is determined by the shape and the width of the specificity pocket (2, 3).
The catalytic mechanism of trypsin isolated from various sources has been investigated in great depth (410). Yet several questions remain unanswered. One is the identification of the catalytically active water molecules. Also, the catalytic triad appears to function in different ways in different proteins and display different features, particularly the presence of the so-called short hydrogen bonds (11).
Trypsin from Fusarium oxysporum shows auto-proteolysis activity even at crystallization conditions. A peptide fragment can always be found in the active site of the crystalline enzyme (12). This together with the fact that its crystals diffract extremely well has made it especially interesting for mechanistic studies and offered an excellent tool for the investigation of the catalytic activity at the electronic level (13, 14).
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Co-crystallization with the Inhibitor DFP1DFP (diisopropyl fluorophosphate, Mw = 184 g/mol) was purchased from Aldrich Chemicals. 5 µl of the inhibitor were dissolved in 250 µl of isopropyl alcohol. 1 µl of this solution was added to a drop containing 10 µl of protein solution and 20 µl of precipitant solution. The approximate molar ratio of inhibitor to protein was 20:1.
Co-crystallization with the Inhibitor PMSFAn aqueous stock solution of a concentration of 8 mg/ml PMSF (Mw = 174 g/mol) from Sigma was prepared, and 1 µl of this solution was added to the drops containing 10 µl of protein and 20 µl of precipitant solution. The molar ratio of inhibitor to protein was again 20:1. Nucleation was initiated as for the native crystals in both cases.
SoakingNative trypsin crystals were soaked in a drop containing the precipitant solution at pH 4 for several minutes prior to cryo-freezing.
Cryo-freezing ConditionsIn previous publications, the cryo-freezing conditions involved a gradual increase of glycerol content in the mother liquor up to 20% (v/v) glycerol (12). Now, we transferred the crystals into light paraffin oil directly from their drops. In order to avoid ice formation, all traces of the mother liquor were carefully removed from the crystal surface by moving it around in the oil drop. Crystals were then flash frozen directly in the cryo-stream at 100 K.
Data Collection and ProcessingData were collected on beamlines X11 and X13 at EMBL Hamburg, equipped with MAR CCD detectors, at wavelengths of 0.8110 and 0.8020 Å, respectively. A summary of the data statistics is given in Table I. Data were processed with denzo/scalepack or HKL2000 (16), intensities were truncated to structure factors using the CCP4 suite of programs (17).
|
RefinementInitial refinement and addition of solvent were carried out with ARP/wARP (18) and refmac (19). Subsequent steps were performed with SHELXL (20), including the addition of hydrogen atoms and anisotropic B factors, until convergence was reached. Refinement was carried out against diffracted intensities. The same refinement protocol was used for all structures described here. Heterocompounds were introduced in the refinement as early as possible. Occupancies for alternate conformers, and the substrate were set according to their anisotropic atomic displacement parameters (ADP) and individually refined in a last round. Solvent molecules with ADP higher than 60 Å2 were removed; those with ADP higher than 45 Å2 were assigned occupancy of 0.5.
Complementary MethodsAb initio quantum chemical calculations on the pH 4 structure were performed using GAMESS (21) with a 631G** base set and a Hückel model for the construction of the initial molecular orbitals. The active site residues, the substrate and side chains involved in direct contact with the substrate were chosen as the geometrical template. Only relevant parts of the interacting compounds (i.e. atoms at an H-bonding distance) were included in the calculations. The substrate arginine was treated as alanine since only its main chain atoms interacted with the active site. Aspartate 99 was modeled as acetic acid. The system included a total of 64 atoms (265 electrons, with a net charge of1). Atoms of chemically inert groups not involved in direct contact with the substrate were fixed at their positions. Solvent effects were neglected as the occupied active site is a closed cavity without considerable contact to the solvent space. Hydrogen atoms (at the two water molecules in the active site) were inserted into the structure according to standard stereochemistry and existing H-bonding partners. The following cases were subjected to ab initio quantum chemical calculations: QC1, active site in the native state; QC2, Ser195 de-protonated; QC3, W1 as hydroxyl ion; PMSF, covalently bound.
Multipolar Atoms RefinementThe 0.80 Å structure was refined using the charge density refinement program MoPro (20). The program describes the electron density of the atoms with the Hansen and Coppens model (22), which includes atomic charges, expansion/contraction coefficients, and multipoles.
The structure refined with SHELXL was used as the starting model. The charge density parameters were transferred to the structure from our data bank describing the electron density of atoms in proteins (23, 24). Protein atoms with ADPs higher than 15 Å2 and multiple conformers were considered spherical and were only assigned a charge and a contraction/expansion coefficient. Water molecules were described as a neutral oxygen atom and were refined in conventional manner.
The contribution of the bulk solvent to the crystal diffraction was added to the structure factors (25, 26) in Equation 1.
![]() | (Eq. 1) |
These parameters were also refined in MoPro. The following weighting scheme was applied to the reflections shown in Equation 2.
![]() | (Eq. 2) |
The three different types of structural parameters were refined in an iterative way until convergence: scale factor with solvent parameters Ksol and Bsol, atomic coordinates, ADPs. Hydrogen atoms were not refined and their coordinates were constrained according to standard stereochemistry while their isotropic B-factors were set to the Beq of the parent atom. Usual distance restraints were applied on the structure according to the Engh & Huber stereochemical dictionary (target rmsd deviations of 0.02 and 0.05 Å for bond and angle distances, respectively).
| RESULTS |
|---|
|
|
|---|
(2.05 Å) (Fig. 1), which implies covalent interaction. The geometry around the substrate carbonyl carbon was roughly tetrahedral. No electron density could be found for the N-terminal fragment of the substrate. The water molecules W1 and W2 were fully occupied as opposed to the substrate, which showed partial occupancy with the total ranging from 0.4 (pH 5) to 0.8 (pH 5/borax and pH 4).
|
|
DFP and PMSF were bound covalently to Ser195. The phosphate groups were always well visible. The respective isopropyl and phenyl moieties displayed weaker electron density and did not point into the specificity pocket but in the direction of His56. As an effect, the histidine side chain is statically disordered showing two conformations, one of which is rotated away from the inhibitors thus disrupting the catalytic triad (Fig. 1). In the DFP and PMSF structures, one oxygen atom in the sulfonyl or phosphate groups takes the position of W1, another one the position of W2.
Like the other native structures, the pH 5/borax structure also had a peptide bound into the active site. This structure had the highest quality data set and was used for multipole refinement.
Co-crystallization with DFP and PMSF changed the space group from P1 to P21 but left the internal protein structure unchanged and the crystal contacts mostly preserved.
Disorder and MobilityOne-fifth of the total number of residues of F. oxysporum trypsin shows at least two conformers. Their occupancies relate to the occupancy of the substrate or of the inhibitor side chain. In all the structures except native pH 5, the substrate- or inhibitor-bound state seems to correspond to the protein conformer with higher occupancy, with an average value of 0.65 in all structures. The conformers with corresponding lower occupancy (0.35) correspond to the empty active site. The rmsd for the two visible main chain conformers of disordered residues lies between 1 and 2 Å There was no evidence for the presence of other, intermediate conformational states.
Most of the disordered residues are spatially linked to each other. The effect starts in the loops of the active site cleft and is then transferred to the adjacent regions. The patterns of disorder are different for the native and the inhibited structures. The inhibited structures, mimicking a reaction intermediate, show higher anisotropy of the active site residues. In the native structures at pH 4 and pH 5, disorder arises in the specificity pocket, which is not the case in the DFP and PMSF structures where this site is empty. Despite the apparent movement in and around the active site, Ser195 shows only one conformation as mentioned above.
An analysis of corresponding conformers showed that there is good correlation between rmsd and ADP in the native structures indicating disorder of mostly dynamic origin. Disorder in the inhibited structure appears to be mostly static.
Quantum Chemical Calculations on the pH 4 StructureA stable match between the crystal structure and the theoretical model could only be achieved under the assumption that the catalytic serine is de-protonated and the attacking water molecule W1 is actually a water molecule and not a hydroxyl ion. In all other cases, the geometry was directed toward either the products of peptide cleavage (QC3) or the covalent intermediate (see Fig. 2 for an overview of the catalytic pathway).
|
This implies that the crystal structure represents a state close to a tetrahedral transition state. Bonds of respective order 0.99 and 0.47 are formed between the substrate C and the Ser195 O
and from the substrate C to W1, which is held in place from the other side by His56. His56 is deprotonated and acts as a Lewis base, activating the catalytic water molecule, which is indicated by a strong H-bonding contact of bond order 0.20.3.
Water molecule W2 in the oxyanion hole forms a very strong H bond to the carbonyl oxygen of the arginine substrate in the model matching the crystal structure. This is accompanied by a high negative charge (0.99) on the W2 oxygen atom.
Multipole RefinementFor the ultra-high resolution structures (pH 5 and pH 5/borax), the protonation state in the active site could be visualized directly in residual electron density maps. His56 was indeed not protonated (N
1 is non-protonated while the hydrogen atom H
1 is bound to N
2). The multipole charge density parameters were transferred from the data base and Fig. 3A shows the corresponding deformation electron density on the catalytic histidine. The small peak near N
2 indicates the electron lone-pair on the nitrogen atom or a partial protonation (which at this pH would be quite probable), or a rotation of the histidine ring by 180° around the C
C
bond. The electrostatic potential derived from the multipoles transfer is shown for the His56-Asp99 interaction (Fig. 3B).
|
| DISCUSSION |
|---|
|
|
|---|
Mobility in the CrystalThe walls of the active site cleft at the subsites S2, S3 (residues 144150) are always in double conformation, subsites S2' and S3' (residues 9395) only in the DFP and PMSF structures. This can be understood as an effect of the release of the N-terminal fragment of the substrate (sites S2, S3) and, for the C-terminal part of the active site cleft, adjustments in the positioning of the substrate just before the nucleophilic attack of water W1.
In the pH 4 and pH 5 structures, Cys191, Tyr225, and the stretch between amino acids 215 and 220 show main chain double conformations. Cys191 and Tyr225 are part of the wall of the specificity pocket. Cys216 forms a disulfide bridge with Cys191 and hence moves together with it. Apparently, this is related to the binding and release of the substrate arginine side chain, which also shows multiple conformers. Movement upon binding or release is supported by the fact that in the case of the covalently bound inhibitors, which did not reach into the pocket, the specificity pocket was completely rigid. An induced fit model was also supported by interpretation of the structures with inhibitors. In both cases the protein showed two different conformers in the subsites S2, S3, S2', and S3', whose occupancies relate to the occupancies of DFP (0.7) and PMSF (0.6).
Two important residues involved in this movement are Asp189 and Ser190, both located at the very bottom of the pocket, highly conserved and responsible for the preference of trypsin for basic residues (2, 3). Asp189 forms a strong ion pair with the substrate arginine side chain and its movement as well as the disorder of the other residues in the specificity pocket appears to be induced directly by the substrate. Changes in the substrate position would be transmitted to the catalytic center or vice versa. Re-positioning must occur during the formation and breaking of the covalent bond to Ser195. NMR experiments on complexes showed that neither active site nor specificity pocket is fully pre-formed in the native structures. Only binding to specificity subsites creates an environment suitable for a tetrahedral transition state (27). This seems to be the case also in F. oxysporum trypsin, in contrast to the widely accepted lock-and-key model for serine protease substrate binding (28).
Increased anisotropy was observed for the residues of the catalytic triad in the inhibited structures. The shape of the thermal ellipsoids of the atoms in His56 and Asp99 indicated movement toward Ser195 and the atoms in Ser195 tend to move perpendicular to the plane formed by C
, O
, and P or S (Fig. 4). In addition, some residues at sites S2' and S3' show considerable shift of one conformer toward the active site, shortening the cleft. This is not observed in the structures with the substrate. We therefore assume that a re-arrangement in the substrate binding region is required before de-acylation. In the covalent intermediate, the Ser195 ester carbonyl can have any orientation (10). It has to be adjusted, fixed, and a water molecule has to approach the covalent intermediate and position itself in a location suitable for catalysis. The water molecule W1 in the pH 4 and pH 5 structures is already at its nucleophilic attack and steric effects might explain why the reaction did not proceed further. Embedding of protein molecules in crystals makes large movement impossible and impedes product release or binding: the active site of trypsin was in principle accessible in the crystals, but it was impossible to bind DFP to the protein by soaking; DFP could not be seen in the active site of trypsin in soaked crystals. This again strongly indicates an induced fit mechanism for substrate binding and the presence of an "open" and "closed" state or an "empty" and "loaded" conformation of the protein molecule.
|
Reaction Partners and Water MoleculesW1, attached to His56, and W2 in the oxyanion hole were the two candidates for participation in catalysis. The role of the water molecule W1 as the nucleophile is easy to see. The location of water molecule W2 in the oxyanion hole seemed at first glance a bit exotic for steric reasons and in view of the commonly accepted direct stabilization of the carbonyl oxygen by the oxyanion hole. However, in the bovine trypsin, a water molecule has been found in the same position (29). Its proposed role was to protonate the carbonyl oxygen atom of the substrate, to take up the excess electrons and facilitate the shift of electrons toward the oxyanion hole (5). This role was controversial, as this water molecule does not exist in all known serine protease structures.
In F. oxysporum trypsin the distance of the substrate carbonyl to W2 is much shorter than expected for even the strongest of H bonds. This is exactly the property that can help facilitate catalysis. In short H bonds, the energy barrier for the proton transfer from one binding partner to the other becomes very small (30, 31). There has been a very lively debate over the influence of "low barrier hydrogen bonds" (30, 32, 33) on catalysis through an energy contribution of up to 20 kcal/mol. In these studies, a strong hydrogen bond was defined as of an OO distance of 2.4 Å, while the hydrogen bond between W2 and the substrate carbonyl oxygen atoms is 1.83 Å long. The proton between these two oxygen atoms should be able to fluctuate easily. Ab initio calculations show that the bond order to the carbonyl oxygen is 0.82 (see Table III). This means that the proton has almost been transferred from the water molecule to the substrate. The hydrogen atoms of the water molecules W1 and W2 were not seen in the electron density maps.
|
Quantum ChemistryThe QC2 theoretical model matching the crystal structure shows the most extreme charge distribution as compared with the other models. The structure is close to a tetrahedral transition state. Its stability can be attributed mostly to the capability of the water molecule in the oxyanion hole to take up a large excess of electrons. It achieves that by attachment to the carbonyl oxygen via a very strong H bond with bond order 0.82. As the bond order between the shared hydrogen atom and the W2 oxygen atoms is only 0.11, the water molecule should be pictured rather as a HOH+-O arrangement at an extremely close distance with the internal structure of the water molecule maintained. In the other models (QC1 and QC3), W2 remains a water molecule with strong internal bonds and external H bridges of normal strength. This indicates that, after the nucleophilic attack of W1, W2 regains its original state and functions as the Henderson water is supposed to function (29). The QC3 model corresponds to the end-state of peptide cleavage, step 5 in Fig. 2.
W1 has been identified before in crystal structures (4, 29) but its chemical state was never clear. Quantum chemical calculations show that W1 performs its nucleophilic attack as a water molecule. When a hydroxyl ion was input into the calculations, the system was immediately converted into the products and the arginine pushed out of the active site. This is understandable as a hydroxyl ion is among the hardest nucleophiles. Its proximity to a carbon carrying a positive partial charge would naturally result in immediate attachment, excluding the chance to observe such a state in a crystal structure. Our results show that not only one, but both water molecules are catalytically active at the same time.
Electronic Details Revealed at Ultra-high ResolutionThe beauty of ultra-high resolution protein crystallography is that one can attain electronic detail by crystallography alone (34). At ultra-high resolution the individual atoms in a molecule can no longer be considered spherical. The presence of lone-pair and bonding electrons has then to be directly included into the refinement.
The electron density in the active site of the ultra-high resolution trypsin structures shows that His56 is definitely deprotonated. Because of a relatively high thermal motion on the Ser195 O
, the electron density cannot give evidence for an attached proton on this side chain. However, the serine 195 C
O
distance is 1.22 Å (pH 5-borax structure), which is significantly smaller than the standard protonated serine C
O
distance of 1.417 ± 0.020 in the Engh and Huber (35) stereochemistry dictionary. This implies the following. First, the pKa of His56 is lower than 6.5. This is already hinted at by the presence of enzymatic activity under crystallization conditions despite a basic pH optimum and proves that His56 acts as a general base. A rotation of the imidazole ring by 180° could play a role in the substrate binding and product release steps. Second, Ser195 is de-protonated as well and is capable of covalent attachment to the substrate. And third, the assumptions made on the protonation state in the active site were correct and therefore the ab initio calculations have produced a realistic model.
Implications for Catalysis and ConclusionsUsing the atomic resolution structures and the results from the ab initio calculations, it could be shown directly that trypsin de-acylation does occur via a tetrahedral intermediate with subsequent hydrolysis in true SN2 fashion. The water molecule W1 is the obvious candidate to perform hydrolysis, while the preparation for this step is performed by W2 by protonating the carbonyl oxygen. Despite the requirement for W1 to carry a negative partial charge in order to perform a nucleophilic attack, complete de-protonation does not occur. Its attachment to the substrate and proton abstraction is concerted.
The lowered pKa of His56 explains why the enzyme was still active in the crystallization solution. The fact that the His56 side chain swings out upon contact with the inhibitors implies that the catalytic triad is not rigid and tight. The inhibitors are larger than the substrate intermediate and remove the water molecules W1 and W2 from their normal positions. Therefore, no activator (W2) and no nucleophile (W1) for hydrolysis are present. Hence, with their modifications (fluorine instead of oxygen and therefore strong withdrawal of electrons), the inhibitors have triple effect. Ab initio calculations on the PMSF structure (result not shown) yielded a charge distribution around the Ser195 O
similar to the native, which indicates that inhibitors are good models for the tetrahedral intermediates. F. oxysporum trypsin is a dynamic enzyme, in which substrate binding and product release involves an induced fit mechanism and where the "loaded" and "empty" states can be structurally distinguished.
Atomic and ultra-high resolution crystal structure determination and ab initio quantum chemical calculations have now become indispensable tools for comprehensive structure interpretation, acquiring snapshots along the reaction pathway and the assignment of function to the residues involved in catalysis. The motion of the macromolecule in the crystal structure, revealed from the anisotropic atomic displacement parameters, complements the time-average structural picture with dynamics thus opening a new prospective in the structure-functional analysis of biological processes.
| FOOTNOTES |
|---|
* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
To whom correspondence should be addressed. E-mail: andrea{at}embl-hamburg.de.
1 The abbreviations used are: DFP, diisopropyl fluorophosphate; PMSF, phenylmethylsulfonyl fluoride; rmsd, root-mean-square-deviation. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
E. S. Radisky, J. M. Lee, C.-J. K. Lu, and D. E. Koshland Jr. Insights into the serine protease mechanism from atomic resolution structures of trypsin reaction intermediates PNAS, May 2, 2006; 103(18): 6835 - 6840. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Molecular and Cellular Proteomics |
| Journal of Lipid Research | ASBMB Today |