|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 19, 14300-14308, May 11, 2007
Structural and Mechanistic Analyses of endo-Glycoceramidase II, a Membrane-associated Family 5 Glycosidase in the Apo and GM3 Ganglioside-bound Forms*![]() ![]() ![]() ![]() ![]() ![]() 1
From the
Departments of
Received for publication, December 14, 2006 , and in revised form, February 8, 2007.
endo-Glycoceramidase, a membrane-associated family 5 glycosidase, deviates from the typical polysaccharide substrate specificity of other soluble members of the family, preferentially hydrolyzing glycosidic linkages between the oligosaccharide and ceramide moieties of gangliosides. Here we report the first x-ray crystal structures of an endo-glycoceramidase from Rhodococcus sp., in the apo form, in complex with the ganglioside GM3 (Svennerholm ganglioside nomenclature (Svennerholm, L. (1964) J. Lipid Res. 5, 145-155)), and trapped as a glycosyl-enzyme intermediate. These snapshots provide the first molecular insight into enzyme recognition and association with gangliosides, revealing the structural adaptations necessary for glycosidase-catalyzed hydrolysis and detailing a novel ganglioside binding topology. Consistent with the chemical duality of the substrate, the active site of endo-glycoceramidase is split into a wide, polar cavity to bind the polyhydroxylated oligosaccharide moiety and a narrow, hydrophobic tunnel to bind the ceramide lipid chains. The specific interactions with the ceramide polar head group manifest a surprising aglycone specificity, an observation substantiated by our kinetic analyses. Collectively, the reported structural and kinetic data provide insight toward rational redesign of the synthetic glycosynthase mutant of endo-glycoceramidase to enable facile synthesis of nonnatural, therapeutically useful gangliosides.
Gangliosides are prominent components of mammalian cell membranes that are particularly prevalent in neuronal membranes (1). They are implicated in numerous signaling roles, with involvement in the cell cycle, differentiation, communication, recognition, and apoptosis (2). In mammalian systems, gangliosides are synthesized by the sequential, glycosyltransferase-catalyzed, addition of monosaccharides to a ceramide (1). In a similar stepwise manner, they are catabolized by exo-glycosidases that cleave monosaccharides from the nonreducing oligosaccharide end. In contrast, endo-glycosidases that directly hydrolyze the glycosidic linkage between the ceramide and oligosaccharide (Fig. 1a) have been discovered in nonmammalian cells (3-9), where they presumably play a nutritional role for the organism. One such enzyme, endo-glycoceramidase II (EGC)2 from Rhodococcus, is a family 5 glycosidase (www.cazy.org) that hydrolyzes gangliosides with a net retention of stereochemistry at the anomeric center via the double displacement mechanism common to other retaining glycosidases (Fig. 1b) (10).
EGC is noticeably distinct in function from other family 5 glycosidases, including the prototypical The potential of gangliosides as therapies for cancer (16, 17), diabetes (18), and neurodegenerative diseases, such as Alzheimer (19, 20) and Parkinson diseases (21, 22), has prompted interest in the development of efficient, large scale syntheses to replace the current method of ganglioside isolation from contaminant-prone sources (e.g. bovine brain). Toward this end, protein engineers have recently harnessed EGC activity in order to address the difficulties in making these synthetically challenging molecules. By creating the glycosynthase (23, 24) of EGC, a nucleophile mutant of the enzyme capable of synthesizing but not hydrolyzing gangliosides, it was possible to synthesize a range of glycosphingolipids from their constituent oligosaccharide and lipid moieties in excellent yields (15). This latter study probed the ability of EGC to couple a broad range of oligosaccharide and lipid substrates, as did earlier complementary studies on the hydrolytic specificity of the wild-type enzyme (25). The structural basis of this specificity has yet to be ascertained.
Given the importance of gangliosides and their turnover in cellular regulation, it is surprising that the only structurally characterized glycolipid-processing enzyme to date is human glucocerebrosidase, whose deficiency results in Gaucher's disease (26, 27). This most likely arises from the inherent difficulties in handling membrane-associated proteins and hydrophobic substrates, such as ceramides. Through trials of various detergents in crystallization and organic solvents to solubilize the substrates, we have succeeded in solving the x-ray crystal structures of apo-EGC, the Michaelis complex of the ganglioside substrate GM3 with a hydrolytically inactive nucleophile mutant, and a trapped glycosyl-enzyme intermediate. Together these structures detail the reaction coordinate, confirming the proposed double displacement mechanism and the identities of the catalytic glutamic acid residues. The GM3 complex, itself the product complex for the glycosynthase, details a novel glycosphingolipid binding topology, which provides a structural basis for the previously observed aglycone specificity of the enzyme. The crystal structures further offer valuable information for rational redesign of EGC toward the synthesis of novel ganglioside-derived therapeutics. This study represents a significant advance in our understanding of this unusual glycosidase activity, permitting insight into how glycosphingolipid hydrolysis is accomplished by this black sheep of the family 5 glycosidases.
Protein Expression and PurificationThe Escherichia coli codon-optimized gene for endo-glycoceramidase II from Rhodococcus sp. strain M-777, lacking the 30-residue N-terminal secretion signal sequence (egc), was synthesized (Blue Heron Biotechnology, Inc.) and subcloned into pET28a using the NdeI/XhoI restriction sites (15). E. coli BL21(Tuner) cells containing the egc/pET28a expression plasmid were grown at 37 °C to stationary phase in TYP medium containing 30 µg·ml-1 kanamycin. Protein production was induced by the addition of isopropyl -D-thiogalactopyranoside to a final concentration of 0.1 mM following reduction of the temperature to 20 °C. The resulting His6·EGC (residues 31-490) was purified by Ni(II) affinity chromatography to >95% purity as determined by SDS-PAGE analysis. For brevity, this recombinant protein is hereafter referred to as EGC.
The activity of EGC was confirmed by a colorimetric assay with 2,4-dinitrophenyl
CrystallizationCrystals of apo-EGC were grown by the vapor diffusion method. Sitting drops of 1 µl of protein (10 mg·ml-1), containing 0.1% (v/v) Triton X-100, and 1 µl of well solution were equilibrated at 21 °C against a well solution of 20% (w/v) polyethylene glycol 3350, 0.175 M NaCl, and 0.1 M Tris-HCl, pH 8.5. Crystals belonging to the space group P21, with unit cell dimensions a = 53.8 Å, b = 92.9 Å, c = 94.5 Å, Data Collection and Structure DeterminationX-ray data were collected at 100 K using a nitrogen stream. Data from selenomethionine-derivatized crystals and GM3 co-crystals were collected at Beamline 8.2.2 of the Advanced Light Source (Berkeley, CA), using an ADSC Q315 CCD detector. All other data were collected on an in-house rotating anode x-ray generator, using a Mar345 detector. The data were processed using either MOSFLM (28) and the CCP4 suite of programs (29), or DENZO and SCALEPACK (30). 18 selenium sites were located, and phases were calculated using the program SOLVE (31). The phases had a figure of merit of 0.40 over the resolution range to 2.0 Å, which increased to 0.72 following density modification with the program RESOLVE (32).
RefinementAn initial model incorporating
Structure Determination and Refinement of EGC Complexes Initial phases for the C2 crystal form of EGC in complex with GM3 were calculated by molecular replacement using the program MOLREP (37). A monomer of EGC from the apo structure was used as the search model. The structure of the glycosyl-enzyme intermediate, resulting from the 2,4-dinitrophenyl -D-lactoside soaking experiment, was solved using the native EGC structure as the initial source of phase information. All structures of EGC complexes were refined using REF-MAC5. Energy-minimized ligand models and their refinement restraints were created using the PRODRG server (38). Ligands were fitted to active site difference electron density, and further rounds of restrained refinement were carried out. Omit mFo - DFc maps were calculated following removal of ligand atoms and random model perturbation using PDBSET. All structures possessed one or more features in the electron density that, based upon coordination geometry and distance, were modeled as sodium ions originating from the crystallization conditions (39). Both the apo-EGC and the EGC·GM3 structures exhibited an unidentified chain of electron density at the periphery of the protein, distant from the active site that was unmodeled. This density resided on a 2-fold symmetry axis in the EGC·GM3 structure. Final models were validated using MolProbity (40). Figures were prepared using PyMOL (41), electrostatic potentials were calculated using APBS (42), and hydrophobic potentials were calculated using GRASP (43).
Enzyme KineticsEnzyme concentrations were measured by absorbance at 280 nm, using the extinction coefficient 280 = 89,600 cm-1·M-1, determined utilizing the methods of Gill and von Hipple (44). Michaelis-Menten kinetic parameters for the glycosynthase were determined by varying the concentration of either the donor or acceptor while utilizing a fixed concentration of 7 mM for the other species. Reactions were conducted at 37 °C in 25 mM sodium acetate, pH 5.0, containing 7% 1,2-dimethoxyethane. Reaction rates were measured using a VWR Symphony fluoride electrode interfaced with Logger Pro 2.2.1 analysis software (Vernier, Inc.). Before the addition of enzyme, the rate of spontaneous hydrolysis of the glycosyl fluoride was measured, and this rate was subsequently subtracted from the enzymatic rate. Glycosynthase reactions were initiated by the addition of EGC/E351S (100 µl) to a final concentration of 1-7 µM. Total reaction volume was 600 µl. Initial rates were plotted against substrate concentration, and Michaelis parameters were determined by non-linear regression analysis using Grafit 4.0 (Erithacus Software (45)). In cases where it was not possible to achieve enzyme saturation, kcat/KM values were determined from the gradient of the rate versus [S] plot.
Architecture of EGCRecombinant EGC, lacking the 30-amino acid N-terminal signal sequence, was overexpressed in E. coli and purified by Ni(II) affinity chromatography. Diffraction quality EGC crystals were subsequently obtained in the presence of 0.05% (v/v) Triton X-100. Smaller EGC crystals were obtained with a number of other detergents, including octyl -D-glucopyranoside, decyl -D-maltopyranoside, and dodecyl -D-maltopyranoside; however, no crystals were observed in the absence of detergent. The structure of EGC was determined by using the single-wavelength anomalous dispersion method with selenomethionine-derivatized protein. The EGC crystals belonged to the space group P21 and contained two molecules per asymmetric unit.
Each monomer of EGC is arranged in two distinct domains (Fig. 2a). The N-terminal domain (residues 43-407) assumes an (
Two molecules of EGC are present per asymmetric unit (root mean square deviation of 0.31 Å over 413 common C
EGC Active Site StructureThe EGC active site channel is formed by the loops of the ( Inspection of the mFo - DFc map of the active site revealed density attributable to a molecule of Tris bound between the predicted catalytic residues. The Tris molecule is coordinated by the side chains of Lys66, Asp137, Asn232, Glu233, and Glu351 and appears to partially mimic the coordination of a sugar bound in the -1 subsite. EGC·GM3 Substrate ComplexIn order to prevent substrate hydrolysis, crystallization experiments with the substrate GM3 were carried out with a nucleophile knock-out mutant, E351S. This is the same mutant as that which was employed as the glycosynthase in previous studies. Soaking EGC/E351S crystals with GM3 led to a significant deterioration in diffraction quality. EGC/E351S was therefore co-crystallized with GM3. EGC·GM3 crystals belonged to the space group C2 and contained one molecule per asymmetric unit. The EGC·GM3 structure was solved by molecular replacement to a resolution of 1.1 Å. Inspection of the mFo - DFc map revealed density attributable to GM3 bound in the proposed active site channel. GM3 was modeled and refined to unitary occupancy, with an average B-factor of 9.3 Å2. There was no indication of residual Tris binding.
The overall structure and fold of EGC appears broadly unchanged following binding of GM3, as reflected in a root mean square deviation of 0.47 Å over 404 common C The three pyranoside rings of GM3 fold into a distinctive curve (Fig. 3), with the 6-OH of glucose interacting directly with the 9-OH of sialic acid over a distance of 2.8 Å. Notably, this forms the only non-solvent-binding interaction for the sialic acid moiety. The coordination of the GM3 is described in detail in Fig. 3c. In common with many carbohydrate-binding proteins, the substrate binding site of EGC is lined with a number of aromatic amino acids that form hydrophobic interactions with the sugar rings; the glucose of GM3 interacts with the side chains of Tyr306 and Trp382, whereas the galactose ring interacts with the side chain of Trp178. The side chain of Trp389 constitutes an additional boundary to the active site cavity. There is no observable distortion of the GM3 glucose moiety from a chair conformation, as might have been expected in a Michaelis complex (49), suggesting that the side chain of the nucleophile Glu351 may be required for substrate distortion. Both hydrocarbon chains of the GM3 ceramide are well defined in the electron density. The electron density deteriorates after the chains exit the enzyme active site channel, presumably due to increased flexibility. The acyl and sphingosine chains of the ceramide are stacked vertically with respect to the hydrophobic channel, with the acyl chain located below the sphingosine chain. The ceramide-binding channel is lined by the hydrophobic side chains of amino acids Leu180, Tyr182, Ile183, Phe235, Ile276, and Leu308. Upon binding of GM3, the side chains of Arg177 and Asp311 localize to form a "cap" over the ceramide-binding channel. Presumably as a consequence of this conformational change, the flexible 311-315 loop region becomes well defined in the electron density. Furthermore, there is a slight conformational change in part of the lid subdomain (residues 166-193) to accommodate the change in Arg177.
EGC·Lac Glycosyl-Enzyme IntermediateThe glycosyl-enzyme intermediate was trapped by soaking crystals of the general acid/base knock-out mutant, EGC/E233A, with an activated sugar donor, 2,4-dinitrophenyl
Attempts to soak EGC/E233A crystals with 2,4-dinitrophenyl -D-sialyllactoside gave rise to electron density suggestive of a sialyllactosyl glycosyl-enzyme intermediate, but only in low occupancy. Presumably, crystal contacts restrict the soaking of this larger trisaccharide, and the poor occupancy prevented confident modeling of this complex. Probing the Ceramide-binding SiteTo further assess the basis of lipid specificity in EGC, the kinetic parameters of various lipid acceptors for the synthetic glycosynthase reaction were determined. These are presented in Table 2.
In common with the structures of other family 5 glycosidases, the catalytic domain of EGC forms an ( / )8 fold, with the catalytic center positioned in a channel on the upper surface of the barrel. It is the constitution of the EGC active site channel, however, which distinguishes it markedly from other members of family 5 and provides insight into the unusual substrate selectivity exhibited by this enzyme. Unlike those family 5 glycosidases that have adapted to accept entirely polar polysaccharides, the active site of EGC displays a split personality to accommodate both the hydrophilic sugar and hydrophobic ceramide components of its glycosphingolipid substrates.
The glycone binding site described by the EGC·GM3 complex, on the nonreducing (minus subsites) side of the catalytic center, exhibits many of the features common to sugar-binding proteins. These include many polar contacts between enzyme and ligand to "solvate" the hydroxyls and a number of hydrophobic interactions with aromatic side chains that further stabilize the bound sugars. This cavity is broader, however, than the linear channels typically observed in the -glucanases, -mannanases, and -xylanases of family 5 (see comparison of EGC and Cel5A -glucanase (11) (Fig. 2)), presumably to allow binding of the branched sugar moieties that EGC is known to process. The flexible loop region (residues 145-156) to the side of the active site is positioned close to the axial 4-OH of the GM3 galactose moiety, the location where the additional sugar groups branch off in the more extended GM1 and GM2 gangliosides. Indeed, studies probing substrate glycone selectivity suggested only slightly lower tolerance of the larger GM1 and GM2 substrates (25), and these larger oligosaccharides were seen to be efficiently processed by the glycosynthase mutant (15). It is possible, therefore, that this loop may facilitate an induced fit mechanism, allowing expansion of the active site for these alternative substrates. The striking curve defined by the three sugar moieties of GM3 may be a determinant of the glycone selectivity of EGC; it is possible that an intramolecular self-association of the substrate into a predefined shape assists recognition and binding. To accommodate the binding of ceramide-based aglycone moieties, the active site channel downstream of the catalytic center is lined with hydrophobic residues. Interestingly, the lipid-binding topology observed in the EGC·GM3 complex differs from that of other glycosphingolipid-binding proteins, which typically encapsulate the hydrophobic tails, revealing only the polar sugars and lipid head groups to solvent. For the CD1 family of antigen-presenting proteins (50-59), although each member exhibits specific adaptations consistent with varying substrate specificities, the overall protein architecture and mode of lipid binding is similar; the lipid tails are fully enclosed within complex networks of hydrophobic channels specific to the length and shape of the lipid chains. The polar sugars and lipid head groups are hydrogen-bonded at the protein surface for presentation to T-cell receptors. Similarly, the structures of human and bovine glycolipid transfer proteins (60, 61) reveal analogous modes of substrate binding; the polar glycone moieties are positioned at the protein surface, whereas the lipid tails are bound within a single hydrophobic tunnel completely enclosed within the protein environment. Structural studies of the membrane-lipid activator proteins saposin B (62) and human GM2 activator protein (GM2AP) (63, 64) further exemplify lipid substrate binding by encapsulation within hydrophobic protein cores. All these classes of glycosphingolipid-binding protein, regardless of function, thus appear to utilize a comparable glycosphingolipid binding pocket. In contrast, the structure of the EGC·GM3 complex reveals a novel lipid binding topology, which most probably reflects the requirement for EGC to more intimately associate with both polar and apolar components of its substrates to facilitate endo-hydrolysis. It is also possible that this topology allows the association of the activator protein employed by this enzyme.
The EGC activator protein is considerably larger (
An interesting model for the interaction of an activator protein with its cognate enzyme is that deduced for the human The unique lipid-binding topology exhibited by EGC is manifested in the high degree of selectivity for lipid substrates with the correct head group structure. In the glycosynthase-catalyzed glycosylation of sphingolipids, a dramatic reduction in catalytic efficiency is observed when modified lipids are employed. The measured KM value for D-erythro-sphingosine was 40 µM, which was increased by 10-fold for the hydrogenated analog D-erythro-dihydrosphingosine, although kcat was roughly the same for both compounds, 0.1 s-1. When more substantial changes were introduced into the head group, the catalytic efficiency was dramatically reduced, making enzyme saturation impossible. When L-threo-sphingosine and its hydrogenated analog, L-threo-dihydrosphingosine, in which both stereocenters of the head group are inverted relative to D-erythro-sphingosine, were employed, a 2000-fold reduction in kcat/KM was observed. These observations illustrate the strict stereoselectivity of EGC toward the lipid head group. These restraints are consistent with the observed specific bond interactions and steric constraints imposed by the substrate-binding channel on the ceramide. Hydration of the double bond in D-erythro-sphingosine, giving phytosphingosine, proved to be even more catastrophic, resulting in a 10,000-fold reduction in kcat/KM. Combined, these structural and biochemical studies illustrate the essential adaptations of EGC from the typical family 5 scaffold. The broad glycan tolerance is explained by the wide and potentially flexible sugar-binding site; the narrow hydrophobic ceramide-binding channel enables intimate association with the ganglioside and thus endo-hydrolysis; and encapsulation of the lipid ensures close contacts with the lipid head group facilitating stereoselectivity. Furthermore, the structure permits speculation on the potential membrane and activator protein interactions that allow the enzyme to function on membrane-associated substrates. Importantly, the structural knowledge also opens the possibilities of rational design of EGC glycosynthase substrate specificity toward the synthesis of nonnatural, therapeutically useful glycosphingolipids.
The atomic coordinates and structure factors (code 2OSW, 2OSX, and 2OSY) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by Neose Technologies and by the Royal Society (United Kingdom), Howard Hughes Medical Institute, Canadian Institutes of Health Research, Natural Sciences and Engineering Research Council of Canada, and the Protein Engineering Network of Centres of Excellence. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 1 To whom correspondence should be addressed: Dept. of Biochemistry and Molecular Biology, University of British Columbia, 2350 Health Sciences Mall, Vancouver, British Columbia V6T 1Z3, Canada. Tel.: 604-822-0789; Fax: 604-822-5227; E-mail: natalie{at}byron.biochem.ubc.ca.
2 The abbreviations used are: EGC, endo-glycoceramidase II; Lac, lactosyl/lactoside; GM2AP, GM2 activator protein.
We thank Dr. Andrew Lovering and Dr. Paula Lario for assistance with data collection, Dr. Calvin Yip for static light scattering analysis, Dr. Bernard Henrissat for helpful discussion, and the staff at Beamline 8.2.2 of the Advanced Light Source (Berkeley, CA) for expert assistance. We also thank Neose Technologies for providing reagents.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||