|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 281, Issue 40, 30175-30185, October 6, 2006
Structure-based Functional AnnotationYEAST ymr099c CODES FOR A D-HEXOSE-6-PHOSPHATE MUTAROTASE*![]() ![]() ![]() ![]() ![]() 1
From the
Received for publication, May 10, 2006 , and in revised form, July 12, 2006.
Despite the generation of a large amount of sequence information over the last decade, more than 40% of well characterized enzymatic functions still lack associated protein sequences. Assigning protein sequences to documented biochemical functions is an interesting challenge. We illustrate here that structural genomics may be a reasonable approach in addressing these questions. We present the crystal structure of the Saccharomyces cerevisiae YMR099cp, a protein of unknown function. YMR099cp adopts the same fold as galactose mutarotase and shares the same catalytic machinery necessary for the interconversion of the and anomers of galactose. The structure revealed the presence in the active site of a sulfate ion attached by an arginine clamp made by the side chain from two strictly conserved arginine residues. This sulfate is ideally positioned to mimic the phosphate group of hexose 6-phosphate. We have subsequently successfully demonstrated that YMR099cp is a hexose-6-phosphate mutarotase with broad substrate specificity. We solved high resolution structures of some substrate enzyme complexes, further confirming our functional hypothesis. The metabolic role of a hexose-6-phosphate mutarotase is discussed. This work illustrates that structural information has been crucial to assign YMR099cp to the orphan EC activity: hexose-phosphate mutarotase.
Despite the overwhelming impact of systematic genome sequencing on the understanding of the biology and evolution of organisms, enormous gaps are present in our current knowledge of the function of proteins. On the one hand, it is well known that many of the newly generated protein sequences cannot be assigned with a precise function. On the other hand, a significant portion of biochemical functions experimentally determined are still not linked to a protein sequence. The awareness of the latter point, which is not well recognized, presents an important challenge. For instance, systematic analysis of the well structured and much used Enzyme Commission (EC) data base revealed that more than 39% of the experimentally identified enzyme activities (corresponding to 1529 EC numbers) are not associated with protein sequences in major public databases (1, 2). There is clearly an urgent need to fill this gap in our biochemical knowledge, and an organized combined bioinformatics and experimental effort will be needed to bridge it (3). Structural genomic endeavors, through systematic approaches at all experimental steps of the structure determination process, to fill the gap between protein sequence and structure space. Apart from being the main source of novel fold discovery, structural genomics may also generate valuable information on proteins whose function was not identified before (4, 5). It is certain that a considerable fraction of these targets is endowed with enzymatic activities, and the identification of the biochemical function of these proteins offers a high value structural genomics spin-off. In our yeast structural genomics project we have focused on non-membrane proteins of unknown fold (6). Our initial target list consisted of a mix of proteins with known and unknown functions. In this article, we demonstrate how biochemical investigations based on initial structure determination may clearly define biochemical function. YMR099cp is a 34-kDa protein that is predicted from Psi-Blast sequence analysis to adopt an aldose-1-epimerase fold, which does not share significant homology with proteins whose biochemical function was characterized. However, no biochemical function could be deciphered from these observations. We report here on the crystal structure of YMR099cp and show that it adopts the galactose mutarotase fold (11). Analysis of the putative active site suggested that its catalytic machinery may perform a mutarotation reaction but that YMR099cp probably has different substrate specificity from the well studied galactose mutarotase with which it shares the same fold. Our present biochemical and biophysical investigation of the catalytic activity indicate that YMR099cp corresponds to the glucose-6-phosphate 1-epimerase (or mutarotase) previously identified in S. cerevisiae, using biochemical experiments, but whose gene has not been identified since (12-15). Our results highlight how structural genomics may suggest biochemical experiments that allow for determination of the precise function of a target protein.
Cloning, Expression, and PurificationThe YMR099c open reading frame was amplified by PCR using the genomic DNA of the S. cerevisiae strain S288C as a template. An additional sequence coding for a His6 tag was introduced at the 3'-end of the gene during amplification. The PCR product was then cloned into a derivative of pET9 vector (Stratagene). Expression was done at 37 °C using the transformed Escherichia coli Gold (DE3) strain and 2xYT medium (BIO101 Inc.) supplemented with kanamycin at 50 µg/ml. Cells were harvested by centrifugation, resuspended in 30 ml of 20 mM Tris-HCl, pH 7.5, 200 mM NaCl, 5 mM -mercaptoethanol, and stored overnight at -20 °C. Cell lysis was completed by sonication. The His-tagged protein was purified on a Ni-nitrilotriacetic acid column (Qiagen Inc.) followed by gel filtration on a SuperdexTM200 column (Amersham Biosciences).
Glucose-6-phosphate Epimerase Activity MeasurementsBecause of the specificity of glucose-6-phosphate dehydrogenase for Fluorescence Measurements for Binding of LigandsAll phosphorylated sugars used for these measurements were purchased from Sigma. Excitation was performed at 295 nm and emission scanned from 300-500 nm with a Cary Eclipse fluorospectrophotometer (Varian). Measurements were performed at 20 °C with 1 µM YMR099cp in 20 mM Tris-HCl, pH 7.5. Successive aliquots of ligands (from 0 to 20 mM) were added to the protein. Binding of ligands to YMR099cp was quantified as a difference of tryptophan fluorescence at 350 nm as a function of ligand concentration. Experiments were performed in triplicate. NMR SpectroscopyAll NMR spectra were recorded in 99% D2O with a Bruker DRX400 instrument (400 MHz), with ligands and protein concentrations of 20 mM and 25 µM, respectively. Standard Bruker software was used to acquire and process the NMR data. 1D-1H NMR was acquired with 16,000 data points and 16 scans. Two-dimensional NOESY was recorded with 256 experiments of 2048 data points and 8 scans per t1 experiment and a mixing time of 200 ms. Sine-squared-bell apodization was applied to spectra in both dimensions and processed in States-TPPI mode, followed by symmetrization.
Crystallization and Resolution of the StructureCrystallization trials were performed at 18 °C. Crystals for the apo form of the protein were grown from a mixture in a 1:1 ratio of 20 mg/ml protein solution in 20 mM Tris-HCl, pH 7.5, 200 mM NaCl, and 10 mM
Diffraction data were collected from a flash-cooled crystal at 100 K on beamline ID23-1 at the European Synchrotron Radiation Facility (ESRF, Grenoble, France). The apoprotein crystallized in space group P21212 with cell dimensions: a = 44.9 Å, b = 74.2 Å, c = 106.5 Å, corresponding to one molecule per asymmetric unit and a solvent content of 55%. Image processing, data reduction, and scaling were carried out using the XDS package (16). The structure of the Haemophilus influenzae Hi1317 protein (PDB code 1JOV; 26% sequence identity) was used as search model for molecular replacement trials using the MolRep program in the 20-4-Å resolution range (17). This initial model was automatically rebuilt using the Arp/wARP program using data to 1.7-Å resolution (18). The resulting model was improved by careful analysis of the 2Fo - Fc and Fo - Fc electron density maps using the molecular graphics program TURBO-FRODO. The refinement of this model was further carried out with the program REFMAC in the 20-1.7-Å resolution range (19). The final model for the apostructure includes all the residues from Pro2 to Lys288, 330 water molecules, 1 sodium, and 2 sulfate ions, one Hepes molecule, and one glycerol molecule from the cryoprotection solution. The sugarbound structures (Glc6P and Gal6P) crystallizing in a different space group (C2221) were solved by molecular replacement and refined to 1.6 Å. Models were constructed for the Glc6P and Gal6P YMR099cp complexes for residues Pro2-Glu289. Both complexes contain one hexose 6-phosphate sugar and in the case of Gal6P a barium ion. The Glc6P and Gal6P complex models were completed with 253 and 378 water molecules, respectively. The statistics on data collection and refinement are provided in Table 1. The atomic coordinates and structure factors for the YMR099c native protein and the Glc6P and Gal6P complexes have been deposited into the Brookhaven Protein Data Bank under the accession numbers 2CIQ, 2CIR, and 2CIS, respectively.
Structure of the YMR099cpThe crystal structure of the S. cerevisiae gene product YMR099cp (297 amino acids) has been solved using the molecular replacement method using the structure of a protein of unknown function (H. influenzae Hi1317; PDB code: 1JOV) and refined to 1.7-Å resolution (Table 1). All 2Fo - Fc electron density maps are of excellent quality, except for the N-terminal methionine and 9 C-terminal amino acids that have not been built because they were not defined in electron density. One copy of YMR099cp is present in the asymmetric unit, confirming gel filtration elution profile that clearly indicates a monomeric state of the protein in solution (data not shown). YMR099cp is made of a single globular domain of approximate dimensions 37 x 52 x 53 Å3. Its structure can be described as a -sandwich made of 22 -strands and two short -helices (Fig. 1A). Strands are organized in four antiparallel -sheets arranged in two parallel layers. The first one is made of three sheets: S1 composed of -strands 1-4, S2 (strand order: 5, 10, and 21), and S4 (strand order: 11, 20, 19, 16, 13, and 14). The second layer is formed by the sole 9-stranded -sheet S3 (strand order: 6- 9, 22, 18, 17, 12, and 15). The two helices 1 and 2 are facing toward the same side of the monomer and are part of the linkers connecting strands 7to 8 and 19 to 20. This fold has already been described for galactose mutarotases (RMSD of 2.2-2.6 Å over 240 C positions; 15-17% sequence identity (11, 20, 21)), domain 5 of -galactosidase (RMSD of 3.1 Å over 190 C positions, 7% sequence identity (22)) and the N-terminal domain of maltose phosphorylase (RMSD of 3.4 Å over 164 C positions, 13% sequence identity (23)). Structural similarity is also shared with proteins of unknown function: Caenorhabditis elegans C01B4.6 gene product (RMSD of 2.41 Å over 235 C positions, 13% sequence identity; PDB code: 1LUR
[PDB]
) and S. cerevisiae YNR071cp (RMSD of 2.67 Å over 243 C positions, 12% sequence identity; PDB code: 1YGA).
Comparison with Galactose MutarotasesThe crystal structures of YMR099cp and of the H. influenzae Hi1317 protein (which provided the model used for molecular replacement) confirm the Psi-Blast prediction that they belong to the aldose-1-epimerase family. The best structurally and biochemically characterized member of this family is galactose mutarotase, an enzyme that catalyzes the conversion of / -D-galactose (24). This is the first step of the Leloir pathway, which in most organisms converts -D-galactose to the more metabolically useful glucose 1-phosphate.
Surface projection of sequence conservation among YMR099cp-related proteins has highlighted a highly conserved pocket whose floor is made by sheet S4 and walls by helix
Functional StudiesThese structural observations led us to test YMR099cp for hexose 6-phosphate epimerase activity. For this purpose, we have used an enzyme-coupled assay. It consists in following the reduction of NAD(P) ( = 340 nm) resulting from glucose-6-phosphate dehydrogenase activity, which specifically converts -Glc6P (but not the form) into D-glucono-1,5-lactone 6-phosphate (Fig. 3A). The formation of -Glc6P from a racemic mixture has been investigated by stopped-flow at various YMR099cp concentrations (Fig. 3B), which revealed two distinct phases. The first one corresponds to the initial burst of -Glc6P because of its presence in the racemic mixture and hence is independent on the YMR099cp concentration. In the second phase, faster generation of NAD(P)H reflecting higher velocity constant (k+1) for the interconversion between glucose 6-phosphate anomers is obtained with increasing concentration of YMR099cp, demonstrating the glucose-6-phosphate epimerase activity of the protein (Fig. 3B, inset). In our experiments, YMR099cp accelerates the reaction two to three times more than previously observed in the same conditions by Wurster and Hess (Fig. 3B, inset; Ref. 12). This probably reflects the general improvement in the quality (i.e. purity and stability) of a protein sample extracted and purified within a few hours from an overexpressing E. coli strain, compared with the one of a protein sample obtained from a wild-type yeast lysate using a long and tedious purification protocol (12).
Unfortunately, this enzymatic assay could not be used to test other hexose 6-phosphate substrates because glucose-6-phosphate dehydrogenase exhibits a high degree of specificity for
We have tested both Man6P and Gal6P as mutarotase substrates of YMR099cp using NMR techniques. Man6P and Gal6P differ from Glc6P by the orientation of the hydroxyl group at position C2 and C4, respectively (Fig. 4). First, sugar binding has been monitored using a 1H-1D reference spectrum and a one-dimensional saturation difference (STD) technique (31). The 1H-1D reference spectra recorded for these different ligands (present at 20 mM) in the absence or presence of 25 µM YMR099cp show a specific line broadening for the peaks corresponding to both the and anomeric protons upon enzyme addition. This indicates that YMR099cp binds to both anomeric forms from Glc6P, Gal6P, and Man6P (Fig. 5). This was confirmed by the STD spectra recorded for these three sugars (data not shown). Second, enzyme catalyzed anomeric interconversion was investigated using two-dimensional nuclear Overhauser effect spectroscopy (NOESY) as previously described for E. coli RbsD, a protein with mutarotase and pyranase activities on D-ribose (32). Exchange cross-peaks between and Glc6P anomers could only be observed in the presence of YMR099cp enzyme (Fig. 5A). These cross-peaks are due to a fast YMR099cp-catalyzed interconversion between and Glc6P, thus confirming the glucose-6-phosphate mutarotase activity measured by enzyme-coupled assay. In addition, this validates that NOESY experiments are useful to screen for putative YMR099cp substrates. NOESY exchange cross-peaks between anomers were also observed on Gal6P and Man6P, when mixed with YMR099cp (Fig. 5, B and C). This clearly shows that YMR099cp also has mutarotase activity on Gal6P and Man6P sugars.
Altogether, these functional data clearly demonstrate that YMR099cp catalyzes the interconversion between the and anomers from at least three hexose 6-phosphate sugars (Glc6P, Gal6P, and Man6P). In addition, the velocity constant measured for the YMR099cp-catalyzed anomerization of to Glc6P (k+1 = 62.8 min-1 with 1.72 µM YMR099cp) and the affinity constant determined for Glc6P (Kd = 200 µM) are comparable to those of the glucose-6-phosphate epimerase initially described by Wurster and Hess (12-14) in identical experimental conditions (k+1 = 32 min-1 with 1.72 µM enzyme and Kd = 55 µM and 144 µM for and anomers, respectively). Complexes with Hexose 6-Phosphate SugarsTo get a better understanding of the substrate preference of YMR099cp, we have solved the structures of its complexes with Glc6P and Gal6P to 1.6-Å resolution (Fig. 6). Virtually no conformational changes are observed between the free and complexed YMR099c (RMSD value of 0.14-0.22 Å between the various structures).
For Glc6P, the 2Fo - Fc electron density map demonstrated that only the
In the unbound structure of YMR099cp, the strictly conserved His159 and Glu264 perfectly superpose with the catalytic residues identified in L. lactis galactose mutarotase (His170 and Glu304, see above). Structural and enzymatic studies on L. lactis enzyme mutants bound to galactose have unambiguously shown that His170 acts as the active site acid by protonating the O5 sugar ring oxygen (distance between His170 N
Surprisingly, the 1.6-Å resolution structure of the YMR099cp Gal6P complex unambiguously revealed that it is the form of tagatose 6-phosphate (Tag6P), the Gal6P isomer (Fig. 6B), which is present in the active site. As observed for Glc6P, the Tag6P phosphate moiety interacts with the Gln81, Arg57, and Arg86 side chains from the arginine clamp and the furan ring packs on Trp238. The Tag6P O3, O4, and O5 atoms are H-bonded to Asp203 O 2, His82 N 2, and Gln81 N 2, respectively. The Tag6P O1 atom interacts with His159 N 2 and Glu264 O 1. Finally, a barium ion (present as counter ion of Gal6P used in the co-crystallization drop) is coordinated to Tag6P O4 and O5 atoms. The presence of Tag6P in the enzyme active site was not expected because the starting material corresponds to the pyran form of galactose 6-phosphate as confirmed by 1H NMR analysis (data not shown). Hence, small amounts of Tag6P (undetectable by NMR) should be present in commercial Gal6P as observed for fructose 6-phosphate in Glu6P salts. Clearly, crystallization of YMR099cp in the presence of Gal6P is strongly selective for the Tag6P-containing complex. Fluorescence measurements have shown that YMR099cp has higher affinity for Fru6P (Kd of 114 µM), the Glc6P isomer, than for its pyran form. Fru6P and Tag6P differ only in the orientation of the hydroxyl group at position C4 (Fig. 4). By analogy, we can speculate that Tag6P, the Gal6P isomer, binds tighter to YMR099cp than Gal6P does (Kd of 0.985 mM), explaining why it is exclusively present in the crystal.
Many enzymes involved in sugar metabolism are specific for the or anomer of their substrate and therefore sugar anomerization equilibrium plays a role in the general metabolic fluxes of the cell. For example, galactokinase specifically phosphorylates -galactose at position 1 to produce -D-galactose-1-phosphate (33), while glucose dehydrogenase only uses -D-glucose as substrate (34). Although interconversion between the and anomers of these sugars occurs spontaneously in solution, the in vivo rate may be too low to support fast energy generation by metabolic pathways (the rates for spontaneous mutarotation of glucose and glucose-6-phosphate are 0.015 min-1 and 0.09 min-1, respectively (35)). Hence, aldose-1-epimerases (or mutarotases; EC 5.1.3.3
[EC]
) increasing the rate of the anomerization are required. The most studied enzyme of this family, galactose mutarotase catalyzes the first of the four steps of the Leloir pathway which converts -D-galactose (one of the lactose degradation products with -D-glucose) into the metabolically more useful glucose 1-phosphate (24, 36). This first step consists of the interconversion of -D-galactose to -D-galactose, which is then phosphorylated by galactokinase to yield -D-galactose 1-phosphate. The latter is transformed into UDP-glucose by galactose-1-phosphate uridylyltransferase to produce glucose 1-phosphate and UDP-galactose. Finally, UDP-galactose 4-epimerase regenerates UDP-glucose from UDP-galactose. In human, mutations in the genes encoding for either of the four enzymes involved in the Leloir pathway result in galactosemia, a rare but potentially lethal disease leading to cataract formation and liver dysfunction (37, 38).
In recent years, the galactose mutarotase catalytic mechanism has been dissected by the resolution of the crystal structures of the L. lactis enzyme bound to different substrates combined with site directed mutagenesis studies (11, 25, 26). A two-layered In conclusion, it has been recently reported that 40% of the enzymatic activities described by EC numbers are not associated with any protein sequence in major public databases. In this article we demonstrate that a systematic structural approach can help to fill this gap in favorable cases. We show here that the S. cerevisiae ymr099c gene product codes for an enzyme that corresponds to a hitherto orphan EC number (EC 5.1.3.15 [EC] ). The definition of the physiological role of the enzyme will need further exploration. Hopefully, this will explain why YMR099c belongs to the 1% of the genes regulated by the transcription activator GCN4 that, to a vast majority, encode for proteins involved in amino acid and nucleotide metabolism (7). Similarly, it remains to be determined if the co-purification of YMR099cp with TRZ1, a tRNA 3' processing endonuclease responsible for a 3' trailer from precursor tRNA (8-10), as any biological relevance or if this is an artifact.
The atomic coordinates and structure factors (code 2CIQ, 2CIR, and 2CIS) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by grants from the Ministère de la Recherche et de la Technologie (Programme Génopoles). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 1 To whom correspondence should be addressed. Tel.: 33-1-69-15-31-55; Fax: 33-1-69-85-37-15; E-mail: Herman.Van-Tilbeurgh{at}ibbmc.u-psud.fr.
2 The abbreviations used are: Gal6P, D-galactose 6-phosphate; Glc6P, D-glucose 6-phosphate; RMSD, root-mean-square deviation; NOESY, nuclear Overhauser effect spectroscopy.
We thank Dr. Y. Mechulam for help with stopped-flow measurements and data analysis, Dr. B. Vauzeilles for fruitful discussion, and Dr. M. Hervé for providing us with the hexose 6-phosphate sugars.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||