|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 281, Issue 9, 6070-6078, March 3, 2006
The Crystal Structure of (S)-3-O-Geranylgeranylglyceryl Phosphate Synthase Reveals an Ancient Fold for an Ancient Enzyme*![]() 1![]() 2![]() ![]() ![]() ¶||13
From the
Departments of
Received for publication, August 24, 2005 , and in revised form, December 21, 2005.
We report crystal structures of the citrate and sn-glycerol-1-phosphate (G1P) complexes of (S)-3-O-geranylgeranylglyceryl phosphate synthase from Archaeoglobus fulgidus (AfGGGPS) at 1.55 and 2.0 Å resolution, respectively. AfGGGPS is an enzyme that performs the committed step in archaeal lipid biosynthesis, and it presents the first triose phosphate isomerase (TIM)-barrel structure with a prenyltransferase function. Our studies provide insight into the catalytic mechanism of AfGGGPS and demonstrate how it selects for the sn-G1P isomer. The replacement of "Helix 3" by a "strand" in AfGGGPS, a novel modification to the canonical TIM-barrel fold, suggests a model of enzyme adaptation that involves a "greasy slide" and a "swinging door." We propose functions for the homologous PcrB proteins, which are conserved in a subset of pathogenic bacteria, as either prenyltransferases or being involved in lipoteichoic acid biosynthesis. Sequence and structural comparisons lead us to postulate an early evolutionary history for AfGGGPS, which may highlight its role in the emergence of Archaea.
The membrane lipids found in Archaea are a defining characteristic of this domain of life (1). These lipids are based on a core architecture where branched-chain saturated hydrocarbons are connected to glycerol through ether linkages (2, 3). In hyperthermophiles, two diphytanylglyceryl units are often linked covalently through their hydrocarbon tails to form tetraether lipids that completely span the membrane. In addition, archaeal membrane lipids have three general characteristics that distinguish them from their bacterial and eukaryotic counterparts (4). First, the phospholipid backbone is built upon the opposite glycerol stereoisomer, sn-glycerol-1-phosphate (G1P),4 not the sn-glycerol-3-phosphate (G3P) backbone found in bacteria and eukaryotes. Second, the hydrophobic chains are isoprenoid derivatives instead of fatty acids. Third, the isoprenoid chains are bound to G1P through ether, not ester, linkages. Of these traits, the glycerol phosphate stereochemistry is the most distinctive because ether-linked lipids are known to exist in some eukaryotes and bacteria (5, 6), and phospholipid fatty acids have recently been described in Archaea (7). To date, however, there is no known exception to the G1P backbone stereochemistry of archaeal lipids or to the G3P backbone stereochemistry found in bacterial and eukaryotic lipids.
The biosynthesis of archaeal membrane lipids is schematically illustrated in (Fig. 1). In brief, dimethylallyl diphosphate (DMAPP) and its isomer isopentenyl diphosphate are synthesized by a mevalonate-like pathway (2, 8). Long isoprenoid chains are produced from these five-carbon precursors by consecutive condensations through the action of a prenyl diphosphate synthase. The committed step in archaeal lipid synthesis occurs with the formation of an ether linkage between G1P and an isoprenoid diphosphate, usually geranylgeranyl diphosphate (GGPP). Separate enzymes catalyze the sequential transfer of isoprenoid units onto the C-3 and C-2 hydroxyls of G1P, respectively (9). The enzyme catalyzing the first reaction, (S)-3-O-geranylgeranylglyceryl phosphate synthase (GGGPS), is a cytosolic protein that has previously been purified and enzymatically characterized from Methanobacterium thermoautotrophicum (9-11) and Thermoplasma acidophilum (12). The enzyme catalyzing the second prenyltransfer, (S)-2,3-di-O-geranylgeranylglyceryl phosphate synthase, is an integral membrane protein which has recently been cloned and purified from Sulfolobus solfataricus (13). In ensuing reactions the isoprenoid double bonds are reduced and various polar headgroups may be attached to the lipid. Synthesis of the aforementioned tetraether lipids still remains largely unexplored. GGGPS, the enzyme catalyzing the committed step in archaeal lipid synthesis, displays three major points of interest. First, it represents the branch point for membrane lipid biosynthesis in Archaea and imparts the unique G1P stereochemistry of the lipid backbone. Second, GGGPS belongs to a class of prenyltransferases that catalyzes the transfer of isoprenoid groups onto nonisoprenoid acceptors; this class includes protein prenyltransferases, DMAPP-tRNA transferase, DMAPP-AMP transferase, dimethylallyltryptophan synthase, and the recently characterized aromatic prenyltransferases (11, 14). Finally, GGGPS is a homologue of PcrB, a protein of unknown function that is conserved in some Gram-positive bacteria, including the human pathogens Bacillus anthracis, Staphylococcus aureus, and Listeria monocytogenes (8, 12, 15).
To gain more insight into the unique stereochemistry of the unusual reaction catalyzed by GGGPS, and to shed light onto the function of PcrB, we have determined crystal structures of GGGPS from the sulfate-reducing hyperthermophile Archaeoglobus fulgidus in complex with G1P and citrate. The present work reveals 1) an ancient fold for this deeply rooted enzyme and 2) establishes, for the first time, a prenyltransferase function on a TIM-barrel scaffold. Our structures further unveil 3) how the "standard phosphate-binding motif" is utilized in concert with conserved active site residues to select for the G1P substrate, 4) a long apolar cavity that bears the hallmarks of a hydrocarbon ruler, and 5) a unique change to the classical ( ![]() )8-TIM-barrel fold, which prompts us to present a hypothesis for the functional consequences of this adaptation.
Cloning, Purification, and CrystallizationThe coding sequence for AfGGGPS was amplified from genomic DNA (ATCC 49558D) and cloned into the NdeI and BamHI sites of the pET-15b expression vector. Native protein was expressed in Escherichia coli BL21(DE3), grown in LB media at 37 °C, and induced at A600 0.8 with 1 mM IPTG for 8-12 h. Selenomethionine (SeMet) protein was expressed similarly but in E. coli B834(DE3) and in M9 minimal media supplemented with 40 mg/liter SeMet. Cells were broken by sonication in Buffer A (50 mM Tris, pH 8.0, 100 mM NaCl, 5 mM imidazole, 10 mM -mercaptoethanol) supplemented with 16 µg/ml benzamidine, 10 µg/ml aprotinin, 10 µg/ml leupeptin, and 1 mM phenylmethylsulfonyl fluoride. Clarified lysate was passed over nickel-nitrilotriacetic acid-agarose (Qiagen), the resin was washed extensively with Buffer A, and proteins were eluted in Buffer A containing 500 mM imidazole. The His6-tag was removed by thrombin digestion ( 12 h) in the presence of 3 mM CaCl2. Proteins were passed over a Superdex 200 column (Amersham Biosciences) in Buffer A without imidazole, and peak fractions containing AfGGGPS were pooled. Proteins were further passed over a second nickel column, and the flow-through containing AfGGGPS was concentrated using a centrifugal filter device (Millipore, 10K MWCO). Finally, the buffer was exchanged to 20 mM Tris, pH 8.0, 100 mM NaCl, 1 mM Tris(2-carboxyethyl)phosphine hydrochloride by three rounds of concentration and dilution. This procedure typically yields 30 mg of AfGGGPS per liter of culture.
AfGGGPS crystallizes readily in the R3/H3 space group. Crystals were grown at 22 °C by the hanging drop method, and microseeding was employed for consistent crystal quality. Equal volumes of protein (20-30 mg/ml) and seed solution (100 mM citrate, pH 6.0, 10 mM MgCl2, 16% polyethylene glycol 4000) were mixed over a reservoir solution of 100 mM citrate, pH 6.0, 10 mM MgCl2, 4-10% polyethylene glycol 4000. For G1P-AfGGGPS soaks, 5-10 mM rac-G1P (Sigma) was added to solutions during cryoprotection. In attempts to co-crystallize the enzyme with GGPP, the addition of 80 mM n-octyl- Data Collection and Structure DeterminationTypically 180° of data were collected from randomly oriented crystals maintained at 100 K in a cold nitrogen stream. A three-wavelength multiple anomalous dispersion data set was collected at the National Synchrotron Light Source beamline X6A on a SeMet-substituted crystal. Native data sets were subsequently collected at the Advanced Photon Source BioCARS 14BM-D. All x-ray data were processed using the DENZO/SCALE-PACK software (16) and reduced, when required, using the program TRUNCATE (17). The AfGGGPS structure was initially determined using SOLVE (18). After applying RESOLVE (19), ARP/wARP (20) built the majority of the protein structure. All subsequent structures were solved by molecular replacement methods using CNS (21), and standard refinement protocols were implemented with CNS. Guided by composite omit maps, water and substrate molecules were added at late stages of refinement. The positions of all atoms were checked manually in the program O (22). The quality of the structural models was assessed by PROCHECK (23).
Although discussed in the main text, the AfGGGPS structure in the P31 space group has not been reported herein due to its relatively limited resolution (
Figures and ModelingAll structural figures were prepared from the G1P-soaked AfGGGPS structure using the programs MOLSCRIPT (24) and RASTER3D (25). The electrostatic surface representations shown in Fig. 3, A and B, were calculated using SPOCK (26). Prior to modeling (Fig. 3D), G1P, MPD, and two water molecules were removed from the G1P-soaked structure. A Mg2+-GGPP substrate was manually placed in the active site cleft. Annealing and energy minimization protocols were run in CNS (21), keeping the positions of all experimentally determined atoms fixed. G1P was added back to the structure with its initial, i.e. experimentally determined coordinates. GGGPS AssayTo avoid the use of radioactivity (9-12), we implemented an enzyme-coupled assay to detect the free pyrophosphate liberated by the GGGPS reaction. The PiPer pyrophosphate assay (Molecular Probes) was performed as described by the manufacturer in the "background management" setting. Absorbance was monitored at 565 nm, and all reactions were incubated at 37 °C in between time points. Different AfGGGPS concentrations were prepared in 50 mM Tris, pH 7.5, at fixed substrate concentrations: 25 µM rac-G1P, 25 µM GGPP (Sigma), and 1 mM MgCl2. The protein used as a negative control, orotidine 5'-monophosphate decarboxylase from M. thermoautotrophicum (MtODCase) (27), was purified as described above for AfGGGPS.
Overall StructureUnless otherwise stated, we will be describing the G1P-soaked AfGGGPS crystal structure. The GGGPS enzymes can be divided into two groups based on phylogenetic analysis (8, 12). The Group I GGGPS from A. fulgidus (AfGGGPS) presents a dimeric structure (Fig. 2, B and C). This is consistent with our results from gel filtration and the recent characterization of a Group II GGGPS from T. acidophilum (TaGGGPS) (12). The structure of AfGGGPS in the P31 space group confirms its dimeric nature,5 as does the crystal structure of its PcrB homologue from B. subtilis (BsPcrB; Protein Data Bank code 1VIZ). Considering these results, it is not immediately obvious from sequence alignments (Fig. 2A) how the Group II GGGPS from M. thermoautotrophicum (MtGGGPS) would form its reported higher order oligomer (10, 11). We conclude that AfGGGPS and most of its sequence homologues are dimeric proteins (Fig. 6A). The dimensions of an AfGGGPS dimer are 85 x 50 x 40 Å (Fig. 2, B and C). Each monomer buries 1,400 Å2, or 13%, of its total accessible surface area.
Based on the unique three-dimensional structures of the cis-, trans-, protein, and aromatic prenyltransferase families (14, 28), it is surprising to see AfGGGPS exhibit a TIM-barrel fold (29, 30). AfGGGPS displays a central eight-stranded parallel
The residues preceding
The AfGGGPS active site lies at the C-terminal end of its
The two active sites within the AfGGGPS dimer do not behave in an equivalent fashion, irrespective of the space group or ligand(s) bound. For example, inspection of the electron density maps in the G1P-soaked structure clearly shows a molecule of G1P bound in one active site, with a "mixture" of G1P and citrate found in the other. The relative active site occupancy also correlates with the rigidity of different parts of the AfGGGPS TIM-barrel. As indicated by the average temperature factors (Fig. 2C and Table 1), a well bound substrate molecule induces stabilization within the region surrounding
Stereoselective Binding of G1PG1P is found near the top inner rim of the In total, G1P forms 14 hydrogen bonds within the AfGGGPS active site: 4 directly to the backbone, 5 through side chains, and 5 mediated by water. While the C-1 carbon and O1P oxygen of G1P are "recognized" indirectly in this binding mode, AfGGGPS conforms to the "four-location model" of stereospecificity to discriminate between the enantiomers G1P and G3P (31). The same conclusion is reached in the context of the stereocenter recognition model for stereoselectivity (32), where binding of G1P is seen to involve two locations, one direction, and steric hindrance (i.e. a third "location"). With respect to the stereospecificity for G1P, it may be surprising that only two protein side chains are absolutely conserved between the GGGPS phylogenetic groups. Perhaps this is reconciled by the fact that Tyr165 and Glu167 simultaneously select for the stereochemistry that defines G1P (Fig. 4A). For completeness, we detail the substitutions that occur in MtGGGPS and TaGGGPS as they relate to the AfGGGPS active site. Ser169 and Asn216 are replaced by glycines, resulting in the loss of two hydrogen bonds to G1P (Fig. 4A). Additionally, Lys11 is changed to leucine, Glu64 to phenylalanine, and Glu122 to methionine (Fig. 2A); these substitutions will drastically alter the charge character of the active site in the Group II proteins. How these amino acid differences affect substrate binding, catalysis, and the release of product are issues that remain to be addressed.
GGPP: Fit for a GrooveOne prominent feature of the AfGGGPS structure is a deep cleft that runs across the top of its
None of our structures showed electron density that could be interpreted as a Mg2+ ion. To identify a potential binding site for the Mg2+-diphosphate moiety of GGPP, we considered the strictly conserved "active site" residues in Group I and Group II GGGPSs. In this context, Tyr124 is too far away from the C-3 hydroxyl of G1P to promote substrate binding and catalysis. However, the side chain of Asp13 lies in a pocket adjacent to G1P, where it forms hydrogen bonds to nearby waters. In fact, a small network of water molecules exists within this pocket. We also noticed that the position equivalent to Thr39 is always either a serine or threonine (Fig. 2A). This is significant because Thr39 is found juxtaposed to Asp13 (Fig. 4B). Together, Asp13, Thr39, the surrounding backbone carbonyls and water molecules are excellent candidates to ligate the Mg2+-diphosphate group of GGPP. This scenario is supported by the fact that the amino acids around Asp13 and Thr39 are two of the most highly conserved regions within the GGGPS proteins (Fig. 2A).
In the area surrounding 3*, binding of G1P also induces rigidification of the 2- 2 loop, i.e. residues Gly38-Glu45, including Thr39 (Fig. 2C). While this stretch of the polypeptide chain is more mobile in the citrate-bound structure, soaking with G1P results in much stronger electron density. It is tempting to speculate that binding of one substrate (G1P) would generate the proper "mold" for binding of the second substrate (GGPP), implying an ordered reaction mechanism. Components of CatalysisPoulter and co-workers (9-11) have interpreted their results on MtGGGPS in favor of an electrophilic alkylation reaction over a nucleophilic attack, which is consistent with the proposals for most other prenyltransferases (28, 35). The reaction mechanisms presented for the well characterized protein farnesyl- and type I geranylgeranyltransferases, however, contain both electrophilic and nucleophilic components (36, 37). As we have only obtained crystals of AfGGGPS in complex with G1P, a discussion of the interactions between the enzyme and GGPP is based on our model of a ternary complex. In this model the C-3 hydroxyl of G1P is directed toward the postulated Mg2+-diphosphate-binding site, where the C-1 of GGPP is appropriately situated to attack the C-3 oxygen of G1P (Fig. 3D). As such, we feel that the limited conservation between Group I and Group II GGGPSs considerably narrows the choice when pinpointing essential catalytic residues. A general feature of prenyltransferase catalysis is thought to be rupture of the carbon-oxygen bond in the isoprenoid diphosphate substrate, which generates a highly electrophilic allylic carbocation that subsequently alkylates a prenyl acceptor (35). Formation of the electrophilic intermediate is aided by the presence of an allylic double bond in the prenyl donor (9, 35). In AfGGGPS, several acidic residues within the active site generate a negatively charged environment (Fig. 3B), lowering the activation energy for creation of a carbocation. At the same time, a magnesium counterion will support the pyrophosphate leaving group. Asp13, an absolutely conserved residue in all GGGPS proteins (Fig. 2A), is well positioned to act as a ligand for a Mg2+ ion.
Glu167 is another residue with a potential role in catalysis. Its O
The Missing Helix: Implications of
In its rigid state, a striking network of ordered water molecules surrounds the
A case can be made for the functional significance of a greasy slide and "swinging door" in the AfGGGPS structure. As with many acyl- and prenyltransferase systems, GGGPSs may require a significant amount of detergent for optimal enzymatic activity (12). Moreover, the release of product is the rate-limiting step in some of these systems (36, 37, 41). Thus, because AfGGGPS turns an already hydrophobic substrate (GGPP) into an even more hydrophobic product (GGG1P), there are obvious implications for the release of GGG1P. As hydrogen bonds to the C-3 hydroxyl of G1P are abolished and anchoring of the geranylgeranyl chain by the Mg2+-diphosphate moiety is released during the reaction cycle, these events are likely propagated along the AfGGGPS structure. Probably linked through the
A Possible Function for PcrBPcrB is a homologue of AfGGGPS ( A DALI search (42) shows that the top structural alignments against AfGGGPS or BsPcrB are TIM-barrel proteins that are known to bind aromatic substrates.6 It is therefore tempting to speculate that the PcrB proteins might perform a function similar to the recently characterized class of aromatic prenyltransferases (14). Alternatively, because G1P is a component of the lipoteichoic acid polymers found in Gram-positive bacteria (43), it is possible that the PcrB proteins could play a role in these biosynthetic pathways.
Nature has clearly exploited the TIM-barrel structural scaffold with a plethora of enzymatic functions (29, 30). We now add a long chain prenyltransferase to the list (Fig. 6B). This result is particularly fascinating when considering the highly specialized structures and unique folds that are known for other prenyltransferases (14, 28). Given the complete restriction of GGGPS homologues to Archaea, the extremely narrow distribution of the GGGPS TIM-barrel prenyltransferase in Nature has two immediate implications. The generation of GGGPS function on a TIM-barrel fold seems to be a uniquely archaeal invention (8), and the limited occurrence of PcrB proteins within bacteria points to a horizontal gene transfer event from Archaea (8). In addition to representing the committed step in lipid synthesis, GGGPS function alone can provide all three distinguishing characteristics of archaeal phospholipids: the unique backbone stereochemistry, an isoprenoid hydrophobic chain, and an ether linkage. Since biological membranes presumably display a vertical line of inheritance, we believe that the evolutionary history of GGGPS reflects the emergence of Archaea.
Besides BsPcrB, a DALI search (42) reveals that AfGGGPS is most closely related to the FMN-dependent oxidoreductase and phosphate-binding (FMOP) family of TIM-barrel proteins (29). Specifically, HisF and HisA are listed at positions 1 and 17, respectively.6 This becomes intriguing because a large body of evidence indicates that HisA and HisF have evolved by 2-fold duplication and fusion of a (
The atomic coordinates and structure factors (code 2F6U and 2F6X (for the citrate and G1P complex of AfGGGPS, respectively)) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by the Canada Research Chairs Program and the National Sciences and Engineering Research Council of Canada (to E. F. P.). Use of the Advanced Photon Source was supported by the United States Department of Energy (DOE) under Contract W-31-109-Eng-38, and BioCARS was supported by the National Institutes of Health (NIH) under Grant RR07707. Use of the National Synchrotron Light Source was supported by the DOE under contract DE-AC02-98CH10886, and X6A was funded by the NIH under agreement Y1 GM-0080-03. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 Present address: Division of Cancer Genomics & Proteomics, Ontario Cancer Inst., MaRS Centre, Toronto Medical Discovery Tower, 101 College St., Toronto, Ontario, M5G 1L7, Canada.
2 Received an Overseas Fellowship and a Research Fellowship for Young Scientists from the Japan Society for the Promotion of Science. Present address: Dept. of Chemistry, Graduate School of Science, Kyoto University, Sakyo-ku, Kyoto 606-8502, Japan. 3 To whom correspondence should be addressed: Division of Cancer Genomics & Proteomics, Ontario Cancer Inst., MaRS Centre, Toronto Medical Discovery Tower, 101 College St., Rm. 5-358, Toronto, Ontario M5G 1L7, Canada. Tel.: 416-581-7545; Fax: 416-581-7545; E-mail: pai{at}hera.med.utoronto.ca.
4 The abbreviations used are: G1P, sn-glycerol-1-phosphate; G3P, sn-glycerol-3-phosphate; GGGPS, (S)-3-O-geranylgeranylglyceryl phosphate synthase; Af, Archaeoglobus fulgidus; Bs, Bacillus subtilis; Mt, Methanobacterium thermoautotrophicum; Ta, Thermoplasma acidophilum; TIM, triose phosphate isomerase; GGPP, geranylgeranyl diphosphate; MPD, 2-methyl-2,4-pentanediol; DMAPP, dimethylallyl diphosphate; SeMet, selenomethionine.
5 J. Payandeh, unpublished data.
6 J. Payandeh and E. F. Pai, unpublished observations.
J. P. is grateful for the continued support of family, friends, and Emily Cowan. We acknowledge suggestions provided by Drs. Ning Wu and Alex Ghetu. We thank the staff at the BioCARS beamlines of the Advanced Photon Source and the X6A beamline at the National Synchrotron Light Source for their time commitments and expert help.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||