|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 27, 19948-19957, July 6, 2007
Structures of Staphylococcus aureus D-Tagatose-6-phosphate Kinase Implicate Domain Motions in Specificity and Mechanism*![]() ![]() 1![]() 2
From the
Received for publication, February 20, 2007 , and in revised form, April 6, 2007.
High resolution structures of Staphylococcus aureus D-tagatose-6-phosphate kinase (LacC) in two crystal forms are herein reported. The structures define LacC in apoform, in binary complexes with ADP or the co-factor analogue AMP-PNP, and in a ternary complex with AMP-PNP and D-tagatose-6-phosphate. The tertiary structure of the LacC monomer, which is closely related to other members of the pfkB subfamily of carbohydrate kinases, is composed of a large /β core domain and a smaller, largely β "lid." Four extended polypeptide segments connect these two domains. Dimerization of LacC occurs via interactions between lid domains, which come together to form a β-clasp structure. Residues from both subunits contribute to substrate binding. LacC adopts a closed structure required for phosphoryl transfer only when both substrate and co-factor are bound. A reaction mechanism similar to that used by other phosphoryl transferases is proposed, although unusually, when both substrate and co-factor are bound to the enzyme two Mg2+ ions are observed in the active site. A new motif of amino acid sequence conservation common to the pfkB subfamily of carbohydrate kinases is identified.
In those bacteria that use lactose as a carbohydrate source, the disaccharide is usually either converted to glucose through the Leloir pathway (1) and then metabolized by glycolysis (2) or catabolized directly through the D-tagatose-6-phosphate pathway (3). Staphylococcus aureus, Staphylococcus epidermidis, and Staphylococcus hominis are the only organisms known to exclusively use enzymes of the D-tagatose-6-phosphate pathway to metabolize lactose and D-galactose (4). In S. aureus, D-galactose and lactose are imported and metabolized by proteins encoded by the lactose operon, lacABCDFEG, the gene products of which have been shown to be inducible by the addition of either D-galactose or lactose (5) or by D-galactose-6-phosphate (6). In fact, the Lac operon was the first discovered example of a group of genes under the control of an operator region to which a repressor (LacR) binds (7). Lactose is converted by transglycosylation into allolactose. This binds to the repressor, inhibits binding to the operator, and allows the transcription of mRNA for enzymes involved in D-galactose metabolism and transport across the membrane (8). In the D-tagatose-6-phosphate pathway (encoded by lacABCD), the first step carried out by a D-galactose-6-phosphate isomerase encoded by lacAB (9) converts D-galactose-6-phosphate into D-tagatose-6-phosphate. The second step is an ATP-dependent phosphorylation of D-tagatose-6-phosphate by D-tagatose-6-phosphate kinase (LacC)3 that yields D-tagatose-1,6-bisphosphate (Fig. 1). A divalent cation is required for this reaction with Mg2+ giving the optimum reaction rate (10). Finally D-tagatose-1,6-bisphosphate aldolase (LacD) cleaves D-tagatose-1,6-bisphosphate to produce glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (5, 11). The pathway is essential for the survival of S. aureus in lactose-rich media such as milk, because if it does not function properly, D-galactose accumulates and bacterial growth is inhibited (12). Interfering with this pathway may therefore be a useful strategy for eradicating infection by S. aureus. Such infections can be important in breast-feeding mothers (13) and are common in livestock where, if not treated, they cause damage to milk-secreting tissues affecting both the quality and quantity of dairy products (14). Structural information concerning the enzymes in the D-tagatose-6-phosphate pathway will play a key role in elucidating a mechanism for its inhibition. Currently, such information is only available for LacD (11).
On the basis of amino acid sequence comparisons, LacC belongs to the pfkB family of carbohydrate kinases, which form a subset of the ribokinase superfamily. Members of this superfamily include ribokinase (RbsK), the minor isoform D-fructose-6-phosphate kinase (PfkB), 1-phosphofructokinase (FruK) and inosine-guanosine kinase (15). A multiple sequence alignment of five members of the ribokinase superfamily including LacC from S. aureus (Fig. 2) shows regions of sequence conservation. Two of these (Motifs I and II) are conserved even in the sequence of human adenosine kinase (16) and represent the ribokinase superfamily signature. The crystal structure of Escherichia coli ribokinase indicates that these motifs are involved in substrate recognition, catalytic mechanism, and transition state stabilization (17).
We present the structure of LacC in two crystal forms. To solve the structure, it was necessary to incorporate two additional selenomethionine (SeMet) residues by site-directed mutagenesis. Fortuitously, the crystals present a number of different states of the enzyme, and the structures provide details of apoLacC, binary complexes with ADP, or the co-factor analogue adenosine 5'-(β, -imido) triphosphate (AMP-PNP, selected because it is a relatively stable triphosphate), and a ternary complex with AMP-PNP and D-tagatose-6-phosphate. The structures illustrate aspects of co-factor and substrate recognition and suggest a mechanism for phosphoryl transfer dependent on conformational changes. We also identify a new region of amino acid sequence conservation specific to the pfkB subfamily of carbohydrate kinases.
Cloning of LacC from S. aureus—The gene encoding tagatose-6-phosphate kinase (lacC) from S. aureus (GenBankTM accession number X14827 [GenBank] ) was amplified by PCR using the specific primers 5'-CAT ATG ATT TTG ACT TTG ACA TTA AAC C-3' (forward) and 5'-GGA TCC TTA CAC CTC TAA AAC TTC-3' (reverse). The forward primer carries an NdeI restriction site and the reverse primer a BamHI restriction site. The resulting PCR product was cloned into a pET15b expression vector (Novagen) and the resulting plasmid heat shock-transformed into E. coli strain BL21 (DE3) (Stratagene). For the L124M,L125M double mutant of S. aureus LacC, PCR primers containing the mutations (forward primer, 5'-TTT ATT AAA CAT TTT GAA CAA ATG ATG GAA AAA GTT GAA GCA GTT GCT-3'; reverse primer, 5'-AGC AAC TGC TTC AAC TTT TTC CAT CAT TTG TTC AAA ATG TTT AAT AAA-3' (mutations shown in bold)) were prepared using a QuikChange® site-directed mutagenesis kit (Stratagene), the PCR product cloned into a pET15b expression vector, and the resulting plasmid purified and sequenced to check the presence of the desired mutation. The plasmid was then heat shock-transformed into E. coli strain BL21 (DE3). Recombinant LacC was produced following similar protocols to those employed for 4-diphosphocytidyl-2C-methyl-D-erythritol kinase (18). After cultivation, cells were harvested and resuspended in 50 mM Tris-HCl, pH 7.7, containing 50 mM NaCl (buffer A). Cell lysis was performed with a cell disruptor at 4 °C, and the cell suspension was centrifuged at 20,000 x g for 30 min. The filtered supernatant was applied to a Hitrap metal-chelating column (Amersham Biosciences). The pure protein, eluted in buffer A and 150 mM imidazole, was dialyzed overnight at 4 °C against buffer A and then concentrated to 34 mg/ml. An aliquot of the pure apoform of the protein was injected onto a high resolution Superdex 200 HR 10/30 size exclusion column (Amersham Biosciences). The elution profile (data not shown) was compared with that of various protein size markers (obtained from Sigma Aldrich) and indicated that LacC forms a dimer in solution. To prepare the selenomethionyl derivatives of LacC and the L124M,L125M double mutant, BL21 (DE3) cells, heat shock-transformed with the pET15b/lacC S. aureus, were grown in 3 ml of LB supplemented with 100 µg/ml ampicillin and grown at 37 °C for 3 h. The culture was harvested and cells resuspended in 25 ml of M9 medium supplemented with 100 µg/ml ampicillin and allowed to grow for another 15 h at 37 °C. These cells were used to inoculate 1 liter of M9 medium. When the absorbance at 600 nm reached 0.56 at 37 °C, solid amino acid supplements were added to the culture as follows: 100 mg/liter L-lysine, L-phenylalanine, L-threonine, 50 mg/liter L-isoleucine, L-leucine, L-valine and 60 mg/liter SeMet. Following a delay of 15 min to allow inhibition of methionine synthesis, gene expression was induced with 1 mM isopropyl 1-thio-β-D-galactopyranoside and the culture left overnight. The same purification protocol described for the native enzyme was then followed. The final concentration of the stock solution in buffer A was 17.0 mg/ml.
Crystallization and Data Collection—The wild-type protein gave crystals that diffracted only to low resolution (
Data collected from crystal form I did not allow de novo structure solution. To achieve this, we produced crystals of the SeMet derivative of the L124M,L125M double mutant to enhance the anomalous dispersion effects due to selenium incorporation. Prior to crystallization, this protein was incubated with 5 mM substrate (D-tagatose-6-phosphate) and 5 mM AMP-PNP. Diffraction quality orthorhombic crystals (crystal form II) grew in the optimized crystallization condition 0.2 M magnesium acetate, 0.1 M sodium cacodylate, pH 6.5, 10% PEG 8000, 15% PEG 550, 10 mM ATP. Cryoprotection was achieved by soaking crystals in 0.2 M magnesium acetate tetrahydrate, 0.1 M sodium cacodylate, pH 6.5, 10% PEG 8000 supplemented with 10% glycerol, and 10% PEG 550. Diffraction data, measured on European Synchrotron Radiation Facility beam-line ID14-EH2, were integrated and scaled using XDS (19) and intensities merged using SCALA software packages. Intensities were then converted to structure factors and anomalous differences derived using the program TRUNCATE. See Table 1 for a summary of statistics. Crystal Form II—The structure of crystal form II was solved using the single wavelength anomalous dispersion technique. The programs XPREP (20) and SHELXD (21) were used to determine a substructure comprising 16 selenium sites from a tetramer in the asymmetric unit. The solvent content and Matthews co-efficient are 50% and 2.5 Å3/Da, respectively. SHELXE (22) was used to determine the correct enantiomorph of the substructure, and the program SHARP (23) was used to calculate phase probability distributions. This resulted in experimental phases with an average figure of merit of 0.245 to dmin = 2.1 Å resolution. The experimental phases were improved using the program DM by density modification using a combination of non-crystallographic symmetry averaging, histogram matching and solvent flattening. Initial non-crystallographic symmetry operators were obtained using the program PROFESS. The resulting phases coupled with the observed structure factors were then input into ARP/wARP (24), which placed 1144 glycine, valine, and serine residues in the asymmetric unit. The GuiSIDE module of CCP4i (25) then docked the correct side chains to these residues. Visual inspection of the electron density using O (26) identified model fragments belonging to each of the four monomers. Rounds of refinement using REFMAC5 interspersed with map inspections in O allowed a model of all four monomers in the asymmetric unit to be constructed. The careful placement of solvent molecules and active site ligands concluded the analysis with the refinement process being monitored using Rwork and Rfree (27). It was clear early in the map interpretation that there was significant deviation from the expected non-crystallographic symmetry, and therefore all subunits were treated independently. The subunits are labeled A-D. Crystal Form I—Subunit A from crystal form II, stripped of ligands and solvent molecules, provided the search model for molecular replacement calculations to solve the structure of crystal form I. Molecular replacement was carried out using the program PHASER. The asymmetric unit of form I also presents four subunits, again labeled A-D, and each was positioned and then refined as a rigid body using REFMAC5 with data between 20.0 and 2.0 Å. The resulting electron density map provided a basis for automatic model building (ARP/wARP), and a model consisting of 992 glycine residues was built. Similar protocols to those applied in the analysis of form II concluded the structure determination.
Summaries of the composition and geometry of the final models for both crystal forms along with refinement statistics are presented in Table 1. The stereochemistry of the final models was assessed using PROCHECK. Ramachandran plots (not shown) indicate excellent main chain geometry. However, in all subunits, Asn-233 has
Crystallographic Analyses—To solve the structure of LacC, we produced the SeMet variant of an L124M,L125M double mutant. This sample (crystal form II) provided an anomalous signal large enough to be exploited for structure determination. The choice of residues to mutate was guided by the knowledge that mutation of Leu to Met has little effect on protein structure (29), coupled with secondary structure predictions for LacC (data not shown) that indicated the residues concerned are located in an -helix. Our crystal structure analyses proved this prediction to be correct. Subsequently, molecular replacement methods allowed us to determine the structure of crystal form I. Both crystal forms have been refined to high resolution (Table 1). The asymmetric unit of each form contains four subunits organized as a pair of homodimers. A New Motif for the pfkB Subfamily—Among the sequences aligned in Fig. 2, there is clearly a previously unreported third region of amino acid conservation common to fructose phosphate kinases and LacC. This conserved region, Motif III, is common to the pfkB subfamily of carbohydrate kinases but not to members of the ribokinase superfamily of proteins in general and has the sequence Ser-Gly-Ser-Leu-Pro-Xaa-Gly. Motif III may thus be assigned as being the characteristic signature of the pfkB family carbohydrate kinases, and as will be described, this motif is specifically involved in the binding of the tagatose substrate.
The Overall Structure of LacC—The primary structure of S. aureus LacC consists of 310 amino acids. The tertiary structure of a LacC monomer is composed of two domains (Fig. 3a). A large
The LacC monomer structure is closely related to other members of the ribokinase superfamily. The common structural feature in this superfamily is the large
In common with other members of the ribokinase superfamily, LacC forms a dimer similar to that seen for E. coli ribokinase (Fig. 3b) (17). Two monomers associate via interactions between their lid domains, which come together to form a β-barrel structure known as a β-clasp. The two β-sheets of the lid domains pack face to face and are rotated The Active Site of LacC—Crystal form I was obtained by co-crystallization with ATP. However, only two molecules of ADP, bound to monomers BI and DI, were identified. Monomers AI and CI represent the structure of the apoform of the enzyme. That ADP, and not ATP, is found bound to monomers BI and DI should not be surprising. In the crystal structure of E. coli ribokinase (17), ADP is found in the co-factor binding site even though the protein was incubated with AMP-PNP (often used because of its stability) before crystallization. In that case, the authors speculated that AMP-PNP had hydrolyzed before binding to the protein. Given the much greater propensity of ATP to hydrolyze, it is likely that the same situation has occurred here.
The ADP binding site is on the surface of the /β domain constructed using residues from β11, the C terminus of β12 and the turn linking it to β13, the long loop linking β14 to 8, the N terminus of the latter, and the C terminus of 9. Aliphatic side chains form a binding pocket in which the adenine base is held in an anti-conformation stabilized only by hydrophobic interactions. The ribose O-2* interacts with the main chain carbonyl and ND2 of Asn-278. A well ordered water molecule mediates hydrogen bonds between the ribose O-3* and the amide and carbonyl groups of Gly-224 and Gly-227. The ribose O-3* atom also accepts a hydrogen bond from Asn-278 ND2. The -phosphate group interacts with Ser-222 OG and via a water molecule with Lys-38 NZ. The O-3A link between - and β-phosphate groups accepts a hydrogen bond from the main chain amide of Gly-253. The β-phosphate forms hydrogen bonds with the carbonyl group of Pro-249 (we assume that a phosphate O is protonated), the amide group of Gly-253, and (via water molecules) the amide groups of Gly-251 and Asp-254.
Crystal form II, from which the first experimental phases were actually determined, was obtained from protein incubated with 5 mM substrate (D-tagatose-6-phosphate) and a co-factor analogue (AMP-PNP). Here, positive difference electron density (mFo - DFc,
In monomer CII, two different moieties were bound in the active site. One is clearly tagatose-6-phosphate (Fig. 4a, right). The OMIT (mFo - DFc, calc) difference density (Fig. 4a, left) indicated that an initial model for the second moiety should be AMP-PNP with two hydrated cations bound to its phosphate tail. However, such a model resulted in significant negative difference density centered on the -phosphate group of the AMP-PNP, an indication that this model was not correct. Clean difference density maps were eventually obtained in the active site of subunit CII by modeling AMP-PNP and ADP, each with half occupancy, coupled with the presence of two fully occupied hydrated cations. One of the two cations binds between the - and β-phosphates of AMP-PNP and (when present) ADP. The second cation is bound between the β- and -phosphates of AMP-PNP or to the β-phosphate group of ADP. Further discussion of the structure of crystal form II of LacC assumes the presence of a fully occupied AMP-PNP. Fig. 4b shows where the ligands bind to the LacC subunit, and Fig. 5 shows the key interactions in the active site that will be discussed.
Based on coordination geometry, the cation bridging the
As described for ADP binding, the co-factor binding sites are located on the surface of the core domain (Fig. 4b), and interactions between the enzyme and the adenine base are conserved. However, interactions between the sugar/phosphate tail of the AMP-PNP/ATP and the protein are different from those seen for ADP binding to LacC (not shown).
When substrate is absent (monomers AII,BII,DII), the ribose O-2* of AMP-PNP interacts with the protein in the same way as when ADP is bound. However, the AMP-PNP ribose O-3* now accepts a hydrogen bond directly from Asn-278 ND2 and the previously observed solvent-mediated hydrogen bonds between the ribose O-3* and the protein are replaced by a direct interaction of this group with the carbonyl oxygen of Gly-227. This difference is due to a change in the position of the loop involving residues Ser-222 to Gly-227 which, when AMP-PNP rather than ADP is bound, is shifted toward the co-factor by
The majority of the hydrogen bonds and metal ion-co-factor interactions are maintained when AMP-PNP/ATP is bound in the presence of substrate (Fig. 5). However, in this case, a major difference is that a second Mg2+ ion bridges the In monomer CII, the D-tagatose-6-phosphate substrate binds to residues from both domains of LacC and to residues in the linker region (Figs. 4b and 5). The substrate O-1 group, the phosphorylation site, forms hydrogen bonds with Asp-254 OD2 and Lys-38 NZ, with the latter also donating a hydrogen bond to the D-tagatose-6-phosphate O-5 hydroxyl and interacting with the substrate phosphate via a water molecule (not shown). The substrate O-2 interacts with Asn-41 ND2, whereas its O-3 and O-4 groups form hydrogen bonds with Asp-12 OD2 and OD1, respectively. As well as the solvent-mediated interaction with Lys-38, the substrate phosphate also forms direct hydrogen bonds with the side chains of Arg-88 and (from the partner subunit) Arg-27, the side chain hydroxyl of Ser-136, and the amide of Gly-135. The dimeric nature of LacC is thus important to construct the active site, in particular to position the basic Arg-27 so that it can participate in substrate binding.
The Catalytic Mechanism of LacC—As described above, monomer CII represents the structure of LacC with both the substrate and co-factor simultaneously bound. The active site comprises residues from the lid domain, the linking fragment, and the /β domain. Amino acids in all three conserved motifs (Fig. 2) play a role in the binding of substrate and/or co-factor. In particular, and as also seen in the structure of ribokinase (17), the amino acids in Motif I form an -helix of which the N terminus and the side chain of Lys-38 point toward the substrate phosphorylation site. From Motif II, the main chain amides of Gly-251 and Gly-253 form hydrogen bonds with the -phosphate of the co-factor, and the side chain of Asp-254 points toward the substrate phosphorylation site. Ser-136 from Motif III donates a hydrogen bond to the substrate's phosphate group. This phosphate is present only in substrates processed by 1-phosphofructokinase, D-fructose-6-phosphate kinase, and D-tagatose-6-phosphate kinase, and this explains the conservation of this serine in these sequences only.
The catalytic center (Fig. 5) is located at the interface between the co-factor and substrate. The
This mechanism is similar to that proposed for other phosphoryl transferases. However, unlike other enzymes that carry out the reaction in an Mg2+-dependent manner (36-41), the crystal structures described here indicate that the active site of LacC contains two, rather than one, Mg2+. The cation that bridges the β- and Conformational Changes in LacC—LacC monomers adopt distinct conformations in which the relative orientations of the lid and core domains vary. Conformational changes upon substrate binding are a well known strategy for nucleotide-dependant transferases to position appropriate amino acids around the reacting partners (44). In the crystal structures described here, monomer CII is found in a fully closed conformation. But which domain moves when this conformation is formed?
The conservation of the distances ( The phenomenon of domain closure has been observed in many multidomain enzymes. In one such enzyme, phosphoglycerate kinase, complete domain closure may only be achieved with binding of both substrate and AMP co-factor (45). Such a situation also seems to be the case for LacC. However, domain closure is not common to all enzymes that possess a lid domain. Thus, the crystal structures of 2-keto-3-deoxygluconate kinase from Thermus thermophilus in complex with ATP or ATP and substrate revealed that only local structural (and not large scale conformational) changes are required to enable catalysis (46).
The conformational changes observed in monomer CII of LacC are also critical in allowing the recruitment of the Mg2+ ion that bridges the
The structure of LacC from S. aureus has been determined in two different crystal forms. The fold is characteristic of the ribokinase superfamily and consists of two domains, a core /β domain, and a lid domain comprising β-strands that is involved in dimer formation. These two domains are linked by unstructured polypeptides, which confer flexibility in the relative orientation of the two domains. As the dimer interface forms a rigid β-clasp structure, the conformational changes observed involve a movement of the core domain toward the lid domain. Domain closure has to be complete to bring substrate and co-factor close enough to react, to provide a fully formed substrate binding pocket and to allow the recruitment of a second Mg2+ ion that may be important in catalysis. Our study provides understanding of how LacC recognizes and binds its substrates and allows us to propose a mechanism for the catalysis of phosphoryl transfer. The structural information may, in addition, provide the basis for the design of a specific LacC inhibitor, which might have implications for human and animal health.
The atomic coordinates and structure factors (code 2JGV and 2JG1) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 Supported by the Wellcome Trust and the Biotechnology and Biological Sciences Research Council (Structural Proteomics of Rational Targets). 2 To whom correspondence should be addressed: Macomolecular Crystallography Group, European Synchrotron Radiation Facility, 6 Rue Jules Horowitz, 38043 Grenoble Cedex, France. Tel.: 33-4-76-88-23-94; Fax: 33-4-76-88-29-04; E-mail: leonard{at}esrf.fr.
3 The abbreviations used are: LacC, D-tagatose-6-phosphate kinase; AMP-PNP, adenosine 5'-(β,
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||