|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 51, 37112-37123, December 21, 2007
The Active Site of an Algal Prolyl 4-Hydroxylase Has a Large Structural Plasticity*![]() ![]() ![]() ![]() ![]() 1
From the
Received for publication, August 8, 2007 , and in revised form, October 16, 2007.
Prolyl 4-hydroxylases (P4Hs) are 2-oxoglutarate dioxygenases that catalyze the hydroxylation of peptidyl prolines. They play an important role in collagen synthesis, oxygen homeostasis, and plant cell wall formation. We describe four structures of a P4H from the green alga Chlamydomonas reinhardtii, two of the apoenzyme at 1.93 and 2.90Å resolution, one complexed with the competitive inhibitor Zn2+, and one with Zn2+ and pyridine 2,4-dicarboxylate (which is an analogue of 2-oxoglutarate) at 1.85Å resolution. The structures reveal the double-stranded β-helix core fold (jellyroll motif), typical for 2-oxoglutarate dioxygenases. The catalytic site is at the center of an extended shallow groove lined by two flexible loops. Mutagenesis studies together with the crystallographic data indicate that this groove participates in the binding of the proline-rich peptide-substrates. It is discussed that the algal P4H and the catalytic domain of collagen P4Hs have notable structural similarities, suggesting that these enzymes form a separate structural subgroup of P4Hs different from the hypoxia-inducible factor P4Hs. Key structural differences between these two subgroups are described. These studies provide first insight into the structure-function relationships of the collagen P4Hs, which unlike the hypoxia-inducible factor P4Hs use proline-rich peptides as their substrates.
Prolyl 4-hydroxylases (P4Hs)2 are 2-oxoglutarate dioxygenases that catalyze the hydroxylation of proline residues in peptide linkages, requiring also Fe2+, 2-oxoglutarate, O2, and ascorbate (1–3) (Fig. 1). The animal P4Hs have been grouped in two categories; collagen P4Hs (C-P4Hs), which have a central role in the synthesis of all collagens (2, 4), and hypoxia-inducible factor (HIF) P4Hs, which play a key role in the response of cells to hypoxia (5–6). A third class of P4Hs is found in plants, playing an important role in the synthesis of cell-wall glycoproteins (7–8).
The vertebrate C-P4Hs are
The catalytic domains of all P4Hs contain four conserved residues; a histidine and an aspartate in a His-X-Asp motif, which together with a second, distal histidine bind the Fe2+ ion, and a basic residue that binds the 5-carboxylate group of the 2-oxoglutarate (21). The structure of the catalytic domain of HIF-P4H-2 complexed with Fe2+ and a 2-oxoglutarate analogue, {[(4-hydroxy-8-iodoisoquinolin-3-yl)carbonyl]amino}acetic acid, was solved recently (15) and shown to contain an 8-stranded β-helix core fold (jellyroll motif, also referred to as the double-stranded β-helix fold) typical of the 2-oxoglutarate dioxygenases (3, 22). The structure of the C-P4H tetramer is currently unknown, but that of the peptide-substrate binding domain of the human C-P4H-I Detailed knowledge of the structure of the C-P4H catalytic domain would help in the rational design of potent inhibitors for the treatment of fibrotic diseases. The catalytic domain of HIF-P4H-2 is not a suitable model for this purpose because of its low amino acid sequence identity (14%, starting from HIF-P4H-2 Thr-185, Fig. 2) and the remarkably different substrate specificity. We report here on the determination of several apo and liganded structures of an algal P4H of C. reinhardtii (Cr-P4H-1) (8), which shows 26% sequence identity to the catalytic domain of C-P4H-I (starting from Cr-P4H-1 Val-29, Fig. 2), and which, like the C-P4Hs, uses proline-rich peptide-substrates (8). Furthermore, Cr-P4H-1 and the C-P4Hs are inhibited by both pyridine 2,4-dicarboxylate and pyridine 2,5-dicarboxylate, whereas the latter compound does not inhibit the HIF-P4Hs (1, 2, 8, 25). Therefore, the structure of Cr-P4H-1 seems important to gain a better understanding of the properties of the catalytic domain of the C-P4Hs. The Cr-P4H-1 structure identifies the residues involved in cosubstrate binding and reveals a groove formed by an extension after the second jellyroll β-strand. The catalytic site is in a pocket in the middle of this shallow groove. Mutagenesis studies and crystallographic data on the crystal contact interactions of loops of neighboring molecules in this groove also suggest that it is important for peptide-substrate binding.
Expression, Purification, and Analysis of Wild-type and Mutant Recombinant Cr-P4H-1 Polypeptides—Because NMR analysis of full-length Cr-P4H-1 (lacking the signal peptide and, thus, starting from Ala-17) with an N-terminal histidine tag (8) indicated that the N terminus of the polypeptide was disordered (data not shown), a truncated construct starting from Val-29 and containing a Met-His6-Met sequence at the N terminus (referred to as His6-Cr-P4H-1-V29) was generated, expressed in Escherichia coli, and purified to homogeneity as described previously (8). SeMet-labeling of His6-Cr-P4H-1-V29 was performed by expressing it in a methionine-requiring auxotrophic E. coli strain B384 (DE3) using 0.5 mM isopropyl 1-thio-β-D-galactopyranoside induction for 20 h at 20 °C. Mutations were introduced into the His6-Cr-P4H-1-V29 plasmid with a QuikChangeTM site-directed mutagenesis kit (Stratagene). The mutagenesis oligos were the following: Y134F, 5'-CTGCAGGTTCTGCACTTCCATGACGGTCAGAAG-3'+ 5'-CTTCTGACCGTCATGGAAGTGCAGAACCTGCAG-3'; Y168F, 5'-GTCACGATGCTCATGTTCCTGACCACGGTG-3'+ 5'-CACCGTGGTCAGGAACATGAGCATCGTGAC-3'; R93A, 5'-GACAGCGAGATCGCCACCAGCACCGGCAC-3'+ 5'-GTGCCGGTGCTGGTGGCGATCTCGCTGTC-3'; S95A, 5'-GAGATCCGCACCGCCACCGGCACCTGG-3'+ 5'-CCAGGTGCCGGTGGCGGTGCGGATCTC-3'; W99A, 5'-ACCAGCACCGGCACCGCGTTTGCTAAGGGCGAG-3'+ 5'-CTCGCCCTTAGCAAACGCGGTGCCGGTGCTGGT-3'; E127A, 5'-CTGGAGAACCATGCGGGGCTGCAGGTTC-3'+ 5'-GAACCTGCAGCCCCGCATGGTTCTCCAG-3'; Q130A, 5'-CCATGAGGGGCTGGCGGTTCTGCACTACC-3'+ 5'-GGTAGTGCAGAACCGCCAGCCCCTCATGG-3'; Y140A, 5'-CCATGACGGTCAGAAGGCTGAGCCACACTATGAC-3'+ 5'-GTCATAGTGTGGCTCAGCCTTCTGACCGTCATGG-3'; R161A, 5'-GAGCACGGCGGCCAGGCCGTGGTCACGATGCTC-3'+ 5'-GAGCATCGTGACCACGGCCTGGCCGCCGTGCTC-3'; W243F, 5'-TGGAGCGCCACCAAGTTCATCCACGTGGCCCCC-3'+ 5'-GGGGGCCACGTGGATGAACTTGGTGGCGCTCCA-3'; H245A, 5'-GCCACCAAGTGGATCGCCGTGGCCCCCATCGG-3'+ 5'-CCGATGGGGGCCACGGCGATCCACTTGGTGGC-3'. Because co-crystallization experiments of the His6-Cr-P4H-1-V29 with ligands failed, another construct having an N-terminal SUMO partner fused to the Gly-30 of Cr-P4H-1 (referred to as Cr-P4H-1-G30) was generated using the ChampionTM pET SUMO protein expression system (Invitrogen). The fusion protein was expressed and purified as described (8), after which the SUMO partner was cleaved by digestion overnight with SUMO protease (Invitrogen or LifeSensors Inc.) at 4 °C in 0.01 M Tris-HCl, 0.15 M NaCl, 0.1 M glycine, and 2 mM β-mercaptoethanol, pH 7.8. The sample was reapplied to a nickel nitrilotriacetic acid column after which the cleaved Cr-P4H-1-G30 obtained in the flow-through was applied to a SuperDex HR75 column by which the buffer was changed into the storage buffer of 0.01 M Tris-HCl, 0.1 M NaCl, 0.1 M glycine, pH 7.8. The catalytic properties of the purified Cr-P4H-1 polypeptides were determined using poly-L proline, Mr 5000, as a substrate, as described previously (8). Crystallization—Crystals were grown using vapor diffusion methods with drops made by mixing equal volumes of protein (dissolved in storage buffer) and well solution (Table 1). The SeMet-labeled His6-Cr-P4H-1-V29 produced bipyramidal crystals (referred to as SeMet(apo) crystals). These crystals were used to solve the initial apo structure of Cr-P4H-1 (referred to as SeMet(apo) crystal form). Needle-like crystals, referred to as the needle(apo) crystals, were also obtained with the non-labeled construct by using 0.24 mM N-octanoylsucrose (Hampton Research) as an additive (Table 1). Subsequent co-crystallization experiments with Zn2+ and pyridine 2,4-dicarboxylate (Sigma) were performed with the Cr-P4H-1-G30 construct, resulting in a third crystal form referred to as Zn(complex) (Table 1).
Crystal Structure Determination and Validation—The SeMet(apo) and needle(apo) crystals were soaked for 1 min in a solution containing 20% glycerol, 2 M (NH4)2SO4, 2% polyethylene glycol 400, 0.1 M triethanolamine, pH 7.5, and subsequently flash-frozen in liquid nitrogen. A three-wave-length multiple-wavelength anomalous dispersion dataset to 2.8 Å resolution was collected from a SeMet(apo) crystal using synchrotron radiation at the BW7A beamline, EMBL, DESY, Hamburg. An additional high resolution dataset was collected from a second SeMet(apo) crystal to 1.93 Å resolution. The dataset for the needle(apo) crystal was collected at the high-throughput macromolecule beamline ID14-3, European Synchrotron Radiation Facility (ESRF), Grenoble. In the case of the Zn(complex) crystal form, the cryo solution consisted of 20% glycerol, 2 M (NH4)2SO4, 0.1 M Tris, pH 8.5, 2 mM ZnSO4, and 2 mM pyridine 2,4-dicarboxylate. An 20-s soak in the cryoprotectant before freezing the crystal in liquid nitrogen resulted in the optimal diffraction pattern and allowed data collection to 1.85 Å at the X12 beamline, EMBL, DESY, Hamburg. All the datasets were processed using XDS (26). The data collection statistics are summarized in Table 1. The initial phaseset (2.8 Å resolution, average figure-of-merit being 0.69) was obtained from the SeMet(apo) multiple-wavelength anomalous dispersion dataset using SOLVE (27), and the initial model was constructed using RESOLVE (28). The data were subsequently processed with programs of the CCP4 package (29). The model was further refined using the data to 1.93 Å resolution, with REFMAC5 (30) and the iterative model building programs O (31) and COOT (32). The needle(apo) and Zn(complex) crystal forms were solved by molecular replacement using Phaser (33) and molecule A of the SeMet(apo) as a model. Both structures were refined using REFMAC5 (30), and the models were completed using COOT (32). In most of the apo subunits there was a strong electron density in the 2-oxoglutarate pocket. This density could not be modeled as a water molecule, but a chloride ion placed at this position refined to normal B-values. The presence of a chloride ion also agrees with the geometry of this site, with distances to the surrounding atoms of Nz (Lys-237) and Og1 (Thr-178) being 3.1 and 3.0 Å, respectively. The final refinement statistics for all three crystal forms are shown in Table 1. Non-crystallographic symmetry restraints were used in the refinement of the needle(apo) crystal form.
The structures were validated using COOT (32), WHATIF (34), and Molprobity (35). The figures were prepared using ICM (MOLSOFT, LLC) (see Fig. 5, A and B) and SwissPDB-viewer (36). For the structure superimpositions with the program O (31), the central residues Val-131—Leu-132, His-143—Asp-145, Leu-166—Met-167, Thr-178—Val-179, Leu-202—Ala-203, Leu-210—Met-211, His-227—Gly-228, and Lys-237—Trp-238 of the Cr-P4H-1 jelly roll β-strands were used. The superimpositions with HIF-P4H-2 (PDB code 1G19) and clavaminate synthase (CAS) (PDB codes 1GVG and 1DRY) resulted in root mean square differences for the corresponding C
Determination of the Cr-P4H-1 Structures—Four structures of N-terminal-truncated Cr-P4H-1 were obtained, two of the apoenzyme from two crystal forms referred to as the SeMet(apo) and needle(apo), one of a binary complex with the competitive inhibitor Zn2+, and one of a ternary complex with Zn2+ and pyridine 2,4-dicarboxylate (Tables 1 and 2). These two complex structures were obtained from the third crystal form referred to as the Zn(complex) form (Tables 1 and 2). The SeMet(apo) structure was determined by the multiple-wave-length anomalous dispersion phasing method and was used to solve the needle(apo) and the Zn(complex) crystal forms by molecular replacement.
The SeMet(apo) crystal form was refined at 1.93 Å resolution, whereas the needle(apo) and Zn(complex) forms were refined at 2.90 and 1.85 Å, respectively (Table 1). Each crystal form has multiple copies per asymmetric unit, representing altogether four conformational states differing with respect to the presence of ligands and the conformation of some of the active site loops (Table 2). The cores of each conformational state superimpose well, with the root mean square deviations of C atoms in a pairwise comparison being 0.3–0.5Å. Analysis of each of the structures by means of Molprobity (35) shows that there are no Ramachandran outliers (Table 1). The SeMet(apo) form has four similar molecules per asymmetric unit, of which molecule D is best defined, whereas the needle(apo) form has three similar molecules per asymmetric unit, of which A is best defined (Table 2). The Zn(complex) form has two molecules of different conformation per asymmetric unit; molecule A (the ternary complex) has the competitive inhibitor pyridine 2,4-dicarboxylate in the 2-oxoglutarate-binding pocket and a Zn2+ ion at the iron-binding site, whereas molecule B (the binary complex) has only the bound Zn2+ (Table 2). The ternary complex is the best defined structure in terms of its corresponding electron density map and will mainly be used for the subsequent description of the overall fold.
The Overall Structure—Cr-P4H-1 consists of 13 β-strands and 3
The Flexible Loops—The complete polypeptide chain is well defined in the ternary complex, and there are no loops with high B-factors. In contrast, both apo structures and the binary complex have two major regions with fragmented or missing electron densities; the β3-β4 hairpin loop (Val-75—Ser-95) and the βI-βII loop (His-135—Lys-139). The only apo structure for which the latter region could be built is molecule A of the needle(apo) form (Fig. 3C). Moreover, the whole βII, the His-X-Asp motif (His-143/Asp-145), and the extended βII-βIII loop (Tyr-146–Gly-158) are not seen in molecules A, B, and C of the SeMet(apo) form. When ordered, the βII-βIII loop can exist either in an open conformation, as seen in the SeMet(apo) (molecule D) and needle(apo) forms and the binary complex, or adopt a closed conformation folding back over the catalytic site as seen in the ternary complex (Fig. 3C). The tip of the loop, Gly-154, moves by 19 Å between the open and closed conformations. Interestingly, the open conformation of the βII-βIII loop in the binary complex and the needle(apo) structure is due to binding of a loop of a neighboring molecule in the groove formed between the open βII-βIII loop and the strands β5, βI, and βII (Table 2). The Catalytic Site—The catalytic site is best defined in the binary and ternary complexes with bound Zn2+ and with Zn2+ and pyridine 2,4-dicarboxylate, respectively (Figs. 3A and 4). The positions of the Zn2+ ion and the pyridine 2,4-dicarboxylate molecule are clearly defined by the electron density maps. Zn2+ and pyridine 2,4-dicarboxylate are competitive P4H inhibitors with respect to Fe2+ and 2-oxoglutarate, respectively (2, 25, 37), with Ki values for Cr-P4H-1 of 10 and 70 µM (8). Zn2+ takes the position of Fe2+ and is bound to His-143 (the proximal histidine) and Asp-145 of the His-X-Asp motif of βII and His-227 (the distal histidine) of βVII, with bond distances of 2.1, 2.1, and 2.2Å, respectively (Fig. 4). These are the only three protein-Zn2+ interactions in the binary complex, whereas pyridine 2,4-dicarboxylate provides an additional interaction with the Zn2+ in the ternary complex via a bidentate coordination through its 1-carboxylate moiety (2.1 Å) and the N atom (2.2 Å) (Fig. 4). The coordination of Zn2+ by these five ligands in the ternary complex is approximately octahedral, with the sixth site (opposite the proximal His-143) empty. The inhibitor is positioned so that its 5-carboxylate moiety is bound in a deeply buried pocket inside the jellyroll core. This moiety forms four direct hydrogen bonds with the side chains of Tyr-134 (2.9Å) of βI, Thr-178 (2.7Å) of βIV, and Lys-237 (2.8Å) and Ser-239 (3.3Å) of βVIII (Fig. 4). Lys-237 is a conserved basic residue corresponding to Lys-493 in the human C-P4H-I, which has been shown by site-directed mutagenesis to bind the 5-carboxylate of 2-oxoglutarate (21). Hydrogen bonds are also formed between the 1-carboxylate of the inhibitor and the side chains of Thr-241 (2.7Å) and Trp-243 (3.3Å) of βVIII and a water molecule (2.9 Å) (Fig. 4). The only apolar residues close to the bound inhibitor are Tyr-140 and Leu-166. Tyr-140 covers the binding pocket by a stacking interaction with the aromatic ring of the inhibitor. Interestingly, in the binary complex with no bound inhibitor Tyr-140 has flipped outwards and points to the bulk solvent, similar to the situation in molecule D of the SeMet(apo) structure (Fig. 3C and Table 2). At the back of the molecule, the pocket for binding of the 5-carboxylate moiety is sealed off from the bulk solvent by the hydrophobic side chains of Tyr-168 of βIII, Leu-200, Leu-202, and Pro-204 of βV, and Leu-210 of βVI. Residues from the N-terminal part of Cr-P4H-1, such as Val-43 and Leu-45 of β1 and Leu-53 of β2, further extend this cluster of hydrophobic residues at the back of the molecule. The outer rim of the 2-oxoglutarate binding pocket, near the catalytic site, is shaped by Thr-241 and Trp-243 of βVIII, Gln-130 and Leu-132 of βI, Arg-93 of β5, Tyr-140 of βII, and the metal ion binding residues (Fig. 5, A and B).
In the needle(apo) structure the side chains of the three iron binding residues superimpose well on those in the binary and ternary complexes (Fig. 3C). It appears that in this crystal form a loosely bound water molecule has replaced the Zn2+, having distances of 3.0, 3.1, and 2.7Å to His-143, Asp-145, and His-227, respectively. In contrast, in the SeMet(apo) form the iron binding residues are poorly ordered with the exception of the distal histidine, His-227 (βVII). The metal binding residues His-143 and Asp-145 of βII are only visible in subunit D but have high B factors. The two iron binding histidines point inwards, whereas Asp-145 points away from the catalytic site leading to a highly irregular structure (Fig. 3C) which is different from those seen in the binary and ternary complex. Unlike the metal ion binding residues, all the 2-oxoglutarate binding residues superimpose well when the apo structures are compared with the binary and ternary structures. The 2-oxoglutarate binding pocket is filled with water molecules and a chloride ion (Table 2). The chloride ion is hydrogen-bonded to Lys-237 and Thr-178, thus replacing an oxygen of the 5-carboxylate moiety of pyridine 2,4-dicarboxylate. The other 5-carboxylate oxygen atom is replaced by a water molecule, which is hydrogen-bonded to Tyr-134 and Ser-239 in the apo structures. The Putative Peptide Binding Groove—The βII-βIII loop and the β3-β4 hairpin loop shape a long shallow groove that stretches from Trp-99 (β5) to βII. The shape of this possible peptide binding groove is illustrated in Fig. 5, A and B. Its center, near the catalytic site, is dominated by Trp-243, which is covered by the Arg-161–Glu-127 salt bridge on the bulk solvent side and by the side chains of Gln-130 and Thr-241 on the bulk protein side. Adjacent to Gln-130 and Thr-241 (above in the standard view of Fig. 5B) this peptide binding pocket is shaped by residues of βI (for example, Leu-132) and βII (for example Tyr-140). Farther above the Trp-99/Trp-243 center are the side chains of Arg-93 and Ser-95, which are located immediately after the β3-β4 hairpin loop. Arg-93 is stacked to the Tyr-140 side chain, the hydroxyl group of which points into the peptide binding groove. The Tyr-140 side chain is also stacked to the catalytic His-143 side chain. On the bulk solvent side, below the Trp-99/Trp-243 center (Fig. 5B), Glu-127 is also hydrogen-bonded to His-245, which is subsequently hydrogen-bonded to the main chain oxygens of Gly-157 and His-159 of the βII-βIII loop. These hydrogen bonds do not exist in the ternary complex with the closed βII-βIII loop, but instead there are hydrogen bonds between Arg-93 and Gln-130 and the closed βII-βIII loop. Due to the crystallographic contacts, the putative peptide-substrate binding groove in the needle(apo) and the binary complex is occupied by a loop region of a neighboring molecule. The modes of binding of the polypeptide chain in this groove in the needle(apo) and binary complex are different from each other and also different from the mode of binding of the closed βII-βIII loop of the ternary complex. In all these complexes, however, hydrogen bonds are formed between the polar side chains adjacent to the Trp-99/Trp-243 center and the peptide bound in the groove. Mutagenesis Studies—Based on the structural analyses, we selected for mutagenesis studies nine solvent exposed and two buried residues, predicted to be involved in peptide-substrate and 2-oxoglutarate binding, respectively (Table 3). Analysis of the purified mutant enzymes by circular dichroism (CD) spectropolarimetry confirmed that these mutations had no substantial effects on the secondary structures and thermal stabilities of the variants as compared with the wild-type enzyme (data not shown). The structures suggested that tyrosines 134 and 168 might be involved in 2-oxoglutarate binding, and their mutations to phenylalanine indeed increased the Km values for 2-oxoglutarate 6- and 3.5-fold, respectively, whereas they did not affect the Km for poly(L-proline) (Table 3). Mutations of Arg-93, Ser-95, Tyr-140, Arg-161, and His-245 to alanines totally inactivated the enzyme (Table 3), in line with the proposal that these residues may have a crucial role in peptide binding. Likewise, the E127A mutation reduced the enzyme activity so dramatically that it was impossible to determine reliable kinetic constants. Mutations of the proposed peptide binding residues Trp-99 and Gln-130 to alanine and Trp-243 to phenylalanine led to about 2.5- and 1.8-fold increases and no increase in the Km for poly(L-proline), respectively, and about 3.5-, 2-, and 12-fold decreases in kcat (Table 3). These surface mutations did not affect the Km for 2-oxoglutarate.
Comparison with Structural Homologues—The structures of 13 2-oxoglutarate dioxygenases have now been characterized (Refs. 15, 22, 38, and 39 and the present data). Cr-P4H-1 is one of the smallest members of this superfamily. The closest structural homologue of Cr-P4H-1 is the catalytic domain of HIF-P4H-2 (15), with a Z-score of 15 for the DALI comparison (40) between these two structures. The DALI Z-scores with the other 2-oxoglutarate dioxygenases range from 14 (alkylated DNA repair enzyme (AlkB)) (38) to 7 (CAS) (41). The substrates of most of these enzymes are small molecules, whereas those of Cr-P4H-1, HIF-P4H-2, and the HIF asparaginyl hydroxylase (factor-inhibiting HIF (FIH)) are polypeptides and that of the (AlkB) is a polynucleotide (15, 38, 42–44).
All superfamily members have the conserved eight-stranded jellyroll core fold (Fig. 6), but in addition they can have extensions or domains at the N terminus, C terminus, or within the core between the βIV and βV (22). The longest N-terminal and C-terminal extensions are observed in HIF-P4H-2 and proline 3-hydroxylase, respectively (Fig. 6) (15, 45). The highest sequence similarity within the superfamily, apart from the four catalytically critical residues, is found in the βV-βVI region (Fig. 6). In all these structures there is a tight turn between βV and βVI, and the hydrophobic residues in this region are well conserved. βV and βVI are the edge strands of the jellyroll core fold, and this region defines the bottom of the 2-oxoglutarate binding pocket (as shown in Fig. 3, A and B). The high sequence conservation suggests that this region may be important for proper folding and stability of the jellyroll framework. A unique feature of the Cr-P4H-1-fold is the disulfide bridge (Cys-195—Cys-230) between the extended βIV-βV loop and βVII of the jellyroll core (Fig. 3A). In HIF-P4H-2, as in four other members, this loop region is very short (only 5 residues), whereas in Cr-P4H-1 it has 18 residues and forms a small helix (
The jellyroll core fold, the β2- and β5-strands, and the two N-terminal helices of Cr-P4H-1 and HIF-P4H-2 superimpose well (Fig. 7), but large deviations exist at the N terminus and C terminus and in some of the loop regions. The crystallized HIF-P4H-2 starts with an -helix, whereas the corresponding Cr-P4H-1 region is a β-strand (β1). Also the C-terminal tail of HIF-P4H-2 is longer and adopts a helical conformation. These structural differences as well as the longer 3-βI loop in HIF-P4H-2 ( 2-βI loop in Cr-P4H-1) and the longer βVI-βVII and βIV-βV loops in Cr-P4H-1 are not close to the catalytic site (Fig. 7). However, two important differences in loop structures are present near the putative peptide binding region. Closest to the catalytic site is the longer extension after βII (βII-βIII loop) in Cr-P4H-1, which is a very short turn in HIF-P4H-2 (Figs. 2 and 7). The second difference near the catalytic site concerns the β3-β4 hairpin loop, which has a different conformation in the ternary structure of Cr-P4H-1 than in the corresponding HIF-P4H-2 structure (Fig. 7). It is noteworthy that this region is fully disordered in the apo structures and the binary structure of Cr-P4H-1, indicating high structural plasticity. The Catalytic Site—Cr-P4H-1 is one of the few 2-oxoglutarate dioxygenases that has been crystallized in two apo forms without any bound ligand as well as in a binary complex with a bound metal and a ternary complex with a bound metal and a 2-oxoglutarate analogue (Table 2). The disorder in the Fe2+ binding region in the absence of bound metal ion, in the SeMet(apo) form, highlights the importance of the metal ion for obtaining the correct geometry of a competent catalytic site. In the ternary structure the N-moiety of the pyridine 2,4-dicarboxylate (corresponding to the 2-oxo moiety of 2-oxoglutarate, Fig. 1) is bound to the metal ion opposite to the carboxylate group of Asp-143 in the His-X-Asp motif (Figs. 4 and 8, A and B). This geometry is seen in most of the other structures of corresponding complexes of the superfamily members. A geometry where the 1-carboxylate is bound opposite to the distal histidine is seen in a subset of superfamily members, for example in CAS, cephalosporin synthase and phytanoyl-CoA-2 hydroxylase (22). In this subset of enzymes the putative oxygen-binding site is proposed to be opposite to the histidine of the His-X-Asp motif. This is best established for CAS. In the complex of CAS with 2-oxoglutarate, Fe2+, and nitric oxide (NO), the NO is bound to this site mimicking the binding of O2 (46). Interestingly, in CAS the binding of NO induces a small rearrangement in the mode of 2-oxoglutarate binding (46). Without NO the 1-carboxylate moiety is bound to the iron opposite the proximal histidine, whereas in the presence of NO it moves up, more opposite to the distal histidine (Fig. 8B). These observations in CAS suggest that 2-oxoglutarate may also be bound somewhat further up, more opposite to distal His-227 in the actual active complex of Cr-P4H-1 with bound O2. In CAS, the shifted position of the 1-carboxylate moiety replaces a water molecule, whereas in Cr-P4H-1 such an adjustment in the mode of 2-oxoglutarate binding would require a structural rearrangement of the Tyr-140 side chain. Interestingly, this tyrosine adopts a completely different conformation in the Cr-P4H-1 binary complex (Fig. 3C), pointing away from the catalytic site. The structural homology between Cr-P4H-1 and the other superfamily members suggests that the oxygen-binding site in Cr-P4H-1 is opposite to the proximal His-143, near the Asp-145 and Trp-243 side chains in an otherwise hydrophobic pocket that is also shaped by Thr-164, Leu-166, Phe-212, and Thr-241 (Figs. 4 and 8, A and B). Each of the latter residues protrudes out of the major sheet, whereas each of the three Fe2+ binding residues protrudes out of the minor sheet. In the ternary complex this pocket is empty, but in the HIF-P4H-2 complex structure the sixth coordination site of iron (trans to the proximal histidine) is occupied by a water molecule (Fig. 8A). The situation is different in the binary structure of Cr-P4H-1, which has no bound pyridine 2,4-dicarboxylate. In this complex the architecture of the catalytic site residues coordinating Zn2+ is the same, but the coordination sites occupied by the pyridine 2,4-dicarboxylate atoms are now occupied by two loosely bound water molecules, 2.0 and 3.0 Å away from the Zn2+ ion. No water is bound in the putative oxygen binding pocket opposite the proximal His-143 in this structure.
The 2-Oxoglutarate-binding Site—Although pyridine 2,4-dicarboxylate is a known P4H inhibitor, no 2-oxoglutarate dioxygenase structures complexed with it have been reported so far. Its 5-carboxylate moiety (Fig. 1) is bound deep inside the 2-oxoglutarate binding pocket and is salt-bridged to Lys-237 and hydrogen-bonded to Tyr-134, Thr-178, and Ser-239 (Fig. 4). Each of the hydrogen-bonding residues is a common 2-oxoglutarate binding residue in the superfamily (Fig. 6). Lys-237 (βVII) corresponds to an arginine in all other superfamily members except for FIH, in which case Lys-214 of another β-strand (βIV) is the corresponding residue (42). In some superfamily members (such as HIF-P4H-2 and CAS but not Cr-P4H-1), a buried water molecule also provides a hydrogen-bonding partner for the 5-carboxylate moiety (Fig. 8, A and B). The positioning of this moiety with respect to the jellyroll core is indeed similar in each superfamily member, as shown for Cr-P4H-1, HIF-P4H-2, and CAS in Fig. 8, A and B. This highlights the conserved, anchoring mode of binding of the 5-carboxylate moiety. The Y134F mutation increased the Km of Cr-P4H-1 for 2-oxoglutarate 6-fold and reduced the kcat about 20-fold, whereas Y168F increased the Km for 2-oxoglutarate 3.5-fold and reduced the kcat about 2-fold (Table 3). These data show that Tyr-134, which is directly hydrogen-bonded to the 5-carboxylate moiety, is more critical for catalytic efficiency than Tyr-168, which is linked to the inhibitor only by indirect hydrogen bonding (Fig. 4). The two polar residues involved in hydrogen bonds with the 1-carboxylate moiety of pyridine 2,4-dicarboxylate, Thr-241 and Trp-243 (Fig. 4), are located near the entrance of the 2-oxoglutarate binding pocket, which is in the center of the extended peptide binding groove (see below). These two residues, protruding out of βVIII, are also found in HIF-P4H-2, but no hydrogen bonds are formed between them and the bound inhibitor in the HIF-P4H-2 crystal structure because the structure of the inhibitor is different, missing the 1-carboxylate moiety (Fig. 8A). However, as these residues are conserved in HIF-P4H-2 and are well aligned with Thr-241 and Trp-243 of Cr-P4H-1, they are likely to be hydrogen-bonded to the 1-carboxylate moiety of 2-oxoglutarate in HIF-P4H-2 as well. The structural superimposition shows that the inhibitor bound to HIF-P4H-2 cannot become bound in the same mode to Cr-P4H-1, because the entrance of its 2-oxoglutarate binding pocket is narrower and more polar due to Leu-132 and Gln-130 (Ala-301 and Met-298 in HIF-P4H-2, respectively) (Fig. 8A). Gln-130, Thr-178, and Lys-237 of the 2-oxoglutarate binding pocket are conserved in C-P4Hs but not in HIF-P4H-2 (Figs. 2 and 8A), indicating that the 2-oxoglutarate binding pocket of algal and C-P4Hs is uniquely different from the binding pocket of the HIF-P4H-2.
The Putative Peptide Binding Groove—The extended peptide binding groove in Cr-P4H-1 stretches from Trp-99 (β5) to βII (Fig. 5, A and B) and is further shaped by the long βII-βIII loop and the β3-β4 hairpin loop together with the Arg-93 side chain located immediately after the latter loop. The corresponding regions of HIF-P4H-2 are different. In HIF-P4H-2 the βII-βIII loop is much shorter and the β2-β3 hairpin loop points in a different direction from the corresponding β3-β4 hairpin loop in Cr-P4H-1 (Figs. 2 and 7). It has been suggested that the β2-β3 hairpin loop may have a role in the substrate specificity of HIF-P4Hs (15). Arg-93 is conserved in HIF-P4H-2 (Arg-252), but points away from the catalytic site, whereas in Cr-P4H-1 it points toward the catalytic site in the center region of the putative peptide binding groove, interacting with the side chains of Ser-77 and Ser-95. Ser-77, which is exposed on the surface, is at the beginning of β3 and the somewhat buried Ser-95 is part of β5 (Fig. 5B). Ser-95 is conserved in the C-P4H Mutation of Trp-99 in the peptide binding groove to alanine increased the Km for poly(L-proline) about 2.5-fold and reduced the kcat about 3.5-fold (Table 3). The W243F mutation had no influence on the Km for poly(L-proline) but reduced the kcat about 3.5-fold (Table 3). The polar residues Glu-127, Gln-130, Tyr-140, and Arg-161 surrounding these aromatic residues were also found to be important for catalytic function. Mutation of Glu-127, Tyr-140, or Arg-161 to alanine led to complete or essentially complete inactivation, whereas the Q130A mutation led to similar but less severe changes in the catalytic properties to those brought about by the W99A mutation (Table 3). Each of the mutated polar side chains is probably important for substrate binding and are indeed hydrogen-bonded to the bound peptide from the neighboring molecule in the needle(apo) structure. The role of His-245 may also concern the stabilization of the open conformation of the βII-βIII loop via two hydrogen bonds to the main chain oxygens of this loop. This suggestion is supported by the inactivation seen by the H245A mutation (Table 3). The importance of this histidine is also highlighted by the fact that it is crucial for the catalytic activity of the human C-P4H-I (21).
Concluding Remarks—The structural analysis of Cr-P4H-1 combined with its mutagenesis data provide strong evidence that the protein surface near Trp-243 is the central region of a peptide binding groove. It is seen that Cr-P4H-1 and the catalytic domain of C-P4Hs form a separate structural subgroup distinct from HIF-P4H-2. Each of the nine residues of the predicted peptide binding surface that were selected for mutagenesis is also conserved in each of the C-P4H
The atomic coordinates and structure factors (code 2V4A, 2JIJ, and 2JIG) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by a EU-BIOXHIT grant (to R. K. W.), by the Academy of Finland Grants 200966 (to R. K. W.), 115124 (to M. K. K.), and 203574, 200965, and 211128 (to J. M.), and by the Sigrid Ju 1 To whom correspondence should be addressed: P. O. Box 3000, University of Oulu, FIN-90014 Oulu, Finland. Tel.: 358-8-5531199; Fax: 358-8-5531141; E-mail: rik.wierenga{at}oulu.fi.
2 The abbreviations used are: P4H, prolyl 4-hydroxylase; C-P4H, collagen P4H; HIF, hypoxia-inducible transcription factor; Cr-P4H-1, C. reinhardtii P4H type 1; His6-Cr-P4H-1-V29, Cr-P4H-1 construct starting from Val-29 and containing the N-terminal His tag; Cr-P4H-1-G30, Cr-P4H-1 construct starting from Gly-30; SeMet, selenomethionine; SeMet(apo), apo crystal form of Cr-P4H-1 obtained with SeMet-labeled His6-Cr-P4H-1-V29 construct; needle(apo), apo crystal form of Cr-P4H-1 obtained with His6-Cr-P4H-1-V29 construct; (Zn)complex, liganded crystal form of Cr-P4H-1 obtained with Cr-P4H-1-G30 construct; CAS, clavaminate synthase; FIH, factor-inhibiting hypoxia-inducible factor 1; SUMO, small ubiquitin-like modifier.
We thank the staff of the beamlines BW7A and X12 at the EMBL, DESY, especially Drs. Andrea Schmidt and Manfred Weiss. For the data collection at the ESRF we thank Dr. Xavier Thibault and colleagues involved in the EU-BIOXHIT project for the use of the high-throughput data collection facilities of beamline ID14-3. We thank Ville Ratas and Merja Nissilä for excellent technical assistance.
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||