Molecular Insights into the Biosynthesis of the F420 Coenzyme*

Coenzyme F420, a hydride carrier, is found in Archaea and some bacteria and has crucial roles in methanogenesis, antibiotic biosynthesis, DNA repair, and activation of antitubercular compounds. CofD, 2-phospho-l-lactate transferase, catalyzes the last step in the biosynthesis of F420-0 (F420 without polyglutamate), by transferring the lactyl phosphate moiety of lactyl(2)diphospho-(5′)guanosine to 7,8-didemethyl-8-hydroxy-5-deazariboflavin ribitol (Fo). CofD is highly conserved among F420-producing organisms, and weak sequence homologs are also found in non-F420-producing organisms. This superfamily does not share any recognizable sequence conservation with other proteins. Here we report the first crystal structures of CofD, the free enzyme and two ternary complexes, with Fo and Pi or with Fo and GDP, from Methanosarcina mazei. The active site is located at the C-terminal end of a Rossmann fold core, and three large insertions make significant contributions to the active site and dimer formation. The observed binding modes of Fo and GDP can explain known biochemical properties of CofD and are also supported by our binding assays. The structures provide significant molecular insights into the biosynthesis of the F420 coenzyme. Large structural differences in the active site region of the non-F420-producing CofD homologs suggest that they catalyze a different biochemical reaction.

The coenzyme F 420 is a hydride carrier that is found in Archaea and in high GϩC Gram-positive bacteria such as Streptomyces and Mycobacterium. This coenzyme is essential for energy metabolism in methanogenic Archaea, which involves the conversion of CO 2 , several other one-carbon compounds, and acetate to CH 4 (1)(2)(3)(4). In Streptomyces, F 420 is involved in the biosynthesis of tetracycline, lincomycin, and other natural products (5)(6)(7)(8)(9). In some cyanobacteria, a precursor of F 420 is a cofactor in DNA photolyases for DNA repair (10,11). In Mycobacterium tuberculosis, an F 420 -dependent glucose-6-phosphate dehydrogenase is required for the reductive activation of a series of nitroimidazo-oxazine compounds for their antitubercular effects (12)(13)(14).
The biosynthesis of F 420 uses lactate, 4-hydroxyphenylpyruvate (an intermediate in tyrosine biosynthesis), and a pyrimidine intermediate in riboflavin biosynthesis as precursors (1, 18 -21). The last step in the biosynthesis of F 420 -0 is the condensation of Fo and lactyl(2)diphospho-(5Ј)guanosine (LPPG), catalyzed by the enzyme 2-phospho-L-lactate transferase (CofD) (Fig. 1A) (20,21). Lactyl(2)diphospho(5Ј)adenosine can also be used as the donor of the LP group, but the enzyme is much less efficient with this substrate (30-fold higher K m and 14-fold lower k cat values) (20). The catalysis by CofD requires Mg 2ϩ ions, and the enzyme can also catalyze the phosphorylation of Fo using GTP, GDP, PP i , or PPP i as the donor (20).
The amino acid sequence of CofD is well conserved among archaeal and bacterial organisms that are known to produce F 420 (Fig. 1B). Weak sequence homologs (ϳ20% identity) are also found in some bacteria (such as Bacillus and Escherichia coli) that do not produce F 420 (Fig. 1B), but the biochemical functions of these homologs are not known (22). CofD is a member of pfam family UPF0052. There are 381 sequences in * This work was supported by Protein Structure Initiative of the National Institutes of Health Grants P50 GM062413 and U54 GM074958, National Science Foundation Grant MCB 0231319, the CNRS, and the French Foundation for Medical Research. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. this pfam family, including eukaryotic homologs. Besides these, the CofD enzymes do not share any recognizable homology with other proteins in the sequence data base (20), and there is no structural information on CofD. We report here the first crystal structures of the CofD enzyme from Methanosarcina mazei, in its free form at 3.1 Å resolution, in a ternary complex with Fo and P i at 2.5 Å resolution, and in a ternary complex with Fo and GDP at 3.0 Å resolution.

MATERIALS AND METHODS
Protein Expression and Purification-The production of CofD protein was carried out as part of the high throughput protein production process of the Northeast Structural Genomics Consortium (23). CofD corresponds to Northeast Structural Genomics Consortium Target MaR46. The fulllength cofD gene from M. mazei (strain Goe1) was cloned into a pET21d (Novagen) derivative, generating plasmid pMaR46-21.
The resulting recombinant protein contains eight non-native residues (LEHHHHHH) at the C terminus. The construct was verified by standard DNA sequence analysis. Escherichia coli BL21 (DE3) pMGK cells, a rare codon-enhanced strain, were transformed with pMaR46-21. A single isolate was cultured in MJ9 minimal media (24) supplemented with selenomethionine, lysine, phenylalanine, threonine, isoleucine, leucine, and valine for the production of selenomethionine-labeled CofD (25). Initial growth was carried out at 37°C until the A 600 of the culture reached 0.6 -0.8. The incubation temperature was then decreased to 17°C, and protein expression was induced by the addition of isopropyl ␤-D-thiogalactopyranoside at a final concentration of 1 mM. Following overnight incubation, the cells were harvested by centrifugation.
Selenomethionyl CofD was purified by standard methods. Cell pellets were resuspended in lysis buffer (50 mM NaH 2 PO 4 (pH 8.0), 300 mM NaCl, 10 mM imidazole, and 5 mM ␤-mercaptoethanol) and disrupted by sonication. The resulting lysate was clarified by centrifugation at 26,000 ϫ g for 45 min at 4°C. The supernatant was loaded onto a nickel-nitrilotriacetic acid column (Qiagen) and eluted in lysis buffer containing 250 mM imidazole. Fractions containing partially purified CofD were pooled, and buffer conditions providing monomeric samples were optimized by analytical gel filtration detected by static light scattering, following the protocol described elsewhere (23). Preparative gel filtration (Superdex 75, GE Healthcare) was then performed using a buffer containing 10 mM Tris (pH 7.5) and 5 mM dithiothreitol. The purified CofD protein was concentrated to 10 mg/ml, flash-frozen in aliquots, and used for crystallization screening. Sample purity (Ͼ97%) and molecular weight (34.77 kDa) were verified by SDS-PAGE and matrixassisted laser desorption ionization time-of-flight mass spectrometry, respectively. The yield of purified protein was ϳ50 mg/liter of culture.
Cloning, expression, and purification of MJ1256 from Methanocaldococcus jannaschii were performed as described previously (20). The D50A mutant of MJ1256 was created with the QuikChange site-directed mutagenesis kit (Stratagene), according to the manufacturer's protocol. The mutated plasmid was sequenced to verify the incorporation of the correct mutation. Protein Crystallization-The CofD free enzyme and two ternary complexes (with Fo ϩ P i and Fo ϩ GDP) were crystallized at 20°C by the hanging-drop vapor diffusion method. For the free enzyme crystals, 2 l of protein solution containing CofD (10 mg/ml in 5 mM Tris (pH 7.5), 100 mM NaCl, and 5 mM dithiothreitol) were mixed with 2 l of the reservoir solution consisting of 16% (w/v) PEG3350 and 200 mM LiNO 3 . Crystals of CofD in complex with Fo and P i were grown by mixing 2 l of the protein solution containing 5 mM Fo and 5 mM LP (only phosphate is observed in the final structure) with 2 l of 4.2 M sodium formate, whereas the crystals of CofD in complex with Fo and GDP were grown by mixing 2 l of protein solution containing 5 mM Fo, 5 mM GTP (but only GDP is observed in the final structure), and 20 mM magnesium chloride with 2 l of reservoir solution of 100 mM Hepes (pH 7.15), 1.35 M ammonium sulfate, and 0.5% (w/v) PEG8000.
The free enzyme crystals were cryo-protected by paratone, and crystals of the ternary complexes were cryo-protected by transferring to their reservoir solution supplemented with 25% (v/v) glycerol. The crystals were flash-frozen in liquid propane for data collection at 100 K.
The free enzyme crystals belong to space group P4 3 2 1 2, with cell parameters of a ϭ b ϭ 110.6 Å and c ϭ 76.0 Å. There is one molecule of CofD in the crystallographic asymmetric unit.
The two ternary complex crystals, despite being crystallized from entirely different solutions, are isomorphous to each other. They belong to space group P3 2 with cell parameters of a ϭ b ϭ 185.3 Å and c ϭ 67.8 Å for the Fo ϩ P i complex, and a ϭ b ϭ 186.5 Å, c ϭ 67.8 Å for the Fo ϩ GDP complex. There are two dimers of the ternary complex in the crystallographic asymmetric unit.
Data Collection and Structure Determination-A multiple wavelength anomalous diffraction (26) data set to 3.1 Å resolution was collected on a single crystal of the CofD free enzyme at the X4A beamline of the National Synchrotron Light Source. The diffraction images were processed with the HKL package (27). The data processing statistics are summarized in Table 1.
The selenium sites were located with the program SnB (28). SOLVE/RESOLVE (29) was used for phase calculation, phase improvement, and automated model building, but only about 10% of the residues were placed. The model building was greatly facilitated by the collection of a SAD data set to 2.5 Å resolution on a crystal of the ternary complex with Fo and P i , also at the X4A beamline of National Synchrotron Light Source. The complete atomic models, including the initiating (seleno)methionine, were built with the program XtalView (30). The structure of the ternary complex with Fo and GDP was determined by the molecular replacement method with the program COMO (31), using the structure of the ternary complex with Fo and P i as the model.
The three structures were refined with the program CNS (32). Noncrystallographic symmetry restraint was applied for all stages of the refinement of the ternary complexes. The refinement statistics are summarized in Table 1.
CofD Activity Assays-CofD activity was measured in a coupled assay with lactylphosphate guanylyltransferase (CofC, MJ1117), 4 which produces LPPG from LP and GTP. The reaction mixture contained 50 mM TES (pH 7.5), 0.1 mM GTP, 10 mM LP, 0.4 mM Fo, 2 mM MnCl 2 , 10 g of CofC, and 0.4 -0.6 g of CofD. The enzyme system is inhibited by high concentrations of GMP; therefore, 2 mM phosphoenolpyruvate and 1 unit of pyruvate kinase (Sigma P-1381) were also included in the reaction mixture to regenerate GTP from GMP. The reactions were incubated at 55°C for 30 min, after which the proteins were precipitated by addition of 1.5 volumes of methanol and removed by centrifugation. F 420 -0 formation was determined by high pressure liquid chromatography analysis on a Varian Pursuit XRs 250 ϫ 4.6 mm 5-m C18 column with linear elution gradient from 95% 25 mM sodium acetate pH 6.0, 5% MeOH to 80% MeOH at 0.5 ml/min flow rate over 40 min. F 420 -0 was detected by fluorescence (420 nm excitation and 480 nm emission).
Fluorescence Assays-All experiments were performed using a SAFAS Xenius spectrofluorometer. All spectra were corrected for buffer fluorescence. Fluorescence measurements were carried out after dilution of CofD (0.55 M final concentration) and equilibration for 2 min in 2 ml of buffer containing 25 mM Hepes (pH 8), upon excitation at 282 nm. Binding of Fo was monitored by the variation of tryptophan-intrinsic fluorescence of CofD (between 310 and 380 nm) produced after addition of increasing concentrations of effectors. Correction for the inner filter effect was performed under the same conditions by using N-acetyltryptophanamide (Sigma), following a published protocol (33). Fluorescence resonance energy transfer between tryptophan residues of CofD and bound mant-nucleotide derivatives was monitored by the fluorescence emission between 310 and 500 nm. Peak integration was carried out at each ligand concentration, and the observed changes in fluorescence intensity or fluorescence resonance energy transfer were used for the calculation of ligand affinity. Data were collected at least in triplicate for each ligand, and curve fitting of the data was performed by using Graphit 4.0.10 (Erithacus Software) as described previously (34). For quenching by Fo, the Equation 1 was used, and for fluorescence resonance energy transfer with mant nucleotides, Equation 2 was used, where F is the relative fluorescence intensity; F max is the maximal relative fluorescence intensity; F min is the minimal relative fluorescence intensity; L is the concentration of Fo; E is the total concentration of CofD; and K d is the dissociation constant of the enzyme-substrate complex.
BH3658 and YvcK contain no Trp residues. The binding assays monitored the fluorescence change of Fo, mant-, and trinitrophenylnucleotides in the presence of the proteins.

RESULTS AND DISCUSSION
Structure Determination-The structure of the free enzyme of M. mazei CofD was determined at 3.1 Å resolution by the selenomethionyl multiple wavelength anomalous diffraction method (26). However, several loops in the structure have weak electron density. The subsequent availability of a SAD data set to 2.5 Å on CofD in a ternary complex with Fo and P i greatly facilitated the model building process.
The current atomic models of the three structures of CofD, free enzyme and the ternary complexes with Fo ϩ P i and Fo ϩ GDP, contain residues 1-303 for all the protein molecules in the asymmetric unit, together with a few non-native residues from the C-terminal His tag (LEHHHHHH). All the structures have low R values and excellent geometry at their respective resolution ( Table 1). The majority (90%) of the residues are located in the most favored region of the Ramachandran plot, and no residues are located in the disallowed region (data not shown).
Crystals of the two ternary complexes contain four CofD monomers in the crystallographic asymmetric unit. The structures of the monomers in each crystal are essentially identical. There are, however, recognizable differences between the conformations of the two ternary complexes in the active site region, and these will be described later.
Overall Structure of CofD-The structure of CofD monomer contains 12 ␤-strands (named ␤1 through ␤12) and 13 ␣-helices (␣A through ␣M) (Fig. 1B and Fig. 2). Seven of the ␤-strands (␤1, ␤2, ␤5, ␤9 -␤12) form a Rossmann fold, but there are three large inserted segments between consecutive ␤-strands in this fold that decorate the C-terminal end of the central parallel ␤-sheet. The first insert, with about 75 residues and located in the connection between neighboring strands ␤2 and ␤5, contains a small ␤-hairpin (␤3 and ␤4), 3 helices, and 2 long loops (Fig. 2). The second insert (with 40 residues and located between strands ␤5 and ␤9) is composed of a three-stranded anti-parallel ␤-sheet (␤6, ␤7, and ␤8) and a helix. Finally, the third insert (with 20 residues and between ␤10 and ␤11) contains a helix and a long loop. All three inserts have important roles in forming the dimer interface and/or the active site of CofD ( Fig.  2 and see below).
Earlier gel filtration data showed that CofD is dimeric in solution (20). The structures show a highly elongated dimer (Fig. 2), and this dimeric association is observed in all three structures, in two different crystal forms. The dimer interface is formed by the first insert to the  Rossmann fold and buries about 950 Å 2 of the surface area of each monomer (Fig. 2). The ␤3-␤4 hairpin and the ␣C, ␣D helices in one monomer are in contact with their equivalents in the other monomer in this interface. As some of these residues are also involved in binding the deazariboflavin of Fo (Fig. 2, see below), it is possible that dimerization is important for stabilizing the conformation of this insert for substrate binding. Currently, it is not known whether dimerization is required for the catalytic activity of CofD.
Binding Mode of GDP-The active site of CofD is located at the C-terminal end of the central ␤-sheet of the Rossmann fold, with contributions from the three major inserts to this fold (Fig.  2). To define the binding modes of the substrates of this enzyme, we included Fo and GTP in one set of crystallization experiments and succeeded in obtaining the structure of a ternary complex with Fo and GDP. By including Fo and LP in the crystallization solution, we were able to determine the structure of another ternary complex, with Fo and P i . Although LPPG is the actual substrate of the enzyme, it is not stable in solution, with a half-life of about 2 h at room temperature (20,21). As LPPG is produced by the condensation of LP and GTP (21), we included these two compounds instead in our crystallization experiments. Although 5 mM GTP was used in the crystallization solution, only the electron density for GDP is observed in the four CofD molecules based on the crystallographic analysis (Fig. 3A). These crystals took 2 weeks to grow, and it is possible that GTP has been hydrolyzed during this time, or the ␥-phosphate is disordered in structure because of a lack of strong interactions with the enzyme (see below).
The GDP molecule is bound between the central Rossmann fold core and the third insert of CofD (Fig. 2). Many of the residues that interact with GDP are conserved among the F 420 -producing CofD homologs (Fig. 1B), confirming their functional importance. One face of the guanine base lies against the side chains of Lys-12, Pro-191, and Met-286, and the other face has van der Waals interactions with the side chains of Ile-223 and Val-228 in the long loop in the third insert (Fig.  3B). The N-1 and O-6 atoms of the guanine base are recognized by hydrogen-bonding interactions with the main chain atoms of residues Thr-284 and Met-286 (Fig. 3B), in the loop connecting the last strand (␤12) and last helix (␣M) of the Rossmann fold (Fig. 2). The 2-amino group of guanine is hydrogen-bonded to the main chain carbonyl of Ser-220 and the side chain hydroxyl of Thr-284, which in turn interacts with the side chain of Asp-262, although Thr-284 is not conserved among the CofD homologs (Fig. 1B). This pattern of specific recognition of the guanine base is entirely different from that observed in G proteins. Nonetheless, the structure shows that the adenine base cannot maintain the same hydrogenbonding interactions with the enzyme, consistent with kinetic data showing that the K m value for lactyl(2)diphospho-(5Ј)adenosine is about 30-fold higher than LPPG (20).
The ␣-phosphate of GDP has many hydrogen-bonding interactions with the enzyme, primarily with the glycine-rich loop in the connection between the first strand (␤1) and first  (41). B, schematic drawing of the interactions between GDP and CofD. C, overlay of the structure of the ternary complex with Fo and GDP (in purple) and that with Fo and P i (in yellow). The P i is located about 1 Å from the ␣-phosphate of GDP, but there is a large difference in the conformation of the glycine-rich loop, indicated with the red star.
helix (␣A) of the Rossmann fold (Fig. 1B), as well as with the side chains of Ser-192, Asn-193, and Ser-197 (Fig. 3B). These three residues are part of the PSNPXXSI motif that is conserved among the F 420 -producing CofD homologs (Fig. 1B) (20). In comparison, the ␤-phosphate of GDP does not appear to be specifically recognized by the enzyme in this structure (Fig. 3B). The 2Ј-hydroxyl of the ribose is hydrogen-bonded to the main chain atoms of Pro-221 and Val-228, whereas the 3Ј-hydroxyl interacts with the side chain of Asn-193 (Fig. 3B).
Crystallographic analysis on the crystal grown in the presence of Fo and LP revealed the presence of a phosphate group in the electron density. The lactyl group is either disordered or it has been hydrolyzed during crystallization. Another possibility is that our sample of LP contained some free phosphates. This phosphate is located about 1 Å away from the ␣-phosphate group of GDP, and the glycine-rich loop has a significant conformational difference in this complex (Fig. 3C).
Binding Mode of Fo-The deazariboflavin ring of Fo has well defined electron density in the ternary complex with Fo and P i (Fig. 4A). This group has a planar conformation in our structure, in contrast to a butterfly conformation as observed in F 420dependent dehydrogenases (35)(36)(37). In addition, the mode of recognition of Fo is highly different between CofD and F 420 -dependent dehydrogenases (35)(36)(37). The ribityl group of Fo has much weaker electron density (Fig. 4A). A satisfactory model could not be built for it based on the crystallographic analysis, and most of this group has been omitted from the current atomic model. The electron density for Fo in the ternary complex with Fo and GDP is weaker but shows similar interactions with the enzyme as those observed in the Fo ϩ P i ternary complex.
The binding site for Fo is formed by residues in the first and second inserts of CofD (Fig. 2). Fo has both hydrogen-bonding interactions with conserved residues in CofD, mediated through two water molecules, and van der Waals interactions with the enzyme. One face of the deazariboflavin group isstacked against the side chain of Trp-64 (in the first insert), and the other face has van der Waals interactions with the side chains of Ile-43, Leu-86, and Ile-152 (Fig. 4B). The 8-hydroxyl group on the phenyl ring interacts through two water molecules with several residues in CofD, including the side chains of Asp-48 and Asp-92, from the two conserved DXD motifs in F 420 -producing CofD homologs (Fig. 1B) (20). Such interactions help the enzyme distinguish between the 7,8-didemethyl-8-hydroxy-5-deazariboflavin in Fo and regular riboflavin, as the latter is not a substrate for CofD (20). Moreover, our structural analysis shows that introducing a methyl group at the 7 position of Fo will cause steric clashes with the side chain of Asp-92, which should also help the discrimination against regular riboflavin by CofD. The carbonyl groups on the pyrimidine ring of Fo are exposed to the solvent (Fig. 4B).
Large Conformational Changes upon Substrate Binding-A comparison of the structures of the free enzyme of CofD and the two ternary complexes shows that there are large conformational changes in the enzyme upon substrate binding, primarily in the second and third inserts (Fig. 5A). The largest difference is seen for the loop connecting strand ␤10 and helix ␣J, in the third insert. This loop is located away from the GDP molecule in the free enzyme structure, creating an open form of the active site (Fig. 5B). In the ternary complex, this loop moves into the active site (Fig. 5A), creating a closed form of the enzyme (Fig. 5C) and providing several side chains that are important for interactions with the guanine base and the ribose. Interestingly, the structure of the Fo ϩ P i ternary complex has a similar conformation for this loop (Fig. 5A), suggesting that a single phosphate group may be sufficient to induce active site closure. However, the glycine-rich loop assumes a different conformation in the ternary complex with P i , as discussed earlier (Fig. 3C).
Additional differences are seen in the position of helix ␣F, in the second insert. It moves closer to the active site to establish interactions with the Fo substrate (Fig. 5A). A conformational change is also observed for helix ␣L, in the connection between strands ␤11 and ␤12 in the Rossmann fold core, but this helix is located far from the active site.
Substrate Binding Assays Confirm the Structural Observations-Our structures show that CofD contains three Trp residues, Trp-63, Trp-64, and Trp-151, in the active site region (Fig. 4B), which allowed us to develop assays that can monitor ligand binding by either intrinsic Trp fluorescence or by fluorescence resonance energy transfer. In particular, Trp-64 is -stacked with the deazariboflavin moiety of Fo (Fig. 4B) and therefore is the most likely to change its fluorescence upon Fo binding. Indeed, in the absence of Fo and with the excitation wavelength set at 282 nm, CofD exhibited a fluorescence emission spectrum characteristic of a rather accessible tryptophan residue ( max emission around 340 nm; data not shown). A progressive increase of the Fo concentration from 1.25 to 20 M quenched the fluorescence intensity of CofD (Fig. 6A), from which the apparent K d for the binding of Fo to CofD can be estimated to be 1.38 Ϯ 0.07 M.
We also attempted to monitor the binding of GDP to CofD by intrinsic Trp fluorescence. The addition of GDP produced a weak specific change of CofD fluorescence, but the curve fitting of the data did not allow us to determine a precise apparent K d value (data not shown). We then used the fluorescent nucleotide mant-GDP in the binding assay. Addition of increasing amounts of mant-GDP produced a quenching of the fluorescence emission spectrum of CofD tryptophan residues (Fig. 6B). Simultaneously, a new fluorescence signal progressively developed, centered at ϳ430 nm, which is related to fluorescence resonance energy transfer between the tryptophan residues and the mant group. The fluorescence resonance energy transfer data allowed us to estimate an apparent K d of 1.59 Ϯ 0.14 M for the CofD-mant-GDP complex (Fig.  6C). A K d value of 4.6 Ϯ 0.6 M was determined for the CofD-mant-GTP complex using similar experiments. No binding of mant-ADP or   (20).
Implication for the Biosynthesis of F 420 -Our structural studies have defined the binding modes of GDP and the deazariboflavin portion of Fo, as well as located the active site region of the CofD enzyme. The structural analysis suggests that the GMP portion of the LPPG substrate of CofD is likely to have a similar binding mode. Although the exact binding modes of the LP portion of LPPG and the ribityl group of Fo are not defined from these structures, a careful examination of the active site region of CofD provides significant insights into the mechanism of catalysis by this enzyme. The reaction is known to require Mg 2ϩ ion(s), which are likely coordinated by the phosphate groups and the carboxylate group of the LPPG substrate (20). Our structures show that two acidic side chains in the active site region, Glu-34 and Asp-46 (Fig. 4B), could also be ligands to the Mg 2ϩ ions. Asp-46 is part of the first conserved DXD motif, and Glu-34 is conserved as an acidic residue among F 420 -producing CofD homologs (Fig. 1B). To confirm the functional importance of the Asp-46 residue, we mutated its equivalent residue in the M. jannaschii CofD homolog to alanine and observed a 40-fold loss in the specific activity of the enzyme.
It is likely that the conformation of the LP and ribityl groups of the substrates are only ordered in the presence of Mg 2ϩ ions, which brings the terminal hydroxyl group of Fo into proximity of the ␤-phosphate of LPPG to initiate the reaction. Our structural analysis suggests that the general base that extracts the proton from this hydroxyl group is likely a hydroxide ligated to the Mg 2ϩ ions or possibly one of the terminal oxygens on the LPPG substrate, as the side chains of acidic residues in the active site region are likely too far from this hydroxyl group.
Our structures also show that only the ␣-phosphate of GDP is tightly anchored by its interactions with the enzyme (Fig. 3B). Therefore, other compounds that contain di-or triphosphate groups could become substrates to the enzyme, by overlapping with this phosphate. This explains the observations that CofD can catalyze the phosphorylation of Fo by using GTP, GDP, PP i , and PPP i as substrates (20).
LargeStructuralDifferencesinNon-F 420 -producing CofD Homologs-CofD belongs to a large family of proteins that have been found in both F 420 -producing and non-F 420producing organisms (Fig. 1B). The crystal structures of several non-F 420 -producing CofD homologs have been determined, including the BH3568 protein from Bacillus halodurans (PDB entries 2HZB and 2O2Z, 24% sequence identity to M. mazei CofD), the LP_0780 protein from Lactobacillus plantarum (PDB entry 2P0Y, 21% identity), and the SE_0549 protein from Staphylococcus epidermidis (PDB entry 2PPV, 18% identity). The central Rossmann fold of these structures is similar. The root mean square distance between equivalent C-␣ atoms of the structures of CofD and BH3568 is 3.3 Å (Z score of 22.6 from the program Dali) (38).
However, our detailed sequence and structural and functional analyses suggest that the non-F 420 -producing CofD homologs may use completely different substrates and catalyze a different biochemical reaction compared with CofD. Evidence in support of this conclusion includes the following. 1) Many of the conserved sequence motifs in the F 420 -producing homologs (for example the two DXD motifs and the PSNPXXSI motif) are not conserved in the non-F 420 -producing homologs (Fig. 1B). Our structural data on CofD show that these sequence motifs are important for GDP binding and/or catalysis. 2) There are large differences in the conformation of the inserts, and as a result there are large differences in the active site of the non-F 420 -producing CofD homologs (Fig. 7). The structure of the first insert of BH3568 is entirely different from that in CofD (Fig. 7). As this insert is involved in dimerization, the BH3568 dimer has a significantly different organization relative to the CofD dimer as well. This difference is also expected to disrupt the Fo-binding site, and the Trp-64 residue in CofD does not have an equivalent in BH3568. The second insert in BH3568 has a similar fold but is positioned differently compared with CofD (Fig. 7). The third insert is much shorter in BH3568, and this protein contains a new insert, between strand ␤11 and helix ␣L (Fig. 7). In addition, the loop connecting strand ␤12 and helix ␣M is longer (by 4 residues, Fig. 1) and assumes a different conformation in BH3568. As residues in this loop are involved in recognizing the guanine base in CofD (Fig. 3B), the structural difference suggests that the non-F 420 -producing homologs probably cannot bind GDP. 3) We did not detect any binding of Fo, GDP, ADP, GTP, or ATP to the BH3568 protein or its close homolog YvcK from B. subtilis in our assays (22). The binding data provide strong experimental support for the observations based on the sequence and structural analyses. Further studies are needed to characterize the exact biochemical and biological functions of the non-F 420 -producing CofD homologs. On the other hand, the glycine-rich loop is conserved in these proteins (Fig. 1B), suggesting that the substrate(s) is likely to contain a phosphate group. In fact, one of the BH3568 structures (PDB entry 2O2Z) contains a sulfate at this position.
In summary, our structural and biochemical studies have provided significant molecular insights into one of the steps in the biosynthesis of the F 420 coenzyme. The structural analysis also indicates the presence of a structurally homologous family of enzymes that likely catalyze a different biochemical reaction.