Crystal Structures of Vertebrate Dihydropyrimidinase and Complexes from Tetraodon nigroviridis with Lysine Carbamylation

Background: Lysine carbamylation facilitates metal coordination for enzymatic activities. Results: Structures of dihydropyrimidinase as the apo- and holoenzyme with one and two metals and its substrate/product complexes are determined. Conclusion: The structures reveal four steps in the assembly of the holoprotein with the carbamylated lysine and two metal ions. Significance: The results illustrate how proteins exploit lysines and metals to accomplish lysine carbamylation and enzymatic functions. Lysine carbamylation, a post-translational modification, facilitates metal coordination for specific enzymatic activities. We have determined structures of the vertebrate dihydropyrimidinase from Tetraodon nigroviridis (TnDhp) in various states: the apoenzyme as well as two forms of the holoenzyme with one and two metals at the catalytic site. The essential active-site structural requirements have been identified for the possible existence of four metal-mediated stages of lysine carbamylation. Only one metal is sufficient for stabilizing lysine carbamylation; however, the post-translational lysine carbamylation facilitates additional metal coordination for the regulation of specific enzymatic activities through controlling the conformations of two dynamic loops, Ala69–Arg74 and Met158–Met165, located in the tunnel for the substrate entrance. The substrate/product tunnel is in the “open form” in the apo-TnDhp, in the “intermediate state” in the monometal TnDhp, and in the “closed form” in the dimetal TnDhp structure, respectively. Structural comparison also suggests that the C-terminal tail plays a role in the enzymatic function through interactions with the Ala69–Arg74 dynamic loop. In addition, the structures of the dimetal TnDhp in complexes with hydantoin, N-carbamyl-β-alanine, and N-carbamyl-β-amino isobutyrate as well as apo-TnDhp in complex with a product analog, N-(2-acetamido)-iminodiacetic acid, have been determined. These structural results illustrate how a protein exploits unique lysines and the metal distribution to accomplish lysine carbamylation as well as subsequent enzymatic functions.

Post-translational modification of proteins increases the diversity in the proteome (1)(2)(3). Carbamylation on lysine extends the residue length by ϳ2 Å and changes the side chain from a positive to negative charge at neutral pH. This unique modification enables the carbamylated lysine to play multifunctional roles in proteins, especially critical for many physiologically important enzymes to exhibit the proper activities (4,5). In humans, the proteins involved with lysine carbamylation are found to be related to the human diseases, such as type 2 diabetes, developmental delay, metabolic acidosis, mental retardation, hypotonia, and seizures (6 -9). In plants, to convert the atmospheric carbon dioxide to energy-rich molecules, the enzyme Rubisco 3 catalyzes the CO 2 fixation in photosynthesis (10). Rubisco has no enzymatic function until the key lysine residue is carbamylated. The roles of the carbamylated lysine are for metal binding, such as the magnesium ion, as well as to serve as a base for proton abstraction in the catalytic cycle. To counter antibiotics, bacteria have developed the ␤-lactamase resistance system to destroy the ␤-lactam antibiotics for survival (11)(12)(13). The function of the carbamylated lysine residue in class D ␤-lactamases is not only to promote the acylation step of the active-site serine but also to activate hydrolytic water for the second catalytic step of the deacylation event (13). So far, there are about 250 structures containing carbamylated lysine residues found in the Protein Data Bank). Although carbamylated lysines exist in a broad range of organisms and have been considered as a residue of catalytic capabilities in nature among the 20 canonical amino acids (14), much less is known about the carbamylation, biochemically and structurally, on the ⑀-amino group of lysine.
Lysine carbamylation does not need an additional enzyme to achieve the covalent modification, a process that has been reported to be a spontaneous and reversible (15,16). By comparison, additional specific enzymes or catalysts, such as methyltransferases and acetyltransferases, are required to catalyze methylation and acylation reactions, respectively (17,18), both with high kinetic barriers for the covalent bond formation. Thus, the spontaneous and reversible features of lysine carbamylation are unique in post-translational modifications. In fact, lysine carbamylation is difficult to establish. Most of the carbamylated lysines in proteins are identified by x-ray crystal structures.
In addition to the catalytic role, lysine carbamylation is also involved in the coordination with metal ions. For example, metal ions are found in the amidohydrolase superfamily (19), such as dihydropyrimidinase (DHP; EC 3.5.2.2) in animals. The physiological function of DHP is to hydrolyze dihydrouracil (DHU) or dihydrothymine (DHT) to N-carbamyl-␤-alanine (NC␤A) or N-carbamyl-␤-amino isobutyrate (NC␤I), respectively (20), in the second step of the pyrimidine degradative pathway. The bacterial counterpart of DHP, also called hydantoinase (HYD), is not involved in the pyrimidine catabolism, but in industry, it is used for the production of enantiomerically active N-carbamyl amino acids, which are further converted to non-proteinogenic amino acids for synthesis of antibiotics, peptide hormones, and pesticides (21,22). The active sites of HYD in the bacterial enzymes usually contain dimetal ions with the carbamylated lysine based on their crystal structures (23,24). However, animal DHP contains only a single divalentmetal ion in the active site according to several independent studies (25)(26)(27), despite the fact that a dimetal binding site potentially exists in animal DHP based on the multiple sequence alignments. In fact, before this work, there has been no structural information on the active-site environment in DHP with the single divalent metal co-existing with carbamylated lysine.
Here we describe the crystal structures of the vertebrate dihydropyrimidinase from Tetraodon nigroviridis (TnDhp) in three states: the apoenzyme as well as the holoenzyme with one divalent metal and two divalent metal ions, respectively. In addition, the apo-TnDhp in complex with a product analog, N-(2-acetamido)-iminodiacetic acid (ADA), and the dimetal TnDhp in complexes with hydantoin, NC␤A, and NC␤I have also been determined. These structures have enabled us to correlate the metal distribution within this enzyme with lysine carbamylation as well as subsequent enzymatic functions and mechanisms.

EXPERIMENTAL PROCEDURES
Molecular Cloning and Protein Purification-Healthy green spot pufferfish, T. nigroviridis, were sacrificed. Total RNA was isolated from fish liver using the RNeasy Protect minikit (Qiagen) according to the manufacturer's protocol. The RNA was transcribed into the single-stranded cDNA using the Super-Script III first-strand synthesis system for RT-PCR kit (Invitrogen). The cDNA of the recombinant TnDhp was amplified by PCR using the sense primer 5Ј-GCGCGGATCCATGGCA-GAAGCCGGCGAG-3Ј and the antisense primer 5Ј-GCGC-GAATTCTCAGTCAGACGTTCCCAGAGCAAC-3Ј (with the BamHI and EcoRI cutting sites underlined). After digestion by restriction enzymes, a fragment with BamHI and EcoRI sticky ends was purified from agarose gel using the Gene-Spin TM 1-4-3 DNA Purification Kit-V 2 (PRO TECH ) and subcloned into the BamHI/EcoRI precut pET44a(ϩ) (Novagen) plasmid by T 4 DNA ligase. pET 44a(ϩ) harboring the TnDhp cDNA was subsequently transformed into Escherichia coli BL21 (DE3) cells for recombinant protein expression and purification. The bacteria were grown in Luria-Bertani (LB) medium, and protein expression was induced with isopropyl-␤-D-thiogalactopyranoside at 25°C for 30 h. To improve the solubility of the recombinant TnDhp, the fusion protein with native E. coli NusA (Nus-tag) attached to the C terminus (28) was first prepared and then purified and eluted with an imidazole gradient through an Ni 2ϩ column. After thrombin cleavage of the Nustag, the pure recombinant TnDhp (non-amended enzyme) was eluted again through the Ni 2ϩ column.
Inductively Coupled Plasma Mass Spectrometry (ICP-MS) Assays-The metal ion compositions of the various ThDhps were determined by ICP-MS. The Centricon (molecular weight cut-off 30,000; Amicon) was used to replace the buffer solution of each sample with potassium phosphate buffer (10 mM, pH 7.0), which served as the background solution for the determination of the metal ions. The measurement on each sample was independently repeated three times with calculated S.D. values.
Activity Assays-The specific activity measurements were performed by a rapid spectrophotometric assay (27). The TnDhps activity was determined by measuring the decrease of the absorbance at 298 nm upon the hydrolysis of phthalimide as the substrate at 25°C. For routine assays, the reaction mixture (1 ml) contained phthalimide (1 mM) and BisTris propane (100 mM) at pH 7.0. Under the reaction conditions, a change in A 298 of 2.26 corresponded to the hydrolysis of 1 mol of the substrate. Substrate hydrolysis was monitored with a UV-visible spectrophotometer (Hitachi U 3300).
Crystallization of the TnDhps-All crystallizations were performed with the hanging-drop vapor diffusion method in a 48-well ADX plate (Hampton Research) at 291 K. 1-l drops of the protein solution (ϳ10 mg ml Ϫ1 ) in Tris buffer (20 mM, pH 7.0) were mixed with 1-l drops of the reservoir solution and equilibrated against the reservoir solution (150 l). Crystals of the apo-TnDhp could be obtained when the solution contained lithium sulfate monohydrate (100 mM), PEG 4000 (12%, w/v) and isopropyl alcohol (2%, v/v) in ADA buffer (100 mM, pH 6.5). Crystals of the mono-Zn TnDhp, which were purified naturally from E. coli with the addition of zinc chloride during isopropyl-␤-D-thiogalactopyranoside induction, were grown from a solution containing lithium sulfate monohydrate (100 mM), PEG 4000 (12%, w/v), and trisodium citrate dihydrate (100 mM, pH 6.5). Crystals of the di-Zn TnDhp were prepared by soaking zinc chloride (0.1 mM) into the crystals of the mono-Zn TnDhp in the crystallization buffer. Complexes of the di-Zn TnDhp were co-crystallized using a molar ratio of 3 (substrates/protein). Suitable single crystals (0.2 ϫ 0.1 ϫ 0.1 mm 3 ) were selected and transferred to the crystallization solution containing glycerol (20%) as the cryoprotectant and fast-cooled in liquid nitrogen for further x-ray diffraction experiments.
Data Collection and Processing-X-ray diffraction data from frozen crystals of apo-TnDhp were collected at 110 K using a CCD detector (Q-4R, ADSC) at wavelength 1.0 Å on the Taiwan contracted beamline BL12B2 at SPring-8 in Japan. A 240º rotation with 1.0º oscillation was measured with an exposure duration of 10 s and distance of 180 mm from the crystal to the detector, at 110 K in a nitrogen stream using a cryosystem (X-Stream, Rigaku/MSC, Inc.). The mono-Zn TnDhp data were collected on the BL44XU beamline at SPring-8 with a similar procedure as for the apo-TnDhp and recorded by a SMART-6500 CCD detector (Bruker). The x-ray diffraction data of di-Zn TnDhp as well as their substrate complexes were collected at 110 K on beamline BL13B1 equipped with a CCD detector (Q315, ADSC) at the National Synchrotron Radiation Research Center in Taiwan. All data sets were indexed, integrated, and scaled using HKL2000 (29). Details of the data statistics appear in Table 2.
Determination and Refinement of Structures-Assuming one monomer structure in the asymmetric unit, the Matthew's coefficient was estimated to be 2.82 Å 3 /Da, which corresponds to a solvent content of ϳ56%. The structure of apo-TnDhp was first solved by the molecular replacement method using the structure of DHP from Dictyostelium discoideum (Protein Data Bank code 2FTW) (5) (sequence identity 58%). This attempt generated initial phases with an R factor of ϳ45% by the program MOLREP (30). Throughout the refinement, a random selection (10%) of the data were set aside as a "free data set," and the model was refined against the remaining data with F Ն 0 as a working data set (31). After the rigid body refinement and simulated annealing with CNS version 1.3 (32), several rounds of model building with Coot (33) were performed. The refine-ment then proceeded through another cycle of individual B-factor refinement and restrained refinement by CNS and Refmac5 (34), respectively, to generate a model with R-factor of ϳ25%. Subsequently, the water molecules were located at peaks of the difference maps automatically, using the program ARP/ warp (35). After the assignment of 232 waters, the refinement converged to a final R-factor of 15.9% and R free of 19.5% for all data at a resolution of 30 to 2.0 Å. The structures of the monoand di-Zn TnDhp and complexes with various substrates were determined by molecular replacement with the program MOLREP using the structure of apo-TnDhp as the search model. The structures were refined with the procedures described above. The correctness of the stereochemistry of the models was verified using MolProbity (36). All main-chain dihedral angles of residues were in the most favored and additionally allowed regions with only glycine exceptions in the Ramachandran plots. All refinement statistics are summarized in Table 2.

Characterizations of TnDhp; ICP-MS and Reactivation
Assays-The ICP-MS data show that the non-amended TnDhp monomer contains various metals with different compositions (Table 1), of which iron is the major metal ion, implying that the reactive metal ion(s) at the active site can be largely replaced with iron(s) during protein expression in E. coli. Previously, it was found that the iron incorporated into the DHP might result from iron contamination of the LB medium (37). In fact, the enzyme containing iron as the abundant metal ion is nearly inactive, exhibiting only a low level of activity.
The metal compositions in both the zinc-amended (mono-Zn TnDhp) and the cobalt-amended TnDhp (mono-Co TnDhp) are also analyzed by ICP-MS (Table 1). These measurements are repeated three times, with similar results obtained for each metal. The metal contents suggest that the total metal capacity (metal/enzyme monomer) is ϳ1 (mol/mol), implying that the active site of TnDhp contains only one tightly bound metal ion in its functional form. The specific activity of the cobaltamended TnDhp is about 1.4-and 23-fold higher than those of the zinc-amended and the non-amended (iron-containing) TnDhp, respectively. Despite the higher activity of the cobaltamended TnDhp, we have used the zinc-amended TnDhp as the model for discussion here because the DHP in vertebrate is a natural zinc-metalloenzyme (25).
The above metal stoichiometry was confirmed by reactivity assays. Various concentrations of zinc ions were added to a solution of the apo-TnDhp, and the activity was measured. The reactivation experiment clearly indicates that the addition and incorporation of zinc ions into the apo-TnDhp could restore the enzymatic activity ( Fig. 1). However, the activity curve also shows that concentrations of zinc ions significantly higher than the 1:1 stoichiometry caused a slight inhibitory effect on the protein activity. At a concentration of 25 M, the activity decreased to 60% of the maximum value (data not shown).
To verify that the binding of zinc ions is specific, we titrated samples at various metal/protein ratios and determined the zinc contents by ICP-MS after the solutions were passed through the desalting column to remove zinc ions that were non-specifically bound (Fig. 1). The ICP-MS data show that all samples contain only one zinc ion per protein monomer, confirming that the maximum activity was obtained with only one metal ion occupying a specific site in the protein.
Overall Structure and Molecular Packing of TnDhp-The crystal structures of TnDhp in various states were determined and summarized in Table 2. The structure of apo-TnDhp was first determined. It exhibits a monomer per asymmetric unit in crystals of the space group I4 1 22, with 492 amino acids from Ala 4 to Ala 495 out of a total of 500 residues (Fig. 2, A and B). The molecular dimensions of TnDhp are about 70 ϫ 55 ϫ 50 Å 3 , with a surface area of ϳ18,912 Å 2 accessible to the solvent. The overall structure comprises 18 ␣-helices and 20 ␤-strands and folds into one minor (Ala 4 -Gly 58 , Phe 384 -Lys 405 , and Pro 431 -Gly 447 ) and one major domain (Gly 59 -Asn 383 and Thr 406 -His 430 ). After the last ␣-helix (Glu 466 -Glu 477 ), the C-terminal fragment (Val 478 -Ala 495 ) forms a long tail of 18 amino acids. As is commonly found in DHPs, the major domain folds as a distorted (␣/␤) 8 TIM barrel, whereas the minor domain forms a sandwich-like structure with 10 mixed anti-parallel/parallel ␤-strands. The electrostatic surface reveals a cleft of about 10 ϫ 10 ϫ 20 Å 3 in the center of the major domain that facilitates the entry of substrates to the active site.
The TnDhp exhibits a monomer in the asymmetric unit of the crystals; however, the crystallographic symmetry-related dimer and tetramer structures could be generated with the FIGURE 1. Reactivation of the apo-TnDhp with zinc ions. Variable amounts of Zn 2ϩ were added into a solution of the apo-TnDhp at pH 7.0 (10 mM potassium phosphate) and incubated at 4°C for 3 days. The specific activities were determined by the standard assay (•). In parallel experiments, the same samples were further desalted by passage over a HiTrap desalting column with 10 mM potassium phosphate (pH 7.0), and the metal contents in the reconstituted enzymes were analyzed by ICP-MS. The ICP-MS data points (E) 1Ј, 2Ј, and 3Ј correspond to the samples 1, 2, and 3 after desalting column, respectively. The inset presents higher molar ratio (up to 8 and 10) data with similar desalting results. Error bars, S.D. space group I4 1 22 (Fig. 2, C and D), which exhibits molecular packing similar to that of other previously reported tetramers (39). The results of the gel filtration during protein purification suggest that the TnDhp is a tetramer in the solution state. Interestingly, the dimeric interface of TnDhp shows that the long C-terminal tail (Val 478 -Ala 495 ) from one monomer winds onto the surfaces of another monomer and reaches close to its active site. Several hydrogen bonds stabilize the dimeric interaction through the C terminus. The Metal-binding Site and Post-translational Modification of Lysine by Carbamylation-The metal-free TnDhp (apo-TnDhp) was generated, and the metal content (ϳ0) was confirmed by ICP-MS (Table 1). The active site of apo-TnDhp contains four histidines (His 63 , His 65 , His 188 , and His 244 ), one aspartic acid (Asp 322 ), and one lysine (Lys 155 ) in the catalytic pocket (Fig. 3A). In the structure of the apo-TnDhp, the residue of Lys 155 is not carbamylated at the ⑀-amino group. Two water molecules, but no metals, occupy the potential metal-binding sites.
The mono-Zn TnDhp contains one metal at the active site, as confirmed by ICP-MS assays ( Table 1). The zinc-anomalous difference Fourier map clearly indicates only one zinc atom in the active site (defined as the M␣ site) (Fig. 3B). This zinc ion is coordinated with residues His 63 , His 65 , and Asp 322 , with zinc to nitrogen distances of 2.21, 2.16, and 2.13 Å, respectively. Two water molecules are also observed. One water molecule is loosely associated with the zinc atom at a distance of 4.4 Å. The other water is directly coordinated (2.2 Å) to Tyr 160 , which is the proposed key residue participating in the catalysis (39). The composite omit map shows that the electron density between the M␣ metal and Lys 155 is connected. This electron density is consistent with the carbamate of a carbamylated lysine with a reasonable distance of 2.14 Å between one of the oxygens of the carbamate and the M␣ metal. An inspection of 10 data sets, which were collected independently, reveals the same result as described above. The structure of mono-Zn TnDhp shows that one metal (M␣) is sufficient to promote the carbamylation process. Evidently, Lys 155 is chemically modified, with one metal ion at the active site of the protein.
Sequence alignment as well as the structural features of the active site uncovered above suggest that the catalytic pocket of TnDhp could potentially accommodate two metal ions. To generate the di-Zn TnDhp, we soaked crystals of the mono-Zn TnDhp in the crystallization milieu containing zinc ions at high concentrations (0.1 mM). The structure of the di-Zn TnDhp shows two zinc ions at the active site (Fig. 3C), of which one zinc atom is located at the same M␣ site as in the mono-Zn structure, and the other, dubbed the M␤ site, is bound to the two histidine residues His 188 and His 244 with zinc to nitrogen distances of 2.15 and 2.13 Å, respectively. The distance between the two zinc atoms (i.e. M␣ and M␤) is 3.74 Å in the di-zinc TnDhp. The existence and position of the second zinc was confirmed by zinc-anomalous diffraction. These results indicate that the active site could accommodate a total of two zinc atoms. A water molecule is also found at the position between the two zinc atoms (Fig. 3C). In the dimetal TnDhp, each of the zinc ions is coordinated with one of the carba- Vertebrate DHP and Complexes from T. nigroviridis OCTOBER 18, 2013 • VOLUME 288 • NUMBER 42 mate oxygens of the carbamylated Lys 155 with a zinc to oxygen distance of 1.97 Å.
Superimposed structures of the apo-TnDhp and the monoand di-Zn TnDhp reveal conformational changes upon lysine carbamylation and metal binding (Fig. 3D). In the surrounding environment of the active site, the side-chain conformations of His 63 , His 65 , and His 244 are relatively stable, whereas those of Tyr 160 , His 188 , and Asp 322 are altered. Relative to their positions in the apo-TnDhp structure, we find that Tyr 160 , His 188 , and Asp 322 have shifted their side-chain orientations by distances of ϳ1.9, 0.5, and 0.3 Å, respectively, upon binding of one metal and carbamylation in the mono-Zn structure. After carbamylation and the binding of the two zinc ions, these residues move closer to the metal sites, and the movements increase to 2.0, 0.8, and 0.6 Å, respectively.
The conformation of the Lys 155 is also greatly affected by the metal binding and carbamylation (Fig. 3D). In apo-TnDhp, Lys 155 exhibits the general non-carbamylated conformation. Upon zinc binding to the M␣ site, the post-modification process takes place by stabilizing CO 2 already on the ⑀-amino group of Lys 155 . It is of interest to note that the free oxygen atom of the carbamate in the mono-Zn TnDhp structure is located at the M␤ position coordinating with His 244 (2.4 Å). Moreover, the side chain of the carbamylated Lys 155 has shifted away 1.4 Å to yield space for the second zinc atom to fit into the M␤ site in the structure of the di-Zn TnDhp.
It has been reported that cobalt atoms could substitute for zinc at the active site (40). We have confirmed that the structures of the mono-Co and di-Co TnDhp are similar to the zinc structures described above.
Comparison of Active Sites of Carbamylated Proteins-A protein generally contains several lysine residues, but carbamylation is typically found at special positions, which is extremely interesting and important. The propensity toward carbamylation must depend on the environment of the lysine, including the surrounding residues. The carbamylated lysines are usually buried within the protein structures with a small solvent-accessible surface area, ϳ6.2 Å 2 , as in apo-TnDhp, and are surrounded by hydrophobic residues, such as Val and Phe. At present, there are about 250 structures in the Protein Data Bank containing carbamylated lysines required for their functions. These structures could be classified into three groups based on the metal content: 1) no metal; 2) carbamylated lysine with one metal in "half-occupied" geometry; and 3) carbamylated lysine with two metals in full geometry (Scheme 1).
The structures of TnDhp reveal a common structural motif for lysine carbamylation with both M␣ and M␤ sites. This motif has been demonstrated to be important for maintaining the carbamate structure according to site-directed mutagenesis and activity rescue experiments (40). One metal is bound to the M␣ site sharing two histidines. Apparently, this metal coordination is sufficient to fix CO 2 to the lysine. Interestingly, there are several other systems, such as transcarboxylase (Protein Data Bank entry 1RQB) (41), pyruvate carboxylase (2QF7) (6), and pyruvate carboxylase (3BG3) (7), with the same "half-occupied" geometry utilized by the respective lysine to coordinate one cobalt, zinc, and manganese ion, respectively (Fig. 4). A carbamylated lysine with one metal has also been reported in Rubisco (4). The two highly conserved histidine residues near the carbamylated lysine exist also in Rubisco, although the metal is bound to the ribulose 1,5-bisphosphate substrate instead of the histidines. Thus, our mono-Zn TnDhp provides a new structural feature for enzymes that contain only one tightly bound metal in full geometry, differing from those structures in the Protein Data Bank.
A ligand structure comprising four histidines and one aspartic acid utilizing both the M␣ and M␤ sites is required for lysine carbamylation and other biological functions mediated by this family of proteins (Fig. 4B). Examples include the amidohydrolase superfamily (e.g. urease (42), allantoinase (43), HYD (44), and DHP (5)) and other proteins, such as arginine carboxypeptidase (45).
Lysine Carbamylation and Metal Numbers Regulate the Structure and Function of the Protein-The post-translational lysine carbamylation facilitates additional metal coordination to trigger the observed structural alteration seen in our structures for specific enzymatic activities. The molecular structure of apo-TnDhp also reveals a tunnel from the surface of the protein to the active site suitable for the substrate entrance or product release (Fig. 5B). Notably, two loops located at the entrance of the tunnel and near the active site are highly dynamic (Fig. 5A). One loop from Ala 69 to Arg 74 exhibits the largest variation among three structures (apo-, mono-Zn, and di-Zn TnDhp). In particular, Phe 70 and Met 71 swing with a maximum range of 6.9 and 7.5 Å, respectively. The other loop from Met 158 to Met 165 also exhibits a similar structural disparity, allowing Tyr 160 to move closer to the active site in the dimetal structure with a displacement of 2 Å.
In the structure of apo-TnDhp, the two loops are positioned away from the active site with distances of 12.5 and 5.7 Å measured from Phe 70 (on the first loop) and Tyr 160 (on the second loop) to M␣ and M␤, respectively. We surmise that this is the "open form" of the enzyme because there is no apparent barrier restricting the transport of the substrate from the surface to the active site (Fig. 5B). The bottleneck dimension of the tunnel in an open form is about 10 Å (Fig. 5D), which is greater than the average size (ϳ6 Å) of the TnDhp substrates, such as DHU and DHT. However, the two dynamic loops in di-Zn TnDhp move closer to the active site with distances of 8.5 and 4.5 Å from Phe 70 and Tyr 160 to M␣ and M␤, respectively, and significantly reduce the bottleneck dimension to 3 Å (Fig. 5, C and E), resulting in a "closed form" of TnDhp and limiting the substrate entry into or product release from the active site.
Interestingly, in the structure of the mono-Zn TnDhp, the orientation of the (Ala 69 -Arg 74 ) loop is in the "intermediate state" between the conformations of apo-and di-Zn TnDhp. SCHEME 1. Four stages are involved in lysine carbamylation. Stage I, the structural requirements for lysine carbamylation. "Half-occupied" geometry (at least two histidines) is needed for coordination with one metal, whereas the full geometry is employed for two-metal coordination. Stage II, the carbon dioxide interacts weakly with the lysine in a resonance structure. Stage III, the state with the carbamylated lysine coordinated to one metal. The free oxygen of the carbamate containing the electron lone pair is stabilized by a histidine. The dashed circle indicates the binding site for the second metal. The six atoms in the dashed square are coplanar. Stage IV, the carbamylated Lys 155 moves by 1.4 Å, as indicated by the arrow, upon binding of the second metal. In this stage, the seven atoms are coplanar (dashed square).  OCTOBER 18, 2013 • VOLUME 288 • NUMBER 42

JOURNAL OF BIOLOGICAL CHEMISTRY 30651
The dynamic loops in the mono-Zn structure are about 40% in the open form and 60% in the closed form based on the electron density analysis. The averaged temperature B-factor of this loop (Ala 69 -Arg 74 ) in the mono-Zn TnDhp (ϳ60.2 Å 2 ) is higher than that of either the apo-TnDhp (ϳ32.2 Å 2 ) or the di-Zn TnDhp (ϳ26.0 Å 2 ), suggesting that the (Ala 69 -Arg 74 ) loop has the propensity to undergo structural fluctuations even in the crystalline state.
It has been reported that the D-HYD from Bacillus stearothermophilus is equipped with three stereochemistry gate loops (SGL) located in the tunnel gate to regulate the substrate specificity and activity (46,47). The first (Ala 69 -Arg 74 ) and second (Met 158 -Met 165 ) dynamic loops found in TnDhp are structurally related to the gate loops SGL1 and SGL3, respectively, in B. stearothermophilus D-HYD. In this study, we have provided evidence for structural or conformational fluctuations of similar SGLs in vertebrate dihydropyrimidase through a comparison of the apo-and metal-binding structures.
The Flexible C-terminal Tail-The role of the C terminus in DHP has been investigated for a long time, but its function is still a mystery. It has been suggested that the C terminus may play a role in maintaining a multimer form of proteins. It is known that the deletion of the C-terminal fragment in D-HYD from B. stearothermophilus results in dissociation of the protein from dimers to monomers (48). In our structure of TnDhp, the C-terminal tail also maintains the dimeric form, with its terminus reaching near the dynamic (Ala 69 -Arg 74 ) loop of the active site (Figs. 2C and 5B). An inspection and comparison of the C-terminal tails in apo-, mono-Zn, and di-Zn TnDhp struc-tures show that the stability of the C-terminal tail, especially the fragment from Pro 488 to Ala 495 , affects the dynamic (Ala 69 -Arg 74 ) loop (Fig. 5A). The structure of the C-terminal fragment (Pro 488 -Ala 495 ) is rigid, with well defined electron densities in the apo-TnDhp structure (the open form) (Fig. 5B), whereas it is flexible in the monometal structure (an intermediate state). In the latter, the electron density of this region is diminished so that model building could only be allowed with the density map at a lower contour level (0.7 ). Finally, in the di-Zn TnDhp structure (the closed form), the electron density of the C-terminal tail is too indistinct to allow model building in this region of the protein structure (Fig. 5C). Thus, the flexibility of the C-terminal tail and the conformation of the dynamic loops appear to be correlated. Note that a small C terminus fragment of five residues after Ala 495 is too flexible to be identified in all of the structures, so we could not exclude the possibility that these five residues are also interacting with the first dynamic loop.
Structures of TnDhp in Complexes with Hydantoin, NC␤I, and NC␤A-To explore the substrate-binding site and probe the interactions between the substrates and the residues essential for catalysis, we have co-crystallized TnDhp in complexes with hydantoin, DHT, and DHU, separately. Although both mono-Zn and di-Zn TnDhp are prepared for co-crystallization, only the structures of the complexes with two zincs in the active site could be successfully obtained (Fig. 6). This interesting result might reflect the fact that the substrate channel of the intermediate state in mono-Zn TnDhp does not contain barriers to restrict the diffusion of substrate and products to and respectively. The C-terminal tail from another monomer (orange) is shown, and the most dynamic residues are labeled. The carbamylated lysine 155 is labeled as Kcx155. B, the structure of apo-TnDhp shows that the dynamic loops (green) in the "open form" expose the active site (blue) to the molecular surface. The C-terminal tail from another monomer (orange) is shown in close proximity to the dynamic loop. C, the structure of di-Zn TnDhp shows that dynamic loops (yellow) in the "closed form" obstruct the active site. In this state, the C-terminal tail (orange) is too flexible to be modeled, because there is no electron density seen for the last seven residues. D, a side view of the active site with the dynamic loops in the open form. The cross-section bottleneck of the tunnel is about 10 Å (light green), which is wider than the average size of substrates (6 Å). The active site pocket surrounded by key residues is shown in blue. E, a side view of the active site with the dynamic loops in the closed form. The cross-section bottleneck is reduced dramatically to ϳ3 Å (yellow). The active site pocket is shown in blue. The two zinc ions are depicted as spheres.

Vertebrate DHP and Complexes from T. nigroviridis
from the active site. The hydantoin is found in the catalytic pocket (Fig. 6A) and interacts with two zinc atoms (with distances of 2.8 and 3.0 Å), the main chain of residue Gly 294 (ϳ3.8 Å), and the side chains of Tyr 160 (ϳ3.8 Å) and Asp 322 (ϳ3.4 Å). Phe 70 , which is located in the dynamic loop (Ala 69 -Arg 74 ), described above, is in van der Waals contact with the hydantoin (ϳ3.8 Å) (Fig. 5A) and stabilizes the substrate viastacking interactions.
From the co-crystallization experiments with the DHT and DHU, it is clear from the electron densities that the C4 -N3 bonds in these substrates have been broken to open up the dihydropyrimidine ring and generate the corresponding products NC␤I and NC␤A, respectively. These observations indicate that catalytic cleavage of the substrates has taken place, but the products could not be released with the two zinc atoms present in the active site (Fig. 6, B and C). Several residues are utilized for stabilizing the NC␤I as well as the NC␤A, as listed in Table 3. The distances from Tyr 160 to the N3 and O4 atoms of NC␤I are 3.0 and 2.7 Å, and the distances to the N3 and O4 atoms of NC␤A are 3.1 and 2.3 Å, respectively. From the structure of the NC␤I in complex with the TnDhp, we conclude that the stereo-selectivity of the catalysis is the L-conformation of the substrate.
Activity assays indicate that the TnDhp processes hydantoin with a low activity (ϳ330-fold lower compared with DHT or DHU), 4 in agreement with our structural results indicating that the hydantoin is maintained in the cyclic form, but both DHT and DHU have been hydrolyzed in the structures of the complexes. The positions of the NC␤I and NC␤A are similar in the structures (Fig. 6D), with the cutting sites pointing toward Tyr 160 . However, the cutting site of hydantoin is displaced ϳ2.5 Å away from Tyr 160 compared with DHT or DHU, which may explain the low activity of the di-Zn TnDhp toward hydantoin.
The Structure of Apo-TnDhp in Complex with the Substrate Analog ADA-Crystals of the apo-TnDhp have been also grown in the crystallization buffer containing ADA, which could be regarded as a substrate analog (Scheme 2). In this apo-TnDhp structure, the electron density clearly reveals that three ADA molecules are lined up in a channel from the entrance surface to  Vertebrate DHP and Complexes from T. nigroviridis OCTOBER 18, 2013 • VOLUME 288 • NUMBER 42 the active site (Fig. 7), highlighting the potential pathway(s) for substrate entry and product release. One ADA molecule (ADA3) is coordinated with the side chains of His 231 and Asn 343 and the main chains of Phe 338 and Thr 339 . A second ADA (ADA2) is located in the middle position and interacts with the side chains of Tyr 160 and Asn 343 and the main chains of Thr 339 , Ile 341 , and Asn 343 . The residue Tyr 160 in TnDhp is structurally equivalent to Tyr 152 in Sinorhizobium meliloti DHP, which has been proposed to be associated with the catalysis process by a mutagenesis study (39). The third ADA (ADA1) at the position close to the active site is coordinated with the side chain of Asp 322 and main chain of Gly 294 . Notably, this ADA1 molecule is located at the same position with an orientation comparable with that in the substrates hydantoin, NC␤I, and NC␤A described earlier in the structures of the substrate complexes (Fig. 7B). In addition, several hydrogen bonds mediated by water molecules are found to stabilize ADA molecules between residues and ADA molecules. The detailed interactions with the corresponding distances are listed in Table 3.

Structural Insights into the Stages of Lysine Carbamylation-
The structures with varying metal contents provide insights into the different stages of lysine carbamylation (Scheme 1). The first stage is depicted by apo-TnDhp, where Lys 155 is not carbamylated. The second stage (precarbamylation) might involve the formation of an intermediate with a carbon dioxide weakly associated with the ⑀-amino group of the lysine (49). This structure is expected to be quite labile, so that the process is reversible. The third stage corresponds to the carbamylation of the lysine, culminating in the structure of the mono-Zn TnDhp, in which the lysine has been carbamylated at the active site with only one metal. Although carbon dioxide exists at a concentration of ϳ50 ppm in the solution, and reversible car-bamylation on Lys 155 (Stage II of Scheme 1) can be expected, one zinc or cobalt ion must be positioned at the active site for CO 2 fixation (Stage III of Scheme 1). The final stage is represented by the structure of the di-Zn TnDhp, where a second metal ion has bound to the active site to stabilize the carbamylated lysine.
The Potential Role of the M␤ Metal Ion-Beside the catalytic metal ion at the M␣ site, the active site of TnDhp allows binding of a second metal ion at the M␤ site. Enzymes that equip the active site with full geometry as well as a second metal ion for regulatory function are observed in the amidohydrolase superfamily (50). It has also been reported that zinc ions at high concentrations could enhance the thermal stability of DHP (51). A structural comparison of apo-TnDhp and mono-and dimetal TnDhp suggests that the metal ion at the M␤ site regulates the size of the substrate tunnel (Fig. 5). Presumably, the M␤ metal site plays the role of controlling the two dynamic loops (Ala 69 -Arg 74 and Met 158 -Met 165 ) for substrate entrance, substrate locking, and product release. Upon the substrate entering the active-site pocket, the M␤ metal ion is simultaneously positioned at the M␤ site to alter the conformations of the dynamic loops to the "closed form" and potentially lock the substrate at the right position with Phe 70 for catalysis. After initiating the catalytic turnover, the M␤ metal ion would of SCHEME 2. A, the reactions of dihydrouracil (top) and dihydrothymine (bottom). B, structure of hydantoin. C, structure of ADA. course ultimately have to leave the M␤ site to generate an "open-form" substrate tunnel for the product release and new substrate entrance. Thus, the weak metal binding exhibited by the M␤ site is obligatory for its regulatory function, making the occupancy of the M␤ site by a metal ion observable only upon soaking crystals of the mono-Zn TnDhp with zinc ions at a high concentration. This phenomenon might resolve the puzzle as to why only one metal was detected previously in vertebrate DHPs with the full geometry (26,52). In the normal enzymatic state, the M␣ metal ion is tightly bound for catalysis, but the binding of metal to the M␤ site is weak in order to provide the protein with dynamic features to control the "open/closed" conformations of the two dynamic loops. It has been proposed that the metal ion at the M␤ site is loosely bound in DHP (5).
The Importance of Protein Oligomerization-The dimerization of TnDhp results in the C-terminal tail from one monomer reaching near the dynamic (Ala 69 -Arg 74 ) loop of another monomer (Figs. 2C and 5). This feature of the structure allows us to probe the function of the C terminus. The structural analysis reveals that the conformational stability of the C-terminal tail is in the following order: apo-TnDhp Ͼ mono-Zn TnDhp Ͼ di-Zn TnDhp. It seems that the flexibility of the C-terminal tail is correlated with the conformations of the dynamic (Ala 69 -Arg 74 ) loop. The open form of the dynamic loop is linked to the greater stability (low flexibility) of the C-terminal tail, whereas the low stability (high flexibility) of the C-terminal tail results in a closed form of the dynamic loop, implying that the protein is exploiting the flexibility of the C-terminal tail to control the size of the substrate tunnel. Therefore, our structural analysis indicates that the C-terminal tail dictates not only the dimerization of the protein but also the enzymatic activity. It is possible that dimerization of TnDhp by the C-terminal tail assists the opening of the two dynamic loops (SGL1 and SGL3) for the enzyme function.
Previous reports have shown that modifications to the C termini of DHP/HYD enzymes could decrease the enzyme activity (9,48,(53)(54)(55). In the D-HYD from B. stearothermophilus and Bacillus thermocatenulatus, the truncated enzymes with a deletion of 40 amino acids on the C terminus have lost 64% of their enzymatic activities (48). In Pseudomonas putida YZ-26, a series of deletions were performed in the D-HYD and all of the truncated enzymes exhibited either low or non-detectable activities (53,54). However, CD spectroscopy showed that these deletions of the C-terminal fragment did not alter the secondary structures of the overall folding (48,54). More recently, deletions on the C terminus of DHP from human, a vertebrate similar to T. nigroviridis, have been undertaken, and these mutants also show no detectable activity (9). Evidently, the C-terminal tail plays an important role for the activity of the enzyme.
Proposed Structure-based Mechanism; Monometal Active Site-According to our structures of apo-, monometal, and dimetal TnDhp, substrate complexes, and the ADA complex, we have gained important insights into the mechanism of action of the DHP family of enzymes. The first step of the catalytic turnover is to get the substrate into the active site. This state is highlighted by the apo-TnDhp and monometal TnDhp structures with the open form of the tunnel, which provides a suitable cross-section for the substrate to diffuse into the active site (Figs. 5D and 8A).
In order to maintain the substrate in the specific position for catalysis, some structural changes have occurred. Two dynamic loops (Ala 69 -Arg 74 and Met 158 -Met 165 ) with aromatic residues Phe 70 and Tyr 160 have moved closer to the active site, locking the substrates in the active site, as seen in the dimetal TnDhp with the tunnel characteristic of the closed form (Figs. 5E and 8B).
While the substrate is bound to the active site, some residues, such as Gly 294 and Asn 343 , are employed to stabilize the substrate. For substrates, such as DHT, that belong to 5-monosubstituted dihydropyrimidines, the stereo-selectivity of DHP is the L-configuration, not the D-configuration, as seen in the structure of the NC␤I complex (Fig. 6B).
Catalytic mechanisms involving both one zinc and two zinc ions have been proposed for the DHP proteins (5,37,55). The mechanism with one zinc ion was first proposed for the DHP from calf and pig liver (55). In this proposed mechanism, the zinc ion acts as a Lewis acid, and an amino acid residue behaves both as a general base and acid to mediate the catalysis. The substrate is bound to the metal ion (Lewis acid) through the 4-oxo group; the general base, most likely the aspartic acid (Asp 322 in TnDhp), activates a water molecule for the nucleophilic attack at the C-4 carbon of the substrate, generating a tetrahedral intermediate, which is stabilized by a tyrosine residue (Tyr 160 in TnDhp) (5). The ring of the substrate in the tetrahedral intermediate then opens up to yield the products upon general acid protonation of the ring nitrogen using the same aspartic acid residue. On the other hand, the two-zinc mechanism of DHP proposed is based on the mechanism of dihydroorotase (24,38), because it is generally believed that these two protein systems share the similar catalytic mechanism (5,39). In this scheme, the M␤ zinc interacts with the carbonyl oxygen (O4) of the substrate. This is followed by nucleophilic attack through the hydroxyl group of the water molecule and assisted by the carboxylate of the aspartic acid. Both the M␣ and the M␤ zinc ions as well as a tyrosine residue (Tyr 160 in TnDhp) (5) are used to stabilize the tetrahedral intermediate state. After protonation of the amide nitrogen of the substrate assisted by the same aspartic acid, the product is generated and coordinates with the two zinc ions.
Indeed, our activity and structure analyses of TnDhp described earlier show that one stable metal in the active site could achieve the maximum activity. Thus, it seems that the one-metal mechanism is operative in the catalysis mediated by the vertebrate DHP. However, according to our results, the metal in the M␤ site is only loosely bound in the absence of substrate. The regulatory role we discussed earlier for this metal could well be related to the binding of substrate and subsequent catalysis (Fig.  8C). Hence, it might be that only one metal is detected in the active site in vertebrate DHPs, but for the biochemistry, the second metal is essential, and it is conceivable that the binding of the second metal and of the substrate reinforce each other to promote the catalysis. On the other hand, as a ligand of the M␣ metal, the role of Asp 322 as a general base/acid is diminished. Instead, Tyr 160 stabilizing the intermediate state may be important in the catalysis turnover.

Vertebrate DHP and Complexes from T. nigroviridis
After the catalytic reaction, the product can only be released through the same tunnel. To change the tunnel from the closed to the open form, extra forces will be needed to entice the dynamic loops to return to the open state. According to our structures, the C-terminal tail from the other monomer is positioned near the dynamic loop and the active site. This C-terminal region has high temperature B-factors in the x-ray structure, so the dynamic loop could potentially open via thermal fluctuations (Fig. 5, B and C). While the dynamic loop is open, the product (and possibly the second metal as well) could diffuse out of the active site, following the pathway(s) of the three ADA molecules in the tunnel of the apo-TnDhp structure (Figs. 7 and 8D). Subsequently, another cycle of the reaction can be repeated with a new substrate entering into the active site.
In summary, we have demonstrated the possible existence of four stages of the metal-mediated lysine carbamylation with various conformational changes at the active site for the biological function of a vertebrate DHP. Lysine carbamylation results in a tightly bound metal ion at the M␣ site, which participates in both CO 2 fixation and catalysis, and a weakly bound metal ion at the M␤ site introduces specific dynamic properties into the protein structure to allow the regulation of the size of the sub-strate/product tunnel. This information provides a basis to explain the longstanding question in the literature on the varying number of metal ions associated with the carbamylated lysine in this class of enzymes.