![]()
|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 47, 36149-36161, November 24, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

¶




¶

1
¶2
¶3
From the
Banting and Best Department of Medical Research and the
Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L6, Canada, the ¶Ontario Center for Structural Proteomics, Ontario Cancer Institute, Toronto, Ontario M5G 2C4, Canada, the ||National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, the **Graduate School of Biological Sciences, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara 630-0101, Japan, and 
Structural Genomics Consortium, University of Toronto, Toronto, Ontario M5G 1L6, Canada
Received for publication, June 7, 2006 , and in revised form, September 21, 2006.
| ABSTRACT |
|---|
|
|
|---|
-phosphoglucomutase, phosphonatase, and dehalogenase activities. Using a representative set of 80 phosphorylated substrates, we characterized the substrate specificities of 23 soluble HADs encoded in the Escherichia coli genome. We identified small molecule phosphatase activity in 21 HADs and
-phosphoglucomutase activity in one protein. The E. coli HAD phosphatases show high catalytic efficiency and affinity to a wide range of phosphorylated metabolites that are intermediates of various metabolic reactions. Rather than following the classical "one enzyme-one substrate" model, most of the E. coli HADs show remarkably broad and overlapping substrate spectra. At least 12 reactions catalyzed by HADs currently have no EC numbers assigned in Enzyme Nomenclature. Surprisingly, most HADs hydrolyzed small phosphodonors (acetyl phosphate, carbamoyl phosphate, and phosphoramidate), which also serve as substrates for autophosphorylation of the receiver domains of the two-component signal transduction systems. The physiological relevance of the phosphatase activity with the preferred substrate was validated in vivo for one of the HADs, YniC. Many of the secondary activities of HADs might have no immediate physiological function but could comprise a reservoir for evolution of novel phosphatases. | INTRODUCTION |
|---|
|
|
|---|
Haloacid dehalogenase (HAD)4-like hydrolases (3) represent the largest family of predicted small molecule phosphatases encoded in the genomes of bacteria, archaea, and eukaryotes, with 6,805 proteins in data bases. The great majority of these proteins have no known biochemical or biological function. In any individual genome, the number of HAD genes can range from 10 to 20 in different bacteria to 100 in humans and 115 in Arabidopsis thaliana (InterPro data base). HADs share little overall sequence similarity (15-30% identity), but they can be unequivocally identified by the presence of three short conserved sequence motifs (3) (supplemental Fig. 1). Most of the characterized HADs have phosphatase activity (CO-P bond hydrolysis), and several also catalyze dehalogenase (C-halogen bond hydrolysis), phosphonatase (C-P bond hydrolysis), and
-phosphoglucomutase (CO-P bond hydrolysis and intramolecular phosphoryl transfer) reactions (3, 4). The biochemically and structurally studied HADs include phosphoserine phosphatase SerB from Methanococcus jannaschii (5), phosphoglycolate phosphatase from Thermoplasma acidophilum (6), phosphonacetaldehyde hydrolase from Bacillus cereus (7),
-phosphoglucomutase from Lactococcus lactis (8), haloacid dehalogenases from Pseudomonas sp. YL (9), and Xanthobacter autotrophicus (10), and two E. coli phosphatases, YbiV and NagD (11, 12). However, the vast majority of HADs remains uncharacterized. Since these enzymes generally show little sequence similarity, the catalyzed reaction and, especially, the substrate specificity are hard to predict on the basis of sequence conservation and have to be determined experimentally.
The E. coli genome encodes five membrane-bound and 23 soluble HAD-like hydrolases, representing
40% of the E. coli proteins with known or predicted small molecule phosphatase activity (1). None of the E. coli HADs is essential for bacterial growth (13). The physiological substrates have been experimentally identified for three soluble E. coli HADs, namely phosphoglycolate phosphatase Gph, 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase YrbI, and trehalose 6-phosphatase OtsB (14-16) Only three enzymes (Gph, YrbI, and NagD) have been characterized biochemically (12, 14, 15), and three-dimensional structures have been resolved for YbiV and NagD (11, 12).
Using a set of 80 representative phosphorylated metabolites, we characterized the substrate specificities of all 23 soluble E. coli HADs and found that they comprise a family of promiscuous phosphatases with overlapping substrate profiles and are capable of hydrolyzing a wide range of phosphorylated metabolites, including carbohydrates, nucleotides, organic acids, coenzymes, and small phosphodonors. Genetic analysis demonstrated that the activity of one of the HADs, YniC, toward its preferred substrate was biologically important. We further show that all E. coli HADs have phosphatase activity against small phosphate donors (acetyl phosphate, carbamoyl phosphate, phosphoramidate), which resembles the autophosphorylation reaction catalyzed by CheY fold receiver domains of the two-component regulatory systems. Together with the previously reported structural similarity, these results indicate that the HAD superfamily and the receiver domain originate from an ancestral low specificity phosphatase. Clustering of E. coli HADs on the basis of their phosphatase activities (kcat/Km) was incongruent with the sequence-based phylogeny. Thus, many of the secondary activities of HADs might be of no immediate functional importance but comprise a reservoir for evolution of phosphatases with novel specificities.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
genomic DNA and cloned into a modified pET15b (Novagen) as previously described (17). Several HADs (YbjI, YaeD, and YrbI) were cloned from the E. coli K12 W3110 genomic DNA into the archive vector pCA24N (Genobase data base; available on the World Wide Web at ecoli.aist-nara.ac.jp). Purification of proteins for screening and biochemical characterization was performed as previously described (18). NagD was expressed in an insoluble form and was partially refolded from the inclusion bodies by a buffer exchange method using the Spin-Column Protein Folding Screen kit (SFC01-10) and the column PFC02 from ProFoldin Protein Folding Services according to the manufacturer's instructions (available on the World Wide Web at www.profoldin.com).
Enzymatic Screens and AssaysGeneral phosphatase screens with p-nitrophenyl phosphate (pNPP) as substrate and natural substrate phosphatase screens with 80 phosphorylated compounds from Sigma (supplemental Table 1) were performed as previously described (18). Acetyl-phosphatase activity was assayed by measuring the acetyl-phosphate concentration using the hydroxylamine protocol of Lipmann and Tuttle (19). The production of fructose in enzymatic reactions was determined using an enzyme-coupled assay with fructose dehydrogenase (F5152; Sigma), essentially as previously described (20). This assay was adapted for 96-well microplates (200-µl reaction mixtures). Haloacid dehalogenase activity was determined spectrophotometrically by measuring the release of halide ions using the mercuric thiocyanide method (21). The assays were adapted to the 96-well format (125 µl) and contained 50 mM CHES buffer (pH 9.0), 10 mM substrate, and 5 µg of protein. Six compounds were used as substrates (R-chloropropionic acid, S-chloropropionic acid, bromoacetic acid, 4-chlorobenzoic acid, 2,2-dichloropropionic acid, and 2-bromopropionic acid), and L-2-haloacid dehalogenase from Pseudomonas sp. YL (22) was used as a positive control. Phosphonatase activity was assayed using phosphonoacetate as a substrate by measuring the release of inorganic phosphate using the Malachite Green reagent as previously described (23).
-Phosphoglucomutase activity was determined using a glucose-6-phosphate dehydrogenase-coupled assay and 1 mM
-glucose 1-phosphate as substrate (24).
For Km and Vmax determination, the phosphatase assays contained substrates at concentrations 0.005-2.0 mM. Kinetic parameters were determined by nonlinear curve fitting from the Lineweaver-Burk plot using the GraphPad Prism software (version 4.00 for Windows, GraphPad Software, San Diego, CA).
Mutagenesis and Growth ExperimentsThe yniC gene was deleted from the chromosome of the E. coli K-12 W3110 strain using a one-step inactivation method described by Datsenko and Wanner (25). The obtained
YniC strain contains an unmarked gene deletion, which was verified using PCR. The YniC-overexpressing strain was prepared by subcloning (PCR) of the wild-type yniC into the KpnI/HindIII sites of the arabinose-inducible vector pBAD-33 (26). The resulting plasmid (pKC1) and the empty (no insert) vector pBAD-33 (a control) were transformed into the wild-type W3110 strain. Cells were grown aerobically at 37 °C on the MOPS-buffered minimal medium containing 0.2% of succinate as a carbon source, and the expression of YniC was induced by the addition of 0.02% arabinose. The culture growth (A600) was determined after 14 h of cultivation.
Bioinformatic AnalysesHierarchical clustering of HADs (based on their substrate profiles) and substrates (based on their HAD spectra) was calculated using cosine correlations, and groups were clustered using the average method (R Foundation for Statistical Computing; available on the World Wide Web at www.R-project.org). For hierarchical clustering of proteins across HAD substrates, we consider each protein as a point in the m-dimensional space, where m designates the total number of HAD substrates. Each protein can then be represented as a vector A = (s1,... sm) of length |A| and unit (normalized) vector A'(s1' = s1/|A|,.. .sm' = sm/|A|). We use the cosine angle between normalized vectors as a similarity measure between proteins across substrates. The protein profiles across substrates are clustered into groups using the average linkage method (R Foundation for Statistical Computing; available on the World Wide Web at www.R-project.org). For hierarchical clustering of substrates across proteins, we consider each protein as a point in the n-dimensional space, where n designates the total number of proteins. Each substrate can then be represented as a vector B = (p1,... pn) of length |B| and unit (normalized) vector B'(p1' = p1/|A|,... pn' = pn/|B|). We use the cosine angle between normalized vectors B' as a similarity measure between substrates across proteins. The substrate profiles across proteins are clustered into groups using the average linkage method (R Foundation for Statistical Computing; available on the World Wide Web at www.R-project.org).
The distance between the catalytic efficiency (kcat/Km) profiles of 18 enzymes acting on 26 different substrates (see Table 2) was computed using the square distance approach (see supplemental materials and "Experimental Procedures"). A neighbor-joining tree was constructed from the distance matrix using the NEIGHBOR program of the PHYLIP package (28). Multiple alignments of amino acid sequences were constructed using the MUSCLE program (29) and optimized manually to ensure the correct alignment of known sequence motifs of the HAD superfamily (3). Sequence-based phylogenetic trees were constructed using the following methods: (i) neighbor-joining method, as implemented in the NEIGHBOR program of the PHYLIP package (28), (ii) least squares method as implemented in the FITCH program of the PHYLIP package (28), (iii) local maximum likelihood optimization of the least squares tree using the ProtML program of the MOLPHY package (30), (iv) quartet puzzling as implemented in the TREE-PUZZLE program (31), and (v) Markov chain Monte Carlo Bayesian estimation using the MrBayes program (32, 33).
|
| RESULTS |
|---|
|
|
|---|
15 to 30% identity), but they can be identified by the presence of three short conserved sequence motifs (3) (supplemental Fig. 1). 22 soluble HADs encoded by the E. coli genome were purified to homogeneity (supplemental Fig. 2) and screened for phosphatase, dehalogenase, phosphonatase, and
-phosphoglucomutase activities. 16 previously uncharacterized HADs showed detectable activity with pNPP, a general phosphatase substrate (18) (Table 1). All of these HADs required the presence of a divalent metal cation for activity (Mg2+ > Co2+ > Mn2+ > Ni2+) and showed a broad range of activities and affinities to pNPP (Vmax 0.03-4.95 µmol/min mg of protein; Km = 0.92-17.8 mM). Several proteins, YcjU (HAD11), YigB (HAD7), OtsB (HAD17), YedP (HAD19), YaeD (HAD20), and HisB (HAD21), had no activity against pNPP. YcjU (HAD11) has been annotated in sequence data bases as a putative
-phosphoglucomutase, and indeed, we detected this activity using a specific assay with
-glucose-1-P as a substrate (3.81 ± 0.13 µmol/min/mg of protein). Low
-phosphoglucomutase activity (0.02-0.15 µmol/min/mg of protein) was also detected in YihX (HAD4), YqaB (HAD6), YbhA (HAD14), Cof (HAD18), Gph (HAD10), and YaeD (HAD20). Screening of purified E. coli HADs for hydrolytic activity against phosphonoacetate (phosphonatase substrate) or several haloacid dehalogenase substrates (chloroacetate, bromoacetate, chloropropionate, dichloropropionate, bromopropionate, and chlorobenzoate) produced no positive results (data not shown). Thus, most of the soluble E. coli HADs are phosphatases.
|
-phosphoglucomutase, whereas HisB (predicted histidinol phosphatase) and YedP might be phosphatases that are highly specific to their respective substrates. The remaining 20 HADs showed significant phosphatase activity toward a variety of substrates, in many cases, at levels predicted to be of physiological relevance (Fig. 1). To keep the background low, the screens with natural substrates were performed with substrate concentrations of 0.1-0.2 mM, which were subsequently found to be nonsaturating for most HADs (Table 2). Therefore, the observed velocities do not represent the maximum activities of these enzymes. Nevertheless, these screens not only identified the positive substrates and revealed the broad substrate ranges for most HADs but also correctly identified their preferred substrates (as shown by the subsequent saturation experiments presented in Table 2). The broad substrate spectra of HADs could not be accounted for by the presence of contaminating activities in purified preparations. Further extensive purification of HADs did not eliminate their activities toward different substrates, and alanine replacement mutagenesis of the catalytic Asp residues of two HADs, YniC and YidA, resulted in at least 99.9% abrogation of phosphatase activity.
|
|
Metabolites Hydrolyzed by the E. coli HADsE. coli HADs hydrolyze a wide range of phosphorylated metabolites, including carbohydrates, nucleotides, organic acids, and coenzymes. Hierarchical clustering of the HAD phosphatase activities (observed in the screens) against 52 substrates showed that phosphorylated carbohydrates represent the major group of the HAD substrates (Fig. 2A). The most common substrates for these enzymes were fructose-1-P, glucose-6-P, mannose-6-P, 2-deoxyglucose-6-P, fructose-6-P, ribose-5-P, and erythrose-4-P (Fig. 2A). PLP and FMN also appear to be common substrates of HADs. In addition, hierarchical clustering recognized a group of three nucleotidases, YrfG (HAD8), YjjG (HAD5), and YieH (HAD3). YrfG preferentially hydrolyzed purine nucleotides (GMP and IMP), and YjjG preferred pyrimidines (UMP, dUMP, and dTMP), whereas YieH hydrolyzed both purines and pyrimidines as secondary substrates. The fourth nucleotidase, NagD (HAD23) (which was only partially refolded and, therefore, was not included in the clustering), had an unusually broad substrate range and hydrolyzed deoxyribo- and ribonucleoside tri-, di-, and monophosphates, as well as polyphosphate and glucose-1-P (Fig. 1). Nucleotidase activity of E. coli NagD has been recently described by others, and a broad substrate range of this enzyme, with the highest catalytic efficiency toward nucleoside monophosphates, has been demonstrated (12).
Hierarchical clustering of 52 metabolites based on their ability to serve as substrates for the HADs separated these compounds into distinct groups of structurally related molecules (Fig. 2B). The dendrogram in Fig. 2B, which was based solely on the similarity of the enzyme profiles, grouped together hexoses (mannose-6-P, glucose-6-P, and glucosamine-6-P), pentoses (ribose-5-P and ribulose-5-P), purine nucleotides (GMP, dGMP, dAMP, and IMP), pyrimidine nucleotides (UMP, dUMP, dTMP, TDP, and UDP), pyrophosphate-containing metabolites (thiamine pyrophosphate, 5-phosphoribosyl-l-pyrophosphate, PPi, and polyphosphate), and amino acids (P-serine and P-threonine). This indicates that structurally related substrates are recognized by similar sets of HADs, demonstrating a direct relationship between the chemical structure of HAD substrates and their biological activity (HAD spectra).
The metabolites hydrolyzed by the E. coli HADs are intermediates of various metabolic pathways and reactions (glycolysis, pentose phosphate pathway, gluconeogenesis, and intermediary sugar and nucleotide metabolism). YbiV (HAD12), YfbT (HAD2), YqaB (HAD6), and YidA (HAD13) showed high phosphatase activity with fructose-1-P (Table 2, Fig. 1). In E. coli, this metabolite is produced by the PEP-dependent phosphotransferase system PtsI, PtsH, FruA, and FruB during transport of extracellular fructose across the cytoplasmic membrane (44, 45). At the next step, 1-phosphofructokinase FruK (EC 2.7.1.56 [EC] ) phosphorylates fructose-1-P, yielding the glycolytic intermediate fructose 1,6-bis-P (46). Analysis of the kinetic parameters of four HADs demonstrated that YidA (HAD13) had the lowest Km (highest affinity) to this substrate (Km = 0.39 ± 0.03 mM), whereas YbiV (HAD12) had the highest catalytic activity and efficiency (Table 2). YrfG (HAD8) preferentially hydrolyzed purine nucleotides (GMP, IMP), whereas YjjG (HAD5) preferred pyrimidines (UMP, dUMP, and dTMP). The observed substrate affinities of YrfG and YjjG (Km = 0.64-2.44 mM; Table 2) were within the range reported for other characterized nucleotidases (0.01-22 mM; BRENDA data base). NagD (HAD23) preferentially hydrolyzed purine nucleotides (IMP, UMP, and GMP) (12) (Fig. 1). For YieH (HAD3), Cof (HAD18), and YigL (HAD16) nucleotides were identified as secondary substrates (Fig. 1). Thus, like eukaryotes, E. coli has multiple intracellular nucleotidases with different substrate specificities, namely YrfG, YjjG, NagD, Cof, YieH, and the previously characterized SurE and YfbR (47). These E. coli nucleotidases can dephosphorylate a wide range of 2'-, 3'-, and 5'-nucleotides and, together with nucleoside kinases (Gsk, Tdk, and Udk), can establish regulatory substrate cycles controlling the nucleotide levels in the bacterial cell, similar to those described in mammals (48).
Several HADs catalyzed the dephosphorylation of two important coenzymes, pyridoxal phosphate (PLP) and FMN. PLP was the preferred substrate for E. coli Cof (HAD18) and YbhA (HAD14), and several other HADs (YbiV (HAD12), YihX (HAD4), YbjI (HAD15), and Gph (HAD10)) showed significant phosphatase activity with this molecule (0.2-1.44 µmol of Pi/min/mg of protein) (Table 2, Fig. 1). In mammalian cells, the intracellular level of PLP is regulated, to a large extent, by the PLP phosphatase (49-51), which is also an enzyme of the HAD superfamily and has Km and Vmax for this substrate similar to those determined here for E. coli HADs.
Four E. coli HADs (YbjI (HAD15), YigB (HAD7), Cof (HAD18), and YrfG (HAD8)) were capable of dephosphorylating FMN, an activity without EC number assigned (Table 2, Fig. 1). The KEGG data base indicates that in E. coli, three acid phosphatases are involved in the dephosphorylation of FMN: AppA, SurE, and AphA. However, two of them (AppA and AphA) are periplasmic nonspecific phosphatases and, therefore, not likely to be involved in the control of the intracellular FMN level, whereas no FMN dephosphorylating activity has been found in SurE in our recent work (47). We have found that FMN was a preferred substrate for two HADs, YbjI (HAD15) and YigB (HAD7) (Table 2, Fig. 1), and they represent more likely candidates for the FMN dephosphorylation function in E. coli cells. Generally, we suggest that the activity of several HADs could be important in the regulation of the intracellular levels of PLP and FMN in E. coli.
We identified four HADs (YaeD (HAD20), YbhA (HAD14), YidA (HAD13), and YbiV (HAD12)) with fructose 1,6-bis-P dephosphorylation activity (Table 2, Fig. 1). Fructose 1,6-bisphosphatase (EC 3.1.3.11 [EC] ) catalyzes the second irreversible step in gluconeogenesis producing fructose-6-P that is further converted into glucose-6-P by phosphoglucose isomerase Pgi (52). Two fructose 1,6-bisphosphatases have been characterized in E. coli, namely Fbp and GlpX, both members of the inositol polyphosphate 1-phosphatase-like superfamily (53, 54). Like Fbp and GlpX, YaeD (HAD20) and YbhA (HAD14) hydrolyzed this metabolite to yield fructose-6-P (data not shown). By contrast, YidA (HAD13) and YbiV (HAD12) yielded fructose as the final product of fructose 1,6-bis-P hydrolysis (data not shown), which is compatible with their ability to dephosphorylate both fructose 1- and 6-phosphates (Fig. 1). YaeD (HAD20) showed higher affinity to fructose 1,6-bis-P than YbhA (HAD14), whereas the latter had higher activity and catalytic efficiency (kcat/Km) with this substrate (Table 2). Given that the reported intracellular concentration of fructose 1,6-bis-P in E. coli (3-6 mM) (55-57) is significantly higher than the Km of YaeD (HAD20) and YbhA (HAD14) (Table 2), the phosphatase activity of both proteins could affect the in vivo pool of this crucial metabolite.
Three E. coli HADs showed high phosphatase activity with
-D-glucose-1-P (YidA (HAD13) and YihX (HAD4)) or
-D-glucose-1-P (YfbT (HAD2)) (Table 2, Fig. 1). These proteins were strictly specific to one anomer of glucose-1-P (
or
) and did not hydrolyze the other anomer. In E. coli, the
-form is produced from glycogen by glycogen phosphorylase GlgP or in other reactions of pentose, glucuronate, or nucleotide sugar metabolism, whereas
-D-glucose-1-P might be produced by yet unidentified maltose or trehalose phosphorylases (58). Both anomers can be used in various biosynthetic reactions or channeled to glycolysis or the Entner-Doudoroff pathway through conversion into the respective
- and
-glucose 6-phosphates by specific phosphoglucomutases,
-phosphoglucomutase Pgm (EcoCyc data base) or
-phosphoglucomutase YcjU (this work). E. coli also makes the
-glucose 1-phosphatase Agp, a periplasmic enzyme that acts primarily as a glucose scavenger (59). Although, in the KEGG data base, this periplasmic phosphatase is annotated as an enzyme responsible for intracellular dephosphorylation of
-glucose-1-P, our results indicate that both YihX (HAD4) and YidA (HAD13) have at least 2 times lower Km (higher affinity) to this substrate, and therefore, these enzymes are likely to be the principal phosphatases involved in the intracellular metabolism of
-glucose-1-P in E. coli. YfbT (HAD2) is the first phosphatase found to selectively hydrolyze
-D-glucose-1-P and showing no activity with the
-form of this substrate.
Finally, many HADs demonstrated phosphatase activities against various intermediates of two central metabolic pathways, glycolysis (gluconeogenesis) and the pentose phosphate pathway (glucose-6-P, fructose-6-P, dihydroxyacetone phosphate, 3-phosphoglyceraldehyde, 3-phosphoglycerate, 2-phosphoglycerate, PEP, 6-P-gluconate, erythrose-4-P, ribose-5-P, ribulose-5-P) (Fig. 1). In several cases, E. coli HADs showed high affinities with these substrates (Table 2), comparable with their known intracellular concentrations. The Km values of YidA (HAD13) and YfbT (HAD2) for glucose-6-P, fructose-6-P, and ribose-5-P (0.44-2.23 mM; Table 2) were comparable with the known intracellular concentrations of these metabolites (0.18-2.0 mM glucose-6-P, 0.54-0.91 mM fructose-6-P, and 0.15-0.63 mM ribose-5-P) in E. coli (55, 60, 61). The Km of YieH (HAD3) to PEP (2.89 mM) and of OtsB (HAD17) to trehalose-6-P (0.61 mM) were also within the range of the known physiological concentrations of these metabolites (0.15-8.63 mM for PEP and 0.76 mM for trehalose-6-P) (55, 60-62). Therefore, the in vivo activities of HADs could potentially modulate the metabolite pools and metabolic fluxes through a broad variety of pathways.
HADs Hydrolyze Acetyl Phosphate and Other Small PhosphodonorsThe promiscuity of the HAD phosphatase family raises interesting questions regarding the evolution of the activities and the emergence of new activities from ancestral ones. One clue into the evolution of the HAD family comes from structural analysis of phosphoserine phosphatase from M. jannaschii, which showed significant structural similarity to CheY, a receiver domain/response regulator of the two-component signal transduction system CheA-CheY (63). In two-component systems, the receiver domains of response regulator proteins are either phosphorylated by their cognate sensor histidine kinases or autophosphorylate using acetyl phosphate (acetyl-P), carbamoyl phosphate (carbamoyl-P), or phosphoramidate as phosphodonors (64, 65). The structural similarity and the inferred common ancestry of response regulators and the HADs prompted us to test if any of the E. coli HADs were able to hydrolyze small phosphodonors. All purified E. coli HADs, including even YcjU (HAD11), a
-phosphoglucomutase, showed various levels of Mg2+-dependent activity toward these substrates (imidodiphosphate was used as a phosphoramidate substrate) (Table 1). Most proteins showed higher activities (
1 µmol/min/mg of protein) with acetyl-P and lower activities with carbamoyl-P or imido-di-P, but YidA (HAD13), YbiV (HAD12), and Gph (HAD10) had high activity with all three substrates (Table 1). To investigate whether these small phosphodonors are specific substrates for HAD phosphatases, we purified seven non-HAD phosphatases from E. coli (CCA, YfdR, YcdX, SurE, CysQ, GlpX, and YfbR) and eight HADs from other organisms (Cof, YigL, and YrfG from Salmonella typhimurium; PA0006, PA0065, and PA3172 from Pseudomonas aeruginosa; RPA3639 and RPA4337 from Rhodopseudomonas palustris) and assayed them for phosphatase activity against these compounds. Although all of these proteins showed high phosphatase activity against pNPP, only HAD phosphatases were active toward small phosphodonors. Thus, the phosphohydrolase activity toward acetyl-P, carbamoyl-P, and imido-di-P is a specific property of the HADs.
With acetyl-P or imido-di-P as substrates, YidA (HAD13), YihX (HAD4), YbiV (HAD12), SerB (HAD9), and Gph (HAD10) demonstrated classical Michaelis-Menten kinetics with higher activity toward acetyl-P (except for Gph) and lower Km to imido-di-P (Table 2). With carbamoyl-P, all tested HADs (YidA, YbiV, YqaB, and Gph) showed no saturation, indicating that substrate binding was the limiting step in the hydrolysis of this substrate. The Km values of the E. coli HADs with acetyl-P (3.64-8.93 mM) were substantially higher than the reported intracellular concentrations of acetyl-P (0.04-1.3 mM) (66). Therefore, the physiological role of the acetyl phosphatase activity of these enzymes remains uncertain. The ability of the E. coli HADs to hydrolyze imido-di-P (a phoshoramidate-like substrate, N-P bond hydrolysis) is a previously unknown catalytic activity of these enzymes. This reaction yields free
(data not shown), indicating that HADs also hydrolyze phosphoramidate, the immediate product of imido-di-P hydrolysis. Very recently, the first microbial phosphoramidase, the E. coli HinT, was identified and characterized, but its physiological function remains unknown (67). Eight more E. coli enzymes can now be annotated with this activity: YieH (HAD3), YjjG (HAD5), Gph (HAD10), YbiV (HAD12), YidA (HAD13), YbhA (HAD14), YigL (HAD16), and YaeD (HAD20).
In Vivo Validation of an in Vitro ActivityTo determine whether any of the in vitro activities observed in our experiments represent the corresponding biologically relevant reactions, we performed genetic and physiological experiments for a representative in vitro phosphatase activity against 2-deoxyglucose-6-P observed in several E. coli HADs. 2-deoxyglucose-6-P is a toxic analogue of glucose-6-P, and its intracellular concentration can be very high (up to 100 mM) (68). YniC (HAD1) showed higher catalytic efficiency (kcat/Km) and affinity (Km = 0.61 mM) to this substrate than YigL (HAD16) and Cof (HAD18) (Table 2), suggesting that in vivo YniC might be the principal phosphatase responsible for the hydrolysis of this compound. It has been shown previously that 2-deoxyglucose is taken up by E. coli and phosphorylated to yield 2-deoxyglucose-6-P (69). The E. coli yniC deletion mutant was much more sensitive to the presence of 2-deoxyglucose in the growth medium (IC50 = 0.02 mM) than the wild-type strain (IC50 = 0.59 mM) (Fig. 3). By contrast, the YniC-overproducing strain grew well even in the presence of 20 mM 2-deoxyglucose (Fig. 3). These results indicate that the 2-deoxyglucose 6-phosphatase activity of YniC (HAD1) plays an important role in the resistance of E. coli to 2-deoxyglucose and suggest that the in vitro activities of HADs toward their preferred and at least some of the secondary substrates reflect their functionally relevant in vivo activities.
|
|
The present work might offer insights into even more ancient history of the HAD hydrolases and their evolutionary progenitors. We showed that small intracellular phosphodonors (acetyl-P, carbamoyl-P, imido-di-P) are universal substrates for all E. coli HADs (as well as for HADs from other organisms) but not for any of the other phosphatases we were able to test (Table 1). This functional similarity meshes well with the monophyly of the HADs and the receiver domain superfamily, which together form a distinct branch in the evolutionary tree of the Rossmann fold domains (70), because the receiver domains can autophosphorylate using the same small phosphodonors. Conceivably, the common ancestor of the HADs and the receiver domains was a broad specificity phosphatase that could use as substrates, at least, small phosphodonors and various phosphorylated sugars (Fig. 5). Subsequently, after duplication, the evolving HAD and receiver domain families apparently retained different subsets of the multiple functions of the ancestor (71, 72) (subfunctionalization), which, in some cases, was followed by neofunctionalization (i.e. emergence of new specificities in some of the paralogs) (73). Thus, the receiver domains retained the ability to hydrolyze small phosphodonors and evolved the activity against another N-P bond that occurs in the phosphohistidine of phosphorylated histidine kinases (Fig. 5). In contrast, the HADs expanded, through a series of duplications, the repertoire of phosphatase activities against a broad variety of metabolites, as demonstrated by their broad substrate profiles (Fig. 5). Conceivably, the more specialized phosphatases like OtsB, SerB, or Gph were locked into their narrow specificities at a later stage of evolution. Dehalogenase, phosphonatase, and protein phosphatase activities observed in several HADs from other organisms (7, 9, 10, 74), but not found in E. coli, represent further substrate specialization of the HADs. The present findings on the substrate profiles of the HAD superfamily phosphatases emphasize the enormous and still largely unexplored versatility of enzymes that can evolve within the same structural scaffold through duplication, subfunctionalization, and neofunctionalization (75).
| DISCUSSION |
|---|
|
|
|---|
|
A fundamental question is whether the secondary activities of HADs (and other promiscuous enzymes) are biologically relevant and maintained by selection or represent functionally neutral variations. The lack of congruence between the phylogenetic tree of the HADs and the clustering of their activities (kcat/Km or substrate profiles) (Fig. 4) seems to be most compatible with the latter possibility. However, even if most of the secondary specificities are not directly maintained by selection, the breadth of the substrate spectra of HADs might be crucial in evolutionary terms, since they supply the pool from which new specificities could evolve. The ability of HADs to hydrolyze different types of bonds and their substrate promiscuity described here might have contributed to the evolution of new enzymatic specificities involved in the regulation of bacterial metabolism and environmental response.
Note Added in ProofWhile this manuscript was under review, Burroughs et al. (Burroughs, A. M., Allen, K. N., Dunaway-Mariano, D., and Aravind, L. (2006) J. Mol. Biol. 361, 1003-1034) reported the results of the sequence- and structure-based analysis of the entire HAD superfamily. These authors presented a comprehensive classification of the HAD superfamily based on their sequences, structures, phyletic distribution patterns, and phylogenetic tree analysis. Consistent with our model, the authors suggested that the diversification of the substrate specificities of the primordial HAD hydrolases provided the raw evolutionary material for the development of new enzymes. Additionally, we want to mention two more works, which also pointed out the structural similarity between HADs and the receiver domain of two-component regulatory systems (Ridder, I. S., and Dijkstra, B. W. (1999) Biochem. J. 339, 223-226; Meng, E. C., Polacco, B. J., and Babbitt, P. C. (2004) Proteins 55, 962-976).
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains supplemental Figs. 1 and 2 and Tables 1-3. ![]()
1 A Scientist of the Canadian Institutes of Health Research. ![]()
2To whom correspondence may be addressed: Banting and Best Dept. of Medical Research, University of Toronto, 112 College St., Rm. 117, Toronto, Ontario M5G 1L6, Canada. Tel.: 416-946-3436; Fax: 416-978-8528; E-mail: aled.edwards{at}utoronto.ca. 3To whom correspondence may be addressed: Banting and Best Dept. of Medical Research, University of Toronto, 112 College St., Rm. 024, Toronto, Ontario M5G 1L6, Canada. Tel.: 416-946-0075; Fax: 416-946-0078; E-mail: a.iakounine{at}utoronto.ca.
4 The abbreviations used are: HAD, haloacid dehalogenase-like hydrolase; CHES, 2-(cyclohexylamino)ethanesulfonic acid; PLP, pyridoxal 5'-phosphate; PEP, phosphoenolpyruvate; MOPS, 4-morpholinepropanesulfonic acid; pNPP, p-nitrophenyl phosphate. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|