Originally published In Press as doi:10.1074/jbc.C600231200 on September 21, 2006
J. Biol. Chem., Vol. 281, Issue 45, 33830-33834, November 10, 2006
First Structure of a Eukaryotic Phosphohistidine Phosphatase*
Robert D. Busam,
Ann-Gerd Thorsell,
Alex Flores,
Martin Hammarström,
Camilla Persson, and
B. Martin Hallberg1
From the
Structural Genomics Consortium, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, SE-17177 Stockholm, Sweden
Received for publication, August 30, 2006
, and in revised form, September 14, 2006.
 |
ABSTRACT
|
|---|
Phosphatases are a diverse group of enzymes that regulate numerous cellular processes. Much of what is known relates to the tyrosine, threonine, and serine phosphatases, whereas the histidine phosphatases have not been studied as much. The structure of phosphohistidine phosphatase (PHPT1), the first identified eukaryotic-protein histidine phosphatase, has been determined to a resolution of 1.9Å using multiple-wavelength anomalous dispersion methods. This enzyme can dephosphorylate a variety of proteins (e.g. ATP-citrate lyase and the
-subunit of G proteins). A putative active site has been identified by its electrostatic character, ion binding, and conserved protein residues. Histidine 53 is proposed to play a major role in histidine dephosphorylation based on these observations and previous mutational studies. Models of peptide binding are discussed to suggest possible mechanisms for substrate recognition.
 |
INTRODUCTION
|
|---|
Reversible phosphorylation of residues is crucial in a variety of signaling pathways. Most of our understanding regarding these signaling events in eukaryotes comes from tyrosine, serine/threonine kinases, and phosphatases (1). Less well characterized is histidine phosphorylation-dependent signaling in eukaryotes. A little more than thirty years ago, Histone H4, the first vertebrate protein with a phosphorylated histidine residue, was identified (2). Since then there has been a measured increase in knowledge of mammalian histidine kinases (3). Unfortunately, very little information regarding eukaryotic histidine phosphatases has been available during this same period. This nescience is interesting because histidine phosphorylation is quite prevalent in the cell and likely accounts for
6% of all phosphorylation in eukaryotes (4). Thus far, only one protein (the
-subunit of heterotrimeric G proteins) in vertebrates has been identified as undergoing reversible histidine phosphorylation where both the kinase (NDPK B) and phosphatase (PHPT1)2 are known (for a recent review, see Ref. 5). However, more information regarding histidine phosphatases is slowly beginning to emerge. To date, the only other structure of a histidine phosphatase is Escherichia coli SixA (6). Under certain anaerobic respiratory conditions, SixA is involved in down-regulation of the E. coli ArcB-to-ArcA phosphorelay system. SixA shows structural homology to the well studied family of arginine-histidine-glycine (RHG) phosphatases (6) but no sequence homology to PHPT1.
Mammalian phosphohistidine phosphatase (PHPT1) was first identified and characterized as a 14-kDa protein in 2002 (7, 8). The enzyme can dephosphorylate the phosphohistidine-containing peptide succinyl-Ala-His(P)-Pro-Phe-p-nitroanilide, E. coli cheA, rabbit ATP-citrase lyase, and the rat
-subunit of G proteins (710). PHPT1 has been suggested to be highly involved in neuronal function. Unlike most phosphatases it does not require divalent cations for activity. Individual point mutations of conserved histidine and arginine residues determined that Arg45, His53, and His102 may play a role in the reaction mechanism as a result of eliminated or reduced phosphatase activity when mutated to alanines (11). PHPT1 is expressed in a variety of vertebrates but not in fungi or bacteria. The PHPT1 DNA sequence shows similarity to testes-specific proteins in Drosophila, ocnus, janus-a, and janus-b, according to the Conserved Domain Data Base (12). These proteins may be phosphatases, but little information is available on their function. In this work, we report the first crystal structure of a eukaryotic phosphohistidine phosphatase: PHPT1. The structure enables us to define a substrate-binding pocket and to model possible phosphatase-substrate docking modes. We thereby provide the structural basis for further biochemical, biophysical, and genetic studies in the rapidly developing field of eukaryotic phosphohistidine signaling.
 |
EXPERIMENTAL PROCEDURES
|
|---|
Cloning, Protein Expression, and PurificationResidues 5125 of the human PHPT1 gene (gi: 19353099) were cloned by ligation-independent cloning into a pET-28 based expression vector incorporating a TEV-cleavable N-terminal His-tag fusion (pNIC-Bsa4). After transformation and liquid culture growth using standard methods, recombinant expression of PHPT1 was induced at 291 K by addition of 0.5 mM isopropyl
-D-thiogalactopyranoside to either Terrific broth for expression of native protein or minimal medium for selenomethionine incorporation according to the methionine pathway inhibition method (13). After harvest PHTP1 was purified using IMAC on a 1 ml HiTrap chelating HP column followed by gel filtration on a Superdex 75 column (columns from GE Healthcare, Uppsala, Sweden).
Crystallization, Data Collection, and Structure SolutionInitial crystal screening with selenomethionine-derivatized protein using commercial screens gave hits in several salt conditions. After optimization, large rhombohedral crystals could be obtained by mixing protein solution (15 mg/ml) with an equal volume of reservoir (2.0 M ammonium sulfate, 0.1 M Bistris, pH 5.5). Crystals were swept into a reservoir with 15% butanediol and then dunked in liquid nitrogen. The data were indexed in space group R3 with unit cell parameters a = b = 228 Å, c = 29.4 Å. Multiple-wavelength anomalous dispersion data (Table 1) were collected around the selenium K-edge at Beamline 14.1 (BESSY II, Berlin, Germany). XDS/XSCALE (14) was used to process the data and put the three data sets on an approximately similar scale. XPREP (Bruker AXS) were used to prepare FA-values for substructure solution in SHELXD (15) that found 10 of the possible 12 sites. Identified sites were used for phase calculation in SHARP (16) followed by density modification in PIRATE3 The resulting map could be partially autotraced using ARP/wARP (17). Unfortunately, only one of the four protomers in the asymmetric unit was well ordered and numerous attempts with different refinement protocols failed to give acceptable residuals. Since twinning is common in space group R3, twinned refinement was attempted with different operators but did not improve residuals or the quality of difference maps.
A search for alternative crystal forms was initialized with native protein (23 mg/ml). Both protein with and without intact His-tag was used in this screening process. A new crystal form was obtained in 2.0 M sodium formate, 0.1 M Bistris propane, pH 7.0. After optimization, the best diffracting crystals were grown in 1.9 M sodium formate, 0.1 M Bistris propane, pH 6.5, at room temperature in 2 days by the hanging drop vapor diffusion method. Diffraction data were collected to 1.9 Å (
= 1.033 Å) on a PHPT1 crystal at Beamline ID29 (European Synchrotron Radiation Facility, Grenoble, France). The data were indexed in space group R3 with unit cell parameters a = b = 112.5 Å, c = 29.4 Å. In the new crystal form, there is only one PHPT1 molecule in the asymmetric unit. Diffraction data were processed using XDS/XSCALE (14). The large cell model was used as the starting model to obtain a molecular replacement solution using MOLREP (18). The resulting model was improved through several rounds of model building and refinement in Coot (19) and REFMAC5 (18). Progress of the refinement was monitored by the free R factor for 5% of the data (20). The final model consisted of 100 amino acid residues, 17 formate ions, and 59 waters. The N-terminal His-tag and linker (17 residues), residues 3138, and the last three C-terminal amino acids are not visible in the structure. The final residuals for this model were r = 17.1% and Rfree = 22.5%.
Structural Similarity SearchesDALI (21) was used to determine the closest structural neighbors of PHPT1. The top five structures based on statistical significance (Z-score cutoff of 3.3) were aligned and compared with PHPT1.
Modeling of LigandsThe peptides succinyl-Ala-His(P)-Pro-Phe-p-nitroanilide, Met-Gly-His(P)-Ala-Gly-Ala-Ile, and Tyr-Ser-His(P)-Asp-Asn-Ile-Ile-Cys-Gly were modeled into the putative active site using the Molegro virtual docker (22). The sequences with phosphorylated histidines MGHAGAI and YSHDNIICG were chosen based on the conserved sequence among succinyl-CoA synthetases and its structure (23) and a loop between two
-strands in the
-subunit of the G protein transducin structure (24), respectively. In the case of the
-subunit of the G protein transducin peptide, the histidine was modeled as phosphorylated on the N
2 position. Typical docking runs consisted of docking a single ligand with the PHPT1 protein with no solvent molecules. The search algorithm typically used default values for 510 runs. In some cases the population size was increased from 50 to 100. Multiple poses were returned for each run and those with phosphohistidines near His53 were considered potential ligands.
Protein Structure Accession NumbersThe atomic coordinates and structure factors have been deposited in the Research Collaboratory for Structural Bioinformatics under Protein Data Bank code 2HW4.
 |
RESULTS AND DISCUSSION
|
|---|
Structural ModelPHPT1 is a 125-amino acid human cytostolic protein that folds into a compact elbow-shaped molecule of a mixed
/
fold with novel topology (Fig. 1, A and B). The molecule is defined by six
-strands flanked by two
-helices. A single helix and three strands lie approximately perpendicular to each other near the base of the molecule. There are two central anti-parallel
-strands (
2 and
4) that extend the full length of the protein. On either side of these strands are two sets of parallel
-strands (
1,
3 and
5,
6). The core of the protein is defined by this
-sheet and the N-terminal histidine tag extends from it. Electron density for the tag was partly not interpretable so it was not modeled, but it is likely required for crystallization because of the numerous crystal contacts that is formed with this extended strand. On the surface there is a positively charged pocket that consists of conserved residues (Fig. 2, A and B). This area is a potential site for phosphatase activity because of its positive surface charge. The cavity volume is
30 Å3 and found at the base of the molecule (Fig. 1C). Two individual amino acid substitutions to alanine resulted in the loss of phosphatase activity (11). These mutations, His53
Ala and His102
Ala, map to this region of the enzyme. Additionally, many identical and conserved residues of orthologous proteins define this basic patch and surrounding atoms (Fig. 1D). Formate and sulfate ions are found in this pocket in both the small and large cell structures, respectively (Fig. 2, A and B). Both ions bind in similar locations near His53 in the positive patch but form a binding network distinct from one another. This arrangement suggests there may be multiple ways to bind substrates within the active site based on their size and charge. In total, 17 formate ions solvate PHPT1. Some of these ions are bound to surfaces distinct from the active site.
A DALI (21) search revealed that the closest structural neighbors of PHPT1 in the Protein Data Bank are of no apparent relation. Polyphosphate kinase (Z score = 4.3, r.m.s.d. 3.1 Å over 74 C
atoms), PurS (Z score = 3.4, r.m.s.d. 2.4 Å over 50 atoms), and plasma gelsolin (Z score = 3.4, r.m.s.d. 2.8 Å over 63 atoms) were the most similar. Weak structural homology to SixA (6) is found in the central
-sheet with 23 C
atoms superimposable within a distance of 3.8 Å.
Active SiteThe residues that define the active site are Lys21, Glu51, Tyr52, His53, Arg78, Tyr93, and Met95 (Fig. 2C). Almost all of these residues are identical in orthologous proteins among the animal kingdom. Therefore, these residues likely contribute important structural features to maintain the phosphatase activity of the enzyme. The active site residue His53 is located at the beginning of helix
1; near this residue, two formate ions and two water molecules form an intricate hydrogen bonding network. A total of five formate molecules are found in a pocket near His53. The pocket is defined by two loops; Glu51 and Tyr52 on a loop joining helix
1 and
-strand
4 define one side of the pocket while Tyr93 and Met95 on a loop between
5 and
2 define the other side. Furthermore, Arg78 is found in the middle of the
4 strand and may be involved in coordinating the phosphate group. The Arg78
Ala mutant resulted in a 30% decrease in activity (11), suggesting it does play a role in phosphatase substrate recognition or catalysis. There are three conserved glycines immediately upstream of Arg78; these glycines offer Arg78 a great deal of conformational flexibility, presumably needed in substrate recognition and catalysis. Furthermore, in a scenario where substrate interacts with Arg78, the nearby glycine-rich region may flex so that a tightening of the pocket around the peptide substrate occurs. Close to Arg78, Lys21 is located in the middle of the
2 strand and is involved in hydrogen bonding to a formate ion. In the pocket, only Arg78 and Lys21 are found on
-strands. These residues may act as anchors and assist in coordination of the phosphohistidine to the active site. It is not known whether the N
1 or N
2 of the imidazole ring of His53 is involved in dephosphorylation and it is unfortunately not clear from the structure in what orientation the imidazole ring should be positioned. From a conformational standpoint, as the side chain of His53 projects radially from its turning backbone with the imidazole pointing downward the bottom of the active site pocket, a substrate N
1 interaction is most likely. Interestingly, His53 is calculated to have a pKa of 3.13 using PROPKA (25). Therefore, it is likely that His53 is not protonated at the N
2 position in the crystallization solution and in the physiological context.
Substrate BindingATP-citrate lyase (ACL) is a substrate for PHPT1 and it has been suggested that PHPT1 dephosphorylates phospho-His760 (9). A BLAST (26) search revealed that a conserved domain (SucD, succinyl-CoA
-subunit) is found from residues 649777 of human ACL. The histidine maps to this domain and sequence analysis show that this residue and adjacent residues (GHAGA) are highly conserved among higher eukaryotes (data not shown). The structure of this subunit contains a phosphorylated histidine at residue His259 (23). This amino acid corresponds to His760 in CLUSTALW (27) sequence alignments of the succinyl-CoA
-subunit with ACL. This residue is found on an extended loop of 22 residues between two
-helices near the surface of the protein. Considering the length and makeup of the loop, it is entirely possible that it is flexible and could bind in the pocket of PHPT1 (Fig. 2D).
The
-subunit of G proteins is phosphorylated on histidine residue 266 (2831) and is dephosphorylated by PHPT1 (10). The structure of the
-subunit (24) shows this residue in a surface loop of eight amino acids connecting two
-strands. This small loop is modeled in the pocket of PHPT1 (Fig. 2E). In these two cases it appears that some rearrangement of these loops would be required for the substrate to bind in the pocket. It is unclear if there are additional sites of binding separate from the active site. The binding pocket of PHPT1 can accommodate a variety of ligands and studies are in place to assess possible inhibitors and binding partners that should allow for the mechanism of action to be discerned.
The structure of PHPT1 is the first glimpse at a eukaryotic histidine phosphatase. The protein is of a novel topology, and the structure reveals a positively charged active site defined by conserved residues that provide a suitable environment for binding phosphohistidine-containing substrates. His53 is proposed to be involved with the phosphatase activity of this enzyme. Furthermore, Arg78 and Lys21 may act as anchors to provide a stable scaffold for substrate and phosphohistidine interactions as well as provide a possibility for charge stabilization of transition states in the catalytic reaction. Future challenges include investigation of co-crystal structures with target peptides to understand in detail the structural basis for substrate recognition and catalysis.
 |
FOOTNOTES
|
|---|
* The Structural Genomics Consortium is a registered charity (number 1097737) funded by the Karolinska Institutet, The Swedish Governmental Agency for Innovation Systems (VINNOVA), the Knut and Alice Wallenberg Foundation, the Swedish Foundation for Strategic Research, the Wellcome Trust, GlaxoSmithKline, Genome Canada, the Canadian Institutes of Health Research, the Ontario Innovation Trust, and the Ontario Research and Development Challenge Fund and the Canadian Foundation for Innovation. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
The atomic coordinates and structure factors (code 2HW4) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/). 
1 To whom correspondence should be addressed: Structural Genomics Consortium, Dept. of Medical Biochemistry and Biophysics, Karolinska Institutet, 17177 Stockholm, Sweden. Fax: 46-852486868; E-mail: Martin.Hallberg{at}ki.se.
2 The abbreviations used are: PHPT, phosphohistidine phosphatase; Bistris, 2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol; r.m.s.d., root mean square deviation; ACL, ATP-citrate lyase. 
3 K. Cowtan, unpublished results. 
 |
ACKNOWLEDGMENTS
|
|---|
We acknowledge support during data collection from Martin Fieber-Erdmann and Laurent Terradot at Berliner Elektronenspeicherring-Gesellschaft für Synchrotronstrahlung (BESSY) and the European Synchrotron Radiation Facility, respectively.
 |
REFERENCES
|
|---|
- Pawson, T., and Scott, J. D. (2005) Trends Biochem. Sci. 30, 286290[CrossRef][Medline]
[Order article via Infotrieve]
- Chen, C. C., Smith, D. L., Bruegger, B. B., Halpern, R. M., and Smith, R. A. (1974) Biochemistry 13, 37853789[CrossRef][Medline]
[Order article via Infotrieve]
- Besant, P. G., and Attwood, P. V. (2005) Biochim. Biophys. Acta 1754, 281290[Medline]
[Order article via Infotrieve]
- Matthews, H. R. (1995) Pharmacol. Ther. 67, 323350[CrossRef][Medline]
[Order article via Infotrieve]
- Klumpp, S., and Krieglstein, J. (2005) Biochim. Biophys. Acta 1754, 291295[Medline]
[Order article via Infotrieve]
- Hamada, K., Kato, M., Shimizu, T., Ihara, K., Mizuno, T., and Hakoshima, T. (2005) Genes Cells 10, 111[Abstract/Free Full Text]
- Klumpp, S., Hermesmeier, J., Selke, D., Baumeister, R., Kellner, R., and Krieglstein, J. (2002) J. Cereb. Blood Flow Metab. 22, 14201424[CrossRef][Medline]
[Order article via Infotrieve]
- Ek, P., Pettersson, G., Ek, B., Gong, F., Li, J. P., and Zetterqvist, O. (2002) Eur. J. Biochem. 269, 50165023[Medline]
[Order article via Infotrieve]
- Klumpp, S., Bechmann, G., Maurer, A., Selke, D., and Krieglstein, J. (2003) Biochem. Biophys. Res. Commun. 306, 110115[CrossRef][Medline]
[Order article via Infotrieve]
- Maurer, A., Wieland, T., Meissl, F., Niroomand, F., Mehringer, R., Krieglstein, J., and Klumpp, S. (2005) Biochem. Biophys. Res. Commun. 334, 11151120[CrossRef][Medline]
[Order article via Infotrieve]
- Ma, R., Kanders, E., Sundh, U. B., Geng, M., Ek, P., Zetterqvist, O., and Li, J. P. (2005) Biochem. Biophys. Res. Commun. 337, 887891[CrossRef][Medline]
[Order article via Infotrieve]
- Marchler-Bauer, A., Anderson, J. B., Cherukuri, P. F., DeWeese-Scott, C., Geer, L. Y., Gwadz, M., He, S., Hurwitz, D. I., Jackson, J. D., Ke, Z., Lanczycki, C. J., Liebert, C. A., Liu, C., Lu, F., Marchler, G. H., Mullokandov, M., Shoemaker, B. A., Simonyan, V., Song, J. S., Thiessen, P. A., Yamashita, R. A., Yin, J. J., Zhang, D., and Bryant, S. H. (2005) Nucleic Acids Res. 33, D192D196[Abstract/Free Full Text]
- Van Duyne, G. D., Standaert, R. F., Karplus, P. A., Schreiber, S. L., and Clardy, J. (1993) J. Mol. Biol. 229, 105124[CrossRef][Medline]
[Order article via Infotrieve]
- Kabsch, W. (1993) J. Appl. Crystallogr. 26, 795800[CrossRef]
- Schneider, T. R., and Sheldrick, G. M. (2002) Acta Crystallogr. Sect. D Biol. Crystallogr. 58, 17721779[CrossRef][Medline]
[Order article via Infotrieve]
- de La Fortelle, E., Bricogne, G., and Carter, C. W., Jr. (1997) Methods Enzymol. 276, 472494
- Perrakis, A., Harkiolaki, M., Wilson, K. S., and Lamzin, V. S. (2001) Acta Crystallogr. Sect. D Biol. Crystallogr. 57, 14451450[CrossRef][Medline]
[Order article via Infotrieve]
- Collaborative Computational Project, No. 4 (1994) Acta Crystallogr. Sect. D Biol. Crystallogr. 50, 760763[CrossRef][Medline]
[Order article via Infotrieve]
- Emsley, P., and Cowtan, K. (2004) Acta Crystallogr. D60, 21262132
- Brunger, A. T. (1992) Nature 355, 472475[CrossRef]
- Holm, L., and Sander, C. (1993) J. Mol. Biol. 233, 123138[CrossRef][Medline]
[Order article via Infotrieve]
- Thomsen, R., and Christensen, M. H. (2006) J. Med. Chem. 49, 33153321[CrossRef][Medline]
[Order article via Infotrieve]
- Fraser, M. E., James, M. N., Bridger, W. A., and Wolodko, W. T. (2000) J. Mol. Biol. 299, 13251339[CrossRef][Medline]
[Order article via Infotrieve]
- Sondek, J., Bohm, A., Lambright, D. G., Hamm, H. E., and Sigler, P. B. (1996) Nature 379, 369374[CrossRef][Medline]
[Order article via Infotrieve]
- Li, H., Robertson, A. D., and Jensen, J. H. (2005) Proteins 61, 704721[CrossRef][Medline]
[Order article via Infotrieve]
- Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) J. Mol. Biol. 215, 403410[CrossRef][Medline]
[Order article via Infotrieve]
- Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994) Nucleic Acids Res. 22, 46734680[Abstract/Free Full Text]
- Kowluru, A., Seavey, S. E., Rhodes, C. J., and Metz, S. A. (1996) Biochem. J. 313, 97107
- Wieland, T., Nurnberg, B., Ulibarri, I., Kaldenberg-Stasch, S., Schultz, G., and Jakobs, K. H. (1993) J. Biol. Chem. 268, 1811118118[Abstract/Free Full Text]
- Cuello, F., Schulze, R. A., Heemeyer, F., Meyer, H. E., Lutz, S., Jakobs, K. H., Niroomand, F., and Wieland, T. (2003) J. Biol. Chem. 278, 72207226[Abstract/Free Full Text]
- Hippe, H. J., Lutz, S., Cuello, F., Knorr, K., Vogt, A., Jakobs, K. H., Wieland, T., and Niroomand, F. (2003) J. Biol. Chem. 278, 72277233[Abstract/Free Full Text]
- Baker, N. A., Sept, D., Joseph, S., Holst, M. J., and McCammon, J. A. (2001) Proc. Natl. Acad. Sci. U. S. A. 98, 1003710041[Abstract/Free Full Text]
- Delano, W. L. (2002) The PyMOL Molecular Graphics System, DeLano Scientific, San Carlos, CA

CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
Copyright © 2006 by the American Society for Biochemistry and Molecular Biology.