Discovery of PPi-type Phosphoenolpyruvate Carboxykinase Genes in Eukaryotes and Bacteria *

Yoko Chiba (千葉洋子), Ryoma Kamikawa (神川龍馬), Kumiko Nakada-Tsukui (津久井久美子), Yumiko Saito-Nakano (中野由美子), and Tomoyoshi Nozaki (野崎智義) From the Faculty of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8572, Japan, the Department of Parasitology, National Institute of Infectious Diseases, 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan, and the Graduate School of Environmental Studies, Graduate School of Human and Environmental Studies, Kyoto University, Yoshida Nihonmatsu cho, Kyoto, Kyoto 606-8501, Japan

Phosphoenolpyruvate carboxykinase (PEPCK) is one of the pivotal enzymes that regulates the carbon flow of the central metabolism by fixing CO 2 to phosphoenolpyruvate (PEP) to produce oxaloacetate or vice versa. Whereas ATP-and GTPtype PEPCKs have been well studied, and their protein identities are established, inorganic pyrophosphate (PP i )-type PEPCK (PP i -PEPCK) is poorly characterized. Despite extensive enzymological studies, its protein identity and encoding gene remain unknown. In this study, PP i -PEPCK has been identified for the first time from a eukaryotic human parasite, Entamoeba histolytica, by conventional purification and mass spectrometric identification of the native enzyme, followed by demonstration of its enzymatic activity. A homolog of the amebic PP i -PEPCK from an anaerobic bacterium Propionibacterium freudenreichii subsp. shermanii also exhibited PP i -PEPCK activity. The primary structure of PP i -PEPCK has no similarity to the functional homologs ATP/GTP-PEPCKs and PEP carboxylase, strongly suggesting that PP i -PEPCK arose independently from the other functional homologues and very likely has unique catalytic sites. PP i -PEPCK homologs were found in a variety of bacteria and some eukaryotes but not in archaea. The molecular identification of this long forgotten enzyme shows us the diversity and functional redundancy of enzymes involved in the central metabolism and can help us to understand the central metabolism more deeply.
Inorganic pyrophosphate (PP i ) is composed of two molecules of phosphate (P i ) linked by a phosphoanhydride bond. PP i has been proposed as an evolutionary precursor of ATP and GTP (1) because this structurally simple compound can be formed spontaneously, and the hydrolysis of PP i produces high energy (2,3). If ancestral organisms could utilize only PP i or polyphosphates as an energy and/or phosphate donor for enzymatic reactions, there should have been an event in which new enzymes utilizing ATP/GTP arose. This further raises the question of how organisms change the major substrate: by accumulation of mutation on the enzyme to change the substrate specificity from PP i to nucleotide triphosphate or the substitution of PP i -utilizing enzymes by an independently emerging functional homolog that utilizes nucleotide triphosphates. To gain insight into the evolutionary transition from PP i to nucleotide triphosphate, the evolutionary relationship between the extant PP iand ATP/GTP-utilizing enzymes has been examined (4 -11).
In glycolytic/gluconeogenic pathways and the closely related reactions in current organisms, three enzymatic reactions that can utilize PP i as the substrate have been reported. The first one is a PP i -dependent phosphofructokinase (PFK) 3 (EC 2.7.1.90) reaction. PP i -PFK catalyzes a reversible reaction, whereas ATPdependent PFK (EC 2.7.1.11) catalyzes an irreversible reaction.
PP i ϩ D-fructose 6-phosphate N P i ϩ D-fructose 1,6-bisphosphate ATP ϩD-fructose 6-phosphate f ADP ϩ D-fructose 1,6-bisphosphate REACTIONS 1 AND 2 PP i -PFKs from an eukaryotic human parasite, Entamoeba histolytica (12), and an anaerobic bacterium, Propionibacterium freudenreichii subsp. shermanii (13), were proposed to work for fructose 1,6-bisphosphatase synthesis like ATP-PFKs. In higher plants, it has been shown that PP i -PFK works in the opposite direction, at least during internode developmental stages (14). Therefore, PP i -PFK can catalyze both PP i -utilizing and PP i -producing reactions not only in vivo but also in vitro. ATP-and PP i -PFK share a common origin, but the evolutionary history is highly complex. Change of the phosphate donors in PFKs occurred more than once, as suggested by phylogenetic analyses of ATP-and PP i -PFKs (6,9,10,15). The complex evolution of substrate utility in PFK suggests that, at least in PFK, the transition of PP i -utilizing and nucleotide triphosphate-utilizing ability can have occurred relatively easily. Indeed, mutation of just a few amino acids can change the substrate preference from PP i to ATP, based on the ratio of k cat /K m (8).
PEP-pyruvate conversion is also catalyzed by PP i -independent enzymes, PEP synthase (EC 2.7.9.2; Reaction 4) and pyruvate kinase (EC 2.7.1.40; Reaction 5). REACTIONS 4 AND 5 PPDK and PEP synthase share conserved domains, whereas overall amino acid sequence similarity is not high (16). On the other hand, neither PPDK nor PEP synthase shows substantial similarity to pyruvate kinase at the primary structure level. This case is apparently distinguishable from that of PFK because transition of substrate utility for the PEP-pyruvate conversion has not occurred by simple amino acid substitutions, as seen in PFK.

REACTIONS 6 -9
ATP-and GTP-PEPCKs and PEP carboxylase have been well characterized. ATP-PEPCKs are mainly present in bacteria, yeast, and plants, whereas GTP-PEPCKs are mostly present in higher eukaryotes, archaea, and some bacteria (21). PEP carboxylase exists in various bacteria and in limited archaea and eukaryotes. Although there is no discernible similarity in the overall structure of these proteins, the residues implicated for the binding to metal, oxaloacetate/PEP, and nucleotides are conserved between ATP-and GTP-PEPCKs (22,23), whereas conservation of these residues has not been reported in the case of PEP carboxylase. In contrast, the information on PP i -PEPCK, which is also called PEP carboxytransphosphorylase, is limited. Although PP i -PEPCK was previously purified from P. freudenreichii subsp. shermanii (19, 24 -26) and E. histolytica (27) and extensively characterized enzymologically, the gene encoding PP i -PEPCK has remained unknown since the first description in 1961 (24). Both the distribution of PP i -PEPCK among the three domains of life and the evolutionary relationship between other PEP-oxaloacetate interconverting enzymes were totally obscure.
In the present study, we identified the protein for PP i -PEPCK activity and its encoding gene from E. histolytica and P. freudenreichii. The identification of PP i -PEPCK finally enabled us to compare the amino acid sequences of all three types of PEPCKs as well as PEP carboxylase. It also allowed us to estimate the distribution of PP i -PEPCK in the tree of life. Last, we described the complex evolutionary history of proteins utilizing different substrates for the same catalytic reaction.

Experimental Procedures
Microorganisms and Cultivation-Trophozoites of E. histolytica clonal strain HM-1:IMSS Cl6 were maintained axenically in Diamond's BI-S-33 medium at 35.5°C as described previously (28). Cells were grown to the late logarithmic phase (2-3 days after inoculation), harvested by the addition of ice-cold PBS buffer to culture flasks, after discarding the medium, and followed by centrifugation at 300 ϫ g for 5 min at 4°C. For protein purification, the harvested cell pellets were stored at Ϫ80°C until use.
Enzyme Assays-PP i and oxaloacetate-producing direct PEPCK activity was assayed by measuring the oxidation of NADH by detecting the decrease of absorbance at 340 nm as described previously (27). The reaction mixture contained 10 mM potassium phosphate buffer (pH 6.5), 0.4 mM PEP, 0.1 mM CoCl 2 , 20 mM KHCO 3 , 10 mM MgCl 2 , 2.0 units/ml malate dehydrogenase (from rabbit muscle; Sigma), 0.25 mM NADH, and an enzyme solution. Reaction was started by adding PEP, and the NADH oxidizing activity without the addition of PEP was subtracted as the base line. To obtain the absolute active value, 400 l of the reaction mixture was added in 1.0-cm path length cuvettes, and absorbance was monitored by a spectrometer. To obtain the relative activity during the purification, 100 l of the mixture was put in a 96-well plate and incubated at 37°C, and 340 nm was monitored by a microplate reader. One unit of activity was defined as the amount of enzyme oxidizing 1 mol of NADH/min.
Purification of Native E. histolytica PEPCK-EhPEPCK was purified from 5 g of wet cells as follows. The cells were suspended in 15 ml of 20 mM Tris-HCl (pH 8.0) containing 0.1 mM CoCl 2 (buffer A) and 0.5 mg/ml E-64 and disrupted by freezing and thawing, and cell debris was then removed by centrifugation at 100,000 ϫ g for 1 h. After the supernatant was diluted to 30 ml with buffer A, ammonium sulfate was added to give 30% saturation. The samples were then applied to a Butyl-Toyopearl column (22 ϫ 15 cm; Tosoh, Tokyo, Japan) equilibrated with buffer A supplemented with ammonium sulfate at 30% saturation. All chromatography steps were performed using an Ä KTA purifier system (GE Healthcare) at room temperature. Proteins were eluted with a gradient of ammonium sulfate from 30 to 0% at a flow rate of 4 ml/min. The active fractions were desalted using a PD10 column (GE Healthcare) and then applied to a MonoQ HR 5/5 column (bed volume, 1 ml; GE Healthcare) equilibrated with buffer A. Proteins were eluted with a gradient of NaCl from 0 to 500 mM for 20 column volumes at a flow rate of 1.0 ml/min. The active fractions were pooled and loaded onto a Superdex 200 (10/300) column (GE Healthcare). Proteins were eluted with buffer A supplemented with 150 mM NaCl.
Identification of EhPEPCK by LC-MS/MS Analysis-ϳ130 kDa bands were excised from silver-stained gel and subjected to LC-MS/MS analysis at the W. M. Keck Biomedical Mass Spectrometry Laboratory (University of Virginia, Charlottesville, VA). The analysis was performed in almost the same manner as before (29); the only difference was that 5 l of the extract was injected to a reversed-phase capillary column, and peptides were eluted over 0.3 h.
To estimate the relative ratio of EhPEPCK1, -2, and -3 in the analyzed sample, the peak area for the selected ion chromatogram of the monoisotopic mass of the most abundant charge state (Ϯ0.02 Da) for each peptide was quantified. Three peptides conserved in all three proteins, specific to EhPEPCK1, and conserved only in EhPEPCK2 and -3, respectively, and one peptide unique to EhPEPCK2 or -3, respectively, were selected for the analysis because these peptides were detected in all three samples: immunoprecipitated samples (see below) using EhPEPCK1-HA, EhPEPCK2-HA, and purified PP i -PEPCK from the wild-type amoeba lysate. The sum of the peak area value of the three peptides conserved in all was used for normalization.
Overexpression and Purification of Recombinant PP i -PEPCKs-The plasmids for bacterial expression were introduced into E. coli BL21 CodonPlus (DE3)-RIL (EhPEPCK1 and -2; Agilent) and E. coli BL21 Star (DE3) (PfPEPCK; Life Technologies, Inc.). The hosts transformed with the expression plasmids were inoculated into 400 ml or Luria-Bertani medium in a 1-liter conical flask containing 50 g/ml ampicillin and 34 g/ml chloramphenicol if necessary. After cultivating the cells aerobically at 37°C until the A 600 reached ϳ0.5, protein expressions were induced by cooling the culture on ice for 30 min and adding 0.1 mM isopropyl thio-␤-D-galactopyranoside to the medium, followed by overnight cultivation at 15°C. Harvested cells (ϳ8 g of wet cells from 1.2 liters of culture) were disrupted by adding BugBuster (ϳ5 ml/g of wet cells; Merck), and cell debris was removed by centrifugation. Imidazole and NaCl concentrations in the supernatant were adjusted to 10 and 300 mM, respectively. The supernatant was then applied to an open column packed with Ni-NTA-agarose (1.5-ml bed volume; Qiagen). After washing the column with 16 bed volumes of 50 mM Tris-HCl, 300 mM NaCl, and 20 mM imidazole-HCl (pH 8.0), the His-tagged protein was eluted with 4 bed volumes of the same buffer containing 250 mM imidazole-HCl. The eluted proteins were further purified using a MonoQ column as described above.
Antibodies-To make anti-EhPEPCK antisera, full-length EhPEPCK1 with a His tag at the N terminus was expressed in E. coli, purified using Ni-NTA and MonoQ columns as described above, and used as the antigen. The anti-EhPEPCK1 antisera from rabbit were commercially raised by Operon Biotechnology (Tokyo, Japan). The specificity of these antibodies was confirmed using E. histolytica lysates by Western blotting (WB) analysis (data not shown).
Amoeba Transformation-The plasmids generated as described above were introduced into E. histolytica trophozoites by lipofection as described previously (31) with minor modifications. Approximately 40 l of the transfection medium containing 3-5 g of one of the plasmids was mixed with 10 l of PLUS TM reagent (Life Technologies) and kept at room tem-perature for 15 min. This mixture was combined with 20 l of Lipofectamine TM transfection reagent mixed with 30 l of transfection medium, kept at room temperature for 15 min, diluted with 400 l of transfection medium, and added to the seeded trophozoites attached to a 12-well plate after removing the medium. The plate was then incubated at 35.5°C for 5 h, and cells were transferred to a tube containing 5.5 ml of BI-S-33 medium. About 24 h after transfection, BI-S-33 medium was changed into fresh medium with 1 g/ml G418 and gradually increased for ϳ2 weeks until the G418 concentration reached 10 g/ml.
Protein Assay-Protein concentrations were measured using a Bio-Rad protein assay DC dye. Bovine ␥-globulin was used as a standard.
WB-Protein samples were subjected to SDS-PAGE after boiling at 95°C for 3 min with SDS-PAGE loading buffer. Proteins in the gel were transferred to nitrocellulose membrane and then blocked with 5% (w/v) skim milk in Tris-buffered saline with Tween 20 (TBST) overnight at 4°C. Proteins on the membrane were reacted with antibodies in TBST for 1 h at room temperature. Primary antibodies were used at a 1:5000 dilution for anti-EhPEPCK rabbit antibody and at a 1:1000 dilution for anti-cysteine synthase 1 (32) and anti-Cpn60 (33) rabbit antibodies and anti-HA mouse monoclonal antibody (clone 11MO, Covance (Princeton, NJ)). The blots were visualized using alkaline phosphatase-conjugated anti-rabbit or mouse IgG antibody (1:2000 dilution; Cell Signaling Technology, Danvers, MA) with AP Color Reagent (Bio-Rad) according to the manufacturer's protocol.
Immunoprecipitation-To examine the interaction between EhPEPCKs, 0.08 g of wet cells expressing PEPCK1-HA, PEPCK2-HA, or only HA were lysed with 1 ml of lysis buffer (50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.1 mM CoCl 2 , 0.2% Triton X-100, 0.5 mg/ml E-64) for 10 min on ice. After centrifugation at 20,000 ϫ g for 10 min, the supernatants were precleaned with 150 l of Protein G-Sepharose (GE Healthcare) for 1 h. After removing the beads by centrifugation, the precleared supernatant samples were mixed and incubated with 50 l of anti-HA monoclonal antibody-conjugated agarose (Sigma-Aldrich) for 4 h. The agarose beads were washed with 400 l of lysis buffer without E-64 five times (9300 ϫ g for 1 min each) to remove the unbound proteins, and the bound proteins were eluted by incubating with 200 l of lysis buffer containing 0.2 mg/ml HA peptide (Sigma-Aldrich) overnight. The eluted samples were collected by centrifugation at 800 ϫ g for 3 min. All of the centrifugations and incubations were done at 4°C otherwise mentioned.
Phylogenic Analysis-Homologues of PP i -PEPCK were searched by BLASTP analysis against the non-redundant protein sequence database using EhPEPCK1 as a query (May 5, 2015). In addition, we also retrieved the homologues of Mastigamoeba balamuthi and Spironucleus barkhanus by tBLASTN analysis against the whole genome shotgun reads and the expressed sequence tag database, respectively. Other eukaryotic counterparts were retrieved by tBLASTN analysis against the Marine Microbial Eukaryote Transcriptome Sequencing Project database (34). No more than two sequences with 97-100% amino acid sequence similarity were included, except for Entamoeba spp. and Giardia intestinalis (for details, see "Discussion"). Sequences were aligned by MAFFT (35), and ambiguously aligned sites were excluded, resulting in a data set comprising 158 taxa 749 positions. The data set was subjected to the maximum likelihood method with the LG ϩ ⌫ ϩ F model. The maximum likelihood tree was heuristically searched from 10 distinct maximum parsimony trees. In maximum likelihood bootstrap analyses (100 replicates), a heuristic tree search was performed from a single maximum parsimony tree per replicate. The maximum likelihood phylogenetic analyses described above were conducted by RAxML version 7.2.8 (36).

Identification of PP i -PEPCK from E. histolytica-An in silico
search of potential PEPCK homologs in the E. histolytica genome (AmoebaDB) using canonical ATP-and GTP-PEPCK from Dictyostelium discoideum (XP_645490.1 and XP_ 645396.1, respectively) failed to detect any candidate of PEPCK (data not shown), although PP i -PEPCK activity was detected from E. histolytica trophozoite lysates (0.165 units/mg of protein) as previously reported by Reeves (27). PP i -PEPCK activity was sequentially separated using hydrophobic (Butyl-Toyopearl) and anion exchange (MonoQ) columns (Table 1), and a MonoQ fraction containing peak PP i -PEPCK activity gave one major protein band at ϳ130 kDa on SDS-PAGE analysis (Fig.  1A). PP i -PEPCK activity detected in this MonoQ fraction indicated 206-fold purification with a specific activity of 34.0 units/ mg, which agrees well with the previous results of PP i -PEPCK from P. freudenreichii that showed 12.8 units/mg of activity after 99-fold purification (25). The activity level was also similar to that of ATP-PEPCK from E. coli (24 units/mg) (37). Phosphate, PEP, and MDH dependence of the NADH oxidizing activity in the MonoQ fraction was also confirmed. NADH oxidizing activity was not detected when phosphate was substituted to ADP, indicating that the fraction did not contain ATP-PEPCK activity.
The fraction containing PEPCK activity was further subjected to size exclusion chromatography using Superdex 200. The PP i -PEPCK activity was detected in the fractions corresponding to two major peaks around ϳ143 and ϳ440 kDa (Fig.  2). The integrated areas of the absorbance at 280 nm corresponding to the 440-and 143-kDa proteins (the first and second peaks, respectively) and their enzymatic activity levels are comparable, suggesting that active PP i -PEPCK is present as both monomer and trimer in E. histolytica ( Fig. 2A). PP i -PEPCK from P. freudenreichii has been previously reported to exist as a monomer, dimer, and tetramer with activity (26). Formation of several quaternary structures may be a characteristic of PP i -PEPCK. Because both of the fractions seemed to contain an identical ϳ130-kDa protein band on SDS-PAGE, the protein band from the fractions corresponding to the first peak (ϳ440 kDa) was excised and subjected to TOF MS/MS analysis. Peptides detected from the band correspond to a "cluster of hypothetical proteins," which consist of three homologous proteins: EHI_166920 (XP_654765.1), EHI_030750 (XP_650862.1), and EHI_198620 (XP_655201.1) ( Table 2). The calculated molecular mass of these proteins (131 kDa) agreed well with that expected from the SDS-PAGE result, and the detected peptides covered 63-67% of the three proteins. Hereafter these proteins are referred to as EhPEPCK1, -2, and -3, respectively. EhPEPCK1 and EhPEPCK2 or -3 showed 91% amino acid identity, whereas EhPEPCK2 and -3 showed 98% identity. Amino acid variations between EhPEPCK1 and EhPEPCK2/3 were mainly found in the first 60 residues at the N-terminal region, showing only 45% identity between EhPEPCK1 and EhPEPCK2/3. Neither of these proteins show any similarity to known proteins or domains as predicted by Pfam, indicating that the purified proteins were novel proteins.
Confirmation of PP i -PEPCK Activity of the Amebic Enzymes Expressed in E. coli-EhPEPCK1, EhPEPCK2, and PfPEPCK were expressed as soluble recombinant proteins with the histidine tag at the N terminus in E. coli. A Coomassie Brilliant Blue-stained SDS-polyacrylamide gel of the proteins purified using Ni-NTA and MonoQ chromatography showed that the full-length proteins of the expected size were purified to homogeneity (Fig. 1B). PP i -PEPCK activity of the purified EhPEPCK1, EhPEPCK2, and PfPEPCK was 8.9, 3.9, and 23.0 units/mg, respectively, confirming that the identified genes indeed encode proteins with PP i -PEPCK activity.
Expression of EhPEPCK1 and -2 with HA Tag in E. histolytica-Amebic transformants expressing either EhPEPCK1 or EhPEPCK2 with the HA tag at the carboxyl terminus (EhPEPCK1-HA or EhPEPCK2-HA) using the episomal plasmid were established. Expression of the epitope-   tagged proteins were confirmed by WB using anti-HA antibody (Fig. 3A). The molecular mass estimated from the band corresponding to EhPEPCK1-HA was slightly larger than that of EhPEPCK2-HA. We hypothesize that this subtle difference in the mobility between EhPEPCK1 and -2 is attributable to the difference in the isoelectric point between the two proteins. The isoelectric point values of EhPEPCK1 and -2 are calculated as 6.15 and 6.06, respectively. At this stage, we do not know whether the slight difference in the isoelectric point and the molecular size between EhPEPCK1 and -2 is physiologically important.
Heteromeric Configuration of EhPEPCKs-To investigate whether EhPEPCKs form homo-or heterotrimer, EhPEPCK1 and -2 were immunoprecipitated using lysates from transformants expressing corresponding EhPEPCK with the HA tag using anti-HA antibody. Both of the immunoprecipitated samples contained peptides from all of the three EhPEPCKs (Table 3). When proteins were immunoprecipitated using EhPEPCK1-HA, EhPEPCK2/3-specific peptides were also detected, whereas the relative ratio decreased about 1.7-fold compared with that of trimetric EhPEPCK purified from the wild-type E. histolytica. Similarly, the immunoprecipitated sample using EhPEPCK2-HA also contained peptides unique to EhPEPCK1, whereas the relative ratio decreased 2.0-fold. These data suggest that some proportion of the trimer consists of a mixture of EhPEPCK1, -2, and -3. The detailed composition of the trimeric complex should be analyzed in the future.
Detection of PP i -PEPCKs by WB Using Anti-EhPEPCK Antiserum-The anti-EhPEPCK antiserum reacted with recombinant PEPCK1 and -2 with comparable efficiency (Fig.  3B). Major bands of the predicted molecular mass (ϳ130 kDa) were observed in the lysate of E. histolytica (Fig. 3C), confirming the specificity of this anti-EhPEPCK antiserum. We estimated that EhPEPCKs account for ϳ1% of the total protein of the amebic lysate by comparing the band intensity of serially diluted recombinant PEPCKs and amebic lysates in WB analysis (Fig. 3B). This estimation also agrees well with the finding that about 200-fold purification of E. histolytica lysate yields highly purified PP i -PEPCK (Table 1 and Fig. 1A).
Localization of PP i -PEPCKs in E. histolytica Trophozoites-E. histolytica trophozoites were mechanically disrupted in sucrose-MOPS buffer and centrifuged to separate the cytosol and organelle/membrane fractions. The cytosolic marker, cys-  teine synthase 1 and the organelle marker, Cpn60, were demonstrated in the corresponding fractions (Fig. 4). When these fractions were reacted with anti-EhPEPCK antiserum, clear bands were detected from the supernatant fraction. Therefore, major parts of both EhPEPCK1 and EhPEPCK2/3 localize in the cytosol (Fig. 4), whereas two isoforms of GTP-PEPCK exist in the cytosol and mitochondria in the case of animals (38). In silico analysis to find organelle targeting signals (TargetP version 1.1 and SignalP 4.1) or transmembrane regions (TMHMM Server version 2.0) did not detect any targeting signals and thus was consistent with the result from the wet experiment described above. Distribution of PP i -PEPCK in the Tree of Life-Homologs of EhPEPCKs were found in limited lineages of eukaryotes and bacteria but not in archaea (Tables 4 and 5). PP i -PEPCK genes were found in all of the three genomes of Entamoeba species that were currently present in GenBank TM , although the copy numbers are different among the three. The PP i -PEPCK gene was also found in the genome data of a free living amoebozoan, M. balamuthi, whereas the other amoebozoans Acanthamoeba and Dictyostelium lacked the gene. In addition to the two amoebozoan genera bearing PP i -PEPCK, PP i -PEPCK homologs were also present in some eukaryotes, such as excavates, dinoflagel-lates, diatoms, and cryptophytes, which are distantly related to each other (39). In bacteria, a limited number of species in Actinobacteria, including P. freudenreichii used in this study, Verrucomicrobia, Bacteroidetes, Planctomycetes, Proteobacteria, Firmicutes, and Spirochates, possessed PP i -PEPCK homologs ( Table 5); many species were found to lack PP i -PEPCK even if many of them were closely related to PP i -PEPCK-bearing species.

Discussion
In the present study, we successfully identified the gene encoding PP i -PEPCK, which is one of the key enzymes in the central metabolism and connects sugar and organic acids metabolisms, from E. histolytica and P. freudenreichii. We also found that the homologs of EhPEPCK were distributed in lim- Trophozoites of mock, EhPEPCK1-HA-expressing, or EhPEPCK2-HAexpressing transformants were mechanically disrupted and centrifuged to separate into cytosolic and organelle fractions. Those fractions were reacted with anti-EhPEPCK, cysteine synthase 1 (CS1, a cytosolic marker), or Cpn60 (a mitosomal matrix protein, used as an organellar marker) antibodies. Whereas ϳ5 g of lysates were used for anti-PEPCK antibody, 20 g of lysates were reacted with anti-CS1 and anti-Cpn60 antibodies.

Identification of PP i -type Phosphoenolpyruvate Carboxykinase
ited but diverse lineages of unicellular eukaryotes and bacteria but not in archaea.
In E. histolytica, which lacks ATP/GTP-PEPCK and PEP carboxylase, PP i -PEPCK has been believed to work in the PP i -and oxaloacetate-producing direction (27,40). This is because a supply of PP i is required to replenish PP i consumed by the PP iutilizing enzymes in glycolysis, PP i -PFK and PPDK (4,12,41). However, it is uncertain whether PP i produced by PP i -PEPCK is truly essential because PP i is supplied as a by-product of many reactions in vivo (3,42). Alternatively, it would be possible that PP i -PEPCK works for the PP i -utilizing direction when there is enough supply of PP i because of the following reason. E. histolytica is able to consume asparagine and aspartate from media (43), and these amino acids can be converted to oxaloacetate by aminotransferase (Fig. 5) (44). Although the oxaloacetate catabolic pathway has not been experimentally confirmed in this organism, which lacks the tricarboxylic acid cycle, the amoeba may be capable of producing ATP from these amino acids by using the energy of PP i if oxaloacetate is further converted to PEP by PP i -PEPCK and then pyruvate by PPDK. Future experiments to investigate which direction is critical (or whether both directions are important) for viability in E. histolytica would give us deeper insight into the central metabolic pathway in this human parasite.
E. histolytica has three isoforms of PP i -PEPCK, which are strongly suggested to form heterotrimers among the isoforms. This may be worthy of mention because the two other species of Entamoeba have two isoforms, and the four strains of the other eukaryotic human gut parasite Giardia intestinalis, for example, possess the single copy of this gene in the genomes (Fig. 6). Thus, it is still a key question whether each isoform of PP i -PEPCK from E. histolytica has different roles in the heterotrimer and whether their roles are, if different, also shared among Entamoeba species. These issues should be examined in the future to consider the roles of PP i -PEPCK in the human parasite E. histolytica.
Most of the organisms possessing PP i -PEPCK homologs have canonical ATP/GTP-PEPCK genes, and some of them have PEP carboxylase as well (Table 4), although E. histolytica possesses the three cytosolic PP i -PEPCKs but lacks the others. How these functional homologs are utilized/regulated in each organism is an open question. They might work in different directions from each other, or they might work in different environmental conditions. Especially, in some eukaryotes, these functional homologs might be utilized in different cellular compartments. For instance, a potential PP i -PEPCK from the unicellular alga Thalassiosira pseudonana (XP_002285950.1) is predicted to be cytosolic according to the in silico predictions (data not shown) as well as those of E. histolytica, whereas its ATP-PEPCK was confirmed to exist in mitochondria, and two types of PEP carboxylase localize in mitochondria and the chloroplast, respectively (45). Although the localization of ATP-PEPCK and PEP carboxylase indicates that PEP-oxaloacetate conversion occurs only in organelles of the diatom, our findings suggest that this reaction important for the central carbon metabolism can also occur in the cytosol of T. pseudonana. Our findings in this study could further open a new era for deeper understanding the central metabolism of various organisms.
Identification of PP i -PEPCK also allows us to compare the amino acid sequences between PP i -PEPCK and the functional homologs, ATP/GTP-PEPCK and PEP carboxylase. PP i -PEPCK bears none of the known catalytic domains, such as oxaloacetate binding sites, kinase 1a/P-loop, kinase-2 region, and nucleotide binding sites conserved in ATP-and GTP-PEPCK (22). In addition, no similarity in primary structure was observed between EhPEPCK and the functional homologs. Therefore, it is unlikely that PP i -PEPCK shares a common origin with ATP-and GTP-PEPCK/PEP carboxylase. This is similar to the case between pyruvate kinase and PPDK/PEP synthase but different from the cases of ATP-PFK/PP i -PFK and PPDK/PEP synthase that have most probably emerged from single origins. Our findings further illuminate the complexity of evolution among the proteins, which are functionally redundant but utilize different phosphate donors (e.g. PP i and nucleotide triphosphates).
The origins and evolutionary histories of PP i -PEPCKs were one of the most interesting aspects for us. However, it was very difficult to reconstruct a rooted tree of PP i -PEPCK because we could not find any proteins with sister relationships to PP i -PEPCK. In addition, whether the current distribution of PP i -PEPCK in the tree of life has been shaped by lateral gene transfer or by vertical inheritance followed by independent gene loss basically remains unclear because of the low resolution of the PP i -PEPCK tree (Fig. 6) with one exception for PP i -PEPCK in Dehalobacter sp.; the Dehalobacter homologue was nested in the clade of Actinobacteria, although Dehalobacter belongs to Firmicutes, strongly suggesting actinobacterium-to-Dehalobacter lateral gene transfer.
In summary, PP i -PEPCK was identified at the molecular level for the first time since the activity was first demonstrated more than 50 years ago (24). It was also demonstrated that PP i -PEPCK was distributed in phylogenetically diverse but limited unicellular eukaryotes and bacteria, which would remodel the central metabolic pathways in various organisms. PP i -PEPCK has no clear evolutionary relationship with ATP-or GTP-PEPCK, in sharp contrast to the case of PP i -and ATP-PFK and that of PPDK and PEP synthase, which have evolved from single origins. These data suggest that, in order to substitute the FIGURE 5. Glycolytic pathway and oxaloacetate-malate node of E. histolytica. Arrows, reaction steps and their directions conducted by the enzymes in E. histolytica (46,47 FIGURE 6. Phylogenic tree of PP i -PEPCKs and their homologs. Sequences were aligned by MAFFT, and the data set was subjected to maximum likelihood analysis with RAxML version 7.2.8. Only bootstrap supports equal to or more than 85 are shown on each node. White letters in black boxes and black letters indicate eukaryotic and bacterial phyla, respectively. energy and phosphate donors from PP i to nucleotide triphosphates, proteins have employed various strategies.
Author Contributions-Y. C. and T. N. designed this study. Y. C. performed biochemical analysis. K. N.-T. and Y. S.-N. designed molecular experiments. R. K. and Y. C. performed the phylogenetic analysis. Y. C., R. K., and T. N. wrote the manuscript. All of the authors reviewed the results and approved the final version of the manuscript.