If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USADepartment of Chemistry and Biochemistry, University of North Carolina–Wilmington, Wilmington, North Carolina, USA
Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USADepartment of Biology, Williams College, Williamstown, Massachusetts, USA
Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USADepartment of Molecular Genetics and Cell Biology, University of Chicago, Chicago, Illinois, USA
Aldehyde dehydrogenases are versatile enzymes that serve a range of biochemical functions. Although traditionally considered metabolic housekeeping enzymes because of their ability to detoxify reactive aldehydes, like those generated from lipid peroxidation damage, the contributions of these enzymes to other biological processes are widespread. For example, the plant pathogen Pseudomonas syringae strain PtoDC3000 uses an indole-3-acetaldehyde dehydrogenase to synthesize the phytohormone indole-3-acetic acid to elude host responses. Here we investigate the biochemical function of AldC from PtoDC3000. Analysis of the substrate profile of AldC suggests that this enzyme functions as a long-chain aliphatic aldehyde dehydrogenase. The 2.5 Å resolution X-ray crystal of the AldC C291A mutant in a dead-end complex with octanal and NAD+ reveals an apolar binding site primed for aliphatic aldehyde substrate recognition. Functional characterization of site-directed mutants targeting the substrate- and NAD(H)-binding sites identifies key residues in the active site for ligand interactions, including those in the “aromatic box” that define the aldehyde-binding site. Overall, this study provides molecular insight for understanding the evolution of the prokaryotic aldehyde dehydrogenase superfamily and their diversity of function.
In all organisms, the diversification of large superfamilies of enzymes provides a foundation for the evolution of biochemical capacity and the ability to metabolize varied small molecules (
). Typically, across a superfamily, the core chemical mechanism is retained with substrate profiles becoming either specialized or promiscuous, depending on evolution and the metabolic needs of the organism (
). Classic examples of enzyme superfamilies found across all kingdoms include the cytochromes P450, alcohol dehydrogenases, aldo-keto reductases, and aldehyde dehydrogenases (
). For example, aldehyde dehydrogenases are NAD(P)(H)-dependent enzymes that metabolize a wide range of aldehydes to their corresponding carboxylic acids in prokaryotes and eukaryotes and tend to be encoded by multiple genes in the genome of a given species (
Aldehyde dehydrogenases are generally associated with the detoxification of aldehydes, which are highly reactive compounds generated through cellular metabolism (
). For example, these enzymes can scavenge aldehydes, such as malondialdehyde resulting from lipid peroxidation, and convert them to a less chemically reactive carboxylic acid (
). Aldehyde dehydrogenases and their biochemical functions are also linked to a wide variety of biochemical processes ranging from ethanol metabolism via oxidation of acetaldehyde into acetate (
The Arabidopsis thaliana REDUCED EPIDERMAL FLUORESCENCE1 gene encodes an aldehyde dehydrogenase involved in ferulic acid and sinapic acid biosynthesis.
Identification of genes involved in cell wall biogenesis in grasses by differential gene expression profiling of elongating and non-elongating maize internodes.
Over-expression of different aldehyde dehydrogenase genes in Arabidopsis thaliana confers tolerance to abiotic stress and protects plants against lipid peroxidation and oxidative stress.
Arabidopsis and tobacco plants ectopically expressing the soybean antiquitin-like ALDH7 gene display enhanced tolerance to drought, salinity, and oxidative stress.
). Recent work has also linked aldehyde dehydrogenase activity to the synthesis of indole-3-acetic acid, the primary plant hormone auxin, in the plant pathogenic microbe Pseudomonas syringae strain PtoDC3000 (
To suppress host defenses and promote diseases development, Pseudomonas syringae produces a variety of virulence factors, including phytohormones or chemical mimics of hormones, to manipulate hormone signaling in its host plants (
). P. syringae and many other plant-associated microbial pathogens can synthesize the major auxin indole-3-acetic acid (IAA), whose production is implicated in pathogen virulence (
). Previously, a mutation in Azospirilum brasilense (aldA) was identified that decreased IAA production and was linked to a gene and annotated as encoding an aldehyde dehydrogenase (
). Bioinformatic analysis of PtoDC3000 identified a set of aldehyde dehydrogenases (AldA–C) sharing 30–40% amino acid identity with each other. Subsequent metabolic, biochemical, and in planta analyses of AldA (UniProt: PSPTO_0092), AldB (UniProt: PSPTO_2673), and AldC (UniProt: PSPTO_3644) demonstrated that AldA functions as an indole-3-acetaldehyde dehydrogenase, is essential for IAA synthesis, and contributes to virulence of PtoDC3000 (
); however, its potential biochemical role in PtoDC3000 was not fully examined.
Here we investigate the substrate profile of AldC from PtoDC3000, which suggests that this enzyme functions as a long-chain aliphatic aldehyde dehydrogenase. The 2.5 Å resolution X-ray crystal of the AldC C291A mutant in complex with octanal and NAD+ and biochemical analysis of a set of site-directed mutants provide insight on substrate recognition in this enzyme. Comparison of the three-dimensional structures of AldA and AldC from PtoDC3000 reveals the sequence and structural changes that lead to distinct substrate profiles of these aldehyde dehydrogenases.
Results and discussion
Biochemical analysis of AldC as a long-chain aliphatic aldehyde dehydrogenase
Previously, three aldehyde dehydrogenases (AldA–C) from the plant pathogen P. syringae strain PtoDC3000 were identified and examined for their contribution to the synthesis of IAA and virulence (
). Steady-state kinetic parameters for AldA using indole-3-acetaldehyde showed this enzyme to be 130- and 710-fold more efficient than AldB and AldC, respectively (
Bioinformatic analysis predicted that AldC contains the PF00171 domain, a signature domain of the aldehyde dehydrogenase superfamily, ranging from Ser33 to Ile482 (Fig. 2A). Specifically, the 4 catalytic residues (Asn159, Glu257, Gly288, and Cys291) and 19 residues in the NAD+-binding site (Ile155–Asn159, Lys182, Gly219, Ile233–Ser236, Ala239, Leu242, Glu257, Leu258, Gly259, Cys291, Glu391, and Phe393) define the aldehyde dehydrogenase consensus sequence motifs of AldC. The NCBI Conserved Protein Domain Family server places AldC in the cd01738/ALDH_CddD_SSP0762 family, 1 of 42 aldehyde dehydrogenase families found in the Pseudomonas (
Figure 2Domain architecture and phylogeny of AldC from P. syringae strain PtoDC3000.A, the UniProt aldehyde dehydrogenase (ALDH) domain (green), NAD(P)(H)-binding site (blue triangles), and catalytic residues (red triangles) of PtoDC3000 AldC (WP_011104646.1) were predicted using InterPro79.0 (
). B, the PtoDC3000 AldC sequence was used as a BLAST query to identify ortholog group members of aldC (group ID POG018413) from the Pseudomonas Orthologous Groups classification system in the Pseudomonas Genome Database (RRID:SCR_006590) (Table S1). Amino acid sequences from 93 Pseudomonas species strains were multialigned using Clustal Omega (
). A clade consisting of aldehyde dehydrogenases from plant pathogenic Pseudomonas species and strains is highlighted (green) with AldC from PtoDC3000 indicated.
The Pseudomonas orthologous groups classification system in the Pseudomonas Genome Database (RRID:SCR_006590) found orthologs of AldC (group ID POG018413) from 93 Pseudomonas species and strains, including P. aeruginosa, P. putida, P. fluorescens, and P. savastanoi (Fig. 2B and Table S1). AldC belongs to a clade consisting of aldehyde dehydrogenases, which share ∼90% sequence identity, from various plant pathogenic Pseudomonas spp.: P. syringae, P. viridiflava, P. savastanoi, and P. amygdali in the P. syringae phylogenetic group. None of AldC-related enzymes from Pseudomonas had previously been experimentally characterized, and their substrates and functions had yet to be described.
To determine the substrate preference of AldC, a panel of 23 molecules (Table S2), including short- to long-chain aliphatic aldehydes (acetaldehyde, propionaldehyde, butyraldehyde, valeraldehyde (pentanal), isovaleraldehyde, 3-methylcrotonaldehyde, hexanal, trans-2-hexen-1-al, heptanal, octanal, nonanal, trans-2-nonenal), betaine aldehyde, and aromatic aldehydes (benzaldehyde, m-anisaldehyde, p-anisaldehyde, phenylacetaldehyde, cinnamaldehyde, hydrocinnamaldehyde, coniferylaldehyde, sinapaldehyde, 4-pyridinecarboxaldehyde, and indole-3-acetaldehyde) was used to screen for enzymatic activity. Spectrophotometric assays of AldC identified aliphatic aldehydes of 5–9-carbon length, as well as hydrocinnamaldehyde and 4-pyridinecarboxyaldehyde, as substrates (Fig. 3A) with octanal having the highest specific activity (Fig. 3B). These data suggest that short 2–4-carbon aldehydes, branched aliphatic aldehydes, and larger aromatic aldehydes are poor substrates for AldC. To evaluate the nicotinamide cofactor preference of AldC, the activity of the enzyme was tested with octanal and either NAD+ or NADP+, which showed a distinct preference for NAD+ (Fig. 3C).
Figure 3Screening of AldC with aldehyde substrates.A, screening of aldehyde substrates. Assays were performed as described under “Experimental procedures.” Enzymatic activity was measured spectrophotometrically (A340 nm) in assays with a fixed concentration of NAD+ (2 mm) and each aldehyde (5 mm). Spectrophotometric absorbance changes versus time are shown for AldC-catalyzed conversion of hydrocinnamaldehyde, octanal, nonanal, 4-pyridinecarboxyaldehyde, hexanal, 2-hexanal, valeraldehyde, isovaleraldehyde, and phenylacetylaldehyde. B, comparison of AldC specific activity for substrate aldehydes. Average values ± S.D. (n = 3) are shown. C, AldC nicotinamde cofactor preference. Spectrophometric data for assays of AldC using octanal (5 mm) and either NAD+ and NADP+ (each at 2 mm) is shown. D, summary of aldehyde substrates and potential reactions catalyzed by AldC.
Steady-state kinetic analysis of AldC with valeraldehyde, hexanal, heptanal, octanal, and nonanal indicates that the 8-carbon substrate (i.e. octanal) is the preferred aliphatic aldehyde substrate (Table 1). Although the kcat values of AldC vary less than 3-fold between these substrates, the Km value for octanal is the lowest (1.2 mm) with that of valeraldehyde as the highest (48.8 mm). The overall effect is a marked difference in catalytic efficiency (kcat/Km) between octanal and the other aliphatic substrates ranging from use of nonanal exhibiting a 10-fold reduction in kcat/Km to ∼80-fold less efficient use of the 5-carbon substrate valeraldehyde. Generally, the aromatic aldehyde substrates hydrocinnamaldehyde and indole-3-acetaldehyde were poorly used by AldC with catalytic efficiencies comparable with valeraldehyde and 4-pyridinecarboxaldehyde used with a kcat/Km value 6-fold lower than octanal (Table 1). Overall, the biochemical analysis of AldC suggests that this enzyme functions primarily as a long-chain aliphatic aldehyde dehydrogenase (Fig. 3D). Moreover, conservation of related homologs in 93 different Pseudomonas species and strains (Fig. 2B) indicates a likely conserved function across these organisms.
Table 1Steady-state kinetic parameters of AldC with aldehyde substrates
Three-dimensional structure of AldC C291A mutant in complex with octanal and NAD+
To understand how AldC recognizes aliphatic aldehyde substrates, crystals of the AldC C291A mutant were grown in the presence of octanal and NAD+. The analogous cysteine to Cys291 of AldC in aldehyde dehydrogenases, including AldA from P. syringae, is the essential catalytic residue (
). The AldC C291A point mutant was generated by PCR mutagenesis, and the resulting protein was expressed, purified, and crystallized with the goal of obtaining a dead-end complex of the enzyme with octanal and NAD+ bound in the active site.
The 2.5 Å resolution X-ray crystal structure of the AldC(C291A)·octanal·NAD+ complex was solved by molecular replacement using the three-dimensional structure of P. syringae AldA as a search model (
) (Table 2). Each of the two unique protein chains in the asymmetric unit form a corresponding physiological dimer through crystallographic symmetry (Fig. 4A). The secondary structure features and domains of the AldC monomer are similar to those of other aldehyde dehydrogenase family members (Fig. 4, B and C). The N-terminal Rossmann-fold domain contains a central β-sheet (β9-β8-β7-β10-β11) surrounded by α-helices to form the NAD(H)-binding site (Fig. 4, B and C, blue). The C-terminal region consists of a mixed α/β domain, which includes the catalytic cysteine residue and forms the aldehyde-binding site (Fig. 4, B and C, red). An interdomain linker region (Fig. 4, B and C, green) connects the N- and C-terminal domains of AldC. A small three-stranded β-sheet domain (Fig. 4, B and C, gold) facilitates oligomerization.
Table 2Summary of crystallographic statistics for AldC structures
Figure 4Overall structure of the AldC C291A mutant structure.A, the dimeric structure of AldC dimer is shown as a ribbon diagram. Secondary structure features of monomer A are colored, as follows: α-helices and β-strands are shown as gold cylinders and blue arrows, respectively. The ribbon diagram of monomer B is colored gray with the N and C termini labeled. Ligands are shown as space-filling models. B, domain organization of the AldC monomer. The color scheme for the AldC monomer shows the NAD(H)-binding domains in blue, the catalytic domain in red, the interdomain linker in green, and the oligomerization domain in gold. The positions of octanal and NAD+ (space-filling models) are indicated. C, topology of AldC showing the four domains described in B with the same coloring and secondary structure features numbered.
The overall fold of AldC shares structural similarity with multiple aldehyde dehydrogenases, including P. syringae AldA, from a variety of microbes and eukaryotes (Fig. S1 and Table S3). Overlays of the AldC structure with twelve aldehyde, retinal, betaine aldehyde, and indole-3-acetaldehyde (i.e. AldA) dehydrogenases (
Structural, biochemical, and computational studies reveal the mechanism of selective aldehyde dehydrogenase 1A1 inhibition by cytotoxic duocarmycin analogues.
), which are related by 38–46% amino acid identity, show the conservation of the three-dimensional fold with 0.7–1.2 Å RMSD for 400–450 Cα-atoms in these enzymes from widely varied organisms (Fig. S1). Amino acid sequence comparisons of these proteins highlight how the catalytic cysteine and residues of the NAD(H)-binding site are highly conserved with major variations in the substrate-binding site leading to functional differences in these proteins (Fig. S2).
Although the specific determinants of oligomerization in aldehyde dehydrogenases can be highly variable (
). The P. syringae AldA tetramer is formed from a dimer of dimers with an extensive interaction surface, shown in Fig. 5 (A and B) with the oligomerization region highlighted in the close-ups. Fig. 5 (C and D) shows the same region of the AldC dimer for comparison. Multiple amino acid differences between AldA and AldC in the oligomerization domain alter both the electrostatic surface charge, which is strongly basic in AldA, and the surface topology, especially in the positioning of β5, β6, and the β5-β6 loop of the oligomerization domain of the two proteins. In addition, the length of the β5-β6 loop differs between AldA (Ala143–Val153) and AldC (Val138–Thr142), which contributes to alterations in the interaction region (Fig. 5, B and D, and Fig. S2, orange box). In AldC, these changes appear to alter what would be the corresponding tetramer (i.e. dimer–dimer) interface of AldA but may only be one contributing factor to different oligomerization.
Figure 5Oligomerization interface differences in AldA (tetramer) and AldC (dimer) from P. syringae DC3000.A, electrostatic surface charge of AldA dimer. Electrostatic surface charge was generated using the APBS plugin in PyMOL (Schrödinger) with red (acidic residues) and blue (basic residues). The box corresponds to the oligomerization interface for forming a homotetramer. B, ribbon diagram of monomer A and surface view of monomer B of AldA. The box corresponds to the oligomerization interface for forming a homotetramer. The α-helix and β-sheets for oligomerization are colored blue and gold, respectively. C, electrostatic surface charge of AldC dimer. Coloring is as in A. D, ribbon diagram of monomer A and surface view of monomer B of AldC. Coloring is as in B.
Unambiguous electron densities for NAD+ and octanal define how these ligands bind to the AldC C291A mutant and indicate the location of the substrate and cofactor-binding sites (Fig. 6A). Forming a dead-end complex, NAD+ and octanal occupy two separate pockets, which are at the opposite ends of a ∼45-Å-long tunnel (Fig. 6B). The Rossmann fold of the NAD(H)-binding domain provides extensive polar and apolar interactions that position the nicotinamide ring of NAD+ in proximity to the C291A point mutation, which would be the invariant catalytic cysteine in WT AldC (Fig. 6C). The nicotinamide ring is mainly held in place by van der Waals contacts with Leu258, Leu419, and Phe456 and a critical hydrogen bond from the backbone carbonyl of Leu258 to the NH2 group of the cofactor carboxamide. The nicotinamide-ribose of NAD+ establishes multiple polar interactions with the backbone carbonyl of Gly235 and the carboxylate side chain of Glu391. In addition, the adenine ring of NAD+ is mainly stabilized by multiple van der Waals interactions with Pro216, Ile233, Leu242, and Val243, along with a hydrogen bond with a water molecule in an apolar pocket. As with nicotinamide-ribose binding, polar interactions between the adenine-ribose ring and the side-chains of Lys182 and Glu185 contribute to NAD+ binding. Interaction of Glu185 with the 2'-hydroxyl-group of the adenine-ribose is the key structural determinant of the cofactor specificity, as observed in other aldehyde dehydrogenases (
). AldC is not able to accommodate the 2′-phosphate of NADP(H) sterically and electrostatically, which is consistent with its preference for NAD(H) as cofactor (
). Sequence comparison of AldC and structurally related aldehyde dehydrogenases (Fig. S2) underscores the conservation of the NAD(H)-binding site across these enzymes from distantly related organisms.
Figure 6Active site of AldC.A, electron density for octanal (left panels) and NAD+ (right panels) in the AldC(C291A)·octanal·NAD+ structure is shown as a 2Fo − Fc omit map (1.6 σ). B, surface view of the AldC active site. Octanal (substrate) and NAD+ (cofactor) are shown as stick models. The active site tunnel is shown as a surface view from the exterior into the catalytic site and the same view rotated 90°. Hydrophobicity was calculated based on the Eisenberg hydrophobicity scale in PyMOL. The darkest red indicates the strongest hydrophobicity, with white indicating the most polar. C, amino acid residues forming interactions with octanal and NAD+ in the AldC active site. Side chains of residues interacting with octanal (green) and NAD+ (blue) are shown as stick renderings. Two water molecules interacting with the cofactor are shown as red spheres. Hydrogen bond interactions between the amino acid residues and ligands are shown as yellow dotted lines. The C291A mutation is labeled in red.
In contrast to the NAD(H)-binding site, apolar interactions dominate the octanal binding in the hydrophobic substrate-binding pocket (Fig. 6, B and C). A cluster of aromatic residues (Trp160, Tyr163, Trp450, Phe456, and Tyr468) and two apolar residues (Met114 and Leu118) provide the hydrophobic environment that accommodates octanal and other aliphatic aldehydes. As described for other aldehyde dehydrogenases (
), the substrate-binding site forms an “aromatic box” for adaptable apolar ligand interaction.
In the AldC(C291A)·octanal·NAD+ complex, the 8-carbon chain of octanal extends toward the solvent exposed, but highly apolar, opening of the aldehyde-binding site. This places the reactive aldehyde group, which hydrogen bonds with Asn159 and the backbone nitrogen of Ala291, in proximity to the C4 position of the nicotinamide ring that undergoes hydride transfer in the chemical reaction and the location of the catalytic cysteine. Multiple sequence alignment of AldC and other aldehyde dehydrogenases (Fig. S2) shows that the residues forming the aldehyde-binding site are highly variable, which account for substrate preferences in these enzymes.
Molecular docking of the valeraldehyde (pentanal), hexanal, heptanal, nonanal, hydrocinnamaldehyde, and 4-pyridinecarboxaldehyde (Fig. S3) provides insight on likely binding modes for each substrate that place the reactive aldehyde near the catalytic center of AldC and the aliphatic or aromatic portions of each into the apolar tunnel leading toward the solvent opening. Although the size of the site would allow short-chain aliphatic substrates, such as acetaldehyde, propionaldehyde, and butyraldehyde, to bind, the region of the substrate-binding site where these molecules would fit is more polar than the largely hydrophobic region of the tunnel accessible to longer-chained aliphatic substrates (i.e. heptanal, octanal, and nonenal), which is consistent with the biochemical assays of AldC. This suggests that substrate preference is governed by van der Waals contacts that maximize nonpolar surface–ligand interactions.
Site-directed mutagenesis of AldC active site residues
To examine the contribution of active site residues, a series of site-directed mutants targeting residues in the NAD(H)-binding site (i.e. Trp158, Lys182, Glu185, Thr234, Ser236, Glu257, and Glu391) and the octanal-binding site (i.e. Met114, Leu118, Trp160, Tyr163, Gln164, Arg285, Ser292, Leu419, Trp450, Phe456, and Tyr468) were generated. All of the 31 AldC point mutants (M114A, L118A, W158A, W158H, N159A, W160A, W160Y, Y163A, Q164A, Q164N, K182A, K182Q, E185A, E185Q, T234A, T234V, S236A, S236T, E257A, E257Q, R285A, S290A, S292A, E391A, E391D, L419A, L419V, W450A, F456A, F456W, and Y468A) were expressed in Escherichia coli and purified using nickel-affinity and size-exclusion chromatographies. The solubility and purity of each mutant was comparable with WT protein. Under standard assay conditions, enzymatic activities of WT AldC, the AldC C291A mutant (as described above), and the additional 31 point mutants were tested with fixed concentrations of octanal (5 mm) and NAD+ (2 mm) using up to 100 µg of each protein (Fig. 7). The initial enzyme activity screening assays showed that mutation of the catalytic cysteine (C291A) and certain residues in the NAD(H)-binding site (K182A, K182Q, T234V, E257A, and E391A), and octanal-binding site (N159A and L419A) resulted in enzymes with less than 1% of WT specific activity. For example, no activity was detected with the N159A, K182A, and C291A mutants.
Figure 7Comparison of WT and mutant AldC specific activities. Mutants proteins are grouped and colored by active site location, as follows: blue, NAD(H)-binding site; and green, octanal-binding site. Standard assay conditions were used as described under “Experimental procedures.” Average values ± S.D. (n = 3) are plotted relative to WT specific activity (1.02 ± 0.08 µmol min−1 mg protein−1).
Kinetic analysis of mutants with changes in the NAD(H)-binding site of AldC reveals critical residues for biochemical function. As noted above, low activity of the K182A, K182Q, T234V, E257A, and E391A mutants did not allow for accurate determination of their kinetic parameters (Fig. 7); however, the other mutant proteins with changes in the NAD(H) site were analyzed (Table 3). Mutation of Lys182 (K182A and K182Q) removes interaction of the lysine side-chain with a hydroxyl group of the adenine ribose of NAD+. Similarly, alanine substitutions of Glu257 and Glu391 severely disrupted AldC activity (Fig. 7), whereas the E257Q and E391D mutations, which introduced subtler alterations, displayed comparable or less than a 2-fold change in catalytic efficiency (i.e. kcat/Km) with NAD+ (Table 3). Loss of the Glu257 side-chain would remove an interaction that helps position the amide group of the nicotinamide and changes at this residue could alter the orientation of the nicotinamide group for hydride transfer (Fig. 6). Similar effects have been observed in other NAD(P)(H)-dependent enzymes (
). Mutation of Glu391 to alanine eliminates interaction of the carboxylate group to the nicotinamide ribose group, which can be partially complemented by an aspartate, as seen in the E391D mutant (Table 3). The effects of mutating the third glutamate in the NAD(H)-binding site (i.e. Glu185), which interacts with the adenine ribose group of NAD+, are comparable with changes of the other two acidic residues, but not as severe. The E185A mutant displayed a 21-fold decrease in kcat/Km with NAD+, and the E185Q mutant is similar to that of WT AldC (Table 3).
Table 3Steady-state kinetic analysis of mutant AldC. The protein and location of the mutation are indicated. Enzyme assays were performed as described under “Experimental procedures.” Average values ± S.D. (n = 3) are shown.
Substitutions of Trp158, Thr234, and Ser236 in the NAD(H)-binding site also reduced activity. The loss of steric bulk with the W158A mutant, which may change how the pyrophosphate backbone of NAD+ binds in the site, resulted in a 10-fold decrease in kcat/Km with NAD+, but the W158H mutant modestly reduced efficiency 3-fold (Table 3). Mutation of Thr234, which is positioned with its methyl group to provide surface contacts to the nicotinamide ring and its hydroxyl group oriented toward the amine of Lys168 (Fig. 6), led to a slight 5-fold reduction of kcat/Km with NAD+, but nearly inactive protein with a valine mutation (T234V). It is possible that local conformational changes impact the positioning of the nicotinamide ring to impact activity of this mutant protein. The kinetic parameters of S236A and S236T mutations with NAD+ were consistent with the loss of a hydrogen bond or changes in the side-chain position for interaction with the pyrophosphate bridge of NAD+ (Table 3).
Within the octanal-binding site (Fig. 6), mutations of Asn159, Trp160, Ser292, Leu419, and Phe456 were the most disruptive with other changes having more modest effects (Table 3). These residues are located closer to the catalytic site of AldC than those residues with lesser impact on AldC activity. The N159A and L419A mutants lacked significant activity, as noted above. Although the W160A, S292A, L419V, and F456A mutants displayed activity in the initial screen experiment (Fig. 7), the estimated Km values for these proteins with octanal exceeded the solubility limit of this substrate (∼10 mm) in assays, which prevented accurate assessment of kinetic parameters. The N159A mutant abrogates the hydrogen bond between Asn159 and the carbonyl of octanal; the loss of this interaction likely alters the position of the reactive group for catalysis and results in the undetected activity of this mutant. Ser292 is also positioned in proximity to the reactive aldehyde group of octanal with mutation to an alanine, resulting in a 30-fold decrease in specific activity (Fig. 7).
Trp160 provides surface contacts along the central portion of octanal carbon chain with the W160A mutation significantly altering the topology of the binding tunnel. Replacement of Trp160 with a tyrosine resulted a less than 2-fold change in catalytic efficiency (Table 3). Leu419 is also positioned at the intersection of the NAD(H) and octanal-binding sites and forms van der Waals contacts with Ala291 in the AldC structure. Changes of this residue to either alanine (L419A) or valine (L419V) may alter the orientation of the catalytic sulfhydryl required for the conversion of octanal to its carboxylic acid. In the AldC crystal structure, Phe456 π-stacks with Tyr468, which forms an interaction network with Tyr163 and Trp450 (Fig. 6). Mutation of Phe456 to an alanine (F456A) also increases the apparent Km of octanal, as described above, which likely results from altered packing along the aldehyde-binding site of the F456A mutant. In contrast, the F456W mutant results in a modest 2-fold decrease in kcat/Km with octanal (Table 3).
The opposites of Trp160, Tyr468, Tyr163, and Trp450 are interconnected through a series of hydrogen bonds to form one of the walls of the octanal-binding site (Fig. 6). Disruptions in this network using the Y468A, Y163A, and W450A mutants resulted in 20-, 5-, and 9-fold decreases in catalytic efficiency with octanal (Table 3). It appears that removal on one of these aromatic side chains alters AldC activity but is not sufficient to significantly disrupt substrate binding and catalysis. Alanine substitutions of the three residues situated toward the solvent entrance of the aldehyde-binding site, Met114, Leu118, and Arg285, have only slight 3–4-fold reductions in catalytic efficiency compared with WT AldC (Table 3), which is consistent with the interior hydrophobic surface as the major determinant of substrate interaction. Two other residues in the octanal-binding site, Gln164 and Ser290, also play lesser roles in substrate binding, as suggested by the modest effects of the Q164A and Q164N mutations and the S290A mutant (Table 3).
Conclusion
The genus Pseudomonas is one of the most ubiquitous and complex among the Gram-negative bacteria (
). Because many Pseudomonas species evolved to grow under unfavorable environmental conditions (i.e. severe nutrient limitation, extreme temperatures, high salinity, and low oxygen or water availability), they also evolved metabolic diversity and plasticity to use a variety of nutrient sources (i.e. carbon, nitrogen, and sulfur), to detoxify toxic organic chemicals, and to produce multiple specialized metabolites, including polymers and small molecule compounds (
). In particular, P. syringae, a species that includes many plant pathogenic strains, developed diverse bacterial virulence mechanisms to survive in the adverse environmental conditions of the phyllosphere (
). Further, recent studies suggest that PtoDC3000 uses several strategies to manipulate the auxin biology in its host plants to promote pathogenicity, including using an indole-3-acetaldehyde dehydrogenase for IAA synthesis (
). Recently, a genus-wide study identified a total of 6,510 aldehyde dehydrogenase sequences in 258 Pseudomonas strains belonging to 46 different species, but only a limited number of newly identified Pseudomonas aldehyde dehydrogenases have been biochemically characterized (
), which have been studied because of their ability to catabolize and biodegrade a range of compounds, including napthalene-derived molecules, and metabolic aldehyde dehydrogenases linked to the osmoprotectant betaine and growth in biofilms (
), less is understood about the biochemistry of aliphatic aldehyde dehydrogenases in these microbes. Here we comprehensively examined and identified AldC from PtoDC3000 as a long-chain aliphatic aldehyde dehydrogenase using an integrated approach combining phylogenetic, crystallographic, and biochemical analyses.
The biochemical activity of AldC from PtoDC3000 is consistent with the traditional role of aldehyde dehydrogenases as metabolic clean-up enzymes that convert reactive aldehydes into less active carboxylates (
); however, growing evidence suggests that these enzymes may have broader contributions to the specialized environments that different Pseudomonas species occupy. For example, recent work identified AldA as an indole-3-acetaldehyde dehyrogenase involved in the synthesis of the phytohormone IAA in the plant pathogen P. syringae DC3000 and demonstrated a role for this protein in virulence of the pathogen in plant leaves (
). Given the growth environment of this pathogenic Pseudomonas species in the interior of plant leaves, it is possible that aliphatic molecules in the apoplast could contribute to energy sources for the microbe or to the synthesis of surfactants that help establish microbial colonization in the host plant (
). The physiological contribution of AldC to the life cycle of PtoDC3000 and related plant pathogenic or plant-associated species and strains remains to be evaluated but raises the question of how these microbes develop host relationships in the natural environment. Overall, this study provides molecular insight for understanding the evolution of the prokaryotic aldehyde dehydrogenase superfamily and for the potential development of inhibitors for the pathogenic Pseudomonas species, which could be useful for pathogen control in agriculture.
Experimental procedures
Cloning and site-directed mutagenesis
The pET28a-AldC construct was previously described (
). Briefly, the full-length coding sequence of AldC (UniProt: PSPTO_3644; GenBankTM: WP_011104646.1) was amplified from P. syringae strain PtoDC3000 genomic DNA and cloned into the pBlunt II-TOPO vector (Invitrogen). The AldC fragment from the pBlunt II-TOPO-AldC was excised using NdeI and XhoI and subcloned into pET-28a (Novagen). Site-directed mutants of AldC (M114A, L118A, W158A, W158H, N159A, W160A, W160Y, Y163A, Q164A, Q164N, K182A, K182Q, E185A, E185Q, T234A, T234V, S236A, S236T, E257A, E257Q, R285A, S290A, C291A, S292A, E391A, E391D, L419A, L419V, W450A, F456A, F456W, and Y468A) were generated using the QuikChange PCR method (Agilent Technologies) with the pET28a-AldC construct as template. For protein expression, each pET28a-AldC construct was transformed into E. coli BL21(DE3) (EMD Millipore).
Protein expression and purification
Transformed E. coli BL21(DE3) cells containing either WT or mutant AldC construct were grown at 37 °C in Terrific broth with 50 µg ml−1 kanamycin until A600 nm = ∼0.6–0.9. After induction with 1 mm isopropyl-1-thio-β-d-galactopyranoside, the cells were grown at 18 °C overnight. Following centrifugation (5,000 × g for 30 min), the cell pellets were resuspended in 50 mm Tris, pH 8.0, 500 mm NaCl, 25 mm imidazole, 10% (v/v) glycerol, and 1% (v/v) Tween 20. Following lysis by sonication, the cell debris was removed by centrifugation (12,000 × g for 45 min), and the supernatant was loaded onto a Ni2+-nitriloacetic acid column (Qiagen). The column was washed with 50 mm Tris, pH 8.0, 500 mm NaCl, 25 mm imidazole, and 10% (v/v) glycerol to remove unbound proteins. Column-bound AldC protein was eluted using 50 mm Tris, pH 8.0, 500 mm NaCl, 25 mm imidazole, 10% glycerol, and 250 mm imidazole. AldC was further purified by size-exclusion chromatography with a Superdex 200 16/60 size-exclusion column (GE Healthcare) equilibrated in 25 mm Hepes (pH 7.5) and 100 mm NaCl. Fractions corresponding to the purified AldC protein were pooled and concentrated to ∼9 mg ml−1. Protein concentrations were determined using the Bradford method with BSA as a standard.
Enzyme assays and steady-state kinetic analysis
Enzymatic activity of WT and mutant AldC was measured by monitoring NADH formation (ε340 nm = 6220 m−1 cm−1; 100-μl volume) at A340 nm using an EPOCH2 microplate spectrophotometer (BioTek). The substrate and cofactor screening experiments were performed at 25 °C in a standard assay mix of 100 mm Tris·HCl (pH 8.0) and 100 mm KCl with a fixed concentration of NAD+ (2 mm) and aldehyde (5 mm). The panel of aldehydes and their chemical structures used for substrate screening is shown in Table S2. Steady-state kinetic parameters of WT and mutant AldC were determined at 25 °C in a standard assay mix with either fixed NAD+ (2.0 mm) and varied aldehyde (i.e. valeraldehyde, hexanal, heptanal, octanal, nonanal, hydrocinnamaldehyde, or 4-pyridinecarboxaldehyde; 0.01–10 mm) or with fixed octanal (5 mm) and varied cofactor (NAD+ or NADP+; 0.05–5 mm). Protein amounts ranged from 0.1 µg for WT AldC to 100 µg for less active AldC mutants. The resulting initial velocity data were fit to the Michaelis–Menten equation, v = (kcat[S])/(Km + [S]), using SigmaPlot.
Protein crystallography
Protein crystals of the AldC C291A mutant were grown by the hanging-drop vapor-diffusion method at 4 °C. Crystals of the AldC C291A mutant (9 mg ml−1) in complex with octanal and NAD+ formed grew in drops of a 1:1 mixture of proteins and crystallization buffer: 20% (w/v) PEG-1000, 100 mm Tris·HCl (pH 7.0), 2 mm octanal, and 5 mm NAD+. Crystals were stabilized in cryoprotectant (mother liquor with either 30% (v/v) glycerol or 30% (v/v) ethylene glycol) before flash-freezing in liquid nitrogen for data collection at 100 K. Diffraction data were collected at Beamline 19-ID of the Advanced Photon Source at the Argonne National Laboratory HKL3000 was used to index, integrate, and scale the collected X-ray data (
) with the three-dimensional structure of the AldA indole-3-acetaldehyde dehydrogenase from P. syringae, which shares 39% amino acid identity with AldC, as a search model (Protein Data Bank code 5IUW) (
) were used for iterative rounds of model building and refinement, respectively. Data collection and refinement statistics are summarized in Table 2. The final model included Leu6–Leu490 of chain A, Leu6–Asp489 of chain B, 1 octanal, and 1 NAD+ in each chain, and 197 waters. Atomic coordinates and structure factors for the AldC(C291A)·octanal·NAD+ complex were deposited in the RCSB Protein Data Bank (code 6X9L).
Data availability
All data are contained in the article with structural information deposited in the Protein Data Bank (code 6X9L), as noted above.
The Arabidopsis thaliana REDUCED EPIDERMAL FLUORESCENCE1 gene encodes an aldehyde dehydrogenase involved in ferulic acid and sinapic acid biosynthesis.
Identification of genes involved in cell wall biogenesis in grasses by differential gene expression profiling of elongating and non-elongating maize internodes.
Over-expression of different aldehyde dehydrogenase genes in Arabidopsis thaliana confers tolerance to abiotic stress and protects plants against lipid peroxidation and oxidative stress.
Arabidopsis and tobacco plants ectopically expressing the soybean antiquitin-like ALDH7 gene display enhanced tolerance to drought, salinity, and oxidative stress.
Structural, biochemical, and computational studies reveal the mechanism of selective aldehyde dehydrogenase 1A1 inhibition by cytotoxic duocarmycin analogues.
Author contributions—S. G. L., K. H., S. A. M., B. N. K., and J. M. J. conceptualization; S. G. L. data curation; S. G. L., K. H., and W. C. formal analysis; S. G. L., C. K. H., B. C., W. C., B. N. K., and J. M. J. supervision; S. G. L., K. H., O. A., S. O. A., A. G. B., H.-Y. S. C., M. D., C. M. F., A. R. G., J. K., J. S. K., M. C. L., J. J. L., R. L., J. E. L., C. M. M., A. F. M., S. H. P., V. P., C. D. R., A. S., S. S., J. D. T., B. R. C., C. K. H., B. C., and W. C. investigation; S. G. L., K. H., O. A., S. O. A., A. G. B., H.-Y. S. C., M. D., C. M. F., A. R. G., J. K., J. S. K., M. C. L., J. J. L., R. L., J. E. L., C. M. M., A. F. M., S. H. P., V. P., C. D. R., A. S., S. S., J. D. T., B. R. C., C. K. H., B. C., and W. C. methodology; S. G. L., K. H., S. A. M., B. N. K., and J. M. J. writing-original draft; S. G. L., K. H., S. A. M., B. N. K., and J. M. J. writing-review and editing; S. A. M., B. N. K., and J. M. J. resources; S. A. M. and J. M. J. validation; B. N. K. and J. M. J. funding acquisition; B. N. K. and J. M. J. project administration.
Funding and additional information—This work was supported by National Science Foundation Grants IOS-1645908 (to B. N. K.) and MCB-1614539 (to J. M. J.) and a grant from the Howard Hughes Medical Institute Science Education Program (to J. M. J.). C.K.H. was supported by National Science Foundation Graduate Research Fellowship DGE-1143954. This project was part of Bio4522, an upper-level laboratory course at Washington University in St. Louis. Portions of this research were carried out at the Argonne National Laboratory Structural Biology Center of the Advanced Photon Source, a national user facility operated by the University of Chicago for the Department of Energy Office of Biological and Environmental Research under Grant DE-AC02-06CH11357.
Conflict of interest—The authors declare that they have no conflicts of interest with the contents of this article.