Defining a Structural and Kinetic Rationale for Paralogous Copies of Phenylacetate-CoA Ligases from the Cystic Fibrosis Pathogen Burkholderia cenocepacia J2315*

The phenylacetic acid (PAA) degradation pathway is the sole aerobic route for phenylacetic acid metabolism in bacteria and facilitates degradation of environmental pollutants such as styrene and ethylbenzene. The PAA pathway also is implicated in promoting Burkholderia cenocepacia infections in cystic fibrosis patients. Intriguingly, the first enzyme in the PAA pathway is present in two copies (paaK1 and paaK2), yet each subsequent enzyme is present in only a single copy. Furthermore, sequence divergence indicates that PaaK1 and PaaK2 form a unique subgroup within the adenylate-forming enzyme (AFE) superfamily. To establish a biochemical rationale for the existence of the PaaK paralogs in B. cenocepacia, we present high resolution x-ray crystal structures of a selenomethionine derivative of PaaK1 in complex with ATP and adenylated phenylacetate intermediate complexes of PaaK1 and PaaK2 in distinct conformations. Structural analysis reveals a novel N-terminal microdomain that may serve to recruit subsequent PAA enzymes, whereas a bifunctional role is proposed for the P-loop in stabilizing the C-terminal domain in conformation 2. The potential for different kinetic profiles was suggested by a structurally divergent extension of the aryl substrate pocket in PaaK1 relative to PaaK2. Functional characterization confirmed this prediction, with PaaK1 possessing a lower Km for phenylacetic acid and better able to accommodate 3′ and 4′ substitutions on the phenyl ring. Collectively, these results offer detailed insight into the reaction mechanism of a novel subgroup of the AFE superfamily and provide a clear biochemical rationale for the presence of paralogous copies of PaaK of B. cenocepacia.

Aromatic compounds are ubiquitous in the environment and exist primarily in the form of recycled plant material. Specialized microbial degradative pathways have evolved to overcome the resonance stabilized aromatic nucleus, thereby serving an integral role in the global carbon cycle. Furthermore, charac-terization of the bioprocessing enzymes comprising such pathways offers opportunities to engineer novel bioremediation and biofuel production strategies. The ability to degrade aromatic compounds also is an influential factor in microbial pathogenesis. For example, the degradation of phenylacetic acid (PAA) 3 is proposed to facilitate B. cenocepacia in establishing lifethreatening infections in cystic fibrosis patients (1). Although the precise mechanism that governs this process is not yet understood, it is speculated that early PAA degradation pathway intermediates play a role in host cell damage (2,3).
The PAA pathway is a highly conserved metabolic solution for the assimilation of PAA, with the corresponding genes identified in 16% of sequenced bacterial genomes (3). Moreover, many structurally related compounds, including aromatic pollutants styrene and ethylbenze, are converted to phenylacetate prior to metabolism via the PAA pathway (4,5). Although this novel, hybrid pathway is the only known route for "aerobic" metabolism of PAA, it also exploits CoA activation, a feature typical of classical "anaerobic" aromatic degradation pathways. As such, the initial enzyme of the PAA pathway, phenylacetate-CoA ligase (PCL), plays a vital role in regulating entry of potential substrates into the pathway. An unusual feature of the PAA pathway in B. cenocepacia is that it encodes two copies of the pcl gene (paaK1 and paaK2), yet harbors only a single copy of each subsequent enzyme in the pathway (2). Although a rationale for the additional PCL in B. cenocepacia has not been established, studies of related bacterial PCLs suggest the potential for altered kinetics (6 -8).
PaaK1 and PaaK2 are members of the adenylate-forming enzyme (AFE) family PFAM00501, which incorporates enzymes involved in metabolism of short-to-long chain fatty acids, aromatic compounds, biosynthesis of siderophores and peptide antibiotics, and luciferases (9). The family maintains a general architecture of two ␣/␤ domains with the active site formed at the interface of the N-and C-terminal domains (10 -13). Sequences reveal noteworthy divergent features of PaaK1 and PaaK2 with respect to the superfamily, including a novel N-terminal sequence spanning ϳ70 residues. The general mechanism of the AFE superfamily indicates that catalysis relies on a large conformational change (9,14), yet it is rare that a single enzyme is captured in multiple conformations. Furthermore, there are very few ATP-bound structures (15) mak-ing it difficult to refine the general catalytic mechanism of these metabolically important enzymes.
The goal of our current study is to address the outstanding question of why B. cenocepacia encodes paralogous copies of PaaK1 and PaaK2. Through high resolution x-ray co-crystal structures with both ATP and the phenylacetyl adenylate intermediate, and kinetic assays, we propose a biochemical rationale for the presence of the two isozymes and include detailed analyses of the unique structural features such as the novel N-terminal microdomain. Collectively, these data offer rare insight into multifunctional roles for key active site residues and substructures and are discussed with respect to the general catalytic mechanism AFE superfamily enzymes.

EXPERIMENTAL PROCEDURES
Cloning, Expression, and Purification-The paaK1 (YP_ 002229570) gene was amplified from B. cenocepacia J2315 genomic DNA provided by Dr. Silvia Cardona (University of Manitoba, Winnipeg, Canada). A synthetic paaK2 (YP_002234323) gene with reduced GC content was ordered from GenScript. Both genes were cloned into pET28a(ϩ) (Novagen, Mississauga, ON, Canada) vector in frame with a hexahistidine tag and thrombin cleavage site. Escherichia coli BL21 Star (DE3) cells (Invitrogen) were used for protein production in Overnight Express Instant TB (EMD Chemicals) medium supplemented with 50 g ml Ϫ1 kanamycin (Sigma) at 32°C. Production of selenomethionine-derivatized PaaK1 was carried out in E. coli 834 (DE3) (a methionine auxotroph; Novagen) and grown in SelenoMet medium (AthenaES) supplemented with L-selenomethionine to a final concentration of 40 g ml Ϫ1 (AthenaES). The cells were grown at 32°C until reaching an A 600 nm of 0.6 at which isopropyl-␤,D-thiogalactopyranoside was added to a final concentration of 0.75 mM.
Crystallization and Data Collection-All crystallization trials were carried out using the sitting-drop, vapor diffusion method in 96-well plates. SeMet PaaK1 (at 12 mg ml Ϫ1 ) was initially incubated with 3 mM MgCl 2 and ATP for 1 h prior to crystallization. Diffraction quality crystals were obtained in 20 -25% PEG 3350, 200 mM potassium thiocyanate, and 5% glycerol. A single PaaK1 crystal was looped into cryo-protect-ant consisting of reservoir solution supplemented with 3 mM MgCl 2 , 3 mM ATP, and 25% glycerol for 30 s and flash cooled directly in the cryo-stream (100 K). Diffraction data were collected at the Stanford Synchrotron Radiation Lightsource on beamline 9-2 at a wavelength of 0.9792 Å.
To obtain the phenylacetyl adenylate co-structures, PaaK1 and PaaK2 were concentrated to 12 mg ml Ϫ1 protein and incubated for 1 h with 3 mM MgCl 2 , ATP, and 5 mM phenylacetic acid prior to crystallization trials. Crystals of PaaK1 were grown 10% (w/v) PEG 8000, 0.1 M Hepes, pH 7.5, and 8.0% (v/v) ethylene glycol, and crystals of PaaK2 were grown in 17% (w/v) PEG 6000, 0.1 M Hepes, pH 7.5, 0.1 M KCl, and 2.5% glycerol. A single PaaK1 crystal was flash cooled directly in the cryo-stream (100 K), and data were collected on a Rigaku R-axis IVϩϩ area detector. For PaaK2, a single crystal was looped and stepped slowly into cryo-protectant solution consisting of reservoir solution supplemented with 3 mM MgCl 2 , 3 mM ATP, 5 mM phenylacetic acid, and 10 and 15% glycerol cooled directly on the cryo-stream (100 K). Diffraction data for PaaK2 was collected on beamline 9-2 at the Stanford Synchrotron Radiation Lightsource.
Data Processing, Structure Solution, and Refinement-All data sets were processed with iMosflm (16), scaled with Scala (17), and the models were refined with REFMAC using 5% of the reflections for calculation of R free (18). Phasing of PaaK1 (1.6 Å resolution) was carried out with ShelxC/D/E (19), and automated building was performed using Buccaneer (20), all within the CCP4 suite of programs (21). Manual building was completed in Coot (22). PaaK1 (1.92 Å resolution) and PaaK2 (1.90Å resolution) adenylated intermediate co-structures were solved by molecular replacement using MOLREP (23) with the ATP-bound PaaK1 as the search model. The adenylated phenylacetate ligand and accompanying library file were generated using the Dundee Prodrg2 Server (24). The majority of the PaaK2 model was built with Buccaneer (20), though the majority of the C-terminal domain required manual building in Coot (22). The high resolution of the data also permitted modeling of small molecules such as glycerol, thiocyanate, and ␤-mercaptoethanol. Data collection and refinement statistics are presented in Table 1.
Enzyme Assays and Kinetics-Enzyme activity measurements for PaaK1 and PaaK2 were carried out using the indirect spectrophotometric assay described by Ziegler et al. (25) and adapted to a 96-well plate (11). Briefly, the assay is based on linking production of AMP by PaaK1 and PaaK2 to the oxidation of NADH to NAD, which is measured spectrophotometrically at 365 nm. The assay utilized 3 g of PaaK1 or PaaK2 per 200 l of reaction volume. Concentrations ranging from 20 to 1000 M phenylacetic acid were tested with saturating concentrations (2 mM) of co-substrates CoA and ATP. Similarly, for the substituted phenylacetate substrates, 0.1-3 mM concentrations were tested using 2 g of PaaK1 or PaaK2. Activity is reported as none detected where change in absorbance with 3 mM substrate was not discernible from that of the negative control showing spontaneous decay of NADH.

Dimerization Leads to Formation of an Intermolecular
Extended ␤-Sheet-The size exclusion elution profile of PaaK1 indicated a dimeric organization (Fig. 1a) and accordingly, PaaK1 crystallized as an intimate dimer in space group P1 (Fig.  1b). The individual PaaK1 monomers are structurally equivalent with an root mean square deviation of 0.251 Å over 376 C␣ atoms. In forming the dimer interface, ϳ1383Å 2 of surface area is buried, which is stabilized through extensive shape and chemical complementarity. Nonpolar residues comprise ϳ55% of the interface and are accompanied by 17 hydrogen bonds and two salt bridges (Arg 89 and Glu 152 ). It is noteworthy that all of the residues participating in dimerization are derived from the N-terminal domain leaving the C-terminal domain free to undergo the conformational reorganization between the adenylation (phenylacetate ϩ ATP 3 phenylacetyl adenylate ϩ pyrophosphate) and thioesterification (phenylacetyl adenylate ϩ CoA 3 phenylacetyl-CoA ϩ AMP) reactions.
The assembly of the symmetrical PaaK1 dimer results in the formation of two extended, intermolecular seven-strand ␤sheets each composed of a two-strand antiparallel ␤-sheet (Fig.  1c, purple) from one monomer and a five-strand distorted ␤-sheet from the second monomer (Fig. 1c, orange). Interestingly, the topology of the extended ␤-sheets observed in the PaaK1 dimer appear to mimic a similar arrangement defined within a single monomer of family members (Fig. 1d) (11, 26 -28). The multimeric requirement of PaaK1 to recapitulate the extended ␤-sheet may indicate a branching point in the evolution of CoA ligases.
A Novel N-terminal Microdomain in PaaK1 and PaaK2-The PaaK1 monomer is composed of a larger N-terminal domain incorporating Pro 4 -Gly 325 (Fig. 2a, green), which as described above, forms the dimer interface and a smaller C-terminal domain comprising residues Met 331 -Arg 430 (Fig. 2a,  blue). The domains are connected by a short, solvated linker ( 326 RSDDM 330 ), with the active site formed at the domain interface. The N-terminal domain of PaaK1 is comprised of three ␤-sheets sandwiched between nine ␣-helices, whereas the C-terminal domain is defined by two helices, a short two-strand antiparallel ␤-sheet, and a twisted four-stranded sheet. In the PaaK1 ATP co-structure, the C-terminal domain is properly oriented to position the invariant Lys 422 (Fig. 2a, red) within the active site, consistent with superfamily members occupying conformation 1 (11,14,15,26,28). Although the general architectural features of PaaK1 conform to the homologous members in the AFE superfamily (14), extensive structural divergence is observed within the N-terminal ϳ70 residues. Intriguingly, this region possesses negligible sequence or structural identity with any structurally characterized AFE family member, and thus, we propose that bacterial PCLs such as PaaK1 and PaaK2 constitute a separate subgroup within the superfamily. This region of PaaK1 adopts a continuous three helical bundle structure measuring ϳ35 ϫ 20 Å (Fig.  2b). Reminiscent of a leucine zipper, six leucine residues partic-ipate in hydrophobic interactions, with additional contributions from Tyr 30 , Phe 44 , and Phe 63 (Fig. 2b). Collectively, these interactions stabilize what appears to be a compact microdomain. Despite the predicted stability of this substructure, it thoroughly integrated with the larger, conserved portion of the N-terminal domain (Thr 67 -Arg 326 ) with a complexation significance score of 1.00 (29). The overall interface is formed through complementary basic and acid patches resulting in nine salt bridges and 26 hydrogen bonds. Hydrophobic interactions also have a significant presence with a hydrophobic segment defined by 295 ALPII 299 on the larger N-terminal domain inserting into the hydrophobic cavity formed at the center of the N-terminal helical bundle. Though no function is immediately apparent for this novel substructure, it may be utilized for protein-protein interactions, perhaps facilitating recruitment of subsequent PAA enzymes such as the multicomponent oxygenase, PaaABCDE. As a result of this remodeled N-terminal microdomain, PaaK1 is unable to adopt the typical familial arrangement of the initial ϳ200 residues, normally consisting of four ␤-sheets sandwiched between five helices (Fig. 2c) (11,12,26,28). Consequently, the P-loop flanking ␤-strands of PaaK1 are isolated in the monomer structure, necessitating dimerization for reconstitution of this ␤-sheet environment. However, apart from the N-terminal regions, the core N-terminal domain of PaaK is highly homologous with related family members such as benzoate-CoA ligase (Protein Data Bank code 2V7B) (11), displaying an root mean square deviation of 1.72 Å over 141 C␣ atoms. The RCSB coordinate file and structure factor codes for the 1.60 Å crystal structure of PaaK1 in complex with ATP are 2Y27 and r2y27sf, respectively. The RCSB coordinate file and structure factor codes for the 1.92 Å crystal structure of PaaK1 in complex with the adenylated phenylacetate intermediate are 2Y4N and r2y4nsf, respectively. The RCSB coordinate file and structure factor codes for the 1.90 Å crystal structure of PaaK2 in complex with the adenylated phenylacetate intermediate are 2Y4O and r2y4osf, respectively.

Pre-and Postadenylation Complexes of PaaK1 Reveal Dynamic Enzyme-Substrate Interactions-
The ability to discern dynamic information from crystal structures relies in part on being able to co-crystallize the enzyme at different stages of substrate turnover. In the case of an AFE, the first two stages can be defined as the pre-and postadenylation state. It is noteworthy that capturing these different states in the context of one enzyme is rare, with DltA from Bacillus cereus serving as the only reported example (15,30). To explore the reorganization of key structural features in PaaK1 as the enzyme progresses through the adenylation reaction, we report the high resolution structures of PaaK1 in complex with ATP (Fig. 3a) and the adenylated phenylacetate intermediate (Fig. 3b).
P-loop-The PaaK1 ATP co-structure displays a well ordered P-loop ( 93 SSGTTGKPT 101 ) that envelopes the ␤ and ␥ phosphates of ATP (Fig. 3a, inset, green residues). Five hydrogen bonds between the backbone amides of the P-loop and the ATP phosphates stabilize the P-loop and are accompanied by three additional hydrogen bonds from Ser 94 , Thr 96 , and Thr 97 side chains (Fig. 3c). Based on the stability imparted by this hydrogen bond network, it is not surprising that removal of the pyrophosphate in transitioning to the adenylate intermediate results in P-loop becoming disordered and therefore unmodeled (Fig. 3, d and e). These data are consistent with the classical expectations of the P-loop to bind the phosphates of ATP. Once the ␤ and ␥ are removed, the P-loop does not contribute to binding the adenylate intermediate within the active site.
Lys/Arg Pair Orient Phosphates for Nucleophillic Attack-Initially, the catalytically essential lysine (Lys 422 ) forms bifurcated hydrogen bonds with the ␣and ␤-phosphates of ATP (Fig. 3c), where it is proposed to shield point charges and facilitate nucleophillic attack at the ␣-phosphate (15). Despite the flexibility of the lysine residue, only the ⑀-amino of Lys 422 is reoriented in the PaaK1-phenylacetyl adenylate co-structure such that it forms two hydrogen bonds with the ␣ phosphate (Fig. 3, d and e). These results are consistent with the lysine playing an essential catalytic role in the adenylation reaction, with the high density positive charge localized to the reactive center of the ATP, the ␣-phosphate.
In contrast, the active site arginine of the conserved interdomain linker, Rx(D/K)x 6 G, exhibits a much greater degree of movement between the two PaaK1 crystal structures (Fig. 3d). In the ATP/preadenylation complex, Arg 326 of PaaK1 forms bidentate hydrogen bonds with the ␤-phosphate of ATP (Fig.  3a). In the adenylate intermediate co-structure, Arg 326 is reoriented such that it hydrogen bonds with the 2Ј ribose hydroxyl of the phenylacetyl adenylate intermediate (Fig. 3, d and e). Thus, our data is consistent with a proposed bifunctional role for Arg 326 where initial stabilization of the ATP phosphates is replaced with a stabilizing role of the adenylated intermediate (15).
Coordinated Mg 2ϩ Ion-The ␤and ␥-phosphates of ATP coordinate a single Mg 2ϩ ion that is further coordinated by four water molecules in an overall octahedral geometry (Fig. 3, a and   FIGURE 2. Novel N-terminal microdomain. a, monomer structure of PaaK1 with the larger N-terminal domain in green, the C-terminal domain in blue, and active site conserved lysine shown in red. b, PaaK1 contains a small helical bundle arrangement at the N terminus (boxed) but lacks the typical N-terminal arrangement exhibited by family members (dashed circle). Inset, helical bundle microdomain of PaaK1 is largely stabilized by hydrophobic interactions of Leu and Phe side chains. The electrostatic surface of the microdomain interface predicts a large hydrophobic patch surrounded by charged patches. c, overlays of four homologous family members (Protein Data Bank code 2V7B, benzoate-CoA ligase; Protein Data Bank code 1MDB, 2,4-dihydroxybenzoate AMP ligase; Protein Data Bank code 2D1T, firefly luciferase; Protein Data Bank code 2P2J, acetyl-CoA synthetase) demonstrate the typical ␣/␤-sandwich arrangement at the N terminus (circle). c). Abutting this arrangement is the conserved Glu 241 that hydrogen bonds with two of the coordinated water molecules (Fig. 3c). Glu 241 of the PaaK1-adenylate structure has essentially maintained its position from that of the ATP-bound structure, despite the departure of the ␤and ␥-phosphates and solvated Mg 2ϩ from the active site (Fig. 3, d and e). Instead, a single bridging water molecule enables the Glu 241 to interact indirectly with the ␣-phosphate in the PaaK1-adenylate structure (Fig. 3e).
It is clear that the ␤and ␥-phosphates require numerous interactions for correct orientation for in-line attack from the carboxylate oxygen of the phenylacetate substrate. The bifunctional role of the interdomain arginine and active site glutamate suggested by our structural data are consistent with DltA mutagenesis data where mutation of either the arginine or glutamate to glutamine reduced the k cat by ϳ20and 14-fold, respectively. The K m , however, was increased by 6-and 8-fold likely due to the critical role for these residues in anchoring the adenylate intermediate (15). Collectively, these data reveal bifunctional strategies for residues that initially contort the terminal phosphoryl groups of the ATP to adopt new catalytically important interactions as the reaction cycle progresses.
P-loop Functions to Stabilize C-terminal Domain in Conformation 2-As members of the AFE superfamily, bacterial phenylacetate-CoA ligases such as PaaK1 and PaaK2 undergo large conformational changes to remodel the single active site for two distinct part reactions (Fig. 4a). Notably, PaaK2 was captured following the domain reorientation in conformation 2 (Fig. 4, b and d), thus presenting the opportunity for additional insight into structural reorganization during the course of catalysis. In contrast to both PaaK1 complexes, the invariant lysine (Lys 429 in PaaK2) is now completely removed from the active site and shifted ϳ29 Å from its previous position (Fig. 4,  c and d). Unexpectedly, the P-loop is well ordered (Fig. 4d), which contrasts with the P-loop in the Paak1 adenylated intermediate capture in conformation 1. Stability of the reorganized P-loop in conformation 2 is mediated by hydrogen bonds between the backbone carbonyl of Thr 100 and of Ser 345 (O␥), between Glu 349 (O⑀2) and the backbone amide nitrogen of Gly 102 , as well as a bifurcated hydrogen bond between Ser 98 (O␥), Gln 346 (N⑀2), and the backbone carbonyl of Asn 341 . These data strongly suggest that the P-loop plays a critical bifunctional role in the AFE superfamily where its role in coordinating ATP in the adenylation part reaction is replaced by a role in stabilizing the C-terminal domain in conformation 2 during the thioesterification part reaction. Indeed, an ordered P-loop has been observed previously for AFE enzymes captured in conformation 2 (9, 10, 31, 32), where polar interactions take place between the P-loop and C-terminal domain and often incorporate solvent molecules. Cycling of P-loop from ordered (PaaK1: ATP Ϫ conformation 1) to disordered (PaaK1:adenylate intermediate Ϫ conformation 1) back to ordered (PaaK2:adenylate intermediate Ϫ conformation 2) offers a rare and seamless view of the entire process of structural reorganization during substrate turnover.
An Extended Aryl Binding Pocket in PaaK1 Contrasts with PaaK2-Both the PaaK1 and PaaK2 adenylate intermediate costructures displayed well ordered electron density within the active site, permitting the phenylacetyl adenylate to be accurately modeled with low average B-factors of 26.69 and 12.12 Å 2 , respectively (Figs. 3b and 4c and Table 1). Detailed structural comparison of these forms revealed diversity in the aryl binding pockets (Fig. 5a). In PaaK1, the base of the pocket is defined by Ala 147 resulting in a pocket ϳ7 Å wide by 10 Å deep. In PaaK2, an isoleucine (Ile 151 ) takes the place of Ala 147 in PaaK1 resulting in the pocket being reduced in depth to ϳ7 Å. Intriguingly, 4-chlorobenzoate-CoA ligase of Alcaligenes sp. AL3007, which accepts 4-chlorobenzoate as its native substrate, also incorporates an Ala residue (Ala 213 ) at this position (33). Therefore, although the active site pockets in both PaaK paralogs accommodate the phenyl group of the phenylacetate intermediate, the extension in PaaK1 may endow the enzyme with the ability to bind a broader range of substrates with substitutions on phenyl ring. The only additional change in the active site pocket is the substitution of Tyr 136 in PaaK1 for Phe 140 in PaaK2 (Fig. 5a). These residues overlay nearly perfectly and do not contribute to the volume difference, however the hydroxyl group of Tyr 136 would likely allow PaaK1 to better accommodate polar ring substitutions.
PaaK1 Displays a Lower K m for Phenylacetic Acid and Broader Substrate Specificity Than PaaK2-Prior to this study, there was no rationale for the presence of two copies of PaaK in B. cenocepacia. One intriguing possibility was that the paralogs would exhibit different kinetic parameters and substrate specificities as a result of substrate pocket remodeling. This hypothesis was based on the previously observed wide range of K m values and substrate specificities for orthologous PCLs, coupled with the fact that PaaK1 and PaaK2 share only 69% identity and therefore may display substitutions within the substrate binding pocket. Indeed, two residue substitutions resulted in structural divergence between the paralogs. To determine how these structural differences translate to functional profiles, kinetic analyses were undertaken using phenylacetic acid as well as a variety of ring-substituted phenylacetic acids.
The most dramatic difference between the paralogs for the native phenylacetic acid substrate was observed for the K m , with PaaK2 (150 M) more than double that of PaaK1 (62 M) (Fig. 5b). The apparent V max and calculated k cat values were more similar, with PaaK2 displaying a 20% higher k cat at 300 min Ϫ1 (Fig. 5b). Collectively, these results show that despite the additional volume of the aryl substrate binding pocket, PaaK1 maintains a higher affinity for phenylacetic acid. Furthermore, PaaK1 was also able to better accommodate the substituted phenylacetic acid substrates, especially the 4-hydroxyphenylacetic acid, for which PaaK2 did not show detectable activity. Although both paralogs were able to accept 3Ј substituted phenylacetic acid, PaaK1 exhibited a much lower K m (ϳ6.5-fold) than PaaK2, likely due to the extension of the aryl binding pocket. Not surprisingly, neither paralog was active toward 2Ј substituted phenylacetic acid, which would position the substituent toward either the adenine ring or the helical backbone defining one side of the pocket. Similarly, 3,4-substituted phenylacetic acid was not accepted by either paralog, demonstrating that PaaK1 could accommodate either a 3Ј or to a lesser extent a 4Ј substituent, but not both, within the extended pocket. In the case of 4-chlorobenzoate-CoA ligase, it was possible to expand the substrate range to 3Ј and 3Ј4Ј-substituted chlorobenzoate through mutation of the active site residue Ile 303 to Ala and Gly (33). Mutation of the corresponding Ile of PaaK1 (Ile 236 ) or PaaK2 (Ile 240 ), which is positioned between the 3Ј and 4Ј positions of the phenyl ring, would likely also improve activity with of 3Ј-and 4Ј-substituted phenylacetate. Conclusions-With PaaK1 and PaaK2 catalyzing the first and only committed step of the pathway (34), these enzymes are perfectly positioned to control the flow of phenylacetic acid into the PAA pathway thereby contributing to the nutritional requirements of B. cenocepacia in infected cystic fibrosis patients. High resolution crystal structures of PaaK1 and PaaK2 captured at different catalytic stages offer insight into the dynamic cycling of the P-loop, which likely serves to promote conformational change in the C-terminal domain, and the subtle, yet critical, repositioning of multipurpose active site residues such Arg 326 and Glu 241 . Furthermore, our data provides a clear rationale for the existence of paralogous copies of PaaK1 and PaaK2 through divergent aryl binding pocket structure and corresponding kinetic profile. Incorporation of Ala at position 147 (Ile 151 in PaaK2) and Tyr at position 136 (Phe 140 in PaaK2) in the aryl binding pocket results in a lower K m for PaaK1 with the native substrate and a broader substrate specificity profile, relative to PaaK2.
It is conceivable that either the more (PaaK1) or less (PaaK2) active copy would be more beneficial depending on carbon source availability. Thus, although PaaK2 has been shown to be sufficient for growth on phenylacetic acid as a sole carbon source (2), a second more active copy (PaaK1) may be beneficial during infection. In fact, recent microarray studies of J2315 have shown that both paaK1 and paaK2 are up-regulated 10and 8.9-fold respectively, during growth in synthetic cystic fibrosis media, whereas subsequent pathway genes such as paaF, paaG, paaJ, and paaI are only up-regulated only 2-5-fold (35). Intriguingly, high throughput RNA-seq studies using the closely related pathogenic strain HI2424 found increased transcript levels of the paaK1 ortholog during synthetic cystic fibrosis media growth (36). Because the PaaKs of J2315 are 98% identical to HI2424 with no substitutions in the aryl binding pocket, the HI2424 PaaKs likely display similar kinetic profiles further supporting a biological role for the structural and kinetic differences between PaaK1 and PaaK2 established by this study.