Evolution of Substrate Specificity within a Diverse Family of β/α-Barrel-fold Basic Amino Acid Decarboxylases

Pyridoxal 5′-phosphate (PLP)-dependent basic amino acid decarboxylases from the β/α-barrel-fold class (group IV) exist in most organisms and catalyze the decarboxylation of diverse substrates, essential for polyamine and lysine biosynthesis. Herein we describe the first x-ray structure determination of bacterial biosynthetic arginine decarboxylase (ADC) and carboxynorspermidine decarboxylase (CANSDC) to 2.3- and 2.0-Å resolution, solved as product complexes with agmatine and norspermidine. Despite low overall sequence identity, the monomeric and dimeric structures are similar to other enzymes in the family, with the active sites formed between the β/α-barrel domain of one subunit and the β-barrel of the other. ADC contains both a unique interdomain insertion (4-helical bundle) and a C-terminal extension (3-helical bundle) and it packs as a tetramer in the asymmetric unit with the insertions forming part of the dimer and tetramer interfaces. Analytical ultracentrifugation studies confirmed that the ADC solution structure is a tetramer. Specificity for different basic amino acids appears to arise primarily from changes in the position of, and amino acid replacements in, a helix in the β-barrel domain we refer to as the “specificity helix.” Additionally, in CANSDC a key acidic residue that interacts with the distal amino group of other substrates is replaced by Leu314, which interacts with the aliphatic portion of norspermidine. Neither product, agmatine in ADC nor norspermidine in CANSDC, form a Schiff base to pyridoxal 5′-phosphate, suggesting that the product complexes may promote product release by slowing the back reaction. These studies provide insight into the structural basis for the evolution of novel function within a common structural-fold.

(NSpd), a polyamine analog absent in most eukaryotic cells, is required for vibriobactin biosynthesis, a peptide iron-chelator needed for growth and virulence in Vibrio species (15).
CANSDC provides the only route to spermidine and NSpd biosynthesis in Vibrio (16) and deletion of the gene abolishes spermidine and NSpd intracellular pools leading to defects in biofilm formation (13).
The x-ray structures of several eukaryotic ODCs (17)(18)(19)(20)(21)(22), of PBCVADC (6), Vibrio vulnificus L/ODC (VvL/ODC) (5), and several bacterial DAPDCs have been reported (23,24). Comparison of the available structures identified the helix (termed "specificity helix") ( Fig. 2), which sits at the back of the substrate-binding site at the 2-fold axis of the dimer, as a key specificity element (5,6). Substrates of different size are accommodated by changes in the distance from this helix to PLP, whereas variation in its amino acid composition provides specificity of interaction with the range of substrates.
Two functional classes within the family, however, remain structurally uncharacterized, including bacterial/plant ADC and CANSDC. The bacterial ADCs contain a long insertion between the N-terminal ␤/␣-barrel domain and a C-terminal ␤-barrel domain, both of which are novel within the family, and the function of which was unknown. Furthermore, the structural basis for substrate specificity in the bacterial ADCs cannot be deduced by sequence analysis alone. Determination of the x-ray structure of a bacterial ADC has been of long standing interest as evidenced by the number of crystallization reports that have been published (25)(26)(27)(28). Despite this significant effort, solution of the structure remained elusive until now. CANSDC catalyzes the decarboxylation of carboxynorspermidine and carboxyspermidine, both substrates are considerably larger than the substrates utilized by other enzymes in the family, and structural data are needed to elucidate how the enzyme accommodates these larger substrates.
Herein we report the first x-ray structure determination of the two remaining functional enzyme types in the ␤/␣-barrel-fold basic amino acid decarboxlyase family: bacterial ADC (V. vulnificus; VvADC) and bacterial CANSDC (Campylobacter jejuni; CjCANSDC). These studies round out the structural analysis of the five characterized substrate specificity types (Table 1) in the ␤/␣-barrel decarboxylase-fold. Taken together with previous structural analysis of enzymes from this family, these studies provide a comprehensive example of how enzymes evolve to generate novel function through gene duplication and divergence.

Protein Expression and Purification of CjCANSDC and
VvADC-The gene for CjCANSDC was cloned from C. jejuni 81116 by PCR using genomic DNA as the template followed by cloning into the pET100 expression vector. The original clone of CjCANSDC contained a spontaneous single nucleotide mutation at amino acid residue 184 resulting in Glu (GAA) to Lys (AAA) mutation. This mutation was not observed in any other sequenced strain of CjCANSDC and led to loss of activity. Lys 184 was mutated to Glu by QuikChange TM mutagenesis  (5Ј-gctgttttaaaggtctttgaagagaaatttggtaaatgg-3Ј) and the corrected clone was used for further study including kinetic analysis and protein crystallization. The pET-22b expression vector for V. vulnificus CMCP6 arginine decarboxylase (VvADC) was previously described (5). Native proteins of CjCANSDC and VvADC were overexpressed in Escherichia coli BL21(DE3) and purified using Ni 2ϩ -affinity and gel filtration chromatography as previously described (5). Selenomethionine (SeMet) derivatives of CjCANSDC and VvADC were expressed in E. coli BL21(DE3) using the Met pathway inhibition method as previously described (29). Cells were grown in M9 minimal medium containing 100 g/ml of ampicillin at 37°C until A 600 nm reached 0.6 -0.7. Cells were induced with 200 M isopropyl 1-thio-␤-Dgalactopyranoside at room temperature for 4 h after the addition of SeMet stock solution (Athena Enzyme Systems, Baltimore, MD) and the feedback inhibition amino acids (100 mg/liter each of L-Thr, L-Lys, and L-Phe, and 50 mg/liter each of L-Leu, L-Ile, and L-Val). Harvested cells were lysed and proteins were purified as above in the presence of 20 mM dithiothreitol. ESI-MS analysis of the purified proteins (Protein Technology Center, UT Southwestern Medical Center) revealed that all Met sites in each protein (14 sites in VvADC and 13 sites in CjCANSDC) were replaced with SeMet (data not shown).
Synthesis of Carboxynorspermidine and Carboxyspermidine-Substrates were synthesized previously as described (13).
Steady-state Kinetic Analysis of CjCANSDC-Steady-state kinetic analysis was performed utilizing a coupled enzyme assay that links decarboxylation to the oxidation of NADH through the activity of phosphoenolpyruvate carboxylase and malate dehydrogenase as previously described (13). Data were fitted to the Michaelis-Menten equation to determine k cat and K m using Prism (GraphPad).

Crystallization and X-ray Diffraction Data Collection-
SeMet-substituted crystals of CjCANSDC were co-crystallized with 10 mM NSpd in hanging drops containing 1.5 l of protein (20 mg/ml in 50 mM HEPES, pH 8.0, 300 mM NaCl, 10% glycerol, 20 mM dithiothreitol, 0.03% Brij, and 0.5 mM EDTA) and 1.5 l of reservoir solution (2.75 M AmSO 4 , 0.1 M Bicine, pH 8.5). Rod shape crystals appeared after 4 -5 days at 20°C and grew to 80 m thick ϫ 600 m long by 2 weeks. Crystals were cryoprotected in 2.7 M AmSO 4 , 0.1 M Bicine, pH 8.5, 0.3 M NaCl, and 17% glycerol and flash frozen in liquid nitrogen. SAD data were collected at beamline 19ID of Advanced Photon Source. Crystals of SeMet-substituted CjCANSDC exhibited the symmetry of space group P4 3 2 1 2 with unit cell parameters of a ϭ b ϭ 144.2 Å and c ϭ 79.9 Å. They contained two molecules per asymmetric unit and diffracted isotropically to a d min of 1.9 Å when exposed to synchrotron radiation.
Phase Determination and Structure Refinement-Phases for SeMet-substituted CANSDC were obtained from a single wavelength anomalous dispersion experiment with data to a resolution of 1.9 Å, and 17 selenium sites were located using the program SHELXD (31). Phases were refined with the program MLPHARE (32), resulting in an overall figure-of-merit of 0.24 for data between 44.7 and 1.9 Å. Phases were further improved by density modification and 2-fold non-crystallographic averaging with the program DM (33). An initial model was automatically generated by ARP/wARP (34) and additional residues were manually modeled in Coot (35). Refinement was performed to a resolution of 1.9 Å using the program Refmac (36) with a random 5% of all data set aside for an R free calculation. The structure was refined to a R work of 0.179 and a R free of 0.218 ( Table 2). The final model contains two CjCANSDC monomers in the asymmetric unit; molecule A includes residues 2-129, 140 -382, PLP, and NSpd, and whereas molecule B contains residues 6 -129, 140 -382, and PLP. Both subunits also contain a bound glycerol. The final refined structure contains 385 waters. A Ramachandran plot generated with Molprobity indicated that 97.5% of all protein residues are in the most favored regions with the remaining 2.5% in allowed regions.
Phases for SeMet-substituted VvADC were obtained from a single-wavelength anomalous dispersion experiment with data to a resolution of 2.3 Å, and 52 selenium sites were located using the program SHELXD. Phases were refined with the program MLPHARE, resulting in an overall figure-of-merit of 0.13 for data between 49.7 and 2.3 Å. Phases were further improved by density modification and 4-fold non-crystallographic averaging with the program DM. An initial model containing ϳ91% of all residues was automatically generated by alternating cycles of the programs ARP/wARP. Refinement to a resolution of 2.3 Å was performed as described for CjCANSDC, except that the 2-fold non-crystallographic restraints on the protein main chain were applied between the A to C subunit and B to D subunit. The final R work is 0.178, and the R free is 0.239 ( Table 2). The final model contains four VvADC monomers in the asymmetric unit and a PLP and the product of L-Arg decarboxylation, agmatine, bound to each monomer; including residues 10 -639, in molecule A; residues 11-639, molecule B and D; residues 12-639 in molecule C; and 440 waters. A Ramachandran plot generated with Molprobity indicated that 97.6% of all protein residues are in the most favored regions with the remaining 2.4% in allowed regions. Phasing and model refinement statistics are provided in Table 2.
Molecular Modeling-Structures were displayed using the graphics program PyMol (52). Buried surface area was calculated by "Define secondary structure of proteins" analysis (37). Structures were superimposed by alignment of 4 ␤-strands and 2 ␣-helixes located in the most conserved part of the ␤/␣-barrel domain using LSQKab (38)  Multisequence Alignment-The amino acid sequences of VvADC and CjCANSDC were aligned with other sequences in the family for which x-ray structural data are available, including a representative of each substrate specificity type: TbODC (PDB code 1F3T), PBCVADC (PDB code 2NVA), VvL/ODC (PDB code 2PLK), MjDAPDC (PDB code 1TWI). The sequence alignment was generated from the x-ray structure alignment using the program PROMALS3D (42,43). The secondary structure elements were then annotated using PDBsum (44).
Analytical Ultracentrifugation-Sedimentation velocity experiments were performed using a Beckman Optima XL-1 with An-50 Ti rotor, charcoal-filled dual sector centerpiece, and sapphire windows. The wavelength of the absorbance optics was set at 280. Experiments were performed at 45,000 rpm and at 20°C. Three data set were collected using 3 VvADC concentrations (0.1, 0.4, and 0.8 OD 280 , respectively), 1 OD 280 ϭ 0.93 mg/ml in buffer (50 mM HEPES, pH 7.8, 150 mM NaCl, 1 mM tris(2-carboxyethyl)phosphine), and a total volume per cell of 0.4 ml. Data were analyzed using SEDFIT software to determine the oligomeric structure as described (45). Solvent density (1.0083 g/ml), partial specific volume (0.7324 ml/g), and viscosity (0.010473 g/s/cm) were calculated using the SEDNTERP program.

RESULTS
Purification and Kinetic Analysis of VvADC and CjCANSDC-VvADC and CjCANSDC were expressed and purified from E. coli as described under "Experimental Procedures." Steadystate kinetic analysis of VvADC was described previously (5). Steady-state kinetic analysis for CjCANSDC was performed using both carboxyspermidine (K m ϭ 4.1 Ϯ 0.36 mM and k cat ϭ 0.24 Ϯ 0.0077 s Ϫ1 ) and carboxynorspermidine (CANS) (K m ϭ 2.1 Ϯ 0.13 mM and k cat ϭ 0.58 Ϯ 0.013 s Ϫ1 ) as substrates. CjCANDC has similar activity on both CANS and carboxyspermidine, whereas VvCANSDC has evolved as higher activity on CANS (13). This is consistent with the cellular requirement for both norspermidine and spermidine in Vibrio species (13).
Crystallization and Structural Refinement of VvADC and CjCANSD-VvADC was crystallized in the presence of the substrate L-Arg, and the structure was refined to 2.30-Å resolution ( Table 2). A tetramer was observed in the asymmetric unit and the final refined structure contains 1 molecule of PLP and 1 molecule of the product agmatine per subunit (Figs. [3][4][5]. Noncrystallographic symmetry was constrained along the 2-fold dimers for the main chain during the refinement. Small differences in the subunits are observed in the position of the bound agmatine. CjCANSDC was co-crystallized in the presence of the product NSpd, and the structure was refined to 1.9-Å resolution ( Table 2). A dimer was observed in the asymmetric unit and the final refined structure contained 1 molecule of PLP per subunit, and a molecule of NSpd was found in one of the two active sites ( Figs. 3 and 4). Additionally, a molecule of glycerol used in the crystallization buffer was observed in both subunits.
Overall Fold and Oligomeric Structure of VvADC-The structure of the VvADC monomer is similar to that observed for other members of the ␤/␣-barrel-fold decarboxylase family, and it contains both the ␤/␣-barrel N-terminal domain, and the C-terminal ␤-barrel domain observed in the other structures (Fig. 3A). Structural alignment with T. brucei ODC shows a r.m.s. deviation of 3.1 Å for the monomer. No significant domain rotations are observed. However, in addition to the core conserved domains, VvADC contains three unique insertions: 1) the N-terminal residues (Val 14 -Gln 49 ) form a broken helix, followed by a 2-stranded anti-parallel ␤-sheet, 2) the interdomain extension (residues Lys 366 -Glu 462 ) forms a righthanded superhelix composed of a 4-helix bundle, and 3) the C terminus (Val 589 -Glu 638 ), which extends beyond that observed in T. brucei ODC, forms a 3-helical bundle, with all helices in a single plane (Fig. 3A).
The oligomeric unit required for activity is the dimer (AB or CD dimer in Fig. 4A) and the active site is formed as previously observed for ODC at the subunit and domain boundaries. Packing of the active dimer unit is similar to what has been previously observed for this fold type. The dimer interface of the A/B or C/D subunits sits between the N-terminal ␤/␣-barrel domain of one subunit, the C-terminal ␤-barrel domain of the second subunit, and the ␣17-␣20 helices and loop region composed of residues 555-560 from the ␤-barrel domains of the two subunits. Two additional interfaces are observed in the ADC dimer that have not been previously observed in enzymes from this fold type: 1) the 4-helical bundle interdomain extension interacts (helices ␣14 and ␣15) with the ␤/␣-barrel domain of the opposite subunit (helices ␣9 and ␣11), and the C-termi-nal extension 3-helical bundle (helices ␣22 and ␣23) interacts with the ␤/␣-barrel domain (helices ␣6 and ␣7) of the opposite subunit, but on the opposite face from the 4-helical bundle. These two unique helical bundles form a cradle around the ␤/␣-barrel of the opposite subunit. This results in a large buried surface area for the dimer of 12,300 Å 2 per dimer (6,180 Å 2 per monomer).
Within the asymmetric unit VvADC packs as a tetramer of two active dimer units (Fig. 5). The interface between the two dimers contains residues from all 4 subunits. They are arranged such that the 4 helical bundles from the interdomain extension of monomers B and D (or A and C for the symmetry related pair) are in direct contact with each other over a short region of helix ␣14. They in turn sit between the two ␤/␣-barrel domains from monomers A and C (or B and D) (interacting with helices ␣9-␣11) allowing the formation of an extensive packing interaction. The tetramer is packed as a donut with a large central cavity. The four active sites point into that cavity with the 4-helical bundles packed around the cavity. The additional buried surface area that occurs by formation of the dimer-dimer interface (tetramer) is 7,370 Å 2 (3,680 Å 2 buried surface area per dimer).
Sedimentation velocity analysis of VvADC was performed at three concentrations of enzyme. The calculated molecular mass of the VvADC monomer is 73,558 Da. In solution, the molecular mass determined by the sedimentation velocity analysis was on average 280 Ϯ 27 kDa for data collected at three protein concentrations in the range of 0.1, 0.4, and 0.8 OD (supplemental Fig. 3S). These data are consistent with the tetramer being the dominant solution form for VvADC. A slight amount of dissociation to the dimer (Ͻ10%) was evident from the data, but this species was not present in sufficient amounts for quantitation.
Active Site of VvADC-The VvADC crystals were grown in the presence of the substrate L-Arg, and good density (F o Ϫ F c map) was observed for the decarboxylated product agmatine before refinement (supplemental Fig. 1S). Strong electron density was also observed for PLP in all four subunits.
PLP-binding Site of VvADC-The PLP-binding site is formed between the end of ␤-strands ␤10 and ␤11 and helix ␣12 of the ␤/␣-barrel (Figs. 6A, 7A, and 8), and interactions between PLP and the protein are largely conserved in comparison with other enzymes from the family (Fig. 2 and supplemental Fig. S2). VvADC-Lys 105 (TbODC-Lys 69 ) is the catalytic Lys that forms a Schiff base with PLP, and which has been implicated in accelerating the rates of substrate binding, decarboxylation, and product release in TbODC (46). VvADC-His 255 (TbODC- VvADC Agmatine-binding Site-The substrate-binding site lies between the ␤/␣-barrel domain on one subunit, which encompasses the PLP-binding site, the interdomain region of this same subunit, which contains helix ␣18 (found in the same position as the previously described 3 10 -helix or "specificity element" (5, 6)), and the C-terminal ␤-barrel from the opposite subunit (Figs. 6A, 7A, and 8). Agmatine is observed in the substrate-binding site but it does not form a Schiff base with PLP, and represents a structure of the enzyme product Michaelis complex. The substrate N1 is not within bonding distance of the PLP C4Ј atom and thus the position of the substrate when bound to PLP in the productive reaction complex will minimally require bond rotations in the ligand to bring the N1 atom into bonding distance of the PLP cofactor. Within the ␤/␣barrel subunit the guanidinium group of agmatine (NH1 and NH2) forms a salt bridge interaction with VvADC-Asp 480 in 2 of the 4 active sites (distance 3.5-4.5 Å depending on subunit), and the wall of the binding pocket that runs down the length of the aliphatic portion of the substrate is formed by VvADC-Tyr 551 (Tyr 389 in TbODC). Additional interactions are contributed from across the subunit boundary: VvADC-Asp 512 forms a salt bridge with NE of the substrate in 3 of 4 active sites (3.2-4.1 Å depending on the subunit), N1 of agmatine forms an H-bond with Ser 513 , and VvADC-Asp 514 , whereas just outside of van der Waals range, likely provides additional charge stabilization. All three residues are invariant within the ADC enzymes that contain the insertions (5). An acidic residue at position VvADC-Asp 512 is also conserved in ODC (TbODC-Asp 361 ). The catalytic base VvADC-Cys 511 (TbODC-Cys 360 ) is observed in the down position pointing away from the ligand, but its position preserves a similar potential to play a role in catalysis as observed for TbODC (20). TbODC-Phe 397 , conserved in ODC type enzymes and implicated in decarboxylation (19), is replaced by VvADC-His 559 in the ADCs and TbODC-Tyr 323 is replaced by VvADC-Phe 475 , which sits further away from the active site.
Overall Fold and Oligomeric Structure of CjCANSDC-The monomer of CjCANSDC superimposes with TbODC with a r.m.s. deviation of 2.7 Å, and again no significant domain rotations were observed (Fig. 3B). CjCANSDC is a dimer (Fig. 4B). The CjCANSDC sequence contains additional Cterminal residues (residues Tyr 375 -Asn 382 ) that extend beyond the C-terminal domain of TbODC. This extension forms a short ␣-helix (␣12) that interacts to form part of the dimer interface on the back side of the dimer (opposite face from the active sites) (supplemental Fig. 4S). These helices in turn interact with helix ␣3 of the ␤/␣-barrel of the opposite subunit to form a 4-helix stack, with helices alternating between monomers. This generates an additional interface between the monomeric subunits not observed in ODC or ADC. The buried surface area upon dimerization is 7950 Å 2 (3970 per monomer).
Active Site of CjCANSDC-Strong electron density (F o Ϫ F c map) for the PLP cofactor was observed for both subunits of the CjCANSDC structure and density for the cocrystallized product (NSpd) was present but only in subunit A (supplemental Fig. 1S). The NSpd density is discontinuous but clearly indicates the presence of the ligand. In addition, density for glycerol, a component of the crystallization buffer, was observed in both subunits in a solvent accessible channel adjacent to the distal end (relative to PLP) of the ligand binding site. Several unusual differences in the CjCANSDC PLP-binding site are present. First, TbODC-Arg 277 (VvADC-Arg 346 ) is replaced with CjCANSDC-Glu236, which removes the charge stabilization of the PLPphosphate that has been shown to be important for PLP binding in TbODC (47). Glu 236 is mostly conserved in CANSDC members of the family, although in some species this residue is replaced with Ser (5). CjCANSDC-Glu 236 is involved in a H-bond network with His 341 and Asp 338 . Although Asp 338 is conserved among the CANSDCs, His 341 is not (5). Second, Asp 88 in TbODC is replaced with Thr 60 in CjCANSDC, which retains the potential to form an H-bond to the catalytic Lys 41 in the ligand bound structure, although in this structure the hydroxyl group of Thr 60 is not oriented toward Lys 41   CjCANSDC NSpd-binding Site-NSpd is bound in the typical substrate-binding site at the subunit interface between PLP and helix ␣11, as described above for VvADC (Figs. 6B, 7B, and 8). NSpd does not form a Schiff base with PLP and instead the ␣-amino group is turned away from C4Ј of PLP and forms an ion pair with Glu 236 . Thus as with the VvADC structure, the Michaelis complex is observed. A rotation of the bound C2-C3 would bring the amino group into position to form a Schiff base with PLP. The N3-amino group forms an ion pair with Asp 272 from the specificity helix ␣11, with the bond distances being short and consistent with a strong interaction (OD2 to N3 ϭ 2.6 Å and OD1 to N3 ϭ 3.3 Å). However, unlike all other substrate specificity types within the family, CANSDC does not position an acidic residue across the subunit boundary to form a salt bridge with the ligand (e.g. VvADC-Asp 512 and TbODCA-Asp 361 ). The equivalent residue in CjCANSDC is Leu 314 and this residue is conserved among the CANSDCs (13). Both carboxynorspermidine and carboxyspermidine have a 3-carbon linker between the ␣-amino group and N2, which is 1 carbon shorter than observed for other substrates in the family. The positioning of the ligand within the binding site places the aliphatic portion of NSpd near Leu 314 , providing structural insight into why an aliphatic residue in this position is required instead of the acidic residue typical of other enzymes in the family. The adjacent catalytic base from the same loop, Cys 313 (TbODC-Cys 360 ), is conserved and is observed in the down position.

PLP-binding Site of CjCANSDC-
CjCANSDC Glycerol-binding site-A glycerol molecule from the crystallization solvent was observed bound in a solvent accessible channel adjacent to the specificity helix ␣11 (Figs. 2, 7B, and 9). The hydroxyl residues form H-bond interactions with Asp 272 , Asp 338 , and His 341 . This channel provides the potential for a more extended substrate-binding site than observed for other enzymes in the family. The second substrate of the enzyme, carboxyspermidine, is longer by 1 carbon than NSpd suggesting that the extra chain length could be accommodated by turning into the channel occupied by glycerol in the NSpd CANSDC structure.
Comparison of VvADC and CjCANSDC to Other Members of the Family-Alignment of the VvADC and CjCANSDC structures with prior x-ray structures for enzymes in the family shows that the actives sites are highly conserved, both in structure and composition (Figs. 7 and 8), and indeed key residues in the active site overlay closely on each other with PLP bound in near identical position in all structures, despite the very low overall sequence identity between members of the family. Within this context the structural basis for specificity differences was evaluated to identify differences between the enzymes from the family. The primary structural change is observed in the specificity helix. In the VvADC and CjCANSDC structures the specificity helix is shifted further back in the pocket when compared with TbODC, which allows a larger distance between PLP and the key acidic residue that contacts ligand (Figs. 6 -8). The acidic residues (VvADC-Asp 480 ; helix ␣18) in VvADC and CjCANSDC (CjCANSDC-Asp 272 ; helix ␣11) form salt bridges with the ligand project from the start of the specificity helix. TbODC-Asp 332 also projects from the equivalent helix but originates from the C-terminal end of the helix (Figs. 2 and 8). The change in register allows equivalent salt bond interactions to be formed despite the larger distance between the helix and PLP and accommodates ligands of different size. DAPDC is the only enzyme that catalyzes decarboxylation of a dicarboxylate substrate and the binding site of this enzyme has evolved to stabilize the carboxylate on C5. However, the same strategy is in play, and an arginine residue (Arg 343 ) projects from the specificity helix from the N terminus of the helix to interact with the C5-carboxylate, and Glu 348 from the C terminus of the helix interacts with substrate N2.
Across the domain boundary, differences that provide insight into the substrate specificity spectrum in the family are also observed. VvADC-Asp 512 forms an equivalent interaction with ligand to TbODC-Asp 361 . The position of VvADC-Asp 512 has shifted away from the ligand-binding site allowing additional room to accommodate the larger agmatine. The equivalent residue in CjCANSDC (Leu 314 ) interacts with the aliphatic portion of the NSpd ligand, accommodating the shorter carbon skeleton between N1 and N3 for substrates that bind CANSDC. In VvADC TbODC-Tyr 331 is replaced with VvADC-Trp 482 and NE1 is within H-bond distance of the NH2 of agmatine. In both cases this residue projects from the specificity helix and forms A limited set of residues within the 4-Å shell are displayed. A, comparison of VvADC (monomer A (purple); monomer B (pink)) to TbODC (monomer A (tan); monomer B (gray)). B, comparison of CjCANSDC (monomer A (teal); monomer B (light teal)) to TbODC (monomer A (tan); monomer B (gray)). PLP (yellow for ODC, purple for VvADC, and teal for CjCANSDC), putrescine (Put) (tan), agmatine (Agm) (purple), and NSpd (teal) are shown as ball and stick. an interaction with its symmetry related partner from the other subunit. However, the relationship to the active site structure is reversed. In TbODC-Tyr 331 from the same subunit the ␤/␣barrel domain caps the active site, although it is not within van der Waals contact with putrescine, and in VvADC-Trp 482 is contributed from the opposite subunit. This residue is replaced with Ile 275 in CANSDC, and like TbODC the residue from the same subunit as the ␤/␣-barrel is closest to the active site, but its position on the helix does not allow for direct interaction with ligand. Furthermore, this residue is not conserved within the CANSDCs (5) and is unlikely to play a role in ligand binding.

DISCUSSION
The basic amino acid decarboxylases from the ␤/␣-barrelfold family encompass enzymes with diverse substrate specificity ranging from the smallest ligand L-Orn to the largest, carboxyspermidine. Members of the family are found in all three domains of life, and play essential roles in amino acid metabolism and polyamine biosynthesis (5,13). Completion of the ADC and CANSDC structures allows for the first time a comprehensive structural analysis of how substrate specificity has evolved within the five major specificity types within the family. These data provide significant insight into the structural basis for the observed specificity differences and provide a powerful example of how change of function evolves within the context of a conserved structural domain.
Structural comparison of the major specificity classes in the family (ODC, ADC, DAPDC, and CANSDC) shows that the overall fold and active site structure is strongly conserved, despite low overall sequence identity (in the range of 15%). This allows the active site structural elements controlling specificity to be clearly identified. At the overall structural level ODC, DAPDC, and CANSDC appear to be most similar to each other with all three sharing the same overall monomer organization and dimeric structure. ADC diverges most significantly and has acquired significant additional structural complexity augmenting the basic monomeric structure. Both the interdomain insertion (4-helical bundle) and the extended C terminus (3-helixal bundle) are unique to ADC and these participate in interactions at the dimer interface and in the formation of the tetrameric structure observed in the asymmetric unit. The additional interactions, and significantly greater buried surface area, formed at the dimer interface as the result of these insertions suggests the dimer may be more stable than observed for shorter members of the family that do not contain these features. The tetrameric structure was shown to be the relevant solution species by sedimentation velocity analysis, suggesting that the tetramer may play a unique role in the biology of the bacterial ADCs. All four active sites are oriented within the central donut of the tetramer, providing a potential mechanism for allosteric regulation. These data provide the first example of a tetrameric structure within the fold class. A tetrameric donut-like structure of Mycobacterium tuberculosis DAPDC was recently reported, however, this tetramer displays a small surface area of interaction (900 Å 2 per dimer) that only involves 2 monomers (48) in comparison to the VvADC tetramer that involves all 4 monomers (3,680 Å 2 buried surface area per dimer) in a more extensive interaction. Furthermore, solution studies with mtDAPDC were consistent with a dimeric structure.
The substrate specificity differences in the family are reflected in two key differences in the active site structures. The first is the position and amino acid composition of the specificity helix, which sits at the back of the active site. The distance between the PLP cofactor and the key residues on this helix that interact with substrate serves as a molecular ruler to restrict catalysis to the basic amino acid ligand of the correct size. For ODC, which has the shortest ligand, the helix is positioned closest to PLP, whereas for ADC, CANSDC, and DAPDC the helix has shifted away from PLP allowing accommodation of the larger ligand (Fig. 8). The functional importance of the amino acid residues that project from the helix to interact with the substrate has been shown for both ODC and chlorella virus FIGURE 9. Surface representation of CjCANSDC showing the ligand-binding sites. NSpd (pink), PLP (yellow), and glycerol (orange) are displayed as spheres. The surface was generated against the full protein minus the ligands (PLP and NSpd). The side chain of Lys 41 was also removed before the surface was generated to allow the PLP to be more visible.
ADC, where mutation of TbODC-Asp 332 to Glu increases the K m for L-Orn by 20-fold, and mutation of the equivalent residue in chlorella virus ADC increases the K m by 10-fold, whereas decreasing k cat by 100-fold (7).
The second specificity determinant is contributed from residues across the domain boundary on the loop that also hosts the key catalytic base (e.g. TbODC-Cys 360 ) (20). For ODC, ADC, and DAPDC, an acidic residue is positioned from this loop to interact with the distal amino group of the ligand. In ODC, mutation of TbODC-Asp 332 to Glu or Ala increases the K m for L-Orn by 100 -1000-fold, respectively, demonstrating the functional importance of the interaction (49). However, the CANSDC structure reveals the first example in the family where this residue is replaced by a hydrophobic amino acid (CjCANSDC-Leu 314 ), and this change allows interaction with the aliphatic portion of the substrate because of the change in register due to the shorter carbon backbone found between N1 and N2 of the carboxyspermidine and carboxynorspermidine substrates, in comparison to other enzymes in the family.
Although the overall structure diverges most for the large ADCs, the active site composition is most different for CANSDC. ODC and ADC retain almost all of the same key catalytic residues, whereas for CANSDC, in addition to the substitution of Leu 314 for an acidic residue, the residue interacting with the phosphate of PLP has also diverged (Glu or Ser at position 236 replaces TbODC-Arg 277 , and equivalent in ADC and DAPDC). This is at first a surprising change in active site structure given the demonstration that TbODC-Arg 277 is necessary for high affinity PLP binding (47). However, CjCANSDC-Glu 236 is involved in an extensive H-bonding network that may mitigate the charge replacement and provide an alternative mechanism to stabilize the PLP phosphate moiety.
An open question remains as to how CANSDC accommodates the larger carboxyspermidine substrate. The observation of a solvent accessible channel that binds glycerol in the CjCANSDC-NSpd structure suggests that the longer substrate could be accommodated by extending into this channel. The interaction between N3 and Asp 272 could potentially be preserved, and additionally Asp 338 , which interacts with glycerol, is conserved throughout the CANSDC enzymes (5) and could potentially form another interaction point for N3 of the longer carboxyspermidine substrate.
The ligands bound to the VvADC and CjCANSDC structures are not found in the typical Schiff base configuration with PLP that has been observed in most of the structures of other enzymes in the family. Both appear to be representatives of the Michaelis complex between enzyme and product. The N1 group of both ligands forms an H-bond interaction in the active site: N1 of agmatine interacts with Ser 513 in VvADC, whereas the NSpd N1 of CjCANSDC interacts with Glu 236 . These data suggest that after decarboxylation these interactions may help to facilitate product release and dissociation away from PLP.
As all polyamine biosynthetic pathways have evolved from amino acid metabolism, it is likely that ADC, ODC, and CANSDC evolved from the lysine biosynthetic enzyme DAPDC by gene duplication and functional divergence. This idea is also supported by the fact that DAPDC is found in most bacteria and euryarcheaota, whereas ADC, although widespread in bacteria, is not found in single-membrane bacteria (Firmicutes and Actinobacteria) or archaea (5). An important aspect of the evolution of ADC was the four-helical bundle interdomain insertion, raising the possibility of an ancestral form of ADC lacking the insertion. Within bacteria and archaea, ODC is found mainly in the ␣-proteobacteria, which suggests that the eukaryotic ODC may have originated from the ␣-proteobacterial ancestor of the mitochondrion. Gene duplication not only led to diversification of polyamine biosynthesis, from L-Arg and L-Orn by ADC and ODC but also led to elongation of the pathway, in the case of CANSDC. The viral PBCVADC evolved relatively recently from ODC (6,7), and in vertebrates, antizyme inhibitor, a polyamine regulatory protein, evolved by gene duplication of ODC followed by loss of catalytic activity, which was accompanied by loss of dimer formation (50). This family of decarboxylases exemplifies the evolutionary processes sculpting metabolism by elaboration of a single structural fold through gene duplication and functional divergence to produce biosynthetic diversification, pathway elongation, and the formation of regulatory proteins.