Structural basis of transcriptional regulation by CouR, a repressor of coumarate catabolism, in Rhodopseudomonas palustris

The MarR family transcriptional regulator CouR, from the soil bacterium Rhodopseudomonas palustris CGA009, has recently been shown to negatively regulate a p-coumarate catabolic operon. Unlike most characterized MarR repressors that respond to small metabolites at concentrations in the millimolar range, repression by CouR is alleviated by the 800-Da ligand p-coumaroyl–CoA with high affinity and specificity. Here we report the crystal structures of ligand-free CouR as well as the complex with p-coumaroyl–CoA, each to 2.1-Å resolution, and the 2.85-Å resolution cocrystal structure of CouR bound to an oligonucleotide bearing the cognate DNA operator sequence. In combination with binding experiments that uncover specific residues important for ligand and DNA recognition, these structures provide glimpses of a MarR family repressor in all possible states, providing an understanding of the molecular basis of DNA binding and the conformation alterations that accompany ligand-induced dissociation for activation of the operon.

Proteins of the multiple antibiotic resistance regulator (MarR) 3 family constitute a large group of transcription factors that are widespread in bacteria and archaea. These transcription factors control gene expression to regulate such diverse processes as antibiotic resistance, stress response, aromatic carbon source catabolism, and virulence. An analysis of 19 major families of bacterial transcription factors recently implicated the MarR family as a significant contributor, comprising roughly 8% of the total proteins analyzed. With nearly 130,000 deposited bacterial genomes to date, many MarR family members have been and will be annotated as such; however, less than 1% of these sequences have been fully characterized with respect to physiological function. The founding member of this family was first identified in multidrug resistant Escherichia coli, where the MarR protein regulates an operon encoding the AcrAB-TolC multidrug efflux system, in response to a wide range of antibiotics and phenolic compounds (1)(2)(3)(4)(5)(6)(7).
Most MarR homologs act as gene repressors, and a typical organization of the locus orients the marR gene distally from the genes that are under transcriptional control. The intergenic region, typically at sites that overlap the Ϫ10 and Ϫ35 promoter elements, contains a 16-to 20-bp palindromic DNA sequence that harbors the binding site for the transcriptional regulator. Binding of a MarR homodimer to this palindromic region represses expression of both the gene that is regulated as well as the repressor itself. Dissociation of the repressor from DNA and activation of gene expression occurs when binding of a small-molecule ligand alters the conformation of the MarR homodimer. As a result of this autoregulated expression, the physiological concentrations of the repressor itself change very slightly, allowing for an exceptionally sensitive response to ligand concentrations.
Structural studies of MarR homologs reveal two sets of Nand C-terminal ␣-helices that facilitate dimerization and two winged helix-turn-helix motifs (wHTH) that bind to the palindromic DNA duplex via spacing that is established by dimer formation. Each monomer binds to one half-site of the palindromic DNA inverted repeat sequence, and residues along the dimerization interface help to establish the spacing between the two half-sites. Disruption of dimer formation results in a loss in DNA binding affinity and a loss of the corresponding drug resistance phenotype. A clade of MarR members responds to oxidative stress through the oxidation of Cys residues, which results in change to a conformation that is incompatible with DNA binding. Biochemical and biophysical studies of MarR family members are limited by the low binding affinity for cognate ligands, which often results in confounding data. For example, several small-molecule ligands can bind to MarR factors at multiple sites, and the physiological relevance of these interactions is unclear (8,9).
A recent analysis of the pathway for the catabolism of the aromatic compound p-coumarate by the alphaproteobacterium Rhodopseudomonas palustris CGA009 revealed that genes encoding the enzymes for this pathway are regulated by a MarR-like transcriptional repressor. This soil bacterium uti-lizes plant-derived phenylpropanoids, including p-coumarate, as carbon sources by first converting them to acyl/aryl-CoA and subsequently to p-hydroxybenzoates, which are degraded aerobically by an oxidative meta-ring cleavage pathway or anaerobically by a reductive aromatic ring degradation pathway (10,11). In R. palustris, the transcription of an operon that encodes for enzymes involved in the catabolism of p-coumarate to p-hydroxybenzoate is under the control of a MarR family regulator named CouR. Recombinant CouR was shown to bind to an inverted repeated sequence in the Ϫ10 region of the promoter, and DNA binding was disrupted by addition of low micromolar concentrations of p-coumaroyl-CoA (pCC) (Fig. 1, B and C) (1,10,12).
An orthologous pathway for aromatic acid catabolism has been identified in Rhodococcus jostii RHA1, which also utilizes a similar strategy of CoA-thioesterification, followed by ␤-oxidative deacetylation to generate hydroxybenzoates. However, the constituent enzymes encoded by this operon are different from those of R. palustris. For example, the R. jostii RHA1 cluster encodes an aryl-CoA dehydrogenase and an aryl-CoA ligase (CouL) that can utilize dihydroferulate as a substrate, suggesting that the pathway functions on dihydro-p-hydroxycinnamic acids rather than their unsaturated p-hydroxycinnamate counterparts. Last, although the MarR family repressor that regulates the R. jostii RHA1 operon has been named CouR, the sequence identity of this polypeptide to R. palustris CouR is only 36%, and the DNA-binding site is divergent in sequence and size, as it contains a 5-nucleotide (nt) spacer between the two half-sites rather than the 3-nt spacer in the R. palustris promoter (13).
The R. palustris CouR represents a rare example of a MarR family regulator for which a physiologically relevant ligand has been verified, and the cognate ligand is shown to bind to the receptor with low micromolar affinity. Competition studies indicate that only pCC (and not p-coumarate, benzoyl-CoA, or acetyl-CoA) can induce dissociation of CouR from its promoter DNA. To elucidate the mechanism for recognition of the target promoter and how binding of a large ligand affects protein-DNA interactions to relieve transcriptional repression, we determined the cocrystal structures of CouR in complex with a 23-nt duplex inverted repeat corresponding to the physiological DNA binding site (to 2.85-Å resolution, Fig. 2A) and the corresponding structure of the repressor in the inactive state, bound to pCC (to 2.1-Å resolution; Fig. 3, A and B). The function of specific amino acids in DNA or ligand recognition were probed by analysis of structure-based site-specific variants using electrophoretic mobility shift assays (EMSAs, for DNA binding) or differential scanning fluorimetry (DSF) assays (for ligand binding). The combined biochemical and structural biological data provide a framework for understanding transcriptional regulation by CouR, which further extends the existing knowledge of MarR family repressors.

Structure determination and overall structure
Cocrystallization efforts with CouR and various synthetic oligonucleotides bearing the inverted repeat DNA recognition sequence yielded several candidates, but most of these crystals did not diffract beyond 3.5-Å resolution. Altering the length and identity of the nucleotides flanking the recognition sequence finally produced crystals that diffracted to 2.85-Å resolution at an insertion device synchrotron beamline. Structures of ligand-free CouR and a complex with synthetic pCC each produced crystals that diffracted to 2.1-Å resolution. Because of ease of reproducibility, structure determination focused on using crystals of the CouR-DNA complex, and crystallographic phases were determined by the single-wavelength anomalous diffraction method from data collected on crystals of selenomethionine-labeled CouR. The apo-CouR and CouR-pCC structures were determined by molecular replacement using the coordinates of the polypeptide from the DNA-bound structure, followed by manual rebuilding of the polypeptide.
The overall structure of the CouR homodimer is triangular in shape with a pseudo 2-fold axis of symmetry. Each monomer consists of six ␣-helices and two ␤-strands containing a wHTH motif, where helices ␣3-␣4 and strands ␤1-␤2 define the DNA binding elements (Fig. 1, A and D). Dimerization between the two monomers is mediated by interlocking interactions between helices ␣1, ␣5, and ␣6 from each monomer, resulting in a burial of 4,487-4,497 Å 2 of solvent-accessible surface area (depending on the conformational state). Dimerization between the two monomers is mediated by extensive burial of numerous hydrophobic residues to form the structural core. Residues at this interface include Leu-51, Leu-55, Val-62, and Phe-66 from helix ␣1; Val-152, Leu-160, and Leu-164 from helix ␣5; and Leu-172, Leu-176, and Ile-179 from helix ␣6. Additional interactions include intramolecular (Asp-65-Arg-159) as well as intermolecular (Glu-46 -Lys-149 and Glu-49 -Arg-169) salt bridges. Most notably, electron density is sparse for the ␤-strands that contain the wHTH motif, and the wings show a different conformation for each of the two monomers. There is also weak density in the presumptive binding pocket for one of the monomers, which may reflect a weakly bound molecule of HEPES from the buffer component, but this was too poorly defined and not modeled.
A DALI search of the CouR monomer against the Protein Data Bank reveals a strong conservation of secondary structural elements with other transcriptional regulators. The closest homologs include a putative regulator of unknown function from Pseudomonas aeruginosa (PDB code 2NNN, Z score of 17.5, RMSD of 1.4 Å over 133 aligned C␣ atoms), the Streptomyces coelicolor ␤-ketoadipate regulator PcaV (PDB code 4FHT, Z score of 17.1, RMSD of 2.7 Å over 141 aligned C␣ atoms), E. coli MarR (PDB code 3VOE, Z score of 17.1, RMSD of 2.7 Å over 136 aligned C␣ atoms), and the multidrug efflux regulator MexR from P. aeruginosa (PDB code 1LNW, Z score of 17.1, RMSD of 1.7 Å over 140 aligned C␣ atoms).

Cocrystal structure of CouR with operator DNA
The cocrystal structure of CouR bound to a 23-bp duplex bearing the two half-sites was determined to 2.85-Å resolution from crystals containing two copies of the CouR dimer-DNA duplex complex in the asymmetric unit (ASU). The CouR homodimer is situated on the pseudo-palindromic duplex so that helix ␣4 from the wHTH motif is positioned into each of

Structural basis for regulation of coumarate catabolism
the two half-sites along consecutive major grooves that are roughly 34 Å apart. The DNA duplex is roughly B-form but is under-twisted by 1.3°, resulting in a shortening of the end-toend distance by roughly 3.8 Å, relative to canonical B-DNA. Notably, these deviations arise from a widening of the major grooves, to accommodate binding of the CouR ␣4 recognition helix, and a corresponding slight narrowing of the minor groove ( Fig. 2).
Each CouR monomer contacts each half of the inverted repeat sequence through interactions with the wHTH motif, with the ␣4 helix positioned in the major groove and strands ␤1-␤2 of the wHTH making contacts with nucleotides on the outer periphery of the recognition sites. As in other structures of MarR family regulators in complex with their cognate operator sequences, the number of contacts between CouR and DNA bases of the inverted repeat is limited. The only basespecific interaction occurs at helix ␣4 between the carboxamide of Asn-107 and N6 and N7 of A 15 . Van der Waals contacts with DNA bases of the recognition sequence are mediated via the wedging of Pro-106 into the segment encompassing T 6 -T 7 -A 8 of one strand and the corresponding bases of the complementary strand. The Pro-106 -Asn-107 sequence comprises the first two residues of the ␣4 helix, and the combination of extensive van der Waals contacts and sequence-specific interactions mediated through these two residues establish the orientation of the recognition helix within the major groove of each half site.
The wing portion of the DNA-binding region, spanning residues Val-122 through Leu-135 (from strands ␤1-␤2), mediates additional contacts with nucleotides at the periphery of the recognition sequence. Electron density for both strands as well as for the intervening loop is clear and continuous in the DNA cocrystal structure. Residues within the loop of this region are situated in the minor groove and are involved in extensive nonspecific contacts with the DNA backbone. The phosphate backbone of the DNA duplex is within hydrogen bonding distance of several residues, including the side chains of Arg-123, Arg-125, Arg-130, Arg-131, Ser-132, and His-133, as well as the main chain carbonyl of Ser-128. The side chain of Arg-131 is poised within hydrogen-bonding distance of T 3 , but this nucleotide is outside of the recognition half-site and is not part of the inverted repeat sequence of the naturally occurring couA operator. Arg-131 interacts with the side chain carboxylate of Asp-129 via a salt bridge, and this Asp-X-Arg pairing, along with a similar interaction with thymine, is observed across various MarR family members (13)(14)(15)(16)(17).
Outside of interactions with the wHTH motif, there are a few additional residues that make contact with the DNA backbone. Specifically, the side chain of Lys-56 (equivalent to Arg-10 in MepR and Tyr-19 in OhrR) from helix ␣1 is situated directly above the phosphate of T 14 , which is located at the inner periphery of the inverted repeat. As a result, interactions between equivalent Lys-56 residues from each monomer occur at either side of the minor groove. In the CouR cocrystal structure, Asn-107 makes direct interactions with the A 15 nucleobase, and equivalent residues in MepR (Thr-63) and OhrR (Thr-70) are also involved in base pair recognition. Last, Gln-94 (equivalent to Gln-50 in MepR) is located in helix ␣3 and is part A, overall structure of the apo CouR homodimer labeled with secondary structural elements (one monomer is colored in plum and the other in sky blue, wHTH ϭ ␣3-␣4-␤1-␤2). B, the gene organization of the p-coumarate catabolic operon regulated by CouR in R. palustris CGA009. Vertical black bars represent CouR binding motifs (GTTATAnnnTATAAC). C, coumarate catabolism pathway in R. palustris CGA009. The plant-derived starting material p-coumaric acid is converted to its aryl-CoA thioester (pCC) by an ATP-dependent CoA ligase (CouB), followed by hydration of the alkene and retro-aldol cleavage catalyzed by CouA to afford p-hydroxybenzaldehdye, which is readily oxidized to p-hydroxybenzoic acid and shunted into aromatic acid degradation or fatty acid metabolism. D, sequence of oligonucleotide duplex used in crystallization. E, a multisequence alignment of MarR family proteins featuring CouR from R. palustris CGA009 (CouR-RP), CouR from R. jostii RHA1 (CouR-RJ), MepR, HcaR, MexR, and PcaV (3,4,17,19,31).

Structural basis for regulation of coumarate catabolism
of a framework of interactions with the phosphate of T 6 , which also engages Arg-123 and His-133 from each of the strands of the wing (for a detailed CouR-DNA interaction map, see Fig. S4).

EMSA
To probe the CouR-DNA contacts in detail, we generated site-specific variants corresponding to a number of residues within the ␣4 helix, the ␤1-␤2 wing, and two residues located outside of these elements and probed the effects of each mutation on DNA binding using EMSAs (Fig. 2D). Four distinct bands are observed when the couA promoter fragment is presented with CouR, corresponding to four distinct oligomeric states, with the highest mobility band corresponding to free DNA. The presence of two bands observed at high protein concentrations (i.e. 8 pmol lanes for CouR WT and Asn-1073 Ala in Fig. 2D) is likely a result of quaternary interactions between CouR-couA promoter complexes under nondenaturing conditions. Mutation of the highly conserved Arg-131 to Ala completely abolished DNA-binding activity, as observed previously in studies of the S. aureus multidrug resistance regulator MepR (17). This result is reconciled by the crystal structure as numerous contacts are made between Arg-131 and the minor groove of the DNA binding partner. The Arg-1303 Ala variant also shows a significant reduction in DNA binding, reflecting the multiple roles this residue plays as both part of the Asp-X-Arg pairing common across MarR members as well as facilitating interactions with the phosphate backbone. Notably, Arg-130 is not conserved across other orthologs, and its provision in DNA interactions may be unique to CouR (Fig. 1D).
Other notable residues identified in the EMSA as important for binding include Lys-56 and Arg-125, which form electrostatic interactions with the DNA phosphate backbone, and Gln-94, which interacts with the phosphate group of T 6 ( Fig. 2D and Fig. S4). Alanine mutations at these residues resulted in observable decreases in the formation of higher-molecularweight complexes. The modest effect of the Gln-943 Ala variant suggests that, although the DNA interaction mediated by this residue is important, it is not absolutely essential. Notably, the Asn-1073 Ala variant results in a mobility shift commensurate with WT CouR, suggesting that the base pair interaction is not important for binding.

Cocrystal structure of CouR with the ligand pCC
CouR was observed to bind pCC in a 1:1 stoichiometric ratio without disruption of the homodimeric interface also observed

Structural basis for regulation of coumarate catabolism
in the ligand-free CouR and CouR-DNA cocrystal structures (Fig. 3, A and B). The ligand-free and pCC-bound structures are very similar, demonstrating that binding of the ligand does not cause any appreciable changes in global or local structure of CouR (Fig. 4A). A major difference between the two structures is reflected in the orientation of the ␤-strands of the wHTH motif, which are partially disordered in the ligand-free structure but are well-resolved in the pCC-bound structure. Moreover, the strands of each monomer in the pCC-bound structure are nearly superimposable, suggesting that ligand binding may restrict the flexibility of these strands.
Recognition of pCC appears to be driven primarily by hydrophobic interactions between the coumaroyl moiety of pCC and one of two equivalent clefts (one per monomer) in addition to some hydrogen bonding and hydrophobic interactions along the pantetheine group of pCC. The cleft exists adjacent to the dimeric interface surrounded by residues from helices ␣1, ␣2, and ␣5 (e.g. Phe-63, Phe-66, Pro-77, and Phe-80) of one monomer in addition to residues from helix ␣1 (e.g. Leu-47 and Tyr-53) of its neighboring monomer comprising the homodimer. CouR residue Asn-107, which is involved in base pair recognition of A 15 of the operator, also interacts with a backbone amide of pCC.
The location and secondary structural composition of this hydrophobic cleft is analogous to those observed in structures of MarR family members bound to various phenolic acids and aldehyde ligands lacking CoA (3,8,18,19). Sufficient electron density was observed to model four pCCs bound to four CouR monomers in the crystallographic ASU. Additional interactions near the opening of the cleft include hydrophobic contacts between the side chain of Ile-103 and the geminal dimethyl group of pCC as well as hydrogen bonding between the side chain of Thr-76 and the ␤-phosphate of pCC (Fig. 3A). The electron density corresponding to the adenosine nucleotide of pCC is ambiguous, indicative of disorder because of the minimal interactions of this moiety with the protein and its residence within a solvent pocket in the crystal lattice. Similar disorder has been observed for the adenosine group in other CoA-bound structures (20,21). These cross-subunit interactions with the pCC ligand result in an inward constriction of the second subunit upon binding of the ligand to the first subunit (Fig. 3, A and B, and Fig. 4A).
The overall structures of the CouR monomer in either DNAor pCC-bound forms do not differ appreciably, with an RMSD of 0.85 Å over 891 atoms. However, superimposition of CouR dimers (RMSD of 2.35 Å over 2,061 atoms) reveals notable changes that occur in the quaternary structure, especially with regards to the wHTH motif that mediates DNA binding. Specifically, a superposition of dimers of the DNA-bound CouR with that of the pCC-bound structure reveals that the wHTH motif is shifted outwards by 5 Å in the former structure, which facilitates a disposition compatible with binding of the homodimer across consecutive major grooves of the operator (Fig. 4). Binding of DNA resulted in an outward shift of helix ␣4, which forms a portion of the pCC binding cavity, so that the wings of the wHTH motif are also suitably positioned for interactions with the duplex. The collapse of helix ␣4 necessary to form a viable pCC binding pocket would result in a CouR dimer that is not optimally aligned for binding across consecutive major grooves, suggesting a mechanism for ligand-mediated attenuation of operator binding. There are no relative rotational movements of the two CouR monomers between the

Structural basis for regulation of coumarate catabolism
DNA-and pCC-bound structures, such as that observed in comparisons of ligand and DNA-bound structures of the MarR-related Rv2887 from Mycobacterium tuberculosis (22).

DSF
On the basis of the observed crystallographic contacts between CouR and pCC, the residues involved in hydrophobic contacts (Phe-63 and Ile-103) and hydrogen-bonding interactions (Thr-76 and Asn-107) were mutated to Ala. To probe the contributions of each residue to ligand binding, we carried out DSF analyses for each variant. As the protein-ligand complex demonstrated an increased melting temperature (T m ), melting curves were monitored as a function of pCC concentration, enabling the measurement of dissociation constants (K D ) between pCC and each CouR variant (23).
Analysis of WT CouR with pCC yielded a K D value of 68 Ϯ 8 M. The Phe-633 Ala and the Thr-763 Ala variants showed the greatest effect on the K D values for pCC binding. This result is consistent with crystallographic observations, which shows that Phe-63 serves as a major contributor to the hydrophobic pocket that houses the ligand, and an Ala substitution at this site would likely compromise the integrity of the pCC binding pocket. Likewise, Thr-76 is within hydrogen bonding distance of pCC, with a distance of 2.7 Å between the O␥ of this residue and the ␤-phosphate of pCC (Fig. 3).
Notably, the thermal denaturation midpoint (T m ) of the Thr-763 Ala variant had decreased by 10°C relative to the WT. Consequently, this variant was tested to ensure that it was properly folded and formed the requisite homodimeric assembly. Analytical size exclusion chromatographic (SEC) analysis performed on CouR WT and the Thr-763 Ala variant resulted in retention times expected for a 40-kDa homodimer (see "Experimental procedures" and Fig. S3). The Ile-1033 Ala variant displays a slight binding impairment upon replacement of the secbutyl group with a methyl group in the position found to associate with the geminal dimethyl group of pCC. No appreciable binding impairment was observed for the Asn-1073 Ala mutant, suggesting that proximity of Asn-107 to the amide carbonyl of pCC may not be critical for tight binding of the pCC ligand (Fig. 3).

Discussion
Detailed biochemical studies of MarR family proteins are often hampered because of a lack of details regarding the physiological effector. Even though surrogate small molecules may function at the protein level, the relevance of binding by such effectors is sometimes not clear. Here we present a detailed characterization of CouR, a repressor of a p-coumarate catabolic pathway for which genetic and microbiological analyses have been carried out in detail (1, 12). We complement the earlier studies using biochemical and structural biological approaches to elucidate the details of a MarR member visualized in three different states: without any bound ligand, bound to the cognate operator element, and bound to the physiological effector molecule. Unlike other studies of MarR family members, pCC, the effector for CouR, is not metabolically ubiquitous, and the extensive contacts made throughout the entirety of this structurally unique ligand afford an unbiased delineation of the structural details with a bound ligand. To further dissect the mechanism of ligand-induced dissociation, additional studies will be necessary to confirm whether pCC binds directly to the CouR-DNA complex or free CouR.
A prior effort also characterized a CouR-like pathway encoded in the soil bacterium R. josti RHA1 (CouR-RJ) (13). That system is distinct in that not all of the genes proposed in the catabolic pathway are within the same operon. Based on the sequences of the different promoters, it was proposed that a single repressor may regulate transcription even though the promoters are divergently transcribed. Notably, the promoter sequence identified in R. josti RHA1 (cATTGAnnnnn-TCAATg) is entirely distinct from the R. palustris CGA009 CouR (CouR-RP)-responsive promoter (GTTATAnnnTATAAC), and the two MarR-type regulators share only 36% sequence identity. The authors provide a molecular basis for DNA binding attenuation that invokes the sequestration of two Arg residues (numbered 36 and 38 in CouR-RJ, Fig. 1D) by pCC; however, these residues are not present in CouR-RP, and similar contacts were not observed in the CouR structures presented here. Moreover, superimposing the CouR-RP DNA-bound structure with the orthologous CouR-RJ pCC-bound structure reveals that the separations of helices ␣4 are nearly equidistant, suggesting compatibility with major groove binding and a unique mechanism of repression (Fig. S5).
The crystal structure of CouR bound to the operator supports the idea of an indirect sequence readout, first suggested by Dolan et al. (15) in the context of the SlyA-DNA structure. This is a result of primarily nonbase-specific contacts mediating DNA recognition and is corroborated by existing MarR family/DNA cocrystal structures in the Protein Data Bank (14 -17). The unexpected finding that many of the protein-DNA contacts are actually nonbase-specific lends credence to the notion of an indirect sequence readout and suggests that newer in silico techniques may be necessary to predict three-dimensional topologies for correct assignment of protein-DNA binding partners. Indeed, computational techniques involving comparative and machine learning strategies have been utilized to predict protein-DNA binding interactions (24). Predictive methods will hopefully benefit from the experimental structures in this work to understand similar MarR family regulators for which experimental data are lacking.

Cloning, expression, and purification of CouR
The CouR gene was amplified by the PCR from purified R. palustris genomic DNA using the following set of primers: 5Ј GCACAGGATCCGTGACCTCGTCGAACAGGATC 3Ј (forward) and 5Ј TAACACTC GAGTCAGAACTCGCGGGCG-ATGG 3Ј (reverse). The PCR product was digested with BamHI and XhoI and ligated into a similarly digested pET28-MBP expression vector using T4 DNA ligase (New England Biolabs).
The presence of CouR was tested by restriction digestion analysis and verified by di-deoxy sequencing (ACGT Inc.) The resulting construct encodes a fused maltose-binding protein (MBP) protein followed by a His 6 affinity tag preceding the CouR reading frame.

Structural basis for regulation of coumarate catabolism
The recombinant plasmid described above was used to transform E. coli (Rosetta) for overexpression of MBP-CouR. E. coli cultures were grown with shaking at 250 rpm in LB medium supplemented with kanamycin (50 g/ml) and chloramphenicol (25 g/ml) until an A 600 of 0.5-0.6 was reached. The cultures were induced with 0.5 mM isopropyl 1-thio-␤-D-galactopyranoside after cooling on ice for 20 min and grown for an additional 16 h at 18°C. Cells were harvested by centrifugation at 3,500 rpm and resuspended in buffer A (500 mM NaCl, 25 mM Tris-HCl (pH 8.0), and 10% glycerol) before lysing with a French press at 8,000 -10,000 pounds/square inch for 4 cycles. The lysate was cleared by centrifugation at 14,000 rpm for 1 h at 4°C, and the supernatant was loaded on a 5-ml HisTrap nickelnitrilotriacetic acid affinity column (GE Healthcare) equilibrated with buffer B (1 M NaCl, 25 mM Tris-HCl (pH 8.0), and 30 mM imidazole). The column was washed with 40 ml of buffer B before eluting with a linear gradient from 0 -100% buffer C (1 M NaCl, 25 mM Tris-HCl (pH 8.0), and 250 mM imidazole) over 20 min at 2 ml/min. CouR was then subjected to tobacco etch virus protease proteolysis at 4°C to remove the MBP tag while being dialyzed into 250 mM NaCl, 20 mM Tris (pH 7.5), and 1 mM DTT. The reaction ran to near-completion after 24 h (as monitored by SDS-PAGE) before dialyzing into 250 mM NaCl and 20 mM Tris-HCl (pH 7.5) to remove DTT before subtractive nickel purification. Reaction contents were loaded onto a nickel-nitrilotriacetic acid column equilibrated with buffer A, followed by a 30-ml wash with buffer A. Proteins were eluted in 4-ml increments using the following stepwise gradient of increasing buffer B: 5%, 10%, 20%, 30%, and 100%. This was followed by additional stepwise elution steps of increasing buffer C: 50% and 100%. Fractions containing CouR were dialyzed into 100 mM NaCl, 20 mM Tris-HCl (pH 7.5) at 4°C before loading onto a 5-ml HiTrap SP HP column (GE Healthcare) equilibrated with buffer D (100 mM NaCl, 50 mM N,N-bis(2hydroxyethyl)glycine (pH 7.5)). The column was washed with 20 ml of buffer D before eluting with a linear gradient of increasing buffer E (1 M NaCl and 50 mM bicine (pH 7.5)) for 35 min at a flow rate of 1.5 ml/min. CouR-containing fractions indicated Ն95% purity by SDS-PAGE analysis (Fig. S1).

DSF binding assay
CouR WT and mutant proteins were purified as described above followed by gel filtration on a 120-ml Superdex 75 column (GE Healthcare) in 50 mM KCl, 0.5 mM 2-mercaptoethanol, and 20 mM HEPES (pH 7.0). The DSF assay was performed on a StepOnePlus RT-PCR instrument (Applied Biosystems) using a 96-well plate format. WT and mutant proteins were subjected to increasing concentrations of pCC from 0 -10 mM while maintaining a constant protein concentration of 68 M and 6ϫ SYPRO orange dye (prepared from a 5,000ϫ stock in DMSO). Samples were allowed to equilibrate for 30 min prior to initiating the melt curve. The melt curve program initiated with a 2 min 25°C hold step followed by a ramp up to 99°C at 1.7°C/min (step and hold) and a final hold for 2 min at 99°C. First derivatives of the melting curves were used to determine the T m for each sample well. Plots were fit to a dose-response curve using OriginPro 2015, and the K D was taken to be equal to the substrate concentration at which melting temperature achieved 50% of the total change.

EMSA
EMSAs were performed as described previously (1). The same DNA probe, spanning the Ϫ300 to ϩ17 bp relative to the couA start codon, was PCR-amplified, and 0.3 pmol of DNA probe was mixed with various amounts (2, 4, or 8 pmol) of CouR variants in a 15-l reaction mixture (binding buffer: 20 mM Tris (pH 7.5), 50 mM KCl, 1 mM DTT, and 8% glycerol) and incubated for 25 min at room temperature. The samples were loaded on a 5% nondenaturing acrylamide Tris/glycine-EDTA gel and electrophoresed in Tris/glycine-EDTA buffer (10 mM Tris (pH 8.0), 380 mM glycine, and 1 mM EDTA) at 4°C. The gel was soaked in 10,000-fold-diluted SYBR Green I nucleic acid stain (Lonza, Walkersville, MD), and DNA was visualized under UV light.

Analytical SEC
Analytical SEC was performed using a 40-ml Superdex 200 10/300 GL column (GE Healthcare) equilibrated with 0.1 M NaCl and 20 mM Tris (pH 7.0). A similar running buffer was used to elute protein standards prepared from commercially available dry lyophilized powders (Sigma-Aldrich) at 1 ml/min. CouR variants were eluted similarly (Fig. S3).

Crystallization and structure solution of the CouR-operator complex
Single-stranded DNA oligomers were ordered from Integrated DNA Technologies and were made up to 10 mM by solvating in 20 mM Tris (pH 7.5) and 50 mM NaCl. The complementary single-stranded DNA solutions were mixed 1:1 before using a thermocycler to heat at 95°C for 5 min, followed by a decrease to 25°C at a rate of 1°C per minute. A series of dsDNA sequences ranging from 16 -29 bp and containing either blunt end or single-nucleotide overhangs were screened for cocrystallization with CouR before arriving at the optimally diffracting 23-nt sequence (23-mer) used in the final model. Crystals of CouR bound to the 23-mer were obtained by mixing 15 mg/ml CouR with 1.1 stoichiometric amounts of 23-mer and incubating on ice for 20 min. This solution was mixed with an equal volume of reservoir solution containing 0.1 M Tris (pH 8.5), 0.1 M ammonium phosphate, and PEG 6000 to generate 2-l hanging drops. Thin plate-shaped crystals grew at 16°C to maximum size (about 100 ϫ 100 m) in less than a week.
Crystals were cryoprotected in crystallization buffer supplemented with 25% glycerol and vitrified by direct immersion into liquid nitrogen prior to data collection. Structure determination was carried out using single-wavelength anomalous dispersion data collected from crystals of selenomethionine-derivatized CouR bound to DNA using Phenix AutoSol with a Bayesian estimate of the map quality (25). Data were collected at the Advanced Photon Source, Argonne National Laboratory using the Life-Science Collaborative Access Team 21-ID-F, 21-ID-G, and 21-ID-D beamlines. After determining the phases and initial model from Phenix AutoSol, each structure was rebuilt using Phenix AutoBuild with multiple rounds of manual intervention (26). This new model was further refined using Structural basis for regulation of coumarate catabolism CCP4 refmac5 in combination with additional manual rebuilding in COOT (27,28).

Crystallization and structure solution of the CouR-pCC complex
CouR was concentrated to 24 mg/ml, combined with 3 mM pCC, and crystallized at 9°C in similar 2-l hanging drops containing a reservoir solution of 0.17 M Li 2 SO 4 , 23.5% PEG 4000, 1 mM DTT, 0.085 M Tris-HCl (pH 8.5), and 15% glycerol. Rodshaped crystals grew to maximum size in less than a week to about 200 ϫ 50 m at the largest face. Prior to data collection, crystals were directly flash-frozen in liquid nitrogen. Data were collected at the Advanced Photon Source, Argonne National Laboratory using the Life-Science Collaborative Access Team 21-ID-G beamline. The structure was solved by molecular replacement using the protein coordinates from the CouR-couR operator structure. Molecular replacement-derived phases were used to build an initial model in Phenix AutoBuild, followed by similar refinement procedures as described for the CouR-couR operator complex. Prior to fitting pCC into electron density difference maps, restraint parameters and geometry optimizations were produced by Phenix eLBOW (29). Water molecules were incorporated in the CouR-pCC structure using Phenix Refine (30).

Crystallization and structure solution of apo CouR
CouR was concentrated to 15 mg/ml and crystallized at 9°C in 1 l sitting drops by mixing 1:1 (v/v) protein solution and reservoir solution containing 60% (v/v) Tacsimate (pH 7.0). Diamond-shaped crystals grew to maximum size in less than a week. The structure was solved by molecular replacement in a similar manner as described for the CouR-pCC cocrystal structure. Additional model building and refinement were carried out as described above.