Conformational Changes Relevant to Channel Activity and Folding within the first Nucleotide Binding Domain of the Cystic Fibrosis Transmembrane Conductance Regulator*

Background: The CFTR chloride channel undergoes conformational changes during its gating cycle. Results: H620Q mutation associated with increased channel Po, and the corrector/potentiator CFFT-001 both lead to similar conformational shifts in NBD1. Conclusion: There is an intrinsic conformational equilibrium within NBD1 that is correlated with channel activity. Significance: Conformational fluctuations within NBD1 are fundamental to CFTR regulation. Deletion of Phe-508 (F508del) in the first nucleotide binding domain (NBD1) of the cystic fibrosis transmembrane conductance regulator (CFTR) leads to defects in folding and channel gating. NMR data on human F508del NBD1 indicate that an H620Q mutant, shown to increase channel open probability, and the dual corrector/potentiator CFFT-001 similarly disrupt interactions between β-strands S3, S9, and S10 and the C-terminal helices H8 and H9, shifting a preexisting conformational equilibrium from helix to coil. CFFT-001 appears to interact with β-strands S3/S9/S10, consistent with docking simulations. Decreases in Tm from differential scanning calorimetry with H620Q or CFFT-001 suggest direct compound binding to a less thermostable state of NBD1. We hypothesize that, in full-length CFTR, shifting the conformational equilibrium to reduce H8/H9 interactions with the uniquely conserved strands S9/S10 facilitates release of the regulatory region from the NBD dimerization interface to promote dimerization and thereby increase channel open probability. These studies enabled by our NMR assignments for F508del NBD1 provide a window into the conformational fluctuations within CFTR that may regulate function and contribute to folding energetics.

Cystic fibrosis (CF) 4 is a genetic disorder caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene (1)(2)(3). CFTR encodes a 1480-residue integral membrane protein that functions as a chloride channel. As a member of the ABC transporter superfamily of proteins, CFTR is organized into two repeated units, each comprised of a membrane-spanning domain (MSD1 and 2) and a cytosolic nucleotide binding domain (NBD1 and -2) (supplemental Fig. S1). A large intrinsically disordered regulatory (R) region, unique to CFTR, connects NBD1 to MSD2. The first ϳ30 residues of the R region found immediately C-terminal to NBD1 is called the regulatory extension (RE). Intracellular domains are formed by cytosolic extensions of the transmembrane helices and bridge the MSDs and the NBDs. Crystal structures of NBD1 (4), including the F508del variant, show a well conserved conformation of NBD1 that consists of three subdomains: the ␣-subdomain (the site of Phe-508), the ␤-subdomain (containing three anti-parallel ␤-strands), and a central ATP-binding core (Fig. 1B). A 34-residue disordered segment, called the regulatory insertion (RI), also unique to CFTR, is found between the first two ␤-strands of NBD1 (supplemental Fig. S1). The ATPbinding core is dominated by a central ␤-sheet (comprising strands S6, S7, S8, S3, S9, and S10). Two short helices, H8 and H9, at the C terminus of this NBD1 construct are arguably not part of the core NBD1, but can fold onto strands S3, S9, and S10 of this sheet (see Fig. 1B). Opening and closing of the CFTR channel involves phosphorylation of the R region and the RI by protein kinase A (5) (and potentially protein kinase C (6) and AMP kinase (7)), in combination with ATP binding and hydro-lysis at the interface between NBD1 and NBD2. During the gating cycle, ATP molecules bind at the interface of NBD1 and NBD2, as shown in bacterial NBD homodimers (8). Conformational changes in the NBD1/NBD2 heterodimer are thought to be influenced by phosphorylation of the R region and RI (9,10). It has been proposed that these ATP-and dimerization-induced conformational changes are relayed to the MSDs via the intracellular domains leading to pore opening and closing (10 -12).
The most common and severe CF-causing mutation in the CFTR gene is deletion of Phe-508 (F508del) in NBD1 (13). The F508del mutation further exacerbates the normally inefficient folding and processing of CFTR, thereby targeting it to the endoplasmic reticulum-associated protein degradation pathway (14). Low temperature (15), glycerol addition (16), introduction of revertant (17) and solubilizing (18) mutations, and removal of the RI (19) can reduce this misprocessing and degradation, such that a small population of F508del CFTR molecules can escape the quality control mechanisms of the cell and traffic to the plasma membrane. However, once at the plasma membrane, these F508del CFTR molecules show a decreased open probability (20) and instability (21), eventually leading to endocytosis and degradation.
As potential therapies for CF, small-molecule compounds have been sought to address the underlying defects of F508del and G551D, a mutation that impairs gating. Pharmacological agents called "corrector" compounds aim to overcome processing/folding defects, allowing mutated CFTR to escape cellular degradation and increasing the number of CFTR molecules at the plasma membrane. Compounds may achieve this by enhancing folding, increasing stability, mimicking or affecting the levels of a molecular chaperone, or interrupting an interaction with the degradation machinery of the cell. A corrector compound, VX-809, has been shown to restore F508del CFTR processing in primary human bronchial epithelial cells isolated from patients homozygous for F508del (22,23). "Potentiator" compounds increase the open probability of CFTR channels already transported to the plasma membrane. The compound VX-770 is a potentiator of G551D (24) and F508del-CFTR (25), as well as several other gating mutations (26,27).
There has been success in the identification and discovery of correctors, potentiators, and dual corrector/potentiators (28), but their mechanisms of action on a molecular level including potential direct interactions with CFTR have not been characterized. Even in the cases of compounds for which physiological and biochemical data suggest a direct interaction with CFTR, the domain of the CFTR molecule to which particular compounds bind is not known. Such structural level data would not only give information about how a compound functions but also facilitate the design of improved therapeutic compounds. Details on the conformational effects of binding of compounds can also provide insights into the dynamic properties and the energetic landscape of the CFTR protein, known to be critical for channel function and folding (29,30) A limitation, however, for these types of studies lies in the relatively poor solubility of most known CFTR modulator compounds in aqueous solution.
To probe the conformational effects of mutations and binding of small molecule modulators, we have applied a range of biophysical approaches to study H620Q and helix H9 variants of NBD1 and the interaction of the most soluble compound of a series of dual corrector/potentiator compounds provided by the Cystic Fibrosis Foundation, N-cyclohexyl-4-(6-methyl-3pyridinyl) pyrimidine-2-amine, referred to here as CFFT-001. NMR backbone resonance assignments (82%) carried out on a human F508del NBD1 lacking the RI and ending at residue 646 (F508del NBD1 ⌬RI⌬RE) enable us to define conformational changes due to the H620Q mutation. We also present electrophysiology data demonstrating the dual corrector/potentiator activity of CFFT-001 for F508del CFTR, consistent with it interacting directly with CFTR, possibly at NBD1, the domain harboring the F508del mutation. Addition of CFFT-001 to F508del NBD1 ⌬RI⌬RE results in NBD1 conformational changes overlapping with those observed for the H620Q mutant protein that has higher open channel probability. Differential scanning calorimetry (DSC) data show a reduction in NBD1 thermal melting temperature for H620Q or in the presence of CFFT-001, demonstrating direct binding of the compound to a less thermostable conformation. Docking of the compound onto higher energy conformations of NBD1 accessed in replica exchange simulations describes possible compound binding modes consistent with the NMR and DSC data. Together, these orthogonal methods provide evidence for mutation-induced or compound-induced conformational changes within NBD1 and generate insights into the motional and energetic properties of CFTR relevant for channel activity and folding.

Protein Expression and Purification (NMR and DSC Studies)-
Human NBD1 (387-646, ⌬405-436) constructs with or without Phe-508 and containing the H620Q mutation or deletion of helix H9(636 -646) were expressed as His 6 -SUMO fusions at 16°C in BL21(DE3) Codon Plus cells grown in minimal media with [ 15 N]NH 4 Cl and/or [ 13 C]glucose, for NMR studies, or LB for DSC studies, and purified as described previously (31,32). Purified proteins were stored at 4°C in buffer containing 12.5% glycerol before being exchanged into 50 mM sodium phosphate, pH 7.5, 150 mM NaCl, 5 mM ATP/MgCl 2 , 2% (v/v) glycerol, 2 mM DTT, 10% D 2 O with 0.05% sodium azide and 0.5% benzamidine for NMR or into 150 mM NaCl, 20 mM HEPES, pH 7.5, 10% glycerol, 10% ethylene glycol, 1 mM Tris(2-carboxymethyl)phosphine, 2 mM ATP, and 3 mM MgCl 2 for DSC. The NMR buffer has been optimized for both stability and spectral line width, precluding the use of higher concentrations of glycerol and ethylene glycol used in the DSC buffer, which has been optimized for stability alone.
NMR Spectroscopy-All NMR experiments were carried out at 20°C on a Varian 500 or 600 MHz spectrometer equipped with pulsed field gradients and either a triple resonance room temperature or cryo-probe, respectively. Data were processed and analyzed using NMRDraw (33), NMRView (34), and Sparky (35). Assignments were obtained using data from HNCO, HNCA, and HN(CO)CA experiments (36 -38) as well as WT assignments. 5 Combined N and H chemical shift changes in Hz were calculated using Equation 1.
Origin of CFFT-001-The compound N-cyclohexyl-4-(6methyl-3-pyridinyl) pyrimidine-2-amine (CFFT-001, supplemental Fig. S2) was synthesized originally in the CFTR modulator drug discovery program of EPIX Pharmaceuticals, Inc. A pharmacophore model was generated based on the previously identified corrector classes 1, 2, 3, and 5 (represented by corr-1c, corr-2b, corr-3c, and corr-5c) (39). The pharmacophore model generated with Catalyst software (Accelrys, San Diego, CA) also included a shape constraint to improve the spatial overlap between screened compounds and reference correctors. An EPIX in-house data base of ϳ4 million unique commercially available compounds was filtered using a set of one-dimensional and two-dimensional chemical property descriptors derived from the selected compound classes, resulting in a focused library of ϳ740,000 compounds. Screened compounds matching the pharmacophore model were subsequently filtered by the Catalyst fitness score. The score cutoff was selected by enrichment analysis and cutoff application reduced the library size to 2300 compounds. The remaining compounds were subsequently clustered by molecular similarity, and cluster representatives were selected based on the Catalyst fitness score. The structure activity relationship trends, which may be concluded from the reference compound data but could not be incorporated into the pharmacophore hypothesis for technical reasons, were used to prioritize compounds, as well as chemical novelty. 85 compounds were selected, purchased, and sent for biological testing, one of which was found to be weakly active in an FRT (Fischer rat thyroid) cell Ussing chamber assay. The synthesis of non-commercial analogs led to the discovery of CFFT-001 (68), an early member of the EPIX chemical series of dual-acting corrector/potentiator compounds.
Ussing Chamber Analysis-Base line-corrected short circuit current traces were analyzed from Ussing chamber recordings of transepithelial currents across monolayers of FRT cells that stably express F508del. Although all compound incubation was performed at 37°C, the recordings were carried out at 27°C, as described previously (40). This temperature was chosen to decrease the run-down of corrector effect by reducing internalization of CFTR. Cells were grown on Snapwell filters (Corning catalog no. 3801, Corning, NY), incubated for 24 h with negative control (0.3% DMSO), 10 or 30 M CFFT-001, or 10 M positive control C18 (69) (supplemental Fig. S2), which is distributed by the CF Foundation. Snapwell filters were then inserted into Physiologic Instruments Ussing chambers (Physiologic Instruments, Inc., San Diego, CA) for at least 20 min in 10 ml of buffer solution before CFTR was activated with forskolin.
Simulations and Docking-The crystal structure of F508del NBD1⌬RI⌬RE (Protein Data Bank code 2PZF) (40), comprising residues 387-646 (⌬405-436), was prepared with Discovery Studio (41). Replica Exchange Molecular Dynamics (REMD) simulations were done using the Gromacs Molecular Dynamics package (42,43) with the AMBER99SB-ILDN force field (44). The protein plus its bound ATP was submerged in TIP4P water in a rhombic dodecahedral box with an extra extension along each axis of the protein of 10 Å. Ions were added to the solution to make the system electrically neutral. The structure was minimized and equilibrated, first under NVT conditions for 100 ps and then under NPT conditions for an additional 100 ps. The REMD production phase was carried out under NPT conditions with a time step of 2 fs. REMD simulations consisted of 10 replicas covering a temperature range of 300 K to 313.6 K. The simulation of each replica was run for 2 ns. The temperature range was selected to give the same acceptance probability between all adjacent pairs over the entire temperature range based on known energy distributions for solvated proteins (45). The simulation was run with the leap-frog algorithm (46). Long range electrostatic interactions were computed using Particle Mesh Ewald electrostatics (47,48). The cut-off for van der Waals and Coulomb interactions was set to 10 Å. Periodic boundary conditions were applied. The LINCS algorithm (49) was used to constrain bond lengths. The above procedure was repeated three times starting from different random seeds.
Selected conformations from the simulation, particularly those having frayed H8 and H9 helices and long distances between His-620 and Phe-640, were searched for potential binding sites based on the receptor cavities approach within Discovery Studio (41). Docking simulations of CFFT-001 to the potential binding site between S3/S9/S10 and H8/H9 in the different NBD1 conformations from the REMD were conducted using the flexible docking procedure (50) of Discovery Studio or using the induced fit protocol of Schrodinger (51)(52)(53). Flexible residues were selected to be those facing the binding site.
Differential Scanning Calorimetry-Protein samples of 0.5 ml were diluted to 15 ml in DSC buffer and concentrated to 0.5 ml at 4°C using an Amicon Ultra-15 (10K) centrifugal filter. The procedure was repeated three times. CFFT-001 in 100% DMSO was added to the protein sample to create a 5% DMSO solution at the appropriate compound concentration. DSC was conducted with a VPCapillary DSC System (MicroCal Inc., GE Healthcare) using a scan rate of 2 K/min. A buffer-only heat capacity curve was subtracted from the protein curve, and data were analyzed with the instrument provided software. Repeated measurements for the H620Q F508del sample gave identical T m values to the first decimal place; the general reproducibility of NBD1 T m values is within Ϯ0.3°C.
Sequence Alignment-ABC family C sequences were collected with BLAST (54, 55) using one representative sequence for each of the 12 human ABC family C members (UniProt IDs P33527, Q92887, O15438, O15439, O15440, O95255, Q09428, O60706, Q5T3U5, Q96J66, and Q96J65), comparing them against the default non-redundant database (Aug. 18,2011) using an E threshold of 1e Ϫ 200. The top 200 hits were retrieved, and for the non-CFTR targets, a match to the desired member was ensured by restricting to FASTA files containing any of the following keywords: "member N," "protein N," "mrp-N," "MRP-N," "mrpN," or "MRPN," where N is the member number of the target. All sequences were then filtered by manually removing duplicate sequences at 100% identity, truncated isoforms, sequences containing unsequenced residues (X), and sequences with less than 1400 residues. A total of 294 sequences were obtained, with 80 from CFTR. NBD1 sequences were iso-lated by aligning the full-length sequences in MUSCLE and extracting the alignment spanning human CFTR residues 388 to 680. These sequences were then realigned using MUSCLE. Sequence profiles were calculated for each of the 12 members by taking amino acid and gap frequencies for each position in the alignment.
Highly conserved residues in CFTR were defined by a 95% sequence identity threshold, and their specificity to CFTR was measured by measuring the frequency of the CFTR conserved residue in each of the other 11 family C members. A member was considered a potential match when it has 10% or greater identity to CFTR at that position.

Assignment of F508del
Human NBD1 ⌬RI⌬RE-NMR studies were performed on F508del NBD1 lacking the RI and ending at residue 646 (NBD1 ⌬RI⌬RE: 387-646, ⌬405-436) because of its enhanced solubility and stability (40). NBD1 ⌬RI⌬RE that is otherwise wild-type (referred to as WT) or additionally contains F508del can be concentrated to ϳ1 mM and remains stable at 4°C for several weeks. HSQC spectra of WT and F508del NBD1 ⌬RI⌬RE are overlaid in Fig. 1A. In agreement with crystallographic results (40), the similarity of the two spectra indicates that both share the same overall structure. However, dif-ferences in peak intensity within the individual spectra reflect significant conformational heterogeneity in both the WT and the F508del NBD1 ⌬RI⌬RE domains. WT assignments 5 helped facilitate 82% assignment of backbone resonances for F508del NBD1 ⌬RI⌬RE. Unassigned residues were mapped onto the NBD1 structure and sequence ( Fig. 1, B and C). Many of the unassigned residues are located in loops connecting helical segments. The inability to assign them is likely due to the dynamic nature of these loops leading to broadening of resonances.
H620Q Variant Reduces Helicity of H8 and H9 at the C Terminus of F508del NBD1 ⌬RI⌬RE-To address possible conformational changes within NBD1 that may be relevant to channel activity and/or misfolding, we focused on the C-terminal ␤-strands (S9 and S10) which have been shown to affect processing, activity and pharmacology of CFTR (56). In particular, an H620Q variant (in S9) originally identified in CF patients has an increased P o in single channels (56 -58). Fig. 2, A and B, presents an overlay of NMR spectra for F508del NBD1 ⌬RI⌬RE and its H620Q variant (F508del H620Q NBD1 ⌬RI⌬RE). Arrows indicate a subset of peaks that shift upon H620Q mutation, including Glu-621, Gly-622, Gln-634, Leu-636, Ser-641, Leu-644, and Met-645. Mapping of the chemical shift changes (supplemental Fig. S3A) onto a ribbon diagram of the structure The ␣and ␤-subdomains and the ATP binding core are shown in blue, green, and gray, respectively. The C-terminal helices H8 and H9 are labeled. Red spheres represent the N atoms from residues not assigned in either WT or F508del (three red spheres representing residues Glu-391, Thr-390, and Pro-638 are obstructed from view). Yellow spheres are N atoms of residues not assigned in F508del. The orange sphere is the N atom of Phe-508. ATP is shown in cyan. The deletion site for the RI is indicated. C, using the same color coding as in B, these residues are shown in the amino acid sequence. In total, 82% of F508del NBD1 ⌬RI⌬RE has been assigned.
of WT NBD1 ⌬RI⌬RE (Fig. 2E) shows that in addition to effects adjacent to His-620 (Glu-621, Gly-622) there are changes in strands S3, S9, and S10. There are also significant peak shifts mapping to residues on the two short C-terminal helices (H8 and H9) that are immediately N-terminal to residues of the RE/R region in full-length CFTR. Strikingly, residues on both surfaces of these helices are affected with five of six of the H9 resonances perturbed in the variant. The general direction of the chemical shift changes (toward the center of the amide proton chemical shift range at ϳ8.0 ppm) in H8 and H9 is consistent with a conformational equilibrium shift toward more disorder (supplemental Fig. S4A). Chemical shifts in the absence of the mutation demonstrate that these helices are in equilibrium with a coil conformation (supplemental Fig. S4C) (59) and the chemical shift perturbations for the residues of the H8/H9 helices upon replacing His-620 with a Gln reflect a change in the equilibrium further toward the coil conformation. The chemical shift perturbations expected for a full helix-to-coil transition are much larger than the observed shifts, indicating a small shift in the equilibrium. An examination of the structure reveals that the His-620 side chain in S9 interacts with the Phe-640 side chain in H9 (Fig. 2D). We propose that the helices are stabilized by binding to residues on strands S3, S9, and S10 and that mutation of His-620 disrupts the interaction between the surface of the ␤-strands and H8/H9, reducing the helical structure.
Dual Corrector/Potentiator Activity of CFFT-001-To probe the conformational effects of binding of small molecule CFTR modulators, we focused on the dual corrector/potentiator N-cyclohexyl-4-(6-methyl-3-pyridinyl) pyrimidine-2-amine (CFFT-001, supplemental Fig. S2) that was originally synthesized in the CFTR modulator drug discovery program of EPIX Pharmaceuticals, Inc. funded by the Cystic Fibrosis Foundation Therapeutics (Bethesda, MD). The electrophysiological data in Fig. 3 illustrate the dual acting properties of CFFT-001. After CFTR was maximally activated by subsequent additions of 10 M forskolin, 100 M IBMX, and 20 M genistein, cells incubated for 24 h with 30 M CFFT-001 showed a robust peak current increase of 68% compared with control (Fig. 3A), pointing to an increase in membrane-localized channels and illustrating the corrector activity of the compound. ) and red to the largest shifts. The N atoms for residues whose chemical shift changes upon mutation yet cannot be identified with certainty are shown as magenta spheres. His-620 and Phe-508 are shown as green and orange spheres, respectively.
After activation of CFTR with 10 M forskolin, acute addition of CFFT-001 to 0, 3, 10, 30, or 60 M final concentrations in the half-chamber bathing the apical membrane of the FRT cells leads to a clear dose-dependent current increase (Fig. 3B), as expected for a CFTR potentiator. Consistent with the concept of an absolute biophysical limit for the maximal open probability, the additional current increases due to subsequent additions of 100 M IBMX and 20 M genistein have the inverse rank order of the CFFT-001-mediated effect: the IBMX/genistein response is largest without any acute addition of CFFT-001 and smallest for the highest dose of the compound. It should be noted that the Ͼ60% increase in inhibitor-sensitive current over the DMSO control condition observed after 24 h of incubation with 30 M CFFT-001 (Fig. 3A) is not seen after acute addition of the compound (Fig. 3B), clearly distinguishing the corrector and potentiator activities of CFFT-001 in these assays.
CFFT-001 Compound Reduces Helicity of H8 and H9 in F508del NBD1 ⌬RI⌬RE-To determine whether the dual corrector/potentiator CFFT-001 compound binds to F508del NBD1 ⌬RI⌬RE and to monitor any conformational changes that may result, we recorded NMR spectra following several titration points of CFFT-001. Spectra were obtained after addition of each of several aliquots of CFFT-001 (1:1, 2:1, and 3:1, compound:protein) to generate a series of spectra representing changes as a function of apparent compound concentration up to 750:250 M of CFFT-001:NBD1. Due to the relatively low aqueous solubility of CFFT-001 (ϳ30 M), we have not saturated the complex, and our stated compound concentrations are likely to be significantly over-estimated, consistent with expectations from the 13 M EC 50 for potentiation. Titrations beyond ϳ3:1 (apparent compound:protein concentrations) requiring Ͼ750 M apparent compound concentrations were not performed due to precipitation of CFFT-001 in the aqueous NMR buffer (only including up to 1.5% DMSO to minimize protein destabilization due to this organic solvent). Fig. 4, A and B, shows a spectral overlay for F508del NBD1 ⌬RI⌬RE in the apo state and after the final CFFT-001 titration point, with arrows pointing to a subset of peaks that shift. Comparison of this set of peaks with a subset of those resulting from the H620Q mutation (compare Fig. 2 with Fig. 4) shows significant similarities, including both the identity of perturbed resonances and the direction of the chemical shift changes. These similarities are consistent with a common effect of the H620Q substitution and CFFT-001 on H8 and H9, although additional peak shifts reflecting other conformational changes in the mutant are present as well ( Fig. 2A). To account for small shifts caused by the compound solvent DMSO, chemical shifts in the presence of DMSO at appropriate concentrations were subtracted from chemical shift changes due to CFFT-001. Mapping of the resulting chemical shift changes (Fig. 4C) onto a ribbon diagram of the structure of WT NBD1 ⌬RI⌬RE clearly shows that the greatest peak shifts map to residues on helices H8 and H9. Similar to the H620Q variant, residues on both surfaces of these helices are affected by CFFT-001, as opposed to one surface of the helix as one might expect for a direct drug interaction with the helix. The linearity of the size of peak shifts as a function of added CFFT-001 indicates equilibrium between two states that are in fast exchange on the NMR timescale, with addition of the compound shifting the equilibrium to one of the states. The direction (toward the center of the amide proton chemical shift range at ϳ8.0 ppm) of most of the amide proton chemical shift changes is consistent with a conformational equilibrium shift toward coil, shifting the underlying H8/H9 helix-coil equilibrium further toward disordered states (supplemental Fig. S4B). There are also smaller chemical shift changes in residues 458 and 462 (located in the loop between S3 and helix H1) and ␤-strands S9 and S10, a region contacted by helices H8 and H9.
Importantly, the size of the chemical shift change is not directly correlated with proximity to the compound. Rather, the size of the shift is related to changes in the immediate chemical environment of each amide proton-nitrogen pair due to either direct binding or indirect structural and dynamic effects. H8 and H9 form an unlikely interaction surface for CFFT-001, with their polar, charged, and irregular surfaces. The com-pound is hydrophobic and contains planar aromatic rings, suggesting that a flatter surface composed of hydrophobic residues in S3, S9, and S10 could constitute a more suitable site for drug binding. We propose that CFFT-001 binds to a hydrophobic surface formed by residues of these strands lying below H8 and H9, thereby disrupting the interaction between the sheet and the helices and reducing their helicity. The small shifts in the hydrophobic surface residues are attributable to the small fraction bound, the relatively minor conformational adjustments to these strands upon compound binding, and the expectation of  Compare Fig. 2, A and B, with Fig. 4, A and B. C, ribbon diagram of WT NBD1 ⌬RI⌬RE as described in Fig. 2 with the color gradient representing chemical shift changes due to compound addition. poorer sensitivity of the observed resonances of the backbone atoms to compound binding than unobserved resonances of side chain atoms likely in direct contact with the compound. The larger shifts observed for H8 and H9 are attributable to the more dramatic backbone conformational changes upon ligand binding.
In addition to the F508del NBD1 ⌬RI⌬RE protein, we have also titrated the compound into WT NBD1 ⌬RI⌬RE and saw nearly identical peak shifts (supplemental Fig. S5) and a similar reduction of helicity. Interestingly, titration of the compound into the F508del H620Q NBD1 ⌬RI⌬RE leads to resonances of H9 shifting more toward coil (Fig. 2C, arrows). Thus, the compound further pushes the conformational equilibrium shift already present in the H620Q variant, in a qualitatively additive fashion, and the mutation does not inhibit compound binding.
Effect of Deletion of H9 on Binding of CFFT-001-Although the most significant peak shifts observed upon addition of compound map to residues in the C-terminal helices, we hypothesize that CFFT-001 has a direct interaction with residues of S3, S9, and S10 that results in shifting the H8/H9 conformational equilibrium. To address this hypothesis and to test whether removal of these helices would allow greater accessibility for the compound to the surface of the sheet, we attempted to probe binding to NBD1 with both H8 and H9 deleted. The protein, however, was unstable, and sufficient yields and purity could not be attained. Notably, we were able to express and purify NBD1 in which residues 637-646 comprising H9 were deleted (Fig. 5A), consistent with the lack of density or conformational heterogeneity for residues of H9 in various crystal structures of NBD1 (supplemental Fig. S4D) (40,60,61) and our NMR data showing fraying of H9 (supplemental Fig. S4C). Comparison of spectra of F508del NBD1 ⌬RI⌬RE with F508del NBD1 ⌬RI⌬H9 (Fig. 5A) shows that deletion of residues 637-646 leads to significant chemical shift changes in addition to absence of peaks of deleted residues. Chemical shift changes can be noted in H7, H8, S3, S9, S10, and elsewhere in the protein (Fig. 5B). The largest shifts were observed for Gly-458 (in the loop between S3 and H1), confirming the loss of binding interaction to H9, and, as expected, Arg-636, at the C terminus before the truncation. We titrated CFFT-001 (1:1, 2:1, and 3:1, apparent compound: protein concentrations) into F508del NBD1 ⌬RI⌬H9 spectra and observed that the largest shift, again, was for Gly-458 (Fig.  5, A and C). This highlights the sensitivity of S3, S9, and S10 to compound binding, providing evidence for direct binding of the compound on the surface of this portion of the NBD1 core. Comparison of Figs. 5C and 4C shows a more extensive effect of compound binding (including S2 and the intersubdomain region) and more significant peak shifts in constructs lacking H9, suggestive of higher affinity binding. This is expected as moving H9 away to facilitate accessibility to the actual binding site on the surface of strands S3, S9, and S10 requires energy, so deleting it should lead to more favorable binding energy. These data also indicate that NBD1 retains the same basic fold in the absence of H9 and support our model of compound binding to the ␤-strands below the C-terminal helices and not to H9 itself (Fig. 5C).
Simulated Docking of CFFT-001 onto NBD1-To address whether the region around S3/S9/S10 and H8/H9 contains a suitable binding site for CFFT-001, we carried out docking simulations. Examination of the crystal structure of F508del NBD1 ⌬RI⌬RE (Protein Data Bank code 2PZF) did not reveal any compound binding sites in this region. Because NMR data suggest that the binding does not occur to the ground state described in the crystal structure, we utilized REMD simulations starting from the structure of the ATP-bound NBD1 to sample higher energy states. In accordance with NMR data, we identified conformations in which H8 and H9 showed substantial fraying of helical structure as well as large distances between His-620 in S9 and Phe-640 in H9. We performed a site search procedure (41) on several of these conformations, leading to the identification of potential compound binding sites in this region that were not present in the ground state. These potential binding sites were subjected to docking simulations using the flexible docking procedure in Discovery Studio (50) and the induced fit protocol in Schrodinger (52,53,62). Docking resulted in multiple binding modes to the surface of S3/S9/S10 (Fig. 6). Interestingly, H9 contributed to the binding in many cases but not usually through specific H-bonds orinteractions with the ligand.
Because CFFT-001 also binds to NBD1 in the absence of H9, we repeated the docking simulations after removing residues comprising H9 (637-646) from the previously chosen REMD trajectory conformers. Once more, multiple binding modes were obtained in which the ligand comes into close contact with the hydrophobic S3/S9/S10 surface (supplemental Fig. S6). A few specific interactions with binding site residues (e.g. -cation interactions with Lys-464) were observed in some cases. Similar results were obtained upon docking CFFT-001 to chain B of Protein Data Bank code 2PFZ, which lacks H9. In all of these binding modes, interaction of CFFT-001 with the surface of strands S3, S9, and S10 is not expected to be significantly perturbed by mutation of H620Q, which is adjacent to but not directly at the proposed binding surface (Fig. 6 and supplemental Fig. S6). Note also that ATP did not contact the compound in any of the REMD or docking simulations.
Direct Interaction between CFFT-001 and Mutations of NBD1 That Decrease Thermal Stability-Having examined conformational changes within NBD1 as a result of mutation and compound addition using NMR approaches, we used differential scanning calorimetry (DSC) to probe these perturbations. The data in Fig. 7A illustrate that both the H620Q variant and the deletion of H9 reduce the thermal stability of NBD1. The midpoint of thermal denaturation, T m , is reduced by ϳ1-2 degrees in each, comparing F508del NBD1 ⌬RI⌬RE (T m in 2 mM ATP ϭ 49.4°C and 5 mM ATP ϭ 51.0°C) to either F508del H620Q NBD1 ⌬RI⌬RE (bottom curve, T m in 2 mM ATP ϭ 47.1°C) or F508del NBD1 ⌬RI⌬H9 (middle curve, T m in 5 mM ATP ϭ 50.4°C). Furthermore, direct CFFT-001 binding to NBD1 is indicated by the consistent reduction in T m of ϳ1 degree when the compound is incubated at ϳ1 mM (apparent concentration) with the various NBD1 constructs, for F508del NBD1 ⌬RI⌬RE from 49.4 to 48.6°C, at 2 mM ATP, and for F508del NBD1 ⌬RI⌬H9 from 50.4 to 49.4°C, at 5 mM ATP. In addition, there is a clear concentration-dependent loss of thermal stability (Fig. 7, B-D). Note that although ATP concentration changes the absolute T m values, it is not the T m values but the changes in T m upon mutation or CFFT-001 binding (at constant ATP concentration) that are important. Given that ATP is already saturated by 2 mM and there are no contacts between ATP and His-620, helix H9, or the compound, the changes in T m values should not be significantly affected by the ATP concentration.
Importantly, the negative T m shift observed upon binding of CFFT-001 is diagnostic of the compound interacting and stabilizing a conformation of NBD1 that is less thermostable, compatible with binding to a state in which the C-terminal helices are released from the NBD1 core. The data are consistent with both mutation and compound binding facilitating a shift in the underlying conformational equilibrium that leads to the preferential population of the state having the C-terminal helices released from the core. These data also indicate that deletions of Phe-508 and H9 do not eliminate the CFFT-001 binding site because the T m values of NBD1 containing F508del or both mutations are equally affected by CFFT-001. Overall, the DSC data are in agreement with the effects of deletion of H9, an H620Q variant, and addition of CFFT-001 on NBD1 inferred from NMR data for these constructs.
CFTR-specific Sequence Conservation of Strands S9 and S10-CFTR differs structurally from other members of the ABC C family by the addition of the RI and R region phosphoregulatory segments that have been proposed to interact with the core of NBD1 in a phosphorylation-dependent fashion (31,63). We hypothesized that these interactions require a binding site that is unique to CFTR. We expected to find CFTR-specific conservation of surface residues in the NBD1 of CFTR that differ significantly from their counterparts in other ABC C family members and that would point to RI and R region-specific functional surfaces on NBD1 involved in regulation of channel activity and/or misfolding. We further hypothesized that these surfaces could potentially be linked to the conformational changes within NBD1 identified by our mutational and compound binding studies. As such, an analysis of sequence conservation was undertaken, with a specific focus on separating conservation common to ABC transporters from conservation that is unique to CFTR.
We prepared a sequence alignment comparing NBD1s from CFTR to sequences from each of the 11 other ABC C family members, using all 12 human sequences to identify homologs while filtering by annotation to ensure that the 12 members were kept distinct. From this, we determined the degree to which conserved CFTR residues (Ͼ95% conserved) in NBD1 match those found in other ABC C family members (see supplemental Table S1 for sequence alignment). Fig. 8 depicts the location of highly conserved residues in NBD1, colored according to the number of ABC C family members that have Ͼ10% sequence identity to CFTR at that position, with highly conserved residues that are unique to CFTR shown in red. Of these uniquely conserved residues, the largest solvent exposed cluster occurs in ␤-strands S9 and S10, with His-620, Tyr-625, Phe-626, and Tyr-627 having no matches to other ABC C family members, and Lys-615 only having a match to one. Significantly, S10 contains three unique hydrophobic aromatics in CFTR, but in all other ABC C family members, S10 has instead at least one strongly polar residue, primarily Glu, Arg, Lys, and Gln. This region of high conservation, namely S9 and S10, coincides with the hydrophobic region we have described here as the site of CFFT-001 binding immediately below helices H8 and H9 and is also the location of residues shown to affect channel activity, including His-620 (56). This high degree of CFTR-specific conservation supports the observation that this region plays a role in conformational changes related to CFTR function and/or processing.

DISCUSSION
Molecular level knowledge on the conformational changes within NBD1 that are relevant to channel activity and folding energetics of CFTR is currently limited. Mutations, deletions and pharmacological effects are most often measured in the context of a full length CFTR molecule and usually in whole cells where many proteins are involved. NMR has enabled us to probe these effects at a residue-specific level and demonstrate that the H620Q substitution associated with higher channel open probability and a dual corrector/potentiator compound give rise to similar conformational changes within NBD1. These results, as well as their synergy with other dynamic and energetic data on NBD1 (29,31,32,64), provide evidence for conformational properties of NBD1 that likely are involved in regulation of channel gating and in folding that may be tested in the future by studies on full-length CFTR.
The significance of the DSC results for the H620Q and H9 deletion can be better appreciated in the context of our current understanding of the NBD1 thermal unfolding pathway derived from a comprehensive analysis of previous DSC data on WT and F508del NBD1 (32,64). Deletion of F508 leads to a T m reduction of 6 -7°C (Fig. 7) resulting from the combined effects of a lower thermodynamic stability of the native state (⌬G) and a faster rate for the formation of an aggregation-prone intermediate. The H620Q and ⌬H9 mutations also may reduce the thermodynamic stability of NBD1, or accelerate the rate of aggregation, or both. It is reasonable to expect that either structural modification would reduce the T m of the protein because dynamic, but productive, interactions with the remainder of NBD1 have been disrupted or eliminated.
Mass action effects provide a clear, albeit unexpected, interpretation for the concentration-dependent decrease in the NBD1 T m upon CFFT-001 binding. Typically, ligand binding to the native receptor results in an apparent increase in stability and T m for receptor unfolding. However, here the T m decreases, indicating that CFFT-001 binds preferentially to a conformation that is populated to a lower extent in its absence. FIGURE 8. Sequence conservation unique to CFTR. Sequence profiles were determined for each of the 12 ABC subfamily C members, including CFTR. Conservation values were mapped onto the structure of WT CFTR NBD1 (Protein Data Bank 2PZE). Residues with Ͻ95% conservation in CFTR (by sequence identity) are colored in light blue. Residues with Ͼ95% conservation are colored in a gradient from red to white, with the residues that are unique to CFTR in red, and the residues that are present (Ͼ10% sequence identity) in other subfamily C members colored from dark pink (one matching ABC) to white (11 matching ABCs). A, ribbon diagrams in two orientations, with the N atoms of unique residues shown as spheres. B, solvent accessible surfaces for the same two orientations.
Any such conformational equilibrium shift toward a state having reduced structural contacts gives rise to a reduction in T m , based on thermodynamic principles, although compound binding makes this the more stable state. NMR data point to the release and loss of helicity for H8 and H9 upon CFFT-001 binding, yielding a conformation that would qualify as the non-native state to which CFFT-001 binds.
The H620Q variant, originally identified in CF patients demonstrating pancreatic insufficiency, has previously been shown to increase the P o of single CFTR channels, indicating an effect on the gating properties of CFTR (56 -58). Both the H620Q variant and the CFFT-001 compound cause a shift of helices H8 and H9 from a preexisting helix-coil conformational equilibrium toward the coil state. These results suggest that perturbations affecting this conformational equilibrium coincide with conditions that promote channel opening and/or impede channel closing. Because the H620Q variant displays additional peak shifts relative to those observed for compound binding, there are certainly other consequences of the mutation. The Becq group (56) has demonstrated that the region surrounding H620 is critical for trafficking and gating. Although H620Q does not affect maturation/processing, an H620P mutation does, highlighting the importance of this position and the nature of the residue in it.
Previous models for the interaction of the RE/R region with NBD1 suggest that the RE is dynamic but that it can interact with the proposed NBD1/NBD2 heterodimerization interface on the core of NBD1 (4,31,61,63). Work from our laboratory on murine NBD1 containing the RE (31) as well as isolated full-length R region with murine (63) and human 6 NBD1 shows that the RE, which consists of the first 30 residues of the R region, transiently interacts with the surface of the NBD1 core in helical conformations. The RE contains two PKA phosphorylation sites, at 660 and 670. One effect of PKA-mediated phosphorylation of the RE or R region is to reduce this interaction and the helical structure 6 (4, 31), shifting the conformational ensemble and potentially facilitating NBD heterodimerization and enhancing channel activity. Our current data on NBD1 lacking the RE, which is a mimic for the extreme of this confor-mational equilibrium in which the RE is always off of the surface, suggest that H9 may be thought of as the first part of the RE and R region with a conformational equilibrium between helical states bound to the core of NBD1 and coil states not interacting with the NBD1 core. This is consistent with the significant conformational heterogeneity of H9 in NBD1 crystal structures (supplemental Fig. S4D) and with previous definitions of the C-terminal NBD1 boundary (19,40,65). Thus, we hypothesize that conformational changes, as elicited by H620Q or CFFT-001, result in a shift in an underlying conformational equilibrium of regulatory interactions that are normally involved in gating. This shift involves release of H8 and H9 from the surface of S3, S9, and S10 that helps displace the RE/R region from the NBD1 dimerization interface (Fig. 9). This should promote NBD dimerization and lead to an enhanced open probability, increased channel activity for H620Q mutant channels and the potentiating effect of CFFT-001. It is possible that H620Q and CFFT-001 act to force an "unnatural" gating; however, the likelihood of this is low, considering that the portion of NBD1 affected appears to be a regulatory hot spot based on our sequence analysis and the observed effects of mutations here (56). Note that although phosphorylation is expected to have an overlapping effect to compound binding in similarly disrupting the R region:NBD1 interface, R region phosphorylation has numerous other effects including enhancement of R region interactions with other parts of CFTR (9) and modulations of R region binding with other binding partners. 6 It is also possible that CFFT-001 may bind more strongly to full-length CFTR in the cell where many other factors contribute toward shifting this conformational equilibrium, including phosphorylation by kinases other than PKA (66) as well as multiple binding partners for the R region that compete with NBD1 for interaction. 6 Such a scenario is consistent with our observation that PKA phosphorylation of CFTR is required to observe potentiator activity for CFFT-001 in cells.
The CFFT-001 compound may contribute to the corrector effect by binding to the core of NBD1 and stabilizing the folded state of the core NBD1 (not including H8 and H9) or by promoting NBD dimerization that could stabilize the folding of full-length CFTR. As shown by DSC, CFFT-001 does not stabilize the folded state of an isolated NBD1, including H8 and H9, 6 Z. Bozoky and J. D. Forman-Kay, manuscript in preparation. FIGURE 9. Simplified schematic model for functional dynamics within NBD1. NBD1 (dark blue) is shown as residing in two populations in a dynamic equilibrium. When channels are closed, NBD1/NBD2 heterodimers are inhibited due to the steric hindrance of the RE/R region interacting with NBD1. Addition of compound or the H620Q mutation shifts this equilibrium by reducing H9 helicity and contacts with the NBD1, subsequently leading to release of the RE/R region from the dimerization interface, relieving the inhibition and facilitating NBD1/NBD2 heterodimerization and channel opening.
but it may stabilize the NBD1 core within full-length CFTR. Binding of CFFT-001 to the uniquely conserved S9 and S10 site is consistent with CFTR-specific potentiating and/or correcting effects. Mutations of residues in these C-terminal strands of NBD1, specifically G622D and G628R, have been demonstrated to perturb the pharmacological effects of dual "MPB (benzo(c)quinolizinium)" compounds, characterized by their ability to both activate CFTR and rescue defective trafficking (56). This suggests that the C-terminal part of NBD1 impacts both gating and trafficking and plays a role in the mechanism of action for these dual acting compounds. It may also be that the mechanisms of action for the CFFT-001 corrector and/or potentiator activities occur at separate sites on the CFTR molecule and involve other domains of CFTR or proteins, making them impossible to dissect using an isolated NBD1.
The identification of S9 and S10 as uniquely conserved and as the site of potentiating mutations and potentiator/corrector compound binding may suggest that this region is an allosteric regulator of channel function or folding. With an understanding of allostery based on modulation of the energy landscape (67), NBD1 can be seen as a sensitive energetically malleable domain, with conformational sampling poised to be affected by a variety of perturbations such as compound binding (29). Thus, although the helix-coil transition for H8 and H9 may be qualitatively correlated with potentiation under some conditions, it is not necessarily quantitatively correlated with potentiation under all conditions, because the set of conformations accessible under different conditions (i.e. phosphorylated or not, nucleotide-bound state, etc.) may not be the same. A similar argument can be made for an underlying allosteric mechanism of suppression by the "3M" (G550E/R553M/R555K) mutations (17) which can improve CFTR processing in the absence of F508; even though they do not directly address the structural changes at the Phe-508 site, these mutations apparently change the equilibrium distribution of conformations accessed by NBD1 to be more similar to that of wild-type. The large number of substitutions in NBD1 that can suppress the F508del mutation supports such a general allosteric view of NBD1 with the structural changes we observe in the H620Q variant and upon CFFT-001 binding being one part of the NBD1 conformational equilibria. The conformational dynamics involving H8/H9 and the start of the R region highlighted here may facilitate other changes in CFTR that are not encompassed by the highly simplified model presented in Fig. 9, including modulation of the dynamic R region interaction hub, 6 and allosteric effects within NBD1 on ATP binding and hydrolysis or NBD/intracellular domain coupling. These may be predicted to have potentiator and corrector consequences.
We have demonstrated that both a mutation and a compound elicit similar conformational changes within NBD1 that shift an underlying equilibrium involving helical structure in H8/H9 and their interaction with the uniquely conserved CFTR-specific region of ␤-strands S3/S9/S10. Because the H620Q mutation and CFFT-001 are both associated with an increase in the open probability of CFTR channels and because H8/H9 lead directly into the RE/R region within the context of full-length CFTR, we have hypothesized that this conformational shift at H8/H9 releases the R region from the NBD1/ NBD2 dimerization interface, allowing heterodimers to form and thereby enhancing channel open probability and possibly processing. The uniquely conserved ␤-strands S9/S10 may thus be a key CFTR-specific lynchpin for integrating phosphoregulatory signals from the R region to NBD1 making it an attractive site for CFTR-specific therapeutics. This is consistent with previous evidence (56) pointing to ␤-strands S9 and S10 as a potential target for the design of more potent and selective CFTR modulators. These studies provide a foundation for further detailed analysis of the effects of mutations and other compounds on underlying conformational changes within NBD1 to provide insights into dynamic processes required for proper CFTR function and the rescue of these processes in the diseased state.