The Streptococcus gordonii Adhesin CshA Protein Binds Host Fibronectin via a Catch-Clamp Mechanism*♦

Adherence of bacteria to biotic or abiotic surfaces is a prerequisite for host colonization and represents an important step in microbial pathogenicity. This attachment is facilitated by bacterial adhesins at the cell surface. Because of their size and often elaborate multidomain architectures, these polypeptides represent challenging targets for detailed structural and functional characterization. The multifunctional fibrillar adhesin CshA, which mediates binding to both host molecules and other microorganisms, is an important determinant of colonization by Streptococcus gordonii, an oral commensal and opportunistic pathogen of animals and humans. CshA binds the high-molecular-weight glycoprotein fibronectin (Fn) via an N-terminal non-repetitive region, and this protein-protein interaction has been proposed to promote S. gordonii colonization at multiple sites within the host. However, the molecular details of how these two proteins interact have yet to be established. Here we present a structural description of the Fn binding N-terminal region of CshA, derived from a combination of X-ray crystallography, small angle X-ray scattering, and complementary biophysical methods. In vitro binding studies support a previously unreported two-state “catch-clamp” mechanism of Fn binding by CshA, in which the disordered N-terminal domain of CshA acts to “catch” Fn, via formation of a rapidly assembled but also readily dissociable pre-complex, enabling its neighboring ligand binding domain to tightly clamp the two polypeptides together. This study presents a new paradigm for target binding by a bacterial adhesin, the identification of which will inform future efforts toward the development of anti-adhesive agents that target S. gordonii and related streptococci.

The adherence of bacteria to material and host cell surfaces is essential for colonization, persistence, and pathogenicity. This process is facilitated by bacterial cell-surface components termed adhesins, which recognize and bind specific partner molecules presented on the surfaces of host cells and other microorganisms (1). The filamentous adhesins, comprising pili and fibrils, are among the largest and most complex of all bacterial surface proteins. They are found in both Gram-positive and Gram-negative bacteria, including many pathogenic species, and have attracted considerable attention due to their roles in colonization, infection, and as vaccine candidates (2,3). Characteristically, these proteins form sizable polymeric assemblies that project outwards from the bacterial cell surface, presenting an adhesive target-binding region at their tip. Fibrillar adhesins are expressed by a wide variety of bacteria and display an extraordinarily diverse array of adhesive functions. In contrast to the extensively studied pili, much still remains to be learned about the structures and functions of these proteins. Fibrillar adhesins usually include a single multidomain polypeptide chain, covalently tethered to the bacterial cell wall (1). They frequently display exquisite selectivity for their partner ligands, with their modular architectures providing a platform for multifunctionality (4 -6). Although smaller than pili, they are inherently more complex, with divergent molecular structures, functions, and adhesive properties (7)(8)(9).
Streptococcus species produce a multitude of fibrillar adhesins that promote binding to host cells and other microorganisms. These include, among others, the M proteins of Streptococcus pyogenes and the antigen I/II family polypeptides (10 -12). Streptococcus gordonii, a primary colonizer of the human oral cavity and an opportunistic pathogen, has been shown to present a number of fibrillar adhesins on its surface, including the polypeptide CshA. In vitro and in vivo binding assays, gene disruption experiments, and heterologous expression studies in the non-adherent bacterium Enterococcus faecalis have established CshA as an important determinant of S. gordonii adherence (4,13). This protein has been shown to play a role in binding both to host molecules and a range of other microorganisms (14).
CshA is a 259-kDa polypeptide (4,13) that forms ϳ60-nm peritrichous fibrils on the surface of S. gordonii (4). It shares Ͻ10% overall sequence identity to any protein of known structure, indicative of divergent or novel function. The CshA pre-protein is composed of 2508-amino acid (aa) 3 residues, organized in the form of a leader peptide (residues 1-41), a non-repetitive region (residues 42-778), 17 repeat domains (R1-R17, each ϳ101-aa residues), and a C-terminal cell wall anchor (13). CshA has been shown to bind the high molecular weight glycoprotein fibronectin (Fn) via an interaction that is mediated by the N-terminal non-repetitive region of the protein (7). Antibodies specific to this portion of CshA block Fn binding, whereas those raised against the repeat domain region elicit no effect (15). The CshA-Fn interaction has been proposed to be of general significance in promoting S. gordonii colonization at a range of sites within the host (7,15,16). These include the cardiac endothelium, where adherence by S. gordonii is known to promote the onset of infective endocarditis, a severe, potentially fatal inflammation of the inner tissues of the heart (17).
Despite the importance of CshA, little is known about the molecular structure and function of this protein. Such information would provide fundamental mechanistic insight and may inform the development of anti-adhesive agents that target S. gordonii and related streptococci.
Here we report a structural and functional description of the non-repetitive Fn-binding region of CshA. We reveal that this part of the polypeptide is composed of three distinct domains, designated herein as non-repetitive domain 1 (NR1, CshA(42-222)), non-repetitive domain 2 (NR2, CshA(223-540)), and non-repetitive domain 3 (NR3, CshA(582-814)). In vitro Fn binding assays of truncated CshA proteins heterologously expressed on the surface of the non-adherent bacterium Lactococcus lactis demonstrate that both NR1 and NR2, but not NR3, confer adhesive properties to CshA. Biolayer interferometry analysis of Fn binding by the recombinant CshA NR domains reveals that NR2 binds both cellular and plasma fibronectin with significantly lower k a , k d , and K D values than NR1. Using circular dichroism (CD) spectroscopy, small angle X-ray scattering (SAXS), and allied biophysical methods, NR1 is shown to constitute a discrete intrinsically disordered domain (IDD). The crystal structure of NR2 is also presented, which adopts a lectin-like fold with a clearly identifiable ligand-binding site. Together, our data are consistent with a two-state mechanism of Fn binding by CshA, where NR1 functions to recognize and bind Fn, forming a dissociable pre-complex, which is subsequently stabilized by a high affinity binding interaction mediated by NR2. This "catch-clamp" mechanism of Fn binding may be of general significance in other bacterial adhesins that contain intrinsically disordered domains.

Results
Reassignment of CshA Domain Architecture-CshA has previously been shown to comprise four distinct regions as follows: an N-terminal signal peptide; a Ͼ80-kDa non-repetitive region; 17 ϳ100-aa residue repeat domains, and a C-terminal cell wall anchor. In an effort to provide a more detailed description of the domain architecture of CshA, the sequence of this polypeptide was subjected to comprehensive bioinformatic analysis. Predictions of secondary and tertiary structure, disorder content, domain composition, and homology modeling of selected regions of the protein were performed. Our reassigned CshA domain composition is summarized in Fig. 1. The most significant consequence was assignment of the non-repetitive region of CshA into three distinct domains, designated NR1 (CshA(42-222)), NR2 (CshA(223-540)), and NR3 CshA(582-814)).
Fn Binding by CshA Is Confined to the NR1 and NR2 Domains-In previous studies, CshA has been shown to promote adherence by S. gordonii DL1 to the extracellular matrix protein Fn, an interaction mediated by the non-repetitive region of the protein (7,14). To determine which of the NR domains of CshA contribute to Fn binding, full-length CshA (Fl_CshA) and two truncated forms of the protein, CshA⌬NR1 and CshA⌬NR1 ϩ 2, were expressed on the surface of the nonadherent bacterium L. lactis. Genes encoding full-length CshA, CshA⌬NR1, and CshA⌬NR1 ϩ 2 were cloned into the pMSP7517 shuttle vector (18) and transformed into wild type L. lactis by electroporation. Constructs encoding the two CshA deletion mutants included the wild type CshA leader peptide FIGURE 1. Reassigned domain architecture of CshA. The CshA polypeptide comprises an N-terminal leader peptide (residues 1-40), a three-domain non-repetitive region (residues 41-819), a repeat region formed from 17 90 -102-aa domains (residues 820 -2507), and a C-terminal LPXTG cell wall anchor motif. The CshA domains NR1, NR2, NR3, and R13, produced recombinantly during this study, are highlighted (masses include vector encoded His 6 -tag and linker). sequence fused to their N termini to ensure efficient export from the cytoplasm.
Levels of surface-expressed CshA polypeptides were determined by dot immunoblot analysis of S. gordonii DL1, L. lactis MG1363, and L. lactis strains expressing Fl_CshA, CshA⌬NR1, or CshA⌬NR1 ϩ 2. Blots were probed with antibodies raised against the repetitive region of the protein ( Fig. 2A). There was no reactivity of the CshA antiserum with the L. lactis MG1363 control. However, S. gordonii DL1 and each of the three L. lactis strains engineered to express CshA proteins presented the respective polypeptides on their surfaces. Variations in the levels of CshA surface expression were observed in each of the different strains, with Fl_CshA produced in a greater amount than CshA⌬NR1 or CshA⌬NR1 ϩ 2 ( Fig. 2A).
To measure the relative binding properties of heterologously expressed Fl_CshA, CshA⌬NR1, and CshA⌬NR1 ϩ 2, the ability of lactococci expressing each of these proteins to adhere to immobilized human cellular Fn (cFn) was assessed (Fig. 2B). Levels of adherence were compared with those of wild type S. gordonii DL1 and non-CshA-expressing L. lactis MG1363. Expression of full-length CshA on the surface of L. lactis conferred the ability to bind cFn. L. lactis cells expressing CshA⌬NR1 were ϳ2-fold more adherent to cFn than L. lactis::Fl_CshA, suggesting that exposure of NR2, by deletion of NR1, promotes target binding by CshA. L. lactis cells expressing CshA⌬NR1 ϩ 2 bound cFn at a level not significantly greater (as established by analysis of variance) than L. lactis MG1363, suggesting that the NR3 domain is dispensable for cFn binding.
Kinetics of Fn Binding by Recombinant CshA NR Domains-To further probe the Fn binding properties of CshA NR domains, each of NR1, NR2, and NR3 were recombinantly overexpressed in Escherichia coli and purified to homogeneity (Ͼ95% purity). Recombinant NR2 was isolated as a mixture of monomeric (ϳ70% of total material) and dimeric (ϳ30% of total material) species that could be readily separated by sizeexclusion chromatography. Concentration of monomeric NR2 by ultrafiltration failed to yield the dimeric species, suggesting that NR2 dimerization is concentration-independent. Given this, and the absence of any prior experimental evidence to support homodimerization of the CshA non-repetitive region (4,13), monomeric NR2 was taken to be the biologically relevant form of the protein and was used in all subsequent Fn binding studies.
The kinetics of human cFn and plasma Fn (pFn) binding by recombinant CshA NR domains were determined by biolayer interferometry (Blitz, ForteBio). Both cFn and pFn were analyzed as they share many structural features and functional properties but are synthesized by different cell types and at different locations. Fn proteins were amide-coupled to the tip surface of amine-reactive second-generation (AR2G) biosensors (ForteBio) prior to analysis. Binding affinities were measured between each of the two Fn proteins and NR1, monomeric NR2, and NR3; the control proteins BSA and CshA repeat domain 13 (R13); and a commercially sourced anti-Fn antibody (Dao; Fig. 3). The R13 domain of CshA was chosen for analysis as unlike other repetitive domains of the polypeptide, recombinant protein could be readily produced in high quantity. Experimentally determined kinetic parameters for cFn and pFn binding are summarized in Table 1. Neither BSA nor CshA R13 bound either form of Fn. By contrast, the anti-Fn antibody bound both substrates with an equilibrium dissociation constant (K D ) in the low micromolar range (Table 1). Recombinant NR1 bound both cFn and pFn with a K D value approximately twice that of the anti-Fn antibody. This confirms that NR1 binds cFn and pFn but with lower affinity than the anti-Fn antibody. NR2 was found to bind cFn and pFn with a Ͼ10and Ͼ4-fold lower K D value, respectively, than NR1, and a Ͼ5and Ͼ2-fold lower K D value, respectively, than the anti-Fn antibody. These data demonstrate that NR2 binds Fn with high affinity, an observation in keeping with our heterologous expression studies, where deletion of NR1 serves to promote adherence to cFn by L. lactis strains. Kinetic parameters for Fn binding by CshA NR1 and NR2 suggest that NR1 forms a rapidly assembled (higher k a ) but readily dissociable (higher k d ) complex with Fn, whereas substrate binding by NR2 occurs over a longer time scale (lower k a ), yielding a less rapidly dissociating complex (lower k d ). Although there is variation in the kinetic parameters of cFn and pFn binding by individual CshA domains, the relative binding properties of each domain are consistent irrespective of the form of Fn used. NR3 is found to bind both cFn and pFn weakly, confirming a negligible role in defining the adhesive properties of CshA.
CshA NR1 Is a Discrete, Intrinsically Disordered Domain-Inspection of the aa sequence of the NR1 domain of CshA reveals that this protein is rich in disorder-promoting residues (10% Pro, 11% Glu, and 12% Ser) and deficient in order-promoting residues (Ͻ5% Ile and Leu, with no Cys, Phe, Trp, or Tyr). Analysis of the NR1 sequence using PONDR-FIT (19) suggests that Ͼ95% of NR1 is structurally disordered (Fig. 4A). To determine whether NR1 is indeed an IDD, the protein was subjected to biophysical and hydrodynamic characterization. Recombi-nant NR1 displays aberrant migratory behavior when analyzed by SDS-PAGE, characteristic of intrinsic disorder (20). The protein migrates with an apparent molecular mass of ϳ37 kDa (Fig. 4B), although it possesses an experimentally verified mass of 20.8 kDa (including hexahistidine tag and linker; data not shown). Similarly, the CD spectrum of the protein is consistent with an absence of secondary structure, with a single minimum centered around 200 nm and no strong negative signals above 205 nm (Fig. 4C). Deconvolution of this spectrum employing the CDSSTR algorithm (21) and reference sets 6 and 7 (22,23) gives a predicted disorder content of 79 -81%.
To further investigate the structure of NR1, this protein was analyzed by SAXS, a technique that is widely recognized as indispensable for the assignment and characterization of IDDs (24). SAXS data were processed to yield Guinier and Kratky plots and to calculate the shape and size of NR1 (Fig. 4D). The Guinier plot is linear in Q 2 in the low angle region confirming that recombinant NR1 is monodisperse in solution (data not shown). Analysis of the Guinier region (R g Q Ͻ1.3) yields a radius of gyration of ϳ67 Å, consistent with an extended conformation in solution. The Kratky plot derived from the NR1 scattering curve contains a distinct peak indicative of globular structure; however, the data do not progress to zero at high Q, suggestive of significant disorder content. Calculated dimensions of NR1 reveal an elongated structure of ϳ250 ϫ ϳ70 Å in size, consistent with a loosely compacted but predominantly disordered structure. This contrasts with theoretically calculated dimensions of ϳ560 ϫ ϳ2 Å for the fully disordered polypeptide.
Crystallization and Structure Determination of NR2-Having established that NR1 is an IDD, we next turned our attention to the NR2 domain of CshA. Both monomeric and dimeric forms of this protein were subjected to crystallization screening. Monomeric NR2 proved recalcitrant to crystallization; however, dimeric NR2 was readily crystallizable. The structure of this protein was determined to 2.7 Å resolution in space group P6 2 22 using the single wavelength anomalous dispersion (SAD) method as applied to selenomethionine (SeMet)-labeled crystals of dimeric NR2. The asymmetric unit includes two copies of NR2, each of which forms a strand-swapped homodimer with a partner monomer from a neighboring asymmetric unit (Fig. 5A). Compelling electron density was not observed for the residues comprising the vector-encoded His 6 tag and cleavable linker and the residues 313-331, 393-397, and 488 -560 in either of the two copies of NR2. These residues have consequently been omitted from our final model. Inspection of the NR2 strand-swapped dimer reveals that each of the two monomers buries its three C-terminal ␤-strands (residues 447-488) into the central core of its dimer partner (Fig. 5A). Such extensive strand-swapping is unusual even in entirely ␤-strand proteins (25). In Fl_CshA, the N and C termini of NR2 are tethered to their neighboring NR1 and NR3 domains, respectively, an organization that would disfavor NR2 strand-swapping. Furthermore, previous studies of heterologously expressed Fl_CshA have revealed no evidence of dimer or higher oligomer formation (4,13). Together, these observations imply that the observed dimerization is an artifact of protein overexpression. This provides further support for the validity of conducting our in vitro binding assays with monomeric recombinant NR2.

NR2 Is a Discrete Ligand Binding Domain with a Lectin-like
Fold-Based on our NR2 strand-swapped dimer crystal structure, a model of the biologically relevant monomeric form of the protein was produced (Fig. 5, B and C). Inspection of this model revealed a globular protein with a lectin-like fold, comprising a ␤-sandwich core decorated with ␣-helices. The sandwich is formed from two anti-parallel ␤-sheets of four and five strands respectively, with the interface between the two sheets populated by predominantly hydrophobic amino acids. The ␤-sandwich is augmented by a series of short ␣-helices co-located at one end of the protein, which pack against strands 5, 7, and 10. The model suggests that the residues 313-331, for which compelling electron density was not observed in the NR2 dimer crystal structure, form a large flexible loop that caps a highly negatively charged pocket on the surface of the protein (Fig. 5, B and C). Given the absence of any other cavities, clefts, or highly charged surfaces, it is likely that this pocket constitutes the ligand-binding site of NR2.
Identification of Structural Relatives of NR2 Supports a Role in Carbohydrate Binding-Despite minimal sequence identity to any protein of known structure, a DALI search using monomeric NR2 identifies a number of structurally related proteins. These include the N1 region of the S. gordonii adhesin Sgo0707 (Z-score ϭ 8.9 (26)), the adhesive V domains of the antigen I/II polypeptides SspB (Z-score ϭ 7.1 (27)), and SpaP (Z-score ϭ 7.0 (28)), and the endo-␤-1,4-galactanase TmCBM61 from Thermotoga maritima (Z-score ϭ 5.8 (29). Structural identity between NR2 and its relatives is predominantly restricted to the ␤-sandwich core of the protein, with significant divergence in the extent and identity of structural elements that decorate this subassembly (Fig. 6). Each of the structural homologues identified are involved in binding carbohydrates or glycoproteins, and with the exception of TmCBM61, they act as the adhesive component of a cell wall-anchored polypeptide in a Gram-positive bacterium.

Discussion
In previous studies we have established the important role played by the S. gordonii cell wall-anchored polypeptide CshA in mediating adherence to host and microbial cell surfaces (4,7,14,15). Here we extend the scope of our investigations to include detailed molecular level characterization of the Fnbinding "non-repetitive region" of this protein. This portion of CshA possesses an aa sequence distinct from that of any polypeptide for which structural or functional studies have been conducted to date, indicative of novel form or function.
In vitro cFn binding assays of heterologously expressed wild type CshA and truncated variants thereof confirm the critical role played by the non-repetitive region of CshA in facilitating cFn binding. These data also demonstrate that the adhesive properties of CshA are confined to the NR1 and NR2 domains  FEBRUARY 3, 2017 • VOLUME 292 • NUMBER 5 of the protein. Surprisingly, a CshA⌬NR1-expressing strain of L. lactis is more adherent to immobilized cFn than that expressing the wild type polypeptide. These data imply a mechanism of Fn binding by CshA where NR2 forms a dominant tight-binding interaction with its Fn substrate.

Streptococcus gordonii CshA
Results from complementary experiments investigating the kinetics of Fn binding by recombinant NR proteins are consistent with our heterologous expression studies. NR1 is found to bind both cFn and pFn with a K D value in the low micromolar range. NR2 binds both forms of Fn with greater affinity than NR1, demonstrating that this domain forms a stable, less dissociable interaction with Fn than its NR1 neighbor. Conversely, NR1 exhibits both a k a and k d value for c/pFn greater than that of NR2, indicating that NR1 engages and disengages Fn more rapidly. NR3, which was found to be dispensable for Fn binding in our heterologous expression studies, is shown using the BLitz technique to bind cFn and pFn only weakly, indicating that this domain may not significantly contribute to Fn binding in vivo. Together, our data are consistent with a mechanism of Fn binding by CshA that involves a rapidly formed but readily dissociable NR1-Fn complex, in tandem with a higher affinity, but less frequently formed NR2-dependent interaction.
To further probe the mechanism of Fn binding by CshA, recombinant NR1 and NR2 were subjected to structural characterization. NR1 is shown to exhibit all the hallmarks of an IDD, including aberrant hydrodynamic behavior and an absence of secondary structure (20,30). SAXS analysis of NR1 demonstrates that the protein adopts a disordered but partially compacted structure in solution. Such a conformation is typical of IDDs, which form loosely assembled dynamic structures, stabilized by residual local and long-range weak intramolecular forces (31). This results in inherent conformational adaptability and, in the case of IDDs that are involved in protein or ligand binding, confers an expanded capture radius, enabling target binding over greater distances than can be achieved by globular proteins (32). Target engagement by IDDs may be concomitantly accompanied by partial or complete recovery of a fold state, a process that often brings both partners into closer proximity, potentially enhancing the rate of binding by the so-called "fly-casting" mechanism (33). Given that the energy needed to recover secondary structure is abstracted from the interaction energy of binding, IDDs generally form low affinity interactions with their binding partners, characterized by remarkably fast on-and off-rates (34). Fn binding by NR1 exhibits many of these defining characteristics, including rapid complex formation and dissociation and a weak yet specific relative binding affinity. Interestingly, NR1 is devoid of sequence motifs that have been previously implicated in IDD-mediated bacterial adherence to cell-surface molecules, implying that Fn binding by CshA NR1 may proceed via a mechanism that is distinct from, for example, the tandem ␤-zipper model (35). There was no evidence of NR1-NR2 complex formation, as assessed by native PAGE, when the two domains were mixed in vitro, implying that NR2 does not bind NR1 and act to template its folding.
In contrast to NR1, the NR2 domain of CshA is shown to adopt a globular structure with a lectin-like fold. The protein presents a clearly identifiable ligand-binding site on its surface that is populated by a combination of aromatic and negatively charged residues. This composition is consistent with a role for NR2 in the binding of carbohydrates or glycoproteins, a function further supported by our in vitro binding data, and the identification of structural homologues of NR2 that mediate protein-carbohydrate interactions (26 -29, 36). The ligandbinding site of NR2 is capped by a sizeable loop, for which no electron density was observed during structure elucidation. The absence of compelling electron density in this region is indicative of flexibility. It therefore appears likely that the capping loop plays a role in gating access to the NR2-binding site, potentially acting to expose this highly charged pocket during Fn engagement. Based on available structural and functional data, it is not possible to unambiguously identify the specific glycosylation(s) upon Fn that are bound by NR2, although this represents a major focus for future investigations.
In summary, we report herein a structural and functional dissection of the Fn-binding, non-repetitive region of the S. gordonii fibrillar adhesin CshA. Uniquely, CshA appears to employ a bipartite mechanism of target binding that involves the coordinated action of a low affinity but high capture radius IDD, in partnership with a high affinity, high specificity ligand binding domain. Our data suggest a two-step catch-clamp Fn-binding mechanism, wherein the disordered NR1 domain of CshA acts to engage and "catch" Fn, via a process that is expedited by its elongated, disordered conformation. The resulting NR1-Fn pre-complex is rapidly formed and dissociated, although it may be stabilized via an NR2-mediated tight binding interaction, which functions to "clamp" CshA and Fn together (Fig. 7). Such a mechanism offers an elegant solution to the problem of forming highly specific intermolecular interactions within molecularly rich environments such as the human host. A number of fibrillar adhesins have been identified that incorporate IDDs, and as such the catch-clamp mechanism outlined herein may be of general applicability to other bacterial surface proteins that possess such domains (37). Finally, the unusual target binding mechanism exhibited by CshA, along with molecular insights provided herein, presents a framework for the development of new anti-adhesive interventions that target disease causing streptococci and related bacteria. Such interventions could be based on multicomponent agents that target both the IDD and ligand-binding steps during adherence.

Experimental Procedures
Bioinformatic Analysis and Reassignment of CshA Domain Architecture-Bioinformatic analysis and homology modeling studies were performed using non-proprietary software or publicly accessible web servers. In cases where the CshA amino acid sequence was too long for analysis, the protein sequence was sub-divided into appropriately sized regions and each was analyzed independently. Domain composition of CshA and the identification of domain boundaries were performed using Interpro (38) integrating 11 databases, including Pfam (39), PRINTS (40), PROSITE (41), and ProDom (42), to provide prediction of protein families and domains. Complementary analyses were performed using the DomPred server (43). Disorder prediction was performed using PONDR-FIT (19), secondary structure prediction performed using PSIPRED (44), and signal peptide prediction performed using RPSP (45), and the presence of coiled-coils was established using COILS (46). Where a suitable template or templates could be identifies, SWISS-MODEL (47)(48)(49)(50) or PHYRE2 (51) were used to generate homology models of identified CshA domains to allow more accurate assignment of domain boundaries.
Generation of L. lactis Heterologous Expression Strains-Plasmids (Tables 2 and 3) or PCR amplicons were purified using QIAquick Spin Miniprep or PCR purification kits, respectively (Qiagen). Oligonucleotides were synthesized by MWG Eurofins. Chromosomal DNA was extracted from S. gordonii DL1 as FIGURE 7. Schematic depicting the catch-clamp mechanism of Fn binding by CshA. The intrinsically disordered NR1 domain of the protein rapidly engages and binds Fn in a process expedited by the sizable capture radius of the domain. Fn binding results in the formation of a dissociable pre-complex that may be accompanied by the recovery of NR1 secondary structure. The resulting pre-complex is stabilized by a high affinity binding interaction mediated by the NR2 domain of CshA.  Contains the E. faecalis prgB gene, encoding aggregation substance, downstream of a nisin-inducible promoter PnisA (Erm R ) (18) described previously (53). Unless otherwise stated, DNA was PCR-amplified with PrimeSTAR GXL DNA polymerase (Clontech). Primers pMSP-cshA.F/R (Tables 2 and 3) were designed to amplify the entire cshA gene from S. gordonii DL1 chromosomal DNA, incorporating unique NcoI/XhoI restriction endonuclease sites at its termini. The purified amplimer was cloned into vector pMSP7517 (18) using ligation-independent In-Fusion HD cloning kit (Clontech). The resultant construct (pMSP::cshA) was transformed into competent E. coli Stellar TM cells (Clontech), recovered, and confirmed by sequencing. This construct was employed as template for in-frame deletion of NR1 (primers cshA.Rev2/Fwd2) or NR1 ϩ 2 (primers cshARev.primerA/cshAFwd.primerB) from cshA, based upon a circular PCR method (54) with 5Ј-phosphorylated primers. Following PCR amplification with Phusion High-Fidelity DNA polymerase (New England Biolabs), template was removed by DpnI digestion, and purified amplimers were self-ligated using T4 DNA ligase. Constructs were transformed into E. coli Stellar TM cells and confirmed by sequencing. Consequently, the three pMSP-derived constructs were electroporated into L. lactis MG1363, as described previously (55). Transformants were recovered on GM17 agar supplemented with erythromycin, and expression of CshA, CshA⌬NR1, or CshA⌬NR1 ϩ 2 on the surface of L. lactis was verified by SDS-PAGE and Western immunoblot analyses. Western Immunoblotting Analysis-Expression of CshA, CshA⌬NR1, or CshA⌬NR1 ϩ 2 on the surface of L. lactis was induced by culturing cells in the presence of 10 ng of nisin/ml. Suspensions were adjusted to A 600 nm 1.5 and spotted (2 l) onto nitrocellulose membrane. The membrane was blocked with TBS (50 mM Tris-HCl, pH 7.6, 0.15 M NaCl) containing 10% (w/v) milk powder, and subsequently probed with rabbit ␣-CshA-RR (15) followed by swine anti-rabbit IgG-horseradish peroxidase (HRP) conjugate (Dako), both diluted 1 in 1000 into TBS supplemented with 0.1% Tween 20 and 1% milk powder. The membrane was then developed using Amersham Biosciences TM ECL TM Western blotting analysis system (GE Healthcare) according to manufacturer's instructions.
Protein Expression and Purification-E. coli BL21 (DE3) cells harboring CshA domain expression plasmids were grown in LB medium supplemented with carbenicillin at 37°C to A 600 ϭ 0.4 -0.6. Protein expression was induced by adding IPTG (1 mM) and incubating cultures for 16 h at 18°C. Bacteria were harvested by centrifugation, suspended in 20 mM Tris-HCl, 150 mM NaCl, pH 8, and lysed with a cell disruptor (Z Plus Series cell disruptor, Constant Systems Ltd.) at 25,000 p.s.i. Cell supernatants were applied to a 5-ml Hi-Trap chelating column (pre-loaded with nickel, GE Healthcare) and eluted with an imidazole gradient (10 -500 mM) over 15 column volumes. Fractions containing CshA proteins were pooled, concentrated, and further purified by passage through a Hi-Load 16/60 Superdex 75 column (GE Healthcare) pre-equilibrated in 20 mM Tris-HCl, 150 mM NaCl, pH 7.5. Fractions containing CshA proteins were pooled and concentrated by ultrafiltration to 10 mg/ml.
For the production of selenomethionine-labeled NR2, a culture of E. coli BL21 (DE3) cells harboring pOPINF:cshA-NR2 was grown in LB medium supplemented with carbenicillin at 37°C for 16 h. Cells were harvested by centrifugation, washed three times, suspended in double-distilled water, and used to inoculate 1 liter of SelenoMet TM medium (Molecular Dimensions Ltd.) containing carbenicillin, supplemented with Sel-enoMet Nutrient Mix (Molecular Dimensions Ltd.). The culture was grown at 37°C with shaking to A 600 ϭ 0.6. A mix of amino acids was added to the culture (100 mg of lysine, 100 mg of phenylalanine, 100 mg of threonine, 50 mg of isoleucine, 50 mg of leucine, 50 mg of valine, 60 mg of selenomethionine), which was then incubated with shaking, for 15 min, at 37°C. IPTG was added (final concentration 1 mM) and the culture incubated for 16 h at 18°C. Cells were harvested and processed for protein purification as above with the addition of 2 mM tris(2-carboxyethyl)phosphine to all chromatography buffers.
Mass Spectrometry-For accurate intact mass determination, a sample of recombinant CshA NR1 with the His tag intact was prepared. Mass analysis was performed using an Ultrafle-Xtreme mass spectrometer (Bruker Daltonik) in positive reflectron ion mode. Data analysis was performed using the Bruker Flex Analysis software.
Biolayer Interferometry-The BLItz system (ForteBio) was used to measure binding affinity and kinetics of the interaction between cFn or pFn with CshA recombinant protein fragments. All experiments were performed at 20°C, using PBS-based, pH 7.4, Kinetics Buffer (ForteBio). c/pFn were immobilized onto Dip and Read TM Amine Reactive Second-Generation (AR2G) biosensors using N-hydroxysuccinimide/1-ethyl-3-(3dimethylaminopropyl)carbodiimide (NHS/EDC)-catalyzed amide bond formation at 20 g/ml in 10 mM acetate buffer, pH 4.0, for 5 min. Probes were quenched in 1 M ethanolamine, pH 8.5, and used for kinetic assays of CshA protein activities. The association and dissociation kinetics of the CshA proteins were performed at concentrations of 5, 15, 25, and 50 M. The kinetic dataset was globally fitted employing a 1:1 binding model.
CD Spectroscopy-CD spectra of 1 mg/ml recombinant NR1 in 20 mM phosphate, pH 7.5, 150 mM NaCl, were collected using an Aviv model 410 CD spectrometer at 20°C, using a 0.05-cm path length cuvette, with a wavelength interval of 1 nm and a data collection time of 1 s. The buffer-only spectrum was sub-tracted, and the data were converted to mean residue ellipticity. The CDSSTR algorithm (22,23) and associated reference data sets were used for data analysis, accessed via the DichroWeb server (21).
Small Angle X-ray Scattering-SAXS data of CshA NR1 were collected at Diamond Light Source (beamline I22) at a wavelength of ϭ 1 Å and a sample-to-detector distance of 2.5 m. Three sample concentrations were measured: 8, 4, and 2 mg/ml. Samples were exposed for 200 frames of 1 s. Each frame was sector-integrated with in-house beamline software. There was no evidence of concentration-dependent effects; thus data collected for the 8 mg/ml sample of NR1 were used for further analysis. Buffer scattering was subtracted using in-house software Scientific Data Analysis for non-crystalline diffraction. Particle distance distribution function (p(r)) and maximum intra-particle dimension (D max ) were calculated with GNOM (57). PRIMUS (58) was used to calculate the radius of gyration (R g ) employing the Guinier approximation and to generate a Kratky plot. The shape of the protein was evaluated using DAMMIF (59).
Protein Crystallization-Conditions supporting the growth of crystals of native and SeMet-labeled dimeric CshA NR2 were initially identified using the hanging drop method of vapor diffusion at 20°C and commercially available screens. Diffraction quality crystals of unlabeled or SeMet-labeled CshA NR2 were grown in 0.2 M Na/K tartrate, 20 -25% polyethylene glycol 3350, MMT buffer (DL-malic acid, MES, and Tris base in the molar ratios 1:2:2), pH 5.0. Crystals selected for diffraction data collection were mounted in appropriately sized litholoops (Molecular Dimensions Ltd) and flash-cooled in liquid nitrogen without additional cryoprotection prior to analysis.
Diffraction Data Collection and Structure Determination-Diffraction data were collected at Diamond Light Source, UK, on beamlines I03 and I04-1. Data were processed with iMosflm (60) and scaled with Scala (61) as implemented in the CCP4 suite (62). Two sets of CshA_NR2 SeMet-labeled data were integrated using iMosflm and scaled with Scala. These data were then prepared for merging and scaling in Pointless (61,63) and merged with Aimless (61,64) to scale together multiple observations of reflections. The structure of CshA_NR2 was initially determined in space group P6 2 22, with two molecules in the asymmetric unit, to 3.5 Å resolution, using the SAD method as applied to SeMet-labeled crystals of dimeric NR2. Identification of heavy atom sites and the resulting initial phase calculation was carried out using Phenix Autosol (65). 18 selenium sites were located with a figure of merit of 0.35. The output model was refined utilizing Refmac5 and employed as a molecular replacement search model to elucidate a higher resolution NR2 structure (2.7 Å) using PhaserMR (66), employing diffraction data collected from a crystal of unlabeled NR2. The resulting model was subjected to additional cycles of automated model building with Phenix AutoBuild (67), followed by iterative rounds of manual model building and refinement using COOT (68) and Refmac5 (69), respectively. Data collection, phasing, and refinement statistics for CshA NR2 are provided in Table 4. Protein structure figures have been prepared with PyMOL (Schrödinger, LLC).
Electrostatic Potential Calculations-For figures showing an electrostatic potential projected on the molecular surface of protein structures, the Poisson-Boltzmann electrostatic potential on the solvent-accessible surface is shown, with potential values ranging from Ϫ15 kT/e (red) to 15 kT/e (blue). This was calculated using the APBS plug-in (70) in PyMOL and a PQR file generated using PDB2PQR (71), with PARSE force-field charges after protonation state assignment by PropKa (72,73).   FEBRUARY 3, 2017 • VOLUME 292 • NUMBER 5