Targeted Protein Engineering Provides Insights into Binding Mechanism and Affinities of Bacterial Collagen Adhesins*

Background: Collagen adhesins play critical roles in bacterial colonization. Results: Several structural aspects influence binding affinity, binding ratio, and kinetics of binding to type I collagen. Conclusion: Discrete structural elements within the bacterial collagen adhesin control ligand affinity and binding mechanism. Significance: Understanding the collagen/MSCRAMM interaction sheds insight on how domain architecture influences the mechanism of collagen binding and affinity. The collagen-binding bacterial proteins, Ace and Cna, are well characterized on the biochemical and structural level. Despite overall structural similarity, recombinant forms of the Ace and Cna ligand-binding domains exhibit significantly different affinities and binding kinetics for collagen type I (CI) in vitro. In this study, we sought to understand, in submolecular detail, the bases for these differences. Using a structure-based approach, we engineered Cna and Ace variants by altering specific structural elements within the ligand-binding domains. Surface plasmon resonance-based binding analysis demonstrated that mutations that are predicted to alter the orientation of the Ace and Cna N1 and N2 subdomains significantly affect the interaction between the MSCRAMM (microbial surface components recognizing adhesive matrix molecule) and CI in vitro, including affinity, association/dissociation rates and binding ratio. Moreover, we utilized this information to engineer an Ace variant with an 11,000-fold higher CI affinity than the parent protein. Finally, we noted that several engineered proteins that exhibited a weak interaction with CI recognized more sites on CI, suggesting an inverse correlation between affinity and specificity.

Bacterial colonization of host tissues is a critical step in the infection process, and many pathogenic bacteria produce proteins that promote bacterial adhesion. One such family of proteins, the MSCRAMMs 3 (microbial surface components recognizing adhesive matrix molecules), interacts with components of the host extracellular matrices (1)(2)(3). MSCRAMMs found on Gram-positive bacteria typically belong to a family of cell wall anchored proteins that have similar structural organizations. In general, the N-terminal portion of these proteins contains the ligand-binding domain in the so-called A region, whereas the C-terminal half contains repeated motifs often known as the B domains that appear to extend and project the ligand-binding domain away from the bacterial surface (4). The C terminus also contains the cell wall sorting motifs, including the LPXTG sequence, which is essential for sortase-dependent anchoring to the cell wall, followed by a stretch of hydrophobic residues and a short segment of positively charged amino acids (5,6).
Despite a conserved structural organization, MSCRAMMs are largely species-specific, and homologues with substantial sequence similarities within the ligand-binding domains are not found in other bacteria. One exception is a family of collagen-binding MSCRAMMs, which includes Cna from Staphylococcus aureus (7), Ace from Enterococcus faecalis (8), Acm from Enterococcus faecium (9), Cne from Streptococcus equi (10), Cnm from Streptococcus mutans (11), RspA and RspB from Erysipelothrix rhusiopathiae (12), and BA0871 and BA5258 from Bacillus anthracis (13).
Cna and Ace are the most extensively studied members of this family of collagen-binding MSCRAMMs. The Ace and Cna ligand-binding regions are composed of N 1 and N 2 , subdomains that each adopt an IgG fold variant (14 -16). Structural comparisons of Cna  in the apo form and in complex with a synthetic collagen peptide led to the proposal of a multistep ligand-binding mechanism called the Collagen Hug (15). In this model, ligand binding is initiated by a low affinity interaction between the ligand and residues present within a shallow "trench" located on the N 2 subdomain. Next, the interdomain linker wraps around the triple helical collagen and repositions the N 1 subdomain. Finally, the complex is stabilized when the C-terminal N 2 extension, acting as a "latch," inserts into a cleft and complements a ␤-sheet within the neighboring N 1 subdomain (15). In the resulting complex, the MSCRAMM forms an interdomain hole through which the collagen projects. Structural comparisons and mutagenesis studies indicate that Ace employs a similar ligand-binding mechanism (16).
The crystal structures of Cna  and Ace  indicate that the hole (consisting of the interdomain linker and the N 1 and N 2 subdomains) created by ligand hugging can only accommodate a single collagen triple helix. Therefore, it is likely that Cna and Ace interact with monomeric triple helix collagen but not with fibrillar collagen (15,16). Furthermore, Cna 31-344 does not recognize gelatin or denatured collagen, indicating that the rod-like shape of the triple helix is critical for binding (15).
Expression of Ace and Cna on the surface of E. faecalis and S. aureus, respectively, results in increased bacterial adherence to immobilized collagen type I (CI) (8,(17)(18)(19). In addition, surface expression of Ace mediates E. faecalis adherence to collagen type IV (CIV), laminin, and dentin (20,21), whereas the presence of the cna gene has been associated with adherence of S. aureus clinical isolates to laminin and CIV (22). Furthermore, several studies demonstrate that Ace and Cna are virulence factors in different animal models. S. aureus strains harboring the cna gene exhibit increased virulence in animal models of staphylococcal infection, including endocarditis, arthritis, osteomyelitis, mastitis, and keratitis (18,(23)(24)(25)(26). It has also been demonstrated that ace-null strains of E. faecalis are attenuated in a murine model of urinary tract infection and a rat endocarditis model (27). In addition, recombinant forms of Cna can be used as effective vaccine components, whereas passive immunization with Cna antibodies is protective in a S. aureus septic death mouse model (28). Also, active immunization of rats with Ace 32-367 significantly decreases the incidence of endocarditis, whereas prophylactic treatment of rats with purified Ace antibodies significantly reduces the numbers of E. faecalis isolated from vegetations as well as the number infected (27).
Despite the high degree of structural similarity between Ace 32-367 and Cna 31-344 , these proteins demonstrate significantly different affinities and binding kinetics for CI in vitro (8,15,16). To understand the bases for these differences, we used a structure-based approach to engineer proteins harboring alterations within specific structural elements. Surface plasmon resonance (SPR)-based binding analysis demonstrates that sequence alterations within the N 1 subdomains of Ace and Cna significantly affect the stability of the interaction between the MSCRAMM and the ligand in vitro. The mutations, which are predicted to alter the orientation and range of motion of the N 1 and N 2 subdomains, influenced collagen affinity, rates of association/dissociation, and binding site preference. We also found that the sequence and length of the interdomain linker influenced binding of the proteins to immobilized CI and determined that although there is an optimal length for the interdomain linker, the sequence of the interdomain linker is more important.

EXPERIMENTAL PROCEDURES
Medium and Growth Conditions-Overnight cultures of Escherichia coli strains were grown at 30°C in LB medium (37°C for plasmid extraction) with shaking (200 rpm). For protein expression, E. coli TOPP3 was diluted 1:100 into fresh LB medium and grown at 37°C with shaking (250 rpm). During logarithmic growth phase, 200 M isopropyl-1-thio-␤-D-galactopyranoside was added, and growth was continued at 30°C for 16 h, after which cells were harvested by centrifugation at 3500 rpm for 20 min using a Sorvall RC 3B Plus centrifuge. Pellets were frozen at Ϫ20°C. Ampicillin (Sigma-Aldrich) or carbenicillin (Sigma-Aldrich) was used at concentrations of 100 g/ml.
DNA Manipulation and Plasmid Construction-Sequence changes to the DNA regions encoding the interdomain linker and the C-D loop of ace and cna were introduced using sitedirected mutagenesis (29) or overlap extension PCR (30) using primers listed in supplemental Table S1 in a mixture containing Phusion polymerase (Thermo Scientific, Vantaa, Finland), HF buffer (Thermo Scientific), 50 ng of DNA template, and 200 -250 nM dNTP mix (Invitrogen). Following digestion with DpnI (New England Biolabs, Ipswich, MA), PCRs generated from site-directed mutagenesis were transformed into E. coli TOPP10 using standard procedures. Purified overlap PCRs were incubated at 72°C with Taq polymerase (New England Biolabs) and 200 nM dATP (Invitrogen) and purified using DNA clean and concentrator 5 (Zymo Research, Carlsbad, CA) prior to ligation into pGEMTeasy (Promega, Madison, WI).
Sequencing was performed to verify site-directed mutagenesis and pGEMTeasy constructs containing inserts (Genewiz, Houston, TX). Following digestion with BamHI and SalI (New England Biolabs), the sequence of interest was purified from pGEMTeasy using the gel extraction kit (Zymo Research) and ligated into the previously digested expression vector, pQE30, using T4 DNA ligase (Promega). Standard procedures were followed for DNA transformation into E. coli TOPP10 cells. Final plasmids are described in supplemental Table S2.
Protein Purification-The His-tagged recombinant proteins were purified with a 5-ml nickel-charged HiTrap chelating column (GE Healthcare, Uppsala, Sweden) and a 5-ml anion exchange Sepharose column (GE Healthcare) as described by Barbu et al. (31). Fractions obtained from the anion exchange column were concentrated and dialyzed against PBS, 10 mM EDTA, pH 7.4. Recombinant proteins were Ͼ95% pure. The concentration of the recombinant protein was determined by absorbance measured at 280 nm and calculated using the extinction coefficient for each protein.
Molecular Modeling-Molecular modeling studies were performed using Coot (32) and Insight II (Accelrys, Inc., San Diego, CA). To build a model of Ace in the Cna conformation, individual N 1 and N 2 domains of Ace (Protein Data Bank (PDB) id: 2Z1P) were superimposed on corresponding N 1 and N 2 domains of the crystal structures of Cna (PDB id: 2F68). Superposition based on secondary structure matching (33) available in Coot (32) was used for the superposition. One hundred and eighteen C␣ atoms in the N 1 subdomain and 137 C␣ atoms in the N 2 subdomain aligned with root mean square deviations of 1.67 and 1.50 Å, respectively. The model of the Ace molecule in the Cna conformation showed steric clashes between the N 1 and N 2 subdomain. To build a molecular model of the AceC4 harboring the Cna N1 region (C-D loop, D-strand, and D-DЈ loop), a homology model was built using the HOMOLOGY module in the Insight II package (Accelrys, Inc.). After building a model of the Ace N 1 domain with the Cna N 1 region, this domain was superimposed on the Ace with Cna conformation to build a model of AceC4.
SPR Analysis-The interactions of Cna 31-344 , Ace  , and the mutant derivatives with immobilized rat tail CI (R&D Systems, Inc., Minneapolis, MN) were characterized using a Bia-core 3000 (GE Healthcare/Biacore) at 25°C. Sensor chip C1 and amine-coupling kits were obtained from the same company and used to covalently attach CI onto the sensor surface using the amine coupling procedure as recommended by the manufacturer. The matrix-free C1 chip was chosen because it can achieve low density immobilization and avoid mass transfer limitation. Also, the flat surface allows interactions to take place closer to the surface, which is beneficial when working with multivalent interaction partners and large molecules. Briefly, after the surface was activated for 2 min, 10 l of 5 g/ml CI solution (stock of 5 mg/ml in 0.01 N HCl was diluted in 10 mM sodium acetate, pH 5.5) was injected into the flow cell at a flow rate of 5 l/min. Approximately 440 response units (RU) of collagen were immobilized. Lower (ϳ150 and 180 RU) density collagen surfaces were also prepared by adjusting the activation time and volume of CI injected. A Cna surface was prepared on a separate sensor chip using 5 g/ml Cna 31-344 in 10 mM sodium acetate buffer, pH 5.5. The reference flow cells were prepared with activation and deactivation steps where no protein was coupled. Phosphate-buffered saline (8 mM Na 2 HPO 4 , 1.5 mM KH 2 PO 4 , pH 7.4, 2.7 mM KCl, 137 mM NaCl, and 0.005% Tween 20) was used as running buffer. Binding was performed at a relatively high flow rate of 30 l/min, although no mass transfer was observed in the system. To regenerate the sensor surface, bound protein was removed by flowing 10 mM glycine (pH 1.5) over the surface for 1 min.
Base line-corrected SPR response curves (with buffer blank run further subtracted) were used for affinity determination. For steady-state analysis, equilibrium response (R eq ) of each injection was collected and plotted against the concentration (A) of injected protein. A binding isotherm was fitted to the data using equation 1 (GraphPad Prism 4, GraphPad Software, Inc., La Jolla, CA) to obtain the equilibrium dissociation constant (K D ) and binding response maximum (R max ).
Nonequilibrium data were globally fit to a predefined twostate model using the BIAevaluation software (Version 4.1). The R max , association and dissociation rate constants (k a1 , k d1 ) for the binding state (A ϩ B 7 AB), and forward and backward rate constants (k a2 , k d2 ) for the conformational change state (AB 7 AB*) were obtained from the fitting with T-value Ն100 as criteria for acceptable fit. This model describes 1:1 binding of analyte to immobilized ligand followed by a conformational change in the complex (34). It simply assumes that the only way AB* can dissociate to release free A is through prior conversion to the form AB, where A is Cna or Ace protein in solution, B is immobilized collagen, AB is the complex formed by binding A to B, and AB* is the complex formed by conformational change from AB. The apparent dissociation constant (K D app ) was calculated from the kinetic parameters obtained from the curve fitting.
Theoretical maximum response for 1:1 binding (R max 1:1 ) was calculated using Equation 3, where R immob is the SPR response of the protein immobilized on the sensor surface and MW sol and MW immob are molecular weight of soluble and immobilized protein, respectively. Molecular weights for Ace and Cna are ϳ39,000 and ϳ36,000, respectively. Because the collagen molecule was kept at very low concentration in a low salt and low pH buffer during immobilization, the monomer (ϳ300 kDa) conformation was maintained.
Molar binding ratio (N) is defined as the number of soluble proteins bound to a single immobilized molecule and is determined by comparing the experimental value R max with R max

RESULTS AND DISCUSSION
Comparisons of the Cna  and Ace  Structures and Interactions with Collagen-Superpositions of apo crystal structures of Cna 31-344 and Ace 32-367 are shown in Fig. 1A. Structural comparisons of the two structures revealed that the interdomain orientations of the Ace 32-367 (PDB id: 2Z1P) N 1 and N 2 subdomains are altered by about 36°when compared with that of Cna 31-344 (PDB id: 2F68) (16) with the interdomain linker residues disordered in both the apo structures. As a result, the apo-Ace 32-367 structure is in a partially open conformation with the latch partially closed, whereas Cna 31-344 exhibits a latch-closed conformation (16). Consistent with previous studies, Ace 32-367 and Cna 31-344 exhibit different binding affinities and kinetics in SPR analysis of MSCRAMM binding to immobilized CI (Table 1, Fig. 1B) (8,15,16). Cna 31-344 has a higher affinity for collagen (K D app ϳ66.0 nM) than Ace 32-367 (K D ϳ43.2 M). Also, the association and complete dissociation of Ace 32-367 with immobilized CI are rapid, and the reaction follows a one-step steady-state binding model (Fig. 1, B and C, Table 1). However, the interaction of CI with Cna 31-344 is more complex, consistent with a two-state conformational change model (Fig. 1, B and C, Table 1).
This simplified two-state reaction model involves two steps: 1) the formation of the CI⅐Cna 31-344 complex (k a1 and k d1 ) and 2) a conformational change that may occur after formation of the complex (k a2 and k d2 ) (Fig. 1C, Table 1). The interaction of Cna 31-344 with collagen involves several steps. We propose that after the initial interaction between collagen and the N 2 subdomain of a Cna 31-344 molecule in the closed form, additional conformational changes occur, including opening of the N 1 -N 2 subdomains, reorientation of the N 1 subdomain, which allows the interdomain linker to wrap around the collagen, and the latching event that stabilizes the collagen⅐Cna 31-344 complex (supplemental Fig. S1). Although we believe that this model is most likely, we cannot exclude a mechanism where the open form and closed form of Cna 31-344 are in an equilibrium and only the open form is capable of binding the collagen molecule (15).
In an effort to understand the molecular bases for these biochemical differences, we compared the structures of the Ace and Cna ligand-binding domains and found significant variation in the interdomain linkers and interdomain orientation (Fig. 1A). The interdomain hole of Ace 32-367 , formed by the interdomain linker and the N 1 -N 2 subdomains, is more spa-cious than that of Cna  . Notably, structure-based sequence alignment showed that the Ace 32-367 interdomain linker is 40% longer than that of Cna 31-344 and likely contributes to the larger interdomain space (Fig. 1D). In addition, the crystal structure of Cna 31-344 in complex with a collagen peptide shows the interdomain linker tightly wrapped around the ligand, suggesting that 8 amino acids is the optimal interdomain linker length. Therefore, it is possible that the Ace 32-367 interdomain linker may be longer than required for monomeric collagen binding. Furthermore, the sequences of the Ace and Cna interdomain linkers share little sequence homology (Fig.  1D). Considering that structural data demonstrate specific contacts between the Cna 31-344 interdomain linker and residues within a synthetic collagen peptide, it is possible that sequence differences within this structural element could influence ligand binding (15).
Structural comparisons also revealed that the interdomain orientations of the Ace 32-367 N 1 and N 2 subdomains are altered by about 36°when compared with that of Cna 31-344 (16). As a result, the apo-Ace structure is in a partially open conformation with the latch partially closed (16). Although it is possible that this altered orientation represents apo-Ace  in equilibrium between the open and closed conformation (16), molecular modeling predicts that Ace 32-367 cannot adopt the interdomain orientation exhibited by Cna  . Specifically, the C-D loop at the N terminus of strand D located within the Ace 32-367 N 1 subdomain is longer than the corresponding C-D loop of Cna   (Fig. 2A). Molecular modeling of Ace 32-367 in the Cna 31-344 conformation predicts that the Ace 32-367 C-D loop located within the N 1 subdomain would clash with the A-B loop of the Ace 32-367 N 2 subdomain (Fig. 2B). Therefore, it is likely that Ace 32-367 uti-  lizes a more open conformation than Cna 31-344 to avoid an apparent steric conflict between the two subdomains. The altered interdomain orientation of the Ace 32-367 N 1 -N 2 subdomains and/or the more spacious interdomain hole of Ace 32-367 could result in a loose or "slippery" binding region, which might contribute to the lower binding affinity and less stable binding, reflected by a faster dissociation when compared with Cna 31-344 (Fig. 1B). To understand how these structural differences contribute to the observed differences in ligand binding, we evaluated the binding kinetics of several recombinant Ace and Cna proteins containing mutations that are predicted to alter the possible orientation of the N 1 and N 2 subdomains as well as the sequence and length of the interdomain linkers (Fig. 3).
Linker Length Affects Binding to Type I Collagen-To address the contribution of interdomain linker length to collagen binding, we engineered Cna and Ace proteins containing elongated and/or truncated interdomain linkers (Fig. 3) and compared the affinity and binding kinetics to immobilized CI using SPR (Figs. 4 and 5). We determined that the interdomain linker length influenced multiple aspects of in vitro CI binding, including affinity and binding ratio (number of MSCRAMMs bound to a single collagen molecule).
Cna proteins containing interdomain linkers harboring insertions or deletions demonstrated significantly lower CI binding affinities than Cna 31-344 , indicating that the length of the Cna interdomain linker is optimal for CI affinity in vitro (Fig. 4, A-C, Table 1). The addition of 3 amino acids to the Cna interdomain linker resulted in a protein (Cna ∧QIE ) with an ϳ40-fold decrease in binding affinity (Table 1, Fig. 4B), whereas deleting three residues from the interdomain linker yielded a protein (Cna ⌬EAG ) with an ϳ9-fold decrease in affinity for CI (Table 1, Fig. 4C). The length of the interdomain linker also influenced the number of sites in CI recognized by the MSCRAMM; Cna ∧QIE showed an increased binding ratio of 16, and Cna ⌬EAG exhibited a decreased binding ratio of 5. A binding ratio of 13 was calculated for Cna   (Table 1). Taken together, these data demonstrate that the native Cna interdomain linker length is optimal for CI binding in vitro as revealed by affinity calculations. However, a longer interdomain linker  may allow the protein to recognize a greater number of binding sites within the collagen molecule (Table 1).
Removing residues from the Ace interdomain linker yielded proteins with CI affinities that were only marginally higher than the parent protein, Ace 32-366 (K D ϳ43.2 M); Ace ⌬QIE exhibited a K D of ϳ29.0 M, and Ace ⌬ETGQIE exhibited a K D of ϳ30.1 M (Fig. 5, A-C, Table 1). However, deleting residues within the Ace interdomain linker negatively affected the number of sites  A, D, and E; and 400 RU for B, C, F, G, and H). Sensorgrams are shown in black with lower curve corresponding to lower concentration of Cna protein injected. Kinetic analysis for A, C, D, and E (the fitted curves are shown in red) was performed to obtain the rate constants (shown in Table 1) K D app and R max . For steady- state interactions (B, F, G, and H), a binding isotherm was created (inset) to determine equilibrium K D and R max . Drawings in each graph indicate the general location of the mutations. The presence of the mutations within the interdomain linker is represented as a filled in arc, and the mutations affecting the interdomain orientation are represented as a filled circle. OCTOBER 5, 2012 • VOLUME 287 • NUMBER 41 recognized on the collagen molecule. Although the parent protein exhibited a binding ratio (N) of 14, binding ratios of 11 and 10 were calculated for Ace ⌬QIE and Ace ⌬ETGQIE , respectively (Table 1). These results indicate that decreasing the length of the Ace interdomain linker does not strongly influence affinity of Ace for CI, but as with Cna, the longer interdomain linker may allow the protein to recognize a larger number of binding sites within CI.  G and H). For steady-state interaction (A-F), a binding isotherm was created (inset) to determine equilibrium K D and R max . Kinetic analysis was performed for G and H to obtain the rate constants (shown in Table 1) K D app and R max . Drawings in each graph indicate the general location of the mutations. The presence of the mutations within the interdomain linker is represented as a filled in arc, and the mutations affecting the interdomain orientation are represented as a filled circle.

Insights on Collagen-hugging MSCRAMMs
Because of the repetitive nature and linear structure of the CI molecule, the multiple MSCRAMM-binding sites observed in SPR likely represent independent binding sites. However, to confirm that the data are not due to aggregation of the injected proteins, we examined binding of solubilized CI to immobilized Cna 31-344 (supplemental Fig. S2). The resulting binding ratio of 1:11 demonstrates that one CI molecule was recognized by ϳ11 Cna 31-344 molecules on the sensor surface, which is close to the N of 11.7 observed in experiments utilizing soluble Cna  and immobilized CI (Table 1). Differences in binding ratio are expected because immobilization is random. Therefore, a portion of the immobilized Cna 31-344 will not be able to interact with the CI molecule. Furthermore, the extremely slow off-rate observed in the SPR sensorgrams in supplemental Fig. S2 is due to the effects of valency, which occurs when a single CI molecule is anchored by multiple immobilized Cna  .
Linker Sequence Affects Affinity and Binding Site Recognition for Type I Collagen-The Ace and Cna interdomain linker sequences share little homology, and structural analysis of the MSCRAMM⅐ligand complex revealed that amino acids within the Cna interdomain linker make specific contacts with the collagen peptide (15). Specifically, Val-172, located near the C terminus of the interdomain linker, interacts with a proline residue from the leading chain of the collagen peptide. This interaction may facilitate repositioning of the N 1 subdomain, which is an important step in the ligand binding process of Cna (15). Currently, a structure for the Ace ligand-binding domain in complex with collagen has not been resolved. Therefore, specific residues within the interdomain linker that make contact with the collagen ligand are unknown. However, based on position within the Ace interdomain linker, it is likely that the C-terminal residues Tyr-176 and/or Pro-177 play a role similar to that of Val-172.
To understand how the sequence of the interdomain linker contributes to collagen binding, we replaced the 8-amino acid interdomain linker from Cna 31-344 with 1) the full-length, 14-amino acid Ace 32-367 linker, 2) a 12-amino acid Ace linker lacking two N-terminal residues, and 3) a 10-amino acid Ace linker lacking two N-and two C-terminal residues, creating chimeric proteins CnaA1, CnaA2, and CnaA3, respectively (Fig. 3). Similarly, the Ace 32-367 interdomain linker was replaced with 1) the 8-amino acid Cna linker, 2) a 10-amino acid sequence containing the Cna interdomain linker plus two C-terminal Ace linker residues, and 3) a 12-amino acid sequence containing the Cna interdomain linker flanked by 2 amino acids each from the N and C termini of the Ace linker to generate the chimeric proteins AceC1, AceC2, and AceC3, respectively (Fig. 3). We found that changes to the length and sequence of the interdomain linkers impacted the binding kinetics of the proteins and underscored the importance of specific residues within the interdomain linker sequence.
SPR analysis revealed that the sequence of the interdomain linker influenced the CI affinity and binding ratio. CnaA1, which contains the full-length Ace interdomain linker, exhibited a higher affinity for CI (K D app ϳ19.0 nM) than the native Cna  , indicating that the sequence of the longer Ace interdomain linker can facilitate high affinity binding to CI in vitro (Fig. 4, A and D, Table 1). Additionally, removing the two N-ter-minal Ace interdomain linker residues, as in CnaA2, yielded a protein with an affinity (K D app ϳ88.5 nM) similar to that of Cna   (Fig. 4, A and E, Table 1), whereas CnaA3, which is lacking the N-and C-terminal Ace interdomain linker residues, exhibited a much lower affinity for CI (K D ϳ11.6 M) (Fig. 4, A and F, Table 1). The binding ratio for CnaA3 was the highest (N ϳ16), whereas the binding ratios of CnaA1 (N ϳ13) and CnaA2 (N ϳ10) were more similar to Cna 31-344 (N ϳ12) ( Table 1).
We determined that changes to the length and sequence of the Ace interdomain linkers resulted in modest changes in CI affinity (Table 1, Fig. 5, A, D, and E). However, the numbers of CI-binding sites were significantly altered (Table 1). Specifically, Ace 32-367 and AceC3, which harbor interdomain linkers containing the N-terminal residues, Val-164 and Thr-165, and the C-terminal residues, Tyr-176 and Pro-177, exhibited binding ratios of 14. Proteins containing interdomain linkers lacking the N-and C-terminal residues, AceC1 (lacks Ace residues Val-164, Thr-165, Tyr-176, and Pro-177) and AceC2 (lacks Tyr-176 and Pro-177), exhibited binding ratios ranging from 4 to 5.
Taken together, the SPR data obtained from the engineered Cna and Ace proteins harboring interdomain linkers with altered sequences and lengths suggest that the N-and C-terminal residues of the Ace interdomain linker positively influence binding to CI in vitro. Moreover, it is possible that the C-terminal residues, YP, present within the Ace interdomain linker sequence stack better with the CI proline residues than the corresponding SV amino acids from the Cna interdomain linker (Fig. 3). As a result, proteins containing the YP residues may be able to anchor the collagen molecule within the N 2 domain, suggesting that the ligand affinity is affected more by the sequence than by the length of the interdomain linker.
The Interdomain Orientation Affects Affinity and Binding Kinetics for Type I Collagen-Changes to the Ace interdomain sequence and length did not result in recombinant proteins that demonstrate stable interactions with immobilized CI, as was observed for the engineered Cna proteins, indicating that another aspect of the Ace 32-367 structure is responsible for the low affinity and fast on-and off-rate. Analysis of the structure of apo-Ace 32-367 reveals a protein with a partially open conformation. The Ace 32-367 N 1 subdomain contains a short D-DЈ loop, a short D strand, and a large C-D loop, whereas the Cna 31-344 N 1 subdomain contains a large D-DЈ loop, a longer D strand, and a short C-D loop (Figs. 3 and 2C). The larger C-D loop likely precludes the Ace 32-367 protein from adopting a conformation that is optimal for ligand binding, thus preventing a tight interaction with collagen (Fig. 2B). In addition, the larger D-DЈ loop found in Cna 31-344 forms part of the latching trench and may yield a stronger latching event than the shorter Ace 32-367 D-DЈ loop. The open conformation and/or weaker latching event may explain why Ace 32-367 has a lower affinity and more rapid off-rate for immobilized CI when compared with Cna 31-344. To understand how structural regions that do not interact with the ligand can influence orientation of the N 1 and N 2 subdomains, we engineered a protein that was predicted to adopt a more closed conformation than Ace  . We replaced the amino acid sequence spanning the Ace 32-367 C-D loop, D-strand, and D-DЈ loop with the corresponding residues from Cna, creating AceC4 (Fig. 3). In addition, we replaced the Insights on Collagen-hugging MSCRAMMs OCTOBER 5, 2012 • VOLUME 287 • NUMBER 41

JOURNAL OF BIOLOGICAL CHEMISTRY 34863
Cna 31-344 C-D loop, D-strand, and D-DЈ loop with the corresponding sequence from Ace to create CnaA4 (Fig. 3), which should adopt a conformation that is more open than that exhibited by Cna  . SPR analysis demonstrated that changes predicted to influence the N 1 and N 2 interdomain orientation affect several aspects of the MSCRAMM/CI interaction. The CI affinity of AceC4 was ϳ11,000-fold higher than Ace 32-367 (Fig. 5, A and G,  Table 1), whereas CnaA4 exhibited an ϳ25-fold decrease in affinity for CI when compared with Cna 31-344 (Fig. 4, A and F,  Table 1). These results suggest that a more closed conformation allows a tighter interaction with CI.
The partially open conformation observed for apo-Ace 32-367 may explain why the purified Ace proteins containing the shortest interdomain linkers (AceC1 and Ace ⌬ETGQIE ) exhibited the lowest binding ratios for CI. In an effort to assess the relationship between the interdomain linker and the interdomain orientation, we created AceC1C4, which contains the shorter Cna interdomain linker and the shorter Cna C-D loop and CnaA1A4 that harbors the full-length Ace interdomain linker and the larger Ace C-D loop (Fig. 4). AceC1C4 showed a 600-fold increase in CI affinity when compared with AceC1, which harbors the Cna linker but not the smaller C-D loop (Fig.  5, A and F, Table 1). However, CnaA1A4 (K D ϳ26.2 M), which contains the Ace interdomain linker sequence and the larger C-D loop, showed an affinity for C1 that was comparable with CnaA4 (K D ϳ37.7 M). Together, these data suggest that removing the apparent steric hindrance from the Ace N 1 and N 2 subdomains allows the Ace protein to accommodate a shorter interdomain linker.
We also found that mutations to the C-D loop region, which would likely influence the interdomain orientation of Ace  and Cna 31-344 proteins, significantly altered the overall binding kinetics of the engineered proteins, resulting in an "Ace-like" Cna and a "Cna-like" Ace. The Cna proteins (CnaA4 and CnaA1A4) containing the larger C-D loop exhibited similar affinities and binding profiles (a rapid association and complete dissociation) to CI as Ace   (Fig. 4, A, G and H, Table 1). On the other hand, SPR analysis of the AceC4 and AceC1C4 interaction with immobilized CI revealed kinetic data that were more similar to results obtained with Cna 31-344 (Figs. 4A and 5, G and H, Table 1). This is most evident when comparing the dissociation of the purified proteins. Dissociation from CI by AceC4 and AceC1C4 is slow and incomplete (Fig. 5, G and H, Table 1) and suggests an interaction with CI that is more complex than the parent protein Ace 32-367 and may involve a conformational change.
Altering the C-D loop also affected the number of CI-binding sites recognized by the proteins. The binding ratios (N) for the purified Ace proteins containing the smaller C-D loop were significantly lower (AceC4 N ϳ5, AceC1C4 N ϳ4) than the parent protein (Ace 32-367 N ϳ14), whereas the binding ratios for CnaA4 (N ϳ15.5) and CnaA1A4 (N ϳ13.6) were higher than the binding ratio for Cna  (N ϳ11.7) ( Table 1), suggesting that a more open conformation may allow the protein to recognize a greater number of binding sites within the CI molecule (Table 1).
Our data demonstrate that the lower affinity and unstable binding kinetics for immobilized CI exhibited by Ace 32-367 are in large part due to the presence of a longer loop that connects the C and D strands of the Ace 32-367 N 1 subdomain, which likely results in an altered orientation of the N 1 and N 2 subdomains when compared with Cna  . Furthermore, these results indicate that the longer Ace interdomain linker may be necessary to accommodate the altered interdomain orientation observed in the apo crystal structure of Ace (16). Surprisingly, the SPR data revealed that a more open conformation may allow the proteins to recognize a greater number of binding sites on the CI molecule as all proteins containing the Ace N 1 sequence, and presumably a more open conformation, exhibited higher binding ratios.
Concluding Remarks-Our studies point to an inverse correlation between affinity and binding site specificity. However, it is not clear whether a high affinity interaction or the ability to recognize multiple binding sites would be more advantageous for mediating bacterial attachment to a collagenous surface. A high affinity interaction between a surface-anchored protein and a host ligand would facilitate attachment and proliferation. On the other hand, recognition of a greater number of binding sites would result in increased avidity, and disassociation at low affinity motifs by a fraction of the MSCRAMMs may not result in detachment of the bacterium from the substrate. We are currently working to understand how alterations within the ligand-binding domains of Ace and Cna affect adherence of E. faecalis and S. aureus to CI, CIV, and Ln in vitro. We are also examining how the presence of the ligand-binding domains affects virulence in different animal models. These studies may also yield insight into how affinity and binding site specificity affect the host/pathogen interaction.
This study demonstrates that the MSCRAMM/CI interaction is complex as several structural features influence ligand binding affinity, specificity, and kinetics. For the Cna-like MSCRAMMs, the interdomain orientation appears to be most important followed by the interdomain linker sequence and finally interdomain linker length. Our data indicate that although the regions within Ace 32-367 that are predicted to interact with collagen should facilitate a tighter interaction, the more open orientation and larger interdomain hole adopted by Ace 32-367 are not optimal for high affinity CI binding. This structural variation may allow Ace to bind additional rodshaped ligands with a relatively larger diameter or recognize different collagen arrangements. Therefore, it is possible that small variations in the structure could dictate ligand specificity, underscoring the need to study key MSCRAMM/ligand interactions individually in submolecular detail.