![]()
|
|
||||||||
J. Biol. Chem., Vol. 282, Issue 27, 19629-19637, July 6, 2007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1
1

2
From the
Center for Extracellular Matrix Biology, Texas A&M University System Health Science Center, Albert B. Alkek Institute of Biosciences and Technology, Houston, Texas 77030, the ||School of Optometry and Center for Biophysical Sciences and Engineering, University of Alabama at Birmingham, Birmingham, Alabama 35294, the ¶Division of Infectious Diseases, Department of Internal Medicine, and Center for the Study of Emerging and Re-emerging Pathogens, University of Texas Medical School at Houston, Houston, Texas 77030, and the
Department of Crystallography and Biophysics, University of Madras, Guindy Campus, Chennai 600025, India
Received for publication, December 5, 2006 , and in revised form, March 23, 2007.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The molecular pathogenesis of different enterococcal infections has not been elucidated but presumably involves multiple sets of virulence factors (5). As with most other bacterial infections, the adherence of E. faecalis to host tissues likely represents an early critical step in the infection process (6, 7). Extracellular pathogens, such as enterococci, staphylococci, and streptococci, often target extracellular matrix components for attachment and colonization. The adhesins on the surface of microbes mediating these interactions collectively have been named MSCRAMMs3 (8). Most MSCRAMMs on Gram-positive pathogens belong to a family of structurally related cell wall-anchored proteins. These MSCRAMMs contain an N-terminal signal peptide followed by a non-repetitive region called the A region, which in most cases is responsible for ligand binding. The A regions are composed of two or more subdomains each adopting an immunoglobulin G-like (IgG-like) fold. Following the A region is often a segment composed of repeated sequences or motifs that is referred to as the B region. The C-terminal segment has features required for cell wall attachment including an LPXTG-like motif and a hydrophobic transmembrane region followed by a short positively charged cytoplasmic tail at the end (8).
ACE, a collagen binding MSCRAMM, was the first MSCRAMM identified on E. faecalis (911). ACE has a structural organization similar to that of the Staphylococcus aureus collagen adhesin CNA, which has been characterized in some detail (1216). The CNA A region consists of three subdomains: N1,N2, and N3. The N2 subdomain, a 168-amino-acid-long segment, was earlier characterized as the minimum collagen binding region (12). The crystal structure of the N2 subdomain showed that it adopts an IgG-like jelly-roll fold, composed of two antiparallel
-sheets and two short
-helices. Collagen docking studies by molecular modeling identified a shallow groove on one of the
-sheets of N2 as a putative binding interface (17). Binding studies later showed that the N1N2 protein segment binds collagen with a considerably higher affinity than the N2 subdomain alone or the intact A region (18). The crystal structures of the N1N2 segment of CNA, both as an apo-protein and in complex with a synthetic collagen triple helical peptide, were reported recently. Based on the analyses of these two structures, a mechanism for the binding of CNA to collagen called the Collagen Hug (Fig. 1) was postulated (18). In the Collagen Hug model, the N1N2 subdomains of CNA are predicted to adopt an open conformation where a binding trench on the N2 subdomain is accessible and allows a collagen triple helix ligand to dock. As a result, the MSCRAMM appears to wrap around the collagen triple helix where the N1 and N2 subdomains of CNA create a "tunnel-like" structure that "locks" the collagen ligand in between the two subdomains. In a final step, the C-terminal extension of the N2 subdomain acts as a "latch" by inserting into a trench present on the N1 subdomain by
-strand complementation. This event is expected to stabilize the structure (18).
|
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Construction of Expression PlasmidsA series of expression plasmids was constructed using the vector pQE-30 (Qiagen Inc., Chatsworth, CA) essentially as described previously (9). Recombinant proteins expressed from this vector contain an N-terminal tail of 6 histidine residues.
Site-directed MutagenesisSite-directed mutagenesis was performed using the QuikChange site-directed mutagenesis kit (Stratagene). The introduction of specific mutations into ACE32367 was achieved by the use of pQE-30 ACE32367 (pQE-30 with a 1-kb insert containing ACE32367 from E. faecalis strain OG1RF) as template DNA with the mutagenic primers listed in supplemental Table 1. After initial denaturation at 95 °C for 30 s, the cycling parameters were 30 s at 95 °C followed by 1 min at 55 °C and 5 min at 68 °C (18 cycles).
The reaction mixtures were placed on ice for 2 min. To minimize the chance of the supercoiled double-stranded template DNA being transformed, the mixtures were digested with DpnI at 37 °C for 1 h before being transformed into competent E. coli XL1-Blue cells. Cloned sequences were confirmed with automated DNA sequencing analysis (ABI Prism DNA sequencer; Applied Biosystems) at the University of Texas Medical School DNA sequencing facility, using pQE-30 sequencing primers.
Expression and Purification of Recombinant ProteinsLarge scale expression and purification of the recombinant fragments (Fig. 2) by a nickel-charged HiTrap chelating column and an anion-exchange Sepharose column (Amersham Biosciences) were as described previously (19). Protein-containing fractions were analyzed by SDS-PAGE, appropriately pooled. Finally, except for ACE32367N124C, N328C, protein samples were dialyzed extensively against phosphate-buffered saline, pH 7.4, stored at 4 °C, and generally used within 2 weeks. Protein concentrations were determined by absorption spectroscopy at 280 nm using calculated molar absorption coefficient values (20). Molecular masses of the expressed proteins were confirmed with matrix-assisted laser desorption/ionization-time of flight mass spectrometry (Tufts University mass spectrometry facility) from protein samples in H2O.
|
Surface Plasmon Resonance (SPR) AnalysisSPR analysis was performed at 25 °C using a BIAcore 3000 system (BIAcore AB, Uppsala, Sweden) based on the method described previously (21). Analysis of the association and dissociation rates was performed using the BIAevaluation 3.0 software (BIAcore). The KD values for the binding of recombinant MSCRAMMs to the immobilized collagen were calculated by Scatchard analysis based on the responses at the steady state portion of the sensorgrams using a one-site binding non-linear regression model (19).
Structure Determination of ACE32367The crystallization of the ACE32367 segment was reported previously (22). The structure of ACE32367 was determined by the molecular replacement method using the crystal structure of the N2 subdomain of ACE as search model (23). Briefly, ACE32367 crystals were grown by the hanging-drop method using 2-methyl-2,4-pentanediol as a precipitant. Crystals diffracted to 2.5 A resolution, and the data were collected at 100 K using an R-AXIS IV image plate detector with Rigaku rotating anode generator. Data processing and scaling were performed using DENZO and SCALEPACK (24). The unit cell parameters and data reduction statistics are presented in Table 1. Phase improvement was performed using density modification by solvent flipping and density truncation without phase extension. Experimental amplitudes (202.5 Å data) and phases calculated from the model after rigid body minimization was used in a density modification calculation in CNS (25). The Fourier map calculated with the improved phases clearly revealed the electron density for the additional subdomain (N1 subdomain). Chain tracing and model building were carried out with the program QUANTA2000 (Molecular Simulations, Inc). After several rounds of refinement in CNS with manual model building, the Rcryst/Rfree converged to 22.9%/26.5%. Final refinement statistics are listed in Table 1. The 41 C-terminal residues and the 9-residue linker residues that connect N1 and N2 subdomains could not be identified from the electron density maps. In addition, breaks in the main chain were observed at three places (residues 68, 122123, and 142). The side chains for some residues in a few loop regions are not visible and were modeled as alanines. Mass spectrometry analysis of ACE32367 crystals confirmed that the protein has not been cleaved at the C-terminal during the storage or crystallization. Thus, the C-terminal missing residues and other flexible loop regions appear to be genuinely disordered in the crystal. The final model contains 282 residues out of a total of 336 residues.
|
|
|
| RESULTS |
|---|
|
|
|---|
-sheets (
I and
II), and each sheet consists of five anti-parallel
-strands.
I and
II stack face-to-face and form a "
-sandwich" with jelly roll topology. The N2 subdomain exhibits a shallow groove on the
I sheet, representing a putative collagen binding region. The N1 subdomain is also composed of 10
-strands grouped into two
-sheets forming a
-sandwich. As observed in the structure of apo-CNA31344 (18), the G' strand is not formed by the residues of the N1 subdomain. Instead, this strand is created by the C-terminal extension of the N2 subdomain corresponding to residues 320325, which complement the
I sheet in N1.Inparticular, the association of N1 and N2 generates a deep cleft at their interface, lined by the putative collagen binding trench and two loop segments of the N1 subdomain. The linker residues 165173 that connect the two subdomains are disordered and therefore not included in the refinement. However, modeling of the linker based on the observed discrete densities in the 2fo - fc map suggests that the linker forms a wall in the tunnel-like structure as observed in the CNA-collagen complex and could play a similar role in locking the collagen ligand in place. Because of the structural similarities between the structures of CNA31344 and ACE32367, we hypothesize that ACE binds its ligand by the Collagen Hug mechanism, which was previously proposed for the CNA collagen interaction (18).
|
atoms forming the core
-sheet regions of the N1 and N2 subdomains. However, superposition of either one of the corresponding subdomains on CNA from the CNA-collagen complex clearly showed a significant deviation in the position of the other subdomain. Superposition of the N1 subdomains of ACE and CNA is shown in Fig. 4. Analysis of these crystal structures using the program Dyndom (29) showed that the interdomain orientation of the N1 and N2 subdomains in ACE is altered by about 36° as compared with that of CNA. Due to this difference in the interdomain orientation, the apo-ACE structure is in a partially open conformation as compared with the closed conformation of the CNA-collagen crystal structure.
The electron density for the C-terminal residues (326331) of the latch (320331) is not observed in the ACE structure, and only two pairs of hydrogen bonds between the
-strands G' (the latch) in N2 and E in N1 are found in ACE as compared with three pairs in the crystal structure of CNA. Moreover, the hydrogen bonds that are observed between strands G' and E of apo-ACE are weaker than those observed between the corresponding strands in the crystal structures of CNA as an apoprotein and in complex with the collagen.
Examining the Putative Ligand Binding Trench by Mutagenesis AnalysisThe structure of CNA31344 in complex with a synthetic collagen-like peptide showed that the collagen peptide binds at a shallow binding cleft formed on the surface of the N2 subdomain and makes additional contacts with a few residues in the linker connecting N1 and N2. Arg-15 and Hyp-12 of the middle strand in the collagen helix make hydrogen bonds with a backbone oxygen and a side chain oxygen (OG), respectively, of Thr-169 in one copy of the CNA molecule in the asymmetric unit. In another copy of the molecule in the asymmetric unit, the backbone conformation of the linker is slightly different, and therefore, the interactions between the linker residues and the collagen triple helix are different but significant. Examination of the CNA-collagen complex suggests that bulky, hydrophilic residues would be more favorable to make interactions with the linker by reaching out to the linker residues. The interactions with the collagen triple helix at the N2 subdomain is conserved in all the CNA molecules in the asymmetric unit. Therefore, it could be hypothesized that a high affinity binding site for MSCRAMMS would be 34 GPO repeats flanked by hydrophilic residues containing GXY repeats, where GPO repeats are primarily recognized by the N2 subdomain. Residues that make contact with the collagen peptide in the CNA N2 subdomain (Tyr-175, Thr-177, Gly-178, Leu-181, Glu-183, Arg-189, Phe-191, Asn-196, Asn-223, and Tyr-223) are largely conserved in ACE (Fig. 5A). Mutational analysis was used to evaluate the importance of these residues in ACE32367 for collagen binding. Four residues (Tyr-180, Arg-193 Phe-195, and Asn-197) within the binding region and one residue (Lys-237) outside the groove were individually replaced with Ala residues. The binding of recombinant wild type ACE32367 and mutant proteins to collagen type I immobilized on a sensor chip was analyzed by SPR using a BIAcore 3000 system. Mutations at positions Tyr-180, Arg-193, and Phe-195 resulted in proteins with much lower affinity for collagen as compared with wild type ACE32367 (Fig. 6, Table 2), suggesting that the mutated residues participate in collagen binding as observed in the CNA-collagen complex. Examination of the CNA-collagen complex shows that the corresponding residues of Phe-195 and Tyr-180 in CNA (Phe-191 and Tyr-175, respectively) make stacking interactions with proline residues at position X in the (GXY)n collagen peptide repeats. In the CNA-collagen structure, these residues are a part of the collagen binding groove in the N2 subdomain. Replacing the aromatic side chain of these residues with a single carbon atom side chain would severely affect the stacking interaction. The decreased affinity for collagen exhibited by the ACE F195A and Y180A mutants suggested that similar stacking interactions may also occur in ACE. Surprisingly, a significant increase in apparent affinity for collagen as compared with wild type ACE32367 was observed when Asn-197 in ACE was replaced with Ala (Fig. 6, Table 2). Superposition of CNA and ACE structures and modeling of ACE-collagen showed that residue Asn-197 in ACE corresponds to Asn-193 in CNA that contacts collagen through a weak electrostatic interaction with the hydroxyl group of the hydroxyproline (Hyp-9, O9) in the (GPO)4 repeat. A closer examination of the complex suggested that a bulky residue at that position would constitute a steric conflict. Replacing Asn-197 with Ala therefore could potentially make room for a collagen peptide to contact additional residues in the ACE binding groove.
|
|
|
The Latch in ACE32367 Is Required for Effective Collagen BindingIn the Collagen Hug binding model, the insertion of the C-terminal latch sequence into the neighboring N1 subdomain is predicted to stabilize the closed conformation of the protein-ligand complex. To evaluate the role of the latch sequence, three truncated versions of ACE32367 were generated in which the latch sequence is completely removed (ACE32320), partially truncated (ACE32325), or intact (ACE32340) but with the C-terminal latch extension removed (Figs. 2 and 7A). Binding of the truncation mutants to collagen was analyzed by SPR. Consistent with the predicted role of the latch sequence in collagen binding, the ACE32325 and ACE32320 polypeptides showed a markedly reduced affinity for the ligand. The calculated KDs of these ACE forms with truncations extending into the latch sequence were 185 (ACE32325) and 196 µM (ACE32320) respectively. On the other hand, the affinity of ACE32340 (KD = 53 µM) was similar to that of the intact ACE32367 (KD = 45.6 µM) (Fig. 8B, Table 2).
Mutations in the Latch Sequence Affect Collagen BindingTwo critical steps in the Collagen Hug model are the redirection of the latch sequence and insertion of the sequence into the latching trench in the N1 subdomain. Modeling of amino acid replacements in the latch sequence suggested that a G323P mutation (Figs. 2 and 7A) should prevent the redirection of the latch sequence on ligand binding and that a G323R mutation (Figs. 2 and 7A), due to the introduction of a bulky residue at this position, might result in the latch not being able to fit optimally in the latching trench. Both mutations were predicted to disrupt the latching event and affect ligand binding. The proposed mutants of Ace32367 were produced, and their bindings to collagen were examined by SPR analysis. Both recombinant proteins, ACE32367 G323P and G323R, showed significantly reduced binding affinities for collagen with calculated KD of 119 and 245 µM, respectively, as compared with a KD = 46 µM determined for the wild type protein (Fig. 8C, Table 2).
| DISCUSSION |
|---|
|
|
|---|
|
Using a mutagenesis approach, we also demonstrated that the putative collagen binding cleft on N2 is in fact the ligand binding site. We identified residues that were predicted to make contact with a docked collagen molecule by molecular modeling (Fig. 5B). Mutations of some of these residues resulted in a loss of collagen binding activity, and in one case, a gain of function. These residues are conserved in CAN, and molecular modeling based on the CNA-collagen complex structure could explain the effects of the mutations on ACE/collagen binding. The residues in the shallow ligand binding trench of CNA N2 identified to make contact with collagen and the solved structure of CNA N2N3 in complex with a collagen peptide suggest that the N2 trench can accommodate a generic (GPO)n triple helical peptide. This is consistent with our earlier studies that demonstrate that CNA can bind to multiple sites along a collagen monomer with varying affinities (12). It is not clear what specific residues in collagen and the CNA or ACE contribute to this varying binding affinity for different sites. MSCRAMM can interact to define a high affinity intermediate or low affinity site. In this context, it is also possible that some residues in collagen are not compatible with MSCRAMM binding and define a non-binding site. The dynamic nature of the collagen triple helix on binding makes it difficult to use a molecular modeling approach to define the importance of different residues. We demonstrated that introduction of a proline or an arginine residue in the N2 extension or truncation of the latch sequence affected the latching function of the segment, resulting in MSCRAMM proteins with reduced affinity for collagen. The structure presented in this study shows a significant difference from CNA in the interdomain orientation, possibly due to a weak latch. A weak latch would allow the molecule to adopt an intermediate conformation between the open and closed form and provide a snapshot of a stage in the binding process. Therefore, the structure determined for ACE and the biochemical analyses of the selected ACE mutants provide strong evidence in support of the predicted multistep model of ligand binding and strand complementation as proposed in the Collagen Hug model.
Although the previously determined CNA-collagen structure and ACE crystal structure and biochemical analysis presented here provide evidence for a Collagen Hug model at an atomic level, a key biological question is the relevance of this mechanism of binding to supramolecular fibrillar arrangement in vivo. A simplistic explanation for MSCRAMM binding to collagen would be the presence of disrupted, broken collagen molecules in damaged tissues. These collagen molecules could serve as anchoring templates for the pathogens involved in wound infections. The broad specificity of CNA or ACE MSCRAMMs would help identify binding sites in any monomeric collagen extending off the fiber. It is possible that the collagen binding MSCRAMM could use the Collagen Hug binding mechanism to target any other molecules that contain a triple helical collagenous domain. CNA has been shown to contribute to the virulence of S. aureus in a septic arthritis (30) model, and this property is due to its collagen binding activity. Studies have shown that an ACE mutant is significantly less virulent in a rat endocarditis model as compared with the parent wild type model.4 Thus, the two MSCRAMMs appear to play important roles in the disease processes, but the collagenous host molecules targeted by the adhesions remain to be defined.
The Collagen Hug model and the Dock, Lock, and Latch mechanism previously proposed for the binding of a linear fibrinogen peptide to the SdrG MSCRAMM (31) have several similarities. The two MSCRAMMs are structurally very similar. In both models, the ligand binding region is situated in between the two subdomains: N2 and N3 in SdrG and N1 and N2 in ACE (or CNA). The ligand-MSCRAMM complex is apparently stabilized by insertion of the extension of the second subdomain into an incomplete
-sheet in the first subdomain. The Collagen Hug model also has some unique features. The linker region (165173) connecting the N1 and N2 subdomains of ACE32367, which is required for wrapping around the triple helix, is much longer than the one connecting the N2 and N3 subdomains of SdrG. Thus, the ACE N1 and N2 subdomains are far apart from each other, making it possible for the bulky collagen triple helix molecule to fit into the binding cleft before the insertion of latch sequence into the latching trench. It seems that various MSCRAMMs can utilize similar structural building blocks (IgG-like modules) to bind to different ligands with similar mechanisms that are modified to fit the structural specificities of the ligands.
| FOOTNOTES |
|---|
* This work was supported by National Institutes of Health Grants AI 20624 (to M. H.) and R37 AI 47923 (to B. E. M.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
The on-line version of this article (available at http://www.jbc.org) contains a supplemental table. ![]()
1 Both authors contributed equally to this work. ![]()
2 To whom correspondence should be addressed: Center for Extracellular Matrix Biology, TX A&M University System Health Science Center, Institute of Biosciences and Technology, 2121 West Holcombe Blvd., Suite 603, Houston, TX 77030, Tel.: 713-677-7552; Fax: 713-677-7576; E-mail: mhook{at}ibt.tamhsc.edu.
3 The abbreviations used are: MSCRAMM, microbial surface components recognizing adhesive matrix molecules; ACE, adhesin of collagen from enterococci; SPR, surface plasmon resonance. ![]()
4 B. Murray, personal communication. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. Sillanpaa, S. R. Nallapareddy, V. P. Prakash, X. Qin, M. Hook, G. M. Weinstock, and B. E. Murray Identification and phenotypic characterization of a second collagen adhesin, Scm, and genome-based identification and analysis of 13 other predicted MSCRAMMs, including four distinct pilus loci, in Enterococcus faecium Microbiology, October 1, 2008; 154(10): 3199 - 3211. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Leo, H. Elovaara, B. Brodsky, M. Skurnik, and A. Goldman The Yersinia adhesin YadA binds to a collagenous triple-helical conformation but without sequence specificity Protein Eng. Des. Sel., August 1, 2008; 21(8): 475 - 484. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Bowden, A. P. Heuck, K. Ponnuraj, E. Kolosova, D. Choe, S. Gurusiddappa, S. V. L. Narayana, A. E. Johnson, and M. Hook Evidence for the "Dock, Lock, and Latch" Ligand Binding Mechanism of the Staphylococcal Microbial Surface Component Recognizing Adhesive Matrix Molecules (MSCRAMM) SdrG J. Biol. Chem., January 4, 2008; 283(1): 638 - 647. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||