Originally published In Press as doi:10.1074/jbc.M404920200 on May 12, 2004
J. Biol. Chem., Vol. 279, Issue 30, 31873-31882, July 23, 2004
Crystal Structure of the C-terminal Peptidoglycan-binding Domain of Human Peptidoglycan Recognition Protein I
*
Rongjin Guan
,
Emilio L. Malchiodi
,
Qian Wang
,
Peter Schuck¶, and
Roy A. Mariuzza
||
From the
Center for Advanced Research in Biotechnology, W. M. Keck Laboratory for Structural Biology, University of Maryland Biotechnology Institute, Rockville, Maryland 20850, the
Cátedra de Inmunología, IDEHU, CONICET, FFyB, UBA, Junin 956, 4 to P, Buenos Aires, Argentina, and the ¶Division of Bioengineering and Physical Science, Office of Research Services, National Institutes of Health, Bethesda, Maryland 20892
Received for publication, May 3, 2004
 |
ABSTRACT
|
|---|
Peptidoglycan recognition proteins (PGRPs) are pattern recognition receptors of the innate immune system that bind, and in some cases hydrolyze, peptidoglycans (PGNs) on bacterial cell walls. These molecules, which are highly conserved from insects to mammals, participate in host defense against both Gram-positive and Gram-negative bacteria. We report the crystal structure of the C-terminal PGN-binding domain of human PGRP-I
in two oligomeric states, monomer and dimer, to resolutions of 2.80 and 1.65 Å, respectively. In contrast to PGRPs with PGN-lytic amidase activity, no zinc ion is present in the PGN-binding site of human PGRP-I
. The structure reveals that PGRPs exhibit extensive topological variability in a large hydrophobic groove, located opposite the PGN-binding site, which may recognize host effector proteins or microbial ligands other than PGN. We also show that full-length PGRP-I
comprises two tandem PGN-binding domains. These domains differ at most potential PGN-contacting positions, implying different fine specificities. Dimerization of PGRP-I
, which occurs through three-dimensional domain swapping, is mediated by specific binding of sodium ions to a flexible hinge loop, stabilizing the conformation found in the dimer. We further demonstrate sodium-dependent dimerization of PGRP-I
in solution, suggesting a possible mechanism for modulating PGRP activity through the formation of multivalent adducts.
 |
INTRODUCTION
|
|---|
The innate immune system is a host defense mechanism, evolutionarily conserved from insects to humans, that mediates the early recognition and control of invading microorganisms (1, 2). The basis for innate immune responses resides in the ability of the host to recognize conserved products of microbial metabolism that are unique to microorganisms and are not produced by the host (pathogen-associated molecular patterns, or PAMPs).1 Examples of PAMPs recognized by pattern recognition receptors of the innate immune system include lipopolysaccharide (LPS) of Gram-negative bacteria, DNA sequences containing unmethylated CpG dinucleotides (CpG DNA), and peptidoglycan (PGN), present in both Gram-positive and Gram-negative bacteria (1, 2).
Peptidoglycan recognition proteins (PGRPs) are pattern recognition receptors that bind and, in certain cases, hydrolyze PGNs of bacterial cell walls (19). These molecules are highly conserved from insects to mammals and show distinct specificities for PGNs from different microbes (3). PGRPs have been divided into three classes: short PGRPs (PGRP-S), which are small (20 kDa) extracellular proteins; intermediate PGRPs (PGRP-I) of 4045 kDa that are predicted to be transmembrane proteins; and long PGRPs (PGRP-L), up to 90 kDa, which may be either intracellular or transmembrane (24, 6, 7). Drosophila has 13 PGRP genes (6) and the mosquito Anopheles seven (10); humans have four PGRPs, designated PGRP-S, PGRP-I
, PGRP-I
, and PGRP-L (3, 7). Mammalian homologs of insect PGRPs have also been identified in the mouse, rat, cow, and camel (11).
In Drosophila, PGRPs activate two different signaling pathways that induce production of antimicrobial peptides. PGRP-SA interacts with the lysine-containing PGN from Gram-positive bacteria, which activates the Toll receptor pathway (2, 12). PGRP-LC and PGRP-LE recognize the diaminopimelic acid-containing PGN from Gram-negative bacteria and activate the Imd/Relish pathway (2, 13). The functions of mammalian PGRPs are much less well understood. However, mouse PGRP-S, which is present in neutrophil tertiary granules, inhibits the growth of certain Gram-positive bacteria in culture media and participates in the intracellular killing of bacteria in neutrophils (5). Mice deficient in PGRP-S exhibit increased susceptibility to intraperitoneal infections with low pathogenicity Gram-positive bacteria (15). Bovine PGRP-S (also termed oligosaccharide-binding protein), found in neutrophil and eosinophil granules, is bacteriostatic or bacteriocidal for both Gram-positive and Gram-negative bacteria (16). This PGRP kills, in addition, some microorganisms that completely lack PGN, suggesting that certain PGRPs recognize envelope components other than PGN. Human and mouse PGRP-L, like several Drosophila PGRPs (8, 17), hydrolyze the amide bond between the N-acetylmuramic acid and L-alanine moieties of PGNs (9, 18). Mouse PGRP-S has been shown to form a cytotoxic complex with the major stress protein Hsp70 that induces apoptotic death in various tumor lines at subnanomolar concentrations (19).
All insect and mammalian PGRPs contain a conserved C-terminal domain of
165 amino acids (designated the PGRP domain) with
30% sequence similarity to bacteriophage T7 lysozyme, a zinc-dependent N-acetylmuromyl-L-alanine amidase (2, 11, 20). Recently, the crystal structure of Drosophila PGRP-LB was reported (17), showing an active site cleft with a bound zinc ion, in common with T7 lysozyme. To define the structural relationship between insect and mammalian PGRPs, we determined the crystal structure of the C-terminal PGN-binding domain of human PGRP-I
. Unexpectedly, the protein was found to exist in two oligomeric states, monomer and domain-swapped dimer, in both the crystal and solution. In contrast to Drosophila PGRP-LB, no zinc ion is present in the PGN-binding site of human PGRP-I
. The structure further revealed that PGRPs are characterized by considerable variability in a large hydrophobic groove, located behind the PGN-binding site, which may serve as a binding site for host proteins or PAMPs besides PGN.
 |
EXPERIMENTAL PROCEDURES
|
|---|
DNA Synthesis, Protein Expression, and PurificationA synthetic gene encoding full-length human PGRP-I
(341 residues) was assembled in vitro using recursive PCR techniques. The DNA sequence was designed for optimum expression in Escherichia coli based on codon usage preferences. DNA fragments encoding residues 177341, or 179341, of the C-terminal PGN-binding domain (designated PRGR-I
C1 and PGRP-I
C2, respectively), or 19176 of the N-terminal PGN-binding domain (PGRP-I
N), were generated by PCR from the full-length gene and cloned into the bacterial expression vector pT77 (Novagen). The proteins were expressed as inclusion bodies in E. coli BL21(DE3) cells (Stratagene). Bacteria were grown at 37 °C to an absorbance of 0.7 at 600 nm, and isopropyl-
-D-thiogalactoside was added to a concentration of 1 mM. After incubation for 34 h, the bacteria were harvested by centrifugation and resuspended in 50 mM Tris-HCl (pH 8.0) containing 0.1 M NaCl, 2 mM EDTA and 0.5% (v/v) Triton X-100; cells were disrupted by sonication. The inclusion bodies were washed three times with 50 mM Tris-HCl (pH 8.0), 0.1 M NaCl, 2 mM EDTA, and 0.5% Triton X-100, and another three times with the same buffer without Triton X-100, then solubilized in 50 mM Tris-HCl (pH 8.0), 8 M urea, 2 mM EDTA, and 1 mM dithiothreitol.
For in vitro folding, solubilized PGRP-I
C1, PGRP-I
C2, PGRP-I
N were diluted to a final concentration of 2050 µg/ml into 1.0 M arginine, 50 mM Tris-HCl (pH 8.5), 2 mM EDTA, 5 mM cysteamine, and 0.5 mM cystamine. After 3 days at 4 °C, the folding mixture was concentrated, dialyzed against 50 mM Tris-HCl (pH 8.5), and applied to a Mono Q anion exchange column (Amersham Biosciences) equilibrated in the same buffer; the protein was eluted with a linear NaCl gradient. Further purification was carried out by size exclusion using a Superdex 75 HR column (Amersham Biosciences). Mass spectrometry and N-terminal sequencing established the identities of the recombinant proteins.
Crystallization and Data CollectionCrystallization trials of purified PGRP-I
C1 and PGRP-I
C2 were carried out in hanging drops at protein concentrations of 6 and 8 mg/ml, respectively. Crystals of PGRP-I
C1 grew at room temperature in 20% (weight/volume) PEG-MME 2000, 0.01 M NiSO4, and 50 mM Tris-HCl (pH 8.5). The PGRP-I
C2 protein crystallized at room temperature in 1.0 M sodium potassium tartrate (pH 7.0), 1 mM ZnCl2, and 0.05% (volume/volume) Triton X-100.
For data collection, crystals of PGRP-I
C1 were cryoprotected by brief soaking in mother liquor containing 15% (volume/volume) glycerol and flash-cooled in liquid propane. X-ray diffraction data to 3.30 Å resolution were recorded in-house at 100 K using an R-axis IV++ image plate detector. Higher resolution data (2.80 Å) were collected on beamline X25 of the Brookhaven National Synchrotron Laboratory using a Quantum-4 charge-coupled device detector. Crystals of PGRP-I
C2 were cryoprotected by soaking in mother liquor containing 15% (volume/volume) ethylene glycol before being flash-cooled. Data to 1.65 Å resolution were measured at 100 K on beamline X26C of the Brookhaven National Synchrotron Laboratory. The data were processed and scaled using DENZO/SCALEPACK (21) (Table I).
Structure Determination and RefinementThe structure of PGRP-I
C1 was determined by molecular replacement with the program AMoRe (22) using the 3.30 Å data set. The crystal structure of Drosophila PGRP-LB (PDB accession code 1OHT
[PDB]
) served as the search probe. The translation function search gave a clear solution with a correlation coefficient of 36.8 and Rfactor of 47.2% at a resolution range of 8.04.0 Å. After rigid-body refinement, the correlation coefficient was 45.8 and the Rfactor was 44.2%. The solution contains one monomer in the asymmetric unit. The structure of PGRP-I
C2 was solved by molecular replacement using the refined PGRP-I
C1 model as the search probe. When checking crystal packing in XtalView (23), clashes were observed involving residues 316327 in symmetry-related molecules. Rebuilding these residues according to omit electron density maps revealed swapping of the C-terminal
-helical domain (residues 316341).
Refinement of the PGRP-I
C1 model was performed with CNS (24) using the 2.80 Å data set;
A-weighted 2Fo - Fc and Fo - Fc electron density maps were calculated for model adjustment. After several rounds of rebuilding in XtalView (23), including gradual residue replacement to the human PGRP-I
sequence, the Rcryst was reduced to 34.8% and Rfree to 35.9%. Group temperature factor (B) refinement was done with further model adjustment. As individual B-factor refinement failed to reduce the Rfree, it was not applied to the final model, which comprises residues 177341, two nickel ions, one sulfate, and 11 water molecules. The final Rcryst is 21.9% and Rfree is 25.2% for all data between 30.0 Å and 2.80 Å (Table I). The PGRP-I
C2 structure was refined similarly; the final model includes residues 179341, one sodium ion, and 137 water molecules. The final Rcryst and Rfree are 19.6 are 21.2%, respectively, for data between 30.0 and 1.65 Å (Table I).
Peptidoglycan Binding AssayInsoluble PGN (100 µg) from Staphylococcus aureus (Sigma) was incubated for 2 h at 4 °C with 100 µl (1 mg/ml) of bovine serum albumin (BSA), PGRP-I
C, PGRP-I
N, or hen egg lyzosyme (HEL), as described (5, 6). The samples were centrifuged 5 min at 8,000 x g and the pellets washed twice with 300 µl of 50 mM Tris-HCl (pH 8.0), 150 mM NaCl. The pellets were suspended in 100 µl of sample buffer (62.5 mM Tris-HCl (pH 6.8), 2% SDS, and 25% glycerol), boiled 5 min, and centrifuged. Supernatants were submitted to 20% SDS-PAGE and protein visualized by Coomassie staining.
Sedimentation EquilibriumSedimentation equilibrium studies were conducted in a Optima XL-I/A analytical ultracentrifuge (Beckman Coulter, Fullerton, CA) at a temperature of 4 °C and at several rotor speeds between 20,000 rpm and 45,000 rpm. 180 µl of protein was loaded into Epon double-sector centerpieces at concentrations of 0.72 and 0.24 mg/ml in 0.5 M sodium tartrate (pH 7.0) (A), and at 3.5, 1.2 and 0.4 mg/ml in 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl (B), respectively. Equilibrium absorbance profiles were acquired at wavelengths of 250, 280, and 300 nm. In buffer A, the observed weight-average buoyant molar mass values were concentration-dependent, ranging from 3670 to 5230 Da, clearly indicating self-association. Based on the known protein monomer molar mass, the buoyant molar mass at the lowest concentrations led to the effective solvated partial specific volumes of 0.749 (A) and 0.732 ml/g (B), respectively, consistent the values of 0.750 (A) and 0.736 ml/g (B) theoretically predicted for these buffer conditions based on the amino acid composition under fully hydrated conditions (using the software SEDNTERP, kindly provided by Dr. John Philo, Alliance Protein Laboratories). The protein extinction coefficient at 280 nm was calculated as 28,730 A280/Mcm based on amino acid composition, and the extinction coefficients at all other wavelengths were calculated as part of the global multi-wavelength analysis using the software SED-PHAT (www.analyticalultracentrifugation.com) including multiple scans of the same solution at different wavelengths (25). Data analysis was performed by global least-squares analysis of data from multiple concentrations, multiple rotor speeds, and multiple data acquisition wavelengths, based on the well known superpositions of the Boltzmann distributions of ideal species in the centrifugal field, using conservation of mass constraints (25). For the model describing reversible dimerization, the monomer and dimer distributions were linked by mass action law (26). The redistribution of a known small non-participating species of
3 kDa with relative abundance of <10% of the loading concentration was included in the analysis, consideration of which only slightly influenced the quantitative aspects of the results, without affecting the qualitative conclusions. Similarly, a term for non-ideal sedimentation due to hard-sphere repulsion (second virial coefficient B2 = 3 ml/g, assuming the absence of nonideality from Donnan effects and electrostatic interactions at the buffers of high ionic strength used) (27) was considered in the analysis, leading to negligible changes in the quantitative results.
Protein Data Bank Accession CodesCoordinates and structure factors for PGRP-I
C1 and PGRP-I
C2 have been deposited under accession codes 1SK3
[PDB]
and 1SK4
[PDB]
, respectively.
 |
RESULTS
|
|---|
Overview of the StructureBased on predictions using the domain identification algorithm SMART (28), and on the distribution of cysteine residues, two versions of the putative PGN-binding C-terminal domain of human PGRP-I
were expressed in bacterial inclusion bodies and folded in vitro: 1) residues 177341 (designated PGRP-I
C1) and 2) residues 179341 (PGRP-I
C2), which differs from PGRP-I
C1 only in lacking Val177 and Cys178. The structures of PGRP-I
C1 and PGRP-I
C2 were determined by molecular replacement to resolutions of 2.80 and 1.65 Å, respectively (Table I). Both structures contain a central
-sheet composed of five
-strands: four parallel and one (
5) anti-parallel and three
-helices, similar to the zinc-dependent amidases T7 lysozyme (20) and Drosophila PGRP-LB (17) (Fig. 1A). In addition, both PGRP-I
C1 and PGRP-I
C2 contain a long N-terminal segment (residues 177198 and 179198, respectively), absent from T7 lysozyme, which is also found in Drosophila PGRP-LB (Fig. 1B); however, the structure of this PGRP-specific segment differs substantially in the human and insect proteins (see below). By analogy to T7 lysozyme (20), the putative PGN-binding site of PGRP-I
C resides in a cleft flanked by the
1-helix and four loops (
3
1,
1
4,
5
6, and
7
3) that project above the
-sheet platform.

View larger version (40K):
[in this window]
[in a new window]
|
FIG. 1. Structure of the C-terminal PGN-binding domain of human PGRP-I and comparison with Drosophila PGRP-LB. A, left and center, ribbon diagrams of human PGRP-I C in two crystal forms (PGRP-I C1 and PGRP-I C2). Right, structure of Drosophila PGRP-LB (17) (Protein Data Bank code 1OHT
[PDB]
). Secondary structure elements are numbered sequentially. The N and C termini are labeled. The view is looking down on the putative PGN-binding cleft, whose walls are formed by helix 1 and four loops ( 3 1, 1 4, 5 6, and 7 3) extending above the -sheet platform. For each structure, the N-terminal PGRP-specific segment is red and the C-terminal -helix is blue. Disulfide bonds are yellow. The zinc ion bound to Drosophila PGRP-LB is drawn as a purple sphere. B, view of the opposite face of each PGRP domain from that in A, looking down onto the PGRP-specific segment. The molecules are rotated 180° about a vertical axis and 40° about a horizontal axis compared with the orientation in A.
|
|
The PGRP-I
C1 structure includes a disulfide bond linking Cys178 in the N-terminal PGRP-specific segment to Cys300 in the
2-helix (Fig. 1B). PGRP-I
C2 does not form this disulfide, as it lacks Cys178; the unpaired Cys300 is oxidized to cysteine sulfenic acid in the crystal. The Cys178Cys300 disulfide is not required for PGN binding (see below). However, the most notable difference between PGRP-I
C1 and PGRP-I
C2 lies in the position of the
3-helix, which is distal to the core of the latter protein (Fig. 1A). This helix mediates formation of a domain-swapped dimer both in the PGRP-I
C2 crystal and in solution, as discussed later.
Disulfide Bonds in PGRP DomainsThe human PGRP-I
C domain is cross-linked by three disulfide bonds (Cys178Cys300, Cys194Cys238, and Cys214Cys220), of which only one (corresponding to Cys214Cys220) is present in Drosophila PGRP-LB (Fig. 1A). This buried disulfide, which appears to be present in all known PRGPs except Drosophila PGRP-LE, may be required for maintaining the structural integrity of the PGRP domain. The two additional disulfides in human PGRP-I
, Cys178Cys300 and Cys194Cys238, tether the N-terminal PGRP-specific segment to the
2-helix and loop
1
4, respectively (Fig. 1). These disulfides, which are conserved in the I and S classes of mammalian PGRPs (Fig. 2), may serve to stabilize these proteins in the harsh environment of the extracellular compartment or leukocyte granules, where they are believed to reside (5, 15, 16).
In insects, the Cys178Cys300 disulfide is likely found in several PGRP-S proteins (but in no PGRP-L), including Drosophila PGRP-SA and Anopheles PGRP-S3 (10). The Cys194Cys238 disulfide, however, is not present in insect PGRPs. In mammals, this disulfide distinguishes non-catalytic from catalytic PGRPs, as it is retained by all non-catalytic PGRPs, but is absent from PGRPs with demonstrated PGN-hydolyzing activity (human and mouse PGRP-L) (9, 18). The Cys194Cys238 disulfide cannot form in catalytic PGRPs (mammalian or insect) due to replacement of one, or both, of the requisite cysteines (Fig. 2).
Structural Variability in PGRP-specific SegmentsAlthough human PGRP-I
C adopts an overall fold very similar to that of Drosophila PGRP-LB, the human and insect PGRPs differ markedly in the structures of their N-terminal segments (residues 177198 in PGRP-I
C), which form a hydrophobic groove with
2-helix on the face opposite the PGN-binding site of each protein. Whereas in Drosophila PGRP-LB this region comprises two
-strands (
1 and
2) linked by a 310 helix, a corresponding 310 helix links two stretches of coil in the human PGRP-I
C structures (Fig. 1B). Superposition of the two PGRP-specific segments showed that they consist of a structurally conserved central region (residues Arg184Arg190 in PGRP-I
C), which includes the 310 helix, flanked by two variable regions, comprising residues 177183 and 191198 (Fig. 3A).
Three residues in the central region of the PGRP-I
C N-terminal segment, Arg184, Trp187, and Arg190, make extensive interactions with the main body of the PGRP domain: Arg184 and Arg190 form bidentate salt bridges with Asp239 and Asp233, respectively, while Trp187 makes a tight hydrophobic cluster with Val224, Tyr253, and Ala189 (Fig. 3B). Significantly, all three residues, and their specific interactions, are preserved in Drosophila PGRP-LB (Fig. 3C). Among all PGRPs, Trp187 is invariant and Arg184 is highly conserved; residues 189 and 190 exhibit moderate conservation (Fig. 2). By contrast, the regions flanking the central region (residues 177183 and 191198) differ at 15 out of 15 positions in human PGRP-I
C compared with Drosophila PGRP-LB, which also contains an insertion at position 30 in its sequence. Similar sequence variability is apparent in alignments of other PGRP-specific segments (Fig. 2). The result is marked differences in the topology and dimensions of the hydrophobic groove formed by the two PGRP-specific segments and the
2-helix (Fig. 4), which was proposed as a potential protein-binding site (17). The different orientations of the N terminus relative to the
2-helix make this groove much narrower at this end in the PGRP-I
C structure (8.6 Å between Val177 C
and Glu304 C
) compared with PGRP-LB (18.2 Å between Thr12 C
and Phe139 C
) (Fig. 1B). In addition, loop
5
6 protrudes into the putative protein-binding groove of PGRP-I
C, but not that of PGRP-LB. Both grooves contain a prominent hydrophobic pocket that is, however, deeper and wider in PGRP-I
C than PGRP-LB (Fig. 4).
Based on sequence comparisons (Fig, 2), a similar degree of structural variability probably exists among other PGRP-specific segments as observed between those of human PGRP-I
C and Drosophila PGRP-LB. Thus, individual PGRPs are distinguished by unique N-terminal regions, which may mediate specific interactions with different effector or signaling proteins (29, 30), or with microbial ligands besides PGN, such as LPS (13, 31) and 1,3-
-D-glucan (32).
The Peptidoglycan-binding SiteThe putative PGN-binding cleft of PGRP-I
C is
27 Å long, with a shallow (67 Å) end, flanked by helix
1 and loops
3
1 and
6
2, and a deep (1213 Å) end, flanked by loops
5
6 and
7
3. Although no crystal structure of any PGRP (or T7 lysozyme) in complex with PGN has been reported, it is tempting to speculate that the ligand would be oriented with its bulky N-acetylmuramic acid moiety situated at the shallow end of the binding groove, and the short PGN peptide held in an extended conformation at the deep end.
In contrast to Drosophila PGRP-LB, no zinc ion is visible in the PGN-binding site of human PGRP-I
C (Fig. 5), despite the inclusion of 1 mM Zn2+ in one of the crystallization buffers (see "Experimental Procedures"). This difference is explained by the substitution of zinc-coordinating residues His42 and Cys160 in Drosophila PGRP-LB by Ile207 and Ser324 in human PGRP-I
C. Since zinc is required for the PGN-hydrolyzing amidase activity of Drosophila PGRP-LB and other catalytic PGRPs, the absence of this metal ion in human PGRP-I
C is consistent with its ability to bind, but not hydrolyze, PGNs. Other noncatalytic PGRPs, which constitute the majority of known insect and mammalian PGRPs, also do not preserve the zinc-coordinating residues of Drosophila PGRP-LB (Fig. 2). Topological differences between the human and insect PGRPs are not restricted to this particular region of the binding site cleft, but are distributed along its entire length (not shown). Thus, human PGRP-I
C differs from Drosophila PGRP-LB at 18 of 28 residues lining the groove, suggesting that the two PGRPs may exhibit distinct PGN binding specificities. Indeed, PGRPs have been shown to discriminate among PGNs from different microbes (3, 12), presumably through differential recognition of their variable peptide moieties.

View larger version (36K):
[in this window]
[in a new window]
|
FIG. 5. Comparison of PGN-binding sites of non-catalytic and catalytic PGRPs. A, human PGRP-I C, a non-catalytic PGRP. B, Drosophila PGRP-LB, which has PGN hydrolyzing activity. Side chains of residues that likely contribute to PGN recognition or hydrolysis are shown. Carbon atoms are yellow, nitrogen atoms are dark blue, oxygen atoms are red, and sulfur atoms are green. The zinc ion bound by Drosophila PGRP-LB is purple; one of the four zinc ligands is provided by Glu182 from a symmetry-related molecule (gray).
|
|
PGRP-I
Comprises Tandem PGN-binding DomainsHuman PGRP-I
, and the closely related PGRP-I
, were predicted to be transmembrane proteins, based on the identification of potential membrane-spanning segments in their N-terminal regions (residues 1176) (7, 11). However, re-examination of the N-terminal sequence of human PGRP-I
, in light of the crystal structure of its C-terminal PGN-binding domain, revealed that the N- and C-terminal regions share very high sequence similarity (>40% identity), with strict conservation of four cysteines forming disulfide bonds Cys194Cys238 and Cys214Cys220 in the PGRP-I
C structure (Fig. 2). Sequence alignments of human PGRP-I
, and of mouse PGRP-I
and PGRP-I
, produced similar results. Therefore, mammalian PGRPs of the I class are most likely soluble intra- or extracellular proteins composed of two tandem PGN-binding domains. To test this prediction, we expressed the entire N-terminal region of human PGRP-I
(PGRP-I
N), excluding the putative 18-residue signal peptide, and examined its ability to recognize PGN. Indeed, the individual PGRP-I
N and PGRP-I
C1 (or PGRP-I
C2) domains appeared to bind insoluble PGN from S. aureus equally well (Fig. 6A), though this assay can probably only discriminate between large affinity differences. The existence of PGRPs having multiple PGN-binding domains is not restricted to mammals. For example, sequence analysis (28) of insect PGRPs indicated that Drosophila PGRP-LF, like human PGRP-I
, comprises two tandem PGN-binding domains and is unlikely to be transmembrane.

View larger version (48K):
[in this window]
[in a new window]
|
FIG. 6. PGRP-I contains two PGN-binding domains. A, binding of recombinant PGRP-I C and PGRP-I N to PGN from S. aureus. Proteins (BSA, lane 1; PGRP-I N, lane 2; PGRP-I C, lane 3; HEL, lane 4; Bio-Rad molecular mass markers, lane 5) were incubated with insoluble PGN. Unbound protein was removed by centrifugation and washing; bound protein was analyzed by 20% SDS-PAGE and stained with Coomassie Blue. BSA and HEL served as negative and positive controls, respectively, for PGN binding. B, molecular surface showing the putative PGN-binding cleft of PGRP-I C, oriented vertically. Potential PGN-contacting residues that differ between PGRP-I C and PGRP-I N are green; conserved residues are red. At each position, residues are identified as PGRP-I C/PGRP-I N.
|
|
While the functional role of tandem PGN-binding domains remains to be elucidated, two possibilities may be considered. First, incorporation of two or more PGN-binding domains within a single PGRP might serve to augment avidity by permitting multivalent attachment to the bacterial cell wall. Second, tandem domains may confer multiple PGN-binding specificities to individual PGRPs, thereby expanding the ability of a relatively small number of receptors to detect a broad spectrum of microbial pathogens. In this regard, human PGRP-I
N differs from PGRP-I
C at 21 of 28 potential PGN-contacting residues in the binding site cleft (Fig. 6B), implying that the two domains could possess different fine specificities. Notably, Tyr242, His264, Thr265, and Asn269 form a contiguous patch of conserved residues on the floor of the binding groove at its deep end, which may be the site for accommodation of the peptide chain of PGNs (see above). Moreover, with the exception of Thr265, these residues are >80% conserved in other PGRPs.
PGRP-I
Dimer FormationAlthough we had expected the two crystal forms of PGRP-I
C (Table I) to yield essentially identical structures, this was not the case: whereas in PGRP-I
C1 crystals the PGRP domain exists as a monomer, it forms a domain-swapped dimer in PGRP-I
C2 crystals, with the two monomers related by a 2-fold rotational symmetry axis (Fig. 7A). That is, two PRGP molecules exchange their C-terminal
3 helices, such that each helix makes the same interactions in the dimer as in the monomer, but the interactions are inter-rather than intramolecular. A secondary interface is introduced between the subunits in the domain-swapped dimer through juxtaposition of loop
5
6 and helix
1 (Fig. 7B). Although dimerization appears to partially occlude the PGN-binding sites of both monomers (Fig. 7B), the PGRP-specific segments are fully exposed for possible interactions with other microbial ligands (13, 31, 32), or with host proteins (29, 30). It should also be noted that full-length PGRP-I
includes an N-terminal PGRP domain with PGN binding activity (Fig. 6A). Consequently, dimerization of full-length PGRP-I
, in the manner observed in PGRP-I
C2 crystals (Fig. 7A), would create an assembly having two identical PGN-binding sites at opposite poles.
To form the domain-swapped dimer, the
7
3 hinge loop connecting the C-terminal
-helix of PGRP-I
C to the rest of the structure adopts a more extended conformation than in the monomer, where it folds back on itself (Fig. 7, C and D). During refinement of the PGRP-I
C2 structure, a strong persistent peak of electron density was observed in both 2Fo - Fc and Fo - Fc maps at the center of the hinge loop, surrounded by a cluster of oxygen atoms. Efforts to model this density using various ions (Zn2+, Ca2+, Mg2+, K+, Na+), or water, yielded satisfactory results only for Na+, which was present in the crystallization buffer of dimeric, but not monomeric, PGRP-I
C (see "Experimental Procedures"). The temperature factor of the bound sodium ion (15.5) is comparable to the average B value for the main chain (19.1). The sodium is pentahedrally coordinated by the carbonyl oxygens of Ser317, Val319, and Ile322, and by the hydroxyl oxygen of Ser324; a water molecule provides a fifth sodium ligand (Fig. 7C). All the metal-ligand distances (2.32.5 Å) are within the range seen for sodium ions in high resolution protein structures (33). The bound Na+ could contribute to stabilizing the particular conformation of the hinge loop found in the dimer, as suggested by the fact that the structure of this loop in monomeric PGRP-I
C, which crystallized in the absence of sodium, is incompatible with Na+ coordination (Fig. 7D).
To address whether PGRP-I
C dimerizes in solution, as well as in the crystal, and whether Na+ promotes dimerization, the protein was characterized by size exclusion chromatography. In the absence of sodium, PGRP-I
C behaved as a monomer of 16 kDa, with little or no detectable dimer (Fig. 8A). However, in the presence of 300 mM NaCl, at pH 8, a clear dimer peak of 34 kDa, representing
20% of the total protein, was observed (Fig. 8B). These results were corroborated by analytical ultracentrifugation. In the presence of sodium tartrate (pH 7), the salt used for crystallization, sedimentation equilibrium profiles showed 31% of PGRP-I
C in a dimeric state, with an estimated KD of 130 µM (Fig. 9). Similar results were obtained using NaCl. Thus, sodium-mediated dimerization of this pattern recognition molecule occurs in both crystal and solution states.

View larger version (27K):
[in this window]
[in a new window]
|
FIG. 9. Sedimentation equilibrium analysis of PGRP-I C dimerization. a, absorbance distributions for the sedimentation of PGRP-I C (0.72 mg/ml) in 0.5 M sodium tartrate (pH 7.0) at 4 °C at rotor speeds of 20,000 rpm (circles), 25,000 rpm (triangles), and 30,000 rpm (squares). Only every second data point is shown. Distributions were analyzed as part of a global fit to absorbance data at multiple wavelengths and loading concentrations. Solid lines are the global best-fit distributions using a reversible monomer-dimer model with an equilibrium constant of KD = 130 µM. b, residuals of the fit, with a r.m.s. deviation to the data shown of 0.004 A300. c, for comparison, the residuals of the best-fit distributions are shown if the absence of dimer was assumed, with an otherwise identical model.
|
|
 |
DISCUSSION
|
|---|
Except for members of the PGRP-S class, which are clearly secreted in both insects and mammals (37, 15, 16, 32), most other PGRPs have not been identified experimentally as transmembrane, intracellular, or secreted molecules. Indeed, considerable caution is warranted in classifying individual PGRP-I and PGRP-L receptors as transmembrane or soluble based solely on structure predictions. Accordingly, we demonstrated the absence of a putative transmembrane segment in the N-terminal portion of human PGRP-I
by folding the entire region in aqueous solution into a functional PGN-binding domain. This result, in conjunction with structure-based sequence alignments, indicates that mammalian PGRP-I
and PGRP-I
, and certain insect PGRP-L (e.g. Drosophila PGRP-LF), are soluble intracellular or secreted proteins consisting of tandem PGRP domains. As evidence that mammalian PGRP-L is also soluble, a 64 kDa N-acetylmuramoyl-L-alanine amidase purified from human serum was found to have an N-terminal sequence identical to that predicted for mature PGRP-L (34). Since all mammalian PGRPs contain leader sequences, they should be localized in intracellular vesicles, where they may encounter phagocytosed bacteria, or be secreted upon fusion of these vesicles with the cell membrane. Another pattern recognition molecule, Toll-like receptor 9, was recently shown to require translocation from the endoplasmic reticulum to a CpG DNA-containing lysosomal compartment for ligand binding and signal transduction (35).
Recent evidence indicates that PGNs may not be the sole microbial surface determinants recognized by PGRPs. For example, the unexpected finding that bovine PGRP-S is microbicidal for some organisms that lack PGN (Cryptococcus neoformans), or in which PGN is obscured by LPS (Salmonella typhimurium), suggests that envelope components other than PGN may interact, directly or indirectly, with certain PGRPs (16). In support of this view, Drosophila PGRP-LC is required for activation of the Imd/Relish pathway by LPS, as well as PGN (13, 31). Similarly, recombinant PGRP-S proteins from the large beetle Holotrichia diomphalia were found to specifically bind both 1,3-
-D-glucan and PGN (32). In addition to microbial products, PGRPs interact with yet unidentified host effector molecules responsible for downstream signaling, with other pattern recognition receptors such as Gram-negative-binding protein 1 (29, 30), and with heat shock proteins (19). It seems unlikely that the relatively conserved PGN-binding site of PGRPs can recognize such an array of structurally diverse microbial and host ligands. Rather, it is more probable that the PGRP-specific segment, which we have shown exhibits considerable topological variability (Fig. 4), mediates binding to non-PGN ligands, with different segments forming functionally distinct binding sites. However, direct binding studies will be required to test this hypothesis.
Three-dimensional domain swapping, the process whereby two or more proteins form a dimer or higher order oligomer by exchanging an identical structural element, has now been observed in over 30 crystal structures (36, 37). As the number of domain-swapped proteins continues to rise, the question of biological relevance grows in importance. While some domain-swapped dimers are undoubtedly crystallization artifacts (36, 37), a variety of proteins have been shown to exist as domain-swapped oligomers in vivo, including T7 helicase (38), T4 endonuclease VII (39), coagulation factors IX/X-binding protein (40), and bleomycin resistance protein (41). In addition, domain swapping regulated by ligands has been described for glyoxylase I (42) and the cell cycle regulatory protein suc1 (43, 44), where the equilibrium between monomer and dimer is altered by glutathione and phosphopeptide, respectively, suggesting a role for domain swapping in allostery and signal transduction.
We do not know whether the domain swapping observed in PGRP-I
C (Fig. 7A) has functional significance; however, several possibilities exist. While dimerization appears to block the PGN-binding sites of the PGRP-I
C monomers (Fig. 7B), the PGRP-specific segments remain completely available for the potential recognition of other PAMPs (13, 31, 32) or effector proteins (17, 29, 30). In addition, since full-length PGRP-I
includes an N-terminal PGRP domain with PGN binding activity (Fig. 6A), dimerization of full-length PGRP-I
would generate a molecule with two identical PGN-binding sites at its opposite ends. Such a dimer would be bivalent not only with respect to the PGN-binding site of the N-terminal PGRP domain, but also with respect to each of the two PGRP-specific segments present in full-length PGRP-I
(Fig. 2). Domain swapping may therefore serve to augment the binding avidity of PGRPs, whose monomeric affinities for certain ligands might be weak. Alternatively, dimerization may create multivalent templates for directing the assembly of multiprotein signaling complexes as PGRPs concentrate on the cell walls of invading microbes. Finally, an exchange of
3 helices between different PGRPs would generate heterodimers, whose existence has been postulated to explain the simultaneous requirement of two splice forms of Drosophila PGRP-LC for the response to LPS (13, 31). In this regard, coagulation factors IX/X-binding protein is a domain-swapped heterodimer formed between homologous C-type lectin domains (40).
Sodium-binding sites have been identified in numerous protein structures (33). However, as in the case of domain swapping, the possible role of bound sodium ions in regulating protein function or stability is generally not well understood (45). A notable exception is the allosteric serine protease thrombin (46, 47), whose specificity switches from anticoagulant to procoagulant substrates upon binding Na+, as a result of subtle conformational changes in the primary specificity pocket (48). Here we have shown that sodium ions promote dimerization of PGRP-I
by stabilizing a particular conformation of a flexible hinge loop through coordination of specific loop residues, suggesting a novel role for bound Na+ in altering protein structure. Since PGRP-I
exists in solution in monomer-dimer equilibrium, this finding raises the intriguing possibility that monovalent cations could modulate PGRP function by promoting homo- or heterodimerization of these pattern recognition receptors. Future studies will address this and other functional questions through direct measurements of PGRP binding to its ligands.
 |
FOOTNOTES
|
|---|
The atomic coordinates and structure factors (codes 1SK3
[PDB]
and 1SK4
[PDB]
) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by National Institutes of Health Grant AI47900. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 
|| To whom correspondence should be addressed: Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, 9600 Gudelsky Dr., Rockville, MD 20850. Tel.: 301-738-6243; Fax: 301-738-6255; E-mail: mariuzza{at}carb.nist.gov.
1 The abbreviations used are: PAMP, pathogen-associated molecular pattern; LPS, lipopolysaccharide; PGN, peptidoglycan; PGRP, peptidoglycan recognition protein; PGRP-S, short peptidoglycan recognition protein; PGRP-I, intermediate peptidoglycan recognition protein; PGRP-L, long peptidoglycan recognition protein; BSA, bovine serum albumin; HEL, hen egg lysozyme; R.m.s., root mean-square. 
 |
ACKNOWLEDGMENTS
|
|---|
We thank C. P. Swaminathan for many valuable discussions and critical reading of the manuscript. We are grateful to H. Robinson for x-ray data collection and preliminary processing at the Brookhaven National Synchrotron Laboratory under the Biology PX Mail-in program. We also thank J. Yang for help in primer design and E. J. Sundberg for critical comments on the manuscript.
 |
REFERENCES
|
|---|
- Medzhitov, R., and Janeway, C. A., Jr. (2002) Science 296, 298-300[Abstract/Free Full Text]
- Hoffmann, J. A. (2003) Nature 426, 33-38[CrossRef][Medline]
[Order article via Infotrieve]
- Kang, D., Liu, G., Lundstrom, A., Gelius, E., and Steiner, H. A. (1998) Proc. Natl. Acad. Sci. U. S. A. 95, 10078-10082[Abstract/Free Full Text]
- Ochiai, M., and Ashida, M. (1999) J. Biol. Chem. 274, 11854-11858[Abstract/Free Full Text]
- Liu, C., Gelius, E., Liu, G., Steiner, H., and Dziarski, R. (2000) J. Biol. Chem. 275, 24490-24499[Abstract/Free Full Text]
- Werner, T., Liu, G., Kang, D., Ekengren, S., Steiner, H., and Hultmark, D. (2000) Proc. Natl. Acad. Sci. U. S. A. 97, 13772-13777[Abstract/Free Full Text]
- Liu, C., Xu, Z., Gupta, D., and Dziarski, R. (2001) J. Biol. Chem. 276, 34686-34694[Abstract/Free Full Text]
- Mellroth, P., Karlsson, J., and Steiner, H. (2003) J. Biol. Chem. 278, 7059-7064[Abstract/Free Full Text]
- Wang, Z.-M., Li, X., Cocklin, R. R., Wang, M., Wang, M., Fukase, K., Inamura, S., Kusumoto, S., Gupta, D., and Dziarski, R. (2003) J. Biol. Chem. 277, 49044-49052
- Christophides, G. K., Zdobnov, E., Barillas-Mury, C., Birney, E., Blandin, S., Blass, C., Brey, P. T., Collins, F. H., Danielli, A., Dimopoulos, G., Hetru, C., Hoa, N. T., Hoffmann, J. A., Kanzok, S. M., Letunic, I., Levashina, E. A., Loukeris, T. G., Lycett, G., Meister, S., Michel, K., Moita, L. F., Muller, H. M., Osta, M. A., Paskewitz, S. M., Reichhart, J. M., Rzhetsky, A., Troxler, L., Vernick, K. D., Vlachou, D., Volz, J., von Mering, C., Xu, J., Zheng, L., Bork, P., and Kafatos, F. C. (2002) Science 298, 159-165[Abstract/Free Full Text]
- Dziarski, R. (2004) Mol. Immunol. 40, 877-886[CrossRef][Medline]
[Order article via Infotrieve]
- Michel, T., Reichhart, J. M., Hoffmann, J. A., and Royet, J. (2001) Nature 414, 756-759[CrossRef][Medline]
[Order article via Infotrieve]
- Choe, K. M., Werner, T., Stoven, S., Hultmark, D., and Anderson, K. V. (2002) Science 296, 359-362[Abstract/Free Full Text]
- Gottar, M., Gobert, V., Michel, T., Belvin, M., Duyk, G., Hoffmann, J. A., Ferrandon, D., and Royet, J. (2002) Nature 416, 640-644[CrossRef][Medline]
[Order article via Infotrieve]
- Dziarski, R., Platt, K. A., Gelius, E., Steiner, H., and Gupta, D. (2003) Blood 102, 689-697[Abstract/Free Full Text]
- Tydell, C. C., Yount, N., Tran, D., Yuan, J., and Selsted, M. E. (2002) J. Biol. Chem. 277, 19658-19664[Abstract/Free Full Text]
- Kim, M.-S., Byun, M., and Oh, B.-H. (2003) Nat. Immunol. 4, 787-793[CrossRef][Medline]
[Order article via Infotrieve]
- Gelius, E., Person, C., Karlsson, J., and Steiner, H. (2003) Biochem. Biophys. Res. Comm. 306, 988-994[CrossRef][Medline]
[Order article via Infotrieve]
- Sashenko, L. P., Dukhanina, E. A., Yashin, D. V., Shatalov, Y. V., Romanova, E. A., Korobko, E. V., Demin, A. V., Lukyanova, T. I., Kabanova, O. D., Khaidukov, S. V., Kiselev, S. L., Gabibov, A. G., Gnuchev, N. V., and Georgiev, G. P. (2004) J. Biol. Chem. 279, 2117-2124[Abstract/Free Full Text]
- Cheng, X., Zhang, X., Pflugrath, J. W., and Studier, F. W. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 4034-4038[Abstract/Free Full Text]
- Otwinowski, Z., and Minor, W. (1997) Methods Enzymol. 276, 307-326
- Navaza, J. (1994) Acta Crystallogr. Sect. A 50, 157-163[CrossRef]
- McRee, D. E. (1999) J. Struct. Biol. 125, 156-165[CrossRef][Medline]
[Order article via Infotrieve]
- Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J. S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T., and Warren, G. L. (1998) Acta Crystallogr. Sect. D 54, 905-921[CrossRef][Medline]
[Order article via Infotrieve]
- Vistica, J., Dam, J., Balbo, A., Yikilmaz, E., Mariuzza, R. A., Rouault, T. A., and Schuck, P. (2004) Anal. Biochem. 15, 234-256
- Schuck, P., and Braswell, E. H. (2000) in Current Protocols in Immunology (Coligan, J. E., Kruisbeek, A. M., Margulies, D. H., Shevach, E. M., and Strober, W., eds) pp. 18.8.1-18.8.22, John Wiley & Sons, New York
- Cole, J. L. (1996) Biochemistry 35, 15601-15610[CrossRef][Medline]
[Order article via Infotrieve]
- Schultz, J., Milpetz, F., Bork, P., and Ponting, C. P. (1998) Proc. Natl. Acad. Sci. U. S. A. 95, 5857-5864[Abstract/Free Full Text]
- Gobert, V., Gottar, M., Matskevich, A. A., Rutschmann, S., Royet, J., Belvin, M., Hoffmann, J., and Ferrandon, D. (2003) Science 302, 2126-2130[Abstract/Free Full Text]
- Pili-Floury, S., Leulier, F., Takahashi, K., Saigo, K., Samain, E., Ueda, R., and Lemaitre, B. (2004) J. Biol. Chem. 279, 12848-12853[Abstract/Free Full Text]
- Werner, T., Borge-Renberg, K., Mellroth, P., Steiner, H., and Hultmark, D. (2003) J. Biol. Chem. 278, 26319-26322[Abstract/Free Full Text]
- Lee, M. H., Osaki, T., Lee, J. Y., Baek, M. J., Zhang, R., Park, J. W., Kawabata, S., Soderhall, K., and Lee, B. L. (2004) J. Biol. Chem. 279, 3218-3227[Abstract/Free Full Text]
- Harding, M. M. (2002) Acta Crystallogr. Sect. D 58, 872-874[CrossRef][Medline]
[Order article via Infotrieve]
- De Pauw, P., Neyt, C., Vanderwinkel, E., Wattiez, R., and Falmagne, P. (1995) Protein. Expr. Purif. 6, 371-378[CrossRef][Medline]
[Order article via Infotrieve]
- Latz, E., Schoenmeyer, A., Visintin, A., Fitzgerald, K. A., Monks, B. G., Knetter, C. F., Lien, E., Nielsen, N. J., Espevik, T., and Golenbock, D. T. (2004) Nat. Immunol. 5, 190-198[CrossRef][Medline]
[Order article via Infotrieve]
- Liu, Y., and Eisenberg, D. (2002) Protein Sci. 11, 1285-1299