|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 280, Issue 11, 10636-10645, March 18, 2005
Functional Insights from the Structure of the Multifunctional C345C Domain of C5 of Complement*![]() ![]() ![]() ![]() ![]() ¶
From the
Received for publication, November 22, 2004 , and in revised form, December 9, 2004.
The complement protein C5 initiates assembly of the membrane attack complex. This remarkable process results in lysis of target cells and is fundamental to mammalian defense against infection. The 150-amino acid residue domain at the C terminus of C5 (C5-C345C) is pivotal to C5 function. It interacts with enzymes that convert C5 to C5b, the first step in the assembly of the membrane attack complex; it also binds to the membrane attack complex components C6 and C7 with high affinity. Here a recombinant version of this C5-C345C domain is shown to adopt the oligosaccharide/oligonucleotide binding fold, with two helices packed against a five-stranded -barrel. The structure is compared with those from the netrin-like module family that have a similar fold. Residues critical to the interaction with C5-convertase cluster on a mobile, hydrophobic inter-strand loop that protrudes from the open face of the -barrel. The opposite, helix-dominated face of C5-C345C carries a pair of exposed hydrophobic side chains adjacent to a striking negatively charged patch, consistent with affinity for positively charged factor I modules in C6 and C7. Modeling of homologous domains from complement proteins C3 and C4, which do not participate in membrane attack complex assembly, suggests that this provisionally identified C6/C7-interacting face is indeed specific to C5.
A complement-mediated response to infection is fundamental to good health, but inappropriate complement activity underlies the symptoms of numerous inflammatory disorders (1). Activation of complement, and the ensuing attack on pathogens, entails a sequence of intermolecular recognition events, enzymatic cleavages, and assemblies of multiprotein complexes. The 30 fluid-phase and membrane-associated proteins participating in the complement system have been well characterized at the sequence level, and their respective roles are broadly understood (2, 3). There is, however, little understanding at atomic resolution of the interplay between the components. In particular, the sequence in which the five soluble, terminal components of complement (C5, C6, C7, C8, and C9) assemble to form the remarkable lipid bilayer-penetrating membrane attack complex (MAC)1 has been known for many years (4). But the network of protein-protein interactions entailed in forming this stable lytic complex and the involvement of specific amino acids remain a mystery. The key to progress in this area will be more three-dimensional structural information.
Assembly of the MAC is initiated by proteolytic cleavage of C5 by the trimeric enzyme, C5 convertase, at the target cell surface to generate C5a and a metastable species, C5b. C5b has the transient ability to interact tightly with C6 (5). The C5bC6 complex subsequently serves as a nucleation site for sequential assembly of C7, C8, and n molecules of C9 to create the MAC. Mature C5 is a heterodimer consisting of
An opportunity to address this lack of structural information arose from the suggestion that the C-terminal
Expression of the segment of C5 corresponding to its C345C domain (14) followed by analysis using CD and NMR confirmed that these amino acid residues fold to form a compact three-dimensional structure (15). Furthermore, C5-C345C, unlike the C345C domain of C3, is able to bind to both C6 and C7 in surface plasmon resonance (SPR)-based assays (14). In further work, C5-C345C was shown to inhibit recruitment of C7 by C5bC6 through an interaction between C5-C345C and the pair of factor I membrane attack complex (FIMAC) domains, also called factor I modules (FIMs), at the C terminus of C7 (16). Thus the C5-C345C domain provides at least part of the interacting surface between C5b and C7 in formation of the MAC. The C345C domain also harbors a region that interacts with the C5 convertase (17), although the cleavage site itself lies some 800 residues away toward the N terminus of the Although the fold of C5-C345C might be anticipated to resemble the fold of the NTR module from PCOLCE-1 (9), the sequence identity is low (see Fig. 1A) and disulfide bonding patterns are different, 14, 25, and 36 in the PCOLCE-1 NTR module compared with 13, 26, and 45 in the C3 equivalent (and therefore by inference in the C5 example). The C5-C345C sequence is longer and contains fewer prolines (147 residues including three prolines) compared with the PCOLCE-1 NTR module sequence (119 residues including 11 prolines). An experimentally determined three-dimensional structure of C5-C345C would therefore represent an important advance in understanding the basis, at atomic resolution, for the early steps of MAC assembly.
Here we report the use of solution NMR to solve the structure of C5-C345C. We thus provide the first new structural information for the C3/C4/C5 family of proteins since the structures of the C3d and C4d fragments were solved (18, 19) and, in the case of C5, since the anaphylatoxic C5a fragment structure was determined in 1989 (6). The new structure allows the construction of useful models of the C345C domains from C3 and C4. The positions within the structure of residues previously identified as being functionally critical and the location of surface patches likely to be involved in protein-protein interactions are now revealed.
Protein PreparationpET15b vectors encoding the amino acid residues of C5 from Ala1512 to the C-terminal residue Cys1658 (both with and without the point mutation F1613A) were constructed as described previously (14). The isotopically enriched recombinant proteins were overexpressed in the Escherichia coli strain Origami (Novagen, Madison, WI) and purified as described previously (14). For NMR studies, 15N- and 15N, 13C-protein samples (0.51.0 mM) were prepared in buffer containing 20 mM sodium phosphate, 100 mM NaCl, 5 µM EDTA, 0.02% NaN3, pH 6.0, in 95% H2O, 5% D2O. Binding StudiesAffinities of the recombinant wild-type and F1613A versions of C5-C345C for C6 and C7 were measured using SPR as described previously (14). NMR SpectroscopyNMR spectra were acquired on Bruker AVANCE 600- and 800-MHz and Varian INOVA 600- and 800-MHz spectrometers, using 5-mm triple resonance probes equipped with pulse-field gradients. Spectra were processed using the AZARA package (provided by W. Boucher, University of Cambridge), using maximum entropy processing of F1 and F2 dimensions of the three-dimensional experiments, and resonance assignment was achieved using ANSIG as described previously (15). Distance restraints for the structure calculation were derived from the following three complementary NOE spectroscopy (NOESY) experiments: a 15N-edited NOESY-HSQC and two 13C-edited NOESY-HSQCs, one in H2O buffer and one in D2O buffer. All mixing times were 100 ms. Slowly exchanging amide protons were identified by the detection of 26 NH resonances in a 15N-HSQC spectrum recorded 1 month after exchanging a protein sample into D2O buffer. Hydrogen bond acceptors for most of these slowly exchanging protons were identified using the refined initial structures. Distance restraints corresponding to hydrogen bonds were only introduced following identification of the supporting characteristic NOEs. 15N Relaxation Measurements15N T1 and T2 relaxation times were measured by the method of Kay (20). The pulse sequence was modified according to Grzesiek and Bax (21) to keep the water magnetization on the z axis during the T1 period. Relaxation delays of 43.1, 253.1, 421.1, 589.1, 757.1, 841.1, 925.1, and 1051.1 ms were employed for T1 measurements, and delays of 15.8, 31.6, 63.2, 94.8, 111.7, and 126.5 ms were employed for T2 measurements. The T1 and T2 relaxation times were calculated by nonlinear least squares fitting. In each case, the spectrum corresponding to one of the relaxation delay values was re-collected to allow an estimation of the experimental error of the measured peak intensities. For the 1H-15N HSQC heteronuclear NOE experiment (21), a saturation experiment and a reference experiment were recorded with a relaxation delay of 5 s, of which 3 s was used for 1H saturation in the 1H-saturated experiment. Structure CalculationWherever possible, resonances in the NOESY spectra were assigned unambiguously. Otherwise a set of two or more assignment possibilities were assigned on the basis of their chemical shifts using the Connect program within AZARA. Peak intensities were converted into four distance categories of 02.7, 03.3, 05.0, and 06.0 Å. A total of 3544 distance restraints were generated from the three NOESY-HSQCs, of which 2609 were unambiguous and nondegenerate. The NOE-derived distance restraints were used as input for the structure calculations using CNS-Solve (22). At a later stage more distance restraints representing the 26 inferred hydrogen bonds were added to the restraints list. The simulated annealing protocol employed the PARALLHD force field, with the nonbonded energy function of PROLSQ (23) and included active swapping of pro-chiral centers. For the initial structure calculations, the six cysteines were defined as being in the oxidized state. In the absence of information from experimental disulfide mapping, however, no covalent linkages between sulfur atoms were initially defined in the molecular structure file in order to avoid bias. At a later stage, and based on the initial structure calculations, two disulfide bonds, Cys1514Cys1588 and Cys1535Cys1658, were added. There was a lack of NOE-based evidence to support the formation of a disulfide between the remaining pair of cysteines, Cys1636 and Cys1639. This arose, at least in part, from a paucity of assignments for nuclei in this region of the sequence. No covalent linkage was therefore defined between these residues. As the calculations progressed, the ambiguously assigned distance restraints were "filtered" iteratively to eliminate assignment possibilities contributing less than 1% to the total NOE, and redundant restraints (duplicates) were also removed. A total of 100 structures were calculated from which a representative ensemble of 40 structures, with the lowest NOE-derived energies, was selected. The quality of the ensemble of structures was checked with PROCHECK (24). The NOE-derived distance restraints used for the structure calculations and the coordinates of the ensemble of 40 structures of C5-C345C have been deposited in the Protein Data Bank under accession number 1XWE [PDB] .
Modeling C345C Domains of C3 and C4 Modeling of the C345C domains of C3 and C4 was undertaken based on the lowest NOE energy NMR-derived structure of C5-C345C using the program Modeller release 7, version 7 (25). The alignments between the target sequences of human C3 and C4 C345C domains and the template structure were based on initial multiple sequence alignments of C3-, C4-, and C5-C345C domain sequences from various organisms from the SwissProt (26, 27) and the GenBankTM nonredundant databases, using the program MUSCLE (28, 29). The multiple sequence alignment (Fig. 1B) was manually edited to ensure the most plausible alignment of conserved amino acid residues and of secondary structure elements as predicted by PsiPred (30) between the target and template. The three putative disulfide bridges and the longer predicted C-terminal
Recombinant F1613A Mutant Binds C6/C7The protein fragment, C5-C345C (residues Ala1512 to the C-terminal Cys1658 of human C5), with an N-terminal His tag was overexpressed in the E. coli strain Origami. The use of a bacterial expression system facilitated isotopic enrichment, and the Origami strain was selected because its oxidizing intracellular environment is conducive to formation of disulfide bonds (32). After thrombin cleavage of the His tag, four extra residues (Gly-Ser-His-Met) remained at the N terminus of the C5-C345C sequence. Protein expression levels in rich media were typically 4 mg liter-1 but only 0.5 mg liter-1 in Martek 9-labeled media. Yields were improved 45-fold using a construct with the point mutation F1613A. To assess any structural perturbations that might be introduced by such a mutation, 15N, 1H-HSQC spectra of 15N-labeled wild-type and F1613A C5-C345C samples were compared (data not shown). Nearly all the resonances coincide. Significant chemical shift differences were noted only for those peaks corresponding to residues located close in sequence to the mutation, namely Ile1609Tyr1617; of these only Asn1612, Phe/Ala1613, and Ser1614 show major differences. This observation demonstrates the F1613A mutant of C5-C345C has a near identical structure to that of the native domain. To assay for functional activity, binding to C6 and C7 was measured (Fig. 2). As may be judged from the SPR-derived binding parameters (Table I), the affinities of the F1316A mutant for both these MAC components are similar or identical to those of the wild-type domain. Given its higher expression levels, the mutant was therefore used in the subsequent structural studies.
The Solution Structure of C5-C345C Is SolvedThe 15N and 15N, 13C-labeled samples of C5-C345C yielded high quality NMR spectra thus permitting the assignment of nearly all of the 15N, 13C, and 1H nuclei (15). Only a few assignments were made for Ser1637, Ser1638, and the four non-native residues at the N terminus because they all gave rise to few detectable resonances. Several assignments for aromatic side chain atoms were missing, mainly due to overlapping signals; these were Tyr1543 (C and H ), Tyr1611 (C and H ), Phe1556 (C and H ), Phe1615 (C , H , C , and H ), Phe1642 (C and H ), and Phe1654 (C , C , and H ). Tyr1541 is unusual in that its H and its H nuclei have nondegenerate chemical shifts; a strong chemical exchange peak between the resonances of H 1 and H 2 and between H 1 and H 2, indicates restricted rotation of its aromatic side chain (subsequently, the structure reveals that this side chain is indeed well buried within the core of the protein). The H atom of Leu1521 exhibits an unusually low chemical shift of 1.58 ppm (cf. average shift is 4.32 ppm). All three proline residues are in the trans conformation as evidenced by the differences in the chemical shifts, C C of 4.03, 5.00, and 4.88 ppm for Pro1537, Pro1620, and Pro1631, respectively ( C C is 4.51 ± 1.17 ppm for trans and 9.64 ± 1.27 ppm for cis (33)), as well as strong NOE cross-peaks between the H s of the prolines and the H of the preceding residues.
Subsequently, a structure calculation was undertaken using a total of 3544 NMR-derived distance restraints as detailed in Table II. Two disulfide (Cys1514Cys1588 and Cys1535Cys1658) bonds were added only after NOE-based calculations had established beyond a doubt the proximity and orientation of the contributing cysteine side chains. A third potential disulfide was not invoked because, although the remaining two cysteine residues are close in space, there is insufficient NOE-derived evidence to judge whether their side chains are appropriately juxtaposed. Similarly, distance restraints based on 26 inferred inter-
A total of 40 structures, selected on the basis of lowest NOE-derived energy from 100 calculated ones, converged well in most regions as may be judged from a backbone overlay (Fig. 3A) and the values for r.m.s.d. in Table II. The r.m.s.d. of the C coordinates of the 40 selected structures from those of the mean structure are plotted in Fig. 4A as a function of residue number and compared with the distribution of 1H-1H NOEs (Fig. 4B). Significantly fewer than average NOEs are exhibited by two stretches of residues within the sequence (Ile1609Phe1615 and Thr1635Cys1639) and by the N-terminal residues Ala1512 and Asp1513. This is reflected in the elevated r.m.s.d. values of their C s and is also evident from inspection of the overlay in Fig. 3A. In the case of Ser1637 and Ser1638, the aforementioned lack of detectable amide signals would account in part for the dearth of NOEs.
Description of the StructureFor the purposes of the description below, and unless stated otherwise, a residue is designated as belonging to an -helix or -strand in C5-C345C if it is so defined in the majority of the 40 members of the ensemble according to the Kabsch and Sander (34) criteria, as implemented in MolMol (35). Two views of the fold of the closest-to-the-mean C5-C345C structure are shown in Fig. 3B. The core of the structure is an OB-class fold that is most easily thought of as two orthogonal three-stranded, antiparallel twisted -sheets composed from strands AC-B-C and strands AN-D-E, where the superscripts N and C denote the N- and the C-terminal halves of strand A (Tyr1541Val1552). There are two adjacent helices as follows: a short one (helix-1, Arg1530Ala1534) composed of residues from near the N terminus of the module, and a longer and irregular one (helix-2, Leu1643Leu1655) close to the C terminus of the module (and of the full-length protein). The two helices are tilted with respect to one another but are essentially aligned with, and lie against, the convex face of the AN-D-E sheet. Strand B (Val1557Lys1568) extends beyond the AC-B-C sheet so that its C-terminal part participates in a four-stranded anti-parallel sheet BC-AN-D-E. In a small proportion of calculated structures, there are two segments to strand E, E1 (Ile1618Pro1620) and E2 (Trp1626Tyr1629), interrupted by coil. Strand E1, which is assigned (within MolMol) in only a few structures, forms a small parallel -sheet with strand C (Glu1579Lys1584). In all of the C5-C345C structures, there is potential for H-bonds between the CO of Tyr1619 and the NH of Thr1581, and between the NH of residue Tyr1619 and CO of Ile1583, thus completing the hydrogen bond network that forms the barrel-like structure. Strand E2, which appears in all structures, is antiparallel to strand D (Gln1598Gly1603). Thus the barrel has a "closed" side made up from the -strands, and a more "open" side (to the right of the view in the left-hand panel of Fig. 3B) occupied by Tyr1619 and the residues prior to E2.
The N-terminal segment of the domain runs above one end of the barrel, from Cys1514 to the top of helix-1. Cys1514 is disulfide-linked to Cys1588, which is located in the long CD loop; the CD loop crosses over the otherwise open end of the barrel from the AC-B-C sheet to the AN-D-E sheet (Fig. 3B). The 15N T1/T2 ratios (Fig. 4C) in some residues of the N-terminal segment (but not in the CD loop) suggest chemical exchange (i.e. microsecond to millisecond time scale conformational fluctuations), whereas in both the N-terminal segment and the CD loop there is also some evidence of rapid (i.e. nanosecond and faster) motion from the heteronuclear NOE plot (Fig. 4D). At the bottom of the short helix-1 the transition to the long strand A contains Cys1535, which is linked to Cys1658, the C-terminal residue of the module. The BC loop caps off the other end of the barrel and corresponds to a dip in the heteronuclear NOE plot consistent with rapid motion, but there is no evidence of chemical exchange among these residues. Following strand D, there is a 14-residue loop that in a few members of the ensemble contains antiparallel
Examination of the ensemble of calculated structures indicates that helix-2 is not a straight, regular The pattern of disulfide bond formation thus agrees with that predicted on the basis of disulfide mapping in C3 (36). The first Cys (1514) is disulfide-bonded to the third Cys (1588); and the second Cys (1535) is linked to the sixth Cys (1658). The third disulfide, involving the remaining fourth and fifth Cys residues (1636 and 1639), has presumably formed because biochemical analysis suggested that no free sulfhydryl groups are present in C5-C345C, but its presence could not be supported by the NOE data. It is possible that this bond exists only transiently due to the constraints placed by there being only two residues between these two cysteines. Of 36 residues in C5-C345C that are >95% buried (on average in the ensemble) (see Fig. 1), three are likely to be charged in the solution conditions used. The alkyl chain of Lys1584 is buried, but its amino group is exposed to solvent. However, Arg1530 and Glu1628 are completely buried and proximal, indicating the likelihood of an unusual ion pair or salt bridge connecting helix-1 with strand E of the barrel. Ala1534 of helix-1, a stack of four residues located along one side of the second helical region (Phe1642, Leu1646, Phe1649, and Ile1653), two buried residues from the start of strand A (Ile1539 and Ala1542), two residues from strand D (Leu1600 and Met1602), and Pro1631 from beyond strand E are all deeply buried in a hydrophobic core between the helices and the barrel along with the Arg/Glu salt bridge. Of the remaining deeply buried residues, all contribute to the hydrophobic core of the barrel. Most of the solvent-exposed (>30%) hydrophobic residues lie in the DE loop, whereas Phe1654 and Leu1655 are exposed near the C terminus. Adjacent to this exposed pair of side chains is a patch of negatively charged side chains (glutamates 1528, 1648, and 1651; aspartates 1647 and 1652) that dominate the electrostatic surface of C5-C345C (see below). This surface (to the left in the left-hand panel of Fig. 3B) is likely to be exposed in full-length C5 because it is distal to the N terminus of the C345C domain.
Comparison with Other StructuresAs predicted (8), the lowest NOE energy structure of the C5-C345C domain resembles the N-terminal domains of TIMP-1 (PDB ID 1UEA [PDB] , chain B) and TIMP-2 (PDB ID 1BR9 [PDB] ) (10, 11) (C r.m.s.d., over 107 and 106 residues of 2.9 and 3.1 Å, respectively), the NTR domain of PCOLCE1 (PDB ID 1UAP) (9) (C r.m.s.d. over 107 residues = 2.8 Å), and the laminin-binding domain of agrin (PDB ID 1JC7
[PDB]
) (12) (C r.m.s.d. over 111 residues = 3.5 Å). A comparison of the structures is presented in Fig. 5, and a structure-based alignment of these domains is shown in Fig. 1A. This work therefore confirms that the C345C domain of C5 is an example of an NTR module. For the purposes of further discussion, all four domains represented in Fig. 5 (and the equivalent TIMP-1 domain) will henceforth be referred to as NTR modules.
Many of the buried residues of C5-C354C that lie in strands are conserved or conservatively substituted in the other modules. These are drawn in Fig. 5; examples include the following: the first, second, third, fifth, and seventh residues of strand A; the first, third, fifth, and seventh residues of strand B; residues in positions 26 of strand D; and three residues (equivalent to Tyr1619, Leu1621, and Ile1627) in the strand E region. On the other hand, many of the side chains that make up the hydrophobic core between the -barrel-like ( -) subdomain and the helix-rich ( -) subdomain are not well conserved, consistent with a higher degree of structural divergence in the helical subdomain. The buried partners that comprise the putative salt bridge, Arg1530 and Glu1628 of C5, are replaced by hydrophobic residues in the other domains. Many of the exposed hydrophobic residues of C5-C345C lie in insertions, and examples include Leu1523, Pro1537, and Val1573 (that lies in the BC loop and is one of the most exposed residues in the protein) and five residues (including Ala1613 that replaces the wild-type Phe) in the DE protuberance. The conspicuous tandem pair of exposed hydrophobic residues near the C terminus (Phe1654 and Leu1655) is not conserved.
As would be expected from the conservation of buried residues, the
In both the TIMP and PCOLCE-1 NTR modules, two disulfides staple the N-terminal region to the rest of the
TIMP-2 lacks the prominent loop seen in C5 between strands D and E1, but its A and B strands are more extended. The PCOLCE-1 NTR module also lacks the long loop between strands D and E that forms such a prominent feature of C5-C345C. Between strands B and C of the agrin NTR module, there are two turns of
In all the NTR modules there are helical regions, packed against the convex surface of the A-D-E sheet, forming the Comparison with Other C345C DomainsThe C345C domains of C3 (residues 15181663) and C4 (residues 15951744) are both 26% identical to C5-C345C (Fig. 1B). The Arg/Glu salt bridge is conserved as are many other buried residues. Exceptions include the following: Ala1540 at the start of strand A that is replaced by an Asp or a Glu in C3 and C4, respectively; Val1545 that is conserved in C4 but replaced with a Thr or a Gly in some examples of C3; Val1557 at the beginning of strand B is replaced with Arg in C4 and an Asp in most examples of C3; Ile1580 in strand C is replaced by Arg in C3 and C4. Exposed hydrophobic side chains that are conserved include the following: Pro1537 (in an insertion relative to the non-C3/C4/C5 NTR modules) and Leu1607, which is also a large hydrophobic residue in most examples of C3 in the alignment but is replaced by Ser in rat and mouse C4 (and is not conserved in the non-C3/C4/C5 NTR modules). Finally, the pair of hydrophobic residues toward the bottom of helix-2 are well conserved in C5 and most examples of C3 but not in C4 (nor in the NTR modules of TIMP, PCOLCE-1, and agrin). The most conspicuous examples of exposed hydrophobic residues that are peculiar to C5 lie in the DE protuberance, which has a pronounced hydrophobic character.
A further comparison of the C3, C4, and C5 modules was made on the basis of homology-modeled three-dimensional structures of the C3 and C4 examples (Fig. 6). Obvious structural differences arise from the DE insertion of C5-C345C and the insertions in the sequences of C3 and C4 between the fourth and fifth Cys residues, prior to helix-2. According to the secondary structure prediction program PsiPred (30), helix-2 is strongly predicted to begin well before the fifth Cys of C3 and C4 (this region forms a turn but is not classified as a helix in the majority of the C5-C345C NMR-derived structures) and to extend to the C terminus. In human C3, the beginning of the predicted helix and the preceding region (following strand E2) are rich in negatively charged residues (seven within a 10-residue stretch), and this feature dominates the electrostatic representation of the C3-C345C surface (Fig. 6). By contrast, the equivalent region of the human C5-C345C surface is neutral, whereas in human C4 it is positively charged (Fig. 6). In all three proteins, the middle part of helix-2 is negatively charged, with human C5 having the most charge and human C4 the least. Another feature common to C3, C4, and C5 is that the open face of the
Interpretation of Mutagenesis DataIn previous work, the DE region of C5-C345C had been investigated as a site of possible functional significance on the basis that it is close to an "indel" (indels are evolutionary insertions or deletions of amino acid residues that result in length polymorphisms among members of a protein family). Deletion of the putative insertion Ser1623 and Leu1624 resulted in significant loss of hemolytic activity (40% of wild type, with normal expression levels) in full-length C5 (37). In light of the three-dimensional structure it seems likely that such a deletion, just prior to strand E2, would affect the structural integrity of the -subdomain or at least disrupt the open side of the barrel and the DE loop. Substitution of Leu1607Tyr1611 by the sequence DFWGE resulted in loss of all detectable hemolytic activity (37). In this case, the original sequence corresponds to a poorly structured region of the DE loop with no buried side chains, and the substitution would be unlikely to disrupt the structure of the domain. These observations therefore implicate the DE loop in function.
Subsequently, alanine substitutions of residues from Gly1603 to Pro1621 were carried out in full-length C5 (17). Most substitutions had little or no effect on the hemolytic activity of C5 or its susceptibility to proteolytic cleavage. From the structure it can be seen that Tyr1619 is >95% buried within the Substitution of Ile1609 or Tyr1611 by alanine would not be expected to disrupt any local structure within the DE protuberance because their side chains are exposed, and indeed these mutations had little effect on hemolytic activity or proteolytic susceptibility. Substitution of the exposed Lys1610, on the other hand, produced mutant C5 molecules with both low hemolytic activity and decreased sensitivity to proteolytic activation. This is consistent with the pentapeptide substitution experiment and pinpoints Lys1610 as the functionally critical residue in that peptide. Substitution of Phe1613 and Phe1615 also perturbed hemolytic activity and decreased proteolytic susceptibility to the classical pathway convertase (but not cobra venom factor, which is able to cleave both C3 and C5 and therefore presumably has a different recognition mechanism). Comparison of HSQC spectra for the wild-type and F1613A versions of C5-C345C proved that this mutation has no nonlocal structural effects, and indeed the F1613A mutant was used for structure determination in the current study. Phe1615 is also exposed, and mutation to Ala would be equally unlikely to disrupt structure. Therefore these mutagenesis results clearly identify three exposed side chains (1610, 1613, and 1615) as being specifically involved in an interaction, either with the convertase or within the full-length C5, that is critical for function. It is striking that these three residues, whose Ala substitutions cause 8090% loss of C5 activity, are at the tip of the DE extension and that their side chains, as well as those of two of the four residues whose substitutions cause 50% loss of activity (Arg1616 and Tyr1617), are located on the same side of the protuberance. In the absence of a three-dimensional structure for C3, C4, or C5, the physical distance of the C345C module from the cleavage site (some 800 residues in terms of primary structure) is unknown. From the structure of the module, however, it can be seen that the DE loop exposes hydrophobic side chains (including those of the two critical Phe residues) and lies close to the N terminus of the module. Just prior to Cys1514 in the C5 sequence is Cys1509 that is (by extrapolation from disulfide-mapping in C3) disulfide-linked to Cys848. Thus the C345C domain is likely closely coupled to further structured domains, and therefore, the DE extension could be buried in the interface between the C345C domain and the remainder of the C5 protein. In that case mutations of the DE loop might disrupt the arrangement of domains within the full-length protein and exert their functional influence indirectly. Arguing against this, however, is the observation that a peptide extending from Lys1604 to Arg1616 inhibited complement hemolytic activity and activation of C5 by the convertase pathway C5 convertase (but not cobra venom factor) (17). Furthermore, the consequences for inhibitory activity of alanine substitution within the peptide reflected the results of alanine-scanning mutagenesis in C5. The peptide studies are therefore more consistent with a direct interaction between the DE extension and the classical pathway convertase.
C5-C345C, but not the equivalent domain from C3, binds reversibly to C6 and C7, with a preference for C7 (14) (and Fig. 2), and the interaction with C7, but not C6, appears to be essential for the nonreversible formation of the MAC.2 The F1613A mutant used in the current structure determination retained the ability to bind C6 and C7 (Table I); indeed, none of several DE loop mutations in C5 influenced binding to C6 (17). Therefore, the C6/C7-binding site of C5-C345C is likely to lie elsewhere. Another set of mutants2 was therefore constructed in which deletions or insertions were made in the N-terminal region, the AB, BC, and CD loops, and the Cys-Ser-Ser-Cys bulge. These changes removed several of the exposed hydrophobic side chains evident in the C5-C345C structure (Fig. 1A). Nonetheless, all the mutants in this set showed full affinity for C6 and C7.2 One region that has not so far been explored by mutagenesis, however, is that made up from the exposed face of the
ConclusionsThis work confirms that the multifunctional C-terminal 150 residues of C5 constitute an independently folding unit that is an example of an NTR module. NTR modules are compactly folded and globular, containing a
The atomic coordinates (code 1XWE) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by the Medical Research Council of the UK, the Wellcome Trust, and National Institutes of Health Grant GM29831. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ¶ To whom correspondence should be addressed: Schools of Chemistry and Biological Sciences, Joseph Black Chemistry Bldg., University of Edinburgh, West Mains Road, Edinburgh EH9 3JJ, Scotland, UK. Tel.: 44-131-650-4727; Fax: 44-131-650-7055; E-mail: Paul.Barlow{at}ed.ac.uk.
1 The abbreviations used are: MAC, membrane attack complex; C5-C345C, residues Ala1512 to the C-terminal Cys1658 of human C5; FIMAC, factor I membrane attack complex; OB, oligosaccharide/oligonucleotide binding; NOE, nuclear Overhauser effect; NOESY, NOE spectroscopy; NTR, netrin-like; PCOLCE, type I procollagen C-proteinase enhancer protein; r.m.s.d., root mean square deviation; SPR, surface plasmon resonance; TIMP, tissue inhibitor of metalloproteinase; WT, wild type; PDB, Protein Data Bank.
2 C.-T. Thai and R. T. Ogata, unpublished data.
We thank Dr. M. Rance of the University of Cincinnati, J. Bella and Dr. K. Bromek of the Edinburgh Biomolecular NMR Unit, and Dr. N. Assa-Munt.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||