|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 32, 23219-23230, August 10, 2007
Crystal Structure of the N-terminal NC4 Domain of Collagen IX, a Zinc Binding Member of the Laminin-Neurexin-Sex Hormone Binding Globulin (LNS) Domain Family*![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() 1
From the
Received for publication, March 23, 2007 , and in revised form, May 22, 2007.
Collagen IX, located on the surface of collagen fibrils, is crucial for cartilage integrity and stability. The N-terminal NC4 domain of the 1(IX) chain is probably important in this because it interacts with various macromolecules such as proteoglycans and cartilage oligomeric matrix protein. At least 17 distinct collagen polypeptides carry an NC4-like unit near their N terminus, but this report, describing the crystal structure of NC4 at 1.8-Å resolution, represents the first atomic level structure for these domains. The structure is similar to previously characterized laminin-neurexin-sex hormone binding globulin (LNS) structures, dominated by an antiparallel -sheet sandwich. In addition, a zinc ion was found in a position similar to that of the metal binding site of other LNS domains. A partial backbone NMR assignment of NC4 was obtained and utilized in NMR titration studies to investigate the zinc binding in solution state and to quantitate the affinity of metal binding. The Kd of 11.5 mM suggests a regulatory rather than a structural role for zinc in solution. NMR titration with a heparin tetrasaccharide revealed the presence of a secondary binding site for heparin on NC4, showing structural and functional conservation with thrombospondin-1, but a markedly reduced affinity for the ligand. Also the overall arrangement of the N and C termini of NC4 resembles most closely the N-terminal domain of thrombospondin-1, distinguishing the two from the majority of the published LNS structures.
Collagens are a family of extracellular matrix proteins comprised of at least 28 distinct types, all characterized by the presence of one or more elongated triple helical regions. This common motif is supplemented with a variety of non-helical domains to add unique properties for molecular or supramolecular assembly, for tissue-specific targeting, or for interactions with other constituents of the matrix (1, 2). An example of such a supplementary element is a module showing similarity to the N-terminal domains of thrombospondins (TSPNs)2 (3). This domain occurs in at least 17 different collagen polypeptides, located N-terminal to the triple helix, but little is known about the detailed structures and physiological functions of these units (2).
One of the TSPNs found in collagen chains is the NC4 domain of collagen IX, formed by about 245 N-terminal residues of the mature
An atomic level structure is currently not available for any of the collagen TSPN units. On the basis of amino acid sequence conservation and the apparent similarities of the secondary structural elements, it was predicted that the TSPN module is homologous with pentraxins and with members of the laminin-neurexin-sex hormone binding globulin (LNS) domain family (18, 19). Three-dimensional structures of LNS modules from the laminin To understand the physiological function of the NC4 domain and its homologues in other collagens, knowledge of the three-dimensional structure is essential. We have previously reported the production and initial characterization of the recombinant NC4 domain of human collagen IX (7). Here we describe the crystal structure of NC4, the first collagen LNS domain solved, and show that NC4 is indeed a metal-coordinating LNS module with binding sites for multiple ligands.
Production of Recombinant NC4 Domain—The recombinant NC4 domain of human collagen IX without the 23-residue signal peptide was produced and purified essentially as described previously (7). The residue numbering used below starts from the first residue of the mature, secreted polypeptide. Instead of ammonium sulfate precipitation, the clarified cell lysate was subjected directly to affinity chromatography on a glutathione-Sepharose column (GE Healthcare) using Factor Xa protease (GE Healthcare) digestion to elute the NC4. After heparin affinity and cation exchange chromatography steps, the sample was further polished by gel filtration on a Superdex 75 column (GE Healthcare), dialyzed into water, and lyophilized. In vitro mutagenesis (Stratagene) was used to create a mutant form of NC4, termed NC4Ndel, lacking the first 17 residues of the mature polypeptide and replacing, for technical reasons, Asn18 and Glu19 with Asp and Leu, respectively. We also separately made the D190A/D192A double mutant. CD spectropolarimetry and two-dimensional NMR spectroscopy were used to verify that the mutations did not cause abnormal folding of NC4.
Selenomethionine incorporation was achieved by inoculating the Escherichia coli RosettaTM(DE3) cells (Novagen) into LB broth and growing at 37 °C to an absorbance at 600 nm of about 0.7. L-Selenomethionine was then added to 60 µg/ml, expression was induced after 10 min with 1 mM isopropyl 1-thio- Crystallography—Crystals of NC4 were grown in sitting drops over a reservoir solution of 100 mM MES, pH 6.5, 10 mM ZnSO4, and 20% monomethyl ether-polyethylene glycol 550. The drops were prepared by mixing 2 µl of the reservoir solution and 2 µl of the protein solution at 20 mg/ml. The crystals belong to space group I4 (a, b = 81.88 Å, c = 71.43 Å) with one molecule per asymmetric unit and a solvent content of 44%. For data collection at –180 °C, crystals were frozen in liquid nitrogen with the well solution containing 10% (v/v) ethylene glycol. Multiwavelength anomalous diffraction (MAD) data on a frozen selenomethionine derivative crystal were collected to 1.8 Å using the BM14U beamline at European Synchrotron Radiation Facility (Grenoble, France) at three wavelengths (Table 1). The data sets were indexed and scaled with the program X-ray Detector Software (33). The Crystallography NMR software (Ref. 34) was used to find the three expected selenium sites. A fourth, strong anomalous scatterer was found, showing an increasing signal from the remote to the selenium K edge inflection point wavelength (0.9797 Å). The scatterer was initially assigned as a Zn2+ ion due to its presence in the crystallization buffer and the fact that it has the K absorption edge at 1.283 Å. The selenium sites together with the Zn2+ were used to estimate the experimental MAD phases at 2.0 Å in Crystallography NMR software (Table 1). The electron density map obtained upon solvent flipping in Crystallography NMR software was used for initial model building.
We built an initial model of residues 19–24 and 30–233 using O (35). Using the inflection point data, this model was subjected to iterative rounds of refinement in Crystallography NMR software (34) and manual rebuilding. Refinement was carried out using energy minimization, simulated annealing, and individual temperature factor (B-factor) refinement. Water molecules were added to peaks above 3.5 in the Fo – Fc difference map if they had suitable hydrogen bonding geometry. The final model, with good stereochemistry (Table 1), consists of 210 residues of which nine side chains display alternative conformations. In addition to the Zn2+, the solvent model includes a sulfate ion, four ethylene glycol, and 209 water molecules. Residues 1–18, 25–29, and 234–245 are not seen in the electron density. PROCHECK (36) was used to assign secondary structure elements and calculate the Ramachandran plot. Of all the non-Gly/non-Pro residues, 86.1% have main chain torsion angles in the most favored regions, and there are no residues in the disallowed regions. Structural comparisons were carried out using DALI (37).
NMR Spectroscopy—For the backbone chemical shift assignment of NC4, HNCACB and HN(CO)CACB spectra were acquired at 40 °C on a Varian Unity INOVA spectrometer operating at 600-MHz 1H frequency and equipped with a cryogenically cooled probe head and an actively shielded z-gradient system. The sample contained To obtain apo-[15N]NC4, the NMR sample was washed free of divalent cations in the presence of 2 M NaCl and 100 mM EDTA on a spinnable concentrator (Millipore) by concentrating 10-fold followed by a similar wash without added EDTA. Additional washing steps with 20 mM BisTris buffer, pH 6.5, were carried out until the calculated concentration of EDTA was 20 µM or less. This procedure gave a two-dimensional 15N HSQC spectrum identical to that from the size exclusion-based method that we adapted from the procedure used for removal of Ca2+ from calerythrin (39).
For NMR titration with divalent cations as ligands, a Varian Unity INOVA 600 MHz spectrometer equipped with an actively shielded z-axis gradient probe head was used to measure a series of two-dimensional 15N HSQC spectra for apo-[15N]NC4 in the absence or presence of cations. For the Zn2+ titration, a series of spectra was recorded in which the concentration of ZnCl2 was gradually increased. Concentrations corresponding to molar ratios of 0:1, 1:1, 5:1, 10:1, 20:1, 40:1, 60:1, and 80:1 of Zn2+ to NC4 were used, and a 1H, 15N HSQC spectrum was acquired at each titration point. Average chemical shift changes were calculated with equation The titration spectra with heparin oligosaccharides were acquired at 600-MHz 1H frequency (40) in 33 mM NaCl, 20 mM BisTris buffer, pH 6.5 at 40 °C, 92.5% H2O, 7.5% D2O. For titration with the heparin tetrasaccharide (Dextra Laboratories, Reading, UK) concentration ratios of 0:1, 3:1, 6:1, 15:1, 24:1, and 36:1 were used, and an averaged dissociation constant was determined as above using data from five residues. To investigate the interaction of naturally occurring retinoids with NC4, solid all-trans-retinoic acid or vitamin D3 (Sigma-Aldrich) was incubated for at least 24 h with [15N]NC4 at 20 °C in the presence of Zn2+ or EDTA (41). Insoluble retinoid was removed by centrifugation, and a two-dimensional 15N HSQC spectrum was measured as above. Alternatively concentrated stock solution of retinoid prepared in deuterated dimethyl sulfoxide was added in excess to a [15N]NC4 NMR sample, keeping the amount of Me2SO below 1%, and a two-dimensional 15N HSQC spectrum was measured (42).
Structure Determination—The three-dimensional crystal structure of NC4 was solved by collecting MAD data from a crystal of selenomethionine-substituted protein. In addition to the three expected selenium sites, a Zn2+ cation was found and used in phasing. The asymmetric unit encompasses a single NC4 domain. The model was refined at 1.8-Å resolution to a crystallographic R factor of 18.8% and an Rfree of 21.4%. Data collection and refinement statistics are summarized in Table 1. Analysis using PROCHECK (36) shows that all non-glycine and non-proline residues are in the most favored or additionally allowed regions of the Ramachandran plot. Except for residues 1–18, 25–29, and 234–245, the main chain is well defined by the electron density.
Structure of NC4—The NC4 structure has a globular lectin-like architecture and contains 14
Although the C-terminal portion of the structure is covered by the lectin-like
A strong anomalous scatterer was found when the MAD phases were estimated. The anomalous signal (Fig. 2) for this scatterer was highest in the selenium K edge inflection point data set, and because 10 mM ZnSO4 was required for crystallization, we assigned the anomalous scatterer as Zn2+. Crystals could not be obtained in the presence of Ca2+ or Mg2+. The Zn2+ binding site is located on an edge of the
NMR Chemical Shift Assignment—The dispersion for the majority of the cross peaks in the 1H, 15N HSQC spectrum of NC4 is good. Unfortunately in the middle of the spectrum there is a cluster of peaks with many of the peaks overlapped and some of very high intensity. This indicates that part of the NC4 main chain contains highly flexible parts. A total of 208 backbone cross-peaks could be distinguished in the spectrum.
Backbone chemical shift assignment of NC4 was pursued with three-dimensional HNCACB and HN(CO)CACB spectra (47, 48). We acquired this pair of spectra from three different samples, a 2H, 13C, 15N triple-labeled sample at pH 4 and 7.5 as well as a 13C, 15N double-labeled sample at pH 6.5. The latter gave the best spectra and highest assignment percentage when measured with a cryoprobe head. A C
NMR spectra for the mutant NC4Ndel lacking the N-terminal, apparently flexible stretch of residues were acquired in an attempt to increase the spectral quality in the central crowded region. There was, however, no significant change in the quality of spectra (results not shown). The overall fold of the domain appeared not to be markedly affected by the deletion. NMR Titration of NC4 with Zn2+ Shows a Low Affinity Interaction—ZnCl2 was titrated into a sample of 15N-labeled NC4. All the chemical shift changes fell into the fast exchange limit on the NMR time scale as monitored by 1H, 15N HSQC. All cross-peak movements were linear, suggesting that the interaction is bimolecular. No additional cross-peaks appeared upon titration. The largest cross-peak movements were observed for residues Thr76, Asp112, Ser113, Glu117, Lys135, Arg166, and Ile189–Phe194, which are located either in the loops coordinating Zn2+ in the crystal structure or in the neighboring, hydrogen-bonded strands. Of these residues, the five that could be traced with the highest confidence in the Zn2+ titration spectra were used in plotting the averaged chemical shifts as a function of Zn2+ concentration to obtain dissociation constants for the individual binding curves (Fig. 3). After averaging, an equilibrium dissociation constant Kd of 11.5 ± 1.1 mM for Zn2+ was obtained. Titration with Ca2+ or Mg2+ did not result in perturbation of chemical shifts even with a 25-fold molar excess of the cation.
NMR Titration with Heparin Oligosaccharides Reveals a Low Affinity Binding Site—NMR titrations of 15N-labeled NC4 with heparin di-, tetra-, or hexasaccharides or longer heparin fragments were performed. When overlaying the titration spectra, linear movements were seen for less than 10% of the cross-peaks upon titration with heparin di- or tetrasaccharide (not shown). Most of these chemical shift perturbations were in one part of the NC4 molecule formed by loops 3– 4 and 13– 14, strand 5, and the neighboring helix 4, suggesting the presence of a previously unidentified, low affinity heparin binding site on NC4. Analysis of the perturbation data obtained with a tetrasaccharide gave a Kd of 1.0 mM (Fig. 4). The primary heparin binding site of NC4 is located within the first eight residues of the domain (7); these are currently of undetermined structure and without a backbone chemical shift assignment. The cross-peaks originating from these residues may be among the few unassigned peaks demonstrating chemical shift perturbation upon titration with the tetrasaccharide, or they may be undetectable under the experimental conditions used. Notably among the assigned residues Val10 is located closest to the N terminus and shows clear perturbation upon heparin addition (data not shown). In addition, when the tetrasaccharide titration was performed at pH 5, large perturbations were seen for four unassigned cross-peaks reaching saturation at a 10-fold lower ligand concentration (data not shown). Titrations with heparin hexasaccharide or larger fragments resulted in precipitation of NC4 and an associated reduction of the spectral intensity and quality even at subequimolar amounts of the ligands. The same effect was seen with highly sulfated heparin analogues such as sucrose octasulfate and myo-inositol hexasulfate (not shown).
NC4 Lacking the N Terminus Has Only Residual Heparin Affinity—To verify our initial mapping of the heparin binding site of NC4 to the N terminus of the domain (7), we created mutant NC4Ndel lacking the first 17 residues of the domain. Analytical heparin affinity chromatography showed that N-terminally truncated NC4 could not bind to the column at a physiological salt concentration. When bound to the column in a buffer containing no added NaCl, NC4Ndel eluted at 0.13 M salt compared with the 0.31 M salt required for elution of wild-type NC4 (Fig. 5). The D190A/D192A double mutant, which cannot bind Zn2+, bound to the heparin column as tightly as wild type, and the presence of excess EDTA had no significant effect on the elution profile of wild-type NC4 (data not shown). This indicates that a Zn2+ cation bound to NC4 is unlikely to affect its interaction with a glycosaminoglycan in vivo.
Comparison of the Structural Model of NC4 with Other Structures—Searching the Protein Data Bank using DALI (37) revealed that NC4 has structural similarities to the large number of lectins and lectin-like domains in the concanavalin A-like lectin and glucanase superfamily (Structural Classification of Proteins; Ref. 49). Despite the modest sequence identity ( 16%), all of the structurally homologous domains share the central lectin fold feature: a pair of antiparallel -sheets curved to form a concave and a convex side. NC4 is structurally closest to the 23.5-kDa TSPN-1 (Ref. 27; Fig. 6A) and can be superimposed with a root mean square deviation of 2.8 Å for 187 C atoms. In addition to TSPN-1 and several glycolytic enzymes, the top hits include serum amyloid P component (28), neurexin 1 (22), calnexin (24), laminin G-like domain (LG5; Ref. 20), agrin G3 domain (26), Gas6 (25), and SHBG (21). These multifunctional protein domains, including TSPN-1, are involved in heparin, steroid ligand, and protein-protein interactions rather than in binding simple carbohydrates. They are referred to collectively as LNS domains (22, 26, 32). Because NC4 may function in both protein-protein interactions and heparin binding (7), we include NC4 as a new member of the LNS domain family.
The significant structural similarity between NC4 and the other LNS domains is mainly in the
NC4 and the metal ion-binding LNS domains have a similar metal binding site. The Ca2+ ions in the laminin LG5 and agrin G3 domains have octahedral coordination with four protein ligands (20, 26). The tetragonal coordination of Zn2+ in NC4 consists of three protein ligands, two from a single molecule and a third one from a symmetry-related molecule of the crystal. In NC4, the long connecting loop between -strands 12 and 13 provides two aspartate side chains, and the hairpin loop between -strands 6 and 7 is hydrogen-bonded to the Zn2+ via a water molecule and the Ser113 side chain (NC4 strand numbering; Figs. 1, 2, and 6C). In laminin LG5 and agrin G3 domains both of the corresponding loops contribute an aspartate side chain to the Ca2+ coordination (Fig. 6C). The laminin LG5 and agrin G3 12- 13 loops and 8- 9 hairpin turns complete the coordination by one main chain carbonyl oxygen each. In SHBG, the Zn2+ ion affecting ligand binding is coordinated by histidines both from the long connecting loop 12- 13 and the hairpin turn 8- 9 and by aspartate from the 6– 7 hairpin loop (51). In a very recent report, the second LNS/LG domain of neurexin 1 (1 _LNS#2; Ref. 23) was shown to have a Ca2+ ion in a similar position and with analogous coordination to laminin LG5 and agrin G3 domains. However, the neurexin 1 _LNS#2 has only three protein ligands due to the lack of counterpart for the aspartate in the 12- 13 loop. The metal ions are up to 13 Å apart when the structures are superimposed (Fig. 6, B and C).
The Central Cavity of NC4 Does Not Incorporate Retinoids—SHBG is known to accommodate various hydrophobic compounds, i.e. steroid hormones, at the central cavity between the
Sequence Alignment ot the FACIT Collagen LNS Domains Suggests Specific Features—A sequence alignment for the collagen LNS units has been reported (52), but it contained data from multiple species and did not include all appropriate FACIT sequences. To facilitate the evaluation of the conserved properties of the FACIT LNS domains, we performed an alignment of the amino acid sequence of NC4 with sequences of other LNS modules of human collagens with ClustalW (Fig. 7). The TSPN-1 sequence was included as the most homologous published structure, and the laminin 2LG5 was included as a representative of other thoroughly characterized LNS modules. The 1 chain of collagen XI was included to represent the LNS modules of collagens V, XI, XV, XVIII, XXIV, and XXVII, which are not members of the FACIT subfamily. The alignment shows that the -strands aligning with the strands 5- 7, 10, 11, 13, and 14 of NC4 present the highest similarity of residues at corresponding positions. The central thirds of the sequences present the best alignment, whereas more gaps are inserted at the N- and C-terminal thirds. As already noted above based on structural superimpositions, the overall arrangement of the N and C termini of NC4 is homologous with that in a recent model of the TSPN-1 (27). In contrast, other LNS modules of proteins outside the collagen superfamily, as represented by the laminin 2LG5 sequence in the alignment (Fig. 7), appear to be devoid of a region corresponding to the N-terminal third of the NC4 domain. At the same time, these proteins, with the exception of TSPN-1, lack the C-terminal cysteine of NC4 and its conserved surroundings. In 2LG5 the cysteine closest to the C terminus is used to link it to the -strand corresponding to the 14 of NC4. Close homologues of 2LG5 thus adopt different N- and C-terminal folds compared with the LNS modules of the FACITs and the TSPN-1. In fact, the Cys21-Cys219 disulfide bond that is used to stabilize the N terminus of NC4 seems to be present in all FACIT collagens with the exception of collagen XXI but absent in all published LNS models and in LNS modules of other collagens (Fig. 7). Although the secondary structure elements of the N-terminal third of NC4 appear to have counterparts in other FACITs, the numerous gaps in the alignment at this region suggest that the N termini of FACITs are likely to show specific differences in their folding.
The fibril-associated collagen IX and other FACIT collagens are proposed to mediate the interaction of a collagen fibril with neighboring fibrils and with other extracellular matrix components. In cartilage, collagen IX covers collagen fibrils so that the NC4 and COL3 domains project away from the fibril body. Several interactions have been suggested, especially for the NC4 domain, reflecting its position and its putative homology to multifunctional LNS domains involved in glycan, proteoglycan, protein, and steroid ligand binding. The functional significance of LNS modules found at the N terminus of numerous collagens is nevertheless mostly a mystery.
To study the NC4 structure-function relationship at the atomic level, we have solved the crystal structure at high resolution. Furthermore having assigned NMR chemical shifts for the majority of residues, we used NMR titration to study the binding of various ligands. The NC4 structure is dominated by two, curved
Many of the LNS domains showing homology to NC4 either are known to have metal ion-dependent ligand binding activities or their crystal structure contains at least one metal ion, usually Ca2+. The Ca2+ ion in the Gas6 G domain pair interface (25) as well as the one in the calnexin convex sheet (24) is structural rather than functional. On the other hand, the Ca2+ ions in the laminin LG5, the agrin G3, and the neurexin 1
The biological function of the NC4 Zn2+ ion remains elusive. Our studies by NMR and affinity chromatography demonstrate that Zn2+ has no effect on the heparin affinity of NC4. However, the NC4 Zn2+ data in this study and the recent agrin G3 (26) and neurexin 1
NC4 binds to COMP in the presence of Zn2+. Initially COMP was shown to bind type I and type II collagens in a Zn2+-dependent manner (59). We, and others, have shown that COMP binds to collagen IX at several locations, including the NC4 domain (6, 7, 60). The high affinity binding is Zn2+-dependent, and the interaction is via the COMP C-terminal domain. The regulatory effect of Zn2+ on collagen interaction is apparently caused by binding of the metal to the COMP C-terminal domain, resulting in a change of the conformation of COMP (6). Unfortunately the atomic level basis of this putative Zn2+ dependence is poorly understood, and thus it remains to be seen whether a Zn2+ ion, possibly in the binding site identified here, has a direct role in the interaction of NC4 with COMP. Considering other possible ligands of NC4,
The FACIT subfamily currently contains collagens IX, XII, XIV, XVI, XIX, XX, XXI, and XXII (2). Within this group, the N terminus of the molecule is formed by an LNS motif in the case of collagens IX, XVI, and XIX, whereas the rest of the members contain additional motifs, namely von Willebrand factor A-domains and fibronectin type III repeats, located N-terminal to the LNS module. The sequence alignment of FACITs performed here (Fig. 7) shows that they are likely to present highly similar folds. The differences appear to be confined mainly to loop regions, whereas the overall arrangement of the domain is probably conserved. The apparently flexible regions corresponding to the loop between Cys21 and the The sequence alignment (Fig. 7) also shows that the residues coordinating the Zn2+ in NC4 show limited conservation in the other FACITs. This does not indicate, however, that they could not bind divalent cations because the metal identity, location, and coordinating groups, often involving backbone carbonyl oxygen atoms, are quite variable in the published LNS structures. The lengths of the loops participating in the Zn2+ coordination in NC4 are apparently well conserved within the FACIT subfamily, suggesting that metal coordination may be a common characteristic. This can only be confirmed by experimental studies, however. Metal binding is apparently a property of the homologous proline/arginine-rich protein domain of the non-FACIT collagen XI (52). Many of the LNS modules are capable of binding the highly sulfated glycosaminoglycan heparin. We have shown previously that NC4 also interacts with heparin with a Kd of 0.6 µM and mapped the major binding site to the extreme N terminus of the domain (7). The recent crystallographic model of TSPN-1 (27) included a synthetic pentameric heparin as a ligand contacted by the side chains of Arg29, Arg42, and Arg77 of TSPN-1. From our sequence alignment (Fig. 7), it is clear that these residues have positively charged counterparts in NC4 at corresponding locations, i.e. Arg53, Lys65, and Arg95. The NMR studies performed here suggest that this region of NC4 does indeed show functional conservation of the heparin binding. This site has an estimated Kd of 1 mM for heparin; this is several orders of magnitude higher than the Kd of 71–80 nM reported for the TSPN-1 (63). Closer examination of the crystal structure of NC4, compared with that of TSPN-1 (27), offers some possible explanations for the dramatic differences in the affinities of the two sites. In NC4, the side chains of Lys65 and Arg95 are not well exposed, and the side chain of Arg53 is oriented away from these two residues so that a proper contact surface for heparin is not formed (data not shown). Of these three basic residues of NC4, Arg53 does not have a backbone NMR assignment, Lys65 shows negligible chemical shift perturbation upon heparin titration, and the cross-peak representing Arg95 disappears upon the first titration step. We, therefore, had to calculate the Kd by monitoring the adjacent residues in the sequence. The surroundings of Lys65 and Arg95 in the electron density map of NC4 have no suspicious density that could originate from a sulfate ion. A SO2–4 is found at the region containing the positively charged residues His218, Arg223, Arg225, and Arg226 of NC4, but no evidence of heparin binding to this site is seen upon NMR titration. Furthermore NC4Ndel lacking the positively charged N terminus does not bind heparin at physiological salt concentration (Fig. 5), thus confirming our earlier mapping of the binding site. Consequently despite the apparent conservation of the heparin binding basic residues of TSPN-1 in NC4, a completely different basic stretch is responsible for binding of the polyanionic glycan in NC4. This offers yet another reminder of the importance of experimental verification when making predictions on the structure-function relationships of proteins on the basis of sequence homology and existing homologous structures.
Within the FACIT subfamily, collagens XXI and XXII also contain a cluster of basic residues at their N terminus (Fig. 7), but their activity in heparin binding is unknown. On the other hand, most FACITs apparently have positively charged side chains at the location corresponding to the TSPN-1 heparin binding site (Fig. 7), but their heparin binding has not been studied experimentally either. A very recent report (64) describes the binding of heparan sulfate to the LNS module of collagen XI
The crystallographic model of the NC4 domain presented here is the first experimentally determined structure available for a collagen LNS module and provides an atomic level basis for understanding the structure-function relationship of NC4. The NMR chemical shift assignment of the backbone amide bonds of NC4 performed here enables detailed analyses of the molecular interactions of NC4 at atomic resolution as exemplified by the results of the Zn2+ and heparin tetrasaccharide titrations. The NC4 crystal structure revealed a Zn2+ in a position similar to that of other metal ion-dependent LNS domains. NMR titration with Zn2+ mapped the same binding site in solution but also revealed its nature of low affinity. Ligand binding in the other metal ion-dependent LNS domains is regulated by low affinity metal binding sites. Similarly the NC4 metal ion may control ligand binding suggesting that this common ligand interaction region is important for NC4 as well. The N-terminal deletion mutant confirmed the major heparin binding site, and NMR titration with a heparin tetrasaccharide revealed a secondary site homologous to TSPN-1. The Zn2+ ion is not crucial for heparin binding, supporting its putative role in modulating NC4 multifunctionality. Finally the NC4 structure reveals new insights into the general properties of the LNS family of extracellular protein domains, and it will serve as a model in the detailed characterization of the respective domains of the other FACIT collagens.
* This work was supported by Academy of Finland Grants 106411 (to T. P.), 1206228 (to I. K.), 70891 (to V.-M. L.), 1105157 and 1111771 (to A. G.), and 106852 (to P. P.) as well as by funding from the Sigrid Juselius Foundation and Biocentrum Helsinki (to A. G.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
The atomic coordinates and structure factors (code 2UUR) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/). 1 To whom correspondence should be addressed: NMR Laboratory, Inst. of Biotechnology, P. O. Box 65, University of Helsinki, FI-00014 Helsinki, Finland. Tel.: 358-9-19159544; Fax: 358-9-19159541; E-mail: Tero.Pihlajamaa{at}helsinki.fi.
2 The abbreviations used are: TSPN, N-terminal domain of thrombospondin; COMP, cartilage oligomeric matrix protein; FACIT, fibril-associated collagen with interrupted triple helices; HSQC, heteronuclear single quantum coherence; LNS, laminin-neurexin-sex hormone binding globulin; MAD, multiwavelength anomalous dispersion; SHBG, sex hormone binding globulin; TSPN-1, N-terminal domain of thrombospondin-1; MES, 4-morpholineethanesulfonic acid; BisTris, 2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol.
We thank Harri Heikkinen for preparation and analyses of mutated NC4 and Anne Hakonen for expert technical assistance. We acknowledge the European Synchrotron Radiation Facility for provision of synchrotron radiation resources, and we thank Dr. Gavin Fox for assistance in using beamline BM14U. We also thank Dr. Tomi Airenne for help with the data collection.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||