|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 281, Issue 9, 5845-5851, March 3, 2006
Solution Structure and Novel Insights into the Determinants of the Receptor Specificity of Human Relaxin-3*![]() ![]() ![]() 1
From the
Received for publication, October 14, 2005 , and in revised form, December 12, 2005.
Relaxin-3 is the most recently discovered member of the relaxin family of peptide hormones. In contrast to relaxin-1 and -2, whose main functions are associated with pregnancy, relaxin-3 is involved in neuropeptide signaling in the brain. Here, we report the solution structure of human relaxin-3, the first structure of a relaxin family member to be solved by NMR methods. Overall, relaxin-3 adopts an insulin-like fold, but the structure differs crucially from the crystal structure of human relaxin-2 near the B-chain terminus. In particular, the B-chain C terminus folds back, allowing TrpB27 to interact with the hydrophobic core. This interaction partly blocks the conserved RXXXRXXI motif identified as a determinant for the interaction with the relaxin receptor LGR7 and may account for the lower affinity of relaxin-3 relative to relaxin for this receptor. This structural feature is likely important for the activation of its endogenous receptor, GPCR135.
Recent developments have caused considerable excitement in the relaxin field, including the discovery of a new member of the relaxin family, relaxin-3 (1), and the identification of several long sought after relaxin receptors (2). Prior to the discovery of the relaxin-3 gene (RLN3), only one relaxin gene had been characterized in most mammals, with the exception of humans and higher primates, in which two separate genes, RLN1 and RLN2, were known (3, 4). The product of the human RLN2 gene, human relaxin-2 (H2 relaxin), is the functional ortholog of the RLN1 gene product from non-primate species, which is termed relaxin. The function of the product of the human RLN1 gene, human relaxin-1 (H1 relaxin), is unknown; and indeed, a native H1 relaxin peptide has not been isolated (5). Hence, throughout this work, "relaxin" will refer to the pregnancy hormones H2 relaxin and non-primate relaxin. Relaxin-3 is the ancestor of the entire relaxin peptide family (5), and RLN3 genes have been identified in all mammalian genomes as well as in the genomes of chicken, frog, and various fish species. In contrast, the RLN1 gene is found only in mammals. Fig. 1 shows a sequence comparison of the product of the human RLN3 gene, human relaxin-3 (H3 relaxin); H2 and H1 relaxins; and relaxin-3 orthologs from other species. Interestingly, the relaxin-3 sequences are highly conserved between species, in contrast to relaxin, which shows considerable sequence variation. Relaxin has long been regarded as a hormone mainly associated with pregnancy. Produced in the corpus luteum and/or placenta of most mammals, it has numerous pregnancy-specific actions, including the remodeling of the reproductive tract and preparation of the mammary apparatus for lactation (6). However, relaxin also has important physiological roles outside of pregnancy. It inhibits collagen biosynthesis and promotes collagen breakdown (7-9) and causes vasodilatation in various tissues (10). Interestingly, the expression pattern of relaxin-3 differs significantly from that of relaxin, suggesting a distinctly different physiological role. The highest expression of relaxin-3 in all species examined to date, including humans, is in the brain. In rats (11) and mice (1), the expression is localized to a specific region of the dorsal tegmental region of the pons called the nucleus incertus. Anatomical studies suggest that this nucleus is involved in a midbrain behavior control network that influences circuits regulating locomotion, attention, and learning processes and that responds to stress-related neuroendocrine signals (12). Mapping studies have demonstrated that relaxin-3-immunoreactive nerve fibers emanating from the nucleus incertus innervate numerous regions of the brain, suggesting that the nucleus incertus utilizes relaxin-3 as a neurotransmitter (13). These data, coupled with the remarkable conservation of the relaxin-3 peptide from fish to humans, suggest that relaxin-3 has highly important, but presently unknown, central actions. Indeed, a very recent study suggests that relaxin-3 is involved in appetite regulation (14), and another study has demonstrated that relaxin-3 mRNA production is increased in response to water-restraint stress (13), suggesting a role in the stress response. Members of the relaxin family are structurally similar to insulin, comprising two peptide chains that are linked by two disulfide bonds, with the A-chain containing a third, intrachain disulfide bond. However, unlike insulin, whose receptor is a tyrosine kinase, members of the relaxin peptide family bind to G-protein-coupled receptors (GPCRs).2 The first receptors identified responding to relaxin stimuli were the leucine-rich repeat-containing GPCRs LGR7 and LGR8 (2), which both have low nanomolar affinity for relaxin and which are capable of mediating the actions of relaxin through a cAMP-dependent pathway (2). Interestingly, relaxin-3 also activates LGR7, albeit with a lower affinity than relaxin, but has a significantly reduced affinity for LGR8 (15). Recent work has identified two additional orphan GPCRs, GPCR135 and GPCR142, which also respond to relaxin-3 activation (16, 17). On the basis of the higher affinity of relaxin-3 for GPCR135 and their coexpression in regions of the brain, it was concluded that relaxin-3 is the endogenous ligand for GPCR135, whereas the primary ligands for LGR8 and GPCR142 are two other members of the relaxin peptide family, INSL3 (insulin-like peptide-3 or Leydig insulin-like peptide) (18) and INSL5 (insulin-like peptide-5) (19), respectively. The cross-reactivity of these hormone-receptor signaling systems has raised questions about which factors are responsible for the selectivity of these ligand-receptor pairings. Moreover, as relaxin-3, GPCR135, and LGR7 are all expressed in the brain, understanding the significance of the interaction of relaxin-3 with LGR7 will be crucial in determining its function.
In this study, we provide a crucial piece of the puzzle by presenting the solution structure of the human form of the most recently discovered member of this hormone family, relaxin-3. The structure reveals a relaxin/insulin fold, but with an unusual conformation of the C-terminal region of the B-chain. The structural data provide insights into the receptor specificities of this new member of the relaxin peptide family.
Peptide SynthesisThe A- and B-chains were assembled as C-terminal peptide acids or amides by Fmoc (N-(9-fluorenyl)methoxycarbonyl) solid-phase synthesis on a hydroxymethylphenoxyacetyl- or 5-(4-aminomethyl-3,5-dimethoxyphenoxy)valeryl-polyethylene glycol/polystyrene support using the following selective cysteine S-protection: Cys A10/A15 trityl; CysA11/B11, acetamidomethyl; CysA24, t-butyl; and CysB22, trityl. Following conventional trifluoroacetic acid cleavage in the presence of scavengers, the intramolecular disulfide bond of the A-chain was formed by aeration, and CysA24 (t-butyl) was displaced with S-pyridinyl by reaction with 2,2'-dipyridyl disulfide in trifluoromethanesulfonic acid. Combination of the peptide with the B-chain occurred by thiolysis, after which the third and final disulfide bond between A11 and B10 was formed by iodolysis. The purity of the peptide was confirmed by MALDI-TOF mass spectrometry (for example, peptide acid: theory, 5500.5; and found, 5499.6) and analytical reversed-phase HPLC. Biological Activity AssaysH3 relaxin acid and amide were tested for their ability to activate LGR7, LGR8, and GPCR135. Stably transfected human embryonic kidney 293T cells expressing human LGR7 or LGR8 (15) and Chinese hamster ovary K1 cells stably expressing human GPCR135 (20) were stimulated for 30 min with various concentrations of H3 relaxin acid or amide. LGR7 and LGR8 were tested in parallel with H2 relaxin and human INSL3, respectively (both at 100 nM), and GPCR135 with forskolin (5 µM) to determine maximum cellular cAMP response. cAMP accumulation was measured in cell lysates as described previously (15).
NMR SpectroscopyPulsed-field gradient NMR diffusion experiments were performed with a two-dimensional sequence using stimulated echo longitudinal encode-decode (21). The lengths of all pulses and delays were held constant, and 32 spectra were acquired with the strength of the diffusion gradient varying between 2 and 95% of its maximum value. The lengths of the diffusion gradient and the stimulated echo were optimized for each sample to give a total decay in the protein signal of
Distance restraints were derived primarily from a 100-ms NOE correlation spectrum recorded at 298 K and 600 MHz. Cross-peaks were assigned and integrated in XEASY and converted to distance restraints using CYANA. Distance restraints for cross-peaks that could not be unambiguously assigned were introduced into structure calculations as ambiguous restraints. Backbone dihedral restraints were inferred from 3JNH-H
Protein structures were analyzed using PROMOTIF and PRO-CHECK and displayed using MOLMOL. Ramachandran analysis showed that 83% of the residues are in the most favored regions, with the remaining in the additionally allowed (15%) and generously allowed (2%). The coordinates representing the solution structure of H3 relaxin and the experimental restraints have been submitted to the Protein Data Bank (Code 2FHW).
Peptide SynthesisH3 relaxin was prepared as both its C-terminal acid and amide forms by solid-phase peptide synthesis of the separate A- and B-chains together with regioselective disulfide bond formation (25). Chemical characterization by reversed-phase HPLC and MALDI-TOF mass spectrometry confirmed the homogeneity of the products. Tryptic mapping followed by MALDI-TOF mass spectrometry showed that the disulfide bonds were in the correct insulin-like pairings. Biological ActivityH3 relaxin acid was tested for its ability to stimulate human LGR7, LGR8, and GPCR135 in parallel with the H3 relaxin amide peptide. As shown in Fig. 2, the activities of the peptides were identical. In LGR7-expressing cells, both peptides stimulated cAMP accumulation to levels similar to those achieved with H2 relaxin (H3 relaxin acid pEC50 = 8.08 ± 0.07 (n = 4) and H3 relaxin amide pEC50 = 8.19 ± 0.078 (n = 3)). In GPCR135-expressing cells, both peptides decreased forskolin-stimulated cAMP accumulation (H3 relaxin acid pEC50 = 9.30 ± 0.082 (n = 3) and H3 relaxin amide pEC50 = 9.25 ± 0.092 (n = 3)). Both peptides were unable to stimulate cAMP accumulation in LGR8-expressing cells at concentrations up to 1 µM (data not shown).
NMR Diffusion MeasurementsIn light of the tendency of members of the insulin superfamily to aggregate, including H2 relaxin and insulin, it was important to establish whether any multimers of H3 relaxin were present under the conditions used for the NMR studies. This was assessed by measuring translational diffusion using pulsed-field gradient experiments with dioxane (hydrodynamic radius of 2.12 Å) as an internal standard. No significant difference in the hydrodynamic radius of H3 relaxin was observed between concentrations of 0.1 and 1 mM (14.2 and 14.4Å,respectively), indicating that there was no concentration-dependent aggregation. On the basis of the equation derived by Wilkins et al. (22) (Rh = (4.75 ± 1.11)N0.29±0.02, where N is the number of residues in the protein and Rh is the hydrodynamic radius in Angstroms), the expected values for a H3 relaxin monomer (51 residues) and a H3 relaxin dimer (102 residues) would be
NMR Assignments and Structure Determination of H3 RelaxinFor the structural analysis of H3 relaxin, extensive two-dimensional NMR spectral data were recorded at 600 and 750 MHz on a sample containing The structural restraints derived from the NMR data and used for structure calculations were based exclusively on data recorded for the native acid form and included interproton distances, backbone and side chain dihedral angles derived from coupling constants, and restraints for hydrogen bonds deduced from amide exchange experiments. Structures were calculated by simulated annealing followed by refinement and energy minimization in explicit solvent (H2O). Distance restraints were derived from intensities of NOE cross-peaks in an NOE correlation spectrum recorded at 298 K with a mixing time of 100 ms. As expected for a 51-amino acid peptide, a number of cross-peaks were ambiguous at the first stage of the assignment due to degeneracy of chemical shifts, in particular in the methyl region. These ambiguities were resolved using an iterative approach in which preliminary structures were used to guide the assignments and by inclusion of ambiguous restraints. Hydrogen bonds were inferred from amide exchange behavior and introduced into the structure calculations once the preliminary structures indicated a suitable acceptor, which in all cases were found to be within elements of secondary structure.
Description of the Three-dimensional Structure of H3 RelaxinA summary of the NMR data, including sequential and medium-range NOEs, coupling constants, hydrogen exchange, and secondary H
Fig. 4 shows a superposition of a family of 20 low energy structures representing the solution structure of H3 relaxin. Overall, the structure is well defined, with the exception of the N terminus of the B-chain, which is disordered and likely flexible in solution. The energetic and structural statistics for the structural family are summarized in Table 1. All structures have good energies and covalent geometry as evident from small deviations from ideal bond lengths and angles and the Ramachandran analysis, which showed that 83% of all non-Gly/Pro residues are in the most favored regions and that the remaining residues, all of which are in the more flexible regions of the peptide, are in the additionally allowed regions. It is clear that H3 relaxin adopts a typical relaxin/insulin fold, with the A-chain containing two -helices (residues A1-A13 and A17-A24) separated by a short -strand (residues A15-A17) and the B-chain containing a second -strand (residues B5-B7) and an -helix (residues B12-B22). The A-chain helices lie parallel to each other, forming a U-shaped arrangement. The B-chain helix is placed across the face of the U, roughly perpendicular to the axes of the A-chain helices. Enclosed between them is a significant hydrophobic core involving the side chains of, LeuA3, LeuA6, IleA20, LeuA23, CysA10, CysA15, LeuB9, CsyB10, PheB14, IleB15, ValB18, IleB19 and TrpB27. The observation that the Trp side chain interacts with the hydrophobic core as evident from a large number of NOEs to CysA24, CysB22, IleB15, ValB18, and IleB19 was surprising because, in the crystal structure of H2 relaxin, this region is disordered, and the Trp extends away from the core, being fully exposed to the solvent. The interaction appears to have become possible because of a shortening of the B-chain helix and a reversal of the peptide backbone due to turns formed by the B23-B26 (GGSR) segment.
Solution Structure of H3 RelaxinIn this study, we have determined the three-dimensional solution structure of H3 relaxin. H3 relaxin is well behaved in solution and adopts an insulin-like fold that is braced by the three disulfide bonds conserved throughout the family and that is characterized by three helical segments and a short double-stranded -sheet. Although the structure is well defined overall, there is evidence for dynamic processes in several regions of the peptide. Most pronounced is the disorder at the N terminus of the B-chain, which is likely due to flexibility in this region. Supporting this suggestion is the observation of cis/trans-isomerization for ProB4, which results in two sets of resonances for the amino acids in this region. A similar isomerization has been observed in insulin (26), but there are no data suggesting that this feature is implicated in the biological function. More interesting is the observation of specific broadening of peaks from residues near the CysA10 -CysA15 and CysA11 -CysB10 disulfide bonds, with the extreme example being CysA10, which is broadened close to the limit of detection, suggesting conformational exchange on a microsecond time scale in this region of the molecule. The most likely explanation for this is a reorientation of the CysA10 -CysA15 disulfide bond conformation as a result of a 1 angle flip of one of the cysteines. Such a process has been observed previously in disulfide-rich peptides, including, for example, bovine pancreatic trypsin inhibitor (27, 28). In the H3 relaxin structural family, equal populations of -60 and -180° 1 conformations of the Cys side chain are observed, supporting the proposal that this disulfide can adopt more than one conformation. Although it is difficult to correlate the two conformations with other differences between the two subpopulations of structures, the proximity of the CysA10-CysA15 disulfide bond to the side chain of PheB14 means that a reorientation of the disulfide could cause a slight shift in the positioning of the aromatic ring. This could result in turn in a large difference in the chemical shift of neighboring protons between the two conformations due to the ring current effects, which would explain the severe broadening observed for the resonances around PheB14, including the CysA10-CysA15 disulfide bond itself.
Amide exchange rates can provide information about the rigidity of a structure in that amide protons involved in strong hydrogen bonds are protected from exchange with the solvent. Among the amides found to be slowly exchanging in H3 relaxin are all amides expected to be involved in hydrogen bonds in the C-terminal
Comparison with the Structures of H2 Relaxin and InsulinStructural studies of members of the insulin superfamily in solution have been limited by the fact that both H2 relaxin and insulin form dimers or higher aggregates in aqueous solution. High resolution crystal structures of the dimer of H2 relaxin (29) and zinc ion-stabilized insulin hexamers (30-32) have been determined and provide a detailed pictured of the molecular core and elements of secondary structure. However, the biologically active forms are the monomers, and the previous structures do not provide a complete picture, as the dimer interface in both cases involves the receptor-binding face, and it is unclear how the formation of the dimer and crystal packing interactions may affect the active site. In particular, there are questions about the conformation of the C-terminal region of the B-chain, which in insulin forms a Fig. 5 shows a comparison of the H3 relaxin NMR structure and the x-ray structures of H2 relaxin and insulin, from which it is clear that the overall fold with the characteristic helical segments (residues A1-A13, A17-A24, and B12-B22) is highly conserved and that the main differences are around the termini of the B-chain. The C-terminal tails of the relaxins are shorter, comprising only five residues in H3 relaxin and six residues in H2 relaxin compared with 11 residues in insulin. Although this prevents the formation of a longer extended segment that can interact both with the B-chain helix and with an additional monomer as in insulin, we have shown here that the tail can still fold back to allow interactions between TrpB27 and the hydrophobic core, mainly IleB19 and ValB18 in H3 relaxin. In contrast, in H2 relaxin, the helix is one turn longer, including three additional helical hydrogen bonds, forcing the tail to extend away from the molecular core. It is interesting to speculate which differences in the primary sequence may be determinants of the conformation of the C-terminal region. The tail is one residue longer in H2 relaxin; but based on the structure of H3 relaxin, an additional residue would not appear to interfere with the fold and is unlikely to prevent such a conformation. The main interaction between the tail and the core in H3 relaxin involves hydrophobic contacts between the core and the TrpB27 side chain; and as this Trp is conserved, similar interactions would be possible in H2 relaxin. One possibility is that these hydrophobic interactions would be destabilized by the substitution of ValB18 in H3 relaxin with GlnB18, which has a more polar side chain. Furthermore, in H2 relaxin, the conformation in which the Trp side chain extends away from the core is stabilized by hydrophobic interactions in a cluster comprising the side chains of TrpB27, MetB24, and MetA2. Neither of these Met residues is conserved in H3 relaxin, which contains Ser and Gly in the equivalent positions; and hence, such an interaction would not be possible. The presence of MetB24 in H2 relaxin would also be expected to have a stabilizing effect on the extended C-terminal helix, in contrast to GlyB24 in H3 relaxin, which, together with its neighboring residues, GlyB23 and SerB25, would be expected to be strongly helix-breaking.
Implications for Receptor BindingThe observation that, although adopting a well defined structure, H3 relaxin is a rather dynamic molecule is interesting, as studies on insulin have indicated that a significant degree of flexibility is a requirement for binding to the insulin receptor. In particular, an insulin mutant in which the fold was stabilized by the addition of a peptide linker between the N terminus of the A-chain and the C terminus of the B-chain was found to be biologically inactive, despite adopting a native fold (35). In contrast, the des-PheB25 and PT insulin mutants have also been studied by NMR and shown to have significantly destabilized structures (34, 36). These conclusions were based on increased amide exchange rates (34); and in the extreme case of PT insulin, the co-solvent trifluoroethanol, known to stabilize helical structures, had to be added to achieve a well defined structure in solution (36). Strikingly, both these molecules have increased biological activity relative to native insulin (33, 37). Based on these observations, a model for insulin binding to its receptor that involves both detachment of the C-terminal part of the B-chain from the rest of the molecule and unfolding of the N-terminal -helix has been proposed. Here, we have shown that, in H3 relaxin, the amide exchange rates are also fast, in particular in the N-terminal helix of the A-chain; and in addition, there is evidence for line broadening as a result of conformational exchange around this region. Although the relaxin and relaxin-3 receptors LGR7 and GPCR135, respectively, are GPCRs and the insulin receptor is a tyrosine kinase receptor, and the interactions are thus likely different, it is possible that flexibility may be needed for structural rearrangement in a similar fashion to bind the receptor. An essential requirement for binding to the relaxin receptor is a conserved RXXXRXX(I/V) motif that is presented on one face of the B-chain helix (38). Both relaxin and relaxin-3 have this motif and have the ability to stimulate the relaxin receptor LGR7, but relaxin-3 has a 50-100-fold lower activity (1, 15). Recent mutational studies on the relaxin receptor LGR7 revealed a binding surface in the pleated sheet region of the ectodomain comprising two crucial Asp-Glu pairs in the right positions for chelating each of the relaxin active-site Arg residues and, in addition, a cluster of hydrophobic residues equally important for relaxin binding assumed to be clustering with the active-site Ile/Val (38). Interestingly, the structure of H3 relaxin reveals that the C terminus of the B-chain interacts with the molecular core, including the important IleB19 of the active site. As shown in Fig. 6, this interaction partly blocks the accessibility of IleB19 and would interfere with the binding to LGR7. This suggests either that a conformational change, like the detachment of the C-terminal tail proposed for insulin, is needed for H3 relaxin to be able to interact with LGR7 or that the interaction may be different, and this can account for the reduced activity of H3 relaxin relative to H2 relaxin in activating LGR7. It is also apparent that PheB20, which is presented on the same face of the B-chain helix, is highly exposed to the solvent. As this residue is fully conserved among the relaxin-3 family members, it is interesting to speculate that it may play an important role in the interaction of relaxin-3 with its receptor GPCR135. An additional important feature of the disposition of the C terminus of the B chain is the interaction of H3 relaxin with LGR8. H2 relaxin has been demonstrated to be a high affinity ligand of LGR8, but H3 relaxin is inactive at this receptor (15). It has been demonstrated previously that the C terminus of the B chain in INSL3 (the endogenous ligand of LGR8), in particular the sequence GGPRW, is essential for the interaction of INSL3 with LGR8 (39). The tryptophan residue is essential for LGR8 activity, and this residue is present in the identical position in H2 relaxin. This tryptophan and the entire motif are highly conserved in H3 relaxin, and it was a surprise when H3 relaxin was demonstrated to have no activity at LGR8. Hence, the interaction of TrpB27 in H3 relaxin with the hydrophobic core of the molecule demonstrated in this study may explain its lack of activity for LGR8. Finally, an unusual but characteristic feature of the relaxin peptide family is that high sequence variability is seen between closely related species, whereas distant species show high homology (5). Furthermore, the relationship between primary sequence of relaxins and their biological activity is puzzling. A number of mutational studies have provided valuable data on the residues crucial for the biological activity of several members of the relaxin family. However, although a motif found to be responsible for receptor binding in one hormone may be conserved in other family members, it does not always confer the same activity. In this study, we have shown that, despite the family's highly conserved overall fold, there are differences in the tertiary structures of relaxins and that these differences, not necessarily caused by sequence differences in the active site, may provide an explanation for such observations.
* This work was supported in part by National Health and Medical Research Council Project Grants 350284 (to J. D. W.) and 300012 (to R. A. D. B. and J. D. W.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 1 Australian Research Council Professorial Fellow. To whom correspondence should be addressed. Tel.: 61-7-3346-2019; Fax: 61-7-3346-2029; E-mail: d.craik{at}imb.uq.edu.au.
2 The abbreviations used are: GPCRs, G-protein-coupled receptors; MALDI-TOF, matrix-assisted laser desorption ionization time-of-flight; HPLC, high pressure liquid chromatography; NOE, nuclear Overhauser effect.
We thank Tania Ferraro and Sharon Layfield (Howard Florey Institute) for technical assistance.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||