|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 280, Issue 32, 29269-29276, August 12, 2005
Crystal Structures of the HIV-1 Inhibitory Cyanobacterial Protein MVL Free and Bound to Man3GlcNAc2STRUCTURAL BASIS FOR SPECIFICITY AND HIGH-AFFINITY BINDING TO THE CORE PENTASACCHARIDE FROM N-LINKED OLIGOMANNOSIDE*![]() ![]() ![]() **
From the
Received for publication, April 27, 2005 , and in revised form, May 25, 2005.
The cyanobacterial protein MVL inhibits HIV-1 envelope-mediated cell fusion at nanomolar concentrations by binding to high mannose N-linked carbohydrate on the surface of the envelope glycoprotein gp120. Although a number of other carbohydrate-binding proteins have been shown to inhibit HIV-1 envelope-mediated cell fusion, the specificity of MVL is unique in that its minimal target comprises the Man (1 6)Man (1 4)GlcNAc (1 4)GlcNAc tetrasaccharide core of oligomannosides. We have solved the crystal structures of MVL free and bound to the pentasaccharide Man3GlcNAc2 at 1.9- and 1.8-Å resolution, respectively. MVL is a homodimer stabilized by an extensive intermolecular interface between monomers. Each monomer contains two structurally homologous domains with high sequence similarity connected by a short five-amino acid residue linker. Intriguingly, a water-filled channel is observed between the two monomers. Residual dipolar coupling measurements indicate that the structure of the MVL dimer in solution is identical to that in the crystal. Man3GlcNAc2 binds to a preformed cleft at the distal end of each domain such that a total of four independent carbohydrate molecules associate with each homodimer. The binding cleft provides shape complementarity, including the presence of a deep hydrophobic hole that accommodates the N-acetyl methyl at the reducing end of the carbohydrate, and specificity arises from 7-8 intermolecular hydrogen bonds. The structures of MVL and the MVL-Man3GlcNAc2 complex further our understanding of the molecular basis of high affinity and specificity in protein-carbohydrate recognition.
HIV1 infection occurs via virus-cell and cell-cell fusion mediated by the two envelope glycoproteins, gp120 and gp41 (1-3). Gp120 binds to CD4 and chemokine receptors, triggering a series of events that leads to the insertion of the fusion peptide of gp41 into the target membrane and subsequent membrane fusion. The surface of gp120 is characterized by an abundance of high mannose N-linked glycosylation sites (4). Post-translational glycosylation of gp120 covers conserved portions of the protein with host-generated carbohydrates that constitute almost half of the molecular mass of gp120 and limit recognition by the immune system, thereby allowing HIV to evade detection (4, 5). The neutralizing antibody 2G12 (6) and the anti-HIV protein cyanovirin (7-10) specifically target high mannose sugars on the surface of gp120. Cyanovirin is a cyanobacterial protein isolated from Nostoc ellipsosporum (11, 12) that inhibits HIV-1 fusion at nanomolar concentrations (7, 11) and is currently undergoing preclinical development as a microbicide, having shown 100% efficacy in a primate model (13). Very recently, another cyanobacterial protein known as MVL, originally isolated from a laboratory culture of Microcystis viridis NIES-102 (14), has also been shown to inhibit HIV-1 fusion at nanomolar concentrations (15). Cyanovirin and MVL are unusual in that they bind oligosaccharides with very high affinity in the absence of multivalent interactions. Cyanovirin specifically recognizes the disaccharide Man (1 2)Man located at the terminal branches of mammalian high mannose oligosaccharides (7-10). In contrast MVL is distinct among oligomannose-binding proteins in that it exhibits fine specificity for the Man (1 6)Man (1 4)GlcNAc (1 4)GlcNAc tetrasaccharide core found in N-linked oligomannosides (15).
MVL is a monodisperse homodimer in solution with four independent carbohydrate binding sites (15). Each subunit comprises two homologous domains, each consisting of 54 residues with 50% sequence identity between them, connected by a five-residue linker. MVL exhibits no significant sequence similarity to any other known protein family. To elucidate the molecular basis of carbohydrate recognition by MVL we have solved the crystal structures of MVL both free and bound to the pentasaccharide Man3GlcNAc2.
Protein Purification and CrystallizationThe expression vector for MVL (15) was transformed into strain BL21(DE3) of Escherichia coli and grown at 37 °C in Luria-Bertani medium. The culture was induced at an A600 nm of 1 with 1.0 mM isopropyl- -D-thiogalactoside for 2-3 h and harvested by centrifugation at 7800 x g for 15 min. The pellet from 1 liter of culture was resuspended in 100 ml of 20 mM NaPO4 (pH 7.0), 50 mM NaCl, and 2 mM EDTA, lysed by three passages through a microfluidizer, and clarified by centrifugation at 10,000 x g for 15 min. The supernatant was subjected to fractionation by ammonium sulfate precipitation with the serial addition of 10 and 15 g of NH4SO4/100 ml of supernatant, incubation on ice for at least 4 h, and centrifugation at 10,000 x g for 15 min. The final pellet (25% w/v NH4SO4) was resuspended in 8 ml of 20 mM NaHPO4 (pH 7.0), 200 mM NaCl, and 2 mM EDTA and loaded onto a Superdex 75 26/60 gel filtration column (Amersham Biosciences) equilibrated with the same buffer. The fractions containing MVL were identified by SDS-PAGE, pooled, diluted 4-fold with 20 mM Tris (pH 7.5), loaded onto a HiLoadTM 10/10 MonoQ anion exchange column (Amersham Biosciences) equilibrated with the same buffer, and eluted with a 0-1 M NaCl gradient. The largest peak containing MVL was pooled and dialyzed against 10 mM Tris (pH 7.5).
Selenomethionine (SeMet)-substituted protein was expressed by transforming the same vector into the B834(DE3) methionine auxotrophic strain and growing the bacteria in M9 minimal medium supplemented with 50 mg/liter SeMet (Sigma-Aldrich). The culture was grown at 37 °C up to an A600 nm Crystals were grown in 0.4 M NH4H2PO4 at 20 °C. For flash-freezing, 40% ethylene glycol was added to the mother liquor. Co-crystallization of MVL and Man3GlcNAc2 (obtained from Glycotec, Rockville, MD) at 1:2.2-3.0 molar ratios yielded suitable crystals under identical conditions. Structure Determination and RefinementMVL and SeMet-MVL crystals of free MVL diffracted to better than 1.9 Å, whereas the MVL-Man3GlcNAc2 co-crystals diffracted to better than 1.8 Å. Data were collected with an in-house x-ray source using a RAXIS IV image plate detector and processed using the HKL2000 software package (16). The structure of free MVL was solved by the single isomorphous replacement and anomalous scattering method (17). 16 of 16 selenium atoms were found, and over 90% of the free MVL molecule was traced using the SOLVE/RESOLVE software (18). The initial model was refined iteratively using the programs O (19) and CNS (crystallography NMR software) (20). The final model of free MVL contains eight MVL monomers (residues 1-113), six (PO4)2-, 13 ethylene glycol, and 886 water molecules. The initial model of the co-crystal was generated by molecular replacement with free MVL using the program AMORE (21, 22). The carbohydrate molecules were traced and the model iteratively refined using O (19) and CNS (20). The final co-crystal model contains eight MVL monomers (residues 1-113), 16 Man3GlcNAc2, 2 (PO4)2-), 26 ethylene glycol, and 643 water molecules. The data collection and refinement statistics are given in Table I. Structure figures were generated with the programs VMD-XPLOR (23), RIBBONS (24), and GRASP (25).
Residual Dipolar CouplingsBackbone NH residual dipolar couplings (26), 1DNH, were measured in solution on a sample of 0.4 mM MVL (dimer) saturated with Man3GlcNAc2 (1.6 mM) in 10 mM sodium phosphate, pH 6.5, by taking the difference in 1JNH couplings between liquid crystalline (5% polyethylene glycol/hexanol with a surfactant to alcohol ratio of 0.96 (27)) and isotropic (water) media. 1JNH couplings were measured using two-dimensional in-phase/antiphase {15N,1H} heteronuclear single quantum coherence experiments at a 1H frequency of 600 MHz on a Bruker DMX600 NMR spectrometer. Best-fitting of the measured 1DNH couplings to the crystal structures of free and complexed MVL was carried out by singular value decomposition using the program SSIA (28). The agreement between observed and calculated 1DNH couplings is expressed as a dipolar coupling R-factor, which scales between 0 and 100%, and is defined as the ratio of the r.m.s. deviation between observed and calculated values to the expected r.m.s. deviation if all of the N-H vectors were randomly distributed (29). The latter is given by , where Da is the magnitude of the axial component of the alignment tensor and the rhombicity (29). The values of DaNH and obtained by singular value decomposition are 7.4 Hz and 0.63, respectively, when best-fitting the dipolar couplings to the coordinates of the free monomer; 7.5 Hz and 0.61, respectively, for the free dimer; 7.2 Hz and 0.63, respectively, for the complexed monomer; and 7.5 Hz and 0.62, respectively, for the complexed dimer.
Crystal Structure of MVLNative MVL crystallized in space group P21 and diffracted to 1.9 Å. A SeMet derivative of MVL crystallized under identical conditions in the same space group and diffracted to 1.85 Å. The initial phases were determined by single isomorphous replacement and anomalous scattering, and the structure was refined with data to a final resolution of 1.9 Å. The final structure was well defined with an R-factor of 17.7% and a free R of 21.6% (Table I). A total of four homodimers of MVL were present in the asymmetric unit, leading to eight unique copies of the MVL monomer in the final structure.
MVL is a 13-kDa polypeptide of 113 amino acids that contains two highly homologous 54-residue domains with 50% sequence identity between them (15). In the following descriptions the secondary structure elements and residue numbers are given for the N-terminal domain (residues 1-54), whereas the equivalent elements and residues in the C-terminal domain (residues 60-113) are provided in parentheses. Each domain contains a three-stranded anti-parallel
MVL forms a homodimer such that the boomerang-shaped monomers interlock, and each domain from one monomer contacts both domains from the second monomer (Fig. 1, B and C). The 1 and 2 helices of the first monomer contact and lie approximately orthogonal to the 2 and 1 helices, respectively, of the second monomer, whereas the first strand of each -sheet ( 1 or 4) contacts the same strand in the second monomer. The two monomers interact extensively resulting in 3930 Å2 of buried accessible surface area with 70% of the buried surface involving nonpolar atoms. The gap volume index (defined as the ratio of gap volume to interface buried accessible surface area) for the interacting surfaces is 1.28, indicative of a tight, complementary interface (30). Three intermolecular hydrogen bonds stabilize the dimer including those between the N- 2 atom of His-29 of one monomer and the hydroxyl group of Ser-87' of the other monomer, the side-chain hydroxyls of Thr-64 and Thr-64', and the hydroxyl group of Thr-64 and the backbone carbonyl oxygen of Asp-65'. At least eight (4 x 2) water molecules form hydrogen bonds that bridge the interface between the two monomers (Fig. 1B); a single water molecule bridges the backbone carbonyl oxygens of Ala-9 and Pro-11 and the side chain of Gln-83'. A similar network is formed between the backbone carbonyl oxygens of Pro-70 and Lys-24' and the side chain of Glu-76, between the backbone carbonyl and amide groups of Asn-6 and the same atoms of Asn-6', and between the backbone carbonyl oxygens of Lys-24 and Ala-68'. Residual dipolar couplings provide long range orientational information in solution, thereby providing a simple and accurate means of ascertaining whether the structure in the crystal reflects that in solution (26). The relative orientations of the N- and C-domains within the monomer and of the two monomers within the dimer are the same in solution and in the crystal state, as evidenced by the excellent agreement between the measured backbone N-H residual dipolar couplings recorded on the MVL-Man3GlcNAc2 complex in a diluted liquid crystalline medium (of polyethylene glycol/hexanol) and those calculated from the crystal coordinates of both free and carbohydrate-bound MVL. The values obtained for the dipolar coupling R-factors (29) are 16.6% for the monomer and 17.3% for the dimer using the coordinates of free MVL, and 16.9% for the monomer and 17.9% for the dimer using the coordinates of MVL complexed to Man3GlcNAc2. These values for the dipolar coupling R-factors are within the range expected for 1.5-2-Å resolution crystal structures (26, 31), indicating that the structures in solution and in the crystal are the same.
Intriguingly, despite the tight intermonomer interaction, water molecules fill a channel that extends from the surface adjacent to the linker region of one monomer to the surface adjacent to the linker region of the second monomer (Fig. 1D). This channel also contains ethylene glycol molecules from the cryopreservative solution such that the N-
A search of the Protein Data Bank using the program DALI (32) reveals that the topology of the individual domains of MVL (a triple-stranded Crystal Structure of MVL-Man3GlcNAc2MVL saturated with Man3GlcNAc2 (1:2.2-3.0 molar ratio) co-crystallized under the same conditions as free MVL. The complex formed crystals in space group P1 that diffracted to 1.8 Å. The initial model was generated by molecular replacement with the structure of free MVL and refined using data to a final resolution of 1.8 Å. Similar to the free MVL crystals, the asymmetric unit contains four MVL dimers. Clear electron density (Fig. 1E) demonstrates the presence of four Man3GlcNAc2 ligands bound to each MVL dimer (for a total of 16 ligands in the asymmetric unit). The final model was well defined by the data with an R-factor of 19.2% and a free R of 22.1% (Table I). The presence of eight unique MVL monomers per asymmetric unit that were refined independently (without non-crystallographic symmetry restraints) provides an opportunity to assess the conformational variability of MVL within the crystal environment (Fig. 2). Interestingly, the variation between monomers is smaller for MVL in the complex than for free MVL; the average r.m.s. deviation to the mean coordinate positions for free MVL is 0.19 Å for backbone atoms and 0.41 Å for all heavy atoms, and the corresponding values for the MVL-Man3GlcNAc2 complex are 0.12 and 0.31 Å, respectively (and 0.32 Å for the carbohydrate heavy atoms). This observation suggests that binding of Man3GlcNAc2 restricts the conformational variability of the MVL protein even though the free structure is already well ordered. In terms of side chains, this phenomenon is particularly evident for Trp-72 in the C-domain and to a lesser extent for the equivalent residue Trp-13 in the N-domain, which both exhibit a sizeable spread among the eight molecules in free MVL but are fixed in a narrow region of conformational space in the complex (Fig. 2). With one exception, MVL undergoes no significant conformational change upon binding carbohydrate, as evidenced by the observation that the r.m.s. difference between the average coordinates of free and complexed MVL (0.09 Å for backbone atoms and 0.18 Å for all atoms) is smaller than the values for the average r.m.s. deviations to the mean coordinates among the dimers from either structure given above. One residue located in the carbohydrate binding pocket, Thr-38, however, does change its rotameric state from g- in the free state (for seven of the eight molecules) to t in the complex (Fig. 2A), thereby permitting the hydroxyl group of Thr-38 to participate in an intermolecular hydrogen bond with the bound carbohydrate (see below).
Structural Basis for Carbohydrate Binding and SpecificityThe bend in the
The reducing GlcNAc residue of Man3GlcNAc2 fits into the deepest portion of the binding cleft, and the carbohydrate chain extends along and molds to the surface of MVL, such that the pentasaccharide buries
The glycosidic bond torsion angles of Man3GlcNAc2 bound to the N- and C-domains of MVL are all located within the preferred conformations observed in crystal structures of isolated oligosaccharides, glycoproteins, and glycan-binding proteins (33). The average values for the glycosidic
A key component of carbohydrate specificity can be attributed to interactions with the reducing GlcNAc1 unit. The acetyl methyl group from GlcNAc1 fits into a deep hole on the surface such that it wedges between and is in van der Waals contact with the side chains of Pro-11 (Pro-70) and Trp-13 (Trp-72) and the methyl groups of Leu-12 (Leu-71) and Thr-39 (Thr-98) (Fig. 3B), whereas the acetyl oxygen atom is hydrogen-bonded to the backbone amide of Ser-43 (Gly-102) (Fig. 3C). The acetyl NH group of GlcNAc1 is hydrogen-bonded to the backbone oxygen atom of Leu-12 (Leu-71), and the O-3 atom of the pyranose ring is hydrogen-bonded to the side-chain hydroxyl group of Thr-39 (Thr-98). The reducing hydroxyl group (O-1) from GlcNAc1 protrudes from the binding site and remains solvent-accessible such that an N-linked Asn would not disrupt binding. The acetyl oxygen and O-6 atoms of the subsequent GlcNAc2 unit are hydrogen-bonded to the backbone amide groups of Asn-15 (Asn-74) and Thr-39 (Thr-98), respectively. In addition the pyranose ring of GlcNAc2 is stacked on top of the benzyl ring of Trp-37 (Trp-96), which lines the base of the binding site. Formation of these hydrogen bonds together with structural complementarity for the N-acetyl groups explains the selectivity for (1 4)-linked GlcNAc present in the Man3GlcNAc2 pentasaccharide.
The binding site also selects for a
Correlation to Binding AffinitiesAs previously noted (15), binding of oligomannosides to MVL is associated with large negative enthalpies that increase with the size of the carbohydrate. An increasingly negative entropy change opposes the favorable enthalpy change, yet does not fully compensate, so that longer oligomannosides bind with increasing affinity (KD of 0.2 µM for Man6GlcNAc2 versus 2.9 µM for Man3GlcNAc2). These observations are consistent with the extensive hydrogen-bonding network discussed above. Binding of the mannose chain creates enthalpically favorable hydrogen bonds but restricts conformational freedom of the carbohydrate residues. Modeling the appropriate mannose residues on the structure of the MVL-Man3GlcNAc2 complex to form Man6GlcNAc2 using glycosidic bond torsion angles in the preferred regions of conformational space (33) and avoiding steric clash suggests the presence of the following additional interactions involving ManB, (1 6)-linked to Man4', and ManC, (1 2)-linked to Man4. ManB (with the , , and glycosidic torsion angles in the g+, t, and g+ conformations, respectively) could form an additional hydrogen bond with the side chain of Gln-19 (Gln-78); ManC (with the and glycosidic torsion angles in the g+ and t conformations, respectively) could form hydrogen bonds with the side chains of Glu-49 (Gln-108) and possibly Asn-6 (Asp-65). These hydrogen bonds would account in part for the -7.7 kcal/mol additional enthalpy change associated with the binding of Man6GlcNAc2 compared with Man3GlcNAc2 (15). Further extension of the branches to Man9GlcNAc2 yields no additional protein-carbohydrate contacts accounting for the observation that the affinities of Man6GlcNAc2 and Man9GlcNAc2 are very similar (15). Hence, the structure reported here accounts for the affinity and selectivity of MVL for oligomannosides.
Inhibition of HIV-1HIV infects cells through specific recognition of CD4 and chemokine (CCR5 or CXCR4) cell surface receptors by the viral glycoprotein gp120 (1-3). Gp120 is extensively glycosylated by high mannose N-linked carbohydrates, which subvert immune recognition by covering the more highly conserved regions of gp120 with oligosaccharides generated by the host (4, 5); hence current research efforts are being directed toward identifying molecules that can bind high mannose oligosaccharides and prevent cellular infection by HIV.
A number of carbohydrate-binding proteins known to bind high mannose oligosaccharides and therefore HIV-1 envelope have been identified and characterized. As recently reviewed by Botos and Wlodawer (34), these include the Ca2+-dependent C-type lectins involved in innate immunity such as mannose-binding protein and the dendritic cell surface receptor DC-SIGN, a variety of plant lectins, as well as several metal-independent carbohydrate-binding proteins that do not fall into established lectin families. Most of these proteins bind their targets with relatively low affinity and typically show specificity for either mannosides (such as mannotriose) or GlcNAc2, structures that are present in N-linked oligosaccharides abundant on gp120 (35). MVL is unique in this regard in that it specifically binds to the Man
Concluding RemarksIn this study we have presented the crystal structures of the cyanobacterial protein MVL, free and in complex with the core pentasaccharide Man3GlcNAc2 at 1.9- and 1.8-Å resolution, respectively. The details of this structure show extensive interaction between two monomers of MVL forming a highly stable, symmetric homodimer with four carbohydrate binding sites. Each binding site establishes a complementary pocket that is essentially preformed to bind the Man
The crystal structure of MVL described here is only the second structure of a cyanobacterial carbohydrate-binding protein solved to date. Moreover, it is the second cyanobacterial protein structure shown to contain two carbohydrate binding sites within a single polypeptide chain, the first being cyanovirin-N, which recognizes terminal Man
The atomic coordinates and structure factors (codes 1ZHQ and 1ZHS) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by the Intramural AIDS Targeted Antiviral Program of the Office of the Director of the National Institutes of Health (to C. A. B. and G. M. C.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. || To whom correspondence may be addressed: Laboratory of Bioorganic Chemistry, Building 8, Room 1A-02, NIDDK, National Institutes of Health, Bethesda, MD 20892-0820. Tel.: 301-594-5187; E-mail: caroleb{at}intra.niddk.nih.gov. ** To whom correspondence may be addressed: Laboratory of Chemical Physics, Building 5, Rm. B1-30I, NIDDK, National Institutes of Health, Bethesda, MD 20892-0520. Tel.: 301-496-0782; Fax: 301-496-0825; E-mail: mariusc{at}intra.niddk.nih.gov.
1 The abbreviations used are: HIV, human immunodeficiency virus; r.m.s., root mean square; SeMet, selenomethionine.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||