|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 29, 30523-30530, July 16, 2004
Characterization of the Conformational State and Flexibility of HIV-1 Glycoprotein gp120 Core Domain*![]() ![]() ![]() ![]() ¶||
From the
Received for publication, April 20, 2004 , and in revised form, May 6, 2004.
gp120 is key to the human immunodeficiency virus type 1 viral cell entry. Knowledge of the detailed conformational states of gp120 is crucial to intervention, yet the unbound form is still resistant to structural characterization probably because of its flexibility. Toward this goal, we performed molecular dynamics simulations on the wild type gp120 core domain extracted from its ternary crystal structure and on a modeled mutant, S375W, that experimentally has a significantly different phenotype from the wild type. Although the mutant retained a bound-like conformation, the wild type drifted to a different conformational state. The wild type strands 2 and 3 of the bridging sheet were very mobile and partially unfolded, and the organization among the inner and outer domains and strands 20 and 21 of the bridging sheet, near the mutation site, was more open than in the bound form, although the overall structure was maintained. These differences were apparently a result of the strengthening of the hydrophobic core in the mutant. This stabilization further explains the experimentally significantly different thermodynamic properties between the wild type and the mutant. Taken together, our results suggest that the free form, although different from the bound state, shares many of the bound structural features. The observed loss of freedom near the binding site, rather than the previously hypothesized more dramatic conformational transition from the unbound to the bound state, appears to be the major contributor to the large entropy cost for the CD4 binding to the wild type.
The human immunodeficiency virus type 1 virus continues to be the main cause for AIDS infection affecting millions of people every year. The persistence of the infection is believed to be related to the "wise" evolution of the structure of the envelope glycoprotein gp120 on the virion surface that can efficiently evade the immune system. Contributing elements include the heavy glycosylation on the surface (1), residue variation (2, 3), oligomerization (4, 5), and conformational alterations (6, 7). The structure of the gp120 core domain revealed in a co-crystal with CD4 and a Fab from the neutralizing antibody 17b (6), along with information derived from sequence alignments (8, 9) of different primary immunodeficiency viruses, shows a conserved core domain (see Fig. 1) with five long variable loops V1V5 interspersed over the surface of the core domain. The molecule has been shown to be able to adopt multiple conformations along the pathway of the infection. Upon binding to CD4, the principal receptor in the course of the infection (10, 11), variable loops V1 and V2 move out of the way allowing transient exposure of the otherwise shielded conserved residues (12). CD4 binding then induces a conformational change that poses the necessary epitopes for its co-receptor association, usually CCR5 (6, 1315), another major receptor of gp120 (1620). Analysis of the crystal structure revealed a peculiar three-dimensional organization within the core domain with a deeply recessed CD4 binding pocket shaped by the inner domain, the outer domain, and the bridging sheet. As Kwong et al. have argued (6), the hydrophobic core that holds together the three subdomains and sits at the bottom of the recessed pocket might be sensitive to a change in the environment because of the unique organization among the interacting residues. The flexibility of the association between the inner and outer domains is thought to be responsible for the low immunogenicity due to the spread of the epitope residues over separate regions (21). This highly flexible nature of the gp120 has necessarily made it difficult to characterize the detailed conformational features of the free form. Yet, such a characterization of the gp120 would be extremely valuable to the understanding of its biology and intervention, particularly the inhibition of the gp120-CD4 association, which is of primary interest yet has met only limited success (2224).
Despite the lack of direct structural determination, a large body of experimental work relating to the CD4-gp120 association, including thermodynamics (25), mutagenesis (2632), and crystallography (21, 3336), has provided considerable insight into the gp120 structure and flexibility. Of particular note is the thermodynamics data (see Table I) (25), which revealed exceptionally large enthalpy and entropy changes, suggesting that the unbound form of the gp120 may be very flexible and significantly different from the CD4-bound state. This was further supported by the fact that the CD4 molecule shows little conformational change upon ligand binding (6, 3335), and multiple binding partners for gp120 consistently indicated similar thermodynamic behavior (7). The similar thermodynamic behavior for both the full protein and the core domain of gp120 (see Table I) going from the free-form to the CD4-complexed state, confirms that it is the core domain that is largely responsible for the observed enthalpy and entropy changes. Accordingly, the unusual entropic cost has been interpreted to be the outcome of a large interdomain flexibility, including a loose contact between the inner and outer domains and the unfolded bridging sheet in the unbound state (6, 21).
Interestingly, a mutant, S375W, designed to fill the space at the bottom of the CD4 binding pocket that is not occupied by the CD4, to alter the stability of gp120 and thus the nature for CD4 binding, displayed very different phenotypes than the wild type (7). Both the enthalpy change and the entropy penalty were significantly reduced for the CD4 binding (Table I). Furthermore, the S375W mutant binds CD4 tighter than the wild type, whereas the affinity to antibodies elicited against the CD4 binding site (CD4BS)1 is almost totally abolished (7) rendering the possible changes in the organizations of the epitopes and the flexibility of the structure. The underlying factors that contribute to the different phenotypes are expected to reside in the structural and flexibility differences between the wild type and the mutant in free and bound forms.
Prompted by the available crystal structure of the wild type core domain in complex with CD4 and an antibody (6), we aimed to characterize the conformations of the gp120 core domain in its free form. We are interested in its structural flexibility, the difference between the free and bound forms, and the effect of the mutation. Toward this goal, we performed molecular dynamics simulations on both the monomers and the CD4-bound complexes for the wild type and the S375W mutant. The detailed structural information from simulations allow us to address several important issues, such as whether there are multiple conformational states in the free form, what the conformational changes are, and how large they are, and how the constellation of the epitope residues changes upon mutation so as to significantly alter the neutralizing interactions. Answers to these questions can not only shed light on the structural biology of the gp120 but further assist in the identification of specific conformations for efficient drug design. Despite the extensive loop truncations in the crystal structure, its biological functions are largely intact, and we expect that information derived from such a reduced model should allow us to capture the structural properties of the core domain. Our results show that whereas the S375W mutant sampled a conformational space that is very close to the CD4-bound state, the wild type preferred a conformation with a very flexible bridging sheet and a relaxed association between the inner and outer domains. Although the findings here are in very good agreement with experimental data in general they further suggest that only the bridging sheet, particularly the
All structural manipulations and molecular dynamics simulations were performed using the CHARMM program (37) with the CHARMM 22 force field (38). The starting structures of the gp120 monomers and the gp120-CD4 complex were extracted from the ternary crystal structure (PDB 1g9m [PDB] ) (6). Those with the S375W mutations were modeled from the wild type crystal structure using CHARMM. The missing loop 4 was also modeled in by first building the corresponding 14-residue peptide (the central 12 residues were missing) and subjecting it to molecular dynamics simulation at a temperature of 300 K in the gas phase with the distance between the C atoms of the terminal residues restrained to the desired target, which corresponds to 24.3 Å in the crystal structure. The resulting conformations with these matched distances were selected and integrated into the core domain by matching the terminal residues of the C of the peptide with the corresponding atoms in the protein. A structure with no van der Waals overlap between loop 4 and the core domain was selected and minimized for 500 steps of the steepest descent algorithm. For the monomer simulations, the gp120 core domain was first solvated in a TIP3P (39) water box with dimensions of 88 x 85 x 71 Å3, with a minimum distance of 10 Å from any edge of the box to any protein atom, resulting in a system size of 52,000 atoms. Minimizations were first performed for 500 steps with the steepest decent algorithm with the gp120 constrained and for additional 500 steps for the whole system. The systems were further pre-equilibrated for 20 ps with the NVT ensemble before the production simulations, which lasted for 3 ns each, with the NPT ensemble at 300 or 325 K. A time step of 2 fs and a force-switched nonbonded cutoff of 12 Å were used in the trajectory production, and structures were saved every 2 ps for analysis. For the complex simulations, the EEF1 implicit solvent model was used instead, and a total of 4-ns duration of the trajectories was generated with a time step of 1 fs and a group-based electrostatic calculation.
Structural Deviation from the Bound StateThe monomer core domain consists of the inner domain, outer domain, and the bridging sheet with its -strands 20 and 21 folded against the inner-outer domain interface and -strands 2 and 3 protruding out (Fig. 1). Such a unique monomer organization taken from the ternary complex structure was expected to undergo conformational changes to restore its unbound free form. Tables II and III summarize the core domain RMSD from all four simulations for the monomers. Over the 3-ns period at 300 K, the structural change was moderate with a C RMSD for the inner and the outer domains together (Table II) of 1.9 and 1.7 Å for the wild type and the mutant, respectively. When a portion of the bridging sheet, 20,21, was added, the RMSD was essentially unchanged (Table II). However, when the whole domain was included, the RMSD climbed to 2.6 Å for the wild type and only slightly changed for the mutant (Table II), indicating a larger movement of the 2,3 motif in the wild type than in the mutant. The RMSDs, however, did not show dramatic structural changes within each subdomain (inner and outer domains, 20,21 and 2,3). The results from the 325 K simulations were very similar to those from the 300 K simulations except that the difference between the wild type and the mutant was much smaller.
The difference in the structural deviations of the bridging sheet between the wild type and the mutant was further highlighted when the structures were aligned against the inner and outer domains (Table III). At both 300 and 325 K, the RMSD difference between the wild type and the mutant is much larger for the 2,3 and 20,21, whereas it is similar for the inner and the outer domains. These results indicate that the S375W mutation appears to stabilize the bridging sheet from drifting away from the inner and outer domains, which in turn shows that the bridging sheet is the most mobile region of the core domain for the wild type.
Analysis of Interdomain MotionsAbove, our results have indicated that the bridging sheet, especially the
The distance between residues 231 and 360 fluctuated between 39.5 and 40.5 Å for the wild type at 300 K (Fig. 2A). Compared with that in the crystal structure (39.5 Å), the expansion was very mild considering the thermal effect on the structure. The mutant behaved similarly. To increase the sampled conformational space, simulations were also performed at elevated temperature (325 K). However, no significant difference was observed for either the wild type or the mutant (Fig. 2A). Thus, it seems that the motions between the domains were quite limited by this measure. The distance between residues 112 and 375 also increased for the wild type at 300 K (Fig. 2B). Starting from 9.4 Å in the bound state, it fluctuated between 10 and 12 Å after 1 ns of the simulation. Such a magnitude of fluctuation is more significant compared with the fluctuation for the distance between residues 231 and 360, indicating that this part of the structure might be more flexible and relatively more open in the free form. Interestingly, this distance for the mutant did not change as much and was consistently shorter than in the wild type (Fig. 2B). Such a difference between the wild type and the mutant was more significant when measured at 325 K (Fig. 2B). These results indicate that the effect of the S375W mutation was the stabilization of the bound-state association between the inner domain and the outer domain, consistent with the assertion based on the experimental observations (7).
To examine the conformational changes of the gp120 observed by molecular dynamics from a different perspective, we performed a normal mode analysis on both the monomer and the gp120-CD4 dimer with a method developed by Keskin et al. (40). This normal mode analysis method uses a coarse-grained model of the system and focuses on the lowest frequency modes. Results from the monomer analysis show that two of the three lowest frequency modes correspond to the
S375W Mutation Reduces the Flexibility of gp120The S375W mutation was designed to fill the partially occupied CD4 binding pocket (7) so that the stability or the conformation of gp120 would be altered. Because the mutation site is at the bottom of the CD4 binding pocket, and the pocket is surrounded by several hydrophobic/aromatic residues, this mutation was likely to perturb these interactions in its vicinity. Fig. 4 shows the constellations of the hydrophobic and aromatic residues near the mutation site in the average structures from the 300 K simulations (Fig. 4, A and B) and snapshots from 325 K simulations (Fig. 4, C and D). Compared with the wild type, the hydrophobic residues near the mutation site in the mutant shifted slightly toward each other within the pocket, resulting in a better packed hydrophobic cluster because of the presence of Trp-375. As a consequence, the distance between residues 112 and 375 was shortened (Figs. 2 and 4). Such a change in the constellation of the hydrophobic residues might have an effect on the stability of this region. To test whether the stability was affected, residuewise root mean square fluctuations were compared between the wild type and the mutant (Fig. 5). Overall, residues that were nearby the hydrophobic cluster displayed lower fluctuations in the mutant than in the wild type (Fig. 5, B and C), whereas residues far from this site either became more flexible or stayed the same in the mutant. Interestingly, 2,3 of the bridging sheet was also stabilized, possibly because of the lower mobility of the helix 1 or of the 20,21 of the bridging sheet. Other structural effects resulting from the stabilization include the better retained helical structure of helix 5, especially at the C-terminal (data not shown).
Of particular note are the residues that were most stabilized upon mutation (Fig. 5, B and C in blue) that were also within 4.5 Å of CD4 in the complex (Fig. 5D). Residues within 4.5 Å of CD4 include 124 (P), 126 (C), 196 (C), 279281 (DNA), 283 (T), 365368 (SGGD), 370371 (EI), 425430 (NMWQKV), 455460 (TRDGGN), 469 (R), 472475 (GGDM), and 477 (D) (Fig. 5D). Most of these residues are from the outer domain and the bridging sheet except for Met-475 and Asp-477. Only three residues from 2,3 of the bridging sheet are within 4.5 Å of CD4, two of which are Cys-126 and -196, which form the disulfide bond (1) and the other is Pro-124, which may also be important in maintaining the structure. 20,21 of the bridging sheet contains the longest continuous sequence of residues (NMWQKV) that were in contact with CD4. The only residue (Val-430) that made hydrophobic contact with CD4 also resides in this part of the bridging sheet. The rest of the residues were mostly charged or polar. Interestingly, three segments are rich in Gly (Fig. 5D, red). For each of the Gly-rich segments, there is an Asp immediately next to the GG motif. Because the interaction between gp120 and CD4 is quite polar with mostly polar and charged residues within the 4.5 Å (6), these Asp residues are important for the interactions. Changes in Asp-368 and Glu-370, for example, will disrupt interactions with many CD4BS antibodies (41). However, to interact favorably with CD4, these residues need to be able to adjust their conformations. The Glys next to them make it possible. The stabilization of these segments upon mutation, however, will hinder the ability of these motifs to adjust their orientations with respect to CD4, resulting in the reduced enthalpy change as well as the entropy cost. Change of Solvent-accessible Surface Areas upon Mutation For the antibodies to bind, the protein needs to assume certain conformations that allow the access to the specific epitopes. The mutagenesis experiment shows that the S375W mutant binds CD4BS antibodies much weaker than the wild type and binds CD4 6-fold better (7). Thus, it is interesting to see whether the effect of the mutation is reflected in the solvent-accessible surface areas of the epitope residues. We calculated the solvent-accessible surface area difference between the wild type and the mutant for both the monomer and the gp120-CD4 complex (Fig. 6, AC). For a direct comparison, the epitope residues for the CD4BS antibodies were mapped onto the structure (21) (Fig. 6D). Interestingly, we observed that most epitope residues for the CD4BS antibody binding became less accessible upon mutation in both the monomer and the complex simulations, with more dramatic changes in the complex simulations (Fig. 6). Mutagenic analysis indicates that many residues important for the binding of the CD4BS antibodies were not exposed on the surface of the CD4-bound gp120 (21). This result shows that part of the mutational effect on antibody binding is the change of the accessibility of the epitope residues. Furthermore, the more dramatic effect on the surface accessibility of these epitope residues when in the complex form, indicates that CD4 binding is also partially responsible for the conformational change that reduced the accessibility of the CD4BS antibody epitopes. Because the mutation did not affect the binding affinity of CD4, this again confirms that CD4 might have interacted with a different set of residues or with different configurations of the residues.
The Source of Enthalpy and Entropy DifferencesThe interesting part of the binding free energy of gp120 and CD4 is the significant difference for the wild type and the mutant (Table I). The entropy cost for the mutant is only about half that of the wild type and the enthalpy gain is 17 kcal/mol less favorable for the mutant. In addition to the large surface area burial and the conformational changes such as the folding of the bridging sheet as suggested earlier (6, 33), the reduction of the flexibility of residues near the mutation site was also responsible for the entropy changes. To compare the enthalpy changes, we calculated the interaction energy between gp120 and CD4, which is roughly proportional to the enthalpy change upon CD4 binding. Surprisingly, the calculated interaction energy was much smaller for the mutant than for the wild type after 1 ns of equilibration (Fig. 7A). The averages for the last 2 ns from the trajectories are -217 ± 16 kcal/mol and -175 ± 11 kcal/mol for the wild type and the mutant, respectively, which is 40 kcal/mol more favorable for the wild type. Thus, our calculated free energy profile was qualitatively in very good agreement with the experimental result.
The calculated difference in interaction energies between the wild type and the mutant is very significant, because the only difference between the wild type and the mutant is the S375W mutation. Further, the mutated residue was not in contact distance with CD4 in either the wild type or the mutant and therefore did not directly contribute to the interaction energy. Therefore, its impact must be on the conformations. A simple measure of the distances between the C of residue Phe-43 of CD4, a residue that pointed to the hydrophobic pocket of the CD4 binding site, and the C of residue 375 of gp120 shows that this distance is quite stable in the mutant, but it fluctuated much more in the wild type (Fig. 7B). The interpretation of this result is that the low flexibility of the mutant at the interface of the inner domain, outer domain, and the bridging sheet, because of the stabilization effect, imposed some rigidity to the molecule. Because the CD4 molecule is known to be quite rigid, the low flexibility of the gp120 conformation will limit the freedom of the binding partners to adjust their conformations to yield optimal interactions, resulting in the less favorable interaction energy. For the same reason, the entropy cost is largely reduced.
Unbound States of gp120 and Its FlexibilityOur simulation results have shown that the wild type and the mutant preferred different conformations. In particular, the mutant assumed a conformation that is more similar to the CD4-bound structure, whereas the wild type is relatively more open with respect to the inner and outer domains, with part of the bridging sheet ( 2,3) partially unfolded. Such an observation is consistent with the previous speculations about the conformations of gp120 in its free form (7).
However, the putative open-state conformation observed in our simulations appears to be not as different from the bound state as was suggested previously (6). For example, it was proposed that the whole bridging sheet might be in an unfolded state in its free form, and the inner and outer domains may have larger flexibility with respect to each other (7). Our results show that the Conformational Features Versus Antibody RecognitionMutagenesis studies show that structural perturbation of the bridging sheet affected the binding of gp120 to the CD4, CCR5, and the CD4i antibodies but not to the CD4BS antibodies (7). This indicates that the CD4BS antibodies recognize conformations of gp120 that are different from those recognized by the CD4 and CD4i antibodies. Consistently, most of the CD4BS epitopes have been mapped onto the inner and outer domain (21). The fact that the S375W mutant did not bind to any of the CD4BS antibodies indicates a conformational alteration in the inner and outer domain region. Our results suggest that these antibodies may prefer a conformation that is more open than that of the crystal structure particularly at the interface of the inner domain, outer domain, and the bridging sheet, because the mutant had a more contracted inner domain to outer domain distance. It is known that the antibodies are usually larger in size than CD4. Thus, the narrowed binding pocket induced by the S375W mutation explains perfectly the low binding affinity of this mutant to the CD4BS antibodies which were elicited against the potentially more open free form of gp120. Implications for Drug DesignAnti-AIDS drugs are still in great need. However, there has been only limited success in the search for potent anti-AIDS agents even with the available structure of the gp120 in a complexed form. One of the reasons is that the prevailing conformation of gp120 in its free form may be very different from the CD4-bound state. Thus, targeting the bound-state structure that is not well populated is less likely to succeed. In addition, chemical agents targeting the CD4-bound form may act in a way similar to CD4 and thus possibly even trigger conformational changes needed for the chemokine receptor binding and consequently enhance the infection. It is known that several CD4 binding inhibitors are actually infection enhancers, and CD4 binding inhibitors are often known to be CCR5-binding activators (42). Thus, targeting the open form would be more efficient and possibly can avoid the CD4-binding inhibitor-infection-enhancing dilemma. With no available experimental structure of the gp120 in free form, a molecular-dynamics-generated open form structure provides another venue for the search for potent gp120 inhibitors. Encouragingly, our results suggest that the free form of gp120, although flexible as is true for many other proteins, assumes a well defined three-dimensional conformation that is targetable relative to the previously believed very mobile structure. In addition, multiple conformations of the backbone and side chains can be easily explored in this regard. Thus, with the use of our simulated models and the consideration of structural flexibility and multiple conformations, more potent and specific inhibitors can be expected from such structure-based efforts.
ConclusionsWe have characterized the conformational features of the gp120 core domain in its free form via molecular dynamics simulations. Although the overall free-state structure is very similar to the bound-state, two specific regions have been identified to be conformationally different. First, when compared with the crystal CD4-bound form, Hence, our results may further suggest the reason that drugs designed toward the gp120-bound state have had limited success. The conformational features of the free-state gp120 model obtained in our simulations should therefore be of great value in efforts toward new inhibitor development.
* This work was supported in whole or in part with federal funds from the NCI, National Institutes of Health, under Contract Number NO1-CO-12400. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. || To whom correspondence should be addressed: NCI-Frederick, Bldg. 469, Rm. 151, Frederick, MD 21702. Tel.: 301-846-5579; Fax: 301-846-5598; E-mail: ruthn{at}ncifcrf.gov.
1 The abbreviations used are: CD4BS, CD4 binding site; RMSD, root mean square distance.
We thank Drs. D. Zanuy, K. Gunasekaran, Chung-Jung Tsai, and Hui-Hsu (Gavin) Tsai for many useful comments and suggestions.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||