|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 280, Issue 12, 11369-11378, March 25, 2005
Folding Regulates Autoprocessing of HIV-1 Protease Precursor*
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Pettit et al. (20) have recently shown, by co-expressing equivalent amounts of substituted Gag-Pol constructs, that the initial cleavage of the HIV-1 Gag-Pol precursor is intramolecular. Moreover, they showed that competitive active site inhibition by the drug retonavir was 10,000-fold less for the protease embedded in the precursor than for the mature free protease (20). Earlier, kinetic studies on the model precursor system MBP-
TF-Protease-
RT showed that the protease maturation takes place in two steps. (
TF and
RT are short native sequences from the transframe protein and the reverse transcriptase, respectively. MBP stands for maltose-binding protein of Escherichia coli containing two native cleavage sites, p6pol/PR at the N terminus and PR/RT at the C terminus.) The first step involves an intramolecular cleavage of the N terminus that is followed by intermolecular cleavage of the C terminus (19, 21). A relatively low Km for peptide substrates representing the p6*-PR (where p6* is TFP+p6pol) cleavage site, compared with that for oligopeptides corresponding to other Gag or Pol cleavage sites (23) supports the view that the N-terminal cleavage is an early event in the proteolytic cascade. The activity of the protease-
RT was found to be nearly equal to that of the mature PR, though its conformational stability was much less than that of PR (19). However a 600-fold decrease in catalytic activity was seen in MBP-
TF-protease-
RT compared with mature PR (19, 21). Thus the flanking N terminus of the protease seems to have important consequences with maturation.
The N-terminal transframe region (TFR) consisting of a conserved N-terminal transframe octapeptide (TFP) and a 4860 amino acid long variable p6pol, with a protease cleavage site at the intersection, does not have any stable secondary or tertiary structure in free solution (24), though some tendency for helix formation has been seen. However, when present with the PR, TFR does seem to act as a regulator for the autoprocessing of the protease (11, 23, 2528, 29). Interaction of recombinant p6* protein with HIV-1 PR was found to specifically inhibit its activity, and the inhibition was dependent on the C-terminal cleavage site residues SFNF in the p6*. In separate experiments with the precursor, these residues blocked the substrate binding cleft in HIV1-PR after N-terminal autoprocessing of the precursor. At the same time it was also observed that the p6* stabilized the dimer, as the relative amount of dimer increased by 12% in its presence (25). Functional characterization of the model precursor
TFP-p6pol-PR (
TFP is a 5-residue variant of TFP) by examination of the mechanism and the pH rate profile of the autocatalytic reaction to produce mature PR shows that full-length TFR with its native cleavage sites is critical for the regulated autoprocessing of Gag-Pol and for optimal catalytic activity (28). The extensive study by Dautin et al. (27) on functional modulations due to N- and C-terminal extensions to PR, using an E. coli genetic assay for proteolytic activity and a bacterial two-hybrid system, shows that the TFR can restore enzymatic activity to a dimerization-deficient HIV protease variant. Experiments with various deletion and addition mutants of PR and its precursors, Gag-Pol, TFR-PR, also give insights into folding and dimerization of PR (29). For example, deletion of the first four residues in PR led to >90% unfolded
PR. Similar destabilization was observed for PR with additional residues in the N terminus (29). Earlier, it has also been shown that removal of the p6pol domain from the Gag-Pol polyprotein leads to a significantly higher rate of processing of the Gag-
Pol precursor (31).
The studies discussed so far are mainly based on enzymatic activity assays for the HIV-1 PR and its precursors using the chromogenic peptide substrate Lys-Ala-Arg-Val-Nle-Phe(p-NO2)-Glu-Ala-Nle-NH2 (19, 21, 32), or immunoblotting assays of the autolytic products. These give very good quantitative as well as qualitative information with regard to the working of the various precursors of HIV-1 protease. However, there are very few reports about the residue level structural characteristics of these precursors, which is crucial to understanding the molecular mechanism of protease maturation (29, 33). Louis et al. (29) have earlier shown, through NMR, how the N-terminal TFR extension to the HIV-1 PR does not allow it to fold even in the presence of DMP323, which is one of the tightest binding inhibitors. Detailed NMR structural characterization of wildtype TFP-p6pol-PR was not possible because of its autolytic property. Hence, in a later study, an active site D25N mutation was introduced, and the HSQC spectra were seen to have many peaks at the same chemical shifts as in the spectra of the folded PRD25N, though, they also had many intense peaks in a narrow region of amide proton chemical shifts (8.08.5 ppm), presumably belonging to the TFR residues. This indicated that the PR region folded properly, although the TFR region could not be characterized because of insufficient dispersion of the peaks (33). It was suggested that the TFR region was largely unstructured.
Thus, all the above studies demonstrate the importance of TFR on the folding and maturation of the protease. However, the mechanistic details at the residue level are still not understood. In this context we present here investigations on a precursor TFP-p6pol-PR-Cnn, where Cnn is a non-native pentapeptide extension at the C terminus of PR. Bacterial expression and MALDI analysis of the precursor show that TFR does not hamper the autoprocessing of the precursor so as to release the PR. Deletion of Cnn enhanced autoprocessing, indicating that the non-native C-terminal extension interferes in the cleavage mechanism. We carried out extensive NMR investigations on the precursor containing an active site mutation D25N, which was stable for several weeks for NMR experiments. We monitored the intrinsic folding propensities of the precursor by studying the graded changes in the dynamic as well as structural characteristics of the equilibrium intermediates, created by use of different concentrations of the chemical denaturant, urea. These results have significant implications for the regulation mechanism of the autoprocessing reaction of HIV-1 protease precursors.
| MATERIALS AND METHODS |
|---|
|
|
|---|
0.8, and then induced for production of the desired proteins using 1 mM isopropyl-1-thio-
-D-galactopyranoside. Uniformly 15N- and 15N/13C-labeled protein samples were prepared by growing bacteria in M9 minimal media supplemented with 1 g liter-1 15NH Cl and 4 g liter-1 [13C]glucose. Protein was purified as described previously (35). MALDI analysis of the protein showed peaks at the expected molecular mass (17.3 kDa). The NMR samples contained 1 mM protein in 50 mM acetate buffer (pH 5.2) containing 5 mM EDTA, 20 mM dithiothreitol, and different concentrations of urea in 90% H2O, 10% D2O. Gel ElectrophoresisThe recombinant protein was induced in BL21(DE3) E. coli bacterial cells as described in the section on protein preparation. Aliquots were taken at two different induction times, 3 and 5 h, and analyzed on 12% SDS-PAGE.
Capillary ElectrophoresisThe purified protein was concentrated to
1 mM and analyzed by neutral capillary electrophoresis on a Beckmann-Coulter capillary electrophoresis system in the presence and absence of the denaturants, urea and guanidine hydrochloride.
Mass SpectroscopyMALDI-TOF mass spectrometry analyses were carried out with Micromass (UK) MALDI-TOF Spec 2E spectrometer equipped with a UV nitrogen laser (337 nm) and a dual microchannel microplate detector. The samples were prepared by mixing 1 µl of protein solution (
20 µM) with 1 µl of freshly prepared matrix solution (10 mg/ml of 2,5-dihydroxybenzoic acid in 3:2 0.1% trifluoroacetic acid/acetonitrile). A total of 1 µl of this mixture was placed on the stainless steel probe plate and allowed to dry at room temperature. The spectra were recorded in the positive reflector linear mode at an accelerated voltage of 20 kV in the range from 4000 to 30,000 Da. For each measurement, the spectra were externally calibrated using myoglobin and trypsinogen.
NMR SpectroscopyAll NMR experiments were performed at 32 °C on a Varian Unity-plus 600 MHz NMR spectrometer equipped with pulse-shaping and pulse-field gradient capabilities. For the HNN spectrum the delays TN, and TC, were both set to 28 ms. 40 complex points were used along the t1 and t2 dimensions. The HN(C)N spectrum was recorded with the same TN and TC parameters, the same number of t1 and t2 points, and the TCC delay was set to 9 ms. TOCSY-HSQC was recorded with a mixing time of 60 ms, 32 complex points along the 15N (t1) dimension and 64 complex points along the 1H (t2) dimension. CBCANH and CBCA(CO)NH were recorded with 40 complex t1 points (15N) and 64 complex t2 points (13C). HNCO was recorded with 40 complex points along t1 and t2. An HSQC was recorded with 256 t1 increments. For the high resolution HSQC data, required for coupling constant measurements, 8192 and 512 complex points were acquired along the t2 and t1 dimensions, respectively. For the relaxation measurements 2048 and 256 complex points were collected along the two dimensions. For R2 measurements, the following Carr-Purcell-Meiboom-Gill (CPMG) delays were used: 10, 30, 50, 90, 130, 150, 190, 230 ms and spectra duplicated at 50 and 150 ms. The R2 values were extracted by fitting the peak intensities to the equation I(t) = B exp(-R2t). The experiments were carried out using the pulse sequences described by Farrow et al. (36).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
11-kDa peak for the PR and a
7-kDa peak for the TFR part; however for the TFR-PR-Cnn we see a peak at
18 kDa corresponding to the precursor. This seems to suggest that the C-terminal extension possibly interacts with the PR region; either it interferes with dimer formation or it blocks the active site as has been observed for the SFNF stretch at the C terminus of the TFR in an earlier study (26).
|
|
Resonance AssignmentsThe TFR-PR-Cnn precursor is 161 residues long, of which the first 57 residues belong to the TFR portion. The next 99 residues, that is, 58156 actually constitute the PR portion. Hence residues 5962, 152156 form the dimerization domain, 8284 form the active site, 138140 form the substrate binding cleft, 100106 form the hinge region, and 109112 constitute the mobile flaps in the PR (35). The final five residues (157161) having the sequence GGSSG, constitute an extension to the PR at the C terminus. Henceforth we will use these numbers for structural discussion.
Conventionally, backbone assignment in proteins has been achieved by a combination of several three-dimensional triple resonance experiments, typically, HNCA, HN(CO)CA, CBCANH, and CBCA(CO)NH (reviewed recently in Ref. 38). These experiments display correlations between HN, 15N, and (C
, Cb) nuclei along the protein backbone. The success of this approach depends critically on the dispersion of the C
, C
chemical shifts, and therefore for unfolded proteins, where this dispersion is very poor, the method has serious limitations. Our methodology of assignment is based on the recently described triple resonance experiments HNN and HN(C)N (39). The most significant feature of these experiments is the observation of different patterns of positive and negative peaks in the (F1, F3) planes depending on the residue types at i-1, i, and i+1 positions. These have been discussed in detail earlier (39, 40); suffice it to say here that glycines and prolines play important roles in this regard, the former because of the absence of the C
, and the latter because of the absence of the amide proton. Triplets containing these residues produce very characteristic patterns in the (F1, F3) planes, which can be termed as fixed points. These provide many starts and check points for the sequential walk, and hence it is less crucial to obtain side chain assignment to validate the backbone assignments. Nevertheless, simultaneous analysis of an 15N resolved TOCSY (41) experiment helps in resolving occasional ambiguities in the connections because of degeneracies of the chemical shifts. This is particularly useful in unfolded proteins, since the side chain chemical shifts are close to their random coil values, and hence the spin systems of the residues can be relatively easily identified.
TFR-PR-Cnn has 20 glycines and 8 prolines, which are well distributed over the length of the polypeptide chain. Thus there are a number of fixed points, well distributed, to enable unambiguous assignments. Fig. 3A shows an illustrative sequential walk through the stretch 153161, and Fig. 3B displays the summary of the connectivities. All the amide and 15N assignments are shown in the 15N HSQC spectrum in Fig. 4.
|
|
, C
assignments while the HNCO provides C' assignments. We also obtained many side chain assignments for the individual residues from TOCSY-HSQC spectra in a straightforward manner, making use of the amide and 15N assignments. All the assignments made in 8 M urea have been listed in Table I of the Supplementary Material. The HSQC spectra at other urea concentrations were very similar to the one at 8 M urea (see below), and thus peak assignments could be readily obtained by simple transfer of assignments.
Residual Structure at 8 M UreaIt is now becoming increasingly evident that the denatured states of proteins are not always entirely random coils, but may contain regions with preferred conformations or propensities for transient structure formations (4447). The regions having propensities for definite structures are the so-called folding cores, which indicate folding initiation sites on initiation of the folding reaction by dilution of the denaturant concentrations. We have probed for the existence of such preferences in the 8 M urea denatured state of TFR-PR by using carbon C
, C' secondary shifts (deviations of chemical shifts from their random coil values); these are believed to be the most diagnostic from the point of view of residual structural characterization (47, 48). Positive secondary shifts for 13C
and 13C' indicate a preference for
,
angles in the helical conformation, while negative secondary shifts indicate a preference for
,
angles in the
-sheet conformation. If a contiguous stretch of 34 residues shows a specific pattern of secondary shifts, that can be taken to indicate the presence of a transient secondary structural propensity in that region of the protein. Now, in any protein the observed chemical shifts are influenced both by neighboring amino acids and local backbone structure. Therefore, it is important to correct these for contributions from the local amino acid sequence (49). In the present analysis, the random coil values were corrected using sequence-dependent correction factors determined for a set of Ac-GGXGG-NH2 peptides in 8 M urea and pH 2.3 (50). For the residues D, E, and H, which are sensitive to pH, the random coil values given by Wishart et al. (51) at pH 5.0, appropriately corrected for the alanine neighbor were used. Deviations in specific chemical shifts were then calculated by subtracting the corrected random coil values from the measured chemical shifts for all the residues in urea-unfolded TFR-PR-Cnn. These secondary shifts are shown in Fig. 5. The data does not seem to indicate the presence of any long stretches of preferred conformations but suggests the presence of many short contiguous regions with trends of
,
preferences.
|
secondary shifts (top panel) are rather small for most residues, but show interesting sequence-dependent variations; about 15 discrete residues show large secondary shifts (>1.5 ppm) which may represent individual preferences in the
,
space. There are many short contiguous stretches showing shifts of more than about 0.3 ppm (identified by the horizontal line in the figure), and these can be considered to be having propensities for transient secondary structure formation. There are also few other contiguous stretches with smaller shifts, which may indicate smaller tendencies to populate the respective
,
angles in the Ramachandran map. The locations of the stretches with good
-helical and
-strand propensities are shown by open and filled boxes, respectively, in the figure. The TFR segment (residues 157) of the protein appears to contain two short
segments and two short
segments. Previous qualitative reports on structural characteristics of TFR-PR (24, 28) suggested that the TFR segment may be largely unfolded in aqueous solutions. Our present observations, however, seem to suggest that there may be at least a few regions of some
,
preferences, in an otherwise largely unstructured polypeptide. The PR segment of the protein contains many
segments and only one
stretch. The location of the
stretch (8587) is certainly not the same as in the native PR where it occurs near the C terminus; this corresponds to the stretch 143151 in the present case. Several of the
stretches, namely, 6266, 7072, 98102, 110114, 134138 belong to the native-type structures (
type) in the dimeric structure of PR (34).
The C' secondary shifts (bottom panel in Fig. 5) corroborate the results from C
secondary shifts to a large extent. In both cases the contiguous stretches with
and
propensities are nearly at the same locations as is also the discrete residues with large secondary shifts. The short displacements of the stretches or a few mismatches may be attributed to the facts that the segments themselves are very short, and the sensitivities of the C' and C
secondary shifts to
,
preferences could be slightly different. Overall, the C' secondary shifts are slightly larger in magnitude compared with the C
secondary shifts (cutoff of 0.4 ppm is used for C' secondary shifts). The stretch at 103111 is significantly longer, and this belongs to the flap segment of the native protease structure (34).
We also measured the HN-H
coupling constants (see below), amide proton temperature coefficients, and 1H-1H nuclear overhauser effects (data not shown), all of which indicate that the protein is devoid of any persistent structure in 8 M urea. The sensitivities of the average coupling constants to the structural preferences are perhaps relatively smaller compared with the secondary chemical shifts. The transverse relaxation rates (R2) (see below) indicated, however, sequence-dependent variations, suggesting possibilities of conformational transitions at certain locations. Thus we conclude that in 8 M urea at pH 5.2 and 32 °C, the polypeptide is largely unstructured but with short pockets of specific secondary structural propensities in a dynamic ensemble.
Equilibrium Intermediates Along the Folding FunnelEquilibrium intermediates created by different denaturant concentrations help to understand the folding transitions along the folding pathway of a protein. Fig. 6 shows the HSQC spectra of the precursor as a function of denaturant concentration. Interestingly, the spectra (Fig. 6, panels ad) at 8, 6, 4, and 2 M do not show any substantial change in the profile of peak dispersions, thus showing that the protein has a high tendency to be unfolded. All the peaks present in the 8 M spectrum are also present in the 6, 4, and 2 M spectra at almost identical positions, barring a few that show small shifts. However, there are some weaker peaks in the spectra at all the denaturant concentrations, which suggest the presence of other conformations that may be partially folded forms. The presence of these peaks indicates that the state identified by the conserved peaks in the spectra would have differences in the dynamic characteristics under the different denaturant conditions. The spectrum (Fig. 6, panel e) in the absence of urea shows very few broad peaks, which is consistent with the aggregation behavior of the protein discussed earlier.
|
coupling constants. The transverse relaxation rates are sensitive to slow motions and conformational transitions occurring on the milli- to- microsecond time scale. In many instances these have provided valuable insights into sequence-dependent motional restrictions and flexibilities in denatured proteins, which in turn provide clues to the folding mechanisms (12, 5257). In an earlier report, Bhavesh et al. (52) have shown the importance of sequence-specific variations in the transverse relaxation rates (R2), as denaturant concentration is decreased, on the folding hierarchy of HIV-1 protease. The changes in the magnitude of R2 values as the denaturant concentration is varied directly reflect on the transient conformational changes along the sequence that may lead to order, and hence native structure development by formation of native contacts or breaking of non-native contacts. Fig. 7a shows the R2 values for the TFR-PR-Cnn as the denaturant concentration is decreased from 8 to 2 M. The R2 values do show sequence-specific variations indicating different degrees of restricted motions along the chain. Bothupwardanddownwardchangesoccur,suggestingsequence-dependent transient changes in the structural preferences. At this juncture we may mention that the absence of data points for some of the residues is caused by the difficulty in quantitation because of nearby weaker peaks, and also, data points having more than 15% fitting error have not been included; in most cases the errors are less than 6%. Fig. 7b shows the changes in the R2 values as we move toward lower denaturant concentration. Negative and positive deviations indicate increase or decrease in R2, respectively, and correspondingly represent increased and decreased conformational transitions, as long as the protein is still largely unfolded and there are no rigid structures. Once the rigid structures are formed, changes in R2 values would be dictated by internal motions only. The deviations in Fig. 7b may be divided into three classes ±(>1.5), ±(1.51.0) and ±(1.00.0). The third class is roughly similar to the errors in the R2 measurements and hence may not be considered as significant. Thus it follows that as we move from 8to6 M urea the residues 912, 16, 5556, 75, 77, 111, 124, 144, 149, 157 show large propensities for conformational transitions followed by residues 18, 67, 80, and 102. The important numbers among these are: 5556 at the N terminus, 157 at the C terminus, both of which are cleavage sites of the TFR-PR, and 102,111 at the flaps of the PR domain. As we move to 4 M, mostly the same regions exhibit variations, but the magnitudes are somewhat reduced, except for the stretch 8284 at the active site, which shows enhancements indicating large conformational transitions. The general reduction may indicate a tendency toward formation of stable contacts. This trend continues as we move to 2 M, where we see a large decrease in the contribution from the conformational transitions at milli- to microsecond time scales. This seems to indicate formation of relatively more rigid contacts. A more detailed characterization would require NOE quantitation and structure calculations, but this is hampered by the tendency of the protein to aggregate and precipitate.
|
) coupling constant, which has the main chain
torsion angle dependence, is an NMR parameter that can be rigorously analyzed to get an insight into the secondary structural elements that define the conformational preferences (30). A value in the order of 35 Hz corresponds to
-helix, 811 Hz corresponds to
-sheet, and 6.07.5 Hz, which essentially is an average of the
,
values corresponds to random coil. It is also observed that the random coil value for any residue is influenced by its N-terminal neighbor and thus two sets of values have been reported for each residue, depending upon whether the N-terminal neighbor belongs to one of the two groups of residues (22) (group I: Phe, Trp, His, Tyr, Ile, Thr, Val, and group II: remaining residues except glycine). Thus, under any given experimental conditions, deviation of the observed coupling constants from the sequence-dependent random coil value, (Jobs - Jrc), which we call as secondary coupling constants analogous to secondary chemical shifts, would throw valuable light on the secondary structural propensities along the polypeptide chain. Negative secondary coupling constants would indicate
-helical propensities and positive secondary coupling constants would indicate
propensities.
Fig. 8a shows the fine structures of the peaks in the HSQC spectra from which the couplings were derived. The measured values range from 5.0 to 9.1 Hz along the sequence in all the cases from 8 to 2 M urea as shown in Fig. 8b, and the average value (indicated by horizontal line) is roughly the same (
6.8 Hz); the estimated error in the measured coupling constants is
1 Hz. The calculated secondary coupling constants are shown in Fig. 8c. As expected, many of the secondary coupling values are zero or close to zero because of random coil characteristics; however there are also few residues that show either positive or negative deviations larger than 1 Hz. Notable among these are the contiguous stretches at 812 in 6 M and at 1925 and 110116 in 4 M data, which may be taken to indicate some conformational transitions. These have been marked on the figure by empty cylinders and all of them correspond to
propensities. Interestingly, those in the PR region are non-native type.
|
|
| CONCLUSIONS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains Supplementary Materials. ![]()
¶ To whom correspondence should be addressed: Dept. of Chemical Sciences, Tata Institute of Fundamental Research, Homi Bhabha Road, Mumbai 400 005, India. Tel.: 91-22-2280-4545, extension: 2488; Fax: 91-22-2280-4610; E-mail: hosur{at}tifr.res.in.
1 The abbreviations used are: HIV, human immunodeficiency virus; RT, reverse transcriptase; PR, HIV-1 protease; MBP, maltose-binding protein; TFR, N-terminal transframe region; AIDS, acquired immuno-deficiency syndrome; MALDI-TOF, matrix-associated laser desorption ionization-time of flight; HSQC, heteronuclear single quantum coherence; TOCSY, total correlated spectroscopy; RT, reverse transcriptase; TFP, transframe octapeptide. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. Leiherer, C. Ludwig, and R. Wagner Uncoupling Human Immunodeficiency Virus Type 1 gag and pol Reading Frames: Role of the Transframe Protein p6* in Viral Replication J. Virol., July 15, 2009; 83(14): 7210 - 7220. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Ludwig, A. Leiherer, and R. Wagner Importance of Protease Cleavage Sites within and Flanking Human Immunodeficiency Virus Type 1 Transframe Protein p6* for Spatiotemporal Regulation of Protease Activation J. Virol., May 1, 2008; 82(9): 4573 - 4584. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Molecular and Cellular Proteomics |
| Journal of Lipid Research | ASBMB Today |