If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Synchrotron SOLEIL, L'Orme des Merisiers, F-91410 Saint Aubin, FranceLaboratoire de Génie Chimique, Université Paul Sabatier, UMR 5503, Toulouse, France
As all the viruses belonging to the Mononegavirales order, the nonsegmented negative-strand RNA genome of respiratory syncytial virus (RSV) is encapsidated by the viral nucleoprotein N. N protein polymerizes along the genomic and anti-genomic RNAs during replication. This requires the maintenance of the neosynthesized N protein in a monomeric and RNA-free form by the viral phosphoprotein P that plays the role of a chaperone protein, forming a soluble N0-P complex. We have previously demonstrated that residues 1–30 of P specifically bind to N0. Here, to isolate a stable N0-P complex suitable for structural studies, we used the N-terminal peptide of P (P40) to purify truncated forms of the N protein. We show that to purify a stable N0-P–like complex, a deletion of the first 30 N-terminal residues of N (NΔ30) is required to impair N oligomerization, whereas the presence of a full-length C-arm of N is required to inhibit RNA binding. We generated structural models of the RSV N0-P with biophysical approaches, including hydrodynamic measurements and small-angle X-ray scattering (SAXS), coupled with biochemical and functional analyses of human RSV (hRSV) NΔ30 mutants. These models suggest a strong structural homology between the hRSV and the human metapneumovirus (hMPV) N0-P complexes. In both complexes, the P40-binding sites on N0 appear to be similar, and the C-arm of N provides a high flexibility and a propensity to interact with the N RNA groove. These findings reveal two potential sites to target on N0-P for the development of RSV antivirals.
). hRSV infects close to 100% of infants within the first 2 years of life. It is the main cause of bronchiolitis in young children, as well as a significant cause of severe respiratory infections in the elderly. The virus belongs to the Mononegavirales order and Pneumoviridae family (
). Like all the viruses belonging to this order, the hRSV genome is a nonsegmented negative-strand RNA. This genome of 15 kb is enwrapped by the nucleoprotein (N), forming a helical nucleocapsid (
). The crystal structures of the Mononegavirales N proteins complexed with RNA show that they all present a similar structural organization with two globular domains (NNTD and NCTD) that form the RNA groove, and N- and C-arms that are involved in N oligomerization (
). The ribonucleoprotein complex constitutes the template for viral transcription and replication by the viral RNA-dependent RNA polymerase (reviewed in Ref.
). During the viral cycle, the constant supply of neosynthesized monomeric and RNA-free N protein (named N0) constitutes a prerequisite during replication to encapsidate neosynthesized single-stranded genomic RNA (−RNA) and anti-genomic RNA (+RNA). By analogy with paramyxoviruses and rhabdoviruses, the hRSV P protein plays the role of a chaperone protein by preventing N from binding to cellular RNAs and N self-oligomerization (
). Invariably, the N-terminal residues of P are sufficient to maintain the N0 form. However, the structure of the RSV N0-P complex still remains to be solved, and those from related viruses revealed a specificity of P binding at the surface of N proteins that highlighted different mechanisms of the chaperone activity of P. For the vesicular stomatitis virus, N0-P complex, the residues 11–35 of P adopt an α-helical conformation upon binding to the surface of N, and this binding domain overlaps both the RNA-binding groove and the binding surface of the Ni+1 N-arm involved in N oligomerization (
). In these complexes, P binding prevents N from self-oligomerization and maintains it in an open conformation that impairs the interaction with RNA. A similar mechanism was found for the N0-P of Ebola and Marburg viruses, although the P peptide does not bind at the top of the NCTD but near the RNA groove, and adopts a specific conformation with two short helices (
). More recently, the structure of the N0-P complex of parainfluenza 5 (PIV5) showed that the binding of the P peptide to the NCTD blocks both the sites involved in the binding of RNA and of the N-arm of Ni+1 required for oligomerization, maintaining N in an open conformation (
). Finally, the 3D structure of the human metapneumovirus (hMPV) N0-P complex, that is phylogenetically the closest virus relative to hRSV, revealed that P binds on the NCTD but also that the C-arm of N presents a specific conformation to block the accessibility to RNA (
For RSV, using an N mutant that does not interact with RNA, we have previously shown that the residues 1–30 of P are involved in the chaperone activity (
). The periodicity of the residues critical for N binding also suggests that residues 13–28 of P would adopt a helical conformation upon binding to N. This hypothesis was recently supported by NMR studies of RSV P that revealed the presence of a transient α-helix from Asp-12 to Ile-24 (
). However, the poor stability of the N mutant did not allow us to perform structural studies of the complex.
In this work, we rationally designed deletions of the N- and C-arms of the RSV N protein to destabilize the oligomeric organization of N-RNA rings to purify a stable N0-P–like complex, when co-expressed with the N-terminal part of P (P40). Using biochemical and biophysical approaches, we obtained structural information about this complex in solution. Our results highlight a strong structural homology between the hMPV and hRSV N0-P complexes.
Results
Purification of a stable RSV N0-P–like complex
We have previously shown that a surrogate of the RSV N0-P complex can be obtained by co-expression of a K170A/R185A double N mutant with the N terminus of P in bacteria (
). However, the poor stability of this complex at high concentration impaired further structural characterization. Previous studies have shown that the deletion of the 12 N-terminal residues of N, that correspond to nearly half of the N-arm of N, was sufficient to purify a monomeric RNA-free N protein (
), the available crystal structures of N0-P complexes of Mononegavirales were obtained by deleting either the entire N-arm or both N- and C-arms of N (
). Based on these data, we decided to generate truncated N proteins deleted of either N-arm or both the N- and C-arms (Fig. 1A) to seek for a stable N0 recombinant protein. Mutant N proteins were co-expressed in Escherichia coli with GST-P40 (40 N-terminal residues of P) and purified on GSH-Sepharose beads by affinity chromatography. Among the four N expression constructs that were tested, only the N protein deleted of both N- and C-arms was not co-purified with GST-P40 (Fig. 1B). To further characterize the purified N-P complexes, the GST tag was cleaved by TEV protease. We first checked the absence of RNA associated with the isolated recombinant proteins by measuring the A260 nm/A280 nm absorption ratio. Whereas these ratios were <1 for N(13–391) and N(31–391), it was close to 2 for N(13–361), showing a strong tendency to associate with nucleic acids for this construct. These observations strongly suggest that the C-arm of N is required to prevent binding of N to RNA during the formation of the N0-P complex.
Figure 1Purification and characterization of truncated N recombinant proteins. A, schematic representation of the primary sequence of N and of the different deletions generated. The subdomains of N are colored: N-arm (blue), NNTD (yellow), NCTD (red), and C-arm (green). Numbers indicate amino acid positions. B, SDS-PAGE analysis of co-purification of GST-P40 and truncated N proteins; the A260 nm/A280 nm ratio indicating the presence or absence of RNA is indicated for each sample down the lanes. C, elution profiles of N(13–391) (gray) and N(31–391) (black) from gel filtration, at 280 nm (solid lines) and 260 nm (dash lines). D, far-UV CD spectra showing the absence of major conformational changes between N-RNA rings (dash lines) and monomeric N(31–391)-P40 (solid lines).
The purified recombinant N(13–391)-P40 and N(31–391)-P40 complexes were then analyzed by size exclusion chromatography. The N(31–391)-P40 complex eluted from the column as a single peak that corresponds to an apparent molecular mass of ~50 kDa (extrapolated from S200 calibration profile), and the A260 nm/A280 nm value <1 confirmed the absence of RNA (Fig. 1C). For the N(13–391)-P40 complex, in addition to the peak of N protein with apparent molecular mass of ~50 kDa and RNA free, a second peak with an apparent molecular mass of ~500 kDa probably associated to RNA (A260 nm/A280 nm >1) corresponding to N-RNA oligomers was obtained (Fig. 1C). These results reveal that deletion of the entire N-arm of N is required to obtain a stable monomeric N0-P complex surrogate. Finally, the secondary structure of the recombinant purified N(31–391)-P40 complex was analyzed by circular dichroism (CD). As shown in Fig. 1D, the far-UV CD spectra of the N0-P complex surrogate was similar to the spectra of the N-RNA rings; both CD traces presented a positive Cotton effect at 190 nm and two negative peaks at 208 and 222 nm, typical of secondary structures with a mainly α-helical content. This result confirms that no major secondary structure change occurs in the N0 variant when compared with the N-RNA state, as previously shown (
Altogether, our data show that the RSV N(31–391) recombinant protein, subsequently named NΔ30, co-purified with the peptide P40, is a functional surrogate of the RSV N0-P complex. The NΔ30-P40 complex was also found to be monomeric upon concentration. Furthermore, our results strongly suggest that the C-arm of N plays a major role in preventing N binding to RNA.
Investigation of the P40-binding site at the surface of the N protein by site-directed mutagenesis
We then tried to identify the binding site of P40 on monomeric N. By analogy with the structure of N0-P complexes of other Mononegavirales, it is expected that P plays a chaperone activity by binding to a surface of N that overlaps with the fixation site of the N-arm of the Ni+1 protomer within the nucleocapsid, thus impairing N oligomerization. The recent structural elucidation of the hMPV N0-P complex revealed that P binding also competes with the C-arm of the Ni−1 protomer involved in oligomerization (Fig. 2A) (
), we identified 20 residues at the surface of N protomer that could be involved in the interaction with P40 (Fig. 2B). Among these residues, 18 residues are implicated in the interaction with the N-arm of Ni+1 protomer, and the two residues Ile-270 and Tyr-353, are involved in the interaction with the C-arm of Ni−1 protomer. We first attempted to determine the impact of alanine substitution of these residues on the polymerase activity using the minigenome assay, as described previously (
). Preventing the formation of an N0-P complex competent for genomic or antigenomic RNA encapsidation would result in a decrease in N-RNA template formation, and therefore of mRNA transcription and expression of the luciferase (Luc) reporter. Although no main impact on N expression was observed (Fig. 2D), alanine substitution of the 5 residues Leu-272, Met-281, Tyr-288, Phe-300, and Phe-320 totally abrogated polymerase activity, and 11 other mutants displayed a reduction of luciferase activity of about or more than 50% compared with WT N protein (Fig. 2C). Only 4 mutations (K81A, I82A, V262A, and Y353A) had no impact on the polymerase activity.
Figure 2Search for the P40-binding site at the surface of N. A, ribbon representation of the hMPV NCTD (green) in interaction with the peptide of P (residues 1–28, purple). The residues of N involved in the interaction with P are colored in black and yellow and the side chains are shown (PBD ID: 5FVD). B, ribbon representation of a part of RSV N protomer (green) from N-RNA 3D crystal structure (PBD ID: 2WJ8) showing binding of the Ni−1 N-arm (in red) and of the Ni+1 C-arm (in blue). Residues involved in Ni−1 N-arm or in Ni+1 C-arm binding, and that could also interact with P40, are colored in black and yellow, respectively, and their side chains are shown. C, effect of N mutations on RSV polymerase activity. BSRT7/5 cells were transfected with plasmids encoding the WT P, M2–1, and L proteins, the pMT/Luc minigenome, and WT or mutant N proteins, together with pCMV–β-gal for transfection standardization. Viral RNA synthesis was quantified by measuring the luciferase activity after cell lysis 24 h after transfection. Each luciferase minigenome activity value was normalized based on β-gal expression and is the average of three independent experiments performed in triplicate. Error bars represent standard deviations calculated based on three independent experiments made in triplicate. D, Western blotting showing the expression of N protein variants in BSRT7/5 cells.
To further investigate the impact of these mutations, we then studied the cellular localization of N. As previously described, co-expression of N and P proteins leads to the formation of inclusion bodies (IBs) similar to the structures observed during RSV infection (
Cytoplasmic inclusions of respiratory syncytial virus-infected cells: formation of inclusion bodies in transfected cells that coexpress the nucleoprotein, the phosphoprotein, and the 22K protein.
Identification of protein regions involved in the interaction of human respiratory syncytial virus phosphoprotein and nucleoprotein: Significance for nucleocapsid assembly and formation of cytoplasmic inclusions.
). No impact on IB morphology was detected upon substitution of residues Lys-81, Ile-82, Val-262, Val-285, Gly-296, and Tyr-353 (not shown and Table 1). On the contrary, a total loss of IB formation was observed when mutating residues Lys-272, Met-281, Tyr-288, Phe-300, and Phe-320 (Fig. 3 and Table 1). More surprisingly, the absence of IBs was also observed in the presence of N substitutions M248A and L258A that respectively displayed 30 and 50% of polymerase activity (Table 1). Finally, a main defect on IB formation (decrease in number and size of IBs compared with the WT) was observed upon mutation of residues Val-224, His-225, Ile-228, Phe-247, Arg-259, Ile-270, and Val-284 (Fig. 3 and Table 1).
Figure 3Impact of N mutations on the formation of cytoplasmic inclusion bodies. N and P proteins were coexpressed in BSRT7/5 cells; cells were then fixed 24 h post transfection and labeled with anti-P (green) and anti-N (red) antibodies, and the distribution of viral proteins was observed by fluorescence microscopy. Nuclei were stained with DAPI. Scale bars, 10 μm.
Taken together, these results show that, among the 20 targeted N residues, 16 are critical for N function within the polymerase complex. However, given the specific localization of targeted residues, the strong effect of mutations on polymerase activity and N localization could be attributed to either a major defect in (i) intrinsic N conformation, (ii) N–N interaction, or (iii) N–P40 interaction.
Identification of N residues involved in P40 binding
We then assessed the potential impact of N mutations on its structure and its capacity to interact with RNA. As previously described, co-expression in bacteria of N with GST-PCT, which corresponds to a fusion protein between the GST and the fragment P(161–241) that interacts with the NNTD, allows purifying N-RNA rings using the GST tag (
). Mutations were thus introduced in the pET-N vector, and the ability of N (WT or mutants) to co-purify with GST-PCT was evaluated by SDS-PAGE and Coomassie Blue staining (Fig. 4A). Among the 18 N variants tested, only 8 (V224A, M248A, L258A, V262A, V285A, G296A, I270A, and Y353A) were co-purified with GST-PCT with similar amounts compared with WT N. The mutations of the 5 residues His-225, Ile-228, Phe-247, Met-281, and Val-284 seemed to alter the interaction with PCT. Finally, the 5 mutants R259A, L272A, Y288A, F300A, and F320A were not pulled down by GST-PCT, suggesting a strong impact of these mutations on the structural integrity of N. These results correlate with those obtained for the functional minigenome assay and IB morphology (Table 1).
Figure 4Impact of N mutations on N-P and N-N interactions. A, full-length N constructs (WT or mutants) were co-expressed with GST-PCT in E. coli and complexes were pulled down using GST tag. B, NΔ30 constructs (WT or mutants) were co-expressed with GST-P40 or GST-N30 in E. coli and complexes were pulled down using GST tag. Co-purification of N (or NΔ30) with GST constructs was analyzed by SDS-PAGE and stained with Coomassie Blue.
We then focused on the mutants that could still interact with GST-PCT and tried to decipher if the targeted residues could be involved in the interaction with either P40 or the N-arm of N. To this end, 11 mutations were introduced in pET-NΔ30 vector. NΔ30 protein (WT or mutant) was co-expressed in bacteria together with either GST-N30 (30 N-terminal residues of N) or GST-P40, and the ability of monomeric N to be pulled down was evaluated by SDS-PAGE and Coomassie Blue staining. As shown in Fig. 4B, the 5 mutations V224A, M248A, L258A, M281A, and V284A induced a loss of interaction between NΔ30 and GST-N30. On the contrary, these mutations poorly affected the interaction with GST-P40, except for L258A and V284A substitutions that seemed to interact less efficiently with GST-P40. More interestingly, the mutation I270A specifically abrogated the interaction with GST-P40 without affecting the interaction with GST-N30 (Fig. 4B). Finally, the Y353A substitution, which had no impact on minigenome activity or on N oligomerization, induced a defect of interaction of NΔ30 with both GST-N30 and GST-P40. We thus cannot exclude a defect of NΔ30 folding for this mutant protein.
Although our results suggest that most of the introduced mutations strongly affect N conformation, we clearly reveal the direct role of N residue Ile-270 in the interaction with P40. The data also suggest a role of N residues Leu-258 and Val-284 in the interaction. These results confirm the strong structural homology between RSV and hMPV N0-P complexes, with the P-binding domain on N0 overlapping both the Ni−1 N-arm– and Ni+1 C-arm–binding sites involved in N oligomerization (Fig. 2A).
Modeling the structure of RSV N0-P complex in solution
To gain information on the structure of the N0-P complex, small angle X-ray scattering (SAXS) and analytical ultracentrifugation (AUC) approaches were then combined to provide information about the overall size, shape, and stoichiometry of this complex in solution (Fig. 5). After purification of the NΔ30-P40 complex, we first analyzed the complex by AUC (Fig. 5A). One main species with a sedimentation coefficient of 3.4 ± 0.15 s and a frictional ratio of 1.3 compatible with a complex form of one N0 and one peptide P40 was detected (Table 2). This single population suggests a really stable interaction between N0 and P peptide as previously measured (
). On the other hand, the molecular weights derived from Guinier analysis of the SAXS data reveal no sample aggregation or radiation damage. In agreement with AUC experiments (Table 2), we found that the NΔ30-P40 complex behaves as a monodisperse distribution of monomers in one to one complex in solution. The estimated molecular mass of the complex derived from the extrapolated intensity I(0) at the origin is consistent with the theoretical value of 45.3 kDa (Table 2 and Table S1). The maximum distance (Dmax) and the radius of gyration (Rg) of the NΔ30-P40 complex derived from the electron pair distance distribution function P(r) were respectively 9.41 nm and 2.74 nm (Table 2 and Fig. 5B). The normalized Kratky plot is characteristic of a multidomain protein with flexible regions (Fig. 5C). To gain further understanding of the structural arrangement of the complex, we used Coral to generate 50 structures of the NΔ30-P40 complex, taking as starting configuration a model based on the X-ray structures of RSV N (PDB ID: 2WJ8) and hMPV N0-P complex (PDB ID: 5FVD), with a random arrangement of the C-terminal end of P40. Briefly, during the modeling process, the P40 peptide and the globular domain (256–359) of N were fixed in position to maintain the contact between the NCTD surface and the P40 peptide based on the crystal structure of the hMPV N0-P complex. During the process, the last 13 amino acids in the C-terminal end of P40, the linker between the two globular domains of N, as well as the C-arm of N were in random conformation, generating a large amount of interdomain and C-arm orientations. The chi2 value, deduced from the difference between the calculated SAXS curves of the models and the experimental data, as well as the calculated theoretical sedimentation values were used to select the best models that were representative of the SAXS curve. Of the 50 models generated by Coral, only 18 were compatible with the AUC data.
Figure 5Structural characterization of RSV NΔ30-P40 in solution. A, analytical ultracentrifugation sedimentation distribution analysis of RSV NΔ30-P40. B, scattering curve of RSV NΔ30-P40 complex (triangle) with fitted curve (plain line) used to generate distance distribution. Inset, distance distribution functions obtained using the program GNOM. C, dimensionless Kratky plot. D, fit of the best Coral model (see F). E, presentation of the five best Coral models of RSV NΔ30-P40 complex superimposed based on the two main possible orientations of the C-arm of N and on the relative position of the NCTD and NNTD domains. Left, the RNA groove access is blocked by the positioning of the N C-arm in the groove and the NCTD and NNTD domains are maintained in an open conformation. Right, the NCTD and NNTD domains are maintained in a close conformation and N C-arm is flexible. The P40 peptide is represented in green and the C terminus of N is indicated. F, ribbon representation of the best structural Coral model of RSV NΔ30-P40 complex. The N protomer (residues 31–382) is represented in blue, the C-arm of N (residues 383–391) in purple, and the P40 peptide in green. Linker regions are represented by spheres.
Of these 18, the best 5 models that were in agreement with both the experimental SAXS data and the AUC hydrodynamic parameters are presented here (Fig. 5E). Interestingly, these selected conformers display two main possible orientations of the C-arm of N depending on the relative position of the NCTD and NNTD domains. In one case (two superimposed models in Fig. 5E, left), the RNA groove access is blocked by the positioning of the N C-arm in the groove and the NCTD and NNTD domains are maintained in an open conformation. In the second case (three superimposed models in Fig. 5F, right), the NCTD and NNTD domains are maintained in a close conformation and N C-arm is flexible. However, the best model presented in Fig. 5, D and F suggests that the RNA groove access is blocked by the positioning of the N C-arm in the groove as already described for hMPV (
). In conclusion, our models support the strong structural similarity between the N0-P complexes of RSV and hMPV, and suggest a high flexibility of the N C-arm.
Mutations in the N C-arm strongly affect the RNA polymerase activity
Based on our results and hMPV structure, we hypothesized that the C-arm of N is a flexible region that impairs interaction between the neosynthesized N0 nucleoprotein and RNA by blocking the access of the RNA-binding groove of N. Hence, in addition to the chaperone role of P, the C-arm of N would also contribute to the transition from the N0-P to the N-RNA bound complexes. It is noteworthy that the N C termini of RSV and other pneumoviruses are characterized by the presence of conserved acidic and hydrophobic amino acid residues (Fig. 6A). Such residues could play a major role by interacting with the positively charged N RNA-groove. We thus investigated the role of these N residues on the activity of the polymerase complex by site-directed mutagenesis using the RSV minigenome assay. The C-terminal acidic (Glu-373, Glu-376, Asp-386, Asp-388, Glu-373+Glu-374, Asp-386+Asp-388) and hydrophobic (Leu-368, Leu-370, and Leu-382) residues of N were substituted by alanine, and the functionality of these N variants within the polymerase complex was assayed. Similarly, the potential phosphorylated residues Ser-366 and Thr-371 were substituted by either alanine or aspartic acid. As shown in Fig. 6B, no major effect on the RNA polymerase activity was observed for S366A, S366D, and L382A variants. However, the E373A, E376A, D386A, D386A/D388A, T371A, and T371D substitutions resulted in an approximatively 50% reduction of luciferase activity compared with WT N. Finally, the E373A/E374A, L368A, and L370A substitutions had the strongest effect, with a reduction of more than 90% of the polymerase activity. It is noteworthy that all N mutants were shown to be expressed in similar amounts in BSRT7 cells compared with WT N protein (Fig. 6C). Altogether, these results show that conserved acidic and hydrophobic residues of the C-arm of N play a major role in the function of N within the polymerase complex.
Figure 6Effect of mutations in the C-arm of N on the RSV polymerase activity. A, sequence alignments of the C-arms of N proteins from viruses belonging to the Pneumoviridae family. UniProt accession codes: NCAP_HRSVA (human RSV); A1DZS3_9MONO (human MPV), A4KZ90_9MONO (avian MPV_C); A7DV59_9MONO (avian MPV_B), NCAP_MPV15 (murine pneumonia virus). Invariant residues are highlighted in white font on a red background. The short α-helix observed in the crystal structure of N (
) is indicated above the sequence by a black rectangle. Triangles under the alignment mark mutated residues: acidic (black), hydrophobic (red), and Ser/Thr (green). B, RSV polymerase activity in the presence of N mutants. BSRT7/5 cells were transfected with plasmids encoding the WT P, M2–1, and L proteins; the pMT/Luc minigenome; and WT or mutant N proteins, together with pCMV–β-gal for transfection standardization. Viral RNA synthesis was quantified by measuring the luciferase activity after cell lysis 24 h after transfection. Each luciferase minigenome activity value was normalized based on β-gal expression and is the average of three independent experiments performed in triplicate. Error bars represent standard deviations calculated based on three independent experiments made in triplicate. C, Western blotting showing the expression of N protein variants in BSRT7/5 cells.
The N C-arm is directly involved in the inhibition of RNA binding
To determine whether the decrease in the polymerase activity observed with the N C-arm Ala mutants was correlated to a defect of N0-P complex conformation, the same mutations were introduced in the plasmid expressing NΔ30 in E. coli, to generate 11 N variants. GST-P40 was coexpressed with the WT or mutant forms of NΔ30 in E. coli and purified on GSH-Sepharose beads. The resulting purified GST-P40-NΔ30 complexes were analyzed by SDS-PAGE. Only the E373A/E374A mutant could not be purified, suggesting that this double mutation has a dramatic impact on the P40-NΔ30 complex formation (Fig. 7A). All of the 10 other N mutant proteins were efficiently purified using GST-P40, showing that these mutations did not affect the P-N interaction. Next, the GST tag was cleaved by TEV protease and the A260 nm/A280 nm ratio of isolated NΔ30-P40 complexes was measured and compared with either N-RNA rings or NΔ30 alone (purified with a C-terminal His tag). As shown in Table 3, the A260 nm/A280 nm ratios of NΔ30 purified alone or in the presence of P40 were 0.87 and 0.63, respectively, confirming that P40 is required to efficiently block the interaction of N with RNA. Although all the NΔ30-P40 complexes displayed an A260 nm/A280 nm ratio <1 and close to 0.6 like WT NΔ30, a slight increase of this ratio was detected for most of the mutants, especially for mutants L368A, and L370A (Table 3), suggesting the presence of some amounts of RNA associated to these complexes.
Figure 7Purification of C-arm mutants of N from E. coli. A, NΔ30 constructs (WT or mutants) were co-expressed with GST-P40 in E. coli and complexes were pulled down using GST tag. Co-purification of WT and mutants of NΔ30 with GST-P40 was analyzed by SDS-PAGE and Coomassie Blue staining. B, surface view of an RSV N monomer showing the position of RNA and of the Trp-260 residue. N and C termini of N are indicated.
Given that N-RNA and N0-P forms of N present similar secondary structure as revealed by CD spectra (Fig. 1D), it is likely that binding of the N C-arm into the RNA-binding groove induces only minor conformational changes. To investigate the impact of mutations in the N C-arm on N folding, we took advantage of the presence of a single tryptophan residue (Trp-260) in N, located close to the N RNA-binding groove (Fig. 7B), because λmax of emission of a tryptophan reflects the polarity of its environment. In a hydrophilic environment, λmax shifts to 400 nm and the intensity, although in an apolar environment, λmax shifts to 300 nm. To validate that the fluorescence of Trp-260 could be used to characterize the local structural change of N close to the RNA-binding groove, we first measured the Trp λmax of N-RNA rings and of NΔ30 purified alone or in the presence of P40. As shown in Table 3, the λmax values of these proteins are 327 nm, 339 nm, and 330 nm, respectively. When interacting with RNA, the Trp-260 residue of N is in an apolar environment. The slight difference of λmax value and intensity detected between N-RNA and NΔ30-P40 suggests that Trp-260 is maintained in a constrained environment within the N0-P complex. This hypothesis is supported by the important shift of λmax value measured for the NΔ30 alone that clearly revealed the polar environment of the Trp. The strong difference between the λmax values measured for NΔ30-P40 and NΔ30 alone could be attributed to (i) the binding of P40 on N and/or (ii) the interaction of the N C-arm close to the RNA groove. We then investigated the impact of mutations in the N C-arm on the λmax of purified recombinant NΔ30-P40 complex. Although no major change in the λmax values was detected for most of the mutants, a slight change in the fluorescence spectra was detected when mutating residues Ser-266 or Thr-271, and a strong switch of the Trp λmax values, up to 337 nm, was detected for mutants L368A and L370A (Table 3). We consider that these changes were significant as they correspond to tendencies over many data points obtained for two independent protein purifications. These variations in Trp fluorescence reveal that Trp-260 is more exposed to the solvent when mutating the residues Leu-368 and Leu-370. Altogether, these data indicate that the residues Leu-368 and Leu-370 are critical for stabilization of the N0-P complex, and could be directly involved in the interaction of the C-arm close to the N-RNA–binding groove, confirming our initial hypothesis.
Discussion
The viral RNA genome and anti-genome of Mononegavirales are coated by the viral N protein at all times. Efficient replication of genomic and antigenomic RNAs depends on the neosynthesis of the N protein that has to be maintained monomeric and RNA-free (N0) to specifically encapsidate these replication products as they are synthesized. It is noteworthy that all these N proteins present a strong propensity to oligomerize and to bind to RNA, and that these two mechanisms are concomitant. As a consequence, both mechanisms have to be impaired at the same time to maintain a pool of N competent for RNA encapsidation. For this, viruses of the Mononegavirales order share a common strategy, which depends on the chaperone activity of the P N-terminal region that binds to N, forming a N0-P complex (
). It is noteworthy that the N proteins of Mononegavirales present a similar architecture and are composed of two globular domains (NNTD and NCTD) linked by a hinge region that forms the RNA groove, and N- and C-arms that are involved in N oligomerization. However, besides strong similarities between N0-P complexes, the mechanisms involved in the inhibition of RNA binding to N differ (
Here, by deleting the N- and C-arms of RSV N, we observed that both the presence of the peptide P(1–40) and the deletion of the N-arm of N (NΔ30) are required to purify a stable monomeric N protein from E. coli. We also noticed that the C-arm of N is directly involved in the inhibition of the interaction of N with RNA (Fig. 1). Based on the crystal structures of RSV N-RNA and hMPV N0-P complexes, we then tried to characterize the surface of N involved in P40 binding. We identified some residues potentially involved in P40 binding, generated N mutants, and used both cellular and biochemical assays to validate the role of these residues. Although most of the mutations resulted in impaired N purification, we identified the residue Ile-270 as directly and specifically involved in the interaction with P40. Our results also revealed a role of N residues Leu-258 and Val-284 in the P40-N interaction.
Structural information on the RSV NΔ30-P40 complex in solution was then obtained by combining SAXS and AUC approaches. Models were generated by fixing in position the P40 peptide at the NCTD surface based on the crystal structure of the hMPV N0-P complex (
). These models show that monomeric N easily accommodates the structure of an N protomer from the N-RNA ring 3D structure, with a high flexibility of the N C-arm and of the linker region between NCTD and NNTD (Fig. 5E). Although some models suggest that NCTD and NNTD domains could be maintained in a close conformation with the N C-arm flexible, the best model suggests that the N C-arm could bind close to the RNA groove to impair its accessibility (Fig. 5F), as described for the N0-P complex of hMPV. To confirm these results, we generated N mutants of the N C-arm and showed that the acidic and hydrophobic residues of the C-arm of N are critical for its activity. More specifically, residues Leu-368 and Leu-370 of N were shown to be directly involved in the inhibition of RNA binding. These results correlate with the observation of Renner et al. (
) reporting semi-conserved LGLT-motif within the CTD-arms of the N proteins of Paramyxoviridae which is followed by a stretch of residues with helical propensity. It is noteworthy that the C-arms of N proteins of the Pneumoviridae family are composed of conserved acidic and hydrophobic residues. Such acidic and hydrophobic motifs are reminiscent of eukaryotic acidic activation domains, in which acidic residues are used to promote long-range electrostatic interactions to attract their basic targets, and then undergo an induced structural transition to an α-helix to enable the contact with the target through hydrophobic residues (
In conclusion, our results point out the strong structural homology between RSV and hMPV N0-P complexes. These viruses present an original mechanism among Mononegavirales to impair RNA binding that involves (i) a specific interaction of the C-arm of N with the RNA groove and (ii) the binding surface of P peptide that overlaps with both interaction sites of neighboring N protomers within N-RNA rings (i.e. of Ni+1 C-arm and Ni−1 N-arm). It is noteworthy that the crystal structure of the monomeric nucleoprotein of influenza virus that belongs to Orthomyxoviruses and which does not present homolog of the P protein, also revealed that the C terminus of the nucleoprotein monomer is bound to the side of the RNA-binding surface (
). This binding allows reducing the positive charge of the RNA groove to impair RNA binding. These observations thus suggest that the members of the negative-strand viruses present conserved mechanisms to control the specificity of viral genome encapsidation. Although the 3D structure of the RSV N0-P still remains to be solved, these structural data pave the way to the rational design of specific protein–protein interaction inhibitors for these viruses that represent main respiratory pathogens because compounds that stabilize or destabilize the monomer may slow down viral infection. We and others already proposed that inhibitory peptides mimicking the P peptide could represent potent new antivirals not only against RSV but also for other Mononegavirales (
). Finally, these structural data revealed that the surface of interaction between the N C-arm and the RNA groove could constitute a second target to develop antivirals against the N0-P complex. In this case, stabilization of the interaction using small molecules could represent an attractive strategy (
All the viral sequences were derived from the hRSV strain Long, ATCC VR-26 (GenBank™ accession no. AY911262.1). N- and/or C-terminal N deletions were made by PCR using Pfu DNA polymerase (Stratagene). PCR products were cloned in the pET28a+ vector to produce truncated N proteins either with no tag or with a C-terminal His6 tag. The pGEX-N30 plasmid coding for the 30 N-terminal residues of N fused to GST was obtained by cloning annealed oligonucleotides at BamHI restriction site of the pGEX4T3 vector. The pGEX-PCT and pGEX-P40 vectors coding respectively for P(161–241) and P(1–40) in fusion with GST were described previously (
). Point mutations were introduced in pET-N by site-directed mutagenesis, using the QuikChange Site-Directed Mutagenesis Kit (Stratagene). Sequence analysis was carried out to check the integrity of all the constructs.
Plasmids for eukaryotic expression of the hRSV N, P, M2–1, and L proteins designated pN, pP, pM2–1, and pL, have been described previously (
The nine C-terminal amino acids of the respiratory syncytial virus protein P are necessary and sufficient for binding to ribonucleoprotein complexes in which six ribonucleotides are contacted per N protein protomer.
The insertion of fluorescent proteins in a variable region of respiratory syncytial virus L polymerase results in fluorescent and functional enzymes but with reduced activities.
). The pm/Luc subgenomic minigenome which encodes the firefly luciferase (Luc) reporter gene under the control of the M/SH gene start sequence was derived from the pm/SH subgenomic replicon (
). Point mutations were introduced in pN by site-directed mutagenesis as described above.
Antibodies
The following primary antibodies were used for immunofluorescence and/or immunoblotting: a mouse monoclonal anti-N protein (Serotec, Oxford, UK), rabbit anti-P and rabbit anti-N antisera (
The insertion of fluorescent proteins in a variable region of respiratory syncytial virus L polymerase results in fluorescent and functional enzymes but with reduced activities.
), and a mouse monoclonal anti–β-tubulin (Sigma-Aldrich). Secondary antibodies directed against mouse and rabbit IgG coupled to HRP were used for immunoblotting (P.A.R.I.S., Compiègne, France), and goat anti-rabbit and anti-mouse antibodies coupled to Alexa Fluor 448 and Alexa Fluor 594 were used for immunofluorescence.
Generation of bovine respiratory syncytial virus (BRSV) from cDNA: BRSV NS2 is not essential for virus replication in tissue culture, and the human RSV leader region acts as a functional BRSV genome promoter.
) were grown in Dulbecco's modified Eagle's medium (Lonza, Cologne, Germany) supplemented with 10% fetal calf serum (FCS), 2 mm glutamine, and antibiotics. Cells were transfected using Lipofectamine 2000 (Invitrogen) as described by the manufacturer.
Minigenome assay
Cells at 90% confluence in 48-well dishes were transfected with a plasmid mixture containing 125 ng of pm/Luc, 125 ng of pN, 125 ng of pP, 62.5 ng of pL, and 31 ng of pM2–1 as well as 31 ng of pRSV–β-gal (Promega) to normalize transfection efficiencies (
). Transfections were done in triplicate, and each independent transfection was performed three times. Cells were harvested 24 h post transfection, then lysed in luciferase lysis buffer (30 mm Tris, pH 7.9, 10 mm MgCl2, 1 mm DTT, 1% Triton X-100, and 15% glycerol). The luciferase activities were determined for each cell lysate with an Infinite 200 Pro (Tecan, Männedorf, Switzerland) and normalized based on β-gal expression.
Fluorescence microscopy
Immunofluorescence microscopy was performed with cells grown on coverslips and previously transfected with pN and pP. At 24 h post transfection, cells were fixed with 4% paraformaldehyde for 30 min. Fixed cells were made permeable and blocked for 15 min with PBS containing 0.1% Triton X-100, 3% bovine serum albumin (BSA). Cells were then successively incubated for 1 h at room temperature with primary and secondary antibody mixtures diluted in PBS containing 3% BSA, 0.05% Tween. Coverslips were mounted with ProLong Gold Antifade reagent containing DAPI (Invitrogen), and observed with a Nikon TE200 microscope equipped with a CoolSNAP ES2 camera (Photometrics, Tucson, AZ). Images were processed by using Meta-Vue (Molecular Devices, San Jose, CA) and ImageJ software.
Expression and purification of recombinant proteins
E. coli BL21 bacteria (DE3) (Novagen, Madison, WI) transformed with pET-N–derived plasmids alone, or together with pGEX-PCT, pGEX-P40, or pGEX-N30 plasmids were grown at 37 °C for 8 h in 100 ml of Luria Bertani (LB) containing kanamycin (50 μg/ml) or ampicillin and kanamycin, respectively. The same volume of LB was then added and protein expression was induced by adding 80 μg/ml isopropyl-β-d-thio-galactoside to the medium. Bacteria were incubated for 15 h at 28 °C and then harvested by centrifugation. For the purification of the recombinant GST fusion proteins, bacterial pellets were re-suspended in lysis buffer (50 mm Tris-HCl, pH 7.8, 60 mm NaCl, 1 mm EDTA, 2 mm DTT, 0.2% Triton X-100, 1 mg/ml lysozyme) supplemented with cOmplete Protease Inhibitor mixture (Roche), incubated for 1 h on ice, sonicated, and centrifuged at 4 °C for 30 min at 10,000 × g. GSH-Sepharose 4B beads (GE Healthcare) were added to clarified supernatants and incubated at 4 °C for 3 h. Beads were then washed two times in lysis buffer and three times in PBS 1×, and stored at 4 °C in an equal volume of PBS. To isolate GST-free P40-NΔ30 complex, beads containing bound complex were incubated with TEV protease for 16 h at 20 °C. For NΔ30-His6 fusion protein purification, bacterial pellets were re-suspended in lysis buffer (20 mm Tris-HCl, pH 8, 500 mm NaCl, 0.1% Triton X-100, 10 mm imidazole, 1 mg/ml lysozyme) supplemented with cOmplete Protease Inhibitor mixture (Roche). After sonication and centrifugation, lysates were incubated 30 min with chelating Sepharose Fast Flow beads charged with Ni2+ (GE Healthcare). Finally, beads were successively washed in the washing buffer (20 mm Tris-HCl, pH 8, 500 mm NaCl) containing increasing concentration of imidazole (25, 50, and 100 mm), and proteins were eluted in the same buffer with 500 mm imidazole. Purified recombinant proteins were loaded onto a Sephacryl S-200 HR 16/30 column (GE Healthcare), eluted in 20 mm Tris-HCl, pH 8.5, 150 mm NaCl and characterized. The presence of RNA was determined by measuring the A260 nm/A280 nm absorption ratio.
Circular dichroism spectroscopy
CD experiments were performed on a J-810 spectropolarimeter (Jasco, Tokyo, Japan) in a thermostated cell holder at 20 °C. Purified recombinant N proteins were dialyzed against 5 mm phosphate, 100 mm NaF, pH 8.5. Far-UV spectra (180–260 nm) were recorded using a bandwidth of 1 nm and an integration time of 1s, in a 0.5 mm path-length quartz cell with proteins at concentrations of 20 μm. Each spectrum was the average of 6 scans, with a scan rate of 100 nm/min. The spectra were corrected by subtracting the signal from the buffer, smoothed using the FFT filter (Jasco Software, Tokyo, Japan), and were treated as described previously (
Does fusion of domains from unrelated proteins affect their folding pathways and the structural changes involved in their function? A case study with the diphtheria toxin T domain.
Fluorescence measurements were performed with an FP-750 spectrofluorimeter (Jasco, Tokyo, Japan) at 20 °C, using a 1-cm path length quartz cell, with proteins at a concentration of 10 μm. The excitation wavelength was fixed at 295 nm, and emission spectra were recorded from 300 to 360 nm at a scan rate of 125 nm.min−1. The maximum emission wavelength (λmax) represents the average of three values obtained from emission spectra. The measurements were performed on two independent purifications of each recombinant protein.
Analytical ultracentrifugation
Sedimentation velocity experiments were carried out at 20 °C in a ProteomeLab XL-I Analytical Ultracentrifuge (Beckman Coulter, Brea, CA) equipped with double-UV and Rayleigh interference detection. The purified protein-peptide complexes (1.1 mg/ml) were centrifuged at 42,000 rpm using an AN60-Ti rotor and 12-mm-thick Epon double sector centerpieces. Absorbance and interference profiles were recorded every 5 min. Buffer viscosity (η = 1.021 cP) and density (ϕ = 1.0050° g.ml−1) at 20 °C were estimated with SEDNTERP 1.09. Partial specific volumes at 20 °C were estimated based on amino acid sequences using SEDNTERP 1.09 software. Data were analyzed with SEDFIT 15.3 using a continuous size distribution c(S) model. Theoretical sedimentations of the complex were generated using HydroPro 10 (
SAXS data were collected on the SWING beamline at Synchrotron SOLEIL (France) using the online HPLC system. SAXS samples were prepared in buffer (Tris 20 mm, pH 8, NaCl 150 mm, glycerol 5%). Protein complexes of NΔ30-P40 complexes (50 μl) at 6 mg.ml−1 were injected into a size exclusion column (Bio SEC-3 300 Agilent column) cooled at 25 °C and eluted directly into the SAXS flow-through capillary cell at a flow rate of 200 μl.min−1. The data were analyzed using FOXTROT and PRIMUS from ATSAS 2.7 (
), from which Guinier was generated. Scattering curves were selected for stable radius of gyration (Rg) at the apex of the elution profile, and the selected curves were averaged and buffer signal was subtracted. From these corrected scattering curves, the pair distribution function was computed using GNOM (
). The starting structure of N promoter was defined from N-terminal end as a succession of three folded domains (30–249; 256–359; 374–382) connected through two linker regions between the first two domains and the last two domains. The P40 peptide was positioned on the NCTD (residues 256–359) based on X-ray structure of hMPV N0-P complex (
) with the C-terminal end of P40 defined as a flexible region of 13 amino acids.
Molecular graphics images were produced using Chimera and PyMOL software programs.
Author contributions
C. E., B. R., P. R., S. B., C.-A. R., J. F., and M. G. data curation; C. E., B. R., P. R., S. B., and M. G. formal analysis; B. R. and M. G. writing-original draft; J.-F. E. and M. G. supervision; J.-F. E. funding acquisition; J.-F. E. and M. G. writing-review and editing; M. G. conceptualization.
Acknowledgments
We thank the staffs of the synchrotron facilities SOLEIL for technical help with the beamlines SWING. We warmly thank Origène Nyanguile and Christina Sizun for skillful discussions and critical review of the manuscript.
Cytoplasmic inclusions of respiratory syncytial virus-infected cells: formation of inclusion bodies in transfected cells that coexpress the nucleoprotein, the phosphoprotein, and the 22K protein.
Identification of protein regions involved in the interaction of human respiratory syncytial virus phosphoprotein and nucleoprotein: Significance for nucleocapsid assembly and formation of cytoplasmic inclusions.
The nine C-terminal amino acids of the respiratory syncytial virus protein P are necessary and sufficient for binding to ribonucleoprotein complexes in which six ribonucleotides are contacted per N protein protomer.
The insertion of fluorescent proteins in a variable region of respiratory syncytial virus L polymerase results in fluorescent and functional enzymes but with reduced activities.
Generation of bovine respiratory syncytial virus (BRSV) from cDNA: BRSV NS2 is not essential for virus replication in tissue culture, and the human RSV leader region acts as a functional BRSV genome promoter.
Does fusion of domains from unrelated proteins affect their folding pathways and the structural changes involved in their function? A case study with the diphtheria toxin T domain.
This work was supported by the French Agence Nationale de la Recherche, specific program ANR Blanc 2013 “Respisyncycell” ANR-13-IVS3-0007. The authors declare that they have no conflicts of interest with the contents of this article.