|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 16, 16638-16645, April 16, 2004
Crystal Structure of Norwalk Virus Polymerase Reveals the Carboxyl Terminus in the Active Site Cleft*![]() ![]()
From the
Received for publication, January 20, 2004 , and in revised form, February 2, 2004.
Norwalk virus is a major cause of acute gastroenteritis for which effective treatments are sorely lacking. To provide a basis for the rational design of novel antiviral agents, the main replication enzyme in Norwalk virus, the virally encoded RNA-dependent RNA polymerase (RdRP), has been expressed in an enzymatically active form, and its structure has been crystallographically determined both in the presence and absence of divalent metal cations. Although the overall fold of the enzyme is similar to that seen previously in the RdRP from rabbit hemorrhagic disease virus, the carboxyl terminus, surprisingly, is located in the active site cleft in five independent copies of the protein in three distinct crystal forms. The location of this carboxyl-terminal segment appears to interfere with the binding of double-stranded RNA in the active site cleft and may play a role in the initiation of RNA synthesis or mediate interactions with accessory replication proteins.
Recent studies have shown that members of the Norovirus genus within the Caliciviridae family are now considered one of the most common causes of outbreaks and sporadic cases of gastroenteritis in individuals of all ages worldwide (1). These pathogens have a positive-strand RNA genome and life cycle that are similar in many respects to a group of evolutionarily related viruses responsible for important human diseases such as polio, hepatitis C, dengue, yellow fever, viral encephalitis (West Nile virus and Japanese encephalitis virus), and severe acute respiratory syndrome. Many of these viruses are threats to public health, because they are highly infectious and can cause serious illnesses, in some cases leading to death. The limitations of vaccination and lack of effective antiviral chemotherapeutics for many of these diseases underline the urgency of understanding positive-stranded RNA viruses at a deeper, molecular level to allow for the development of novel treatments. Replication of the genome in all positive-strand RNA viruses is critically dependent on the activity of a virally encoded RNA-dependent RNA polymerase (RdRP)1 (2). This enzyme is responsible for synthesizing negative-sense RNA, which is complementary to the positive-sense genomic RNA, as well as newly made positive-sense RNA genomes that can be used for the production of viral proteins or packaged into new viral particles. The three-dimensional structures of RdRPs from members of three families of positive-strand RNA viruses (poliovirus (PV) (3) from Picornaviridae, hepatitis C virus (HCV) (4-6) from Flaviviridae, and rabbit hemorrhagic disease virus (RHDV) (7) from Caliciviridae) have previously revealed a similar overall architecture as well as a range of specific adaptations (8).
Here, we report the novel structure of an RdRP from a genogroup II Norwalk virus (NV) isolate, Ast6139/01/Sp, in a metal-free form at 2.17 Å resolution, as well as in a metal-bound form at 2.95 Å resolution. Although the arrangement of secondary structural elements and key catalytic motifs is similar to those seen in the RHDV enzyme as well as other RdRPs from positive-strand RNA viruses, dramatic structural differences, especially in the carboxyl-terminal region, suggest novel functional adaptations important for viral replication. A small metal-dependent conformational change can also be seen when comparing the structure of the metal-free NV RdRP in two crystal forms with the structure of the metal-bound enzyme in a third crystal form. These structures represent the first three-dimensional structures of a Norovirus enzyme and provide the basis for the structure-based design of antiviral therapeutics against an important group of human pathogens.
Construction of the RdRP Expression VectorThe NV RdRP coding region was amplified by PCR using a cDNA clone (GenBankTM accession number AJ583672 [GenBank] ) from the Spanish NV isolate Ast6139/01/Sp and the specific primers NVpol5B ggatccgacagtaagggaacatactg and NVpol3E gaattctcattcgacgccatcttcattc, which also added BamHI and EcoRI restriction enzyme recognition sequences (underlined residues) at both ends of the amplified region. The resulting fragment was first ligated to pGEM-T vector (Promega), and then, after digestion with the restriction enzymes BamHI and EcoRI, the 1.5-kb fragment containing the RdRP gene was purified and inserted into BamHI-EcoRI-digested, alkaline phosphatase-treated pGEX-2T (Amersham Biosciences) to generate the recombinant expression vector pGEX-NV-3D. This recombinant plasmid was designed to produce the NV RdRP as a fusion protein with Schistosoma japonicum glutathione S-transferase at the amino terminus.
Protein Expression and PurificationOvernight cultures of Escherichia coli XL-1 Blue transformed with pGEX-NV-3D were diluted 1:250 in 500 ml of Luria-Bertani medium containing ampicillin (50 µg ml-1). The cultures were incubated at 37 °C to an A600 of 0.5, and isopropyl-thio- Preparation of Synthetic NV Heteropolymeric RNAA cDNA clone containing the NV Ast6139/01/Sp 3' sequence was PCR-amplified using oligonucleotide primers NV5 (taatacgactcactatagtagctcacactggcccg) and NVdT3 (tttttttttttttttttaaagacactaaag). The 1013-bp PCR product containing a minimal T7 RNA polymerase promoter (underlined residues) was phenol-chloroform-extracted, ethanol-precipitated, and dissolved in diethyl pyrocarbonate-treated water prior to its in vitro transcription using the large scale RNA production system Ribomax (Promega).
Enzymatic AssaysRdRP activity was assayed in 50-µl samples containing 4 µg of the purified enzyme, 50 mM HEPES (pH 8.0), 10 µM ATP, 10 µM GTP, 10 µM CTP, 5 µM UTP, 4 mM dithiothreitol, 50 units of ribonuclease inhibitor (Promega), 25 µmol (10 µCi) of [
CrystallizationCrystals were grown at room temperature by the hanging drop vapor diffusion method by mixing 2 µl of NV polymerase (7 mg/ml) and 2 µl of 24% (w/v) polyethylene glycol 8000, 100-200 mM ammonium sulfate, 50 mM Tris-Cl, pH 7.5, 15% (w/v) glycerol, 0.2% (w/v) CHAPS, and 14 mM 2-mercaptoethanol. At least two different crystal forms (space groups C2221 and P1) with similar morphologies grew to a maximum size of Data Collection, Structure Solution, and RefinementData were measured on flash-frozen crystals at 105 K by transferring crystals directly from the mother liquor into a nitrogen gas stream. Data were measured from a MAR 345 detector using radiation from a rotating anode generator (Rigaku RU-H3R and Osmic confocal optics). Data were indexed, integrated, and scaled using the HKL suite of programs (9). Intensities were converted to amplitudes using TRUNCATE (10). The solvent content of the centered orthorhombic crystals was calculated to be 53% (Vm = 2.66) if one copy of NV polymerase was present in the asymmetric unit. The structure of RHDV polymerase (Protein Data Bank code 1KHV [PDB] ), with all side chains and temperature factors retained, was used as the search model for molecular replacement calculations. Molecular replacement calculations were carried out using BEAST (11). The cross-rotation and translation functions both gave a single prominent solution (log-likelihood scores of 15.8 and 25.6, respectively). The crystal packing seemed reasonable, but the R-factor following rigid-body refinement was poor (r = 0.53, resolution = 20-2.8 Å). Nevertheless, several rounds of simulated annealing refinement using the program CNS (12), density modification using RESOLVE (13), and manual model building using Xfit (14) allowed for the construction of a model with improved statistics (r = 0.38, Rfree = 0.43, resolution = 20-2.3 Å). At this point, the triclinic crystal form with two copies in the asymmetric unit was discovered. This crystal yielded a stronger and more isotropic diffraction pattern. Molecular replacement calculations were carried out using the model that had been partially refined in the centered orthorhombic crystal form. Two clear solutions appeared in both the rotation and translation functions, and the initial electron density map was significantly clearer than for the centered orthorhombic crystal form. Averaging and density modification using RESOLVE further improved electron density maps, and the structures of both copies in the asymmetric unit were built using Xfit and refined using REFMAC (15) (translation-libration-screw rotation parameters for rigid bodies, atomic positions, and temperature factors). Following the completion of refinement, the model of the protein was superimposed onto the partially refined model in the centered orthorhombic crystal form, and refinement was completed on this crystal form as well. Molecular replacement calculations using the fully refined model from the triclinic crystal form also gave two very clear solutions for the primitive orthorhombic crystal form grown in the presence of 20 mM magnesium sulfate. This structure was rebuilt and refined as described above. Because of the lower resolution limit of these data, however, loose restraints to maintain noncrystallographic symmetry were applied. The final models in all three crystal forms include residues 6-507, with the five amino-terminal residues and three carboxyl-terminal residues apparently disordered in all five independent copies. 91.8% of the residues lie in the most favored regions of the Ramachandran plot, and no residues are in disallowed regions (regions defined by PROCHECK (16)). Additional checks on hydrogen bonding geometry were performed using WHATCHECK (17). Data quality and refinement statistics are given in Table I. Figures were prepared with Molscript (18), Bobscript (19), Raster3D (20), PyMOL (21), and LIGPLOT (22).
Expression and Enzymatic Activity of NV RdRPThe recombinantly expressed form of NV RdRP used in this work is identical in sequence to the mature form of the enzyme, as expected from studies of proteolytic processing in other NV isolates (23-25), except that serine replaces glycine at the second residue from the amino terminus. The RNA-dependent RNA polymerase activity associated with the purified recombinant protein was assayed using a synthetic RNA template derived from a cDNA of the 3' region of the genome of the Ast6139/01/Sp isolate including a poly(A) tail of 17 residues. The major product was about 2 kb in size, roughly twice the size of the RNA template used. The doubling in size relative to the template suggests a fold-back synthesis reaction similar to that seen previously in RHDV (26) and PV (27-29). Also, as seen in RdRPs from RHDV (30), PV (31), and many other viruses, the polymerase activity was dependent on the presence of divalent cations (in this case Mg2+) in the incubation medium. Overall Structure of NV RdRPThe crystal structure of NV RdRP has been determined in three different crystal forms using the molecular replacement method starting with RHDV RdRP (7) as the search model. Although all three crystal forms were grown under similar conditions with polyethylene glycol 8000 as the precipitant, the molecular packing and space group symmetry differ significantly in the three crystal forms (Table I). Two crystal forms were grown in the absence of divalent metal ions, and the third crystal form was grown in the presence of 20 mM magnesium sulfate. The overall structure of NV RdRP can be described using an analogy to a right hand, which is commonly done for a wide range of nucleic acid polymerases (Fig. 1). In addition to the fingers, palm, and thumb domains common to all polymerases, NV RdRP has an additional amino-terminal domain bridging the fingers and thumb domains, as seen in all other RdRPs of known three-dimensional structure (8). All residues are defined by electron density with the exception of residues 1-5 at the amino terminus and residues 508-510 at the carboxyl terminus. The carboxyl-terminal region is of particular importance, as discussed below, and it is significant to note that the last three residues are disordered in all three crystal forms.
The availability of three crystal forms of NV RdRP reveals five independent copies in distinct crystal packing environments. The largest crystal contact interface in each of the three crystal forms is formed between the "top" faces (when viewed in the standard orientation facing the active site cleft, as in Fig. 1A) of molecules related by a crystallographic (C2221 and P212121 crystal forms) or noncrystallographic (P1 crystal form) 2-fold rotation axis. The buried surface areas of these interfaces are 1525, 1863, and 2187 Å2 for the P1, C2221, and P212121 crystal forms, respectively, indicating an extensive contact surface similar in size to functionally important oligomerization surfaces (32). A common feature of all three interfaces is a patch of hydrophobic residues consisting of Trp-29, Pro-421, Met-430, and Ile-431, residues fairly well conserved in the noroviruses but not well conserved in other caliciviruses. It is important to note that this hydrophobic patch and interface is not found in the RHDV RdRP crystal structure, and this interface differs from the oligomerization interfaces proposed to be of functional importance in HCV (33, 34) and PV (3, 35, 36) RdRPs. Although the functional importance of the resulting NV RdRP dimers is not clear from the structure, the location of this hydrophobic patch near the top of the active site cleft suggests that protein-protein interactions important for RNA binding or RNA synthesis may be mediated by this contact surface.
Structural Comparisons with RHDV RdRP and Conformational VariabilityThe overall fold of the RdRP enzymes from RHDV and NV is quite similar, although there are many regions of local structural variation. 393 C
Three copies of metal-free NV RdRP have been modeled and refined to 2.17 and 2.3 Å resolution in two different crystal forms. The root mean squared difference of 502 C In both copies of magnesium-bound NV RdRP found in the asymmetric unit, a single magnesium ion is coordinated to one of the carboxylate oxygen atoms of Asp-344 and is near but not coordinated to the carboxylate group of Asp-242. This weak coordination scheme appears similar to that seen in the "open" metal-bound form of RHDV RdRP (7), as well as in the Ca2+-bound PV RdRP (3). It should be noted that this form is more similar to metal-free NV RdRP than the "closed" metal-bound form of RHDV RdRP, which adopts the conformation likely appropriate for facilitating the phosphoryl transfer reaction during RNA synthesis (7, 31). It is likely that the open metal-bound forms of NV and RHDV RdRPs represent well populated states of the enzyme when neither NTP nor RNA is bound. The enzyme conformations adopted in the presence of NTP and RNA await determination, but the structures are likely to be similar to the closed metal-bound conformation seen in RHDV RdRP. Structure and Function of the Carboxyl-terminal RegionA particularly striking feature of the NV RdRP structures is the location of the carboxyl terminus in the active site cleft near the catalytic aspartic acid residues. This segment interacts mainly with the wall of the active site cleft formed by the thumb domain, is well exposed to solvent, and likely possesses conformational flexibility (Fig. 2A). The segment, extending from residue 501 to 507, is well defined by electron density but has higher temperature factors than most of the rest of the protein (Fig. 2B; Table I). In particular, residues 508-510 are not sufficiently well ordered to be modeled, although they likely contribute to some of the uninterpreted electron density in the active site cleft near the active site residues (results not shown).
Although structurally very different, the location of this carboxyl-terminal fragment in the active site cleft is reminiscent of the carboxyl-terminal extension domain and "plough" loop in bacteriophage 6 RdRP (44), as well as the carboxyl-terminal segment and -loop insertion in the thumb domain of HCV RdRP (4-6, 45, 46) (Fig. 3). In both the bacteriophage 6 and HCV RdRPs, short polypeptide segments occlude portions of the active site cleft and appear to stabilize short primers for the initiation of RNA synthesis (37, 47-51). It should be noted that the C termini of all crystallized forms of HCV RdRP are artificial truncations terminating the protein at different distances from the transmembrane segment following the polymerase domain, whereas the carboxyl terminus in the NV RdRP structure is the natural terminus predicted from polyprotein processing studies (23-25). Nevertheless, the structural similarities shared among NV, HCV, and bacteriophage 6 RdRPs suggest that the carboxyl-terminal segment in NV RdRP may also play a role in initiating RNA replication.
The location of the carboxyl-terminal sequence in the active site of NV RdRP appears to block the entry of an RNA primer-template duplex (Fig. 4). If the carboxyl-terminal segment containing residues 501-507 is removed from the NV polymerase structure, a segment of A-form RNA duplex can be modeled to fit into the active site cleft by superimposing the core of the palm domain of the human immunodeficiency virus-1 reverse transcriptase·TTP·DNA complex (52) onto corresponding parts of the NV polymerase structure (as reported previously for RHDV polymerase (7)). The presence of the carboxyl-terminal segment primarily interferes with the entry of the primer strand of an RNA duplex into the active site cleft, but it is well positioned to interact with short di- or trinucleotide primers or possibly with hairpin structures forming near the active site (Fig. 4B).
An alignment of the carboxyl-terminal sequences from Norovirus polymerases reveals a high degree of sequence conservation in the carboxyl-terminal region, indicating that a similar structure is found in both genogroups of noroviruses. A lower degree of sequence similarity in this region of RdRPs from other calicivirus genera, as well as the lack of structural similarity between the last 25 residues of the NV and RHDV polymerases, suggests that the structure of the carboxyl terminus differs in polymerases from these viruses.
The three-dimensional structure of NV RdRP reveals an overall architecture common to a wide range of RdRPs, as well as a unique structure in which the carboxyl terminus of the protein lies in the active site cleft. Although the structure of NV RdRP is the first to show the natural carboxyl terminus of a polymerase lying directly in the active site cleft, it is notable that the carboxyl-terminal sequence in the RHDV and PV RdRP crystal structures, as well as the predicted structures of RdRPs from many related caliciviruses and picornaviruses, lies near the front of the active site cleft. The location of the carboxyl-terminal segment in this region of the enzyme may allow this segment of the protein to play a role in the initiation of RNA synthesis even if the structural details of this region may vary in different viruses. The presence of 3 and 15 disordered residues at the carboxyl termini of the NV and RHDV RdRP crystal structures indicates that these residues may adopt a range of conformations in solution. Perhaps these residues only become well ordered when interacting with other components of the replication complex such as NTPs, RNA, and protein cofactors like the VPg protein (53). To understand the role of the carboxyl-terminal sequence in RdRPs from different viruses, it may also be important to consider that differences in the proteolytic processing of the viral polyprotein precursor are also known to give rise to different forms of polymerases at different stages of the viral life cycle. Most notably, a proteinase-polymerase fusion appears to be the main form of active polymerase in vesiviruses (54) and may also be an important form of the polymerase in noroviruses (25) and lagoviruses. A similar proteinase-polymerase fusion with enzymatic activities that are distinct from the free forms of both the proteinase and polymerase is also found in the picornaviruses (55). Although the structure of this proteinase-polymerase fusion is not known, the presence of the proteinase domain is known to affect the enzymatic activity of the polymerase domain, and the carboxyl-terminal segment may also play a role in mediating this interaction. In the vesivirus feline calicivirus, for example, the presence of the proteinase domain in the fusion protein dramatically increases the activity of the polymerase (56). It would be of interest to test whether modifications to the carboxyl-terminal sequence in the RdRPs from feline calicivirus and other viruses affect the enzymatic activity of the proteinase-polymerase fusion protein by disrupting interactions between the proteinase and polymerase domains. Further studies are needed to clarify the roles of carboxyl-terminal segments in RdRPs from caliciviruses and other positive-strand RNA viruses. Understanding the role of this carboxyl-terminal segment in viral replication will likely be important for the design of structure-based therapeutics against noroviruses and other positive-strand RNA viruses involved in human disease. For example, drug compounds that specifically disrupt interactions between the carboxyl-terminal segment and the active site cleft may interfere with viral replication and act as effective antiviral agents.
The atomic coordinates and structure factors (codes 1SH0 [PDB] , 1SH2 [PDB] , and 1SH3 [PDB] ) for the two crystal forms of metal-free polymerase as well as for the Mg2+-bound form have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This research was supported by a senior research fellowship from the Canadian Institutes for Health Research as well as a Medical Scholar Award and Establishment Grant for medical research from the Alberta Heritage Foundation (to K. K.-S. N.). This work was also supported in part by Grants 01/0976 (to F. P.) and 01/3139 (to J. A. B.) from Fondo de Investigacín Sanitaria, Ministerio de Sanidad y Consumo, Spain. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 The abbreviations used are: RdRP, RNA-dependent RNA polymerase; HCV, hepatitis C virus; PV, polio virus; RHDV, rabbit hemorrhagic disease virus; NV, Norwalk virus; CHAPS, 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonic acid.
We thank Isabelle Barrette-Ng for helpful discussions on the manuscript.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||