|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 47, 48576-48587, November 19, 2004
The NS5A Protein of Hepatitis C Virus Is a Zinc Metalloprotein*![]() ![]() ¶ **
From the
Received for publication, July 12, 2004 , and in revised form, August 30, 2004.
The NS5A protein of hepatitis C virus is believed to be an integral part of the viral replicase. Despite extensive investigation, the role of this protein remains elusive. Only limited biochemical characterization of NS5A has been performed, with most research to date involving the myriad of host proteins and signaling cascades that interact with NS5A. The need for better characterization of NS5A is paramount for elucidating the role of this protein in the virus life cycle. Examination of NS5A using bioinformatics tools suggested the protein consisted of three domains and contained an unconventional zinc binding motif within the N-terminal domain. We have developed a method to produce NS5A and performed limited proteolysis to confirm the domain organization model. The zinc content of purified NS5A and the N-terminal domain of NS5A was determined, and each of these proteins was found to coordinate one zinc atom per protein. The predicted zinc binding motif consists of four cysteine residues, conserved among the Hepacivirus and Pestivirus genera, fitting the formula of CX17CXCX20C. Mutation of any of the four cysteine components of this motif reduced NS5A zinc coordination and led to a lethal phenotype for HCV RNA replication, whereas mutation of other potential metal coordination residues in the N-terminal domain of NS5A, but outside the zinc binding motif, had little effect on zinc binding and, aside from one exception, were tolerated for replication. Collectively, these results indicate that NS5A is a zinc metalloprotein and that zinc coordination is likely required for NS5A function in the hepatitis C replicase.
Since the discovery of the hepatitis C virus (HCV)1 as the causative agent of non-A, non-B hepatitis in 1989 (1), understanding the biology of this pathogen has been a major focus of both basic and clinical research. HCV is a significant global public health problem, with estimates placing nearly 3% of the world population, roughly 170 million people, as HCV-infected (2). Despite recent advances, current HCV therapeutics are inadequate for the majority of patients.
HCV is a member of the Flaviviridae family of enveloped, positive-strand RNA viruses and is the sole member of the Hepacivirus genus (3). The HCV genome consists of a The functions of a number of the NS proteins have been at least partially defined. In addition to NS3 aforementioned role in polyprotein processing, its C-terminal domain serves as an RNA helicase activity likely utilized during replication (16). The small NS4A protein serves as a cofactor for both the protease and helicase activities of NS3 (14, 17). NS4B is an integral membrane protein that has a direct role in reorganization of cellular membranes to form the membranous web (18). Additionally, this protein has been proposed to function as a membrane-bound scaffolding upon which the remaining replicase components assemble. The NS5B protein serves as the viral RNA-dependent RNA polymerase (1). The function of NS5A in the replicase is not clear, but this protein appears to be an absolutely required component of the replicase and is required for HCV replication (11, 12). NS5A is a large (5658 kDa), hydrophilic phosphoprotein of unknown function. The ability of adaptive mutations in NS5A to greatly stimulate HCV replication in cell culture, and the association of this protein with other members of the putative replicase suggests NS5A plays an important role in RNA replication (11, 12). The NS5A protein exists in multiple phosphorylation states, designated p56 (basal) and p58 (hyper), based on their migration on SDS-PAGE gels. Considerable effort has been placed in mapping NS5A phosphorylation sites and identifying the enzyme(s) responsible for this activity. Despite this, our knowledge of phosphate acceptor sites is rudimentary and the kinase(s) remains elusive (1924). Although some NS5A adaptive mutations affect the phosphorylation state of NS5A in HCV replicons (11), the link between NS5A phosphorylation and replication is unknown. Additional roles for NS5A in modulating various cell signaling cascades have been proposed, perhaps based on the observed interaction of NS5A with cellular kinase activities, but the relevance of many of these to replication and pathogenesis remains to be demonstrated (see Ref. 25 for review).
Despite the vast number of NS5A-related publications, surprisingly little effort has been devoted to the basic biochemical characterization of this protein. One area of recent attention has been the interaction of NS5A with cellular membranes. NS5A has been shown to be post-translationally associated with endoplasmic reticulum-derived membranes via an N-terminal amphipathic In this study, a combination of bioinformatics analysis, biochemical assays with purified NS5A protein, and replication studies, with the replicon system, were used to characterize NS5A. We have generated a model for the domain organization of NS5A, which was confirmed by limited proteolysis and mass spectrometry. Analysis of the sequence of NS5A has suggested the presence of an unconventional, yet highly conserved, zinc metal ion coordination site within the N-terminal domain, which was evaluated by a combination of reverse genetics in the replicon system and atomic absorption spectroscopy using purified NS5A protein. The data presented herein provide the first model of NS5A domain organization and defines the essential zinc binding properties of NS5A, observations that further our understanding of this enigmatic protein.
Cloning and in Vitro MutagenesisAll mutagenesis was performed using the QuikChange (Stratagene) in vitro mutagenesis system following the manufacturer's standard protocols. Mutations were generated in the shuttle vector pSL1180GIT, which contained the BsrG1-MfeI fragment of the Con1/SG-neo GIT replicon encompassing the NS5A coding sequence and surrounding region. The Con1/SG-neo GIT replicon is identical to the subgenomic replicons described previously (11), with the exception of the presence of two adaptive mutations in NS3 (E1202G, T1280I) and one in NS4A (K1846T), thereby allowing analysis of NS5A mutants on a wild-type NS5A backbone (28). All mutations generated by QuikChange mutagenesis were confirmed by DNA sequencing of the final replicon plasmids. Fragments of interest (e.g. NS5A coding sequence) were then subcloned into the appropriate vectors for protein expression or replicon experiments as described below. For replicon experiments, the BsrG1-MfeI fragment of pSL1180GIT containing the desired mutation was directly cloned back into the parental Con1/Sg-neo GIT replicon plasmid. For protein expression clones, pSL1180GIT DNA containing the desired mutation was used as a template for PCR amplification of the NS5A coding sequence corresponding to amino acids 25448 of NS5A (NS5A- h). PCR primers were designed to create an in-frame 5'-SacII restriction site for fusion of NS5A- h to the ubiquitin coding sequence cloned into the NdeI (ATG start codon) site of pET30b. The 3'-end of the NS5A coding sequence was altered to include a KpnI site for the in-frame fusion with the linker, enterokinase cleavage site, and polyhistidine tag of pET30b. Domain I expression constructs were constructed similarly, with the exception of the introduction of the 3'-KpnI site in a location to generate an NS5A coding sequence terminating at amino acid 215 with a fusion to the linker, enterokinase cleavage site, and polyhistidine tag of pET30b. Clones generated in pET30b using PCR were confirmed by an additional round of DNA sequencing to ensure no PCR-based mutations were present. Numbering of NS5A amino acid residues corresponds to the first serine residues of NS5A as residue number one. Mutations of amino acids in NS5A are designated by; single letter amino acid code of the parental sequence, the residue number in NS5A, and the altered amino acid present in the mutant construct, for example, a change of cysteine 141 of NS5A to a glycine residue would be described as C141G. Conversion of this numbering scheme to the amino acid numbering scheme of the complete 1b genotype HCV polyprotein requires the addition of 1972 to the numbers used in this article. HCV genotype sequences used for sequence alignments in this article are designated by their corresponding GenBankTM data base accession numbers. Cell CultureHuh-7.5 (human hepatoma) cells were maintained in Dulbecco's modified Eagle medium (Invitrogen) supplemented with 10% fetal bovine serum, non-essential amino acids, 100 units/ml penicillin, and 100 µg/ml streptomycin at 37 °C in 5% CO2. Cells were passaged every 3 days at a split ratio of 1:4 after treatment with 0.05% trypsin, 0.02% EDTA. All experiments were performed using cells between passage number 2835 expanded from an original seed stock of the Huh7.5 cell clone described previously (29). In Vitro RNA TranscriptionIn vitro RNA transcription of HCV replicon RNAs was performed as described previously (11). Cesium chloride density gradient purified Con1/SG-neo GIT DNA was linearized with ScaI (New England Biolabs), followed by phenol-chloroform extraction and ethanol precipitation. RNA transcripts were synthesized at 37 °C for 1.5 h in a 50-µl reaction containing 40 mM Tris-HCl (pH 7.9), 10 mM NaCl, 12 mM MgCl2, 2mM spermidine, 10 mM dithiothreitol, 3 mM each rNTP, 100 units of RNAsin (Roche Applied Science), 100 units of T7 RNA polymerase (Epicenter Technologies), and 3 µg of linearized DNA template. Following transcription, 10 units of DNase I (Ambion) was added, and reactions were incubated at 37 °C for 20 min to remove the DNA template. RNA was then purified using the Qiagen RNAeasy kit following the manufacturer's instructions. The optional on-column DNase treatment described for the RNAeasy kit was used to further remove residual DNA template. RNA yield was determined by quantification with a UV spectrophotometer. The quality of RNA was further confirmed by agarose gel electrophoresis. Electroporation of HCV Replicons and G418 SelectionExperiments using HCV replicons were performed essentially as described previously, with minor modifications (11). Briefly, subconfluent Huh-7.5 cells at were trypsinized, washed twice in ice-cold phosphate-buffered saline, and resuspended at a concentration of 2.5 x 107 cells/ml in ice-cold RNase-free phosphate-buffered saline. A 400-µl aliquot of cells was then mixed with 1 µg of HCV replicon RNA and electroporated using a BTX square wavelength electroporator in a 0.2-mm gap cuvette (five 99 ms pulses, 900 volts, 1.1-s time interval). Following a 10-min recovery period at room temperature, cells were diluted in 9.6 ml of complete medium and plated into 100-mm dishes. Cells were plated a 1 x 106, 1 x 105, and 1 x 104 cells at a fixed overall cell density of 1 x 106 cells/plate, with the remaining cell density consisting of cells electroporated with a replication-deficient Con1/SG-neo GIT pol RNA. Selection of replicon-containing colonies was performed 2 days post-electroporation by the addition of complete medium supplemented with 1 mg/ml G418. Selection was carried out for a period of 2 weeks post-electroporation with fresh G418 containing media added to the cells every 4 days. Following selection, cells were fixed with 7% formaldehyde and stained with 1% crystal violet in 50% ethanol. G418 selection was performed for 3 weeks for the data presented in Fig. 5, as one mutant had a small colony phenotype. G418-resistant colonies were then counted as used to calculate transduction efficiency in units of colony forming units (cfu) per microgram of input RNA. All experiments presented are the average of three independent RNA preparations delivered by three independent electroporations.
NS5A- h Expression and PurificationFor the expression of NS5A- h, 1 liter of LB medium supplemented with 30 µg/liter of kanamycin and 25 µg/liter of chloramphenicol was inoculated from an overnight culture of Escherichia coli BL21(de3) containing pET30ubiNS5A- h and the ubiquitin-specific protease pCG1 plasmids such that the cell density was at approximately OD600 of 0.05. Cells were grown at 37 °C and 250 rpm until an OD600 of 0.55 was reached. Cells were then chilled at 4 °C for 30 min, induced with 1 mM isopropyl-1-thio- -D-galactopyranoside, and incubated for 5 h at 25 °C and 250 rpm. Cells were then collected by centrifugation at 6,000 x g for 10 min and resuspended in 20 ml of buffer A (100 mM Tris-HCl (pH 8.0), 150 mM NaCl, 50 mM imidazole, 10 mM 2-mercaptoethanol) per liter. Cells were lysed by three passes through a cold Avestin air emulsifier at 15,000 psi. Following lysis, cell extracts were clarified at 25,000 x g for 30 min at 4 °C. Clarified extracts were then loaded on a 5-ml bed volume HiTrap nickel IMAC column (Amersham Biosciences) equilibrated with buffer A at a flow rate of 2.5 ml/min. Following extensive washing with buffer A and buffer A supplemented with 1 M NaCl, NS5A- h was eluted with a 25-ml linear gradient of buffer A supplemented with 500 mM imidazole. Fractions containing NS5A- h were pooled and exchanged into buffer B (25 mM HEPES (pH 7.4), 25 mM NaCl) using a HiPrep 26/10 desalting column at a flow rate of 8 ml/min (Amersham Biosciences). Desalted protein was then concentrated via Amicon ultracentrifugal concentrators (Millipore). Concentration of purified NS5A- h was determined using the Bio-Rad protein assay. Protein yields were typically between 2 and 5 mg/liter of bacterial culture.
Domain Mapping of NS5A by Limited TrypsinizationFifteen micrograms per sample of purified NS5A-
Production of HCV NS5A-Domain I-
Atomic Absorption AnalysisAtomic absorption experiments were performed by the Trace Metal Core Facility, The Department of Environmental Sciences, Mailman School of Public Health, Columbia University. All experiments were performed according to the standard operating procedures of the core facility. Background zinc level of the protein analysis buffer was performed, and this value was subtracted from all values determined for protein samples. All values reported are the average of three independent measurements of two independently prepared aliquots of each sample. Zinc values determined were based on comparison with a zinc standard calibration curve. All glass and plasticware used for the preparation of NS5A- Computer-aided Comparative Sequence AnalysesAmino acid sequences were derived from the GenPeptides data base maintained by the National Center for Biotechnology Information (NCBI). Sequence alignments were produced with the Gibbs sampler (30) and the Blosum62 scoring inter-residues table (31) using the Macaw workbench (32) and were processed for presentation using GeneDoc (33). The program SEG was used to identify potential low complexity sequences (LCS) within NS5A (34). Secondary structure predictions were performed using the JPRED server (35).
Prediction and Verification of the Domain Organization of HCV NS5AOur investigation of the NS5A protein began with computer analysis of the protein sequence with a goal of identifying secondary structure content, interdomain boundaries, and key sequence motifs. Using a variety of secondary structure prediction programs remarkably similar results for secondary structure composition were obtained. The N-terminal-half of NS5A appeared to contain a large number of predicted -strand secondary structural elements, a small number of -helical structures, and a minimum of random coil sequence. In contrast, the C-terminal-half of the protein was predicted to be primarily random coil structure, with small regions of predicted regular secondary structure, primarily that of -helices, located near the center of the NS5A sequence. Examination of sequence conservation of various regions of NS5A across HCV genotypes indicates that the N-terminal-half of the protein is the most conserved region of NS5A, with lower sequence conservation seen in the C-terminal-half of the protein. These data suggested that the N-terminal-half of NS5A formed a compact globular domain, with the remaining C-terminal-half of the protein having a more variable/unfolded structure. The C-terminal region of NS5A has recently been proposed to be similar to the natively unfolded class of proteins (36). These data collectively suggested that NS5A is a multidomain protein, with a globular N-terminal domain, a largely helical central domain, and a mostly random coil C-terminal domain. Similar conclusions were drawn from examination of a number of HCV genotype NS5A protein sequences. The identification of the interdomain boundaries connecting these regions of NS5A was an important aspect in developing a domain organization model of this protein. Using the program SEG, an algorithm that identifies repetitive low complexity sequences in proteins, two regions of potential LCS were identified in NS5A (34). These regions were designated LCS I (residues 214249) and LCS II (residues 343355) (Fig. 1A). As repetitive, low complexity sequences are often loop regions connecting independent domains of a protein, the identification of these regions allowed the construction of a hypothetical domain organization model for NS5A (Fig. 1A). The model divides NS5A into three domains based on the location of the blocks of low complexity sequence; domain I (amino acids 1213), domain II (amino acids 250342), and domain III (amino acids 356447).
During the development of this NS5A domain organization model, a system for the generation of milligram quantities of purified NS5A from E. coli became available (kindly provided by Dr. Craig Cameron) (37). We modified this system by substituting the NS5A coding region with the Con1 1b genotype NS5A sequence for genetic compatibility with the replicon system. Additional modifications were performed to generate NS5A in a form for amenable to our experiments, including the deletion of the majority of the N-terminal membrane anchor, thereby allowing production of soluble NS5A (NS5A- h) in the absence of any detergent.
With large quantities of relatively pure NS5A-
Domain I of NS5A Is Predicted to Contain a Conserved Zinc Binding SiteIn attempt to identify the most conserved and, consequently, functionally most important amino acid residues, we extended our comparative sequence analysis of NS5A to include the NS5A sequences encoded in the polyproteins of the closely related GB viruses and pestiviruses. Using Gibbs sampler (1) and Blosum62 scoring table (2) in the Macaw-based (3) analysis the sequence conservation in NS5A among these viruses was found to be limited to Initial Analysis of Putative Zinc Binding Site Mutants Using the Replicon SystemThe ability to generate site-directed mutants in the HCV replicase and analyze the effects of these mutants on RNA replication using the replicon system is a powerful method to assess the importance of predicted sequence motifs. The relative ease of generation and analysis of mutants in this system made the replicon an ideal first step in characterizing the importance of the CPC tripeptide including Cys57 and Cys59 of the predicted zinc binding site in HCV replication. CPC2, containing a conserved cysteine residue at the second position of this tripeptide (Cys142) believed to be independent of the metal binding site, serves as a control. Mutations altering the CPC of each tripeptide to CPG were generated and cloned individually into the Con1/SGneo GIT replicon. The Con1/SGneo GIT replicon contains a total of three adaptive mutations, two in NS3 (E1202G, T1280I) and one in NS4A (K1846T), thereby allowing assessment of NS5A mutations in an NS5A background lacking any adaptive mutations and avoiding any potential incompatibility of these directed mutations with any NS5A adaptive changes (28). The second cysteine of each tripeptide was chosen for the initial mutagenesis, as these residues, unlike the first cysteine in the CPC2 sequence, are conserved among all HCV genotypes. Replicon RNAs bearing mutations of CPC (C59G) and CPC2 (C142G) were generated and delivered to the adapted cell line Huh-7.5 by electroporation for the determination of G418 transduction efficiency as described in materials and methods. The data generated from the analysis of the CPC and CPC2 mutants are presented in Fig. 2. The parental GIT replicon yielded a transduction efficiency of 4.98 x 105 cfu/µg (± 0.05 x 105). A GIT replicon substituting three essential active site residues of the NS5B polymerase (pol) served as a negative control, and this RNA had no G418 transduction capability. The CPC site mutation also had no G418 transduction capability, indicating RNAs bearing this mutation do not replicate. Attempts at isolating revertants that rescue this mutation were unsuccessful. The CPC2 site mutation had a G418 transduction efficiency of 2.35 x 105 cfu/µg (± 0.01 x 105), indicating that alteration of this site had a very mild effect on RNA replication. A mutation of CPC2 altering this site to APG (C140A, C141G) was also generated and found to have a transduction efficiency similar to that of the CPG mutation of this sequence, indicating that neither cysteine in CPC2 was essential for RNA replication (data not shown). Although these data indicate CPC is important for HCV replication, and conversely that CPC2 is of little importance, matching the expected results based on the original model, they do little to clarify the issue of whether NS5A is actually a zinc-binding protein. As the CPC mutation could alter other aspects of the function of NS5A in the replicase, and possibly have nothing to do with metal ion coordination, a more direct method for determining the zinc content of the wild-type and mutant NS5A proteins was needed.
NS5A- h Is a Zinc-binding Protein, and Zinc Coordination Is Dependent on the CPC TripeptideOne method for determining the zinc binding capability of a protein is to directly measure the zinc content of a purified preparation of the protein by atomic absorption spectroscopy. The availability of large quantities of NS5A- h protein made atomic absorption spectroscopy an ideal method for evaluating NS5A zinc content. The zinc content of two independent preparations of wild-type NS5A- h, the CPC site mutation (C59G), and the CPC2 site mutation (C142G) was determined using triplicate measurements of each sample. The zinc content of each sample ( 660 µg of protein per measurement) was determined, and following subtraction of background metal contamination from a buffer-only blank, values were converted to a ratio of mol of zinc to mol of protein. Typical raw data of zinc content were quite consistent, with variations of less than 2% between samples representing independent preparations of the same protein. As an example, wild-type NS5A- h raw values for zinc content for 660 µg of protein were; 987, 979, and 978 µg/liter for preparation 1, and 973, 971, and 970 µg/liter for preparation 2. The data of the ratios of mol of zinc to mol of protein determined for wild-type NS5A- h, NS5A- h CPC (C59G), and NS5A- h CPC2 (C142G) are presented graphically in Fig. 3. The values shown are calculated from the average zinc content from the six measurements collected for each protein. The ratios of mol of zinc per mol of protein determined were 0.897 for wild-type NS5A- h, 0.170 for the NS5A- h CPC mutation, and 0.766 for the NS5A- h CPC2 mutation. These data indicate that the wild-type NS5A- h protein is a zinc-binding protein with one zinc atom coordinated per protein molecule. Similarly, the NS5A- h CPC2 site mutant coordinated approximately one zinc atom per protein molecule. The value measured for these proteins, 0.897 and 0.766, were very close to the ideal value of 1 mol of zinc per mol of protein, especially when the purity of the preparations of NS5A protein used for these analyses are considered ( 85%). Additional reasons for this variation from the ideal zinc:protein ratio of one can be imagined if the fact that little is known about the percentage of metal binding competent protein present in our preparations, with improperly folded, aggregated, or otherwise metal coordinating incompetent protein potentially in these samples. The NS5A- h CPC tripeptide mutant had a zinc:protein ratio of 0.170, indicating the point mutation of Cys59 to glycine impaired the zinc binding capability of NS5A. The value of 0.170 observed for the ratio of zinc:protein for the CPC mutant, although clearly lower than that observed for non-mutant NS5A or CPC2 mutant NS5A, was not zero. Perhaps the CPC tripeptide mutant retains some ability to bind zinc, albeit at a much lower occupancy than non-mutant NS5A. Nonetheless, the data indicate the presence of a zinc ion associated with NS5A- h via the CPC tripeptide. Furthermore, when combined with the data from the replicon system (Fig. 2), it is clear that zinc coordination is required for RNA replication. As the NS5A- h CPC mutation analyzed contains a single cysteine to glycine change, an additional three metal ion coordinating residues likely exist in NS5A allowing zinc binding in the classical tetrahedral fashion. Although our original model of the NS5A zinc binding site suggested the importance of Cys39, Cys57, Cys59, and Cys80 in metal ion coordination, no experimental evidence existed to fully validate this model.
As the NS5A proteins used in these experiments contained a C-terminal polyhistidine affinity purification tag, a known metal ion coordination sequence, experiments were repeated using proteins from which this tag was proteolytically removed following purification. The presence of the polyhistidine tag sequence did not alter the zinc content observed for NS5A- h (data not shown). Analysis of purified NS5A proteins with the N-terminal amphipathic -helix did not affect the observed results (data not shown). Although mutant and wild-type NS5A proteins were of similar solubility when expressed in E. coli, the possibility existed that mutation of the CPC site led to a global defect in NS5A folding, thereby eliminating zinc binding by an indirect method, as opposed to a more direct role for this sequence in metal ion coordination. Circular dichroism spectroscopy was used to confirm NS5A- h CPC had similar secondary structure content to that observed for wild-type NS5A- h and possessed no global defect in protein folding (data not shown).
Localizing the NS5A Zinc Binding Site to Domain IThe model of the NS5A metal coordination site (Cys39, Cys57, Cys59, and Cys80) suggested the zinc binding phenotype of NS5A should localize to the domain I region of NS5A. To verify this prediction, preparations of NS5A-Domain I-
The NS5A Zinc Binding Site Is Vital for HCV Replication Based on the observation that the zinc binding site of NS5A is localized to domain I, a search to identify the residues within this domain that were responsible for zinc coordination was conducted. The observation that a mutation that disrupted zinc binding was lethal for RNA replication suggested that site-directed mutagenesis of possible metal coordinating residues and analysis with the replicon system could be used as a method to determine the importance in replication of putative zinc binding site components. The model of the NS5A metal coordination site, although implicating Cys39, Cys57, Cys59, and Cys80 as important targets for mutagenesis, could not eliminate the potential importance of other amino acids for metal binding, and a more conservative mutagenesis approach altering all of the conserved classical zinc coordinating cysteine and histidine residues within domain I of NS5A was undertaken. Potential cysteine and histidine targets for mutagenesis were selected based on conservation of these residues in NS5A sequences among the six major HCV genotypes, as well as the location of these residues in relation to predicted secondary structure elements. The ten sites selected for analysis are listed in Fig. 5. Of these mutants, four (His66, His85, Cys98, and Cys190) had transduction efficiencies close to that of the parental replicon. These mutants were not analyzed further. Two mutants (His158, and His207) were of reduced transduction efficiency compared with the parental replicon. The His158 mutant, in addition to having a lower transduction efficiency, generated smaller G418-resistant colonies than either the parental replicon or the other mutants. The basis of this phenotype was not explored further. Four of the mutations generated were lethal for RNA replication, with no G418-resistant colonies observed. These mutants are highlighted in bold text in Fig. 5, and include Cys39, Cys57, Cys80, and His128. All of the cysteine residues in this set of lethal mutations, like previously characterized Cys59 residue (Fig. 2), are part of the predicted tetrahedral zinc binding site. The basis of the lethal effect of the H128A mutation, not predicted to be a component of the NS5A zinc binding site, is currently being studied. The obtained results are internally consistent with the predicted model of the NS5A zinc binding site and further indicate that the zinc binding is vital for RNA replication. The identification of a total of five replication essential cysteine and histidine residues within domain I, although strongly suggesting the predicted zinc binding site was correct, could not unambiguously identify the residues responsible for the tetrahedral coordination of a zinc atom.
Mutations at the Predicted Metal Coordination Residues Diminish NS5A Zinc ContentTo analyze the NS5A zinc binding site model further, we employed atomic absorption spectroscopy for the characterization of preparations of full-length NS5A-
The data presented herein provide the first domain organizational model for the HCV NS5A protein and represents one of the first views of NS5A as an individual protein and not merely an unknown component of the HCV replicase. The domain model of NS5A has led to the development of a number of protein expression constructs of small soluble domains of NS5A, allowing a more detailed analysis of the function of the various regions of this protein. Particularly, the presence of a conserved zinc binding motif within domain I was evaluated by a combination of genetic and atomic absorption spectroscopy experiments. The data indicate that NS5A is a zinc metalloprotein, coordinating a single zinc atom per molecule of NS5A via four essential cysteine residues within domain I. Although the functional implications of zinc binding have not yet been evaluated, the presence of a novel zinc binding motif in NS5A provides insight to this protein. In the following paragraphs we discuss our recent observations on NS5A in the context of the body of HCV literature to provide a critical interpretation of the data and put forth several important concepts, notably the domain organization of NS5A, the surface accessibility of potentially important sequences, the potential for interdomain contacts, and the importance of a zinc atom in NS5A structure and function. NS5A Domain OrganizationBased on the results of comparative sequence analysis and limited proteolysis of purified protein, we have proposed a domain organization model for HCV NS5A that includes domain I (amino acids 1213), domain II (amino acids 250342), and domain III (amino acids 355447).
The region of NS5A proposed to constitute domain I represents the most conserved portion of this protein among the HCV genotypes. Despite all appearances suggesting a folded, globular protein domain, homology based sequence searches have failed to identify any proteins related to NS5A (with the exception of the related pestivirus NS5A protein). Scanning domain I, and indeed the entire NS5A protein, for motifs that might imply a function for this enigmatic protein have been futile, with the exception of the putative zinc binding site described in this article. At least one definitive function has been attributed to sequences within domain I, that of a membrane anchor (26, 27). Although the bulk of the work described in this article used NS5A proteins lacking the N-terminal membrane anchoring helix, similar results have been observed with constructs that retain this sequence. These data suggest that the N-terminal helix is likely a separate entity from the largely globular, hydrophilic domain I region. The membrane anchor sequence is mentioned in this discussion of domain I because the proximity of this helix to domain I suggests that the N-terminal region of NS5A is likely close to the membrane. Interactions with karyopherin
The proposed domain II region of NS5A is of lower sequence conservation among HCV genotypes than domain I. This region contains a number of predicted secondary structural elements, primarily The region of NS5A hypothesized to comprise domain III is the most variable among HCV genotypes. This region contains large regions predicted to be random coil structures by various secondary structure prediction programs. The ability to generate functional NS5A proteins containing in-frame fusions with other proteins (i.e. green fluorescent protein) within domain III in the replicon system highlights the sequence flexibility of this region (54). This region does contain several regions of sequence that are absolutely conserved among all HCV sequences suggesting some selective evolutionary pressure exists for maintenance of this region. A careful evaluation of the regions of domain III required for HCV RNA replication is warranted. One area of particular interest in evaluating the domain model of NS5A is the proposed surface exposure of the interdomain connector sequences (LCS I and LCS II) in relation to the HCV literature. It is important to note that designating a sequence "low complexity" does not imply that this sequence has no function. Indeed, examination of sequence alignments of various genotypes of HCV shows these low complexity sequence blocks tend to have considerable sequence conservation, thereby suggesting some importance to NS5A function and HCV biology. LCS I overlaps the region where numerous NS5A cell culture replicon adaptive mutations, some capable of increasing RNA replication 10,000-fold, have been described (11). It is enticing to hypothesize that the exposed residues comprising the LCS I region are a loop region connecting domain I to domain II, and that this loop region is involved in interaction with viral and cellular components of the replicase in a regulatory fashion or modulating contacts between the domains of NS5A. Similarly to LCS I, the sequence at LCS II is conserved among HCV isolates, suggesting a selective pressure for the maintenance of this region. Several publications suggest this proline-rich region of NS5A contains a class II polyproline motif like sequence (amino acids 350356) involved in interaction with Grb2 and downstream cellular signaling cascades (5557). This sequence has also been shown to interact with the cellular protein amphiphysin II (58). The demonstration LCS I and LCS II are exposed on the surface of NS5A, and available for potential interactions with viral and cellular factors, is an important observation. These ideas clearly need to be evaluated critically, but the data provide some intriguing hints at NS5A function and interactions.
NS5A Is a Zinc MetalloproteinComparative sequence analysis of NS5A suggested this protein might contain an unusual zinc coordination motif. Initial analysis of mutations of the central CPC tripeptide sequence of the HCV NS5A zinc binding site using the replicon system indicated that the conserved CPC was required for RNA replication. Atomic absorption analysis of purified NS5A-
Analysis of the mapped HCV zinc binding site in relation to the secondary structure predictions of NS5A provides some interesting insights into the possible structure of this region of NS5A. The secondary structure predictions for the region of NS5A encompassing the zinc binding motif indicate the presence of four short
A cartoon schematic of our hypothetic model of this region of NS5A is shown in Fig. 7B. These observations provide the first tertiary contacts, and constraints, demonstrated for NS5A structure. The data generated to date fit the model quite well, and the conservation of the proposed zinc coordinating cysteines and secondary structure elements surrounding these cysteines suggests that these components of the zinc binding domain are essential. The validity of this model is perhaps best supported by the convergence of the original bioinformatics model of the zinc binding site and the results of the subsequent genetic and biochemical analysis. Nonetheless, since this model relies heavily on predicted secondary structures, it may not accurately represent the real structure of NS5A. We cannot formally exclude the possibility that one or more ligands coordinating the zinc ion may be water-bridged interactions, with our lethal cysteine mutants representing some other global folding defect. Preliminary circular dichroism analysis of NS5A with and without mutations in the CPC tripeptide suggested both proteins were folded, but this analysis is yet to be performed on all of the possible zinc-coordinating ligand mutants (data not shown). Involvement of other amino acids, such as glutamate and aspartate residues, which were not analyzed in this study, in the zinc binding activity of NS5A remains a possibility, although this seems unlikely because these residues are not conserved in the NS5A sequence.
An important question that remains to be answered is in what aspect of NS5A function is zinc binding involved. Unfortunately, no function has yet been attributed to the NS5A protein, complicating the evaluation of the importance of zinc binding. The analysis of the role of zinc binding for NS5A function is further complicated by the large number of functions attributed to zinc binding in proteins. Zinc coordination in proteins has been demonstrated to be involved in a wide range of enzymatic activities, nucleic acid interactions, metabolic regulatory functions, protein-protein interactions, and structural fold maintenance (60). Sequence data base and motif data base searches were unsuccessful in identifying proteins that contain a zinc coordination motif similar to that mapped for HCV NS5A. The HCV zinc binding site does not appear to possess the characteristics of any of the documented classes of enzymatic functions associated with zinc-binding proteins (61). Additionally, this site does not appear to be similar to any of the known classes of zinc-dependent nucleic acid interaction motifs, although the idea of NS5A interacting with viral RNA via a zinc motif is attractive (62, 63). The coordination of the NS5A zinc atom by four cysteine residues strongly suggests this is a structural metal ion necessary for NS5A folding or stability (61). Limited CD data suggest that NS5A proteins bearing mutations at the CPC site are of lower thermal stability than nonmutant proteins, although a more detailed analysis of this observation has yet to be performed. A brief review of well-characterized four cysteine structural metal binding sites has also failed to find a motif similar to that observed for NS5A. The majority of these sites fit the general template CXXCX1730CXXC which is inconsistent with the NS5A zinc site, CX17CPCX20C. Structural metal sites tend to have lower sequence conservation and are more variable than zinc sites involving a catalytic function, making sequence based comparisons outside the context of a known protein structure/fold complicated. Focusing on the central CPC portion of the HCV zinc binding site during the search for similar metal binding sites was somewhat more successful. A well-documented CXC motif exists in the HCV NS3 protease domain (62). This NS3 zinc binding site is a structural metal required for protease activity that is similar to the zinc binding site in the poliovirus 2A protease. Unfortunately, this site is also quite unlike the HCV zinc binding site, with a sequence of CXCX35CX3H. The overall spacing and presence of a histidine metal coordinating zinc via a water molecule bridge makes direct comparisons between this site and the NS5A site questionable. The zinc binding site mapped for HCV NS5A is clearly unlike other published zinc coordination motifs, suggesting this site may represent a novel four In summary, we have presented a domain organization model of the HCV NS5A protein based on sequence analysis and limited proteolysis of purified protein. Comparative sequence analysis of NS5A suggested the presence of a zinc ion coordination site, and this site has been verified using a combination of the HCV replicon system and atomic absorption spectroscopy. This zinc binding site appears to be a novel four cysteine motif that is common to the NS5A proteins of the Hepacivirus and Pestivirus genera, thus supporting the relationship of these genera in the Flaviviridae family. The coordination of a zinc ion, combined with secondary structure elements surrounding the coordinating residues, imparts important structural constraints on NS5A. Unfortunately, no function could be attributed to the zinc ion coordinated by NS5A, largely because of the lack of a defined function for this protein. The challenge for future research clearly lies in the determination of a function for NS5A and/or the determination of a high resolution structure of this protein.
* This work was supported in part by Grant 5 R01 CA57973-12 from the National Institutes of Health and the Greenberg Medical Research Institute (to C. M. R.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
¶ Recipient of a Merck Fellow of the Life Sciences Research Foundation. ** To whom correspondence should be addressed. Tel.: 212-327-7046; Fax: 212-327-7048; E-mail: ricec{at}rockefeller.edu.
1 The abbreviations used are: HCV, hepatitis C virus; NTR, non-translated regions; CAPS, 3-(cyclohexylamino)propanesulfonic acid; cfu, colony-forming units; NS, non-structural; TPCK, tosylphenylalanyl chloromethyl ketone; LCS, low complexity sequence; MALDI-TOF, matrix-assisted laser desorption/ionization-time of flight.
2 T. Tellinghuisen, J. Marcotrigiano, and C. M. Rice, unpublished results.
3 H. Fan and C. M. Rice, unpublished results.
We thank Dr. Craig Cameron for the original NS5A bacterial expression system reagents and methods, Vesna Slavkovich for atomic absorption method development, Jack Hietpas for general laboratory support, and Hongqing Fan for NS5A genotype 4a sequencing. Additionally, we wish to thank Dr. Matthew Evans, Dr. Brett Lindenbach, Dr. Shihyun You, Dr. Ivo Lorenz, and Dr. Annick Gauthier for scientific discussions related to this article.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||