JBC

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M401148200 on April 1, 2004

J. Biol. Chem., Vol. 279, Issue 24, 24965-24975, June 11, 2004
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
279/24/24965    most recent
M401148200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ghibaudo, D.
Right arrow Articles by Martin, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ghibaudo, D.
Right arrow Articles by Martin, A.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Characterization of GB Virus B Polyprotein Processing Reveals the Existence of a Novel 13-kDa Protein with Partial Homology to Hepatitis C Virus p7 Protein*

David Ghibaudo{ddagger}§, Lisette Cohen{ddagger}, François Penin¶, and Annette Martin{ddagger}||

From the {ddagger}Unité de Génétique Moléculaire des Virus Respiratoires, CNRS URA 1966, Institut Pasteur, 75724 Paris Cedex 15, France and CNRS UMR 5086, IFR 128 BioSciences, Lyon-Gerland, Institut de Biologie et Chimie des Protéines, 69367 Lyon Cedex 07, France

Received for publication, February 2, 2004 , and in revised form, April 1, 2004.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Although responsible for a major health problem worldwide, hepatitis C virus is difficult to study because of the absence of fully permissive cell cultures or experimental animal models other than the chimpanzee. GB virus B (GBV-B), a closely related hepatotropic virus that infects small New World primates and replicates efficiently in primary hepatocyte cultures, is an attractive surrogate model system. However, little is known about processing of the GBV-B polyprotein. Because an understanding of these events is critical to further development of model GBV-B systems, we characterized signal peptidase processing of the polyprotein segment containing the putative structural proteins. We identified the exact N termini of the mature GBV-B envelope proteins, E1 and E2, and the first nonstructural protein, NS2, by direct amino acid sequencing. Interestingly, these studies document the existence of a previously unrecognized 13-kDa protein (p13) located between E2 and NS2 within the polyprotein. We compared the sequence of the p13 protein to that of hepatitis C virus p7, a small membrane-spanning protein with a similar location in the polyprotein and recently identified ion channel activity. The C-terminal half of p13 shows clear homology with p7, suggesting a common function, but the substantially larger size of p13, with 4 rather than 2 predicted transmembrane segments, indicates a different structural organization and/or additional functions. The identification of p13 in the GBV-B polyprotein provides strong support for the hypothesis that ion channel-forming proteins are essential for the life cycle of flaviviruses, possibly playing a role in virion morphogenesis and/or virus entry into cells.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Hepatitis C is a medically important disease worldwide, with >180 million infected individuals that are potentially at risk for chronic hepatitis, liver cirrhosis, and hepatocellular carcinoma. Hepatitis C virus (HCV)1 is the only member of the Hepacivirus genus within the Flaviviridae family. HCV is an enveloped virus with a single-stranded, positive-sense RNA genome that encodes a single polyprotein precursor of ~3000 amino acid residues. This precursor is co- and post-translationally cleaved by host cellular signal peptidases within the region comprising the structural proteins (C-p7) and by viral proteinases within the nonstructural region (NS2-NS5B) to release 10 major viral proteins (1, 2).

Although subgenomic HCV RNA replicons that undergo autonomous amplification in cell cultures have recently made it possible to study RNA replication at the molecular level (for review, see Ref. 3), the lack of an efficient, fully permissive cell culture system for HCV has severely hindered the study of virion morphogenesis as well as virus entry into susceptible cells. The capsid protein (C) and two envelope glycoproteins (E1 and E2) are believed to be components of the virion. Studies of these structural proteins have for the most part relied on heterologous expression systems, such as recombinant baculoviruses in insect cells (4), or recombinant vaccinia or Sindbis viruses (5) and Semliki Forest virus-derived vectors (6) in eukaryotic cells. The results of these studies support the hypothesis that the HCV glycoproteins are retained in the endoplasmic reticulum. More recently, however, the development of pseudo-typed lentivirus particles bearing HCV envelope glycoproteins has provided new ways of studying virus entry into susceptible cells and generated evidence arguing against the retention of all HCV glycoproteins in the endoplasmic reticulum (79). A small protein of 7 kDa, p7, has been identified in the HCV polyprotein between the E2 glycoprotein and the first nonstructural protein, NS2 (10). Recently, p7 has been suggested to be a membrane-spanning protein localized to the endoplasmic reticulum (11) and to function as an ion channel (1214). Pestiviruses (e.g. bovine viral diarrhea virus), other members of the family Flaviviridae, also encode a hydrophobic p7 protein at the junction between the structural and nonstructural proteins in the polyprotein (15). However, the relevance and functional role of these p7 proteins in the virus life cycle remain largely unknown.

An aim of this study was to determine whether other members of the Flaviviridae family encode such a small hydrophobic protein with potential ion channel activity that may be critical to the virus life cycle. We have, therefore, examined whether a p7-like protein is expressed by GB virus B (GBV-B), the most closely related of all animal viruses to HCV. GBV-B was identified in 1995 in a tamarin (Saguinus species) that developed acute hepatitis after inoculation with serum from the 11th tamarin passage of the original, human GB inoculum (16, 17). GBV-B infection is typically self-limited and associated with acute hepatitis (18, 19), although persistent infection can lead to chronic hepatitis in tamarins (20). The natural host for GBV-B remains unknown since this virus has never been isolated directly from tamarins or humans (21, 22), but the fact that GBV-B appears to be incapable of replicating in chimpanzees argues against a human origin (23). GBV-B has been classified within the Flaviviridae family but not assigned to any genus. However, GBV-B shares ~27–33% nucleotide sequence identity over the entire open reading frame with HCV (2426) and has a very similar genome organization. In addition, the serine protease and helicase, NS3/4A (2729), as well as the RNA-dependent RNA polymerase, NS5B (30), of these viruses share common enzymatic specificities. These molecular characteristics in addition to the fact that this virus replicates robustly within the liver of small, New World primates (tamarins, marmosets) as well as in the primary cultures of hepatocytes from these species make GBV-B an attractive surrogate model for HCV studies.

Despite its obvious relevance to HCV and other flaviviruses, relatively little is known about the structural proteins of GBV-B. To date, there have been no reports of the direct sequencing of GBV-B proteins. Thus, predictions of the sites of polyprotein processing and GBV-B protein boundaries have relied on amino acid sequence alignments with the HCV polyprotein and computer-based algorithms for host signal peptidase recognition within the N-terminal third of the GBV-B polyprotein that contains the putative viral structural proteins. Importantly, published predictions do not report the existence of a polypeptide equivalent to the HCV p7 protein. We, therefore, carried out a detailed characterization of the proteolytic processing of the GBV-B polyprotein segment containing the viral structural proteins. We identified the N termini of the mature GBV-B envelope proteins, E1 and E2, and the nonstructural protein, NS2, by amino acid sequence analysis. These studies convincingly demonstrate that the GBV-B E2 protein is much shorter than HCV E2. Consistently, we document the existence of a heretofore unrecognized protein (p13) located within the GBV-B polyprotein between E2 and NS2 and with a calculated molecular mass of 13.1 kDa and a topology involving four predicted transmembrane segments. The C-terminal part of this novel protein exhibits clear structural homology with the HCV p7 protein, but its substantially larger size suggests a significantly different structural organization and/or additional functions.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Construction of Expression Plasmids—The expression plasmid pTM/C-NS2 contains sequences encoding the predicted C-E1-E2-NS2 polypeptide precursor of GBV-B downstream of the encephalomyocarditis virus (EMCV) internal ribosome entry site (IRES) and the T7 RNA polymerase promoter. To construct it, GBV-B cDNA sequences were amplified by PCR from pGBV-B/2, a plasmid that contains infectious, genomic-length GBV-B cDNA (GenBankTM accession number AY243572 [GenBank] ; Ref. 20). For this PCR reaction, we used a 3' primer complementary to the terminal codons of the GBV-B NS2 sequence with a 3' extension containing an engineered BamHI restriction site and a 5' primer corresponding to the first codons of the GBV-B core sequence with a 5' extension containing a BsaI restriction site followed by 2 nucleotides and an ATG codon. After restriction with BsaI, generating protruding ends compatible with NcoI-restricted extremities, and BamHI, the PCR fragment was cloned into pTM1 (31) at the NcoI and BamHI sites, giving rise to plasmid pTM/C-NS2 in which the first codon of GBV-B core is directly fused to the ATG codon of the EMCV IRES.

pTM/C-NS2-V5 was constructed from a derivative of pTM/C-NS2 containing an engineered, unique AgeI restriction site overlapping the last codon of NS2, upstream of the existing BamHI site (pTM/C-NS2-AgeI-FLAG). A pair of complementary 54-nucleotide-long, 5'-phosphorylated oligonucleotides containing two glycine codons followed by the V5 tag sequence and a stop codon and designed to create protruding ends compatible with an AgeI site at the 5' end and BamHI site at the 3' end were hybridized together and cloned into pTM/C-NS2-AgeI-FLAG at the AgeI and BamHI sites. The resulting plasmid, pTM/C-NS2-V5, thus encodes a GBV-B C-E1-E2-NS2 precursor that is C-terminally fused to the V5 tag. pTM/C-NS2-V5 was used as a template to introduce substitutions at nucleotides 2656 (A -> G) and 2666 (C 3 A) of GBV-B cDNA, altering selected codons specifying Ile (ATA) and Leu (CTG), respectively, to Met (ATG) within the NS2 coding sequence. For this purpose, a PCR-based fusion strategy was used to produce an FseI-HpaI fragment (nucleotides 2547–2749 of the GBV-B cDNA) containing the indicated mutations that was inserted into pTM/C-NS2-V5 to generate pTM/C-NS2-V5-mut5,9.

To construct the plasmid pTM/C-E2701-V5, a pair of complementary, 5'-phosphorylated oligonucleotides containing two glycine codons followed by the V5 tag sequence and a stop codon and designed to create protruding ends compatible with an FseI site at the 5' end and a BamHI site at the 3' end were hybridized together and cloned into pTM/C-NS2 at the FseI (nucleotide 2547 of GBV-B cDNA) and BamHI sites. This resulted in a cDNA encoding GBV-B amino acid residues 1–701 fused in-frame at the C terminus to the V5 tag. Plasmid pTM/C-E2673-V5 was similarly constructed by inserting the same oligonucleotide pair into pTM/C-E2673-FseI-FLAG, which contains an engineered, unique FseI restriction site immediately downstream of nucleotide 2463 of GBV-B cDNA followed by a peptide sequence and a BamHI site. The resulting plasmid, thus, encodes GBV-B amino acid residues 1–673 fused at the C terminus to the V5 tag. The sequence of amplified fragments was verified in all plasmids.

Plasmids pTM/C-E2701-VP1 and pTM/C-E2673-VP1 were designed to encode GBV-B amino acid residues 1–673 or 1–701 fused at the C terminus to a truncated form of the poliovirus VP1 capsid protein for reporter purposes. To construct these plasmids, a cDNA segment representing nucleotides 2480–3241 of poliovirus was amplified by PCR using pTM/*2A2B* as a template (plasmid described in Ref. 32) and primers hybridizing to 5' and 3' VP1 sequences. The 5' primer also contained a 5' extension consisting of an engineered FseI site followed by a Gly-Ser-codon hinge, and the 3' primer contained a 3' extension consisting of a stop codon followed by a BamHI site. After restriction with FseI and BamHI, this PCR fragment was inserted into plasmids C-E2701-V5 and C-E2673-V5, respectively, at the corresponding sites.

Cell Culture and Viruses—Human 143B thymidine kinase-deficient cells were used for the isolation of recombinant vaccinia viruses and monkey kidney cells (CV1) for propagation of these viruses as well as for expression experiments. These cell lines were maintained in Dulbecco's modified Eagle's medium (Invitrogen) supplemented with 5% fetal calf serum (Invitrogen), penicillin (50 units/ml), and streptomycin (50 µg/ml). vTF7–3, a recombinant vaccinia virus expressing T7 DNA-dependent RNA polymerase (33), was kindly provided by B. Moss (National Institutes of Health, Bethesda, MD).

Generation of Recombinant Vaccinia Viruses and Expression Assays in CV1 Cells—Recombinant vaccinia viruses vv-C-E2701-V5 and vv-C-E2673-V5 were generated by homologous recombination in CV1 cells infected with the temperature-sensitive ts7 strain of vaccinia virus and co-transfected with pTM/C-E2701-V5 or pTM/C-E2673-V5, respectively, as well as wild-type Copenhagen vaccinia virus DNA, essentially as described previously (34). Recombinant viruses were plaque-purified twice on 143B thymidine kinase-deficient cells under selective conditions. Vaccinia virus vv-TM1 was similarly derived from plasmid pTM1, which contains no cDNA insert. Large scale stocks of recombinant vaccinia viruses were produced on CV1 cells.

Expression of GBV-B proteins was accomplished by co-infection of CV1 cells with either of the recombinant vaccinia viruses and vTF7–3, each at a multiplicity of infection of 10 plaque-forming units per cell. Cytoplasmic extracts were prepared at various times post-infection by lysis of cells on ice in 0.2 ml of 10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 100 mM NaCl, and 1% Nonidet P-40 containing 10 µl of Protease Inhibitor Cocktail (Sigma). Samples were mixed with Laemmli buffer, and polypeptides were separated by SDS-PAGE, then transferred to a polyvinylidene difluoride (PVDF) membrane (Amersham Biosciences) and detected by immunoblotting with an anti-V5 monoclonal antibody diluted 1:10,000 (Invitrogen) using enhanced chemiluminescence as a visualization method (ECL Plus, Amersham Biosciences) as detailed previously (35).

In Vitro Transcription—pGBV-B/2 plasmid DNA was linearized at restriction sites as indicated in the text ("Results" section, Figs. 1 and 4). All pTM1-derived plasmid DNAs were linearized at the unique BamHI restriction site. Linearized DNAs were purified by phenolchloroform extractions then ethanol-precipitated before transcription with T7 RNA polymerase for 4 h at 37 °C using either the RiboMAX kit (Promega) or MEGAscript kit (Ambion). RNAs were phenol-chloroformextracted, ethanol-precipitated, and quantified by electrophoresis in 1% agarose gels.



View larger version (41K):
[in this window]
[in a new window]
 
FIG. 1.
In vitro proteolytic processing of the GBV-B envelope glycoproteins. The genome of GBV-B is schematically represented in panel A with the 5'- and 3'-nontranslated regions (NTR) framing a single open reading frame coding for a polyprotein that is subsequently cleaved to generate the indicated proteins. B, in vitro translation reactions in rabbit reticulocyte lysates in the presence of [35S]Met were programmed with either RNA transcripts prepared from ClaI-linearized pGBV-B cDNA (see panel A) or no RNA (M). Reactions were incubated for 4 h at 30 °C in the presence (+) or absence (–) of canine pancreatic microsomal membranes as indicated at the top of the gel. C, aliquots of translation reactions that were programmed with RNA transcripts prepared from AflIII-linearized pGBV-B cDNA (see panel A) and incubated in the presence of membranes were further incubated with peptide N-glycosidase F (PNGase F) for 3.5 h at 37 °C where indicated (+) to obtain deglycosylated proteins (referred to as dg). The resulting GBV-B polypeptides were separated by SDS-14% PAGE and are identified at the right of the gels with respect to molecular mass standards indicated to the left (B and C).

 



View larger version (45K):
[in this window]
[in a new window]
 
FIG. 4.
Mapping of the C terminus of the GBV-B E2 glycoprotein. A, the region spanning the predicted C terminus of GBV-B E2 and the N terminus of NS2 is schematically represented by thick boxes. Dipeptide positions indicated by arrows on top of the E2-NS2 boxes correspond to confirmed or SignalP-predicted ("?") signal peptidase cleavage sites. Numbers in parentheses below the E2 box correspond to amino acid positions within the GBV-B polyprotein and indicate the C-terminal amino acids of each C-E1-E2 polypeptide precursor encoded by RNAs transcribed from cDNA templates linearized with the indicated restriction enzymes. B, each RNA, transcribed in vitro using T7 RNA polymerase, was translated for 4 h in rabbit reticulocyte lysates in the presence of [35S]Met and canine pancreatic microsomal membranes. C, aliquots of translation products were incubated with peptide N-glycosidase F (PNGase F) to obtain deglycosylated proteins (referred to as dg). In both panels B and C, the predicted C-terminal amino acid of each polypeptide precursor is indicated on top of the gels. The resulting processed polypeptides were separated by SDS-10% PAGE and identified on the right of the gels with respect to molecular mass standards as indicated on the left. The electrophoretic mobilities of the truncated E2 polypeptides, as predicted whether glycosylated (B) or after deglycosylation (C), were compared with the full-length E2 protein, which was similarly produced from a polypeptide precursor extending into the N-terminal part of NS3 (amino acids 1–991, AflIII). A mock translation reaction (M) carried out in parallel in the absence of RNA is also shown.

 
Analytical in Vitro Translation and Processing Assays—In vitro transcribed RNAs (200 ng) were used to program translation reactions in FlexiTM rabbit reticulocyte lysates (Promega) in the presence of [35S]Met (Amersham Biosciences) according to the manufacturer's instructions with or without the addition of canine pancreatic microsomal membranes (Promega). Ten-microliter reactions were incubated at 30 °C for 4 h. One-tenth of each translation reaction was mixed either with Laemmli buffer or with an 1% SDS, 2% {beta}-mercaptoethanol loading buffer containing 9 M urea when indicated in the text (in the "Results" section, Fig. 3) before separation of GBV-B polypeptides by SDS-PAGE.



View larger version (32K):
[in this window]
[in a new window]
 
FIG. 3.
Analysis of the GBV-B NS2 polypeptide. As schematically represented in panel A, plasmid pTM/C-NS2-V5 contains the sequence coding for the GBV-B C-E1-E2-NS2 polypeptide precursor fused at its C terminus to a V5 tag downstream of the EMCV IRES and the promoter for T7 RNA polymerase (P-T7). The region spanning the predicted C terminus of GBV-B E2 and the N terminus of NS2 is schematically expanded at the bottom of the panel. Dipeptide positions within the GBV-B polyprotein indicated by arrows on top of the E2-NS2 boxes correspond to SignalP-predicted (question marks) signal peptidase cleavage sites. B, in vitro translation reactions in rabbit reticulocyte lysates in the presence of [35S]Met were programmed with either RNA transcripts prepared from BamHI-linearized pTM/C-NS2-V5 cDNA or no RNA (M). Translation reactions were incubated for 4 h in the presence (+) or absence (–) of canine pancreatic microsomal membranes as indicated at the top. Products were mixed with a 1% SDS, 2% {beta}-mercaptoethanol loading buffer containing 9 M urea before separation by SDS-12% PAGE. Precursors and processed polypeptides are identified on the right of the gel with respect to molecular weight standards indicated on the left. An aliquot of the in vitro translated products shown in lane 2 was immunoprecipitated with a monoclonal anti-V5 antibody and loaded in lane 3, as indicated on top of the gel (+; Anti-V5 IPP). Lane 3 was subjected to a lengthier exposure for autoradiography than the lanes loaded with crude translated products.

 
For deglycosylation of glycoproteins, one-fifth of the translation reaction was denatured and treated with 1000 units of peptide N-glycosidase F (New England Biolabs) according to the manufacturer's instructions for 3.5 h at 37 °C. Half of the sample was subsequently mixed with Laemmli buffer and analyzed by SDS-PAGE.

Immunoprecipitations—For immunoprecipitation of in vitro translated polypeptides, 6 µl of the translation reaction was diluted into 0.5 ml of radioimmune precipitation assay buffer (50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 1 mM EDTA, 0.5% Nonidet P-40) containing 0.1% SDS and incubated overnight at 4 °C with protein A-Sepharose (Amersham Biosciences) and either 1 µg of anti-V5 monoclonal antibody (Invitrogen) or 0.5 µl of rabbit polyclonal anti-poliovirus VP1 antibodies (a gift of B. Blondel, Institut Pasteur, Paris). Immune complexes were eluted from protein A-Sepharose with Laemmli buffer and separated by SDS-PAGE.

N-terminal Amino Acid Sequence Analysis—RNA transcripts derived from BamHI-linearized recombinant pTM plasmids were translated in rabbit reticulocyte lysates in the presence of canine pancreatic microsomal membranes as described above but in preparative 30–50-µl reactions and in the presence of appropriate [3H]-labeled amino acids or [35S]Met (Amersham Biosciences) depending on the protein analyzed. The resulting processed polypeptides were separated by SDS-PAGE either directly after translation or after immunoprecipitation of the translated products with the anti-V5 antibody as described above in the case of pTM/C-NS2-V5-mut5,9. Radiolabeled proteins were transferred to a PVDF membrane, excised from the membrane after identification of their location by autoradiography (Eastman Kodak Co. BioMax MR films) and subjected to automated Edman degradation on an Applied Biosystem 494 liquid-phase sequencer. The isotope content of fractions collected after each cycle of degradation was determined by liquid scintillation counting.

Sequence Analyses and Structural Predictions—Amino acid sequence homologies between GBV-B p13 and HCV p7 and within p13 were searched with the FASTA homology search program (36) using all reported HCV sequences from the GenBankTM/EMBL databases (HCV data base, hepatitis.ibcp.fr). Multiple sequence alignments were carried out with the ClustalW program (37). Internal amino acid sequence similarities within p13 were searched using the Local Similarity Program SIM (38). Peptidase cleavage sites were predicted using the SignalP program (39). Various methods were combined for the prediction of transmembrane sequences: PHDhtm (40), TMHMM (41), DAS (42), and TopPred2 (43). Theoretical {alpha}-helix models of the predicted transmembrane segments were constructed using SwissPdbViewer program (44), and their space-filling representation was generated with Rasmol 2.7 (45). These models were manually positioned in a phospholipid bilayer that was built from the coordinates of phospholipids reported in the 1BCC [PDB] entry of the Protein Data Bank (46).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Characterization of GBV-B Glycoproteins—Nothing is known of the boundaries of the GBV-B structural proteins other than predictions made from amino acid sequence alignments with the HCV polyprotein and computer-based algorithms for host signal peptidase recognition sequences. It is important, however, to gain an experimentally based understanding of the proteolytic processing of the GBV-B structural protein precursor if GBV-B is to serve as a useful model for HCV, particularly as related to the critical issues of particle assembly and viral entry into cells. Thus, to characterize the GBV-B envelope glycoproteins, we expressed these proteins in a cell-free translation system programmed with RNA transcribed from the plasmid pGBV-B/2 that contains an infectious genome-length GBV-B cDNA (Fig. 1A; Ref. 20). RNA was transcribed in vitro from ClaI-linearized pGBV-B/2 cDNA and translated in rabbit reticulocyte lysates in the presence of [35S]Met and in the absence or presence of canine pancreatic microsomal membranes as a source of cellular signal peptidases. The viral polypeptide encoded by this RNA spanned the C-NS4A sequence and extended into the predicted N-terminal domain of NS4B (see Fig. 1A). As shown in Fig. 1B, a high molecular mass protein product of ~180 kDa was detected in the absence of membranes (lane 2) that is likely to correspond to the full-length precursor encoded by the RNA (C-NS4B{Delta}). This protein was not produced in a control translation reaction carried out in the absence of GBV-B RNA (Fig. 1B, lane 1). An additional product of ~55 kDa was observed in the absence of membranes (Fig. 1B, lane 2). This is likely to be identical to an irrelevant product produced in other reactions (see lanes 1–3), although it cannot be ruled out that a co-migrating product resulted from aberrant initiation or premature termination of translation within the precursor sequence.

In the presence of membranes, the GBV-B precursor was fully processed, producing three polypeptides with apparent molecular masses of 31, 44, and 66 kDa, respectively (Fig. 1B, lane 3). The 66-kDa polypeptide was identified as NS3, whereas the 31- and 44-kDa proteins were assumed to represent the envelope glycoproteins, E1 and E2, respectively. Neither NS4A-4B{Delta} nor the core protein, C, was expected to be visualized, since the former has a very small size (7.4 kDa) and the latter lacks internal methionine residues in its sequence. Several minor, diffuse bands with molecular masses in the range of 20–28 kDa were also observed (Fig. 1B, lane 3). These products may represent incompletely glycosylated forms of E1 or the NS2 protein, which has a predicted molecular mass of ~23 kDa. However, NS2 could not be conclusively identified under those conditions. A comparison of lanes 2 and 3 clearly shows that the C-NS4B{Delta} precursor was not proteolytically processed in the absence of membranes. This indicates that the GBV-B proteases contained within this segment of the polyprotein (putatively NS2–3, and NS3/4A) are unable to direct cleavage at the NS2/NS3 or NS3/4A sites in the absence of prior cleavage of the structural protein precursor by signal peptidases. This suggests a fundamental difference from HCV in which NS2-3- and NS3/4A-directed cleavages have been shown to occur under similar conditions (our data and Ref. 47).

The predicted GBV-B E1 sequence contains 192 amino acid residues (residues 157–348 of the polyprotein, Ref. 25) and 3 putative N-linked glycosylation sites. Although somewhat greater than expected, the observed molecular mass of this protein (31 kDa) is consistent with this size prediction and the use of all three predicted glycosylation sites. In contrast, with 384 predicted amino acids (residues 349–732 of the polyprotein, Ref. 25), E2 would be expected to exhibit a 42-kDa molecular mass in the absence of glycosylation. Because there are 6 potential N-linked glycosylation sites in this predicted sequence, the observed molecular mass of 44 kDa suggests that the actual E2 sequence might be shorter or that E2 might be only partially glycosylated. To test this hypothesis, we treated the in vitro translation products with peptide N-glycosidase F. After removal of N-linked sugars, the electrophoretic mobilities of E1 and E2 shifted to 22 and 31 kDa, respectively (Fig. 1C, compare lanes 1 and 2). These results are consistent with the predicted sequence for E1, but they suggest that E2 is actually much shorter than predicted.

Determination of N-terminal Sequences of GBV-B Envelope Proteins—To map the boundaries of both E1 and E2 proteins, we used a heterologous expression system in which viral polypeptides were translated in vitro after internal initiation on the EMCV IRES. cDNA sequences coding for GBV-B C-E1-E2-NS2 were cloned into the plasmid pTM1 downstream of the EMCV IRES and the T7 RNA polymerase promoter (see "Experimental Procedures"), resulting in plasmid pTM/C-NS2. Such a strategy was expected to give higher yields of polypeptides than when expression was driven from the GBV-B IRES while also facilitating future generation of the corresponding recombinant vaccinia viruses. For N-terminal sequencing of proteins, RNA transcribed from the BamHI-linearized pTM/C-NS2 cDNA was translated in rabbit reticulocyte lysates in the presence of canine pancreatic microsomal membranes and either of two different tritiated amino acids, selected with respect to their occurrence and position in the predicted 10 N-terminal amino acids of the protein of interest.

For the E1 protein, labeling was with [3H]Val and [3H]Thr. Processed, in vitro translated products were separated by SDS-PAGE and transferred to a PVDF membrane. Radiolabeled E1 proteins were excised from the membranes and subjected to 10 cycles of Edman degradation. The isotope content of fractions collected after each cycle was determined. [3H]Val was clearly recovered at the third cycle, and [3H]Thr was recovered at the fourth cycle, a profile that was reproduced in two separate experiments (Supplemental Fig. 1A). These data map the N terminus of E1 to the Ala residue located at position 157 of the GBV-B polyprotein. This is in good agreement with the SignalP program (39), which predicts the most likely cleavage in this region to be at residues 156/157.

A similar analysis of [3H]Pro- and [3H]Val-labeled E2 proteins revealed Pro residues at positions 2 and 6 and a Val residue at position 5 of the protein, with marked increases in isotope recovery in these fractions that were reproduced in separate experiments (Fig. 2). This succession of amino acids is consistent with the N terminus of E2 being located at the Asn residue at position 350 of the polyprotein. Therefore, these results map the E1/E2 junction to residues 349/350, 1 amino acid residue downstream of the previously predicted site (25). However, this 349/350 site also conforms to predicted signal peptidase cleavage requirements and was also identified as a likely cleavage site by the SignalP program.



View larger version (19K):
[in this window]
[in a new window]
 
FIG. 2.
N-terminal amino acid sequencing of the E2 glycoprotein. RNA transcripts derived from BamHI-linearized pTM/C-NS2 cDNA were translated in rabbit reticulocyte lysates in the presence of canine pancreatic microsomal membranes and either [3H]Val or [3H]Pro. The polypeptides derived from processing of the encoded C-E1-E2-NS2 precursor were separated by SDS-PAGE and transferred to a PVDF membrane. [3H]Val- and [3H]Pro-labeled E2 proteins were excised from the PVDF membrane after autoradiography and subjected to eight cycles of Edman degradation. The isotope content of fractions collected after each cycle for [3H]Val E2 (black squares, solid line) and [3H]Pro E2 (gray circles, dashed line) was determined. The cycles at which the recovery of labeled amino acids (in bold characters and framed) peaked reproducibly are indicated by the arrows. The deduced polypeptide sequence, derived from an examination of the GBV-B polyprotein sequence in the region of the cleavage, is shown at the bottom of the graph. Numbering of the N-terminal amino acid indicates its position within the GBV-B polyprotein.

 
Characterization of the Downstream, NS2 Polypeptide—Having determined the N terminus of E2 by sequencing, we examined the predicted C-terminal sequence of E2 with the SignalP program to identify potential cleavage sites. Beside the previously predicted E2/NS2 cleavage site at residues 732/733, two possible sites of cleavage were identified at positions 613/614 and 681/682 (Fig. 3A). Because nothing is known about the actual molecular mass of the GBV-B NS2 protein, we undertook to determine whether the N terminus of NS2 was located at residue 733 as predicted or at one of the other upstream putative cleavage sites.

Much like the HCV NS2 protein, the predicted sequence of GBV-B NS2 is mostly hydrophobic, with several putative transmembrane domains. To facilitate detection of this polypeptide, we constructed a pTM/C-NS2 derivative coding for a C-terminal fusion of a V5 tag to NS2, including three glycine residues as a flexible hinge between the protein domains (Fig. 3A; see "Experimental Procedures"). In vitro translations were carried out with or without the addition of microsomal membranes, and aliquots of the resulting products were immunoprecipitated with a monoclonal anti-V5 antibody. The resulting [35S]Met-labeled products were diluted in a loading buffer containing 9 M urea in an effort to avoid aggregation of the membrane-spanning polypeptide and separated by SDS-PAGE. A precursor polypeptide of ~100 kDa, corresponding to the full-length C-NS2-V5 precursor, was observed in the absence of membranes (Fig. 3B, lane 1). Upon the addition of membranes, the precursor protein was processed into E1 and E2 as well as a polypeptide with an apparent molecular mass of ~22 kDa (Fig. 3B, lane 2). This latter product was the only one to be subsequently immunoprecipitated with an anti-V5 monoclonal antibody (Fig. 3B, lane 3) and was, therefore, identified as NS2-V5. The slightly different electrophoretic mobility of this polypeptide in lane 2 as compared with lane 3 is likely due to differences in total protein loading and the presence of rabbit reticulocyte lysate proteins in the crude translation reaction. This result confirms that a cellular signal peptidase is responsible for the release of NS2, as is the case for HCV.

Preliminary data obtained by sequencing an in vitro translated, [3H]Leu-labeled NS2-V5 polypeptide supported the initial prediction for the NS2 N terminus (Phe733) but were not definitive due to very low amounts of product recovered from the membranes (data not shown). To enhance the radioactive labeling of the product, we substituted codons 737 (ATA, Ile) and 741 (CTG, Leu), putatively located at positions 5 and 9 of NS2 in pTM/C-NS2-V5, respectively, with ATG (Met) codons, thus generating the plasmid pTM/C-NS2-V5-mut5,9. These substitutions are conservative in terms of the hydrophobicity of the side chains; positions 5 and 9 were chosen in an effort to reduce the potential impact of substitutions on the efficiency of cleavage by cellular signal peptidase. The in vitro translated NS2-V5-mut5,9 polypeptide, synthesized in the presence of [35S]Met, was immunoprecipitated with anti-V5 antibodies and subjected to 10 cycles of Edman degradation. Radiolabeled Met residues were recovered in fractions collected after the fifth and, to a lesser extent, ninth cycles (Supplemental Fig. 1B), a result that was confirmed in a repeat experiment. These data demonstrated that the N-terminal residue of NS2-V5 is the Phe residue at position 733, a result that is in agreement with the SignalP prediction.

The E2 Glycoprotein Is Smaller Than Predicted—To map the approximate position of the C terminus of the envelope glycoprotein E2, we expressed proteins from pGBV-B/2 RNA transcripts that were terminated at various sites within the predicted E2-encoded sequence (Fig. 4A). RNAs transcribed in vitro from BsaAI-, EciI-, BssHII-, BstEII-, SmlI-, PvuII-, or FseI-linearized cDNA templates were subsequently translated in rabbit reticulocyte lysates in the presence of microsomal membranes and [35S]Met. These RNAs generated various C-terminal-truncated polypeptide precursors that we refer to with respect to the position of the C-terminal amino acid within the GBV-B polyprotein. To produce a marker for the mature E2 protein, we used an AflIII-linearized cDNA template that resulted in synthesis of a C-NS3{Delta} precursor spanning amino acid residues 1–991 of the polyprotein. All of these precursor polypeptides gave rise to the expected, fully processed E1 protein (Fig. 4B). A comparison of the electrophoretic mobilities of the glycosylated forms of E2 released from the various C-terminal-truncated precursors revealed that several products of various intensities and molecular masses, all much lower than those of full-length E2, were generated upon processing of the 1–531 (BsaAI) and 1–563 (EciI) precursors (Fig. 4B, compare lanes 1–2 with lane 8). These truncations were further shown to alter the E2 glycosylation pattern, although they do not remove any predicted glycosylation sites. In contrast, the 1–637 (SmlI), 1–678 (PvuII), and 1–699 (FseI) products, when processed, produced glycosylated polypeptides that co-migrated with mature E2 (Fig. 5B, compare lanes 5–7 with lane 8). The 1–610 (BssHII) and 1–614 (BstEII) precursors released glycosylated proteins that migrated slightly faster than the mature E2 (Fig. 4B, compare lanes 3–4 with lane 8).



View larger version (25K):
[in this window]
[in a new window]
 
FIG. 5.
Evidence for the existence of a small polypeptide between GBV-B E2 and NS2. A, at the top is shown a schematic representation of the region corresponding to the predicted C terminus of GBV-B E2 and the N terminus of NS2. Indicated below are the C termini of the GBV-B structural precursors spanning amino acid residues 1–673 or 1–701, each fused at its C terminus to the V5 tag and expressed by recombinant vaccinia viruses vv-C-E2673-V5 or vv-C-E2701-V5. The location of the two SignalP-predicted signal peptidase cleavage sites ("?") in the predicted E2 C terminus is indicated by arrows on top of the boxes. B, CV1 cells were co-infected with vTF7–3 and the indicated recombinant vaccinia virus or with a recombinant vaccinia virus expressing no heterologous sequence (vv-TM1). V5-containing polypeptides present in cytoplasmic extracts, prepared at the indicated times post-infection (hours (Hrs) post-infection (p.i.)), were separated by SDS-16% PAGE and identified by immunoblot using an anti-V5 monoclonal antibody. Positions of molecular mass standards and of the small pX673-V5 and pX701-V5 polypeptides are indicated.

 
To eliminate any potential interference from differential N-glycosylation patterns in determining the molecular mass of these polypeptides, we deglycosylated the in vitro translated products by incubation with peptide N-glycosidase F. It was not possible to discriminate between the apparent molecular masses of the E2 protein released from any precursor extending to amino acid 610 or beyond (Fig. 4C, lanes 3–6), whereas the 1–531 and 1–563 precursors clearly produced truncated forms of E2 (Fig. 4C, lanes 1–2). Altogether, these results indicate that the C terminus of E2 is located between amino acids 610 and 637 (BssHII and SmlI templates, respectively) and that the precursors that extend downstream of these residues contain an efficiently processed cleavage site located between E2 and the downstream protein. Importantly, these data demonstrate clearly that E2 is substantially shorter in length than originally predicted.

Demonstration of the Existence of a GBV-B Polypeptide between E2 and NS2—Next, we focused on the processing of the viral polyprotein in the region between E2 and NS2. As indicated above, the SignalP program suggests two possible cleavage sites in the 610–732 amino acid region, one at positions 681/682 and a second one, less likely, at positions 613/614. We selected an expression strategy based on the use of recombinant pTM plasmids permitting both in vitro translation as well as recombinant vaccinia virus-driven expression in eukaryotic cells and involving C-terminal fusion of structural precursors to a V5 tag. This C-terminal tag was placed either at positions 673 or 701 within the polyprotein, each downstream with respect to one of the two putative cleavage sites at positions 613/614 and 681/682. Two plasmids, pTM/C-E2673-V5 and pTM/C-E2701-V5, were thus generated, providing expression of precursor proteins spanning amino acids 1–673 or 1–701 of the GBV-B polyprotein, respectively, each fused at its C terminus to a Gly-Gly-Gly hinge followed by the V5 peptide.

We expressed these V5 fusion proteins in eukaryotic cells using recombinant vaccinia viruses generated by homologous recombination between pTM/C-E2673-V5 and pTM/C-E2701-V5 DNAs and the vaccinia virus genome, generating vv-C-E2673-V5 or vv-C-E2701-V5, respectively (see "Experimental Procedures"; Fig. 5A). CV1 cells were co-infected with vTF7–3 (expressing T7 RNA polymerase) and either vv-C-E2673-V5 or vv-C-E2701-V5. V5-tagged proteins present in cytoplasmic extracts prepared at various times (7–17 h) post-infection were detected by immunoblotting with anti-V5 antibodies. A protein of ~8.5 kDa (pX701-V5) was detected in vv-C-E2701-V5-infected cells as well as another minor species ~1-kDa smaller (Fig. 5B, left panel). Protein expression was optimal at the times shown in this figure, 9 and 11 h post-infection. Two V5 fusion proteins (pX673-V5) were observed in vv-C-E2673-V5 infected cells, with apparent molecular masses of ~6.5 and ~7.5 kDa, respectively (Fig. 5B, right panel). With each precursor, the second, smaller polypeptide species never became predominant with increasing time and even was not observed at some later time points. Although the precise molecular mass of very hydrophobic proteins may be difficult to estimate and although the difference in mass that was apparent between the pX701-V5 and pX673-V5 products is somewhat less than expected, the size of the products shown in Fig. 5B is most consistent with cleavage occurring between residues 613 and 614. The co-existence of two forms of each resulting polypeptide might reflect the use of an alternate, non-predicted cleavage site or possible post-translational modification of the polypeptides (see "Discussion"). Taken together, the results of these experiments demonstrate the existence of a previously unrecognized polypeptide located within the polyprotein between E2 and NS2.

Determination of the N-terminal Sequence of the Newly Identified Polypeptide—The expression levels and small sizes of the pX673-V5 and pX701-V5 products did not permit sufficient recovery for N-terminal sequencing of metabolically labeled proteins. Thus, for sequencing purposes, we fused a larger reporter sequence at the C terminus of the C-E2701 and C-E2673 precursors, allowing the use of in vitro translation as an expression system. pTM/C-E2673-VP1 and pTM/C-E2701-VP1 were constructed by replacing the V5 tag sequence in pTM/C-E2673-V5 and pTM/C-E2701-V5, respectively, with a sequence encoding a truncated form of the poliovirus VP1 capsid protein (aa 1–254) as a reporter protein of ~28 kDa preceded by a "Gly-Ser-Gly" flexible hinge (Fig. 6A).



View larger version (50K):
[in this window]
[in a new window]
 
FIG. 6.
Expression of the truncated forms of the GBV-B pX protein fused at the C terminus to a poliovirus reporter protein. A, constructs made in the pTM1 background and designed to express GBV-B structural precursors spanning amino acid residues 1–673 or 1–701 are schematically represented (pTM/C-E2673-VP1 or pTM/C-E2701-VP1). Each precursor is fused at its C terminus to poliovirus VP1 sequence as a reporter protein (a truncated form of the VP1 capsid protein of poliovirus, VP1{Delta}). The locations of the two SignalP-predicted ("?") signal peptidase cleavage sites in the predicted E2 C terminus are indicated by arrows. B, in vitro translation reactions in rabbit reticulocyte lysates were programmed with either RNA transcribed from the indicated constructs or no RNA (M) in the presence of [35S]Met and canine pancreatic microsomal membranes. Precursors and processed polypeptides were separated by SDS-12% PAGE and are identified at the right of the gels with respect to molecular mass standards, indicated to the left. Where indicated at the top of the gel (+; Anti-VP1 IPP), aliquots of in vitro translated products were immunoprecipitated with polyclonal antibodies directed to poliovirus VP1 protein. The lanes within the gel are separated by lines to indicate that those loaded with immunoprecipitated polypeptides were subjected to a lengthier exposure for autoradiography than those loaded with crude translated products.

 
In vitro translation reactions programmed with RNA transcripts derived from these constructs and containing microsomal membranes generated the E1 and E2 proteins as well as a doublet of proteins with molecular masses of ~40.5 and 42 kDa with pTM/C-E2673-VP1 and 42 and 43 kDa with pTM/C-E2701-VP1, respectively (Fig. 6B, lanes 1 and 3). Both the larger and smaller polypeptide species produced from each transcript were immunoprecipitated with anti-VP1 antibodies (Fig. 6B, lanes 2 and 4). These results are similar to those obtained with V5-tagged expression products with respect to the presence of a doublet and the differences in the electrophoretic mobilities of pX701-VP1 and pX673-VP1. Taken together, these data provide strong support for a cleavage event between residues 613 and 614.

To confirm this, a [3H]Leu-labeled pX673-VP1 protein doublet was transferred to PVDF membranes and subjected together to N-terminal Edman degradation, as described above. Radiolabeled Leu residues were recovered at cycles 3 and 7 (Fig. 7A), compatible with the SignalP prediction of a cleavage occurring between Gly613 and Tyr614 residues of the polyprotein. We then focused on pX701-VP1, separately recovering the large and small [3H]Leu- or [3H]Pro-labeled proteins from the corresponding doublet and subjecting each separately to 5 cycles of Edman degradation. Recovery of a Pro residue in the second cycle and a Leu residue in the third cycle from each species clearly demonstrated that the two proteins in the pX701-VP1 doublet have identical N termini corresponding to Tyr614 in the polyprotein (Fig. 7B and data not shown). This N-terminal boundary was identical to that found in pX673-VP1. The data, thus, demonstrate that the doublet bands observed with both the pX701-VP1 and pX673-VP1 products do not reflect the existence of an alternative N-terminal cleavage site. Because they appear to be similar in sequence, the different electrophoretic mobilities of these doublet bands might arise from differences in their conformation or, more likely, a post-translational modification (see "Discussion").



View larger version (13K):
[in this window]
[in a new window]
 
FIG. 7.
N-terminal amino acid sequencing of GBV-B pX protein. RNA transcripts derived from BamHI-linearized pTM/C-E2673-VP1 (A) or pTM/C-E2701-VP1 (B) cDNAs were translated in rabbit reticulocyte lysates in the presence of canine pancreatic microsomal membranes and either [3H]Leu or [3H]Pro. A, the [3H]Leu pX673-VP1 protein doublet (see Fig. 6) was transferred to a PVDF membrane, excised, and subjected as a mixture to eight cycles of Edman degradation. B, the two protein bands forming the [3H]Leu (black squares, solid line)- or [3H]Pro (gray circles, dashed line)-labeled pX701-VP1 doublet (see Fig. 6) were separated, and the more rapidly migrating protein species was subjected to five cycles of Edman degradation. The cycles at which labeled amino acids (in bold characters and framed) were specifically recovered are shown by arrows. The deduced polypeptide sequence is shown at the bottom. Numbering of the N-terminal amino acid indicates its position within the GBV-B polyprotein.

 
These data argue strongly for the existence of a novel GBV-B protein that is 119 amino acid residues in length, located between the E2 and NS2 proteins and extending from the Tyr residue at position 614 to the Ala residue at position 732 of the polyprotein. This polypeptide has a calculated molecular mass of 13.1 kDa and is, thus, referred to hereafter as p13. A pair of polypeptides of ~36–37 kDa was also observed at a low abundance among the in vitro translated products from reactions programmed with pTM/C-E2701-VP1 RNA; these were immunoprecipitated by anti-VP1 antibodies (pY701-VP1, lanes 3–4, Fig. 6B). We suspect that they may arise from an alternative cleavage event occurring at the putative 681/682 site, but the very low level of expression of these polypeptides did not permit their direct amino acid sequencing. This putative alternative cleavage event could not be observed with V5-tagged expression products (Fig. 5), since it would have given rise to products that were too small to detect. To determine whether the C-terminal truncation of p13 at amino acid 701 (Fig. 6) might have biased putative usage of the 681/682 cleavage site, we also expressed the full-length p13 sequence (amino acids 614–732) as a fusion with poliovirus VP1. We found that the N terminus of the major VP1-reactive processing product was located at Tyr614 (data not shown), in clear agreement with the existence of p13 in GBV-B.

Sequence Analysis and Structural Predictions of the Newly Identified p13 Protein—Examination of the p13 sequence revealed that it contains four long hydrophobic stretches. Regardless of the method used (PHDhtm, TMHMM, DAS, or Top-Pred2, see "Experimental Procedures"), these hydrophobic stretches were predicted to be transmembrane (TM) helices and were, thus, denoted TM1-TM4 (gray boxes, Fig. 8). Because the p13 protein is located between the structural and nonstructural proteins in the GBV-B polyprotein, it was expected to share potential homologies with the p7 protein of HCV (10), a 63-amino acid-long protein containing two transmembrane segments that is similarly positioned within the HCV polyprotein (11). Although much lengthier, a comparison of the p13 sequence with that of p7 showed that the two C-terminal transmembrane segments (TM3-TM4) of GBV-B p13 possess rather clear similarities with the two transmembrane segments (TM1-TM2) of HCV p7 (Fig. 8A). In addition, the p13 TM3 and TM4 segments are connected with a short, basic segment that includes several Arg residues, similar to the short sequence located between the TM1 and TM2 segments of p7 (Fig. 8A). The C-terminal part of p13 including TM4 displays structural features typical of signal peptides (Fig. 8B), indicating that it is likely to represent the signal for the reinitialization of translocation of the downstream NS2 protein. These predictions suggest that the topology of the C-terminal part of p13 is similar to that of TM1-TM2 in p7. Interestingly, we also found strong internal sequence similarities between TM4 and TM2 of GBV-B p13, with 42% amino acid residue identities (Fig. 8B). The segments that share strong similarities end at residues 681 and 732, respectively, which correspond to the predicted and confirmed signal peptidase cleavage sites. The TM2 segment also exhibits structural features that are characteristic of a signal peptide (Fig. 8B) and is, thus, likely to act as a signal for the reinitialization of translocation of the C-terminal part of p13. From these analyses a putative topology can be proposed for GBV-B p13 protein wherein the N and C termini are oriented toward the ER lumen, with the short segments that connect the four transmembrane helices oriented alternatively toward the cytosol and the ER lumen (Fig. 8C).



View larger version (66K):
[in this window]
[in a new window]
 
FIG. 8.
Predicted structure of the newly identified GBV-B p13 protein. A, alignment of the amino acid sequences of the GBV-B p13 and HCV p7 proteins; the p13 amino acid residues (single-letter code) are numbered with respect to their position within the GBV-B polyprotein (GenBankTM accession number AY243572 [GenBank] ), whereas HCV p7 residues are numbered according to their position within this protein (EMBL accession number D63821 [GenBank] corresponds to the closest HCV p7 sequence to p13). Boxed amino acids correspond to residues predicted to be involved in transmembrane segments, as deduced from various prediction methods (see "Experimental Procedures"). Identical, strongly conservative, and weakly conservative amino acid pairs are indicated by stars, colons, and dots, respectively, according to ClustalW convention (37). B, an alignment of the sequences of the amino and carboxyl halves of the GBV-B p13 protein demonstrates high frequency of internal sequence identity (42%), suggesting that p13 may have been formed by gene duplication. Note that each segment exhibits structural features that are characteristic of signal peptides (57), consisting of an N-terminal region (n-domain) encompassing 1–3 positively charged residues (Arg), a hydrophobic core region (h-domain) forming an {alpha}-helix, and a more polar, flexible region (c-domain) containing a signal peptidase cleavage site; residues at positions –1 and –3 relative to the cleavage site are small neutral residues (Ala) and form the recognition site for signal peptidases (58), whereas {alpha}-helix-destabilizing residues (Pro, Gly) are present at position –6 and/or in the middle of the h-domain. C, putative topology of GBV-B p13. Several amino acid residues located within the loops connecting the four transmembrane segments are indicated by their positions within GBV-B polyprotein. Note that the conserved Arg-X-Arg residues present between TM1/TM2 and TM3/TM4 are positioned on the cytosolic side of the membrane. ER, endoplasmic reticulum. D, theoretical {alpha}-helical models of the predicted transmembrane segments TM2 and TM4 and their putative location within a phospholipid bilayer were generated as described under "Experimental Procedures." Residues identical in both TMs (indicated by stars in panel B) are represented in gray or black, depending on their location within the helix. Note that residues in gray form similar twisted grooves in both the TM2 and TM4 helices. The polar heads and aliphatic tails of phospholipids are in light gray and very light gray, respectively. Residues His725 and Trp715, possibly contributing to ion channel function, are indicated in TM4.

 
To better evaluate the relevance of the amino acid identities present in the sequences of the TM2 and TM4 segments of p13, we constructed theoretical {alpha}-helical molecular models for both TMs (Fig. 8D). Examination of the spatial arrangement of the side chains of the amino acids residues that are common to both the TM2 and TM4 helices revealed that most of them (residues are in gray in Fig. 8D) are localized on one side of each helix and form a twisted groove extending throughout the helix. The other identical Arg and Leu residues shared by TM2 and TM4 are located on the opposite face of these helices (Arg656, Leu660, and Leu670 in TM2; Arg707, Leu711, and Leu721 in TM4; residues are in black in Fig. 8D). These observations provide circumstantial evidence that TM2 and TM4 share a similar structure, raising the hypothesis that GBV-B p13 might be derived from a gene duplication. Two amino acid residues in TM4, His725 and Trp715 (Fig. 8D), may have a functional role in ion channel activity (see "Discussion"). The fact that these residues have no homologs in TM2 suggests that TM2 and TM4 may possess different functions.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Although its natural host is unknown, GBV-B replicates within the livers of small New World primates, generally causing an acute hepatitis upon infection. In addition to its host tropism and pathogenic characteristics, GBV-B appears to share many molecular characteristics with HCV. In the present work, we studied the proteolytic processing of the GBV-B polyprotein and experimentally determined each of the sites of proteolytic cleavage within the GBV-B structural protein precursor by N-terminal sequencing of the fully mature proteins. We report for the first time a preliminary characterization of GBV-B envelope proteins. We identify several features that are common to both GBV-B and HCV but also point to potentially important differences in these viruses.

The C/E1 junction was found to match the initial prediction by Muerhoff et al. (25) at amino acid residues Gly156/Ala157 of the polyprotein. In contrast, we found that the E1/E2 junction is actually located one amino acid downstream of the initial prediction (25), at the dipeptide Gly349/Asn350. Therefore, with 193 residues, the size of GBV-B E1 protein is very close to that of the HCV E1 protein (192 residues in a genotype 1 strain of HCV, Ref. 48). In addition, our data suggest that all three putative N-linked glycosylation sites are most probably used (Fig. 1). This also brings the molecular mass of the GBV-B E1 glycoprotein close to that of HCV E1, in which 4 N-glycosylation sites are used (49).

In contrast, our analysis of the C-terminal domain of the GBV-B structural precursor revealed previously unreported features that diverge substantially from the original predictions. We demonstrated that the location of the junction between the structural proteins and the first nonstructural protein NS2 conforms to the prediction and that this junction (Ala732/Phe733) is cleaved by a cellular signal peptidase, as in HCV (Fig. 3). Unexpectedly, we found that E2 is considerably smaller than originally predicted, since we demonstrated the efficient cleavage of the polyprotein at Gly613/Tyr614 by N-terminal sequencing of the downstream cleavage product (Figs. 6, 7). GBV-B E2, thus, consists of only 264 residues, resulting in a clear size difference compared with HCV E2 that contains 363 residues (e.g. genotype 1, Refs. 10 and 48). In addition, whereas our data indicate that most if not all of the six putative N-linked glycosylation sites of GBV-B E2 are utilized (Figs. 1 and 4), the HCV E2 is considerably more heavily glycosylated with 11 putative N-glycosylation sites (HCV genotype 1a). Despite these differences, structure predictions indicate the presence of a putative transmembrane domain at the actual C terminus of GBV-B E2 which is similar to that observed in HCV (50).

Consistent with the demonstration that cleavage events take place at dipeptides Gly613/Tyr614 and Ala732/Phe733, we documented the existence of a previously unrecognized GBV-B protein, p13, composed of 119 amino acid residues spanning amino acid residues 614–732 of the GBV-B polyprotein and with a calculated molecular mass of 13.1 kDa. Whether C-terminally fused to a small 17-amino acid peptide tag or to a larger, ~28-kDa reporter protein, the GBV-B 614–673 (pX673-V5, pX673-VP1) and 614–701 (pX701-V5, pX701-VP1) polypeptides migrated as protein doublets with apparent molecular masses differing by ~1–1.5 kDa in SDS-PAGE (Figs. 5, 6). These two species of each fusion protein appear to have identical sequences, since they were immunoprecipitated by antibodies to the reporter peptide sequence fused at their C termini while at the same time containing identical N termini (Fig. 7). We suspect that they may represent different conformations of the protein that co-exist under the conditions of SDS-PAGE or, more likely, two forms of the same protein differing by a post-translational modification. Such a modification would be incomplete in either of the expression systems we used, rabbit reticulocyte lysates programmed for translation with synthetic RNA or eukaryotic cells infected with recombinant vaccinia viruses (Figs. 5, 6). However, we observed no kinetic relationship in the relative proportions of these two forms. One intriguing possibility is that there might be S-acylation of the Cys residues located within or at the connection between the putative transmembrane regions of the protein (at positions 648, 649, 653) in some molecules. Interestingly, such S-acylation is frequent in membrane-spanning proteins and has been found in the 6K protein of Alphavirus (51) and the M2 protein of influenza virus (52), two viral membrane proteins members of a group of small proteins known as viroporins (53). However, further experiments will be required to refute or confirm this hypothesis.

Given its position within the viral polyprotein, this newly discovered p13 protein was expected to share homologies with the p7 protein of HCV (10), a protein that has not been intensively investigated until recently. HCV p7 is composed of 63 amino acids and has been suggested to be a membrane-spanning protein located in the endoplasmic reticulum and containing two transmembrane helices (11). Like HCV p7, prediction analyses of the topology of p13 revealed that it is a polytopic membrane protein that is likely to have its N and C termini oriented toward the ER lumen environment (Fig. 8). In contrast, with 119 amino acids and four predicted transmembrane helices, GBV-B p13 appears to have a structurally different organization than that of HCV p7. We demonstrated, however, that the TM3-TM4 segments of GBV-B p13 share clear homologies with the two transmembrane segments of HCV p7 (Fig. 8). This suggests the possibility that the p13 segment spanning the TM3-TM4 domains might be functionally equivalent to the p7 protein of HCV. Such a hypothesis would be consistent with the prediction of a putative signal peptidase cleavage site at Ala681/Gln682, immediately upstream of TM3 within GBV-B p13. However, our experimental data indicate that such a cleavage occurs with very low efficiency, if at all, regardless of whether p13 is C-terminally fused to a short or a long tag sequence or whether it is expressed as a truncated product (see Fig. 5, 6) or a full-length product (data not shown). Moreover, we have no indication that the faint additional products that we observed in processing reactions (pY701-VP1, Fig. 6) have an N terminus located at residue Gln682. Regardless of whether GBV-B p13 is further processed into two proteins or remains intact, the functional role of the N-terminal half of p13 remains to be determined.

The homologies evident in the sequences and structures of the TM2 and TM4 transmembrane helices of p13 (Fig. 8) are remarkable. The nature of the residues, particularly those with short side chains (Gly, Ala, Pro), that are present within identical grooves in TM2 and TM4 suggests that the corresponding helical faces might be involved in specific helix-helix interactions with a common partner, either p13 itself or another viral or cellular protein that is essential for virus replication. Three recent publications suggest that the HCV p7 protein may function as an ion channel (1214). For it to have such activity, HCV p7 would need to multimerize, and p7 was indeed observed to form putative hexameric ring structures in membranes (12). By analogy with the p7 protein, p13 oligomerization might occur through intra- and/or intermolecular interactions involving the clearly homologous grooves in the TM2 and TM4 domains. Typically, a trimer of p13 could theoretically reproduce the arrangement of transmembrane helices present in a hexamer of p7. However, the presence of the His725 and Trp715 residues in TM4, which may have a functional role in ion channel activity but have no homologs in TM2, suggests that the amino and carboxyl halves of p13 may possess different functions. Although additional experimental data will be needed to determine the oligomeric state and the three-dimensional structure of p13, our analyses strongly suggest that p13, at least through its C-terminal domain, is likely to share ion channel activity with HCV p7.

The p7 ion channel activity has been reported to be inhibited by amantadine (12), a drug that is used in the treatment of influenza A virus infections and known to act by inhibiting the ion channel activity of the M2 viral protein (54). Therefore, HCV p7 is presumed to be a member of the viroporin family. Our data with GBV-B strengthen the hypothesis that numerous members of the Flaviviridae family, including Pestiviruses (e.g. bovine viral diarrhea virus, Ref. 55), HCV, and GBV-B) encode such small membrane-spanning proteins. These proteins are, therefore, likely to have a key role in the Flaviviruses life cycle and may through their ion channel activity contribute to virion maturation, release, and/or entry into cells. In a reverse genetics study of bovine viral diarrhea virus, it was shown that a large in-frame deletion of p7 does not affect RNA replication but prevents the formation of infectious virions (15). The role of HCV p7 in the virus life cycle is unclear at present, but a genome-length HCV RNA in which the p7 sequence had been deleted was not infectious upon intrahepatic inoculation in a chimpanzee (56). A subsequent analysis of genome-length molecular clones of HCV containing chimeric intertypic p7 sequences revealed that there was a genotype-specific p7 requirement localized to the N- and/or C-terminal regions of this polypeptide, suggesting that it may interact with other proteins encoded by HCV in a genotype-specific manner (56). It would be interesting to determine whether part or all of the GBV-B p13 sequence could be replaced by HCV p7 sequence without ablating GBV-B infectivity in tamarins. In addition to providing information on the structure and function of this novel polypeptide, such a chimeric GBV-B/HCV virus would be a valuable tool to evaluate the potential of antiviral candidates targeting HCV p7 sequences in vivo in small primates. GBV-B, as a surrogate model, also offers invaluable opportunities to look at the role of these small membrane-spanning proteins in the viral life cycle in primary cultures of hepatocytes and in a small primate animal model.


    FOOTNOTES
 
* This work was supported in part by the French Ministère de la Recherche, Programme de Recherche Fondamentale en Microbiologie, Maladies Infectieuses et Parasitaires (PRFMMIP, "Emergence jeune équipe") and by the INSERM Grant 1A133C/CNRS (Réseau National: Hépatites Virales). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

The on-line version of this article (available at http://www.jbc.org) contains Supplemental Fig. 1. Back

§ Supported by a French Ministère de la Recherche fellowship. Back

|| To whom correspondence should be addressed: Unité de Génétique Moléculaire des Virus Respiratoires, CNRS URA 1966, Institut Pasteur, 25 rue du Dr. Roux, 75724 Paris Cedex 15, France. Tel.: 33-1-40-61-33-60; Fax: 33-1-40-61-30-45; E-mail: annettem{at}pasteur.fr.

1 The abbreviations used are: HCV, hepatitis C virus; GBV-B, GB virus B; EMCV, encephalomyocarditis virus; IRES, internal ribosome entry site; PVDF, polyvinylidene difluoride; TM, transmembrane. Back


    ACKNOWLEDGMENTS
 
We thank Danièle Bénichou for the construction of plasmid pTM/C-NS2, Jacques d'Alayer for carrying out protein sequencing reactions, and Bruno Blondel for providing poliovirus anti-VP1 antibodies. We are grateful to Stanley M. Lemon for critical reading of this manuscript and to Sylvie van der Werf for continuous support.



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Lindenbach, B. D., and Rice, C. M. (2001) in Fields Virology (Knipe, D. M., and Howley, P. M., eds) Vol. 1, 4th Ed., pp. 991–1041, Lippincott-Raven Publishers, Philadelphia, PA
  2. Penin, F., Dubuisson, J., Rey, F., Moradpour, D., and Pawlotsky, J. (2004) Hepatology 39, 5–19[CrossRef][Medline] [Order article via Infotrieve]
  3. Pietschmann, T., and Bartenschlager, R. (2003) Clin. Liver Dis. 7, 23–43[CrossRef][Medline] [Order article via Infotrieve]
  4. Baumert, T. F., Ito, S.,