![]()
|
|
||||||||
J. Biol. Chem., Vol. 282, Issue 32, 23418-23426, August 10, 2007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1



2
From the
Departments of
Molecular Biology and ¶Chemistry, Princeton University, Princeton, New Jersey 08544 and the
Departments of Biochemistry and Pharmacology, University of Texas Southwestern Medical Center, Dallas, Texas 75390
Received for publication, May 4, 2007 , and in revised form, June 7, 2007.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The COG complex belongs to a group of multisubunit protein assemblies commonly termed "tethering complexes" (14–17). Tethering complexes are thought to act upstream of SNAREs, mediating the initial attachment of intracellular trafficking vesicles to their membrane targets. Many, if not all, tethering complexes are also Rab effectors. One model for the molecular function of tethering complexes is that they act as protein interaction hubs, orchestrating the sequential actions of Rabs and SNAREs (and potentially other proteins) during the process of vesicle docking and fusion (4).
A hurdle in critically evaluating such models is a dearth of structural information. Recently, structures of several subunits of the exocyst complex have been determined (18–23). These results are of particular relevance to COG because detectable sequence homology has been reported between regions of some exocyst and COG subunits (17, 24), although the possibility that this homology represents convergent evolution has been raised recently (25). To begin to investigate the structure of COG, we have initially focused on the Cog2p (Sec35p) subunit. The choice of Cog2p was based on several considerations. First, cog2 mutants display severe phenotypes in yeast and Chinese hamster ovary cells (1, 26). Second, a pool of soluble Cog2p exists in yeast cytosol, suggesting that free Cog2p is likely well folded (Ref. 27; but see also Ref. 24). Third, Cog2p is relatively modest in size; at 30 kDa, it is the smallest of the yeast subunits (although its mammalian ortholog is much larger at 83 kDa). Fourth, initial attempts to overproduce Cog2p for the production of antibodies (27) revealed that recombinant Cog2p was largely soluble in Escherichia coli, boding well for structural studies.
Here we report the structure of a fragment constituting a major portion of yeast Cog2p (residues 61–262), determined using multidimensional NMR. Residues 61–108, which are important for solubility in vitro and function in vivo, populate helical conformations in this Cog2p fragment but do not appear to adopt a fixed tertiary structure. The remainder of the fragment (residues 109–262) forms a six-helix bundle. The fold bears a general resemblance to exocyst subunit domains, strengthening the hypothesis that helical bundle domains are a common structural unit from which both COG and exocyst complexes are constructed (22, 28).
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
0.6–0.8 before adding isopropyl-
-D-thiogalactopyranoside to a final concentration of 0.5 mM. Cells were grown an additional 12–16 h at 23 °C and then harvested by centrifugation and resuspended in buffer 300 (300 mM NaCl, 20 mM Tris-HCl (pH 8.0), 2 mM dithiothreitol, 4 mM EDTA) supplemented with 1 mM phenylmethylsulfonyl fluoride. The resuspended cells were lysed using a EmulsiFlex homogenizer (Avestin); the resulting lysates were cleared by centrifugation at 24,000 x g and applied to glutathione-agarose resin (Sigma). After washing the immobilized fusion proteins sequentially with buffer 300, buffer 500 (containing 500 mM NaCl), and buffer 150 (containing 150 mM NaCl), the Cog2p moiety was released by thrombin cleavage. Thrombin was removed from the eluate using benzamidine-agarose affinity chromatography (GE Healthcare). Final purification was accomplished using size exclusion chromatography (Superdex 75 or Superose 12; GE Healthcare) in buffer 300. For isotopic labeling, cells were instead grown in M9 minimal media with 15NH4Cl and/or uniformly 13C-labeled glucose (Cambridge Isotope Laboratories) as the sole source of nitrogen and/or carbon (29). Cultures in M9 media were grown as above but were harvested 4 h after induction.
Circular Dichroism—Circular dichroism (CD) experiments were carried out using 7–15 µM protein in 1 mM potassium phosphate (pH 7.0), 100 mM KCl, 250 µM
-mercaptoethanol. Protein concentration was determined using a ninhydrin assay (30). Spectra were collected in a 0.1-cm path length quartz cuvette using Aviv 62DS or Jasco 810 CD spectropolarimeters. All wavelength scans were collected at 4 °C with 1-s averaging times and represent the average of three scans.
Generation of Yeast Expression Constructs—Plasmids were based on pSV15 (27), which contains the entire COG2 gene along with
500 bp of genomic flanking sequence at both the 5' and 3' ends, in a pRS415 background (31, 32). Each deletion was created by PCR, using 5' primers designed to loop out a region corresponding to residues 2–60 or 2–96. The 5' primers included a HindIII site upstream of the start codon, whereas the 3' primer included a BamHI site in the 3'-flanking region. Yeast cells were co-transformed with an excess of the resulting PCR product together with pSV15 that had been digested with BsgI and AvrII to remove sequences corresponding to Cog2p residues 57–262 and a portion of the 3'-flanking region. Transformants capable of growing on SC -Leu were screened by PCR for the desired deletions, the presence of which was subsequently confirmed by DNA sequencing.
Haploid Growth Curves—Colonies from single germinated spores were grown in 5 ml of rich media (yeast extract/peptone/dextrose) overnight at 30 °C. From these cultures, 2.5 A600 units were transferred to 25 ml of fresh media in a 250-ml Erlenmeyer flask, for an initial A600 of
0.1, and the optical density was monitored during a further 12 h of growth at 30 °C.
NMR Sample Preparation—Purified Cog2p fragments were exchanged into 3 mM Tris-HCl (pH 7.0), 10 mM NaCl, 1 mM dithiothreitol, 0.5 mM EDTA, 5% D2O, and 0.02% sodium azide using a NAP5 column (GE Healthcare). The exchanged proteins were concentrated to
1 mM using pre-rinsed UltraFree 4 centrifugal concentrators (Millipore).
Data Collection and Processing—All data were acquired for Cog2-(61–262) using Varian Inova spectrometers. Preliminary NMR experiments were performed at Princeton University using a 600-MHz instrument. All of the spectra used for structure determination were acquired at the Environmental and Molecular Sciences Laboratory at Pacific Northwest National Laboratory. 13C,1H HSQC, HCCH-TOCSY, (H)C(CO)NH-TOCSY, H(CCO)NH-TOCSY, HNCO, and HSQC spectra were collected on a 600-MHz instrument; CBCA(CO)NH, HNCACB, HNCO, and HSQC spectra were collected on a 750-MHz instrument, and 13C-edited and 15N-edited NOESY spectra were collected on an 800 MHz instrument. The spectral widths and number of complex points in the F3, F2, and F1 dimensions, with the number of scans per free induction decay, indicated in parenthesis, were: aliphatic 13C,1H HSQC, 8000 x 21128.7 Hz, 1024 x 256 (8); aromatic 13C,1H HSQC, 8000 x 4800.2 Hz, 1024 x 256 (8); HCCH-TOCSY, 8000 x 8000 x 12073.6 Hz, 1024 x 124 x 32 (8); (H)C(CO)NH-TOCSY, 8000 x 12073.7 x 2007 Hz, 1024 x 64 x 32 (32); H(C)(CO)NH-TOCSY, 8000 x 4501.3 x 2007.1 Hz, 1024 x 80 x 32 (32); HNCO, 10500.1 x 2262.3 x 2279.2 Hz, 1024 x 100 x 64 (8); CBCA(CO)NH, 10500.1 x 15078.6 x 2279.2 Hz, 1024 x 108 x 64 (32); HNCACB, 10500.1 x 15078.6 x 2279.2 Hz, 1024, 108, 64 (32); HSQC, 10500 x 2279.2, 1024 x 128 (8); 13C,1H NOESY-HSQC (with carbon carrier in the aliphatic region), 10999.6 x 9599.2 x 4199.9 Hz, 1024 x 256 x 64 (8); 13C,1H NOESY-HSQC (with carbon carrier in the aromatic region), 10999.6 x 9599.2 x 4499.9 Hz, 1024 x 256 x 64 (8); 1H,15N NOESY-HSQC, 10999.6 x 9600 x 2431.5 Hz, 1024 x 256 x 100 (8). Standard Protein-Pack pulse sequences were used for all experiments. Preliminary spectra were acquired at 25 °C, and spectra used for structure determination were acquired at 35 °C. Spectra were processed using NMRPipe (33) and analyzed using NMRView (34).
Data Analysis and Structure Calculation—Backbone and most side-chain resonances were assigned using gradient-enhanced HNCO, HNCACB, CBCA(CO)NH, (H)C(CO)NH-TOCSY, HCCH-TOCSY, and 13C,1H HSQC-NOESY spectra and standard assignment procedures (35–37). Aromatic side chain resonance assignments required homonuclear two-dimensional TOCSY, NOESY, and double quantum-filtered COSY experiments and a 13C,1H NOESY-HSQC experiment acquired with the carbon carrier in the center of the aromatic region. A 13C,1H HSQC experiment performed on 10% 13C-labeled Cog2-(61–262) was obtained to determine stereospecific assignments of valine and leucine methyl groups.
and
torsion angle restraints were predicted by the program TALOS (38) based on backbone chemical shifts. Dihedral angle predictions were restrained to 1.5 times the S.D. observed in the TALOS data base, with a minimum of 22.5°. H-bond restraints were assumed for regions of the protein exhibiting strongly helical chemical shift indices, specifically 108–128, 132–150, 159–176, 185–207, 215–242, 249–258. Structures calculated without these restraints were of similar energy and fold. Structures were calculated using CNS (39) and evaluated using CNS, AQUA, and ProcheckNMR (40). A total of 1200 structures was calculated, and the 20 structures with the lowest NOE energy were selected. Structure figures were generated using PyMOL (41).
Alignment and Structure Comparison—BLAST searching revealed three Cog2p homologs with E < 10-17, all of them from other fungi. The next best score was E = 0.06; iterative searching using Psi-BLAST was needed to detect more distant homologs including human Cog2p (23% sequence identity over 111 residues). ClustalW (42) was used to align S. cerevisiae Cog2p with the three other fungal homologs, Candida glabrata (40% identity over 253 residues), Ashbya gossypii (29% identity over 254 residues), and Kluyveromyces lactis (31% identity over 243 residues). The alignment figure was produced using Alscript (43). Buried residues were defined as those residues with <15% of their side chains exposed to solvent, as calculated using WHAT IF (44).
The previously determined exocyst subunit structures were divided into domains according to the description of each structure in the corresponding original report (18–23). Pairwise Z scores and root mean square deviations for each domain comparison were then calculated using DaliLite (45).
| RESULTS |
|---|
|
|
|---|
85%)
-helical (Fig. 1A). Our efforts to produce diffraction quality crystals of the full-length protein were, however, unsuccessful. To identify large fragments that might constitute more favorable targets for structural analysis, we subjected recombinant Cog2p to limited proteolysis using a battery of nonspecific proteases. Two cleavage products appeared, based on their electrophoretic mobility, to be produced consistently by several of the proteases. These species, identified by N-terminal sequencing and mass spectrometry, differed only at their N termini: Cog2-(56–262) and Cog2-(97–262). Next, we overproduced each of these Cog2p fragments as a recombinant protein in E. coli. Both fragments, like the full-length protein, were highly
-helical (Fig. 1A).
While scaling up production of Cog2-(56–262), we observed that it precipitated at concentrations greater than
1 mg/ml. A more soluble variant was produced by eliminating five predominantly hydrophobic residues (His-Tyr-Leu-Pro-Leu) to generate Cog2-(61–262), and this variant was overexpressed and purified to >95% homogeneity. Its CD spectrum was indistinguishable from that of Cog2-(56–262) (Fig. 1A). Importantly, Cog2-(61–262) remained soluble and monomeric at concentrations in excess of
20 mg/ml (1 mM), as judged by gel filtration and dynamic light scattering (data not shown). Because of its excellent solution properties, Cog2-(61–262) became the subject of most of the subsequent studies described here. This fragment contains 77% of the full-length Cog2p subunit, including almost all of the "conserved amphipathic helical region" (residues 60–125) identified by Whyte and Munro (17, 24) near the N terminus of several COG, exocyst, and Golgi-associated retrograde protein (GARP) subunits.
|
|
|
Although Cog2-(61–262) formed large crystals, they did not diffract beyond 8 Å resolution, precluding x-ray structure determination. At 23.4 kDa, however, Cog2-(61–262) presented a potential target for structure determination by multidimensional NMR. The dispersion and relatively uniform intensity of cross peaks in 1H,15N HSQC spectra confirmed that Cog2-(61–262) is folded and stable (Fig. 2). The size and high
-helicity of the protein gave rise to severe spectral overlap. Nonetheless, by using three-dimensional spectra to resolve ambiguities, it proved possible to make backbone resonance assignments for 182 non-proline residues (91% completeness).
Comparison of the C
chemical shifts observed for Cog2-(61–262) with those of a random coil revealed six unambiguously
-helical regions: amino acids 107–127, 133–153, 158–177, 184–209, 217–243, and 249–257 (Fig. 3). The extent of
-helical structure is in agreement with the CD spectra (Fig. 1A). The chemical shift data, moreover, appear generally consistent with the proposal by Whyte and Munro (24) that the conserved amphipathic helical region has two helices (Cog2p residues 60–82 and 92–125) separated by an extended loop.
|
|
-helices (Fig. 4). Data base analysis using the program Dali (46) revealed that the most highly homologous structure (Z = 9.1) among proteins currently listed in the Protein Data Bank is domain I of the pore-forming Bacillus thuringiensis toxin CrylA(a) (47). Of more likely functional relevance is the observation that four exocyst subunits for which structures are available are found among the top 100 Dali hits: Sec6 (Z = 7.3), Exo84 (Z = 6.0), Exo70 (Z = 5.7), and Sec15 (Z = 5.6). Other helical bundle protein families also figure prominently among the proteins with the 100 highest Dali scores. These families include SNAREs (Tlg1p, syntaxin-1A, Sso1p, Vam3p), guanine nucleotide exchange factors (Tiam1, son of sevenless protein, leukemia-associated RhoGEF, intersectin, collybistin II, Dbs, and Vav), and nuclear import/export proteins (importin
, importin
, Crm1/exportin 1, and karyopherin
2/transportin). Because of the potential functional homology between the COG and exocyst complexes, we compared the known subunit structures domain by domain. A matrix of Dali scores (Fig. 5A) reveals that Cog2-(109–262) resembles many of the exocyst subunit domains as closely as they resemble one another. Because helical bundles are a common fold, however, this resemblance does not by itself establish a definitive connection between COG and exocyst complexes. Cog2p primary amino acid sequences are highly divergent across species. Nonetheless, alignment of Cog2p with three other fungal Cog2p sequences (pairwise sequence identity with S. cerevisiae Cog2 29–41%; see "Experimental Procedures") revealed 16 residues that are strictly conserved across all four sequences and 32 more that are similar (Fig. 4C). The majority of the conserved residues present in the NMR structure are buried (black triangles in Fig. 4C), suggesting that they play a largely structural role. In particular, no region of the protein surface displays a significant clustering of conserved residues. Close examination of the protein surface does, however, reveal two distinctive features, a broad acidic stripe across one end of the bundle (Fig. 5B) and a hydrophobic groove formed by the C-terminal portion of Cog2p (Fig. 5C). These features constitute potential protein-protein interaction surfaces.
| DISCUSSION |
|---|
|
|
|---|
|
immunoglobulin-like fold bearing no resemblance to the
-helical bundles observed for the other exocyst subunits and Cog2p. In the majority of cases, however, the N-terminal regions appear recalcitrant to structural characterization. It is possible that they are poorly ordered in the absence of the other subunits of the complex. Alternatively, like some SNARE proteins, they might be "natively unfolded."
The six-helix bundle structure of Cog2-(109–262) does not have striking conserved surface features to guide functional experiments. Indeed, the majority of the amino acid residues conserved among fungal Cog2 subunits are buried, suggesting that they play a role in maintaining structure and stability. A large acidic patch is evident on the surface of the protein (Fig. 5B); however, the residues composing it are not particularly well conserved (Fig. 4C). A second potentially important feature of the Cog2p structure is a groove, formed largely by the fourth and fifth
-helix, where a number of hydrophobic residues are at least partially exposed (Fig. 5C). The majority of the residues contributing to this groove are hydrophobic in all of the aligned sequences. However, genetic evidence suggests that this groove cannot be required for the essential function of Cog2p. A temperature-sensitive mutation in Cog2p (sec35-1) results in the conversion of Tyr-195 to a stop codon (Fig. 4C), removing virtually all of the residues that contribute to the hydrophobic groove. Nonetheless, the sec35-1 strain displays no growth defect at temperatures 21–30 °C (27). Overall, therefore, we were unable to identify conserved surface features on Cog2-(109–262) that are essential for its function. Although we cannot rule out that the surface of Cog2p has evolved in conjunction with its functional partners, compromising our ability to detect protein-protein interaction sites through the identification of conserved surfaces, it appears likely that this domain plays a fundamentally structural role in the COG complex.
Our functional studies suggest that essential regions of Cog2p are located more N-terminally. Whyte and Munro (24) identified a weakly conserved amphipathic helical region within Cog2p (residues 60–125) and several other subunits of the COG, exocyst, and GARP complexes. Our results are consistent with the prediction that residues 60–82 and 92–125 form helices, although the apparent lack of fixed tertiary structure for residues 61–108 means that we are reliant on chemical shift data alone (Fig. 3) to make helix assignments within this region. In vivo, deleting residues 1–60 compromised but did not abolish function, slowing growth
3-fold (Fig. 1B), whereas deleting residues 1–97 was lethal. The deleted residues may be important for interaction with another COG subunit and, therefore, for the structural integrity of the complex. Alternatively, or in addition, this region may be important for the interaction between COG and another protein with which it functionally collaborates (e.g. a COPI subunit, a Rab protein, or a SNARE). In either case, the intrinsic helicity of the region suggests that it may retain a helical structure in its complexed state. It is interesting to note that both SNAREs and Rab proteins generally recognize helical regions in their functional partners.
Previous reports (18–23) had demonstrated that four different subunits of the exocyst complex all contain helical modules, resulting in extended structures or rods (28). The six-helix bundle of Cog2-(109–262) resembles these helical modules (Fig. 5A), but the presence of only a single module makes the architectural similarity with the exocyst somewhat uncertain. It is worth noting in this regard that the Cog2 subunit is frequently much larger, especially in higher eukaryotes but also in some other fungi, than it is in S. cerevisiae. By analogy with exocyst subunits, it is possible that orthologs of Cog2p contain two (or more) helical bundles. Overall, although further structural investigations are clearly essential, the finding that yeast Cog2p, and presumably its orthologs in higher eukaryotes, contain one or more exocyst-like helical bundles provides an indication of architectural similarity between COG and exocyst complexes that complements indications of functional similarity.
The transport protein particle (TRAPP) complexes I and II are multisubunit tethering complexes essential for trafficking to the Golgi apparatus (52–54). The TRAPP complexes lack any similarity to the COG or exocyst complexes. Several TRAPP subunits do, however, bear a structural resemblance to the "longin" domains found at the N terminus of vesicle (v-)SNAREs including Ykt6p, Sec22b, and Nyv1p (55, 56). This observation led to the suggestion that TRAPP may play a role in SNARE assembly or function. Cog2-(109–262), by contrast, bears a structural resemblance to target membrane (t-)SNAREs with N-terminal helical bundle regulatory domains. Indeed, three of the top eight scores in a Cog2-(109–262) Dali search were achieved by SNAREs (syntaxin 1A (Protein Data Bank (PDB) code 1dn1)) or their isolated N-terminal domains (Tlg1p, PDB code 2c5i) and syntaxin-12 (PDB code 2dnx)). Perhaps during the sequence of events that accompany SNARE assembly and function, helical bundle tethering complexes displace or interact with helical bundle SNARE regulatory domains.
It is intriguing to speculate that exocyst/COG helical bundle subunits are tailored to interact with SNAREs that contain helical bundle domains, whereas TRAPP complex longin domain subunits are tailored to interact with SNAREs that contain longin domains. This hypothesis would explain the observation that tethering protein complexes and SNARE regulatory domains both appear to fall into two main classes. Future efforts to test these and other models will be crucial in understanding the mechanistic basis for tethering complex function.
In conclusion, we have taken an initial step in elucidating the structure, and, ultimately, the molecular function, of the COG complex by determining the NMR structure of the folded core of the subunit, Cog2p. At 23 kDa, this core fragment of a single subunit represents a small part of the entire complex (515 kDa, assuming one copy of each subunit (24)). Nonetheless, its structure reveals an unanticipated similarity with each of the recently determined structures of individual exocyst complex subunits. Our results add new support to the hypothesis that the exocyst and COG complexes share some general structural, and possibly functional, features.
| FOOTNOTES |
|---|
Experimental NMR chemical shift and restraint data have been deposited in the Biological Magnetic Resonance Data Bank (www.bmrb.wisc.edu) with the accession number 15290.
* This work was supported by a National Science Foundation Minority Postdoctoral Fellowship (to L. F. C.), by an American Heart Association grant-in-aid (to F. M. H.), and by National Institutes of Health Grants GM071574 (to F. M. H.) and NS37200 (to J. R.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
1 Current address: Department of Biological Sciences, Columbia University, New York, NY 10027. ![]()
2 To whom correspondence should be addressed. Tel.: 609-258-4982; Fax: 609-258-6730; E-mail: hughson{at}princeton.edu.
3 The abbreviations used are: COG, conserved oligomeric Golgi; SNARE, soluble N-ethylmaleimide factor attachment protein receptor; TOCSY, two-dimensional total correlation spectroscopy; HSQC, heteronuclear single quantum correlation; TRAPP, transport protein particle; NOE, nuclear Overhauser effect. ![]()
4 The original genome annotation misplaced the start codon 39 bases upstream, with the consequence that earlier literature refers to a 275-residue protein with shifted residue numbering. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||