![]()
|
|
||||||||
J. Biol. Chem., Vol. 279, Issue 41, 42881-42888, October 8, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








From the
Department of Biological Chemistry, The Weizmann Institute of Science, Rehovot 76100, Israel, the
Department of Biotechnology and Food Engineering, and Institute of Catalysis Science and Technology, Technion, Israel Institute of Technology, Haifa 32000, Israel, the ¶Bioénergétique et Ingéniérie des Protéines, Centre National de la Recherche Scientifique, IBSM, 13402 Marseille, France, the ||Université de Provence, 3 Place Victor Hugo, 13331 Marseille, France, the **Zephyr ProteomiX, Kiryat-Shemona 11013, Israel, and the 
Department of Molecular Microbiology and Biotechnology, Tel-Aviv University, Ramat Aviv 69978, Israel
Received for publication, July 1, 2004 , and in revised form, July 23, 2004.
| ABSTRACT |
|---|
|
|
|---|
20% capacity of the wild-type interaction. These residues represent but 3 of the 16 contact residues that participate in the cohesin-dockerin interaction. This approach allowed us to differentiate, in a structure-independent fashion, between residues critical for interspecies recognition and binding residues per se. | INTRODUCTION |
|---|
|
|
|---|
Crystal structures of cohesins from Clostridium thermocellum and from Clostridium cellulolyticum were first reported, from which the overall shape and topology were determined (810). On the basis of various analyses of the cohesin surface (comparative bioinformatics, homology modeling, electrostatic potential), the approximate location of the dockerin-binding surface was proposed (8). The corresponding face of the proposed binding surface comprised contiguous
-strands 8, 3, 6, and 5 of the all-
jelly-roll structure. The general fold is representative of an important type of stable protein fold that characterizes different families of protein modules, including carbohydrate-binding modules and certain carbohydrases (1113). On the dockerin counterpart, a series of putative recognition determinants were examined by site-directed mutagenesis along with secondary structure predictions of the dockerin fold (7, 14, 15). The subsequent solution structure of the dockerin molecule corroborated the structure predictions, and a similar set of residues was proposed to participate in the binding interaction with the cohesin molecule (16, 17). Bioinformatics-based site-directed mutagenesis studies of the cohesin module (1820) failed to clarify the cohesin residues that play a definitive role in dockerin binding.
The very high affinity of both cohesins (KD < 10-10 M) to their matching dockerins (15, 2123) suggests that numerous intermodular interactions occur between the cohesin and dockerin. On the other hand, we reasoned that not all of the interactions would be necessary for interspecies specificity. We therefore embarked upon an alternative approach, designed to explicitly provide insight into the distinctive binding characteristics of the cohesin module, with the intention of identifying residues that confer species specificity. The approach used in this work involved progressive swapping of corresponding homologous stretches of the cohesin modules from C. thermocellum and C. cellulolyticum. The resultant chimeric cohesins were then screened using an enzyme-linked affinity assay and complementary protein microarray to provide insight into the residue(s) involved in the binding specificity as opposed to general binding. The progressive nature of this approach stems from the linkage between the experimentally determined specificity of the interaction and the identity of the homologous segments. With each successive generation of homologue swapping, the reactive segments were constricted until only three selected residues of the C. cellulolyticum cohesin were mutated, resulting in a viable cohesin mutant that preferentially recognized the rival dockerin. During the course of this study, a crystal structure for the cohesin-dockerin heterodimer was elucidated from the C. thermocellum cellulosome (24). The putative specificity residues implicated in the present study represent but 3 of the 16 contact residues that participate in complex formation. The results of the present study may allow us to differentiate between residues critical for interspecies recognition and binding residues per se.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Construction of Overexpression PlasmidsThe dockerin modules of celA of C. cellulolyticum (DocC) and celS of C. thermocellum (DocT) were amplified using the primers DA-F/DA-R and DS-F/DS-R, respectively (the sequences of the primers are listed in Table SI, see Supplementary Material). Both PCR products were digested with KpnI and BamHI and cloned separately into the KpnI/BamHI-linearized plasmid pMalc2e (New England Biolabs), which resulted in the plasmids, pMalc2e-docC and pMalc2e-docT, respectively.
In order to prepare an expression vector for production of CBM fusion proteins, the gene encoding for the C. thermocellum CBM3 module (4) was first inserted upstream of the multiple cloning site of pET28a. The cbm3 gene was amplified using the primers CM-F/CM-R (Table SI, see Supplementary Material). The desired PCR fragment was cleaved using BspHI and NcoI and ligated into an NcoI-linearized pET28a plasmid (Novagen, Madison, WI). The clone with cbm3 in the correct orientation was identified by sequencing and named plasmid pET28a-cbm3.
The genes encoding for the C. cellulolyticum and C. thermocellum cohesin modules were amplified using primers Fc/Rc and Ft/Rt, respectively (Table SI, see Supplementary Material). The PCR fragments were cut with NcoI and XhoI and ligated into a NcoI/XhoI-linearized pET28a-cbm3 plasmid, resulting in plasmids pSW-cohC and pSW-cohT, respectively.
Swapping was performed using overlap-extension PCR. The genes coding for the resultant chimeric cohesins of the four consecutive rounds were prepared using appropriate primers as listed in Table SI (see Supplementary Material). For each chimera, two (and in some cases three) successive PCR reactions were required as described in Table SII (see Supplementary Material). The PCR fragments were cleaved and ligated, as described above, into a linearized pET28a-cbm3 plasmid.
Expression and Purification of Recombinant ProteinsMBP-DocC and MBP-DocT were expressed and purified according to the pMALTM Protein Fusion and Purification System: Instruction Manual of New England Biolabs. In this experiment, expression was induced using 0.5 mM isopropyl-1-thio-
-D-galactopyranoside and the TBS column buffer consisted of 137 mM NaCl, 2.7 mM KCl, 25 mM Tris-HCl, pH 7.4. After elution, the proteins were divided into aliquots and stored at -20 °C.
The expression and purification of CBM-cohesin fusion proteins was accomplished as described previously (4) with minor modifications. Induction was initiated with 0.5 mM isopropyl-1-thio-
-D-galactopyranoside. The harvested cells were resuspended in phosphate-buffered saline (137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4, 2 mM KH2PO4, pH 7.4). The fusion proteins, bound to the microcrystalline cellulose, were washed three times with phosphate-buffered saline containing 1 M NaCl and three times with phosphate-buffered saline in the absence of NaCl. Elution and neutralization were done accordingly, and the proteins were stored in 50% (v/v) glycerol at -20 °C. E. coli BL21 pLysS(
DE3) was used as a host for the expression of CBM-cohesin fusion proteins and the MBP-dockerin fusion proteins.
ELISANunc-ImmunoTM 96 MicroWellTM plates (Nalge Nunc International, Roskilde, Denmark)) were coated by adding 5 µg/ml MBP-DocC and MBP-DocT in 0.1 M Na2CO3 (pH 9) and were incubated overnight at 4 °C. Subsequent steps were carried out at room temperature. Blocking was done by incubating the wells with TBSCB (TBS + 10 mM CaCl2 + 2% bovine serum albumin) for 1 h. CBM-cohesin was then added at different concentrations (1100 ng) in TBSCB (100 µl/well) and incubated for 1 h. The wells were washed three times with TBSCT (TBS + 10 mM CaCl2 + 0.05% Tween 20). Monoclonal anti-His tag antibody (Serotec) was added (1:1000) in TBSCB (100 µl/well) and incubated for 1 h. The wells were washed again as above, and peroxidase-conjugated goat anti-mouse IgG (Jackson Immunoresearch Laboratories, West Grove, PA) was added (1:5000) in TBSC (100 µl/well). The plates were incubated for 1 h and washed again. Detection was achieved by adding 100 µl/well of commercially prepared TMB reagent (3,3',5,5'-tetramethylbenzidine; Dako, Glostrup, Denmark). The reaction was terminated by adding 1 M H2SO4 (50 µl/well), and the absorbance was measured at 450 nm using a tunable VERSAmax microplate reader (Molecular Devices, Sunnyvale, CA).
Protein MicroarrayProtein microarray was performed according to Ofir et al.2 Briefly, the CBM-cohesin chimeras were diluted in TBS to a final concentration of 50 and 100 µg/ml, arrayed into a 384 well plate (Genetix, New Milton, Hampshire, UK) and then printed onto a cellulose slide (Zephyr ProteomiX, Kiryat Shmonah, Israel) using a Biorobotics TAS arrayer (Genomic Solutions, Huntingdon, UK) equipped with 16 0.2-mm solid pins. MBP-DocC was labeled with Cy3 mono-reactive dye, and MBP-DocT and polyclonal rabbit anti-CBM were labeled with Cy5 mono-reactive dye (Amersham Biosciences, Freiburg, Germany) as recommended by the manufacturer. Unconjugated dye was removed using a Slide-A-Lyzer Mini Dialysis Unit (Pierce). The slides were blocked in TBSCB for 1 h at room temperature. The following step was carried out in the dark at room temperature. The slides were incubated with both of the fluorescence-labeled probes, MBP-DocC (2.5 µg/ml) together with MBP-DocT (2.5 µg/ml), or the fluorescence-labeled rabbit
-CBM (1:400) in 4 ml of TBSCB for 1 h. The cells were then washed four times with TBSCT for 5 min. The slides were dried and scanned using a Scanarray 4000 (PerkinElmer Life Sciences).
| RESULTS |
|---|
|
|
|---|
First Generation SwappingSix chimeric proteins were constructed using overlap-extension PCR. The conserved amino acid pairs highlighted in black in Fig. 1 were used as sites to divide the proteins into three. Four of the six chimeras (Sw1-1 to Sw1-4) bind selectively to the C. thermocellum dockerin with little or no cross-reactivity with the C. cellulolyticum dockerin (Fig. 2B). Sw1-5 is the only chimera of the first round that binds preferentially to the C. cellulolyticum dockerin. In contrast, Sw1-6 exhibits very little, if any, reactivity with either of the two dockerins.
|
|
Second Generation SwappingIn this round, the first and last third of the cohesin molecule were each split into two parts, and the various combinations were prepared. As shown in Fig. 3A, two chimeric proteins were based on Sw1-1, eight chimeric proteins were based on Sw1-2 and another eight were based on Sw1-5. The amino acid pairs highlighted in gray in Fig. 1 were used to divide the first and last third into two.
|
Sw1-2 was also used as a template to determine the role(s) of smaller portions of the identified specificity regions (Fig. 3A, b). Of the eight chimeric proteins based on this template, only the constructs containing both the second sixth and the last sixth (Sw2-4, Sw2-8, and Sw2-10) showed significant levels of binding to the C. thermocellum dockerin (Fig. 3B, b).
The binding affinity of the eight chimeric proteins based on Sw1-5 (Fig. 3A, c) further strengthened the contention that the second and last sixth are involved in the affinity for the C. thermocellum dockerin. Only constructs containing one or both of these sixths (Sw2-12, Sw2-16, and 2-18) exhibit this preference (Fig. 3B, c). Interestingly, Sw2-15 and Sw2-17 contain one of the two sixths, but do not comply with this rule. Again, as described above for chimera Sw1-6, the presence of incompatible segments derived from the rival species may serve to prevent conversion of specificity.
Sw2-11, Sw2-13, and Sw2-14 exhibit low levels of binding for the C. cellulolyticum dockerin (Fig. 3B, c). As above, these results can be explained on the basis of the inferior affinity characteristics for the cohesin-dockerin interaction of C. cellulolyticum versus those of C. thermocellum (21, 22). Despite the relatively modest affinity observed for these chimeras, the findings also support the premise that in C. cellulolyticum, the relevant region resides within the second and last sixth of the protein.
Third Generation SwappingIn the previous round, the replacement of complementary short stretches of the C. cellulolyticum cohesin on a background of C. thermocellum failed to generate significant affinity toward the C. cellulolyticum dockerin. On the basis of these results, we restricted our subsequent efforts toward unidirectional conversion of affinity from C. cellulolyticum to C. thermocellum. Toward this end, the second and last sixth of the two cohesins were split in three with overlapping regions as depicted in Fig. 4. Fig. 5A illustrates the combinations of cohesin segments used to uncover the successive trail of specificity between the two species; three chimeric proteins were based on Sw2-2, six others on Sw2-10, and another three were based on Sw2-4.
|
|
Fourth Generation SwappingAt this point in our studies, the structure of the cohesin-dockerin heterodimer from C. thermocellum was brought to our attention,3 and we were able to benefit from the consequent knowledge of the overall complement of contact residues. Prior to this stage, the selection of segments was based solely on the aforementioned criteria and/or as a response to the experimental results.
Examination of the crystal structure of the C. thermocellum cohesin-dockerin complex (24) revealed only three dockerin-binding contact residues that were located within the central segments of the cohesin chimeras, as designated in the third generation of swapping. These include Ala-36, Asn-37 of the
3 strand and Glu-131 of the loop separating
-strands 8 and 9 (Fig. 4). In addition, a gap occurs relative to the cohesin of C. cellulolyticum (Fig. 4). The gap and the suspected recognition residues were thus swapped either alone or combined in the present round. The designated residues in C. cellulolyticum were modified by site-directed mutagenesis to the corresponding residues in C. thermocellum. As shown in Fig. 6A, four chimeric proteins were based on Sw3-5, one chimeric protein was based on Sw3-6, three chimeric proteins were based on Sw3-11 and another four were chimeric proteins distinguished by alterations in single residues.
|
In the three chimeric proteins based on Sw3-11, the first third of C. thermocellum was maintained to support the mutations in the second segment. In Sw4-6, TMS (residues 125127 of the C. cellulolyticum cohesin) were swapped with DLV; in Sw4-7, SKI (residues 127129) were swapped with VEQ, and in Sw4-8 Lys-128 with Glu. The results show that at the C-terminal portion of the C. thermocellum cohesin, Glu-131 is a critical determinant for dockerin recognition (Fig. 6B, c). Interestingly, the single mutation exhibited an elevated interaction with the C. thermocellum dockerin, compared with that of Sw3-11, which comprised a greater portion of the C. thermocellum sequence.
All of the final four chimeric proteins included the critical K128E mutation, together with combinations of the
M31, G35A, and T36N mutations. The results (Fig. 6B, d) demonstrate that by changing only three residues of the C. cellulolyticum cohesin (G35A, T36N, and K128E), the binding preference of recognition was converted to that of the C. thermocellum dockerin instead of its own. Approximately 20% of the wild-type binding activity (compared with that of the C. thermocellum cohesin-dockerin interaction) was observed for the cross-species binding, and about 5% of the original wild-type binding remained for the C. cellulolyticum dockerin. If the T36N was lacking, the resultant C. cellulolyticum mutant showed preference for its own dockerin; in the absence of the G35A mutation, only low levels of activity were observed with negligible, if any, preference for the cross-species interaction.
Simultaneous Analysis of the Four Rounds of Swapping Using Protein MicroarrayIn order to validate the above-described results obtained with the ELISA assay, we established a protein microarray assay, based on selective binding of the CBM component of the chimeras to a cellulose-containing surface.2 The latter protocol also allowed us to determine simultaneously the specificity of all the cohesin-containing chimeras toward both species of dockerin probe on a single microarray. Besides the simultaneous detection of binding to both dockerin-containing probes, this approach was complementary in format to the ELISA assay, in that the cohesin-containing chimeras were now the immobilized modules in contrast to immobilization on ELISA plates of the dockerin-containing MBP fusion proteins.
In this protocol, all of the chimeras were printed on a cellulose slide, which was incubated simultaneously with Cy3-labeled (red) MBP-DocC and Cy5-labeled (green) MBP-DocT (Fig. 7). We confirmed that the chimeras were printed in equivalent quantities by incubating a chip from the same batch with Cy5-labeled anti-CBM antibodies (data not shown).
|
| DISCUSSION |
|---|
|
|
|---|
Previous research has determined several residues of the dockerin domain involved in species specificity (7, 14, 15, 31). While there was no prior knowledge of the link between protein sequence and function of the dockerin module, the relatively small size and the conserved patterns within the dockerin sequences enabled a rational mutagenesis approach based on comparative bioinformatics. In this context, we converted the specificity of one dockerin species to that of the other (i.e. from C. thermocellum to C. cellulolyticum) by exchanging the residues in suspect (14, 15). The concept behind this approach is that successful and exclusive conversion from one species to the other would intrinsically preclude any ambiguity regarding improper folding or nonspecific interaction.
In the case of the cohesin module, application of a similar rational-based approach failed to determine the residues responsible for species-specificity (1820). In this case, a clear pattern could not be observed as obtained for the alignments of the dockerin domains (data not shown). Various alternative approaches were thus considered for gaining general insight into the binding characteristics of the cohesin module, including alanine-scanning mutagenesis (32), random point mutagenesis and DNA shuffling (33). In addition to practical obstacles, the resultant cohesin mutants would be subjected to screening on the basis of selective binding to dockerins of divergent specificity. Such screening, however, would likely be inadequate for our purposes, since this approach would select only for the strongest binders to a given dockerin. Desired chimeras, containing minimal numbers of specificity-determining residues, are not necessarily the strongest or exclusive binders.
In view of the above considerations, we explored a different strategy involving progressive homologue swapping, which is a derivative of homologue scanning as described previously (32). The strategy we pursued, however, involved progressive refinement of the regions of interest. The relevant activity (binding to the divergent dockerins) was assessed following consecutive rounds of swapping. Consequently, the approach is independent of preconceived notions and becomes more rational with each round of swapping. In fact, prior knowledge of the structure of the protein is not entirely necessary, although in the later rounds of swapping knowledge of the structure enabled us to focus more intelligently on salient regions or residues. Moreover, this approach precludes the necessity of screening large libraries of mutants; in contrast to conventional DNA shuffling, there is no need for high sequence identity between parent strains. Furthermore, this approach allows us to address the property of differential recognition. Although sequence homology between the cohesin modules of C. thermocellum and C. cellulolyticum is relatively low (32% identity), the structure of the cohesins is well preserved, due to the common fold and conservation of key hydrophobic protein core residues. Thus, progressive homologue swapping of the cohesin molecule should result in viable chimeric species.
The progression of the common denominators that appeared responsible for the observed activity is shown schematically in Fig. 8. After three rounds of swapping, we narrowed down the region of interest into two segments of eight and twelve amino acids of the
150 amino acids, located mainly on one side of the cohesin and arranged primarily on the
3 strand and the loop bridging
-strands 8 and 9 (Fig. 4). The analysis of the C. thermocellum cohesin-dockerin structure revealed that, of the 16 known contact residues (24), only two hydrogen-bonding residues (Asn-37 and Glu-131) and one hydrophobic contact residue (Ala-36) of the cohesin were situated within these two segments (Fig. 9 and Table I). In fact, Asn-37 and Glu-131 both form direct side-chain-to-side-chain hydrogen bonds with the major dockerin-based residues (Ser-45 and Thr-46, respectively), which were previously implicated in species specificity (7, 14). The involvement of the designated cohesin residues in the interspecies recognition is thus strongly supported by the cross-species binding to the C. thermocellum dockerin of the C. cellulolyticum cohesin, in which only the three residues are mutated (G35A, T36N, and K128E). The latter three residues may constitute a minimal number of residues that would effect cross-species interaction. This finding corroborates one of our original assumptions in this work that the two species of cohesin are structurally analogous and can serve as templates for homologue swapping and mutagenesis experiments. It is extraordinary that the additional contact residues appear not to be decisive for species specificity, despite the general lack of conservation, between the two species of cohesin, in the relevant regions of the molecular surface.
|
|
|
Ultimately, we hope to determine the precise elements that control the specificity of interaction between cohesins and dockerins in general. Such an accomplishment would have potential biotechnological value, since a graded range of affinities among affinity partners would serve in various types of applications. The cohesin and dockerin domains are particularly appropriate for the production of complementary hybrid proteins, since both operate naturally as functional modules in parent protein pairs. Further investigation into the biorecognition properties of the cohesin-dockerin interaction in general is currently in progress.
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains Supplementary Material. ![]()

To whom correspondence should be addressed: Dept. of Biological Chemistry, The Weizmann Institute of Science, Rehovot 76100 Israel. Tel.: 972-8-934-2373; Fax: 972-8-946-8256; E-mail: ed.bayer{at}weizmann.ac.il.
1 The abbreviations used are: CBM, carbohydrate binding module; ELISA, enzyme-linked immunosorbent assay; MBP, maltose-binding protein; PDB, Protein Data Bank. ![]()
2 K. Ofir, Y. Berdichevsky, I. Benhar, R. Azriel-Rosenfeld, R. Lamed, Y. Barak, E. A. Bayer, and E. Morag, manuscript in preparation. ![]()
3 H. J. Gilbert, personal communication. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. J. Adams, G. Pal, Z. Jia, and S. P. Smith Mechanism of bacterial cell-surface attachment revealed by the structure of cellulosomal type II cohesin-dockerin complex PNAS, January 10, 2006; 103(2): 305 - 310. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||