JBC

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M304750200 on June 4, 2003

J. Biol. Chem., Vol. 278, Issue 34, 31988-31997, August 22, 2003
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
278/34/31988    most recent
M304750200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Foreman, P. K.
Right arrow Articles by Ward, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Foreman, P. K.
Right arrow Articles by Ward, M.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Transcriptional Regulation of Biomass-degrading Enzymes in the Filamentous Fungus Trichoderma reesei*,

Pamela K. Foreman {ddagger}, Doug Brown §, Lydia Dankmeyer, Ralph Dean §, Stephen Diener §, Nigel S. Dunn-Coleman, Frits Goedegebuur, Thomas D. Houfek §, George J. England, Aaron S. Kelley, Hendrik J. Meerman, Thomas Mitchell §, Colin Mitchinson, Heather A. Olivares, Pauline J. M. Teunissen, Jian Yao and Michael Ward

From the Genencor International, Inc., Palo Alto, California 94304 and the §Fungal Genomics Laboratory, North Carolina State University, Raleigh, North Carolina 27695-7251

Received for publication, May 7, 2003 , and in revised form, June 3, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The filamentous fungus Trichoderma reesei produces and secretes profuse quantities of enzymes that act synergistically to degrade cellulase and related biomass components. We partially sequenced over 5100 random T. reesei cDNA clones. Among the sequences whose predicted gene products had significant similarity to known proteins, 12 were identified that encode previously unknown enzymes that likely function in biomass degradation. Microarrays were used to query the expression levels of each of the sequences under different conditions known to induce cellulolytic enzyme synthesis. Most of the genes encoding known and putative biomass-degrading enzymes were transcriptionally co-regulated. Moreover, despite the fact that several of these enzymes are not thought to degrade cellulase directly, they were coordinately overexpressed in a cellulase overproducing strain. A variety of additional sequences whose function could not be ascribed using the limited sequence available displayed analogous behavior and may also play a role in biomass degradation or in the synthesis of biomass-degrading enzymes. Sequences exhibiting additional regulatory patterns were observed that might reflect roles in regulation of cellulase biosynthesis. However, genes whose products are involved in protein processing and secretion were not highly regulated during cellulase induction.



View larger version (50K):
[in this window]
[in a new window]
 
FIG. 2.
Genome-wide regulation of gene expression by sophorose. A, samples and microarrays are as described in Fig. 1B. The indicated genes were clustered according to the log ratio: expression with sophorose relative to glycerol alone in RLP-37 (column 1), expression with sophorose relative to glycerol alone in QM6a (column 2), and the relative expression levels in RLP-37 relative to QM6a when both strains were grown in the presence of sophorose (column 3). The mean log ratios among replicate microarrays from triplicate cultures were determined and are displayed according to the color bar below. B, clustered data from the entire data set. Columns are as in A. C, the data shown in Fig. 2B was filtered to obtain the sets of ESTs with a log ratio of 0.3 or more in each of the experiments. The sets were compared to identify ESTs that were common and distinct among the sets. Numbers refer to the number of ESTs in the set. Superscripts are used to identify the sets, the full content of which are available in the Supplemental Material. D, the data shown in Fig. 2B were filtered to obtain the sets of ESTs displaying a log ratio of –0.3 or less in each of the experiments. The sets were compared as in C.

 


View larger version (17K):
[in this window]
[in a new window]
 
FIG. 4.
Regulation of ESTs encoding putative components of the protein processing and secretion apparatus. A, list of ESTs containing ORFs with significant sequence similarity to S. cerevisiae gene products involved in protein processing and secretion was compiled. This list was compared with the sets of genes induced with log ratio 0.3 or more by lactose (as in Fig. 3) or by sophorose (as in Fig. 2) in QM6a. B, a similar analysis was performed for the sets of genes induced by 2-fold or more in RLP-37.

 

    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Saprophytic microorganisms produce and secrete a variety of hydrolytic enzymes, including proteases, amylases, cellulases, and hemicellulases. These enzymes degrade organic biological substrates, providing nutrients for growth and contributing to carbon recycling in nature. Recently, a great deal of attention has focused on cellulases and hemicellulases produced by these organisms because of their potential to be produced industrially and used in degradation of biomass for a number applications, most notably biofuel production (17).

The principal component of biomass is cellulose. It consists of polymers of {beta}-1,4-linked glucose residues that are organized into higher order fibrillar structures (8). In addition to cellulose, biomass contains large quantities of hemicelluloses, heteropolysaccharides composed of two or more monosaccharides such as D-xylose, L-arabinose, D-mannose, D-glucose, D-galactose, and 4-O-methyl-D-glucuronic acid (1).

Among the most prolific producers of biomass-degrading enzymes is the filamentous fungus Trichoderma reesei. The cellulase activity produced by T. reesei is composed of a complement of endoglucanases (EGI/Cel7B, EGII/Cel5A, EGIII/Cel12A, EGIV/Cel61A, and EGV/Cel45A) and exoglucanases (the cellobiohydrolases, CBHI/Cel7A, and CBHII/Cel6A) that act synergistically to break down cellulose to cellobiose (glycosyl {beta}-1,4-glucose) (914). Two {beta}-glucosidases (BGLI/Cel3A and BGLII/Cel1A) have been identified that are implicated in hydrolyzing cellobiose to glucose (15, 16). An additional protein, swollenin (encoded by the gene swo1), has been described that disrupts crystalline cellulose structures, presumably making polysaccharides more accessible to hydrolysis (17). The four most abundant components of T. reesei cellulase CBHI/Cel7A, CBHII/Cel6A, EGI/Cel7B, and EGII/Cel5A together constitute greater than 50% of the protein produced by the cell under inducing conditions and can be secreted in excess of 40 g/liter (18).

The regulation of cellulolytic enzyme expression in T. reesei is complex and only partially understood. Transcription of the major components of cellulase (CBHI/Cel7A, CBHII/Cel6A, EGI/Cel7B, EGII/Cel5A, EGIII/Cel12A, EGIV/Cel61A, and EGV/Cel45A) is induced not only by cellulose but also by a variety of disaccharides including lactose, cellobiose, and sophorose (glycosyl {beta}-1,2-glucose) (13, 19, 20). Induction by these molecules is antagonized by the presence of the preferred carbon sources, glucose and fructose. Sophorose is by far the most potent inducer of cellulase expression (21). However, it is unclear whether this high potency is an innate characteristic of the molecule or whether other disaccharides are less effective because they are more readily cleaved by cellular glucosidases. The balance between intact disaccharides and inhibitory monosaccharide cleavage products might then influence the transcriptional state of these genes. In accordance with this notion both lactose and cellobiose fail to fully induce cellulase gene expression when present at high concentrations.

In addition to enzymes with cellulolytic activity, a number of enzymes have been identified in T. reesei that degrade hemicellulose (17, 2231). These enzymes include four xylanases (Xyn1, Xyn2, Xyn3, and Xyn4) and mannanase (Man1), which cleave the xylan and mannan main chains of hemicellulose. Acetyl xylan esterase (Axe1), {alpha}-glucuronidase (Glr1), and arabinofuranosidase (Abf1) digest side chains containing acetyl, methylglucuronic acid, and arabinose moieties, respectively. Additionally, enzymes that digest oligosaccharides derived from hemicellulose have been identified. These are {beta}-xylosidase (Bxl1) and three {alpha}-galactosidases (Agl1, Agl2, and Agl3).

Among the hemicellulases that have been studied, all except arabinofuranosidase are expressed at a substantially higher level when T. reesei is grown in medium containing cellulose than when it is grown in medium containing non-inducing carbon sources such as sorbitol (28, 32). Additionally, expression of many of the genes encoding these enzymes, with the notable exception of man1, is induced by xylans. Sugars such as sophorose, arabitol, xylobiose, cellobiose, and galactose also induce expression to varying extents, particularly of enzymes that degrade substrates related to these sugars (32). The molecular mechanisms by which T. reesei senses the composition of the extracellular milieu and modulates the expression of these enzymes are unknown. Moreover, although expression of some of these genes is modified by certain substrates, it is unclear to what extent each gene has a unique regulatory apparatus and to what extent expression of these genes are coupled among themselves and with the cellulases via sharing of regulatory pathways. The mechanisms by which the cellulase and hemicellulase genes are regulated are likely to influence the ecological niches that T. reesei occupies and are of interest in the commercial production of these enzymes.

The very large quantity of biomass-degrading enzymes synthesized by T. reesei requires a significant investment of cellular resources. Evidence suggests that a primary means by which the cell manages these demands is to regulate transcription of the genes encoding these enzymes according to the availability of different carbon sources. However, the degree to which gene products involved in other cellular processes, such as secretion, must be regulated to accommodate the substantial burden of cellulase biosynthesis has not been systematically investigated.

In this study, aspects of these questions are addressed by determining the transcriptional effects of cellulase inducers on a genomic scale. Over 5100 cDNAs from T. reesei were partially or fully sequenced. Twelve cDNAs encoding new enzymes with putative roles in biomass degradation were discovered. Microarrays were used to examine the regulation of these and previously identified genes encoding biomass-degrading enzymes in the context of the extensive repertoire of newly identified genes. The results presented shed light on the coregulation of cellulases and hemicellulases and the mechanisms by which the cell copes with synthesizing very large quantities of these secreted enzymes.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Media, Strains, and Culture Growth Conditions—T. reesei strains used in this study were obtained from the American Type Culture Collection. Liquid minimal medium was as described previously (20), except that 100 mM piperazine-N, N-bis(3-propanesulfonic acid) (Calbiochem) was included to maintain the pH at 5.5. Vogel's medium was described by Davis and DeSerres (33). YEG medium contains 0.5% yeast extract (Difco), 2% glucose.

For Northern blot analysis, ~1 x 108 spores were inoculated into 50 ml of minimal medium supplemented with 5% glucose and grown at 30 °C for 24 h. Mycelia were collected by centrifugation, washed in carbon-free medium, and resuspended to an optical density of ~0.3 in 50 ml of minimal medium supplemented with 5% glucose, 2% avicel (FMC), 2% glycerol, or 2% glycerol containing 1 mM sophorose (Sigma). Cultures were grown in flasks with vigorous agitation for 20 h. For microarray analysis, inoculation was performed as above. Cultures were grown in triplicate overnight. Mycelia were collected by centrifugation and washed with minimal medium containing 2% glycerol. They were then resuspended to an OD of 0.15–0.2 and grown for 9 h (OD 0.5). The cultures were then divided in half, and 0.01 volume of 100 mM sophorose was added to 1 flask of each pair immediately and 10 h later. The cultures were grown an additional 2 h to an OD ~2–4.

cDNA Library and Sequencing—T. reesei strain QM6a mycelia were grown in baffled flasks at 30 °C for 24 h in YEG medium with vigorous aeration. 5 ml of this culture was added to 50 ml of the following media and grown under the following conditions: Vogel's liquid medium, 2% avicel, 3 and 6 days; Vogel's liquid medium, 2% Solka floc (International Fiber Corp., North Tonawanda, NY), 3 and 6 days; Vogel's liquid medium, 2% wheat bran (Skidmore Sales and Distributing Co., Inc., West Chester, OH), 3 and 6 days; Vogel's liquid medium, 2% beet pulp (D&D Ingredients Distributors, Inc., Delphos, OH), 6 days; Vogel's liquid medium, 2% glucose, 24 h; Vogel's liquid medium, 2% lactose, 24 h; Vogel's liquid medium, 2% xylose, 24 h; Vogel's liquid medium, 2% fructose, 24 h; Vogel's liquid medium, 2% maltose, 24 h; Vogel's liquid medium, no carbon source added, 24 h; Vogel's liquid medium, no nitrogen source added, 24 h; Vogel's liquid medium, 2% phosphoric acid swollen cellulose, 3 days; YEG medium, 42 °C, 1.5 h; YEG medium, 20 mM dithiothreitol, 1.5 h; YEG, room temperature, closed container with no agitation (anoxia), 1.5 h; solid state, 15 g of wheat bran, 1 g of proflo, 1 g of solkafloc, 30 ml of water, 6 and 7 days; solid state, 15 g of beet pulp, 1 g of proflo, 1 g of solkafloc, 30 ml of water, 9 days.

RNA was prepared from the mycelia by grinding under liquid nitrogen with a mortar and pestle and extracting using Trizol reagent (Invitrogen) according to manufacturer's instructions. cDNA libraries were constructed by Invitrogen in the vector pREP3Y, which is a derivative of pREP3X (33) containing additional restriction sites at the multiple cloning site. ESTs1 were generated by sequencing cDNA clones from the 5' end. Template DNA was extracted in a 96-well format using a modified alkaline lysis protocol. Sequencing reactions were performed following standard Big Dye (Applied Biosystems) protocols for a 0.25x reaction. Cycle sequencing was performed over 35 cycles (96 °C for 10 s; 50 °C for 5 s; and 60 °C for 4 min) in an Applied Biosystems GenAmp 9700 thermocycler. DyeEx 96-well plates (Qiagen) were used for dyeterminator removal. Samples were sequenced using an ABI 3700 capillary sequencer (Applied Biosystems).

Fermentation—Duplicate fermentations were run for each of the strains displayed in Fig. 3. 0.8 liters of minimal media containing 5% glucose was inoculated with 1.5 ml of frozen spore suspension. After 48 h, each culture was transferred to 6.2 liters of the same media in a 14-liter Biolafitte fermenter. The fermenter was run at 25 °C, 750 rpm, and 8 standard liters per min of air flow. One hour after the initial glucose was exhausted, a 25% (w/w) lactose feed was started and was fed in a carbon-limiting fashion to prevent lactose accumulation. The concentrations of glucose and of lactose were monitored using a glucose oxidase assay kit or a glucose hexokinase assay kit with {beta}-galactosidase added to cleave lactose, respectively (Instrumentation Laboratory Co., Lexington, MA). Samples were obtained at regular intervals to monitor the progress of the fermentation. Samples obtained before (20–35 g/liter glucose) and just after glucose exhaustion, and 24 and 48 h after the lactose feeding commenced were used for microarray analysis.



View larger version (42K):
[in this window]
[in a new window]
 
FIG. 3.
Lactose-mediated induction of biomass-degrading enzymes. RLP-37 and QM6a mycelia were grown initially in glucose-containing medium. One hour after the glucose had been completely utilized, cultures were fed lactose at a rate that prevented accumulation in the medium. Samples were obtained during the glucose feed, during carbon deprivation, and 24 and 48 h after commencement of the lactose feeding. Microarrays were used to determine the expression levels at each of the times, as indicated above the columns, relative to expression at carbon deprivation. Data obtained for the biomass-degrading enzymes was clustered according to the relative expression levels observed across all conditions in both strains. Color codes indicate log ratios of expression under the indicated conditions relative to expression at carbon deprivation. B, RNA derived from cultures induced for 48 h in lactose was directly compared with RNA derived from the sophorose-induced cultures described in Fig. 2. Color codes indicating log ratios of expression in lactose medium relative to expression in sophorose medium. Color scale is as in Fig. 2

 

Isolation of RNAs, Labeling, and Hybridization—Mycelia were harvested by filtration through miracloth (Northern blots and fermentation samples) or by vacuum filtration through Whatman No. 1 paper and were quick frozen in liquid nitrogen. For Northern blotting, RNA was prepared as described above for construction of cDNA libraries. Polyadenylated RNA was selected 2 times using Oligotex (Qiagen). Blotting was performed using a NorthernMax-Gly Kit (Ambion). 32P-Labeled probes were generated using a DECAprime Kit (Ambion). Hybridization was performed using ULTRAhyb Ultrasensitive Hybridization Buffer (Ambion).

In all of the microarray experiments performed, the relative expression levels were determined between two RNA samples derived from growth under two different conditions. RNA was prepared using a FastRNA (Red) Kit (Qbiogene). Two aliquots of total RNA were taken from each sample. One aliquot was labeled with cyanine 3-CTP, and the other aliquot was labeled with cyanine 5-CTP (PerkinElmer Life Sciences) using a fluorescent linear amplification kit (Agilent Technologies). 250 ng of cyanine 5-labeled RNA derived from mycelia grown under one condition of interest were combined with 250 ng of cyanine 3-labeled RNA derived from mycelia grown under a second condition of interest, and the pooled RNAs were hybridized to the microarrays using an in situ hybridization kit (Agilent Technologies). The relative fluorescent intensities of cyanine 5- and cyanine 3-labeled species bound to each probe of the microarray were determined using an Agilent Technologies microarray scanner and software. The log of the ratio (log ratio) of the two fluorescent species bound to each of the probes reflects the relative expression levels of the cognate genes in the two samples (34, 35). To avoid possible bias resulting from differences in incorporation of the dyes, duplicate microarrays were hybridized in which reciprocal dye combinations were used. Thus, for example, in sophorose induction experiments, cyanine 5-labeled RNA derived from sophorose-induced cultures was combined with cyanine 3-labeled RNA from uninduced cultures; and in replicate microarrays, cyanine 3-labeled RNA derived from sophorose-induced cultures was combined with cyanine 5-labeled RNA from uninduced cultures. A total of 6 microarrays was performed for each condition in the sophorose induction experiments (duplicate reciprocally labeled arrays for triplicate cultures). In the lactose induction experiments 4 microarrays were performed for each time point (duplicate reciprocally labeled arrays for duplicate fermenters). For all figures the mean values across replicates are presented.

Microarray Design—60-mer oligonucleotides corresponding to the assembled ESTs were designed and synthesized in situ by Agilent Technologies.

Bioinformatics and Data Analysis—Sequence chromatograms were assigned base quality values by Phred (version 0.990722, www.phrap.org). Sequences containing >100 bases with Phred quality values >=20 or an average base quality >=12 were retained. High quality sequences were assembled using the PhredPhrap (version 0.990329) script provided by Consed (version 11.0) (36). Contigs were virtually translated in 6 reading frames and were annotated using the BioSCOUT system from LION Biosciences as in Andrade et al. (37). Briefly, biasdb (37) and Seq (38) were used to identify compositionally biased regions and regions of low compositional complexity, respectively. For each sequence, homology search results using FASTA (39) and BLAST (40) were pooled, and a lexical analysis procedure was applied to the descriptive field of the set of homologs (37). Annotations were classified as direct, clear, tentative, marginal, or unknown according to the following criteria: 10<–70, 1010 to 1070, 104 to 1010, 0.1 to 104 and >0.1 for BLAST; and >500, >130, >90, and >0 for FASTA, respectively. Selected sequences were further analyzed by aligning them with similar genes using ClustalW (clustalw.genome.ad.jp/). All tools were used with default parameters. Secretion signal sequence prediction was performed according to Ref. 41. Glycoside hydrolase families and carbohydrate-binding module families are were assigned as in Ref. 42. To identify genes involved in protein processing and secretion, the sequences of gene products of interest from other organisms were compared by BLAST to the translated ESTs. Microarray data were quantified using Feature Extraction software (Agilent Technologies). The data were visualized and analyzed using Genespring version 4.2 (Silicon Genetics). Clustering was performed using a standard correlation and Genespring default settings.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Identification of New Genes Encoding Hydrolytic Enzymes— The inducible expression of the very abundant cellulolytic and hemicellulolytic enzymes requires the coordination of a variety of cellular processes. To identify T. reesei genes that participate in these processes and to identify new enzymes that might play a role in biomass utilization, we sequenced the 5' ends of 18,000 random cDNAs from mycelia grown on a wide variety of carbon sources and conditions. The sequences of individual sequence reads were compared, and overlapping segments were assembled to form 2101 contigs consisting of two or more 5' reads. 3030 individual reads did not have significant sequence overlap with any other reads in the data set.

The predicted coding regions of the EST set were compared with all publicly available sequence data bases. Twelve new sequences were identified encoding proteins with significant similarity to known enzymes whose substrates are commonly found in biomass. Full-length sequences corresponding to these gene products were determined. The genes encoding these sequences along with relevant previously identified enzymes from T. reesei are listed in Table I.


View this table:
[in this window]
[in a new window]
 
TABLE I
Characterized and predicted biomass degrading activities and their genes in T. reesei

 

Among the newly identified sequences were 3 new predicted endoglucanases Cel74a, Cel61b, and Cel5b. The expected amino acid sequences of these gene products correspond to glycoside hydrolase (GH) families 74, 61, and 5, respectively, based on alignments with other members of these families (43). All of these sequences contain a signal sequence, suggesting that they are secreted proteins. Cel74a is the first member of GH family 74 to be found in T. reesei, and it includes a C-terminal carbohydrate-binding domain of the CBM1 family (42). Unlike the previously known T. reesei member of the GH61 family (EGIV/Cel61a), Cel61b has no carbohydrate-binding motif. Cel5B has a C-terminal hydrophobic region and a consensus sequence suggesting that it may be modified by a glycosylphosphatidylinositol anchor attached to serine 411 (44). This gene product may therefore represent a membrane-bound endoglucanase.

Five new putative {beta}-glucosidase sequences were also discovered. As indicated in Table I, they fall into GH families 1 and 3. Cel3b and Cel3e contain presumed signal sequences suggesting that they encode secreted enzymes.

Two predicted hemicellulolytic enzymes including an acetyl xylan esterase (Axe2) and an arabinofuranosidase (Abf2) were also found. Like Cel5b, Axe2 is predicted to contain a glycosylphosphatidylinositol membrane anchor. The glycoside hydrolase and carbohydrate esterase families corresponding to each of these sequences are listed in Table I. Sequence alignments for the predicted gene products with representative members of glycoside hydrolase families are available in the Supplemental Material. Two additional sequences designated Cip1 and Cip2 were found and each contain a fungal-type cellulose binding domain and a signal sequence. The predicted amino acid sequences of these genes do not show compelling sequence similarity to any of the currently defined GH families. However, Cip1 is very similar to an unassigned putative secreted hydrolase from Streptomyces coelicolor (45) (TrEMBL accession number O69962), and Cip2 is highly similar to a multidomain esterase from Ruminococcus flavefaciens that contains an acetyl xylan esterase domain outside the region of sequence similarity with Cip2 (46) (GenBankTM accession number AJ238716 [GenBank] ).

Regulation of the Endoglucanases and the {beta}-Glucosidases— The previously identified endoglucanases are induced during growth on media containing cellulose, sophorose, or lactose (13, 19, 20). To determine whether the newly discovered endoglucanases are similarly regulated, we examined mRNA levels for each of these gene products by Northern blot. Two different strains were used: QM6a, a wild type isolate of T. reesei, and RL-P37, a strain that has been selected for improved production of cellulolytic enzymes (47). Mycelia from each of these strains were grown in flasks in minimal media containing glucose, crystalline cellulose (avicel), or glycerol as the sole carbon source, or glycerol supplemented with 1 mM sophorose. As shown in Fig. 1A, with the exceptions of egl4/cel61a, cel61b, and cel5b, the endoglucanases were regulated very similarly to one another. Induction by sophorose resulted in much higher levels of expression than did growth on cellulose over the time period examined. In addition, expression of these genes was substantially higher in the strain RL-P37 than it was in QM6a. The regulation of egl4/cel61a differed from the other endoglucanases because while it was also more abundant in RL-P37 than it was in QM6a, it appeared to be comparably induced by growth in cellulose and by growth in sophorose. Equivalent expression in cellulose and sophorose was also observed for cel61b in RL-P37 but not QM6a. In contrast, cel5b, the putative membrane-bound endoglucanase, displayed much less dramatic regulation. It was clearly expressed on both glucose and glycerol and induced only a small amount by either cellulose or sophorose. Moreover, cel5b expression may have been somewhat higher during growth on cellulose than during growth on media containing sophorose, particularly in QM6a.



View larger version (70K):
[in this window]
[in a new window]
 
FIG. 1.
Induction of endoglucanases and {beta}-glucosidases. A, cultures of QM6a and RLP-37 were grown in glucose (lanes A), cellulose (lanes B), glycerol (lanes C), or glycerol supplemented with sophorose (lanes D). mRNA from each of the cultures was analyzed by Northern blot. The top set of bands for each gene was probed with labeled cDNAs as indicated. The lower set of bands for each gene was probed with an actin probe to correct for loading differences and differences in exposure times required to visualize the bands. B, QM6a and RLP-37 were grown in glycerol-containing medium or glycerol medium supplemented with 1 mM sophorose. Microarrays were used to determine the relative expression of each of the indicated genes when cells were grown in the presence of sophorose compared with when they were grown in the absence of sophorose. Height of bars indicates the log of the ratio of expression in sophorose compared with expression in glycerol alone for RLP-37 (shaded bars) and QM6a (solid bars). C, using the same RNAs described in B, microarrays were used to determine the relative expression of the indicated genes in RLP-37 compared with QM6a. Height of bars indicates the log ratio of expression in RLP-37 relative to QM6a when both strains were grown with sophorose (shaded bars) or when both strains were grown in glycerol alone (solid bars). Error bars indicate S.D. across replicate arrays using reciprocally labeled mRNA derived from triplicate cultures.

 

The regulation of {beta}-glucosidases was examined in an analogous fashion. The GH family 1 {beta}-glucosidases were induced similarly to the endoglucanases. In contrast to the endoglucanases, however, cel1b was only slightly more abundant in RL-P37 than it was in QM6a. Among the family 3 {beta}-glucosidases, bgl1/cel3a was also regulated similarly to the endoglucanases in RL-P37 but displayed little induction in QM6a. cel3c and cel3d were low in abundance and may have been somewhat more highly induced by cellulose than by sophorose in QM6a. However, in RL-P37, sophorose induction of cel3c and cel3d was more apparent than cellulose-mediated induction. cel3b and cel3e displayed particularly interesting regulation. In QM6a, the expression levels of these genes were higher when the cultures were grown on glycerol than when they were grown on glucose. As a consequence, although both cellulose and sophorose induced expression, the magnitude of the induction appears greater when comparing cellulose-grown cultures to glucose-grown cultures than when comparing glycerol-grown cultures in the presence and absence of sophorose. Surprisingly, in RL-P37 cellulose-mediated induction of cel3b was apparent although sophorose appeared to repress expression below the basal level seen when mycelia are grown on glycerol alone. Thus, although the selective pressures employed to improve cellulase production in RL-P37 appear to have generally resulted in increased inducibility and abundance of the endoglucanases, the regulation of the {beta}-glucosidases seems to have been altered in complex and unexpected ways.

Genome Scale Measurement of Transcriptional Regulation Using Microarrays—To measure the expression levels of all of the ESTs sequenced, microarrays were constructed. Sixty-bp oligonucleotide probes containing unique sequences from within each of the ESTs were designed to query the abundance of their corresponding mRNAs. The oligonucleotide probes were synthesized and arrayed as described in Ref. 34. In all of the experiments performed, the microarrays were used to determine the relative expression levels between two different samples. mRNAs comprising the samples of interest were labeled with Cy5 and with Cy3 fluorescent dyes. Reciprocally labeled pairs of samples were combined and co-hybridized to the arrays. The log of the ratio (log ratio) of the two fluorescent species bound to each of the probes reflects the relative expression levels of the cognate genes in the two samples (34, 35).

To validate the performance of these arrays in measuring T. reesei gene expression, the genes shown in Fig. 1A were analyzed using the microarrays. The magnitude of sophorose induction of the genes was determined by labeling mRNAs from cultures of RL-P37 and QM6a grown in glycerol or glycerol supplemented with sophorose. Samples from sophorose-grown cultures were directly compared with samples of the same strain grown in glycerol. The log ratios of expression in sophorose-grown cultures relative to expression in glycerol-grown cultures are indicated in Fig. 1B. In a second experiment, levels of expression were directly compared across the two strains in the presence and absence of sophorose. mRNA samples from RL-P37 cultures were directly compared with samples from QM6a cultures grown under identical conditions. Fig. 1C shows the log ratios of the expression levels of each of the genes in RL-P37 relative to the expression levels in QM6a. Overall, the log ratios observed using the microarrays are in very good agreement with the differences in the magnitudes of expression observed using Northern blots. However, particularly for some of the lower abundance mRNAs (e.g. cel3e and egl5/cel45a), some discrepancies are observed. These discrepancies arise primarily from variability among cultures (note that the cultures used for the microarray experiments were grown under slightly different conditions than for the Northern blots). Additionally, it was unclear whether the log ratios observed precisely and quantitatively mirror the abundance differences seen by Northern blot over the full dynamic range of expression levels. Therefore, for clarity throughout this paper, we refer to log ratios rather than fold changes in expression.

Analysis of the data obtained in these experiments was extended to include all of the genes encoding putative biomass-degrading enzymes listed in Table I. Fig. 2A shows the results obtained. The data were clustered across three measurements as follows: the magnitude of sophorose induction in RL-P37, the magnitude of sophorose induction in QM6a, and a comparison of the relative levels of expression between RL-P37 and QM6a in sophorose-induced cultures. Clustering across this group of experiments highlights not only the regulation of the genes in the two strains but also the patterns of expression that may have been co-selected with enhanced cellulase production in RL-P37. The clustogram has three primary branches.

One primary branch contains a large cluster of genes that were moderately to highly sophorose-induced in RL-P37. They were also expressed at a significantly higher level in RL-P37 than they were in QM6a. This cluster contains all of the known and putative endoglucanases and exoglucanases, and all of the known and presumed {beta}-glucosidases except cel3b. Interestingly, a variety of hemicellulases, as well as cip1, cip2, and swo1 also fall into this cluster. Most of the genes in this large cluster were induced to some extent in both strains but were a great deal more induced in RL-P37 than they were in QM6a. However, a subset of genes in this group, including the xylanases xyn2, xyn3, and xyn4, was not induced by sophorose in QM6a. Two smaller branches within this cluster showed somewhat different patterns from the rest of the genes. One subbranch contained cel5b and bgl2/cel1a, which were induced approximately equivalently in the two strains and were expressed more highly in RL-P37 than in QM6a. A second subbranch contains bga1 and cel3e, which were both induced as much or more in RL-P37 than in QM6a, but were not more highly expressed in RL-P37 than in QM6a.

The second primary branch of the clustogram consisted of genes that were either not induced or were slightly repressed by sophorose in RL-P37. Interestingly, one of these, cel3b, was highly induced in QM6a. The final primary branch consisted of the single gene abf2, whose expression was unaffected by sophorose.

The pattern of transcriptional regulation in response to sophorose of the complete EST data set is shown in Fig. 2B. The expression profiles observed for the two strains are substantially different. In light of the fact that RL-P37 was derived from QM6a and was selected for its increased capacity for cellulase production, it is likely that many of the differences observed between the two strains influence cellulase synthesis. The full data sets corresponding to Fig. 2, A and B, are available in the Supplemental Material.

To explore further the selective pressures that may have brought about increased cellulase production in RL-P37, we examined ESTs for which the induction or repression in response to sophorose resulted in an absolute level of expression that is either higher or lower in RL-P37 than in QM6a, respectively. In Fig. 2C, ESTs were identified for which the log ratios observed for sophorose induction were higher than 0.3 in QM6a. This set of ESTs was compared with the set that displayed sophorose induction log ratios greater than 0.3 in RL-P37. Eighteen ESTs were induced in common in the two strains. The sets of ESTs that were induced by sophorose in each of the strains were then compared with the set of ESTs that is expressed at a higher level in RL-P37 than in QM6a. Among the 18 ESTs that are induced in common in the two strains, 9 are expressed more highly (log ratio >0.3) in RL-P37 than in QM6a. These ESTs include bgl2/cel1a, cip1, bxl1, a putative aminopeptidase, a putative lactose permease, and four ESTs encoding proteins whose function could not be predicted based on the sequence information available. The majority of the genes tabulated in Table I fall into set e (see Supplemental Material) in Fig. 2B because, although they may be induced in QM6a, the log ratios reflecting the magnitude of their induction were less than 0.3.

A similar analysis was performed for ESTs whose expression was repressed in response to sophorose. Interestingly, there was a set of 4 ESTs that was repressed by sophorose in both strains and whose expression levels were reduced in RL-P37 relative to QM6a. These ESTs may therefore encode proteins whose expression at a relatively high level is detrimental to cellulase production. The identities of the ESTs in each of the sets indicated in Fig. 2, C and D, and their log ratios are available in the Supplemental Material.

Lactose-mediated Induction of Genes Encoding Biomass-degrading Enzymes—Most of the biomass-degrading enzymes listed in Table I are induced by sophorose. These enzymes may be induced via a common sensing and transduction pathway or may be induced via separate pathways. To explore further commonalities in their regulation, the effects of a second inducer, lactose, were examined.

Because lactose inhibits cellulase expression when present at high concentration, mycelia were grown in a fermenter where limited lactose can be supplied at a controlled rate. Initially, cultures were grown in excess glucose to allow rapid cell division and accumulation of cell mass. The cultures were allowed to deplete the medium of glucose. After an hour of carbon deprivation, cultures were fed lactose at a fixed rate that prevented accumulation in the medium. Under these conditions, the cells only undergo significant growth and division during the period when excess glucose is present. Samples of RL-P37 and QM6a mycelia were harvested from the fermenters during growth in glucose, an hour after the glucose had been depleted (carbon deprivation stage) and at 24 and 48 h after lactose feeding commenced. RNA was prepared from each of the samples, and microarray analysis was performed. The expression levels of the biomass-degrading enzymes listed in Table I in lactose and in glucose was determined relative to expression during carbon deprivation (Fig. 3A).

The majority of the genes that were induced by sophorose were also induced by lactose. However, four genes that were induced by sophorose were not induced by lactose under these conditions. bga1 was induced equivalently by sophorose in both RL-P37 and in QM6a, but its expression appeared unaffected by either glucose or lactose. axe2 was slightly sophorose-induced in QM6a but was slightly repressed by lactose, particularly in QM6a. cel3c was induced similarly in both strains by sophorose and was repressed, again particularly in QM6a, by glucose and lactose. Finally, cel3b was induced by sophorose in QM6a but not in RL-P37. Its expression was significantly repressed in both strains by both glucose and lactose.

Several of the genes that were induced by sophorose displayed higher levels of expression, in both glucose and in lactose, than during carbon deprivation. Among these, most are expressed more highly in lactose than in glucose, indicating that they are to some degree preferentially expressed in lactose. Others, such as agl3, abf2, and cel3d, were equivalently expressed in the two carbon sources indicating that their expression may be more sensitive to the availability of a carbon source than to its identity. Nevertheless, the relative induction of these genes is higher in RL-P37 than it is in QM6a. The {beta}-glucosidases cel1b and cel3e are also sophorose-induced and more highly expressed in RL-P37 than in QM6a, yet appear to be preferentially induced by excess glucose in this experimental regimen.

A number of the genes examined were expressed more highly after 1 h of carbon deprivation than they were during the period where excess glucose was present in the medium. Interestingly, more genes displayed this glucose repression (or induction by carbon deprivation) in RL-P37 than in QM6a.

In this experiment, the kinetics of lactose induction appeared to be faster in RL-P37 than in QM6a. Moreover, several genes (e.g. man1, glr1, xyn3, egl1/cel7b, swo1, cbh2/cel6a, egl5/cel45a, cel12a, and xyn4) that showed little or no sophorose induction in QM6a showed clear induction in this strain at the 48-h time point in the lactose-fed fermenters. Because the experiments described above queried only the magnitude of induction by sophorose and lactose, it was unclear whether the absolute expression levels of these genes differed upon induction with the different sugars. To address this question, an experiment was performed in which mRNA from the sophorose-induced cultures was directly compared with mRNA from the fermenter cultures that had been induced for 48 h with lactose. The results of this experiment are shown in Fig. 3B. Remarkably, the final expression level of each of the genes is virtually identical (within log ratio 0.2) in response to the two inducing sugars for both QM6a and RL-P37. Thus despite differences in the experimental regimen and the kinetics of induction, the net expression level of each the genes examined was the same upon induction with either sophorose or with lactose.

Regulation of Genes Involved in Protein Processing and Secretion—Many of the genes encoding biomass-degrading enzymes are induced by lactose and by sophorose more than 10-fold, and some are induced as much as 100-fold. Moreover, the products encoded by these genes accumulate to more than 50% of the total cellular protein (18). The processing and secretion of these gene products are expected to pose a substantial challenge to the secretory pathway of the cell. To ask whether induction of these enzymes produces coordinate changes in the secretory apparatus, a list of 109 T. reesei ESTs and previously identified genes containing open reading frames with significant sequence similarity to known gene products involved in protein glycosylation, folding, secretion, the unfolded protein response, and endoplasmic reticulum-associated protein degradation was compiled. The regulation of these ESTs was examined during cellulase induction.

Fig. 4 shows how this list compares to the sets of genes induced by lactose and by sophorose in QM6a and in RL-P37. Neither of the strains exhibited extensive induction of the identified ESTs. In QM6a, the only ESTs in this set that were induced with log ratio greater than 0.3 by lactose or by sophorose were a gene potentially involved in N-linked glycosylation (annotated as an N-glycosyltransferase), a possible peptidylprolyl cis-trans-isomerase, and an EST (tric039xp19) that contains a region of similarity to Saccharomyces cerevisiae Sec7p. These genes were selectively induced by lactose or by sophorose, respectively, and their log ratios were only slightly greater than 0.3. In RL-P37, a single previously identified gene, encoding protein-disulfide isomerase (48), was induced substantially by both lactose and by sophorose. Twelve genes that participate in a variety of processes including folding, glycosylation, and vesicle transport were induced preferentially by sophorose in RL-P37. An additional gene, encoding Ire1, a transmembrane kinase/nuclease involved in signaling the accumulation of unfolded proteins in the endoplasmic reticulum (49), was induced reproducibly, but with log ratio less than 0.3 by both lactose and by sophorose. The complete data set corresponding to Fig. 4 is available in the Supplemental Material.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Despite the industrial significance of the filamentous fungus T. reesei, few of its genes have been identified and sequenced. In this study, we used an EST-based approach to obtain partial sequence from 18,000 random mRNAs expressed under a wide variety of conditions. These sequences assembled into 5131 contigs indicating that sequence was sampled from nearly half of the 11,000–14,000 genes (50) estimated to reside in a fungal genome of this size. The remaining genes presumably are expressed at such low abundance under the conditions used that they were not sampled in this effort. Despite the fact that the sequences obtained represent only a single pass from the 5' end of cDNAs, we found that ORFs within nearly 50% of the contigs showed clear sequence similarity to known coding sequences from other organisms. Chambergo et al. (51) have also sequenced ESTs comprising ~1151 contigs from T. reesei. We did not attempt to assemble the current sequencing set with the set described by this group because they derive from different strains.

Often genes of related function display commonalities in their regulation. Previous studies (19, 20, 32) have shown that the known cellulase components and some hemicellulases are coordinately induced by certain substrates. The large magnitude in induction of genes encoding cellulase components and the toll on cellular resources used in synthesizing them suggests that the tendency toward tight coordinate regulation is likely to hold true for other genes whose products are involved in biomass degradation or in facilitating their synthesis. We used oligonucleotide-based microarrays to determine the comprehensive expression patterns of the EST set under conditions known to influence cellulase production in T. reesei.

Comparisons of the sets of genes that were induced by lactose to those that were induced by sophorose in these experiments should be interpreted in light of the experimental regimens used. To measure sophorose induction, expression levels observed in sophorose-grown cultures were compared with expression levels when glycerol was used as the sole carbon source. In contrast, expression levels during the limited lactose feed in a fermenter were normalized to expression levels during carbon deprivation. It is likely that these considerations are particularly relevant to the regulation of the {beta}-glucosidases, which as a class showed profound differences in their regulation by sophorose and by lactose (discussed further below). However, when a direct comparison was made between the expression levels observed after sophorose induction to those seen after lactose induction (Fig. 3B), each of the genes encoding the putative biomass-degrading enzymes listed in Table I were expressed virtually equivalently in both RL-P37 and QM6a. Thus, both induction schemes appear to ultimately result in the same net levels of expression and stoichiometry (at the mRNA level) of these enzymes.

Among the cDNAs that were newly identified in this study were three, designated cel74a, cel61b, and cel5b, that appear to encode previously unknown endoglucanases. With the exception of cel5b, all of the endoglucanases were similarly regulated by cellulose, sophorose, and lactose. These results are in agreement with results published for the previously known endoglucanases (20). The endoglucanases were also more dramatically induced and more highly expressed in the strain RL-P37 than they were in QM6a.

Two additional putative glycolytic enzymes, cip1 and cip2, were identified that do not fit into any of the currently defined classes of GHs. The regulation of cip1 among strains with varying cellulase-producing capabilities and across a variety of conditions is indistinguishable from the endoglucanases and particularly the cellobiohydrolase cbh1/cel7a (Figs. 2 and 3).2 Similarly, cip2 has a pattern of expression in common with these genes, particularly in RL-P37. The coregulation of these genes with canonically regulated cellulase components and the fact that they contain distinct cellulose-binding modules furthers the notion that cip1 and cip2 encode previously unrecognized activities with potential roles in biomass degradation.

Five new {beta}-glucosidase genes were identified in the EST set bringing the total number of described T. reesei {beta}-glucosidases to 7. Although it is known that {beta}-glucosidase activity is required for complete digestion of carbohydrates to monosaccharides, the precise functions and specificities of the enzymes encoded by these genes is not known. Deciphering the roles of the {beta}-glucosidases is further complicated by the fact that they have been shown to be capable of transglycosylation, thereby converting sugars that are poor inducers of cellulase gene transcription into more potent inducers (reviewed in Ref. 52). Barnett et al. (15) have shown and Takashima et al. (16) have suggested roles for bgl1/cel3a and bgl2/cel1a, respectively, in cellulose saccharification, although BGLII/Cel1a was later shown to be an intracellular rather than a secreted protein (53). In accordance with this function, bgl1/cel3a and bgl2/cel1a expression were well correlated with the induction, by both sophorose and lactose, of other genes central to cellulose degradation. Interestingly, however, these genes differed in their response to glucose deprivation. The expression of bgl1/cel3a was higher when glucose was present in excess than when glucose was depleted from the medium. bgl2/cel1a expression, in contrast, was dramatically lower in excess glucose.

cel1b, cel3d, and cel3e appeared to be more highly induced by glucose than by lactose. One interpretation of these results is that these {beta}-glucosidases are less directly relevant to cellulose degradation than other enzymes of this class. However, it is interesting to note that these genes are sophorose-inducible and are present at a higher level in RL-P37 than in QM6a suggesting that their regulation may be tied in some way to cellulase production.

Several of the hemicellulase-encoding genes (axe2, xyn1, abf1, abf2, agl1, agl2, and agl3) appear to be differentially induced by lactose and by sophorose in either QM6a or in RL-P37 or both. Most notably, abf1, abf2, agl1, and agl2 were substantially more induced in RL-P37 during growth on lactose than they were by growth in sophorose. Previously published results (32) reported that expression of a subset of the currently examined hemicellulases, with the exception of bxl1, were undetectable in cultures where lactose is used as a carbon source. We attribute the discrepancies with the results reported in this study to several factors. The T. reesei strain used in the previous study differed from the ones used in this study, although like RL-P37 it is derived from QM6a. In addition, in contrast to the experiment described in this paper where lactose was fed via a fermenter to minimize accumulation, the previous study used flask cultures in which excess lactose was present. It is also likely that the sensitivity of detection of mRNAs using microarrays by methods used in this study exceed the sensitivity of Northern blots used in previous analyses.2

It has been speculated that in nature low levels of constitutively expressed cellulases, perhaps in conjunction with a {beta}-glucosidase, must exist to initiate cellulose digestion and generate smaller inducing molecules (5456). These molecules, such as cellobiose and potentially sophorose (generated by trans-glycosylation), would then mediate the induction of the full complement of cellulase-encoding genes. Several of the genes encoding biomass-degrading enzymes displayed regulatory patterns consistent with the possibility that they play a role in primary induction of cellulase expression. cel5b mRNA was moderately abundant and was constitutively expressed in glycerol, glucose, sophorose, and lactose. However, its expression was induced somewhat by cellulose. In addition, in contrast to the other endoglucanases it was only slightly more abundant in RL-P37 than in QM6a. It is therefore probably not a key component of the set of enzymes selected in RL-P37 to digest bulk cellulose. These characteristics, along with the fact that Cel5b may be membrane-anchored via a glycosylphosphatidylinositol anchor, make it an interesting candidate for this role. By analogy, Axe2, which is also predicted to contain a glycosylphosphatidylinositol anchor, may be involved in primary induction of some hemicellulases. In addition, several other genes were moderately to highly increased in abundance when cultures were deprived of a carbon source (Fig. 3B, genes that appear "repressed by glucose"). Margolles-Clark et al. (32) have described similar results for a subset of these genes. Elevated expression of these genes in response to starvation may indicate that they play a role in "foraging" for complex carbon sources in nature or in induction of other genes encoding biomass-degrading enzymes. In this hypothetical role, it is unclear why the complement of genes that is induced by starvation differs between RL-P37 and QM6a. One possibility is that because the screens used in developing RL-P37 included a requirement for cellulase production in the presence of 2-deoxyglucose, glucose-mediated regulation of many of the genes has been altered. Alternatively, regulation in response to glucose may be, in some way, mechanistically tied to induction and cellulase overexpression.

A number of studies have shown that endoplasmic reticulum chaperones and foldases are up-regulated by endoplasmic reticulum stress or secretion of foreign proteins in filamentous fungi (reviewed in Ref. 57). A list of T. reesei ESTs was compiled that contains ORFs with significant sequence similarities to known gene products involved in various aspects of protein secretion. Surprisingly few genes from this list were induced by sophorose or by lactose in either QM6a or in RL-P37 (Fig. 4). Saloheimo et al. (48) showed that in T. reesei the protein-disulfide isomerase gene pdi1 is minimally induced by sophorose and induced by 8–10-fold, compared with growth on glucose, during growth on cellulose-containing medium, or after carbon depletion. In our experiments, pdi1 was not induced in QM6a but was the sole gene among secretion-related genes that was induced (log ratio >0.3) by growth in both sophorose and lactose in RL-P37. It is likely that the relative induction level of pdi1 is a function of the quantity of total secreted protein being produced and that the slight differences observed between the two studies reflect strain differences and differences in the growth regimens employed.

Sophorose is the most potent inducer of T. reesei cellulases known. The large number of genes whose expression is altered by sophorose indicates that its direct and indirect effects on the cell are broad. Moreover, the effect of sophorose on expression of numerous genes appears to differ between the two strains examined. RL-P37 was derived from QM6a by mutation and screening for increased cellulase production in the presence of glycerol and subsequently in 2-deoxyglucose (47). Under the fermentation conditions used in this study, cellulase production in RL-P37 was 10-fold higher than in QM6a. Therefore, one might expect that many of the genes whose expression both differs between these two strains, and is modulated by sophorose, encode gene products involved in cellulase production.

mRNAs corresponding to a large number of ESTs were expressed less abundantly when cells were grown in sophorose than when they were grown in glycerol alone. It is possible that part of the role of sophorose in stimulating production of biomass-degrading enzymes is to directly or indirectly ameliorate the effects of a negative regulator. Such an effect could occur at many levels including transcription, sugar transport, signal transduction, or secretion. In Fig. 2D, we examined the set of ESTs whose expression was decreased in the presence of sophorose. Sets d and e in Fig. 2D (see Supplemental Material for the data sets) contain ESTs that are repressed in RL-P37 cultures grown with sophorose and that are expressed at a lower level in RL-P37 than in QM6a. The strong correlation between reduced expression of these genes in response to sophorose and enhanced production of biomass-degrading enzymes may indicate that their expression is detrimental to cellulase production.

Analogously, a set of genes was identified that was both induced by sophorose and increased in expression in RL-P37 relative to QM6a (Fig. 2C, sets d–f, see Supplemental Material). Among this set are the previously identified glycosyl hydrolase-encoding genes known to be important in cellulase degradation. The co-selection in RL-P37 of some of the newly discovered genes in Table I with the known genes encoding cellulase components strengthens the notion that they may have roles in biomass degradation. It is remarkable that among this group are genes that do not have a known function in cellulose degradation but rather are presumed to act on other substrates (e.g. hemicellulose). These genes would not be expected to have a direct benefit in the selection for increased cellulose production that was applied in deriving RL-P37, yet they appear to have been co-selected for enhanced sophorose induction and overexpression. Thus, it seems that a mutation(s) that improved cellulase production concurrently improved the inducible expression of these ancillary genes. These results suggest that this set of genes may have significant regulatory points of convergence across the spectrum of cellular processes involved in carbon sensing, signal transduction, and transcriptional regulation. By the same reasoning, at least some of the other products encoded by genes in Fig. 2C, sets d–f, are likely to be important either in biomass degradation or in the synthesis of enzymes required for this process. Because of the limited sequence information available, we could not definitively annotate many of these ESTs. However, it is interesting to note that among the ESTs in this set are ones containing ORFs with similarity to several sugar transporters, constituents of the secretory apparatus, and to chitinase. These findings and further characterization of the genes will likely have significant implications for the design of industrial processes for commercial production of biomass-degrading enzymes.


    FOOTNOTES
 
The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EBI Data Bank with accession number(s) CB895227 [GenBank] –CB909713.

* This work was supported in part by a subcontract from the Office of Biomass Program of the Department of Energy Office of Energy Efficiency and Renewable Energy. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

The on-line version of this article (available at http://www.jbc.org) contains sequence alignments for the predicted gene products and data sets for Figs. 2 and 4. Back

{ddagger} To whom correspondence should be addressed: Genencor International, 925 Page Mill Rd., Palo Alto, CA 94304. Tel.: 650-846-7635; Fax: 650-621-7817; E-mail: Pforeman{at}genencor.com.

1 The abbreviations used are: EST, expressed sequence tag; GH, glycoside hydrolase; ORF, open reading frame. Back

2 P. Foreman, unpublished observations. Back



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Biely, P., and Tenkanen, M. (1998) in Trichoderma and Gliocladium: Enzymes, Biological Control and Commercial Applications (Harman, G. E., and Kubicek, C. P., eds) Vol. 2, pp. 25–47, Taylor & Francis Ltd., London
  2. Dove, A. (2000) Nat. Biotechnol. 18, 490[CrossRef][Medline] [Order article via Infotrieve]
  3. Gerngross, T. U. (1999) Nat. Biotechnol. 17, 541–544[CrossRef][Medline] [Order article via Infotrieve]
  4. Himmel, M. E., Adney, W. S., Baker, J. O., Elander, R., McMillan, J. D., Nieves, R. A., Sheehan, J., Thomas, S. R., Vinzant, T. B., and Zhang, M. (1997) in Fuels and Chemicals from Biomass (Woodward, J., and Saha, B., eds) pp. 2–45, American Chemical Society, Washington, D. C.
  5. Mielenz, J. R. (2001) Curr. Opin. Microbiol. 4, 324–329[CrossRef][Medline] [Order article via Infotrieve]
  6. Sheehan, J., and Himmel, M. (1999) Biotechnol. Prog. 15, 817–827[CrossRef][Medline] [Order article via Infotrieve]
  7. Wyman, C. E. (2001) Appl. Biochem. Biotechnol. 91, 5–21
  8. Bayer, E. A., Chanzy, H., Lamed, R., and Shoham, Y. (1998) Curr. Opin. Struct. Biol. 8, 548–557[CrossRef][Medline] [Order article via Infotrieve]
  9. Okada, H., Tada, K., Sekiya, T., Yokoyama, K., Takahashi, A., Tohda, H., Kumagai, H., and Morikawa, Y. (1998) Appl. Environ. Microbiol. 64, 555–563[Abstract/Free Full Text]
  10. Saloheimo, M., Lehtovaara, P., Penttila, M., Teeri, T. T., Stahlberg, J., Johansson, G., Pettersson, G., Claeyssens, M., Tomme, P., and Knowles, J. K. (1988) Gene (Amst.) 63, 11–22[CrossRef][Medline] [Order article via Infotrieve]
  11. Teeri, T. T., Lehtovaara, P., Kauppinen, S., Salovuori, I., and Knowles, J. (1987) Gene (Amst.) 51, 43–52[CrossRef][Medline] [Order article via Infotrieve]
  12. Ward, M., Wu, S., Dauberman, J., Weiss, G., Larenas, E., Bower, B., Rey, M., Clarkson, K., and Bott, R. (1993) in Biochemistry and Genetics of Cellulose Degradation (Aubert, J.-P., Beguin, P., and Millet, J., eds) pp. 53–70, Academic Press, New York
  13. Saloheimo, M., Nakari-Setala, T., Tenkanen, M., and Penttila, M. (1997) Eur. J. Biochem. 249, 584–591[Medline] [Order article via Infotrieve]
  14. Saloheimo, A., Henrissat, B., Hoffren, A. M., Teleman, O., and Penttila, M. (1994) Mol. Microbiol. 13, 219–228[Medline] [Order article via Infotrieve]
  15. Barnett, C. C., Berka, R. M., and Fowler, T. (1991) Bio/Technology 9, 562–567[CrossRef][Medline] [Order article via Infotrieve]
  16. Takashima, S., Nakamura, A., Hidaka, M., Masaki, H., and Uozumi, T. (1999) J. Biochem. (Tokyo) 125, 728–736[Abstract/Free Full Text]
  17. Saloheimo, M., Paloheimo, M., Hakola, S., Pere, J., Swanson, B., Nyyssonen, E., Bhatia, A., Ward, M., and Penttila, M. (2002) Eur. J. Biochem. 269, 4202–4211[Medline] [Order article via Infotrieve]
  18. Durand, H., Baron, M., Calmels, T., and Tiraby, G. (1998) FEMS Symp. 43, 135–152
  19. Nogawa, M., Goto, M., Okada, H., and Morikawa, Y. (2001) Curr. Genet. 38, 329–334[CrossRef][Medline] [Order article via Infotrieve]
  20. Ilmen, M., Saloheimo, A., Onnela, M. L., and Penttila, M. E. (1997) Appl. Environ. Microbiol. 63, 1298–1306[Abstract]
  21. Mandels, M., Parrish, F. W., and Reese, E. T. (1962) J. Bacteriol. 83, 400–408[Abstract/Free Full Text]
  22. Margolles-Clark, E., Tenkanen, M., Nakari-Setala, T., and Penttila, M. (1996) Appl. Environ. Microbiol. 62, 3840–3846[Abstract]
  23. Margolles-Clark, E., Tenkanen, M., Luonteri, E., and Penttila, M. (1996) Eur. J. Biochem. 240, 104–111[Medline] [Order article via Infotrieve]
  24. Margolles-Clark, E., Saloheimo, M., Siika-aho, M., and Penttila, M. (1996) Gene (Amst.) 172, 171–172[CrossRef][Medline] [Order article via Infotrieve]
  25. Margolles-Clark, E., Tenkanen, M., Soderlund, H., and Penttila, M. (1996) Eur. J. Biochem. 237, 553–560[Medline] [Order article via Infotrieve]
  26. Mach, R. L., Strauss, J., Zeilinger, S., Schindler, M., and Kubicek, C. P. (1996) Mol. Microbiol. 21, 1273–1281[CrossRef][Medline] [Order article via Infotrieve]
  27. Zeilinger, S., Mach, R. L., Schindler, M., Herzog, P., and Kubicek, C. P. (1996) J. Biol. Chem. 271, 25624–25629[Abstract/Free Full Text]
  28. Xu, J., Takakuwa, N., Nogawa, M., Okada, H., and Morikawa, Y. (1998) Appl. Microbiol. Biotechnol. 49, 718–724[CrossRef][Medline] [Order article via Infotrieve]
  29. Den Haan, R., and Van Zyl, W. H. (2001) Appl. Microbiol. Biotechnol. 57, 521–527[CrossRef][Medline] [Order article via Infotrieve]
  30. Rose, S. H., and van Zyl, W. H. (2002) Appl. Microbiol. Biotechnol. 58, 461–468[CrossRef][Medline] [Order article via Infotrieve]
  31. Torronen, A., Mach, R. L., Messner, R., Gonzalez, R., Kalkkinen, N., Harkki, A., and Kubicek, C. P. (1992) Bio/Technology 10, 1461–1465