![]()
|
|
||||||||
J. Biol. Chem., Vol. 282, Issue 26, 18929-18936, June 29, 2007
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





1
From the
Department of Chemical and Biomolecular Engineering and
Biomedical Engineering Interdepartmental Program, UCLA, Los Angeles, California 90095
Received for publication, February 16, 2007 , and in revised form, March 16, 2007.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
-bacterium Rhodopseudomonas palustris (8), and we propose a novel method for enhancing expression of foreign transcripts, altering E. coli ribosomal protein S1 by directed evolution.
A number of studies have examined the ability of E. coli to express high GC DNA. One in vitro study directly linked inefficient translation to the GC content from various organisms (9). Others have discussed implications of secondary structure and its ability to regulate and control the strength of protein expression (10, 11). The destruction of stable stem-loops by site-directed mutagenesis in the region of translation initiation of high GC genes has been shown to enhance protein expression (12). Another method introduced a short leader ORF preceding a high GC gene to improve its expression, and suggested that it is beneficial to include an AT-rich translation termination region in the leader ORF that overlaps with the start codon of the high GC target gene (13). Finally, replacing GC-rich codons from one bacterium with those more commonly used by E. coli enabled a 10-fold increase in expression of a recombinant protein for crystallization purposes (14).
To improve expression from high GC heterologous mRNAs in E. coli, it is important to consider features involved in translation. Upstream of the start codon is the 5'-untranslated region (UTR), which contains the Shine-Dalgarno (SD) sequence that is able to base pair with a complementary sequence on the 3' end of the 16S rRNA (15). However, the SD interaction was found to be nonessential for correct initiation (16, 17), indicating that prokaryotic ribosomes have other capacities for start site selection. Ribosomal protein S1, encoded by rpsA, has been shown to play critical roles in translation initiation and elongation (18, 19), and it has been shown to promote transcriptional cycling in vitro (20). S1 interacts with mRNA upstream of the SD sequence (21), and translation efficiency has been positively correlated to the presence of AU-rich sequences at this position of the UTR (22, 23). The requirement for S1 in translating mRNAs with a weak or no SD sequence has also been detailed (24). S1 is an essential protein required for the translation of most transcripts within E. coli (25), and it contains six repeats of the oligonucleotide-binding (OB) fold, contained in other RNA-binding proteins, which raise the nonspecific affinity of the ribosome to single-stranded RNA (26). In this regard, ribosomal protein S1 is an attractive target for improving the expression of GC-rich genes that cannot be efficiently translated in E. coli. We therefore employed directed evolution methods to S1 to generate S1 variants that can recognize GC-rich sequences.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
-D-galactopyranoside and isopropyl
-D-thiogalactopyranoside were purchased from Sigma. Plasmid ConstructsTo create plasmid pJRB320, E. coli rpsA under the control of its native promoter was amplified from the genome using KOD Hot Start Polymerase (Novagen). The resulting PCR fragment was cloned pZS*24MCS-1, a low copy plasmid (27). A mutation -2 bp upstream of the rpsA ATG start codon installed a unique BspHI restriction site upstream of the coding sequence. All constructs described herein were confirmed by DNA sequencing (MCLab).
To generate pTB120, part of the lactose repressor gene lacI and the Ptac promoter was removed from pJF118EH (28) and replaced with the E. coli rrnB terminator. The truncated lacZ gene missing its first 9 amino acids from pRS414 (29) was inserted downstream from the terminator. This plasmid allows for protein fusions to the BamHI site at the start of lacZ.To generate pTB130, the CAT gene was amplified from pACYC184 (New England Biolabs) by changing the start codon to CAG (30). The PCR product was inserted into pTB120 replacing the truncated lacZ gene to generate pTB130, which allows for protein fusions to CAT. Protein fusions listed in Table 2 were inserted into pTB120, and fusions to the CAT gene used for selection were inserted into pTB130. pJRB202 was created to study transcriptional fusions to lacZ. The plasmid is derived from pJF118EH (28) and contains an rrnB terminator followed by the PlacO1 promoter and UTR upstream of the complete lacZ gene. The PlacO1 promoter can be removed using XmaI and EcoRI restriction sites and replaced with the promoter and UTR sequence from another gene, maintaining 27 bp of sequence from expression vector UTR upstream of the ATG start codon, including the SD region. Transcriptional fusions listed in Table 2 were cloned into this vector, and the pJRB202 UTR is listed in supplemental Table 2.
|
-Galactosidase Measurements
-Galactosidase assays were performed in triplicate for each sample using flat bottom 96-well polypropylene plates (Grenier). 20 µl of overnight culture was diluted in 280 µl of Z-Buffer, and subsequently 4 µl of 0.1% SDS and 8 µl of chloroform were added. Plates were covered with aluminum seal film (Corning Glass) and shaken at 1400 rpm for 90 s in a plate incubator/shaker (Eppendorf). After centrifugation, 50 µl of 4 mg/ml o-nitrophenyl-
-D-galactopyranoside (dissolved in Z-buffer) was added to each well, and the
-galactosidase activity was measured in a plate reader (Bio-Tek) at 420 nm for
5 min at 30 °C. Slopes from the linear portion of the curve were obtained using KC Junior software. Activity was determined as described previously (31).
badD UTR ComparisonTo compare various features of the BadD-LacZ protein fusion, we used either the 47 bp preceding the badD gene (Rpal UTR) or the 47 bp of the PlacO1 expression vector (E. coli UTR). Additionally, we compared either the first 19 native R. palustris badD codons fused to lacZ or the same amino acid sequence optimized for the most commonly used E. coli codons (32). Finally, to test the differences in the SD sequence between the R. palustris and PlacO1 expression vector UTRs, we replaced the 14 bp preceding the ATG start from the badD UTR with the corresponding sequence from the PlacO1 expression vector (Ec RBS). Constructs were generated by annealing complementary oligonucleotides with restriction site overhangs in 1x annealing buffer (10 mM Tris, pH 8.0, 50 mM NaCl, 1 mM EDTA) and slowly cooling to room temperature from 95 °C. Annealed oligonucleotides were then diluted in ligation buffer (Roche Applied Science) and incubated with T4 polynucleotide kinase (New England Biolabs) and were ligated into the appropriate vector generated from pJRB40xba. To test the expression from these clones, constructs were transformed to BW25113-Z1, and two independent colonies per construct were grown overnight in LB plus 0.2% glucose. The following day, 15 µl were used to inoculate 3 ml of LB plus 0.2% glucose with 1 mM isopropyl
-D-thiogalactopyranoside. Cells were grown at 37 °C for 4 h until the A600 was 0.4 to 0.9, and cells were then placed on ice and used in a
-galactosidase assay as described above.
Error-prone PCRMutagenic PCR was performed using the Gene Morph II mutagenesis kit (Stratagene). 350 ng of plasmid pJRB320 or mutant variant was used as template, and the reaction was performed according to the manufacturer's protocol to obtain medium mutagenesis rates of
20-fold amplification of the target sequence.
Cloning and Mutant Library GenerationThe mutant PCR product was purified using agarose gel electrophoresis and QIAQuick PCR purification kit (Qiagen) and digested with appropriate restriction enzymes (New England Biolabs). Remaining parent template was removed using DpnI enzyme, and the purified digested product was ligated into an open vector of pJRB320 using Rapid DNA ligation kit (Roche Applied Science). 10 µl of the ligation reaction was used to transform 100 µl of XL10 Gold Ultracompetent cells (Stratagene) according to the manufacturer's protocol. The resultant library size was calculated (
8,00012,000 colonies), and clones were washed from each plate using 2.8 ml of LB medium and a sterile hockey stick. The washed library was vortexed miniprepped using QIAprep Spin Miniprep columns (Qiagen) and eluted using 1/10 concentration elution buffer. The library was subsequently concentrated under vacuum (Labconco).
Mutant Selection and ScreeningApproximately 125 ng of both rpsA library plasmid and selection plasmid containing the R. palustris DNA sequence fused to CAT (derived from pTB130) were electroporated into E. coli BW25113. Cells were plated onto LB containing appropriate antibiotics and variable Cm concentrations to restrict background growth of the library. The transformation efficiency was estimated using LB plates lacking Cm, with a target of
10,000 clones tested per selection plate. Approximately 2024 potential positive clones were identified 48 h post-transformation. Cultures were miniprepped (Qiagen) and eluted in 1/10 elution buffer and dried to
5 µl. 2 µl of the potential positive plasmid preparation were digested using NotI (New England Biolabs), which cuts the CAT fusion selection plasmid without digesting the rpsA mutant plasmid. The reaction was heat-inactivated, and 2 µl was used to transform BW25113 with addition of 1 µl of the reporter plasmid (
100 ng/µl) containing the R. palustris protein fusion to lacZ.
-Galactosidase assays were performed on these cells as stated above. The best new mutant identified by
-galactosidase assay was miniprepped and transformed into E. coli Top 10 (Invitrogen) using kanamycin selection to remove the reporter plasmid (confirmed by ampicillin susceptibility), and template was isolated for subsequent rounds of mutagenesis and DNA sequencing. Two independent colonies of each positive mutant with the appropriate protein fusion were tested in triplicate using the
-galactosidase assay, and the results reported here are the average of these data.
Minimum Inhibitory Concentration (MIC) AssayOvernight 3 ml of LB cultures containing wild type or mutant rpsA and BadR-CAT protein fusion were diluted 20-fold in fresh LB medium and grown to an A600 of 2.0. Cells were chilled on ice and then diluted in LB to a concentration of 107 cells/ml, and 1 µl of this stock was then spotted onto plates with increasing Cm concentrations ranging from 5 to 45 µg/µl. Increments of 1 µg/µl were used below 10 µg/µl, then increments of 2.5 µg/µl were used to 25 µg/µl, and finally 5 µg/µl increases in antibiotic concentration were used for the remaining plates. The MIC reported is the lowest concentration of Cm that restricted growth of a given construct after 16 h at 37 °C.
Time Course Study and Quantitative PCR60 ml of LB cultures were inoculated (1%) with overnight BW25113 transformed with the BadR-LacZ protein fusion along with either wild type, 4R, or 5R evolved rpsA constructs and grown in baffled Erlenmeyer shake flasks at 37 °C and at 200 rpm. Replicate cultures of each clone were performed. Samples were removed and chilled on ice prior to measuring the A600 or
-galactosidase activity. Samples removed during mid-exponential phase (A600 = 0.490.75) were pelleted and frozen in a dry ice/ethanol bath and stored at -80 °C until RNA extraction by RNeasy mini kit (Qiagen) following the manufacturer's protocol. Samples were diluted in 1x DNase I buffer (Ambion) and treated with 0.5 µl of RNase OUT (Invitrogen) and 0.8 µl of DNase I (139 units/µl; Invitrogen). Samples were purified using acid/phenol extraction and ethanol precipitation. First strand cDNA synthesis was performed according to protocol (Invitrogen) with 150 ng of random hexamers and 150 ng of total E. coli RNA and 1 µl of SSIII reverse transcriptase. Reactions were diluted 3-fold with water upon completion. Real time PCR was performed with gene-specific PCR primers designed using MyPROBES software (33) for lacZ and chaA, a housekeeping gene involved in potassium, sodium, and calcium ion regulation within the cell. The reactions were conducted on a Smart Cycler (Cepheid) with 2 µl of cDNA added per 25 µl of real time PCR sample using the QuantiTect SYBR green PCR kit (Qiagen). A four-step program consisting of denaturation, annealing, extension, and data acquisition was used, and melt curves verified that only one species of DNA product was amplified using the primer pairs. Calibration curves were performed for each primer pair using genomic DNA template amounts over 4 orders of magnitude, and curves with linearity R2 values of at least 0.99 were used to determine the amount of lacZ transcript relative to the chaA control.
|
| RESULTS |
|---|
|
|
|---|
3.6-fold when placed upstream of a strong SD (22). Optimizing the beginning coding sequence from R. palustris to E. coli codons enables an
2-fold increase in expression with the constructs containing the PlacO1 expression vector UTR. Finally, the effect of switching the 14 bp preceding the ATG start site from the R. palustris UTR with the corresponding region containing the SD sequence from the PlacO1 expression construct allows for a very small amount of
-galactosidase expression, showing that the presence of a strong RBS alone is not enough to rescue the translational deficiency of R. palustris transcripts in E. coli.
|
To further improve our selection of rpsA variants with enhanced activity toward high GC templates, we applied a
-galactosidase screen to our selection method. Multiple positive clones were chosen after selection on Cm plates, and the plasmids harboring the mutant rpsA was then employed in a screen using a fusion of the R. palustris gene sequence to lacZ. Positive clones were screened in parallel for
-galactosidase activity alongside the parent clone to identify the mutant with the highest activity toward its target gene. This mutant was then subjected to a subsequent round of mutagenesis, selection, and screening to continue improving translation efficiency of the R. palustris DNA sequence. Five rounds of selection and screening were performed, resulting in a mutant (5R) that enabled 12.0-fold higher
-galactosidase expression than the wild type S1 (Fig. 2a). To further investigate the ability to enhance expression of a different R. palustris gene, we used the fourth generation rpsA mutant (4R), evolved toward badR, as a starting point for increasing expression of the CrtI protein fusion. This 4R clone exhibited 2-fold higher activity toward the crtI construct than the wild type S1. After three rounds of selection, a mutant (3C) was isolated, which increased activity from the LacZ fusion by 8.0-fold (Fig. 2b). We attempted a fourth round of selection toward the crtI construct, but we were unable to isolate further enhanced variants, suggesting that the protein may be saturated for mutations that enhance CrtI-LacZ expression or that it had reached a local maximum because of previously accumulated mutations.
To remove the possibility that selected variants are specialized to translate only one DNA sequence, we adopted a strategy to evolve rpsA against multiple gene targets (Fig. 2c). We used an S1 mutant (4R) as the starting point for evolution toward the BadD protein fusion for one round of selection and screening. This clone (5D) was then selected and screened four additional rounds toward a protein fusion of BadK, the final gene in the badHK operon, including the four genes upstream of badK expressed from their native promoter and UTR (34).
Sequence Analysis of the Selected VariantsAfter each round of selection and screening, we sequenced the highest activity clone to determine newly acquired mutations. On average, we observed 13 new coding mutations per round of directed evolution (Fig. 2). The 5th round mutant evolved toward badR accumulated 10 mutations, of which 5 mutations map to the 3rd and 4th repeats of the S1 OB fold domain, whereas other mutations occurred in the 1st, 5th, or 6th domains. After three rounds of evolution toward the CrtI protein fusion, an additional four unique mutations were accumulated, three of which map to the 3rd and 4th repeats of the OB fold domain. The final mutant evolved towards multiple R. palustris constructs (9K) collected an additional 10 amino acid mutations unique from the parent clone (4R). Four mutations were lost during the creation of S1 mutant 9K, including a Q261H mutation occurring in the first round of evolution toward badK, which was again mutated in the 3rd round of selection toward this construct (H261L). Additionally, three mutations (A471V, Y477D, and A530T) occurring in the 6th repeat of the S1 domain were lost during the second round of mutation toward badK in which a frameshift mutation occurred at 1,351 bp, creating an S1 variant missing the final repeat of its S1 domain.
The Truncated S1 Mutant Improves Expression from Its RNA TargetOne S1 clone (7K) that was selected in the 2nd round of selection toward the badK operon contained a frameshift mutation (Fig. 2c). To establish whether this frameshift mutation was truly beneficial, we repaired the mutation adding on the 101 amino acid tail from the parent clone of 7K to both the 7K and 9K mutants from the 2nd and 4th round of selection toward the badK construct. In both cases the expression from the BadK-LacZ fusion was higher when expressed with the mutant rpsA containing the frameshift compared with the corresponding mutant with the repaired C terminus, indicating that this mutation was beneficial for enhanced expression from the BadK operon fusion.
-Galactosidase Rate and mRNA Levels Indicate S1 Mutants Enhance Both Transcript Abundance and Protein ExpressionTo further understand the ability of rpsA mutants to enhance expression from various R. palustris DNA sequences, we monitored
-galactosidase expression over time for the wild type S1 and two mutants isolated in the evolution toward badR (Fig. 3). Although the rpsA mutants display somewhat reduced growth rates (exponential growth doubling times of 44.6 and 49.3 min for 4R and 5R versus 38.4 min for wild type), the mutant S1 constructs allow greater expression from the foreign DNA throughout all stages of growth. We also measured transcript levels by quantitative PCR. The results (Table 1) show that there is approximately a 2.8- and 5.6-fold difference in lacZ transcript abundance levels between cells expressing the two mutants versus the wild type rpsA. These results are consistent with previous work that showed a positive correlation between enhanced translation because of changes in the UTR and increased lacZ mRNA stability (23).
|
7-fold and also improved the translation efficiency of a number of E. coli constructs, indicating that the mutations accumulated during evolution toward R. palustris DNA allow for varying improvements in translation efficiency from a variety of DNA sequences.
|
| DISCUSSION |
|---|
|
|
|---|
|
In the comparison of BadD protein fusions with either UTRs from E. coli or R. palustris, and native or optimized codon sequence, we show that in this case codon usage of the beginning ORF sequence is unable to account for the low expression of R. palustris genes in E. coli (Fig. 1). This was also the case when we co-expressed a plasmid containing seven rare tRNAs alongside multiple R. palustris protein fusions (data not shown). The 2-fold increase in expression between the construct with native R. palustris badD codons and those optimized for expression in E. coli is potentially because of the reduction in GC content of this stretch of DNA from 64.9 to 61.4%. The R. palustris UTR for badD has a particularly high GC content (83% in the 47 bp preceding the start codon), whereas the UTR from the PlacO1 construct we used as a comparison has 46.8% GC content. It is possible that the high GC content of the R. palustris genome presents a significant challenge to E. coli when initiating translation from its mRNAs.
Although the directed evolution of ribosomal protein S1 improves the translational efficiency of expression from these foreign DNA sequences, we recognize that other proteins and perhaps also the RNA core of the ribosome are likely to influence the expression of foreign DNA in the cell. Other proteins such as translation initiation factors IF1, IF2, and IF3 certainly play a role in translation initiation (11, 39, 40), and we cannot rule out the possibility of accessory proteins in E. coli or from a donor organism that may be used to stabilize mRNAs or prepare them for efficient translation. We have considered that many high GC bacteria have optimal growth rates near 30 °C, yet they have evolved to express their high GC mRNAs at temperatures that would allow for extensive formation of secondary structures within transcripts.
The inability of E. coli to efficiently translate high GC mRNAs has evolutionary implications pertaining to horizontal gene transfer. Recently, an example of genetic silencing of foreign DNA was detailed in Salmonella where the H-NS protein was shown to transcriptionally repress genes with lower GC content than that of the genome average (41). The authors showed that a low GC gene integrated to the genome of Salmonella was expressed >15-fold higher in an hns mutant compared with the wild type strain, and propose that the bacterium has evolved a mechanism to defend against the expression of foreign DNA. In this case, the necessity to guard against invading DNA is explicitly directed toward the transcription of low GC DNA as a means of distinguishing "self" from "foreign." Our data show that impediments to the expression of GC-rich DNA in E. coli are likely to reside at the translational level, obviating the need for transcriptional repression.
A group recently reported the stable cloning of the Synechocystis PCC6803 genome into Bacillus subtilis (6); however, the resultant strain had growth limitations and was unable to stably maintain the Synechocystis ribosomal RNA operon, possibly because of the increased expression of proteins toxic to the cell. Their work highlights both the growing ease at which large clusters of DNA can be transferred among bacteria and the difficulty of expressing DNA from distantly related species using unaltered transcription and translation signals. R. palustris exhibits an array of metabolically desirable reactions such as photosynthesis, nitrogen fixation, carbon fixation, and aromatic degradation, which are often clustered in discrete chromosomal loci (8). The directed evolution of rpsA for enhancing the expression of GC-rich DNA in E. coli is a tool that when combined with other techniques will enable the transfer of new metabolic phenotypes from GC-rich organisms, allowing hybrid functionalities for biotechnological applications.
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains supplemental Tables 1 and 2. ![]()
1 To whom correspondence should be addressed: 5531 Boelter Hall, 420 Westwood Plaza, Los Angeles, CA 90095. Tel.: 310-825-1656; Fax: 310-206-4107; E-mail: liaoj{at}ucla.edu.
2 The abbreviations used are: RBS, ribosome-binding site; Km, kanamycin; Cm, chloramphenicol; SD, Shine-Dalgarno; MIC, minimum inhibitory concentration; OB fold, oligonucleotide-binding fold; ORF, open reading frame; PNPase, polynucleotide phosphorylase; CAT, chloramphenicol acetyltransferase; UTR, untranslated region. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. Aliprandi, C. Sizun, J. Perez, F. Mareuil, S. Caputo, J.-L. Leroy, B. Odaert, S. Laalami, M. Uzan, and F. Bontems S1 Ribosomal Protein Functions in Translation Initiation and Ribonuclease RegB Activation Are Mediated by Similar RNA-Protein Interactions: AN NMR AND SAXS ANALYSIS J. Biol. Chem., May 9, 2008; 283(19): 13289 - 13301. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |