|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 280, Issue 32, 29073-29079, August 12, 2005
The Crystal Structure of Mlc, a Global Regulator of Sugar Metabolism in Escherichia coli*![]() From the Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
Received for publication, April 18, 2005 , and in revised form, May 31, 2005.
Mlc from Escherichia coli is a transcriptional repressor controlling the expression of a number of genes encoding enzymes of the phosphotransferase system (PTS), including ptsG and manXYZ, the specific enzyme II for glucose and mannose PTS transporters. In addition, Mlc controls the transcription of malT, the gene of the global activator of the mal regulon. The inactivation of Mlc as a repressor is mediated by binding to an actively transporting PtsG (EIICBGlc). Here we report the crystal structure of Mlc at 2.7 Å resolution representing the first described structure of an ROK (repressors, open reading frames, and kinases) family protein. Mlc forms stable dimers thus explaining its binding affinity to palindromic operator sites. The N-terminal helix-turn-helix domain of Mlc is stabilized by the amphipathic C-terminal helix implicated earlier in EIICBGlc binding. Furthermore, the structure revealed a metal-binding site within the cysteine-rich ROK consensus motif that coordinates a structurally important zinc ion. A strongly reduced repressor activity was observed when two of the zinc-coordinating cysteine residues were exchanged against serine or alanine, demonstrating the role of zinc in Mlc-mediated repressor function. The structures of a putative fructokinase from Bacillus subtilis, the glucokinase from Escherichia coli, and a glucomannokinase from Arthrobacter sp. showed high structural homology to the ROK family part of Mlc.
Mlc (makes large colonies) has been discovered as a regulator protein curbing the utilization of glucose in Escherichia coli (1, 2). Mlc, acting as a transcriptional repressor, controls the expression of malT, encoding the central transcriptional activator of the maltose system (3). In addition, Mlc controls the expression of two operons encoding PTS1-dependent transporters for glucose ptsG (4, 5) and mannose manXYZ (6) as well as the genes encoding the general components of the PTS (79). In contrast to the classical mode of repressor inactivation by a cognate inducer, Mlc is inactivated by the sequestrating interaction with the actively transporting glucose transporter, the EIICBGlc protein of the PTS (1012). The interaction occurs at the EIIBGlc domain of the transporter encompassing a critical cysteine residue (Cys-421). This cysteine residue is phosphorylated in the resting transporter and becomes readily dephosphorylated during glucose transport by the transfer of the phosphoryl group onto the incoming glucose. Mlc binds only to the dephosphorylated form of EIIBGlc (13). The membrane-bound state of EIIBGlc is essential for Mlc inactivation. Soluble EIIBGlc, even though able to interact with Mlc (12, 13), does not prevent Mlc from binding to its operator regions and from its repressing activity. However, EIIBGlc attached to the membrane by any lipophilic anchor, even unrelated to EIICBGlc, binds Mlc in a fashion that prevents binding to the operator regions (13). This indicates that Mlc, when it is in close contact with the membrane, alters its conformation to suppress operator binding.
As judged by its amino acid sequence, Mlc belongs to the ROK family (repressors, open reading frames, and kinases) (14, 15) of transcriptional regulators encompassing xylose repressors, sugar kinases, and transcriptional regulators with the widely conserved CXCGXXGCXE motif (consensus sequence 2). They also harbor another consensus motif (consensus sequence 1) consisting of 28 amino acid residues, located 9 residues upstream from consensus sequence 2 (15). The DNA-binding motif of Mlc consists of a typical helix-turn-helix motif at its N terminus, and the protein behaves in dilute buffer solution as tetramer of a polypeptide of 44.3 kDa (12, 13). The removal of the 18 C-terminal residues leads to dimer formation, to the loss of EIICBGlc binding, as well as to the loss of operator interaction (13). Thus, most surprisingly, the C terminus, which is far from the helix-turn-helix motif in the primary sequence, must be involved directly or indirectly, possibly via a large conformational change in EIICBGlc binding as well as in operator recognition and subsequent repression. Regarding its unusual mechanism of derepression, it was of interest to elucidate the crystal structure of this novel transcriptional regulator. Here we report the three-dimensional structure of dimeric Mlc R52H at 2.7 Å resolution.
Structure Determination and RefinementMlc was cloned, expressed, purified, and crystallized as described previously (16). Three data sets were collected on a selenomethionine-labeled Mlc crystal at the Swiss Light Source SLS Villigen (CH) beamline X06SA. The crystals of space group C2 with unit cell parameters of a = 235.95 Å, b = 74.71 Å, c = 154.95 Å, = 129.15° diffracted to a maximum resolution of 2.7 Å. The raw data were reduced using XDS (17). Because of the radiation sensitivity of the crystals, only the peak and inflection data sets were used for structure solution and refinement (Table I). The selenium substructure was determined with SHELXD (18), and phases were calculated with SHARP (19) using the data sets collected at the peak and the inflection wavelength. Density modification was done in RESOLVE (20), resulting in interpretable electron density maps up to 3 Å resolution. Model building was done manually with the programs "O" (21) and COOT (22). Refinement of the model was done with the program REFMAC5 (23). To use the best data for refinement, the peak and the inflection data sets were merged resulting in a better overall quality of the data (Table I). Refinement statistics and quality indicators of the resulting Mlc model are listed in Table II.
SpectroscopyThe zinc content of the Mlc protein was determined by atomic absorption spectroscopy using a Varian AA240. UV-visible spectra were obtained with a Lambda 16 spectrophotometer (PerkinElmer Life Sciences). EPR spectra were recorded at 10 K on a Bruker Elexsys 500 spectrometer equipped with an ER049 X microwave bridge and an ESR 900 helium cryostat. The sample concentrations for the UV-visible and EPR measurements were 15 mg/ml Mlc. Site-directed MutagenesisSingle point mutations were carried out using the QuickChange multikit from Stratagene according to the manufacturer's protocol using the plasmid template pQE60mlc (16) and the following phosphorylated oligonucleotide primers: 5'-cca gta tca cta aaa ttg tcc gtg aga tgc tgc aag c-3' for constructing wild type Mlc; 5'-ccg tat ggg aaa cgc gct tat gcc ggg aat cac ggc tgc-3' for the C257A/C259A double mutant labeled "AYA," and 5'-ccg tat ggg aaa cgc tct tat tcc ggg aat cac ggc tgc-3' for the C257S/C259S double mutant labeled "SYS." Final vector products were analyzed by sequencing (GATC, Germany).
ChemicalsChemicals were purchased from Fluka unless otherwise stated. Programs Used for Structural and Sequence AnalysesThe program DSSP (Definition of Secondary Structure of Proteins) (26) was used for buried surface calculations and SIM (27) for sequence alignments. The quality of the resulting Mlc model was checked with the program PROCHECK (28). PDB Accession CodeThe coordinate data sets of the structure of SeMet-Mlc R52H is available in the Protein Data Bank (29) with the accession code 1Z6R [PDB] .
Structure of the Mlc MonomerThe structure of Mlc represents the first described structure of an ROK family member. An Mlc molecule shown in Fig. 1 consists of three domains as follows: (a) a helix-turn-helix (HTH) domain (30, 31) from amino acid residues 1 to 81 + 395 to 406 (domain 1, green); (b) a smaller / -domain from residues 82 to 194 + 381 to 394 (domain 2, yellow); and (c) a larger / -domain from residues 195 to 380 (domain 3, blue). The final Mlc model contains 382 of 406 residues. Two segments in domain 1, residues 111 and 6476, were structurally disordered in all four molecules within the asymmetric unit and were therefore not included in the final model. The structurally disordered region 6476 of the HTH domain is known to be very flexible from HTH motifs described previously. It adopts its destined conformation, the so-called hinge helix, only upon binding to the operator DNA (3234). Domains 2 and 3 are common to all ROK family members (14, 15) (see Fig. 1, yellow and blue). Both / -domains (domains 2 and 3) consist of a central -sheet flanked by a pair of -helices on one side and a single -helix on the other side. Between domains 2 and 3 the polypeptide chain switches twice, so that domain 3 is formed by a continuous polypeptide, whereas the fold of domain 2 is completed by the returning C terminus from domain 3, packing as a C-terminal helix against the -sheet of domain 2 (bright yellow in Fig. 1). The interface between domains 2 and 3 is mainly formed by the two single -helices flanking the -sheet in each domain. However, the packing of both domains toward each other is not very tight, allowing the domains to adopt different conformations with respect to each other. In addition to being part of domain 2, the C-terminal helix bends and is also part of domain 1 (bright-green in Fig. 1), thereby stabilizing the orientation of the HTH domain (domain 1) with respect to domain 2. This stabilization might be the reason why the HTH part of domain 1 is structurally ordered, whereas the connecting segment, including the hinge helix in domain 1, is not. The three domains of an Mlc monomer behave as rigid groups, but the pairwise arrangement of the domains is different in the four molecules (AD) within the asymmetric unit. The differences in the domain orientations were analyzed with the program DYNDOM (35) using domain 3 of the Mlc molecule A as the reference. Several rotational axes do not coincide in all possible domain pairs of the four Mlc molecules. Domains 1 and 2 are rotated as single units in molecules B and C by 18 and 5°, respectively. In molecule D, however, domain 2 is rotated by 22°, and domain 1, with respect to domain 2, is rotated by 12° around another axis at the same time. In addition, the comparison of molecule A with molecule B shows that residues 244270 of domain 3, harboring parts of the two ROK motifs, are able to rotate separately by 14° with respect to the rest of domain 3.
The ROK Signature Forms a Zinc-binding SiteDomain 3 contains consensus motifs 1 and 2 that characterize the ROK family members (14, 15). Both motifs are highlighted in Fig. 1 as red and orange ribbons. Consensus motif 1 (Fig. 1, red) forms part of the central -sheet in domain 3, leading into a loop followed by a short 310 helix that ends with the invariant histidine His-247. Nine residues downstream, consensus motif 1 is followed by consensus motif 2 (Fig. 1, orange), starting with the conserved cysteine residues Cys-257 and Cys-259, followed by the conserved cysteine residue Cys-264. The structural explanation for the conservation of these residues is the tetrahedral coordination of a zinc ion by the four residues His-247, Cys-257, Cys-259, and Cys-264 (see Fig. 1, highlighted as a gray sphere). The presence of the zinc ion (0.9 ± 0.1 zinc per protein) was confirmed by atom absorption spectroscopy, EPR, and UV-visible spectroscopy.
The Mlc Dimer and Its Binding to DNAThe four molecules within the asymmetric unit are arranged as two homodimers AB (chains A and B) and CD (chains C and D). Fig. 2A shows AB perpendicular to the 2-fold axis, and Fig. 2B shows AB along the 2-fold axis of the dimer. The dimerization occurs via domain 3 (Fig. 2B, blue) of each monomer, burying a surface area of 1378 Å2 (in the case of AB) and 1393 Å2 (in the case of CD). In the CD dimer there is an additional contact of 303 Å2 between the HTH domains resulting from the above-mentioned Mlc flexibility and the crystal packing, but it is apparently not relevant for the dimer formation. Both dimers found in the asymmetric unit show different conformations. Although the superposition of chains A and C shows almost identical molecules, the superposition of chains B and D reveals two conformationally distinguishable dimers (see superposition in Fig. 3A). The conformational flexibility results in different distances between the two recognition helices in each dimer. Fig. 3, BD, shows the isolated HTH domains of both dimers (AB in yellow and CD in blue). In dimer AB, the distance between the recognition helices is A Tetramer of MlcEarlier studies using size exclusion chromatography indicated that Mlc forms tetramers in vitro (12, 13). In order to determine whether the biochemically described tetramer is present in the Mlc crystals as well, we investigated all intermolecular contacts of the two Mlc dimers (AB and CD) within possible asymmetric units. The most symmetric arrangement relates both Mlc dimers by a pseudo 2-fold axis via domains 3. The single contacts are relatively weak with only 600 Å2 between chains A and D and 636 Å2 between chains B and C. Both contacts taken alone are not significant for a stable multimer formation (36). On the other hand, the sum of both contacts in a dimer of dimers with 1236 Å2 could be relevant for a tetrameric structure. Nevertheless, we do not consider the crystallographic tetramer to be physiologically relevant.
Comparison of Mlc with Related StructuresA similarity search using the DALI server (37) revealed three bacterial kinases having the same structure as the ROK part of Mlc. In Table III the alignment lengths and the r.m.s. deviations of the identical C-
Most surprisingly, four of the five residues directly involved in glucose binding in Ec-GlcK and As-GMK are identical in Mlc and Bs-FrcK (Asp-195, Glu-244, His-247, and Glu266; Mlc numbering). The fifth residue, an asparagine in Ec-GlcK and in As-GMK is a histidine (His-194) in Mlc and a threonine in Bs-FrcK. However, despite the high structural similarity of the corresponding region in Mlc to the binding site for glucose in Ec-GlcK and As-GMK, Mlc does not bind glucose or glucose 6-phosphate as measured by the ammonium sulfate precipitation technique (40). The same technique readily revealed glucose binding of Ec-GlcK (data not shown). This suggested that the fifth position (His-194 in Mlc) either discriminates between different sugars or between sugar binding and non-binding. However, altering His-194 to Asn did not result in glucose binding or glucokinase activity.2
The structural homology between the monomeric forms of Mlc, Bs-FrcK, Ec-GlcK, and As-GMK indicated similar quaternary structures. Our Mlc structure clearly shows two dimers within the asymmetric unit. In Table III the contact interfaces within the crystal packings are listed, showing the same 2-fold symmetry as the Mlc dimer. According to their buried surfaces, Bs-FrcK, Ec-GlcK, and As-GMK could be able to form stable homodimers of similar architecture as well (see Table III). Structural Localization of Mlc MutantsAll mutations in Mlc characterized so far are shown in Fig. 4, highlighted by different colors. The mutant R52H has been accidentally selected on plates containing Luria Bertani (LB) medium during the cloning step for the structural analysis (16). Although the mutation is located in the recognition helix of the HTH domain, the protein still showed full repression of a ptsG-lacZ fusion (Fig. 5, gray histograms). In the presence of glucose, both the wild type Mlc and the R52H mutant show derepression of Mlc regulated genes (Fig. 5, black histograms) demonstrating that the R52H mutation neither affects repression nor induction. The latter is equivalent to the ability of Mlc to be bound by EIICBGlc. To study the role of the bound zinc ion in more detail, we constructed two double mutants by changing Cys-257 and Cys-259 into alanine or serine, respectively, resulting in Mlc C257A/C259A and Mlc C257S/C259S. Both mutant proteins showed only residual ability to repress the ptsG-lacZ fusion, pointing to a structural role of the zinc ion necessary for DNA binding (Fig. 5, gray histograms).
Seitz et al. (13) found that C-terminal deletions of Mlc influence its ability to tetramerize in vitro as well as its activity as a transcriptional repressor and its capacity to bind EIICBGlc in vitro. Although the deletion of the last nine residues (Mlc Furthermore, four point mutations of Mlc have been characterized by Tanaka et al. (41). Two of these mutants, H86R and I34V, are impaired in EIICBGlc binding (see Fig. 4). This observation indicates that Mlc might bind to unphosphorylated EIICBGlc with both the HTH domain and domain 2. On the other hand, Mlc mutants G211R and P294S show raised expression levels, but they neither influenced repression nor binding to EIICBGlc (41).
The Structure of MlcOverall, the Mlc molecule consists of three domains. Only domain 3, which contains both ROK consensus motifs, is composed of one continuous polypeptide, whereas domains 1 and 2 are completed by the back folding of the C terminus of the molecule. This structural arrangement results in a defined orientation of domain 1 with respect to domain 2. The asymmetric unit of the Mlc crystals contains four molecules that are clearly arranged as two dimers with a buried surface of 1400 Å2 in each dimer. The dimer formation between the two Mlc monomers occurs only via domain 3. Both domains 3 seem to form a stable scaffold with domains 1 and 2 flexibly attached to them. The hinge between domains 2 and 3 allows the movement of both domains 1 in an Mlc dimer with respect to each other. In this way the Mlc dimer is able to adopt different conformations. We conclude that the dimer contact is of biological relevance for three reasons. 1) The buried surface within the dimer is much larger than expected for an artificial crystal contact. 2) Mlc needs to be a dimer with the recognition helices being in proximity to bind to palindromic DNA. 3) The structurally very similar molecules Bs-FrcK and Ec-GlcK apparently form dimers of similar architecture. Furthermore, the structural similarity to bacterial sugar kinases suggests that Mlc represents a former kinase reused as a transcriptional repressor by the fusion of an HTH domain at its N terminus. What Determines the ROK Family Proteins?Mlc represents the first structure of an ROK family protein described to date. Two non-overlapping consensus motifs characteristic for ROK family proteins have been described by sequence comparison (15). Based on the Mlc structure, these two consensus motifs can be merged into a single one forming a zinc-binding site. From structural and sequence data, two similar zinc-binding motifs, GHX911CXCGX2G(C/H)XE and GHX1117CX2HX2CXE, can be distinguished in ROK family proteins (metal-binding residues highlighted in boldface). Mlc and most of the published protein sequences of ROK family proteins contain the first zinc-binding motif, whereas the second one is found only in a minority of the investigated sequences. The underlined residues are in the same position in the ROK proteins Mlc, Bs-FrcK, and in the non-ROK proteins Ec-GlcK and As-GMK. These two residues are involved in glucose binding in Ec-GlcK and As-GMK and may serve as sugar-binding residues in ROK proteins as well. Apparently, ROK proteins need a structural zinc ion to keep these two residues in place, whereas non-ROK sugar kinases without a metal-binding motif found another way to stabilize those residues, e.g. by a helical structure as found in Ec-GlcK. Structure-based sequence alignments of the four structures Mlc, Bs-FrcK, Ec-GlcK, and As-GMK, show that these molecules resemble each other much more than expected by pure sequence comparison analysis (data not shown). The fold of both group II sugar kinases (Protein Families Data base (42) accession number PF02685) and group III (ROK) sugar kinases (Protein Families Data base accession number PF00480) is basically the same. Thus, we believe that both types of sugar kinases as well as repressors, with an HTH domain fused to the N terminus, evolved from a common ancestor in two lineages, one with a structural zinc ion and the other one without. Interaction of Mlc with DNAThe dimeric structure of Mlc explains its ability to bind to single palindromic operator sites. The adaptation of the Mlc dimer to its operator sites is not achieved by flexible HTH domains but by the hinge within the ROK part of the molecule, with domains 1 and 2 moving as one rigid group with respect to domain 3. From band shift assays and DNase digestion experiments, it is known that only ptsG has two operator sites upstream of the coding region, whereas all others have only one (43). Mlc has not been observed to form a DNA loop known from experiments with other tetrameric repressors such as LacI or NagC (32, 44, 45), suggesting that Mlc only needs to be a dimer for DNA binding even though it has always been found as a tetramer in dilute buffer solutions in vitro (12, 13).
Seitz et al. (13) demonstrated that the deletion mutant Mlc Mutations within the zinc-binding motif (C257A/C259A called AYA and C257S/C259S called SYS) dramatically impair the ability of Mlc to repress its operator site (Fig. 5, gray histograms). The mutant proteins were not degraded, indicating structural stability. Although the zinc-binding motif is located in domain 3 without direct contact to domains 1 or 2, the coordination of the zinc ion apparently plays an important structural role for the correct orientation of the HTH domain.
The Problem of TetramerizationNative Mlc forms tetramers in vitro (12, 13). The two Mlc dimers in our crystals can be arranged as a tetramer; however, mutational data argue against the tetramer configuration in the crystals. Seitz et al. (13) demonstrated that the Mlc
Interaction of Mlc with EIICBGlcSo far nothing is known about the stoichiometry of an Mlc-EIICBGlc complex. It is unclear whether Mlc binds to the EIICBGlc complex as a tetramer or as a dimer. The Mlc A Model for the Function of Mlc in Controlled Gene ExpressionWe propose the exposure of the amphipathic helix at the very C terminus of Mlc to be the underlying mechanism by which Mlc switches from its active state (being a transcriptional repressor bound to the operator sites of Mlc-regulated genes) to its inactive state (being sequestered by binding to the EIICBGlc transporter). Thus, in the repressor mode, the HTH motif in the dimeric Mlc (see Fig. 2A) is stabilized by the C-terminal amphipathic helix with respect to domain 2. The hinge region between domains 2 and 3 helps the two recognition helices to achieve the correct distance for effective interaction of an Mlc dimer with the major groove of the palindromic operator sites (see Fig. 3). In its inactive mode, the HTH motif rotates, forming the EIICBGlc-binding site and at the same time exposing the C-terminal amphipathic helix, which we propose to be the basis for tetramerization. There has to be an equilibrium between the two states of Mlc. The finding that Mlc forms a tetramer in dilute buffer solutions at pH 7.5 indicates that Mlc is mainly in its tetrameric form ready to be bound by dephosphorylated EIICBGlc when EIICBGlc is transporting glucose. Our hypothesis that tetrameric Mlc binds to dephosphorylated EIICBGlc is consistent with the high number of EIICBGlc molecules present in the bacterial cell versus the very small number of operator sites, the potential targets of the dimeric Mlc. Apparently, the low proportion of dimeric Mlc is sufficient to shut down transcription in vivo. The effective dimer concentration of Mlc drops (by reforming tetramers) only when the bulk of Mlc tetramers is removed by binding to dephosphorylated, glucose-transporting EIICBGlc, thus allowing the release of Mlc from its specific operator sites. Note Added in ProofRecently, the coordinates of the Mlc homolog from vibrio cholerae have been deposited at the PDB (accession code 1Z05 [PDB] ) by Minasov, G., Brunzelle, J. S., Shuvalova, L., Collart, F. R., Anderson, W. F., Midwest Center for Structural Genomics (2005). The structure is very similar to that of Mlc from E. coli, including the dimeric structure discussed here.
The atomic coordinates and structure factors (code 1Z6R) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by the Deutsche Forschungsgemeinschaft. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 The abbreviations used are: PTS, phosphotransferase system; Ec-GlcK, glucokinase from E. coli; Bs-FrcK, putative fructokinase from B. subtilis; As-GMK, inorganic polyphosphate/ATP-glucomannokinase Arthrobacter sp.; ROK, repressors open reading frames and kinases; HTH, helix-turn-helix; r.m.s., root mean square; PDB, protein data bank; wt, wild type; MES, 4-morpholineethanesulfonic acid.
2 M. Erhard, unpublished results.
We thank Günter Fritz for the metal analysis, the staff at the synchrotron beamline X06SA at the Swiss Light Source in Villigen/Switzerland for their support, and Jacqueline Plumbridge from the Institut de Biologic Physico-chimigue in Paris/France for helpful discussions.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||