|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 52, 37717-37729, December 28, 2007
The Structure of a Full-length Response Regulator from Mycobacterium tuberculosis in a Stabilized Three-dimensional Domain-swapped, Activated State*![]() 1![]() ![]() ![]() ![]() ![]() 2
From the
Received for publication, June 20, 2007 , and in revised form, September 26, 2007.
The full-length, two-domain response regulator RegX3 from Mycobacterium tuberculosis is a dimer stabilized by three-dimensional domain swapping. Dimerization is known to occur in the OmpR/PhoB subfamily of response regulators upon activation but has previously only been structurally characterized for isolated receiver domains. The RegX3 dimer has a bipartite intermolecular interface, which buries 2357 Å2 per monomer. The two parts of the interface are between the two receiver domains (dimerization interface) and between a composite receiver domain and the effector domain of the second molecule (interdomain interface). The structure provides support for the importance of threonine and tyrosine residues in the signal transduction mechanism. These residues occur in an active-like conformation stabilized by lanthanum ions. In solution, RegX3 exists as both a monomer and a dimer in a concentration-dependent equilibrium. The dimer in solution differs from the active form observed in the crystal, resembling instead the model of the inactive full-length response regulator PhoB.
Tuberculosis is a global threat to human health, with one-third of the world's population infected with the causative agent of tuberculosis, Mycobacterium tuberculosis. With the emergence of multidrug-resistant strains of M. tuberculosis, novel therapeutic agents are urgently required. This will be best achieved by developing a better understanding of the molecular biology of this pathogen. In this context, signaling systems play an important role in the continued survival of the organism and are central to the way M. tuberculosis responds to environmental stress, especially that generated by the host immune system.
The predominant signaling system for adaptive gene expression changes in bacteria is called a two-component system (TCS).3 TCS are found in eubacteria, archaea, and eukarya; however, they are rare in eukarya. As the name implies, the system typically consists of two proteins: a sensor histidine kinase (SK) and a response regulator (RR) (1). SKs are usually membrane-anchored proteins with a characteristic core consisting of a histidine-containing dimerization domain and a catalytic domain with ATPase activity. In response to extracellular conditions, and in some cases intracellular conditions, the SK autophosphorylates at a histidine residue within the dimerization domain. This autophosphorylation occurs upon dimerization through a phosphate transfer from the catalytic domain (annotated as a HATPase_c domain by the SMART server (2, 3)) of the adjacent protomer to the conserved histidine in the dimerization domain (HisKA domain). The activated SK may then function as a phosphate donor to a universally conserved aspartic acid residue in the RR. RRs typically consist of two domains, a receiver and an effector domain, although a large family of
Some TCS appear to be essential for the survival of bacteria, and to date homologous TCS have not been identified in the animal kingdom. TCS are therefore considered attractive targets for the development of novel agents that could alter the response of the bacterium to its detriment. More than 180 different TCS have been identified in bacteria (5); however, the number varies enormously between species. M. tuberculosis contains 30 genes encoding TCS proteins (6), 12 complete TCS, and several orphan TCS proteins possibly belonging to the SK and RR families (6, 7). Of these 12 complete TCS, all but five have been found deleted or exist as pseudogenes in the closely related intracellular pathogen Mycobacterium leprae (8). The TCS SenX3-RegX3 is one of the five conserved TCS, emphasizing its importance for the survival of mycobacteria (not just the virulent strains) in hostile environments. Previous studies carried out on this TCS have revealed that both SenX3 and RegX3 communicate in the normal fashion, with His167 of SenX3 being the active histidine residue that is phosphorylated and Asp52 being the corresponding phosphorylation target in RegX3, the cognate RR (9). The production of M. tuberculosis strains with a compromised senX3-regX3 operon (through the removal of the 3' end of the senX3 gene, the intergenic repeats, and the 5' end of the regX3 gene) has shown that an intact SenX3-RegX3 TCS is required for a progressive infection (10). Microarray experiments comparing M. tuberculosis strains with and without the senX3-regX3 operon revealed 50 possible regulon members (10). One operon in particular, Rv0096-Rv0101, appears to be directly, negatively regulated with expression found to be 5-17-fold increased in the absence of the SenX3-RegX3 TCS (10). Further experiments using M. tuberculosis strains with a compromised senX3-regX3 operon displayed significant attenuation in active and resting macrophages as well as in immunocompromised and immunocompetent mice, thus demonstrating that the senX3-regX3 operon is involved in the virulence of M. tuberculosis (10). Recently, experiments in Mycobacterium smegmatis revealed that the SenX3-RegX3 system responds to limiting environmental phosphate concentrations by inducing the phoA and pstS genes (11). In E. coli, these genes are regulated by another TCS, PhoR-PhoB (12, 13). Through DNA-binding experiments, it was shown that phosphorylated RegX3 directly regulates phoA and pstS, in M. smegmatis, by binding to their promoter regions. Previously, RegX3 has only been demonstrated to have DNA-binding properties with the promoter region of senX3 (9). A putative "pho-box" for RR binding has been derived for M. smegmatis consisting of an inverted repeat (GTGAAC) separated by seven nonconserved nucleotides (11).
Most RRs are multidomain proteins. All RRs contain structurally similar receiver domains with a characteristic β1- Three-dimensional domain swapping is becoming increasingly more evident in the Protein Data Bank as more structures are deposited. Three-dimensional domain swapping describes the formation of a symmetry interaction between two molecules via the exchange of identical regions. The region that is swapped can be a single element of secondary structure or an entire globular domain (30). Although many proteins display this phenomenon, the functional and physiological relevance is often not understood. Some potential advantages attained through domain swapping include higher local concentrations of active sites, larger binding surfaces, new active sites at subunit interfaces, the possibility of allosteric control, and an economic way to produce a large protein interaction network (31). Here we present the structure of the full-length RR, RegX3, stabilized in what we believe to be an active form that exhibits three-dimensional domain swapping. As such, it would represent the first full-length RR to be captured in an activated dimeric state and to display three-dimensional domain swapping. We also present an analysis of the solution structure of RegX3 in an inactive form as both a monomer and dimer.
Cloning and Expression—The gene sequence encoding RegX3 was cloned by PCR from genomic Mycobacterium tuberculosis (H37Rv strain) DNA. Flanking DNA primers 5'-TATACCATGGCAACCAGTGTGTTGATTGTGGAGG-3' and 5'-CAGAAGCTTATTACTAGCCCTCGAGTTTGTAGCCCAGCCCGCGC-3' were engineered to provide melting temperatures of 65 °C with extensions encoding NcoI and HindIII (New England Biolabs, Frankfurt, Germany) restriction sites. The corresponding PCR product was ligated with T4 DNA ligase (New England Biolabs, Frankfurt, Germany) into a pETM11 (EMBL) expression vector with an N-terminal His6 tag. The forward primers inserted a GCA codon immediately following the start codon to preserve the reading frame. Following sequence confirmation, the plasmid was transformed into Rosetta(DE3)pLysS cells. Cells were induced with 1 mM isopropyl 1-thio-β-D-galactopyranoside at A600 = 0.7 and grown for 5 h at 25 °C. The cells were harvested by centrifugation and lysed by sonication in a buffer containing 50 mM Tris·HCl, pH 8.0, 200 mM NaCl, 5 mM β-mercaptoethanol (buffer A), and two tablets of protease inhibitor mixture (Roche Applied Science). The lysate was spun down and loaded onto a 5-ml nickel-nitrilotriacetic acid affinity column (Qiagen) equilibrated in buffer A. The protein was eluted with a 100-ml linear gradient of 10-250 mM imidazole in buffer A. The fractions containing RegX3 were pooled and diluted to a final imidazole concentration of 50 mM. The protein was then mixed with (His-tagged) tobacco etch virus protease in a 1:20 ratio and incubated at 4 °C overnight in order to cleave the His6 tag. The digested protein was applied to the nickel-nitrilotriacetic acid affinity column previously equilibrated in buffer A containing 50 mM imidazole. The flow-through fractions were collected, concentrated to a final volume of 5 ml, and further purified on a HiLoad 16/60 Superdex 75 column (Amersham Biosciences) equilibrated with 20 mM Tris·HCl, pH 8.0, 100 mM NaCl, and 2 mM dithiothreitol (buffer B). The peak fractions were concentrated using a 10-kDa cut-off Centricon (VIVASPIN) to a final concentration of 10 mg·ml-1 as measured by Bradford's assay (Bio-Rad) (32). Crystallization, Data Collection, and Processing—Crystallization of RegX3 was carried out at 19 °C using the hanging drop vapor diffusion method. The initial conditions (0.2 M calcium acetate, 18% polyethylene glycol 8000, 0.1 M sodium cacodylate at pH 6.0) were identified using the Crystal Screen from Hampton Research.
Following optimization, the best crystals were obtained after 5 days in a mixture containing 2 µl of protein solution with 2 µl of reservoir buffer. The reservoir buffer consisted of 0.2 M calcium acetate, 2-4% polyethylene glycol 4000, and 0.1 M sodium cacodylate at pH 6.5. These crystals diffracted to a maximum resolution of 6.0 Å. To obtain better quality crystals, it was necessary to mix the protein drop with LaCl3 to obtain a final concentration in the crystallization drop of 0.01 M La3+. For data collection, the crystals were transferred to a drop-containing reservoir solution mixed with 2-methyl-2,4-pentendiol to a final concentration of 20% and then flash-cooled to 100 K in a cold nitrogen gas stream. Two data sets were collected: one at The data were indexed and integrated using DENZO (33) and scaled using SCALEPACK (33). The precision-indicating merging R factor was calculated using the program RMERGE (34). Intensities were converted to structure amplitudes using the program TRUNCATE within the CCP4 program suite (35). The crystals belong to the space group P4212 with one molecule in the asymmetric unit. Data collection details are summarized in Table 1.
Structure Determination—The structure was solved to a resolution of 2.6 Å in the P4212 space group using the single anomalous diffraction protocol of AUTO-RICKSHAW, an automated crystal structure determination platform (36). Within this procedure, the structure factors were calculated from the measured intensities by employing TRUNCATE, and the five lanthanide positions were located using SHELXD (37). The positions of these ions were refined using the program MLPHARE (35), and the resultant phases were subjected to density modification using DM (38). Although the maximum resolution was 2.6 Å, the solvent content (63%) was relatively high, and model building was performed using the program ARP/wARP (39). The resultant model contained 184 residues of 228. In the last cycle of the ARP/wARP, 144 residues were docked with the correct sequence. The model was completed with manual building using COOT (40) alternating with additional REFMAC5 (41, 42) refinement cycles. The refinement cycles included a bulk solvent correction and anisotropic scaling and used the entire molecule as a single rigid body for TLS refinement. The resultant refined coordinates were then used to refine against the 2.03 Å data set using REFMAC5. The overall geometric quality of the model was assessed using the program PROCHECK (43). The refined coordinates and the structure factors for RegX3 have been deposited in the Protein Data Bank (Protein Data Bank code 2OQR). Data collection and refinement statistics are shown in Table 1. The superimpositions and r.m.s. deviation calculations were carried out using the programs COOT and SSM (44). All molecular images were generated using PYMOL (45).
Small Angle Scattering—Synchrotron x-ray scattering data from solutions of RegX3 were collected at the X33 EMBL beamline using an MAR345 image plate detector (46). The scattering patterns were measured with a 3-min exposure time for multiple solute concentrations ranging from 2.7 to 15.3 mg·ml-1. No indication of radiation damage was detected. This was checked by the comparison of duplicate scattering patterns from 2-min exposures. Using a sample-detector distance of 2.7 m, the momentum transfer range, 0.009 < s < 0.5 Å-1, was covered (where s = 4
The data were processed using standard procedures by the program package PRIMUS (47). The forward scattering I(0) and the radii of gyration Rg were evaluated using the Guinier approximation (48), assuming that at very small angles (s < 1.3/Rg), the intensity is represented as I(s) = I(0)exp(-(sRg)2/3). The maximum particle dimensions (Dmax) and the interatomic distance distribution function p(r) were computed using the program GNOM (49). The molecular mass (MM) of the solute at various concentrations was evaluated by a comparison of the forward scattering with that from a reference solution of bovine serum albumin (MM = 66 kDa). Particle shape at low resolution was reconstructed ab initio using the program DAMMIN (50), which represents the particle as a collection of M >> 1 densely packed beads inside a sphere with the diameter Dmax. Each bead belongs either to the particle or to the solvent, and the shape is described by a binary string of length M. Starting from a random string, simulated annealing is employed to search for a compact model that fits the experimental data Iexp(s) to minimize the discrepancy
(sj) is the experimental error of Iexp(sj) at the momentum transfer sj. The program GASBOR (51) was also used to create ab initio models consisting of dummy residues instead of beads. Simulated annealing is employed to construct a model with protein-like distribution of beads providing the best fit to the experimental data. For the ab initio and rigid body analyses, multiple runs were performed to verify the stability of the solution, and typical three-dimensional reconstructions are presented below. The scattering patterns from atomic models were computed using the program CRYSOL (52). Rigid body modeling was performed by the global refinement program SASREF (53). The full-length RR, DrrD, was used as a template and was broken into two domains: the receiver (residues 1-118) and the effector (residues 119-217). Based upon the dimerization interface reported for the inactive PhoB structure (Protein Data Bank code 1B00), a model was generated that represented the optimal spatial configuration of the domains against the scattering data recorded at the highest protein concentration (15.3 mg·ml-1). The program OLIGOMER (47) was used to calculate the ratio of monomeric and dimeric species present in solutes at intermediate concentrations (between 4.8 and 11.3 mg·ml-1). The ratios were determined based upon the scattering intensities of monomeric DrrD (Protein Data Bank code 1KGS [PDB] ) calculated by CRYSOL and the model of the dimeric protein constructed by SASREF.
The structure of the full-length RegX3 RR was determined using experimental phases obtained from single wavelength ( = 1.5 Å) anomalous dispersion data from a single crystal co-crystallized in the tetragonal space group P4212 with lanthanum chloride. Phasing information from the five lanthanum ions in the asymmetric unit was used to calculate the initial experimental electron density map. Following manual building of the initial model, the structure was refined to 2.6 Å. The resulting coordinates were then used to refine against 2.0 Å data collected at = 0.813 Å from a second crystal co-crystallized with lanthanum chloride. The final model was refined with REFMAC5 to a final Rwork of 0.178 and Rfree of 0.213 (Table 1). All residues were visible except for the initial methionine, the final glycine, and the first two residues prior to the methionine, which were introduced during cloning. The resulting electron density map shows strong electron density for the five La3+ ions.
Overall Structure—The overall structure, shown in Fig. 1, consists of two domains: the receiver domain, which at first sight has the expected secondary structural elements although in a totally new spatial arrangement, and a wHTH DNA binding domain characteristic of the OmpR/PhoB RR subfamily (14-16). The structure contains five lanthanum ions that were identified and distinguished from other possible metal ions by their anomalous signal. Of the five ions, one occupies the active site, two interact between the domain-swapped receiver domain and the effector domain, and the remaining two stabilize the crystal lattice (see below). The largest contact area between molecules is mediated by the
A typical receiver domain has an /β topology that consists of a central five-stranded β-sheet (β2-β1-β3-β4-β5), which forms a hydrophobic core surrounded by two helices ( 1 and 5) on one side and three ( 2- 4) on the other side. The monomer of RegX3 in the asymmetric unit has an apparently incomplete receiver domain with only a four-stranded parallel β-sheet (β2-β1-β3-β4) and three -helices, of which 1 is positioned on one side of the β-sheet and 2- 3 are positioned on the other side (Fig. 1). The linker region connecting the two domains is made up of 4 and β5, which, as stated above, form a strong dimerization interface. The three-dimensional domain-swapped element, namely the incomplete receiver domain (β1, 1, β2, 2, β3, 3, and β4) from molecule 2 of the crystallographic dimer, interacts with 4, β5, and 5 from molecule 1. Strand β5 of molecule 1 aligns parallel to the domain-swapped, four-stranded β-sheet of molecule 2, forming a composite five-stranded β-sheet. Helix 4 of molecule 1 lies just outside of β5 on one side of the composite β-sheet with 1 and 2 from the domain-swapped element, whereas 5 from molecule 1 lies on the other side of the composite β-sheet (Fig. 2). The new composite receiver domain is now complete, with three -helices on the other side of a five-stranded β-sheet. When the composite receiver domain is superimposed with other previously characterized receiver domains from the OmpR/PhoB family (Fig. 3A), such as the active PhoB receiver domain (29), the inactive PhoB receiver domain (28), DrrD (22), DrrB (23), PrrA (21), and MtrA (20), the secondary structures occupy similar positions with an overall r.m.s. deviation between C atoms ranging from 1.45 to 2.19 Å (Table 2). Not surprisingly, the region of highest variance is the dimerization interface composed of 4β5 5.
The effector domain of RegX3 has a typical wHTH fold, consisting of a four-stranded antiparallel β-sheet followed by a three-helix bundle and a C-terminal β-hairpin (Fig. 2). The DNA recognition helix ( 8) is fully exposed to the solvent, as is also the case for the OmpR/PhoB family members DrrB (23) and DrrD (22) but unlike the other structurally characterized full-length family members PrrA (21) and MtrA (20).
The Receiver Domain—The active site of all RRs is in a crevice in the receiver domain. In the case of RegX3, this is formed by the β5
The signature switch residues known to be crucial in signal transduction by the rearrangements resulting from phosphorylation are serine/threonine and phenylalanine/tyrosine located in the β4 4-loop and in β5, respectively (54, 55). In inactivated RRs the tyrosine or phenylalanine occupy positions facing away from the active site in gauche- (OH group points outward from the domain, exposed to solvent) and gauche+ (OH group points toward the C terminus of 4) positions. This is believed to be due to the outward orientation of the threonine/serine residue on the β4 4-loop. In RegX3, these residues are Thr79 and Tyr98, respectively. Thr79 is from the three-dimensional domain-swapped element, whereas Tyr98, like Lys101, is not. In the RegX3 structure, Thr79 is orientated toward the active site in a position that would coordinate phosphate, if it were there, whereas Tyr98 adopts an inward trans-position, occupying the space freed by the inward rotation of Thr79 (Fig. 5). La-1 is coordinated by the carbonyl oxygen atom of Tyr98. This La3+ ion is positioned between 4 and β5. It also displays nonadentate coordination, interacting directly with both carboxylate oxygen atoms of Glu86, a single carboxylate oxygen atom from Asp97, and the carbonyl oxygen atom from Tyr98, the switch residue. Five water molecules complete the coordination (Fig. 4A). The coordination of the carbonyl oxygen atom of Tyr98 forces the aromatic ring to be orientated inward in the active state position.
Dimerization Interface—The total buried surface area created as a result of dimerization is 2357 Å2/monomer shared equally between two distinct subinterfaces. The first, which we refer to as the dimerization interface, buries 1169 Å2/monomer and is formed between the two composite receiver domains. It is centered on the helices that pack about the crystallographic two-fold axis (Fig. 6A). The second interface is formed between the composite receiver domain (consisting of the N-terminal, three-dimensional domain-swapped element, 4, β5, and 5) and the effector domain (Fig. 6B). The dimerization interface contains four salt bridges per monomer with a forked salt bridge within the active site between E8 (β1 1-loop) and D52 (C terminus of β3) from the domain-swapped element of molecule 2 and Lys101 ( 4β5-loop) from molecule 1. A salt bridge is also formed between Asp96 ( 4β5-loop) from molecule 2 and Asp68 ( 3β4-loop) from molecule 1. The extended five-stranded β-sheet created by the three-dimensional domain swapping is stabilized by an extensive network of hydrogen bonds, which further strengthen the interface. Asp68 (C terminus of 3) from the domain-swapped fragment of molecule 2, already participating in a salt bridge, is further stabilized by Gly94 ( 4β4-loop) from molecule 1. Ile76 (β4), also from the domain-swapped element, stabilizes Asp96 (β5) from molecule 1 by forming two hydrogen bonds. One hydrogen bond is formed between the carbonyl oxygen atom of Ile76 and the main-chain nitrogen atom of Asp96. The other bond is formed between the main-chain nitrogen atom of Ile76 and one of the carboxylate oxygen atoms of Asp96 that is already involved in a salt bridge. Two hydrogen bonds are formed between both the carbonyl oxygen atom and main-chain nitrogen atom of Val78 (β4) from molecule 2 with Asp97 and Val99 (β5) from molecule 1, respectively. The last two hydrogen bonds are formed between the main-chain nitrogen atom and carbonyl oxygen atom of Ala80 (β4) from molecule 2 and the carbonyl oxygen atom of Val99 (β5) and the main-chain nitrogen atom of Lys101 ( 4β5-loop) from molecule 1, respectively. The last salt bridge involved in the dimerization interface is formed between Asp97 (β5) and Arg111 ( 5) both from molecule 1, ensuring the two secondary structure elements maintain close contact as they complete the composite receiver domain. The hydrophobic surface formed by Leu108, Ile109, Ile112, Val115, and Leu116 of 5 protects an otherwise exposed hydrophobic surface of the receiver domain formed by 1 and the central β-sheets. In other RRs, this hydrophobic surface is protected by both 4 and 5; however, in RegX3, 4 acts as a linker packing against the 4 of the dimeric partner. The two helices pack against each other on a hydrophobic interface made by Ile85, Val88, Val89, and Leu93. The N-terminal residues of each 4 helix play important roles in completing the active site of their own molecule.
Interdomain Interface—We suggest that the model of RegX3 provides the first high resolution structural data of the interdomain contacts occurring between the regulatory and the effector domains for a member of the OmpR/PhoB subfamily in an activated state. The interaction is formed by 5 from the composite receiver domain with the β7β8-loop and C terminus of 7 from the effector domain. This interface is the second subinterface of the three-dimensional domain-swapped dimer and buries a total surface area of 1188 Å2/monomer. This is significantly larger than that in previously characterized full-length RRs, PrrA (820 Å2), MtrA (738 Å2), DrrB (751 Å2), and DrrD (245 Å2).
The RegX3 interdomain interface is most similar to the interface of DrrD in that it involves the face of
Another contact point between the two domains occurs at the position of La-2, which lies between
The Effector Domain—The wHTH fold contains the structural and sequence elements responsible for DNA recognition. OmpR and PhoB are the founding members of this family. The isolated structures of the OmpR (15) and PhoB (18) effector domains complexed with DNA as well as the uncomplexed but full-length structures of DrrB (22), DrrD (23), and PrrA (21) have all been described in the literature. The RR subfamily differs from other wHTH non-RR proteins in that they have an N-terminal, four-stranded, antiparallel β-sheet platform and a large loop between the positioning helix ( A single residue in the effector domain (Asn148) is found in the disallowed region of the Ramachandran plot (Table 1). This residue is located at a β-hairpin on the β8β9-loop and is involved in main-chain hydrogen bonding but in a region with poor electron density.
Small Angle X-ray Scattering—We believe that the crystal structure of RegX3 reflects an activated RR dimer generated by the presence of lanthanum ions rather than through phosphorylation. Small angle x-ray scattering (SAXS) was used to analyze the protein in solution in the inactive state without the presence of metal ions. The addition of even minimal amounts of lanthanum to the protein solution led to a precipitation of the solute within minutes, and, moreover, the buffer containing La3+ yielded an extremely strong background. These factors made SAXS measurements on lanthanum-containing proteins impossible. RegX3 was therefore analyzed in lanthanum-free solutions at five different concentrations ranging from 2.7 to 15.3 mg·ml-1 (see Fig. 9A). The overall parameters of the solutes (Rg, MM, and Dmax) were determined for each measurement and showed a clear tendency to increase with concentration (see Fig. 9B). Based upon a comparison of the experimental MM with that determined from the scattering curves and an increasing Rg, it is evident that inactivated RegX3 exists as both a monomer and a dimer in solution. A monomer was observed at the lowest concentration (Rg = 25.6 ± 0.5 Å, MM = 22.2 ± 2 kDa, Dmax = 82.0 ± 5 Å) with a dimer evident at the highest concentration (Rg = 34.8 ± 0.5 Å, MM = 47 ± 5 kDa, Dmax = 135 ± 10 Å). Indeed, the latter MM value matches that of dimeric RegX3 (49.7 kDa). Ab initio models of the dimer were constructed from the scattering curve of RegX3 at the highest concentration measured using different programs (DAMMIN and GASBOR) with and without two-fold symmetry imposed. The overall shape in solution, derived by GAS-BOR with two-fold symmetry applied, appears linear in a "trans-shaped" dimer (Fig. 8B) compared with the "cis-shaped" crystal dimer (Fig. 8A). The elongated trans-shape is strikingly similar to the proposed model of the inactive full-length PhoB structure (29), where the positioning of the effector domains in opposite directions provides an additional means of inhibition. We then used, as prior knowledge, the inactive PhoB dimerization interface contacts and the full-length RR DrrD for rigid body modeling against the 15.3 mg·ml-1 RegX3 SAXS data. Several runs of the program SASREF, assuming two-fold symmetry, produced consistent results and a good fit (
The crystal dimer has two-fold symmetry, which is inconsistent with binding to both parts of a DNA direct repeat, but could conceivably bind to both elements of a DNA inverted repeat. This agrees with recent work on the SenX3-RegX3 system in M. smegmatis (11). Note that the crystal dimer fails to provide a reasonable fit to the SAXS data ( = 9.6) (Fig. 9A).
The scattering data from 4.8 mg ml-1 solutions of RegX3 yield a molecular mass of 22.2 ± 2.2 kDa, consistent with a monomer (24.8 kDa). Moreover, the scattering curve at this concentration agrees well (
The crystallization of multidomain RRs has proven difficult, probably due to conformation variability. Based on information from the full-length structures that have been determined, it is evident that (at least) two distinct structural subclasses exist. These subclasses are based upon the accessibility of the DNA recognition helix. The structures of DrrD, DrrB, and now RegX3 all exhibit accessible recognition helices and hence belong to the "open" structural subclass (Fig. 7A). PrrA and MtrA, on the other hand, are very compact structures with inaccessible recognition helices and therefore belong to the "closed" subclass (Fig. 7B). To date, all of the known full-length RRs have been structurally characterized in an inactive state, and the only structural information about the activation mechanism has come from isolated activated receiver domains. In this paper, we have presented the first full-length structure of a response regulator that not only supports the dimerization hypothesis proposed by Toro-Roman et al. (56) but also exhibits three-dimensional domain swapping. Although crystals were not grown in the presence of any activating substances, such as BeF-3, we believe, based upon the position of a number of important residues surrounding the active site, the length of helix 4, and the existence of a dimer, that the structure we observe represents RegX3 driven to an active state by the presence of lanthanum ions. Three-dimensional Domain Swapping—The formation of a three-dimensional domain-swapped dimer requires the breaking of many interactions within the monomer in order for the domains to be swapped. In the case of RegX3, it also requires the partial unfolding of the receiver domain, which must be a process with a relatively high energy of activation. This leads to speculation as to what conditions would be conducive to the formation of a three-dimensional domain-swapped dimer. The majority of three-dimensional domain-swapped dimers in the past have been formed in vitro at high protein concentrations under environmental conditions that favor unfolding. These conditions exist in many crystallization experiments and are a likely reason for the observation of this and other three-dimensional domain-swapped dimers. In RegX3, the partial unfolding may be due to the LaCl3 present in the crystallization conditions, which is subsequently stabilized in the crystal lattice. The in vivo conditions that lead to three-dimensional domain swapping are still largely unknown. RRs are DNA-binding proteins, and it is possible that the domain-swapped dimer represents a form able to recognize alternative regulons. Three-dimensional domain swapping has previously been observed in the RR Spo0A (Protein Data Bank code 1DZ3) (57), albeit only for the isolated receiver domain. The result of the domain swapping, however, was a nonfunctional receiver domain with an incomplete active site. In contrast, RegX3 appears to be structurally complete. This observation is based upon the formation of a complete composite receiver domain, a complete active site with all of the residues necessary to stabilize a phosphate group, the formation of the dimer on a highly conserved interface that is strongly linked to dimer formation, and the presentation of the DNA recognition helices in an orientation capable of interacting with DNA.
As a result of the three-dimensional domain swapping, two new functional units are created, each consisting of a partly domain-swapped, composite receiver domain and an effector domain. The buried surface area per monomer consists of two parts of roughly equal size and is much greater than that seen in other full-length RRs or in isolated receiver domains. One interface region, the dimerization interface, consists of the elements
Dimerization Interface—Although three-dimensional domain swapping upon activation may not occur in other OmpR/PhoB subfamily members, the fact that the same structural elements (
Multiple sequence alignments of members of the OmpR/PhoB subfamily as well as other RR families reveal that the key residues involved in the
The largest variation seen when aligning the composite receiver domain with other characterized receiver domains is in the orientation of
Helix
This helix in all receiver domain structures, apart from DrrD and RegX3, is positioned parallel to the central five-stranded β-sheets. It is proposed that the helix packs alongside
In RegX3, the role of Switch Residues—The key switch residues in RegX3, as mentioned previously, are Thr78 and Tyr98. Although structures of receiver domains alone in an activated state exist, such as those of PhoB (29), ArcA (4), KdpE (56), TorR (56), and MicA (60), RegX3 represents the first full-length RR structure from the OmpR/PhoB family in what we believe represents an activated state. RegX3 is forced into an active state through its interaction with one of the La3+ ions. La-1 is coordinated by the carbonyl oxygen atom of Tyr98, twisting the side chain of the residue into the active state position. The repositioning of Tyr98 forces Thr78 to shift inward also into an active state position. This mechanism of a geometric switch is in the reverse order to the normally proposed activation mechanism, where the threonine residue shifts in response to phosphorylation followed by a shift in the orientation of the tyrosine residue.
In other active receiver domains, it has been noted that the tyrosine, in its active position, coordinates through the side chain's hydroxyl group to a carbonyl oxygen atom of an arginine or lysine residue. This coordination is proposed to stabilize the active site (29). In RegX3, Tyr98 is oriented so that the side-chain hydroxyl group points toward the carbonyl oxygen atom of Arg81 from the dimeric partner, located toward the N terminus of
In the inactivated full-length RR structures, PrrA, MtrA, and DrrB, the switch tyrosine is oriented away from the receiver domain in an inactive state and forms hydrogen bonds with the effector domain. In the "closed" models (PrrA and MtrA), these bonds occur with the loop region between
Interdomain Interface—All of the current structures of inactive full-length OmpR/PhoB members exhibit different inter-domain interfaces formed by some of the secondary structure elements
The most noticeable difference between RegX3 and the other "open" inactive RRs is the orientation of the effector domain. Although all of them have an accessible The Inactive Dimer—Inactive dimers of OmpR/PhoB response regulators have been reported in the past. Although their function remains largely unknown, it is assumed that they serve an inhibitory function. The dimer in solution was detected by SAXS at a high concentration (15.3 mg·ml-1), suggesting that it is a low affinity dimer. Although this concentration may appear unrealistic inside a cell, it may be influenced through intracellular macromolecular crowding. As suggested by Bachhawat et al. (29), the inactive dimer may play a role in regulation. Initially, RegX3 exists as an inactive monomer. When activated, it autoregulates itself, and expression increases until there is an abundance of RegX3 in vivo, whereupon it dimerizes, allowing other regulatory functions. It is possible that when the stress signal (possibly low environmental phosphate concentration (11)) that the SenX3-RegX3 system is responding to subsides and the active kinase is no longer present, the active RegX3 dimers, which are at high concentration, switch to an inactive dimeric form, thus providing a quick attenuation of the response. The formation of an inactive dimer may inhibit the RR activities in many ways, such as disruption of DNA binding by orientating the effector domains in opposite directions. The dimer may also serve to disrupt or enhance the interaction interface with the SK, or the new dimer may facilitate rapid dephosphorylation by further exposing the active site containing the unstable phosphoaspartate. We also tried to fit the experimental scattering data from the dimeric species by keeping the three-dimensional domain-swapped interface and allowing for the rigid body movements of the peripheral portions of the effector domain. This modeling was able to give extended trans-shapes, which could almost fit the experimental data as well as the inactive PhoB-like dimer shown in Fig. 8B. The latter dimer, however, seems more likely, since the monomer dimer equilibrium is reversible. Conclusions—Based upon the position of the switch residues and the dimerization interface, we believe that the RegX3 structure represents the RR in an active state. The cis-dimer seen in the crystal structure is the first full-length OmpR/PhoB member RR to be crystallized as a dimer in an apparently active state. The RR dimer formed by three-dimensional domain swapping retains a complete active site as is necessary for phosphorylation. This structure provides the first detailed analysis of the interdomain interface for an active dimeric RR. However, SAXS measurements show that the orientation of the effector domains may not be representative of the dimer in vivo in an inactive state. The SAXS studies revealed that the inactive protein exists as a monomer and a dimer in solution. The inactive trans-dimer formed in solution provides the first structural evidence for a previously hypothesized regulatory mechanism. A structural comparison of the active and inactive RegX3 bound to DNA would provide answers regarding conformational changes of the effector domain and interdomain interface.
The atomic coordinates and structure factors (code 2OQR) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported by the Bundesministerium für Bildung und Forschung (German Science Ministry)-funded X-MTB consortium. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 Present address: Argonne National Laboratory, 9700 S. Cass Ave., Argonne, IL 60439. 2 To whom correspondence should be addressed. Tel.: 49-40-89901129; Fax: 49-40-89902149; E-mail: tucker{at}embl-hamburg.de.
3 The abbreviations used are: TCS, two-component system(s); SK, sensor kinase; RR, response regulator; SAXS, small angle X-ray scattering; MM, molecular mass; wHTH, winged-helix-turn-helix; r.m.s., root mean square.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||