JBC

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M509010200 on October 6, 2005

J. Biol. Chem., Vol. 280, Issue 50, 41530-41536, December 16, 2005
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
280/50/41530    most recent
M509010200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Doucleff, M.
Right arrow Articles by Wemmer, D. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Doucleff, M.
Right arrow Articles by Wemmer, D. E.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

The C-terminal RpoN Domain of {sigma}54 Forms an Unpredicted Helix-Turn-Helix Motif Similar to Domains of {sigma}70*{boxs}

Michaeleen Doucleff{ddagger}§, Lawrence T. Malak{ddagger}, Jeffrey G. Pelton{ddagger}§, and David E. Wemmer{ddagger}§1

From the {ddagger}Physical Biosciences Division, Lawrence Berkeley National Laboratory and the Departments of §Chemistry and Molecular and Cell Biology, University of California, Berkeley, California 94720

Received for publication, August 16, 2005 , and in revised form, October 3, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The "{sigma}" subunit of prokaryotic RNA polymerase allows gene-specific transcription initiation. Two {sigma} families have been identified, {sigma}70 and {sigma}54, which use distinct mechanisms to initiate transcription and share no detectable sequence homology. Although the {sigma}70-type factors have been well characterized structurally by x-ray crystallography, no high resolution structural information is available for the {sigma}54-type factors. Here we present the NMR-derived structure of the C-terminal domain of {sigma}54 from Aquifex aeolicus. This domain (Thr-323 to Gly-389), which contains the highly conserved RpoN box sequence, consists of a poorly structured N-terminal tail followed by a three-helix bundle, which is surprisingly similar to domains of the {sigma}70-type proteins. Residues of the RpoN box, which have previously been shown to be critical for DNA binding, form the second helix of an unpredicted helix-turn-helix motif. The homology of this structure with other DNA-binding proteins, combined with previous biochemical data, suggests how the C-terminal domain of {sigma}54 binds to DNA.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Transcription, the synthesis of RNA from double-stranded DNA, is a fundamental process in all forms of life. The primary protein complex catalyzing transcription is composed of five subunits, {alpha}2{beta}'{beta}{omega}, called "core RNA polymerase" (RNAP).2 Core RNAP is fully competent to synthesize RNA from DNA. However, the initiation of transcription at specific DNA sequences requires additional protein(s). In bacteria, these additional proteins are called the "{sigma} factors" (1, 2). The {sigma} factor facilitates transcription at specific DNA sequences by 1) binding to the core RNAP to form the {sigma}-RNAP holoenzyme, 2) recognizing and binding to a specific DNA sequence next to the transcription start site, called the promoter element, and 3) opening the double-stranded DNA to initiate transcription.

Based on protein sequence homology, {sigma} factors can be grouped into two classes, {sigma}70 and {sigma}54, which were named by the molecular weights of the first members identified (reviewed in Ref. 3). The {sigma}70-type factors are the most abundant in bacteria (reviewed in Refs. 4, 5). They include the primary {sigma} factors, such as {sigma}70 from Escherichia coli and {sigma}A from Gram-positive bacteria, which regulate transcription for most genes required for normal exponential growth. Other members of the {sigma}70 family ({sigma}28, {sigma}32, etc.) regulate the transcription of more specialized genes that are required to respond to environmental changes. For example, {sigma}28 from Salmonella typhimurium controls expression of genes required for flagellar assembly (6).

The {sigma}54-type factor, which is also called {sigma}N and encoded by the rpoN gene, has no detectable sequence homology to {sigma}70-type factors. Its occurrence is widespread among bacteria, but there is usually only one {sigma}54 gene present in a particular organism (7). {sigma}54 regulates gene transcription for proteins required for many important cellular functions, such as nitrogen metabolism, development, phage shock responses, and pathogenicity. For example, {sigma}54 has been shown to be required for mammalian infection by Borrelia burgdorferi, the agent of Lyme disease (8).

The most striking difference between {sigma}54 and {sigma}70 factors is how they regulate transcription initiation. Once {sigma}70-RNAP holoenzyme binds to the promoter of a gene (at a conserved sequence -10 and -35 base pairs upstream of the transcription start site), it can open DNA and transcription can begin spontaneously. In contrast, when {sigma}54-RNAP holoenzyme binds to a promoter (at a consensus sequence -12 and -24 base pairs upstream of the transcription start site), it remains in the closed, inactive state. To open the double-stranded DNA and initiate transcription, {sigma}54-RNAP holoenzyme requires interaction with an activator protein (reviewed in Ref. 9), which binds ~150 base pairs upstream of the promoter (10). Once stimulated by a physiological signal, such as phosphorylation or ligand binding, the activator uses the energy of ATP hydrolysis (via a conserved AAA+-ATPase domain) to remodel the {sigma}54-RNAP promoter complex into an active conformation that is capable of initiating transcription (7).

The {sigma}70-type factors have been well characterized structurally (11-14). X-ray crystal structures of individual domains of {sigma}70 and {sigma}70-RNAP holoenzyme show that {sigma}70-type factors are composed of four helical domains ({sigma}1, {sigma}2, {sigma}3, {sigma}4) connected by flexible linkers (15, 16). These four domains are also functionally distinct: {sigma}1 is involved in regulating the kinetics of transcription initiation; {sigma}2 binds to the -10 promoter element and is essential for melting the DNA; {sigma}3 stabilizes the open complex formation by binding the extended -10 element; and {sigma}4 binds specifically to the -35 promoter element (reviewed in Ref. 17).

In contrast to {sigma}70, {sigma}54 is divided into three regions based on function (18-20). Region 1 (E. coli-(1-55)) interacts with the upstream activator protein to control promoter melting; region 2 (E. coli-(70-304)) contains the minimal RNAP-binding domain (E. coli-(70-180) and a part that enhances DNA binding affinity (E. coli-(180-304)); region 3 (E. coli-(329-477)) recognizes and binds the consensus promoter elements. This C-terminal part of the protein contains a region that cross-links to DNA (E. coli-(329-346)), a predicted helix-turn-helix motif (E. coli-(366-386); Fig. 1), and a highly conserved sequence (E. coli-(454-463); Fig. 1) called the RpoN box. The RpoN box (ARRTVAKYRE) is the signature sequence for {sigma}54 proteins and is critical for DNA binding (21).



View larger version (54K):
[in this window]
[in a new window]
 
FIGURE 1.
Amino acid sequence alignment for the C-terminal end of the {sigma}54 protein from eleven different species. Residues conserved in all proteins are highlighted in purple, residues conserved in ≥50% of the species are highlighted in blue, and chemically similar residues are highlighted in magenta. The consensus sequence is shown at the bottom. The highly conserved RpoN box motif is framed in black. The positions of the five PCR primers (three N-terminal primers and two C-terminal primers) used to create six unique protein constructs of this region are shown as black arrows. Only one of the six fragments expressed in E. coli with high levels of soluble protein. This fragment, shown with ** above the primers, was used for further structural studies.

 
Although information about the overall shape of {sigma}54 has come from scattering and electron microscopy data (22, 23), no high resolution structural information is available for {sigma}54. The lack of sequence homology between {sigma}54 and {sigma}70 prevents homology modeling of {sigma}54 domains based on the structures of {sigma}70 domains. Therefore, to understand the mechanism of {sigma}54-dependent transcription initiation, high resolution structures of domains are required.

Here we present the NMR-derived structure of a C-terminal fragment of {sigma}54 (residues Thr-323 to Gly-389) from the hyperthermophilic bacterium, Aquifex aeolicus. This domain, which includes the signature RpoN box motif, consists of a poorly structured N-terminal tail followed by an unpredicted helix-turn-helix motif. Surprisingly, this domain is structurally similar to the {sigma}3 and {sigma}4 domains of {sigma}70-type proteins, despite their low sequence homology. Structural homology with other DNA-binding proteins suggests how this protein may bind the -24 promoter element. Because of the high sequence conservation of this region among {sigma}54 proteins (Fig. 1), it is likely that this domain has a similar fold in all species, including the most highly studied {sigma}54 proteins from E. coli and Klebsiella pneumonia.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Sequence Analyses{sigma}54 protein sequences from a variety of species were downloaded from the National Center for Biotechnology Information web site (www.ncbi.nlm.nih.gov/) and then aligned using the ClustalW (24) web server at the European Bioinformatics Institute (www.ebi.ac.uk/clustalw/). Secondary structure prediction for the A. aeolicus {sigma}54 protein sequence was performed using the neural network Jnet (25) via the Jpred web server (www.compbio.dundee.ac.uk/~www-jpred/) (26).

Plasmid Construction—Six unique constructs of the C-terminal region of {sigma}54 from A. aeolicus were cloned by PCR with primers (Operon) containing NdeI and BamHI restriction sites at the 5' and 3'-ends, respectively. The fragments were amplified from a plasmid containing the full-length {sigma}54 gene (courteously provided by Gongyi Zhang from the National Jewish Medical Center, Denver, CO). The amplified fragments were digested with NdeI and BamHI, purified by gel electrophoresis, and ligated into the pET21a expression vector.

Protein Preparation{sigma}54 C protein was expressed using E. coli BL21 (DE3) with Rosetta.pLysS plasmid. Cells were first grown at 37 °C in two liters of Luria-Bertani medium. At A600 nm ~0.6, cells were exchanged (27) into one liter of M9 minimal medium containing 1 g of 15N-NH4Cl and/or 2 g of 13C6-glucose (Cambridge Isotope Laboratory). Four hours after induction with isopropyl-1-thio-{beta}-D-galactopyranoside, cells were harvested by centrifugation and resuspended into 50 mM Hepes (pH 6.9), 50 mM NaCl, 1 mM EDTA, and 0.1 mM phenylmethylsulfonyl fluoride. {sigma}54 C protein was purified by ion exchange chromatography (heparin; Amersham Biosciences) and by size exclusion chromatography (Superdex 75; Amersham Biosciences). For NMR spectroscopy, {sigma}54C protein was concentrated to 0.8 mM in 50 mM Hepes (pH 6.9), 250 mM NaCl, 1 mM EDTA, 10% 2H2O, and 1x Complete Protease Inhibitor mixture (Roche Applied Science).

Resonance Assignments—NMR data were collected at 298 K on a Bruker DRX 600 MHz. All data were processed with NMRPipe (28) and analyzed with NMRView (29). Small amounts of proteolysis were detected by new peaks in the 15N-HSQC spectrum after 5 days at 298 K. Therefore, a fresh protein sample was used for each three-dimensional spectrum.



View larger version (28K):
[in this window]
[in a new window]
 
FIGURE 2.
A, stereoview of the backbone (N, CA, C') of 20 superimposed NMR structures with the lowest restraint violations for the C-terminal fragment (Thr-323 to Gly-389) of A. aeolicus {sigma}54. Only the well structured region, Ala-335 to Gly-389, (E. coli-(407-466)) is shown. Helix 3 that is created by the RpoN box is in the front in dark blue. B, protein sequence alignment of the A. aeolicus and E. coli {sigma}54C fragment. Conserved residues are highlighted in blue; the highly conserved RpoN box is underlined in red. {alpha}-Helix secondary structure is indicated above in red. Below the sequence alignment (from top to bottom) are the chemical shift index analysis of the CA atoms; chemical shift index analysis of the HA atoms; schematic representation of the sequential i, i+1 NOE connectivities involving NH atoms; schematic representation of the medium-range i, i+3 NOE connectivities involving HA, HN atoms, respectively; and relative peak intensities in the 1H-15N HSQC spectrum. For the NOE plots, the bar is on the residue containing atom i and the height of the bar is inversely proportional to the maximum distance between the atoms.

 
Backbone assignments were obtained using the three-dimensional triple resonance experiments HNCA, CBCA(CO)NH, and 15N-NOESY-HSQC. Side chain assignments were obtained from the three-dimensional experiments 15N-TOCSY-HSQC, C(CO)NH, and HCCH-TOCSY, as well as the two-dimensional experiment proton DQF-COSY for the aromatic proton assignments.

Distance Restraints—Backbone dihedral restraints were obtained from the backbone chemical shifts (HN, HA, CA, and CB) using the program TALOS (30). 3JHNHA-coupling constants were obtained from an HNHA spectrum (31). Nuclear Overhauser effect (NOE) distance constraints were derived by manual assignment of cross-peaks in a three-dimensional 15N-NOESY-HSQC and 13C-NOESY-HSQC, both with 100-ms mixing times. Hydrogen bond restraints were defined from slowly exchanging amide protons. The protein was exchanged into 95% D2O using an Amicon Ultra centrifugal concentrator (5-kDa cutoff; Millipore). Amide protons with strong cross-peak intensities in the 15N-HSQC after 3 h at 298 K were identified as significantly protected. Stereo-specific assignments of the methyl groups of valine and leucine residues were obtained from a 13C-HMQC spectrum of a 10% 13C-labeled protein sample as described in Ref. 32. Only residues with totally unambiguous methyl assignments were used for stereo-specific assignments.

Structure Calculations—Structure calculations were performed with the program DYANA (33). Initial structures were calculated using only unambiguous NOESY peak assignments. During the later stages of assignment (backbone r.m.s.d. <1.5 for residues 21-67), the structure with the lowest energy was used to filter possible assignments based on distances. The program MOLMOL (34) was used to analyze molecules throughout the assignment process. Structure statistics, including PROCHECK analysis (35), are summarized in TABLE ONE.


View this table:
[in this window]
[in a new window]
 
TABLE ONE
Experimental restraints and structural statistics for the C-terminal domain of {sigma}54

 
Miscellaneous—All figures were made with Pymol (www.pymol.org). The sequence alignments were plotted with TexShade (36).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Sequence Analysis and Protein Construct Design—To help identify a subdomain of the {sigma}54 protein suitable for structure determination by NMR, the A. aeolicus {sigma}54 protein sequence was aligned with 38 other {sigma}54 proteins using the program ClustalW (24). This alignment, which includes {sigma}54 proteins from 11 of the most studied species, is shown in Fig. 1. Visual examination of the sequence alignment indicated that the C-terminal end (~288-398 in A. aeolicus and 360-477 in E. coli) was the most conserved region. This fragment of {sigma}54 has been shown to be sufficient for binding to double-stranded DNA (19).

Using the secondary structure prediction of the A. aeolicus {sigma}54 protein as a guide, six different constructs of this region were cloned and expressed in E. coli. Constructs were created using PCR with three different N-terminal primers and two different C-terminal primers, shown in Fig. 1. Only one of the six fragments expressed in E. coli with high levels of soluble protein. This fragment, shown with ** above the primers in Figs. 1 and 2B, starts at Thr-323, just upstream of a pair of conserved phenylalanines (329, 330) and ends at Gly-389, just downstream of the highly conserved RpoN box motif (377-386; boxed in Fig. 1). For this part of {sigma}54, the A. aeolicus protein sequence is 36% identical to the E. coli protein sequence. Here, we present the solution structure of this 67-residue (7.9 kDa) fragment, which we refer to as "{sigma}54C." As shown in Fig. 1, this fragment does not include the region predicted to form a helix-turn-helix motif, residues 292-313 in A. aeolicus and 366-386 in E. coli (gray box above the sequence alignment in Fig. 1).

Structure Determination—The {sigma}54C fragment gave an excellent 1H-15N HSQC spectrum with highly dispersed cross-peaks (supplemental Fig. S1), indicating that the majority of the protein fragment was stably folded. The 1H-15N HSQC spectrum contained cross-peaks for all residues except for the first three N-terminal residues (Thr-323, Tyr-324, Ser-325) and the single proline (Pro-359). 1H, 15N, and 13C resonance assignments for the {sigma}54C domain were made using standard three-dimensional NMR techniques (37). In total, the chemical shifts for 97% of the backbone atoms (N, NH, CA, HA) and 94% of side chain atoms (aliphatic/aromatic C and H) were assigned.

Even before we started calculating structures, three properties of the NMR data indicated that the N-terminal portion of {sigma}54C from Leu-326 to Thr-339 was poorly structured and dynamic on a time scale of micro-seconds to seconds. First, in all two- and three-dimensional spectra, diagonal and cross-peak intensities for atoms in this N-terminal region were significantly weaker than for atoms in other parts of the protein. For example, in the 1H-15N HSQC the average relative peak height for residues Leu-326 to Thr-339 is 24% less than the average intensity for residues Gln-340 to Gly-389 (Fig. 2B, last row). Second, chemical shift index (38) for this region indicated little secondary structure beyond Leu-338 (Fig. 2B, first two rows). Third, only a small number of inter-residue NOE cross-peaks were present in the three-dimensional 13C-NOESY-HSQC for this region of the protein.

Nine hundred fifty-six unique distance restraints (TABLE ONE) were obtained from the three-dimensional 15N-NOESY-HSQC and 13C-NOESY-HSQC spectra. An additional 108 {varphi}/{phi} torsion angle restraints were calculated from HN, HA, CA, and CB chemical shift values (30) and from 3JHNHA-coupling constants (31) (TABLE ONE). These 1064 restraints (16 restraints/residue) were used to calculate 150 structures. The 20 structures with the lowest energy were chosen to represent the structure of the C-terminal domain of the {sigma}54 protein from A. aeolicus (Fig. 2A).

Structure of the C-terminal Domain of {sigma}54—For the well structured region (Gln-340 to Leu-388) of {sigma}54C, the superposition of the final 20 structures has an r.m.s. deviation from the mean structure of 0.56 ± 0.07 Å for the backbone atoms and 1.24 ± 0.09 Å for all heavy atoms. Overall these structures have good statistics and geometry, as summarized in TABLE ONE. This C-terminal portion of {sigma}54C folds into a compact domain consisting of three {alpha}-helices (Fig. 2). Helix 1 (Gln-340 to Asn-353) is followed by a long loop, which starts at the conserved Glu-354 and ends at the conserved Ser-361. Helix 2 (Asp-362 to Leu-369) and helix 3 (Arg-379 to Leu-388), which includes the highly conserved RpoN box sequence, form a helix-turn-helix motif that is commonly found in many DNA-binding proteins (reviewed in Ref. 39).

A color-coded image representing the electrostatic surface of {sigma}54C (residues Gln-340 to Leu-388) shows that the molecule has a striking polarity. It has a positively charged face primarily composed of the loop in the helix-turn-helix motif (loop 2) and the top of helix 3 (Fig. 3C, right). The other side of the molecule is negatively charged, due to negative residues in loop 1 and on helix 1 (supplemental Fig. S2).

A search for proteins structurally homologous to {sigma}54C (residues Gln-340 to Leu-388) using the DALI server identified many DNA-binding proteins. Residues 77 to 127 of the human PAX6 homeodomain (PDB accession number 6PAX; Ref. 40) had the highest Z score of 5.2 (r.m.s.d 2.1 Å). The N-terminal DNA-binding domain of the leucine-responsive regulatory protein, LrpA, from Pyrococcus furiosus (PDB accession number 1I1G [PDB] ; Ref. 41) had the second highest Z score of 4.7 (r.m.s.d. 1.9 Å). The DALI results are summarized in supplemental Table S1 (52-54).

Interestingly, the {sigma}54C domain has structural homology to both {sigma}3 (regions 3.0-3.1) and {sigma}4 (regions 4.1-4.2) of {sigma}70-type factors. {sigma}3 of {sigma}A (residues 279 to 326) from Thermus aquaticus (PDB accession number 1KU2 [PDB] ; Ref. 15) had the tenth highest Z score of 4.1. {sigma}4 of {sigma}28 (residues 186-224) from A. aeolicus (PDB accession number 1RP3; Ref. 14) had a lower Z score of 3.8 but aligns slightly better with the {sigma}54C structure (r.m.s.d. 1.9 Å) than the {sigma}3 fragment of {sigma}A (r.m.s. deviation of 2.5 Å). Structural alignments with {sigma}4 and {sigma}3 are shown in Fig. 3B, left and right, respectively.

As described above, the N-terminal segment from Ser-325 to Thr-339 is poorly structured and probably dynamic on a microsecond-to-second time scale. This type of dynamics causes line broadening, thus making less structural information available for this region. The poor chemical shift dispersion for backbone and side chain atoms (Fig. 2B, top two rows) and the low number of medium- and long-range NOE cross-peaks available made it possible to assign only ten long-range NOE cross-peaks available for this region. These cross-peaks indicate that Ala-335 and Leu-338 interact with the N-terminal end of helix 1 primarily via Leu-343 and with the loop between helices 2 and 3 primarily via Phe-374. For example, the aliphatic hydrogens on Leu-338 interact with the amide and aliphatic hydrogens on Leu-343 and with the aromatic hydrogens on Phe-374.



View larger version (48K):
[in this window]
[in a new window]
 
FIGURE 3.
A, ribbon diagram of the {sigma}54C domain (residues Ala-335 to Gly-389). Side chains for residues that were previously shown to be near the -24 promoter element (50) and/or to decrease significantly DNA binding when mutated (48) are represented as sticks and labeled (except for Tyr-384, which is hidden behind Lys-383). All of these residues are on helix 3 (dark blue) created by the highly conserved RpoN box, except for Met-343 (Arg-421 in E. coli), which is on helix 1. B, superposition of the CA backbones of the {sigma}54C fragment (blue) with the {sigma}4 domain of the {sigma}70-type protein {sigma}28 (magenta; residues 175-234; PDB accession number 1RP3) (left) and the {sigma}3 domain of the {sigma}70-type protein {sigma}A (green; residues 279-326; PDB accession number 1KU2 [PDB] ) (right). In both superpositions, the beginning and the end of the RpoN box on {sigma}54 are shown as blue spheres. C, surface electrostatic potentials (blue, positive; red, negative) of {sigma}54C domain (left), {sigma}4 domain of the {sigma}70-type protein {sigma}28 (middle), and {sigma}3 domain of the {sigma}70-type protein {sigma}A (right). Highly conserved residues are labeled on {sigma}54C. In all images, the proteins are oriented as in panel A).

 
Structural Role for Conserved Residues—The hydrophobic core connecting the three helices is composed of highly conserved residues. For example, Ile-347 in helix 1 makes van der Waal interactions with Ile-365 in helix 2 and Val-381 in helix 3. All three of these residues are isoleucine, leucine, or valine in all 38 {sigma}54 protein sequences examined (11 of these sequences are shown in Fig. 1). Other residues making crucial van der Waals interactions between the helices are Leu-343, Met-344, and Ile-350 in helix 1, Ala-366, Ile-368, and Leu-369 in helix 2, and Tyr-384 and Leu-388 in helix 3.

Many of the residues creating the charge on the protein surface are conserved among {sigma}54 proteins. The highly conserved RpoN box (Ala-378 to Glu-386) forms the majority of helix 3, the second helix in the helix-turn-helix motif. This helix is amphipathic, with the hydrophobic face packing against the hydrophobic core described above. The hydrophilic side is surface exposed. This helix is primarily positively charged due to the side chains Arg-378, Arg-379, Lys-383, and Arg-385, but the C-terminal part of the helix has negative charge due to the side chain of Glu-386 (Fig. 3C, right).


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
When fragments of {sigma}70 were crystallized, they were found to be small helical domains (15). The subsequent structure of {sigma}70 (regions 2-4) in complex with core RNA polymerase showed that these domains are modular, connected by flexible linkers, which interact with other parts of polymerase and DNA (16, 42). Sequence analysis and biochemical experiments suggest that {sigma}54 may also be comprised of such domains (43, 44). Here we present the first structure of a {sigma}54 domain, the C-terminal segment of the protein, which is critical for DNA binding and contains the signature "RpoN box" sequence (Fig. 1).

Residues Gln-340 to Leu-388 of A. aeolicus {sigma}54 form a compact, three-helix domain that includes a helix-turn-helix motif. A search of the Protein Data Bank using the DALI server found that this region of {sigma}54 is structurally similar to many DNA-binding domains (supplemental Table S2). Of the ten structures with the highest Z scores, seven are known to interact with DNA. For two of these domains, the C-terminal part of the PAX6 homeodomain (PDB accession number 6PAX; Ref. 40) and the N-terminal domain of the methicillin repressor protein (PDB accession number 1SAX; Ref. 45), the structures of the protein-DNA complexes are known (40, 46). These structures show that the proteins interact primarily with the major groove of DNA via the second helix of the helix-turn-helix motif, termed the "recognition helix." The equivalent helix in the {sigma}54C domain is created by residues of the highly conserved RpoN box motif (helix 3; Figs. 2B and 3A). Thus, residues in the RpoN box are structurally poised to interact with the major groove of the {sigma}54 promoter element.

Consistent with interpreting structure of the {sigma}54C domain as a DNA-binding domain, previous footprinting and gel shift assays have shown that the region near the RpoN box is critical for {sigma}54 promoter recognition and DNA binding (19, 47, 48). Mutations in the RpoN box sequence are far more damaging to DNA binding than mutations in other parts of E. coli {sigma}54 (21, 48). All five mutations made in the RpoN box sequence had reduced binding to both duplex and fork junction promoter DNA, with four mutant proteins (K. pneumoniae R455A, R456A, Y461A, and R462A) showing ≤20% DNA binding of wild-type {sigma}54 protein. In addition, these four mutant proteins had dramatically reduced transcription activity in vitro with ≤30% the activity of the wild-type protein. Thus, these residues on the RpoN box are important for DNA binding and {sigma}54-dependent transcription. The corresponding residues in A. aeolicus {sigma}54 are shown as sticks in Fig. 3A. Four of these residues (Arg-378, Arg-379, Lys-383, and Arg-385) lie on the hydrophilic face of the recognition helix, suggesting that this face of the helix interacts directly with DNA, as discussed in more detail later in this section.

Additional support for the role of this domain in DNA binding comes from the work of Cannon et al. (47). They found that removing the last 53 residues of K. pneumoniae {sigma}54 (Lys-425 to Val-477; A. aeolicus Lys-346 to Ile-398) eliminated DNase I and 1,10-phenanthroline-copper footprints of {sigma}54 and reduced {sigma}54 holoenzyme binding to 5-bromouracil substituted DNA by 85%. This truncation cuts the protein in the middle of helix 1 of the {sigma}54 C domain, removing loop 2 and the helix-turn-helix motif.

Overall, the C-terminal domain of {sigma}54 is more structurally similar to domain {sigma}3 than to domain {sigma}4 of {sigma}70. {sigma}3 and the {sigma}54C are about the same size (55 and 59 residues, respectively), and all three helices are about the same length (Fig. 3B, right). On the other hand, {sigma}4 has an extra helix at the N-terminal end, and the recognition helix is twice as long as the corresponding helix in the {sigma}54C domain (Fig. 3B, left). In terms of surface charge distribution, however, the {sigma}54C is more similar to {sigma}4 than to {sigma}3 (Fig. 3C). Both {sigma}4 and {sigma}54C have a positively charged surface created by side chains on the loop and the recognition helix of the helix-turn-helix motif (Fig. 3C, left and middle). For {sigma}4, this surface interacts specifically with the major groove of the -35 promoter element (13, 15). For {sigma}54C, this surface is created by the residues of the conserved RpoN box, which probably interact with the -24 promoter element.

Previous biochemical studies suggest that the C-terminal domain of {sigma}54 is more functionally analogous to the {sigma}4 domain than to the {sigma}3 domain of {sigma}70-type factors. {sigma}3 of {sigma}70 proteins interacts with the extended -10 promoter element near the site of melting (42, 49), whereas the major role of {sigma}4 of {sigma}70 proteins is to bind to the -35 promoter element (13, 15). By tethering a DNA cleavage reagent, FeBABE (p-bromoacetamidobenzyl-EDTA-Fe), to charged residues in the RpoN box of K. pneumoniae {sigma}54, Burrows et al. (50) showed that residues in the RpoN box are near the -24 promoter element, not the -12 promoter element where melting occurs. For example, with the FeBABE tethered to residue E463C (A. aeolicus Glu-386), strong cleavage was seen between the -30 and -25 nucleotides of the template DNA strand.

The N-terminal end of {sigma}54C domain, residues Ser-325 to Thr-339, is not well ordered and probably dynamic on the microsecond-to-second time scale. These residues could be a part of a linker region that connects two independently folded domains of {sigma}54, as seen with {sigma}70-type factors (15, 16). Previous limited proteolysis experiments on the full-length {sigma}54 did not identify this segment as proteolytically sensitive (51). Thus, additional proteolysis studies and NMR analyses on larger fragments of the A. aeolicus {sigma}54 are required to determine whether this region (Ser-325 to Thr-339) is still poorly structured in the presence of the other domains and RNAP.

The C-terminal domain of {sigma}54 presented in this report provides an interesting structural connection between {sigma}54 and {sigma}70. This structural similarity of {sigma}54C to domains of {sigma}70-type sigma factors is surprising (Fig. 3B) because these regions have poor sequence homology. Residues Thr-323 to Gly-389 of {sigma}54C have 14% identity, 25% similarity with residues 186-326 of region 4 of {sigma}A from T. aquaticus and 15% identity, 26% similarity with residues 279-326 of region 3 of {sigma}28 from A. aeolicus. A low resolution structure of the full-length {sigma}54 protein suggests that other domains of {sigma}54 may also be more similar to domains of {sigma}70 than previously predicted by only sequence homology (22). High resolution structures of these other domains are needed to confirm this hypothesis.


    FOOTNOTES
 
The atomic coordinates and structure factors (code 2AHQ) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

* This work was supported by National Institutes of Health Grant GM62163 (to D. E. W.) and by a University of California fellowship (to M. D.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

{boxs} The on-line version of this article (available at http://www.jbc.org) contains supplemental Figs. S1 and S2 and Table S1. Back

1 To whom correspondence should be addressed: Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, CA 94720. Tel.: 510-486-4318; Fax: 510-486-6059; E-mail: DEWemmer{at}lbl.gov.

2 The abbreviations used are: RNAP, RNA polymerase; NOE, nuclear Overhauser effect; NOESY, NOE spectroscopy; TOCSY, total correlation spectroscopy; HSQC, hetero-nuclear single quantum coherence spectroscopy; PDB, Protein Data Bank; r.m.s.d., root mean square deviation. Back


    ACKNOWLEDGMENTS
 
We thank Prof. Sydney Kustu for advice and Steven M. Damo for computer assistance.



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Burgess, R. R., Travers, A. A., Dunn, J. J., and Bautz, E. K. (1969) Nature 221, 43-46[CrossRef][Medline] [Order article via Infotrieve]
  2. Hirschman, J., Wong, P. K., Sei, K., Keener, J., and Kustu, S. (1985) Proc. Natl. Acad. Sci. U. S. A. 82, 7525-7529[Abstract/Free Full Text]
  3. Wosten, M. M. (1998) FEMS Microbiol. Rev. 22, 127-150[CrossRef][Medline] [Order article via Infotrieve]
  4. Lonetto, M., Gribskov, M., and Gross, C. A. (1992) J. Bacteriol. 174, 3843-3849[Free Full Text]
  5. Paget, M. S., and Helmann, J. D. (2003) Genome Biol. 4, 203[CrossRef][Medline] [Order article via Infotrieve]
  6. Chilcott, G. S., and Hughes, K. T. (2000) Microbiol. Mol. Biol. Rev. 64, 694-708[Abstract/Free Full Text]
  7. Buck, M., Gallegos, M. T., Studholme, D. J., Guo, Y., and Gralla, J. D. (2000) J. Bacteriol. 182, 4129-4136[Free Full Text]
  8. Fisher, M. A., Grimm, D., Henion, A. K., Elias, A. F., Stewart, P. E., Rosa, P. A., and Gherardini, F. C. (2005) Proc. Natl. Acad. Sci. U. S. A. 102, 5162-5167[Abstract/Free Full Text]
  9. Xu, H., and Hoover, T. R. (2001) Curr. Opin. Microbiol. 4, 138-144[CrossRef][Medline] [Order article via Infotrieve]
  10. Wedel, A., Weiss, D. S., Popham, D., Droge, P., and Kustu, S. (1990) Science 248, 486-490[Abstract/Free Full Text]
  11. Malhotra, A., Severinova, E., and Darst, S. A. (1996) Cell 87, 127-136[CrossRef][Medline] [Order article via Infotrieve]
  12. Campbell, E. A., Tupy, J. L., Gruber, T. M., Wang, S., Sharp, M. M., Gross, C. A., and Darst, S. A. (2003) Mol. Cell 11, 1067-1078[CrossRef][Medline] [Order article via Infotrieve]
  13. Jain, D., Nickels, B. E., Sun, L., Hochschild, A., and Darst, S. A. (2004) Mol. Cell 13, 45-53[CrossRef][Medline] [Order article via Infotrieve]
  14. Sorenson, M. K., Ray, S. S., and Darst, S. A. (2004) Mol. Cell 14, 127-138[CrossRef][Medline] [Order article via Infotrieve]
  15. Campbell, E. A., Muzzin, O., Chlenov, M., Sun, J. L., Olson, C. A., Weinman, O., Trester- Zedlitz, M. L., and Darst, S. A. (2002) Mol. Cell 9, 527-539[CrossRef][Medline] [Order article via Infotrieve]
  16. Murakami, K. S., Masuda, S., and Darst, S. A. (2002) Science 296, 1280-1284[Abstract/Free Full Text]
  17. Murakami, K. S., and Darst, S. A. (2003) Curr. Opin. Struct. Biol. 13, 31-39[CrossRef][Medline] [Order article via Infotrieve]
  18. Wong, C., Tintut, Y., and Gralla, J. D. (1994) J. Mol. Biol. 236, 81-90[CrossRef][Medline] [Order article via Infotrieve]
  19. Cannon, W. V., Chaney, M. K., Wang, X., and Buck, M. (1997) Proc. Natl. Acad. Sci. U. S. A. 94, 5006-5011[Abstract/Free Full Text]
  20. Bordes, P., Wigneshweraraj, S. R., Schumacher, J., Zhang, X., Chaney, M., and Buck, M. (2003) Proc. Natl. Acad. Sci. U. S. A. 100, 2278-2283[Abstract/Free Full Text]
  21. Taylor, M., Butler, R., Chambers, S., Casimiro, M., Badii, F., and Merrick, M. (1996) Mol. Microbiol. 22, 1045-1054[CrossRef][Medline] [Order article via Infotrieve]
  22. Svergun, D. I., Malfois, M., Koch, M. H., Wigneshweraraj, S. R., and Buck, M. (2000) J. Biol. Chem. 275, 4210-4214[Abstract/Free Full Text]
  23. Rappas, M., Schumacher, J., Beuron, F., Niwa, H., Bordes, P., Wigneshweraraj, S., Keetch, C. A., Robinson, C. V., Buck, M., and Zhang, X. (2005) Science 307, 1972-1975[Abstract/Free Full Text]
  24. Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994) Nucleic Acids Res. 22, 4673-4680[Abstract/Free Full Text]
  25. Cuff, J. A., and Barton, G. J. (2000) Proteins 40, 502-511[CrossRef][Medline] [Order article via Infotrieve]
  26. Cuff, J. A., Clamp, M. E., Siddiqui, A. S., Finlay, M., and Barton, G. J. (1998) Bioinformatics 14, 892-893[Abstract/Free Full Text]
  27. Marley, J., Lu, M., and Bracken, C. (2001) J. Biomol. NMR 20, 71-75[CrossRef][Medline] [Order article via Infotrieve]
  28. Delaglio, F., Grzesiek, S., Vuister, G. W., Zhu, G., Pfeifer, J., and Bax, A. (1995) J. Biomol. NMR 6, 277-293[Medline] [Order article via Infotrieve]
  29. Johnson, B. A., and Blevins, B. A. (1994) J. Biomol. NMR 4
  30. Cornilescu, G., Delaglio, F., and Bax, A. (1999) J. Biomol. NMR 13, 289-302[CrossRef][Medline] [Order article via Infotrieve]
  31. Kuboniwa, H., Grzesiek, S., Delaglio, F., and Bax, A. (1994) J. Biomol. NMR 4, 871-878[CrossRef][Medline] [Order article via Infotrieve]
  32. Neri, D., Szyperski, T., Otting, G., Senn, H., and Wuthrich, K. (1989) Biochemistry 28, 7510-7516[CrossRef][Medline] [Order article via Infotrieve]
  33. Guntert, P., Mumenthaler, C., and Wuthrich, K. (1997) J. Mol. Biol. 273, 283-298[CrossRef][Medline] [Order article via Infotrieve]
  34. Koradi, R., Billeter, M., and Wuthrich, K. (1996) J. Mol. Graph 14, 51-55, 29-32
  35. Laskowski, R. A., MacArthur, M. W., Moss, D. S., and Thornton, J. M. (1993) J. Appl. Crystallogr. 26, 283-291[CrossRef]
  36. Beitz, E. (2000) Bioinformatics 16, 135-139[Abstract/Free Full Text]
  37. Cavanagh, J., Wayne, J. F., Palmer, A. G., and Nicholas, J. S. (1996) Protein NMR Spectroscopy: Principles and Practice, Academic Press, San Diego, CA
  38. Spera, S., and Bax, A. (1991) J. Am. Chem. Soc. 113, 5490-5492[CrossRef]
  39. Wintjens, R., and Rooman, M. (1996) J. Mol. Biol. 262, 294-313[CrossRef][Medline] [Order article via Infotrieve]
  40. Xu, H. E., Rould, M. A., Xu, W., Epstein, J. A., Maas, R. L., and Pabo, C. O. (1999) Genes Dev. 13, 1263-1275[Abstract/Free Full Text]
  41. Leonard, P. M., Smits, S. H., Sedelnikova, S. E., Brinkman, A. B., de Vos, W. M., van der Oost, J., Rice, D. W., and Rafferty, J. B. (2001) EMBO J. 20, 990-997[CrossRef][Medline] [Order article via Infotrieve]
  42. Murakami, K. S., Masuda, S., Campbell, E. A., Muzzin, O., and Darst, S. A. (2002) Science 296, 1285-1290[Abstract/Free Full Text]
  43. Sasse-Dwight, S., and Gralla, J. D. (1990) Cell 62, 945-954[CrossRef][Medline] [Order article via Infotrieve]
  44. Missailidis, S., Cannon, W. V., Drake, A., Wang, X. Y., and Buck, M. (1997) Mol. Microbiol. 24, 653-664[CrossRef][Medline] [Order article via Infotrieve]
  45. Garcia-Castellanos, R., Marrero, A., Mallorqui-Fernandez, G., Potempa, J., Coll, M., and Gomis-Ruth, F. X. (2003) J. Biol. Chem. 278, 39897-39905[Abstract/Free Full Text]
  46. Garcia-Castellanos, R., Mallorqui-Fernandez, G., Marrero, A., Potempa, J., Coll, M., and Gomis-Ruth, F. X. (2004) J. Biol. Chem. 279, 17888-17896[Abstract/Free Full Text]
  47. Cannon, W., Missailidis, S., Smith, C., Cottier, A., Austin, S., Moore, M., and Buck, M. (1995) J. Mol. Biol. 248, 781-803[CrossRef][Medline] [Order article via Infotrieve]
  48. Wang, L., and Gralla, J. D. (2001) J. Biol. Chem. 276, 8979-8986[Abstract/Free Full Text]
  49. Voskuil, M. I., and Chambliss, G. H. (2002) J. Mol. Biol. 322, 521-532[CrossRef][Medline] [Order article via Infotrieve]
  50. Burrows, P. C., Severinov, K., Ishihama, A., Buck, M., and Wigneshweraraj, S. R. (2003) J. Biol. Chem. 278, 29728-29743[Abstract/Free Full Text]
  51. Cannon, W., Claverie-Martin, F., Austin, S., and Buck, M. (1994) Mol. Microbiol. 11, 227-236[Medline] [Order article via Infotrieve]
  52. Pohl, E., Haller, J. C., Mijovilovich, A., Meyer-Klaucke, W., Garman, E., and Vasil, M. L. (2003) Mol. Microbiol. 47, 903-915[CrossRef][Medline] [Order article via Infotrieve]
  53. Mer, G., Bochkarev, A., Gupta, R., Bochkareva, E., Frappier, L., Ingles, C. J., Edwards, A. M., and Chazin, W. J. (2000) Cell 103, 449-456[CrossRef][Medline] [Order article via Infotrieve]
  54. Nichols, M. D., DeAngelis, K., Keck, J. L., and Berger, J. M. (1999) EMBO J. 18, 6177-6188[CrossRef][Medline] [Order article via Infotrieve]

Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
J. Biol. Chem.Home page
S. Poggio, A. Osorio, G. Dreyfus, and L. Camarena
Transcriptional Specificity of RpoN1 and RpoN2 Involves Differential Recognition of the Promoter Sequences and Specific Interaction with the Cognate Activator Proteins
J. Biol. Chem., September 15, 2006; 281(37): 27205 - 27215.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
280/50/41530    most recent
M509010200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Doucleff, M.
Right arrow Articles by Wemmer, D. E.