![]()
|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 25, 17400-17409, June 23, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




2
From the
Institute of Molecular Biology, Academia Sinica, Taipei 115,
Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu 300, and ¶Department of Biochemistry, National Cheng Kung University, College of Medicine, Tainan 701, Taiwan
Received for publication, January 17, 2006 , and in revised form, April 12, 2006.
| ABSTRACT |
|---|
|
|
|---|
-helix in place of a typical wing 2 in a member of this family alters the orientation of the C-terminal basic residues (RKRRPR) when binding to DNA outside the core sequence. These results provide a new insight into how the DNA binding specificities of winged helix/forkhead proteins may be regulated by their less conserved regions. | INTRODUCTION |
|---|
|
|
|---|
100 evolutionarily well conserved amino acids essential for DNA recognition (5). Several three-dimensional structures of this DNA binding domain (DBD), including HNF-3
(FoxA3), Genesis (Foxd3), AFX (FOXO4), FREAC11 (FOXC2), and ILF-1 (FOXK1), have been determined using x-ray crystallography or NMR spectroscopy (6-11). Most of these structures exhibit three
-helices, three
-strands, and two wing-like loops (12). The so-called winged helix DNA-binding motif of these proteins is named after the three-dimensional structure of the HNF-3
-DNA complex in which two wing-like loops (wing 1 and wing 2) and helix 3 are involved in protein-DNA interactions (13).
For many DNA-binding protein families, the differences in DNA binding specificity between family members are determined by the contact residues on recognition elements; the substitution of a DNA contact residue within family members leads to different binding specificity (14). However, transcription factors containing the winged helix-binding motif are exceptions (15). The DNA binding domains of Fox proteins exhibit a remarkable degree of sequence homology in their DNA binding region but a notable variability in their DNA recognition specificity (12). To date, more than 200 winged helix/forkhead proteins have been identified, but only two forkhead protein-DNA complexes have been reported (7, 13). Based on these two three-dimensional structures, it has been proposed that the DNA recognition helix (helix 3) is most important in DNA binding specificity. In addition, amino acid variations in the wing regions contribute to the differences in the DNA binding specificity of these proteins (12). Because of the limited number of three-dimensional structures available for these winged helix protein-DNA complexes, however, we have little concrete evidence about the diverse DNA-binding properties of these proteins (12).
Interleukin enhancer binding factors (ILFs) are transcription factors that bind purine-rich regulatory motifs in the human T-cell leukemia virus-long terminal region and the interleukin-2 (IL-2) promoter (16, 17). Previous studies suggest that ILFs bind to the regulatory sequences of the IL-2 promoter and regulate gene expression (17). DNA selection experiments identified a 6-bp motif, 5'-TAAACA-3', as the optimal sequence for ILF binding, but this sequence varies slightly from the target gene (17). Three ILFs of 655 (ILF-1), 609 (ILF-2), and 323 (ILF-3) amino acids have been reported (16-20). There are protein sequence homologies between ILF-1 and ILF-2, including a region for potential ubiquitin-mediated degradation, a nuclear localization site, an N-glycosylation motif, and a DNA binding domain. The DNA binding domains of ILF-1 and ILF-2 are designated as FOXK1a and FOXK1b, respectively (5), and are between residues 251 and 348 of the proteins. They share 35 to 89% similarity with other known members of the winged helix/fork-head family. Hence, ILF-1 and ILF-2 are classified as forkhead family members.
In a previous study (8), we determined the three-dimensional structure of the DNA binding domains of ILF-1 (ILF-DBD) by using multidimensional NMR spectroscopy and affirmed that ILF-1 is a new member of the winged helix/forkhead family. ILF-DBD, however, has a C-terminal
-helix instead of the wing 2 of typical winged helix/fork-head proteins. This C-terminal
-helix may affect the regulation or specificity of DNA binding. In this study, we determined the three-dimensional structure of ILF-DBD complexed with a 16-bp DNA duplex and Mg2+ ions using x-ray crystallography. We have combined structural and biochemical evidence to provide several important insights into the DNA binding specificity of the winged helix/forkhead family proteins.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
CrystallographyILF-DBD was mixed with the oligonucleotides at a 2:1 molar ratio in buffer containing 20 mM HEPES, pH 7.0, 20 mM MgCl2, and 200 mM NaCl. A protein concentration of 5 mg ml-1 was required for protein-DNA complex crystallization. Both native and SeMet-labeled crystals were grown at 4 °C using the hanging drop vapor diffusion method; the complex was mixed with an equal volume of reservoir solution containing 100 mM HEPES, pH 7.2, 200 mM NH4SO4, 20% PEG3350. The crystals belonged to space group P6122 with cell dimensions of a = b = 58.7 Å, and c = 324.9 Å, and diffract to 2.4 and 3.7 Å for native and SeMet-labeled crystals, respectively. The native and multiwavelength anomalous diffraction (MAD) data were collected at the Raxis-IV++ imaging plate using a synchrotron radiation x-ray source at Beamline 17B2 of the National Synchrotron Radiation Research Center in Taiwan. A single crystal was soaked in a cryoprotectant solution for 20 min before it was frozen in liquid nitrogen. The cryoprotectant solution contained the same components as the reservoir solution plus 25% glycerol. The structure of the ILF-DBD-DNA complex was determined using MAD phasing applied to the SeMet analog. The MAD data were collected at three wavelengths at 0.9798 (peak), 0.9800 (inflection), and 0.9721 Å (remote) under cryogenic conditions. All data were processed with the HKL2000 package (21).
SOLVE (22) was used to locate selenium sites and to generate the initial MAD phases at 3.7 Å. The initial phases were further improved using RESOLVE (23). XtalView (24) was used to examine electron density maps and molecular models. The native data set (2.4 Å) was used for further refinement by energy minimization and simulated annealing using CNS (25). Water molecules were added with a waterpick routine in the CNS program. The current model has an R factor of 21.5% for all reflections above 2 s between 30.0 and 2.4 Å resolution and an Rfree of 26.0%, using 8% randomly distributed reflections. The Ramachandran plot has no violation of accepted backbone torsion angles. The helical parameters of DNA were analyzed using the CURVE (26) program. The PyMol (27) and MolMol (28) programs were used to generate the figures. The data collection and refinement statistics are shown in Table 1.
|
Electrophoretic Mobility Shift AssayBinding reactions were performed at 25 °C in a total volume of 10 µl in 25 mM HEPES and 1 mM MgCl2, pH 7.0. DNA substrates used in this experiment were 25 µM. After adding 5 µl of the sample loading dye containing 89 mM Tris borate, 5% glycerol, and 0.01% bromphenol blue, the resulting complexes were resolved at 4 °C on a native 6% polyacrylamide gel in TBE buffer (89 mM Tris borate, 1 mM EDTA, pH 8.3) and were visualized using 0.5 µg/ml ethidium bromide.
|
| RESULTS |
|---|
|
|
|---|
The structure of the complex had two ILF-DBD molecules that bound to the opposite surfaces of the major groove of the DNA helix in a head-to-tail orientation (Fig. 2B). There was no direct contact between these two protein molecules. These two ILF-DBDs interacted with DNA in a similar but not identical manner. Surprisingly, the arrangement was quite different from that of the HNF-3
-DNA complex (13). The geometry of the DNA duplex was canonical B-DNA but with a few kinks. Briefly, the overall folding of the ILF-DBD consisted of four
-helices, three
-strands, one type I turn between the H2 and the H3 regions, and one wing (wing 1) (Fig. 2B). The architecture was similar to that of the DNA binding domain in other winged helix/forkhead proteins except for the H2-H3 turn and the C-terminal region (8). The ILF-DBD did not have the C-terminal wing extension found in the winged helix domain of HNF-3
. The typical wing 2 structure (residues 84-91) of the canonical winged helix/forkhead proteins had been replaced by an
-helix (H4) in ILF-DBD. This
-helix lie antiparallel to and was stabilized by H1 through many hydrophobic interactions and hydrogen bonds. The residues following helix 4 (92-98) formed a coil structure. Upon complex formation, the recognition helix (H3) docked into the major groove roughly perpendicular to the DNA axis and bound extensively with the core sequence 5'-TAAACA-3'. Wing 1 and the basic residues at the C terminus of ILF-DBD interacted with the minor grooves of the 3'- and 5'-flanking regions of the core sequence, respectively. The turn in the H2-H3 loop region bound with the phosphate backbone of the DNA duplex.
The structures of the two crystallographically independent domains were superimposed with a root mean square deviation of 0.53 Å for C-
atoms of secondary structural elements and showed no distinct differences (Fig. 2C). Compared with the NMR structure of the same ILF-DBD solved in the absence of DNA, the root mean square deviation of C-
atoms was 1.57 Å (Fig. 2C). This fact revealed some slight structural variations upon DNA binding, especially in wing 1, the H2-H3 loop, and the C-terminal region of the protein. In the absence of DNA, these regions were highly disordered in solution (8).
DNA Conformation in the ILF-DBD-DNA ComplexThe overall conformation of the DNA in the complex was the general B-form DNA and was bent
19° toward ILF-DBD1. The majority of bending occurred near the major groove, which was bound by the recognition helix H3 of ILF-DBD1. In the major groove of the core sequence, the base steps 5/6, 6/7, and 9/10 were kinked in this region and had slightly higher roll angles of 8.25°, 5.7°, and 10.1°, respectively. The bending in this region may enable wing 1 of ILF-DBD1 to approach T12 and A13, both on the minor groove 3' to the core sequence. Lys63, Ser75, and Trp77 of wing 1 interacted with the phosphate groups of the DNA and further stabilized the bending (Fig. 3). The major groove was slightly widened (
1-2 Å wider than the canonical B-DNA) at points where the two recognition helices of ILF-DBD1 (6 - 8 bp) and ILF-DBD2 (11-12 bp) were inserted. The minor groove was also slightly enlarged in the core sequence region. The protein-phosphate interactions were localized in two phosphate backbones that formed the major groove of the core sequence. The propeller twist angle for the base pair T5-A5' in the ILF-DBD-DNA complex was 1°, whereas the corresponding base pair in the HNF-3
-DNA complex was heavily propeller-twisted. Furthermore, the tilt was more negative at the TCAACC nucleotide base step in the HNF-3
-DNA complex than that of TAAACA in the ILF-DBD-DNA complex. The helical twist per base pair varied from 26.66 to 38.83°. The DNA was 0.6% shorter than canonical B-DNA with the same number of nucleotides.
|
In ILF-DBD1, H3 inserted into the major groove and interacted with three backbone phosphates and six bases of the DNA duplex (Figs. 3 and 4A). Asn49, Ser50, Arg52, His53, and Ser56 of H3 played a central role in DNA recognition. A7 was recognized by Asn49 through two direct hydrogen bonds. Asn49 also recognized T8' via a water-mediated hydrogen bond. Ser50 bound with base A6 using two water-mediated hydrogen bonds. Arg52 interacted with T10' and T8' through van der Waals contacts and contributed to the specificity for G9' with a direct and a water-mediated hydrogen bond (Fig. 4A). The side chain of His53 protruded into the major groove and recognized bases T5 and T6' through van der Waals force and a direct hydrogen bond, respectively (Fig. 4A). T7' was bound by Ser56, which is located at the C-terminal end of the H3, through a hydrophobic interaction. In addition to base recognition, Lys45, Asn54, and Ser56 bound with the phosphate groups of the DNA backbone to further stabilize the complex structure. The backbone phosphate groups of T8' and T11' formed a direct hydrogen bond with Ser56 and Lys45, respectively. However, the phosphate group of G4 interacted with Asn54 via a water molecule.
Ser50, Arg52, and His53 of ILF-DBD2 (Fig. 4B) recognized a specific DNA base in a manner similar to that of ILF-DBD1. Ser50 interacted with A11 indirectly through water-mediated hydrogen bonds. The side chain of Arg52 formed two direct hydrogen bonds with G14'. In addition, Arg52 interacted with both methyl groups of T13' and T15' through van der Waals contacts. The base T11' had a direct hydrogen bond with the side chain of His53. In ILF-DBD2, Ser56 interacted only with the backbone phosphate of T13'. We also noted that Lys45 and Asn49 of ILF-DBD2 had a different conformation from that observed in ILF-DBD1 (Fig. 4, A and B). Instead of interacting with the phosphate group as Lys45 did in ILF-DBD1, Lys45 in ILF-DBD2 recognized T15' through one water-mediated hydrogen bond. Interestingly, Asn49 in ILF-DBD2 did not participate in DNA recognition. The differences in DNA interactions between ILF-DBD1 and ILF-DBD2 might have been due to the DNA sequence used in this study. ILF-DBD1 showed more base pair specificity to its DNA recognition site than that of ILF-DBD2 (Fig. 3).
|
complex. In addition, the amide group of Ser75 hydrogen-bonded with the DNA phosphate. At the stem of wing 1, the amine group of Lys63 and the amide group of Trp77 interacted with the phosphate groups of T8' and G9', respectively (Figs. 3 and 5A). The N-terminal residues (Ser67-Gln68-Glu69-Glu70) of the wing 1 were positioned away from the DNA and did not make contact with the nucleic acids.
C-terminal Basic Residues Interacted with the Minor Groove of DNA The most interesting finding from the current study was that the basic residues (RKRRPR) following helix 4 in the C terminus of ILF-DBD appeared to be important for DNA recognition. The side chain of Arg98 in ILF-DBD1 pointed toward the DNA minor groove, and its two amine groups interacted with the O-2 atoms of nucleotides T2 and T3 through hydrogen bonds and bridging water molecules (Fig. 5, B and C). These two nucleotides are upstream from the core sequence. The side chain of Arg95 interacted with the phosphate group of G4 (Fig. 5B). In contrast, in the HNF-3
/DNA structure, Arg98 was at wing 2 and interacted with the G4 of the major groove through a water-mediated hydrogen-bonding network (Fig. 5C). Arg94 of HNF-3
formed a bidentate interaction with T1. However, Lys94 did not interact with the DNA in the ILF-DBD-DNA complex. These observations mean that ILF-1 and HNF-3
recognized the region upstream of the core sequence quite differently. Thus, the C-terminal region of the DNA binding domain of winged helix/forkhead proteins might contribute to DNA recognition specificity. We present this in greater detail under "Discussion."
Mg 2+-binding SiteThe electron density map of the ILF-DBD/DNA structure showed a metal-binding site at the C terminus of H3 in both ILF-DBD1 and ILF-DBD2. We assumed that it was a magnesium ion because that metal was added to crystallize the complex. The Mg2+ ion was coordinated square-bipyramidally with the main chain carbonyl oxygen of Leu71, Asn74, Phe77, and Ser72, as well as with two water molecules with bond distances ranging from 2.4 to 3.2 Å. Furthermore, the Mg2+ ion also interacted with the phosphate groups of DNA through a water molecule. Similar metal ion-binding sites were found in the HNF-3
/DNA and IRF-2/DNA structures. They were assumed to be a magnesium ion and a potassium ion, respectively (13, 29). The metal ion in the ILF-DBD-DNA complex may have the same function as that in the HNF-3
-DNA complex, where it neutralizes the helix dipole of the H3 C-terminal cap.
|
|
(Fig. 4, A and C). To investigate the DNA binding activity of these residues, we constructed alanine substitution mutants and assayed them by using EMSA. As shown in Fig. 6A, the K3A, K45A, S50A, and K73A mutations led to a decrease in DNA complex formation to 40, 20, 70, and 25%, respectively. We found no band shift for the R52A mutant, which implied that no protein-DNA complex formed. We suggest that the hydrophobic force and the hydrogen bonds formed by Arg52 to the last three bases of the core sequence (TAAACA) might serve as a main step for initiating or promoting the binding of ILF-DBD to DNA. Minimum Length of DNA Fragment Required for ILF-DBD Binding Determined by Electrophoretic Mobility Shift AssayOur crystal structure revealed that two ILF-DBD molecules bound to a 16-bp DNA duplex (Fig. 2B). We tested oligonucleotides 11-16 bp long (S11-S16) by using EMSA to determine the minimum binding site size of the DNA fragment required for ILF-DBD binding. Mixing ILF-DBD and S14, S15, or S16 in equal mole ratios yielded significant band shifts (Fig. 6B), which implied protein-DNA complexes had been formed. At a 2:1 ILF-DBD to DNA ratio, an extra band appeared. This slower mobility band probably represents the binding of 2 ILF-DBD molecules to a DNA duplex. With a 3:1 protein to DNA ratio, this slower mobility band appeared mainly on the gel. There was, however, no band shift for the binding of ILF-DBD to S11, S12, and S13 DNA duplexes, which meant that binding was largely abolished when the size of the oligonucleotide was less than 13 bp. We had similar results when we tested other 13-bp DNA duplexes that contained only the sequence that interacts with ILF-DBD1; this sequence was flanked by some G/Cs that acted as clamps (data not shown). These results ruled out the possibility that shorter nucleotides with high A/T content had formed unstable duplexes. We thus determined that the minimal size of oligonucleotide that permits efficient protein binding was 14 bp.
|
| DISCUSSION |
|---|
|
|
|---|
-DNA (13) and Genesis-DNA (30) have been reported. HNF-3
binds to DNA specifically as a monomer. However, the length of the DNA (13-mer) used in that study was too short to show interactions between wing 1 and the DNA. A longer DNA duplex was used for the NMR solution study of the Genesis-DNA complex (30), but the structure was calculated using a straight B-form DNA template. Prior to this study, little was known about how winged helix/forkhead proteins recognize diverse DNA sequence adjacent to the core sequence. The structural analysis in this study demonstrated that winged helix/forkhead proteins recognized DNA not only with the recognition helix (H3) but also from less conserved regions as discussed below.
DNA Core Sequence Recognition by the Recognition Helix H3The DNA binding domains of ILF, FREAC11, HNF-3
, and Genesis recognized the core sequences TAAACA, GTAAACA, GTCAATA, and AAAATAAC, respectively (7, 9, 13, 17). Although the HNF-3
-DNA complex structure provided the first molecular understanding about interactions between the protein and the core sequence, only Asn49 and His53 of the recognition helix interacted with G4, T5, and A7 (13) (Figs. 3 and 4C). In contrast to HNF-3
, we found important hydrogen bonds to T6' (His53), A7 (Asn49), and G9' (Arg52) and hydrophobic contacts with T5 (His53), T8' (Ser56, Arg52), and T10' (Arg52) in the ILF-DBD/DNA structure (Fig. 3). The contact patterns of the major groove by recognition helices between these two proteins were different. This result was especially surprising, because the amino acid sequences of the recognition helix in these two proteins are highly conserved. In addition, the position and orientation of the recognition helices of these two proteins were similar in the complexes. The reason for the difference was probably that the DNA sequence (GTCAACC) used in the HNF-3
complex study differed from that of the core sequence (GTAAACA) in this study. Different DNA sequence may cause different DNA bending or change the network of water molecules located within the interface between the recognition helix and DNA.
In the ILF-DBD/DNA structure, we found that the base pair A6-T6' had an unusual geometry with a zero propeller. This zero propeller was not observed at the equivalent position (C6-G6') in the HNF-3
-DNA complex, suggesting that this geometry might have facilitated the hydrogen bonding of His53 of the ILF-DBD to T6'. In addition, DNA-binding site selection studies (17, 31-34) showed that most forkhead proteins prefer cytosine and adenine at positions 9 and 10 of the core sequence, respectively. A thymine at position 9 is tolerated by most forkhead proteins but to a lesser extent than is a cytosine. In our complex structure, Arg52 recognized the base pairs C9-G9' and A10-T10' through hydrogen bonding and van der Waals contact, respectively. However, this important recognition did not occur in the HNF-3
-DNA complex. Comparing the 5'-T10'pG9'-3' step of the ILF-DBD/DNA structure with that of the 5'-T "pG"-3' step of the B-type DNA showed that T10' in the DNA-bound structure was displaced into the major groove and approached the side chain of Arg52. Additionally, our structure showed that the DNA was kinked toward ILF-DBD1 with a high roll angle of
10° at this step. These observations suggested that the recognition of T10' by Arg52 was significant. Substituting alanine for Arg52 dramatically reduced ILF-DBD binding to the DNA (Fig. 6A), which was consistent with another positional equivalent mutation of R127H in FOXC1 that led to a significant disruption in the DNA binding affinity of the protein (35). Thus, we hypothesize that Arg52 is essential for DNA binding and is important for core sequence recognition.
H2-H3 Loop Region May Regulate the ILF-DNA InteractionThe DNA binding specificity of winged helix/forkhead proteins is influenced by residues located in the H2-H3 loop region (9, 32, 34). These residues have variable sequences. The H2-H3 region is formed by a short helix, a random-coil segment, and a 310 helix in Genesis, ILF-DBD/DNA, and HNF-3
/DNA, respectively. It has been proposed (15) that the these structures may regulate the relative presentation of helix 3, thereby leading to different binding specificities.
Although the ILF-DBD and HNF-3
share a high degree of sequence identity within the DNA recognition helix, H3, these two proteins have slightly different DNA core sequence specificities. The core sequences are 5'-TAAACA-3' for the ILF-DBD and 5'-GTCAACA-3' for HNF-3
. The H2-H3 region of these two proteins has a segment of highly conserved residues ((Tyr37/Phe37)-Pro-Tyr-Tyr-Arg41). However, the residues before and after this segment are very diverse (Fig. 1). Even though the orientation of H3 in these two proteins is not different, we detected obvious structural differences in the H2-H3 loop region, namely residues that are closed to H3 in particular. Thus, it is possible that residues 42-46 might have an important effect on the DNA binding specificity of winged helix/forkhead proteins.
In the ILF-DNA complex, the side chain of Lys45 from ILF-DBD1 protrudes upward to form a salt bridge with the phosphate group of T11' (Fig. 4A), and the side chain of Lys45 from ILF-DBD2 forms a direct contact with the base T15' (Fig. 4B). These interactions further bend the DNA toward the recognition helices (H3) and form specific base recognitions between Arg52 of DBD1 and G9' as well as Arg52 of DBD2 and G14' (Fig. 4, A and B). Although Arg52 is conserved in HNF-3
, it is not recognized in the HNF-3
-DNA complex. In contrast to ILF, the side chain of Arg46 in HNF-3
protrudes in an opposite direction and forms an ionic interaction with the phosphate group of C6 (Fig. 4C). It is consistent with the reported structures of the DNA-protein complexes that the neutralization of charge by lysines or arginines will influence the bending of DNA (36, 37).
It is noteworthy that the orientations of Arg52 in ILF-DBD and Arg46 in HNF-3
may cause a difference in the electrostatic potentials on the protein surfaces. Consequently, the electrostatic interaction of these two proteins with the DNA phosphate backbone may be affected, and the presentation of H3 to the major groove of the DNA may be modulated. Furthermore, in study of FREAC1-7 (32), the specificity for 5'-GTAAATA-3' versus 5'-GTAAACA-3' is determined by a short stretch of amino acid residues at the junction between the H2-H3 turn and the first three residues of H3. Therefore, we hypothesize that these residues cause the slight variations in the core sequence binding properties between the ILF-DBD and HNF-3
.
The Role of Wing 1 in the Winged Helix/Forkhead ProteinsBecause a short DNA sequence (13 bp) was used in the HNF-3
complex (13), the interaction between wing 1 and DNA was not observed. A structural comparison between the DNA-bound and free forms of ILF-DBD reveals that wing 1 undergoes a major structural rearrangement upon DNA binding. Although wing 1 is mostly disordered in the absence of DNA (8), it is stabilized by protein binding to the minor groove of the DNA through direct hydrogen bonds. In the ILF-DBD-DNA complex structure, Lys73 of wing 1 interacts with the bases at the minor groove of the 3'-flanking core sequence. The K73A mutant significantly loses the DNA binding ability. These results indicate that wing 1 of the ILF-DBD is important for DNA binding and may play a role in DNA recognition. In addition, wing 1 and DNA interactions may stabilize a DNA bending toward the protein and help H3 recognize the core sequence. Therefore, the ILF-DBD-DNA complex structure may provide a model to show how winged helix/forkhead proteins use wing 1 to bind with DNA in the minor groove.
Sequence alignment shows that ILF has a negatively charged Glu70 in wing 1 (Fig. 1). Interestingly, the corresponding residue is a positively charged lysine residue in HNF-3
or Freac11, and a polar asparagine residue in Genesis. This difference may also contribute to DNA binding in other wing helix/forkhead proteins. However, this hypothesis needs to be confirmed by further biochemical studies.
Implications of DNA Binding Specificity Recognized by C-terminal Basic ResiduesA sequence comparison of the DNA binding domain of winged helix/forked proteins shows that the C terminus is one of the most diverse regions (Fig. 1). DNA binding specificities of winged helix/forkhead proteins are strongly influenced by the C-terminal part of the DNA binding domain (32). Instead of the typical wing structure of the canonical winged helix/forkhead proteins, the corresponding region (residue 84-91) in ILF had an
-helical structure.
Protein truncation and gel retardation studies (8) showed that deletion of the C-terminal six residues (residues 93-98) seriously reduced the DNA binding ability, which suggests that this region is important for protein-DNA interaction. In the present study, we found that helix 4 of the ILF-DBD altered the orientation of the C-terminal basic residues (93RKRRPR98) and made contacts to the minor groove that were different from those in the HNF-3
-DNA complex (Fig. 5C). This indicated that the C-terminal region of the DNA binding domain in the winged helix/forkhead family mediated the DNA binding specificities of 5'-flanking core sequences.
Cooperative Binding at the Two DNA SitesThis study provides the first view of cooperative binding of the ILF-DBD to DNA that contains the core sequence. The cooperativity of the ILF-DBD can arise through DNA conformability in the absence of strong protein-protein interactions. The structural evidence presented here showed that there were two ILF-DBDs per DNA duplex, whereas the biochemical data suggested cooperative binding. EMSA experiments showed that DNA at least 14 bp long was required for the two ILF-DBD molecules to bind optimally (Fig. 6B). It is of particular interest that when the second ILF-DBD molecule bound to the ILF-DBD-DNA complex, it induced DNA distortion that narrowed the major groove by bending the DNA helix 19° toward the protein. This DNA structure deformation by the second ILF-DBD binding seemed essential for stabilizing the ILF-DBD-DNA complex and might therefore explain why the ability of the ILF-DBD to bind to the 11-, 12-, and 13-bp-long DNA sequences was severely impaired (Fig. 6B).
The DNA binding domains of transcription factors are frequently sufficient to mediate cooperative binding at composite regulatory sites. It was shown previously that the DNA core recognition sequence is essential but not sufficient for the binding of winged helix/forkhead proteins (10). In the case of ILF-DBD, the structural and functional analysis of the complex revealed that the second ILF-DBD bound to the ILF-DBD-DNA complex was likely a key step for forming a stable complex. The unique DNA binding mode found in ILF may relate to the fact that one ILF-DBD molecule binds to DNA with low affinity, and two ILF-DBD molecules bind to DNA with high affinity. Consequently, the second ILF-DBD molecule bound to DNA may serve as a regulatory element for transcription of the human T-cell leukemia virus-long terminal region or interleukin-2. In this case, the release of the second ILF-DBD molecule from the complex would destabilize the binding of the first ILF-DBD molecule and dissociate it from the core sequence. The transcription would therefore stop. This hypothesis has yet to be shown in vivo.
Insights into the Diverse Binding Specificities of Winged Helix ProteinsThe winged helix/forkhead proteins shared a common fold with diverse DNA binding modes (12). In most cases, use of the recognition helix to recognize bases in the major groove is conserved. The extra structural elements, such as the wings and the H2-H3 loop, provide additional contacts with the DNA backbone. In the case of heat shock transcription factor (HSF), its wing 1 mediates dimerization rather than contacting the DNA (38). Although the nonconserved elements make less contact with DNA, they are important for regulating how the recognition helix makes specific base contacts with DNA. Domain swapping and mutagenesis studies on winged helix proteins revealed that the binding specificity of winged helix proteins depends on the H2-H3 loop (6, 15, 34). The structure of the FOXK1a-DNA complex provides the first evidence that the H2-H3 loop is in contact with DNA and that the wing 1 and the C-terminal tail also make specific base contacts. This study provides a new insight into how the nonconserved regions of winged helix/forkhead proteins may be regulate their DNA binding specificities.
| FOOTNOTES |
|---|
* This work was supported by Academia Sinica and the National Science Council Grant NSC94-2311-B-001-015 (to C.-D. H.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
1 To whom correspondence should be addressed. Tel.: 886-6-235-3535 (ext. 5515); Fax: 886-6-274-1694; E-mail: wjcnmr{at}mail.ncku.edu.tw. 2 To whom correspondence should be addressed. Tel.: 886-2-2788-2743; Fax: 886-2-2782-6085; E-mail: hsiao{at}gate.sinica.edu.tw.
3 The abbreviations used are: Fox, forkhead box; ILF, interleukin enhancer binding factor; IL-2, interleukin-2; ILF-DBD, DNA binding domain of ILF; EMSA, electrophoretic mobility shift assay; SeMet, selenomethionine; MAD, multiwavelength anomalous diffraction. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B. A. Benayoun, S. Caburet, A. Dipietromaria, M. Bailly-Bechet, F. Batista, M. Fellous, D. Vaiman, and R. A. Veitia The identification and characterization of a FOXL2 response element provides insights into the pathogenesis of mutant alleles Hum. Mol. Genet., October 15, 2008; 17(20): 3118 - 3127. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.-L. Tsai, Y.-J. Sun, C.-Y. Huang, J.-Y. Yang, M.-C. Hung, and C.-D. Hsiao Crystal structure of the human FOXO3a-DBD/DNA complex suggests the effects of post-translational modification Nucleic Acids Res., November 29, 2007; 35(20): 6984 - 6994. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||