Structural Insights into c-Myc-interacting Zinc Finger Protein-1 (Miz-1) Delineate Domains Required for DNA Scanning and Sequence-specific Binding*

c-Myc-interacting zinc finger protein-1 (Miz-1) is a poly-Cys2His2 zinc finger (ZF) transcriptional regulator of many cell cycle genes. A Miz-1 DNA sequence consensus has recently been identified and has also unveiled Miz-1 functions in other cellular processes, underscoring its importance in the cell. Miz-1 contains 13 ZFs, but it is unknown why Miz-1 has so many ZFs and whether they recognize and bind DNA sequences in a typical fashion. Here, we used NMR to deduce the role of Miz-1 ZFs 1–4 in detecting the Miz-1 consensus sequence and preventing nonspecific DNA binding. In the construct containing the first 4 ZFs, we observed that ZFs 3 and 4 form an unusual compact and stable structure that restricts their motions. Disruption of this compact structure by an electrostatically mismatched A86K mutation profoundly affected the DNA binding properties of the WT construct. On the one hand, Miz1–4WT was found to bind the Miz-1 DNA consensus sequence weakly and through ZFs 1–3 only. On the other hand, the four ZFs in the structurally destabilized Miz1–4A86K mutant bound to the DNA consensus with a 30-fold increase in affinity (100 nm). The formation of such a thermodynamically stable but nonspecific complex is expected to slow down the rate of DNA scanning by Miz-1 during the search for its consensus sequence. Interestingly, we found that the motif stabilizing the compact structure between ZFs 3 and 4 is conserved and enriched in other long poly-ZF proteins. As discussed in detail, our findings support a general role of compact inter-ZF structures in minimizing the formation of off-target DNA complexes.

Bric-a-brac)-POZ (poxvirus and zinc finger) (POZ) 3 domain at its N terminus followed by 13 ZFs. It was first identified as a direct interactor of the oncogenic protein c-Myc by yeast twohybrid screening (1). Miz-1 is an activator of cell cycle regulator genes, such as the cyclin-dependent kinase inhibitors p15 INK4 (p15), p21 CIP1 (p21), and p57 KIP2 (p57) (2)(3)(4)(5). Miz-1 activates the transcription of those genes by recruiting different co-activators, such as the histone acetyltransferase p300 and the nucleophosmin (3,6). Moreover, it was shown that in response to TGF-␤, Miz-1 forms a complex with the Smad 2/3/4 proteins to activate the expression of p15. This interaction was shown to involve a region located within the first four ZFs of Miz-1 and the MH1 domain of Smad 3 (7). Whereas the role of Miz-1 in cell cycle regulation is well established, recent studies have underlined its implication in other cell cycle-independent processes, such as autophagy, endocytosis, vesicular trafficking, inflammation and DNA repair (8 -11). Moreover, some cytoplasmic functions of Miz-1, such as its implication in the regulation of the Wnt pathway, are emerging, revealing the multifunctional nature of this transcription factor (12).
c-Myc can directly bind Miz-1 and repress the expression of p15, p21, and p57, presumably by abolishing the interaction between Miz-1 and its co-activators (2,3,6). On the one hand, numerous studies carried out in different cancer cell lines have shown that the transcriptional repression of Miz-1 by c-Myc is relevant in many different stages of the carcinogenesis process (13). Interestingly, on the other hand, another recent study showed that high levels of Miz-1 cause the repression of c-Myc transcriptional transactivation (14). These results have led to the interesting hypothesis that the c-Myc/Miz-1 balance can dictate cell fate by controlling the transcription of gene networks (15).
The specific DNA binding of Miz-1 is mediated by its ZF motifs (16). ZFs are small domains of ϳ30 amino acids that share the following consensus sequence: (F/Y)XCX 2-5 CX 3 (F/ Y)X 5 LX 2 HX 3-4 H. These motifs are among the most abundant DNA-binding domains in eukaryotes and possess a ␤␤␣ fold that is stabilized by the coordination of a Zn(II) atom by two conserved Cys and two conserved His side chains (17). Classically, the recognition between ZFs and DNA is mediated by specific hydrogen bonds between side chains of residues Ϫ1, 2, 3, and 6 (position relative to the beginning of the ␣-helix) and DNA bases in the Hoogsteen edge (17). Typically, series of ZFs are connected by highly conserved TGEKP sequence linkers and bind in sets of two or more in the major groove of the DNA with each ZF contacting three DNA bases. These linkers are important to allow for consecutive motifs to optimally fit in the major groove of DNA (18,19). Variations in the consensus sequence can significantly reduce binding affinity (18,19). Although not infallible, recognition codes and statistical potentials have been derived to predict the DNA sequences bound by ZF tandems from the nature of the residues at positions Ϫ1, 2, 3, and 6 of their recognition helix (17,20,21). Miz-1-specific DNA binding has been mainly associated with regions of the proximal and core promoters of p15 and p21 (2). Although one of the regions of the p15 promoter bound by Miz-1 contains an initiator element (INR), no other specific or consensus sequences have emerged until recently. Indeed, two groups recently and independently unveiled similar Miz-1 consensus DNA sequences by genome-wide ChIP-sequencing and Bind-n-Seq methods, respectively (8,16). However, like many other transcription factors, Miz-1 is found to bind elsewhere on the genome, especially when it is stabilized or overexpressed. In fact, in cancer cells, Miz-1 is found to co-localize with c-Myc at many transcription start sites where INR sequences are present. Most interestingly, Miz-1 can inhibit c-Myc transactivation of genes involved in tumorigenic programs (14,22). Despite such a critical role in the normal and oncogenic biology of the cell, it is not known whether all of the ZFs or only subsets bind and recognize the Miz-1 consensus sequence, the INR element, or other nonspecific DNA sequences.
In this context, we have initiated the determination of the solution structure and DNA binding specificity of different constructs of Miz-1 ZF tandems. To date, the structures of Miz-1 ZFs 5-10 have been reported (23,24). Whereas the ZFs 7-10 possess stable Cys2His2 folds and the usual interdomain dynamics, ZF 6 has been shown to be undergoing conformational exchange on the microsecond to millisecond time scale. Moreover, we showed that the non-canonical DTDKE linker that connects ZFs 5 and 6 is rather unlikely for classical DNA binding because it will cause electrostatic repulsions with the DNA backbone according to the classical ZF recognition mode (24).
Here, we continue with the structural and dynamic characterization as well as the DNA binding properties of Miz-1 with a construct consisting of Miz-1 ZFs 1-4 (Miz1-4). Although the 3D structure of ZFs 2-4 of Miz1-4 present the typical ␤␤␣ fold, we were not able to determine the structure of the ZF 1 because this motif also undergoes conformational exchange on the microsecond to millisecond time scale. Also, unexpectedly, we discovered that ZFs 3 and 4 make many interdomain contacts, allowing them to form a compact structure that maintains them in a stable orientation unlikely to provide them with typical DNA binding properties.
As can be rationalized from the dynamic and structural characterization of Miz1-4, the construct fails to bind specifically different Miz-1 DNA targets. The motif involved in the stable interdomain interactions between ZFs 3 and 4 was mutated and led to the disruption of the compact structure. This impacted drastically the binding of the Miz1-4 construct to the DNA consensus sequence. Whereas the DNA binding of the WT Miz1-4 construct is mediated by only ZFs 1-3, reintroducing conformational freedom to ZF 4 led to the formation of a complex with all four ZFs bound. Whereas this could suggest a role in the recognition of the consensus by Miz1-4, we show that the actual Miz-1 consensus sequence can be predicted using the identity of the residues in the recognition helices of ZFs 7-12, hence suggesting that recognition of the consensus is accomplished by the C-terminal ZFs. Thus, we instead propose that Miz-1 ZFs 1-3 are involved in scanning DNA and that the compact structure prevents the formation of stable but nonspecific and off-target complexes. In accordance with this view, we find that the motif leading to the formation of the compact structure between ZFs 3 and 4 is conserved and enriched in long poly-ZF (containing 14 Ϯ 6 ZFs).

Results
Folding of Miz1-4 and Resonance Assignments-Circular dichroism was used to characterize and monitor the folding of the Miz1-4 construct. Folding of ZFs is pH-and zinc-dependent because conserved Cys side chains need to be in their deprotonated form to coordinate the Zn(II) atom. At pH 6.5, the addition of Zn(II) led to a transition from a CD spectrum typical of a random coil to an ␣-helix-like spectrum with a plateau reached at 4 molar eq of Zn(II), suggesting that under these conditions, the four ZFs of Miz1-4 are well folded (Fig. 1B). Moreover, at 5 eq of Zn(II), increasing the pH from 6.5 to 7.5 did not lead to any change of the Miz1-4 CD spectrum, suggesting that the protein is already optimally folded at pH 6.5. Based on this CD characterization, all NMR experiments were carried out at pH 6.5 and with 5 eq of Zn(II). As anticipated from the CD study, the cross-peaks on the 1 H-15 N HSQC spectrum of Miz1-4 are well dispersed, indicating the presence of stable tertiary structures (Fig. 1C). The data set recorded for Miz1-4 allowed for the assignment of 96 of 107 (90%) of the 1 H N , 96 of 112 (86%) of the 15 N, 91 of 112 (81%) of the 13 CЈ, 102 of 112 (91%) of the 13 C ␣ , and 94 of 103 (91%) of the 13 C ␤ . Interestingly, many cross-peaks of ZF 1 are very broad (weak) or broadened beyond detection, preventing us from assigning 9 of its 23 residues. This observation for ZF 1 suggests that this motif could experience some microsecond to millisecond motions (discussed below). Therefore, excluding the ZF 1, 84 of 85 (99%) of 1 H N , 84 of 89 (94%) of 15 N, 79 of 89 (89%) of 13 CЈ, 88 of 89 (99%) of 13 C ␣ , and 81 of 82 (99%) of 13 C ␤ were assigned for backbone atom resonances of residues 24 -112. Secondary chemical shift (⌬␦(C ␣ Ϫ C ␤ )) and DANGLE secondary structure prediction obtained for Miz1-4 are shown in Fig. 1D. The four expected ␣-helices of Miz1-4 ZFs are predicted by DANGLE (based on and angles) and present positive ⌬␦(C ␣ Ϫ C ␤ ) values. On the other hand, even if ␤-strands are not all predicted by DANGLE, mostly negative ⌬␦(C ␣ Ϫ C ␤ ) values are observed for these secondary structures. Altogether, CD and NMR data show that the four ZFs of Miz1-4 are folded in the selected conditions.
Miz-1 ZF 1 Undergoes Microsecond to Millisecond Conformational Exchange-Conformational exchange has previously been detected by our group for the ZF 6 of Miz-1 (24). It is FIGURE 1. Folding and secondary structure content of Miz1-4. A, alignment of the primary structures of the 13 ZFs of Miz-1. B, far-UV CD spectra of Miz1-4 at different pH and Zn(II) concentrations, demonstrating that the secondary structure content is optimal at pH 6.5 and 4 eq of Zn(II). C, 1 H-15 N HSQC of Miz1-4. Many resonances of ZF 1 (residues 4 -26) are weak or broadened beyond detection. D, the expected secondary structures for the consensus ZF motifs and the secondary structures determined from the chemical shifts of the backbone atoms and the program DANGLE are shown at the top. Secondary chemical shift values for the C ␣ and C ␤ (⌬␦ (C ␣ Ϫ C ␤ )) along with NOE connectivities are displayed and support the presence of the expected secondary structures for Miz1-4 ZFs.

Structure and DNA Binding of Miz-1 Zinc Fingers 1-4
FEBRUARY 24, 2017 • VOLUME 292 • NUMBER 8 worth noting that both ZF 1 and 6 contain a His at position 1 of the motif instead of a conserved Phe or Tyr (Fig. 1A). To verify whether the weak signal of Miz-1 ZF 1 in the 1 H-15 N HSQC is caused by conformational exchange, we recorded a set of CPMG relaxation dispersion experiments. One can notice that R 2,eff values of many ZF 1 amides decrease as the applied refocusing pulse frequency ( CPMG ) is increased ( Fig. 2A). This observation clearly indicates that ZF 1 undergoes some conformational exchange on the microsecond to millisecond time scale. The program NESSY was used to fit the dispersion curves according to the protocol described by Bieri and Gooley (25). All of the residues presenting a relaxation dispersion were best fit by the Carver-Richard equation, which is valid for all time scales (slow to fast conformational exchange limits) and considering a two-state model (26). Fig. 2B shows all of the residues manifesting relaxation dispersion curves. Interestingly, conformational exchange at the C terminus of the ␣-helix of ZFs 1, 2, and 4 is also observed. Although such motions at the end of ZF ␣-helices have been suggested to be frequent for C2H2 ZFs (24), cases of conformational exchange affecting the core of the ␣-helix and the first ␤-strand are rather unusual. It is worth mentioning that resonances of nine consecutive amides invisible on the 1 H-15 N HSQC, located in the second ␤-strand and in the beginning of the ␣-helix, do not reappear even at the highest CPMG used. This suggests the presence of conformational exchange, but on a faster time scale (i.e. toward the microsecond limit not covered by CPMG but in the range of the rotating frame relaxation dispersion experiments (T 1 )) (27). Such a widespread exchange broadening for ZF 1 supports the notion that a His at position 1 promotes motions that affect the local environment of almost all of the amides of the motif.
Solution Structure of Miz-1 ZF 2 and ZF 3-4 -Because of the ZF 1 extensive conformational exchange, we were not able to solve its 3D structure. However, we were able to solve the structures of ZFs 2, 3, and 4 and to characterize their linkers. Tandems of ZFs connected by classical TGEKP linkers are relatively independent from one another but are quasi-ordered in the absence of DNA (28,29). Indeed, these linkers are moderately flexible, with 15 N-{ 1 H} NOE values significantly lower than for the ZF core motif (usually ϳ0.5 for the linkers and 0.65 for the motifs). In accordance, no long range NOE involving the TGEKP linkers and consecutive ZFs is usually detected in the absence of DNA (23). However, extensive medium and long range NOEs involving residues from the non-canonical SGEAR linker and ZF 3-4 are observed (Fig. 3). This clearly indicates that they are not independent from one another. On the other hand, no long range NOE was observed for the non-canonical linker between ZFs 2 and 3. For these reasons, we decided to calculate the structure of the ZF 2 alone and ZFs 3 and 4 together.
The NOE restraints, dihedral angles, and hydrogen bonds used for structure calculations of each ZF are summarized in Table 1. The superposition of the 20 lowest energy conformers onto their geometric averages demonstrates that all three ZFs are well defined, with backbone RMSDs of 0.42 Ϯ 0.07, 0.27 Ϯ 0.07, and 0.19 Ϯ 0.05 Å for ZFs 2, 3, and 4, respectively (Fig. 4A). Moreover, their structures reveal that they all present the classical ␤␤␣ topology. The highest backbone RMSD observed for ZF 2 is most probably due to the presence of a Cys at position 4 of the ␣-helix (Cys 47 ; Fig. 4B) instead of a conserved Leu in the motif hydrophobic core. Indeed, relatively few long range NOEs were detected for this motif. This can be caused by a less stable tertiary structure and/or by the fact that a Cys side chain has a lesser density of 1 H and hence generated less NOE. The 15 N-{ 1 H} NOEs recorded for Miz1-4 argue in favor of this last hypothesis and suggest that the amides of ZF 2 are as rigid as those of ZF 3-4 on the picosecond to nanosecond time scale (see below).
The superposition of the 20 lowest energy conformers for ZFs 3 and 4 together shows that they form a well defined compact structure with a backbone RMSD of 0.92 Ϯ 0.36 Å (Fig.  4C). Based on an alignment of 2435 human ZFs realized by Schmidt and Durrett (30), the Ala 86 , Arg 87 , and Leu 96 have low probabilities to be at their specific position (0, 1.6, and 1.5%, respectively, according to this database). Whereas the Ala 86 and the Leu 96 participate to the formation of a hydrophobic core between ZFs 3 and 4 with the aliphatic portions of the Lys 79 and Lys 80 side chains, the Arg 87 engages solvent-exposed electrostatic interactions with the conserved Glu 85 of the linker (Fig.  4C). Typically, in the DNA-bound state, the conserved Glu forms electrostatic interactions with a conserved basic residue at the position of Lys 80 . Interestingly, the non-conserved Arg 87 appears to compensate for this interaction in the free state. One can also notice that the conserved Lys instead of the Ala 86 in the linker would cause electrostatic repulsion with the conserved basic residue at position of Lys 80 and destabilize the compact structure. This observation provides an explanation for why this fold is not observed for ZF tandems connected by classical TGEKP linkers. Finally, the non-conserved Ala 86 , Arg 87 , and Leu 96 stand as the key residues allowing the formation of a cryptic hydrophobic core stabilizing ZFs 3 and 4 in this stable inter-ZF structure. . This suggests that the three linkers undergo more frequent local motions than the four ZFs. Hence, the compact structure adopted by ZFs 3 and 4 does not seem to contribute to rigidify the linker that connects these motifs as compared with what is normally observed for FIGURE 3. Stacked bar chart of the NOE used for the calculation of the Miz1-4 structure. Intraresidue, sequential, medium range, and long range NOE are colored in light gray, gray, dark gray, and black, respectively. Note the many long range NOEs involve residues from the linker between ZFs 3 and 4 (linkers are highlighted in light gray).  (24,29). For macromolecules, 15 N-T 1 and T 2 , respectively, increase and decrease as their effective correlation time ( m ) increases. Hence, the smaller T 1 /T 2 ratios observed for Miz1-4 indicate that it tumbles faster than the other two constructs. As described in detail in the supplemental material (rotational diffusion analysis of Miz1-4) and depicted in Fig. 6, this is mainly caused by the compact structure, which orients the N-H vectors of ZF 4 perpendicular to the largest component of Miz1-4 rotational diffusion tensor (D par ). This leads to a faster effective tumbling for the N-H vectors of this ZF compared with the others. This will result in a decrease in T 1 and an increase in T 2 for those backbone amides with the net result of decreasing the average T 1 /T 2 ratio (Fig. 5). The details of the simulations presented in Fig. 6 can be found in the supplemental material, and a summary of the simulated parameters is given in Table 2.
The Mutation Ala 86 3 Lys Destabilizes the Compact Structure Adopted by Miz-1 ZFs 3 and 4 -As described above, we hypothesized that in the compact structure fold, the conserved Lys usually present in TGEKP linkers should cause electrostatic repulsions with the conserved basic residue at the position of Miz1-4 Lys 80 and destabilize it. Hence, to verify this assertion and to destabilize the compact fold, we mutated the Ala 86 to a Lys and prepared the Miz1-4 A86K mutant. Interestingly, whereas most of the amide cross-peaks on the mutant 1 H-15 N HSQC spectrum are at chemical shifts similar to those in the wild type (Fig. 7A), some peaks, mostly residues located in the linker between ZFs 3 and 4 and at the interface of those ZFs, are significantly perturbed (Fig. 7, B and C). This suggests that these amides experience changes in their chemical environment either because of the Ala to Lys substitution or because the Lys side chain prevents the formation of the compact structure as hypothesized. Accordingly, the inspection of a 15 N-edited NOESY-HSQC spectrum recorded with the mutant reveals that many NOEs characteristic of the ZF 3-4 compact structure are lost. For instance, we noted that NOEs detected in Miz1-4 between Arg 87 HN and Leu 97 H␦*, Phe 97 HN, and Thr 98 H␣ (Fig. 7C) are absent in the spectrum of the mutant.
To further validate the loss of the compact structure for the mutant, we measured its amide 15 N spin T 1 and T 2 (supplemental Fig. S1). As described in the supplemental material, if it is disrupted, ZF 4 should realign with the other ZF and slow down the overall tumbling of the tandem. Accordingly, the T 1 and T 2 values of the mutant are systematically higher and lower, respectively, leading to T 1 /T 2 values that are systematically lower with an average of 7.26 Ϯ 2.03. Such a ratio corresponds to an effective m of 7.9 ns (Fig. 7D), a value larger than wild type Miz1-4 (6.9 ns) and more consistent with those reported for tandems of four ZFs devoid of stable compact structure (e.g. 8.4 ns for MTF-1 ZFs 1-4) (29). Using the T 1 /T 2 values of the same residues as those used with the wild type construct, we characterized the rotational diffusion analysis of ZFs 2-4 of the mutant. The parameters obtained from the analysis are presented in Table 3. As expected, the rotational diffusion of the three ZFs is significantly best simulated using the axially symmetric model (at the 90% confidence interval) with their ␣-helices preferentially aligned with D par (Fig. 7E). Collectively, these results further validate that ZFs 3 and 4 are no longer perpendicular to each other and that the compact structure is not stable in the Miz1-4 A86K mutant. Moreover, the D iso values of the three ZFs have significantly decreased in the mutant compared with the wild type, in agreement with a more anisotropic overall rotational diffusion for this protein construct. In fact, the D iso values calculated for Miz1-4 A86K ZFs 2, 3, and 4 (1.99, 1.96, and 2.48 ϫ 10 7 s Ϫ1 , respectively) are similar to those calculated by Potter et al. (29) for ZFs 2, 3, and 4 of a construct containing the first four ZFs of MTF-1. Altogether, these results clearly show that the mutation of Ala 86 to a Lys prevents the formation of the compact structure adopted by Miz-1 ZF 3-4 and validates the role of the non-canonical SGEAR linker in the formation of such a structure.
Role of Miz-1 ZF 3-4 Compact Structure on the DNA Binding-The structure alignment of the ZF 4 of Miz1-4 with the second ZF of Zif268 bound classically to its target DNA is shown in Fig. 8A (31). One can notice that the compact structure adopted by Miz-1 ZF 3-4 is unlikely to favor the classical DNA binding of the four ZFs. Indeed, if Miz-1 ZF 4 was to bind to the major groove of DNA, ZF 3 would reorient ZFs 1 and 2 away from DNA. Conversely, despite the conformational exchange in ZF 1 and the fact that ZFs 2 and 3 mostly bear hydrophobic residues in their recognition helices (Figs. 4A and 8A), if ZFs 1-3 were to bind DNA, they would also orient ZF 4 away from DNA.
Fluorescence anisotropy was used to determine the apparent DNA binding affinity of Miz1-4 to the sequences of the p15 promoter identified by Seoane et al. (2) to be engaged by Miz-1 (region Ϫ155 to Ϫ140 and Ϫ2 to ϩ14 that contains an INR element) and to the consensus sequence recently identified by Wolf et al. (8) (supplemental Fig. S2). As shown in Fig. 8B, the Miz1-4 construct binds to all of the sequences tested, including two unrelated and nonspecific sequences, with a similar low apparent affinity in the micromolar range. Reported affinities for series of three ZFs to nonspecific DNA sequences normally lie in the low micromolar range, whereas their specific binding rather lies in the nanomolar to picomolar range (17,(32)(33)(34). Hence, our data clearly demonstrate that Miz-1 ZFs 1-4 bind DNA with an affinity and a specificity that are not sufficient to promote specific molecular recognition of the consensus DNA or the p15 core promoter regions by this transcription factor. To identify which ZFs are involved in the weak and nonspecific binding, we recorded 1 H-15 N HSQC spectra of Miz1-4 in the presence of 1 molar eq of the consensus DNA ( Fig. 8C and supplemental Fig. S3). Strikingly, all of the resonances of the ZFs 1, 2, and 3 become invisible upon DNA addition, whereas most of the amide cross-peaks of the ZF 4, even if significantly less intense, are still visible and at similar chemical shifts. This result clearly supports the notion that ZFs 1-3 are more affected by the presence of DNA than ZF 4 and that they are involved in nonspecific and weak interactions. Indeed, a weak binding of DNA by ZFs 1-3 could lead to chemical exchange (and extreme line broadening) on the microsecond to millisecond time scale through sliding and/or hopping mechanisms, as already described for some proteins in complexes with nonspecific DNA (35,36). However, we ran relaxation dispersion experiments on the complex and were not able to recover any of the cross-peak intensities (data not shown). When a 15 N amide is experiencing a conformational exchange (T 2 decrease by R ex Ϫ1 ), it leads to line broadening. However, the evolution of the chemical exchange process leading to the line broadening can be refocused during the CPMG pulse train in the course of a relaxation dispersion experiment. The extent of refocusing can be such that the contribution of the R ex Ϫ1 can be completely removed at high field strength or high refocusing frequencies with the result that vanishing peak intensities of broadened resonances can be recovered (37). Hence, the fact that we could not recover the cross-peak intensities of ZFs 1-3 argues against the presence of an active conformational exchange process during the CPMG experiments on the microsecond to millisecond time scale as reported by others (40,41). However, it is possible that a conformational exchange, caused by the scanning of DNA, occurs on a faster time scale than the one covered by the  a For the isotropic model, the correlation time ( m ) is given by 1/6D iso , where D iso is the isotropic diffusion coefficient. b is an error function, given the sum of the squared difference between the experimental and calculated T 1 /T 2 values divided by the estimated experimental error. See "Experimental Procedures." c For the axially symmetric model, the effective correlation time ( m,eff ) is given by 1/6D iso , where D iso is equal to (2D perp ϩ D par )/3. d This column displays the number of T 1 /T 2 values that were used to calculate the rotational diffusion coefficients of the different ZFs. e Results of the F statistics analysis provided by the R2_R1_Diffusion program. f Simulations of ZF 3-4 T 1 /T 2 values were realized, keeping ZF 3 in the same molecular and diffusional reference frame as the one obtained from the analysis of ZF 3 alone without considering wobbling motions for the ZF 4 (S 2 ϭ 0 and e ϭ 0 ns). g NA, not applicable. h Simulations of ZF 3-4 T 1 /T 2 values were realized, keeping ZF 3 in the same molecular and diffusional reference frame as the one obtained from the analysis of ZF 3 alone considering wobbling motions for the ZF 4 (S 2 ϭ 0.75 and e ϭ 1.7 ns). See "Experimental Procedures." i The F parameter was calculated as described by Gagné et al. (63).

JOURNAL OF BIOLOGICAL CHEMISTRY 3331
CPMG experiment. Alternately, our results can be explained by the fact that ZFs 1-3 bind at multiple and non-interchanging sites on the consensus sequence (Fig. 8C). Indeed, based on the recognition code proposed by the group of Pabo (17) and on statistical potentials (21), four different sites could be bound by Miz-1 ZFs 1-3 with six or seven predicted contacts considering residues at positions Ϫ1, 3, and 6 of the ZF recognition helices (Fig. 8C). It is noteworthy that these four sites would allow the Arg at position 6 of the ZF 1 to contact a guanine, which is generally one of the strongest and more stringent interactions found for specific ZF DNA binding (17). Assuming that all of these sites are bound with comparable affinities by the construct, it is likely that the backbone amides of ZFs 1-3 experience different static chemical shifts in the different complexes. It should be noted that although the apparent binding constant lies in the micromolar range, at the concentration used in the experiments, all of the protein constructs are expected to be bound to DNA. Coupled to the increase of Miz1-4 apparent molecular weight in the complex, this could lead to the disappearance of the cross-peaks. However, because ZF 4 does not contact DNA, it resides in a more chemically isotropic environment independently of where ZFs 1-3 bind on DNA. This can explain the fact that the intensity of its cross-peaks is reduced because of the increase in the molecular weight but not decreased beyond detection by different chemical environments. But, once again, we cannot rule out the contribution of a conformational exchange caused by a sliding of Miz1-4 on DNA on a faster time scale than the one covered by CPMG experiments.
We show on Fig. 8D the 1 H-15 N HSQC of Miz1-4 A86K in the presence of 1 eq of the consensus DNA. One can notice that the amide resonances of the four ZFs are present on the spectrum and that most of them are perturbed by the presence of DNA ( Fig. 8D and supplemental Fig. S3). This strongly suggests that the disruption of the compact structure allows for the four ZFs to engage DNA to form one well defined and stable complex. To  further validate this, we have measured the apparent affinity of the Miz1-4 A86K for the consensus DNA (Fig. 8B). Accordingly, we found that the affinity of the mutant for the consensus is increased by 30-fold with an apparent K d of 0.11 M. As shown in Fig. 8D, only two positions on the consensus can satisfy the strong and specific Arg-guanine interactions defined by the Arg

Structure and DNA Binding of Miz-1 Zinc Fingers 1-4
FEBRUARY 24, 2017 • VOLUME 292 • NUMBER 8 at position 6 of the ␣-helices of ZF 1 and 4 (Fig. 8D). However, according to the prediction tools (17,21), site 1 depicted in Fig.  8D would allow for four additional favorable and observed contacts compared with binding site 2, suggesting that it is probably the site preferentially bound by Miz1-4 A86K . Although the structure of the complex would need to be solved to make conclusions about the exact nature of this complex, the results presented here confirm that the compact structure does not allow for the recognition of the consensus and leads to weak DNA binding, whereas the reestablishment of the freedom of ZF 4 leads to much stronger DNA binding. Fig. 8A, we reasoned that Miz1-4 could also be involved in protein-protein interactions. Indeed, the three Ala and Leu residues (Ala 45 , Ala 46 , and Ala 49 and Leu 71 , Leu 74 , and Leu 77 ) present on Miz-1 ZFs 2 and 3 at positions normally involved in DNA binding, could constitute a suitable hydrophobic surface for protein-protein interactions if solvent-exposed. In a series of co-immunoprecipitation experiments, Seoane et al. (7) reported strong evidence for the implication of Miz-1 ZFs 1-4 in the formation of a protein complex containing Miz-1 and Smad 2/3/4 in response to the activation of the TGF-␤ pathway. Moreover, the authors performed GST pull-down experiments and observed a direct interaction between purified Miz-1 and the Smad 3 MH1 (7). For the sake of completeness in our quest to characterize the structural biology of Miz1-4, we cloned, expressed, and purified the MH1 domain of Smad 3 (residues 1-145) and tried to validate the interaction at the atomic level. The CD spectrum and the thermal denaturation of the Smad 3 MH1 obtained are similar to what have already been published for the folded protein (supplemental Fig. S4A) (38). To detect the interaction between Miz1-4 and the Smad 3 MH1, 1 H-15 N HSQC spectra of 15 Nlabeled Miz1-4 were recorded in the absence or in the presence of 1 eq of unlabeled Smad 3 MH1. Surprisingly, no resonance from Miz1-4 was perturbed by the addition of the Smad 3 MH1 domain, suggesting that there is no interaction between the two folded domains (supplemental Fig. S4B). No protein precipitation was observed upon the addition of the MH1 domain, and both proteins were intact and in the right ratio as shown by the SDS-polyacrylamide gel presented in supplemental Fig. S4B. Moreover, no interaction was observed between these proteins by CD and by fluorescence anisotropy (supplemental Fig. S4, C-E). Our results demonstrate that there is no direct interaction between Miz-1 ZFs 1-4 and the Smad 3 MH1 domain. Because the folding of both the MH1 domain of Smad 3 and ZFs 1-4 of Miz-1 depend on the coordination of zinc atoms, we hypothesize that the utilization of EDTA and the absence of a reducing agent in the buffer used by Seoane et al. (7) in their pull-down experiments could have led to the formation of nonspecific intermolecular disulfide bonds between Smad 3 MH1 and Miz-1 in vitro. More investigation will be necessary to identify the protein(s) that bridges Miz-1 to Smad 2/3/4 through its first four ZF motifs.

Miz1-4 Does Not Interact Directly with the MH1 of Smad 3-Based on the model presented in
Toward the Understanding of Miz-1-specific DNA Binding to Its Consensus Sequence-Our results suggest that Miz1-4 is most likely not involved in the formation of the native and specific complex between Miz-1 and its cognate sequence. We can also cast doubt on the possibility that ZFs 5 and 6 participate in Miz-1 DNA binding. Indeed, we have previously shown that the Asp and Glu residues in the DTDKE linker between these ZFs would clash with the DNA phosphates. In addition, as shown here for ZF 1, ZF 6 also undergoes extensive conformational exchange, indicating that its binding-competent conformation is probably not always present (24), and this will further contribute to weaken DNA binding. On the other hand, Miz-1 ZFs 7-10 all present stable canonical ZF structures and no unusual dynamic properties or compact folds that could prevent them from binding DNA classically (23,24). Indeed, these ZFs are linked by classical or quasi-classical (S/T)GEKP linkers (Fig.  1A). To identify potential Miz-1 ZF series that could recognize Miz-1 target DNA consensus sequence derived by Wolf et al. (8), we have used the code proposed by the group of Pabo (17). It is important to remember that the second conserved region of the sequence identified by Wolf et al. (8) (Fig. 9B) has a high homology with the two consensus sequences recently identified by Barrilleaux et al. (16). Strikingly, the best fit obtained involves ZFs 7-12 with 12 interactions unambiguously predicted by the code. Remarkably, the most conserved regions of the consensus are matched to the strongest and most specific canonical side chain-DNA base interactions (i.e. involving either an Asp with a C or an Arg or a Lys with a G) (Fig. 9B). In addition, we have used the approach proposed by Persikov et al. (21), based on statistical potentials derived from a database of structural and thermodynamic data, to predict the DNA sequence bound by Miz-1 ZFs 7-12 (Fig. 9C). This approach also predicts almost perfectly (one mismatch) the second and third most conserved sequences of the consensus. Also according to the recognition code of Pabo, we found that the least conserved region of the consensus, although almost identical to the second, could alternately be bound by ZFs 7 and 8 (Fig. 9D). Perhaps this degenerated cluster of binding sites is important to favor a high local concentration as a step into the mechanism of specific recognition. It is worth mentioning that two other recognition modes for Miz-1 ZFs along the consensus DNA sequence fit well with nine interactions predicted by the code in both cases (supplemental Fig. S5B). However, because those recognition modes involve ZFs 1-6 and the matches do not correlate well with the conserved regions, we believe that these binding modes, although less stable, could serve in the DNA scanning. In this regard, Zandarashvili et al. (34,36) recently published studies demonstrating the importance of the presence of a lower affinity DNA binding ZF for optimal target search efficiency by a protein containing three ZFs. Analogously, we propose that ZFs 1-6 serve a similar purpose for the efficient recognition of the consensus by subsets of ZFs 7-12.

Discussion
Miz-1 is a transcription factor that contains 12 consecutive ZFs. Despite its crucial role in many decisive aspects of cell biology, the precise identity and role of its ZFs in the recognition of cognate DNA sequences remain unknown. In this study, we report the structural and dynamic characterization of ZFs 1-4 of Miz-1 by solution state NMR as well as their DNA binding properties. Our structural and dynamic analysis revealed that ZF 1 undergoes peculiar conformational exchange and revealed the existence of an unusual compact fold involving ZFs 3 and 4. Similar conformational exchange has been reported by us for ZF 6 (24), and those two ZFs represent, to the best of our knowledge, the only reported cases so far. As discussed below, the compact structure we have unveiled has rarely been discussed or seen so far, but it could be conserved in other poly-ZF proteins yet to be studied.
Our results and previous work from our laboratory enable us to propose that the first six ZFs of Miz-1 are involved in DNA scanning in the search for specific DNA binding through ZFs 7-12 (Fig. 9). Indeed, ZFs 1-4 bind to nonspecific DNA with the same marginal affinity as cognate DNA sequences (p15 core promoter (2) and the newly identified consensus (8)). Moreover, the unusual flexibility of the ZF 1 and the compact fold of ZFs 3 and 4 coupled with already published evidence on ZFs 5 and 6 render canonical DNA binding improbable for those ZFs. Interestingly, this region of Miz-1, more specifically ZFs 1-4, has been reported to bind to the MH1 domain of Smad 3 and to play a role in the TGF-␤-dependent activation of p15 (7). Unfortunately, using biophysical approaches and the purified constructs, we were unable to validate a direct interaction. More investigations will be necessary to identify the protein(s) that bridges Miz-1 to the Smad proteins.
Of more general interest, although specific DNA recognition by poly-ZFs generally involves series of 3-5 ZFs, the number of ZFs per poly-ZF protein has increased throughout evolution with an average of eight ZFs and an upper limit of around 40 ZFs for humans (17,30). This observation suggests that these ubiquitous motifs are implicated in biological functions other than DNA binding. Accordingly, there are an increasing number of reported cases of ZFs involved in protein and RNA interactions (39,40). Some groups have even suggested that the potential of ZFs for protein interactions is likely to be greater than for DNA interaction (41). However, as discussed elsewhere, the structural and dynamic characteristics dictating the functional role of ZFs are poorly understood (40). Tandems of ZFs are generally separated by highly conserved TGEKP sequence linkers that have been shown to be important for optimal DNA binding (18,19,42,43). Accordingly, different groups have proposed that the nature of ZF tandem linkers could enable the prediction of their functions (18,19,44). Although many studies have addressed the functional impact of linker variations on DNA binding by ZFs, little is known about the structural impact of such variations. Here we report that the SGEAR linker between Miz-1 ZFs 3 and 4 participates in the formation of a compact structure that maintains those ZFs in an orientation unlikely for typical DNA binding. To assess the recurrence of similar compact structures potentially adopted by ZF tandems among poly-ZF proteins, we searched for other reported cases in the literature and tried to find rules that could allow identification of them.
To the best of our knowledge, there is only one other reported case of consecutive ZFs separated by a five-residue linker that adopt a compact structure in the absence of DNA. Indeed, the linker between ZFs 5 and 6 of MBP-1 is involved in interfinger residue contacts that maintain them in a conformation different from what is usually observed for classical DNA binding (PDB entry 1BBO, supplemental Fig. S6A) (45). Interestingly, although MBP-1 ZF 5-6 and Miz1-4 ZF 3-4 conformations are quite different, in both cases, the presence of a long side chain amino acid at the position of Miz1-4 Leu 96 (Lys 40 for MBP-1) instead of a conserved small residue is involved (probability of 74.41% for an Ala, Ser, or Gly at that position according to Schmidt and Durrett (30)). In both cases, this residue participates in the formation of a hydrophobic core at the finger interface with a non-conserved hydrophobic residue in the linker (MBP-1 Val 27 and Miz-1 Ala 86 ) (supplemental Fig. S6, B and C) and a residue from the preceding ZF. The observation of the structure of INSM1 ZF 4 -5, deposited in the PDB under the code 2D9H (unpublished), shows that these motifs also adopt a compact structure unlikely to be fit for classical DNA binding (supplemental Fig. S6A). Incidentally, these ZFs do not participate in the specific DNA binding of INSM1 (46). Strikingly, the residues of INSM1 involved in hydrophobic interaction that maintain INSM1 ZFs 4 and 5 in a fixed orientation (Leu 48 , Val 56 , and Thr 65 ) are at the same position as the one responsible for the compact structure adopted by Miz1-4 ZFs 3-4 (Lys 80 , Ala 86 , and Leu 96 ) (supplemental Fig. S6C). However, it should be noted that those ZFs are connected by a shorter linker (4 residues instead of 5) that could also contribute to fix their orientation. Another ZF tandem linked by a non-classical fourresidue linker (KKIK) with restrained ZF flexibility was recently identified for ZFAT ZFs 4 and 5 (PDB entry 2RV7) (47). Once again, a hydrophobic residue of the linker (ZFAT Ile 63 ) is involved in hydrophobic interactions that stabilize the orientation of those ZFs. However, in this case, the linker hydrophobic residue does not interact with the residue present at the position of Miz1-4 Leu 96 but instead interacts with the ZF 4 Tyr 41 (supplemental Fig. S6E). Interestingly, this residue also has a low probability of 0.53% to be at its specific position based on the Schmidt and Durrett alignment (30). This last example suggests that many different scenarios can cause the restriction of ZF orientations. Finally, the inspection of the structure of TFIIIA bound to RNA shows that its ZFs 4 and 5 adopt a nonclassical orientation necessary for the specific RNA binding of this poly-ZF (48,49). Strikingly, once again, hydrophobic interactions between a long side chain residue at the position of Miz1-4 Leu 96 (Arg 145 ) and two hydrophobic residues at the position of Miz1-4 Lys 80 and Ala 86 (TFIIIA Phe 127 and Leu 133 ) are observed in this unusual fold (supplemental Fig. S6F). Interestingly, the particular orientation of ZF 4 -5 is also observed in the crystal structure of TFIIIA bound to DNA and prevents the ZF 4 from contacting DNA bases (50). In this case, the ZF 4 is acting as a spacer element between the ZFs 1-3 and 5 that mediate the specific DNA recognition (supplemental Fig. S6F). This example of TFIIIA ZF 4 -5 illustrates that a better understanding of ZF tandem structural biology could help to predict their functions.
Based on the structural information described above, we ran a PHI-BLAST search against the Swiss-Prot databank to identify ZF tandems that could form compact structures similar to the one observed for Miz-1 ZFs 3 and 4, MBP-1 ZFs 5 and 6, INSM1 ZFs 4 and 5, and TFIIIA ZFs 4 and 5. We found 170 ZF tandems containing a hydrophobic residue at position 3 or 4 of the linker and a non-conserved hydrophobic residue (all except Ala) or a long side chain residue (Arg or Lys) at the position of Miz1-4 Leu 96 (the PHI-BLAST pattern used is shown in supplemental Fig. S7). Interestingly, an average of 14 Ϯ 6 ZFs per poly-ZF are observed for the 66 human proteins identified in the BLAST. This suggests an enrichment of unusual tandem fold for poly-ZFs containing more than the average ZFs in human (average of eight ZFs per poly-ZF in humans). In another PHI-BLAST, we identified 37 proteins in the data bank that contain long hydrophobic side chains at the positions of ZFAT Tyr 42 and Ile 63 . It is worth mentioning that the residue numbering used here is the one provided in the different PDB files.
The above observations suggest that compact structures similar to the one observed between Miz-1 ZFs 3 and 4 are probably not uncommon among poly-ZFs and may possess a conserved role. Based on our results, one possible role could be to limit the number of consecutive ZFs binding in a canonical fashion and consequently fine tune the DNA affinity of poly-ZFs proteins, allowing for an optimal DNA scanning speed in the search for specific sequences. Indeed, it can be anticipated that too many ZFs binding in a concerted fashion to nonspecific DNA could lead to the formation of stable nonspecific complexes that slow down or even halt the DNA scanning process. Another role for ZF tandem compact structures could be to prevent DNA binding of ZF subsets so they can serve as platforms to engage other proteins or RNA.
To conclude, the results presented in this study not only contribute to deepen our knowledge of Miz-1 structural biology, but also provide key elements essential to our understanding of Miz-1-specific DNA binding. The structural, dynamic, and functional results presented here for Miz-1 ZFs 1-4 together with previous structural studies provide strong evidence for the implication of Miz-1 ZFs 1-6 in the scanning of DNA and ZFs 7-12 in specific DNA binding. Moreover, the analysis of the unusual compact structure adopted by Miz-1 ZFs 3 and 4 suggests that other ZF tandems could form such structures and contributes to expanding our understanding of these ubiquitous and versatile motifs.

Preparation of the Miz1-4 and Miz1-4 A86K Constructs-
The cDNA of ZFs 1-4 of Miz-1 (Miz1-4, residues 304 -414) was generated by PCR from the complete cDNA of Miz-1 using primers F (5Ј-CAT ATG GTC ATC CAC AAG TGC GAG GAC TGT GG-3Ј) and R (5Ј-GGA TCC CTA GCC GCT GTG CAC CAG CTG GTG-3Ј). The PCR product was inserted into pDrive (Qiagen), digested by NdeI and BamHI, and inserted in pET-3a (Novagen) by the same restriction sites. The construct was transformed in BL21 star (DE3) competent cells (Invitrogen). Bacteria were grown in M9 medium containing 15 NH 4 Cl and [ 13 C]glucose to an A 600 nm of 0.6, were induced for 15 h at 37°C with 0.5 mM isopropyl 1-thio-␤-D-galactopyranoside, and were harvested by centrifugation. The cell pellet was resuspended in a lysis buffer (700 mM NaCl, 50 mM KH 2 PO 4 (pH 4.5)), frozen at Ϫ80°C for at least 1 h, thawed in hot water, and then sonicated. The lysate was treated with 100 mM DTT and DNase I for 1 h at 37°C and was centrifuged at 17,500 relative centrifugal force for 30 min. The soluble fraction was diluted 5 times in buffer A (0.12 M citric acid-Na 2 HPO 4 (pH 3), 8 M urea) and purified by FPLC with a HiTrap SP HP column (GE Healthcare). After an extensive wash with buffer B (0.12 M citric acid-Na 2 HPO 4 (pH 3)), Miz1-4 was eluted by a gradient of NaCl. Ultracentrifugation (Amicon Ultra-15, Millipore) was used to concentrate the purified protein in H 2 O containing 0.1% TFA. The protein was finally purified by HPLC with a Discovery BIO Wide Pore C18 column (Supelco) preconditioned with H 2 O-0.1% TFA and was eluted by a gradient of acetonitrile. The fractions containing purified Miz1-4 were lyophilized and kept at Ϫ20°C until resuspension in the desired buffer. The mutant Miz1-4 A86K was purified according to the same protocol.
CD Spectropolarimetry-CD measurements were realized with a Jasco J-810 spectropolarimeter equipped with a Jasco Peltier-type thermostat. The CD spectra were recorded with a 1-mm path length quartz cell at 20°C and were averaged over 10 scans with a wavelength step of 0.2 nm. The spectra were smoothed using Spectra Manager (JASCO Corp.). Miz1-4 spectra were recorded at 25 M in a solution containing 10 mM acetic acid, 10 mM cacodylate, 2 mM TCEP, and the indicated ZnCl 2 concentrations. The Smad 3 MH1 spectrum was recorded at 15 M in a buffer containing 10 mM Tris (pH 8), 50 mM KCl, and 2.5 mM ␤-mercaptoethanol. Data were converted from CD signal to mean residue ellipticity using the equation, [] ϭ (␦ ⅐ MRW)/(10 ⅐ c ⅐ l), where ␦ is the ellipticity in degrees, MRW is the mean residue weight, c is the concentration of the sample (g/ml), and l is the path length (cm). Thermal denaturation of Smad 3 MH1 was realized by following the CD signal at 222 nm as a function of the temperature at a rate of 1°C/min. NMR Spectroscopy-All NMR experiments were recorded at 25°C on a Varian (Agilent) Unity INOVA operating at a 1 H frequency of 600 MHz equipped with an indirect detection H/C/N room temperature probe with a z axis pulsed field gradient capacity. Samples were prepared at a final concentration of 0.7-1.0 mM Miz1-4 in the NMR buffer (10 mM acetic acid, 10 mM BisTris (pH 6.5), 50 mM KCl, 2 mM TCEP, 5 eq of ZnCl 2 , and 10% D 2 O). Miz1-4 backbone and side chain assignments were obtained from standard triple resonance experiments. 2D 1 H-15 N HSQC and 3D HNCO, HNCACB, and CBCA(CO)NH spectra were used for the 1 H, 15 N, and 13 C assignments of the protein backbone. Side chain 1 H and 13 C assignments were obtained using 2D 1 H-13 C HSQC and 3D H(CCCO)NH, (H)CC(CO)NH, and HCCH-TOCSY spectra. The 1 H and 13 C resonance assignments of the aromatic rings of Phe, Tyr, and His were realized using 3D aliphatic and aromatic 13 C-edited NOESY-HSQC and 15 N-edited NOESY-HSQC. 3D NOESY spectra were recorded with mixing times of 50 and 150 ms. All of the pulse sequences used were taken from the Biopack repertoire.
NMR 1 H-15 N HSQC experiments run to verify the interaction between Miz1-4 (or Miz1-4 A86K ) and the Miz-1 consensus DNA have been recorded with a 250 M concentration of both the protein and the DNA (see Fig. 8B for the DNA sequence) at 25°C in a solution containing 20 mM BisTris (pH 7), 150 mM KCl, 1.25 mM ZnCl 2 , 2 mM TCEP, and 10% D 2 O.
Structure Calculations-All NOEs were assigned manually and converted into distance restraints using CcpNmr Analysis (51). The program DANGLE was used to obtain and dihedral angles based on the backbone and 13 C ␤ chemical shift values (52). Structures were calculated using the program ARIA version 2.2 in conjunction with CNS (53,54). Calculations for Miz1-4 ZF 2 and ZFs 3 and 4 were first carried out without a zinc atom and zinc coordination restraints. For all three ZFs, the conserved cysteines and histidines were positioned correctly to allow coordination of zinc. In the following calculations, a Zn(II) ion was added, and zinc coordination distance restraints were specified to ARIA (2.3 Å for Zn(II)-S ␥ and 2.0 Å for Zn(II)-N⑀2). The 20 lowest energy conformers of 300 for the final iteration of the calculation were refined in water and submitted to PROCHECK_NMR for initial validation of the structural quality of our models. The final structure ensembles of ZF 2 (residues 30 -58) and ZFs 3 and 4 (residues 58 -112) were deposited in the PDB under identification codes 2N25 and 2N26, respectively. A full report of the structural quality assessment can be found in the PDB with the access codes. NMR resonance assignments for Miz1-4 were deposited in the BMRB under accession number 25587. 15 N Spin Relaxation-15 N-T 1 , T 2 , and 15 N-{ 1 H} NOE experiments were recorded using previously described pulse sequences available in the Biopack repertoire (55). Delays of 0, 10, 30, 90, 120, 150, 250, 350, 450, 550, 650, 800, and 1000 ms and of 0, 10,30,50,70,90,110,130,150,170,190,210, and 250 ms were used to obtain T 1 and T 2 , respectively. 15 N-{ 1 H} NOE measurements were done by comparing 1 H-15 N HSQC spectra with and without a 6-s proton saturation. 15 N backbone CMPG relaxation dispersion profiles were acquired in a constant time (60 ms) and interleaved manner using a modified version of the pulse sequence from the Biopack repertoire based on the work of Palmer et al. (56) and Mulder et al. (57). The field strengths ( cpmg ϭ 1/4 cpmg ) used were 28 43 Hz were repeated twice to estimate the extent of the experimental error. CPMG data (R 2,eff ) were fitted with the program NESSY according to the protocol described by Bieri and Gooley (25). R 2,eff values were calculated using the equation, R 2,eff ( cpmg ) ϭ 1/T⅐ln⅐(I( cpmg )/I 0 ), where T is the total and constant duration of the CPMG period (60 ms), and I( cpmg ) and I 0 are the resonance intensities of the 15 N and 1 H N correlations in the presence and absence of a refocusing pulse, respectively.
Rotational Diffusion Analysis-The programs pdbinertia and R2R1_diffusion (A.G. Palmer) were used to characterize the rotational diffusion of the different ZFs, as described elsewhere by Tjandra et al. (58). Briefly, for the isotropic case, m (or 1/6D iso ) was determined by calculating T 1 /T 2 and comparing it with the experimental value to optimize with R2_R1_diffusion the following error function, where is the experimental uncertainty. As described elsewhere (59), 15 N-T 1 /T 2 ratios are independent of S 2 (order parameter of the backbone amide vector) and e (internal correlation time of the backbone amide vector) for rigid amides (high S 2 values) and fast internal picosecond motions ( e ). For the axially symmetric diffusion case, the molecular reference frames of the PDB files of the geometric averages of ZF 2 (2N25), ZF 3, ZF 4, and ZF 3-4 (2N26) were rotated into their inertia frames with the program pdbinertia. Then a local m was determined by optimizing the 2 function. The local m is a function of cosine angles ␣ (the angle between the amide vectors and the unique axis of the inertia tensor), D per , and D par (the rotational diffusion coefficients perpendicular and parallel to the unique axis of the inertia and diffusion tensors). Hence, during the minimization of the error function, it is in fact D per , D par , and the unique axis of the diffusion tensor that are optimized. An effective correlation time ( m,eff ) is also obtained and is given by 1/6D iso , where D iso is equal to (2D per ϩ D par )/3 with an anisotropy or D ratio given by D par /D per . Finally, an F statistics test (F) is run within R2R1_diffusion. Confidence in the isotropic or the axially symmetric diffusion model was evaluated by comparing the F values with the critical F values determined by the R2R1_diffusion program at the 90% confidence interval following 500 Monte Carlo simulations, as described elsewhere (60). A large F value indicates that the improvement in 2 by using more parameters (axially symmetric versus isotropic) is statistically significant or not obtained by chance. As suggested elsewhere (61), the data (residues) used for the calculations were filtered to exclude residues with T 1 /T 2 ratios higher than the average plus 1.5 S.D. values (considering ZFs individually) and those with the lowest 15 N-{ 1 H} NOE. The former residues are likely to undergo conformational exchange and have T 2 values artificially lowered (or T 1 /T 2 ratios artificially increased) by R ex Ϫ1 , and the latter are likely to be unfolded (low order parameter) and undergo concurrent motions in addition to rotational diffusion. Moreover, residues presenting resonance overlap or having T 1 or T 2 values with Ͼ10% uncertainty were removed to use only high quality data for the analysis. A total of 9, 11, 10, and 21 residues were retained for ZF 2, ZF 3, ZF 4, and ZF 3-4, respectively. T 1 /T 2 values of the selected amides presenting uncertainties lower than 5% (generally comprised between 0.5 and 2.5%) were set to 5% in the calculation to account for the inherent deviations of individual 15 N chemical shift anisotropy (29). Once the calculations were completed, all of the conformers of the ensemble of structures deposited in the PDB were aligned onto their corresponding average structures in the final diffusional reference frame, and ␣-angles were extracted for all of the conformers. Using the diffusion parameters estimated, the ␣-angles and T 1 /T 2 ratios were back-calculated with an in-house written program using the S 2e spectral density function (with S 2 ϭ 1 and e ϭ 0) (62) with an axially symmetric diffusion (58) for all of the conformers of the ensemble of structures deposited in the PDB to estimate uncertainties on the calculated ␣ and T 1 /T 2 ratios. To simulate the effect of slow nanosecond motions of ZF 4 relative to ZF 3 on its T 1 /T 2 ratios, we used its ␣ values, D per and D iso , determined as described above and minimized the difference between the calculated and experimental ratio by manually changing S 2 and e (in the nanosecond time scale) with our in-house written program. Note that we neglected the effect of picosecond motions in this simulation.
Fluorescence Anisotropy-Oligonucleotides were purchase from IDT. The different lyophilized fluorescein-dT-labeled oligonucleotides were resuspended in water at 100 M and were mixed to a 1:1 ratio with the non-labeled complementary strand. The mixtures were then incubated at 95°C and slowly cooled down to room temperature to form the duplex. The double-stranded probes were diluted to 15 nM in a solution containing 100 mM Tris (pH 7.5), 50 mM KCl, 1 mM TCEP, and 500 M ZnCl 2 and were added to a 1-cm path length quartz cell. Data were recorded on a HITACHI F-2000 spectrofluorimeter mounted in the L-shape configuration with the excitation and emission wavelengths set to 490 and 520 nm, respectively, and both slits set to 10 nm. Miz1-4 protein was gradually added from a stock at 450 M. An equilibration of 5 min was allowed before the acquisition of each point. The anisotropy was calculated according to the following equation, where r is the anisotropy, I ʈ is the fluorescence intensity when the polarizers are paralleli and I Ќ is the fluorescence intensity when they are perpendicular.
The dissociation constants were obtained assuming a twostate binding using the following classical equation, where ⌬r obs is the observed anisotropy change at a particular protein/DNA ratio, ⌬r max is the maximum anisotropy change at binding saturation, R is the protein/DNA ratio, and [DNA] 0 is the total concentration of the doublestranded oligonucleotide.