The Human RecQ4 Helicase Contains a Functional RecQ C-terminal Region (RQC) That Is Essential for Activity*

RecQ helicases are essential in the maintenance of genome stability. Five paralogues (RecQ1, Bloom, Werner, RecQ4, and RecQ5) are found in human cells, with distinct but overlapping roles. Mutations in human RecQ4 give rise to three distinct genetic disorders (Rothmund-Thomson, RAPADILINO, and Baller-Gerold syndromes), characterized by genetic instability, growth deficiency, and predisposition to cancer. Previous studies suggested that RecQ4 was unique because it did not seem to contain a RecQ C-terminal region (RQC) found in the other RecQ paralogues; such a region consists of a zinc domain and a winged helix domain and plays an important role in enzyme activity. However, our recent bioinformatic analysis identified in RecQ4 a putative RQC. To experimentally confirm this hypothesis, we report the purification and characterization of the catalytic core of human RecQ4. Inductively coupled plasma-atomic emission spectrometry detected the unusual presence of two zinc clusters within the zinc domain, consistent with the bioinformatic prediction. Analysis of site-directed mutants, targeting key RQC residues (putative zinc ligands and the aromatic residue predicted to be at the tip of the winged helix β-hairpin), showed a decrease in DNA binding, unwinding, and annealing, as expected for a functional RQC domain. Low resolution structural information obtained by small angle X-ray scattering data suggests that RecQ4 interacts with DNA in a manner similar to RecQ1, whereas the winged helix domain may assume alternative conformations, as seen in the bacterial enzymes. These combined results experimentally confirm the presence of a functional RQC domain in human RecQ4.

RecQ helicases belong to a ubiquitous family of DNA unwinding enzymes that are essential to maintain the genome stability by acting at the interface of DNA replication, recombination, and repair. RecQ helicases have been proposed to be involved in various cellular activities including stabilization and repair of damaged DNA replication fork, telomere mainte-nance, base excision repair, homologous recombination, and DNA damage checkpoint signaling (1)(2)(3). Whereas all cellular organisms possess at least one RecQ helicase, humans have five different paralogues called RecQ1, BLM, WRN, RecQ4, and RecQ5. They share a similarity in their helicase domain but also have some additional domains and unique features. Genetic mutations in three paralogues (BLM, WRN, and RecQ4) result in rare genetic disorders, namely Bloom, Werner, and Rothmund-Thomson, RAPADILINO, and Baller-Gerold syndromes. In addition to the common characteristics of genetic instability, short stature, and predisposition to cancer, the disorders have specific clinical symptoms unique to each mutation, suggesting that the five RecQ helicases have distinct, as well as partially overlapping roles within the cell (4 -6). Although RecQ4-deficient Rothmund-Thomson syndrome patients have an elevated risk for osteosarcoma, an increased level of expression of RecQ4 has been reported in human sporadic osteosarcoma, prostate tumor, and breast tumor samples (7)(8)(9)(10).
The human RecQ4 helicase encodes 1208 amino acid residues. Despite the importance of human RecQ4 in DNA replication and repair and its association to three autosomal recessive disorders and cancer, some of the structural and biochemical aspects of the protein are still hazy. In addition to the catalytic core, RecQ4 comprises an N-terminal domain that has some homology with the yeast DNA replication factor Sld2 and includes a zinc knuckle (11). A puzzling feature of RecQ4 was the lack of a RQC 2 domain, which is an essential part of the catalytic core of all the RecQ helicases that have been characterized. This appeared consistent with initial reports (12,13), suggesting that RecQ4 had no helicase activity. However, several laboratories (14 -16) have clearly demonstrated that the recombinant protein is an active DNA helicase, and the presence of a RQC domain has been suggested by a more detailed bioinformatic analysis (17). The putative RQC domain comprises a zinc-binding domain and a winged helix (WH) domain (see Fig. 1, A and B). In most of the RecQ helicases, an aromatic residue present at the tip of a ␤-hairpin in WH domain has been reported to be involved in DNA binding and unwinding (18 -22).
Here we report the biochemical and structural characterization of the catalytic core of human RecQ4 helicase. A number of site-directed mutants, mostly targeting conserved residues within the putative RQC domain, were purified and characterized. Their effect on the catalytic activities of the protein confirms the bioinformatic analysis and the key role of the RQC domain in DNA binding, annealing, and unwinding. Numerous missense/nonsense mutations of Rothmund-Thomson, Baller-Gerold, and RAPADILINO syndromes map onto this region. Our biochemical and structural analysis of the RQC domain provides an essential framework to rationalize the patient mutations. Low resolution structural information suggests that RecQ4 interacts with DNA in a manner similar to RecQ1, whereas the WH domain may assume alternative conformations.

Results and Discussion
Expression and Purification of Recombinant hRecQ4-HelRQC-We produced in bacterial cells a deletion mutant of hRecQ4 corresponding to the helicase core plus putative RQC domain (hRecQ4-HelRQC; Fig. 1A) as a fusion protein with a N-terminal His-SUMO tag. The tag was cleaved with the SUMO protease, and the protein was purified to homogeneity (Fig. 1C). Size exclusion chromatography indicates that the protein is a monomer in solution (Fig. 1D).
The canonical RQC domain found in all or most of the RecQ proteins contains a zinc domain; a bioinformatic analysis carried out on RecQ4 (17) suggested the presence of up to eight putative zinc ligands (seven Cys and one His) and therefore the possibility of two zinc binding sites (Fig. 1A). We have therefore used inductively coupled plasma-atomic emission spectrometry (ICP-AES) to measure the amount of zinc in the purified protein; because the cysteine residue can also bind iron and a subset of helicases contain iron-sulfur clusters, an analysis of the iron content was also carried out. The analysis showed the presence of two zinc atoms for each molecule of protein, whereas no iron was detected, confirming that RecQ4 proteins have an anomalous RQC zinc domain with two rather than one Zn 2ϩ ion (Table 1).
Biochemical Characterization of the hRecQ4-HelRQC-The recombinant protein was characterized in terms of DNA binding, ATP hydrolysis, strand annealing, and helicase activity. To test for DNA binding, we used EMSAs; a fluorescent DNA probe (ssDNA, blunt dsDNA, and fork DNA) was incubated with increasing protein concentrations (0 -3 M), and the mixture was analyzed by polyacrylamide gel electrophoresis to monitor the formation of the protein-DNA complex ( Fig. 2A). The protein shows the highest affinity toward a fork DNA probe, compared with ssDNA and blunt dsDNA (Fig. 2C).
At higher protein concentration, two bands appear in the EMSA ( Fig. 2A), suggesting the presence of complexes with different stoichiometries. These results could be explained either by a second RecQ4 helicase binding to the DNA substrate or by the formation of a 2:2 protein-DNA complex, as seen in the crystal structure of RecQ1 (20). Although we cannot rule showing the Sld2 homologous region at the N terminus (blue) followed by a zinc knuckle (cyan), a helicase core (red), and RQC domain (consisting of a zinc-binding domain (bright green) and a winged helix domain (dark green)). Enlarged at the bottom of the panel is the catalytic core of RecQ4 (helicase and RQC domain) with the positions of the residues mutated in this work. B, the crystal structure of RecQ1 (PDB code 2WWY) with the same color code; critical residues are highlighted. C, SDS-PAGE gel of purified proteins. M, molecular markers; m1, C853A/C855A; m2, C897A; m3, C925A; m4, Cs945A; m5, C949A; m6, F1077A; m7, K508A. D, size exclusion chromatography indicates that the recombinant protein is a monomer. The calibration peaks correspond to protein markers: peak 1, ferritin, 440 kDa; peak 2, aldolase, 158 kDa; peak 3, ovalbumin, 44 kDa; peak 4, ribonuclease A, 13.7 kDa; peak 5, aprotinin, 6.5 kDa. E, heat denaturation profiles of RecQ4 variants. The curves depicting the percentage of unfolded proteins (positive y quadrant, 0 -100) and the Ϫ(dRFU)/dT curves are plotted together (mostly in the negative y quadrant). The experiment suggests that all the mutants are folded and stable.
out that under certain conditions RecQ4 may dimerize, in our hand the truncated protein always behaved as a monomer in size exclusion chromatography; moreover, SAXS data on the protein alone and the protein-DNA complex are consistent with a monomeric form (see below). A previously published biochemical study on the Drosophila RecQ4 also suggests the predominant presence of monomeric form in solution (14). Because the EMSA forked substrate includes 22 base pairs plus two 15-nucleotide-long tails and the protein at high concentrations does bind to ssDNA (Fig. 2C) and shows helicase activity toward blunt substrates, we suggest that the higher molecular weight band on Fig. 2A may be explained as a second monomer assembling on the substrate. We need to stress that the DNA binding experiment was carried out in the presence of a large excess of protein. We have performed further DNA binding assays at higher DNA concentrations (100 nM) by diluting 10 nM of fluorescent DNA with unlabeled substrate (supplemental Fig. S1). A significant decrease in the higher band was observed, validating the explanation discussed above. It has to be stressed that in the SAXS experiment, the protein:DNA molar ratio was 1:1. No evidence of cooperativity was observed (supplemental Fig. S2).
The ATPase activity was evaluated by measuring the inorganic phosphate released during the hydrolysis, in the presence of ssDNA (F3), blunt dsDNA (B3:B4), and fork DNA (F3:F4) substrates to stimulate the ATP hydrolysis. Fork DNA stimulated the activity significantly better than the other substrates (Fig. 2E).
The helicase activity of the protein was assessed by a FRETbased method, using both fork DNA and blunt dsDNA as substrates. The result shows that fork DNA is significantly preferred (Fig. 2D), consistent with the binding affinities and the stimulatory effects on ATP hydrolysis. However, a lower level of activity was observed also with the blunt dsDNA substrate. Although in general blunt duplexes are poor substrates for RecQ helicases, unwinding of blunt dsDNA has been occasionally reported for bacterial and yeast enzymes (23,24). Indeed the crystal structure of the Werner WH domain with DNA shows the ␤-hairpin aromatic residue wedging into a blunt substrate (18), partially opening the double helix, providing a structural insight into the unwinding of a blunt substrate.
The strand annealing activity of the protein was analyzed using a FRET-based assay, by monitoring the decrease in fluorescence when two complementary strands carrying a fluorophore and a quencher come together. The protein showed a robust annealing activity in a range of concentrations comparable with the helicase activity (Fig. 2I). The result shows that the two strands were annealed to 80% by 160 nM protein in 30 min.
To confirm that the activities observed are indeed due to the hRecQ4-HelRQC recombinant protein and not to a contaminant, we generated a site-directed mutant where the key lysine residue within the Walker A motif was mutated to alanine (K508A, mutant m7), thus affecting the ATP binding site. The protein was expressed and purified in a similar way to the wild type ( Fig. 1C) and exhibited a similar behavior when subjected to heat denaturation experiments (Fig. 1E), confirming that the overall fold has not been affected. As expected, the K508A mutant behaved similarly to the wild type in DNA binding experiments (Fig. 2B), but both the ATPase and the helicase activity were negligible (Fig. 2, E and G). The annealing activity was unaffected (Fig. 2I), consistent with the notion that annealing does not requires ATP binding or hydrolysis.
Characterization of RQC Mutants Confirms the Results of the Bioinformatic Analysis and the Role of the RQC Domain in the Helicase Activity-To validate the presence of the RQC domain predicted by in silico analysis (17), several putative functional residues were mutated to alanine. Within the zinc domain, the presence of two insertions makes it difficult to uniquely identify in RecQ4 the residues corresponding to the four cysteines that have been shown to act as zinc ligands in other RecQ proteins. Two cysteines are absolutely conserved in RecQ4 (Cys 925 and Cys 945 ) with the other residue (Cys 949 ) only partially conserved. However, other three potentially conserved zinc ligands are detected in an insertion (Cys 853 , Cys 855 , and Cys 897 ). We have therefore generated a number of mutants (C853A/C855A, C897A, C925A, C945A, and C949A) to verify their role in zinc binding and enzymatic activity. Previous biochemical and structural studies carried out on eukaryotic RecQ proteins have highlighted the key role of an aromatic residue located at the tip of a ␤-hairpin of the winged helix domain, stacking on top of the nucleic acid bases at the ss/dsDNA junction (22); based on the proposed sequence alignment, in RecQ4 this position is occupied by Phe 1077 ; an additional mutant was therefore produced, with this residue mutated to alanine.
All the corresponding proteins were expressed and purified similarly to the wild-type protein (Fig. 1C). To evaluate the folding and stability of the purified proteins, heat denaturation experiments were performed using SYPRO Orange fluorescent dye, whose fluorescence emission intensity increases when it binds to the hydrophobic patches of the thermally unfolded protein (ThermoFluor assay). This property was used to determine the melting temperatures of the purified proteins by monitoring their heat denaturation curves. The wild type and mutant proteins show the same heat denaturation profiles (Fig.  1E), and their melting temperatures were nearly identical ( Table 2), suggesting a comparable stability of the wild type and mutants and indicating that the mutations did not affect the 3D structure of the proteins.
When metal binding analysis using ICP-AES was carried out on the mutant proteins, whereas the wild-type protein, the WH ␤-hairpin mutant (F1077A) and the Walker A mutant (K508A) showed the presence of Zn 2ϩ with a 2:1 molar ratio (2 zinc ions per protein molecule), all the cysteine mutants bound only 1 Zn 2ϩ per protein molecule (Table 1). These results suggests   toward DNA at low nucleic acid concentrations (Fig. 2B), confirming the essential role of the RQC domain in substrate recognition. On the contrary, the Walker A mutant (mut7) behaved similarly to the wild type protein, as expected (Fig. 2B). When DNA binding was carried out at higher DNA concentration (100 nM; Fig. 2F), the binding was impaired but still measurable.
To see the effect of the mutations on ATP hydrolysis, ATPase assays were performed on the mutant proteins using fork DNA substrate as a stimulant. All the RQC mutants behaved similarly to the wild-type protein, with variation well within the statistical error, when in the presence of excess DNA (Fig. 2E). This is a further confirmation that the RQC mutants are properly folded and able to display a catalytic activity. There is an apparent discrepancy between the fact that mutation in the RQC impairs DNA affinity (Fig. 2B) and the fact that in the ATPase assay fork DNA stimulates the ATP hydrolysis activity in the mutants. However, the DNA binding experiments in Fig. 2B are carried out at low concentration of substrate (10 nM), to better appreciate the differences among the mutants, whereas an excess of DNA substrate (500 nM) was used in the ATPase assay. When the ATPase assay was carried out at 100 nM DNA (rather than 500 nM; supplemental Fig. S3), a partial ATPase activity can be observed, consistent with the partial DNA binding abilities of the mutants (Fig. 2F), thus explaining the stimulatory effect of high concentration of DNA on the ATPase activity. Nucleic acid unwinding was negligible for all the RQC mutants at 10 nM substrate (Fig. 2G). When helicase activity is measured at higher substrate concentration (100 nM; Fig. 2H), partial activity can be observed for the RQC mutants. This confirms the importance of the zinc domain and the key role of Phe 1077 in the WH ␤-hairpin for the enzyme activity; furthermore it suggests that the main role of the RQC domain is to interact with nucleic acids. Annealing was also abolished for all the RQC mutants (Fig. 2H).
These results experimentally confirm the presence of a fully functional RQC domain and its essential role in the DNA binding and thus enzyme activity of RecQ4 proteins. The results obtained for the mutant involving the aromatic residue at the end of the WH ␤-hairpin (F1077A) not only validate the accuracy of the bioinformatic analysis (17) but also confirm its essential role in opening up the dsDNA, as seen in other RecQ helicases. There is some discrepancy as to the role of the ␤-hairpin in the annealing activity: whereas mutation of the aromatic residue of RecQ4 does abolish annealing (Fig. 2H), the same mutation in RecQ1 does not (20). However, the annealing activity in RecQ1 appears to be subjected to a complex regulation, requiring the full-length protein and the formation of a tetrameric assembly, whereas the monomeric truncated form of RecQ4 here described exhibits annealing activity.
Even more interesting are the results on the cysteine mutants. The role of the zinc domain and the conserved cysteines has not been clearly defined. When two of the Escherichia coli RecQ zinc ligands are mutated to Asn, the DNA binding and helicase activity is abolished, whereas ATPase activity is retained (25); similarly, mutations of the zinc ligands in BLM result in either insoluble or biochemically compromised protein, where DNA binding, ATPase, and helicase activ-ities are compromised (26). A more recent paper suggests that mutations in three of the cysteine residues that coordinate zinc affect ATP binding, DNA binding, ATPase, and helicase activity but not strand annealing (27); furthermore the same three cysteine mutants exhibit enhanced thermal denaturation. Our results overlap only in part with previous work, showing that the cysteine mutants in RecQ4 have the same thermal stability as the wild type, do not impair the ATPase activity (in excess of stimulatory DNA), but affect nucleic acid binding, annealing, and helicase activity (Fig. 2), consistent with the role of the cysteines in the E. coli protein (25). Moreover, the RecQ4 zinc domain is unusual in having two independent zinc clusters, and we find that they are both critical for the helicase activity.
Mapping the missense/nonsense mutations of Rothmund-Thomson, Baller-Gerold, and RAPADILINO syndromes onto the protein sequence (28) reveals a number of single amino acid mutants (including L926R, L927R, R1021TW, I1051V, and S1079I), as well as truncations (Arg 826 * and Arg 1072 *) located in the putative RQC domain, pointing to an important role of the RQC in the regulation of the enzyme activity and in the development of these rare diseases.
SAXS Data Reveal the Architecture of the RecQ4-DNA Complex-To carry out a low resolution structural characterization of the protein, SAXS data were collected in the presence and absence of a fork DNA substrate. Experimental scattering curves for both samples (Fig. 3, A and B) show good signal to noise ratios, with resolutions of ϳ25-30 Å. Guinier's approximation showed no particle aggregation or radiation damage (Fig. 3C). The main structural parameters calculated from the experimental scattering curves show a bigger volume of the envelope of protein-DNA complex compared with that of protein alone ( Table 3). The plot of the Pr function extrapolated using GNOM shows a more elongated shape in the presence of nucleic acid, with a D max ϭ 130 Å with respect to D max ϭ 100 Å for the protein alone (Fig. 3D).
The molecular envelope obtained in the presence of the nucleic acid substrate (Fig. 3E) shows an elongated extension that can be fitted using the crystal structure of RecQ1-DNA complex. The "extended" DNA configuration does indeed match that seen in the RecQ1 complex, whereas the crystal structures of the DNA complexes of Bloom or a bacterial RecQ helicase show a less protruding nucleic acid substrate, binding at a different orientation. Some "empty" space in the envelope most likely corresponds to the presence of two insertions (with a total of 90 amino acid residues) within the zinc domain, including the second zinc cluster.
In the absence of nucleic acid substrate, the envelope obtained (Fig. 3F) is more globular and lacks details, so that it is less easy to fit a molecular model. A possible explanation is that, in the absence of DNA, the WH domain (that has been shown to be flexible in other RecQ helicases, assuming two configurations that are almost 90°apart) does not assume a well defined position, and the protein in solution is a mixture of multiple states (Fig. 3G).
In both cases (protein alone and protein-DNA complex) the radius of gyration and SAXS envelope are consistent with a monomeric protein; no evidence of the presence of a dimer was detected. This confirms the size exclusion chromatography data.
In conclusion, we report a comprehensive biochemical and structural analysis of the catalytic core of the human RecQ4 helicase, and in particular we focus on the architecture and function of the RQC domain. We experimentally validate a previous bioinformatic analysis (17) and confirm the prediction of two, rather than one, zinc clusters within the zinc domain. Mutagenesis of key residues within the RQC domain demonstrates that this region is essential for the enzymatic activities and suggests that its main function is in nucleic acid interaction. Low resolution SAXS data show that RecQ4 binds DNA similarly to RecQ1, whereas the WH domain may be flexible in the absence of substrates. These results provide a useful frame-work to understand the physiological role of RecQ4 and the disease-causing mutations.

Experimental Procedures
Cloning and Site-directed Mutagenesis-The nucleotide sequence encoding the helicase and RQC domain (encompassing amino acids 445-1112) was PCR-amplified from a plasmid containing the full-length protein (16), using the Platinum Pfx DNA polymerase (Invitrogen) and following primers: 5Ј-ATT-GAGGCTCACAGAGAACAGATTGGTGGTAGCCTGGA-CCCCACCGTG-3Ј and 5Ј-TTTGCGCCGAATAAATACCT-AAGCTTGTCTTCATTCCTCTTCCTCAAAGTAGCG-3Ј. The amplified fragment was subcloned into pET-SUMO/CAT vector (Invitrogen) through a restriction free cloning method (29) for protein expression as a fusion with a N-terminal six-histidine-SUMO tag cleavable by SUMO protease.
Site-directed mutagenesis was carried out to mutate the conserved cysteines in the putative zinc-binding domain (mutant m1, C853A/C855A; mutant m2, C897A; mutant m3, C925a; mutant m4, C945A; and mutant m5, C949A), the conserved  (HRD). C, Guinier's fits with experimental R g values for HR and HRD in the allowed region (q Ͻ 1.3 R g ). D, pair distance distribution function (PDDF) for HR and HRD calculated in the allowed region (q Ͼ ⌸/D max ). E, final model reconstructed from the scattering curve for the protein-DNA complex (HRD). The crystal structure of the RecQ1-DNA complex (PDB code 2WWY) was fitted onto the SAXS envelope. A long protrusion roughly corresponds to the location of DNA in the model. Some unaccounted density could correspond to two long insertions (90 residues in total) within the zinc domain. F, final model reconstructed from the scattering curve for the protein alone. In the absence of DNA, the envelope is more globular and lacks details, so that it is less easy to fit the RecQ1 model. G, to check for the possibility that the WH domain assumes an alternative conformation, the orientation of the WH seen in RecQ1 (dark green) has been overlapped with an alternative conformation (light green) based on the E. coli structure (PDB code 1OYY), suggesting that the protein in solution is a mixture of multiple states. Protein Expression and Purification-The wild-type and mutant proteins were expressed in Rosetta 2 (DE3) cells, using the autoinduction method (30), grown at 17°C for 48 h, and harvested by centrifugation. The cells were lysed by sonication in 50 mM Hepes, pH 8.0, 0.5 M NaCl, 1 mM tris-(2-carboxyethyl) phosphine hydrochloride, 5% glycerol (v/v), 10 mM imidazole, 1 tablet of Complete EDTA-free protease inhibitor mixture (Roche), and DNase I (Sigma). Proteins were purified by nickel affinity chromatography, with a 2 M NaCl washing step to eliminate DNA contamination, followed by tag cleavage with SUMO protease, and an additional step of metal affinity to eliminate the His-SUMO tag and the His-tagged protease. The protein purification was further optimize by heparine chromatography and size exclusion chromatography in 20 mM Tris, pH 8.0, 250 mM NaCl, 5% glycerol (v/v), and 5 mM ␤-mercaptoethanol on a Superdex-200 column. Protein concentration was determined by measuring the absorbance at 280 nm, and protein purity was analyzed on SDS-PAGE. Metal binding analysis was carried out by ICP-AES. Purifications of all the mutant proteins were performed following the same protocol.
ThermoFluor Assay-To determine the protein stability and to compare the various mutants, a heat denaturation analysis was carried out using a CFX96 Touch real time quantitative PCR system (Bio-Rad). Protein stability measurements were performed in a buffer containing 20 mM Tris, pH 8.0, 250 mM NaCl, 5% glycerol (v/v), and 5 mM ␤-mercaptoethanol. The 25 l of reaction mix contained 0.5 M protein and SYPRO Orange (Invitrogen) at 20ϫ concentration. Heat denaturing curves were observed within the temperature range of 20 -80°C, at a ramp rate of 1.8°C/min, collecting data every 10 s. The data obtained was normalized using GraphPad Prism software.
Oligonucleotide Preparation-All the oligonucleotides (unlabeled; 5Ј 6FAM-labeled; 3Ј BHQ1-labeled) were chemically synthesized and purified by reverse phase high pressure liquid chromatography (Sigma-Aldrich). Each oligonucleotide was then resuspended in Tris-EDTA buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA). The oligonucleotide sequences used in this work are reported in Table 4. For the helicase assay the dsDNA substrates (F1:F2 and B1:B2) were prepared with the fluorescent strand and the complementary strand at a 1:2 M ratio in 10 mM Tris, pH 7.5, 50 mM NaCl, 1 mM EDTA by briefly heating at 95°C, followed by slow cooling to room temperature. The substrates for the ATPase assay and EMSA were prepared as above.
Electrophoretic Mobility Shift Assay-The fluorescent DNA binding assay was performed in 20 mM Tris-HCl, pH 7.5, 5 mM MgCl 2 , 50 mM KCl, 8 mM DTT, 0.1 mg/ml BSA, and 5% glycerol v/v with 10 nM substrates (F1, F1:F4 and B1:B4; Table 4) and increasing concentrations (0 -3 M) of the purified proteins in 20 l of reaction volume, incubated at room temperature for 30 min. The reaction mixture was then loaded on a 6% non-denaturing polyacrylamide gel (v/v) and run at 4°C in Tris/borate/ EDTA buffer. Fluorescent labeled nucleic acid fragments were detected by fluorescent scanner (ImageQuant; GE Healthcare), and quantification of protein-nucleic acid complexes was performed with ImageQuant image analysis software (GE Healthcare). To measure DNA binding in the presence of 100 nM fork DNA substrate, 10 nM labeled DNA (F1:F4) was mixed with 90 nM unlabeled substrate (F3:F4).
ATPase Assays-The Malachite Green assay kit (Cayman) was used to determine the ATPase activity by measuring the inorganic phosphate released during ATP hydrolysis (31,32). The method is based on the change in absorbance at A 620 of the complex formed between malachite green (MG) molybdate and free orthophosphate under acidic conditions (molybdophosphoric acid complex). The assay was carried out in 50-l reaction volumes, containing 20 mM Tris-HCl, pH 7.5, 5 mM MgCl 2 , 50 mM KCl, 8 mM DTT, 0.1 mg/ml BSA, and 5% glycerol (v/v) with 500 nM purified protein, 3 mM ATP, and 100 or 500 nM of nucleic acid (F3, F3:F4, and B3:B4; Table 4). The ATPase reactions were incubated at 37°C for 30 min. After incubation, 5 l of MG acidic solution was added and incubated at room temperature for 10 min, followed by addition of 15 l of MG molybdenum reagent and incubation at room temperature for further 5 min. The absorption at 620 nm was then measured using a Lambda25 UV/VIS Spectrometer (PerkinElmer Life Sciences). The concentrations of inorganic phosphate were determined by comparison with a standard curve obtained in the presence of known phosphate concentrations.
Helicase Assays-The helicase activity was based on FRET, by using a double-stranded substrate (Table 4) with a fluorophore (6FAM) on one strand and a quencher (BHQ1) on the other, and by monitoring the fluorence emitted when the two strands are separated by the helicase action (33). The assay was performed in 20 mM Tris-HCl, pH 7.5, 5 mM MgCl 2 , 50 mM KCl, 8 mM DTT, 0.1 mg/ml BSA, and 5% glycerol (v/v) with 10 or 100 nM dsDNA substrate (F1:F2 and B1:B2), 3 mM ATP, and 125 nM capture strand (to prevent reannealing) in 30 l of reaction volume. The unwinding reaction was started by incubating increasing concentrations (0 -160 nM) of the purified proteins in the reaction mixture at 37°C for 30 min. The fluorescence intensity was recorded using the Infinite F200 PRO microplate reader (TECAN). To have a measurement corresponding to 100% unwinding, the reaction was incubated at 95°C. Each assay was done in triplicate. The percentage of unwinding was calculated and plotted using GraphPad Prism software.
Strand Annealing Assays-A FRET-based strand annealing assay was carried out by monitoring the decrease in fluorescence when the two complementary oligonucleotides (one labeled with a fluorophore and the other with a quencher) anneals. The assay was performed in 20 mM Tris-HCl, pH 7.5, 5 mM MgCl 2 , 50 mM KCl, 8 mM DTT, 0.1 mg/ml BSA, and 5%

RecQ4 Helicase Contains a Functional RQC Domain
glycerol with 10 nM 6FAM-labeled oligonucleotide (F1) and 25 nM BHQ1-labeled oligonucleotide (F2) in 30 l of reaction volume. The reaction was started by incubating increasing concentrations (0 -160 nM) of the purified proteins in the reaction mixture at 37°C for 30 min. The fluorescence intensity was recorded using the Infinite F200 PRO microplate reader (TECAN). To have a measurement corresponding to 100% annealing, the fluorescence of the annealed substrate (F1:F2) was measured. Each assay was done in triplicate. The percentage of annealing was calculated and plotted using GraphPad Prism software.
Small Angle X-ray Scattering-The Synchrotron scattering data were collected at the SWING Beamline (34) at Soleil both from the protein alone and in complex with DNA. For the latter an equimolar amount of fork DNA (F3:F4) was added to the protein and incubated at room temperature for 60 min. The protein and protein-DNA complex were then purified using size exclusion chromatography. Samples corresponding to 0.5, 1.5, and 3 mg/ml were automatically loaded using a sample changer thermostatted at 4°C; data were collected at a wavelength of 1.54 Å, and the distance between the sample and the PCCD170170 (AVIEX) detector was set to 4.78 m, q was in the range from 0.012 to 0.412 Å Ϫ1 . After calibration, all data were averaged, normalized, and background-subtracted using the Soleil software Foxtrot. Absolute scale was determined using the reference curve collected from BSA at 5 mg/ml. Guinier's approximation was used to check for particle aggregation and radiation damage in the sample. Structural parameters were calculated from the experimental curves using Primus (35). The maximum length of the model (D max ) was estimated using GNOM (36) looking at the best P(r) function shape. Final SAXS envelopes were obtained by averaging 20 ab initio DAM-MIF (37) models with 20 ab initio DAMMIN (38) models using SUPCOMB (39) and DAMAVER (40).