The R882H DNMT3A hotspot mutation stabilizes the formation of large DNMT3A oligomers with low DNA methyltransferase activity

DNMT3A (DNA methyltransferase 3A) is a de novo DNA methyltransferase responsible for establishing CpG methylation patterns within the genome. DNMT3A activity is essential for normal development, while its dysfunction has been linked to developmental disorders and cancer. DNMT3A is frequently mutated in myeloid malignancies with the majority of mutations occurring at Arg882, where R882H mutations are most frequent. The R882H mutation causes a reduction in DNA methyltransferase

_____________________________________________________________________________________ ABSTRACT DNMT3A (DNA methyltransferase 3A) is a de novo DNA methyltransferase responsible for establishing CpG methylation patterns within the genome. DNMT3A activity is essential for normal development, while its dysfunction has been linked to developmental disorders and cancer. DNMT3A is frequently mutated in myeloid malignancies with the majority of mutations occurring at Arg882, where R882H mutations are most frequent. The R882H mutation causes a reduction in DNA methyltransferase activity and hypomethylation at differentially methylated regions (DMRs) within the genome, ultimately preventing hematopoetic stem cell differentiation and leading to leukemogenesis. Although the means by which the R882H DNMT3A mutation reduces enzymatic activity has been the subject of several studies, the precise mechanism by which this occurs has been elusive. Herein, we demonstrate that in the context of the full length DNMT3A protein, the R882H mutation stabilizes the formation of large oligomeric DNMT3A species to reduce the overall DNA methyltransferase activity of the mutant protein as well as the WT:R882H complex in a dominant negative manner. This shift in the DNMT3A oligomeric equilibrium and resulting reduced enzymatic activity can be partially rescued in the presence of oligomer disrupting DNMT3L, as well as DNMT3A point mutations along the oligomer forming interface of the catalytic domain. In addition to modulating the oligomeric state of DNMT3A, the R882H mutation also leads to a DNA binding defect, which may further reduce enzymatic activity. These findings provide a mechanistic explanation for the observed loss of DNMT3A activity associated with the R882H hotspot mutation in cancer.

_______________________________________
In humans, DNA methylation patterns are established and maintained through the coordinated efforts of four genes in the DNA methyltransferase (DNMT) family: DNMT1, DNMT3A, DNMT3B and DNMT3L (1). DNMT1 is a maintenance DNA methyltransferase with a strong substrate preference for hemimethylated DNA. DNMT1 copies CpG methylation marks from the parental strand to the daughter strand during DNA replication to ensure that methylation patterns are maintained after cell division (1). DNMT3A and DNMT3B are de novo DNMTs, required for establishing methylation patterns during embryonic development, but can also contribute to maintenance of DNA methylation (2,3). Although these two de novo DNMT isoforms are highly similar and have some overlapping functions, they have also been shown to play unique roles in biology (2,4). This is highlighted by the observation that DNMT3B KO mice are embryonic lethal, while DNMT3A KO mice die several weeks after birth (2). DNMT3L is catalytically inactive, lacking key amino acids for DNA binding and methyltransferase activity (5)(6)(7). DNMT3L serves as a stabilizing and stimulating co-factor for the DNMT3A and DNMT3B methyltransferases during embryonic development, but is not normally expressed in differentiated cell types (8)(9)(10)(11)(12). DNMT3L KO mice are viable, but fail to reproduce due to deficient germline DNA methylation, likely mediated by DNMT3A (13).
DNMT3A is one of the most frequently mutated genes in hematopoietic cancers, with mutations observed in as many as 20% of AML patients (14,15). The most frequent DNMT3A mutation is the R882H hotspot mutation (Fig.  1A), which is associated with a lower overall probability of survival of AML patients (16). Malignant cells from cancer patients carrying this mutation display hypomethylation patterns at differentially methylated regions (DMRs), though total genomic CpG methylation is nearly unchanged relative to WT cells (17,18). In engineered mouse models, loss of DNMT3A function similarly causes hypomethylation at DMRs, which is associated with a failure of hematopoietic stem cell (HSC) differentiation, expansion of the HSC compartment and leukemogenesis (4,(19)(20)(21)(22)(23)(24)(25)(26). The R882H mutation has been proposed to have dominant negative activity (18,27), explaining why R882H is always found to be heterozygous in leukemia patients (28). While the R882H DNMT3A mutation has been the subject of multiple biochemical studies, the mechanism by which this mutation reduces DNA methyltransferase activity is unclear. DNMT3A has been shown to form different sized multimeric complexes, where tetramers are likely the most active species (18,29,30). The R882H mutation is located to the C-terminal catalytic domain of DNMT3A (Fig. 1A), near the interface of two DNMT3A monomers in the tetrameric complex (Fig. 1B). In prior studies, the R882H mutation has been proposed to prevent the formation of the active DNMT3A tetramer, with the isolated catalytic domain only forming dimers with low enzymatic activity in solution (18,31,32), but this has never been confirmed with the full length protein. Although the dominant negative behavior of the R882H mutation has been demonstrated in both mouse models (27) and in human cell lines upon overexpression of the mutant protein (18,26), a recent biochemical study refuted these findings, claiming the purified mutant protein does not have dominant negative activity (33).
In addition to tetramers, DNMT3A can form multimeric complexes of significantly larger sizes (30). This is facilitated by the methyltransferase domain, which contains two distinct surfaces each capable of binding to another catalytic domain monomer (34). The catalytic domain can therefore drive oligomerization of the DNMT3A protein (Fig. 1C). The two oligomer forming interfaces of the catalytic domain are structurally different (34). One interface (referred to as the RD interface) is comprised mostly of a series of electrostatic interactions and hydrogen bonds, with R885 and D876 making key interactions in the symmetrical binding event (34). Binding at the second interface (the FF interface) is driven by a central core of hydrophobic interactions, with F732 making a key hydrophobic stacking interaction (34). Single point mutations at R885, D876 and F732 have been shown to disrupt DNMT3A oligomer formation to some extent (30,31). DNMT3A has never been crystalized on its own, likely due to its propensity to oligomerize. Crystal structures of DNMT3A in complex of DNMT3L have been solved (34)(35)(36), where DNMT3L serves as an oligomer disrupting cofactor. The FF interface is conserved between DNMT3A and DNMT3L, whereas the RD interface is not. DNMT3L can therefore bind to DNMT3A at the FF interface only. This forces the formation of a DNMT3L-DNMT3A-DNMT3A-DNMT3L tetramer (Fig. 1, B and C). DNMT3L has been shown to activate DNMT3A methyltransferase activity in both biochemical and cellular assays, though the mechanism is unclear (8-10, 32, 37, 38). In addition to the catalytic domain, DNMT3A contains an N-terminal unstructured region and two histone recognition domains, the PWWP and ADD domains (Fig. 1A). The PWWP domain specifically recognizes trimethylated histone H3K36 and is important for recruiting DNMT3A to heterochromotin (39,40). The ADD domain binds to unmodified histone H3K4, which regulates DNA binding and the methyltransferase activity of DNMT3A (35,41).
In this work, we analyzed the solution behavior and biochemical properties of the full length DNMT3A WT and mutant proteins. We find that the R882H hotspot mutation stabilizes the formation of large oligomeric DNMT3A species, which are intrinsically less active than smaller tetramers. When mixed with WT DNMT3A protein, the mutant protein shifts the equilibrium of WT:R882H complexes to larger less active oligomers, contributing to the dominant negative behavior of the mutant. DNMT3L disrupts the formation of large oligomers to activate DNMT3A and partially rescue the activity of the R882H mutant. In addition to influencing the multimeric state of DNMT3A, R882H also causes a defect in DNA binding, which is not rescued in the presence of DNMT3L. Together these observations provide a mechanistic explanation for the observed dominant negative loss of DNMT3A function in leukemia patients carrying the R882H mutation.

Results
The goal of our work was to interrogate the mechanism by which the R882H mutation impairs DNMT3A function. Prior to working with the mutant protein, we investigated whether the N-terminally truncated WT catalytic domain would be a suitable replacement for the WT full length protein, which has lower expression yields in E. coli. Previous studies have demonstrated that DNMT3A can form very large oligomeric species, as well as smaller complexes around the size of a tetramer (18,29,30). We confirmed this observation by size exclusion chromatography with purified full length and catalytic domain DNMT3A (Fig. S1). For both proteins, a peak is observed near the void volume consistent with some fraction of DNMT3A existing as a very large oligomer, while a second peak elutes around the expected elution volume of a tetramer ( Fig. 2A). It has been proposed that the formation of very large oligomers enhances the DNA methyltransferase activity of the isolated catalytic domain (42). To verify that full length and catalytic domain behave similarly in this regard, we measured the DNA methyltransferase activity of the full length and catalytic domain proteins at varying concentrations in a radiometric assay, measuring the transfer of a tritium labeled methyl group from S-adenosyl methionine (SAM) to a DNA substrate. At low concentrations, the oligomer equilibrium would be expected to shift to smaller species, whereas at higher protein concentrations, the equilibrium should favor larger multimers. Here we find that the full length protein and catalytic domain behave very differently. While the catalytic domain demonstrates a linear relationship between enzymatic activity and protein concentration, the full length protein does not (Fig. 2B). Instead, the specific activity of the full length protein decreases at increasing concentrations (Fig. 2C). This demonstrates that while the activity of the catalytic domain is unaffected by oligomeric state in this experiment, the enzymatic activity of the full length protein is very sensitive, with larger oligomers being less active than smaller species. Given that the DNMT3A catalytic domain does not behave the same as the full length protein in our biochemical assays, we decided to use only the biologically relevant full length protein for the remainder of our work.
We next turned our attention to the mutant protein. Full length WT and R882H DNMT3A were purified from E. coli (Fig. S1), then injected on a Superose 6 size exclusion chromatography column. While WT DNMT3A eluted as two major peaks as described above, the mutant protein eluted predominantly near the void volume (Fig. 3A). This could be explained by two possible mechanisms, either the mutant protein prep was entirely aggregated or R882H shifts the equilibrium of ordered DNMT3A oligomers to higher order multimers. To distinguish between these two possibilities we added a stoichiometric amount of purified DNMT3L (Fig. S1) to the R882H protein in attempt to disrupt any high order mutant oligomer and ran the DNMT3A:DNMT3L complex over a size exclusion column. The R882H:DNMT3L complex mostly runs as a lower molecular weight species, with the major peak eluting at the expected volume of a tetramer, while DNMT3L injected alone eluted as a monomer as expected (Fig. 3A). This demonstrates that the observed high molecular weight species of the mutant is not misfolded/aggregated protein, but rather a very large, ordered oligomer that can be broken down into smaller complexes in the presence of DNMT3L. To verify this was not a biochemical artifact, we expressed Myc-tagged WT DNMT3A and FLAG-tagged R882H DNMT3A in HEK293 cells, lysed the cells by mechanical disruption and ran the lysate over a Superose 6 size exclusion column. The amount of WT or mutant DNMT3A in each fraction was determined by western blot. We find that Myc-WT DNMT3A elutes in five fractions, with the majority of protein eluting around the expected mass of a tetramer (Fig. S2, A and B). Consistent with the observed oligomeric state of the mutant recombinant protein, FLAG-R882H DNMT3A from mammalian cell lysate elutes in only four fractions, with most of the protein eluting near the void volume of the size exclusion column (Fig. S2, A and B).
To demonstrate that WT and R882H DNMT3A can form oligomers in live cells, we expressed full length YFP-tagged DNMT3A in NIH-3T3 cells to look for any obvious changes in sub-nuclear localization. It has been shown that WT DNMT3A localizes to heterochromatin regions (visualized as a punctate pattern in the nuclei of NIH-3T3 cells), which is highly dependent on the ability of DNMT3A to form oligomers (30,(43)(44)(45). After transient transfection, we clearly see that both WT and R882H (as well as R882C) form a similar strong punctate pattern indicative of oligomer formation, which is reduced upon co-expression of DNMT3L (Fig. 3, B and C). As a control, we see that expression of known oligomer disrupting mutants F732A and R885A alone have a reduced number of nuclear puncta relative to WT, as previously reported (30) (Fig. 3, B and C). Together, these data demonstrate that R882H stabilizes full length DNMT3A oligomer formation in a purified system using recombinant protein, in mammalian cell lysates and in live cells expressing tagged DNMT3A protein.
Consistent with prior studies and the observed loss of DNMT3A function in leukemia patients carrying the R882H mutation (18,31,46), we find that R882H displays reduced enzymatic activity relative to WT, with a roughly 20-fold reduction in apparent kcat ( Figure S3). Previous reports have proposed that the R882H mutation has dominant negative behavior in both mouse models and human cell lines overexpressing the mutant (18,26,27), however this was recently challenged in another study using mostly the purified catalytic domain in a reconstituted biochemical assay (33). In our hands, the addition of varying amounts of full length R882H to a fixed amount (100 nM) of full length WT DNMT3A caused a marked and dose dependent reduction in methyltransferase activity, consistent with dominant negative behavior (Fig. 4A). In contrast, adding additional WT DNMT3A to 100 nM WT protein caused an increase in activity (Fig. 4A). Notably, the decrease in specific activity occurred more drastically with the addition of mutant than WT DNMT3A (Fig. 4B). Given that R882H causes a shift in the DNMT3A oligomer equilibrium towards larger complexes and we have shown that large full length oligomers have lower enzymatic activity relative to smaller complexes, this raised the possibility that the R882H mutation exerts its dominant negative function by shifting the distribution of R882H:WT DNMT3A complexes to higher order oligomers. Indeed, injection of a mixture of R882H:WT recombinant DNMT3A protein at a 1:1 ratio on a size exclusion column reveals the complex exists mostly as a large oligomer (Fig. 3A), similar to mutant alone. A comparable shift in the oligomeric distribution of DNMT3A was observed in cell lysates after co-expression of Myc-WT and FLAG-R882H DNMT3A in HEK293 cells (Fig. S2B), where the dominant negative behavior of the mutant was also confirmed (Fig.  S2C). The addition of oligomer disrupting DNMT3L was able to increase the activity of the WT:R882H complex in a biochemical assay to levels greater than WT alone at a one-to-one 3Ato-3L ratio (Fig. 4C). Interestingly, while the addition of DNMT3L to WT DNMT3A alone was capable of activating the WT protein 4.5-fold, DNMT3L activated R882H >10-fold under the same conditions (Fig. 4D). This would be expected if the primary mechanism of DNMT3L activation is through oligomer disruption to form a more active tetrameric species. Relative to WT, R882H DNMT3A has a larger proportion of protein existing as a large oligomer with low activity, therefore it should be stimulated to a greater extent by oligomer disruption than WT.
Next, we determined if DNMT3A could be activated through oligomer disruption in the absence of DNMT3L by generating point mutations along the oligomer forming RD interface of the catalytic domain. R885 and D876 form electrostatic interactions with the opposite monomer to stabilize the RD interface. Mutation of these amino acids to alanine has been shown to prevent oligomer formation in NIH-3T3 cells as demonstrated and discussed above. Full length R885A and D876A DNMT3A were purified from E. coli (Fig. S1) and injected on a size exclusion column to examine their oligomeric state. In contrast to WT and R882H, both interface disrupting mutants demonstrated a significant shift from large oligomers to smaller species, with the predominant peaks eluting around the expected volume of dimers and monomers (Fig. 5A). This shift in equilibrium was not complete, as some fraction of R885A and D876A DNMT3A still eluted as a high molecular weight species. Both R885A and D876A had reduced enzymatic activity relative to WT, displaying comparable activity to R882H (Fig.  5B), confirming that DNMT3A dimers and monomers have low enzymatic activity (29,31). Given that these interface mutants shift the equilibrium of oligomer to smaller species, we would expect that DNMT3L would have reduced ability to activate these proteins relative to WT or R882H DNMT3A. Indeed, we find that DNMT3L is incapable of activating R885A and only able to activate D876A a modest 2-fold, compared to DNMT3L's ability to activate WT 4.5 fold and R882H >10-fold (Fig. S4).
We would expect that after mixing these interface disrupting mutants with WT or R882H DNMT3A, the resulting complex would have higher activity than WT, R882H or interface mutant DNMT3A alone. This is because the oligomer disrupting mutations would break down higher order WT or R882H oligomers into smaller, more active complexes. Upon mixing RD interface mutations R885A or D876A with WT protein at a one-to-one ratio, we see that the total activity of the resulting complex is greater than the sum of WT plus R885A or D876A alone (Fig. 5B). Upon titrating R885A or D876A into WT DNMT3A, we find that ratios of mutant to WT DNMT3A greater than one-to-one continue to activate the WT protein (Fig. 5C). This is in contrast to DNMT3L, which achieves maximal activation at a one-to-one DNMT3A-to-DNMT3L ratio (Fig. 5D), indicative of a very tight binding interaction between DNMT3A and DNMT3L (8). As expected, R885A and D876A also activate R882H in a synergistic manner (Fig. 5E). The weaker total activation of WT and R882H by the interface disrupting mutations compared to DNMT3L can be explained by two observations. One, the interface disrupting mutations do not fully break down DNMT3A oligomers into smaller species, as a some fraction of these two RD interface mutants run as large multimers by size exclusion chromatography (Fig. 5A). Two, the DNMT3A-DNMT3L interaction is higher affinity than the DNMT3A-DNMT3A interaction. Therefore, DNMT3L is fully capable of breaking up DNMT3A oligomers at stoichiometric concentrations, while the interface mutations are not.
It is important to note that relative to WT, full rescue of R882H DNMT3A activity with DNMT3L could not be achieved (Fig. 6A). Although the addition of DNMT3L could activate R882H >10-fold, the absolute activity of the R882H DNMT3A:DNMT3L complex remained suppressed relative to the WT DNMT3A:DNMT3L complex, remaining about 5-fold lower (Fig. 6A). This observation suggests that the R882H point mutation is interfering with DNMT3A methyltransferase activity by some mechanism in addition to shifting the equilibrium of DNMT3A oligomers to higher order, less active multimers. Given that the mutation is close to the DNA binding surface of DNMT3A (Fig. 6B), we next investigated whether R882 is involved in DNA binding. In an equilibrium DNA binding assay, measuring the change in fluorescence polarization of a 30 base pair FAM labeled double stranded DNA oligo upon DNMT3A binding, we find that R882H has a roughly twofold elevated Kd relative to WT (Fig. 6C). The off rate of R882H for DNA is also apparently faster than WT DNMT3A. WT or R882H was preincubated with FAM labeled DNA and equilibrium achieved prior to the addition of a 1000-fold excess of unlabeled DNA. Although the off-rate of both proteins was too fast to accurately measure with available laboratory instrumentation, the off-rate of WT appears to be significantly slower than R882H (Fig. 6D). While all R882H DNMT3A had dissociated from labeled DNA by the time measurements could be made, at least 40% of WT protein remained bound at this early time point prior to fully dissociating within 5 minutes (Fig. 6D). In these same experiments, DNMT3L has no effect on equilibrium binding or apparent residence time for both WT and R882H (Fig. 6, C and D). This is consistent with an earlier study, showing no difference in DNMT3A DNA binding affinity in the presence and absence of DNMT3L (38). During the course of this work, a WT DNMT3A crystal structure was published with DNA bound (36). We and others note that R882 appears to be making a contact with the phosphate backbone of the bound DNA (Fig. 6B). In the same structure, DNMT3L does not make any contacts with DNA. This provides the explanation for the observed DNA binding defect of the R882H mutation.

Discussion
In this study, we establish the mechanism by which the R882H DNMT3A hot spot mutation exerts its dominant negative behavior, dramatically reducing the DNA methyltransferase activity of both mutant and WT DNMT3A. The R882H mutation impairs DNMT3A function by two mechanisms. One, R882H stabilizes the formation of large DNMT3A oligomers, which are intrinsically less active than smaller species. Two, the R882H mutations removes a key binding interaction with substrate DNA. We observe that the R882H mutation shifts the equilibrium of both mutant alone and a oneto-one mixture of R882H:WT DNMT3A oligomer from smaller tetramers to very large multimers by size exclusion chromatography. These large multimers of the full length DNMT3A protein are less active than smaller complexes, as determined by simply measuring the total DNA methyltransferase activity at varying concentrations of enzyme. Upon the addition of DNMT3L, the large DNMT3A oligomers are broken down into tetramers, which is accompanied by an increase in methyltransferase activity (Fig. 7). This new model of DNMT3A regulation by oligomer formation and R882H dysfunction is supported by the use of point mutations along the oligomerization interfaces of the catalytic domain. We show that the RD mutants, R885A and D876A, both form mostly dimers and monomers with very low catalytic activity in solution. When forming a complex with WT or R882H DNMT3A, these mutations synergistically enhance the DNA methyltransferase activity of the resulting complex. This reveals that even in the absence of DNMT3L, oligomer disruption alone enhances the activity of DNMT3A. While DNMT3L clearly prevents oligomer formation to stimulate DNMT3A activity, it is possible that DNMT3L also stimulates DNMT3A activity through additional mechanisms. In prior studies, there has been some speculation that DNMT3L itself binds DNA and that this may enhance DNMT3A activity, however it is now clear from the DNMT3A-DNMT3L-DNA crystal structure that DNMT3L does not make any interactions with substrate DNA during catalysis (36). Our data is in agreement with this observation. In our hands, DNMT3L does not significantly influence full length WT or R882H DNMT3A DNA binding.
It is clear from the recent WT DNMT3A-DNA co-crystal structure that DNMT3A makes many contacts with DNA (36). There are multiple key interactions between DNMT3A and the DNA phosphate backbone, major and minor grooves and with the flipped base in the enzyme active site. Mutation of many of these amino acids causes a sharp reduction in enzymatic activity, with similar total activity to the R882H hotspot mutation (36). Mutation of several of these amino acids even leads to complete ablation of enzymatic activity (36). Yet, only R882 mutations are selected for at a high frequency in hematopoietic cancers. This suggests that disruption of DNMT3A DNA binding alone is insufficient for driving leukemogenesis. It is therefore tempting to speculate that the R882H mutation provides an advantage to cancer cells primarily through higher order oligomer stabilization rather than disruption of DNA binding.
Previous studies of the R882H mutation in the context of the isolated catalytic domain suggested that this mutation disrupts the RD interface, preventing the formation of active DNMT3A tetramers (29,31). We note that in the DNMT3A crystal structures there are many amino acids along the RD interface that are clearly stabilizing DNMT3A-DNMT3A binding (34)(35)(36). Mutation of a number of these amino acids have been shown to disrupt this binding interface, both in this study and in others (29)(30)(31). In cancer, mutations of these RD interface stabilizing amino acids are observed occasionally, but are not selected for anywhere near the same frequency as R882 mutations. This is likely because heterozygous mutation of these amino acids would be activating to the WT allele rather than inhibitory, as we show here for the first time. While the R885A and D876A mutations at the RD interface have very low enzymatic activity on their own, we clearly show that upon mixing with WT or R882H, the total DNA methyltransferase activity of the resulting DNMT3A complex is synergistically increased. The previously proposed model of R882H mediated disruption of DNMT3A complexes is therefore not compatible with the observed loss of function in cancer patients or the numerous cellular and in vivo models developed to study this mutation.
Here, we clearly demonstrate that the full length protein behaves very differently than the isolated catalytic domain. While the DNA methyltransferase activity of the catalytic domain appears to be unaffected by protein concentration and oligomer distribution, the full length protein exhibits substantially reduced enzymatic activity as the oligomer equilibrium shifts from smaller to larger multimers. Given that the R882H mutation influences oligomer distribution, previous studies utilizing only the isolated catalytic domain may have missed key aspects of the mutant behavior. This may be why the dominant negative function the R882H mutation observed in vivo and confirmed here in reconstituted biochemical assays for the full length DNMT3A protein was recently challenged (33). Clearly, the DNMT3A N-terminal unstructured region, PWWP and/or ADD domains are playing an important role here. The mechanism by which this occurs is unclear and will require further investigation. Structural and functional studies have clearly demonstrated that the ADD domain plays a key role in regulating DNMT3A methyltransferase activity and substrate binding (35,41). The PWWP domain binds to DNA, causing substrate inhibition of DNMT3A enzymatic activity under some conditions (47). Both the PWWP and ADD domains bind histone ligands, which has been shown to be important for both DNMT3A recruitment to heterochromatin regions and activation of enzymatic activity (35,40,48). It is also possible that histone tail binding to the ADD or PWWP domains influences the oligomeric state and activity of DNMT3A. The N-terminal unstructured region has unknown function, but likely plays an important regulatory role given that the DNMT3A2 isoform missing this region is selectively expressed during embryonic development but not in differentiated cell types (45).
In summary, our work provides insight into the molecular mechanisms by which the R882H hotspot mutation suppresses the methyltransferase activity of the WT:R882H DNMT3A complex. This dominant negative behavior of the R882H mutant clearly depends on the ability of the mutant protein to form multimeric complexes with the WT allele. This raises the possibility that small molecules targeting the WT:R882H binding interface could disrupt this interaction and break down large oligomers into smaller complexes to restore the DNA methyltransferase activity of DNMT3A. This may have a therapeutic benefit for a large population of leukemia patients harboring the R882H hotspot mutation who currently have few options for treatment.

Protein purification
Both full length and catalytic domain (612-912) human DNMT3A were cloned into the pET28b vector and expressed in E. coli with an Nterminal His tag. E. coli cultures were grown in LB medium to an OD of 0.8 at 37 ºC prior to cooling the temperature to 16 ºC for overnight induction with 1 mM IPTG. Cells were pelleted by centrifugation and resuspended in lysis buffer (50 mM Tris pH 8.0, 300 mM NaCl, 1 mM TCEP, 0.1% Triton X-100 and 10 mM imidazole) prior to being lysed on ice by sonication (60% amplitude, three times 30 seconds). Cell debris was cleared by centrifugation at 30,000 rpm for 30 min. The supernatant from 2 L of culture was then diluted to a final volume of 350 mL in lysis buffer prior to purification on an AKTA FPLC with a 5 mL His trap. Dilution of the lysate was found to be important for recovering DNMT3A protein from the lysate. When concentrated, the DNMT3A protein in the lysate does not readily bind the Ni column, likely due to the formation of large DNMT3A oligomers. This is especially true for the full length R882H protein.
The His trap column was washed and DNMT3A protein was eluted in a step gradient of imidazole (50 mM, 100 mM, 200 mM, 500 mM) in column buffer (50 mM Tris pH 8.0, 300 mM NaCl and 1 mM TCEP). Pure elution fractions were combined and dialyzed over night at 4 ºC in dialysis buffer (50 mM Tris pH 8.0, 300 mM NaCl and 1 mM TCEP). After dialysis, DNMT3A protein was concentrated to between 0.5 and 1 mg/mL using a 100 MWCO centrifugation filter, then frozen and stored at -80 ºC. DNMT3A and DNMT3L proteins were subject to size exclusion chromatography analysis on a Superose 6 10/300 SEC column. The column was pre equilibrated with column buffer (50 mM Tris pH 8.0, 150 mM NaCl and 1 mM TCEP). 500 ug of protein was injected in column buffer with a flow rate of 0.5 mL/min. Components of each elution peak were confirmed by SDS-PAGE and coomassie staining.
Full length human DNMT3L was expressed in E. coli, in the pET28b vector. E. coli cultures were grown in LB medium to an OD of 0.8 at 37 ºC prior to cooling the temperature to 16 ºC for overnight induction with 1 mM IPTG. Cells from 12 L of E. coli culture were pelleted by centrifugation and resuspended in 100 mL lysis buffer (50 mM Tris pH 8.0, 300 mM NaCl, 1 mM TCEP, 0.1% Triton X-100 and 10 mM imidazole) prior to being lysed on ice by sonication (60% amplitude, three times 30 seconds). Cell debris was cleared by centrifugation at 30,000 rpm for 30 min. The supernatant was loaded directly onto a His trap column using an AKTA FPLC. The column was washed and DNMT3L protein eluted in a step gradient of imidazole (50 mM, 100 mM and 250 mM) in column buffer (50 mM Tris pH 8.0, 300 mM NaCl and 1 mM TCEP). Pure fractions were dialyzed overnight at 4 ºC in 50 mM Tris pH 8.0, 300 mM NaCl and 1 mM TCEP, then concentrated to 3 mg/mL and stored at -80 ºC.

Methyltransferase assay
DNMT3A protein was diluted into assay buffer (50 mM Tris pH 8.0, 20 mM NaCl and 1 mM TCEP) at varying concentrations prior to the addition of DNA and 3 H-SAM (specific activity 18 Ci/mmol, Perkin Elmer) to initiate the reaction. For experiments with mixed WT and mutant DNMT3A or DNMT3L, the protein was preincubated in assay buffer for 2 hrs (unless otherwise indicated) prior to initiating the reaction. The final reaction volume was 50 µL. Product was produced linearly with time up to 5 hrs. Reaction times for each experiment were 2 hrs unless otherwise indicated. The reactions were stopped with 1 mM cold SAM. The 50 µL reaction was then diluted into 300 uL assay buffer containing 40 µL DEAE resin. The solution was mixed by rotation for 30 min in a spin cup filter (Pierce™ Spin Cups -Paper Filter #69700 Life Tech) for the DNA to bind to the DEAE resin. The resin was collected and washed in wash buffer (50 mM Tris pH 8.0, 20 mM NaCl and 1 mM TCEP) three times by centrifugation in the spin cup. DEAE resin was then resuspended in 200 µL water and transferred to 4 mL scintillation fluid and counted on a Micro Beta 2 scintillation counter.

Mammalian cell expression and cell lysate analysis
N-terminally Myc-tagged full length WT DNMT3A and N-terminally FLAG-tagged full length R882H DNMT3A were subcloned into the pcDNA3.1 vector and transiently expressed in adherent HEK293 cells. Cells were plated in a 10-cm dish at 50% confluency one day prior to transfection. Cells were transfected with Fugene transfection reagent (Promega), then scraped, washed in PBS and pelleted 24 hrs later. Cell pellets were resuspended in 800 µL lysis buffer (25 mM Tris pH 7.8, 150 mM NaCl and 1 mM TCEP) prior to lysis by Dounce homogenization on ice. The lysate was centrifuged for 10 min at 14000 rpm. 400 uL of the soluble fraction of the cell lysate was then run on a Superose 6 size exclusion column in 25 mM Tris pH 7.8, 150 mM NaCl and 1 mM TCEP. Two mL fractions were collected and DNMT3A protein was detected by western blot (LICOR) using anti-Myc (9E10) or anti-FLAG (M2) antibodies. GAPDH was detected using an anti-GAPDH antibody (Cell Signaling, Cat. # 5174S).
DNMT3A DNA methyltransferase activity in each fraction was determined using a slightly modified protocol described above. 25 uL of each fraction containing DNMT3A protein was mixed with 3 H-SAM (3 µM) and DNA (dIdC, 10 µM bp) substrates. The reaction was allowed to proceed for 4 hours at room temperature prior to stopping the reaction with 1 mM unlabeled SAM. The reaction mixture was then incubated with DEAE resin and the amount of radiolabeled DNA measured as described above.

DNA binding assays
In an equilibrium binding experiment, FAM labeled 30 bp dsDNA (5'-FAM-TCGCTGTCGTACGTCGCGTCTGTGAGTCGA-3') was diluted in assay buffer (50 mM Tris pH 8.0, 20 mM NaCl and 1 mM TCEP) to a final concentration of 10 nM prior to the addition of varying concentrations of DNMT3A or DNMT3A/DNMT3L complex. The mixture was allowed to reach equilibrium over 30 min at room temperature. Fluorescence polarization was measured on an M1000 Tecan plate reader in a 384-well plate (Corning Cat #3820) in a final volume of 15 µL. To measure the relative residence time of WT and R882H DNMT3A for DNA, 10 nM FAM-labeled 30 bp dsDNA was incubated with 100 nM DNMT3A for 30 min at room temperature to reach equilibrium. 10 µM unlabeled dsDNA was then added and dissociation of the DNMT3A:FAM-DNA complex was measured continuously on an M1000 Tecan plate reader as described above.

Subnuclear localization
NIH-3T3 cells were plated at a density of 2E4 cells/well in a 96-well plate in DMEM with 10% FBS. The following day, cells were washed with PBS, then transfected with 200 ng plasmid of YFP-DNMT3A with or without 200 ng CFP-DNMT3L using Lipofectamine 3000. 24 hrs after transfection cells were washed and fixed with paraformaldehyde prior to being permeabilized with 0.1% Triton X-100 in PBS and DAPI staining. Cells were imaged using GE IN Cell 2000 high content screening system with a 40x objective lens. The number of nuclear puncta in YFP or YFP/CFP positive cells were counted and normalized to nuclear area.  A, Histogram of DNMT3A mutations observed in cancer from the TCGA cohort. The R882H hotspot mutation is located to the C-terminal catalytic domain. B, Crystal structure of the DNMT3L -DNMT3A -DNMT3A -DNMT3L complex (PDB 2QRV). R882 is located to the RD interface between two DNMT3A catalytic domains. C, The DNMT3A catalytic domain mediates the formation of large oligomers through self-association along two binding surfaces, called the RD and FF interfaces. DNMT3L prevents the formation of large DNMT3A oligomers, instead forming a tetramer comprised of two DNMT3A and two DNMT3L molecules.

Figure 2. Comparison of full length and catalytic domain oligomeric state and activity. A,
Chromatogram of purified full length and catalytic domain DNMT3A injected on a Superose 6 size exclusion column. 500 µg of each protein was injected. B, Total activity of DNMT3A full length and catalytic domain proteins at the indicated concentrations measured using a radioactive endpoint assay. C, Specific activity of full length and catalytic domain proteins at indicated concentrations. Specific activity at the lowest DNMT3A concentration was normalized to 1 for full length and catalytic domain respectively. Error bars represent SD, n = 3. and DNMT3L proteins. Mixed complexes were pre-incubated for 2 hrs at room temperature prior to injection on a Superose 6 column. Peak components were confirmed by SDS-PAGE analysis. 500 µg of each protein was injected. B, Quantification of nuclear puncta in NIH-3T3 cells after transfection with YFP-DNMT3A with or without co-transfection of CFP-DNMT3L. Error bars represent SD. F732A and R885A have a statistically significant lower number of nuclear puncta compared to WT, P < 0.05 (*).
There no do difference in number of mutant DNMT3A nuclear puncta compared to WT upon DNMT3L co-expression. C, Representative images from B. Scale bars, 5 µm. Mixtures of WT and mutant protein were pre-incubated at room temperature for 2 hrs. Error bars represent SD, n = 4. B, DNA methyltransferase activity from A normalized to total DNMT3A present. Error bars represent SD, n = 4. C, 100 nM WT DNMT3A activity in the presence of an equal concentration of R882H after a 2 hr pre-incubation. Activity of the WT (100 nM), R882H (100 nM) and DNMT3L (200 nM) complex (a one-to-one DNMT3A-to-DNMT3L) ratio after a 2 hr pre-incubation. Error bars represent SD, n = 4. D, Activity of 100 nM WT DNMT3A alone and in the presence of 100 nM DNMT3L. Activity of 100 nM R882H DNMT3A alone and the presence of 100 nM DNMT3L. WT and R882H DNMT3A activity alone is normalized to 1. Error bars represent SD, n = 3. Figure 5. Characterization of R885A and D876A oligomer disrupting mutants and their effect on WT and R882H DNMT3A activity. A, Chromatograms of R885A and D876A injected on a Superpose 6 size exclusion column. B, Total activity of 100 nM WT, R882H, R885A and D876A alone compared to the activity of 100 nM WT protein mixed with an equal amount of the indicated DNMT3A protein after a 2 hr pre-incubation. Error bars represent SD, n = 6. C, Activity of 100 nM WT DNMT3A titrated with R885A or D876A after a 2 hr pre-incubation. Activity of R885A and D876A alone at the indicated concentrations is shown for comparison. The reaction was stopped after 90 min. Error bars represent SD, n = 2. D, Activity of 100 nM WT DNMT3A in the presence of varying concentrations of DNMT3L after a 2 hr preincubation. Error bars represent SD, n = 3. E, Activity of 200 nM R882H DNMT3A in presence of an equal amount of R885A or D876A after a 2 hr pre-incubation. The reaction was stopped after 4 hrs. Error bars represent SD, n = 3.  Here, large oligomeric DNMT3A species are significantly less active than smaller complexes. The R882H hotspot mutation found in leukemia exerts its dominant negative behavior in part by binding WT DNMT3A and shifting the distribution of DNMT3A complexes from smaller, more active tetramers, to larger oligomers with low activity. DNMT3L activates DNMT3A by breaking up large oligomers into smaller complexes.