A versatile mouse model of epitope-tagged histone H3.3 to study epigenome dynamics

The variant histone H3.3 is incorporated into the genome in a transcription-dependent manner. This histone is thus thought to play a role in epigenetic regulation. However, our understanding of how H3.3 controls gene expression and epigenome landscape has remained incomplete. This is partly because precise localization of H3.3 in the genome has been difficult to decipher particularly for cells in vivo. To circumvent this difficulty, we generated knockin mice, by homologous recombination, to replace both of the two H3.3 loci (H3f3a and H3f3b) with the hemagglutinin-tagged H3.3 cDNA cassette, which also contained a GFP gene. We show here that the hemagglutinin-tagged H3.3 and GFP are expressed in the majority of cells in all adult tissues tested. ChIP-seq data, combined with RNA-seq, revealed a striking correlation between the level of transcripts and that of H3.3 accumulation in expressed genes. Finally, we demonstrate that H3.3 deposition is markedly enhanced upon stimulation by interferon on interferon-stimulated genes, highlighting transcription-coupled H3.3 dynamics. Together, these H3.3 knockin mice serve as a useful experimental model to study epigenome regulation in development and in various adult cells in vivo.

The variant histone H3.3 is incorporated into the genome in a transcription-dependent manner. This histone is thus thought to play a role in epigenetic regulation. However, our understanding of how H3.3 controls gene expression and epigenome landscape has remained incomplete. This is partly because precise localization of H3.3 in the genome has been difficult to decipher particularly for cells in vivo. To circumvent this difficulty, we generated knockin mice, by homologous recombination, to replace both of the two H3.3 loci (H3f3a and H3f3b) with the hemagglutinin-tagged H3.3 cDNA cassette, which also contained a GFP gene. We show here that the hemagglutinin-tagged H3. 3

and GFP are expressed in the majority of cells in all adult tissues tested. ChIP-seq data, combined with RNA-seq, revealed a striking correlation between the level of transcripts and that of H3.3 accumulation in expressed genes. Finally, we demonstrate that H3.3 deposition is markedly enhanced upon stimulation by interferon on interferon-stimulated genes, highlighting transcription-coupled H3.3 dynamics. Together, these H3.3 knockin mice serve as a useful experimental model to study epigenome regulation in development and in various adult cells in vivo.
H3.3 is a conserved variant histone distinct from the canonical histone H3 (H3.1 and H3.2). Although it differs only in four or five amino acids from the canonical H3, H3.3 has an activity and mode of action different from H3.1 and H3.2 (1)(2)(3). Unlike the canonical histone H3, which is synthesized and deposited into nucleosomes during DNA replication, H3.3 is expressed throughout cell cycle and is incorporated into chromatin along with transcription. Its deposition is in some cases induced by transcriptional activation triggered by external signals, such as interferons, and linked to transcription elongation (4 -7). Once deposited, H3.3 remains for a long time, although with varying turnover kinetics (8). For these reasons, H3.3 is implicated to play a role in epigenetic regulation (9). Although its deposition has been generally associated with expressed genes, H3.3 is deposited in bivalent genes in ES cells, as well as heterochromatin and telomeres (10 -12). The H3.3 specific histone chaperon, HIRA mediates H3.3 deposition in expressed genes, bivalent genes, and damaged DNA sites (10,11,13,14). ATRX and DAXX, on the other hand, direct H3.3 incorporation into heterochromatin and telomeres (11,(15)(16)(17)(18).
H3.3 is encoded by two independent loci in both humans and mice (H3f3a and H3f3b), which produce a protein with the identical amino acid sequences. Targeted knockout of H3f3b shows the requirement of this gene in early embryogenesis and sperm development (19,20). Conditional knockout of both H3f3a and Hf3b also results in defective embryonic development and abnormal cell growth (21). Furthermore, mutations in the H3F3A gene and genes in the ATRX pathway have been linked to pediatric brain cancer (22)(23)(24)(25). Recently a chromatin reader, ZMYND11, that recognizes H3.3 methylated at lysine 36 has been identified (26,27). These lines of studies demonstrate the fundamental role of H3.3 in the life of mammals.
Despite the well-acknowledged importance, our understanding of H3.3 has been hampered considerably, because of the difficulties in distinguishing H3.3 from the canonical histone H3. Thus, regulated expression of endogenous H3.3 proteins has remained elusive, including regulated distribution of H3.3 in the genome. Further, it is largely unknown how H3.3 responds to physiological changes in the cells. Exogenous introduction of a tagged H3.3, although useful in some cases, does not substitute for the endogenous H3.3 genes. It is of note that Wen et al. (30) reported construction of H3f3b-HA 4 mice using a zinc finger genome-editing method. However, there is no mouse model available for the H3.3f3a gene. Because of the lack of a suitable in vivo models, our knowledge on the status and the role of H3.3 in terminally differentiated cells and small populations of specialized cells is very scarce in mammals.
To study the activity of endogenous H3.3 in vivo, we generated, by homologous recombination, knockin mice that express HA-tagged H3.3 from the H3f3a and H3f3b loci. We report here that H3f3a-HA and H3f3b-HA proteins are expressed broadly in all adult tissues examined and mostly at similar levels, including various cell types in the immune system. ChIP-seq analysis revealed that H3.3 is enriched in expressed genes in a manner closely recapitulating levels of mRNA expression, its localization found from the promoter, the transcription start site (TSS) to the gene body, and the transcription end site (TES). Finally, we demonstrate that interferon stimulation causes rapid H3.3 incorporation within interferon-stimulated genes, highlighting a dynamic nature of H3.3 deposition. Together, these knockin mice would serve as a valuable model for studying the mode and mechanisms of H3.3 activity in cells in vivo, in heath and disease.

Construction of knockin mice expressing HA-tagged histone H3.3
With the aim of developing new mouse strains expressing tagged H3.3 from the endogenous loci, we replaced part of the H3f3a and H3f3b genes with an HA-tagged H3.3 cDNA cassette by homologous recombination. Fig. 1A and Fig. S1A schematically depict the strategy used. The exon 2 containing the ATG through exon 3 of the H3f3a and H3f3b loci were replaced by the cassette containing H3.3 cDNA fused to the HA and FLAG tag (hereafter H3f3A-HA and H3f3b-HA). An IRES-EGFP was inserted at the 3Ј end, enabling independent monitoring of free EGFP expression from each locus. In addition, loxP sites were placed flanking the cassettes for potential conditional deletion. At the 3Ј end of EGFP cDNA, drug selection marker, neomycinor hygromycin-resistant gene flanked by FRT was placed in the H3f3a and H3fb targeting vector, respectively. The drug makers were later removed by crossing with FLP mice. The inserted HA-FLAG tag sequence is shown in Fig. S1B. Mice with germ line transmission were genotyped by PCR (Fig. 1A, lower panel, and Fig. S1C). Thus, the recombination at either locus did not alter the regulatory regions, promoters, 5Ј-and 3Ј-UTRs and introns (except a sequence between exon 2 and exon 3). Heterozygous mice were born as expected and grew with a comparable weight increase with WT mice. In addition, both males and females were fertile. Because the replacement of a single allele was sufficient to detect HA-tagged H3.3 and GFP expression, all experiments in this work were carried out with heterozygous mice. The number of homozygous H3f3a-HA and H3f3b-HA pups appeared somewhat lower than expected. In addition, the number of double knockin mice carrying both H3f3a-HA and H3f3b-HA pups appeared lower. The precise ratios of homozygous mice and double knockin mice await results of additional, mating experiments.
In Fig. 1B, expression of WT or HA-tagged H3.3 mRNA was compared by qRT-PCR. H3f3a-HA and H3f3b-HA mRNA were expressed at varying levels in all adult tissues examined, closely paralleling that of WT mRNA for both loci. The expression was generally higher in the reproductive organs and tissues of the immune system but lower in liver, heart, and kidney. Expression of HA-tagged H3.3 protein was detected in various adult tissues by immunoblot analysis using anti-HA antibody ( Fig. 1C). H3.3A-HA and H3.3B-HA both migrated between 14-and 24-kDa molecular masses as expected and found in all tissues at similar levels.

H3.3A-HA and H3.3B-HA are expressed in adult tissues: immunohistochemical analysis
To examine detailed expression of HA-tagged H3.3, we performed immunohistochemistry (IHC) for adult tissues. spleen, thymus, liver, kidney, brain, testis, and ovary. Our purpose was to assess whether H3.3 expression varies among tissues and among cells within a given tissue. In Fig. 2 (A-D), 5-m sections from indicated organs from H3f3a-HA mice were stained with anti-HA antibody (right panel). Matching sections were stained with control IgG, counterstained with hematoxylin to localize the nuclei (left panel). Images acquired at low and high magnifications are shown. In spleen, most lymphocytes, residing in the white pulp and red pulp displayed robust HA stain. Similarly, most cells in thymus, both in medulla and cortex were positive for HA signals. In liver, HA stain was found widely in hepatocytes and Kupffer cells. Likewise, in kidney most cells, including those in the glomerular tubules were stained with anti-HA antibody. In the brain, again, neurons clustered in the hippocampus area exhibited strong HA signals, showing exclusive localization in the nuclei. In testis, spermatocytes and round spermatids were stained along with non-germ line Leydig cells and granulosa. In ovary, HA stain was found mostly in the nuclei of theca cells and oocytes but more sparsely in granulosa cells. Essentially, the same results were obtained from IHC analyses from the H3f3b-HA mice (Fig. S2). Thus, H3.3-HA is expressed broadly in most, if not all types of cells, in all adult tissues tested.

H3.3-driven GFP is expressed in most of immune cell types: flow cytometry analysis
To assess H3.3 expression in different cell types in further detail, we opted for monitoring GFP signals by flow cytometry. H3.3 expression was tested for T lymphocytes (CD3 ϩ ), B lymphocytes, and myeloid cells from thymus, spleen, and bone marrow. Fig. 3 shows flow cytometry profiles of GFP-positive cells with CD3, B220, and CD11b markers. The cells from both H3f3a-HA and H3f3b-HA mice were tested, with cells from WT mice serving as a negative control for GFP. A large majority of cells in spleen (T cells, B lymphocytes, and myeloid cells) were GFP-positive in both H3f3a-HA and H3f3b-HA mice. Similarly, most of the thymic T cells (CD4 ϩ and CD8 ϩ ) expressed GFP. Likewise, most of B cells and myeloid cells residing in bone marrow were positive for GFP. The histograms in Fig. 3 (right panels) showed that GFP from H3f3a-HA and H3f3b-HA mice are expressed at comparable levels in each cell type in these tissues. Furthermore, in all cases GFP signals exhibited a narrow monophasic peak, indicating that H3.3 expression levels are largely uniform within each cell type. These results were reproducibly observed with samples from additional four mice aged from 6 to 10 weeks. We also found that the majority of macrophages, dendritic cells, and NK cells in spleen expressed GFP signals (Fig. S3).

Epitope-tagged H3.3 mouse model Genic deposition of H3.3 recapitulates the levels of transcript expression: ChIP-seq and RNA-seq analyses
H3.3 deposition has traditionally been investigated by using exogenously introduced epitope-tagged H3.3 as a surrogate vector. The use of exogenous H3.3 vector has been accredited, because exogenous H3.3 was shown to be incorporated into nucleosomes and deposited in chromatin. Nevertheless, the use of exogenous vector has a potential caveat, in that regulated expression of endogenous H3.3 is not easily reproduced by an exogenous H3.3. The use of the knockin mice would obviate this caveat, presumably allowing more accurate assessment of H3.3 deposition. We thus examined genome-wide distribution of H3.3 in mouse embryonic fibroblasts (MEFs) established from H3f3b-HA embryos. Confocal image of H3f3b-HA MEFs showed that H3.3-HA immunoreactivity was found predominantly in the "euchromatic" regions of the nucleus (Fig. 4A). On the other hand HA signals were generally low in 4Ј,6Ј-diamino-2-phenylindole-dense "heterochromatic" regions, consistent Below are the schematic structure of targeting vector and the 5Ј and 3Ј site arms. The targeting vectors and H3.3 cDNA fused to FLAG-HA, free EGFP, and drug selection markers. Bottom panels, identification homozygous knockin (ha/ha), heterozygous (ha/Wt), and WT (Wt/Wt) pups. Quantitative PCR-based genotyping was performed with tail DNA using primers in Fig. S1C. B, expression of endogenous H3.3 and knockin H3.3-HA mRNA in heterozygous knockin mouse. qRT-PCR detection of mRNA from the H3f3a (ha/wt) (left panel) and H3f3b (ha/wt) loci (right panel) in indicated adult tissues. H3f3a-HA and H3f3b-HA are transcripts from knockin loci, and H3f3a-Endo and H3f3b-Endo are transcripts from endogenous loci. Expression levels were normalized to Gapdh. The values represent the averages of three independent analysis using tissues from 8-week-old mice. C, immunoblot detection of H3.3-HA proteins, from knockin mice. 500 ng of histones prepared from 8-week-old heterozygous mice (H3f3a-HA on the left and H3f3b-HA on the right) were tested by immunoblot using anti-HA or anti-H2B antibody.

Epitope-tagged H3.3 mouse model
with the results with exogenous H3.3 (5,6). No HA stain was found in WT MEFs. ChIP-seq analysis was performed for growing MEFs using anti-HA antibody. We found a total of ϳ40,700 H3.3-HA peaks, which were localized to three genomic regions, genic (blue), intergenic (red), and regulatory/promoter (green and other colors) (Fig. 4B). Nearly 60% of H3.3 signals were found in the genic regions (TSS, exons, and introns). Goldberg et al. (11) have shown that H3.3 is enriched in telomere repeats in mouse embryonic stem cells. We thus tested whether H3.3 is enriched in telomeres in MEFs as well. We found that H3.3 signals were enriched in the consensus telomere repeats TTAGGG n and closely related telomeric repeats TTAGG n and Epitope-tagged H3.3 mouse model TTAGGC n . As shown in Fig. 4C, H3.3 signals were significantly enriched in telomere repeats. On the other hand, H3.3 signals on other repeat sequences, LINE, SINE, LTR, SAT, or simple repeats were near or below the background. These results closely resemble those of Goldberg et al. (11) and confirm H3.3 localization on telomeres. To correlate H3.3 deposition with transcription, we performed RNA-seq for the MEFs. In Fig. 4D, RNA-seq peak signals found on all annotated mouse genes were fractioned into 10 bins according to increasing transcripts levels (x axis), where the number of genes in each bin is shown on the y axis. Bin 1 represented 7,158 silent genes, and bin 10 denoted most highly expressed genes. Majority of expressed genes were in bin 5 to bin 7, peaking at bin 6. We next plotted average H3.3 deposition for genes in each bin (Fig. 4E). Strikingly, the amount of H3.3 deposition closely paralleled that of mRNA levels, i.e. the higher the RNA expression, the higher the H3.3 deposition and vice versa. It is interesting to note that for genes expressed at moderate levels (bins 3-6), H3.3 deposition peaked at/near the TSS, with fewer H3.3 signals over the gene body. This pattern of H3.3 accumulation has been reported as a general feature of H3.3 accumulation in many studies (11,13,28,29). However, for genes expressed at higher levels (bins 7-10), H3.3 signals increased more significantly over the gene body, with the greatest rise near or at the TES and beyond. H3.3 signals were virtually absent in silent genes (bin 1). Fig. 4F provides IGV screenshots of H3.3 deposition and RNA peaks for a low (bin 3: Vegfa) and high expressed genes (bin 10: Vim). Heat maps in Fig. S4A show H3.3 distribution on individual genes in each bin. Fig. 4 (F-I) shows average H3.3 deposition patterns over all genes, constitutively expressed genes and silent genes, respectively. Detailed heat maps for constitutive and silent genes Fig. 4 (H-I) are shown in Fig. S4 (B-D).

Interferon (IFN) stimulation triggers rapid H3.3 accumulation in IFN-stimulated genes (ISGs)
We previously showed that IFN stimulation causes marked accumulation of H3.3 in a number of genes transcriptionally activated by IFN. In these studies, we monitored the deposition of virally transduced H3.3-HA in fibroblasts (5,6). To substantiate these observations for endogenous H3.3 and at a genomewide level, ChIP-seq was performed for MEFs expressing H3f3b-HA stimulated by IFN␤ for 6 h, in parallel with RNAseq. We found ϳ49,000 H3.3 peaks in IFN-treated cells, a significant increase over untreated cells, which had 40,700 peaks (Fig. 5A). This increase was presumably due to increased incorporation of the existing H3.3 into the chromatin rather than increased H3.3 expression (see H3f3a and H3f3b RNA peaks in Fig. S5A). Nevertheless, global pattern of H3.3 distribution in the genome was not substantially different between untreated and IFN-treated samples (pie chart in Fig. 5A). The Venn diagram in Fig. 5B shows that a large majority of H3.3 peaks were common in untreated and IFN-treated cells. RNA-seq found 932 genes up-regulated after IFN treatment (Ͼ1.5-fold change (FC) and FDR of 0.1), which were designated as ISGs (Fig. 5C and Table S1). Gene Ontology (GO) analysis showed enrichment in the type I IFN signaling, anti-viral, host defense, cytokine categories, as expected (Fig. S5B), whereas 699 genes were down-regulated by IFN treatment (Fig. 5C). Fig. 5D depicts H3.3 accumulation on ISGs, where H3.3 signals are plotted on three groups of ISGs, according to the degree of mRNA induction. The data showed that H3.3 deposition increased greatly in all ISGs. Interestingly, ISGs with the greatest induction (log 2 FC Ͼ 4) showed the greatest increase in H3.3 deposition, those with intermediate induction (log 2 FC 2-4) showed intermediate increase, and those with the least induction (log 2 FC 1.5-2) showed the least increase in H3.3 accumulation. Conversely, H3.3 deposition was unchanged for genes down-regulated by IFN treatment both in amounts and patterns (Fig. 5E). The heat map in Fig. S6 depicts the rank order of H3.3-HA peak intensity on ISGs. Fig. 5 (E-H) provides snapshots of H3.3 deposition in typical ISGs, Ifit1 and Irf7; a down-regulated gene, Scd1; and a constitutively expressed Cdkn2 gene, supplemented with RNA peaks. Together, these data illustrate that H3.3 interacts with chromatin in a remarkably dynamic way and changes its localization in response to external signaling. Moreover, a direct correlation between the levels of ISG induction and that of H3.3 deposition is highly analogous with the correlation found for constitutively expressed genes in Fig. 4D, reinforcing the notion that H3.3 deposition is directly linked to transcriptional activity.

Discussion
We generated knockin mice in which the two H3.3 loci, H3f3a and H3f3b, were replaced with HA-tagged H3.3 by homologous recombination. Recently, Wen et al. (30) reported construction of H3f3b-HA mice using a zinc finger genomeediting method. We show here that our knockin mice allow sensitive detection of H3.3-HA protein expression in a variety of cells in vivo. These mice also proved useful for studying detailed, genome-wide distribution of H3.3. For these experiments, samples from heterozygous mice provided sufficient signals. Our knockin model is devoid of certain caveats inherent to experimental models that utilize exogenous H3.3, because introduced H3.3 does not readily recapitulate regulated expression of endogenous H3.3, and some cells are resistant to exogenous vector expression. IHC and flow cytometry analyses demonstrated that H3.f3a-HA and H3f3b-HA were expressed at comparable levels in most, if not all cells in various adult tissues, including brain and immune tissues. These tissues are rich in terminally differentiated, postmitotic cells, where our knowledge on the histone H3.3 is still scarce.
The use of anti-HA antibody to detect H3.3 proved particularly valuable for ChIP-seq experiments, because high-intensity H.3.3 peaks were observed with excellent resolution. This is noteworthy, given that commercially available anti-H3.3 antibodies are sometimes variable in quality. ChIP-seq analyses, combined with RNA-seq revealed a striking correlation between the amounts of H3.3 deposition and those of tran- Figure 3. Flow cytometry analysis of immune cells expressing EGFP from H3f3a-HA or H3f3b-HA mice. Cells from spleen, thymus, or bone marrow from 9-week-old H3f3a-HA or H3f3b-HA mice were stained with antibodies for B cells (B220), T cells (CD3 and CD4), or myeloid cells (Cd11b). Cells with these markers (y axis) were tested for EGFP signals (x axis). Histograms in the right panels depict EGFP mean fluorescent intensity.

Epitope-tagged H3.3 mouse model
scripts. Although the TSS-dominant H3.3 accumulation was found for the genes expressed moderately, genes expressed at higher levels showed greater H3.3 deposition over the gene body with even higher deposition toward the TES. This TESbiased pattern of deposition was also found in the majority of ISGs, where the levels of H3.3 deposition again correlated with the levels of transcriptional activation. These observations point to a complex mechanism of H3.3 accumulation that is presumably linked to transcription elongation and rate of RNA polymerase II passage through the gene body. In conclusion, Epitope-tagged H3.3 mouse model these H3.3 knockin mice would serve as a useful model to study various aspects of H3.3 activity.

Construction of H3f3a and H3f3b knockin mice
All the animal studies were performed as per the approved protocols of NICHD Animal Care and Use Committee (Animal Study Proposal (ASP) 14-044). The cDNA cassette human H3.3 cDNA fused to HA and FLAG tag and IRES-driven EGFP cassette were cloned into the targeting vectors TAM1 and TAM2 for H3f3a and H3fb3, respectively, along with the neomycin resistance gene or hygromycin resistance gene (Fig. 1A). Mouse ES cells (129/Sv ESC) replacing H3f3a and H3fb loci with the targeting vector were generated by homologous recombination. The complete sequence information for TAM1 and TAM2 vectors is available in the Mouse Genome Database. Isolation of ES clones, injection into pseudo pregnant C57BL/6 mice, identification of chimeric mice, and construction of mice with germ line transmission, as well as removal of the drug resistance genes by FRT recombination were performed by GenOway. TAM1 and TAM2 heterozygous mice were propagated in this laboratory and genotyped by specific primers Fig.  S1C. Heterozygous mice were bred with C57BL/6 mice for at least six generations. For genotyping, genomic DNA was extracted from adult mouse tails by using DirectPCR Lysis Reagent (Viagen) supplemented with 1 mg/ml of proteinase K (Ambion) by incubating at 56°C overnight and heat inactivating the proteinase K at 85°C for 45 min. The lysate was spun down, and 2 l of supernatant was used for PCR genotyping. A homozygous H3f3a knockin mice carried a 941-bp amplicon, WT mice had a 611-bp amplicon, and heterozygous mice displayed both bands. A homozygous H3f3b knockin mice had a 750-bp amplicon, WT mice had a 600-bp piece, and heterozygous mice carried both bands (Fig. 1A). The H3f3a and H3f3b mouse are available under the following accession numbers from Mouse Genome Informatics: H3f3aϽtm1.1KozaϾ (accession number 6195526) and H3f3bϽtm1.1KozaϾ (accession number 6195527).

Mouse embryonic fibroblasts
Tissues from embryos of heterozygous H3f3a-HA and H3f3b-HA mice were collected after removal of heads, tails, limbs, and internal organs and minced, trypsinized for 20 min at 37°C, and washed. Single cells were cultured in Dulbecco's modified Eagle's medium (Mediatech) supplemented with 10% fetal bovine serum (Atlanta Biologicals) and 100 units/ml penicillin/streptomycin and split at 1:2 ratios when freshly confluent. MEFs were passaged 15 consecutive times, passing a growth crisis, and attained morphologically homogeneous cells.

Immunohistochemistry
Tissues from indicated adult organs were fixed in 10% formaldehyde (pH 7.0) overnight and embedded in paraffin. Tissues were sectioned at a 5-m thickness, deparaffinized on slides. The slides were permeabilized with 2% Triton X-100 for 30 min and incubated with anti-HA (Abcam, ab9010) antibody diluted in TBS buffer containing 1% BSA at room temperature for 120 min, followed by blocking with 10% normal goat serum. The slides were then washed with 0.025% Triton X-100 in TBS and incubated with horseradish peroxidase (HRP)-conjugated secondary antibodies for 30 min at room temperature. The slides were developed using diamino benzidine substrate followed by two counterstaining with hematoxylin. The slides were imaged on Zeiss Axioplan 2 microscope.

Histone preparation and immunoblot analysis
Tissues and organs were homogenized in Dounce homogenizer in homogenization buffer (50 mM Tris-HCl, pH 7.5, 25 mM KCl, 250 mM sucrose, 2 mM sodium butyrate, 1 mM sodium orthovanadate, 0.5 mM phenylmethylsulfonyl fluoride, 1ϫ protease inhibitor mixture) and centrifuged. The pellets were incubated in Triton extraction buffer (PBS containing 0.5% Triton X-100 (v/v), 2 mM phenylmethylsulfonyl fluoride, 0.02% (w/v) The enrichments were calculated using the RepEnrich2 software (36) (https://github.com/nerettilab/RepEnrich2). (Please note that the JBC is not responsible for the long-term archiving and maintenance of this site or any other third party hosted site.) The fold enrichment on the y axis represents the number of reads in the HA sample over the input ChIP. The dotted line shows H3.3 signals that are more than 1-fold and above. The inset shows three different subclasses of telomeric repeat sequences and their fold enrichment over input samples. ERV, endogenous retrovirus. D, RNA-seq analysis. All annotated mouse genes were divided into 10 bins according to the transcript levels (log2, RPKM), and each bin contained equal number of genes (ϳ3000 genes) with the exception of B1 and B2, which had ϳ7,000 genes and plotted as histogram. The y axis (counts) represents how many times mRNA expression occurred within the specified range of RPKM in each bin (Table S1 (Binned genes)). E, ChIP-seq analysis. Average H3.3 accumulation was plotted for genes in 10 bins for the genic regions (TSS, gene body, TES Ϯ 5 kb). Numbers on the y axis represent reads per million. F, IGV screenshots for ChIP-seq peaks (blue) and RNA peaks (red). G-I, average H3.3 distribution patterns for all genes (G), constitutively expressed genes (H), and silent genes (I) are shown. NaN 3 ), centrifuged, and resuspended in 0.2 N HCl at a density of 4 ϫ 10 7 nuclei/ml, and extraction was continued overnight at 4°C. Extracts were centrifuged at 6,500 ϫ g for 10 min at 4°C to remove debris and stored at Ϫ20 C. Protein contents in the supernatants were measured by the Bradford assay. 0.5 g of extracted proteins was separated on NuPAGE gels (4 -12%) (Invitrogen) and transferred to a polyvinylidene difluoride membrane Immobilon-P (Millipore). The membrane was incubated with antibodies for HA (Abcam ab9110) and histone H2B (Abcam ab1790) at 1:2000 dilution followed by HRP-conjugated donkey anti-rabbit IgG at 1:5000 dilutions (Amersham Biosciences). The signal was visualized by the SuperSignal West Pico kit (Pierce) according to the manufacturer's protocols.

qRT-PCR
Total RNA was extracted with TRIzol (Life Technologies), treated with DNase I (Promega), and reverse-transcribed with SuperScript TM II reverse transcriptase (Invitrogen) and mRNA levels for indicated genes was detected as described using primers listed in Table S3 (6). mRNA levels were normalized by Gapdh. Quantitative differences in H3f3a and H3f3b in various tissues were calculated using the 2 Ϫ⌬⌬CT method relative to Gapdh (31).

RNA-seq
RNA-seq library was prepared with Illumina stranded mRNA-seq kit, following the manufacturer's instructions. Total RNA was extracted from MEFs using RNeasy columns (Qiagen). RNA samples (500 ng) were used for library construction. Pooled libraries were sequenced by Illumina HiSeq 2500 sequencer. The RNA-seq Fastq files were aligned to the UCSC mm9 genome using TopHat2 (32) and Bowtie2 (33). Read counts per gene were generated using HTSeq-count, and differential expression analysis was performed using edgeR (34). Normalized bigWig files for IGV genome browser tracks were generated using deepTools (35).

ChIP and ChIP-seq
Quantitative ChIP was performed for immunoprecipitated chromatin from MEFs as described (6). ChIP-seq libraries were prepared using 10 -50 ng of immunoprecipitated DNA using a NEBNext Ultra TM DNA library prep kit for Illumina (Cat E7370) following a standard protocol, normalized by concentration and sequenced in Illumina Next-Seq 550 following standard protocol. The quality of the reads was assessed by FastQC (v.0.9.5). The ChIP-seq reads were mapped onto the Mus musculus (mm9) genome using bowtie/2-2.2.5 and Samtools/1.2. To directly compare different ChIP-seq data, files were binned into 10-bp bins and then converted to sequence depth normalized to bigWig files using deepTools (v2.5.0). The peak calling was performed using both SICER/v1.1 (window size, 200; genome size, 0.787; gap size, 600; FDR, Ͻ0.01) and HOMER peak calling programs (Table S2). We performed visual inspection of the data using a local installation of the IGV genome browser view (Broad Institute). The peak annotation was performed using both CEAS-cistrome package and Homer software tools. The average metagene profiles of constitutive, silent, and induced genes was plotted using ngsplot/2.6.1 tool. We obtained repetitive element annotation version mm9-Jul 2007 -RepeatMasker open-3.2.8 -Repeat Library 20090604 from the repeatmasker site (http://www.repeatmasker.org/ species/mm.html) site. 5 We used RepEnrich2 pipeline available at https://github.com/nerettilab/RepEnrich2; M (36) and obtained count data for various repetitive elements including telomeres. 5 Input and H3.3-HA ChIP fastq files were aligned to the mm9 mouse annotation to obtain bam files using the default bowtie2 settings (bowtie2 -q -threads ϭ $(( SLURM_CPUS_PER_TASK -4)) -x genome -U ../Sample.fastq.gz samtools view -Sb -Sample. bam). RepEnrich2_subset.py (python ../RepEnrich2_subset.py Sample.bam 30 Sample-Name -pairedend FALSE) was run on the resulting bam files to output uniquely mapping and multimapping reads and finally RepEnrich2.py (python/RepEnrich2.py mm9_ repeatmasker.txt Sample_Output_Folder-Input/Sample-Name setup_folder../Sample.gz Sample_unique.bam -cpus 16 -pairedend FALSE) was run to obtain count files for various classes of repeat regions. The fold enrichments and log2 cpm were calculated using exactTest available on edgeR Bioconductor package after normalizing for the number of uniquely mapped reads.