The putative RNA helicase DDX1 associates with the nuclear RNA exosome and modulates RNA/DNA hybrids (R-loops)

The RNA exosome is a ribonuclease complex that mediates both RNA processing and degradation. This complex is evolutionarily conserved, ubiquitously expressed, and required for fundamental cellular functions, including rRNA processing. The RNA exosome plays roles in regulating gene expression and protecting the genome, including modulating the accumulation of RNA-DNA hybrids (R-loops). The function of the RNA exosome is facilitated by cofactors, such as the RNA helicase MTR4, which binds/remodels RNAs. Recently, missense mutations in RNA exosome subunit genes have been linked to neurological diseases. One possibility to explain why missense mutations in genes encoding RNA exosome subunits lead to neurological diseases is that the complex may interact with cell- or tissue-specific cofactors that are impacted by these changes. To begin addressing this question, we performed immunoprecipitation of the RNA exosome subunit, EXOSC3, in a neuronal cell line (N2A), followed by proteomic analyses to identify novel interactors. We identified the putative RNA helicase, DDX1, as an interactor. DDX1 plays roles in double-strand break repair, rRNA processing, and R-loop modulation. To explore the functional connections between EXOSC3 and DDX1, we examined the interaction following double-strand breaks and analyzed changes in R-loops in N2A cells depleted for EXOSC3 or DDX1 by DNA/RNA immunoprecipitation followed by sequencing. We find that EXOSC3 interaction with DDX1 is decreased in the presence of DNA damage and that loss of EXOSC3 or DDX1 alters R-loops. These results suggest EXOSC3 and DDX1 interact during events of cellular homeostasis and potentially suppress unscrupulous expression of genes promoting neuronal projection.

The RNA exosome is a ribonuclease complex that mediates both RNA processing and degradation.This complex is evolutionarily conserved, ubiquitously expressed, and required for fundamental cellular functions, including rRNA processing.The RNA exosome plays roles in regulating gene expression and protecting the genome, including modulating the accumulation of RNA-DNA hybrids (R-loops).The function of the RNA exosome is facilitated by cofactors, such as the RNA helicase MTR4, which binds/remodels RNAs.Recently, missense mutations in RNA exosome subunit genes have been linked to neurological diseases.One possibility to explain why missense mutations in genes encoding RNA exosome subunits lead to neurological diseases is that the complex may interact with cell-or tissue-specific cofactors that are impacted by these changes.To begin addressing this question, we performed immunoprecipitation of the RNA exosome subunit, EXOSC3, in a neuronal cell line (N2A), followed by proteomic analyses to identify novel interactors.We identified the putative RNA helicase, DDX1, as an interactor.DDX1 plays roles in doublestrand break repair, rRNA processing, and R-loop modulation.To explore the functional connections between EXOSC3 and DDX1, we examined the interaction following doublestrand breaks and analyzed changes in R-loops in N2A cells depleted for EXOSC3 or DDX1 by DNA/RNA immunoprecipitation followed by sequencing.We find that EXOSC3 interaction with DDX1 is decreased in the presence of DNA damage and that loss of EXOSC3 or DDX1 alters R-loops.These results suggest EXOSC3 and DDX1 interact during events of cellular homeostasis and potentially suppress unscrupulous expression of genes promoting neuronal projection.
The RNA exosome complex processes/degrades multiple classes of RNAs in both the nucleus and the cytoplasm.The best-defined role of the RNA exosome is the processing of precursor rRNA to mature rRNA for the production of ribosomes (1,2,(20)(21)(22).Other RNAs in the nucleus/nucleolus targeted by the RNA exosome include RNA from RNA/DNA hybrids, commonly known as R-loops, promoter upstream transcripts, snRNAs, small nucleolar RNAs (snoRNAs), and other noncoding RNAs (ncRNAs), such as transcription start site-associated antisense transcripts (xTSS-RNA) (3,(23)(24)(25)(26)(27)(28).In the cytoplasm, the RNA exosome targets aberrant transcripts for degradation, mRNAs for regulatory turnover, and viral RNAs as a cellular immune response (26,(29)(30)(31)(32)(33).The RNA exosome has also been implicated in DNA double-strand break repair by homologous recombination, potentially targeting aberrant transcripts produced upon DNA damage (34).The transcripts that are targeted in response to DNA damage are often within R-loop structures, which occur naturally during transcription.An accumulation of R-loops may have deleterious effects, leading to double-strand breaks and genomic instability (35).Studies suggest that the RNA exosome is poised to degrade released RNA after R-loops are unwound by RNA/DNA helicases (36).
The RNA exosome plays a critical role in cells by degrading and/or processing many transcripts in different cellular compartments.Thus, missense mutations in genes encoding structural subunits of the complex are linked to several human diseases, termed exosomopathies (37,38).The first link between the RNA exosome complex and disease described a patient with a missense mutation in EXOSC3, which causes the neurological disease pontocerebellar hypoplasia type 1B (39).This autosomal recessive disease is characterized by severe atrophy and progressive hypoplasia of the pons and cerebellum (40)(41)(42).Since the initial report of mutations in EXOSC3, more mutations have been identified and described in EXOSC1, EXOSC2, EXOSC5, EXOSC8, and EXOSC9 (16,17,(43)(44)(45)(46).All patients, with the exception of individuals with mutations in EXOSC2, suffer from cerebellar atrophy, at least to some extent.Patients with mutations in EXOSC2 present with a syndromic condition that consists of short stature, hearing loss, retinitis pigmentosa, distinctive facies, and mild intellectual disability (44,47).Why mutations in genes encoding structural subunits of the RNA exosome impact the cerebellum is not at all clear.
A number of protein cofactors associate with the RNA exosome to confer RNA target specificity.Several RNA exosome cofactors that were originally identified and characterized in budding yeast are conserved in human (4,(48)(49)(50)(51)(52)(53).For example, the RNA exosome requires helicases for proper RNA processing as RNAs with significant secondary structure are unable to enter the central channel of the hexameric ring of the RNA exosome (5).A nuclear helicase, termed MTR4 (alternatively named SKIV2L2), interacts with the RNA exosome and facilitates processing of RNA in the nucleus (7,8).A well-characterized cytoplasmic helicase, termed SKIV2L in humans or Ski2 in budding yeast, directly interacts with the RNA exosome and assists in cytoplasmic rRNA processing (49,51).The RNA exosome also interacts with nuclear scaffolding proteins, such as M-phase phosphoprotein 6 (alternatively named MPP6), and other associated ribonucleases, such as EXOSC10 (human) or Rrp6 (budding yeast) (7,8).One potential hypothesis to explain how single amino acid changes in structural subunits of the RNA exosome cause disease is that modest changes in the subunits alter interactions of the complex with cofactors required to target and subsequently process or degrade specific RNAs.As the RNA exosome is essential for fundamental processes, such as the production of mature ribosomes, a complete loss of function in patients seems unlikely.Why the majority of missense mutations identified in patients with exosomopathies cause clinical consequences most notable within regions of the brain remains unclear.
In this study, we examined the interactome of the RNA exosome in a neuronal cell line.An unbiased mass spectrometry approach identified a number of candidate binding partners.We identified the putative RNA helicase DDX1 as a protein that interacts with EXOSC3 and explored the shared functions.We found that DDX1 interacts with EXOSC3 in the nucleus that the interaction is DNA damage-sensitive and that the depletion of EXOSC3 or DDX1 results in significant changes in R-loops.Together, these findings suggest a novel aspect of RNA exosome function and regulation that is required for gene expression control.
Figure 1.RNA exosome subunits coimmunoprecipitate with tagged EXOSC3.A, the RNA exosome is a conserved exo/endoribonuclease complex that comprises ten subunits.Nine of the ten subunits are structural and termed exosome components.EXOSC1, EXOSC2, and EXOSC3 make up the cap and EXOSC4, EXOSC5, EXOSC6, EXOSC7, EXOSC8, and EXOSC9 make up a barrel-shaped core.In this graphic, EXOSC3 (navy) and EXOSC9 (green) are highlighted.EXOSC6, EXOSC7, and EXOSC8 are positioned behind subunits EXOSC4, EXOSC5, and EXOSC9, and consequently are not visible.The catalytic subunit, DIS3 or DIS3L, sits at the base of the complex [PDB 6H25 (6)].B, EXOSC9 core subunit coprecipitates with myc-EXOSC3 from murine neuronal N2A cell line.Cells were transfected with a plasmid encoding Vector control or myc-EXOSC3, followed by immunoprecipitation using anti-myc magnetic beads.Input for Vector control and myc-EXOSC3 was probed by an anti-myc antibody and a band corresponding to the molecular weight is detected in the input but not Vector control for myc-EXOSC3.Input for Vector control and myc-EXOSC3 is probed by an anti-EXOSC9 antibody and a band at the corresponding molecular weight is present in both lanes.Bound for Vector control and myc-EXOSC3 is probed by an anti-myc and an anti-EXOSC9 antibody, and a band corresponding to the molecular weight is detected in the bound fraction for myc-EXOSC3 but not for Vector control.Stain-free blot indicates the loading of total protein in the input.Immunoprecipitation of the myc-tagged EXOSC3 copurifies with the endogenous EXOSC9 subunit.C, eluates of the bound myc-EXOSC3 immunoprecipitation were analyzed by LC-tandem mass spectrometry.A table shows all RNA exosome subunits detected, listing the peptidespectrum matches (PSM) and the peptide numbers for each subunit.Vector IP serves as a control.IP, immunoprecipitation.

Proteomics reveal a suite of EXOSC3 interactors
To identify RNA exosome-interacting proteins in a neuronal cell line, we transiently transfected a plasmid encoding myc-tagged EXOSC3 into a murine neuroblastoma cell line (N2A) and purified coprecipitated proteins using antimyc magnetic beads as described in Experimental procedures.Immunoblots shown in Figure 1B confirm that myc-EXOSC3 is enriched in the bound fraction compared with the vector control.The core subunit EXOSC9 of the RNA exosome coprecipitates with myc-EXOSC3, suggesting myc-tagged EXOSC3 associates with the RNA exosome complex.We then subjected the immunoprecipitates to LC-MS/MS as described in Experimental procedures.The table in Figure 1C shows that all subunits of the RNA exosome complex were detected in the myc-EXOSC3 immunoprecipitation as compared to samples from the control.
The RNA exosome-interacting proteins identified by mass spectrometry were analyzed using the Panther Gene Ontology (GO) program and organized by protein class (Fig. 2A).A complete list of the interacting proteins is provided in Table S1.We excluded all proteins for which the peptide spectra matches (PSMs) log 2 ratio was less than or equal to zero.We examined 955 proteins for which the PSM equated to greater than zero.The largest category of the GO protein classes is "translation" containing 120 proteins and the second largest is "nucleic acid metabolism and binding" with 114 proteins.Within the latter category, all the RNA exosome subunits and some known cofactors, including MTR4 (also known as SKIV2L2) and MPP6, are present.Figure 2B shows cofactors (green) and potential RNA exosome-interacting candidates selected for further analysis (blue).The PSM and peptide numbers are low for even well-established cofactors, and therefore we used literary analysis and a higher PSM and peptide number to inform decisions to explore specific interactors.For this analysis, we opted to focus on nucleic acid metabolism/binding instead of translation because several cytoplasmic interactions between the RNA exosome and cofactors have been well characterized (49,50) and interactions between the RNA exosome and ribosome subunits have been defined (51,52).

Putative helicase DDX1 interacts with the RNA exosome in the nucleus
While we analyzed a number of candidates, we focused on the putative RNA helicase DDX1 for several reasons.RNA Figure 2. Novel EXOSC3/RNA exosome interactors identified using LC-tandem mass spectrometry.A, pie chart that organizes proteins that coprecipitate with myc-EXSOSC3 from murine neuronal N2A cell line by protein class.Peptide-spectrum matches (PSM) were analyzed using a log 2 ratio, so that any result above 0 indicates binding in myc-EXOSC3 IP over vector IP.Results that are less than or equal to 0 were excluded from the analyses.Proteins that contained a PSM beyond the log 2 cut-off of 0 were analyzed by Panther gene ontology terms.The number of proteins within a class is indicated inside the pie slices.B, the table (left) shows selected RNA exosome interactors detected and lists the PSM and Peptide number for each protein.Vector IP serves as the control.The gene list corresponding to the proteins is provided as Table S1.Note that MPP6 is the protein name and MPHOSPH6 is the gene name.MTR4 is the most common name for this helicase but the name appears as SKIV2L2 in Table S1.A short list of nucleic acid metabolism/binding proteins (right) is provided.The RNA exosome subunits (bold, navy) reside in the nucleic acid metabolism/binding subcategory, together with known RNA exosome cofactors (green).Candidates investigated as novel RNA exosome interactors are listed in blue, including the putative RNA helicase, DDX1.IP, immunoprecipitation; MMP6, M-phase phosphoprotein 6.
helicases play critical roles in various aspects of RNA metabolism, including RNA degradation and processing (54).Additionally, multiple conserved RNA exosome cofactors are helicases (7,51).Based on sequence homology, DDX1 is a putative helicase containing a conserved DEAD amino acid sequence motif shared by nucleic acid-unwinding, ATPbinding, DEAD-box proteins (54)(55)(56).DDX1 differs from other members of the DEAD-box family, as it includes an Nterminal SPRY protein interacting domain upstream of two helicase domains, between the phosphate-binding P-loop and the ssDNA binding Ia motifs (Fig. 3A) (54).The DDX1 protein is implicated in rRNA processing (57), R-loop formation (58), and double-strand break repair (59,60).
Initial studies employed epitope-tagged, transiently transfected EXOSC3.To test interactions with endogenous EXOSC3, we raised a rabbit polyclonal antibody against mouse EXOSC3.To test the specificity of this newly generated antibody, we depleted cells of EXOSC3 by transfecting N2A cells with two independent siRNA oligonucleotides that target EXOSC3 and performed immunoblotting (Fig. S1A).In total N2A cell lysate, the antibody detects a prominent band at the predicted size of EXOSC3 (calculated molecular weight of 29.5 kDa), which is specifically enriched by immunoprecipitation (Fig. S1B).This band is significantly decreased when cells are treated with either siRNA-targeting EXOSC3 as compared to the scramble control, providing evidence for the specificity of the antibody generated.
Initial attempts to validate putative RNA exosome-interacting proteins identified by mass spectrometry in whole-cell lysate were unsuccessful.Thus, we considered the fact that many RNA exosome cofactors localize to specific cellular compartments (4), and we examined interactions with endogenous EXOSC3 using cellular fractionation as described in Experimental procedures.Fractionation was confirmed via immunoblotting for the cytoplasmic marker HSP90 (61) and the nuclear protein B23 (62) (Fig. S1C).A band corresponding to the B23 is enriched in the nuclear fraction and absent in the cytoplasmic fraction, indicating efficient nuclear isolation.However, a band corresponding to HSP90 in the nuclear fraction suggests some cytoplasmic adulteration.We immunoprecipitated endogenous EXOSC3 from both the nuclear and cytoplasmic fractions, then used SDS-PAGE and immunoblotting to assess copurification.DDX1 is detected in the input of both the nuclear and cytoplasmic lysates, consistent with reported localization (Fig. 3B) (63,64).However, DDX1 is present in the bound fraction for only the nucleus and not the cytoplasm.DDX1 was not detected in any of the bound fractions for control immunoglobulin G (IgG) samples.EXOSC9, a core component of the RNA exosome complex is detected in both the cytoplasm and nucleus inputs as well as the bound fractions, consistent with the fact that the RNA exosome complex is present in both compartments.EXOSC3 is enriched in the bound fractions in both the cytoplasm and nucleus, and not in control IgG (Ctrl IgG).These data suggest a compartmentspecific interaction between EXOSC3 and DDX1 in the nuclear fraction.To assess whether the interaction detected between EXOSC3 and DDX1 is RNA-dependent, we treated the nuclear N2A cell lysate with RNase A. A urea-PAGE gel confirmed that RNase A degraded total RNA from the lysate (data not shown).We found that the treatment with RNase A increases the interaction between EXOSC3 and DDX1 with no detectable effect on the steady-state level of either protein (Fig. 3C).This experiment suggests the interaction is not RNAdependent and the significant increase in the interaction with RNase A treatment has been suggested to indicate a protein-protein interaction (65).

The interaction between EXOSC3 and DDX1 decreases in response to DNA damage
As both the RNA exosome and DDX1 play roles in DNA damage repair (34,59,60), we tested a potential model where the RNA exosome and DDX1 could cooperate in a cellular response to DNA damage.In such a model, DNA damage might be expected to enhance the interaction between EXOSC3 and DDX1.For this experiment, we treated N2A cells with camptothecin (CPT), a topoisomerase inhibitor that induces double-strand breaks (66), or control PBS as described in Experimental procedures.To confirm DNA damage induced by treatment with CPT, we probed for γH2AX, a classic marker of double-strand breaks (67).As shown in Figure 4A, following treatment with CPT, the γH2AX signal increases markedly compared with the PBS control.We used the EXOSC3 antibody to immunoprecipitate endogenous EXOSC3 from the nuclear fraction of cells treated with CPT and probed the immunoprecipitate for DDX1 (Fig. 4B).As shown in Figure 4C, the interaction between EXOSC3 and DDX1 is significantly reduced following treatment with CPT, without significantly altering the steady state protein levels (pvalue > 0.15, data not shown).In contrast, no change was detected in the interaction between EXOSC3 and EXOSC9.There was no change to the compartmentalized localization of the interaction between EXOSC3 and DDX1, following treatment with CPT (Fig. S2A).To assess whether another method for inducing double-strand breaks also decreases the interaction between EXOSC3 and DDX1, we treated cells with the DNA-damaging agent bleomycin (68).As shown in Fig. S2B, similar to treatment with CPT, induction of double-strand breaks with bleomycin decreases the interaction between EXOSC3 and DDX1.Taken together, these results do not support a model, where the RNA exosome and DDX1 cooperate in DNA damage response at least for these double-strand break-inducing agents.

Depletion of EXOSC3 or DDX1 results in rRNA processing defects
To explore potential shared functions of the RNA exosome and DDX1, we optimized conditions to siRNA deplete each protein from N2A cells.We transfected N2A cells with siRNA scramble control (scramble) or EXOSC3 siRNA, then performed an immunoblot to confirm depletion (Fig. 5A).Knockdown was quantified across the three biological replicates (Fig. 5B).Similarly, N2A cells were transfected with DDX1 siRNA, and depletion was confirmed by immunoblot (Fig. 5C).The knockdown was quantified across the three biological replicates (Fig. 5D).Thus, we were able to substantially deplete each protein to less than 15% remaining (Fig. 5, B and D), providing a model to explore and compare the consequences of loss of each of these proteins.
The RNA exosome has a well-defined role in rRNA processing and maturation (1,2,21,22,69).DEAD-box helicases such as DDX1 also play a critical role in RNA processing and genome stability (65,70).A previous study that analyzed DDX1 KO mouse embryonic stem cells employed a pulsechase experiment utilizing [ 3 H]-uridine-labeled samples and showed an accumulation of precursor 28S rRNA and mature 18S rRNA, suggesting a role for DDX1 in rRNA processing (57).We performed a detailed analysis to explore rRNA processing in cells depleted of either EXOSC3 or DDX1.Using Northern blotting to detect specific rRNA precursors, we examined which rRNA species are affected upon loss of either EXOSC3 or DDX1. Figure 6A depicts the steps of murine rRNA processing from the early precursor 47S to mature rRNA.rRNA processing begins with the 47S precursor, which generates several downstream precursors, including 32S and 12S (71,72).The 47S precursor also produces the 18S, 5.8S, and 28S mature rRNAs, as well as the internal transcribed spacers 1 and 2 (ITS1 and ITS2), and the 5 0 and 3 0 external transcribed spacers (5 0 ETS and 3 0 ETS).To capture these precursors, we used probes specific to the ITS2 sequence.Figure 6B shows that the steady-state levels of the 12S precursor increases when EXOSC3 is depleted compared to the scramble control, consistent with the most well-defined role of the RNA exosome in 3 0 trimming to produce mature 5.8S (1, 2, 22, 29, 69).In contrast, when cells are depleted of DDX1, there is a decrease in the steady-state levels of the 12S precursor compared to the scramble control.When using probes to detect 5.8S rRNA in the same samples, we detect a decrease in 5.8S level after depletion of EXOSC3 but no significant change when DDX1 is depleted.The 7SL signal recognition particle transcript, which is not a target of the RNA exosome (73), is used as a loading control.The Northern blot data are quantified for all analyses in Figure 6C and normalized to the loading control.The data from Figure 6, B and C are also summarized within Figure 6A, as indicated by the up and down arrows to denote statistically significant increases or decreases in these RNA species.Although both EXOSC3 and DDX1 clearly have an impact on rRNA processing or maturation, the roles in this process appear to be independent of each other.

R-loops are globally reduced upon depletion of EXOSC3 or DDX1
Multiple studies have linked DDX1 to R-loops, which are three-strand nucleic acid structures comprised of an RNA-DNA hybrid and a single strand of DNA (58,70,74).DDX1 has been reported to coprecipitate with R-loops and promote R-loop formation by unwinding complex RNA (58,70,74).Studies have also linked the RNA exosome to R-loop regulation in murine B-cells (75).To understand the impact EXOSC3 and DDX1 may have on R-loops, we performed DNA/RNA- immunoprecipitation followed by high-throughput sequencing (DRIP-seq) in cells depleted of either EXOSC3 or DDX1.In cells depleted of EXOSC3, we identified 722 significantly increased R-loop regions and 935 decreased R-loop regions (Fig. 7A, left, n = 3, false discovery rate [FDR] < 0.05).In cells depleted of DDX1, 638 increased R-loop regions and 1058 decreased R-loop regions are identified (Fig. 7A, right, n = 3, FDR < 0.05).We then compared the R-loop regions that showed statistically significant changes in cells depleted of EXOSC3 or DDX1.We found that 140 out of 722 increased R-loop regions (19%) in EXOSC3-depleted cells are also increased in DDX1-depleted cells (Fig. 7B, left, Chi-squared test, p-value < 0.0005) and 425 out of 935 decreased R-loop regions (45%) in EXOSC3 knockdown cells are also decreased in DDX1 knockdown cells (Fig. 7B, right, Chi-squared test, p value < 0.0005).These results suggest that the two proteins may cooperate to alter a common set of R-loops.
EXOSC3-and DDX1-depleted cells share common R-loop regions that increased or decreased.We employed Panther GO analysis on these R-loops (Fig. 7C).The R-loop regions that are increased in both EXOSC3 and DDX1 knockdowns (n = 140) are enriched in the categories of protein folding, histone modifications, stress responses, and RNA metabolic processes when grouped by fold enrichment.The R-loop regions that are decreased in both EXOSC3 and DDX1 knockdown conditions (n = 425) are also enriched in the RNA metabolism category but additionally in categories including axon guidance and neuronal development.
In Figure 7D, we grouped both increased and decreased R-loop regions by RNA transcript class present within the region corresponding to the altered R-loops.Both EXOSC3 and DDX1 depletion affects R-loops within genes encoding different classes of RNA.We grouped all changed R-loop regions, both increased and decreased, after EXOSC3 depletion and found that the largest category of RNA significantly affected is protein coding (n = 1496).We then excluded protein-coding genes from the analysis to allow a clearer view of the noncoding transcripts and identified 80 ncRNAs, 55 microRNAs, nine snoRNAs, eight pseudoRNAs, six long noncoding RNAs (lncRNAs), one scRNA, and one telomerase RNA within the altered R-loop regions.An analysis following DDX1 depletion revealed similar results.We excluded 1509 Rloop regions that mapped to protein-coding genes and identified 95 ncRNAs, 61 microRNAs, 13 snoRNAs, 12 pseu-doRNAs, and five lncRNAs.ncRNAs are those transcripts that do not currently have a more specific distinction.
In parallel with DRIP-seq, we employed RNA-seq after rRNA depletion to identify transcripts altered by depletion of EXOSC3 or DDX1.The pipeline employed for this analysis focused on coding regions, so data presented represent changes in mRNA transcripts.We identified 1757 significantly increased transcripts and 2192 decreased transcripts in EXOSC3 knockdown cells (Fig. 8A, left, n = 3, FDR < 0.05).In DDX1 knockdown cells, 734 increased transcripts and 968 decreased transcripts are identified (Fig. 8A, right, n = 3, FDR < 0.05).We then compared the transcripts that showed statistically significant changes in EXOSC3-and DDX1depleted cells.We found that 322 out of 1757 increased mRNA transcripts (18%) in siEXOSC3 cells are also increased in siDDX1 cells (Fig. 8B, left, Chi-squared test, p-value < 0.0005) and 599 out of 2192 decreased mRNA transcripts (27%) in siEXOSC3 cells are also decreased in siDDX1 cells (Fig. 8B, right, Chi-squared test, p-value < 0.0005), suggesting a coordination between EXOSC3 and DDX1 in commonly regulating a critical set of genes.
With both DRIP-seq and RNA-seq datasets in hand, we created a pipeline to compare results and filter the DRIP regions through the differential mRNA sequencing data to focus on the overlapping similarity between R-loops and changes in transcript level at those specific R-loop regions (Fig. 9A).We applied the pipeline to generate a heatmap to illustrate the DRIP regions and mRNA transcripts that changed upon depletion of either EXOSC3 or DDX1 (Fig. 9B).There are 466 R-loop regions that overlap in the RNA-seq data for both depletions.Of these overlapping Rloop regions, 103 are increased, and 363 regions are decreased.Many of the decreased regions mapped to genes that are involved in RNA metabolism, RNA regulation, translation processes, and neuronal development.We employed the integrative genomics viewer (IGV), which enables the visualization of these regions (Fig. 9C).To illustrate the effect on specific loci, we focused on two genes that contained R-loop regions significantly changed within transcripts that are also significantly changed and which fell under the biological processes aforementioned: Ints6 and Celf4.The red marker above the gene (orange) in the IGVs of Ints6 denotes the R-loop region, indicating a reduction in gene expression at these loci.In the three panels on the right, Celf4 R-loop regions are marked with a red line above the gene (orange).The two left IGV panels corresponding to Celf4 show increased R-loop and transcript regions.In the far right IGV panel corresponding to Celf4, the R-loop regions are  Filtering DRIP reads through RNA-seq revealed genes that are simultaneously affected by depletions of EXOSC3 or DDX1.A, a graphical representation of the pipeline used to focus on specific genes using the analysis funnel.N2A cells siRNA depleted of either EXOSC3 or DDX1 were subjected to both DRIP-and RNA-seq.We filtered results from the DRIP-seq through the RNA-seq reads, using only mRNA transcripts.We were then able to produce heatmaps and examine transcriptomic regions using an integrated genomics viewer (IGV).B, using the pipeline described, we created heatmaps showing the landscape of increased and decreased R-loop regions upon depletion of either EXOSC3 (blue) or DDX1 (green).C, the IGV images of Ints6 and Celf4.The chromosome is displayed at the top of the window.The span lists the number of bases currently displayed.The tick marks indicate the chromosome locations.The red line marks the regions in which R-loops are significantly changed.The top track displays the Mus musculus reference genome (NCBI37/ mm9) in orange.The following three tracks display the R-loop regions of interest corresponding to scramble, siEXOSC3, and siDDX1, respectively.The middle three tracks display the RNase H-treated R-loop regions of interest corresponding to scramble, siEXOSC3, and siDDX1, respectively.The last three tracks display the transcript regions of interest corresponding to scramble, siEXOSC3, and siDDX1, respectively.Celf4 exhibited three changed regions in the genes and is displayed by separate panels.D, quantification of Ints6 and Celf4 transcripts in scramble, siEXOSC3, and siDDX1 by fragment per kilo million reads (FPKM).DNA/RNA-immunoprecipitation followed by sequencing.
significance (Fig. 9D).Altogether, these data indicate that depletion of either EXOSC3 or DDX1 results in changes in Rloop regions that do not necessarily correspond to a similar change in transcript levels for that same gene.

Discussion
In this study, we used a proteomic approach to identify RNA exosome-associated proteins in a neuronal cell line and identified an interaction between EXOSC3 and the putative RNA helicase DDX1.Although each protein is present in both the nucleus and the cytoplasm, the interaction was only detected in the nuclear fraction.To explore possible shared functions between EXOSC3 (RNA exosome) and DDX1 in which each of these proteins are implicated, we examined the EXOSC3-DDX1 interaction upon DNA damage and effects on rRNA processing and on R-loop distribution upon depletion of EXOSC3 or DDX1.Our results suggest that EXOSC3 and DDX1 participate in shared regulation of R-loops and the transcripts produced within the genomic regions that form those R-loops.Taken together, these data define a potential mechanism by which an interaction between the RNA exosome complex and an RNA helicase could modulate R-loops.
The RNA exosome complex interacts with a number of proteins, termed cofactors, to confer specificity for different RNA targets (53).Many studies, including a number of elegant structures (5-8, 10, 49, 76-78), show that the RNA exosome consists of a core set of subunits and cofactors that are present in one-to-one stoichiometry.The core RNA exosome then interacts with a number of different proteins to facilitate degradation or processing of many different RNA targets.These protein-protein binding events may be transient as the RNA exosome interacts with multiple cofactors at shared binding sites that have been revealed by biochemical and structural studies (7).Indeed, only a single peptide was identified in the mass spectrometry data in this study for the wellcharacterized RNA exosome cofactors, MTR4 (alternatively named SKIV2L2) and MPP6.These results provide evidence that the interactions between the RNA exosome and cofactors are likely to be dynamic and transient.
Beyond the previously characterized RNA exosome cofactors, this study identified several candidate RNA exosomeinteracting proteins, which are located in different cellular compartments.For example, DDX1 is present in both the nucleus and the cytoplasm (64), while the Pumilio proteins, PUM1 and PUM2, are reported to control RNA stability exclusively in the cytoplasm (79).Thus, further studies could explore whether the interactions identified in whole-cell lysate occur preferentially in one cellular compartment or another.Although DDX1 is readily detected in both the nucleus and the cytoplasm (Fig. 3B) (64), robust interaction between EXOSC3 and DDX1 was only detected in the nuclear lysate.This compartment-specific interaction could mean that the RNA exosome and DDX1 interact in the context of chromatin.However, treatment with DNase I did not substantially decrease the EXOSC3-DDX1 association (Fig. S1D).One possibility as to why the EXOSC3-DDX1 interaction is compartment-specific is that posttranslational modifications (PTMs) could modulate binding.Studies in Schizosaccharomyces pombe have revealed PTMs in Dis3, Mtr4, Rrp40 (EXOSC3), Rrp43 (EXOSC8), and Rrp46 (EXOSC5), though only PTM mimetics in Dis3 and Mtr4 impacted RNA processing (80).No studies have explored whether PTMs regulate the function of DDX1.Such studies in the future could provide insight into how the RNA exosome complex dynamically interacts with so many different cofactors to target a large number of distinct RNAs.
Though we have identified and characterized the interaction between the RNA exosome cap subunit EXOSC3 and DDX1, we have not yet explored whether this interaction is direct or indirect.As EXOSC3 is a component of the RNA exosome complex, DDX1 may interact with EXOSC3 or with other RNA exosome subunits.Alternatively, as this interaction was identified through copurification, the interaction could be indirect and mediated by RNA exosome cofactors or other proteins.While we detect all RNA exosome subunits copurifying with EXOSC3, these experiments do not demonstrate that DDX1 interacts directly with any RNA exosome subunit beyond EXOSC3.The interaction between EXOSC3 and DDX1 increases significantly upon RNase treatment, particularly RNase A (Figs. 3C and S2C), a phenomenon suggested to occur for protein-protein interactions (65).One possible explanation for why the EXOSC3-DDX1 interaction increases upon digestion of RNA may be that RNA is bound between the two proteins and potentially interfering or competing for the same binding site.In one conceivable model, DDX1 could unwind RNA for degradation/processing by the RNA exosome and the removal of RNA increases interaction with EXOSC3.A previous study reported a similar observation when examining the interaction between hnRNPK and DDX1 (65).Thus, future studies will be required to further characterize the interaction to test whether DDX1 could function as an RNA exosome cofactor like multiple other helicases.
A logical model to explain the interaction between the RNA exosome and DDX1 was that these factors could cooperate in response to DNA damage.The RNA exosome and DDX1 have both been implicated in double-strand break repair by homologous recombination and nonhomologous end joining (23,34,58,59).If these factors worked together to respond to DNA damage, we speculated that inducing DNA damage would increase this interaction.In contrast to this prediction, we found that double-strand breaks induced by treatment with the topoisomerase inhibitor CPT significantly reduced the interaction (Fig. 4B).This result suggests that as DDX1 is recruited to sites of DNA damage (59,60), the interaction with the RNA exosome is lost.This finding led us to consider the possibility that the RNA exosome and DDX1 could share a function in cellular homeostasis in the absence of DNA damage.Alternatively, another model that cannot yet be eliminated is that the RNA exosome and DDX1 could cooperate to respond to specific types of DNA damage, which have not yet been tested.
In addition to DNA damage response, both the RNA exosome and DDX1 have been implicated in rRNA processing.
The best-defined role for the RNA exosome is 3 0 trimming to produce mature 5.8S RNA (1,2,22,69), while DDX1 has been implicated in the accumulation of a number of rRNA species (57).Analysis of rRNA processing in cells depleted of either EXOSC3 or DDX1 revealed that loss of these proteins alter some shared rRNA species, albeit in distinct manners (Fig. 6).For example, the mouse precursor of mature 5.8S rRNA, 12S rRNA, accumulates in cells depleted of EXOSC3 with a concomitant decrease in mature 5.8S.In contrast, depletion of DDX1 led to a decrease in the level of the 12S rRNA precursor.While these results show that some of the same rRNAs are impacted by the loss of either EXOSC3 or DDX1, the impact is different.Though prior studies in human and yeast have elucidated rRNA processing steps upon impairment of the RNA exosome or DDX1, the analyses here provide novel insight into the specific precursors are affected in mice.
Helicases belonging to the DEAD-and DExH-box families such as DDX1 and MTR4 play important roles in RNA processing beyond rRNA.R-loops are also a common target of these helicases.Though R-loops are necessary for cellular maintenance, these structures can pose a threat to the genome if they accumulate (35,81).DEAD/DExH-helicases are critical for resolving and regulating R-loops as they unwind the nucleic acid structures for subsequent degradation by ribonucleases.For example, DDX1 has been reported to unwind G-quadruplex structures that can stabilize R-loops during transcription (58).MTR4, a well-characterized nuclear cofactor, unwinds R-loops and degrades RNA in complex with the RNA exosome (82).Despite data suggesting that both DDX1 and MTR4 could help resolve R-loops, a number of differences between the helicases exist.For example, the SPRY protein binding domain in the N-terminus upstream of two helicase domains (Fig. 3A) is unique to DDX1.The SPRY domain in DDX1 is inserted between a phosphate-binding Ploop motif and an ssDNA binding Ia motif, separating the motifs by 240 residues, instead of the usual 20 to 40 residues seen in other DEAD-box proteins (54).Examination of the SPRY domain of DDX1 compared with the protein-protein interface of MTR4 and the RNA exosome may reveal surface sites of interaction and would potentially explain why a SPRY domain lies upstream of two helicase domains within the DDX1 protein.Structures indicate that MTR4 interfaces with MPP6, which tethers the helicase to the RNA exosome cap (7,8).This interface could be a shared docking site for these helicases.Further studies on whether DDX1 and MTR4 may have some common functions would also shed light on the cellular roles of these proteins.
Many reports link either the RNA exosome or DDX1 to the modulation of R-loops (34,58,74,75,81).However, no reports to date directly compare the RNA exosome and DDX1 in R-loop modulation.One previous study analyzed a wellcharacterized R-loop region within the BAMBI gene by mass spectrometry and provided a detailed list of the proteins that coimmunoprecipitated with R-loops in this region (74).DDX1, EXOSC3, and several other subunits of the RNA exosome were detected in the association with this BAMBI R-loop.Consistent with this previous study, the DRIP-seq performed here identified R-loop regions in the BAMBI gene significantly altered in cells depleted of either EXOSC3 or DDX1 (Fig. S4).These findings support a model where, at least for the BAMBI R-loop region, both EXOSC3 and DDX1 are colocated and may contribute to R-loop modulation.
We discovered several genes for which R-loop regions are significantly altered in the same manner by depletion of either EXOSC3 or DDX1.We coupled the genome-wide DRIP-seq analysis with RNA-seq analysis to focus on regions where changes in R-loops are associated with changes in transcript levels.This analysis revealed several common findings.While depletion of either EXOSC3 or DDX1 caused both some increases and some decreases in the number of R-loops detected, data showed larger numbers of decreased R-loop regions as compared to increased R-loop regions (Fig. 7A).Considering these shared changes, GO analysis revealed that the genes located in regions that show increased R-loops map to genes implicated in a variety of cellular responses to stress, which could reflect a requirement for proper RNA exosome and/or DDX1 function to support normal cell physiology.Strikingly, the only GO term enriched more than 5-fold among regions with decreased R-loops is "anterior/posterior axon guidance" (fold enrichment > 35).This finding could suggest that these regions of the genome are less accessible and perhaps less actively transcribed when either EXOSC3 or DDX1 is depleted, suggesting a possible link to the neurological disorders that are caused by missense mutations in genes encoding structural subunits of the RNA exosome.
To define potential shared transcript targets of EXOSC3 and DDX1, we also performed RNA-seq analysis using a pipeline that distinguished protein-coding transcripts from other RNAs.This analysis revealed that more transcripts show a significant change (FDR < 0.05) in levels upon depletion of EXOSC3 (3949 transcripts) as compared to DDX1 (1702 transcripts).Notably, the percentage of transcripts increased (56% for siEXOSC3; 57% for siDDX1) and decreased (44% for siEXOSC3; 43% for siDDX1) are similar in both cases; in fact, decreased transcripts slightly outnumber increased transcripts for both depletions.Interestingly, the data reveal a large number of shared transcripts that are regulated, suggesting a shared role in modulating a subset of transcripts or a shared cellular response at the transcript level.To integrate the DRIPseq and RNA-seq and provide insight into how changes in Rloops could correlate with altered gene expression within the region of the genome where the R-loop is located, we also narrowed the analysis to only consider regions where both R-loops and transcript levels were changed when either EXOSC3 or DDX1 was depleted with a focus on the shared changes.This analysis provided interesting sets of data to consider.One notable point is that the overall patterns of Rloop and transcript changes displayed in the heatmaps (Fig. 9B) appear similar for both EXOSC3 and DDX1 depletions with regions of increased R-loops sharing more increased transcripts and conversely for regions of decreased R-loops.To delve into the data in more detail, we identified altered R-loops linked to RNA metabolism, DNA repair, and/ or neurodevelopment.Two examples from this analysis are Ints6 and Celf4.Ints6 encodes for a protein that is a noncatalytic member of the Integrator complex, a protein complex involved in the processing of snRNAs, resolving lncRNAs, and metabolizing mRNAs (83,84).The integrator complex, containing INTS6, is involved in transcription termination of lncRNAs that are synthesized at sites of double-strand breaks, also known as damage-induced long ncRNAs (83,85).Celf4 encodes for an RNA-binding protein that regulates mRNA stability and binds 3 0 UTRs (86).Celf4 has also been linked to neurodevelopmental disorders and expression is enriched in the central nervous system (87,88).These transcripts may be regulated by the RNA exosome and DDX1 in neurons and dysregulation could contribute to the neurological pathology that occurs in many exosomopathy patients.These two examples represent just a subset of the many altered genomic regions and transcripts that could contribute to cellular dysfunction.
The overall goal of this study was to identify RNA exosomeinteracting proteins present in a neuronal cell line with the underlying hypothesis that neuronal-specific interactions could be lost in exosomopathies, which show primarily neuronal pathologies despite the ubiquitous expression of the RNA exosome.While the identification of an interaction with an additional RNA helicase is exciting, DDX1, like the RNA exosome, is ubiquitously expressed.DDX1 has not been definitively linked to any monogenic disease, but there is a report within the undiagnosed disease network of an individual with a missense mutation (p.Thr280Arg), which would fall within the catalytic domain of DDX1.This individual is reported to suffer from seizures and developmental regression, which could suggest that DDX1, like the RNA exosome, is required for normal function in some regions of the brain.In summary, we identified and characterized a new interacting partner of the RNA exosome, and studies thus far suggest that DDX1 could interact with the RNA exosome to modulate R-loop accumulation at some loci and to regulate transcript levels of some shared transcripts.Taken together, this study suggests the interaction between the RNA exosome and RNA helicase DDX1 may be required for neuronal-specific gene regulation.

Experimental procedures
Cell culture, siRNA transfections, and compound treatments A mouse neuroblastoma cell line, Neuro2A (N2A) (American Type Culture Collection) was maintained in a humidified incubator with 5% CO 2 at 37 C in Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum, 100 units/ml penicillin G and 100 mg/ml streptomycin antibiotics (1%).Individual siRNAs were transfected into cultured cells using Lipofectamine 2000 (Invitrogen) and Opti-MEM (Thermo Fisher Scientific) according to the manufacturer's protocol in antibiotic-free media.The authenticity of the cell line was validated using short tandem repeat profiling.In addition, the cell line was tested for mycoplasma, and cells were free from mycoplasma contamination for all experiments.Three 60 mm plates were seeded for each experimental condition.Each 60 mm plate was further seeded into three wells in 6-well plates.The 6-well plates were further processed for immunoblotting, DRIP-seq, or RNA-seq.Therefore, each biological replicate for all conditions was subjected to the same analysis.The siRNAs employed were: scramble negative control (Integrated DNA Technologies; 51-01-14-04), EXOSC3 siRNA 1 (IDT; mm.Ri.Exosc3.13.1), and EXOSC3 siRNA 2 (IDT; mm.Ri.Exosc3.13.5).If required, compounds were added to the media: CPT (5 μM, 1 h; Combi-Blocks ST-5997) and bleomycin (25 μg/ml, 3 days; Cayman Chemical 13877).

Protein purification and antibody production
The Exosc3 open reading frame encoding the mouse EXOSC3 protein was cloned into pGEX-6P-2 plasmid (GE Healthcare Life Sciences [now Cytiva]) to create an N terminally GST-tagged EXOSC3 construct.Recombinant GST-EXOSC3 was expressed in Escherichia coli Rosetta 2 (DE3).The GST fusion protein was purified by affinity chromatography on GSH-Sepharose (GE Healthcare Life Sciences [now Cytiva]) in 300 mM NaCl, 50 mM Hepes/NaOH (pH 7.5), 5% glycerol and 2 mM beta-mercapto-ethanol.The fusion protein was eluted by addition of 30 mM reduced GSH.The GST tag was removed using PreScission protease (GE Healthcare Life Sciences [now Cytiva]), and the EXOSC3 protein was further purified by collecting the flowthrough of a second affinity chromatography on GSH-Sepharose resin.The untagged protein was further purified on a Superdex200 size-exclusion chromatography column equilibrated in 150 mM NaCl, 30 mM Hepes/NaOH (pH 7.5), 5% glycerol.Expression and purification of both recombinant GST-EXOSC3 and untagged EXOSC3 were confirmed by SDS-PAGE, followed by Coomassie staining.The purified untagged protein was used as an immunogen to raise rabbit polyclonal antibodies by Josman, LLC.Sera containing anti-EXOSC3 antibodies was collected 21 days after immunization and used directly for immunoblotting, immunoprecipitation, and immunofluorescence.

Cell fractionation
N2A cells were collected, spun down, and gently resuspended in cold fractionation buffer (10 mM Tris-HCl pH 7.4, 10 mM NaCl, 0.5% v/v IGEPAL CA-630 (Sigma)) supplemented with Pierce protease inhibitor mini tablet (Thermo Fisher Scientific A32955) for 10 min on ice.Cell lysates were centrifuged at 1000g for 5 min, and the supernatant/cytoplasmic fraction was collected.The pellet was washed once with fractionation buffer and centrifuged at 1000g for 5 min.The resulting pellet was collected as the nuclear fraction.Protein samples were resuspended in immunoprecipitation (IP) buffer (50 mM Tris-HCl pH 7.4, 100 mM NaCl, 0.5% IGEPAL CA-630 (Sigma)) supplemented with Pierce protease inhibitor mini tablet (Thermo Fisher Scientific A32955) and prepared for immunoblotting and immunoprecipitation.

Immunoprecipitation
Mass spectrometry sample preparation N2A cells were transfected with pcDNA3 plasmids containing 2× myc-EXOSC3 or pcDNA3 empty vector (Addgene) using Lipofectamine 2000 (Thermo Fisher Scientific) and Opti-MEM (Thermo Fisher Scientific) as recommended by the manufacturer.Cell pellets were resuspended and lysed in 20 mM Hepes pH 7.2, 100 mM NaCl, and 0.2% Triton X-100.Cell lysate was cleared with centrifugation for 10 min at 10,000 RPM at 4 C. Protein concentrations were determined using a standard Pierce bicinchoninic acid assay (Thermo Fisher Scientific, 23225).Total lysate (30 μg) was collected for input.For each sample, 20 μl Pierce Anti-c-Myc magnetic beads (Thermo Fisher Scientific #88842) were added to 600 μl to 700 μl of lysate and incubated at room temperature for 1 h while tumbling end over end.Beads were washed 3× and submitted for LC-MS/MS.

Endogenous IP sample preparation
Nuclear and cytoplasmic protein lysates were prepared as described previously in cell fractionation, followed by suspending in IP buffer (50 mM Tris-HCl pH 7.4, 100 mM NaCl, 0.5% IGEPAL CA-630 [Sigma]) with one Pierce protease inhibitor mini tablet (Thermo Fisher Scientific A32955).Lysates were passed through a 27-gauge syringe five times and sonicated for five pulses three times.Lysates were centrifuged at 14,000g for 15 min at 4 C and protein concentrations were determined using a standard Pierce bicinchoninic acid assay (Thermo Fisher Scientific, 23225).Lysate (30 μg) was collected for input.Dynabeads Protein G magnetic beads (Invitrogen; 10003D) were suspended in 200 to 400 μl PBS with Tween and incubated with rotation for 30 min with nonspecific rabbit IgG isotype control (Invitrogen, 31235) or an equal volume of EXOSC3 antibody (50 μg/ml) at room temperature.Clarified lysates were added to washed bead-antibody complexes.If required, enzymes were added to final lysates and washed bead-anybody complexes: RNase A (0.2 μg/μl; Invitrogen 12091-039), RNase T1 (5 U; Worthington Biochemical LS01435), RNase H (8 U; Invitrogen 18021-014), DNase I (5 U; Invitrogen 18068015), benzonase (1:10,000; Santa Cruz sc202391).The samples were incubated at 4 C overnight while tumbling end over end.After incubation, the beads were magnetized and washed three times with cold IP buffer.The protein complexes were eluted with reducing sample buffer (250 mM Tris-HCl pH 7.4, 500 mM DTT, 10% SDS, 0.5% bromophenol blue, and 50% glycerol) and prepared for immunoblotting.

Sample preparation
Digestion was performed on beads using a digestion buffer containing 50 mM NH 4 HCO 3 .The beads were then treated with 1 mM DTT at 25 C for 30 min, followed by addition of 5 mM iodoacetamide (IAA) at 25 C for 30 min in the dark.Lysyl endopeptidase (Wako) was added to the mixture at a 1:50 (w/w) enzyme to protein ratio and digestion proceeded at 25 C overnight.Samples were further digested overnight with 1:50 (w/ w) trypsin (Promega) at 25 C. Resulting peptides were desalted with a Sep-Pak C18 column (Waters) and dried under vacuum.

LC-MS/MS analysis
Dried peptide samples were resuspended in 15 μl of loading buffer (0.1% formic acid, 0.03% trifluoroacetic acid, 1% acetonitrile) and 3 μl was loaded and separated on a selfpacked C18 (1.9 μm Dr Maisch) fused silica column (25 cm × 75 μM internal diameter; New Objective) by a Dionex Ultimate 3000 RSLCNano and monitored on a Fusion mass spectrometer (Thermo Fisher Scientific).Elution was performed over a 120-min gradient at a rate of 250 nl/min with buffer B ranging from 3% to 35% (buffer A: 0.1% formic acid in water, buffer B: 0.1% formic acid in acetonitrile).The mass spectrometer cycle was programmed to collect at the top speed for 3 s cycles.The mass spectrometry scans (300-1500 m/z range, 200,000 automatic gain control, 50 ms maximum ion time) were collected at a resolution of 120,000 at m/z 200 in profile mode and the high-energy collisional dissociation tandem mass spectrometry (MS/MS) spectra (1.5 m/z isolation width, 0.5 m/z offset, 30% collision energy, 10,000 automatic gain control target, 35 ms maximum ion time) were detected in the ion trap.Dynamic exclusion was set to exclude previous sequenced precursor ions for 20 s within a 10-ppm window.Precursor ions with +1 and +8 or higher charge states were excluded from sequencing.

Data analysis
Spectra were searched using Proteome Discoverer 2.1 against mouse UniProt database (53,289 target sequences, downloaded in April 2015).Searching parameters included fully tryptic restriction and a parent ion mass tolerance (±20 ppm).Product ion tolerance was 0.6 Da.Methionine oxidation (+15.99492Da), asparagine and glutamine deamidation (+0.98480), and protein N-terminal acetylation (+42.03670) were variable modifications (up to three allowed per peptide); cysteine was assigned a fixed carbamidomethyl modification (+57.021465Da).Percolator was used to filter the peptide spectrum matches to a FDR of 1%.

Immunofluorescence
N2A cells were fixed on coverslips with 4% paraformaldehyde (Electron Microscopy Sciences) for 20 min at room temperature and permeabilized with 0.5% Triton X-100 (Sigma) for 10 min at room temperature.Coverslips were then blocked with 5% bovine serum albumin for 60 min at room temperature and incubated overnight at 4 C with primary antibody γH2AX (rabbit monoclonal; Cell Signaling 9718T, 1:400).Texas red species-specific secondary antibodies (Jackson ImmunoResearch Laboratories Inc) were used at a 1:200 dilution.Images were obtained using an Olympus BX60 inverted brightfield microscope (Olympus Life Sciences) with a 100× objective and recorded with a Moment CMOS camera (Teledyne Photometrics).Fluorescence intensity in green, blue, and red channels were measured using an automated program pipeline using Ocular v2.0 advanced scientific camera control.

Total RNA preparation
Total RNA was isolated from N2A cells using the TRIzol reagent (Invitrogen) according to the manufacturer's protocol.RNA quality was measured by 1% agarose gel electrophoresis (Fig. S3A) and by high sensitivity (HS) DNA assay (Agilent) (Fig. S3B).

Northern blot
Total RNAs from three biological replicates were separated on a 1% agarose/formaldehyde gel and transferred by capillarity overnight in 20× saline sodium citrate (3 M NaCl, 300 mM sodium citrate pH 7) onto a nylon membrane (Cytiva).Membranes were probed as indicated.Bands were quantified using Image Lab software (Bio-Rad).Probe sequences were purchased from IDT and are listed in Table S2.

DRIP and sequencing
Genomic DNA (gDNA) was extracted from 4 million N2A cells and fragmented with restriction enzymes (BsrGI, EcoRI, HindIII, SspI, XbaI, 30 U/each, NEB) following a published protocol (89) and subjected to phenol-chloroform extraction.The immunoprecipitated DNA-RNA hybrids were quantified by Qubit (ThermoFisher Scientific #Q32854).DNA-RNA hybrids (10 ng) were treated with 5 U of RNase H for 1 h and then were sonicated into small fragments (100-500 bp) using a Covaris Focused-Ultrasonicator Me220.Fragmented DNA was blunted, 5 0 phosphorylated and 3 0 A-tailed by NEBNext Ultra II End Prep Enzyme Mix following the manufacturer's instruction (New England Biolabs #E7645L).An adaptor was ligated, and USER enzyme was used for U excision to yield adaptor-ligated double-strand DNA.The DNA was then PCRamplified using barcoded PCR primers (New England Biolabs #E7335L, #E7500L).After AMPure XP bead purification (Beckman Coulter #A63881) and Qubit quantification, the libraries were sent to Admera Health for quality analysis and sequencing with Illumina with a read length configuration of 150 PE.

Library preparation and high-throughput sequencing
For the rRNA-depleted RNA-seq library, sample quality was assessed by Bioanalyzer 2100 Eukaryote Total RNA Pico (Agilent Technologies), and samples were quantified by Qubit RNA HS assay (Thermo Fisher Scientific #Q32852).rRNA depletion was performed with Ribo-zero rRNA Removal Kit (Illumina Inc), followed by NEBNext Ultra II Nondirectional RNA Library Prep Kit for Illumina per manufacturer's recommendation.Library concentration was measured by quantitative polymerase chain reaction, and library quality was evaluated by Tapestation HS D1000 ScreenTapes (Agilent Technologies).Equimolar pooling of libraries was performed based on quantitative polymerase chain reaction values.Libraries were sequenced on a HiSeq with a read length configuration of 150 PE, targeting 80 million total reads per sample (40 million in each direction).

Mass spectrometry
Proteins exhibiting a log 2 ratio greater than zero PSM were analyzed using Panther Classification system (v17.0)and those exhibiting a ratio less than zero PSM were excluded.Of the 1141 proteins that met these criteria, 751 were organized into the functional classification system protein class.The pie chart was created in Microsoft Excel.
DNA/RNA-immunoprecipitation followed by sequencing Fold enrichment of the modules was determined using Panther Classification System (v17.0) (https://doi.org/10.5281/zenodo.6799722).The total set of increased R-loop peaks (140) and decreased R-loop peaks (425) for both siDDX1 and siEXOSC3 datasets were set as the client text box input.A Fisher's exact test was used to annotate the dataset into biological processes.We used a FDR and p value cut-off <0.05 and focused on biological processes pertaining to RNA metabolism and neuron development.Fold enrichment was graphed using Microsoft Excel.

Figure 3 .
Figure 3. DDX1 coimmunoprecipitates with EXOSC3.A, a graphical representation of the domain structure of human putative DEAD-box helicase DDX1.The SPRY protein-interacting domain in DDX1 is located between a phosphate-binding P-loop motif and an ssDNA binding Ia motif, separating the motifs by 240 residues instead of the usual 20 to 40 residues seen in other DEAD-box proteins (54).The catalytic ATP-binding helicase and C-terminal helicase domains lie downstream of the SPRY domain.B, DDX1 coimmunoprecipitates with EXOSC3 in the nuclear, but not cytoplasmic, fraction of N2A cell lysate.EXOSC3 was immunoprecipitated from the cytoplasmic or nuclear fraction, followed by immunoblotting using EXOSC3, EXOSC9, and DDX1 antibodies.The input, unbound and bound fractions from the EXOSC3 and control nonspecific rabbit IgG (Ctrl IgG) immunoprecipitation are shown.EXOSC9 serves as a representative of the copurified RNA exosome subunits.Stain-free blot serves as the loading control.C, immunoprecipitation of EXOSC3 from the nuclear fraction was performed with RNase A treatment.EXOSC3 antibody described previously is used in the No treatment and +RNase A immunoprecipitation. Nonspecific rabbit IgG (Ctrl IgG) was used as a control.EXOSC3 and DDX1 were analyzed for the input and bound fractions (IP: EXOSC3).The IgG light-chain band is visible (asterisk) in the bound fractions probed with EXOSC3, just below the EXOSC3 band.Stain-free blot serves as a loading control for the input.The experiment was performed in biological replicates (n = 5) and bands in the bound fractions were quantified relative to the No treatment control.The values below the lanes correspond to the amount of protein quantified from the bands.The asterisk (*) below the values indicate the p value <0.05.IgG, immunoglobulin G; IP, immunoprecipitation.

Figure 4 .
Figure 4.The interaction between EXOSC3 and DDX1 is sensitive to DNA damage.A, N2A cells were treated with camptothecin (CPT) or PBS (control), fixed, and analyzed by immunofluorescence using an antibody that detects the DNA damage marker, γH2AX.B, the input and immunoprecipitated samples from nuclear fractions (bound) treated with either CPT or PBS (control) for both EXOSC3 and control IgG (Ctrl IgG) are shown.DDX1, EXOSC9, and EXOSC3 are detected.C, the immunoprecipitation experiment in Figure 4B was performed in biological triplicate and DDX1 bands in EXOSC3 bound fractions were quantified.Statistical significance was calculated by a student's t test.Asterisk (*) represents p value <0.05.IB, immunoblot; IgG, immunoglobulin G; IP, immunoprecipitation.

Figure 5 .
Figure 5. EXOSC3 and DDX1 are robustly depleted by siRNA-mediated knockdown in N2A cells.A, N2A cells were transfected with scramble control or EXOSC3 siRNA (siEXOSC3) in biological triplicate.The steady-state level of EXOSC3 was assessed by immunoblotting.The asterisk (*) indicates a nonspecific band.Stain-free blot serves as a loading control.B, quantification of immunoblot in (A) shows that EXOSC3 is depleted to 13.1%.The percent depletion of EXOSC3 was determined by quantifying the immunoblot in (A) relative to scramble and averaging the values.This result is significant across three biological replicates.C, N2A cells were transfected with scramble or DDX1 siRNA (siDDX1) in biological triplicate.The steady-state level of DDX1 was assessed by immunoblotting.Stain-free blot serves as a loading control.D, quantification of immunoblot in (C) shows that DDX1 is depleted to 12.4%.The percent depletion of DDX1 was determined by quantifying the immunoblot in (C) relative to scramble and averaging the values.This result is significant across three biological replicates.The statistical analyses for (C and D) were calculated using a student's t test.Asterisks (****) represent a p value <0.0001 and (***) represent a p value <0.001.

Figure 6 .
Figure 6.Depletion of EXOSC3 or DDX1 results in misprocessing of rRNA precursors.A, a graphical schematic of murine rRNA processing, adapted from Henras et al., 2015 (71) that includes up and down arrows to summarize the results obtained in this study.B, Northern blots of total RNA from N2A cells depleted of EXOSC3 or DDX1 using rRNA probes show that levels of 12S rRNA precursors and 5.8S rRNA are altered.EXOSC3 or DDX1 was depleted from cells by siRNA knockdown and total RNA was isolated for Northern blotting.An ITS2 probe was used to detect 32S and 12S rRNA precursors.Additionally, we employed a probe specific for 5.8S rRNA.The 7SL transcript serves as a loading control.Lanes 1 to 3 indicate the scramble control; lanes 4 to 6 indicate siEXOSC3; lanes 7 to 8 indicate siDDX1.C, Northern blots from Figure 6B were quantified relative to 7SL in biological triplicates.An asterisk (*) indicates a significant difference using a p value cut-off of < 0.05.

Figure 7 .
Figure 7. DRIP-seq reveals that depletion of EXOSC3 or DDX1 alters R-loop regions.DNA/RNA immunoprecipitation followed by sequencing (DRIP-seq) was performed on N2A cells depleted of EXOSC3 or DDX1 by siRNA.A, the volcano plots show the number of R-loop regions that statistically increase and decrease in either EXOSC3 or DDX1 depletions compared to scramble control.The plot is graphed using a log 2 fold change across a -log 10 false discovery rate (FDR).The R-loop regions that did not achieve the FDR cut-off of <0.05 are indicated in black and fall under the horizontal lines.The numbers on the left of 0 on the x-axis indicate significantly decreased R-loop regions and the numbers on the right of 0 indicate significantly increased R-loop regions.B, the statistically increased and decreased R-loop regions in cells siRNA depleted of EXOSC3 or DDX1 identified in the DRIP-seq dataset are compared using a Venn diagram.The dark blue circles indicate the number of increased (n = 722) or decreased (n = 935) R-loop regions upon siRNA-mediated EXOSC3 depletion.The yellow circles represent the number of increased (n = 638) or decreased (n = 1058) R-loop regions that were affected upon siRNA-mediated DDX1 depletion.The overlap in green indicates the number of increased (n = 140) or decreased (n = 425) R-loop regions that siEXOSC3 or siDDX1 have in common (p value < 2.2 × 10 −16 ).C, the increased and decreased R-loop regions from both siEXOSC3 and siDDX1 samples were analyzed using Panther gene ontology terms and categorized by biological process.These analyses are statistically significant with a cut-off at p value <0.05.D, the classes of RNA that were affected upon depletion of either EXOSC3 or DDX1 are organized into a pie chart.Protein-coding genes were excluded.The numbers to the right of the class of RNA are the number of R-loop regions that correspond to that class.

Figure 8 .
Figure 8. RNA-seq shows that depletion of EXOSC3 or DDX1 results in more shared decreased transcripts than shared increased transcripts.A, RNAseq was performed on N2A cells siRNA depleted of EXOSC3 or DDX1.The volcano plots show the number of mRNA transcripts that increased and decreased in either siEXOSC3 or siDDX1 compared to scramble control.The plot is graphed using a log 2 fold change across a -log 10 false discovery rate (FDR).The differential transcripts that did not met the FDR cut-off <0.05 are indicated in black and fall under the horizontal lines.The numbers on the left of 0 of the xaxis indicate significantly decreased differential transcripts and the numbers on the right of 0 indicate significantly increased differential transcripts.B, the increased and decreased transcripts are compared using a Venn diagram.The dark blue circles indicate the number of increased (n = 1757) or decreased (n = 2192) mRNA transcripts upon EXOSC3 depletion.The yellow circles represent the number of increased (n = 734) or decreased (n = 968) mRNA transcripts that are affected upon DDX1 depletion.The overlap (p-value < 2.2 × 10 -16 ) represented in green indicates the number of increased (n = 322) or decreased (n = 599) transcripts that are common to depletion of both EXOSC3 and DDX1.

Figure 9 .
Figure 9. Filtering DRIP reads through RNA-seq revealed genes that are simultaneously affected by depletions of EXOSC3 or DDX1.A, a graphical representation of the pipeline used to focus on specific genes using the analysis funnel.N2A cells siRNA depleted of either EXOSC3 or DDX1 were subjected to both DRIP-and RNA-seq.We filtered results from the DRIP-seq through the RNA-seq reads, using only mRNA transcripts.We were then able to produce heatmaps and examine transcriptomic regions using an integrated genomics viewer (IGV).B, using the pipeline described, we created heatmaps showing the landscape of increased and decreased R-loop regions upon depletion of either EXOSC3 (blue) or DDX1 (green).C, the IGV images of Ints6 and Celf4.The chromosome is displayed at the top of the window.The span lists the number of bases currently displayed.The tick marks indicate the chromosome locations.The red line marks the regions in which R-loops are significantly changed.The top track displays the Mus musculus reference genome (NCBI37/ mm9) in orange.The following three tracks display the R-loop regions of interest corresponding to scramble, siEXOSC3, and siDDX1, respectively.The middle three tracks display the RNase H-treated R-loop regions of interest corresponding to scramble, siEXOSC3, and siDDX1, respectively.The last three tracks display the transcript regions of interest corresponding to scramble, siEXOSC3, and siDDX1, respectively.Celf4 exhibited three changed regions in the genes and is displayed by separate panels.D, quantification of Ints6 and Celf4 transcripts in scramble, siEXOSC3, and siDDX1 by fragment per kilo million reads (FPKM).DNA/RNA-immunoprecipitation followed by sequencing.