The pre-mRNA splicing and transcription factor Tat-SF1 is a functional partner of the spliceosome SF3b1 subunit via a U2AF homology motif interface

The transcription elongation and pre-mRNA splicing factor Tat-SF1 associates with the U2 small nuclear ribonucleoprotein (snRNP) of the spliceosome. However, the direct binding partner and underlying interactions mediating the Tat-SF1–U2 snRNP association remain unknown. Here, we identified SF3b1 as a Tat-SF1–interacting subunit of the U2 snRNP. Our 1.1 Å resolution crystal structure revealed that Tat-SF1 contains a U2AF homology motif (UHM) protein–protein interaction module. We demonstrated that Tat-SF1 preferentially and directly binds the SF3b1 subunit compared with other U2AF ligand motif (ULM)-containing splicing factors, and further established that SF3b1 association depends on the integrity of the Tat-SF1 UHM. We next compared the Tat-SF1–binding affinities for each of the five known SF3b1 ULMs and then determined the structures of representative high- and low-affinity SF3b1 ULM complexes with the Tat-SF1 UHM at 1.9 Å and 2.1 Å resolutions, respectively. These structures revealed a canonical UHM–ULM interface, comprising a Tat-SF1 binding pocket for a ULM tryptophan (SF3b1 Trp338) and electrostatic interactions with a basic ULM tail. Importantly, we found that SF3b1 regulates Tat-SF1 levels and that these two factors influence expression of overlapping representative transcripts, consistent with a functional partnership of Tat-SF1 and SF3b1. Altogether, these results define a new molecular interface of the Tat-SF1–U2 snRNP complex for gene regulation.

The transcription elongation and pre-mRNA splicing factor Tat-SF1 associates with the U2 small nuclear ribonucleoprotein (snRNP) of the spliceosome. However, the direct binding partner and underlying interactions mediating the Tat-SF1-U2 snRNP association remain unknown. Here, we identified SF3b1 as a Tat-SF1-interacting subunit of the U2 snRNP. Our 1.1 Å resolution crystal structure revealed that Tat-SF1 contains a U2AF homology motif (UHM) protein-protein interaction module. We demonstrated that Tat-SF1 preferentially and directly binds the SF3b1 subunit compared with other U2AF ligand motif (ULM)-containing splicing factors, and further established that SF3b1 association depends on the integrity of the Tat-SF1 UHM. We next compared the Tat-SF1-binding affinities for each of the five known SF3b1 ULMs and then determined the structures of representative high-and low-affinity SF3b1 ULM complexes with the Tat-SF1 UHM at 1.9 Å and 2.1 Å resolutions, respectively. These structures revealed a canonical UHM-ULM interface, comprising a Tat-SF1 binding pocket for a ULM tryptophan (SF3b1 Trp 338 ) and electrostatic interactions with a basic ULM tail. Importantly, we found that SF3b1 regulates Tat-SF1 levels and that these two factors influence expression of overlapping representative transcripts, consistent with a functional partnership of Tat-SF1 and SF3b1. Altogether, these results define a new molecular interface of the Tat-SF1-U2 snRNP complex for gene regulation.
Tat-SF1 co-immunoprecipitates with the U2 small nuclear ribonucleoprotein particle (17)(18)(19)(20), which anneals with the pre-mRNA branch point site in the early steps of spliceosome assembly. Yet, the direct interaction partner(s) of Tat-SF1 in the human spliceosome remains unknown. Clues arise from experimental observations coupled with the primary sequences of Tat-SF1 and the U2 snRNP components. Primary sequence analysis suggests that Tat-SF1 contains two modular domains, an RNA recognition motif (RRM) and a U2AF homology motif (UHM) (Fig. 1A). The UHM is a protein-interaction module with specialized features for recognizing short "U2AF ligand motifs" (ULM) of target proteins (21). Among U2 snRNP subunits, SF3b1 singularly comprises ULMs that could mediate Tat-SF1 UHM association; namely, five motifs near the N-terminal region of the SF3b1 protein (Fig. 1, B and C).
Here, we explore the hypothesis that a UHM-ULM interface couples Tat-SF1 to the SF3b1 subunit of the U2 snRNP. We determined a 1.1 Å resolution structure that details the Tat-SF1 UHM-fold. We show that Tat-SF1 directly and specifically recognizes the ULM region of SF3b1, and determine high resolution structures of Tat-SF1 UHM complexes comparing complexes with high-and low-affinity SF3b1 ULMs. The Tat-SF1-SF3b1 ULM interactions are important for association of the proteins in cell extracts. We demonstrate that SF3b1 regulates Tat-SF1 expression levels, and that these splicing factors both regulate representative transcripts. Altogether these results elucidate a functional UHM-ULM-mediated partnership of Tat-SF1 with the SF3b1 subunit of the U2 snRNP spliceosome particle.

Tat-SF1 contains a U2AF homology motif
To confirm that Tat-SF1 contains a UHM and detail its fold, we determined the crystal structure of this region at 1.1 Å resolution (Fig. 2, Table 1). Analysis using the DALI 3D comparison server (22) shows that the structure most closely matches established members of the UHM family, including CAPER␣ (PDB ID 4OZ1, Z-score 13.3) and U2AF 65 (PDB ID 4Z2X, Z-score 13.2). The core ␤␣␤␤␣␤-fold also resembles an RRM (e.g. the U1A prototype, PDB ID 3K0J, Z-score 12.8), which shares a common topology and appears to be an ancestor of the UHM. A 3 10 helix at the N terminus of the Tat-SF1 UHM is stabilized by crystal packing, yet may have relevance in the context of the full-length protein-fold.
The canonical RNA interface of the RRM family involves nucleobase stacking with aromatic residues in two ribonucleoprotein consensus motifs (RNP1 and RNP2) of the ␤-sheet surface. In all prior UHM structures (23)(24)(25)(26)(27)(28), the RNP motifs are degenerate and masked by C-terminal extensions of the corefold. Accordingly, a C-terminal tail of the Tat-SF1 UHM (namely Trp 347 and Tyr 353 ) packs against the putative RNP surface (Val 314 and Ile 267 ). These key RNP residues have diverged from the typical aromatic residues of RRMs, consistent with specialized roles for the Tat-SF1 UHM in protein rather than RNA interactions.
GST-Tat-SF1 retains a substantial fraction of the input SF3b1 ULM and has little detectable association with (P)SF1 (Fig.  3, A and B). Although GST-U2AF 65 ⌬RS retains a slight fraction of untagged Tat-SF1, this interaction is likely to be nonspecific considering the nanomolar affinity of the constitutive U2AF 65 -2AF 35 heterodimer (29). Instead, the amount of Residue ranges of constructs used for crystallization (structure) or isothermal titration calorimetry (ITC) are given in parentheses. p14, p14a binding site. C, the aligned sequences of five ULM repeats from human SF3b1 are shown in the inset below. The boundaries of synthetic peptides (in parentheses and underlined) terminate at the second TP motif, which for ULM1/ULM2 and ULM2/ULM3 also corresponds to the start of the next ULM. The central tryptophan (yellow) is preceded by characteristic basic (R/K) amino acids (green). Interactions between the C-terminal Tat-SF1 UHM extension and degenerate RNP residues are shown in surface and stick representation. The UHM is colored in a rainbow gradient from blue to red at the N and C termini.

UHM-mediated partnership of Tat-SF1 and SF3b1
Tat-SF1 retained by GST-SF3b1 ULM is significant (Fig. 3C). This result demonstrates that Tat-SF1 specifically and directly associates with the ULM region of SF3b1.

Tat-SF1 preferentially binds the fifth SF3b1 ULM
To quantitatively characterize Tat-SF1-SF3b1 interactions, we used isothermal titration calorimetry (ITC) to measure the binding affinities of Tat-SF1 for SF3b1 ULM and each of the individual SF3b1 ULMs (Fig. 4, Table 2). Our Tat-SF1 construct for ITC experiments included residues 1-360, which supports snRNP co-immunoprecipitation (19) and avoids precipitation of the UHM fragment during titration. Unlike CAPER␣ (23) but similar to U2AF 65 (31), the Tat-SF1-SF3b1 ULM isotherms show no evidence of cooperative association. We chose boundaries of peptides corresponding to the SF3b1 ULMs to minimize overlap among consecutive motifs (Fig. 1B). Tat-SF1 detectably binds three of the five documented SF3b1 ULMs (ULM1, ULM4, and ULM5). The binding affinity of Tat-SF1 for ULM5 is Ͼ20-fold greater than for ULM4 and slightly greater than ULM1 (4-fold). No heats were detected during titrations of Tat-SF1 with ULM2 or ULM3. The apparent binding affinity of Tat-SF1 for the entire SF3b1 ULM region agrees with the average values of the highest affinity ULM5 and ULM1 sites ( Table 2). We conclude that Tat-SF1 is capable of recognizing three local SF3b1 ULM sites, yet has lower affinity for ULM4 compared with either ULM5 or ULM1.

Structures of Tat-SF1 bound to high and low affinity SF3b1 ULMs
To identify and compare the interactions of the Tat-SF1 UHM with high versus low affinity SF3b1 ULMs, we determined the crystal structures of its complexes with ULM4 and ULM5 at 1.9 Å and 2.1 Å resolutions (Fig. 5, Table 1). Seven residues (residues 291-297) of a 17-residue ULM4 peptide were observed bound to a single Tat-SF1 UHM in the crystallographic asymmetric unit (Fig. 5A). Fewer ULM5 residues are evident (four ordered residues) and one of the two Tat-SF1 UHM copies in the asymmetric unit lacks bound ligand (Fig.  5B). This difference is most likely due to the distinct packing environments of the crystal structures.
The core-fold of the apo-Tat-SF1 UHM is similar to its SF3b1 ULM-bound complexes (r.m.s.d. n 0.35-0.45 Å between 86 matching C␣ atoms) (Fig. 5C). However, the empty ULM-binding site of the apo-Tat-SF1 structure is filled by a phenylalanine (Phe 337 ) donated by the so-called "RXF" loop of the UHM (21). In the Tat-SF1-ULM complexes, this Phe 337 side chain rearranges to enclose the consensus ULM tryptophan.
Tat-SF1 displays characteristic features of UHM-type interactions with the SF3b1 ULMs (Fig. 6, A and B). The ULM tryptophan (ULM4 Trp 293 or ULM5 Trp 338 ) inserts between the ␣-helices of the Tat-SF1 UHM. There, the ULM tryptophan is sandwiched between a salt bridge (Tat-SF1 Glu 294 -Arg 335 ) and

UHM-mediated partnership of Tat-SF1 and SF3b1
a T-shaped aromatic interaction (39) with the Phe 337 side chain. The Tat-SF1-bound ULM4 and ULM5 peptides both adopt a U-shape, which is shared by the two other known structures of SF3b1 ULMs bound to UHMs (ULM5 bound to CAPER␣ (23) or SPF45 (26)). In contrast, the SF3b1 ULM trajectories differ from that of the SF1 ULM, which binds to the U2AF 65 UHM in a distinct, linear conformation that is facilitated by a C-terminal coiled-coil extension of the SF1 ULM (24,25). At the C terminus of SF3b1 ULM4, a threonine-proline (TP) motif packs against an exposed Tat-SF1 tryptophan (Trp 336 ) at the X position of the UHM RXF loop (Fig. 6A). Although the surrounding crystal contacts prevent an analogous interaction of Tat-SF1 Trp 336 with the ULM5 TP motif, the SF3b1 ULM5 and ULM4 share identical C-terminal sequences (Fig.  1C). As such, ULM5 is likely to contact Tat-SF1 Trp 336 in a solution environment. The SF3b1 ULM region contains Ͼ20 TP motifs near the ULMs. These SF3b1 sites are highly phosphorylated (phospho.ELM (40)), which could provide means to regulate SF3b1 association with UHM-containing proteins.
Positively-charged residues at the N terminus of each ULM orient toward Glu 294 , Glu 289 , Asp 290 , and Glu 286 in an acidic ␣-helix of the Tat-SF1 UHM (Fig. 6, A and B). Although several N-terminal ULM residues are disordered in the electron den-sity, an ion pair between a conserved ULM arginine (ULM4 Arg 292 or ULM5 Arg 337 ) and Tat-SF1 Glu 286 is well-defined. Considering that the Tat-SF1-interacting residues of the SF3b1 ULM3, ULM4, and ULM5 C termini are identical, the different number of basic residues in the N-terminal tails are likely to dominate the range of binding affinities. For example, the different Tat-SF1 binding energies of ULM4 versus ULM5 (⌬⌬G 1.9 kcal mol Ϫ1 ) are consistent with the loss of an ion pair between the acidic Tat-SF1 ␣-helix and SF3b1 ULM5 Lys 333 (41). A prior study showed that basic residues substituted in the normally acidic U2AF 65 ␣-helix eliminated detectable binding to the SF1 ULM in pulldown assays (30). To investigate the importance of corresponding charge-charge interactions for the Tat-SF1-ULM complex, we introduced analogous Tat-SF1 E286K and D290K mutations, and found that these substitutions abolished detectable binding to SF3b1 ULM5 by ITC (Fig. 6C).

Integrity of the Tat-SF1 UHM is critical for coimmunoprecipitation with SF3b1
We evaluated the contribution of UHM-ULM interactions to the association of full-length Tat-SF1 and SF3b1 in human cell extracts (Fig. 7). Plasmids expressing Tat-SF1 variants or N-terminally FLAG-tagged SF3b1 were co-transfected into HEK293T cells. The associated protein complexes were immunoprecipitated using Tat-SF1-specific agarose beads. We found that wildtype (WT) Tat-SF1 retained significant levels of FLAG-SF3b1 in lysates from transfected but not mock-transfected cells. This result confirms the existence of a Tat-SF1-SF3b1 complex, in agreement with prior observations in distinct contexts (17)(18)(19)(20). We next compared FLAG-SF3b1 coimmunoprecipitates of a Tat-SF1 variant with E286K/F337A mutations, which prevent detectable binding of Tat-SF1 to SF3b1 ULM5 by ITC (Fig. 6D). The mutant Tat-SF1 variant lacked detectable retention of FLAG-SF3b1, which supports the importance of an intact UHM for association of Tat-SF1 with SF3b1 in human cell extracts.

Tat-SF1 expression is sensitive to SF3b1 levels
The association of Tat-SF1 with SF3b1 suggested that its expression level would be mutually regulated to ensure formation of a functional macromolecular complex. To test this hypothesis, we used siRNAs to reduce Tat-SF1 or SF3b1 expression levels in HEK293T cells, and evaluated changes in expression of the other factor by RT-qPCR and immunoblotting (Fig. 8, A-D). Tat-SF1 knockdown had little effect on SF3b1 levels, consistent with prior observations (18). Nevertheless, SF3b1 knockdown significantly reduced Tat-SF1 expression levels. We conclude that whereas SF3b1 serves Tat-SF1independent functions as a major spliceosome subunit (42), a feedback mechanism modulates Tat-SF1 levels in response to SF3b1.

Tat-SF1 and SF3b1 regulate overlapping gene transcripts
To further explore the functional outcomes of a Tat-SF1 partnership with SF3b1, we evaluated whether SF3b1 regulates expression of Tat-SF1 responsive genes. We focused on three representative transcripts (CCND1, SIN3B, and LMNB1) that

UHM-mediated partnership of Tat-SF1 and SF3b1
have documented expression changes in response to Tat-SF1 levels (16). As expected, levels of these transcripts decreased following siRNA-mediated reduction of Tat-SF1 (Fig. 8, E-G).
Notably, SF3b1 knockdown also decreased the levels of all three Tat-SF1-sensitive transcripts (by 3-5-fold). These effects are likely to be direct, because SF3b1 expression is only subtly affected by Tat-SF1 levels (Fig. 8, B and C) (18). Despite numerous studies (e.g. Refs. 14, 16, and 18), to our knowledge neither we nor others have succeeded in rescuing gene expression by reintroducing Tat-SF1, which suggests that Tat-SF1 activities are delicately balanced by localization and/or post-translational modifications. To reinforce the functional convergence of Tat-SF1 and SF3b1, we compared an additional transcript (DUSP11) that is known to be sensitive to an SF3b1-targeted small molecule, spliceostatin-A (43). Both SF3b1 and Tat-SF1 knockdown decreased levels of the DUSP11 transcript (Fig.  8H).
As a preliminary check for alternative splicing, we compared the sensitivity of a distinct splice site from LMNB1 to Tat-SF1 and SF3b1 knockdown (Fig. S1). We observed similar changes in transcript levels when probing either LMNB1 splice site. Consistently, Tat-SF1 reduction typically leads to intron retention rather than alternatively splicing (17,18), and most intronretained transcripts would be removed by quality control pathways (44). As such, our experiments lack discrimination between altered transcription and splicing. Regardless, the significant impact of SF3b1 knockdown for all Tat-SF1-sensitive transcripts supports altered pre-mRNA splicing, considering the well-established roles of SF3b1 in the spliceosome (42).

UHM-mediated partnership of Tat-SF1 and SF3b1
Altogether, our results establish convergent roles for Tat-SF1 and SF3b1 in human gene expression.

Discussion
Here, we identify direct and specific coupling of the Tat-SF1 transcriptional and pre-mRNA splicing factor with the SF3b1 subunit of the U2 snRNP. Underscoring the importance of the Tat-SF1-SF3b1 complex, associated snRNPs are required for Tat-SF1 to stimulate transcription (19). Following clues from structural homology, we pinpoint a critical UHM-ULM interface for Tat-SF1-SF3b1 association in cell extracts or purified proteins. Our high resolution structures reveal that the Tat-SF1 UHM exhibits a characteristic RRM-like topology with an extended C terminus that masks degenerate ribonucleoprotein motifs. The Tat-SF1 UHM encloses a central tryptophan of the SF3b1 ULM between two ␣-helices, which is typically essential for UHM-ULM binding (e.g. Refs. 29 -31 and 36). Tat-SF1 and SF3b1 association further depends on electrostatic interactions between an acidic UHM ␣-helix and a basic ULM tail.
The Tat-SF1 UHM-SF3b1 ULM structures expand panoptic views of this protein interaction motif (21). A recurrent theme of tryptophan-mediated and charge-charge interactions underscores the question of how cognate UHM-ULM pairs are determined. Although we show that the Tat-SF1 UHM is required for detectable SF3b1 association, it remains likely that auxiliary interfaces mediated by distinct protein or RNA regions could enhance the binding specificity and affinity of the Tat-SF1-SF3b1 complex in a physiological context. The binding affinities of the Tat-SF1-SF3b1 protein domains are micromolar (Table 2), consistent with other SF3b1 ULM-UHM pro-tein complexes (21). Such relatively weak interactions could be important for dynamic associations (45), yet do not rule out enhancement by transient interfaces during spliceosome assembly. For example, the UHM-mediated association of Tat-SF1 with the SF3b1 ULMs may synergize with the U2 snRNA, which Tat-SF1 and SF3b1 bind in the context of the intact U2 snRNP. Indeed, an RRM mutation of the yeast homologue Cus2 prevents its U2 snRNA association and rescue of U2 snRNA misfolding (9,19).
Macromolecular complexes such as spliceosome particles often employ feedback mechanisms to coordinate subunit expression (e.g. Refs. 46 -48). Our finding that Tat-SF1 expression decreases following SF3b1 depletion (Fig. 8, A and D) supports functional coordination of Tat-SF1 and SF3b1. Conversely, SF3b1 levels are relatively insensitive to reduced Tat-SF1, which agrees with SF3b1 ULM associations with a variety of alternative splicing factors (23,26,27,31). The SF3b1-dependent Tat-SF1 expression is analogous to a previously characterized precedent, the U2AF heterodimer (46). U2AF 65 knockdown reduces U2AF 35 protein levels, whereas reduced U2AF 35 levels are insensitive to U2AF 65 . Functionally, in vitro splicing can be reconstituted by the U2AF 65 subunit alone (49 -51), and moreover, Schizosaccharomyces pombe introns that are affected by U2AF 65 depletion are affected by U2AF 35 depletion (52). Accordingly, we here find that the levels of representative downstream transcripts respond to both Tat-SF1 and SF3b1 reductions. These effects are likely to be specific, because SF3b1 (43,53,54) and Tat-SF1 (16,17,55) knockdowns alter expression and splicing of only a subset of genes. We suggest that a feedback mechanism promotes the formation of Tat-SF1 complexes with SF3b1.
Altogether, we have identified SF3b1 as a directly interacting partner of Tat-SF1, and detailed a requisite ULM-UHM interface of this functional complex at atomic resolution. Beyond our study of Tat-SF1, the SF3b1 ULM-binding preferences have been comprehensively determined for one other UHM-containing protein, U2AF 65 (31). Both Tat-SF1 and U2AF 65 bind the SF3b1 ULM region with similar apparent affinities (approximate K D values 2 M/3 M for Tat-SF1/U2AF 65 binding to SF3b1 ULM ), and both show a subtle binding preference for ULM5 among the SF3b1 ULMs. Immunoaffinity purification followed by MS has shown that Tat-SF1 associates with U2 snRNP particles, but is absent from assembled, "A"-stage spliceosomes (20). In contrast, U2AF 65 is an essential splicing factor that stabilizes U2 snRNP association with pre-mRNA splice sites (56,57), and is detected only in the A-and later "B"-stages of spliceosome assembly (20). The similar SF3b1 binding preferences of Tat-SF1 and U2AF 65 , coupled with sequential Tat-SF1 and U2AF 65 functions during transcription and spliceosome assembly, call for future studies to explore whether Tat-SF1 serves as a checkpoint factor for handoff of functional U2 snRNP to U2AF 65 during spliceosome assembly.

UHM-mediated partnership of Tat-SF1 and SF3b1
NCBI Refseq NP_009210), or SF1 (residues 1-255 of NCBI Ref-Seq NP_004621) were expressed in Escherichia coli as GST fusions using either pGEX-6p or an in-house pGEX-6p variant encoding a tobacco etch virus protease site. The splicing factor proteins were purified by GST affinity followed by anion exchange or heparin affinity chromatography. For crystallization experiments, the Tat-SF1 UHM was separated from aggregates by size exclusion chromatography in 50 mM NaCl, 15 mM HEPES, pH 7.4, 0.2 mM tris(2-carboxyethyl)phosphine. Purified SF1 proteins were phosphorylated by UHMK1 as described (25) and then separated from UHMK1 by size exclusion chromatography. Synthetic peptides corresponding to SF3b1 ULMs (with residue ranges shown in Fig. 1) were purchased at 98% purity (Biomatik or Bio-Synthesis, Inc.).

GST pulldown assays
The GST fusion "bait" protein (100 g) was incubated for 1 h on pre-packaged GSH-agarose spin columns with 0.2-ml bed volume (Thermo Scientific Pierce TM ) that were equilibrated three times with binding buffer (100 mM NaCl, 25 mM HEPES,

UHM-mediated partnership of Tat-SF1 and SF3b1
pH 7.4, 3% glycerol, 1 mM EDTA, 10 mM DTT), and supplemented with protease inhibitor. The immobilized GST fusions were incubated with purified protein (500 g, 1 mg/ml) in binding buffer for 45 min at 4°C. The resin was then washed five times with binding buffer. Retained proteins were eluted by heating in Laemmli loading buffer supplemented with 10 mM GSH, and analyzed by SDS-PAGE with Coomassie Blue staining.

Isothermal titration calorimetry
Protein samples were dialyzed into 25 mM HEPES, pH 7.4, 50 mM NaCl, 0.2 mM tris(2-carboxyethyl)phosphine, and peptides were diluted into Ͼ200-fold greater volume of dialysis buffer, prior to ITC. The thermodynamics of Tat-SF1 titrated with SF3b1 ULM or ULM peptides were measured using a VP-ITC and fit using Origin 7.0 (MicroCal, LLC). Control titrations of ULM peptides into buffer were linear and showed negligible heats of dilution, and were subtracted to correct the relatively low affinity isotherms of ULM1 and ULM4. Other isotherms were corrected for the heats of dilution using an average of three data points at the saturated plateau of the titration. The apparent stoichiometries are consistent with 1:1 binding except SF3b1 ULM , which is underdetermined and appears to bind between two and five Tat-SF1 molecules considering 2 values of different fits.

Crystallization and structure determination
For crystallization of the Tat-SF1 UHM-SF3b155 ULM4 and ULM5 complexes, 0.1 mM Tat-SF1 UHM was mixed with a 1.2-fold molar excess of SF3b155 ULM and concentrated in a 2-kDa MWCO Vivaspin Hydrosart (Sartorius) concentrator. The Tat-SF1 UHM and its ULM complexes were crystallized by vapor diffusion using the JCSGϩ TM screen (Molecular Dimensions) at 277 K. The apo-Tat-SF1 UHM crystallization condi- A and B, immunoblots demonstrating Tat-SF1 or SF3b1 knockdown following transfection of HEK293T cells with the indicated Stealth TM siRNAs or a low G/C control siRNA ("low GC"). Whole cell lysates were analyzed by SDS-PAGE and immunoblotted for either: A, Tat-SF1 or B, SF3b1. H2B is encoded by an intronless gene and hence an appropriate loading control. C-H, relative levels of TAT-SF1, SF3B1, or the indicated transcripts were determined by RT-qPCR of total RNA isolated from HEK 293T cells following transfection with the indicated siRNAs. Amplification of the LMNB1 exon 5/6 junction is shown. The LMNB1 exon 7/8 junction gave similar results as shown in Fig. S1. Primer locations and sequences are given in Table S1. Unpaired, two-tailed t tests with Welch's correction: *, p Ͻ 0.05; **, p Ͻ 0.005; ***, p Ͻ 0.0005; ****, p Ͻ 0.0001.
The apo-Tat-SF1 UHM structure was determined with an ensemble search model comprising the U2AF 65 , SPF45, and CAPER␣ UHM structures generated using Ensembler (61). A molecular replacement solution using Phaser (61) via the Phenix interface (62) (TF Z-score 5.8, LLG 304) was rebuilt in Coot (63), and refined using Phenix (62). Alternative strategies to determine the apo-Tat-SF1 UHM structure were unsuccessful, including direct methods at 1.13 Å resolution and molecular replacement searches with either individual UHM crystal structures stripped of ligand coordinates (U2AF 65 , PDB ID 4FXW; SPF45, PDB ID 2PEH; CAPER␣, PDB ID 4OZ0) or an unpublished apo-NMR structure (PDB ID 2DIT). The ULMbound structures were determined by molecular replacement starting from the apo-Tat-SF1 UHM structure. Crystallographic data and refinement statistics are given in Table 1.

Cell culture and transfections
HEK 293T cells (ATCC CRL-3216 TM ) were cultured in Gibco TM minimum essential media supplemented with 10% fetal calf serum (Atlanta Biologicals) and 2 mM L-glutamine and maintained at 37°C in a humidified atmosphere containing 5% CO 2 . Cells were transfected using JetPrime (Polyplus-transfection) as instructed by the manufacturer.

Co-immunoprecipitation
HEK 293T cells were transfected with plasmids encoding human Tat-SF1 (modified from Origene #SC114990), N-terminally FLAG-tagged SF3b1 (modified from a gift of Benjamin Ebert (64)), or empty control vector (pCMV). After 24 h, cells were harvested and lysed in a IP buffer (50 mM Tris, pH 8.0, 75 mM NaCl, 5% glycerol, 10 mM CaCl 2 , cOmplete TM EDTA-free protease inhibitor (Sigma), 0.5 mM DTT) ϩ 0.5% Triton X-100. Lysates were clarified by centrifugation and a fraction of each sample was set aside as input controls. The remaining lysates were diluted 1:5 with IP buffer ϩ 0.1% Triton X-100, then incubated for 1 h on a rocking platform at 4°C protein G-Sepharose (GE Healthcare number 17061801) pre-bound to Tat-SF1-specific antibody (Santa Cruz Biotechnology number sc-514351). Beads were collected by centrifugation and washed extensively with IP buffer. Final washes included RNase A (Sigma).

Immunoblotting
The immunoprecipitated complexes on beads were resuspended in SDS-PAGE loading buffer. For total protein analysis, cells were lysed in a buffer containing 50 mM Tris, pH 8.0, 10 mM EDTA, 1% SDS, 1 mM DTT, and protease inhibitors. Proteins were separated by SDS-PAGE, transferred to a polyvi-nylidene difluoride membrane, and immunoblotted with antibodies specific for Tat-SF1 (Santa Cruz Biotechnology number sc-514351), FLAG (Cell Signaling Technology number 2368), SF3b1 (MBL number D221-3), or Histone H2B (Cell Signaling Technology number 12354). Secondary antibodies included anti-mouse IgG horseradish peroxidase or anti-rabbit IgG horseradish peroxidase (GE Healthcare numbers NA931 or NA934). Chemiluminescent signal from SuperSignal WestPico chemiluminescent substrate (ThermoFisher Scientific) was detected on a Chemidoc TM Touch Imaging System (Bio-Rad).

Quantitative-reverse transcriptase real time PCR
Cells were harvested 3 days following transfection with Stealth TM siRNA duplexes (ThermoFisher Scientific) specific for HTATSF1 (assay IDs: HSS120650, HSS120651), SF3B1 (assay IDs: HSS146413, HSS146415), or a negative control (12935200). Total RNA was isolated and DNase I-treated using the RNeasy Kit (Qiagen). The cDNAs were synthesized by Moloney murine leukemia virus RT with random primers (Invitrogen), and analyzed by quantitative PCR with SYBR TM Green in triplicate on a Bio-Rad CFX thermal cycler. The RT-qPCR products were quantified by the relative standard curve method and normalized to the level of 18S rRNA. Results in Figs. 7 and 8 represent at least three biological replicates. Primers are listed in Table S1.