Crystal structure of a Y-box binding protein 1 (YB-1)–RNA complex reveals key features and residues interacting with RNA

The Y-box binding protein 1 (YB-1) is a member of the cold shock domain (CSD) protein family and is recognized as an oncogenic factor in several solid tumors. By binding to RNA, YB-1 participates in several steps of posttranscriptional regulation of gene expression, including mRNA splicing, stability, and translation; microRNA processing; and stress granule assembly. However, the mechanisms in YB-1–mediated regulation of RNAs are unclear. Previously, we used both systematic evolution of ligands by exponential enrichment (SELEX) and individual-nucleotide resolution UV cross-linking and immunoprecipitation coupled RNA-Seq (iCLIP-Seq) analyses, which defined the RNA-binding consensus sequence of YB-1 as CA(U/C)C. We also reported that through binding to its core motif CAUC in primary transcripts, YB-1 regulates the alternative splicing of a CD44 variable exon and the biogenesis of miR-29b-2 during both Drosha and Dicer steps. To elucidate the molecular basis of the YB-1–RNA interactions, we report high-resolution crystal structures of the YB-1 CSD in complex with different RNA oligos at 1.7 Å resolution. The structure revealed that CSD interacts with RNA mainly through π–π stacking interactions assembled by four highly conserved aromatic residues. Interestingly, YB-1 CSD forms a homodimer in solution, and we observed that two residues, Tyr-99 and Asp-105, at the dimer interface are important for YB-1 CSD dimerization. Substituting these two residues with Ala reduced CSD's RNA-binding activity and abrogated the splicing activation of YB-1 targets. The YB-1 CSD–RNA structures presented here at atomic resolution provide mechanistic insights into gene expression regulated by CSD-containing proteins.

The Y-box binding protein 1 (YB-1) is a nucleic acid-binding protein that belongs to the highly conserved cold shock domain (CSD) 3 -containing protein family found in bacteria, plants, and animals (1,2). It was originally identified by its ability to bind Y-box sequence 5Ј-CTGATTGGCCAA-3Ј (an inverted CCAAT cassette) in the promoter of the major histocompatibility complex class II gene (3). Subsequently, YB-1 was characterized as a major component of the messenger ribonucleoprotein particles in the cytoplasm that interacts with mRNAs (4). Accumulating evidence has suggested that YB-1 participates in almost all DNA-and RNA-related processes in cells, from DNA replication and repair to the transcription, splicing, stability, and translation of mRNA through its DNA/RNA-binding activities and interactions with other proteins (1,2,(5)(6)(7)(8)(9)(10)(11)(12). However, the underlying regulatory mechanisms remain elusive.
The biological functions of YB-1 have been extensively studied in diseases, mainly including malignant tumors (13). It has been shown that transgenic mice overexpressing YB-1 developed invasive mammary carcinomas (14). Numerous reports suggest that the up-regulation of YB-1 and its nuclear localization are indicators of poor prognosis in a variety of cancers (15,16). YB-1 plays crucial roles in the control of multiple cellular processes, including cell proliferation, survival, metastasis, and drug resistance. It regulates the transcription of genes involved in drug resistance and the control of cell proliferation and progression, including MDR1 (17), CCNA and CCNB1 (18), EGFR (19), PIK3C (20), MET (21), and CD44 (22). YB-1 has also been shown to act as a critical regulator of translation during the epithelial-mesenchymal transition and metastasis of epithelial malignances by activating the translation of Snail1, HIF1A, and other developmentally regulated factors (23,24). In addition, cro ARTICLE YB-1 was found to be actively secreted, and serum YB-1 levels may be potentially utilized as a biomarker for cancer progression (25)(26)(27)(28). Recent studies reported a role of YB-1 in sorting miRNAs and other small noncoding RNA species into exosomes (29,30).
The YB-1 protein consists of a variable N-terminal Ala/Prorich domain (A/P), a central CSD, and a C-terminal domain (CTD), which is composed of basic and aromatic regions (Fig.  1A). The RNA-binding specificity of an RNA-binding protein is important for understanding its cellular functions and molecular mechanisms as well as for searching for its downstream targets. Early studies based on the Xenopus Y-box protein, FRGY2, suggest that the CSD in Y-box proteins is responsible for its specific interaction with RNA, whereas the Ala/Pro-rich domain and C-terminal domain stabilize its binding to RNA in a sequence-independent manner (31)(32)(33). The CSD of YB-1 adopts a classical OB-fold consisting of five anti-parallel ␤-strands to form a ␤-barrel (34). This ␤-barrel structure usually exists in nucleic acid-binding proteins carrying the ribonucleoprotein particle domain-1 and -2 (RNP-1 and RNP-2). In the previously reported CSD structures, CSDs only show limited sequence specificity, such as bacterial CspB and Lin28 (35,36). However, how YB-1 CSD specifically recognizes its RNAbinding motif remains elusive.
In our previous studies, we showed that YB-1 specifically binds to a CA(U/C)C consensus sequence using the systematic evolution of ligands by exponential enrichment (SELEX) method (37). Mutational analyses showed that the second adenosine (A 2 ) and the fourth cytidine (C 4 ) in the CA(U/C)C consensus are critical for YB-1 recognition. YB-1 specifically recognizes the CAUC core motif in both CD44 variable exon v5 and its upstream polypyrimidine region, which activates exon v5 inclusion by recruiting U2AF65 to the weak 3Ј splice site. Later, using the iCLIP-Seq method, we mapped the in vivo YB-1-binding sites at a genome-wide level and found that YB-1 preferentially recognizes a UYAUC motif, which closely resembles the binding motif determined by in vitro SELEX (38). In a recent study, the CAUC motif was also identified as a YB-1-binding motif in a fused cell line by photoactivatable ribonucleoside-enhanced cross-linking and immunoprecipitation (39). We also showed that YB-1 is up-regulated in glioblastoma and promotes cell proliferation by blocking the processing of miR-29b-2 via specific binding to the CAUC motif in the terminal loop region of pri-/pre-miR-29b-2 (38). These results demonstrate that the specific recognition of the YB-1-binding motif is crucial for its function.
To provide the structural basis for YB-1-RNA interactions, in this study, we resolved the high-resolution crystal structures of YB-1 CSD in complex with RNA-binding consensus sequence defined previously by using SELEX method (37). We also found that YB-1 CSD forms a homodimer both in crystal and in solution. Its dimerization is important for its RNA binding and function in splicing regulation. Taken together, our findings provide structural details and novel features of YB-1-RNA interactions.

The CSD of YB-1 preferentially binds a single-stranded RNA oligomer
The CSD is located in the middle of YB-1 protein, spanning residues 50 -130 (Fig. 1A). To investigate the binding affinities of YB-1 CSD to single-stranded or dsRNA oligos, we performed fluorescence polarization (FP) assays to measure the dissociation constants (K D ). RNA oligomers used in the FP assays were designed to eliminate the secondary structures, including hairpin structure or more than four nucleotides of self-complementary sequence. In detail, RNA oligomers all contain a fluorescein amidite (FAM) group and six additional nucleotides (5Ј-gccaau-3Ј) at the 5Ј-end and two nucleotides (5Ј-ua-3Ј) at the 3Ј-end. The tetranucleotides in all sequence contexts are located in the middle. For convenience, we refer to 5Ј-FAM-gccaauC 1 A 2 U 3 C 4 ua-3Ј, which is derived from the high-affinity CA(U/C)C motif, as the WT sequence (f-rWT). The first cytidine shown as a capital letter in the sequence is position ϩ1. The YB-1 CSD binds to f-rWT, with a K D value of 1.26 M (Fig.  1B). However, no binding was detected for a dsRNA oligo, which was generated by annealing f-rWT with its complementary sequence. Therefore, our results indicate that the YB-1 CSD binds to single-stranded RNA oligomers rather than duplex RNAs.

Overall structure of the YB-1 CSD in complex with a hexamer RNA
We then performed crystallization trials with the YB-1 CSD and a hexamer RNA, which is composed of the central CAUC motif (CAUC: 5Ј-U 0 C 1 A 2 U 3 C 4 U 5 -3Ј). We were able to grow crystals of the complex of YB-1 CSD-CAUC that diffracted to 1.7 Å in a space group of P6 2 ( Table 1). The structure was solved by molecular replacement using the crystal structure of the Bacillus subtilis CspB CSD as a search model (PDB code 3PF5). The asymmetric unit is composed of one CSD bound to one RNA oligomer. The overall structure of the YB-1 CSD is a typical OB-fold, which is a closed ␤-barrel formed by five antiparallel ␤-strands (␤1-␤5) (Fig. 1C). The CAUC RNA oligomer bound on YB-1 CSD in an arc shape. U 1 and U 6 at each end of the RNA oligomer are missing in the electron density map; however, the CAUC consensus tetranucleotides are clearly visible (Fig. 1D). The CAUC RNA oligomer bound onto the surface of YB-1 CSD, which is formed by ␤1-␤3 with an interaction interface of 532.7 Å 2 (Fig. 1E), indicating the formation of a stable complex of YB-1 CSD with the CAUC RNA oligo. Moreover, the hydrophobic and partially basic surface of YB-1 CSD facilitated RNA binding.

The YB-1 CSD interacts with RNA mainly throughstacking interactions, which are crucial for activating CD44 exon v5 splicing
When bound to the YB-1 CSD, the CAUC RNA forms four stacking pairs with the pairing alignments of C 1 ⅐His-87, A 2 ⅐Phe-85, U 3 ⅐Phe-74, and C 4 ⅐Trp-65 ( Fig. 2A). Sequence alignment showed that these four aromatic residues are highly conserved in cold shock proteins across species (Fig. 2B). To test the importance of these specific residues for the binding of the Crystal structure of YB-1-RNA complex CAUC RNA, we substituted the four residues His-87, Phe-85, Phe-74, and Trp-65 with Ala residues within the YB-1 CSD (H87A, F85A, F74A, and W65A, respectively). Each mutant abolished the binding to the RNA oligomer, suggesting that these residues are essential for RNA binding by the CSD of YB-1 (Fig. 2C).
We previously reported that YB-1 is necessary for the activation of CD44 exon v5 splicing by binding to the CAUC elements in both exon v5 and its upstream intron (37). To test the impact of these key residues on splicing, we co-transfected full-length constructs expressing full-length WT YB-1 and its mutants carrying W65A, F74A, F85A, and H87A substitutions together with the CD44 exon v5 minigene and analyzed exon v5 splicing by RT-PCR (Fig. 2, D and E). Each mutant exhibited a decrease in exon v5 inclusion compared with WT YB-1, whereas W65A or F74A showed stronger effects than F85A or H87A. The double mutation F85A/H87A resulted in a similar effect compared with each single mutation. Mutation of all four residues resulted in a significant decrease. Together, these results indicate that Trp-65 and Phe-74 play a major role in stimulating exon v5 splicing by YB-1.

Sequence-specific recognition of the CAUC motif by YB-1 CSD
Structural analysis of the YB-1 CSD-CAUC complex revealed the underlying structure-based intermolecular interactions observed in the crystal structure of the complex, where a combination of specific hydrogen bonding, stacking, and hydrophobic interactions contributes to the anchoring of the RNA oligomer to the YB-1 CSD. The four nucleotides adopt anti conformations except for the A 2 nucleotide, which is in syn conformation ( Fig. 2A).
Besides thestacking interactions, C 1 forms a network of intermolecular hydrogen bonds with Thr-89 and His-87 (Fig.  3A). The backbone of C 1 is further stabilized by a water-mediated hydrogen bond with His-87, and its 2Ј-hydroxyl group on the sugar pucker forms hydrogen bonds with Lys-118. Lys-118 is involved in multiple interactions with the RNA oligomer. The side chain of Lys-118 not only forms a salt bridge and a hydrogen bond with the phosphate group of A 2 (Fig. 3A), but stacks over A 2 to buttress the cation--sandwich structure. Moreover, N 6 of A 2 forms a hydrogen bond with the main chain of Lys-118. N 7 of A 2 comes to contact with Phe-85 via a water- Crystal structure of YB-1-RNA complex bridged hydrogen bond. In addition, the orientation of A 2 is further stabilized by its sugar phosphate backbone. A hydrogen bond between N 3 and O 4Ј of A 2 was observed. Furthermore, in the crystal structure of YB-1 CSD-CAUC complex, N 3 and O 2 of U 3 form hydrogen bonds with the side chains of Asp-83 and Lys-64, respectively (Fig. 3B). A conserved salt bridge between Asp-83 and Lys-64 is formed to stabilize these two residues. Residues Trp-65, Asn-67, Asn-70, Tyr-72, and Asp-105 form key intermolecular contacts with C 4 . Additionally, the base of C 4 stacks on Trp-65 and contacts Trp-65, Asn-67, and Asp-105 via direct or water-mediated hydrogen bonds. The backbone sugar pucker of C 4 is shaped by Asn-70 and Tyr-72 by two hydrogen bond interactions.

Impact of YB-1 CSD mutants on in vitro binding affinity
To assess these interactions, site-specific mutagenesis as well as FP assays were performed (Fig. 3C). T89A, which is predicted to disrupt the hydrogen bond interactions with C 1 , decreased the binding affinity by about 4.3-fold, with a K D of 5.39 M. Lys-118, which plays multiple roles in the recognition of A 2 , completely abolished the binding when mutated to Ala (Fig.  3C). Residues Asp-83 and Lys-64 directly interact with U 3 . Mutation of Asp-83 to Ala exhibits a K D of 3.10 M, reflecting a 2.5-fold reduction in binding affinity compared with the WT, which is in contrast to the mutation of Lys-64, which dramatically decreased the binding; thus, Lys-64 plays a major role in the recognition of U 3 rather than Asp-83 (Fig. 3C). In addition, N67A, which disrupts the hydrogen bond between Asn-67 and the O 2 of C 4 , decreases the binding affinity by a factor of 4 (K D ϭ 5.07 M). In addition, there is a unique interaction between YB-1 CSD and the sugar-phosphate backbone of CAUC. The amide group of Asn-70 interacts with the 2Ј-hy-droxyl group of the sugar ring of C 4 via a hydrogen bond. Asn-70 only exists in YB-1 CSD and not in bacterial Csp protein or mLin28 CSD (Fig. 2C). Mutation of Asn-70 to Ala decreased the binding by about 1.7-fold, with a K D of 2.19 Ϯ 0.25 M (Fig.  3C). Taken together, these residues play important roles in the specific recognition of the CAUC RNA oligomer by the YB-1 CSD.

Impact of complementary substitutions of the CAUC RNA oligo on binding to YB-1 CSD
To further investigate the molecular basis underlying sequence-specific recognition, we mutated each nucleotide on the CAUC motif to its complementary sequence and assessed the binding affinities. Mutation of C 1 to its complementary nucleotide guanidine reduced the binding affinity by 1.8-fold, with a K D of 2.26 Ϯ 0.06 M (Fig. 4A). Similarly, the dissociation constant decreased by a factor of 2.2 (K D ϭ 2.77 M) for A2U, a factor of 1.1 (K D ϭ 1.34 M) for U3A, and a factor of 3.0 (K D ϭ 3.77 M) for C4G mutants (Fig. 4A), indicating that YB-1 CSD preferentially binds to the CAUC motif over its complementary mutants.
Next, we co-crystallized and determined the structures of YB-1 CSD in complex with RNA oligos carrying complementary mutations, including C1G (5Ј-U 0 G 1 A 2 U 3 C 4 U 5 -3Ј), A2U (5Ј-U 0 C 1 U 2 U 3 C 4 U 5 -3Ј), U3A (5Ј-U 0 C 1 A 2 A 3 C 4 U 5 -3Ј), and C4G (5Ј-U 0 C 1 A 2 U 3 G 4 U 5 -3Ј). Except for C1G, crystallization of YB-1 CSD with the other three mutant RNA oligomers was successful (Table 1), which allows us to obtain efficient information on the detailed interactions by comparing these protein-RNA complex structures. The binding mode of the four RNA oligos to YB-1 CSD is quite similar, with only a small conformational change at each nucleobase (Fig. 4, B-F).

Crystal structure of YB-1-RNA complex
Hydrogen bond interactions are important for the sequencespecific recognition of RNA. In the crystal structures of the YB-1 CSD and the complementary mutant RNA complexes, the hydrogen bond networks had changed. In the crystal structure of the YB-1 CSD-A2U complex, U 2 could not form hydrogen bonds with the main chain of Lys-118 and Ala-120 due to the lack of hydrogen bond donors (amine groups at the corresponding position in the CAUC complex) (Fig. 4G). The intra- Crystal structure of YB-1-RNA complex molecular interactions between A 2 and its sugar phosphate backbone are also missing in the A2U complex structures. Similarly, in the crystal structure of YB-1 CSD-U3A, U 3 is mutated to its complementary adenine. Thus, the amine group of Lys-64 is unable to form a hydrogen bond with C 4 of A 3 (Fig. 4H). Moreover, N 1 of A 3 could not form a hydrogen bond with Asp-83 due to the lack of hydrogen donors on N 1 . According to the previous SELEX results, the third nucleotide in the core CAUC motif is flexible, and U 3 could be cytosine. Mutation of U 3 to adenine only shows mild impact on the binding affinity, suggesting the tolerance of different nucleotides in this position (Fig. 4I). Moreover, no hydrogen bond was observed between G 4 and Asn-67 in the structure of YB-1 CSD-C4G complex, which is in contrast to the CAUC complex, where a hydrogen bond is formed between O 2 of C 4 and Asn-67. These findings further indicate that the YB-1 CSD has preference over the CAUC sequence.

YB-1 CSD forms a homodimer independent of RNA
Many RNA-binding proteins have a tendency to aggregate under low-salt conditions, with crowding agent, or upon binding to RNAs, which has been implicated in the aberrant formation of RNA granules in some diseases (40,41). Full-length YB-1 can form a homomultimer, corresponding to ϳ440 kDa, in a concentration-dependent manner (42). Moreover, the elution volume for CSD was smaller than expected on the sizeexclusion chromatography. Therefore, it may be concluded that full-length YB-1, including the CSD, tends to oligomerize in solution. In the crystal structures of YB-1 CSD and RNA complex, there is one molecule in an asymmetric unit. However, we observed intermolecular hydrogen bonds between symmetry-related YB-1 CSD molecules. YB-1 CSD exhibits as a homodimer (A-B) with a 2-fold axis. The area of the dimeric interface is 489.0 Å 2 (Fig. 5A). The carboxylate group of Asp-105 on molecule A interacts with the corresponding Asp-105Ј on molecule B, and this interaction is further stabilized by the main-chain amide group of Phe-66 and Phe-66Ј on each molecule (Fig. 5B). Moreover, Tyr-99 on molecule A forms a hydrogen bond with Glu-107Ј on molecule B. To exclude the possibility that dimerization of YB-1 CSD is due to crystal packing, we performed dynamic light scattering assays to monitor the molecular weight of YB-1 CSD in solution. A double mutant (DM) in which Tyr-99 and Asp-105 are substituted with Ala (Y99A/D105A) was generated to disrupt the intermolecular interactions. The result showed that the WT YB-1 CSD exhibited higher molecular weight than expected (Fig. 5C). As the protein concentration increased, no significant change in molecular weight was observed, which was similar to what was observed in a previous study, that YB-1 CSD behaves as a single peak on size-exclusion chromatography independent of protein concentration. Moreover, the elution profile of YB-1 CSD was earlier than expected, indicating that YB-1 CSD forms a stable multimer. In contrast, the DM exhibited a lower molecular weight that was approximately half of that

Crystal structure of YB-1-RNA complex
of the WT, supporting the notion that YB-1 CSD is likely to dimerize in solution. Interestingly, the oligomerization state was not affected by RNA binding because both the WT and the DM showed a similar behavior in the dynamic light scattering assay in the presence or absence of RNA oligos. Moreover, we performed size-exclusion chromatography to check the oligomerization state of YB-1 CSD (Fig. 5D). WT YB-1 CSD was eluted in a volume earlier than the DM, thereby validating that YB-1 CSD exists as a dimer.

Abrogation of CSD dimerization leads to a reduction in RNA binding and splicing activation
To test the homodimer formation of YB-1 proteins in vivo, we performed co-immunoprecipitation (co-IP) experiments. The constructs expressing FLAG-tagged and HA-tagged fulllength YB-1 were co-transfected into HEK293 cells. We observed that YB-1 is capable of interacting with itself in the presence of DNase and RNase (Fig. 5E, lane 1). Both mutants

Crystal structure of YB-1-RNA complex
containing only one residue mutation (Y99A or D105A) still formed the homodimer (Fig. 5E, lanes 2 and 3); however, YB-1 DM (Y99A/D105A) exhibited a dramatic decrease in the interactions (Fig. 5E, lane 4). These data indicate that YB-1 CSD is oligomerized in solution, and residues Tyr-99 and Asp-105 are important for YB-1 oligomerization both in vitro and in vivo.
To address whether the function of YB-1 DM is different from that of the WT protein, we first conducted a gel shift experiment to test the RNA-binding activity of full-length YB-1 WT and DM using an RNA substrate derived from CD44 exon v5 as reported previously (37). Compared to WT YB-1, the binding of YB-1 DM to RNA reduced (Fig. 5F). Next, to exclude the influence of endogenous YB-1, we performed in vivo splicing analysis of MDA-MB-231 cells after stable knockdown of YB-1. Consistent with previous results, knockdown of YB-1 inhibited exon v5 splicing. Expression of an shRNA-resistant YB-1 construct almost completely restored exon v5 inclusion, whereas the double mutant of YB-1 only partially restored exon v5 inclusion. Taken together, these results strongly suggest that the dimerization of the CSD is critical for YB-1-RNA interaction and its function in splicing regulation.

Discussion
In this study, we determined the crystal structures of YB-1 CSD in complex with its core binding motif CAUC or several mutant RNA oligos at atomic resolution. A careful comparison of these structures revealed the detailed interactions between YB-1 CSD and RNA, thereby providing the mechanistic insights into the recognition of RNA by YB-1 in a sequence-dependent manner. We further show that YB-1 CSD forms a homodimer both in crystal and in solution. Mutations that disrupt the dimeric interface fail to form the homodimer of the CSD and result in a decrease in activating the inclusion of an alternative exon.
Comparison of structures of other members of the cold shock protein family (36,(43)(44)(45) indicated that YB-1 CSD is structurally similar to the CSD of mammalian Lin28 proteins (mouse Lin28a CSD, PDB code 3TS2; human Lin28a CSD, PDB code 5UDZ), which binds to pre-let-7 family RNAs (Fig. 6, A-C). Unlike bacterial Csp proteins, which preferentially bind to polythymidine DNA, the CSDs of YB-1 and hLin28 proteins favor binding RNAs. By comparing our structure with mammalian Lin28 -RNA complex structures, the 3Ј-end of the RNA oligos exhibits different orientations. In our structures, the nucleobase of C 4 is stacking over the six-membered ring of the Trp-65 (Fig. 3B). The corresponding nucleotide takes different conformations either in the hLin28 CSD-RNA complex or in the mLin28 CSD-RNA complex (Fig. 6, D and E), suggesting a more flexible binding of the fourth nucleotide to Lin28 CSDs. This may be due to the hydrogen bond between Asn-70 and the 2Ј-OH of the sugar pucker of C 4 (Fig. 3B). Asn-70 is conserved in YB-1 proteins across species; however, it differs in Lin28 family proteins as well as in bacterial Csp proteins (Fig. 2B). In hLin28, the residue at the corresponding position is methionine, which is unable to form a hydrogen bond with the 2Ј-hydroxyl group. Moreover, the CSDs of YB-1 and hLin28 proteins function differently. Both proteins have multiple RNA-binding domains that play roles (37,45). However, the sequence-specific binding of Lin28 proteins to single-stranded RNA is medi-ated by the zinc-finger domain, and the CSD only shows limited sequence specificity (45,46). In contrast, in YB-1, the CSD plays the central role in sequence-specific recognition (37).
In this work, we also found that the YB-1 CSD forms a homodimer both in crystal and in solution. We further showed that the dimerization influences the RNA-binding ability of the CSD. Dimerization of the RNA-binding protein has been reported previously, such as proteins containing the KH and STAR domains. For example, T-STAR and Sam68 function in alternative splicing through the dimerization of the KH domain, thus looping out regions to skip exons for pre-mRNA splicing (47). The RNAbinding element is therefore located on the opposite side of the KH dimer in an anti-parallel manner. In our structures, however, the two YB-1 dimer-bound RNA strands whose 3Ј termini are directed toward one another were in a loop conformation (Fig.  5A). This orientation seems to exclude the possibility that longer RNA strands bind onto the surface of the YB-1 dimer in a geometry resembling the crystal structure. However, a longer RNA strand may interact with the other molecule of YB-1 CSD, although we did not observe a direct interaction between them in the crystal structure. It is likely that the CSD dimerization facilitates the multimerization of the YB-1 protein and its RNA-binding ability. The full-length YB-1 protein was previously reported to form multimers on selected mRNAs, which leads to the inhibition of the translation of some mRNAs and disassembly of stress granules in neurons by competing with TDP-43 and FUS (40). In our study, we showed that the dimerization of YB-1 CSD synergizes the binding affinity to RNAs and regulates splicing in vivo. Therefore, we speculated that the homodimer formation of YB-1 CSD may contribute to the selection of target mRNAs and initiate aggregation onto RNAs. YB-1 is an oncogenic protein and highly expressed in various malignant tumors, making it a potential therapeutic target. Here, we showed that disruption of the CSD dimer interface impairs the alternative splicing of CD44. The newly identified dimerization interface of YB-1 CSD is a potential target site for novel drugs.

Plasmid construction
A DNA fragment encoding the cold shock domain of human YB-1 (residues 50 -130) was cloned into the pET28-SMT3 vector between the BamHI and SalI sites, which contains an N-terminal Ulp1-cleavable His 6 -SUMO tag. YB-1 mutants were generated by two-step PCR method and cloned into p3ϫFlag-CMV10-YB-1 plasmid (48) or pcDNA3-FLAG-YB-1. All constructs were sequence-verified.

Protein expression and purification
His 6 -SUMO-tagged recombinant proteins were expressed in Escherichia coli BL21 (DE3) under kanamycin selection. Cells were grown to an A 600 of 0.6, and protein expression was induced with 0.2 mM isopropyl-␤-D-thiogalactopyranoside overnight at 18°C. The cells were collected by centrifugation at 4,000 rpm for 15 min and were lysed by a French Press (JNBIO) at 4°C. His 6 -SUMO-tagged proteins were purified by affinity chromatography using a HisTrap HP column (GE Healthcare), followed by the cleavage of the His 6 -SUMO tag using Ulp1 protease. The proteins were further purified by ion-exchange chro-Crystal structure of YB-1-RNA complex matography using a HiTrap S column (GE Healthcare) and eluted in a gradient of NaCl. Size-exclusion chromatography was performed for the final purification of WT and mutant YB-1 CSD using Superdex G75 Hiload 16/60 column (GE Healthcare). Fractions containing target proteins were pooled and concentrated to 10 mg/ml in a buffer containing 10 mM Tris⅐Cl, pH 8.0, 100 mM NaCl, and 1 mM DTT.

Crystallization and structure determination
YB-1 CSD was mixed with RNA oligonucleotides (Bioneer) in a 1:2 ratio and incubated on ice for 1 h before crystallization. Crystals were grown at 17°C by the hanging drop vapor diffusion method with the reservoir solution (0.1 M Hepes-HCl, pH 7.5, 2.0 M ammonium sulfate, and 5% isopropyl alcohol). Crystals of YB-1 CSD in complex with different RNA oligos were flash-frozen in liquid nitrogen. Diffraction data were collected at beamline BL17U1/ BL19U1 of the Shanghai Synchrotron Radiation Facility (SSRF). The data set was processed using the HKL-2000 or HKL-3000 program (49). The structure was solved by molecular replacement using the structure of B. subtilis CspB (PDB code 3PF5) as a search model. Model building and structure refinement were carried out using COOT (50) and PHENIX (51). Buried surface area was calculated by PISA (52). All the of structural graphics were generated using PyMOL version 1.8 (Schrödinger, LLC, New York). The statistics of data collection and refinement were summarized in Table  1. The atomic coordinates and associated structure factors have been deposited in the Protein Data Bank under accession codes 5YTV, 5YTS, 5YTX, and 5YTT.

Fluorescence polarization assay
WT or mutant YB-1 CSD proteins were diluted using a buffer containing 20 mM Tris-HCl, pH 8.0, 100 mM NaCl, 1 mM DTT, 5% glycerol to a series of concentrations, including 400, 200, 100, 50, 25, 12.5, 6.25, 3.125, 1.563, 0.781, and 0.391 M. The FAM-labeled RNA oligonucleotides were diluted in the same buffer to a final concentration of 1 nM. The final volume of each well in a 96-well plate (Corning) was brought to 100 l with the same buffer and incubated at room temperature in the dark for 30 min. The fluorescence polarization values were measured at an excitation wavelength of 485 nm and an emission wavelength of 528 nm using the Synergy Neo Multi-Mode Reader (Bio-Tek) at 25°C. The normalized fluorescence change was quantitated using GEN 5, and the dissociation constant (K D ) values were fitted and analyzed using the Dose Response function in OriginPro version 8.5.

Dynamic light scattering
All the samples used in dynamic light scattering experiments were centrifuged at 14,000 rpm for 10 min to remove large aggregates. The experiments were performed to measure average size of micelles using DynaPro Nanosta (Wyatt) operating at a wavelength of 658 nm at 4°C. WT and mutant YB-1 CSD proteins were diluted in a buffer containing 10 mM Tris-HCl, pH 8.0, 100 mM NaCl, 0.1 mM DTT to a series of concentrations of 1.25, 2.5, 5, and 10 mg/ml. Each measurement was repeated three times to ensure reproducibility. The data were analyzed with OriginPro version 8.5.

Analytical size-exclusion chromatography
Analytical size-exclusion chromatography was performed for WT and mutant YB-1 CSD proteins using a Superdex 75 Increase 10/300 column (GE Healthcare) in a buffer containing 10 mM Tris-HCl, pH 8.0, 100 mM NaCl, and 1 mM DTT at a flow rate of 0.5 ml/min.

Crystal structure of YB-1-RNA complex
Gel shift assay 130 fmol of 32 P-labeled RNAs were incubated at 30°C under standard splicing conditions with different molar excess of YB-1 proteins. A 5-l aliquot was removed and transferred to a new tube containing 1 l of tRNA (2 mg/ml). RNA-protein complexes were fractionated on a 5% native polyacrylamide gel.

Co-IP
48 h after transfection, HEK293 cells were harvested and lysed in co-IP lysis buffer (50 mM Tris-HCl, pH 7.5, 150 mM KCl, 0.5% Nonidet P-40, 1 mM phenylmethylsulfonyl fluoride, 2 g/ml RNase A) at 4°C for 15 min with rotating. The lysate was centrifuged at 13,000 rpm for 10 min at 4°C. The supernatant was collected and incubated with 10 l of anti-FLAG M2 beads (Sigma) at 4°C for 3 h with rotating. The beads were subsequently washed three times with 1 ml of co-IP washing buffer (50 mM Tris-HCl, pH 7.5, 500 mM KCl, 0.1% Nonidet P-40, 1 mM phenylmethylsulfonyl fluoride). The bound material was fractionated by SDS-PAGE followed by blotting with anti-HA antibody (Roche Applied Science).

In vivo splicing
The day before transfection, 3 ϫ 10 5 MDA-MB-231 cells were seeded into a 3.5-cm culture dish. YB-1 expression plasmids and CD44 exon v5 minigene, pET-v5, were transfected using Lipofectamine 2000 (Invitrogen) according to the manufacturer's instructions. 48 h after cell transfection, total RNAs were isolated using TRIzol (Invitrogen). Total RNA (2.5 g) was annealed to oligo(dT) 18 and reverse-transcribed by Moloney murine leukemia virus (Promega) according to the manufacturer's instructions. The resulting first-strand cDNA was further amplified by PCR, using primers pET-for and pET-rev (37).