JBC Oz Biosciences

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Louie, D. F.
Right arrow Articles by Ahn, N. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Louie, D. F.
Right arrow Articles by Ahn, N. G.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Volume 271, Number 45, Issue of November 8, 1996 pp. 28189-28198
©1996 by The American Society for Biochemistry and Molecular Biology, Inc.

Mass Spectrometric Analysis of 40 S Ribosomal Proteins from Rat-1 Fibroblasts*

(Received for publication, May 23, 1996, and in revised form, August 8, 1996)

Donna F. Louie Dagger §, Katheryn A. Resing §, Timothy S. Lewis § and Natalie G. Ahn Dagger §par

From the Dagger  Howard Hughes Medical Institute and the § Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado 80309

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
Acknowledgments
REFERENCES


ABSTRACT

Although sequences of most mammalian ribosomal proteins are available, little is known about the post-translational processing of ribosomal proteins. To examine their post-translational modifications, 40 S subunit proteins purified from Rat-1 fibroblasts and their peptides were analyzed by liquid chromatography coupled with electrospray mass spectrometry. Of 41 proteins observed, 36 corresponded to the 32 rat 40 S ribosomal proteins with known sequences (S3, S5, S7, and S24 presented in two forms). The observed masses of S4, S6-S8, S13, S15a, S16, S17, S19, S27a, S29, and S30 matched those predicted. Sa, S3a, S5, S11, S15, S18, S20, S21, S24, S26-S28, and an S7 variant showed changes in mass that were consistent with N-terminal demethionylation and/or acetylation (S5 and S27 also appeared to be internally formylated and acetylated, respectively). S23 appeared to be internally hydroxylated or methylated. S2, S3, S9, S10, S12, S14, and S25 showed changes in mass inconsistent with known covalent modifications (+220, -75, +86, +56, -100, -117, and -103 Da, respectively), possibly representing novel post-translational modifications or allelic sequence variation. Five unidentified proteins (12,084, 13,706, 13,741, 13,884, and 34,987 Da) were observed; for one, a sequence tag (PPGPPP), absent in any known ribosomal proteins, was determined, suggesting that it is a previously undescribed ribosome-associated protein. This study establishes a powerful method to rapidly analyze protein components of large biological complexes and their covalent modifications.


INTRODUCTION

Ribosomes are complex structures consisting of large numbers of proteins that interact with rRNA to form a functional protein-synthesizing entity. Much progress has been made in identifying the components of prokaryotic ribosomes and in understanding their structure and function in protein synthesis (reviewed in Refs. 1 and 2). However, less information is available for the more complex eukaryotic ribosomes. Previous studies have identified several covalent modifications of eukaryotic rRNA, demonstrating their importance in the enzymology and stability of ribosomes (reviewed in Refs. 2 and 3). Less is known about the covalent modifications of eukaryotic ribosomal proteins, although protein and cDNA sequence information is now available for nearly all of the yeast, rat, and human ribosomal proteins (reviewed in Refs. 4, 5, 6). Nevertheless, protein co- or post-translational modifications may have effects on the activity, stability, assembly, or localization of eukaryotic ribosomes.

Mass spectrometry can be used to identify and characterize protein covalent modifications (reviewed in Refs. 7 and 8) without prior knowledge of their chemistry, providing an unbiased approach that is not achievable using techniques such as metabolic radiolabeling or chemical derivatization. Electrospray ionization mass spectrometry (9) is a particularly sensitive means of obtaining masses of proteins up to 100,000 Da with uncertainties of ~0.01% from small amounts of sample (nano- to micrograms) (reviewed in Refs. 10 and 11). When electrospray ionization mass spectrometry is coupled with reversed-phase HPLC1 (LC/MS), components within mixtures can be analyzed by mass as they elute from columns, allowing rapid analysis of proteins or peptides. Tandem mass spectrometry (MS/MS), in which peptide ions are fragmented by collision-induced dissociation (CID), provides amino acid sequence information, enabling identification of modified residues (12). CID can be performed on all components in a sample by raising the orifice voltage (13) or on a single ion selected in the first quadrupole, activated by argon in the second quadrupole, and analyzed in the third quadrupole (14). Multisubunit complexes can also be analyzed by mass spectrometry, allowing identification of the protein components and their covalent modifications. Only two examples of this type of analysis have been reported previously, examining subunits of cytochrome bc1 (15) and eukaryotic nucleosomes (16). In each case, protein components (<11) were first extracted and purified from these complexes prior to mass spectral analysis. In principle, the purification can be eliminated by the use of LC/MS analysis, saving time and avoiding loss by handling. Furthermore, the high sensitivity of the method enables analysis of limiting amounts of material, such as samples purified from cultured mammalian cells.

In this study, we report new information about the protein composition of 40 S ribosomal subunits and the nature of co- or post-translational modifications of ribosomal proteins. Our study demonstrates a novel application of LC/MS and LC/MS/MS to rapidly identify the protein constituents and protein modifications of a large biological complex purified from cultured mammalian cells.


EXPERIMENTAL PROCEDURES

Cell Culture

Rat-1 fibroblasts were grown at 37 °C in Dulbecco's minimal essential medium (Life Technologies, Inc.) in the presence of 10% fetal bovine serum (Sigma) and 100 µg/ml each streptomycin and penicillin. Cells were seeded at 3 × 105 cells/dish in 30 150-cm2 culture dishes and allowed to grow to confluence for ribosome purification.

Preparation of 40 S Ribosomal Proteins

40 S ribosomes were purified by a procedure modified from Terao and Ogata (17). All procedures were performed at 4 °C unless otherwise stated. Cells were rinsed twice with 5 ml of phosphate-buffered saline and twice with 5 ml of 0.25 M sucrose, 25 mM KCl, 5 mM MgCl2, 1 mM benzamidine (Sigma), 20 µg/ml each leupeptin and aprotinin (U. S. Biochemical Corp.), 5 µg/ml pepstatin (U. S. Biochemical Corp.), 30 µM 3,4-dichloroisocoumarin (Sigma), and 50 mM Tris-HCl, pH 7.6. Cells were drained, scraped, and then homogenized with a tight-fitting glass homogenizer. The homogenate was centrifuged at 27,000 × g for 10 min, and the resulting supernatant was centrifuged at 350,000 × g for 100 min. Pellets were resuspended in 1.4 ml of 0.15 M sucrose, 10 mM MgCl2, 25 mM KCl, and 35 mM Tris-HCl, pH 7.6, to which was added 0.4 ml of 2.5 M KCl and 10 mM MgCl2 and 0.2 ml of 10% (w/v) sodium deoxycholate. Approximately 0.5 ml of the suspension was layered onto 2.5 ml of 0.3 M sucrose, 10 mM MgCl2, 0.6 M KCl, and 35 mM Tris-HCl, pH 7.6, and centrifuged at 350,000 × g for 2 h. The resulting pellets were resuspended in 0.7-1.0 ml of 0.25 M sucrose, 5 mM MgCl2, 50 mM KCl, 10 mM NaHCO3, and 50 mM Tris-HCl, pH 7.6, and centrifuged at 27,000 × g for 10 min, yielding a supernatant enriched in 80 S ribosomes.

80 S ribosomes were dissociated into 40 S and 60 S subunits by addition of puromycin (Sigma) to 0.1 mg/ml and incubation for 10 min at 37 °C, followed by addition of beta -mercaptoethanol to 5 mM, KCl to 0.7 M, and MgCl2 to 1.25 mM and incubation for 15 min at 37 °C. To purify ribosomal subunits, the dissociated ribosomes (0.5 ml) were layered onto 15-30% linear sucrose gradients (11.5 ml) containing 0.3 M KCl, 3 mM MgCl2, 20 mM Tris-HCl, pH 7.6, and 0.5 mM dithiothreitol, and the gradients were centrifuged at 125,000 × g for 13 h in a Beckman SW 41 Ti rotor. Gradients were eluted at 0.5 ml/min in 0.5-ml fractions and analyzed by 15% SDS-polyacrylamide gel electrophoresis visualized by silver staining (18). Fractions containing 40 S subunits were pooled and diluted 2-fold with 0.6 M KCl, 6 mM MgCl2, 40 mM Tris-HCl, pH 7.6, and 1 mM dithiothreitol, and subunits were pelleted in 2-3-ml volume at 490,000 × g for 4 h. The final pellets were stored at -80 °C until use.

To dissociate 40 S ribosomal proteins from rRNA, each pellet was resuspended in 100 µl of 6 M guanidine HCl, pH 5.0, followed by addition of 200 µl of 10% acetic acid and incubation on ice for at least 5 min. Precipitated rRNA was removed by centrifugation at 10,000 rpm for 2 min at room temperature. Supernatants were used for direct LC/MS analyses as well as for partial protein purification and peptide generation for further mass spectral analyses.

Protein Purification and Peptide Generation for Mass Spectral Analyses

40 S ribosomal proteins (rRNA-free) were partially resolved on either a Brownlee C4 microbore (Applied Biosystems, Inc., 2.1 mm × 3 cm) or a POROS R120 PEEK reversed-phase column (PerSeptive Systems; 2.1 mm × 3 cm). Columns were equilibrated in 0.1% trifluoroacetic acid, and proteins were eluted with a gradient in 0-80% acetonitrile and 0.08% trifluoroacetic acid at 1% acetonitrile/min and 200 µl/min. Protein elution was monitored at 215 nm by UV absorbance detection (Applied Biosystems, Inc., Model 785A; 2.4-µl flow cell), and fractions were collected manually. In some cases, post-column splitting was used to direct 10-15% of the HPLC effluent to the mass spectrometer and the rest to the UV absorbance detector for manual fraction collection. Acetonitrile/trifluoroacetic acid in the fractions was removed by lyophilization, and mass spectral analyses of either intact or proteolyzed proteins were performed after solubilization with 20 µl of 8 M urea and 80 µl of 0.1 M Tris-HCl, pH 8. Proteolysis was carried out with 5% (w/w) endoproteinase Lys-C (Wako Bioproducts) for 3-4 h at 37 °C. Prior to mass spectral analysis, samples were acidified by addition of formic acid to a final concentration of 1% (v/v).

Mass Spectral Analyses

Mass determinations of proteins or peptides were performed as described (19, 20, 21) using an HPLC (Applied Biosystems, Inc., Model 140B) directly coupled to a Perkin-Elmer Sciex API-III triple quadrupole mass spectrometer equipped with a nebulization-assisted electrospray source and a high pressure collision cell. Protein components of 40 S subunits were separated on fused silica capillary columns (500 µm × 20 cm) packed with POROS R120 resin. Following column equilibration in 0.1% formic acid, proteins were loaded and eluted with a gradient in 0-80% acetonitrile and 0.1% formic acid at 1% acetonitrile/min and 20 µl/min. Peptides generated after proteolysis were analyzed in a similar manner, except that Vydac C18 resin (Hewlett-Packard Co.) buffered with 0.05% trifluoroacetic acid was used; peptides were eluted with a 0-40% acetonitrile gradient at 2% acetonitrile/min. Mass spectra for proteins were obtained by scanning from 350 to 1800 Da/e with 0.13-Da/e step size and 0.5-ms dwell time. The ionspray needle was held at a potential of 4.8 kV with a nebulizing air flow rate of 0.6 liter/min and an orifice voltage of 75-80 V. Conditions for peptide analysis were identical except that spectra were obtained by scanning from 50 to 1600 Da/e with 0.2-Da/e step size, and the ionspray needle was decreased to 4.5 kV; to obtain CID at the orifice, the orifice voltage was increased to 95 V.

MS/MS was performed as proteins or peptides eluted from the capillary column into the mass spectrometer (LC/MS/MS) (19, 20, 21, 22). Individual ions were selected for CID in the first quadrupole and then accelerated to a kinetic energy of 16-18 eV and collisionally activated with argon at a thickness of 3.5-4.2 × 1014 atoms/cm3 in the high pressure collision cell of the second quadrupole. MS/MS data were collected in the third quadrupole by scanning from 50 to 1600 Da/e with 0.15-Da/e step size and 1.0-ms dwell time. Fragment ions produced were usually ``b'' and ``y'' types; in some cases, doubly charged fragment ions were observed (referred to as b2+ and y2+, corresponding to (b + 1H)2+ and (y + 1H)2+, respectively, in the nomenclature of Biemann (8)). LC/MS and LC/MS/MS data were analyzed using software provided by Perkin-Elmer Sciex. Figures were prepared by importing data into Sigmaplot (Jandel Scientific), Photoshop (Adobe), or Canvas (Deneba).


RESULTS

Mass Determinations of 40 S Ribosomal Proteins

Ribosomes were purified from Rat-1 fibroblasts and separated into 60 S and 40 S subunits. 40 S ribosomal proteins were dissociated from 18 S rRNA and analyzed by LC/MS. Fig. 1 displays the mass/charge ratio (m/z) of the ribosomal proteins versus HPLC elution. Each protein is observed as a harmonic series of coeluting ions with varying m/z values. For example, in scans 328-330 (Fig. 1, arrow), a discrete series is observed that is clearly separated from neighboring series. Several scans were summed to produce an ion spectrum; for example, Fig. 2A illustrates an m/z series from MH99+ to MH2020+. Protein masses were then derived by deconvolution of the m/z series; for example, Fig. 2B shows the deconvolution of the data in Fig. 2A, indicating a mass of 14,709 Da. Even in situations where proteins coeluted (for example, scans 195-215), each protein produced a separate ion series that deconvoluted into distinguishable masses. Masses were accepted as valid ribosomal components if they were observed in each of three independent 40 S ribosomal protein preparations. All proteins selected by this criterion were observed with signal intensities >5-fold above background. Experimental errors were typically <0.01% of total protein mass, as measured by standard deviations of independent determinations represented by each set of m/z values as well as by standard deviations of mass determinations from three independent preparations of 40 S ribosomal proteins. Observed masses did not vary when ribosomes were prepared in the presence versus absence of protease inhibitors, indicating that proteolysis after cell disruption was not significant.


Fig. 1. On-line coupled LC/MS analysis of 40 S ribosomal proteins. Mass spectral data were collected as proteins eluted from a capillary POROS R120 HPLC column and converted digitally to gray scale, where depth of color reflects ion intensities (see ``Experimental Procedures''). The m/z values are plotted against elution time, measured by mass spectral scan number and minutes. Each set of discrete bands eluting together represents an ion series of a single protein, as described under ``Results.'' The arrow indicates the ion series of the protein discussed in Fig. 2.
[View Larger Version of this Image (64K GIF file)]



Fig. 2. Analysis of protein mass. A, an example of a mass spectrum was derived by summing several scans of LC/MS (scans 328-330 from Fig. 1). Peaks represent individual charged forms (m/z values indicated above) and correspond to MH99+ to MH2020+ (from right to left; the smallest peak that is not labeled is MH2020+ = 736.1 Da/e). Protein mass can be calculated from each peak using the formula (M + z)/z = observed mass of ion, where M is protein mass and z is charge. Protein masses in Table I are determined from these calculations. B, the deconvoluted mass spectrum of the m/z series in A indicates a major protein of 14,708.7 Da and two minor proteins of 14,740.8 and 14,766.0 Da; these minor peaks most likely represent an oxidation product of the major protein and a contaminating adduct (+57 Da) variably found in our experiments, respectively. The areas under these peaks are semiquantitative approximations of the relative amounts of each form.
[View Larger Version of this Image (20K GIF file)]


LC/MS analyses of the 40 S ribosomal proteins yielded 41 protein masses. Other investigators have identified 33 rat 40 S ribosomal proteins, and amino acid and/or cDNA sequence information is available for 32 of these proteins (reviewed in Ref. 4). Comparison of our observed masses with masses predicted from amino acid sequences showed that, within experimental error, 12 of the 41 proteins had observed masses identical to the predicted masses of S4, S6, S7, S8, S13, S15a, S16, S17, S19, S27a, S29, and S30 (Table I). Of these 12 proteins, two (S6 and S7) retained their initiator Met as indicated by amino acid sequencing (23, 24), and S27a and S30 were observed without their respective N-terminal ubiquitin and ubiquitin-like fusions (25, 26). (Proteins that have the initiator Met removed after translation, as demonstrated by previous studies, are noted by Footnote j in Table I.)

Table I.

Identification of the 40 S ribosomal proteins by mass and by recovery of endoproteinase Lys-C peptides, comparing with protein and peptide masses predicted from primary sequences


Elution timea Nameb Observed massc Predicted massd Mass differencee Residues and masses of modifications in identified peptidesf No. of observed peptidesg Coverageh

min Da Da Da %
39.5 Sa 32,734.1  ± 1.3 32,824.1  -90.0 1-11 (-89.8), 12-17, 18-40, 43-52, 53-57, 58-89 [1-17 (-88.6)] 6 /12 29.5 (295)
36.2 S2 31,451.0  ± 1.6 31,231.4 219.6 1-54 (+219.6), 59-65, 66-71, 109-114, 115-125, 126-142, 146-159, 160-173, 213-238, 239-246, 247-257, 258-263, 264-275, 278-293 [1-58 (+219.5), 75-89, 90-108, 247-263] 14 /25 84.6 (293)
34.8 S3 26,585.4  ± 0.7 26,660.3  -74.9 11-18, 63-75, 76-90, 133-141, 152-185, 188-197, 215-227 [149-185, 186-197, 202-214, 228-243] 7 /20 56.0 (243)
34.8i S3 26,298.2  ± 1.0 26,660.3  -362.1 228-243 (-288.0)
32.0 S3a 29,815.3  ± 1.4 29,945.0  -129.7 21-27, 29-34, 47-56, 57-63, 64-83, 86-94, 153-166, 188-195, 223-227, 228-240 [20-27, 21-28, 241-264] 10 /39 47.3 (264)
29.2 S4 29,468.6  ± 2.9 29,466.6j +2.0 10-15, 16-21, 22-36, 53-61, 75-93, 94-105, 106-119, 128-133, 134-154, 155-167, 168-173, 211-229, 233-241, 242-253 [16-36, 230-241] 14 /28 64.9 (262)
37.9 S5 22,817.2  ± 0.8 22,878.4  -61.2 1-18 (-89.8), 19-22, 23-42, 183-191, 194-201 [19-42, 86-94, 193-201] 5 /16 33.8 (204)
37.9k S5 22,948.0  ± 0.6 22,878.4 +69.6 1-18 (+42.0)
20.3 S6 28,682.6  ± 1.6 28,680.7 +1.9 15-23, 24-30, 31-46, 47-58, 59-64, 65-79, 80-93, 120-136, 144-149, 176-188, 189-195, 212-215, 224-230, 231-243, 244-249 [150-164, 168-172, 189-196, 202-211] 15 /36 73.5 (249)
30.4 S7 22,127.6  ± 1.6 22,126.9 +0.7 1-7, 38-49, 50-58, 59-70, 75-85, 91-103, 116-142, 143-147, 148-155, 156-160, 170-178, 184-194 [1-10, 11-37, 91-107] 12 /25 84.0 (194)
31.0l S7 22,169.0  ± 0.9 22,126.9 +42.1 1-7 (+42.1)
23.2 S8 24,073.3  ± 0.2 24,074.0j  -0.7 1-9, 26-36, 37-52, 54-74, 75-93, 98-123, 128-138, 157-169, 170-192, 193-200 [24-36, 37-53, 94-123, 128-139, 201-205] 10 /31 82.1 (207)
29.9 S9 22,460.7  ± 0.9 22,374.4j +86.3 11-21, 22-29, 30-39, 40-46, 47-51, 55-65, 66-90, 93-100, 101-115, 116-120, 139-154, 180-193 [30-46, 47-54, 91-100, 179-193] 12 /20 73.1 (193)
25.3 S10 18,972.8  ± 1.2 18,915.8 +57.0 1-5, 7-17, 18-24, 26-31, 32-38, 39-47, 48-53, 54-59, 108-138 [1-6, 18-25, 139-165 (+55.9)] 9 /14 70.9 (165)
22.4 S11 18,341.5  ± 0.6 18,430.8  -89.3 1-12 (-89.1), 13-20, 22-30, 33-38, 39-45, 49-58, 82-98, 99-104, 137-144 [49-59, 154-158] 9 /22 56.3 (158)
33.0 S12 14,425.3  ± 0.8 14,525.0  -99.7 1-23, 24-40, 122-129 [94-102] 3 /16 43.2
(132)
26.5 S13 17,091.2  ± 0.7 17,091.1j +0.1 1-8, 9-26, 27-33, 34-38, 43-69, 70-75, 78-92, 94-99, 112-129, 132-139, 140-150 [27-38, 42-69, 78-93, 94-106, 130-139, 132-150] 11 /19 93.3 (150)
26.5 S14 16,141.6  ± 0.2 16,258.7  -117.1 64-86, 87-96, 97-106, 107-125, 126-143, 144-151 [62-86] 6 /13 58.3 (151)
24.3 S15 16,950.2  ± 0.9 17,040.2m  -90.0 1-7 (-89.0), 10-14, 15-52, 53-58, 66-72, 73-77, 78-100, 101-108, 128-145 [59-64, 65-72, 66-77, 101-124] 9 /16 96.6 (145)
33.8 S15a 14,708.2  ± 0.2 14,708.6j  -0.4 1-11, 12-18, 22-31, 32-42, 43-59, 60-70, 84-87, 88-118, 124-129 [71-87] 9 /12 93.8 (129)
26.5 S16 16,314.1  ± 0.1 16,314.4j  -0.3 26-32, 33-49, 50-59, 60-72, 73-89, 90-97, 131-142 [4-16, 17-32, 98-105, 130-142] 7 /17 79.3 (145)
28.0 S17 15,378.6  ± 0.3 15,377.9j +0.7 11-18, 19-31, 59-71, 72-102, 107-128, 129-134 [36-48] 6 /14 79.1 (134)
28.0 S18 17,629.5  ± 0.3 17,718.7  -89.2 1-8 (-88.9), 9-25, 26-34, 35-47, 48-54, 55-78, 79-88, 95-106, 107-115, 117-137, 138-150, [107-116, 138-151] 11 /16 95.4 (152)
24.3 S19 15,954.2  ± 0.5 15,954.3j  -0.1 1-6, 7-22, 29-37, 43-76, 77-96, 97-110, 122-142, [7-23, 27-37, 41-76] 7 /16 86.8 (144)
25.3 S20 13,283.1  ± 0.1 13,372.8  -89.7 9-30, 52-59, 60-67, 76-86, 87-99 [50-59] 6 /14 53.8 (119)
25.3 S21 9,168.8  ± 0.6 9,127.4 +41.4 1-16 (+41.5), 17-27, 28-51, 75-81 4/6 69.9
(83)
20.3 S23 15,692.5  ± 0.6 15,676.6j +15.9 12-20, 29-36, 37-47, 48-53, 54-59, 60-67 (+16.1), 108-120, 124-134 [124-136] 8 /21 52.1 (142)
20.3 S24 15,464.8  ± 0.2 15,423.2 +41.6 1-11 (+42.4), 12-21, 22-32, 33-37, 38-43, 50-68, 69-83, 89-99 [89-100, 101-105] 8/24 70.7 (133)
20.3n S24 15,111.3  ± 0.3 15,423.2  -311.9
23.2 S25 13,639.0  ± 0.3 13,742.2  -103.2 44-52, 61-66, 67-78, 79-94, 103-114, 115-125 [21-29] 6 /28 60.0 (125)
20.3 S26 12,885.1  ± 0.3 13,015.5  -130.4 5-12, 39-66, 67-70, 71-82, 83-93, 94-113 [4-12, 5-13, 13-32, 38-66] 6 /14 92.2 (115)
Elution timea Nameb Observed massc Predicted massd Mass differencee Residues and masses of modifications in identified peptidesf No. of observed peptidesg Coverageh

min Da Da Da %
20.3 S27 9,388.2  ± 0.2 9,477.0  -89.0 1-15 (-131.4), 23-36, 71-82 [1-16 (-130.8)] 3/11 51.2 (84)
19.1 S27a 9,403.3  ± 0.4 9,404.1o  -0.8 8-13, 24-28 [7-13, 8-14] 2/18 16.3
(80)
21.6 S28 7,883.0  ± 0.4 7,841.1 +41.9 1-10 (+58.2), 11-16, 48-69 3/4 55.1
(69)
19.6 S29 6,545.2  ± 0.1 6,545.8j  -0.6 1-12, 13-32, 48-53 [48-55] 3/5 72.7
(55)
11.7 S30 6,647.7  ± 0.1 6,647.9p  -0.2 2-11, 34-51 [1-11, 34-52] 2/14 50.8
(59)
25.3 Sbq 12,084.3  ± 0.7
37.5 Sc 13,706.1  ± 0.5
33.0 Sd 13,741.1  ± 1.0
33.0 Se 13,883.5  ± 0.7
37.5 Sf 34,987.3  ± 1.7

a  Elution time from Fig. 1.
b  Nomenclature of Wool et al. (4).
c  Average observed mass and standard deviation obtained from three independent 40 S preparations.
d  Predicted mass calculated from translated cDNA sequence obtained from Swiss-Prot, PIR, or GenBank data bases (All include the initiator Met, except where indicated).
e  Mass difference calculated as observed mass minus predicted mass.
f  Peptides identified by LC/MS and high orifice voltage CID sequencing are indicated by residue numbers. Peptides identified by mass alone are indicated in light-face Roman type. Peptides identified by three or more fragment ions by CID are indicated in bold-face italic type. Peptides identified by less than three fragment ions by CID are indicated in light face italic type; in these cases, fragment ions were consistent with fragmentation N-terminal to Pro. A bold-face number in parentheses indicates the observed mass difference from the predicted mass for that peptide. Brackets indicate an incomplete proteolytic product. In all cases, observed masses were within 1 Da of predicted masses. Monoiotopic masses were used for peptides <1500 Da and average masses for larger peptides.
g  Numerator and denominator represent the number of observed and predicted peptides, respectively, from a complete endoproteinase LysC digestion (not including incomplete proteolytic products).
h  Percent coverage of residues observed from LC/MS. The total number of residues in each protein is indicated in parentheses.
i  A protein coeluting with S3 that appears to be a variant form of S3 (see ``Results'').
j  N-terminal Met is removed after translation for S4 (43), S8 (44), S9 (45) S13 (46), S15a (47), S16 (48), S17 (49), S19 (50), S23 (51), and S29 (52).
k  A protein coeluting with S5 that appears to be a methionylated variant of S5 (see ``Results'').
l  A major protein eluting after S7 that appears to be an acetylated form of S7 (see ``Results'').
m  Previously shown to be acetylated at the N-terminus after removal of the initiator Met (29).
n  A protein coeluting with S24 that appears to be a proteolyzed form of S24 (see ``Results'').
o  Mass without ubiquitin N-terminal fusion domain, predicted from the cDNA sequence (25).
p  Mass without ubiquitin-like protein fusion domain, predicted from the cDNA sequence (26).
q  Proteins unidentifiable with any of the known ribosomal proteins.

Identification of 40 S Ribosomal Proteins

To verify these assignments and to identify the remaining 29 proteins, ribosomal proteins were partially purified, digested with endoproteinase Lys-C, and analyzed by LC/MS. An example of peptide mapping by LC/MS is shown in Fig. 3A for a protein with a mass of 31,451 Da. Peptide masses observed by LC/MS of the endoproteinase Lys-C digests were compared with predicted peptide masses for each of the ribosomal proteins. In the example shown, 16 observed peptide masses matched, within experimental error, to peptide masses predicted from the known amino acid sequence of ribosomal protein S2, establishing the identity of this protein (Table II and Fig. 3B). In general, four categories of peptides were often not found in peptide maps: (i) small hydrophilic peptides that might not bind to the Vydac C18 resin, (ii) large hydrophobic peptides that might not release from the resin, (iii) cysteine-containing peptides that might form intramolecular or intermolecular disulfide bridges, and (iv) covalently modified peptides (e.g. peptides 1 and 1/2 from S2; see below). In many cases, peptide identities were confirmed by high orifice voltage CID, which yielded sequence information (13). For example, peptide 22 from S2 generated fragment ions with masses of 446.4, 685.2, 759.4, 872.2, 1035.4, and 1122.8 Da/e (Fig. 3, arrow at scan 526), representing the observed y3 and y5-9 ions for this peptide, respectively (Fig. 4). (CID sequencing was also observed for peptides 4, 14, 22, 24, and 25 (Table II and Fig. 3B).)


Fig. 3. LC/MS analysis of an endoproteinase Lys-C digest of S2. A, S2 was partially purified by reversed-phase HPLC, lyophilized, and digested with endoproteinase Lys-C, and peptides were analyzed by LC/MS (see ``Experimental Procedures''). The m/z values are plotted against elution as in Fig. 1, except that most peptides are represented by only one parent ion. In some cases, peptide ions were fragmented by high orifice voltage CID; these appear as lines of less intense fragment ions trailing below more intense parent ions, e.g. peptide 22 from S2 (arrow), located at scans 526-529 and discussed in the legend to Fig. 4. The N-terminal peptides 1 and 1/2, located at scans 431-434 (asterisks and arrowheads, respectively), showed a modification of +220 Da as discussed under ``Discussion.'' B, the amino acid sequence of S2 is shown with peptide coverage. Slashes indicate the expected cleavages by endoproteinase Lys-C. The observed S2 peptides are underlined, and the N-terminal peptides observed with mass differences of +220 Da are italicized. Fragment ions generated by high voltage CID of the endoproteinase Lys-C peptides are indicated by lfloor for y ions and rceil for b ions.
[View Larger Version of this Image (63K GIF file)]


Table II.

Identification of endoproteinase Lys-C peptides from S2 by mass


Peptide No.a Elution timeb Residue No.c Predicted massd Observed m/ze Observed massf

min Da Da/e Da
1 22.5 1 -54 4963.5 1037.6 (5+), 1296.8 (4+) 5183.1
1/2g 22.5 1 -58 5406.9 938.8 (6+), 1126.2 (5+), 1407.6 (4+) 5626.4
3h 20.6 59 -65 871.5 872.6 (1+) 871.6
4i 16.7 66 -71 684.5 685.6 (1+) 684.6
6/7g 23.8 75 -89 1793.2 897.4 (2+) 1792.8
8/9g 36.4 90 -108 2153.5 1077.8 (2+) 2153.6
10h 15.1 109 -114 714.4 715.6 (1+) 714.6
11 14.2 115 -125 1348.5 675.0 (2+) 1348.0
12 25.3 126 -142 1717.0 859.4 (2+) 1716.8
14i 24.7 146 -159 1425.7 713.6 (2+) 1425.2
15 23.3 160 -173 1644.9 823.4 (2+) 1644.8
20j 29.9 213 -238 2724.2 908.6 (3+), 1361.4 (2+) 2721.8
21 17.8 239 -246 851.4 852.4 (1+) 851.4
22i 27.4 247 -257 1386.6 1387.6 (1+) 1386.6
22/23g 29.9 247 -263 2092.4 1047.0 (2+) 2092.0
23 16.0 258 -263 723.4 724.4 (1+) 723.4
24i 24.1 264 -275 1463.6 1463.8 (1+), 732.6 (2+) 1463.0
25i 16.8 276 -293 1924.1 962.8 (2+) 1923.6

a  Peptides generated from a digestion of S2 by endoproteinase Lys-C, numbered from the N terminus.
b  Elution time from the data in Fig. 3.
c  Residues are numbered as described by Suzuki et al. (53).
d  Calculated using monoisotopic masses for peptides <1300 Da and average masses for peptides >1300 Da.
e  Peptide ions observed by LC/MS. The charge of the ion is indicated in parentheses.
f  Peptide masses calculated from the observed m/z values.
g  Incomplete proteolytic product.
h  Confirmed by CID data with less than three fragmented ions; the CID ions were consistent with fragmentation N-terminal to Pro.
i  Confirmed by CID data with three or more fragmented ions.
j  This peptide contains two cysteines. The mass difference of -2.0 Da from the predicted mass likely represents a recovery of an intramolecular disulfide form.


Fig. 4. Peptide sequencing by high orifice voltage CID. A high orifice voltage CID mass spectrum of peptide 22 from S2 was obtained by summing scans 526-529 in Fig. 3. The sequence of this peptide is shown with peptide cleavages (lfloor ) that give rise to y5, y6, y7, and y9 (658.2, 759.4, 872.2, and 1122.8 Da/e, respectively). Two ions (dashed lfloor ) were observed with intensities less than five times background (y3 = 446.4 Da/e and y8 = 1035.4 Da/e). The predicted y3 and y5-9 ions were 446.3, 658.4, 759.4, 872.5, 1035.6, and 1122.6 Da/e, respectively. The MH+ for this peptide is 1387.6 Da/e (predicted MH+ = 1387.6 Da/e).
[View Larger Version of this Image (17K GIF file)]


These methods confirmed the identities of all 32 of the known 40 S ribosomal proteins, allowing mass assignments to be made. This analysis is summarized in Table I, indicating the differences in observed versus predicted mass for each protein (third and fourth columns, respectively). The residues in each protein that were identified by peptide mapping (Table I, sixth column) include those that were recognized by mass alone (Roman type in the sixth column) and those for which high orifice voltage CID sequencing was obtained (italic type in the sixth column). The percentage of total residues accounted for by the identified peptides (``coverage'') ranged from 16 to 97%, with the majority >50%, and the number of peptides observed for each protein ranged from 2 to 15 (seventh and eighth columns). Peptide mapping confirmed the identities of 11 proteins with observed masses similar to predicted masses (see below for the twelfth protein, S27a); N-terminal peptide masses were observed for S7, S8, S13, S15a, S19, S29, and S30, confirming the presence or absence of the initiator Met.

Covalent Modifications of 40 S Ribosomal Proteins

Of the remaining 29 protein masses, 24 corresponded to known ribosomal proteins, deduced by peptide mapping and sequencing (Table I). Our following predictions about the modifications of these proteins are based on the comparison of observed mass differences with the masses of known covalent amino acid modifications (27). In most cases, modifications could be localized within the protein amino acid sequence by identification and partial sequencing of modified peptide(s). Mass differences were accountable by known covalent modifications in 13 cases (Sa, S3a, S5, S11, S15, S18, S20, S21, S23, S24, S26, S27, and S28), leaving seven in which mass differences were clearly apparent, but corresponded to no known protein covalent modifications (S2, S3, S9, S10, S12, S14, and S25). In each of four cases (S3, S5, S7, and S24), two protein forms were observed by LC/MS.

Mass Differences Consistent with Known Covalent Modifications

S3a and S26 showed mass differences (-130 ± 1 Da) consistent with demethionylation (-131 Da). Most likely, demethionylation would occur at the N terminus, although the N-terminal peptides (MAVGK, S3a; MTK, S26) were not observed due to their small size and hydrophilicity. S21, S24, and S28 showed mass differences consistent with acetylation (+42 Da). Partial sequencing of the N-terminal peptide from S21 indicated that an acetyl group on this protein was located within the first four residues, in agreement with a previous report (28) of a blocked N terminus (MQNDAGEFVDLYVPRK: predicted MH22++(42/2) = 963.1 Da/e, observed = 962.8 Da/e; confirmed by fragment ions corresponding to b4+42, b5+42, and b7+42 with observed m/z = 531.2, 602.2, and 788.3 Da/e, respectively). Partial sequencing of the S24 N-terminal peptide indicated that the +42-Da modification was located within the first four residues (MNDTVTIRTRK: predicted MH1++42/MH22++(42/2) = 1376.7/688.9 Da/e, observed = 1376.8/689.2 Da/e; observed fragment ions: b4+42, b5+42, and y8 = 504.0, 603.0, and 975.4 Da/e, respectively).

Interestingly, S24 coeluted with a protein of mass 311 Da smaller than the predicted S24 mass or 354 Da smaller than the observed S24 mass. This protein could be a variant of acetylated S24, accounted for by proteolytic removal of the three C-terminal residues on S24 (PKE, predicted = 354 Da). The ratio of intensities of the unproteolyzed versus the putative proteolyzed form was 1.4:1 (data not shown), indicating that the proteolyzed form is minor. The ratio was not significantly altered by protease inhibitors added during ribosome preparation, suggesting that the modified form was not produced during purification.

S28 also differed from its predicted mass by +42 Da. However, the N-terminal peptide from S28, identified by high orifice voltage CID, showed a mass difference of +58 Da rather than +42 Da; the +58-Da modification was located within the first six residues (MDTSRVQPIK: predicted MH1++58/MH22++(58/2) = 1232.6/616.8 Da/e, observed = 1232.6/617.0 Da/e; observed fragment ions: b6+58, b7+58, and y5 = 749.8, 876.2, and 585.0 Da/e, respectively). Because the intact protein differed by +42 Da from the predicted mass, we ascribe the additional 16 Da to oxidation of the acetylated initiator Met during peptide preparation.

Finally, S23 was observed with a mass 16 Da greater than predicted. A peptide observed with this mass difference corresponded to residues 61-68 of S23 (QPNSAIRK: predicted MH1++16 = 929.5 Da/e, observed = 929.6 Da/e). A mass difference of +16 Da is consistent with oxidation, hydroxylation, or methylation. Based on the peptide sequence, hydroxylation (of Lys or Pro) or methylation (of Lys) is more likely because oxidation generally occurs on Met. It is also possible that the observed mass difference might result from an amino acid substitution of Cys for Ser, Glu for Ile, Ile for Pro, Leu for Pro, or Ser for Ala.

Although S7 was among those proteins observed with exact predicted mass, a modified form that was 42 Da larger eluted 0.6 min later than the unmodified form. Peptide mapping of a mixture containing both proteins yielded two forms of the N terminus differing by 42 Da, demonstrating that S7 exists in acetylated and unacetylated forms (MFSSSAK: acetylated form, predicted MH1++42 = 799.3 Da/e, observed = 799.4 Da/e; unacetylated form, predicted MH1+ = 757.3 Da/e, observed = 757.4 Da/e). The ratio of intensities of the acetylated versus unacetylated forms was 17.7:1 (data not shown), indicating that the acetylated form predominates in 40 S subunits.

Six proteins (Sa, S11, S15, S18, S20, and S27) showed mass differences consistent with combined demethionylation and acetylation (-90 Da). In four cases (Sa, S11, S15, and S18), N-terminal demethionylation and acetylation were confirmed by identification of correspondingly modified N-terminal peptides. Partial sequencing by high orifice voltage CID indicated that the modifications occurred within the first six residues of Sa, the first four residues of S11, the first four residues of S15, and the first five residues of S18 (for Sa, MSGGLDVLQMK: predicted MH1+-90 = 1089.4 Da/e, observed = 1089.6 Da/e; observed fragment ions: b6-90, b7-90, and b8-90 = 472.2, 571.2, and 684.8 Da/e, respectively; for S11, MADIQTERAYQK: predicted MH1+-90/MH22+-(90/2) = 1363.7/682.4 Da/e, observed = 1364.6/683.0 Da/e; observed fragment ions: b4-90, b8-90, b9-90, and b10-90 = 342.4, 856.6, 927.8, and 1090.4 Da/e, respectively; for S15, MAEVEQK: predicted MH1+-90 = 744.4 Da/e, observed = 745.4 Da/e; observed fragment ions: b4-90, b5-90, b6-90, and b7-90 = 342.2, 471.2, 600.2, and 728.6 Da/e, respectively; for S18, MSLVIPEK: predicted MH1+-90 = 826.5 Da/e, observed = 827.6 Da/e; observed fragment ions: b5-90 and b6-90 = 455.2 and 551.8 Da/e, respectively). Removal of the initiator Met and acetylation of the succeeding Ala residue of S15 have been previously documented by Edman degradation and mass spectrometry of the modified N terminus (29). In the case of S20, no modified peptide was observed; most likely, the modification occurs at the N-terminal peptide (MAFK), which would not bind to the Vydac C18 resin. In the case of S27, the N-terminal peptide was found to be demethionylated (-131 Da), but not acetylated (MPLARDLLHPSLEEEK: predicted MH22+-(131/2) = 874.6 Da/e, observed = 874.4 Da/e). The N-terminal peptide was identified by high orifice voltage CID with observed b6-9 ions of 553.4, 667.0, 779.4, and 916.4 Da/e that were 131 Da smaller than the corresponding predicted ions of 684.4, 797.4, 910.5, and 1047.6 Da/e. An incomplete proteolytic product, peptide 1/2, also supported this identification (MPLARDLLHPSLEEEKK: predicted MH22+-(131/2) = 938.7 Da/e, observed = 938.8 Da/e). This left a mass difference of +42 Da unaccounted for, indicating that an acetylated residue may be present elsewhere on the protein.

S5 also displays N-terminal demethionylation and acetylation, as its N-terminal peptide was 90 Da smaller than predicted (MTEWETATPAVAETPDIK: predicted MH22+-(90/2) = 951.1 Da/e, observed = 951.2 Da/e). In this case, peptide sequencing was carried out by LC/MS/MS, and the modifications were located within the first three residues of S5 (Fig. 5). However, the total mass observed still differed from that predicted by +29 Da after accounting for the -90-Da N-terminal modification; therefore, an additional residue modified by +29 Da is expected. This mass difference would be consistent with formylation of a free amine, such as Lys, or might result from amino acid substitution of Lys or Gln for Val. Although an internal peptide with a +29-Da mass difference was not found, this model is supported by the detection of a second form of S5 with a mass 70 Da greater than predicted. The N-terminal peptide from the second form is 42 Da greater than predicted, indicating that it contains the initiator Met and is acetylated (predicted MH22++(42/2) = 1017.1 Da/e, observed = 1016.6 Da/e). Thus, two acetylated forms of S5 are present, with and without the initiator Met (ratio of intensities = 1:4.2; data not shown), both of which contain a residue additionally modified by +29 Da.


Fig. 5. LC/MS/MS sequencing of the N-terminal peptide from S5. The S5 peptide digest was applied to a Vydac C18 capillary column, and a single ion was selected on-line MS/MS by collisional activation with argon. The parent ion was 951.6 Da/e (MH22+ of the N-terminal peptide). The sequence of this peptide is shown with cleavages that generate the observed b ions (rceil ) and y ions (lfloor ), which are labeled in the spectrum. Dashed lfloor ions were observed with intensities less than five times background (not labeled). The dehydrated ions are indicated as singly or doubly charged ions, differing by -18 or -9, respectively. The observed b3-8 and b11-14 ions were 273.2, 459.4, 588.2, 689.2, 760.2, 861.4, 1128.7, 1199.3, 1328.9, and 1429.7 Da/e, respectively. All were 90 Da smaller (indicated by the asterisks) than the corresponding predicted b ions: 362.1, 548.2, 677.3, 778.3, 849.4, 950.4, 1217.6, 1288.6, 1417.6, and 1518.7 Da/e. The observed y1, y2, y4-8, and y10-14 ions were 146.8, 259.9, 472.3, 573.2, 702.2, 773.5, 872.6, 1040.8, 1141.6, 1212.7, 1313.9, and 1443.4 Da/e, respectively. The corresponding predicted y ions were 147.1, 260.2, 472.3, 573.3, 702.4, 773.4, 872.5, 1040.6, 1141.6, 1212.7, 1313.7, and 1442.7 Da/e.
[View Larger Version of this Image (27K GIF file)]


Mass Differences Unaccounted for by Known Covalent Modifications

Seven ribosomal proteins showed differences between observed versus predicted masses that were unaccounted for by known covalent modifications, raising the possibility that ribosomal proteins contain previously undescribed post-translational modifications. For example, S2 was observed with a mass 220 Da greater than predicted. Peptide mapping revealed N-terminal peptides 1 (Fig. 3A, asterisk) and 1/2 (arrowhead), which were each 220 Da greater than predicted, indicating that the modification is located near the N terminus (peptide 1, residues 1-54: predicted MH44++(220/4)/MH55++(220/5) = 1296.9/1037.7 Da/e, observed = 1296.8/1037.6 Da/e; peptide 1/2, residues 1-58: predicted MH44++(220/4)/MH55++(220/5)/MH66++(220/6) = 1407.7/1126.4/938.8 Da/e, observed = 1407.6/1126.2/938.8 Da/e).

S10 and S12 were observed with mass differences of +57 and -100 Da, respectively. Their N-terminal peptides were observed with masses that agreed with those predicted (for S10, residues 1-5 (MLMPK) and residues 1-6 (MLMPKK): predicted MH1+ = 619.3 and 747.4 Da/e, respectively, and observed = 619.2 and 747.4 Da/e, respectively; for S12, MAEEGIAAGGVMDVNTALQEVLK: predicted MH22+/MH33+ = 1174.4/783.2 Da/e, observed = 1174.6/783.8 Da/e). This indicates that the sites of modification are located elsewhere within these proteins. For S10, a C-terminal peptide with an observed mass 56 ± 1 Da greater than predicted was found (KAEAGAGSATEFQFRGGFGRGRGQPPQ: predicted MH22++(56/2)/MH33++(56/3) = 1411.5/941.3 Da/e, observed = 1411.6/941.2 Da/e). This C-terminal sequence contains RGGF and GR repeats, also seen within the modified N-terminal sequence of S2 (Fig. 3B). An internal modified peptide from S12 was not identifiable by peptide mapping.

S3, S9, S14, and S25 were observed with masses that differed from those predicted by -75, +86, -117, and -103 Da, respectively. In these cases, neither N-terminal nor modified internal peptides could be identified by peptide mapping, revealing little information as to whether these represent individual modifications, combinations of more than one modification, or allelic variations in sequence. S3 coeluted with a protein 362 Da smaller than the predicted and 287 Da smaller than the observed S3 mass. Peptide mapping of this mixture revealed a peptide consistent with the C terminus of S3 with a mass 288.0 ± 1 Da smaller than predicted (GGKPEPPAMPQPVPTA: predicted MH1+-288 = 1286.8 Da/e, observed = 1286.8 Da/e). Thus, the coeluting protein appears to be a form of S3 with an additional modification of -288 Da. This mass change is not easily explained by C-terminal proteolysis, suggesting that the modification may include amino acid residue changes. The ratio of intensities of the larger versus smaller forms of S3 was 2.4:1 (data not shown).

Identification of S27a by LC/MS/MS Sequencing of an Intact Protein

Although peptide mapping and sequencing information allowed us to unequivocally identify 31 of the 32 known rat 40 S ribosomal proteins, the situation was complicated for S27a because peptide mapping revealed no CID data and low coverage (16%). Four peptide ions from endoproteinase Lys-C digestion were observed that agreed with predicted masses (residues 7-13 (KSYTTPK) and residues 8-14 (SYTTPKK): predicted MH1+ = 696.3 Da/e, observed = 696.2 Da/e; residues 8-13 (SYTTPK): predicted MH1+ = 543.4 Da/e, observed = 543.4 Da/e; residues 24-28 (LAVLK): predicted MH1+ = 824.4 Da/e, observed = 824.4 Da/e). However, these data were insufficient to unequivocally identify the protein as S27a. The observed mass and elution time of this protein (9403 Da, 19.1 min) were similar to those of S27 (9477 Da, 20.3 min), suggesting that the protein might be a modified form of S27. An HPLC fraction containing the candidate protein with a mass of 9403 Da was subjected to LC/MS/MS, selecting the MH1515+ (672.6 Da/e) ion for fragmentation. Fragmentation resulted in partial CID at the C terminus, TYCFNKPEDK (Fig. 6). The partial sequencing was sufficient to identify the 9403-Da protein as S27a.


Fig. 6. Sequencing of a full-length ribosomal protein by LC/MS/MS. S27a was subjected to CID by LC/MS/MS, choosing the MH1515+ (672.6 Da/e) ion for fragmentation. The C-terminal sequence of S27a is shown with observed y ions (lfloor ), which are also labeled in the spectrum. Dashed lfloor ions were observed with intensities less than five times background. The y ions in S27a were y2, y72+, and y4, corresponding to observed ions at 262.1, 439.0, and 488.0 Da/e and predicted ions at 262.1, 439.2, and 488.2 Da/e, respectively. Amino acid residues indicated in brackets indicate observed immonium ions, which are consistent with the amino acid composition of this peptide.
[View Larger Version of this Image (21K GIF file)]


Unidentified Proteins in 40 S Ribosomal Subunits

Five of the 41 proteins observed in our LC/MS analysis could not be identified as any of the known 40 S ribosomal proteins. Based on their association with 40 S subunits in each of three independent preparations, we named them Sb, Sc, Sd, Se, and Sf, with masses of 12,084, 13,706, 13,741, 13,884, and 34,987 Da, respectively. Total ion intensities of these proteins were measured by integration of deconvoluted mass spectra (as in Fig. 2A) and were compared with those of nearby proteins. The intensity of Sb was 40-100% of S20; Sd and Se 30-70% of S12; and Sc and Sf 5-50% of Sa. Sc and Sf eluted late from HPLC, suggesting that they are hydrophobic; the lower intensities may be due to less efficient ionization of hydrophobic proteins or to lower recovery from HPLC. The direct comparison of summed scans of Sa, Sc, and Sf showed that, despite the lower total ion intensities, individual ions from Sc were actually more intense than those from Sa and Sf (Fig. 7A). In four of the five cases (Sb, Sd, Se, and Sf), no information beyond total mass was obtained. Further studies are needed to determine if these represent modified forms of the known ribosomal proteins.


Fig. 7. Mass spectrum and LC/MS/MS sequencing of the unknown ribosomal protein Sc. A, the mass spectrum of Sc is shown along with the mass spectra of the coeluting proteins Sa and Sf. The ion series of the unknown Sc and the coeluting Sa are indicated in the ion spectrum. The ions observed for Sc were MH1212+ to MH1616+ (1143.1, 1055.4, 980.0, 914.7, and 857.5 Da/e, respectively), and those for Sa were MH1919+ to MH2929+ (1724.2, 1637.8, 1559.8, 1488.9, 1424.2, 1364.9, 1310.4, 1260.1, 1213.6, 1170.2, and 1129.3 Da/e, respectively). Ions from Sf were weaker and not labeled (MH2121+ to MH3232+: 1667.2, 1591.1, 1522.0, 1458.6, 1400.4, 1346.7, 1296.7, 1250.8, 1207.5, 1167.2, 1129.4, and 1094.5 Da/e, respectively). B, an HPLC fraction containing Sc and Sa was subjected to LC/MS/MS, choosing the MH1414+ (980.0 Da/e) of Sc for fragmentation. Observed ions are discussed under ``Results.''
[View Larger Version of this Image (31K GIF file)]


Sufficient sequence information was obtained for Sc (13,706 Da) to confirm that it is not a modified form of any known ribosomal protein. LC/MS/MS was performed on an HPLC fraction containing a mixture of Sa and Sc, selecting the Sc MH1414+ (980.0 Da/e) ion for fragmentation. The ion series of Sc differed sufficiently from that of Sa to exclude any possibility of fragmentation from neighboring ions. From the fragmentation pattern, a sequence tag of six residues (PPGPPP) was identified that was repeated within two ion series. (Observed ions in the first series were 440.0, 537.1, 595.1, 692.2, 789.2, and 886.3 Da/e, and those in the second series were 344.9, 442.1, 539.2, 596.9, 694.1, and 791.2 Da/e (Fig. 7B).) These ion series differed in mass by 2 Da; speculatively, a Val residue, which is 2 Da larger than Pro, might reside near this sequence. The sequence PPGPPP or PPPGPP was absent in any of the known 40 S or 60 S ribosomal proteins.


DISCUSSION

In this study, we present a novel application of mass spectrometry to analyze the protein components of rat 40 S ribosomal subunits from Rat-1 fibroblasts. Sufficient information was provided by LC/MS peptide mapping and LC/MS/MS sequencing to assign 36 of the 41 observed masses to the 32 40 S ribosomal proteins for which sequence information is available. Of the 32 ribosomal proteins, only 12 showed observed masses that matched those predicted from amino acid sequences (S4, S6-S8, S13, S15a, S16, S17, S19, S27a, S29, and S30). Twenty showed observed masses that differed from those predicted, indicating that the majority of 40 S ribosomal proteins are co- or post-translationally modified (Sa, S2, S3, S3a, S5, S9-S12, S14, S15, S18, S20, S21, and S23-S28). Four ribosomal proteins were represented by more than one mass, indicating heterogeneity in processing (S3, S5, S7, and S24). After accounting for 36 observed proteins, five proteins remained unidentified (referred to as Sb, Sc, Sd, Se, and Sf), of which Sc appears to represent a unique ribosomal or ribosome-associated protein.

The differences between predicted versus observed protein masses most likely reflect covalent modifications of amino acids, although variations in amino acid sequence from data base information cannot be excluded. In most cases, reasonable predictions could be made about the chemistry of the modifications. The majority of observed mass differences were accounted for by acetylation and/or removal of the initiator Met. These reactions are catalyzed co-translationally by Met aminopeptidases and Nalpha -acetyltransferases (30, 31, 32). Four proteins (S7, S21, S24, and S28) showed mass differences consistent with N-terminal acetylation in the presence of the initiator Met. S24 and S28 have N-terminal sequences (MNDT and MDTS, respectively) with Asn and Asp as penultimate residues, which are among the residues most susceptible to acetylation at the initiator Met (Asn, Asp, and Glu) (33, 34). S7 and S21 have N-terminal sequences (MFSS and MQND, respectively) that are not typically acetylated (34); however, Boissel et al. (34) have demonstrated that acetylation sometimes occurs at the initiator Met with penultimate Phe and Gln residues. Nineteen 40 S ribosomal proteins were demethionylated at the first position. Eleven of these confirmed previous studies by Edman degradation (S4, S8, S9, S13, S15, S15a, S16, S17, S19, S23, and S29); our results suggest eight more examples (Sa, S3a, S5, S11, S18, S20, S26, and S27). Six of the demethionylated proteins showed masses consistent with further N-terminal acetylation (Sa, S5, S11, S15, S18, and S20). After removal of the initiator Met, the N-terminal residues are Ala or Ser in Sa, S11, S15, S18, and S20 and Thr in S5. These results agree with previous studies showing that proteins with Gly, Ala, Ser, or Thr as the penultimate residue are usually demethionylated and acetylated (33, 34) and that the majority of acetylated proteins are modified on either Ser or Ala (35). Removal of the initiator Met often destabilizes proteins, enhancing their susceptibility to degradation by the ubiquitin pathway (36). Acetylation in the presence or absence of Met prolongs the half-lives of proteins, preventing their degradation (30, 36).

Other proteins showed mass differences that could be accounted for by (i) proteolyzed forms of S3 (-288 Da) and S24 (-354 Da), (ii) internal acetylation of S27 (+42 Da), (iii) formylation of S5 (+29 Da), and (iv) methylation or hydroxylation of S23 (+16 Da). The modifications of the remaining proteins (S2, S3, S9, S10, S12, S14, and S25) are likely to be more complex. For example, ribosomal proteins S2 and S10 showed mass differences of +220 and +56 Da, respectively. These mass differences correspond to no known naturally occurring modifications, although they might reflect combinations of several modifications, with or without amino acid substitutions. The +220-Da modification of S2 was located within the first 54 residues. Masses of known modifications similar but not identical to this value include myristoylation (+210 Da) or biotinylation (+226 Da). It is possible that the mass difference is actually 178 Da if the N terminus is acetylated, 351 Da if it is demethionylated, or 309 Da if it is demethionylated and acetylated. Known modifications with masses similar to these estimates are glucuronylation (176 Da), 4-oxyglycosylation (177 Da), and N-glycolneuraminylation (307 Da). In S10, the +56-Da modification was located within the C-terminal peptide. This mass difference does not correlate to known protein modifications, but could be accounted for by an extra Gly (+57 Da) incorporated in the protein sequence or an amino acid substitution of Ile or Leu for Gly. Further studies are needed to identify the chemistry of these modifications.

Both S2 and S10 are modified within sequences that contain RGGF and GR repeats, which are found within several nucleolar proteins (37, 38, 39, 40). Wool et al. (4) previously noted these repeats within S2, suggesting that they may direct nucleolar localization of S2 or mediate S2 binding to nascent rRNA. It is tempting to speculate that the modifications of S2 and S10 somehow influence S2 and S10 translocation or rRNA binding, thus regulating 40 S ribosomal protein assembly. It is also possible that the modification of S2 regulates binding or translation of mRNA, given that S2 has RNA binding potential and is likely to be localized near the ribosome active site, based on the homology between eukaryotic S2 and Escherichia coli S5 (2) and the immune electron microscopy of rat 40 S subunits (41).

Five proteins (Sb, Sc, Sd, Se, and Sf) were observed with masses that did not correspond to any known ribosomal protein. S1 is the only 40 S ribosomal protein for which no published cDNA sequence is available. Sf (34,987 Da) is most likely identical to S1, based on estimates of S1 mass by two-dimensional SDS-polyacrylamide gel electrophoresis (42). LC/MS/MS sequencing of Sc (13,706 Da) revealed a sequence tag of six residues: PPGPPP or PPPGPP. This sequence is absent in all of the known ribosomal proteins. Sc also appears to have greater hydrophobicity than the other ribosomal proteins based on elution time and mass. Hydrophobic proteins are known to ionize inefficiently, which might explain the low intensity observed for Sc (and Sf). Nevertheless, ions from Sc (Fig. 7A) showed intensities exceeding those of the coeluting protein Sa, indicating that Sc is relatively abundant. The reproducible appearance of Sc in three separate preparations of 40 S ribosomal proteins raises the possibility that Sc might represent a previously unidentified protein associated with 40 S ribosomal subunits.

The direct coupling of HPLC with mass spectrometry simplified the analyses of ribosomal complexes by eliminating the purification of individual proteins. This allowed us t