JBC Oz Biosciences

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wassenhove-McCarthy, D. J.
Right arrow Articles by McCarthy, K. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wassenhove-McCarthy, D. J.
Right arrow Articles by McCarthy, K. J.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

J Biol Chem, Vol. 274, Issue 35, 25004-25017, August 27, 1999


Molecular Characterization of a Novel Basement Membrane-associated Proteoglycan, Leprecan*

Deborah J. Wassenhove-McCarthyDagger and Kevin J. McCarthyDagger §

From the Departments of Dagger  Pathology and § Cell Biology and Anatomy, School of Medicine, Louisiana State University Medical Center, Shreveport, Louisiana 71130

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

A monoclonal antibody was used in early studies to identify a novel chondroitin sulfate proteoglycan, secreted by L-2 cells, the core protein of which was approximately 100 kDa. To characterize this proteoglycan core protein at the molecular level, an L-2 cell cDNA library was probed by expression screening and solution hybridization. Northern blot analysis assigned transcript size to approximately 3.1 kilobases and, after contig assembly, the coding region of the mRNA corresponded to 2.18 kilobases. Immunoassays were performed to confirm the identity of this sequence, using a polyclonal antibody raised against an expressed fusion protein encoded by sequence representing the carboxyl half of the molecule. The antibody recognized the core protein in Western blots after prior digestion of the intact proteoglycan with chondroitinase ABC. Immunostaining tissue sections with the same antibody localized the proteoglycan to basement membranes, and expression of the entire sequence in Chinese hamster ovary K-1 cells showed that the protein encoded by the sequence secreted as a chondroitin sulfate proteoglycan. The core protein not only has motifs permitting glycosylation as a proteoglycan, but also possesses the endoplasmic reticulum retrieval signal, KDEL, which suggests that, in addition to its role as a basement membrane component, it may also participate in the secretory pathway of cells.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

The focus of our laboratory has been the study of the cell and molecular biology of one group of basement membrane molecules, basement membrane proteoglycans. These molecules have been traditionally divided into two families based on the glycosaminoglycan chains they bear; i.e. heparan sulfate proteoglycans (HSPG)1 or chondroitin sulfate proteoglycans (CSPG). The most well recognized and characterized member of the HSPG group is perlecan, originally isolated from the Engelbreth-Holm-Swarm tumor (1, 2). Other proteoglycan core proteins bearing heparan sulfate (HS) chains have been identified and characterized, using both biochemical and immunological approaches, as potentially being gene products unique from perlecan (3, 4). However, with the exceptions of perlecan (5-7) and, most recently, agrin (8), none have been characterized at the molecular level.

The cell and molecular biology of CSPG present in basement membranes has not been as thoroughly investigated as their HSPG counterparts. Previous work by ourselves (9, 10) and from the laboratories of other investigators (11-13) demonstrated the possibility of the existence of several unique gene products for which core proteins bear chondroitin (CS)/dermatan sulfate (DS) glycosaminoglycan (GAG) chains. Little was known about the core protein structure of this species until the molecular characterization of one member of this group, bamacan, by Wu and Couchman (14). However, although the bamacan core protein is presently thought to be a "full time" CSPG, the core protein of perlecan also qualifies as inclusion into the basement membrane CSPG family, since it has been shown to be secreted in certain tissues and cell lines in culture as a proteoglycan core protein with CS/DS chains (15-17).

However, there is some evidence that highlight the fact that basement membranes might, in fact, contain other CSPG proteoglycans for which core proteins might differ from those reported above. In earlier reports, using core protein-specific monoclonal antibodies to BM-CSPG/bamacan, we showed that BM-CSPG/bamacan was a component of the mesangial matrix and not present in the walls of capillaries within the glomerulus (10, 18). However, two studies using monoclonal antibodies directed against chondroitin carbohydrate-specific epitopes have shown specific immunostaining the wall of the glomerular capillary for the presence of CS chains (13, 19). Moreover, a recent report by Schittny et al. (20) demonstrated the presence of a heterodimeric proteoglycan bearing DS chains in some but not all basement membranes. These findings imply the possibility that CS/DS-bearing proteoglycans, other than BM-CSPG/bamacan, might also exist in basement membranes, or at the very least, within the glomerular capillary basement membrane. The present report fulfills in part this premise, describing the molecular characterization of a novel proteoglycan, leprecan (leucine proline-enriched proteoglycan), the protein core of which bears CS chains and which can be immunolocalized to basement membranes.

    EXPERIMENTAL PROCEDURES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Methods

Isolation of Proteoglycans-- L-2 cells, a rat parietal yolk sac tumor cell line (a generous gift from Dr. Ulla Wewer, Institute of Pathology and Anatomy, University of Copenhagen, Copenhagen, Denmark), were grown to confluence in roller bottle culture in Dulbecco's modified Eagle's medium (Mediatech) supplemented with 4% fetal bovine serum (HyClone Laboratories, Logan, UT), 4 mM glutamine, and penicillin/streptomycin. The culture medium was changed weekly and the cultures maintained in 10% CO2 at 37 °C. The total proteoglycan pool from the conditioned culture medium was isolated using a combination of ion exchange chromatography and CsCl density gradient ultracentrifugation as described previously (10). Briefly, 2 liters of conditioned medium were concentrated under positive pressure at 4 °C in an ultrafiltration cell (100,000 Mr cut-off, Amicon, Beverly, MA) to a final volume of 250 ml. Guanidine hydrochloride (Sigma) was added to a final concentration of 4 M, and solid CsCl (Fisher) was added to give a final starting density of 1.3 g/ml. 4% CHAPS (Sigma), 10 mM Na2EDTA, 1 mM dithioerythritol (DTE), and the protease inhibitors 0.2 mM PMSF and 5 mM benzamidine were added to the solution to result in the final concentrations listed above. The solution was aliquoted into individual ultracentrifuge tubes and subjected to ultracentrifugation at 40,000 rpm, for 40 h at 10 °C using a 50.2TI rotor (Beckman Instruments) in a RC-80 ultracentrifuge (Sorvall). Subsequently, each tube was divided into five fractions, the density of each fraction measured, and fractions of approximately the same density pooled. SDS-PAGE analysis of the pooled fractions indicated that the two most dense fractions 4 (1.35 g/ml, low buoyant density proteoglycan) and 5 (1.42 g/ml, HD-PG) contained proteoglycans. These were subsequently dialyzed into 8 M urea, 0.15 M NaCl, 10 mM Na2 EDTA, 1 mM DTE, 50 mM Tris-HCl (pH 8.0) with the above protease inhibitors. Afterward, each fraction was further purified by ion exchange chromatography, initially loading each at a rate of 1 ml/min onto a 2.5 × 10-cm column of DEAE Trisacryl (IBF Biotechnics, Villeneuve-la-Garrenne, France) equilibrated in 8 M urea, 0.15 M NaCl, 0.5% Tween 20, 10 mM Na2EDTA, 1 mM DTE, 50 mM Tris-HCl (pH 8.0) with protease inhibitors (as above). After loading, the column was rinsed with 250 ml of the same buffer, then 250 ml of 8 M urea, 0.15 M NaCl, 10 mM Na2EDTA, 0.5% Tween 20, 1 mM DTE, 50 mM sodium acetate (pH 4.2) with protease inhibitors. Afterward, the column was eluted with a NaCl gradient (0.15-2.0 M in the same base buffer as the latter column rinse buffer) monitored by an in-line conductivity meter, and the eluent collected in 2-ml fractions. The elution of proteoglycans in the gradient was confirmed by SDS-PAGE. A subsequent Western blot immunoassay using a polyclonal antibody, R44, which recognizes the carbohydrate "stubs" remaining on proteoglycan core proteins after prior digestion with chondroitinase ABC (EC 4.2.2.4, Seikagaku America) (10), was used to confirm the presence of CSPGs within the respective fractions.

Characterization of Proteoglycans-- In order to identify HSPGs and CSPGs in the pool, 50-µl samples of the HD-PG pool were digested overnight either with heparitinase/heparinase (EC 4.2.2.8 and 4.2.2.7, respectively) in 100 mM sodium acetate (pH 7.0), 1 mM calcium acetate, 10 units of leupeptin, 1 mM benzamidine, or with chondroitinase ABC in 50 mM Tris, 30 mM sodium acetate (pH 8.0), 20 mM EDTA, 10 mM n-ethylmaleimide, 0.2 mM PMSF. Afterward, both untreated samples and digested samples were characterized by SDS-PAGE using a 4-15% resolving gel. The separated proteoglycans were subsequently visualized by silver staining after electrophoresis.

Monoclonal and Polyclonal Antibody Production-- An aliquot of the HD-PG pool was used as an antigen for the production of monoclonal antibodies as described previously (10). Tissue culture supernatants from the resultant hybridomas were screened by Western blot immunoassay with the HD-PG pool. Those clones for which supernatants were positive in the screening procedure were further subcloned by limiting dilution, expanded to high density, and injected into pristane-primed BALB/c mice for the production of ascites fluid. The determination of mAb subclass was done using a commercially available isotyping kit (Amersham Pharmacia Biotech). The monoclonal antibody, Mo27, was subsequently characterized and used during the course of this study. The identity of the epitope recognized by Mo27 on proteoglycan core proteins present in the HD-PG preparation is not certain, but the data (see below) indicate that it is shared by three of the core proteins present within the preparation (180, 130, and 100 kDa).

For the production of polyclonal antibodies against the 100-kDa proteoglycan core protein, an aliquot of the HD-PG pool was fractionated using gel filtration chromatography (CL-4B, 0.5 × 120 cm) in denaturing conditions (50 mM Tris, pH 8.0, 0.35 M NaCl, 0.1% SDS, 0.02% sodium azide, 0.2 mM PMSF, 5 mM NEM, 10 mM EDTA), the elution of the target proteoglycan from the column identified in the collected fractions using the monoclonal antibody, Mo27, in a Western blot assay (data not shown). A polyclonal antiserum (Rb4127) was raised against the isolated native proteoglycan in rabbits using biweekly immunization protocol. 500 µg of antigen suspended in an equal volume of Freund's complete adjuvant (Sigma) was used in the initial immunization, followed by a second immunization after 2 weeks using a comparable amount of antigen suspended in Freund's incomplete adjuvant (Sigma). The resultant polyclonal antibody was a polyvalent antibody, recognizing the 180-, 130-, and 100-kDa proteoglycan core proteins present in the HD-PG preparation.

A polyclonal antibody (Rb2096) directed against a fusion protein (see below) representing the carboxyl half of the molecule was developed using the same immunization protocol.

cDNA Library Screening-- A cDNA library developed from the mRNA of rat parietal endoderm cells (oligo(dT)- and random-primed) in the Uni-ZAP XR vector was purchased from Stratagene (La Jolla, CA) and screened as an expression library according to the developer's instructions using the polyclonal antiserum Rb4127. Clones found positive after a third round of preliminary screening were subcloned into appropriate host bacteria. To identify putative clones for further development, all positive clones from the third round of screening were initially sequenced (21) using both a -20 universal primer and a forward primer (5'-ATTAACCTCACTAAAG-3') with BST polymerase (Bio-Rad). Extension of sequence information was managed using a primer walking strategy. The obtained sequence information from all clones was compared for homology to a data base constructed from known proteoglycan sequences using the data base search and analysis routine in MacVector (Oxford Molecular Group, Campbell, CA). Contigs from DNA sequencing were aligned using AssemblyAlign (Oxford Molecular Group).

To obtain a full-length cDNA, an aliquot of the L-2 cDNA library was amplified and mass excised according to the manufacturer's (Stratagene) directions. This DNA was subsequently used in a solution hybridization screening strategy (Genetrapper cDNA Positive Selection SystemTM, Life Technologies, Inc.) according to the developer's instructions. The capture oligonucleotide (5'-TCCTGGCACTCGTCATCA-3') used in this procedure was obtained from Oligos Etc, Inc. (Wilsonville, OR). After completion of the cDNA capture protocol, the captured single-stranded DNA was converted to double-stranded DNA using a different oligonucleotide (5'-TCCTTGATACACACGAGGGATTCG-3') as a repair primer, according to the developer's instructions. The repaired double-stranded DNA were transformed by electroporation into ElectroMAX DH10B cells (Life Technologies, Inc.) using a Gene Pulser apparatus (Bio-Rad) under the following conditions: 0.8 kV in 0.1-cm gap chamber at settings of 200 ohms and 25 microfarads. The cells were then plated on LB/ampicillin plates (100 µg/ml) and incubated overnight at 37 °C. The resultant plates contained four dominant colonies that were selected and grown overnight at 37 °C in 5 ml of Terrific Broth supplemented with 100 µg/ml ampicillin. Plasmids were purified using a Wizard Miniprep DNA system (Promega), and the identity of the isolated clones was confirmed by DNA sequencing. One clone, SJ3, was found to encode the full length of the leprecan molecule.

Northern Blotting

Probe Production-- An 850-base pair fragment of clone P2 was excised from P2 cDNA by digestion with BamHI. For probe development, 250 ng of the isolated probe DNA was biotinylated using a NEBlot phototope kit according to the developer's directions (New England Biolabs Inc., Beverly, MA). The specificity and sensitivity of the probe was tested by dot blot assay prior to use on in the Northern blotting protocol.

Isolation of RNA and Northern Blotting-- Total RNA was isolated from L2 cells using TRIzolTM LS reagent (Life Technologies, Inc.). Total RNA (18.75 µg/lane) was electrophoresed at 5 V/cm on a 1.0% agarose, 2.2 M formaldehyde gel. Afterward, the gel was rinsed five times with diethylpyrocarbonate-treated water and the RNA was transferred to a nylon membrane (NytranTM, Schleicher & Schuell) following the manufacturer's protocol for neutral transfer using a downward transfer system (TurboblotterTM, Schleicher & Schuell) overnight in 20× SSC. The next day, membrane was rinsed in 5× SSC for 20 min and the lanes cut into individual strips and cross-linked with UV irradiation (Gene Linker, Bio-Rad) for total delivery of 125 mJ of energy. One strip was stained with 0.02% methylene blue, 0.3 M sodium acetate, pH 5.5 to visualize the 28 and 18 S rRNA bands. The adjacent lane was incubated in prehybridization solution (6× SSC, 5× Denhardt's reagent, 0.5% SDS, 100 µg/ml salmon sperm DNA) at 65 °C for 1.0 h. Afterward, the biotinylated P2 probe was added to the prehybridization solution and allowed to hybridize overnight at 65 °C. Afterward, the membrane was rinsed as follows: 2× SSC, 0.1% SDS at room temperature twice for 5 min each; 0.2× SSC, 0.1% SDS at 25 °C twice for 10 min; 0.2× SSC, 0.1% SDS at 58 °C twice for 15 min; 2× SSC at 25 °C for 10 min. Detection of the biotinylated probe was done using a NEBlot photodetection kit (New England Biolabs, Inc., Beverly, MA) following the manufacturer's protocol with minor modifications; the blocking and rinsing times were increased to 1.0 h and 30 min, respectively. The membrane was exposed to BioMax MR film (Eastman Kodak Corp., Rochester, NY) and subsequently developed in an automatic film processor. The experiment was repeated using duplicate strips from the same blot.

Expression of Bacterial Fusion Proteins

The sequence encoded by clone P2, which represents the carboxyl half of the core protein, was expressed as a fusion protein by subcloning a KpnI and SacI restriction fragment of the sequence into a commercially available fusion protein expression system (QiaexpressTM, Qiagen) that incorporates a 6×His tag 5' to the polylinker region allowing affinity purification of the fusion protein product on a nickel-nitrilotriacetic acid resin. Competent M15 (pREP4) cells (Qiagen) were transformed with the appropriate vector according to the manufacturer's protocol. Expression of fusion proteins in liquid cultures was induced by isopropyl-1-thio-beta -D-galactopyranoside according to the developer's instructions. Afterward, the cells were harvested, centrifuged, and the cell pellet stored at -70 °C until processing. For purification of fusion proteins, cell pellets were solubilized in 8 M urea, 0.1 M sodium phosphate, 0.01 M Tris-HCl, pH 8.0. Purification of the fusion protein (P2E) was performed using a low pressure chromatography system (Econo-SystemTM; Bio-Rad) to load, rinse, and elute the expressed protein. The protein was loaded at a flow rate of 0.25 ml/min onto a 0.5 × 0.5-cm nickel-nitrilotriacetic acid resin affinity column equilibrated in 8 M urea, 0.1 M sodium phosphate, 0.01 M Tris-HCl, pH 8.0, rinsed with 8 M urea, 0.1 M sodium phosphate, 0.01 M Tris-HCl, pH 5.9, and eluted in 1-ml fractions with 8 M urea, 0.1 M sodium phosphate, 0.01 M Tris-HCl, pH 4.5. The fractions containing the P2E protein, as measured by absorbance at 280 nm, were subsequently characterized by 10-20% SDS-PAGE and Western immunoblot assays using polyclonal antibody Rb4127. The fractions for which proteins were immunoreactive with Rb4127 on immunoblot assay were pooled, dialyzed extensively against PBS, concentrated to a 2 mg/ml concentration, and used to immunize a rabbit to develop a polyclonal antiserum directed against the fusion protein (Rb2096, see above).

Transfection and Expression in Mammalian Cells

Four T-75 flasks were seeded with equal numbers of CHO-K1 cells (American Type Culture Collection, Rockville, MD) in Ham's F-12 medium (Mediatech, Inc., Herndon, VA) supplemented with 10% fetal bovine serum (HyClone Laboratories) incubated with 10% CO2, at 37 °C 24 h prior to transfection. Each flask represented one of the following treatment groups: 1) control, no transfection; 2) sham transfection, i.e. the cells exposed to transfection agent only; 3) inappropriate transfection, i.e. cells transfected with the control plasmid vector pZeoSVLacZ (Invitrogen, San Diego, CA); 4) cells transfected the pZeoSV plasmid bearing the SJ3 cDNA, representing the full-length coding region for the 100-kDa protein. Protocol for transfections followed that provided by the manufacturer (Life Technologies, Inc.) adjusting all volumes and ratios of the components in accordance to the relative surface area of the flasks. Conditioned culture medium from each flask was harvested 24 h and again at 48 h following the start of the transfection, the medium placed in 50-ml tubes and clarified by centrifugation at 3000 rpm for 10 min, and the cleared supernatants aspirated to new tubes. For morphology studies, a similar transfection protocol was used for CHO-K1 cells seeded in six-well plates, adjusting all volumes and component ratios to the relative surface area of the wells.

Immunoprecipitation of CHO-K1 Cell Culture Supernatants

For immunoprecipitation assay, affinity-purified Rb2096 antiserum was biotinylated using the EZ-link sulfo-NHS biotinylation kit (Pierce) according to the manufacturers' protocol. The biotinylated antibody was subsequently tested for sensitivity in dot blot assays and also for specificity via Western immunoblot analysis using the P2E fusion protein.

Cell culture medium (1.75 ml) representing each transfection condition (normal, sham transfection with LipofectAMINE, lacZ transfection, and SJ-3 transfection) was precleared in microcentrifuge tubes with 50 µl of protein A plus resin (Pierce Chemical Co,) with gentle shaking for 2.0 h at room temperature. Afterward, the supernatants were clarified by centrifugation at 2000 × g for 2.0 min, and 1.70 ml of each aspirated and dispensed into fresh tubes. 100 µl of affinity-purified polyclonal antibody Rb2096 (2.2 mg IgG/ml) was added to each tube and the immunoprecipitation carried out overnight at 4 °C with gentle agitation. The next morning 50 µl of protein A resin/buffer slurry was added to each of the tubes and the supernatants incubated an additional 3.5 h with gentle agitation. The supernatants were subsequently clarified by centrifugation at 2000 × g for 2.0 min, the resin pellets removed from the tubes and rinsed with TTBS (20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 0.5% Tween 20) for a total of seven 2-min rinses, followed by a final rinse step with 500 µl of sterile water. The supernatant was aspirated, and 25 µl of chondroitinase buffer and 2 µl of chondroitinase ABC (Seikagaku America) enzyme were added directly to the suspension. The samples were incubated overnight at 37 °C. Subsequently, the resin was rinsed twice with 500 µl of sterile water and the proteins eluted from the resin by incubating at 100 °C with two 25-µl aliquots of DTE mixture for 5 min each. After each incubation, the resin was spun at 2000 × g for 2.0 min and the sample aspirated and placed in a clean tube. The samples were then separated by SDS-PAGE using a 4-15% gradient resolving gel (Bio-Rad) and then electrophoretically transferred onto nitrocellulose (Schleicher & Schuell). Afterward, the blot was rinsed in TBS (20 mM Tris-HCl, pH 7.5, 150 mM NaCl) and then blocked for 1.0 h at room temp. in SuperblockTM blocking buffer (Pierce). Afterward, the blot was rinsed three times for 2 min each in TBS and incubated with biotinylated antibody Rb2096 at 1:1000 dilution in TBS buffer. The blot was then rinsed twice for 5 min each with TTBS, three times for 10 minutes each in immunoblot wash buffer (0.0125% Nonidet P-40, 0.0125% SDS, 0.0312% deoxycholic acid in TBS, pH 7.5), and twice for 5 min each with TTBS. The blot was then incubated for 30 min at room temperature with neutravidin conjugated to horseradish peroxidase (Pierce) at 1:10,000 dilution in TBS. Following incubation, the blot was then rinsed twice for 5 min in TTBS, twice for 10 min in immunoblot wash buffer, twice for 5 min in TTBS, and five times for 2 min each in TBS. The blot was developed via chemiluminescence using the Biomax chemiluminescent detection system (Eastman Kodak Corp.).

Immunodetection of KDEL

Immunoblots of electrophoretically transferred 4-15% SDS-PAGE gels of secreted proteoglycan (as described above) and cell layer preparations isolated during several stages of the study were performed as described previously (10). For the determination of the presence of the KDEL signal in native leprecan, cell layer extracts were prepared as follows. Confluent monolayers of L-2 cells were rinsed three times for 10 min each with PBS followed by extraction with 4 M urea, 250 mM Tris (pH 8.0), 150 mM sodium acetate, 100 mM Na2EDTA, 100 mM n-ethylmaleimide, 2 mM PMSF. After extraction, 10-µl aliquots of cell extract were diluted with water to 50 µl, and some aliquots subjected to overnight treatment with chondroitinase ABC. Both cell layer aliquots and secreted proteoglycan (± chondroitinase ABC digestion) were electrophoretically separated on a 4-15% gradient gel and subsequently transferred to nitrocellulose for immunoblotting. The resultant blots were probed with either Rb2096 or with a monoclonal antibody directed against the sequence SEKDEL (StressGen, Victoria, British Columbia, Canada). Detection of binding of primary antibodies was done using species- or isotype-specific secondary antibodies (Jackson Immunoresearch) conjugated to alkaline phosphatase, followed by color development with an alkaline phosphatase color development kit (Bio-Rad).

Cell and Tissue Immunohistochemistry

CHO-K1 cells in six-well plates were fixed for 10 min in Clark's solution (22), rinsed three times in PBS, and the wells blocked for 20 min with 1% bovine serum albumin in PBS. Afterward the wells were rinsed three times for 10 min each with PBS, and incubated with the primary antiserum Rb2096 (1/100 dilution in PBS) for 45 min. The wells were subsequently rinsed three times for 10 min each with PBS and then incubated with secondary antibody, goat anti-rabbit IgG conjugated with fluorescein isothiocyanate (1/100 dilution in PBS, Jackson Immunoresearch, Malvern, PA) for 45 min. The wells were rinsed three times for 10 min each with PBS, mounting medium applied, and then coverslipped.

Frozen sections from rat tissues were immunostained with polyclonal antiserum Rb2096 using single label immunohistochemistry protocols described previously (10, 18).

The resultant plate wells or slides were imaged using a Zeiss Axiovert-35 inverted microscope equipped with epifluorescent illumination. Images were digitized using a Photometrics SenSys camera (Photometrics Ltd., Tuscon, AZ) and ported to a Macintosh PowerPC 9500 hosting the image processing software IPLabSpectrum (Scanalytics Corp., Fairfax, VA).

    RESULTS
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

L-2 Cells Produce Multiple Proteoglycans-- The total pool of HD-PG was isolated using a protocol combining cesium chloride density gradient ultracentrifugation and ion exchange chromatography under denaturing conditions. SDS-PAGE of aliquots of that pool indicated the possibility of the existence of four or more species of proteoglycan within the pool (data not shown). Treatment of aliquots of that pool with chondroitinase ABC demonstrated that at least eight core proteins were resolvable after separation with SDS-PAGE, with the majority of the protein cores migrating within the range of 100-200 kDa. Digestion of parallel aliquots of the HD-PG pool with heparitinase/heparinase revealed no clarification of proteoglycan core proteins within the SDS-PAGE gel (data not shown), suggesting that the majority of the proteoglycans within the pool were chondroitin/dermatan sulfate proteoglycans in nature. Western immunoblotting of another aliquot of the HD-PG pool digested with chondroitinase ABC with the polyclonal antiserum R44 provided additional evidence that the core proteins within the pool bore CS/DS GAG chains (Fig. 1A).


View larger version (77K):
[in this window]
[in a new window]
 
Fig. 1.   Panel A is a photograph of a Western blot immunoassay of chondroitinase ABC-treated total HD-PG after SDS-PAGE on a 4-15% gel and transfer to nitrocellulose. In this case the blot was probed with the R44 antiserum, a polyclonal antibody recognizing remnant carbohydrate "stubs." Within the region of the blot spanning 200-100 kDa, there are at least six proteoglycan core proteins that bear CS/DS side chains. Note that there are also several smaller core proteins (~35-50 kDa) identified by this antibody in this preparation. Panel B is a Western blot immunoassay of total HD-PG electrophoresed on a 4-15% gel and transferred to nitrocellulose. Lanes 1 and 3 represent intact proteoglycan, lanes 2 and 4 represent the HD-PG sample treated with chondroitinase ABC and with sequential treatment with chondroitinase ABC then heparitinase/heparinase, respectively. The blot was probed with Mo27, which recognizes material that runs as a smear at 200 kDa (lane 1) that, when treated with the enzyme chondroitinase ABC resolves into several lower molecular mass bands (180, 130, and 100 kDa). c'abc, chondroitinase ABC; +hep, c'abc followed by heparitinase/heparinase; unt, untreated.

An aliquot of HD-PG pool was used as an antigen to immunize mice for the development of a panel of monoclonal antibodies. Screening of the resultant hybridoma tissue culture supernatants using immunoblot assay yielded one clone, Mo27, for which immunoreactivity against proteoglycans was characterized by Western immunoblot assay. In those assays, Mo27 recognized a "ladder" of core proteins 180, 130, and 100 kDa that were resolved after treatment of the HD-PG pool with chondroitinase ABC (Fig. 1B, lane 2). This ladder is sometimes present in GAG-lyase-treated preparations where the GAG chains on the proteoglycan core protein were not entirely cleaved by the lyase. However, prolonged digestion with chondroitinase ABC failed to resolve the three core proteins into a single band (data not shown). Treatment of another aliquot of the HD-PG pool with heparinase/heparitinase failed to change the position of the higher molecular weight protein cores on Western blots. This indicated that presence of the higher molecular weight core proteins recognized by Mo27 on the immunoblot was not due to substitution with HS rather than CS chains. In order to exclude the possibility that the higher molecular weight core proteins might be hybrid proteoglycans, a third aliquot of HD-PG pool was treated in a sequential manner with chondroitinase ABC followed by heparinase/heparitinase. This treatment protocol resolved the same pattern of core proteins on Western blots in a manner similar to that seen with treatment by chondroitinase ABC alone, indicating that L-2 cells export all three core proteins as a CS-substituted core protein (Fig. 1B, lane 4). The fact that neither alternative digestion protocol was unable to resolve the core protein ladder into a single band could be indicative that Mo27 recognizes a shared epitope present on all three core proteins in the HD-PG pool.

In order to maximize the potential to detect either the entire proteoglycan core protein or partially expressed protein core in a cDNA expression library, an aliquot of the HD-PG pool was fractionated over a CL-4B column. Mo27 was used in a Western blot immunoassay to identify those fractions containing the proteoglycan population of interest. Using the contents of those fractions as an immunogen, the polyclonal antiserum Rb4127 was developed against the intact native proteoglycans of the HD-PG pool. The resultant antiserum was characterized in a manner similar to that described above. The antiserum recognized the same pattern of core proteins recognized by Mo27 (Fig. 2A, lanes 1 and 2) and is considered to be a polyvalent antiserum, the activity of which is directed against several CSPG core proteins present in the fraction.


View larger version (36K):
[in this window]
[in a new window]
 
Fig. 2.   Panel A is a Western immunoblot of total HD-PG electrophoresed on a 4-15% gradient gel as either the intact, native PG (lanes 1 and 3) or after treatment with chondroitinase ABC (lanes 2 and 4). The blot was probed with a polyclonal antiserum, Rb4127, that was directed against the native proteoglycan species in the HD-PG fractions (lanes 1 and 2) or with Rb2096, the polyclonal antiserum directed against the P2 fusion protein (lanes 3 and 4). Both antisera recognize the 100-kDa core protein; however, Rb2096 is not immunoreactive with the higher molecular weight core proteins present in the HD-PG fractions. Abbreviations are defined in legend for Fig. 1. Fig. 2B is a Western immunoblot of a 10-20% gradient gel over which the P2E fusion protein had been electrophoresed. The blot was probed with Rb4127, which readily recognized the fusion protein. Un, no induction of expression; Ind, induced expression. Fig. 2C is a Northern blot of total RNA isolated from L-2 cells that was probed with cDNA probe derived from clone P-2. Adjacent to the blot is a parallel strip of membrane that was stained with methylene blue to visualize the 28 and 18 S rRNA. The resultant band detected by the P2 probe migrated to a position within the gel to yield an estimated size of 3.1 kb for the mRNA encoding the protein.

cDNA Library Screening-- From the expression screening protocol using Rb4127 antiserum as a probe, a total of 10 unique clones were isolated, each clone sequenced for approximately 0.5 kb from each end, and the cDNA sequence information obtained from each clone was compared with a proteoglycan cDNA data base to detect potential homology to sequence information of known proteoglycans. None of the 10 clones isolated had overlap with the other. However, one of these clones from the Rb4127 screening panel, P2, had a stop codon and polyadenylation signal at the 3' end, indicating that the cDNA encoded for the carboxyl terminus of a novel protein. Moreover, proteoglycan data base comparison showed P2 to have homology over a limited region with several members of the small leucine-rich repeat family of proteoglycans (SLRP). Translation of the sequence encoded by the cDNA revealed the presence of at least one optimal acceptor site for the addition of GAG chains to the core protein of the molecule (see below). Extension of sequence information was performed using a solution hybridization protocol, using a biotinylated oligonucleotide (5'-TCCTGGCACTCGTCATCA-3') to hybridize to complementary single-stranded cDNA, and the hybridized cDNAs isolated by magnetic separation. One clone, SJ3, contained the sequence found in clone P2 in its entirety, and upon DNA sequencing was found to represent the entire cDNA of the molecule (see below).

Expression of PG Fusion Protein-- In order to further verify that clone P2 encoded the sequence of a proteoglycan core protein, the entire length of P2 cDNA was expressed as a fusion protein (P2E) subsequently purified by affinity chromatography. An aliquot of the P2E fusion protein was characterized by SDS-PAGE (10-20% gel). Coomassie staining indicated that the size of the P2E was approximately 25 kDa. Subsequent Western blot immunoassays of P2E using Rb4127 as a probe provided partial confirmation that the isolated cDNA encoded the carboxyl terminus of a proteoglycan core protein (Fig. 2B). However, probing an identical blot with Mo27 failed to demonstrate recognition of the fusion protein (data not shown).

In order to provide further evidence that the fusion protein represented a portion of the proteoglycan core protein, a polyclonal antiserum, Rb2096, directed against the fusion protein, was developed. The resultant antiserum recognized the 100-kDa proteoglycan core protein resolvable after prior digestion with chondroitinase ABC in the HD-PG fraction (Fig. 2A, lane 4). This 100-kDa protein was also recognized by Mo27 (Fig. 1B, lanes 2 and 4) and Rb4127 (Fig. 2A, lane 2). However, Rb2096 did not recognize the 180- and 130-kDa proteoglycan core proteins in the same preparation as did Mo27 and Rb4127.

Features of Leprecan-- Northern blot analysis of total RNA extracted from L-2 cells gave an estimated 3.1-kb size for the message (Fig. 2C). The isolated SJ3 cDNA contained the entire open reading frame (ORF) for leprecan (Fig. 3) was 2.541 kb in size; the size of the ORF for leprecan in this cDNA was 2.183 kb in length. The remainder of the sequence consisted of an 0.325-kb untranslated region and a poly A tail. During library construction, the EcoRI and XhoI sites in the polylinker region of the UniZap XRTM vector were used for the directional insertion of cDNAs. Because of this, we believe that the 5'-untranslated region is truncated, due to the presence of an EcoRI site 13 bases 5' to the putative translational start site. The deduced amino acid sequence for the leprecan core protein is 727 amino acids in length, having an estimated Mr of 82,384 and an estimated pI of 4.84. The translational start site (23) for the protein leads into a presumptive hydrophobic signal peptide corresponding to the first 14 amino acids of the deduced sequence. A putative signal peptidase cleavage site (24) was found starting at Ala12. An RGD cell attachment motif was found at Arg38, suggesting that this region serves as a binding site for the integrin family of cell adhesion receptors. Within the deduced sequence, three sites for potential GAG attachment were found (Ser182, Ser488, and Ser645). Two (Ser182 and Ser488) of the three sites were flanked on one side by acidic clusters, indicating that they might serve as a site of the addition of either CS or HS chains (25). Also within the sequence were four N-glycosylation sites at Asn308, Asn450, Asn459, and Asn532. Fifteen cysteine residues were found in the sequence (positions 65, 69, 111, 115, 234, 238, 270, 274, 315, 475, 560, 583, 592, 637, and 662), the majority (n = 8) of which occurred as pairs of cysteines separated by three amino acids (CXXXC). Because of the extensive disulfide bonding that occurs within the molecule, in SDS-PAGE the leprecan core protein migrates as a protein with an apparent mass of approx 80 kDa under non-reducing conditions (data not shown) and at/or slightly greater than 100 kDa with reduction. An endoplasmic reticulum retrieval signal, KDEL, was found in proper context as the terminal four amino acids of the core protein, suggesting the possibility that this molecule might also be recycled in the ER-Golgi pathway and may have an intracellular function. (26). The predicted secondary structure of leprecan shows the molecule to consist of a series of short alpha  helices interrupted at irregular intervals by beta  sheets and turns (Fig. 4). Sequence homology searches showed that the amino-terminal region of leprecan has homology (34%) with the protein CASP (27), a protein associated with the cartilage extracellular matrix (Fig. 5), and related proteins (see below). Comparison with cDNA information in the GenBankTM EST data base shows that this molecule is potentially highly conserved in both human and murine tissues (Fig. 6).



View larger version (131K):
[in this window]
[in a new window]
 
Fig. 3.   cDNA sequence of the leprecan coding region with the annotated amino acid sequence directly subjacent. The heavy underscore denotes the extent of the cDNA present in clone P2, the clone isolated in the first round of screening an L-2 expression cDNA library with Rb4127. The translation start site and polyadenylation signal are denoted with appropriate markers, and asterisk (*) denotes the stop codon at the end of the ORF, which is immediately preceded by the ER targeting signal, KDEL. The three putative glycosaminoglycan acceptor sites (SG) are boxed, and the four N-glycosylation sites are underscored.


View larger version (36K):
[in this window]
[in a new window]
 
Fig. 4.   Panel A is a plot of the predicted secondary structure of leprecan, using the Chou-Fasman secondary structure algorithm (top), the Robinson-Garnier secondary structure algorithm (middle), or a combined plot (bottom) of regions of consensus between the two algorithms. The consensus strand predicts the leprecan sequence to contain at least 15 small alpha  helical regions interupted at intervals by either beta  sheets or turns. Panel B shows a schematic diagram of leprecan that is aligned to the upper secondary structure plots shown in A. The three glycosaminoglycan acceptor sites, the N-glycosylation sites, and the KDEL site predicted by primary structure analysis all occur outside of the alpha  helical regions of leprecan.



View larger version (146K):
[in this window]
[in a new window]
 
Fig. 5.   Alignment of the deduced amino acid sequence of leprecan with those of CASP, SC65 (rat), No55 (human), and Unknown Protein B (human). The presence of the CXXXC clusters and their respective enumeration are denoted in bold text above the aligned sequences. The three SG acceptor sites present in leprecan are also demarcated in a similar manner. The areas within the dark gray boxes denote areas of sequence identity; the lighter gray areas denote areas of conservative amino acid substitution. Leprecan and Unknown Protein B (U47926) exactly match two out of three SG acceptor sites (sites 1 and 3); site 2 in the latter is offset from the leprecan site by two amino acids in the aligned sequence. Both molecules share the latter two sets of CXXXC clusters.


View larger version (26K):
[in this window]
[in a new window]
 
Fig. 6.   Alignment of the cDNA sequence of leprecan with sequence information present in the GenBankTM EST data base. The accession numbers and direction of the EST clones is given to the right of each aligned segment. The box at right of the figure shows in tabular format the EST clone number, the species the cDNA clone was derived from, and the percentage of homology at the nucleic acid level each clone has to leprecan.

The Leprecan cDNA (SJ3) Is a Full-length Transcript Encoding a CSPG-- Further studies were done to substantiate the fact that a viable translation initiation sequence was present in the ORF of SJ3 cDNA and that the protein encoded by the SJ3 cDNA would be exported into the extracellular milieu as a proteoglycan. For these studies, CHO-K1 cells were transiently transfected with the entire SJ3 cDNA construct that had been subcloned into the pZeo-SV vector. To use the pZeo-SV vector for protein expression studies, the inserted cDNA must have a start codon, stop codon, and polyadenylation signal present within its sequence. In parallel wells, non-transfected cells or CHO-K1 cells transfected with either pZeoSVLacZ or pZeo-SV served as controls for the immunoprecipitation experiments. Twenty-four hours after transfection, the medium from each cell condition was harvested, the cell layers fixed, and processed for immunocytochemistry (Fig. 7) using Rb2096 as a probe. Immunofluorescence microscopy showed that no leprecan-associated immunostaining was present in native CHO cells (Fig. 7A), sham-transfected CHO cells (data not shown), or lacZ-transfected cells (Fig. 7B). Leprecan-associated immunofluorescence was seen in cells transfected with SJ3 cDNA (Fig. 7C and background), the pattern of immunolocalization for this molecule resembled ER and Golgi staining.


View larger version (127K):
[in this window]
[in a new window]
 
Fig. 7.   Series of micrographs representing the results of transfection studies in CHO-K1 cells. The micrographs in the inset represent native CHO-K1 cells (A), or CHO K-1 cells transfected with either the lacZ gene (B) or the SJ3 construct (C), which represents the entire coding region of leprecan. The cultures were immunostained with the polyclonal antiserum Rb2096. Both native CHO K1 cells (A) and lacZ-transfected cells (B) were negative for immunostaining, whereas the CHO K1 cells transfected with the SJ3 construct were positive. The larger micrograph is a higher magnification of an SJ3-transfected culture, the cells within that culture showing varying degrees of immunoreactivity for the SJ3 protein as well as the limited localization of the protein to structures resembling ER and Golgi in appearance. In both the transfected cells and in native L-2 cells immunostained for leprecan, nuclear/nucleolar immunostaining was not noted.

The secreted proteoglycan was immunoprecipitated from equal volumes of medium from the cell transfection groups with Rb2096. Because the intact leprecan core proteoglycan runs as a smear within the upper confines of the resolving gel, in order to resolve the CS substituted core proteins, the bound, immunoprecipitated proteoglycan was digested with chondroitinase ABC, run on a 4-15% SDS-PAGE under reducing conditions, and electrophoretically transferred to nitrocellulose. The blot was subsequently probed with biotinylated affinity-purified antibodies derived from the same antiserum (Fig. 8) and detected with avidin-horseradish peroxidase. Leprecan could not be immunoprecipitated from conditioned medium taken from cultures of either sham-transfected cells (lane 1) or cells transfected with lacZ (lane 2). Rb2096 was able to immunoprecipitate leprecan (lane 3) from the medium of CHO-K1 cells transfected with SJ3 cDNA, as evidenced by the dense, broad band present on the gel at 100 kDa. Densitometry measurements of areas of even exposure within the corresponding lanes showed that CHO cells transfected with the SJ3 insert gave a signal strength 2.42 and 3.70 times greater than that of lacZ and sham transfectants, respectively.


View larger version (80K):
[in this window]
[in a new window]
 
Fig. 8.   Lumigraph of a chemiluminescence-based Western blot immunoassay of a chondroitinase ABC-treated material electrophoresed on a 4-15% SDS-PAGE gel, isolated by immunoprecipitation from CH0-K1 cell-conditioned medium using Rb2096. The medium was derived from cells that were either sham-transfected with LipofectAMINE (lane 1), transfected with lacZ (lane 2), or transfected with the entire SJ3 construct (lane 3). The boxes directly adjacent to the lane demarcate the extent of the area within the lane that was used for quantification (see text). The measurements were taken in each lane in a region of continuous density immediately to the left of each box. Rb2096 immunoprecipitated material in the conditioned medium from the cells expressing the SJ3 construct that was sensitive to chondroitinase ABC treatment that migrated with an apparent size of approximately 100 kDa. This material was not seen in medium from either the native cells (data not shown), cells sham-transfected, or cells transfected with lacZ, nor was it resolved in lumigraphs of similar experiments where the immunoprecipitated proteoglycan was not treated with chondroitinase ABC prior to electrophoresis (data not shown).

Leprecan Is Secreted as a Chondroitin Sulfate Proteoglycan by L-2 Cells-- The information from immunoblot assays of secreted proteoglycans isolated from L-2 conditioned medium and the pattern of tissue immunostaining (see below) with the Rb2096 polyclonal antiserum provided evidence that the leprecan core protein should be secreted by cells as an extracellular CSPG. However, the presence of the KDEL motif at the COOH terminus of the deduced sequence also implied that this molecule should be retained in the ER-Golgi circuit (26). To determine if the intracellular and secreted forms of leprecan did contain the KDEL recognition signal, an immunoblot assay on duplicate membranes was performed using the polyclonal antiserum Rb2096 and a monoclonal antibody recognizing the KDEL sequence (Fig. 9). The anti-KDEL monoclonal antibody recognized a band approximately 100 kDa that was resolved by SDS-PAGE after prior treatment of a total L-2 PG pool with chondroitinase ABC. A band that migrated in an identical manner in SDS-PAGE was also recognized by Rb2096, the polyclonal antiserum directed against the P2 fusion protein. These results indicate that the native leprecan core protein secreted by L-2 cells possesses the KDEL motif at the COOH terminus.


View larger version (47K):
[in this window]
[in a new window]
 
Fig. 9.   Western blot of a 4-15% SDS-PAGE through which secreted L-2 PG (lanes 1, 2, 5, and 6) and PG extracted from confluent L-2 cell layers (lanes 3, 4, 7, and 8) were electrophoresed. One half of the blot was probed with Rb2096 (lanes 1-4), which was directed against the carboxyl half of leprecan; the other half of the blot (lanes 5-8) was probed with a monoclonal antibody directed against the KDEL sequence, the ER targeting signal present on the carboxyl terminus of leprecan (see "Experimental Procedures"). The material present in lanes 1, 3, 5, and 7 was intact proteoglycan (unt) whereas the material run in lanes 2, 4, 6, and 8 was proteoglycan treated with chondroitinase ABC (c'abc) prior to electrophoresis. The intact proteoglycan is barely evident on the resolving gel, showing mainly as a faint smear within lanes 1, 3, 5, and 7. With prior treatment with chondroitinase ABC, a band migrating to approximately 100 kDa is detected in the secreted PG (lanes 2 and 6) and in the proteoglycan extracted from the cell layer (lanes 4 and 8) using both antibodies. This provides evidence that leprecan, which contains a KDEL targeting signal, is secreted by L-2 cells as a proteoglycan bearing CS glycosaminoglycan chains.

Leprecan Is a Basement Membrane Proteoglycan-- A survey of various tissues including cardiac muscle, skeletal muscle, central nervous system (cerebral cortex and cerebellum), intestinal tract, trachea, ear, skin, liver, and kidney was done (Fig. 10). A consistent finding was that, in every tissue examined, leprecan antibodies immunostained the basement membranes of the vasculature and smooth muscle associated with each organ.


View larger version (158K):
[in this window]
[in a new window]
 
Fig. 10.   These micrographs show the patterns of leprecan immunostaining of frozen sections of the renal glomerulus (A), renal tubules and peritubular capillaries (B), hyaline cartilage from trachea (C), and skeletal muscle (D). In the glomerulus (A), the glomerular basement membrane (arrow), Bowman's capsule, and mesangial matrix are stained with this antiserum. Antiserum also immunostained the basement membranes of renal tubules and the peritubular capillaries (arrow). In cartilage (C), leprecan immunostained the pericellular material surrounding individual chondrocytes (arrow) in lacuna but not the cartilage matrix. In skeletal muscle (D), the basement membranes of the microvessels (arrow) in between the skeletal muscle are stained but the skeletal muscle basement membranes are negative for leprecan.

In the kidney, Rb2096 immunostained the glomerular basement membrane, mesangial matrix, and Bowman's capsule of the nephron (Fig. 10A). In the renal parenchyma leprecan antibodies stained the basement membranes of tubules and blood vessels (Fig. 10B). Because of the homology of leprecan with CASP (27), sections of trachea and ear were also immunostained (Fig. 10C). Leprecan immunoreactivity was found around the perimeter of resident chondrocytes in lacunae (arrows) similar to that reported for perlecan (28) but not in the cartilage matrix proper as reported for CASP (27). Leprecan weakly immunostained the basement membranes around smooth muscle in the stomach and small intestine but did not immunostain the basement membranes of the intestinal crypts, villi, or the gastric glands. (data not shown). In liver, leprecan immunostained the basement membranes of large vasculature as well as small components of the hepatic sinusoidal tree (data not shown). Consistent with its distribution in the above tissues, leprecan was found in the vasculature of the brain, cardiac muscle, and skeletal muscle (Fig. 10D).

    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Our results show that we have characterized a novel proteoglycan core protein, the tissue distribution of which in our initial survey studies appears to be localized to some, but not all, basement membrane matrices. Although this particular sequence was identified in our early screening studies by remote homology to fibromodulin and lumican, two members of the small leucine-rich repeat proteoglycan (SLRP) family, the deduced amino acid sequence is far removed enough from the SLRP's to show this protein to be unique in primary structure from all other known proteoglycan core proteins. Moreover, alignment of the cDNA sequence of leprecan with those of other proteoglycans resident to the basement membrane, including perlecan (6, 7), agrin (8), or bamacan (14), shows that the leprecan core protein is unrelated to the known proteoglycans, and is a member of a separate family of molecules that have yet to be characterized in detail (see below).

The deduced amino acid sequence information suggests that this core protein has the potential to be heavily glycosylated, having a total of four putative N-glycosylation sites (Asn308, Asn450, Asn459, and Asn532) and a total of three SG sequences, which might serve to function as acceptor sites for the addition of the GAG chains. Although the immunoblot data demonstrate that this molecule is secreted as a CSPG, the number and size of the attached GAG chains is presently not known. However, two of the three SG motifs are associated with flanking acidic residues, which are thought to promote the addition of GAGs (25). The first site, at Ser182, is associated with a downstream acidic sequence, E186ED. The second site (Ser488) has the motif SGDG, which is similar to the glycosaminoglycan site found in the perlecan core protein (7). This site is flanked on the amino-terminal side by the sequence D472DEC, which might also permit this particular site to serve as an acceptor site for a heparan sulfate GAG chain. The third site at Ser645 might serve as an acceptor site for GAG chains but is not immediately flanked on either side by clusters of acidic residues (25), suggesting that this site might serve for the exclusive attachment for CS chains. Thus, leprecan could possibly be secreted by cells as a proteoglycan bearing either CS or HS chains, or possibly a hybrid molecule. In the case of the native core protein secreted by L-2 cells, immunoblot assays addressed this possibility, showing the core protein to be susceptible to chondroitinase ABC and insensitive to treatment with heparanase/heparitinase (data not shown). Sequential treatment with both GAG degrading enzymes showed no shift in the electrophoretic mobility of the core protein, indicating that L-2 cells did not secrete leprecan as a hybrid proteoglycan (Fig. 1B). However, recognizing the fact that the perlecan core protein can exist as either a CS- or HS-substituted proteoglycan (17), one must acknowledge the potential for tissue-specific patterns of GAG substitution for leprecan.

Of particular interest is that the carboxyl terminus of this proteoglycan bears the ER retrieval signal, KDEL, implying that this molecule might participate in some way in ER processes, possibly interacting with other secreted matrix molecules. This observation is corroborated in part by the pattern of immunolocalization of leprecan synthesized by transiently transfected CHO K1 cells, showing that the construct is readily detectable in the ER and Golgi (Fig. 7). Immunostaining of L2 cells in culture for leprecan also shows that cells handle the native protein in a similar manner (data not shown). Immunoblots of SDS-PAGE gels of whole L2 cell homogenates and secreted proteoglycan from tissue culture supernatant bulk preparations with a monoclonal antibody directed against the KDEL sequence demonstrates that the KDEL retrieval signal is retained in both the intracellular and in the secreted proteoglycan (Fig. 9). KDEL is known to function as an ER retrieval signal that potentially limits the localization of proteins bearing this sequence to the ER and Golgi circuit (26). However, several recent reports describe other KDEL-tagged molecules as exiting the secretory/sorting pathway, being exported by the cell as cell surface molecules (29-31) or as a soluble secreted protein (32). Our data show that leprecan follows that trend, for L-2 cells secrete the native leprecan protein into the cell medium and CHO K1 cells transiently transfected with the leprecan sequence secrete leprecan as a CS substituted proteoglycan. This finding implies a mechanism by which retention in the ER is bypassed to allow the core protein to escape the ER-Golgi-ER circuit for further processing. Several mechanisms have been postulated on how ER retention of KDEL proteins may be defeated including subunit dimerization (33) or posttranslational cleavage of the KDEL domain (34). Others have postulated that the KDEL signal is "leaky" (32), delaying transit of the protein through the secretory pathway but allowing eventual export of the KDEL containing protein. In the case of leprecan, two out of the three mechanisms are possible, but have yet to be investigated in detail. As mentioned above, our studies show that the KDEL targeting signal is retained on the secreted leprecan core protein, thus proteolytic cleavage is unlikely. Dimerization or aggregation into larger order aggregates is a possibility considering the numerous cysteine residues (n = 15) present in the deduced sequence. The latter possibility might also be occurring in the case of leprecan, where the export of leprecan by the cell is deliberately delayed due to the KDEL signal. Site-directed mutagenesis studies are currently in progress to test the significance of the loss of the KDEL signal in the secretion of leprecan.

The deduced amino acid sequence has little homology between the leprecan sequence when compared to most of the known proteoglycan core protein sequences by BLAST algorithms. Our data base search of GenBankTM showed that the amino-terminal region of the proteoglycan leprecan has homology with a recently described glycoprotein present in the extracellular matrix of cartilage, called CASP (EMBL accession no. X97607)/Dualin (Swiss Protein data base accession no. Q90830) (27) (Fig. 5). Although CASP was shown to be highly regulated during morphogenesis and development, the function of CASP/Dualin in cartilage is yet unknown. Interestingly, both leprecan and CASP/Dualin have, in turn, sequence similarities with a nucleolar protein, No55 (35) (Swiss Protein accession no. Q92791), a synaptonemal protein SC65 (36) (EMBL accession no. X65454), and with an unknown protein B (37) (GenBankTM accession no. U47926). Moreover, besides apparent familial similarities (see below), scanning the GenBankTM EST data base reveals that in part the human and murine forms of leprecan are highly conserved (Fig. 6).

The leprecan cDNA sequence information indicates that the encoded core protein is substantially larger in size (estimated Mr 82,384) than CASP, No55, and SC65, primarily due to the longer carboxyl "tail" of leprecan (Fig. 5). Leprecan shows areas of similarity with the sequence encoded by U47926, especially obvious in the carboxyl tail region of both molecules where the other members of the family do not overlap. In its amino-terminal region, leprecan also differs from three members of this family in the location of the putative glycosaminoglycan acceptor site. That acceptor site, Ser182, is not conserved in CASP, No55, or SC65 but is present in U47926. However, a different SG motif is present at a site in No55, and SC65 (Fig. 5) far downstream from the putative GAG acceptor site found in leprecan. The third GAG acceptor site is conserved between leprecan and U47926 at Ser645, while the second leprecan acceptor site Ser488 is offset in U47926 by two amino acids. What is not known at this time is whether these SG motifs serve as GAG acceptor sites for the other members of the family.

Comparison of the homologous regions of leprecan with the former molecules shows that seven cysteine residues present in regions of sequence similarity are highly conserved among the proteins, with the CXXXC motif being conserved in three of these proteins. The fourth CXXXC match is shared among leprecan, SC65, No55, and U47926, but is missing in CASP. Conservation of these cysteines is indicative that disulfide bonding is important for the secondary structure of these molecules. Although the CXXXC motifs in the homologous regions among the family members is consistent, what does vary among members in the family is the distance between one CXXXC motif and the next, with leprecan having the greatest distance between CXXXC 1 and 2. The distance between CXXXC 2 and 3 for all family members with the exception of leprecan is identical. U47926 has the greatest distance between CXXXC 3 and 4, followed by leprecan; SC65 and No55 have identical distances. Because of these minor differences, it is anticipated that the secondary structure between members of the family will differ slightly. However, at this point the significance of this pattern of organization is presently unknown and the extent of actual intra- versus interpeptide bonding has yet to be ascertained; those experiments are currently in progress.

Analysis of the secondary structure of leprecan (Fig. 4A) shows leprecan to consist of 15 short (18-25 amino acids long) alpha  helical regions with beta  sheets and turns interspersed between the helical regions. Given this secondary structure, we believe that leprecan must be globular in configuration rather than a long, extended configuration. This prediction can be collaborated in part by its differential mobility in SDS-PAGE, based on whether or not the molecule is reduced prior to electrophoresis (data not shown). Within the predicted secondary structure of leprecan, the glycosaminoglycan binding sites, the N-glycosylation sites, and the KDEL retrieval signal all lie outside the boundary of the alpha  helical regions where they would be readily accessible. Three of the four CXXXC motifs (CXXXC 1-3) present in leprecan begin at the end of an alpha  helical region and terminate immediately outside of the predicted alpha  helix. The last CXXXC domain spans what is predicted as a beta  turn in the secondary structure. It may be that the CXXXC motif (1-3) serves to lock the secondary structure of the respective alpha  helices into a specific conformation.

Leprecan's secondary structure is unlike the basement membrane CSPG bamacan (14), the secondary structure of which is conferred by the presence of two distinct regions of closely spaced alpha  helices. Several of these alpha  helices are between 50 and 100 amino acids in length before they are terminated. Because of the extensive alpha  helices present in bamacan, its secondary structure has been predicted to consist of two coiled-coil domains separated by a short flexible hinge region (14). Perlecan, a basement membrane proteoglycan, has small alpha  helices (10-20 amino acids) within its sequence. However, these are in minority compared with the numerous beta  sheets and turns predicted to occur in perlecan. The predicted secondary structure of agrin, the other heparan sulfate proteoglycan present in basement membranes, resembles that of perlecan in that the beta  sheets and turns far outnumber the small alpha  helices present in the predicted secondary structure.

The immunohistochemistry studies in this report show that leprecan is associated with the vascular tree, in the media/adventitia of vessels. Leprecan is present in the glomerulus, resident in both the glomerular basement membrane and mesangial matrix (Fig. 10). Its presence in the glomerular capillary wall of the glomerulus is unlike what we had previously described for BM-CSPG/bamacan (10, 18). In those studies we reported that BM-CSPG/bamacan was present only in the mesangial matrix of the glomerulus and not present in the glomerular capillary wall. However, two studies that used monoclonal antibodies directed against carbohydrate epitopes (13, 19), present on CSPGs after prior digestion with chondroitinase ABC, were able to demonstrate the presence of a CSPG in the glomerular capillary wall. Because of the data in this present report, it is possible that leprecan, not BM-CSPG/bamacan, was being detected in the glomerular basement membrane. In some respects, the pattern of distribution for leprecan parallels that reported for two members of the original panel of monoclonal antibodies (mAbs 5A3 and 4D5) directed against the proteoglycan BM-CSPG (10, 18), that is now named bamacan (14, 17). In the earlier report we showed that mAb 5A3 immunostained the microvasculature of skeletal muscle and weakly, if at all, immunostained the basement membranes of skeletal muscle. The pattern for leprecan immunostaining is similar in that it stains vascular basement membranes and not basement membranes surrounding skeletal muscle. mAb 4D5 and Rb2096 also show parallel patterns of immunostaining in cartilage. These patterns of immunostaining early in our studies suggested to us that the cDNA that we had isolated was, in fact, BM-CSPG. However, comparing both cDNA sequences by BLAST analysis, the deduced amino acid sequences for both molecules by Clustal W analysis, and the secondary structures by Chou-Fasman analysis shows leprecan and bamacan to be completely different.

    ACKNOWLEDGEMENTS

We thank Mark Lauer and Cynthia Goldsmith for their assistance with experiments in this report.

    FOOTNOTES

* This work was supported by Grant 1-RO1-DK48055 from NIDDK, National Institutes of Health and by a grant from the Kirkman and Elizabeth O'Neal Research Fund of the Greater Birmingham Foundation.The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EMBL Data Bank with accession number(s) AF087433.

To whom correspondence should be addressed: Dept. of Pathology, School of Medicine, Louisiana State University Medical Center, 1501 Kings Highway, Shreveport, LA 71130. Tel.: 318-675-4509; Fax: 318-675-5889; E-mail: kmccar2@lsumc.edu.

    ABBREVIATIONS

The abbreviations used are: HSPG, heparan sulfate proteoglycan; CSPG, chondroitin sulfate proteoglycan; HS, heparan sulfate; CS, chondroitin sulfate; DS, dermatan sulfate; GAG, glycosaminoglycan; HD-PG, high buoyant density proteoglycan; TBS, Tris-buffered saline; TTBS, Tris-buffered saline with Tween 20; ER, endoplasmic reticulum; SLRP, small leucine-rich repeat family of proteoglycans; kb, kilobase(s); BM-CSPG, basement membrane-specific chondroitin sulfate proteoglycan; DTE, dithioerythritol; PMSF, phenylmethylsulfonyl fluoride; CHAPS, 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonic acid; contig, group of overlapping clones; ORF, open reading frame; EST, expressed sequence tag; mAb, monoclonal antibody; PAGE, polyacrylamide gel electrophoresis; PBS, phosphate-buffered saline; CHO, Chinese hamster ovary.

    REFERENCES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

1. Hassell, J. R., Robey, P. G., Barrach, H. J., Wilczek, J., Rennard, S. I., and Martin, G. R. (1980) Proc. Natl. Acad. Sci. U. S. A. 77, 4494-4498[Abstract/Free Full Text]
2. Hassell, J. R., Leyshon, W. C., Ledbetter, S. R., Tyree, B., Suzuki, S., Kato, M., Kimata, K., and Kleinman, H. K. (1985) J. Biol. Chem. 260, 8098-8105[Abstract/Free Full Text]
3. Hagen, S. G., Michael, A. F., and Butkowski, R. J. (1993) J. Biol. Chem. 268, 7261-7269[Abstract/Free Full Text]
4. Soroka, C. J., and Farquhar, M. G. (1991) J. Cell Biol. 113, 1231-1241[Abstract/Free Full Text]
5. Kallunki, P., and Tryggvason, K. (1992) J. Cell Biol. 116, 559-571[Abstract/Free Full Text]
6. Murdoch, A. D., Dodge, G. R., Cohen, I., Tuan, R. S., and Iozzo, R. V. (1992) J. Biol. Chem. 267, 8544-8557[Abstract/Free Full Text]
7. Noonan, D. M., Fulle, A., Valente, P., Cai, S., Horigan, E., Sasaki, M., Yamada, Y., and Has