JBC

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Bennett, E. P.
Right arrow Articles by Clausen, H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bennett, E. P.
Right arrow Articles by Clausen, H.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Volume 271, Number 29, Issue of July 19, 1996 pp. 17006-17012
©1996 by The American Society for Biochemistry and Molecular Biology, Inc.

cDNA Cloning and Expression of a Novel Human UDP-N-acetyl-alpha -D-galactosamine
POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE, GalNAc-T3*

(Received for publication, July 5, 1995, and in revised form, April 15, 1996)

Eric Paul Bennett , Helle Hassan and Henrik Clausen Dagger

From the Faculty of Health Sciences, School of Dentistry, DK-2200 Copenhagen, Denmark

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
Acknowledgment
REFERENCES


ABSTRACT

The glycosylation of serine and threonine residues during mucin-type O-linked protein glycosylation is carried out by a family of UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferases (GalNAc-transferase). Previously two members, GalNAc-T1 and -T2, have been isolated and the genes cloned and characterized. Here we report the cDNA cloning and expression of a novel GalNAc-transferase termed GalNAc-T3. The gene was isolated and cloned based on the identification of a GalNAc-transferase motif (61 amino acids) that is shared between GalNAc-T1 and -T2 as well as a homologous Caenorhabditis elegans gene. The cDNA sequence has a 633-amino acid coding region indicating a protein of 72.5 kDa with a type II domain structure. The overall amino acid sequence similarity with GalNAc-T1 and -T2 is approximately 45%; 12 cysteine residues that are shared between GalNAc-T1 and -T2 are also found in GalNAc-T3. GalNAc-T3 was expressed as a soluble protein without the hydrophobic transmembrane domain in insect cells using a Baculo-virus vector, and the expressed GalNAc-transferase activity showed substrate specificity different from that previously reported for GalNAc-T1 and -T2. Northern analysis of human organs revealed a very restricted expression pattern of GalNAc-T3.


INTRODUCTION

The glycosylation of serine and threonine residues during mucin-type O-linked protein glycosylation is carried out by a family of UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferases (GalNAc-transferase)1 (EC). Two distinct human GalNAc-transferase genes, GalNAc-T1 and -T2, have been cloned and characterized to date (1, 2, 3).2 Analysis of the acceptor substrate specificity of GalNAc-T2 has revealed substrates that this transferase does not utilize (3, 4). In the present study we have analyzed the acceptor substrate specificity of GalNAc-T1 and found that neither GalNAc-T1 nor -T2 utilize all substrates identified, thus suggesting the existence of additional GalNAc-transferases. The existence of additional GalNAc-transferases has also been suggested by Hagen et al. (5) by analysis of sequence similarities of expressed sequence tag clones with those of GalNAc-T1 and -T2. O-Glycosylation in yeast has similarly been shown by Tanner and colleagues (6, 7, 8) to be initiated by at least four mannosyltransferases.

Families of glycosyltransferases with related acceptor and/or donor substrate specificities may be encoded by homologous genes showing segments of sequence similarity (9, 10). Initially, no sequence similarities were found between enzymes having the same donor substrate specificity (11), but as more enzymes have been cloned, several families of homologous glycosyltransferase genes have been identified. Livingston and Paulson (12) originally identified a sialyltransferase motif in a segment of 55 amino acids that has now been found to be conserved within all identified members of the sialyltransferase family (13). Similarly, sequence similarities are also found within alpha 3/4-fucosyltransferases (10, 14), alpha 2-fucosyltransferases (15), beta 6-N-acetylglucosaminyltransferases (16), beta 4-N-acetylgalactosaminyltransferases (17), the histo-blood group A/B transferases, and an alpha 3-galactosyltransferase (18).

The human GalNAc-transferases T1 and T2 share a segment of 61 amino acids with 82% sequence similarity, and this segment is also found in a deduced homologue, ZK688.8 (see Fig. 1), which has been observed to exhibit GalNAc-transferase activity (5). In the present study we have utilized this potential GalNAc-transferase motif to develop a PCR strategy that identified two novel cDNAs with sequence similarity. Here we report the cloning of cDNA containing the complete coding sequence of one of these and show by expression that the gene encoded a GalNAc-transferase with an acceptor substrate specificity partly different from GalNAc-T1 and -T2. Northern analysis showed that the expression of GalNAc-T3 is highly tissue-restricted in contrast to GalNAc-T1 and -T2.


Fig. 1. GalNAc-transferase motif. Multiple sequence alignment of human GalNAc-T1 (accession number X85018[GenBank]), GalNAc-T2 (accession number X85019[GenBank]), and C. elegans gene (accession number L16621[GenBank]) revealed a 61-amino acid motif. Sequences given are compared with the human GalNAc-T1 (3) where an asterisk indicates an identical amino acid or base. The location and sequence of primers (EBHC100/EBHC106) used in the RT-PCR cloning strategy are indicated. The sequence of the GalNAc-tranferase motif of the isolated TE3 DNA shown to represent GalNAc-T3 is also shown.
[View Larger Version of this Image (29K GIF file)]


EXPERIMENTAL PROCEDURES

Identification of cDNA Homologous to GalNAc-T1 and -T2 by RT-PCR and Restriction Enzyme Analysis

Multiple sequence alignment analysis (DNASIS, Hitachi) of GalNAc-T1 and -T2 was applied to identify areas with highest degree of sequence similarity. Based upon a 61-amino acid segment shared by GalNAc-T1 and -T2 as well as a more recently reported sequence derived from a homologous Caenorhabditis elegans gene (5), a pair of sense and anti-sense primers (EBHC100, 5'-TGGGGAGGAGARAACCTAGA-3', and EBHC106, 5'-ATTCATCCATCCATACTTCT-3', respectively, was used in RT-PCR amplifications of poly(A+) RNA from several sources (see Figs. 1 and 2). The mRNA from human organs (liver, brain, and submaxillary gland) were obtained from Clontech, and mRNA from human cancer cell lines (MKN45, Colo205, and WI38) was prepared as reported previously (3). A restriction enzyme search identified a common BstNI site within the expected 196-bp RT-PCR product of GalNAc-T1 and -T2, which would produce two fragments of 128 and 68 bp. Novel DNA fragments representing putative additional GalNAc-transferases were identified by RT-PCR with EBHC100/EBHC106 primers. Six mRNA templates were analyzed after BstNI digestion in 2% agarose gels. Following reverse transcription using the EBHC106 primer, PCR was performed for 35 cycles of 95 °C for 45 s, 53 °C for 15 s, and 72 °C for 15 s using Taq polymerase on a model 480 thermocycler (Perkin Elmer). Products were verified by Southern blotting and hybridization with oligonucleotide probes specific for GalNAc-T1 or -T2 (EBHC112, 5'-CTTTGGAAATTGTTACATGCTCA-3', and EBHC45, 5'-TGGTGGCAGCCTGGGAGATCA-3', respectively).


Fig. 2. BstNI restriction analysis of EBHC100/EBHC106 RT-PCR products generated from organs and cell lines. PhiX174 markers (194 and 118 bp) are indicated to the left, and the predicted RT-PCR product of EBHC100/EBHC106 (196 bp) as well as the larger product of a BstNI cleavage hereof (128 bp) are indicated to the right.
[View Larger Version of this Image (76K GIF file)]

The 196-bp products from RT-PCR of MKN45 mRNA that were resistant to BstNI cleavage were isolated using the prep-A-gene kit (Bio-Rad) and cloned into the pT7T3U19 vector (Pharmacia Biotech Inc.). Plasmids from 40 individual clones were purified using Qiagen-tip 20 column (Qiagen), and the clones were sequenced. Two sets of sequences differing from GalNAc-T1 and -T2 but exhibiting a high degree of similarity were identified, and sequence information from one set of identical clones designated TE3 was used for the isolation of 5' and 3' sequences outside the GalNAc-transferase motif.

Cloning and Sequencing of GalNAc-T3 by Rapid cDNA Library Screening

Rapid library screening was performed by diluting 1 × 106 pfu of human salivary gland lambda gt11 library (Clontech) into 40 sublibraries (designated numbers 1-40), each possessing approximately 2.5 × 104 pfu. All sublibraries were subjected to phage amplification (approximately 40-fold) by liquid culture phage amplification (19), giving a sublibrary titer of 1 × 106 pfu. Phage amplification was performed in 1 ml of LB MgSO4 maltose medium in a shaking incubator at 37 °C for 5 h. After amplification, 20 µl of chloroform was added to each sublibrary, cellular debris was pelleted, and the phage supernatants were titrated and used in subsequent screenings. All 40 sublibraries were screened to identify TE3 possessing phage clones. One µl of each sublibrary (approximately 104-105 pfu) was lysed in a 10 µl of volume in the presence of 0.45% Nonidet P-40 and Tween 20, 100 µg/µl proteinase K at 56 °C for 30 min. Proteinase K was heat-inactivated by boiling for 15 min, and 2 µl of phage lysate was amplified by PCR using primers EBHC100 and EBHC204 at 0.5 µM using 40 cycles of 95 °C for 45 s, 55 °C for 5 s, and 72 °C for 30 s. Thirteen sublibraries found to contain TE3 lambda gt11 clones were further assayed by PCR using EBHC202 (5'-GCGGATCCGCAGCAAAAGCCCTCATAGCTTT-3') or EBHC204 (5'-GCGGATCCTCTAGCAATCACCTGAGTGCC-3') primers combined with the lambda gt11 vector primers to estimate lengths of cDNA inserts for selection of sublibraries with most 3' or 5' sequences. Amplifications were performed for 35 cycles of 95 °C for 45 s, 55 °C for 1 s, and 72 °C for 2 min.

Two sublibraries generated 3' PCR products (EBHC204/lambda gt11 vector) of approximately 1000 bp, and two sublibraries generated TE3 5' PCR products of approximately 1200 bp. PCR products were subcloned into pT7T3U19 and sequenced. These PCR products were used to probe and isolate lambda cDNA clones from the corresponding sublibraries. Both strands of the subcloned lambda cDNAs were sequenced (20) using internal primers spaced 3-400 bp apart. Partly overlapping sequence data from lambda cDNA clones were utilized to derive the complete coding sequence.

lambda cDNA Isolation

25,000 pfu from sublibraries were plated on 15-cm LB agar plates by standard procedures (19), and plaques were transferred to HYBOND N+ (Amersham Corp.) nylon membranes and hybridized to random primed [alpha -32P]dCTP-labeled PCR probes. Hybridization was performed at 42 °C in the presence of 6 × SSPE, 5 × Denhart's, 0.5% SDS, and 50% formamide. Plaque lifts were washed 5 × at 42 °C with 2 × SSC, 0.1% SDS, once with 0.5 × SSC, 0.1% SDS, and once at 55 °C with 0.1 × SSC, 0.1% SDS in a mini-hybridization oven (Hybaid).

Expression of GalNAc-T3 in Sf9 Cells

A partial cDNA sequence of the putative GalNAc-T3 gene an RT-PCR product (pAcGP67-GalNAc-T3-sol) using primers EBHC219 (5'-AGCGGATCCTCAACGATGGAAAGGAACATG-3') and EBHC215 (5'-AGCGGATCCAGGAACACTTAATCATTTTGGC-3') with BamHI restriction sites introduced was produced and cloned (see Fig. 3). The PCR product was designed to yield a putative soluble form of the GalNAc-T3 protein with an NH2-terminal end positioned immediately COOH-terminal to the potential transmembrane domain and including the entire sequence expected to contain the catalytic domain. The PCR product was cloned into a BamHI site of the expression vector pAcGP67 (Pharmingen), and the expression construct was sequenced to verify the sequence and correct insertion into the cloning site. Control constructs included pAcGP67-GalNAc-T2-sol prepared as described previously (3), pAcGP67-GalNAc-T1-sol prepared similarly by RT-PCR with human submaxillary gland mRNA and designed to mimic the originally identified amino terminus of the soluble bovine GalNAc-transferase protein (1), and pAcGP67-O2-sol containing the histo-blood group O2 cDNA and prepared as described previously for the blood group A cDNA (21). Co-transfection of Sf9 cells with pAcGP67-constructs and Baculo-GoldTM DNA was performed according to the manufacturer's description. Briefly, 0.5 µg of construct was mixed with 0.05 µg of Baculo-Gold DNA and co-transfected in Sf9 cells in 24-well plates. 96 h post-transfection recombinant virus was amplified in 6-well plates at dilutions of 1:10 and 1:50. Titer of amplified virus was estimated by titration in 24-well plates with monitoring of enzyme activities. Transferase assays were performed on supernatants of Sf9 cells in 6-well plates infected with virus at titer 1:1000 to 1:5000 representing end point dilutions giving optimal enzyme activities.


Fig. 3. Schematic representation of cDNA clones used to construct the nucleotide sequence of GalNAc-T3.
[View Larger Version of this Image (5K GIF file)]

Analysis of Polypeptide GalNAc-transferase Activity

GalNAc-transferase activity was measured in standard reaction mixtures containing 25 mM Tris (pH 7.4), 5 mM MnCl2, 0.25% Triton X-100, 50 µM UDP-[14C]GalNAc (4000 cpm/nmol), 5 mM 2-mercaptoethanol, 250 µM peptide, and 10 µl of culture supernatant after incubation at 37 °C for 20 min. Controls included untransfected Sf9 cell culture medium and a construct pAcGP67-O2-sol of the enzymatically nonfunctional histo-blood group O2 gene (22). Acceptor peptides (Muc2, PTTPISTTTMVTPTPTPTC; Muc5c, CTTSTTSAPTTSTTSAPTTS; and HIV-V3, CIRIQRGPGRAFVTIGKIGNM) were obtained from Carlbiotech (Copenhagen) and Neosystems (Strassburg), and quality was ascertained by amino acid analysis and mass spectrometry. Glycosylated product was quantified by scintillation counting after Dowex-1 chromatography. All combinations of enzyme sources and peptides were evaluated at least once by C-18 reverse phase chromatography (PC3.2/3 or uRPC C2/C18 SC2.1/10 Smart System, Pharmacia) and scintillation counting of peptide peak fractions to confirm incorporation of [14C]GalNAc into the acceptor peptide (4).

Northern Hybridization

Multiple tissue Northern blots (MTN I, MTN II, and fetal MTN) were obtained from Clontech. The GalNAc-T1 probe, TEB1, was prepared by RT-PCR and contained nucleotides 1-1132. The GalNAc-T2 probe, TEB2, was prepared as described previously (3) and contained nucleotides 331-1268. The GalNAc-T3 probe, TEB3, contained sequences 307-1902 as shown in Fig. 4. Probes were random prime labeled using [alpha -32P]dCTP (Amersham Corp.) and oligo labeling kit (Pharmacia). Blots were probed sequentially with GalNAc-T1, -T2, and -T3 probes using the same conditions as were used for plaque lift hybridizations. Blots were probed, stripped, and reprobed as recommended by Clontech.


Fig. 4. Nucleotide sequence and predicted amino acid sequence of the combined cDNA clones. The amino acid sequence is shown in single-letter codes. The hydrophobic segment representing the putative transmembrane domain is underlined with a double line (Kyte & Doolittle, window of 8) (32), and adjacent charged amino acid underlined with a dotted line. Four potential N-linked glycosylation sites are indicated by asterisks. Location of primers used for RT-PCR preparation of the expression construct are indicated by single underlining. The GalNAc-transferase motif and primers EBHC100/EBHC106 are indicated in bold type. Indicated by bold underlined type is a potential polyadenylation signal.
[View Larger Version of this Image (54K GIF file)]


RESULTS

Identification of DNA Homologous to GalNAc-T1 and -T2

A set of primers (EBHC100/EBHC106) corresponding to sequences flanking a putative GalNAc-transferase motif (Fig. 1) were used in RT-PCR reactions with mRNA from a variety of human organs and cell lines. A single DNA fragment of approximately 196 bp corresponding to that predicted for GalNAc-T1 and -T2 was amplified from all templates (Fig. 2). Hybridization with oligonucleotides probes specific for GalNAc-T1 and -T2 served as controls for the identities of the products observed. A restriction enzyme (BstNI) that selectively cut the products of both GalNAc-T1 and -T2 was used to detect potentially novel DNA from homologous genes. As seen in Fig. 2, RNA from several organs and cell lines yielded RT-PCR products that were not cleaved by BstNI, indicating the presence of a novel DNA fragment. The BstNI uncleaved RT-PCR product from the gastric carcinoma cell line MKN45 was subcloned and sequenced. Forty independent clones were sequenced, and of these eight clones contained sequences homologous to but different from GalNAc-T1 and -T2. Six independent clones had a novel sequence designated TE3, and two clones had a novel sequence designated TE4. The DNA sequence of TE3 was clearly similar to GalNAc-T1 and -T2 with a sequence similarity of approximately 80%. The deduced amino acid sequence containing the putative GalNAc-transferase motif is presented in Fig. 1.

Cloning of Human GalNAc-T3 Using the TE3 DNA Sequence

Cloning and sequencing of the complete coding sequence of GalNAc-T3 was achieved by PCR screening of 40 sublibraries from a human salivary gland lambda gt11 library, which yielded two sublibraries (number 8 and number 1) containing long 3' and 5' sequences outside the TE3 probe area. This strategy facilitated identification of lambda cDNA clones with long 5' and 3' inserts and allowed us to compare multiple 5' and 3' sequences obtained within the isolated cDNA clones to identify and avoid intron containing sequences. Two PCR products of 1000 bp from sublibrary 8 and two PCR products from sublibrary 1 of 1200 bp were selected, subcloned, and sequenced. The sequences of these PCR products exhibited similarity to the sequences of GalNAc-T1 and -T2. One lambda cDNA clone from each sublibrary was isolated, and inserts were subcloned and sequenced. The sequences found in the PCR products were identical to the corresponding sequences in the selected lambda cDNA clones. The 3' cDNA clone 8.3' possessed a 3-kb insert with a single 900-bp open reading frame followed by multiple stop codons and a consensus polyadenylation box (Fig. 3). The 5' end of the insert of clone 8.3' apparently contained an intron sequence, and this has been confirmed by sequence comparison of several RT-PCR and cDNA clones as well as a genomic clone.3 One 5' cDNA clone 1.5' possessed a 1300-bp open reading frame but was not considered to contain the complete coding sequence, because it lacked a putative hydrophobic transmembrane region. A second screen using an antisense primer EBHC211 (5'-ACCGGATCCAGTGTTTAGCTTCCCCACG) (5' region of clone 1.5') yielded another 5' clone, 12.5', which contained additional 550 bp of 5' sequence including a potential transmembrane region. As shown in Fig. 4, the combined sequences of the selected cDNA clones contained an 1902-bp open reading frame. Multiple alignment analysis (DNASIS, Hitachi) of human GalNAc-T1, -T2, and -T3 and the C. elegans ZK.688.8 deduced protein presented in Fig. 5 demonstrated high sequence similarity in the COOH-terminal region (amino acids 82-559, GalNAc-T1; amino acids 102-571, GalNAc-T2; amino acids 148-633, GalNAc-T3; and amino acids 133-612, ZK.688.8) with conservation of cysteine residues. The NH2-terminal region show no sequence similarity and vary considerably in length with GalNAc-T3 having the longest sequence between the putative transmembrane region and the putative catalytic domain.


Fig. 5. Multiple sequence alignment (DNASIS, Hitachi) of human GalNAc-T1 (T1), GalNAc-T2 (T2), GalNAc-T3 (T3), and C. elegans (Ce) predicted protein ZK.688.8. Introduced gaps are shown as hyphens, and aligned residues identical to GalNAc-T1 are indicated by colons. Hydrophobic sequences representing putative transmembrane domains are indicated by underlining (Kyte & Doolittle, window of 8) (32). Cysteine residues are indicated by bold underlining, and aligned cysteine residues between sequences are indicated by a solid diamond.
[View Larger Version of this Image (33K GIF file)]

Expression of GalNAc-T3

Expression of the pAcGP67-GalNAc-T3-sol construct in Sf9 cells resulted in a 20-100-fold increase in GalNAc-transferase activity in culture medium of infected cells compared with uninfected controls or cells infected with the histo-blood group O2 gene (Table I). GalNAc-transferase activity with the Muc2 acceptor substrate peptide was increased 20-fold, and activity with the HIV-V3 peptide was increased nearly 100-fold. In contrast, expression of GalNAc-T1 and -T2 constructs only increased the GalNAc-transferase activity toward Muc2 and Muc5C peptide substrates.

Table I.

Expression of GalNAc-T3 in Sf9 cells


Constructs Specific activitya
Muc2 Muc5 HIV-V3

millunits/ml
pAcGP67-GalNAc-T3-sol 2.05 0.77 0.97
pAcGP67-GalNAc-T1-sol 1.98 1.24 0.03
pAcGP67-GalNAc-T2-sol 1.20 0.66 0.02
pAcGP67-O2-sol 0.08 0.02 0.01
Uninfected cells 0.12 0.04 0.01

a  One unit of enzymes is defined as the amount of enzyme that transfers 1 µmol GalNAc in 1 min using the standard reaction mixture as described under ``Experimental Procedures'' with 50 µg of peptide as acceptor substrate.

Background levels of GalNAc-transferase activity in uninfected cell medium was higher than in control infected cell medium, probably as a result of the production and release of endogenous Sf9 GalNAc-transferase due to the larger number of cells in uninfected cultures. Furthermore, background enzyme activity varied significantly among different acceptor substrate peptides. The peptide Muc2 yielded the highest background and HIV-V3 peptide yielded the lowest activity. In an early attempt to express functional pAcGP67-GalNAc-T3, constructs were made that were truncated either at the 5' end or 3' end (data not shown). Interestingly, constructs lacking the 14 COOH-terminal or 55 NH2-terminal amino acids were completely inactive, indicating that both the stem region and the COOH-terminal region are important for maintaining a catalytically active protein, a feature also found for the alpha 3-galactosyltransferase (14).

Northern Blot Analysis of Human Organs

Northern blots with mRNA from 16 human adult and 5 fetal organs were probed with GalNAc-T1, -T2, and -T3 (Fig. 6). Similar to previous results using the multiple tissue Northern blot, MTN I, GalNAc-T1 hybridized to two mRNAs of approximately 3.4 and 4.1 kb (1), whereas GalNAc-T2 hybridized to a 4.5-kb mRNA. Variable amounts of a smaller 2-3-kb mRNA were also detected with this probe (3). Hybrization of these probes to multiple tissue Northern blots, MTN II and fetal MTN, resulted in slightly different estimated mRNA sizes for all GalNAc-Ts. This discrepancy is probably due to differences in the parameters of gel electrophoresis and the marker positions assigned by the supplier. GalNAc-T3 hybridized to a 3.6-kb mRNA (estimated from MTN I) highly expressed in pancreas and testis, which was weakly expressed in kidney, prostate ovary, intestine, and colon. A very low level of GalNAc-T3 mRNA was also detected in adult placenta and lung as well as fetal lung and kidney. In adult spleen GalNAc-T3 hybridized to a larger 4.2-kb mRNA (estimated from MTN II).


Fig. 6. Northern blot analysis of human tissues. Multiple human Northern blots (MTN I, MTN II, and fetal MTN) from Clontech as labeled was probed with 32P-labeled GalNAc-T1 probe, TEB1 (T1), GalNAc-T2 probe, TEB2 (T2), and GalNAc-T3 probe, TEB3 (T3), respectively.
[View Larger Version of this Image (77K GIF file)]


DISCUSSION

This study presents data on the cloning, sequencing, and expression of a third member of a growing family of polypeptide GalNAc-transferases. A putative GalNAc-transferase motif of 61 amino acids that is highly conserved in sequence among GalNAc-T1, GalNAc-T2, and a C. elegans homologue was used to search for potential additional members of the polypeptide GalNAc-transferase family. The screening strategy included an RT-PCR strategy similar to that reported for the sialyltransferase family (12, 23) followed by restriction enzyme analysis as a selection procedure. This method allowed us to eliminate or reduce ``background'' for the two known GalNAc-transferases, GalNAc-T1 and GalNAc-T2, and clearly distinguish novel RT-PCR products of the same size as those for GalNAc-T1 and -T2. Two novel DNA fragments were identified and sequenced. The present study presents data about one of these.

The novel cDNA was shown by expression in insect cells to have polypeptide GalNAc-transferase activity (Table I); it therefore may be classified as GalNAc-T3. The GalNAc-T3 gene encodes a protein with a predicted type II transmembrane domain structure similar to GalNAc-T1 and -T2 as well as all other glycosyltransferases characterized thus far (10). The GalNAc-T3 protein shows an overall amino acid sequence similarity of approximately 45% to either GalNAc-T1 or -T2, which is similar to the sequence similarity between GalNAc-T1 and -T2 (3). The lowest degree of sequence similarity is found in the amino-terminal region, including the transmembrane domain as well as the putative stem region. GalNAc-T3 is more than 50 amino acids longer than GalNAc-T1 or -T2 in the putative stem region. More than 80% of the COOH-terminal sequence of GalNAc-T1, -T2, and -T3 can be aligned by sequence similarity including the GalNAc-transferase motif, a number of minor segments of sequence similarity, and most of the cysteine residues (Fig. 5). Despite the relative low overall amino acid sequence similarity between GalNAc-T1, -T2, and -T3, 12 cysteine residues that are evenly spaced within the major part of the proteins are conserved. These may be involved in intramolecular disulfide bonding, or they may be directly involved in the catalytic activity of the enzymes. The significance of conserved cysteine residues was originally noted by Drickamer (24) within the sialyltransferase family. The functional importance of cysteine residues involved in intramolecular disulfide bonding as well as possibly the catalytic site of the beta 4-galactosyltransferase was recently demonstrated (25). The number of conserved cysteine residues in the polypeptide GalNAc-transferases far exceeds the number of cysteine residues reported in other glycosyltransferases to date (10). Interestingly, it appears that in vitro measurable GalNAc-transferase activity is increased by the presence of reducing agents (3, 26).

GalNAc-T3 was found to have a different acceptor substrate specificity than GalNAc-T1 and -T2. Among a panel of acceptor substrate peptides (Table I), GalNAc-T3 was found to glycosylate a peptide derived from the HIV envelope glycoprotein gp120, which did not serve as substrate for GalNAc-T1 or -T2 (4, Table I). This peptide was identified as an acceptor substrate during analysis of enzyme activity in total extracts of various cell lines and organs (4, 27). GalNAc-T3 also catalyzed glycosylation of mucin-type acceptor sequences such as Muc2 and Muc5, which can also be glycosylated by GalNAc-T1 and -T2. In a previous study we found that the enzyme activity that mediated glycosylation of the HIV peptide also utilized the Muc2 substrate by cross-competitive glycosylation (4). This finding is consistent with the substrate specificity reported here for GalNAc-T3, suggesting that GalNAc-T3 may represent this particular enzyme; however, additional enzymes may also show related specificities. Detailed analysis of individual GalNAc-transferases with a large panel of peptides and structural confirmation of the specific acceptor sites utilized for GalNAc-glycosylation will be necessary to fully understand the specificity of the individual members of the enzyme family.

The first step of mucin-type O-glycosylation is mediated by at least three and probably more GalNAc-transferases. The data presented here clearly show that GalNAc-T3 exhibits a different acceptor substrate specificity than GalNAc-T1 and -T2, using short synthetic peptides with no or little predicted secondary structure. The finding that the three GalNAc-transferases share mucin-type acceptor substrates such as the Muc2 and Muc5 peptide sequences may indicate overlap in specificity, but further structural studies of the products formed to identify the sites utilized by each enzyme on these peptides with multiple serine and threonine residues are needed to clarify this. It is clear that in vivo models displaying differential expression of GalNAc-transferases are needed to evaluate the contribution of a given GalNAc-transferase.

In contrast to GalNAc-T1 and -T2, expression of GalNAc-T3 appears to be highly regulated and mainly found in pancreas and testis; weak expression is found in a few other organs including placenta. Interestingly, approximately 200 bp covering part of the GalNAc-transferase motif of GalNAc-T3 were recently sequenced from a pancreatic expressed sequence Tag library (EMBL accession number T11328[GenBank]). The lack of GalNAc-T3 expression in human liver correlates with the finding that organ extracts from human liver lacked GalNAc-transferase activity utilizing the HIV peptide, whereas expression of GalNAc-T3 mRNA in human placenta is in agreement with GalNAc-transferase activity using the HIV peptide in extracts of placenta (4). One interpretation of these data is that differential expression of different GalNAc-transferases can result in O-glycosylation of distinct sites on a given protein. The biological significance of this is unclear. There are a few studies on O-glycosylation sites, and these are limited to analysis of the functional activity or stability of a protein with or without a single O-glycosylation site (28), because assignments of O-glycosylation sites are difficult to perform (29).

The results presented here suggest that cell-, organ-, and species-specific differences in the position of O-glycosylation may occur as a result of differential expression of polypeptide GalNAc-transferases. In searching for potential motifs of O-glycosylation by analyzing serine and threonine residues carrying O-glycans in glycoproteins, one may need to consider the GalNAc-transferase repertoire of the cell of origin (30, 31). The existence of a transferase family of unknown size possibly exceeding three members displaying differential acceptor substrate specificity and cell/organ distribution suggests that mucin-type O-glycosylation is a much more defined and controlled process than previously recognized. In this respect, previous studies aimed at identifying consensus sequence motifs for O-glycosylation may not have identified such because of the unknown level of complexity. The data reported here suggest that O-glycosylation in terms of sites is less random than previously suggested and that more defined acceptor substrate peptide sequences may be recognized for each of the individual GalNAc-transferases. With the individual GalNAc-transferases expressed as recombinant proteins, it may be possible to determine primary peptide sequence motifs for the individual enzymes, which could be useful for predicting O-glycosylation in vivo by a given cell type.


FOOTNOTES

*   This work was supported by the Danish Medical Research Council, the Danish Natural Science Research Council, the Lundbeck Foundation, Ib Henriksens Foundation, the Gangsted Foundation, the Novo Nordisk Foundation, and the Ingeborg Roikjer's Foundation. The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EMBL Data Bank with accession number(s) X92689[GenBank].


Dagger    To whom the correspondence should be addressed: School of Dentistry, Nørre Alle 20, DK-2200 Copenhagen N, Denmark. Tel.: 45-3532-6835; Fax: 45-3532-6505.
1   The abbreviations used are: GalNAc-transferase, UDP-N-acetyl-alpha -D-galactosamine:polypeptide N-acetylgalactosaminyltransferase; bp, base pair(s); kb, kilobase(s); PCR, polymerase chain reaction; RT, reverse transcriptase; pfu, plaque-forming units; MTN, multiple tissue Northern blots; HIV, human immunodeficiency virus.
2   GalNAc-T1 is a GalNAc-transferase originally cloned by Homa et al. (1); GalNAc-T2 is a GalNAc-transferase originally cloned by White et al. (3); and GalNAc-T3 is the GalNAc-transferase cloned in the present paper.
3   E. P. Bennett and H. Clausen, unpublished observations.

Acknowledgment

We are grateful to Dr Michael A. Hollingsworth for helpful advice during the project.


REFERENCES

  1. Homa, F. L., Hollanders, T., Lehman, D. J., Thomsen, D. R., Elhammer, Å. P. (1993) J. Biol. Chem. 268, 12609-12616 [Abstract/Free Full Text]
  2. Hagen, F. K., Van Wuyckhuyse, B., Tabak, L. A. (1993) J. Biol. Chem. 268, 18960-18965 [Abstract/Free Full Text]
  3. White, T., Bennett, E. P., Takio, K., Sørensen, T., Bonding, N., Clausen, H. (1995) J. Biol. Chem. 270, 24156-24165 [Abstract/Free Full Text]
  4. Sørensen, T., White, T., Wandall, H. H., Kristensen, A. K., Roepstorff, P., Clausen, H. (1995) J. Biol. Chem. 270, 24166-24173 [Abstract/Free Full Text]
  5. Hagen, F. K., Gregoire, C. A., Tabak, L. A. (1995) Glycoconj. J. 12, 901-909 [CrossRef][Medline] [Order article via Infotrieve]
  6. Immervoll, T., Gentzsch, M., Tanner, W. (1995) Yeast 11, 1345-1351 [CrossRef][Medline] [Order article via Infotrieve]
  7. Strahl-Bolsinger, S., Immervoll, T., Deutzmann, R., Tanner, W. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 8164-8168 [Abstract/Free Full Text]
  8. Lussier, M., Gentzsch, M., Sdieu, A-M., Bussey, H., Tanner, W. (1995) J. Biol. Chem. 270, 2770-2775 [Abstract/Free Full Text]
  9. Schachter, H. (1994) Molecular Glycobiology (Fukuda, M., Hindsgaul, O., eds) , p. 88, IRL Press at Oxford University Press, Oxford
  10. Kleene, R., Berger, E. G. (1993) Biochim. Biophys. Acta 1154, 283-325 [Medline] [Order article via Infotrieve]
  11. Paulson, J. C., Colley, K. J. (1989) J. Biol. Chem. 264, 17615-17618 [Free Full Text]
  12. Livingston, B. D., Paulson, J. C. (1993) J. Biol. Chem. 268, 11504-11507 [Abstract/Free Full Text]
  13. Nara, K., Watanabe, Y., Maruyama, K., Kasahara, K., Nagai, Y., Sanai, Y. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 7952-7956 [Abstract/Free Full Text]
  14. Henion, T. R., Macher, B. A., Anaraki, F., Galili, U. (1994) Glycobiology 4, 193-201 [Abstract/Free Full Text]
  15. Kelly, R. J., Ernst, L. K., Larsen, R. D., Bryant, J. G., Robinson, J. S., Lowe, J. B. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 5843-5847 [Abstract/Free Full Text]
  16. Bierhuizen, M. F. A., Mattei, M-G., Fukuda, M. (1993) Genes Dev. 7, 468-478 [Abstract/Free Full Text]
  17. Smith, P. L., Lowe, J. B. (1994) J. Biol. Chem. 269, 15162-15171 [Abstract/Free Full Text]
  18. Joziasse, D. H., Shaper, N. L., Kim, D., Van den Eijnden, D. H., Shaper, J. H. (1992) J. Biol. Chem. 267, 5534-5541 [Abstract/Free Full Text]
  19. Sambrook, J., Fritsch, E. F., Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  20. Sanger, F., Nicklen, S., Coulson, A. R. (1977) Proc. Natl. Acad. Sci. U. S. A. 74, 5463-5467 [Abstract/Free Full Text]
  21. Bennett, E. P., Steffensen, R., Clausen, H., Weghuis, D. O., van Kessel, D. G. (1995) Biochem. Biophys. Res. Commun. 206, 318-325 [CrossRef][Medline] [Order article via Infotrieve]
  22. Grunnet, N., Steffensen, R., Bennett, E. P., Clausen, H. (1994) Vox Sang. 67, 210-215 [Medline] [Order article via Infotrieve]
  23. Datta, A. K., Paulson, J. C. (1995) J. Biol. Chem. 270, 1497-1500 [Abstract/Free Full Text]
  24. Drickamer, K. (1993) Glycobiology 3, 2-3 [Free Full Text]
  25. Wang, Y., Wong, S. S., Fukuda, M. N., Zu, H., Lio, Z., Tang, Q., Appert, H. E. (1994) Biochem. Biophys. Res. Commun. 204, 701-709 [CrossRef][Medline] [Order article via Infotrieve]
  26. Wang, Y., Abernethy, J. L., Eckhardt, A. E., Hill, R. L. (1992) J. Biol. Chem. 267, 12709-12716 [Abstract/Free Full Text]
  27. Clausen, H., Pallesen, T., White, T., Wandall, H., Hansen, J-E. S. (1994) Complex Carbohydrates in Drug Research. Structural and Functional Aspects (Bock, K., Clausen, H., eds) , p. 297, Munksgaard, Copenhagen
  28. Parekh, R. B. (1994) Adv. Drug Del. Rev. 13, 251-266
  29. Gooley, A. A., Williams, K. L. (1994) Glycobiology 4, 413-417 [Free Full Text]
  30. Elhammer, Å. P., Poorman, R. A., Brown, E., Maggiora, L. L., Hoogerheide, J. G., Kozdy, F. J. (1993) J. Biol. Chem. 268, 10029-10038 [Abstract/Free Full Text]
  31. Hansen, J. E., Lund, O., Rapacki, K., Clausen, H., Mosekilde, E., Nielsen, J. O., Hansen, J.-E. (1994) Protein Structure by Distance Analysis (Bohr, H., Brunak, S., eds) , p. 247, IOS Press, Amsterdam
  32. Kyte, J., Doolittle, R. F. (1982) J. Mol. Biol. 157, 105-132 [CrossRef][Medline] [Order article via Infotrieve]

©1996 by The American Society for Biochemistry and Molecular Biology, Inc.

Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
K. Amano, Y. Chiba, Y. Kasahara, Y. Kato, M. K. Kaneko, A. Kuno, H. Ito, K. Kobayashi, J. Hirabayashi, Y. Jigami, et al.
Engineering of mucin-type human glycoproteins in yeast cells
PNAS, March 4, 2008; 105(9): 3232 - 3237.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
S. Wopereis, S. Grunewald, K. M.L.C. Huijben, E. Morava, R. Mollicone, B. G.M. van Engelen, D. J. Lefeber, and R. A. Wevers
Transferrin and Apolipoprotein C-III Isofocusing Are Complementary in the Diagnosis of N- and O-Glycan Biosynthesis Defects
Clin. Chem., February 1, 2007; 53(2): 180 - 187.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Endocrinol. Metab.Home page
H. J. Garringer, C. Fisher, T. E. Larsson, S. I. Davis, D. L. Koller, M. J. Cullen, M. S. Draman, N. Conlon, A. Jain, N. S. Fedarko, et al.
The Role of Mutant UDP-N-Acetyl-{alpha}-D-Galactosamine-Polypeptide N-Acetylgalactosaminyltransferase 3 in Regulating Serum Intact Fibroblast Growth Factor 23 and Matrix Extracellular Phosphoglycoprotein in Heritable Tumoral Calcinosis
J. Clin. Endocrinol. Metab., October 1, 2006; 91(10): 4037 - 4042.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
J. Egelund, B. L. Petersen, M. S. Motawia, I. Damager, A. Faik, C. E. Olsen, T. Ishii, H. Clausen, P. Ulvskov, and N. Geshi
Arabidopsis thaliana RGXT1 and RGXT2 Encode Golgi-Localized (1,3)-{alpha}-D-Xylosyltransferases Involved in the Synthesis of Pectic Rhamnogalacturonan-II
PLANT CELL, October 1, 2006; 18(10): 2593 - 2607.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. Kato, C. Jeanneau, M. A. Tarp, A. Benet-Pages, B. Lorenz-Depiereux, E. P. Bennett, U. Mandel, T. M. Strom, and H. Clausen
Polypeptide GalNAc-transferase T3 and Familial Tumoral Calcinosis: SECRETION OF FIBROBLAST GROWTH FACTOR 23 REQUIRES O-GLYCOSYLATION
J. Biol. Chem., July 7, 2006; 281(27): 18370 - 18377.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Pathol.Home page
M F Campagnoli, A Pucci, E Garelli, A Carando, C Defilippi, R Lala, G Ingrosso, I Dianzani, M Forni, and U Ramenghi
Familial tumoral calcinosis and testicular microlithiasis associated with a new mutation of GALNT3 in a white family.
J. Clin. Pathol., April 1, 2006; 59(4): 440 - 442.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
S. Wopereis, D. J. Lefeber, E. Morava, and R. A. Wevers
Mechanisms in Protein O-Glycan Biosynthesis and Clinical and Molecular Aspects of Protein O-Glycan Biosynthesis Defects: A Review
Clin. Chem., April 1, 2006; 52(4): 574 - 600.
[Abstract] [Full Text] [PDF]


Home page
J. Histochem. Cytochem.Home page
N. Berois, D. Mazal, L. Ubillos, F. Trajtenberg, A. Nicolas, X. Sastre-Garau, H. Magdelenat, and E. Osinaga
UDP-N-Acetyl-D-Galactosamine: Polypeptide N-Acetylgalactosaminyltransferase-6 as a New Immunohistochemical Breast Cancer Marker
J. Histochem. Cytochem., March 1, 2006; 54(3): 317 - 328.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
E. M. Comelli, S. R. Head, T. Gilmartin, T. Whisenant, S. M. Haslam, S. J. North, N.-K. Wong, T. Kudo, H. Narimatsu, J. D. Esko, et al.
A focused microarray approach to functional glycomics: transcriptional regulation of the glycome
Glycobiology, February 1, 2006; 16(2): 117 - 131.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Endocrinol. Metab.Home page
T. Larsson, X. Yu, S. I. Davis, M. S. Draman, S. D. Mooney, M. J. Cullen, and K. E. White
A Novel Recessive Mutation in Fibroblast Growth Factor-23 Causes Familial Tumoral Calcinosis
J. Clin. Endocrinol. Metab., April 1, 2005; 90(4): 2424 - 2427.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
F. J. Olson, M. Backstrom, H. Karlsson, J. Burchell, and G. C. Hansson
A MUC1 tandem repeat reporter protein produced in CHO-K1 cells has sialylated core 1 O-glycans and becomes more densely glycosylated if coexpressed with polypeptide-GalNAc-T4 transferase
Glycobiology, February 1, 2005; 15(2): 177 - 191.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
N. T. Marcos, S. Pinho, C. Grandela, A. Cruz, B. Samyn-Petit, A. Harduin-Lepers, R. Almeida, F. Silva, V. Morais, J. Costa, et al.
Role of the Human ST6GalNAc-I and ST6GalNAc-II in the Synthesis of the Cancer-Associated Sialyl-Tn Antigen
Cancer Res., October 1, 2004; 64(19): 7050 - 7057.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
K. Mardberg, K. Nystrom, M. A. Tarp, E. Trybala, H. Clausen, T. Bergstrom, and S. Olofsson
Basic amino acids as modulators of an O-linked glycosylation signal of the herpes simplex virus type 1 glycoprotein gC: functional roles in viral infectivity
Glycobiology, July 1, 2004; 14(7): 571 - 581.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
N. Miyahara, J. Shoda, T. Kawamoto, M. Furukawa, T. Ueda, T. Todoroki, N. Tanaka, K. Matsuo, Y. Yamada, K. Kohno, et al.
Expression of UDP-N-Acetyl-{alpha}-D-Galactosamine-Polypeptide N-Acetylgalactosaminyltransferase Isozyme 3 in the Subserosal Layer Correlates with Postsurgical Survival of Pathological Tumor Stage 2 Carcinoma of the Gallbladder
Clin. Cancer Res., March 15, 2004; 10(6): 2090 - 2099.
[Abstract] [Full Text] [PDF]


Home page
DevelopmentHome page
H. Strutt, J. Mundy, K. Hofstra, and D. Strutt
Cleavage and secretion is not required for Four-jointed function in Drosophila patterning
Development, February 15, 2004; 131(4): 881 - 890.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
L. Kinarsky, G. Suryanarayanan, O. Prakash, H. Paulsen, H. Clausen, F.-G. Hanisch, M. A. Hollingsworth, and S. Sherman
Conformational studies on the MUC1 tandem repeat glycopeptides: implication for the enzymatic O-glycosylation of the mucin protein core
Glycobiology, December 1, 2003; 13(12): 929 - 939.
[Abstract] [Full Text] [PDF]


Home page
J BiochemHome page
Y. Kouzuma, Y. Suzuki, M. Nakano, K. Matsuyama, S. Tojo, M. Kimura, T. Yamasaki, H. Aoyagi, and T. Hatakeyama
Characterization of Functional Domains of the Hemolytic Lectin CEL-III from the Marine Invertebrate Cucumaria echinata
J. Biochem., September 1, 2003; 134(3): 395 - 402.
[Abstract] [Full Text] [PDF]


Home page
J. Histochem. Cytochem.Home page
N. T. Marcos, A. Cruz, F. Silva, R. Almeida, L. David, U. Mandel, H. Clausen, S. von Mensdorff-Pouilly, and C. A. Reis
Polypeptide GalNAc-transferases, ST6GalNAc-transferase I, and ST3Gal-transferase I Expression in Gastric Carcinoma Cell Lines
J. Histochem. Cytochem., June 1, 2003; 51(6): 761 - 771.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
H. Iwasaki, Y. Zhang, K. Tachibana, M. Gotoh, N. Kikuchi, Y.-D. Kwon, A. Togayachi, T. Kudo, T. Kubota, and H. Narimatsu
Initiation of O-Glycan Synthesis in IgA1 Hinge Region Is Determined by a Single Enzyme, UDP-N-Acetyl-alpha -D-galactosamine:Polypeptide N-Acetylgalactosaminyltransferase 2
J. Biol. Chem., February 14, 2003; 278(8): 5613 - 5621.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
Y. Zhang, H. Iwasaki, H. Wang, T. Kudo, T. B. Kalka, T. Hennet, T. Kubota, L. Cheng, N. Inaba, M. Gotoh, et al.
Cloning and Characterization of a New Human UDP-N-Acetyl-alpha -D-galactosamine:Polypeptide N-Acetylgalactosaminyltransferase, Designated pp-GalNAc-T13, That Is Specifically Expressed in Neurons and Synthesizes GalNAc alpha -Serine/Threonine Antigen
J. Biol. Chem., January 3, 2003; 278(1): 573 - 584.
[Abstract] [Full Text] [PDF]


Home page
IOVSHome page
P. Argueso, A. Tisdale, U. Mandel, E. Letko, C. S. Foster, and I. K. Gipson
The Cell-Layer- and Cell-Type-Specific Distribution of GalNAc-Transferases in the Ocular Surface Epithelia Is Altered during Keratinization
Invest. Ophthalmol. Vis. Sci., January 1, 2003; 44(1): 86 - 92.
[Abstract] [Full Text] [PDF]


Home page