![]()
|
|
||||||||
(Received for publication, July 5, 1995, and in revised form, April 15, 1996)
From the Faculty of Health Sciences, School of Dentistry,
DK-2200 Copenhagen, Denmark
The glycosylation of serine and threonine
residues during mucin-type O-linked protein
glycosylation is carried out by a family of
UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferases
(GalNAc-transferase). Previously two members, GalNAc-T1 and -T2, have
been isolated and the genes cloned and characterized. Here we report
the cDNA cloning and expression of a novel GalNAc-transferase
termed GalNAc-T3. The gene was isolated and cloned based on the
identification of a GalNAc-transferase motif (61 amino acids) that is
shared between GalNAc-T1 and -T2 as well as a homologous
Caenorhabditis elegans gene. The cDNA sequence has a
633-amino acid coding region indicating a protein of 72.5 kDa with a
type II domain structure. The overall amino acid sequence similarity
with GalNAc-T1 and -T2 is approximately 45%; 12 cysteine residues that
are shared between GalNAc-T1 and -T2 are also found in GalNAc-T3.
GalNAc-T3 was expressed as a soluble protein without the hydrophobic
transmembrane domain in insect cells using a Baculo-virus vector, and
the expressed GalNAc-transferase activity showed substrate specificity
different from that previously reported for GalNAc-T1 and -T2. Northern
analysis of human organs revealed a very restricted expression pattern
of GalNAc-T3.
The glycosylation of serine and threonine residues during
mucin-type O-linked protein glycosylation is carried out by
a family of UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferases
(GalNAc-transferase)1 (EC). Two
distinct human GalNAc-transferase genes, GalNAc-T1 and -T2, have been
cloned and characterized to date (1, 2, 3).2
Analysis of the acceptor substrate specificity of GalNAc-T2 has
revealed substrates that this transferase does not utilize (3, 4). In
the present study we have analyzed the acceptor substrate specificity
of GalNAc-T1 and found that neither GalNAc-T1 nor -T2 utilize all
substrates identified, thus suggesting the existence of additional
GalNAc-transferases. The existence of additional GalNAc-transferases
has also been suggested by Hagen et al. (5) by analysis of
sequence similarities of expressed sequence tag clones with those of
GalNAc-T1 and -T2. O-Glycosylation in yeast has similarly
been shown by Tanner and colleagues (6, 7, 8) to be initiated by at least
four mannosyltransferases.
Families of glycosyltransferases with related acceptor and/or donor
substrate specificities may be encoded by homologous genes showing
segments of sequence similarity (9, 10). Initially, no sequence
similarities were found between enzymes having the same donor substrate
specificity (11), but as more enzymes have been cloned, several
families of homologous glycosyltransferase genes have been identified.
Livingston and Paulson (12) originally identified a sialyltransferase
motif in a segment of 55 amino acids that has now been found to be
conserved within all identified members of the sialyltransferase family
(13). Similarly, sequence similarities are also found within
The human GalNAc-transferases T1 and T2 share a segment of 61 amino
acids with 82% sequence similarity, and this segment is also found in
a deduced homologue, ZK688.8 (see Fig. 1), which has been observed to
exhibit GalNAc-transferase activity (5). In the present study we have
utilized this potential GalNAc-transferase motif to develop a PCR
strategy that identified two novel cDNAs with sequence similarity.
Here we report the cloning of cDNA containing the complete coding
sequence of one of these and show by expression that the gene encoded a
GalNAc-transferase with an acceptor substrate specificity partly
different from GalNAc-T1 and -T2. Northern analysis showed that the
expression of GalNAc-T3 is highly tissue-restricted in contrast to
GalNAc-T1 and -T2.
Multiple sequence
alignment analysis (DNASIS, Hitachi) of GalNAc-T1 and -T2 was applied
to identify areas with highest degree of sequence similarity. Based
upon a 61-amino acid segment shared by GalNAc-T1 and -T2 as well as a
more recently reported sequence derived from a homologous
Caenorhabditis elegans gene (5), a pair of sense and
anti-sense primers (EBHC100, 5
The 196-bp products from RT-PCR of MKN45 mRNA that were resistant
to BstNI cleavage were isolated using the prep-A-gene kit
(Bio-Rad) and cloned into the pT7T3U19 vector (Pharmacia Biotech Inc.).
Plasmids from 40 individual clones were purified using Qiagen-tip 20 column (Qiagen), and the clones were sequenced. Two sets of sequences
differing from GalNAc-T1 and -T2 but exhibiting a high degree of
similarity were identified, and sequence information from one set of
identical clones designated TE3 was used for the isolation of 5 Rapid library screening was performed by diluting 1 × 106 pfu of human salivary gland Two sublibraries generated 3 25,000 pfu from sublibraries were
plated on 15-cm LB agar plates by standard procedures (19), and plaques
were transferred to HYBOND N+ (Amersham Corp.) nylon
membranes and hybridized to random primed
[ A partial cDNA
sequence of the putative GalNAc-T3 gene an RT-PCR product
(pAcGP67-GalNAc-T3-sol) using primers EBHC219
(5
GalNAc-transferase activity was measured in standard
reaction mixtures containing 25 mM Tris (pH 7.4), 5 mM MnCl2, 0.25% Triton X-100, 50 µM UDP-[14C]GalNAc (4000 cpm/nmol), 5 mM 2-mercaptoethanol, 250 µM peptide, and 10 µl of culture supernatant after incubation at 37 °C for 20 min.
Controls included untransfected Sf9 cell culture medium and a construct
pAcGP67-O2-sol of the enzymatically nonfunctional
histo-blood group O2 gene (22). Acceptor peptides (Muc2,
PTTPISTTTMVTPTPTPTC; Muc5c, CTTSTTSAPTTSTTSAPTTS; and HIV-V3,
CIRIQRGPGRAFVTIGKIGNM) were obtained from Carlbiotech (Copenhagen)
and Neosystems (Strassburg), and quality was ascertained by amino acid
analysis and mass spectrometry. Glycosylated product was quantified by
scintillation counting after Dowex-1 chromatography. All combinations
of enzyme sources and peptides were evaluated at least once by C-18
reverse phase chromatography (PC3.2/3 or uRPC C2/C18 SC2.1/10 Smart
System, Pharmacia) and scintillation counting of peptide peak fractions
to confirm incorporation of [14C]GalNAc into the acceptor
peptide (4).
Multiple tissue Northern blots (MTN
I, MTN II, and fetal MTN) were obtained from Clontech. The GalNAc-T1
probe, TEB1, was prepared by RT-PCR and contained nucleotides 1-1132.
The GalNAc-T2 probe, TEB2, was prepared as described previously (3) and
contained nucleotides 331-1268. The GalNAc-T3 probe, TEB3, contained
sequences 307-1902 as shown in Fig. 4. Probes were random prime labeled
using [
A set
of primers (EBHC100/EBHC106) corresponding to sequences flanking a
putative GalNAc-transferase motif (Fig. 1) were used in
RT-PCR reactions with mRNA from a variety of human organs and cell
lines. A single DNA fragment of approximately 196 bp corresponding to
that predicted for GalNAc-T1 and -T2 was amplified from all templates
(Fig. 2). Hybridization with oligonucleotides probes
specific for GalNAc-T1 and -T2 served as controls for the identities of
the products observed. A restriction enzyme (BstNI) that
selectively cut the products of both GalNAc-T1 and -T2 was used to
detect potentially novel DNA from homologous genes. As seen in Fig. 2,
RNA from several organs and cell lines yielded RT-PCR products that
were not cleaved by BstNI, indicating the presence of a
novel DNA fragment. The BstNI uncleaved RT-PCR product from
the gastric carcinoma cell line MKN45 was subcloned and sequenced.
Forty independent clones were sequenced, and of these eight clones
contained sequences homologous to but different from GalNAc-T1 and -T2.
Six independent clones had a novel sequence designated TE3, and two
clones had a novel sequence designated TE4. The DNA sequence of TE3 was
clearly similar to GalNAc-T1 and -T2 with a sequence similarity of
approximately 80%. The deduced amino acid sequence containing the
putative GalNAc-transferase motif is presented in Fig. 1.
Cloning
and sequencing of the complete coding sequence of GalNAc-T3 was
achieved by PCR screening of 40 sublibraries from a human salivary
gland
Expression of the
pAcGP67-GalNAc-T3-sol construct in Sf9 cells resulted in a 20-100-fold
increase in GalNAc-transferase activity in culture medium of infected
cells compared with uninfected controls or cells infected with the
histo-blood group O2 gene (Table I).
GalNAc-transferase activity with the Muc2 acceptor substrate peptide
was increased 20-fold, and activity with the HIV-V3 peptide was
increased nearly 100-fold. In contrast, expression of GalNAc-T1 and -T2
constructs only increased the GalNAc-transferase activity toward Muc2
and Muc5C peptide substrates.
Expression of GalNAc-T3 in Sf9 cells
Volume 271, Number 29,
Issue of July 19, 1996
pp. 17006-17012
©1996 by The American Society for Biochemistry and Molecular Biology, Inc.
-D-galactosamine
POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE,
GalNAc-T3*

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
Acknowledgment
REFERENCES
3/4-fucosyltransferases (10, 14),
2-fucosyltransferases (15),
6-N-acetylglucosaminyltransferases (16),
4-N-acetylgalactosaminyltransferases (17), the
histo-blood group A/B transferases, and an
3-galactosyltransferase
(18).
Fig. 1.
GalNAc-transferase motif. Multiple
sequence alignment of human GalNAc-T1 (accession number X85018[GenBank]),
GalNAc-T2 (accession number X85019[GenBank]), and C. elegans gene
(accession number L16621[GenBank]) revealed a 61-amino acid motif. Sequences
given are compared with the human GalNAc-T1 (3) where an
asterisk indicates an identical amino acid or base. The
location and sequence of primers (EBHC100/EBHC106) used in the RT-PCR
cloning strategy are indicated. The sequence of the GalNAc-tranferase
motif of the isolated TE3 DNA shown to represent GalNAc-T3 is also
shown.
Identification of cDNA Homologous to GalNAc-T1 and -T2 by
RT-PCR and Restriction Enzyme Analysis
-TGGGGAGGAGARAACCTAGA-3
, and EBHC106,
5
-ATTCATCCATCCATACTTCT-3
, respectively, was used in RT-PCR
amplifications of poly(A+) RNA from several sources (see
Figs. 1 and 2). The mRNA from human organs (liver, brain, and
submaxillary gland) were obtained from Clontech, and mRNA from
human cancer cell lines (MKN45, Colo205, and WI38) was prepared as
reported previously (3). A restriction enzyme search identified a
common BstNI site within the expected 196-bp RT-PCR product
of GalNAc-T1 and -T2, which would produce two fragments of 128 and 68 bp. Novel DNA fragments representing putative additional
GalNAc-transferases were identified by RT-PCR with EBHC100/EBHC106
primers. Six mRNA templates were analyzed after BstNI
digestion in 2% agarose gels. Following reverse transcription using
the EBHC106 primer, PCR was performed for 35 cycles of 95 °C for
45 s, 53 °C for 15 s, and 72 °C for 15 s using
Taq polymerase on a model 480 thermocycler (Perkin Elmer).
Products were verified by Southern blotting and hybridization with
oligonucleotide probes specific for GalNAc-T1 or -T2 (EBHC112,
5
-CTTTGGAAATTGTTACATGCTCA-3
, and EBHC45,
5
-TGGTGGCAGCCTGGGAGATCA-3
, respectively).
Fig. 2.
BstNI restriction analysis of
EBHC100/EBHC106 RT-PCR products generated from organs and cell
lines. PhiX174 markers (194 and 118 bp) are indicated to the
left, and the predicted RT-PCR product of EBHC100/EBHC106
(196 bp) as well as the larger product of a BstNI cleavage
hereof (128 bp) are indicated to the right.
and 3
sequences outside the GalNAc-transferase motif.
gt11 library (Clontech)
into 40 sublibraries (designated numbers 1-40), each possessing
approximately 2.5 × 104 pfu. All sublibraries were
subjected to phage amplification (approximately 40-fold) by liquid
culture phage amplification (19), giving a sublibrary titer of 1 × 106 pfu. Phage amplification was performed in 1 ml of LB
MgSO4 maltose medium in a shaking incubator at 37 °C for
5 h. After amplification, 20 µl of chloroform was added to each
sublibrary, cellular debris was pelleted, and the phage supernatants
were titrated and used in subsequent screenings. All 40 sublibraries
were screened to identify TE3 possessing phage clones. One µl of each
sublibrary (approximately 104-105 pfu) was
lysed in a 10 µl of volume in the presence of 0.45% Nonidet P-40 and
Tween 20, 100 µg/µl proteinase K at 56 °C for 30 min. Proteinase
K was heat-inactivated by boiling for 15 min, and 2 µl of phage
lysate was amplified by PCR using primers EBHC100 and EBHC204 at 0.5 µM using 40 cycles of 95 °C for 45 s, 55 °C
for 5 s, and 72 °C for 30 s. Thirteen sublibraries found
to contain TE3
gt11 clones were further assayed by PCR using EBHC202
(5
-GCGGATCCGCAGCAAAAGCCCTCATAGCTTT-3
) or EBHC204
(5
-GCGGATCCTCTAGCAATCACCTGAGTGCC-3
) primers combined with the
gt11
vector primers to estimate lengths of cDNA inserts for selection of
sublibraries with most 3
or 5
sequences. Amplifications were
performed for 35 cycles of 95 °C for 45 s, 55 °C for 1 s, and 72 °C for 2 min.
PCR products (EBHC204/
gt11 vector) of
approximately 1000 bp, and two sublibraries generated TE3 5
PCR
products of approximately 1200 bp. PCR products were subcloned into
pT7T3U19 and sequenced. These PCR products were used to probe and
isolate
cDNA clones from the corresponding sublibraries. Both
strands of the subcloned
cDNAs were sequenced (20) using
internal primers spaced 3-400 bp apart. Partly overlapping sequence
data from
cDNA clones were utilized to derive the complete
coding sequence.
cDNA Isolation
-32P]dCTP-labeled PCR probes. Hybridization was
performed at 42 °C in the presence of 6 × SSPE, 5 × Denhart's,
0.5% SDS, and 50% formamide. Plaque lifts were washed 5 × at
42 °C with 2 × SSC, 0.1% SDS, once with 0.5 × SSC, 0.1% SDS, and
once at 55 °C with 0.1 × SSC, 0.1% SDS in a mini-hybridization
oven (Hybaid).
-AGCGGATCCTCAACGATGGAAAGGAACATG-3
) and EBHC215
(5
-AGCGGATCCAGGAACACTTAATCATTTTGGC-3
) with BamHI
restriction sites introduced was produced and cloned (see Fig. 3). The
PCR product was designed to yield a putative soluble form of the
GalNAc-T3 protein with an NH2-terminal end positioned
immediately COOH-terminal to the potential transmembrane domain and
including the entire sequence expected to contain the catalytic domain.
The PCR product was cloned into a BamHI site of the
expression vector pAcGP67 (Pharmingen), and the expression construct
was sequenced to verify the sequence and correct insertion into the
cloning site. Control constructs included pAcGP67-GalNAc-T2-sol
prepared as described previously (3), pAcGP67-GalNAc-T1-sol prepared
similarly by RT-PCR with human submaxillary gland mRNA and designed
to mimic the originally identified amino terminus of the soluble bovine
GalNAc-transferase protein (1), and pAcGP67-O2-sol
containing the histo-blood group O2 cDNA and prepared
as described previously for the blood group A cDNA (21).
Co-transfection of Sf9 cells with pAcGP67-constructs and Baculo-GoldTM
DNA was performed according to the manufacturer's description.
Briefly, 0.5 µg of construct was mixed with 0.05 µg of Baculo-Gold
DNA and co-transfected in Sf9 cells in 24-well plates. 96 h
post-transfection recombinant virus was amplified in 6-well plates at
dilutions of 1:10 and 1:50. Titer of amplified virus was estimated by
titration in 24-well plates with monitoring of enzyme activities.
Transferase assays were performed on supernatants of Sf9 cells in
6-well plates infected with virus at titer 1:1000 to 1:5000
representing end point dilutions giving optimal enzyme activities.
Fig. 3.
Schematic representation of cDNA clones
used to construct the nucleotide sequence of GalNAc-T3.
-32P]dCTP (Amersham Corp.) and oligo labeling
kit (Pharmacia). Blots were probed sequentially with GalNAc-T1, -T2,
and -T3 probes using the same conditions as were used for plaque lift
hybridizations. Blots were probed, stripped, and reprobed as
recommended by Clontech.
Fig. 4.
Nucleotide sequence and predicted amino acid
sequence of the combined cDNA clones. The amino acid sequence
is shown in single-letter codes. The hydrophobic segment representing
the putative transmembrane domain is underlined with a
double line (Kyte & Doolittle, window of 8) (32), and
adjacent charged amino acid underlined with a dotted
line. Four potential N-linked glycosylation sites are
indicated by asterisks. Location of primers used for RT-PCR
preparation of the expression construct are indicated by single
underlining. The GalNAc-transferase motif and primers
EBHC100/EBHC106 are indicated in bold type. Indicated by
bold underlined type is a potential polyadenylation
signal.
Identification of DNA Homologous to GalNAc-T1 and -T2
gt11 library, which yielded two sublibraries (number 8 and
number 1) containing long 3
and 5
sequences outside the TE3 probe
area. This strategy facilitated identification of
cDNA clones
with long 5
and 3
inserts and allowed us to compare multiple 5
and
3
sequences obtained within the isolated cDNA clones to identify
and avoid intron containing sequences. Two PCR products of 1000 bp from
sublibrary 8 and two PCR products from sublibrary 1 of 1200 bp were
selected, subcloned, and sequenced. The sequences of these PCR products
exhibited similarity to the sequences of GalNAc-T1 and -T2. One
cDNA clone from each sublibrary was isolated, and inserts were
subcloned and sequenced. The sequences found in the PCR products were
identical to the corresponding sequences in the selected
cDNA
clones. The 3
cDNA clone 8.3
possessed a 3-kb insert with a
single 900-bp open reading frame followed by multiple stop codons and a
consensus polyadenylation box (Fig. 3). The 5
end of
the insert of clone 8.3
apparently contained an intron sequence, and
this has been confirmed by sequence comparison of several RT-PCR and
cDNA clones as well as a genomic
clone.3 One 5
cDNA clone 1.5
possessed a 1300-bp open reading frame but was not considered to
contain the complete coding sequence, because it lacked a putative
hydrophobic transmembrane region. A second screen using an antisense
primer EBHC211 (5
-ACCGGATCCAGTGTTTAGCTTCCCCACG) (5
region of clone
1.5
) yielded another 5
clone, 12.5
, which contained additional 550 bp of 5
sequence including a potential transmembrane region. As shown
in Fig. 4, the combined sequences of the selected
cDNA clones contained an 1902-bp open reading frame. Multiple
alignment analysis (DNASIS, Hitachi) of human GalNAc-T1, -T2, and -T3
and the C. elegans ZK.688.8 deduced protein presented in
Fig. 5 demonstrated high sequence similarity in the
COOH-terminal region (amino acids 82-559, GalNAc-T1; amino acids
102-571, GalNAc-T2; amino acids 148-633, GalNAc-T3; and amino acids
133-612, ZK.688.8) with conservation of cysteine residues. The
NH2-terminal region show no sequence similarity and vary
considerably in length with GalNAc-T3 having the longest sequence
between the putative transmembrane region and the putative catalytic
domain.
Fig. 5.
Multiple sequence alignment (DNASIS, Hitachi)
of human GalNAc-T1 (T1), GalNAc-T2 (T2),
GalNAc-T3 (T3), and C. elegans (Ce)
predicted protein ZK.688.8. Introduced gaps are shown as
hyphens, and aligned residues identical to GalNAc-T1 are
indicated by colons. Hydrophobic sequences representing
putative transmembrane domains are indicated by underlining
(Kyte & Doolittle, window of 8) (32). Cysteine residues are indicated
by bold underlining, and aligned cysteine residues between
sequences are indicated by a solid diamond.
Constructs
Specific activitya
Muc2
Muc5
HIV-V3
millunits/ml
pAcGP67-GalNAc-T3-sol
2.05
0.77
0.97
pAcGP67-GalNAc-T1-sol
1.98
1.24
0.03
pAcGP67-GalNAc-T2-sol
1.20
0.66
0.02
pAcGP67-O2-sol
0.08
0.02
0.01
Uninfected
cells
0.12
0.04
0.01
a
One unit of enzymes is defined as the amount of enzyme
that transfers 1 µmol GalNAc in 1 min using the standard reaction
mixture as described under ``Experimental Procedures'' with 50 µg
of peptide as acceptor substrate.
Background levels of GalNAc-transferase activity in uninfected cell
medium was higher than in control infected cell medium, probably as a
result of the production and release of endogenous Sf9
GalNAc-transferase due to the larger number of cells in uninfected
cultures. Furthermore, background enzyme activity varied significantly
among different acceptor substrate peptides. The peptide Muc2 yielded
the highest background and HIV-V3 peptide yielded the lowest activity.
In an early attempt to express functional pAcGP67-GalNAc-T3, constructs
were made that were truncated either at the 5
end or 3
end (data not
shown). Interestingly, constructs lacking the 14 COOH-terminal or 55 NH2-terminal amino acids were completely inactive,
indicating that both the stem region and the COOH-terminal region are
important for maintaining a catalytically active protein, a feature
also found for the
3-galactosyltransferase (14).
Northern blots with
mRNA from 16 human adult and 5 fetal organs were probed with
GalNAc-T1, -T2, and -T3 (Fig. 6). Similar to previous
results using the multiple tissue Northern blot, MTN I, GalNAc-T1
hybridized to two mRNAs of approximately 3.4 and 4.1 kb (1),
whereas GalNAc-T2 hybridized to a 4.5-kb mRNA. Variable amounts of
a smaller 2-3-kb mRNA were also detected with this probe (3).
Hybrization of these probes to multiple tissue Northern blots, MTN II
and fetal MTN, resulted in slightly different estimated mRNA sizes
for all GalNAc-Ts. This discrepancy is probably due to differences in
the parameters of gel electrophoresis and the marker positions assigned
by the supplier. GalNAc-T3 hybridized to a 3.6-kb mRNA (estimated
from MTN I) highly expressed in pancreas and testis, which was weakly
expressed in kidney, prostate ovary, intestine, and colon. A very low
level of GalNAc-T3 mRNA was also detected in adult placenta and
lung as well as fetal lung and kidney. In adult spleen GalNAc-T3
hybridized to a larger 4.2-kb mRNA (estimated from MTN II).
This study presents data on the cloning, sequencing, and expression of a third member of a growing family of polypeptide GalNAc-transferases. A putative GalNAc-transferase motif of 61 amino acids that is highly conserved in sequence among GalNAc-T1, GalNAc-T2, and a C. elegans homologue was used to search for potential additional members of the polypeptide GalNAc-transferase family. The screening strategy included an RT-PCR strategy similar to that reported for the sialyltransferase family (12, 23) followed by restriction enzyme analysis as a selection procedure. This method allowed us to eliminate or reduce ``background'' for the two known GalNAc-transferases, GalNAc-T1 and GalNAc-T2, and clearly distinguish novel RT-PCR products of the same size as those for GalNAc-T1 and -T2. Two novel DNA fragments were identified and sequenced. The present study presents data about one of these.
The novel cDNA was shown by expression in insect cells to have
polypeptide GalNAc-transferase activity (Table I); it therefore may be
classified as GalNAc-T3. The GalNAc-T3 gene encodes a protein with a
predicted type II transmembrane domain structure similar to GalNAc-T1
and -T2 as well as all other glycosyltransferases characterized thus
far (10). The GalNAc-T3 protein shows an overall amino acid sequence
similarity of approximately 45% to either GalNAc-T1 or -T2, which is
similar to the sequence similarity between GalNAc-T1 and -T2 (3). The
lowest degree of sequence similarity is found in the amino-terminal
region, including the transmembrane domain as well as the putative stem
region. GalNAc-T3 is more than 50 amino acids longer than GalNAc-T1 or
-T2 in the putative stem region. More than 80% of the COOH-terminal
sequence of GalNAc-T1, -T2, and -T3 can be aligned by sequence
similarity including the GalNAc-transferase motif, a number of minor
segments of sequence similarity, and most of the cysteine residues
(Fig. 5). Despite the relative low overall amino acid sequence
similarity between GalNAc-T1, -T2, and -T3, 12 cysteine residues that
are evenly spaced within the major part of the proteins are conserved.
These may be involved in intramolecular disulfide bonding, or they may
be directly involved in the catalytic activity of the enzymes. The
significance of conserved cysteine residues was originally noted by
Drickamer (24) within the sialyltransferase family. The functional
importance of cysteine residues involved in intramolecular disulfide
bonding as well as possibly the catalytic site of the
4-galactosyltransferase was recently demonstrated (25). The number
of conserved cysteine residues in the polypeptide GalNAc-transferases
far exceeds the number of cysteine residues reported in other
glycosyltransferases to date (10). Interestingly, it appears that
in vitro measurable GalNAc-transferase activity is increased
by the presence of reducing agents (3, 26).
GalNAc-T3 was found to have a different acceptor substrate specificity than GalNAc-T1 and -T2. Among a panel of acceptor substrate peptides (Table I), GalNAc-T3 was found to glycosylate a peptide derived from the HIV envelope glycoprotein gp120, which did not serve as substrate for GalNAc-T1 or -T2 (4, Table I). This peptide was identified as an acceptor substrate during analysis of enzyme activity in total extracts of various cell lines and organs (4, 27). GalNAc-T3 also catalyzed glycosylation of mucin-type acceptor sequences such as Muc2 and Muc5, which can also be glycosylated by GalNAc-T1 and -T2. In a previous study we found that the enzyme activity that mediated glycosylation of the HIV peptide also utilized the Muc2 substrate by cross-competitive glycosylation (4). This finding is consistent with the substrate specificity reported here for GalNAc-T3, suggesting that GalNAc-T3 may represent this particular enzyme; however, additional enzymes may also show related specificities. Detailed analysis of individual GalNAc-transferases with a large panel of peptides and structural confirmation of the specific acceptor sites utilized for GalNAc-glycosylation will be necessary to fully understand the specificity of the individual members of the enzyme family.
The first step of mucin-type O-glycosylation is mediated by at least three and probably more GalNAc-transferases. The data presented here clearly show that GalNAc-T3 exhibits a different acceptor substrate specificity than GalNAc-T1 and -T2, using short synthetic peptides with no or little predicted secondary structure. The finding that the three GalNAc-transferases share mucin-type acceptor substrates such as the Muc2 and Muc5 peptide sequences may indicate overlap in specificity, but further structural studies of the products formed to identify the sites utilized by each enzyme on these peptides with multiple serine and threonine residues are needed to clarify this. It is clear that in vivo models displaying differential expression of GalNAc-transferases are needed to evaluate the contribution of a given GalNAc-transferase.
In contrast to GalNAc-T1 and -T2, expression of GalNAc-T3 appears to be highly regulated and mainly found in pancreas and testis; weak expression is found in a few other organs including placenta. Interestingly, approximately 200 bp covering part of the GalNAc-transferase motif of GalNAc-T3 were recently sequenced from a pancreatic expressed sequence Tag library (EMBL accession number T11328[GenBank]). The lack of GalNAc-T3 expression in human liver correlates with the finding that organ extracts from human liver lacked GalNAc-transferase activity utilizing the HIV peptide, whereas expression of GalNAc-T3 mRNA in human placenta is in agreement with GalNAc-transferase activity using the HIV peptide in extracts of placenta (4). One interpretation of these data is that differential expression of different GalNAc-transferases can result in O-glycosylation of distinct sites on a given protein. The biological significance of this is unclear. There are a few studies on O-glycosylation sites, and these are limited to analysis of the functional activity or stability of a protein with or without a single O-glycosylation site (28), because assignments of O-glycosylation sites are difficult to perform (29).
The results presented here suggest that cell-, organ-, and species-specific differences in the position of O-glycosylation may occur as a result of differential expression of polypeptide GalNAc-transferases. In searching for potential motifs of O-glycosylation by analyzing serine and threonine residues carrying O-glycans in glycoproteins, one may need to consider the GalNAc-transferase repertoire of the cell of origin (30, 31). The existence of a transferase family of unknown size possibly exceeding three members displaying differential acceptor substrate specificity and cell/organ distribution suggests that mucin-type O-glycosylation is a much more defined and controlled process than previously recognized. In this respect, previous studies aimed at identifying consensus sequence motifs for O-glycosylation may not have identified such because of the unknown level of complexity. The data reported here suggest that O-glycosylation in terms of sites is less random than previously suggested and that more defined acceptor substrate peptide sequences may be recognized for each of the individual GalNAc-transferases. With the individual GalNAc-transferases expressed as recombinant proteins, it may be possible to determine primary peptide sequence motifs for the individual enzymes, which could be useful for predicting O-glycosylation in vivo by a given cell type.
The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EMBL Data Bank with accession number(s) X92689[GenBank].
To whom the correspondence should be addressed: School of
Dentistry, Nørre Alle 20, DK-2200 Copenhagen N, Denmark. Tel.:
45-3532-6835; Fax: 45-3532-6505.
-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase; bp, base pair(s); kb,
kilobase(s); PCR, polymerase chain reaction; RT, reverse transcriptase;
pfu, plaque-forming units; MTN, multiple tissue Northern blots; HIV,
human immunodeficiency virus.
We are grateful to Dr Michael A. Hollingsworth for helpful advice during the project.
This article has been cited by other articles:
![]() |
K. Amano, Y. Chiba, Y. Kasahara, Y. Kato, M. K. Kaneko, A. Kuno, H. Ito, K. Kobayashi, J. Hirabayashi, Y. Jigami, et al. Engineering of mucin-type human glycoproteins in yeast cells PNAS, March 4, 2008; 105(9): 3232 - 3237. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wopereis, S. Grunewald, K. M.L.C. Huijben, E. Morava, R. Mollicone, B. G.M. van Engelen, D. J. Lefeber, and R. A. Wevers Transferrin and Apolipoprotein C-III Isofocusing Are Complementary in the Diagnosis of N- and O-Glycan Biosynthesis Defects Clin. Chem., February 1, 2007; 53(2): 180 - 187. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Garringer, C. Fisher, T. E. Larsson, S. I. Davis, D. L. Koller, M. J. Cullen, M. S. Draman, N. Conlon, A. Jain, N. S. Fedarko, et al. The Role of Mutant UDP-N-Acetyl-{alpha}-D-Galactosamine-Polypeptide N-Acetylgalactosaminyltransferase 3 in Regulating Serum Intact Fibroblast Growth Factor 23 and Matrix Extracellular Phosphoglycoprotein in Heritable Tumoral Calcinosis J. Clin. Endocrinol. Metab., October 1, 2006; 91(10): 4037 - 4042. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Egelund, B. L. Petersen, M. S. Motawia, I. Damager, A. Faik, C. E. Olsen, T. Ishii, H. Clausen, P. Ulvskov, and N. Geshi Arabidopsis thaliana RGXT1 and RGXT2 Encode Golgi-Localized (1,3)-{alpha}-D-Xylosyltransferases Involved in the Synthesis of Pectic Rhamnogalacturonan-II PLANT CELL, October 1, 2006; 18(10): 2593 - 2607. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kato, C. Jeanneau, M. A. Tarp, A. Benet-Pages, B. Lorenz-Depiereux, E. P. Bennett, U. Mandel, T. M. Strom, and H. Clausen Polypeptide GalNAc-transferase T3 and Familial Tumoral Calcinosis: SECRETION OF FIBROBLAST GROWTH FACTOR 23 REQUIRES O-GLYCOSYLATION J. Biol. Chem., July 7, 2006; 281(27): 18370 - 18377. [Abstract] [Full Text] [PDF] |
||||
![]() |
M F Campagnoli, A Pucci, E Garelli, A Carando, C Defilippi, R Lala, G Ingrosso, I Dianzani, M Forni, and U Ramenghi Familial tumoral calcinosis and testicular microlithiasis associated with a new mutation of GALNT3 in a white family. J. Clin. Pathol., April 1, 2006; 59(4): 440 - 442. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wopereis, D. J. Lefeber, E. Morava, and R. A. Wevers Mechanisms in Protein O-Glycan Biosynthesis and Clinical and Molecular Aspects of Protein O-Glycan Biosynthesis Defects: A Review Clin. Chem., April 1, 2006; 52(4): 574 - 600. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Berois, D. Mazal, L. Ubillos, F. Trajtenberg, A. Nicolas, X. Sastre-Garau, H. Magdelenat, and E. Osinaga UDP-N-Acetyl-D-Galactosamine: Polypeptide N-Acetylgalactosaminyltransferase-6 as a New Immunohistochemical Breast Cancer Marker J. Histochem. Cytochem., March 1, 2006; 54(3): 317 - 328. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Comelli, S. R. Head, T. Gilmartin, T. Whisenant, S. M. Haslam, S. J. North, N.-K. Wong, T. Kudo, H. Narimatsu, J. D. Esko, et al. A focused microarray approach to functional glycomics: transcriptional regulation of the glycome Glycobiology, February 1, 2006; 16(2): 117 - 131. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Larsson, X. Yu, S. I. Davis, M. S. Draman, S. D. Mooney, M. J. Cullen, and K. E. White A Novel Recessive Mutation in Fibroblast Growth Factor-23 Causes Familial Tumoral Calcinosis J. Clin. Endocrinol. Metab., April 1, 2005; 90(4): 2424 - 2427. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. J. Olson, M. Backstrom, H. Karlsson, J. Burchell, and G. C. Hansson A MUC1 tandem repeat reporter protein produced in CHO-K1 cells has sialylated core 1 O-glycans and becomes more densely glycosylated if coexpressed with polypeptide-GalNAc-T4 transferase Glycobiology, February 1, 2005; 15(2): 177 - 191. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. T. Marcos, S. Pinho, C. Grandela, A. Cruz, B. Samyn-Petit, A. Harduin-Lepers, R. Almeida, F. Silva, V. Morais, J. Costa, et al. Role of the Human ST6GalNAc-I and ST6GalNAc-II in the Synthesis of the Cancer-Associated Sialyl-Tn Antigen Cancer Res., October 1, 2004; 64(19): 7050 - 7057. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Mardberg, K. Nystrom, M. A. Tarp, E. Trybala, H. Clausen, T. Bergstrom, and S. Olofsson Basic amino acids as modulators of an O-linked glycosylation signal of the herpes simplex virus type 1 glycoprotein gC: functional roles in viral infectivity Glycobiology, July 1, 2004; 14(7): 571 - 581. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Miyahara, J. Shoda, T. Kawamoto, M. Furukawa, T. Ueda, T. Todoroki, N. Tanaka, K. Matsuo, Y. Yamada, K. Kohno, et al. Expression of UDP-N-Acetyl-{alpha}-D-Galactosamine-Polypeptide N-Acetylgalactosaminyltransferase Isozyme 3 in the Subserosal Layer Correlates with Postsurgical Survival of Pathological Tumor Stage 2 Carcinoma of the Gallbladder Clin. Cancer Res., March 15, 2004; 10(6): 2090 - 2099. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Strutt, J. Mundy, K. Hofstra, and D. Strutt Cleavage and secretion is not required for Four-jointed function in Drosophila patterning Development, February 15, 2004; 131(4): 881 - 890. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Kinarsky, G. Suryanarayanan, O. Prakash, H. Paulsen, H. Clausen, F.-G. Hanisch, M. A. Hollingsworth, and S. Sherman Conformational studies on the MUC1 tandem repeat glycopeptides: implication for the enzymatic O-glycosylation of the mucin protein core Glycobiology, December 1, 2003; 13(12): 929 - 939. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Kouzuma, Y. Suzuki, M. Nakano, K. Matsuyama, S. Tojo, M. Kimura, T. Yamasaki, H. Aoyagi, and T. Hatakeyama Characterization of Functional Domains of the Hemolytic Lectin CEL-III from the Marine Invertebrate Cucumaria echinata J. Biochem., September 1, 2003; 134(3): 395 - 402. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. T. Marcos, A. Cruz, F. Silva, R. Almeida, L. David, U. Mandel, H. Clausen, S. von Mensdorff-Pouilly, and C. A. Reis Polypeptide GalNAc-transferases, ST6GalNAc-transferase I, and ST3Gal-transferase I Expression in Gastric Carcinoma Cell Lines J. Histochem. Cytochem., June 1, 2003; 51(6): 761 - 771. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Iwasaki, Y. Zhang, K. Tachibana, M. Gotoh, N. Kikuchi, Y.-D. Kwon, A. Togayachi, T. Kudo, T. Kubota, and H. Narimatsu Initiation of O-Glycan Synthesis in IgA1 Hinge Region Is Determined by a Single Enzyme, UDP-N-Acetyl-alpha -D-galactosamine:Polypeptide N-Acetylgalactosaminyltransferase 2 J. Biol. Chem., February 14, 2003; 278(8): 5613 - 5621. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhang, H. Iwasaki, H. Wang, T. Kudo, T. B. Kalka, T. Hennet, T. Kubota, L. Cheng, N. Inaba, M. Gotoh, et al. Cloning and Characterization of a New Human UDP-N-Acetyl-alpha -D-galactosamine:Polypeptide N-Acetylgalactosaminyltransferase, Designated pp-GalNAc-T13, That Is Specifically Expressed in Neurons and Synthesizes GalNAc alpha -Serine/Threonine Antigen J. Biol. Chem., January 3, 2003; 278(1): 573 - 584. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Argueso, A. Tisdale, U. Mandel, E. Letko, C. S. Foster, and I. K. Gipson The Cell-Layer- and Cell-Type-Specific Distribution of GalNAc-Transferases in the Ocular Surface Epithelia Is Altered during Keratinization Invest. Ophthalmol. Vis. Sci., January 1, 2003; 44(1): 86 - 92. [Abstract] [Full Text] [PDF] |
||||
![]() |
|