JBC Advanced Peptides, Inc.

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kurima, K.
Right arrow Articles by Schwartz, N. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kurima, K.
Right arrow Articles by Schwartz, N. B.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

J Biol Chem, Vol. 274, Issue 47, 33306-33312, November 19, 1999


Genomic Organization of the Mouse and Human Genes Encoding the ATP Sulfurylase/Adenosine 5'-Phosphosulfate Kinase Isoform SK2*

Kiyoto KurimaDagger , Bhawani Singh§, and Nancy B. SchwartzDagger parallel

From the Departments of Dagger  Pediatrics, § Chemistry, and  Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES

Mammalian ATP sulfurylase/adenosine 5'-phosphosulfate (APS) kinase consists of kinase and sulfurylase domains, and catalyzes two sequential reactions to synthesize the universal sulfate donor, phosphoadenosine phosphosulfate (PAPS). In simpler organisms, the ATP sulfurylase and APS kinase reactions are catalyzed by separate enzymes encoded by two or three genes, suggesting that a fusion of separate genes during the course of evolution generated the bifunctional enzyme. We have characterized the genomic structure of the PAPS synthetase SK2 isoform genes for mouse (MSK2) and human (HSK2) and analyzed the possible fusion region. The MSK2 and HSK2 genes exhibit a common structure of 13 exons, including a 15-nucleotide alternatively spliced exon 8. Enzyme activities of several bacterially expressed exon assemblages showed exons 1-6 encode APS kinase, while exons 6-13 encode ATP sulfurylase. The MSK2 construct without the exon 6-encoded peptide showed no kinase or sulfurylase activity, demonstrating that exon 6 encodes sequences required for both activities. Exon 1 and its 5'-flanking sequence are highly divergent between the two species, and intron 1 of the HSK2 gene contains a region similar to the MSK2 promoter sequence, suggesting that it may be the remnant of a now-superceded regulatory region. The HSK2 promoter contains a GC-rich region, not present in the mouse promoter, and has few transcription factor binding sites in common with MSK2. These differences in the two promoter regions suggest that species-specific mechanisms regulate expression of the SK2 isoform.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES

Biosynthesis of the universal sulfate donor, phosphoadenosine phosphosulfate (PAPS),1 requires two sequential enzymatic reactions: the transfer of sulfate to ATP to form APS catalyzed by ATP sulfurylase (EC 2.7.7.4), followed by transfer of phosphate to APS yielding PAPS catalyzed by APS kinase (EC 2.7.1.25) (1). The ATP sulfurylase and APS kinase reactions are catalyzed by separate proteins in organisms such as bacteria (2-4), fungi (5, 6), yeast (7, 8), and plants (9-11). In contrast, the catalytic units for the ATP sulfurylase and APS kinase reactions are fused in animals; cDNAs encoding a single bifunctional protein have been isolated in the marine worm Urechis caupo (12), in Drosophila melanogaster (13), and in several mammalian species (14-17). Recombinant individual ATP sulfurylase or APS kinase domains of a mouse PAPS synthetase retain the ability to catalyze the respective reactions (18), implying that the ancestral PAPS synthetase gene was formed by fusion of separate genes encoding the independent functional units.

Variations in the structural organization of monofunctional and bifunctional PAPS synthetase genes in simpler organisms and animals, respectively, likely reflect differing histories of gene fusion/duplication events in evolution. For instance, the sulfate activation operon in Escherichia coli encodes ATP sulfurylase (cysD and cysN), followed by APS kinase (cysC) (3). In a symbiotic nitrogen-fixing bacterium, Rhizobium meliloti, there are only two genes; nodP is homologous to cysD, and encodes a subunit of ATP sulfurylase, while nodQ is homologous to cysN and cysC, encoding both a subunit of ATP sulfurylase and APS kinase (4). In a filamentous fungus, Penicillium chrysogenum, the ATP sulfurylase gene encodes the ATP sulfurylase domain plus a nonfunctional APS kinase-like domain at the C terminus (5), reminiscent of the cysN-cysC fusion in R. meliloti nodQ. A functional APS kinase is encoded by a separate gene in P. chrysogenum as well. In plants and yeast, the functional catalytic units are encoded by two separate genes with no combination of domains. In the fused PAPS synthetases, the structural orientation of the two functional domains is the reverse of the nodQ arrangement with APS kinase N-terminal of the ATP sulfurylase, suggesting an unrelated origin of gene fusion. Our previous activity and stability studies with rearranged recombinant enzymes showed that reversing the order of the domain organization resulted in either diminished activity or a thermally unstable enzyme (18). Such a structural or functional advantage could be a possible selective factor for gene fusion in a specific order. Thus far, the complete genomic structure of the fused PAPS synthetase genes has not been determined; further, this additional information is needed to gain insights into the mechanism of the gene fusion that brought two separate genes into a single gene encoding a bifunctional PAPS synthetase.

The PAPS synthesis pathway plays an important role in normal cartilage and skeletal development in mammals as demonstrated by the recent identification of PAPS synthetase isoforms (14, 16) and of mutations in one member of the PAPS synthetase gene family, SK2, in murine brachymorphism and human spondyloepimetaphysial dysplasia (16, 17). Brachymorphism is characterized by a dome-shaped skull, short thick tail, and shortened but not widened limbs (19), and is associated with limited PAPS availability due to a reduction of PAPS synthetase activity (20-22). Interestingly, severely reduced PAPS synthetase activity was found in brachymorphic cartilage and liver, while no reduction was observed in brachymorphic brain and skin (23). Spondyloepimetaphysial dysplasia (Pakistani type) is characterized by short and bowed lower limbs, enlarged knee joints, and early onset of degenerative joint disease in the hands and knees (24). The presence of two PAPS synthetase genes and the tissue-specific defects in PAPS synthesis in mammalian mutants implicate coordinated mechanisms that control the expression of the PAPS synthetase genes.

To gain better insights into the origin and regulation of the PAPS synthetase gene, we determined the mouse and human genomic structures of one PAPS synthetase gene family member, SK2, whose importance in development is highlighted by both human and murine growth disorders due to mutations in this gene. Recombinant monofunctional ATP sulfurylase and APS kinase proteins and a fused protein with an internal deletion were produced to determine the distribution of exons for each catalytic domain. We also analyzed the 5'-flanking region sequences of the genes for potential transcription binding sites to guide future functional expression studies.

    MATERIALS AND METHODS
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES

Analysis of the Mouse and Human SK2 Genomic Structure-- A mouse C57BL/6 genomic BAC library (GenomeSystem, St. Louis, MO) was screened with a 32P-labeled mouse SK2 cDNA fragment following the manufacturer's instructions. A human genomic BAC library, release I (GenomeSystem) was screened by PCR with a set of human SK2 primers derived from cDNA sequence (accession no. AF173365 reported in the present study). Two mouse BAC clones, BACSK2#1 and BACSK2#5, and one human BAC clone, BAChSK2, were isolated and various plasmid subclones were prepared. Direct sequencing and PCR amplification with cDNA-derived primers were performed to determine the genomic structure, comprising exon-intron boundaries and the intron sizes. Search of publicly available human genomic sequences identified two BAC clone, CIT-HSP-306M19 (Caltech Genome Research Laboratory) and AC006191 (Human Chromosome 10 Sequencing Group, Sanger Center), both containing exons 1 through 9 of the HSK2 and at least 8 kb from the upstream 5'-flanking region of the gene. The sequence data obtained from BAChSK2, CIT-HSP-306M19, and AC006191 were used to determine the complete human genomic structure. The TESS (Transcription Element Search Software) program was used to search the sequence for high quality matches to a data base of position-weighted nucleotide distribution matrices.

Alternatively Spliced Mouse and Human SK2 Open Reading Frame cDNA-- Reverse transcription-PCR was performed on total RNA extracted from 3-day-old C57BL mouse cartilage. The first strand was synthesized by SuperScript II Reverse Tanscriptase (Life Technologies, Inc.) using an antisense primer (SK5, 5'-GCAATTGGATACAGAGCAGC-3') complementary to the 3'-untranslated region of SK2 mRNA. The complete open reading frame cDNA was amplified with a 5' end sense primer containing a NdeI site (SK30, 5'-AGAGAGTTCCATATGTCTGCAAATTCCAAAATGAACCATAAAAGAGACCAGC-3') and a 3' end antisense primer containing a XhoI site (SK31, 5'-GAGAGAGATTCGAGCTAGTTGGTCTTCTCCAGAGACCTGTAGTAATCTGTCAACAC-3') using Expand polymerase (Roche Molecular Biochemicals). Reverse transcription-PCR total cartilage RNA yielded a single band approximately 1.9 kb in length. The fragment was digested with NdeI and XhoI and cloned into appropriate restriction sites in a bacterial expression vector, pET-15b (Novagen). Sequence was determined using the dRhodamine terminator cycle sequencing kit (Perkin-Elmer) and ABI model 377 DNA sequencer (Applied Biosystems). The sequence was compared with previously reported mouse SK2 cDNA (accession no. AF052453) prepared from liver RNA.

In order to isolate a human SK2 cDNA, the deduced amino acid sequence of mouse SK2 (accession no. AF052453) was used as a BLAST query sequence against the human EST data base maintained by the National Center for Biotechnology Information at the National Library of Medicine. Nucleotide sequences from several EST clones suspected to contain human SK2 cDNA were used to generate oligonucleotide primers. The 5' end sequence of a human SK2 cDNA was obtained by the 5'-inverse PCR method (25) using human fetal liver poly(A)+ RNA (CLONTECH). The complete open reading frame for a liver HSK2 cDNA was amplified from Marathon-ready human fetal liver cDNA (CLONTECH) with a 5' end sense primer containing a NdeI site (HSK2-16, 5'-AGAGAGTTCCATATGTCGGGGATCAAGAAGC-3') and a 3' end antisense primer containing a BglII site (HSK2-4, 5'-GAGAGAGATCTCGAGTTAGTTCTTCTCCAGGGACCTGTAATAATCTG-3') using Expand polymerase (Roche Molecular Biochemicals). The sequence was compared with a previously reported HSK2 cDNA (accession no. AF091242) prepared from cartilage RNA (17).

Expressed Mouse SK2 Constructs and Enzyme Assays-- The original SK2 construct in a pET-15b bacterial expression vector was prepared as described (16) and used as the template DNA to prepare other constructs by PCR. MSK2 is the original SK2 (accession no. AF052453) prepared from liver RNA, while MSK2Delta 8 is an alternatively spliced SK2 prepared from cartilage RNA, lacking the 15-nt/5-amino acid insert (amino acid residues 290-294) encoded by exon 8. Two APS kinase constructs were prepared. The functional APS kinase domain construct, APSK21-5.5 (residues 1-227) corresponds to the mouse SK1 APS kinase domain previously shown to catalyze the APS kinase reaction (18), and is encoded by exon 1 through the first 72 bp of exon 6. Another APS kinase construct, APSK21-5 (residues 1-214), is encoded by exons 1-5 only. Two ATP sulfurylase constructs were prepared. ATPS26-13 (residues 215-621) is an ATP sulfurylase domain protein encoded by exons 6-13, while ATPS27-13 (residues 253-621) is another ATP sulfurylase protein encoded by exons 7-13. MSK2Delta 6 (residues 1-214 and 253-621) is a MSK2 deletion construct lacking the sequence encoded by exon 6 (residues 215-252).

E. coli strain BL21(DE3) was transformed by the heat-shock method with cDNA constructs cloned in the pET-15b vector (Novagen). Isopropyl-1-thio-beta -D-galactopyranoside (1 mM) was added to an overnight bacterial culture and incubated for an additional 4 h. The bacterial cell culture was centrifuged at 9,000 × g, and the pellets were resuspended into IMAC 5 buffer (5 mM imidazole, 50 mM Tris, pH 7.9). The expressed protein was extracted by cell sonication followed by centrifugation. MSK2, MSK2Delta 8, APSK21-5.5, APSK21-5, and ATPS26-13 proteins were soluble and present in the supernatant, whereas MSK2Delta 6, and ATPS27-13 were insoluble and present in the bacterial pellet as inclusion bodies. To purify the soluble expressed protein, the supernatant was loaded onto a His·Bind resin column (Novagen) followed by two washes (30 ml) with IMAC30 buffer to remove bacterial contaminant proteins. APSK21-5 protein eluted in IMAC30 buffer wash, while other proteins including APSK21-5.5 were eluted with IMAC400 buffer (4-6 ml). Insoluble proteins, MSK2Delta 6 and ATPS27-13, were extracted with 6 M urea in IMAC5 buffer. Both His·Bind-purified and urea-extracted enzymes were dialyzed into phosphate buffer (25 mM NaH2PO4-K2HPO4, pH 7.8, 1 mM dithiothreitol, 1 mM EDTA) overnight and diluted with phosphate buffer to 20 µg/ml for enzymatic assays. ATP sulfurylase was assayed in the reverse direction of ATP formation as described (26); standard assay contained 50 mM NaH2PO4-K2HPO4, pH 7.8, 12 mM MgCl2, 0.5 mM dithiothreitol, 5 mM NaF, 0.2 mM Na4P2O (containing 6.7 mCi of 32P), 0.1 mM APS, and 50 µl of enzyme preparation. APS kinase was assayed as described (27); standard assay contained 80 nM [35S]APS, 0.5 mM ATP, pH 7.0, 5 mM MgCl2 10 mM ammonium sulfate, and 12 µl of enzyme, brought up to 25 µl with buffer A (25 mM NaH2PO4-K2HPO4, pH 7.8, 1 mM dithiothreitol, 1 mM EDTA, and 10% glycerol). The overall reaction with ATP and SO42- as substrate and measuring the production of APS and PAPS was assayed as described (28); standard overall assay contained 0.4 mM [35S]H2SO4, 10 mM ATP, 20 mM MgCl2, 22 mM Tris-HCl, pH 8.0, and 15 µl of enzyme preparation in a total volume of 25 µl.

    RESULTS
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES

Genomic Structure of the MSK2 and HSK2 Genes-- The genomic structure of one PAPS synthetase gene family member, SK2, was determined both in mouse and human. A mouse BAC clone, BACSK2#1, contained exons 2-13, while another BAC clone, BACSK2#5, contained the mouse exon 1 and 5'-flanking regions. One human BAC clone, BAChSK2, contained the whole gene. Two additional BAC clones, CIT-HSP-306M19 (Caltech Genome Research Laboratory) and AC006191 (Human Chromosome 10 Sequencing Group, Sanger Center) contained exons 1-9 of the HSK2 gene. The sequence data obtained from these five clones were used to determine the whole genomic structure for the MSK2 and HSK2 genes.

The MSK2 and HSK2 genes have identical exon organizations, consisting of 13 exons (Fig. 1A) with a 15-nt alternatively spliced exon 8 (Fig. 2; see below). The minimum sizes of the genes are 40 and 85 kb for the MSK2 and HSK2 genes, respectively. Nucleotide sequences of the intron/exon boundaries show that all introns in the MSK2 and HSK2 genes begin with a 5'-GT dinucleotide and conclude with 3'-AG termini (Table I). The size of exons and phases of splicing are also conserved between the species, with the exception of exon 1. The deduced amino acid sequences at the N termini encoded by exon 1, as well as the nucleotide sequences of the 5'-flanking regions, are significantly different between MSK2 and HSK2 (Fig. 4). By scanning the intron 1 sequence of the HSK2 gene, we identified a 800-bp sequence 25 kb downstream from exon 1, which is similar to the MSK2 5'-flanking region sequence (68% identity). However, this region does not appear to encode a peptide corresponding to the MSK2 N-terminal sequence, or to contain transcription factor binding motifs as found in the MSK2 promoter region, and thus may be the remnant of a former promoter.


View larger version (27K):
[in this window]
[in a new window]
 
Fig. 1.   Structure of the mouse and human SK2 genes. A, exons are represented as closed boxes and introns are represented as the lines connecting the boxes in the gene structure. Distribution of exons within the corresponding domains of SK2 protein is shown below. A black box represents the location of the alternatively spliced exon 8 region in the ATP sulfurylase domain (stippled). Exon and intron sizes are shown in Table I. B, comparison with the C. elegans SK gene. Exons are represented as open boxes with coding sequence nucleotide number indicated, and introns are represented as the lines connecting the boxes with intron base pairs. A black box represents the unique insert in exon 4 corresponding to the exon 8 insert found in mammalian SK2. Striped boxes indicate the interdomain regions in both molecules.


View larger version (14K):
[in this window]
[in a new window]
 
Fig. 2.   Sequence around the alternative splice site in SK2 and SK2Delta 8 in mouse and human. Amino acid sequences are shown in uppercase and nucleotide sequence in lowercase. Alternatively spliced 15-nt sequences encoded by each exon 8 are underlined. Amino acid sequences added by the splicing are italicized.

                              
View this table:
[in this window]
[in a new window]
 
Table I
Exon-intron structure of the mouse and human SK2 gene
Intron sequence is shown in lowercase and exon sequence in uppercase. The highly conserved gt...ag motif is shown in boldface.

Comparison of the exon structures to the functional domains of PAPS synthetases reveals that the APS kinase domain is encoded by exon 1 through exon 6, base 39, and the ATP sulfurylase domain is encoded by exon 6, base 40, through exon 13. The intervening sequence present in the functional APS kinase domain is encoded by exon 5, base 48 through exon 6, base 39. An alternatively spliced 15-nt sequence comprises one exon, designated as exon 8 (Fig. 2).

Exon distribution of several functional motifs was also analyzed. The GXXGXGK motif essential for ATP binding (P-loop) (29) is found in the APS-kinase domain (50GLSGAGK56). The first base of the G50 codon is encoded by exon 2, and the rest of the P-loop codons reside in exon 3, forming a phase I splicing. Our previous study on mutational analysis of the P-loop in mouse SK1 showed that a mutation of glycine 50 to alanine did not affect any of the enzyme activities (30), indicating that the functionally essential residues of the P-loop are solely encoded by exon 3. Another functional unit, a PAPS-dependent enzyme motif, KAXAXXXXFTG, in mouse SK2 is found as 166KRARAGEIKG175FTG178 in the kinase portion. The first base of the G175 codon is encoded by exon 4, and the rest of the motif is encoded by exon 5, forming a phase I splicing. The FISP motif (31), HXXH motif (phosphodiester cleavage) (32) and PP-loop (IVGRDPAG, pyrophosphate binding) (33) motif are each encoded entirely by exons 3, 11, and 12, respectively. Functionally essential residues in each of these form motifs have been identified.2

Alternatively Spliced SK2 Form Isolated Mouse and Human-- In previous studies of mouse and human SK2 cDNAs (16, 17), the deduced amino acid sequences showed high sequence identity (93%). However, HSK2 lacks five amino acids in the ATP-sulfurylase domain, and the first 9 or 11 N-terminal residues are dissimilar between MSK2 and HSK2. Amplification of MSK2 partial cDNA generated from cartilage RNA identified a MSK2 cDNA variant, referred to as MSK2Delta 8 (Fig. 2). This cartilage cDNA lacks 15 nt at positions 869-883, predicting a deletion of the five amino acids, GVVPR, from residues 290-294, in the ATP sulfurylase domain. Analysis of the genomic sequence shows that this 5-amino acid segment is encoded by an exon with typical flanking donor and acceptor sequences (Table I). PCR analysis of MSK2 gene expression showed that MSK2Delta 8 is the major form in cartilage, while the original MSK2 is the major form in liver (data not shown). We have prepared liver HSK2 cDNA, and the sequence was compared with HSK2 cDNA isolated from a cartilage cDNA library (accession no. AF091242). The liver HSK2 cDNA is equivalent to MSK2 and contains the 15-nt exon 8 sequence, while the cartilage HSK2 protein is equivalent to MSK2Delta 8, lacking the 5-amino acid sequence, GMALP from residue 289 to 293, encoded by exon 8. Both MSK2 and MSK2Delta 8 proteins were bacterially expressed, and the ability of these proteins to catalyze the APS kinase, ATP sulfurylase, and overall PAPS synthetase reactions in vitro was compared. No significant difference between the two proteins was observed in any of the three assays (see below).

Recombinant Enzyme Assays-- In order to determine the distribution of exons encoding each enzyme catalytic unit, several expression constructs with different exon compositions were prepared, and the expressed proteins subjected to enzyme assays (Fig. 3). Three assays were used to assess expression levels for the recombinants as well as the functionality of each mutant: the reverse sulfurylase, the forward kinase, and the overall reaction, which measures APS and PAPS simultaneously. The full-length proteins MSK2 and MSK2Delta 8 were soluble proteins, and after purification and dialysis were assayed for the three activities (see "Materials and Methods" for details). MSK2 exhibited normal sulfurylase (2.35 µmol of ATP/min·mg), kinase (227.9 pmol of PAPS/min·mg), and overall (13.6 nmol of PAPS/min·mg) activities. Alternatively spliced isoform MSK2Delta 8 also exhibited all three activities comparably to MSK2. The APS kinase constructs APSK21-5.5 and APSK21-5 were both soluble in IMAC5 buffer, but APSK21-5 exhibited decreased binding to His·Bind resin. APSK21-5 also failed to exhibit kinase activity, whereas an APS kinase construct with C-terminal amino acids encoded by exon 6 (APSK21-5.5) showed APS kinase activity comparable to MSK2. This result shows that the first 13 residues of the linker sequence encoded by exon 6 are required for the APS kinase activity. The ATP sulfurylase construct ATPS2 6-13 was soluble, whereas ATPS27-13 was found in the bacterial pellet and had to be solubilized with M urea followed by dialysis into phosphate buffer. When tested for sulfurylase activity, ATPS26-13 showed normal sulfurylase activity while ATPS27-13 was inactive. To ensure that the urea treatment did not cause irreversible denaturation, ATPS26-13 was also subjected to urea treatment and reequilibration. Urea-solubilized ATPS26-13 was initially inactive, but after dialysis into phosphate buffer, activity comparable to the non-urea-treated enzyme preparations was restored. The MSK2Delta 6 construct was designed in order to test whether additional sequences at the C terminus of APSK21-5 and N terminus of ATPS27-13 would increase enzyme solubility and restore the kinase and sulfurylase activities, respectively. MSK2Delta 6 was insoluble, and following urea extraction and dialysis into phosphate buffer, the recombinant protein was still unable to catalyze ATP sulfurylase or APS kinase reactions. This demonstrates a requirement for exon 6 sequences similar to that observed for the separate domain constructs.


View larger version (21K):
[in this window]
[in a new window]
 
Fig. 3.   Schematic diagram of bacterially expressed mouse SK2 constructs and enzyme activities. The APS kinase domain is depicted as a white box, the ATP sulfurylase domain as a stippled box, and the 5-amino acid insert as a black box. The striped boxes represent the interdomain region. The intact sulfurylase/kinase (MSK21-13) is encoded by exons 1-13, while MSK2Delta 8 lacks the 5 amino acid residues encoded by exon 8. APSK21-5.5 includes the APS-kinase domain and the first 13 amino acids encoded by exon 6, while APSK21-5 is an APS kinase domain encoded by exon 1-5 only. ATPS26-13 is an ATP sulfurylase domain encoded by exon 6-13, while ATPS27-13 lacks the region encoded by exon 6. MSK2Delta 6 is MSK2 without the exon 6-encoded region. The results of in vitro enzyme assays for each construct are shown at the left. Y represents the presence of the activity, while N represents the lack of the activity. An asterisk (*) represents expressed protein that failed to be released into the reaction buffer and had to be solubilized as described.

Sequence Analysis of the 5'-Flanking Region of the MSK2 and HSK2 Genes-- The MSK2 (1 kb) and HSK2 (2+ kb) 5'-flanking sequences were examined for the presence of promoter elements. The TESS (Transcription Element Search Software) program was used to search the sequence for high quality matches to a data base of position-weighted nucleotide distribution matrices. Analysis of 1 kb of the MSK2 gene immediately flanking the translation start site detected potential binding motifs for a variety of transcription factors (Fig. 4A). These include response elements for progesterone receptor, EBP-1, NF-kappa B, MyoD, PU.1, CREB, Pit-1a, CCAAT-binding protein, NF-Y, Evi-1, TIN-1, and GRalpha beta . With regard to the HSK2 gene, the 5'-flanking sequences of the HSK2 gene differ significantly from those of the MSK2 gene (Fig. 4B). The HSK2 proximal region (2000 bp) has a subregion with high content of G + C (73% in the 500-bp proximal region) and at least nine potential Sp-1 binding sites, present in many constitutive gene promoters. Although not an exhaustive list, potential binding sites in the HSK2 include NF-E2, LyF-1, MEF-2, Pit-1a, AP-1, TFE3-S, USF, CAC-binding protein, NFAT-1, and CCAAT-binding protein. Only a few detected motifs (Pit-1a, CCAAT-binding protein) were common to both MSK2 and HSK2 5'-flanking regions.


View larger version (52K):
[in this window]
[in a new window]
 
Fig. 4.   Nucleotide sequence from the mouse (A) and human (B) SK2 5'-flanking regions and first exons. Translated exon 1 sequences are shown in bold, and the first methionine codon ATG is assigned +1. TATA-box sequences are bold and italicized. Underlined cis-elements delineate binding sites for the transcription factors indicated.


    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES

A number of eukaryotic multifunctional enzymes have been shown to consist of several monofunctional catalytic domains, each of which are encoded by a single gene in simpler organisms (18, 34-38). When expressed as monofunctional domains, each domain often retains its activity, suggesting that the genes encoding each catalytic unit were fused during the course of evolution (18, 39, 40). It is also common that these present-day multidomain enzymes have interdomain segments that are not homologous to any protein sequences and display no clear boundary of those domains (14, 36, 37, 41, 42). In addition, structure comparisons, as well as phylogenetic analysis of multidomain proteins in different organisms, show a striking variety of gene fusion events. In the case of PAPS synthetases, the APS kinase domain is located at the N terminus and ATP sulfurylase at the C terminus linked by a short non-homologous intervening sequence, and the boundary of the two functional catalytic units has been established (18). The structural organization of genes encoding APS kinase and APS sulfurylase in E. coli and fungi suggests the simple gene fusion of adjacent genes, while the reverse domain structure in the present-day bifunctional PAPS synthetases implicates a historically unrelated gene fusion event. Elucidation and analysis of the genomic structures of the mammalian and Caenorhabditis elegans PAPS synthetase genes provide a possible mechanism for how these bifunctional enzymes have evolved.

The MSK2 and HSK2 genes are structurally very similar, consisting of 13 exons with some variation in intron sizes. Exon sizes and intron phases are also conserved between the species, with the exception of exon 1. The exact sizes of the genes have not been determined, but the minimum sizes are 45 and 80 kb for the MSK2 and HSK2 genes, respectively. We have identified two splicing variants in mouse and human SK2 cDNAs, and the alternatively spliced 15 nt was identified as an exon in both genomes (Figs. 1 and 2). The SK2Delta 8 proteins lack 5 amino acids, GVVPR in MSK2 and GMALP in HSK2 in the ATP sulfurylase domain. A similar 15-nt/5-amino acid insertion, MDGSY, is found in D. melanogaster, and a larger 19-22-amino acid insertion in C. elegans PAPS synthetase at the same location. Consistent occurrence of a splice variant at this position among the different species implies some functional significance; however, no differences in the ability of MSK2 and MSK2Delta 8 to catalyze ATP sulfurylase, APS kinase, or PAPS synthetase overall reactions were detected. Additionally, there is no apparent motif associated with the inserted sequences; therefore, the insert may not influence enzyme function directly.

Relative to the exon distribution, the APS kinase domain is encoded by exons 1-6, while the ATP sulfurylase domain is encoded by exon 6-13. Based on the recombinant monofunctional constructs of the mouse SK1 protein, the first 236 amino acid residues constitute a functional APS kinase domain, while the ATP sulfurylase domain consists of residues 237-624 (18). The functional boundary between the APS kinase and ATP sulfurylase is thus located in exon 6 of the MSK2 gene. In fact, exon 6 encodes only the last 13 C-terminal amino acid residues of the APS kinase. Initially, because of the location of the functional boundary relative to the location of the exon 5/6 boundary and the phase 0 exon insertion of intron 5, we suspected that the exon 5/6 boundary might be the ancestral fusion site where the two separate genes were fused. However, the bacterially expressed protein encoded by exons 1-5, APSK21-5, failed to exhibit APS kinase activity, and APS kinase activity was restored only when an additional 13 amino acid residues encoded by exon 6 were added to the APSK21-5.5 protein. The failure of the MSK2Delta 6 protein to catalyze both APS sulfurylase and APS kinase reactions further supports the contention that the exon 5/6 boundary does not reflect the ancestral fusion site.

In fact, examination of the C. elegans PAPS synthetase gene, which consists of only six exons, with the large exon 4 encoding a part of both the APS kinase and ATP sulfurylase domains, further suggests random insertion of introns. Since there is no correspondence in the location of the introns between the C. elegans PAPS synthetase gene and the mammalian PAPS synthetase gene (Fig. 1B), it is plausible that introns were inserted randomly and independently into an intron-less ancient fused PAPS synthetase gene in different organisms. Alternatively, pre-existing introns could have been lost as well. Locations of introns are not necessarily coincident with the boundaries of catalytic units in other bi- or multifunctional enzymes (43, 44). Introns appear to be able to be located proximal to the functional boundary as seen in PAPS synthetases, or distal as seen in the UMP synthetase gene (43). Although introns appear to have been introduced randomly, it is interesting that most of the highly conserved functional motifs that this complicated bifunctional enzyme contains, including FISP, phosphodiester (HXXH) motif, and PP-loop, are all contained within single exons. An exception is Gly50 of the P-loop, which is encoded by exon 2 while the rest of the motif is encoded by exon 3. However, we have shown that Gly50 is not essential for kinase activity, and therefore may not be part of this common motif in PAPS synthetases.

Another interesting finding from the genomic sequence data for the two species is the lack of sequence similarity in exon 1 and its 5'-flanking regions. For instance, nine amino acid residues, MSGIKKQKT, are encoded by HSK2 exon 1, while 11 residues, MSANSKMNHKR, are encoded by MSK2 exon 1. The 5'-flanking region of the HSK2 gene is rich in GC sequence, characteristic of a housekeeping-type promoter with multiple Sp-1 binding sites, while the MSK2 gene contains several potential recognition sites for tissue-specific transcription factors, such as progesterone receptor, MyoD, PU.1, GC1, Pit-1a, Evi-1, TIN-1, and GRalpha beta . Even though the significance of these transcription factor binding motifs must await functional analysis, the highly heterologous promoter sequences of the two orthologous genes implies different origins and modes of regulation of the promoter. A stretch of 800 bp in intron 1 of the HSK2 gene is 68% identical to the proximal promoter region and exon 1 of the MSK2 gene but lacks a translatable or spliceable exon, suggesting it is a former exon/promoter, which was once an alternative or sole promoter. It is not unusual to have more than one promoter in many genes, and some promoters are preferentially used in a tissue-specific manner. We still cannot preclude possible SK2 transcript variants differing at the 5' end sequence due to the usage of alternative promoters in either species. It is possible that the mammalian SK2 gene had two promoters, and that the HSK2 gene used the upstream promoter, while the MSK2 used the downstream promoter as the major promoter, creating dissimilar N-terminal SK2 so far identified in human and mouse. Alternatively, the ancestral mammalian gene may have had only one promoter, and an additional or new promoter was acquired by the primate line. A bacterially expressed mouse SK1 construct starting from the second methionine codon, which is equivalent to SK2 without the exon 1-encoded sequence, showed equivalent enzyme activity to the first methionine SK1 construct (30). Thus, it is likely that the sequence encoded by exon 1 is catalytically nonessential, allowing divergence of the region. The dissimilarity of the promoter sequence of the orthologous mouse and human genes also suggests different spatio-temporal expression patterns of the gene in the two species. Although no data are currently available for the SK1 isoform, the failure of SK1 to fully complement the SK2 mutation in the brachymorphic mouse (16) and human spondyloepimetaphysial dysplasia (17) suggests differential roles for the members of this gene family as well. Analysis of expression patterns of the PAPS synthetase genes and functional analysis of its promoters will provide the specific roles of each PAPS synthetase in each species. In sum, we have provided the first report on the genomic structure, intron/exon mapping, alternative spliced forms, correlation between functional protein domains and genomic organization, and initial promoter analysis for the PAPS synthetase gene family.

    ACKNOWLEDGEMENTS

We thank Mary Lou Spach for maintenance of the mouse colony, Glenn Burrell for manuscript preparation, and James Mensch for manuscript review.

    FOOTNOTES

* This work was supported by Grants HD-17332 and AR-19622 from the National Institutes of Health, a grant from the Mitzutani Foundation (to N. B. S.), and a Markey fellowship (to B. S.).The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EMBL Data Bank with accession number(s) AF173365 (human SK2, alternatively spliced isoform); AF173361, AF173362, AF173363, and AF173364 (human SK2 partial genomic sequences); and AF172857, AF172858, AF172859, AF172860, AF172861, AF172862, AF172863, AF172864, AF172865, and AF172866 (mouse SK2 genomic sequences).

parallel To whom correspondence should be addressed: Dept. of Pediatrics, University of Chicago, MC 5058, 5825 S. Maryland Ave., Chicago, IL 60637. Tel.: 773-702-6426; Fax: 773-702-9234; E-mail: n-schwartz@uchicago.edu.

2 Deyrup, A. T., Singh, B., Krishnan, S., Lyle, S., and Schwartz, N. B. (1999) J. Biol. Chem. 274, 28929-28936.

    ABBREVIATIONS

The abbreviations used are: PAPS, phosphoadenosine phosphosulfate; APS, adenosine 5'-phosphosulfate; nt, nucleotide(s); kb, kilobase pair(s); bp, base pair(s); PCR, polymerase chain reaction.

    REFERENCES
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES

1. Schwartz, N. B., Lyle, S., Ozeran, J. D., Li, H., Deyrup, A., and Westley, J. (1998) Chem. Biol. Int. 109, 143-151[CrossRef][Medline] [Order article via Infotrieve]
2. Leyh, T. S., Taylor, J. C., and Markham, G. D. (1988) J. Biol. Chem. 263, 2409-2416[Abstract/Free Full Text]
3. Leyh, T. S., Vogt, T. F., and Suo, Y. (1992) J. Biol. Chem. 267, 10405-10410[Abstract/Free Full Text]
4. Schwedock, J. S., Liu, C., Leyh, T. S., and Long, S. R. (1994) J. Bacteriol. 176, 7055-7064[Abstract/Free Full Text]
5. Foster, B. A., Thomas, S. M., Mahr, J. A., Renosto, F., Patel, H. C., and Segel, I. H. (1994) J. Biol. Chem. 269, 19777-19786[Abstract/Free Full Text]
6. Borges-Walmsley, M. I., Turner, G., Bailey, A. M., Brown, J., Lehmbeck, J., and Clausen, I. G. (1995) Mol. Gen. Genet. 247, 423-429[CrossRef][Medline] [Order article via Infotrieve]
7. Cherest, H., Kerjan, P., and Surdin-Kerjan, Y. (1987) Mol. Gen. Genet. 210, 307-313[CrossRef][Medline] [Order article via Infotrieve]
8. Korch, C., Mountain, H. A., and Bystrom, A. S. (1991) Mol. Gen. Genet. 229, 96-108[Medline] [Order article via Infotrieve]
9. Leustek, T., Murillo, M., and Cervantes, M. (1994) Plant Physiol. 105, 897-902[Abstract]
10. Jain, A., and Leustek, T. (1994) Plant Physiol. 105, 771-772[CrossRef][Medline] [Order article via Infotrieve]
11. Arz, H. E., Gisselmann, G., Schiffmann, S., and Schwenn, J. D. (1994) Biochim. Biophys. Acta 1218, 447-452[Medline] [Order article via Infotrieve]
12. Rosenthal, E., and Leustek, T. (1995) Gene (Amst.) 165, 243-248[CrossRef][Medline] [Order article via Infotrieve]
13. Jullien, D., Crozatier, M., and Kas, E. (1997) Mech. Dev. 68, 179-186[CrossRef][Medline] [Order article via Infotrieve]
14. Li, H., Deyrup, A., Mensch, J. R., Jr., Domowicz, M., Konstantinidis, A. K., and Schwartz, N. B. (1995) J. Biol. Chem. 270, 29453-29459[Abstract/Free Full Text]
15. Venkatachalam, K. V., Akita, H., and Strott, C. A. (1998) J. Biol. Chem. 273, 19311-19320[Abstract/Free Full Text]
16. Kurima, K., Warman, M. L., Krishnan, S., Domowicz, M., Krueger, R. C. J., Deyrup, A., and Schwartz, N. B. (1998) Proc. Natl. Acad. Sci. U. S. A. 95, 8681-8685[Abstract/Free Full Text]
17. ul Haque, M. F., King, L. M., Krakow, D., Cantor, R. M., Rusiniak, M. E., Swank, R. T., Superti-Furga, A., Haque, S., Abbas, H., Ahmad, W., Ahmad, M., and Cohn, D. H. (1998) Nat. Genet. 20, 157-162[CrossRef][Medline] [Order article via Infotrieve]
18. Deyrup, A. T., Krishnan, S., Singh, B., and Schwartz, N. B. (1999) J. Biol. Chem. 274, 10751-10757[Abstract/Free Full Text]
19. Lane, P. W., and Dickie, M. M. (1968) J. Hered. 65, 297-300
20. Schwartz, N. B., Ostrowski, V., Brown, K. S., and Pratt, R. (1978) Biochem. Biophys. Res. Commun. 82, 173-178[CrossRef][Medline] [Order article via Infotrieve]
21. Sugahara, K., and Schwartz, N. B. (1979) Proc. Natl. Acad. Sci. U. S. A. 76, 6615-6618[Abstract/Free Full Text]
22. Sugahara, K., and Schwartz, N. B. (1982) Arch. Biochem. Biophys. 214, 589-601[CrossRef][Medline] [Order article via Infotrieve]
23. Sugahara, K., and Schwartz, N. B. (1982) Arch. Biochem. Biophys. 214, 601-609
24. Ahmad, M., Haque, M. F., Ahmad, W., Abbas, H., Haque, S., Krakow, D., Rimoin, D. L., Lachman, R. S., and Cohn, D. H. (1998) Am. J. Med. Genet. 78, 468-473[CrossRef][Medline] [Order article via Infotrieve]
25. Zeiner, M., and Gehring, U. (1994) BioTechniques 17, 1051-1053
26. Lyle, S., Geller, D. H., Ng, K., Stanzak, J., Westley, J., and Schwartz, N. B. (1994) Biochem. J. 301, 355-359
27. Lyle, S., Stanzak, J., Westley, J., and Schwartz, N. B. (1995) Biochemistry 34, 940-945[CrossRef][Medline] [Order article via Infotrieve]
28. Lyle, S., Stanzack, J., Ng, K., and Schwartz, N. B. (1994) Biochemistry 33, 5920-5925[CrossRef][Medline] [Order article via Infotrieve]
29. Smith, C., and Rayment, I. (1996) Biophys. J. 70, 1590-1602[Abstract/Free Full Text]
30. Deyrup, A. T., Krishnan, S., Cockburn, B. N., and Schwartz, N. B. (1998) J. Biol. Chem. 273, 9450-9456[Abstract/Free Full Text]
31. MacRae, I. J., Rose, A. B., and Segel, I. H. (1998) J. Biol. Chem. 273, 28583-28589[Abstract/Free Full Text]
32. Venkatachalam, K. V., Fuda, H., Koonin, E. V., and Strott, C. A. (1999) J. Biol. Chem. 274, 2601-2604[Abstract/Free Full Text]
33. Bork, P., and Koonin, E. V. (1994) Proteins 20, 347-355[CrossRef][Medline] [Order article via Infotrieve]
34. Chen, Z., Dixon, J. E., and Zalkin, H. (1990) Proc. Natl. Acad. Sci. U. S. A. 87, 3097-3101[Abstract/Free Full Text]
35. Duncan, K., Edwards, R. M., and Coggins, J. R. (1987) Biochem. J. 248, 375-386
36. Witkowski, A., Rangan, V. S., Randhawa, Z. I., Amy, C. M., and Smith, S. (1991) Eur. J. Biochem. 198, 571-579[Medline] [Order article via Infotrieve]
37. Sakata, J., Abe, Y., and Uyeda, K. (1991) J. Biol. Chem. 266, 15764-16770[Abstract/Free Full Text]
38. Faure, M., Camonis, J. H., and Jacquet, M. (1989) Eur. J. Biochem. 179, 345-358[Medline] [Order article via Infotrieve]
39. Lin, T., and Suttle, D. P. (1993) Somat. Cell Mol. Genet. 19, 193-202[CrossRef][Medline] [Order article via Infotrieve]
40. Beaucamp, N., Hofmann, A., Kellerer, B., and Jaenicke, R. (1997) Protein Sci. 6, 2159-2165[Abstract]
41. Crawford, I. P., Clarke, M., van Cleemput, M., and Yanofsky, C. (1987) J. Biol. Chem. 262, 239-244[Abstract/Free Full Text]
42. Lin, T., and Suttle, D. P. (1995) Somat. Cell Mol. Genet. 21, 161-175[CrossRef][Medline] [Order article via Infotrieve]
43. Suchi, M., Mizuno, H., Kawai, Y., Tsuboi, T., Sumi, S., Okajima, K., Hodgson, M. E., Ogawa, H., and Wada, Y. (1997) Am. J. Hum. Genet. 60, 525-539[Medline] [Order article via Infotrieve]
44. Amy, C. M., Williams-Ahlf, B., Naggert, J., and Smith, S. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 1105-1108[Abstract/Free Full Text]


Copyright © 1999 by The American Society for Biochemistry and Molecular Biology, Inc.
Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
EndocrinologyHome page
M. Yamashita, E. Glasgow, B.-J. Zhang, K. Kusano, and H. Gainer
Identification of Cell-Specific Messenger Ribonucleic Acids in Oxytocinergic and Vasopressinergic Magnocellular Neurons in Rat Supraoptic Nucleus by Single-Cell Differential Hybridization
Endocrinology, November 1, 2002; 143(11): 4464 - 4476.
[Abstract] [Full Text] [PDF]


Home page
Endocr. Rev.Home page
C. A. Strott
Sulfonation and Molecular Action
Endocr. Rev., October 1, 2002; 23(5): 703 - 732.
[Abstract] [Full Text] [PDF]


Home page
GlycobiologyHome page
N. B. Schwartz and M. Domowicz
Chondrodysplasias due to proteoglycan defects
Glycobiology, April 1, 2002; 12(4): 57R - 68R.
[Abstract] [Full Text] [PDF]


Home page
Drug Metab. Dispos.Home page
Z.-H. Xu, T. C. Wood, A. A. Adjei, and R. M. Weinshilboum
Human 3'-Phosphoadenosine 5'-Phosphosulfate Synthetase: Radiochemical Enzymatic Assay, Biochemical Properties, and Hepatic Variation
Drug Metab. Dispos., February 1, 2001; 29(2): 172 - 178.
[Abstract] [Full Text]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kurima, K.
Right arrow Articles by Schwartz, N. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kurima, K.
Right arrow Articles by Schwartz, N. B.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea