|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 278, Issue 44, 43178-43187, October 31, 2003
Human Mitochondrial C1-Tetrahydrofolate SynthaseGENE STRUCTURE, TISSUE DISTRIBUTION OF THE MRNA, AND IMMUNOLOCALIZATION IN CHINESE HAMSTER OVARY CELLS*![]() ![]() ![]() ![]() ||
From the
Received for publication, April 24, 2003 , and in revised form, August 20, 2003.
C1-tetrahydrofolate (THF) synthase is a trifunctional enzyme found in eukaryotes that contains the activities 10-formyl-THF synthetase, 5,10-methenyl-THF cyclohydrolase, and 5,10-methylene-THF dehydrogenase. The cytoplasmic isozyme of C1-THF synthase is well characterized in a number of mammals, including humans; but a mitochondrial isozyme has been previously identified only in the yeast Saccharomyces. Here, we report the identification and characterization of the human gene encoding a functional mitochondrial C1-THF synthase. The gene spans 236 kilobase pairs on chromosome 6 and consists of 28 exons plus one alternative exon. The gene encodes a protein of 978 amino acids, including an N-terminal mitochondrial targeting sequence. The mitochondrial isozyme is 61% identical to the human cytoplasmic isozyme. Expression of the gene was detected in most human tissues, but transcripts were highest in placenta, thymus, and brain. Two mRNAs were detected, a 3.6-kb transcript and a 1.1-kb transcript, and both transcripts were observed in varying ratios in each tissue. The shorter transcript results from an alternative splicing event, where exon 7 is spliced to exon 8a instead of exon 8. Exon 8a is derived from an exonized Alu sequence, sharing no homology with exon 8 of the long transcript, and encodes just 15 amino acids followed by a stop codon and a polyadenylation signal. This short transcript potentially encodes a bifunctional enzyme lacking 10-formyl-THF synthetase activity. Both transcripts initiate at the same 5'-site, 107 nucleotides up-stream of the ATG start codon. The full-length (2934 bp) cDNA fused to a C-terminal V5 epitope tag was expressed in Chinese hamster ovary cells. Immunoblots of subfractionated cells revealed a 107-kDa protein only in the mitochondrial fractions of these cells, confirming the mitochondrial localization of the protein. Yeast cells expressing the full-length human cDNA exhibited elevated 10-formyl-THF synthetase activity, confirming its identification as the human mitochondrial C1-THF synthase.
C1-tetrahydrofolate (THF)1 synthase is a trifunctional enzyme found in eukaryotes that contains the activities 10-formyl-THF synthetase (EC 6.3.4.3 [EC] ), 5,10-methenyl-THF cyclohydrolase (EC 3.5.4.9 [EC] ), and 5,10-methylene-THF dehydrogenase (EC 1.5.1.5 [EC] ) (Fig. 1, reactions 13). These activities, along with serine hydroxymethyltransferase (Fig. 1, reaction 4), are central to the interconversion of the one-carbon units carried by the biologically active form of folic acid, THF. The activated one-carbon units are used in a variety of cellular processes, including de novo purine and thymidylate synthesis, serine and glycine interconversion, methionine biosynthesis, and protein synthesis in mitochondria and chloroplasts.
In eukaryotic cells, the mitochondrial and cytosolic compartments each contain a parallel set of one-carbon unit-interconverting enzymes (1). For example, in the yeast Saccharomyces cerevisiae, mitochondrial and cytoplasmic isozymes of C1-THF synthase (encoded by the nuclear genes MIS1 and ADE3, respectively) have been purified and characterized (2, 3). Both isozymes exist as homodimers of 100-kDa subunits. Each subunit consists of a C-terminal 10-formyl-THF synthetase domain of All three activities of C1-THF synthase are found in mammalian mitochondria as well (10, 11). Our studies with intact rat liver mitochondria and mitochondrial extracts demonstrated the ability of these organelles to oxidize carbon 3 of serine to formate by a folate-dependent pathway (Fig. 1, reactions 14) (11). However, the existence, structure, and function of the folate-interconverting activities of C1-THF synthase in mammalian mitochondria have been controversial. MacKenzie and co-workers (12, 13) characterized a bifunctional NAD-dependent 5,10-methylene-THF dehydrogenase/5,10-methenyl-THF cyclohydrolase, originally isolated from ascites tumor cells. This bifunctional enzyme lacks the large C-terminal domain catalyzing the 10-formyl-THF synthetase activity and thus is unable to produce formate. This enzyme was shown to be a nuclear encoded mitochondrial protein (14, 15), detectable only in transformed mammalian cells and embryonic or non-differentiated tissues (12). Among adult differentiated tissues, NAD-dependent 5,10-methylene-THF dehydrogenase activity is detectable only in rat adrenal tissue (16), although the mRNA encoding this enzyme is present at low levels in all tissues examined (17). MacKenzie and co-workers (18, 19) have argued that mammalian mitochondria lack a C1-THF synthase and that the bifunctional NAD-dependent dehydrogenase/cyclohydrolase is the mammalian homolog of the trifunctional mitochondrial enzyme. Here, we report the identification and characterization of the human gene encoding a functional mitochondrial C1-THF synthase. We show that it is expressed widely in adult human tissues and that the full-length cDNA encodes a protein that localizes to mitochondria when expressed in Chinese hamster ovary (CHO) cells. These data confirm the existence of C1-THF synthase in mammalian mitochondria, completing the folate-interconverting pathway shown in Fig. 1.
MaterialsAll chemicals were of the highest available commercial quality. Difco media components were obtained from VWR (West Chester, PA). Restriction enzymes, shrimp alkaline phosphatase, calf intestinal alkaline phosphatase, and T4 DNA ligase were purchased from Invitrogen. Primers for PCR and sequencing were made by IDT (Coralville, IA). [ -32P]dATP (3000 Ci/mmol) was purchased from PerkinElmer Life Sciences.
Construction of Full-length cDNAA partial cDNA clone (DKFZp586G1517) constructed by the German Genome Project (RZPD German Research Center for Genome Research) (20) was identified in the GenBankTM/EBI Data Bank (accession number AL117452
[GenBank]
) by a BLAST search using the cDNA sequence of the human cytoplasmic C1-THF synthase (21). This cDNA contains 390 nucleotides (nt) of 3'-noncoding sequence and a poly(A) tail, but lacks a start codon, indicating that it is truncated at the 5'-end. The truncated cDNA clone was obtained from RZPD, and its sequence was confirmed by the DNA Analysis Facility of the University of Texas (Austin, TX). The Human Genome Database contains the entire gene corresponding to this cDNA and predicts an additional 5'-exon that encodes 60 additional N-terminal amino acids. The missing 5'-exon (exon 1) (see Fig. 4) was PCR-amplified from a genomic P1 artificial chromosome clone (dJ44A20) obtained from the Sanger Centre (Cambridge, UK). The PCR-amplified product was gel-purified using a QIAGEN gel extraction kit and subcloned into the pGEM-T Easy vector (Promega, Madison WI), and its sequence was verified. It was necessary to use MasterAmp Tfl DNA polymerase (Epicentre Technologies Corp., Madison, WI) in the PCR due to the high GC content of exon 1 (see "Results"). The partial cDNA clone and the exon 1 clone were then used as templates in a splice overlap extension (SOE)-PCR (22) to produce the full-length cDNA. The exon 1 fragment (230 bp) was amplified using Tfl polymerase and primers TOPO5' (5'-CACCATGGGCACGCGTCTGCCGCTC-3', with the ATG start codon underlined) and humitoSOE3' (5'-CTTCTCTGACGATGGAGTCCCG-3'). The 2719-bp cDNA fragment was PCR-amplified using Pfu polymerase and primers GS5'SOE (5'-GGGACTCCATCGTCAGAGAAG-3') and TOPO3' (5'-GAACAAGCCTTTAACTTGTTCTGTTTC-3'). Primer TOPO3' is complementary to the last nine codons of the open reading frame before the stop codon. Both products were gel-purified using the QIAGEN gel extraction kit. The 230- and 2719-bp PCR products served as templates in the SOE-PCR using primers TOPO5' and TOPO3' and Tfl polymerase. The full-length cDNA product (2934 bp) was gel-purified and cloned into the mammalian expression vector pcDNA3.1D/V5-His-TOPO (Invitrogen) using directional TOPO cloning according to the manufacturer's instructions. The TOPO cloning reaction was transformed into One-Shot chemically competent Escherichia coli (Invitrogen) by chemical transformation, and positive colonies were selected on YT (0.5% yeast extract, 0.8% Tryptone, and 0.5% NaCl) plates containing 50 µg/ml ampicillin. The colonies were screened by PCR with a vector primer and a gene-specific primer, and positive plasmids were prepared using a QIAGEN miniplasmid preparation kit. Sequence analysis revealed a base substitution in the full-length clone compared with the original cDNA and genomic sequences, presumably incorporated during the PCRs. (Tfl polymerase, which was chosen due to the high GC content of exon 1, lacks a 3'
CHO Cell TransfectionCHO cells (1.5 x 105) were plated on 35-mm diameter dishes and cultured in -minimal Eagle's medium supplemented with 10% (v/v) fetal bovine serum. Duplicate plates were then transfected with 2 µg of pcDNA3.1-humito/plate using the LipofectAMINE 2000 reagent method (Invitrogen). After transfection, cells were cultured for an additional 48 h in regular medium before a G418-containing selective medium (0.8 mg/ml) was applied. The selective medium was applied for 1 week until antibiotic-resistant colonies developed. Resistant colonies were picked, replated, cultured, and collected. Preparation of Cell Homogenates and Subcellular FractionsTransfected cells were cultured in two 150-cm2 T-flasks to yield 12 x 108 cells. The monolayer was rinsed with phosphate-buffered saline (4 x 5 ml) at 4 °C and then incubated with phosphate-buffered saline containing 10 mM EDTA (10 ml) at room temperature until the cells detached (510 min). The flasks were tapped gently to dislodge the cells, and the cells were transferred to a 50-ml plastic conical tube. Cells were pelleted by centrifugation at 300 x g for 5 min at room temperature, and the cell pellet was washed with 15 ml of homogenization solution (HMS; 250 mM sucrose and 1 mM EDTA (pH 6.9)) at 4 °C. The cell pellet was resuspended in HMS (2 ml) at 4 °C, transferred to a Kontes nitrogen cavitation device, and exposed to a pressure of 36 p.s.i. for 30 min at 4 °C. The suspension of disrupted cells was collected into a 3-ml conical ground-glass Duall tissue grinder and further disrupted with four strokes of the homogenizer (23). Nuclei and unbroken cells were sedimented by centrifugation at 900 x g for 6 min. The supernatant was removed carefully, transferred to another centrifuge tube, and stored on ice. The pellet was resuspended in HMS (1 ml) and further dispersed by four strokes in the grinder. After centrifugation at 900 x g for 6 min, the supernatant was combined with the first supernatant and stored on ice. The pellet was washed with HMS (3 x 1 ml), and the final viscous pellet (nuclear fraction) was resuspended in HMS (1 ml). The combined supernatants were centrifuged at 900 x g for 5 min, and any pellet was discarded. The volume of the supernatant (total post-nuclear supernatant fraction) was increased to 5 ml by the addition of HMS. The post-nuclear supernatant was centrifuged at 10,000 x g for 15 min, and the pellet was stored on ice. The supernatant was recentrifuged at 10,000 x g for 15 min to give a final supernatant (cytosolic fraction). The second pellet was combined with the first, washed with HMS (2 ml), and resuspended in HMS (1 ml) to give the mitochondrial fraction. Glutamate dehydrogenase activity (24) was used as a mitochondrial marker, and lactate dehydrogenase activity (25) was used as a cytoplasmic marker. ImmunoblottingThe protein concentration of the cytosolic and mitochondrial fractions was determined using the Bradford assay (26) with bovine serum albumin as a standard. Eighty µg of cytosolic and mitochondrial protein from transfected and untransfected CHO cells were fractionated on a 7.5% SDS-polyacrylamide gel for 50 min at 180 V. One-half of the gel was stained, and the proteins on the other half were transferred onto a nitrocellulose membrane (Midwest Scientific, Valley Park, MO) by electroblotting for 90 min at 250 mA. The membrane was then washed with distilled water (3 x 5 min each) and blocked in 2% dry milk in Tris-buffered saline (TBS; 10 mM Tris base and 0.15 M NaCl (pH 8.0)) for 1 h at room temperature. The blocked membrane was incubated with mouse anti-V5 primary antibody (1:1000 dilution; Invitrogen) diluted in TBS and 1% dry milk for 1 h at room temperature. The membrane was then washed with TBS containing 0.0025% Tween 20 (TBST; 3 x 5 min each) and incubated with goat anti-mouse secondary antibody (1:2000 dilution; Zymed Laboratories Inc., San Francisco, CA) for 1 h at room temperature. The membrane was finally washed with TBST and TBS (2 x 5 min each) and rinsed with water before visualizing the bands. Reacting bands were visualized by enhanced chemiluminescence detection (ECL, Amersham Bioscience). Expression in Yeast and Enzyme AssaysThe full-length human cDNA was subcloned from pcDNA3.1-humito into the BamHI and XhoI sites of the yeast expression vector pVT103U (27). In the resulting construct, pVT-humito, the entire human mitochondrial C1-THF synthase open reading frame, including the mitochondrial presequence, is expressed from the ADH promoter of the vector. Yeast strain DAY3 (ser1 ura3-52 trp1 leu2 ade3-130) (28) was transformed with pVT-humito or empty pVT103U vector using a lithium acetate method (29) modified as described.2 Cells were grown in synthetic minimal medium, and extracts were prepared and assayed for NAD+- and NADP+-dependent methylene-THF dehydrogenase activity as described (30). 10-Formyl-THF synthetase activity was determined according to Kirksey and Appling (31).
Northern AnalysisA FirstChoice Northern human blot I kit was obtained from Ambion Inc. (Austin, TX), with poly(A)+ mRNA from the following adult human tissues: brain, placenta, skeletal muscle, heart, kidney, pancreas, liver, lung, spleen, and thymus. Probes were synthesized by asymmetric PCR using reagents supplied in the kit and [ A probe was also synthesized for detection of the human cytoplasmic C1-THF synthase. The plasmid pUC13/HS230 (obtained from Dr. R. E. MacKenzie, McGill University), which contains a 230-bp fragment near the 3'-end of the human cytoplasmic C1-THF synthase cDNA (21), was linearized by digestion with SacI. A linear PCR amplification method (following the kit manufacturer's instructions) was used to synthesize the probe. The antisense primer used was 5'-GTAAAACGACGGCCAGT-3', which is complementary to the vector sequences flanking the insert. The membrane was subjected to a 1-h prehybridization at 42 °C with Ultrahyb ultrasensitive hybridization buffer (Ambion Inc.). The probe was added at 106 cpm/ml of hybridization buffer and allowed to hybridize at 42 °C overnight in a roller bottle. The membrane was then washed twice with NorthernMax low stringency wash solution (equivalent to 2x SSC; Ambion Inc.) for 10 min at 42 °C and twice with NorthernMax high stringency wash solution (equivalent to 0.1 x SSC) for 30 min at 42 °C. The membrane was exposed to a storage phosphor screen (Amersham Biosciences) for 48 h and imaged using an Amersham Biosciences 445 SI PhosphorImager. The same blot was stripped and reconstituted for hybridization with each probe according to the kit manufacturer's instructions. Transcript MappingThe 5'- and 3'-ends of the transcripts were mapped by RNA ligase-mediated rapid amplification of cDNA ends using the FirstChoice RLM-RACE kit from Ambion Inc. Human placental total RNA (Ambion Inc.) was used to map the 5'-end of the transcript. Nested antisense primers specific to the cDNA were designed for use with the two nested 5'-RACE primers provided in the kit (see Fig. 6). The cDNA-specific inner primer (GSI2, 5'-CGCCTCGAGACGGCTGGTTCTCAGGGGACAC-3', with the XhoI site underlined) was complementary to nt 9 to 30 in the 5'-untranslated region. The cDNA-specific outer primer (GSO2, 5'-AGCGCGACAGGGCACACGGAG-3') was complementary to nt +93 to +73. The 5'-RACE inner primer and the cDNA-specific inner primer had BamHI and XhoI sites, respectively, at their 5'-ends to facilitate cloning.
For mapping the 3'-end of the 1.1-kb transcript, first-strand cDNA was synthesized from human placental total RNA using the supplied 3'-RACE adapter. Nested sense primers specific to the cDNA were designed for use with the two nested 3'-RACE primers provided in the kit. The cDNA-specific inner primer (3'-RACE GSI, 5'-CGCCTCGAGGAACTTGTTTAGCAACAAAGTCCT-3', with the XhoI site underlined) was equivalent to nt +485 to +508. The cDNA-specific outer primer (3'-RACE GSO, 5'-CGCCTCGAGCTCCCTCCAGATAGCAGTGAA-3') was equivalent to nt +390 to +410. The 3'-RACE inner primer and the cDNA-specific inner primer had BamHI and XhoI sites, respectively, at their 5'-ends to facilitate cloning. PCR fragments generated in the "inner" PCRs of both 5'- and 3'-RACE were gel-purified, digested with BamHI and XhoI, and ligated separately into BamHI/XhoI-digested pBluescript II KS(+) vector (Stratagene, La Jolla, CA). The ligation reactions were transformed into chemically competent XL1-Blue cells (Stratagene), and positive colonies were selected on YT/ampicillin plates. Colonies were screened by PCR using T7 reverse (5'-GTAATACGACTCACTATAGGGC-3') and T3 forward (5'-AATTAACCCTCACTAAAGGG-3') vector primers, and plasmids were prepared for sequence analysis. This 1.1-kb cDNA has been submitted to the GenBankTM/EBI Data Bank under accession number AY374131 [GenBank] .
cDNA Identification and CloningA cDNA encoding an open reading frame with high similarity to human cytoplasmic C1-THF synthase was cloned from human uterine RNA by the German Genome Project (RZPD; GenBankTM/EBI accession number AL117452 [GenBank] ). The homology extends the length of the proteins, suggesting that the cDNA encodes another trifunctional C1-THF synthase (Fig. 2). This cDNA encodes 917 amino acids plus 390 nt of 3'-noncoding sequence and a poly(A) tail, but lacks a start codon, suggesting that it is truncated at the 5'-end. Blasting this sequence against the Human Genome Database (NCBI Protein Database) revealed the corresponding gene on chromosome 6 at 6q25.2. This gene spans 236 kilobase pairs and encodes the entire cDNA sequence in 27 exons plus an additional 5'-exon that encodes 60 additional N-terminal amino acids. The predicted initiator codon sits within a near-perfect expanded Kozak consensus sequence (32). The first half of this N-terminal extension has the characteristics of a mitochondrial leader sequence, including the potential to form a positively charged amphipathic -helix. Truncation of the original cDNA clone was due to the presence of a NotI site near the 3'-end of the first exon; NotI was used in the cDNA cloning procedure (20). Subsequently, the RIKEN Mouse Gene Encyclopedia Project (33) identified a full-length mouse cDNA (ID22289) that predicts a protein with 88% identity to the human protein, including the N-terminal extension (Fig. 2). The mouse cDNA lacks the NotI site that caused truncation of the human cDNA. These data suggest that the gene on human chromosome 6 encodes a mitochondrial C1-THF synthase.
Attempts to construct a full-length cDNA by RACE using human uterine RNA were unsuccessful, probably due to the extremely high GC content (>80%) of the first exon. Instead, a genomic P1 artificial chromosome clone (dJ44A20, Sanger Centre) was used to PCR-amplify the 5'-exon. This was then spliced to the remaining cDNA by SOE-PCR to construct a full-length cDNA encoding the human protein (GenBankTM/EBI accession number AY374130 [GenBank] ).
CHO Cell Expression and Subcellular LocalizationTo determine whether the protein encoded by this cDNA is, in fact, mitochondrial, we expressed the cDNA in CHO cells. The full-length cDNA was cloned into the mammalian expression vector pcDNA3.1D/V5-His-TOPO. This construct fused the 14-amino acid V5 epitope and a His6 tag to the C terminus of the 2934-bp coding region. Expression of the insert in mammalian cells is driven by the cytomegalovirus promoter. The resulting plasmid, pcDNA3.1-humito, was transfected into CHO cells, and G418-resistant colonies were selected and grown. The cytosolic and mitochondrial fractions from transfected and untransfected (control) CHO cells were isolated as described under "Experimental Procedures." Each fraction was assayed for the mitochondrial marker enzyme glutamate dehydrogenase and the cytoplasmic marker enzyme lactate dehydrogenase. Glutamate dehydrogenase activity ranged from 68 to 95 µmol/min/mg of protein in the mitochondrial fractions, compared with 2.44 µmol/min/mg of protein in the cytoplasmic fractions. The lactate dehydrogenase activity of the mitochondrial fraction was only one-seventh that of the cytoplasmic fraction. These subcellular fractions were then subjected to SDS-PAGE and immunoblotting using antibodies against the V5 epitope (Fig. 3). A clear signal at
Expression in YeastThe full-length human mitochondrial C1-THF synthase cDNA, including the 62-codon N-terminal extension, was subcloned into a yeast expression vector (pVT103U) and transformed into an ade3 deletion strain (DAY3). Disruption of the ADE3 gene, which encodes the cytoplasmic C1-THF synthase, results in yeast cells with very low 10-formyl-THF synthetase and 5,10-methylene-THF dehydrogenase activities; the residual activity is due to the mitochondrial isozyme (34). DAY3 cells transformed with pVT-humito overexpressed 10-formyl-THF synthetase activity
Gene StructureThe human gene encoding C1-THF synthase spans 236 kilobase pairs on chromosome 6 (Fig. 4). The coding sequence consists of 28 exons and is interrupted by 27 introns ranging from 89 to 55,350 bp in length. The start codon is present in the first exon, and the 5'-end of exon 1 extends 107 bp upstream of the ATG start codon (see "Transcript Mapping" below). The stop codon is present in exon 27, and exon 28 encodes 360 nt of 3'-untranslated region, including a polyadenylation signal (AATAAA). Exon 1 is very GC-rich (>80% GC), containing a CpG island and a NotI restriction enzyme site (GCGGCCGC). The existence of this NotI site prevented the cloning of a full-length cDNA because NotI linkers were used in the cloning procedure (20). All of the intron/exon splice sites follow the GT/AG rule (35), except after the terminal exons, 8a and 28 (Table I). A scan of the 5'-flanking sequences by the TESS web server3 using the TRANSFAC Version 4.0 Database predicts numerous potential transcription factor-binding sites, including Sp1, retinoic acid receptor-
Northern AnalysisA Northern blot membrane prebound with human poly(A) RNA from several tissues was obtained from Ambion Inc. A 304-bp 5'-end probe spanning nt 215518 of the human mitochondrial C1-THF synthase cDNA revealed two bands: one at
To determine the relationship of the 3.6- and 1.1-kb transcripts, a 465-bp probe was synthesized that ended just before the stop codon. This 3'-probe detected only the 3.6-kb transcript (Fig. 5B), suggesting that the 1.1-kb transcript represents just the 5'-end of the cDNA. We also compared the tissue distribution of the mitochondrial C1-THF synthase transcript with that of the cytoplasmic isozyme. Using a 230-bp probe from the 3'-end of the cytoplasmic C1-THF synthase cDNA (21), a 3.3-kb transcript was observed (Fig. 5C). The tissue distribution of this transcript differed from that of the mitochondrial isozyme, being highest in liver, kidney, and skeletal muscle. Thus, the human mitochondrial and cytoplasmic C1-THF synthase isozymes are encoded by distinct transcripts that do not cross-hybridize under these probe and wash conditions. Transcript MappingA 5'-RACE experiment was done to determine the transcriptional start site(s). 5'-RACE was performed as described under "Experimental Procedures" using 10 µg of human placental total RNA for first-strand cDNA synthesis by reverse transcription. This was followed by a first round of PCR (outer PCR), which gave no detectable specific product. Two µl of the outer PCR product were used in a second round of PCR with nested primers (inner PCR), yielding a specific product of <300 bp. The final PCR product was gelpurified and subcloned. Nine colonies were screened by PCR, and all of them gave a product of between 220 and 298 bp. Three of the nine clones were sequenced, and all of them exhibited the same 5'-end 107 bp upstream of the ATG start codon (Fig. 6). These results suggest that the majority of the transcripts from this gene initiate at or near position 107, and it appears that both the 3.6- and 1.1-kb transcripts initiate from this site. Alternative SplicingA 3'-RACE experiment was performed to determine the 3'-end of the short 1.1-kb transcript observed on Northern blots (Fig. 5A). One µg of human placental total RNA was used for first-strand cDNA synthesis. This was followed by a first round of PCR (outer PCR), which gave no detectable specific RACE product. One µl of the outer PCR product was used in a second round of PCR with nested primers (inner PCR). Four distinct PCR products of 500, 350, 200, and 100 bp were detectable on a 2% agarose gel. (A smear at the top of the gel was also observed, produced from the full-length transcript.) Based on the 1.1-kb length of the short transcript and the position of the inner primer, the 500- and 350-bp RACE products were gel-purified and cloned separately. Six of the clones were sequenced to determine the 3'-extent of the clones. All of these clones represented the short transcript, in which exon 7 is spliced to a previously unrecognized exon, termed exon 8a, which sits in the intron between exons 7 and 8 (Fig. 7A). Exon 8a appears to be 139 bp long, although in one clone, the 3'-end extended 162 bp. It contains a stop codon after 45 nucleotides and a polyadenylation signal near its 3'-end. Thus, the 3.6- and 1.1-kb transcripts share the first seven exons and then diverge at exon 8/8a. The 1.1-kb transcript would be translated into a 275-amino acid protein in which the first 260 amino acids are identical to the full-length protein, followed by 15 unrelated amino acids (GenBankTM/EBI accession number AY374131 [GenBank] ) (Fig. 7, B and C).
An additional variation was observed upon sequencing the 3'-RACE clones. Several of the clones contained an extra codon at position +643, at the junction between exons 6 and 7 (Fig. 8). This extra valine codon appears to arise from variation in the 3'-splice acceptor site during the splicing of exon 6 to exon 7. The 5'-splice site has the GT consensus sequence as the first 2 nt of the intron. The 3'-splice site has two AG consensus dinucleotides at the 3'-end of the intron. If the first AG dinucleotide is used, exon 7 contains 3 additional nt; if the second is used, these 3 nt are not present in exon 7.
The experiments described here confirm that humans express a mitochondrial C1-THF synthase, with properties very similar to those of the cytoplasmic homologs previously characterized. The full-length human cDNA encodes a protein of 978 amino acids, including an N-terminal mitochondrial targeting sequence. When the full-length cDNA was expressed in CHO cells, the targeting sequence directed the protein exclusively to mitochondria (Fig. 3). Alignment of the deduced amino acid sequence with the human cytoplasmic C1-THF synthase (935 residues) reveals a 62-residue N-terminal extension in the putative mitochondrial protein (Fig. 2). PSORT II analysis4 predicts a mitochondrial targeting sequence with a cleavage site between residues 31 and 32. The next 31 residues, before alignment with the cytoplasmic protein begins, include an unusual run of 9 consecutive glycines and several basic residues. A very similar N-terminal extension is predicted for the mouse protein (Fig. 2). Excluding this N-terminal extension, homology to the human cytoplasmic C1-THF synthase is quite high (61% identity), and the putative mitochondrial protein appears to possess the same domain structure. In the cytoplasmic protein, the N-terminal dehydrogenase/cyclohydrolase domain is 300 residues, and the C-terminal synthetase domain is 700 residues (9). The two human proteins share 31% identity in the dehydrogenase/cyclohydrolase domains and 73% identity in the synthetase domains, including conserved active-site residues and the 10-formyl-THF-binding site in the synthetase domain (31). However, the putative mitochondrial proteins from human and mouse lack 12 amino acids near the junction between the two domains (position 318) (Fig. 2).
Expression of the full-length cDNA in yeast revealed elevated 10-formyl-THF synthetase activity, further supporting its identification as the human mitochondrial C1-THF synthase. We were unable to detect increased 5,10-methylene-THF dehydrogenase activity in these cells using either NADP+ or NAD+ as cofactor. Is the human mitochondrial enzyme multifunctional like its yeast counterpart? Given the low identity between the human cytoplasmic and mitochondrial isozymes in the dehydrogenase/cyclohydrolase domain (31%), it is conceivable that the mitochondrial protein has lost these activities. However, other members of this family have diverged as much or more (e.g. yeast Mtd1p (30)) and still retain 5,10-methylene-THF dehydrogenase activity. Another possibility is that the dehydrogenase activity of the human enzyme is simply below detection in crude yeast extracts. Depending on the species, the dehydrogenase activity of these trifunctional enzymes is only one-half to one-tenth that of the synthetase activity (29). Finally, the construct we expressed in yeast contained the entire 62-amino acid N-terminal extension. The 10-formyl-THF synthetase activity was found in the soluble cytoplasmic fraction, but not in the mitochondrial fraction (data not shown), suggesting that the presequence was not processed. If it is retained, this extension could interfere with the dehydrogenase/cyclohydrolase activities contained in the N-terminal domain of the protein while leaving the C-terminal synthetase domain unaffected. These questions will have to await purification of the recombinant enzyme. Expression of the gene was detected in most human tissues, but transcripts were highest in placenta, thymus, and brain. Expression was low in liver and skeletal muscle and barely detectable in heart. A mouse cDNA has also been identified that predicts a protein with 88% identity to the human protein, including the N-terminal extension, suggesting that this mitochondrial C1-THF synthase will be found in all mammals. The human gene encoding this enzyme has several interesting features. The gene is large, spanning 236 kilobase pairs on chromosome 6 at 6q25.2. The gene contains 29 exons (Table I), including the alternative exon 8a found in the intron between exons 7 and 8 (Fig. 4). This same intron/exon structure is observed for the mouse homolog found on mouse chromosome 10, except that the alternative exon 8a is absent in the mouse gene. Moreover, the genes for the cytoplasmic C1-THF synthase from rat, mouse, and human have all been shown to contain 28 exons, with introns in nearly identical positions (36, 37). This suggests that an ancestral C1-THF synthase gene arose before the divergence of the human and rodent lineages, >75 million years ago (38), and genes encoding the mitochondrial and cytoplasmic isozymes are probably related by a gene duplication event.
The full-length 3.6-kb transcript is encoded in 28 exons. A shorter, 1.1-kb transcript is produced by an alternative splicing event, in which exon 7 is spliced to exon 8a instead of exon 8 (Fig. 7). This transcript encodes a 275-amino acid protein in which the first 260 amino acids are identical to the full-length protein, followed by 15 amino acids not found in any other C1-THF synthase. The first 11 amino acids of these 15 terminal amino acids are also found, with one mismatch, near the C terminus of isoform 2 of the human
Assuming the short transcript is translated in vivo, it is unlikely that the resulting protein would retain 5,10-methylene-THF dehydrogenase or 5,10-methenyl-THF cyclohydrolase activity. Modeling the human mitochondrial protein sequence onto the x-ray structure of the dehydrogenase/cyclohydrolase domain of the human cytoplasmic C1-THF synthase (42) reveals that exons 8 and 9, which are missing in the short transcript, encode the major portion of the Rossman fold of the NADP-binding site and a critical Using RNA from human placenta, a single 5'-transcriptional start site at position 107 was identified by 5'-RACE (Fig. 6). It appears that both the 3.6- and 1.1-kb transcripts initiate from this site because only a single 5'-end was identified. A BLAST search of the Human EST Database with the 5'-end of the human cDNA revealed >100 entries. Four ESTs extended beyond position 107 (Fig. 6). BG481636 [GenBank] (position 276) and BE735249 [GenBank] (position 268) were isolated from choriocarcinoma mRNA; BQ062382 [GenBank] (position 119); and BQ055629 [GenBank] (position 118) were isolated from a lymphoma cell line. Thus, it appears there may be some heterogeneity in the 5'-transcriptional start site, depending on the tissue or cell type.
One additional splicing variation was discovered. Some transcripts contained an extra codon at position +643, at the junction between exons 6 and 7 (Fig. 8). This valine codon appears to arise from alternative usage of AG splice acceptor sites separated by 1 nt. This type of variation in splice site selection has been seen in several other mammalian genes, including human prothymosin-
Based on the x-ray structure of the dehydrogenase/cyclohydrolase domain of the human cytoplasmic C1-THF synthase (42), the extra valine is predicted to reside on the exposed loop between The tissue distribution of the mitochondrial C1-THF synthase is quite different from that of the cytoplasmic isozyme (Fig. 5). Whereas the cytoplasmic transcript is most abundant in liver and kidney, the transcripts for the mitochondrial isozyme are relatively low in those tissues, but highest in placenta, followed by thymus, spleen, brain, and lung. The low expression of the mitochondrial isozyme in liver probably contributed to our earlier difficulties in purifying the protein from liver mitochondria. Although the ratio of the two transcripts varies somewhat from tissue to tissue, both are present in every tissue assayed, even heart (Fig. 5). The short transcript is significantly reduced in brain. Future work will be directed toward understanding the metabolic role of the mitochondrial isozyme and how that role relates to the observed tissue distribution. The discovery of the human gene for this mitochondrial C1-THF synthase confirms our model for the compartmentation of folate-mediated one-carbon metabolism in mammalian cells (Fig. 1). Based on the well documented existence of a mitochondrial C1-THF synthase in yeast (3, 45), we proposed that mammalian mitochondria also contain this trifunctional enzyme (10). All three activities of C1-THF synthase are found in mammalian mitochondria (10). More importantly, intact rat liver mitochondria and mitochondrial extracts were shown to oxidize carbon 3 of serine to formate by the folate-dependent pathway outlined in Fig. 1 (mitochondrial reactions 14) (11). However, all our attempts to purify these activities from rat liver mitochondria were unsuccessful. During this same time period, MacKenzie and co-workers (12, 13) characterized a mammalian bifunctional NAD-dependent 5,10-methylene-THF dehydrogenase/5,10-methenyl-THF cyclohydrolase, originally isolated from ascites tumor cells. This bifunctional enzyme lacks the large C-terminal domain catalyzing the 10-formyl-THF synthetase activity and thus is unable to produce formate. When this enzyme was shown to be localized in mitochondria (14, 15), MacKenzie and co-workers (18, 19) proposed that mammalian mitochondria lack a trifunctional C1-THF synthase and that this bifunctional NAD-dependent dehydrogenase/cyclohydrolase is the mammalian homolog of the trifunctional mitochondrial enzyme. There are, however, several problems with this proposal. First, the bifunctional enzyme is detectable mainly in transformed mammalian cells and embryonic or non-differentiated tissues (12). Among adult differentiated tissues, NAD-dependent 5,10-methylene-THF dehydrogenase activity is detectable only in rat adrenal tissue, but not adult liver (16). Second, the 5,10-methylene-THF dehydrogenase activity we detected in rat liver mitochondria is dependent on NADP+, not NAD+ (11). Finally, adult rat liver mitochondria are capable of producing formate by the folate-dependent pathway (11), and formate production requires the 10-formyl-THF synthetase activity (Fig. 1, reaction 1) that is missing from the bifunctional enzyme. Clearly, only a trifunctional C1-THF synthase, with an NADP-dependent 5,10-methylene-THF dehydrogenase activity, is consistent with the biochemical data. Mitochondrial C1-THF synthase probably supports several metabolic processes in mammalian mitochondria. Folate-mediated one-carbon metabolism is involved in the synthesis of formyl-methionyl-tRNA for mitochondrial protein synthesis (Fig. 1, reaction 8) (46, 47) and the oxidation of choline methyl groups via dimethylglycine dehydrogenase and sarcosine dehydrogenase (48). Mitochondrial C1-THF synthase may also play an important role in homocysteine metabolism. Recent studies of patients with nonketotic hyperglycinemia reveal a connection between the mitochondrially localized glycine cleavage system (GCS) (Fig. 1, reaction 5) and homocysteine metabolism. Nonketotic hyperglycinemia is an autosomal recessive brain disease caused by defects in subunits of the GCS, resulting in elevated glycine levels (49). Loss of GCS activity might be expected to cause, in addition to elevated glycine, a deficiency of mitochondrial one-carbon units. Consistent with this hypothesis, two recent studies report mild elevations of homocysteine in the plasma and cerebrospinal fluid of nonketotic hyperglycinemia patients (50, 51). Furthermore, Randak et al. (51) found that the mildly elevated plasma homocysteine levels could be reduced in their three patients by treatment with the one-carbon donor 5-formyl-THF (folinic acid, leucovorin). This observation provides strong evidence that the homocysteine elevations are due to a defect in homocysteine remethylation resulting from a deficiency of one-carbon units. Examination of Fig. 1 suggests two ways in which a loss of mitochondrial GCS activity could cause a deficiency of cytoplasmic one-carbon units. First, as suggested by Van Hove et al. (50), cells lacking a functional GCS might increase the transport of serine into mitochondria for metabolism by serine hydroxymethyltransferase to compensate for the deficiency of mitochondrial 5,10-methylene-THF. This could, in turn, cause a deficiency of serine, and thus one-carbon units, in the cytoplasm. A second possibility is that formate production is defective in mitochondria from nonketotic hyperglycinemia patients. As we showed both in vitro with rat liver mitochondria (10, 11), and in vivo with yeast (28, 52), mitochondrial 5,10-methylene-THF is rapidly converted to formate and transported to the cytosol, where it is activated to 10-formyl-THF via cytoplasmic 10-formyl-THF synthetase (Fig. 1, mitochondrial reactions 3, 2, and 1 and cytoplasmic reaction 1). The 10-formyl-THF is then reduced by cytoplasmic reactions to the 5-methyl-THF required for homocysteine remethylation. Consistent with this explanation is the observation that GCS activity is stimulated by glucagon (53), and glucagon lowers plasma homocysteine in rats (54), presumably by increasing the mitochondrial production of formate. An elegant stable isotope study in humans (55) provides further support for the role of mitochondrial one-carbon units in the remethylation of homocysteine. Gregory et al. (55) showed that both cytoplasmic and mitochondrial one-carbon units end up in the methyl group of methionine following infusion of deuterated serine. This result strongly supports mitochondrial formate production as a significant contributor to cytoplasmic one-carbon units in vivo in mammals and places the mitochondrial C1-THF synthase in the center of this pathway.
The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EBI Data Bank with accession number(s) AY374130 [GenBank] and AY374131 [GenBank] .
* This work was supported by National Institutes of Health Grants DK61428 (to D. R. A.) and DK42033 (to B. S.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
|| To whom correspondence should be addressed: Dept. of Chemistry and Biochemistry, University of Texas, 1 University Station A5300, Austin, TX 78712-0165. Tel.: 512-471-5842; Fax: 512-471-5849; E-mail: dappling{at}mail.utexas.edu.
1 The abbreviations used are: THF, tetrahydrofolate; CHO, Chinese hamster ovary; nt, nucleotide(s); SOE, splice overlap extension; HMS, homogenization solution; TBS, Tris-buffered saline; RACE, rapid amplification of cDNA ends; EST, expressed sequence tag; GCS, glycine cleavage system.
2 Details are available upon request from the corresponding author.
3 Available at www.cbil.upenn.edu/tess.
4 Available at psort.nibb.ac.jp/form2.html.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||