![]()
|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 11, 7605-7613, March 17, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

1
1
¶
¶**2
From the
Structural Biology Program,
Institute of Biochemistry, and ||Faculty of Life Science, National Yang-Ming University, Taipei 11221, Taiwan, ¶Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan, and **Department of Medical Research and Education, Taipei Veterans General Hospital, Taipei 11217, Taiwan
Received for publication, September 19, 2005 , and in revised form, November 3, 2005.
| ABSTRACT |
|---|
|
|
|---|
2400 Å2/monomer. The N-terminal deaminase domain belongs to the cytidine deaminase superfamily. A structure-based sequence alignment of a variety of nucleotide deaminases reveals not only the unique signatures in each family member for gene annotation but also putative substrate-interacting residues for RNA-editing deaminases. The strong structural conservation between the C-terminal reductase domain and the pharmaceutically important dihydrofolate reductase suggests that the two reductases involved in the riboflavin and folate biosyntheses evolved from a single ancestral gene. Together with the binding of the essential cofactors, zinc ion and NADPH, the structural comparison assists substrate modeling into the active-site cavities allowing identification of specific substrate recognition. Finally, the present structure reveals that the deaminase and the reductase are separate functional domains and that domain fusion is crucial for the enzyme activities through formation of a stable tetrameric structure. | INTRODUCTION |
|---|
|
|
|---|
During riboflavin biosynthesis (1), GTP cyclohydrolase II first catalyzes the hydrolytic C-8 release of GTP to yield formate and pyrophosphate as side products. The product, 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate (compound 1) is converted into 5-amino-6-ribitylamino-2,4(1H,3H)-pyrimidinedione 5'-phosphate (compound 4) by deamination of the pyrimidine ring and NAD(P)H-dependent reduction of the ribose (Fig. 1). The deamination and reduction steps have been shown to proceed in the opposite order in yeast and Escherichia coli (2, 3). Most eubacteria contain a bifunctional protein; for instance Bacillus subtilis RibG (BsRibG)3 is composed of an N-terminal deaminase domain (D domain) and a C-terminal reductase domain (R domain) (4). In contrast, in fungi, plants and most archaea, these two enzymes are separate (5-7). In yeast, Rib7 is the corresponding reductase, whereas the C-terminal domain of Rib2 is responsible for deamination, in addition to N-terminal pseudouridine synthase activity.
The D domain of bacterial RibG was expected to belong to the cytidine deaminase (CDA) superfamily because of the two conserved signatures, H(C)XE and PCX8-9C. The CDA superfamily consists of the mononucleotide deaminases involved in nucleotide metabolism, and the RNA/DNA-editing deaminases involved in gene diversity and in antivirus defense (8, 9). The RNA/DNA-editing deaminases include A- to -I tRNA-specific adenosine deaminases (TADs), A- to -I adenosine deaminases acting on RNA, and C- to -U cytidine deaminases acting on RNA/DNA. These deaminases catalyze the hydrolytic deamination of cytosine, guanine, and adenine moieties and several of their therapeutically useful analogues. The available member structures reveal a virtually identical zinc-assisted deamination mechanism with the consensus histidine and cysteines acting as the zinc ligands, whereas the glutamate serves as a proton shuttle (10-14). However, the question of how nature has evolved the CDA fold into these various deaminases requires additional study.
Furthermore, the structural fold of the R domain was predicted by 3D-PSSM (15) to be similar to dihydrofolate reductase (DHFR). DHFR catalyzes the NADPH-utilizing reduction of dihydrofolate to tetrahydrofolate. Many DHFR inhibitors, such as methotrexate, pyrimethamine, and trimethoprim, have long been used clinically in the treatment of cancer, rheumatoid arthritis, malaria, and bacterial and fungal infection (16). Therefore, the R domain may become an important target for new drug design. To gain structural insights into the inhibitor design, substrate specificity, and evolution, we have solved the BsRibG structure at 2.41-Å resolution.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
-D-thiogalactopyranoside. Cells were grown for another 5 h at 37°C before harvest. Cell pellets were resuspended in 40 ml of cold buffer A (20 mM Tris-HCl, 200 mM NaCl, pH 7.5) and lysed by French press. After removal of cellular debris by centrifugation at 39,000 x g at 4 °C for 30 min, the supernatant was applied to a 5-ml nickel-nitrilotriacetic acid column (Ni-NTA, Qiagen). The resin was washed with 40 mM imidazole in buffer A, and the protein was eluted with buffer A containing 500 mM imidazole. The protein fractions were collected, dialyzed against buffer B (20 mM Tris-HCl, pH 8.5), and loaded onto a SepharoseTM Q column (Amersham Biosciences). After washing with 230 mM NaCl in buffer B, the BsRibG was eluted with 260 mM NaCl and dialyzed against 20 mM HEPES (pH 7.5) and 5 mM dithiothreitol.
|
Protein CrystallizationThe initial crystallization screening was performed with screening kits using the hanging-drop vapor diffusion method at 22 °C. The hanging drops were mixtures of 2 µl of reservoir solution and 2 µl of protein solution. Protein crystals could be obtained under several reservoir solutions, and the best crystals were grown in 26.6% polyethylene glycol 400, 190 mM MgCl2, 5% glycerol, and 95 mM HEPES (pH 7.5) with a protein solution of 20-25 mg/ml. Crystals appeared and reached their final dimensions in 1 week at 15 °C. The NADPH derivative was prepared by soaking crystals for 4 days in reservoir solution containing 10 mM NADPH. X-ray diffraction data were collected and processed at beamlines BL12B2 at SPring-8 (Harima, Japan) and NW12 at the Photon Factory (Tsukuba, Japan). The crystals belong to the P212121 space group with 1 tetramer/asymmetric unit.
Structure DeterminationThe BsRibG structure was determined using single-wavelength anomalous dispersion (SAD) of the endogenous zinc ion. Four Zn2+ positions were identified and refined with SOLVE (18). The initial phase was improved by direct-method phasing refinement using OASIS (19) before density modification. The initial electron density map showed an interpretable density of
70% of the tetramer. About 60% of the initial tetrameric model was automatically built using RESOLVE (18). Because the structural fold was predicted to be similar to yeast cytosine deaminase (yCD) and Thermotoga maritima DHFR (TmDHFR) (10, 20), both structures were used to assist in the manual building of the atomic model using TURBO-FRODO (21), and refinement was carried out against the native data to 2.41-Å resolution using crystallography NMR software (22). The statistics for data collection and refinement are summarized in Table 1. About 88% of the residues are in the most favored regions of the Ramachandran plot, with the remaining in the additional allowed regions.
|
|
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
-sheet (
1-
5) with
1 running antiparallel to the others. The
-sheet is sandwiched by two helices (
A and
E) on one side and by three helices (
B,
C, and
D) on the other side. The C-terminal helix
F extends away from the D domain and connects to the R domain. The R domain (residues 146-359) is composed of a large nine-stranded
-sheet (
A-
H and
D') with the C-terminal strand
H running antiparallel to the others. The
-sheet is flanked by five
-helices (
B,
C,
D',
E, and
F). The secondary structure elements of the R domain are numbered as for DHFRs (28).
The four subunits in the crystal asymmetric unit did not show significant differences between each individual domain except for several loops (root-mean-square deviations (r.m.s.d.) of 0.38-0.46 and 0.59-0.73 Å for the backbones of residues 2-139 and residues 146-359, respectively). However, the relative orientations between the D and R domains are slightly different, resulting in a weak noncrystallographic symmetry. Molecules A and B interact with each other through their D domains with a buried surface area of
650 Å2 per D domain (the D interface) (Fig. 4A), whereas molecule A makes extensive contacts with molecule C through their respective R domains with a buried area of
1750 Å2 per R domain (the R interface) (Fig. 4B). There are no contacts between molecules A and D in the tetramer.
The D interface is made up mainly of the N-terminal two helices
A and
B, the
2 strand, and the connected loops (residues 4-19 and 35-61). There are 14 direct hydrogen bonds between the protein atoms across the interface and hydrophobic patches formed by Leu8, Leu12, Ile36, Met39, Leu43, and Met57. The anti-parallel disposition of the two pseudodyad-related
A helices also contributes to the interface by dipole-dipole interactions. Interestingly, the side chains of His56 from the two D domains stack very well, with a distance of 3.3-3.4 Å between the aromatic rings. The R interface mainly is made up of: the two large loops, L
A-
B (residues 156-169) and L
F-
G (residues 314-338); and the C-terminal residues 339-358. There are 18 direct hydrogen bonds between the protein atoms across the interface, and extensive hydrophobic patches are by Try169 formed, Pro314, Lys315, Leu316, Ile317, Leu325, Phe331, Met334, Val337, Leu339, Leu340, Phe342, Ile345, Ile352, and Leu354.
|
/
/
structure, in which five
-strands (
1-
5) and three helices (
A-
C) correspond closely, whereas the remainder varies across the different deaminases (Fig. 5A). The main chain atoms of the 65-70 structurally equivalent residues are overlaid with an r.m.s.d. of 0.75 to 1.35 Å and with 8-24% sequence identity. These structural elements in the CDA fold form a conservative hydrophobic core, which is also preserved in the AICAR transformylase domain, implying that the hydrophobic core has been highly conserved throughout evolution.
The active site of the D domain contains one tightly bound endogenous zinc ion, for which the anomalous data provided sufficient phase information for structure determination. The zinc ion is tetrahedrally coordinated by His49 N
1 (2.0 Å), Cys74 S
(2.4 Å), Cys83 S
(2.3 Å), and a water molecule (2.0 Å).The zinc-bound water molecule interacts with Glu51 O
2 (2.5 Å). The active-site architecture resembles those of the CDA members, which share a similar zinc-assisted deamination mechanism with a virtually identical interaction network between the common moiety of the pyrimidine ring of the substrate, the zinc ion, the zinc-bound water molecule, the zinc ligands, and the base glutamate (Fig. 5B). In addition, the active-site cavities of these deaminases are mainly made up of the C-terminal tail and the loops connecting the
A-
1,
2-
B,
3-
C, and
4-
D (Fig. 5C).
Based on the structural comparison of the active-site cavities, the substrate of the D domain was modeled into the active site through superposition of the nucleobase rings because of the highly conserved interaction networks surrounding the target amino group (Fig. 5B). The model was then subjected to energy minimization with crystallography NMR software. Simulation of the complex structure suggested that the nucleophilic OH-2 group of the pyrimidine ring coordinates to the catalytic zinc ion and interacts with Glu51O
2 and Cys74 N. The NH-3 group hydrogen bonds with Glu51 O
1, the O-4 atom with Ala50 N and His42 N
, and the NH2-5 with Asn23 O
1 and His42 N
1. The two hydroxyl groups of the ribose have close contacts with the side chains of Asp101 and Asn103. The phosphate moiety forms salt bridges with His76 and Lys79, located in the unique insertion between the two zinc ligand cysteines, and these interactions are essential for the deamination activity because the enzyme cannot utilize the dephosphorylated form as substrate (3). These predicted substrate-binding residues are all highly conserved in the eubacterial RibGs. However, the fungal deaminases such as yeast Rib2 (yRib2) apparently contain different substrate-interacting residues because of their distinct substrate, which has an open ribityl group instead of a cyclic ribose (Figs. 1 and 5C).
|
B-
D and surrounding loops for oligomerization. BsGD, yCD, and AaTADA display similar dimeric structures (10, 11, 14). The swapping of the C-terminal segment in BsGD causes additional dimeric contacts including the
5 strand and helices
A and
E. CDAs form tetramers in which one subunit interacts with the other three subunits (12). The hexameric T4dCMPD contains two types of intersubunit interfaces (13). In contrast, the D interface of BsRibG is very distinct from the others and possesses the fewest contacts, with a total buried area of
1300 Å2 (Fig. 4A). In addition to the loops and helices
A and
B, RibG includes the
2 strand in the D interface. The distinct intersubunit orientation in the D interface also separates the active sites away from each other with an inter-zinc distance of 30 Å. Notably, the shortest inter-zinc distance in the other five deaminases is about 14-15 Å.
|
4 strand is quite diverse and may make a major contribution to the structural plasticity and functional diversity among the CDA members (Fig. 5A). For instance, the mononucleobase deaminase yCD and the tRNA-editing deaminase AaTADA unexpectedly superimposed very well, with an r.m.s.d. of 1.08 Å for 115 C
atoms with 22% sequence identity. The AaTADA structure was solved by molecular replacement using yCD as a search model (14). The major structural difference around the active-site cavity is the C-terminal helix. In yCD as well as BsGD, genetic changes to alter the substrate specificity are through an introduction of substrate recognition residues at the C-terminal tail (Asp155 in yCD and Tyr156 in BsGD), which then forms a "flap," capping and hence narrowing the opening of the active-site cavity upon substrate binding to limit the pocket size for the nucleobase (10, 11). In contrast, the C-terminal tail in AaTADA, as well as in CDAs, dCMPDs, and the D domain of BsRibG, swings away to enlarge the active-site cavity for their larger substrates (12-14). In addition to the diverse C-terminal segment, each family member has some unique substrate recognition residues; these are discussed below in detail.
Structure-based Sequence Alignment of the CDA MembersIn combination with a sequence-structure analysis, a structure-based sequence alignment of the CDA members was constructed (Fig. 5C). Multiple sequence alignments reveal that there are unique member signatures that are useful for gene annotation. The unique substrate recognition residues for the cytosine deaminases are WXXDI at the C terminus, and those for the guanosine deaminases are FDD between helices
D1 and
D2 and a tyrosine at the C terminus. The CDAs also have unique substrate-binding residues, namely FXV at the N terminus of the
1 strand and NXE at the C terminus of the
2 strand. The absence of the
D helix between the
4 and
5 strands causes both strands to become antiparallel and the C terminus to be in the opposite direction. dCMPDs possess GYNG on the
2 strand and a diverse extra 20-60-residue insertion between the
2 strand and the
B helix. Interestingly, most phosphate-interacting residues are located in this insertion and are not conserved across dCMPDs. RibGs have NPXVG at the N terminus of the
1 strand and an extra 8-9-residue loop between the two zinc-ligating cysteines (PCX8-9C). The extra loop is highly conserved in RibGs and is predicted to interact with the phosphate moiety of the substrate.
Even though the CDA members display substantial sequence diversity (15-25% sequence identity), our structure-sequence analysis suggests that comparative modeling is feasible. For instance, bacterial TADAs and eukaryotic TAD2s share a unique signature, EXPVG, at the N terminus of the
1 strand, in which the glutamate is also conserved in TAD1. Our comparative modeling suggests that this glutamate residue may serve as an adenine recognition residue for these A- to -I tRNA deaminases.
|
D helix (Fig. 5C), and hence the
4 and
5 strands are parallel and the direction of the C terminus is opposite from that of CDAs. In addition, both AID and APOBEC1 are too short to accommodate two CDA folds in one protein molecule. We have modeled residues 1-160 of human AID into a CDA fold consisting of the five
-strands surrounded by helices
A-
E. APOBECs contain more conserved aromatic residues than other members with a unique signature TWY(F)XSWSPCX2C around the
3 strand.
|
1 strand, and Arg112 to at the C terminus of the
4 strand may interact with the phosphate group of the edited cytidylate, with respect to corresponding Ser21 and Tyr153 the in T4dCMPD. Phe151 in the
E helix may form close contacts with the substrate. The conserved FFX3R motif in the
E helix in bacterial TADAs has been shown to play a critical role in A34- to -I34 deamination (33). In contrast, Trp80, Leu106, and Met139, located in the
3,
4, and
5 strands, respectively, may be responsible for construction of the conserved hydrophobic core. The truncated C147X mutant could not be expressed in 293T cells (32), implying that this mutant might not form a stable CDA fold. The 181X and 190X mutants retain deamination activity but are defective for class switch recombination and normal nuclear distribution, indicating the functional roles of the extra C-terminal segment in AID. Similarly, Ser3 and Lys10 at the N-terminal
A helix are expected to be exposed at the protein surface and might be important for nuclear location signaling or for interaction with associate proteins.
NADPH Binding in the R DomainNADPH was identified by its strong electron density and is firmly embedded at the active site of the R domain (Fig. 6A). The enzyme did not show significant structural changes upon NADPH binding, with an r.m.s.d. of 0.4 Å for all of the backbone atoms. The cofactor is bound in an extended conformation with extensive interactions with the protein. The adenine N-3 and N-6 atoms interact with Thr221 and Ser298, respectively (Fig. 6B). The phosphate group at the ribose O-3' makes close contacts with Thr221. These interactions seem not to contribute significantly to the cofactor binding, because NADH as well as NADPH can serve as the coenzyme (4). The pyrophosphate moiety interacts with a constellation of backbone NH groups from Gly194, Thr195, Gly292, Ser293, Ala294, and Val295 and the side chain of Thr195. These residues are located at the N termini of helices
C and
E, and the helix dipoles may partially neutralize the negative charges of the phosphate groups. The nicotiamide amide group complementarily interacts with the amide backbone of Ala153, whereas the nicotinamide ribose OH-2' and OH-3' contact with Asp199 and Gly165, respectively.
Structural Conservation and Divergence between BsRibG and DHFRsA structural homology search by DALI (34) revealed that the R domain displays significant structural similarity to DHFRs (Fig. 7A). Despite low sequence identity, which is as little as 20%, all the DHFR structures exhibit a virtually identical fold consisting of a central eight-stranded
-sheet (designated
A-
H) and four flanking helices (
B,
C,
E, and
F) (28). The R domain of BsRibG contains an extra 25-residue insertion between the
D strand and the
E helix, forming the
D' helix and the
D' strand. This insertion is quite a distance away from the active site and is diverse across the bacterial and fungal reductases.
TmDHFR is the most stable DHFR isolated thus far. The most prominent feature of TmDHFR is its highly stable dimer, which has been shown to make a major contribution to the high intrinsic stability (20). The R-R structure of BsRibG is similar to the dimeric TmDHFR structure. Both TmDHFR and BsRibG utilize similar regions for the subunit association. The contact areas of the R interface are larger than those of TmDHFR,
3500 Å2 versus
3050 Å2, perhaps due to a larger L
F-
G loop in BsRibG. Structural superposition of RibG and TmDHFR reveals an r.m.s.d. of 1.5 Å for 120 C
atoms with 25% sequence identity. The strong conservation of the tertiary structures suggests that the two reductases involved in the riboflavin and folate biosyntheses are descended from a single ancestral gene and there by define a new superfamily.
DHFRs and BsRibG share four conserved regions for NADPH binding, motifs A-D (Fig. 7B). In motif A, the two hydrogen bonds between the amide groups of the nicotinamide ring and an alanine residue at the C terminus of the
A strand (Ala153 in BsRibG) are strictly conserved. Both proteins use motifs B and D for the pyrophosphate binding, and motif C for the phosphate group at the adenosine O-3'. The interactions between DHFRs and the 3'-phosphate group are so extensive via 3-4 residues that the enzyme has a strict NADPH dependence. Both DHFR and RibG utilize many main chain atoms for NADPH binding, and hence the corresponding sequences of the NADPH-interacting residues have diverged during evolution, but their spatial positions have remained convergent in the four regions. A conserved cis peptide bond occurs at the two consecutive glycine residues in motif D. In contrast, both enzymes form diverse interactions with the remaining areas including the adenine ring and the riboses.
The enzymatic mechanism of DHFR has been chemically and structurally studied in detail (28, 35). A complete catalytic scheme through five kinetically observable intermediates has been proposed. Similarly, RibG is expected to catalyze the reduction of a cyclic ribosyl into an open ribityl group by hydride transfer from the C-4 atom of the nicotinamide ring of NAD(P)H to the C-1' of the ribose with concomitant protonation of O-5'. The binding architecture of the nicotinamide ring is virtually identical in DHFR and RibG. Thus, based on the structural comparison, the substrate of the R domain was modeled into the active site with subsequent energy minimization. Simulation of the complex structure suggested that the O-2 atom of the pyrimidine ring interacts with Lys151 N
and the NH-3 and O-4 with Thr172 O
1. Thr195 and Asp199 make close contacts with the two hydroxyl groups of the ribose. Arg176, Arg183, and Arg206 form salt bridges with the phosphate group, interactions that are essential for the reduction activity because the enzyme cannot utilize the dephosphorylated form as substrate (3). These substrate-binding residues are highly conserved in the reductases and are located in spatial positions similar to those in DHFRs (Fig. 7B). Lys151 is highly conserved in most eubacteria and plants but not in fungi and archaea, implying that this residue distinguishes the O-2 atom from the NH2-2 group of the pyrimidine moiety, which is the only difference in the substrates (Fig. 1). A highly conserved glutamate in motif D (Glu290 in BsRibG) may assist in proton transfer during the hydride transfer.
Fusion of Two Enzyme Domains in RibGThere is no evidence for any dependence of the active sites within the RibG tetramer. The active sites of the D and R domains within the same molecule are
40 Å apart if measured from the zinc ion to the nicotiamide C-4 atom. The openings of the D and R active sites point in the opposite directions. Unlike the bifunctional enzyme of DHFR and thymidylate synthase (36), the present BsRibG structure does not show a channel for substrate transport from the D domain to the R domain for the sequential reaction. In addition, the active sites of the dimeric TmDHFR have been shown to be independent (20). Unlike the monomeric DHFRs, TmDHFR does not show significant structural changes on binding of NADPH and the inhibitor methotrexate, because the active-site loops (L
A-
B and L
F-
G) in the monomeric enzymes are rearranged on the dimeric interfaces. Similarly, NADPH binding does not induce any significant conformational changes in BsRibG, and the corresponding loops are also involved in the R interface.
Previous deletion mutants have demonstrated that the N-terminal 147 residues and the C-terminal 248 residues of BsRibG were sufficient for their respective enzyme activities (4). However, these truncated proteins could not be isolated because of poor stability. The present structure reveals that the D and R domains make only a few contacts, including four hydrogen bonds by Glu21, Gln135, His139 Gln187, Lys212, Gln306 and Lys357 and small hydrophobic contacts by Lys136, Phe140, Tyr147, and Met285.If the
F helix (residues 135-143) is considered as an interdomain linker, the D and R domain is even less interactive. Therefore, even though the two enzyme domains can fold independently, the domain fusion is crucial for the enzyme activities through formation of a stable tetrameric structure.
| FOOTNOTES |
|---|
* This study was supported by National Science Council Grant NSC94-2311-B010-017. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
1 These authors contributed equally to this work. ![]()
2 To whom correspondence should be addressed: Faculty of Life Science, National Yang-Ming University, Taipei, Taiwan 11221. Tel.: 886-2-2826-7278; Fax: 886-2-2820-2449; E-mail: shliaw{at}ym.edu.tw.
3 The abbreviations used are: BsRibG, B. subtilis RibG; BsGD, B. subtilis guanine deaminase; CDA, cytidine deaminase; yCD, yeast cytosine deaminase; T4dCMPD, T4 bacteriophage dCMP deaminase; TAD, tRNA-specific adenosine deaminase; AaTADA, A. aeolicus tRNA-specific adenosine deaminase; DHFR, dihydrofolate reductase; TmDHFR, T. maritima dihydrofolate reductase; AID, activation-induced deaminase; APOBEC, apolipoprotein B mRNA-editing catalytic subunit; r.m.s.d., root-mean-square deviation; AICAR, aminoimidazole-4-carboxamide ribonucleotide. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. Toyomasu, M. Tsukahara, A. Kaneko, R. Niida, W. Mitsuhashi, T. Dairi, N. Kato, and T. Sassa Fusicoccins are biosynthesized by an unusual chimera diterpene synthase in fungi PNAS, February 27, 2007; 104(9): 3084 - 3088. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Molecular and Cellular Proteomics |
| Journal of Lipid Research | ASBMB Today |