|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 281, Issue 11, 7605-7613, March 17, 2006
Crystal Structure of a Bifunctional Deaminase and Reductase from Bacillus subtilis Involved in Riboflavin Biosynthesis*![]() 1![]() 1 ¶ ¶**2
From the
Received for publication, September 19, 2005 , and in revised form, November 3, 2005.
Bacterial RibG is an attractive candidate for development of antimicrobial drugs because of its involvement in the riboflavin biosynthesis. The crystal structure of Bacillus subtilis RibG at 2.41-Å resolution displayed a tetrameric ring-like structure with an extensive interface of 2400 Å2/monomer. The N-terminal deaminase domain belongs to the cytidine deaminase superfamily. A structure-based sequence alignment of a variety of nucleotide deaminases reveals not only the unique signatures in each family member for gene annotation but also putative substrate-interacting residues for RNA-editing deaminases. The strong structural conservation between the C-terminal reductase domain and the pharmaceutically important dihydrofolate reductase suggests that the two reductases involved in the riboflavin and folate biosyntheses evolved from a single ancestral gene. Together with the binding of the essential cofactors, zinc ion and NADPH, the structural comparison assists substrate modeling into the active-site cavities allowing identification of specific substrate recognition. Finally, the present structure reveals that the deaminase and the reductase are separate functional domains and that domain fusion is crucial for the enzyme activities through formation of a stable tetrameric structure.
Flavin coenzymes are ubiquitous in all organisms because of their involvements in central metabolic pathways. Plants and many microorganisms obtain the precursor riboflavin by biosynthesis, whereas animals depend on nutritional sources. Numerous pathogenic microorganisms are unable to take up flavins from the environment and hence are absolutely dependent on their endogenous production. Therefore, the enzymes involved in riboflavin biosynthesis have the potential to become attractive candidates for the design of new defenses against antibiotic-resistant pathogens. During riboflavin biosynthesis (1), GTP cyclohydrolase II first catalyzes the hydrolytic C-8 release of GTP to yield formate and pyrophosphate as side products. The product, 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate (compound 1) is converted into 5-amino-6-ribitylamino-2,4(1H,3H)-pyrimidinedione 5'-phosphate (compound 4) by deamination of the pyrimidine ring and NAD(P)H-dependent reduction of the ribose (Fig. 1). The deamination and reduction steps have been shown to proceed in the opposite order in yeast and Escherichia coli (2, 3). Most eubacteria contain a bifunctional protein; for instance Bacillus subtilis RibG (BsRibG)3 is composed of an N-terminal deaminase domain (D domain) and a C-terminal reductase domain (R domain) (4). In contrast, in fungi, plants and most archaea, these two enzymes are separate (5-7). In yeast, Rib7 is the corresponding reductase, whereas the C-terminal domain of Rib2 is responsible for deamination, in addition to N-terminal pseudouridine synthase activity.
The D domain of bacterial RibG was expected to belong to the cytidine deaminase (CDA) superfamily because of the two conserved signatures, H(C)XE and PCX8-9C. The CDA superfamily consists of the mononucleotide deaminases involved in nucleotide metabolism, and the RNA/DNA-editing deaminases involved in gene diversity and in antivirus defense (8, 9). The RNA/DNA-editing deaminases include A- to -I tRNA-specific adenosine deaminases (TADs), A- to -I adenosine deaminases acting on RNA, and C- to -U cytidine deaminases acting on RNA/DNA. These deaminases catalyze the hydrolytic deamination of cytosine, guanine, and adenine moieties and several of their therapeutically useful analogues. The available member structures reveal a virtually identical zinc-assisted deamination mechanism with the consensus histidine and cysteines acting as the zinc ligands, whereas the glutamate serves as a proton shuttle (10-14). However, the question of how nature has evolved the CDA fold into these various deaminases requires additional study. Furthermore, the structural fold of the R domain was predicted by 3D-PSSM (15) to be similar to dihydrofolate reductase (DHFR). DHFR catalyzes the NADPH-utilizing reduction of dihydrofolate to tetrahydrofolate. Many DHFR inhibitors, such as methotrexate, pyrimethamine, and trimethoprim, have long been used clinically in the treatment of cancer, rheumatoid arthritis, malaria, and bacterial and fungal infection (16). Therefore, the R domain may become an important target for new drug design. To gain structural insights into the inhibitor design, substrate specificity, and evolution, we have solved the BsRibG structure at 2.41-Å resolution.
Protein PreparationThe His-tagged BsRibG was expressed using the pQE30 vector (Qiagen) in E. coli BL21 pLysS. The recombinant protein contains 12 additional vector residues (MRGSH6GS) at the N terminus. One-liter Luria broth (LB) cultures were grown at 37 °C to an A600 of 0.8 and induced with the addition of 1 mM isopropyl -D-thiogalactopyranoside. Cells were grown for another 5 h at 37°C before harvest. Cell pellets were resuspended in 40 ml of cold buffer A (20 mM Tris-HCl, 200 mM NaCl, pH 7.5) and lysed by French press. After removal of cellular debris by centrifugation at 39,000 x g at 4 °C for 30 min, the supernatant was applied to a 5-ml nickel-nitrilotriacetic acid column (Ni-NTA, Qiagen). The resin was washed with 40 mM imidazole in buffer A, and the protein was eluted with buffer A containing 500 mM imidazole. The protein fractions were collected, dialyzed against buffer B (20 mM Tris-HCl, pH 8.5), and loaded onto a SepharoseTM Q column (Amersham Biosciences). After washing with 230 mM NaCl in buffer B, the BsRibG was eluted with 260 mM NaCl and dialyzed against 20 mM HEPES (pH 7.5) and 5 mM dithiothreitol.
Protein CharacterizationThe enzyme activity assay was carried out as described previously (4). The substrate was prepared by GTP hydrolysis using the recombinant E. coli GTP cyclohydrolase II. The molecular mass in solution was estimated by a Beckman-Coulter XL-A analytical Ultracentrifuge with an An60Ti rotor. Sedimentation velocity was performed at 20 °C and 40,000 rpm with standard double sector centerpieces. The UV absorption of the cells was scanned every 5 min for 2 h, and the data were analyzed using the SedFit program (17). Protein CrystallizationThe initial crystallization screening was performed with screening kits using the hanging-drop vapor diffusion method at 22 °C. The hanging drops were mixtures of 2 µl of reservoir solution and 2 µl of protein solution. Protein crystals could be obtained under several reservoir solutions, and the best crystals were grown in 26.6% polyethylene glycol 400, 190 mM MgCl2, 5% glycerol, and 95 mM HEPES (pH 7.5) with a protein solution of 20-25 mg/ml. Crystals appeared and reached their final dimensions in 1 week at 15 °C. The NADPH derivative was prepared by soaking crystals for 4 days in reservoir solution containing 10 mM NADPH. X-ray diffraction data were collected and processed at beamlines BL12B2 at SPring-8 (Harima, Japan) and NW12 at the Photon Factory (Tsukuba, Japan). The crystals belong to the P212121 space group with 1 tetramer/asymmetric unit.
Structure DeterminationThe BsRibG structure was determined using single-wavelength anomalous dispersion (SAD) of the endogenous zinc ion. Four Zn2+ positions were identified and refined with SOLVE (18). The initial phase was improved by direct-method phasing refinement using OASIS (19) before density modification. The initial electron density map showed an interpretable density of
Sequence AlignmentTo identify the unique signatures for each family member for gene annotation, sequence similarity searches were conducted by PSI-BLAST (23), and multiple sequence alignment of the homologous sequences was performed by ClustalW (24). Because of a conservative hydrophobic core and the consensus HXE and PCX2-9C, a structural-based sequence alignment between the CDA members was feasible and was carried out by manual editing according to the available structures. The conserved residues in each family member were mapped onto the known structures to reveal their potential involvement in structural integrity or the enzyme catalysis. Figs. 3, 4, A and B, and 6A were generated by MolScript (25) and Raster3D (26), Fig. 5A by BobScript (25), and Fig. 5B by LigPlot (27).
Overall StructureAnalytical ultracentrifugation experiments clearly demonstrated that BsRibG exists as a tetramer in solution as well as in crystal form, where the enzyme forms a tetrameric ring-like structure (Figs. 2 and 3a). The current tetrameric model contains residues 1-359 for each subunit. Each monomer is composed of two separate functional domains (Fig. 3b). The N-terminal D domain (residues 1-143) consists of a central five-stranded -sheet ( 1- 5) with 1 running antiparallel to the others. The -sheet is sandwiched by two helices ( A and E) on one side and by three helices ( B, C, and D) on the other side. The C-terminal helix F extends away from the D domain and connects to the R domain. The R domain (residues 146-359) is composed of a large nine-stranded -sheet ( A- H and D') with the C-terminal strand H running antiparallel to the others. The -sheet is flanked by five -helices ( B, C, D', E, and F). The secondary structure elements of the R domain are numbered as for DHFRs (28).
The four subunits in the crystal asymmetric unit did not show significant differences between each individual domain except for several loops (root-mean-square deviations (r.m.s.d.) of 0.38-0.46 and 0.59-0.73 Å for the backbones of residues 2-139 and residues 146-359, respectively). However, the relative orientations between the D and R domains are slightly different, resulting in a weak noncrystallographic symmetry. Molecules A and B interact with each other through their D domains with a buried surface area of
The D interface is made up mainly of the N-terminal two helices
Structural Conservation in the CDA SuperfamilyAs expected, the D domain displays high structural homology to the available structures of the CDA members including yCD, B. subtilis guanine deaminse (BsGD), CDAs, T4 bacteriophage dCMP deaminse (T4dCMPD), and Aquifex aeolicus TADA (AaTADA), and subdomains 2 and 4 of the chicken AICAR transformylase domain (10-14, 29). The TADAs in prokaryotes and the TAD2/TAD3 heterodimers in eukaryotes are responsible for the I34 alteration at the wobble position of the tRNA anticodon, whereas TAD1 creates the unique 1-methyl-I37 in eukaryotic tRNAAla (8). Detailed structural comparisons reveal a common three-layer / / structure, in which five -strands ( 1- 5) and three helices ( A- C) correspond closely, whereas the remainder varies across the different deaminases (Fig. 5A). The main chain atoms of the 65-70 structurally equivalent residues are overlaid with an r.m.s.d. of 0.75 to 1.35 Å and with 8-24% sequence identity. These structural elements in the CDA fold form a conservative hydrophobic core, which is also preserved in the AICAR transformylase domain, implying that the hydrophobic core has been highly conserved throughout evolution.
The active site of the D domain contains one tightly bound endogenous zinc ion, for which the anomalous data provided sufficient phase information for structure determination. The zinc ion is tetrahedrally coordinated by His49 N
Based on the structural comparison of the active-site cavities, the substrate of the D domain was modeled into the active site through superposition of the nucleobase rings because of the highly conserved interaction networks surrounding the target amino group (Fig. 5B). The model was then subjected to energy minimization with crystallography NMR software. Simulation of the complex structure suggested that the nucleophilic OH-2 group of the pyrimidine ring coordinates to the catalytic zinc ion and interacts with Glu51O
Structural Divergence in the CDA SuperfamilyThe CDA members exist as an oligomer. All of the available member structures except for BsRibG utilize helices B- D and surrounding loops for oligomerization. BsGD, yCD, and AaTADA display similar dimeric structures (10, 11, 14). The swapping of the C-terminal segment in BsGD causes additional dimeric contacts including the 5 strand and helices A and E. CDAs form tetramers in which one subunit interacts with the other three subunits (12). The hexameric T4dCMPD contains two types of intersubunit interfaces (13). In contrast, the D interface of BsRibG is very distinct from the others and possesses the fewest contacts, with a total buried area of 1300 Å2 (Fig. 4A). In addition to the loops and helices A and B, RibG includes the 2 strand in the D interface. The distinct intersubunit orientation in the D interface also separates the active sites away from each other with an inter-zinc distance of 30 Å. Notably, the shortest inter-zinc distance in the other five deaminases is about 14-15 Å.
The C-terminal segment beyond the 4 strand is quite diverse and may make a major contribution to the structural plasticity and functional diversity among the CDA members (Fig. 5A). For instance, the mononucleobase deaminase yCD and the tRNA-editing deaminase AaTADA unexpectedly superimposed very well, with an r.m.s.d. of 1.08 Å for 115 C atoms with 22% sequence identity. The AaTADA structure was solved by molecular replacement using yCD as a search model (14). The major structural difference around the active-site cavity is the C-terminal helix. In yCD as well as BsGD, genetic changes to alter the substrate specificity are through an introduction of substrate recognition residues at the C-terminal tail (Asp155 in yCD and Tyr156 in BsGD), which then forms a "flap," capping and hence narrowing the opening of the active-site cavity upon substrate binding to limit the pocket size for the nucleobase (10, 11). In contrast, the C-terminal tail in AaTADA, as well as in CDAs, dCMPDs, and the D domain of BsRibG, swings away to enlarge the active-site cavity for their larger substrates (12-14). In addition to the diverse C-terminal segment, each family member has some unique substrate recognition residues; these are discussed below in detail.
Structure-based Sequence Alignment of the CDA MembersIn combination with a sequence-structure analysis, a structure-based sequence alignment of the CDA members was constructed (Fig. 5C). Multiple sequence alignments reveal that there are unique member signatures that are useful for gene annotation. The unique substrate recognition residues for the cytosine deaminases are WXXDI at the C terminus, and those for the guanosine deaminases are FDD between helices
Even though the CDA members display substantial sequence diversity (15-25% sequence identity), our structure-sequence analysis suggests that comparative modeling is feasible. For instance, bacterial TADAs and eukaryotic TAD2s share a unique signature, EXPVG, at the N terminus of the
Possible Effects of the AID Point Mutants in Hyper-IgM PatientsTo date, apolipoprotein B mRNA-editing catalytic subunit 1 (APOBEC1) and its sequence homologues, activation-induced deaminase (AID), APOBEC2, and the tandem repeats APOBEC3A to -3H, are the only identified C- to -U RNA/DNA-editing deaminases in humans (9). AID is an essential B cell-specific factor required for antibody maturation, whereas several APOBECs are involved in defense against a broad range of retroviruses. The AID and APOBEC1 structures have been modeled with an N-terminal catalytic domain, a linker, and a C-terminal pseudocatalytic domain based on the CDA structures (30, 31). However, both proteins seem to contain the D helix (Fig. 5C), and hence the 4 and 5 strands are parallel and the direction of the C terminus is opposite from that of CDAs. In addition, both AID and APOBEC1 are too short to accommodate two CDA folds in one protein molecule. We have modeled residues 1-160 of human AID into a CDA fold consisting of the five -strands surrounded by helices A- E. APOBECs contain more conserved aromatic residues than other members with a unique signature TWY(F)XSWSPCX2C around the 3 strand.
Most residues involved in loss-of-function point mutants of AID in patients with hyper-IgM syndrome type 2 (32) are highly conserved in APOBECs (Fig. 5C). His56 and Cys87 ligate the zinc ion. Arg24, prior the 1 strand, and Arg112 to at the C terminus of the 4 strand may interact with the phosphate group of the edited cytidylate, with respect to corresponding Ser21 and Tyr153 the in T4dCMPD. Phe151 in the E helix may form close contacts with the substrate. The conserved FFX3R motif in the E helix in bacterial TADAs has been shown to play a critical role in A34- to -I34 deamination (33). In contrast, Trp80, Leu106, and Met139, located in the 3, 4, and 5 strands, respectively, may be responsible for construction of the conserved hydrophobic core. The truncated C147X mutant could not be expressed in 293T cells (32), implying that this mutant might not form a stable CDA fold. The 181X and 190X mutants retain deamination activity but are defective for class switch recombination and normal nuclear distribution, indicating the functional roles of the extra C-terminal segment in AID. Similarly, Ser3 and Lys10 at the N-terminal A helix are expected to be exposed at the protein surface and might be important for nuclear location signaling or for interaction with associate proteins.
NADPH Binding in the R DomainNADPH was identified by its strong electron density and is firmly embedded at the active site of the R domain (Fig. 6A). The enzyme did not show significant structural changes upon NADPH binding, with an r.m.s.d. of 0.4 Å for all of the backbone atoms. The cofactor is bound in an extended conformation with extensive interactions with the protein. The adenine N-3 and N-6 atoms interact with Thr221 and Ser298, respectively (Fig. 6B). The phosphate group at the ribose O-3' makes close contacts with Thr221. These interactions seem not to contribute significantly to the cofactor binding, because NADH as well as NADPH can serve as the coenzyme (4). The pyrophosphate moiety interacts with a constellation of backbone NH groups from Gly194, Thr195, Gly292, Ser293, Ala294, and Val295 and the side chain of Thr195. These residues are located at the N termini of helices
Structural Conservation and Divergence between BsRibG and DHFRsA structural homology search by DALI (34) revealed that the R domain displays significant structural similarity to DHFRs (Fig. 7A). Despite low sequence identity, which is as little as 20%, all the DHFR structures exhibit a virtually identical fold consisting of a central eight-stranded
TmDHFR is the most stable DHFR isolated thus far. The most prominent feature of TmDHFR is its highly stable dimer, which has been shown to make a major contribution to the high intrinsic stability (20). The R-R structure of BsRibG is similar to the dimeric TmDHFR structure. Both TmDHFR and BsRibG utilize similar regions for the subunit association. The contact areas of the R interface are larger than those of TmDHFR,
DHFRs and BsRibG share four conserved regions for NADPH binding, motifs A-D (Fig. 7B). In motif A, the two hydrogen bonds between the amide groups of the nicotinamide ring and an alanine residue at the C terminus of the
The enzymatic mechanism of DHFR has been chemically and structurally studied in detail (28, 35). A complete catalytic scheme through five kinetically observable intermediates has been proposed. Similarly, RibG is expected to catalyze the reduction of a cyclic ribosyl into an open ribityl group by hydride transfer from the C-4 atom of the nicotinamide ring of NAD(P)H to the C-1' of the ribose with concomitant protonation of O-5'. The binding architecture of the nicotinamide ring is virtually identical in DHFR and RibG. Thus, based on the structural comparison, the substrate of the R domain was modeled into the active site with subsequent energy minimization. Simulation of the complex structure suggested that the O-2 atom of the pyrimidine ring interacts with Lys151 N
Fusion of Two Enzyme Domains in RibGThere is no evidence for any dependence of the active sites within the RibG tetramer. The active sites of the D and R domains within the same molecule are
Previous deletion mutants have demonstrated that the N-terminal 147 residues and the C-terminal 248 residues of BsRibG were sufficient for their respective enzyme activities (4). However, these truncated proteins could not be isolated because of poor stability. The present structure reveals that the D and R domains make only a few contacts, including four hydrogen bonds by Glu21, Gln135, His139 Gln187, Lys212, Gln306 and Lys357 and small hydrophobic contacts by Lys136, Phe140, Tyr147, and Met285.If the
The atomic coordinates and structure factors (codes 2B3Z and 2D5N) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This study was supported by National Science Council Grant NSC94-2311-B010-017. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 These authors contributed equally to this work. 2 To whom correspondence should be addressed: Faculty of Life Science, National Yang-Ming University, Taipei, Taiwan 11221. Tel.: 886-2-2826-7278; Fax: 886-2-2820-2449; E-mail: shliaw{at}ym.edu.tw.
3 The abbreviations used are: BsRibG, B. subtilis RibG; BsGD, B. subtilis guanine deaminase; CDA, cytidine deaminase; yCD, yeast cytosine deaminase; T4dCMPD, T4 bacteriophage dCMP deaminase; TAD, tRNA-specific adenosine deaminase; AaTADA, A. aeolicus tRNA-specific adenosine deaminase; DHFR, dihydrofolate reductase; TmDHFR, T. maritima dihydrofolate reductase; AID, activation-induced deaminase; APOBEC, apolipoprotein B mRNA-editing catalytic subunit; r.m.s.d., root-mean-square deviation; AICAR, aminoimidazole-4-carboxamide ribonucleotide.
We thank Dr. R. Kirby for critical reading of the manuscript and Dr. H. Chang for performing the ultracentrifuge experiments.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||