|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 280, Issue 6, 4289-4298, February 11, 2005
Specificity Changes in the Evolution of Type II Restriction EndonucleasesA BIOCHEMICAL AND BIOINFORMATIC ANALYSIS OF RESTRICTION ENZYMES THAT RECOGNIZE UNRELATED SEQUENCES*![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
From the
Received for publication, August 6, 2004 , and in revised form, November 22, 2004.
How restriction enzymes with their different specificities and mode of cleavage evolved has been a long standing question in evolutionary biology. We have recently shown that several Type II restriction endonucleases, namely SsoII ( CCNGG), PspGI ( CCWGG), Eco-RII ( CCWGG), NgoMIV (G CCGGC), and Cfr10I (R CCGGY), which recognize similar DNA sequences (as indicated, where the downward arrows denote cleavage position), share limited sequence similarity over an interrupted stretch of 70 amino acid residues with MboI, a Type II restriction endonuclease from Moraxella bovis (Pingoud, V., Conzelmann, C., Kinzebach, S., Sudina, A., Metelev, V., Kubareva, E., Bujnicki, J. M., Lurz, R., Luder, G., Xu, S. Y., and Pingoud, A. (2003) J. Mol. Biol. 329, 913929). Nevertheless, MboI has a dissimilar DNA specificity ( GATC) compared with these enzymes. In this study, we characterize MboI in detail to determine whether it utilizes a mechanism of DNA recognition similar to SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I. Mutational analyses and photocross-linking experiments demonstrate that MboI exploits the stretch of 70 amino acids for DNA recognition and cleavage. It is therefore likely that MboI shares a common evolutionary origin with SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I. This is the first example of a relatively close evolutionary link between Type II restriction enzymes of widely different specificities.
Restriction enzymes are components of restriction-modification systems that protect bacteria and archaea from invading foreign DNA (13). Four different types of restriction enzymes are known (4) that differ in quaternary structure, cofactor dependence, and mode of DNA recognition and cleavage. Type II enzymes recognize short, typically palindromic sequences and cleave the DNA within or in close proximity to the recognition site (57). More than 3570 different Type II restriction enzymes with 240 unique specificities have been biochemically characterized (8). Remarkably, little sequence similarity has been observed between the more than 200 Type II restriction enzymes that have been sequenced to date. The few exceptions consist of isoschizomers that cleave the same sequence at the same position, e.g. EcoRI and RsrI (G AATTC) (9), MthTI and NgoPII (GG CC) (10), XmaI and CfrI (C CCGGG) (11), and Cfr10I and Bse634I (R CCGGY) (12). Still, most isoschizomers do not share significant sequence similarity. Limited sequence similarity has also been observed in some cases among restriction enzymes that recognize partially related sequences, e.g. EcoRI (G AATTC) and MunI (C AATTG) (13) and SsoII ( CCWGG) and PspGI ( CCNGG) (14). Thus, prior to 1995 the prevailing view was that restriction enzymes are not evolutionarily related (15, 16). This notion changed as additional crystal structures of restriction enzymes became available, clearly demonstrating these proteins to have a highly similar structural core that consists of a four-stranded -sheet flanked by -helices and that harbors the characteristic PD... (D/E)XK motif active site (1719). Furthermore, a statistical analysis revealed a significant correlation between the amino acid sequences ("genotype") of restriction enzymes and their recognition sequences and mode of cleavage ("phenotype"); these findings were interpreted as evidence for an evolutionary relationship among Type II restriction endonucleases (20). There is little doubt now that Type II restriction enzymes of the PD... (D/E)XK superfamily evolved via divergent evolution (21), a process that was accelerated by frequent exchange of restriction-modification systems through horizontal gene transfer among bacteria and archaea (22). Nevertheless, only slowly do we begin to understand the evolution of type II restriction enzymes.
As mentioned above, amino acid sequence similarities among Type II restriction enzymes appear to be restricted to enzymes that recognize the same or similar DNA sequences and that cleave the DNA at the same position. The paradigm is the correlation first detected for SsoII (
SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I belong to the superfamily of PD... (D/E)XK enzymes but have a variant configuration of the active site in which the second carboxylate (D/E) is replaced by serine (SsoII, NgoMIV, or Cfr10I), threonine (PspGI), or alanine (EcoRII). The catalytic role of the second carboxylate in these enzymes is assumed by a glutamate that is 15 or 16 residues downstream, as shown for Cfr10I by mutational replacement (34). That these five enzymes (and their close relatives, see above) belong to different families (Type IIP, Type IIE, and Type IIF) is interpreted as their having a common ancestor, presumably a homodimeric enzyme that already had the variant active site and that acquisition of an effector domain (Type IIE) or generation of a dimer-dimer interface (Type IIF) is a more recent event in evolution (23). Although this evolutionary relationship is well supported by a previous structure-function relationship study of SsoII and PspGI, the suggestion that other Type II restriction enzymes that recognize unrelated sequences, such as GATC, G GYRCC, and GG CC (25), are evolutionarily related to SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I has not yet been explored experimentally.
Here, we present a detailed biochemical analysis of MboI, a Type II restriction enzyme from Moraxella bovis that recognizes and cleaves the sequence
Site-directed Mutagenesis of MboI VariantsSite-directed mutagenesis of the MboI gene was performed using a PCR-based technique (38). The mutant genes were sequenced and found to contain only the desired mutation. Protein Expression and PurificationHis6-tagged MboI and its variants were expressed in Escherichia coli BL21 and purified to near homogeneity using nickel-nitrilotriacetic acid-agarose. Fractions containing pure MboI were collected and dialyzed against buffer D (10 mM Tris-HCl, 0.1 mM EDTA, 100 mM NaCl, 50% (v/v) glycerol, pH 8.0). DNA Binding AssayMboI protein dimer (0.5 nM to 10 µM) was mixed with 5 nM 32P-labeled 188-bp PCR product (containing one MboI recognition site) in binding buffer (10 mM Tris-HCl, 100 mM NaCl, 5 mM CaCl2, 100 µg/ml bovine serum albumin, 10% (v/v) glycerol, pH 8.5) supplemented with 0.1 µg of poly(dI-dC) (Amersham Biosciences). After incubation for 15 min at ambient temperature, the samples were applied to a 6% polyacrylamide gel, electrophoresed for 3.5 h at 80 V (10 V/cm) in 20 mM Tris acetate, 5 mM CaCl2, pH 8.5, and the gels subjected to autoradiography using an instant imager (Packard Instrument Co.). Binding constants were determined by a linear least-square fit of the data to a simple A + B = C binding model. Gel-shift experiments to compare DNA binding by MboI to a substrate containing two recognition sites were done with a 5 nM 32P-labeled 352-bp PCR product. DNA Cleavage AssayDNA cleavage activity of wild-type MboI and its variants was determined by incubating protein with 32P-labeled 188-bp PCR product in cleavage buffer (50 mM Tris-HCl, 100 mM NaCl, 10 mM MgCl2, 100 µg/ml bovine serum albumin, pH 8.0) at 37 °C for different times depending on the activity of the variant. Protein and DNA concentrations are indicated in Table I. Experiments designed to compare the rates of cleavage by wild-type MboI of one- versus two-site substrates were performed with two different PCR products containing one (188 bp) or two recognition sites (352 bp), respectively, in the absence or presence of a specific double-stranded oligodeoxyribonucleotide d(CATACGAGGATCCATTCGCT) (US20/LS20). Reaction conditions are given in the legend to Fig. 4. Reaction products were analyzed by electrophoresis on 10% polyacrylamide gels, which were subsequently subjected to autoradiography using an instant imager (Packard Instrument Co.).
Determination of the Mg2+- and Mn2+ Concentration Dependence Mg2+ (0.0550 mM) and Mn2+ (0.0052.5 mM) concentration dependence of DNA cleavage by MboI was measured with cleavage buffer (10 mM Tris-HCl, 1 mM dithiothreitol, 100 µg/ml bovine serum albumin, pH 7.5) supplemented with 0150 mM NaCl to maintain a constant ionic strength. Cleavage was measured with an enzyme concentration of 0.5 nM (Mg2+ dependence) or 1 nM (Mn2+ dependence) MboI dimer and a substrate concentration of 37 nM 188-bp PCR product containing a single cleavage site. Cleavage reactions were performed for 15 (Mg2+ dependence) or 30 (Mn2+ dependence) min at 37 °C. Determination of the Salt Concentration DependenceSalt concentration dependence of DNA cleavage by MboI was measured in buffers (10 mM Tris-HCl, 10 mM MgCl2, 1 mM dithiothreitol, 100 µg/ml bovine serum albumin, pH 8.0) containing varying concentrations of NaCl (0200 mM). Cleavage was measured at an enzyme concentration of 1.25 nM MboI dimer and a substrate concentration of 25 nM 188-bp PCR product containing a single cleavage site. Cleavage reactions were performed for 4 min at 37 °C. Determination of the pH DependenceA triple buffer containing 25 mM MES, 25 mM sodium acetate, and 50 mM Tris-HCl (39) was prepared at various pH values. This buffer was chosen to maintain constant ionic strength across the pH range examined. The cleavage rate was measured within the range pH 4.510 with an enzyme concentration of 2.5 nM MboI dimer and a substrate concentration of 25 nM 188-bp PCR product containing a single cleavage site. Cleavage reactions were performed for 4 min at 37 °C. Gel FiltrationGel filtration experiments were performed at room temperature on a Merck-Hitachi high-performance liquid chromatography system using a Superdex 75 column (Amersham Biosciences) equilibrated with 10 mM Tris-HCl, pH 8.5, 500 mM NaCl, 5 mM CaCl2. Wild type MboI was diluted in the same buffer, which was loaded onto the column, and chromatography was carried out at a flow rate of 0.5 ml/min. Elution was monitored by absorbance at 280 nm. The molecular mass was determined by interpolation, using a calibration curve of proteins of known molecular mass (molecular weight marker kit, Sigma).
Transmission Electron MicroscopyFor electron microscopy, DNA fragments of 188 and 352 bp in length, containing one or two recognition sites for MboI, respectively, were produced by PCR and purified. In a reaction volume of 20 µl, 30 ng of DNA was incubated with 30 or 60 ng of MboI (the molar ratio of recognition sites to MboI monomers was Photocross-linkingFor analytical scale photocross-linking, 10 µM MboI dimer was preincubated with 10 µM double-stranded eicosadeoxyribonucleotide monosubstituted with 5-IdU at various positions in the recognition site (underlined) or the two flanking positions, namely 5'-CATACGAGGATCCATTCGCT-3' (Thermo Hybaid) in buffer P (10 mM Tris-HCl, 100 mM NaCl, 5 mM CaCl2, pH 8.0) for 10 min at ambient temperature in a volume of 25 µl. Photocross-linking was carried out with a 40-milliwatt helium/cadmium laser emitting at 325 nm (Laser 2000). The total irradiation time was typically 45 min, or 090 min in kinetic experiments. Samples of 5 µl were taken before and after cross-linking and analyzed by electrophoresis on a 15% SDS-polyacrylamide gel. Gels were silver-stained, and radioactive bands were visualized by autoradiography with intensifying screens or using an imager. For preparative isolation of a cross-linked MboI-oligodeoxyribonucleotide complex (see below), 20 µM MboI dimer was preincubated with 20 µM double-stranded dodecadeoxyribonucleotide monosubstituted with 5-IdU at the A of the recognition sequence, namely 5'-CGAGGATCCATT-3' (US12-A-1/LS12, Biomers) in buffer P in a volume of 100 µl. A small amount of the oligodeoxyribonucleotide was radioactively 5'-end-labeled with 32P-phosphate. Digestion of the Cross-linked MboI-dodecadeoxyribonucleotide ComplexCross-linked complex (125 µl and 2.5 nmol) of MboI and dodecadeoxyribonucleotide US12-A-1/LS12 was incubated at 37 °C overnight with 15.5 µg of trypsin (sequencing grade, Roche Applied Science) or a combination of 9.5 µg of trypsin and 9.5 µg of chymotrypsin (sequencing grade, Roche Applied Science) or with 12.5 µg of proteinase K (Fermentas). Aliquots were analyzed by electrophoresis on a 15% polyacrylamide gel containing 0.5x TBE (89 mM Tris base, 89 mM boric acid, 2 mM EDTA, pH 8.3) and 2 M urea (41). After complete digestion, samples were precipitated in the presence of 1/10 volume of 4 M LiCl and 3 volumes of ethanol, dissolved in 0.1 M acetic acid, and loaded onto Fe3+-IMAC columns prepared from nickel-nitrilotriacetic acid spin columns (Qiagen) by Ni2+/Fe3+ exchange according to a previous study (42). Only unreacted oligodeoxyribonucleotides and peptide-DNA hetero-conjugates were retained on the IMAC column when washed as described before (43). The retained DNA-containing components were eluted with 2 x 600 µl of H2O, pH 10.5 (adjusted with NH3), and evaporated in a SpeedVac. The dried material was dissolved in 40 µl of H2O. Mass SpectrometryHydrogen fluoride (HF) treatment: selective cleavage of phosphodiester linkages was performed according to a previous study (44) using a modified protocol (45). Briefly, dried samples were incubated with 50 µl of 48% HF (Merck) at 4 °C overnight. The reagent was removed under a stream of nitrogen, and dried samples were dissolved in 50 µl of methanol and dried again; this procedure was repeated once. MALDI-MS and MS/MS spectra were recorded using an Ultraflex time-of-flight mass spectrometer (Bruker) equipped with a LIFT-MS/MS facility. Samples were prepared on an AnchorChipTM 600/384 target plate. For oligodeoxyribonucleotides and oligodeoxyribonucleotide-peptide heteroconjugates, a saturated solution of 3-hydroxypicolinic acid (Sigma-Aldrich) in acetonitrile/water (45:65, v/v), containing 10% diammoniumhydrogen citrate was used as matrix. Matrix solution (0.5 µl) was applied to the anchor chip and allowed to dry. Then 0.51 µl of analyte was added, and the chip was dried again. HF-treated cross-linked complexes were analyzed as described in detail previously (46). The MALDI mass spectra obtained were externally calibrated using either an oligodeoxyribonucleotide calibration standard or peptide calibration standard II (both from Bruker). Fragment ion analysis in the tandem time-of-flight (TOF-TOF) mode was performed as described (47). Protein Structure PredictionSecondary structure prediction and protein-fold recognition was carried out via the GeneSilico metaserver gateway (genesilico.pl/meta/ (48)). The reported alignments between the MboI sequence and structures of PD... (D/E)XK nucleases were used as starting points for homology modeling using the "Frankenstein's monster" approach, comprising cycles of model building, evaluation/scoring, realignment in poorly scored regions, and merging of best scoring fragments (49).
The present study was carried out with His6-tagged MboI (wild type and mutants) isolated from an overproducing E. coli strain and purified to near homogeneity by nickel-nitrilotriacetic acid-agarose.
Analysis of DNA Cleavage by MboIOptimum DNA cleavage conditions, a prerequisite for the analysis of structure-function relationships, were determined for MboI. Experiments carried out to assess the Mg2+/Mn2+, ionic strength, and pH dependence of DNA cleavage by MboI show that this enzyme, like many Type II restriction endonucleases, has an Mg2+ (Mn2+) optimum between 1 and 10 mM (0.1 and 1 mM, corrected for ionic strength), requires Classification of MboI as a Type IIP EnzymeThe presumptive distant relatives of MboI are Type IIP (SsoII and PspGI), Type IIE (EcoRII), and Type IIF (NgoMIV and Cfr10I) restriction enzymes. To ascertain the subtype of MboI, four types of experiments were carried out: (i) gel filtration, to determine the quaternary structure of MboI; (ii) electron microscopy, to establish whether MboI binds two recognition sites simultaneously; (iii) gel electrophoretic mobility shift experiments, to observe if two recognition sites on one DNA molecule are bound cooperatively by MboI and; and (iv) DNA cleavage experiments, to decide whether the rate of cleavage of a substrate having one site can be increased by addition of an oligodeoxyribonucleotide substrate (activation in trans).
Gel filtration runs with MboI and marker proteins show that MboI is eluted between bovine serum albumin and ovalbumin (data not shown). The apparent molecular mass of
Mutational Analysis of Residues in MboI Considered Important for DNA Binding and CleavageBased on the alignment shown in Fig. 1, we selected 15 amino acids for a mutational analysis, which included DNA binding and cleavage experiments. The amino acid residues selected fall into three groups: (i) residues 138149 (R-box), whose counterparts in NgoMIV are involved in DNA base and backbone contacts on the 3'-side of the recognition sequence; (ii) residues 186201, comprising the PD... (D/E)XK motif; and (iii) residues 203215, comprising a region that in NgoMIV is responsible for some of the nucleobase contacts. The results of the mutational analysis are summarized in Table I.
Asn-138, conserved among the close MboI relatives (which all recognize MboI has an unusual PD... (D/E)XK motif, in which Pro is replaced by Phe and Lys by Asn. The FD... EXN sequence is, however, fully conserved among the MboI relatives. To demonstrate that this sequence is indeed representing the catalytic center, Asp-186, Glu-199, and Asn-201 were substituted by Ala: the resulting variants are catalytically inactive, but show unaltered DNA binding affinity, which is not unusual for Ala substitutions of the PD... (D/E)XK residues of Type II restriction enzymes (compare with the "Discussion"). Interestingly, the N201K variant is catalytically inactive and impaired in DNA binding. Replacing a nearby Tyr (Tyr-203) by Ala leads to a variant with similar properties to the N201K variant. Based on the alignment with NgoMIV, which uses Asp-193 and Arg-194 for recognition of the central 4 bp of its 6-bp recognition sequence, we have replaced in MboI the corresponding residues Ser-208 and Lys-209, which are conserved among the close relatives of MboI, as well as three adjacent residues, which are partially conserved among the close MboI relatives and have functional groups that could be used for base recognition (e.g. Asn-211, Glu-212, and Arg-215). The S208A and K209A variants are catalytically inactive and show a strong reduction in DNA binding affinity, as expected for residues involved in base recognition. Intriguingly, the K209R variant is not compromised in DNA binding, but is almost inactive in DNA cleavage, suggesting that Lys-209 is required for DNA binding (a function that can be taken over by Arg) and coupling of recognition to catalysis (a function that requires a precise interaction, which cannot be taken over by Arg). The N211A variant has near wild-type properties, whereas the E212Q as well as the R215A and R215K variants bind to DNA with wild-type affinity but are almost inactive in DNA cleavage.
Photocross-linking of MboI with 5-IdU-substituted OligodeoxyribonucleotidesA photocross-linking approach utilizing 5-iododeoxyuridine (5-IdU)-substituted oligodeoxyribonucleotides was used to determine whether the DNA target site GATC is in close contact to amino acid residues of MboI that are suggested by the alignment with EcoRII, SsoII, PspGI, and NgoMIV (Fig. 1) to be involved in DNA binding and cleavage. For this purpose, six double-stranded eicosadeoxyribonucleotides with a central GATC site and monosubstituted in one strand with 5-IdU were synthesized. The substitutions were localized in the recognition site and the two adjacent positions. These oligodeoxyribonucleotides were tested for cleavage by MboI. It was found that oligodeoxyribonucleotides with 5-IdU substitutions at the positions G, A, and C of the GATC recognition sequence cannot be cleaved by MboI but were all demonstrated to bind to MboI in the presence of Ca2+ (41). Results of the photocross-linking reactions are shown in Fig. 5A. Only the oligodeoxyribonucleotide that in one strand had the 5-IdU substitution at the adenine in the GATC recognition site showed a detectable cross-link on a silver-stained gel. A time course of the photocross-linking reaction with a dodecadeoxyribonucleotide with a GATC site, in which in one strand the alanine was substituted by 5-IdU, indicated that
To identify the amino acid in MboI involved in the cross-link with the oligodeoxyribonucleotide, preparative scale photocross-linking was carried out with a double-stranded dodecadeoxyribonucleotide having a central GATC site in which the adenine was substituted in one strand by 5-IdU. After photocross-linking, trypsin, chymotrypsin plus trypsin, or proteinase K was added to aliquots of the irradiation mixture. The progress of proteolysis was analyzed by electrophoresis on denaturing gels containing 2 M urea (41). Peptide-oligodeoxyribonucleotide heteroconjugates were isolated from the proteolytic digestion mixture by Fe3+-IMAC and analyzed by MALDI-TOF mass spectrometry (Fig. 6). The mass differences between the heteroconjugates and the unreacted DNA, corrected for the loss of hydrogen iodide during the reaction, allowed us to calculate the masses of the peptide portions. The trypsin digestion gave a fragment with a molecular weight of 2379.6, which was assigned to the peptide TYVIETNYYNSGGSKLNEVAR, having a theoretical average molecular weight of 2379.59. The corresponding results with the chymotrypsin plus trypsin and proteinase K digestions were 1231.9 (observed molecular weight)/NSGGSKLNEVAR/1232.34 (theoretical molecular weight) and 574.8 (observed molecular weight)/GGSKLN/575.64 (theoretical mass), respectively (Fig. 6A). These results allowed narrowing down the cross-link site to Ser-208, Lys-209, and Asn-211, which are reasonable candidates for an attack by a uridyl radical produced through homolytic cleavage of the IC bond in iodouracil. To determine the exact site of cross-linking, the peptide-oligodeoxyribonucleotide heteroconjugates were treated with HF to cleave all phosphodiester bonds in the peptide-oligodeoxyribonucleotide heteroconjugates. The MALDI-TOF analyses of the resulting dU-modified peptides are shown in Fig. 6B. Observed masses closely match the theoretical masses. Fig. 6C shows the result of the MS/MS analysis of the tryptic peptide-dU heteroconjugate, in which Lys-209 was identified as the amino acid residue cross-linked to the 5-IdU-substituted oligodeoxyribonucleotide.
We produced the K209A variant to confirm that Lys-209 is the cross-linking site in MboI and tried to cross-link this variant with the 5-IdU-substituted oligodeoxyribonucleotide. As shown in Fig. 5B, K209A cannot be cross-linked to this oligodeoxyribonucleotide. The variant K209R, however, which has wild type binding affinity (despite the fact that it is almost inactive in DNA cleavage, see above), is able to form a cross-link with the oligodeoxyribonucleotide in which the deoxyadenosine had been replaced by 5-IdU, at a reduced yield compared with wild-type MboI. To facilitate experimental data interpretation in the sequence-structure-function context, we constructed a theoretical model of the MboI catalytic domain (residues 137280) in complex with target DNA. The homology model of the monomer is based on the fold recognition analysis (see "Experimental Procedures"). The mutual orientation of the two MboI monomers as well as the coordinates of the DNA molecule and the Mg2+ ions was derived from superposition onto subunits A and B in the NgoMIV co-crystal structure. The GATC site was generated by "mutating" bases in the original NgoMIV target (CCGG) and optimizing their geometry using HyperChem 7.1 (Hypercube, Inc.). The final model of the MboI dimer-GATC sequence complex (available from genesilico.pl/iamb/models/R.MboI/R.MboI.model.pdb) was obtained after removing steric clashes between residues from both monomers and the DNA by selecting different rotamers of the respective side chains (Figs. 7 and 8).
The PD... (D/E)XK superfamily of Type II restriction endonucleases is divided into an EcoRI and EcoRV branch (17). This division is based on structural (i.e. dimerization scheme and topology of secondary structure elements) and functional differences: enzymes that belong to the EcoRI branch (BamHI, BglII, Bse634I, BsoBI, Cfr10I, EcoRI, EcoRII, FokI, MunI, and NgoMIV) usually approach the DNA from the major groove side (17), recognize the DNA mainly via an -helix and a loop ( -class (52)) and in general produce 5'-staggered ends, whereas enzymes of the EcoRV branch (BglI, EcoRV, HincII, NaeI, and PvuII) usually approach the DNA from the minor groove side (17), use a -strand and a -like turn for DNA recognition ( -class (52)), and in general produce blunt or 3'-staggered ends. Therefore, the two branches were termed " " and " ," respectively (53). Structural variations within the EcoRI ( ) and EcoRV ( ) branch were used for a cladistic analysis of the members of the two branches using methods based on protein structure rather than amino acid sequence (54). This analysis allows inferences regarding evolutionary distances and construction of an evolutionary tree of Type II restriction enzymes of known three-dimensional structure (21).
In the absence of detailed structural information, predictions regarding phylogenetic relationships among restriction endonucleases can be made on the basis of amino acid sequence similarities. However, with few exceptions concerning isoschizomers, restriction enzymes do not show sufficient sequence similarities to infer phylogenetic relationships without additional structural and/or functional information. Based on sequence alignments, we previously proposed that SsoII, PspGI, and EcoRII belong to the EcoRI branch and are structurally related to Bse634I, Cfr10I, and NgoMIV (23, 25). This hypothesis was supported by a detailed experimental analysis and homology modeling, which demonstrated amino acid residues of SsoII and PspGI that align with functionally important residues in Bse634I, Cfr10I, and NgoMIV to be indeed essential for DNA binding and catalysis by SsoII and PspGI. Our suggestion to include EcoRII in this comparison has been validated by the recently published crystal structure analysis of EcoRII (33). These enzymes recognize similar sequences (e.g. SsoII In the present study, we followed the approach we developed to demonstrate an evolutionary relationship between SsoII, PspGI, and EcoRII on one side and Bse634I, Cfr10I, and NgoMIV on the other (23, 25). This approach consists of detailed biochemical analyses of the restriction enzyme, mutational analyses of amino acids assumed to be of functional importance, identification of amino acids involved in DNA binding by cross-linking, and homology modeling of the protein-DNA complex. Our results show that MboI is a typical Type II restriction enzyme with a divalent cation requirement, pH, and ionic strength dependence similar to that of PspGI (25). Like PspGI and SsoII, but unlike EcoRII (Type IIE) and Bse634I, and Cfr10I and NgoMIV (Type IIF), it is a Type IIP enzyme, because it interacts with DNA as a homodimer and does not require binding to a second recognition site for efficient DNA cleavage.
The mutational analysis shows that amino acid residues from the R-box (Table I), well conserved among the close MboI relatives, among them DpnII (Fig. 1), are essential for DNA binding and cleavage. The corresponding alanine variants of MboI are catalytically inactive and bind to DNA by approximately three orders of magnitude less than the wild type enzyme. A similar result was obtained with SsoII and PspGI, suggesting that the arginine residues of the R-box play an important role for these enzymes (23, 25). This is emphasized by the finding that also the lysine for arginine substitution of Arg-140 and Arg-143 in MboI is deleterious for DNA binding and cleavage. The amino acids of the R-box are not conserved among Bse634I, Cfr10I, and NgoMIV, but rather are replaced by hydrophilic residues. The co-crystal structure of an NgoMIV-DNA product complex shows that amino acids corresponding to R-box residues are involved in contacts to the region at the 3'-end of the recognition sequence, e.g. Gln-63 (which aligns with Asn-142 in MboI) is involved in a hydrogen bond to the N4 of the outer cytosine (underlined) of the G Six residues C-terminal of the R-box is a glutamic acid that is conserved among the restriction enzymes under consideration in this study; in MboI, however, it is replaced by a serine (Fig. 1). In NgoMIV, this glutamic acid (Glu-70) is within <0.4 nm of Asp-140, the first carboxylate of the variant PD... (D/E)XK motif. The equivalent residue in MboI is Ser-149. It cannot be substituted by glutamic acid without interfering with catalysis, presumably because the active site of MboI has been fine-tuned in the framework of the entire structure. The catalytic motif in MboI is a variation of the classic PD... (D/E)XK motif (PDX4655KX13E (55)), inasmuch as the conserved lysine is substituted by asparagine. Notably, it is another variation than that found in EcoRII, SsoII, PspGI, NgoMIV, Cfr10I, and Bse634I. The mutational analysis clearly shows that Asp-186, Glu-199, and Asn-201 are essential for catalysis by MboI; furthermore, Asn-201 cannot be replaced by lysine, although this replacement would have reconstituted the classic PD... (D/E)XK motif. Replacements of the active site carboxylates by alanine do not affect the affinity of the MboI variants for their DNA substrate. This is in line with results obtained for other restriction enzymes (56, 57). In NgoMIV, base-specific contacts to the two inner base pairs are made by amino acids 26 residues downstream from the catalytic lysine (Lys-187), among them Thr-189, Asp-193, and Arg-194; equivalent to Tyr-203, Ser-208, and Lys-209, respectively, in MboI. Replacing these residues by alanine led to inactive mutants, which in addition show a greatly decreased affinity for their DNA substrate, as observed for the R-box variants and as expected for residues involved in recognition. The replacement of Lys-209 by arginine did not compromise DNA binding but is deleterious for DNA cleavage activity. This indicates that the arginine for lysine substitution interferes with the coupling of recognition and catalysis.
Asn-197 in NgoMIV makes a backbone contact to the 5'-phosphate after cleavage of the scissile bond. The corresponding residue in MboI is Glu-212; its replacement by glutamine (as in EcoRII and Cfr10I) renders the variant almost inactive, without affecting the affinity for its substrate. A similar effect is observed for the R215A variant. Based on homology with NgoMIV, it is assumed that this region of MboI is involved in backbone interactions, presumably at the 5'-end of the
Taken together, these results allowed us to model the structure of MboI. Fig. 7 shows a secondary structure prediction for the catalytic domain of MboI in comparison with the secondary structure of EcoRII and NgoMIV, deduced from the crystal structures (31, 33). The active site residues are indicated for all three enzymes, whereas the residues involved in DNA binding are only indicated for MboI (based on the mutational analysis) and NgoMIV (based on the co-crystal structure). Fig. 8 shows the stretch of
In two previous reports (23, 25) we showed that the Type II restriction endonucleases SsoII and PspGI share significant sequence similarities with EcoRII, NgoMIV, Cfr10I, and Bse634I. It was argued that these enzymes, which recognize similar sequences and have a similar catalytic center (a variant PD... (D/E)XK motif), are evolutionarily closely related, despite belonging to different sub-types. We have now extended this study to MboI, which recognizes a very different sequence, but makes use of the same structural elements to recognize and cleave DNA using a catalytic center that constitutes another variant of the PD... (D/E)XK motif. Having confirmed the functional significance of the sequence similarities between MboI, SsoII, PspGI, EcoRII, NgoMIV, Cfr10I, and Bse634I, we conclude that these enzymes are evolutionarily related (Fig. 9). They belong to the EcoRI branch of Type II restriction enzymes that use an -helix and its preceding loop for DNA recognition. Because MboI has only a slight variation of the PD... (D/E)XK (K being substituted by N) motif, we infer that MboI, a Type IIP enzyme, resembles the common ancestor more than other enzymes that have a larger variation of the catalytic motif. In these enzymes, the second carboxylate of the catalytic motif, usually found on the -strand preceding the recognition helix, is recruited from this helix. Two isoschizomers of MboI, namely Sau3AI and MutH (a nuclease involved in mismatch repair, but related to Type II restriction enzymes), in contrast to MboI belong to the EcoRV branch; they are related to each other but not to MboI. Furthermore, whereas MutH is a monomer that nicks DNA, Sau3AI is a monomer that dimerizes on the DNA and makes a double strand cut, activated by binding to a second recognition site (like a Type IIE enzyme) (40, 58). MboI and Sau3AI/MutH are therefore examples of independent invention of the same functionality (specificity toward GATC), i.e. a bona fide case of functional convergence of two proteins that shared a common ancestor, but underwent extreme divergence. Finally, it appears from our study that the ... ![]() ![]() unit that characterizes the stretch of 70 amino acids of sequence similarity has been used in a subset of enzymes of the EcoRI branch to evolve widely different specificities. It will be interesting to observe whether restriction endonucleases recognizing G GYRCC, such as BanI, and GG CC, such as FnuDI, which share sequence similarity with MboI (25, 59), indeed belong to the EcoRI branch of restriction endonucleases and to determine which elements they use for specific sequence recognition.
* This work was supported by the Deutsche Forschungsgemeinschaft (Grants Pi 122/13-4 and SFB535,Z1), the Fonds der Chemischen Industrie, the Deutscher Akademischer Austauschdienst (International Quality Network Biochemistry of Nucleic Acids), Grant 07.03.066 from the State Program Universities of Russia, and Grant 3P04A001124 from the State Committee for Scientific Research in Poland. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
This report is dedicated to Prof. Dr. G. Maass on the occasion of his 70th birthday.
1 The abbreviations used are: 5-IdU, 5-iododeoxyuridine; MES, 4-morpholineethanesulfonic acid; IMAC, immobilized metal-affinity chromatography; HF, hydrogen fluoride; MALDI, matrix-assisted laser desorption ionization; MS, mass spectrometry; TOF, time-of-flight.
We thank Drs. Liqing Chen and Hwai-Chen Guo for communicating results prior to publication, Dr. Rudolf Geyer for valuable suggestions, Dr. Dagmar Niemeyer for helpful discussions, Dr. George Silva for critical reading of the manuscript, and Nadine Thome for expert technical assistance.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||