![]()
|
|
||||||||
J. Biol. Chem., Vol. 280, Issue 6, 4289-4298, February 11, 2005
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||









From the
Institut für Biochemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 58, D-35392 Giessen, Germany, the ¶A. N. Belozersky Institute of Physico-chemical Biology and Chemistry Department, Moscow State University, Moscow 119899, Russia, the ||Biochemisches Institut, Justus-Liebig-Universität, Friedrichstrasse 24, D-35392 Giessen, Germany, the **Bioinformatics Laboratory, International Institute of Molecular and Cell Biology, Trojdena 4, 02-108 Warsaw, Poland, the 
Max-Planck-Institut für Molekulare Genetik, Ihnestrasse 63-73, D-14195 Berlin, Germany, and ¶¶New England Biolabs, Beverly, Massachusetts 01915
Received for publication, August 6, 2004 , and in revised form, November 22, 2004.
| ABSTRACT |
|---|
|
|
|---|
CCNGG), PspGI (
CCWGG), Eco-RII (
CCWGG), NgoMIV (G
CCGGC), and Cfr10I (R
CCGGY), which recognize similar DNA sequences (as indicated, where the downward arrows denote cleavage position), share limited sequence similarity over an interrupted stretch of
70 amino acid residues with MboI, a Type II restriction endonuclease from Moraxella bovis (Pingoud, V., Conzelmann, C., Kinzebach, S., Sudina, A., Metelev, V., Kubareva, E., Bujnicki, J. M., Lurz, R., Luder, G., Xu, S. Y., and Pingoud, A. (2003) J. Mol. Biol. 329, 913929). Nevertheless, MboI has a dissimilar DNA specificity (
GATC) compared with these enzymes. In this study, we characterize MboI in detail to determine whether it utilizes a mechanism of DNA recognition similar to SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I. Mutational analyses and photocross-linking experiments demonstrate that MboI exploits the stretch of
70 amino acids for DNA recognition and cleavage. It is therefore likely that MboI shares a common evolutionary origin with SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I. This is the first example of a relatively close evolutionary link between Type II restriction enzymes of widely different specificities. | INTRODUCTION |
|---|
|
|
|---|
240 unique specificities have been biochemically characterized (8). Remarkably, little sequence similarity has been observed between the more than 200 Type II restriction enzymes that have been sequenced to date. The few exceptions consist of isoschizomers that cleave the same sequence at the same position, e.g. EcoRI and RsrI (G
AATTC) (9), MthTI and NgoPII (GG
CC) (10), XmaI and CfrI (C
CCGGG) (11), and Cfr10I and Bse634I (R
CCGGY) (12). Still, most isoschizomers do not share significant sequence similarity. Limited sequence similarity has also been observed in some cases among restriction enzymes that recognize partially related sequences, e.g. EcoRI (G
AATTC) and MunI (C
AATTG) (13) and SsoII (
CCWGG) and PspGI (
CCNGG) (14). Thus, prior to 1995 the prevailing view was that restriction enzymes are not evolutionarily related (15, 16). This notion changed as additional crystal structures of restriction enzymes became available, clearly demonstrating these proteins to have a highly similar structural core that consists of a four-stranded
-sheet flanked by
-helices and that harbors the characteristic PD... (D/E)XK motif active site (1719). Furthermore, a statistical analysis revealed a significant correlation between the amino acid sequences ("genotype") of restriction enzymes and their recognition sequences and mode of cleavage ("phenotype"); these findings were interpreted as evidence for an evolutionary relationship among Type II restriction endonucleases (20). There is little doubt now that Type II restriction enzymes of the PD... (D/E)XK superfamily evolved via divergent evolution (21), a process that was accelerated by frequent exchange of restriction-modification systems through horizontal gene transfer among bacteria and archaea (22). Nevertheless, only slowly do we begin to understand the evolution of type II restriction enzymes.
As mentioned above, amino acid sequence similarities among Type II restriction enzymes appear to be restricted to enzymes that recognize the same or similar DNA sequences and that cleave the DNA at the same position. The paradigm is the correlation first detected for SsoII (
CCNGG) and PspGI (
CCWGG) (14) that was subsequently verified and extended to include EcoRII (
CCWGG), NgoMIV (G
CCGGC), and Cfr10I (R
CCGGY) (2325). It is noteworthy that this group of related enzymes and their close relatives (SsoII: Kpn2kI/Ecl18kI/StyD4I/SenPI; Cfr10I: Bse634I/BsrFI) comprises: (i) Type IIP enzymes such as SsoII and PspGI, homodimeric enzymes that recognize palindromic sequences; (ii) Type IIE enzymes such as EcoRII, homodimeric enzymes that need two recognition sequences for DNA cleavage, one sequence being cleaved and the other serving as an effector (2629); and (iii) Type IIF enzymes such as NgoMIV and Cfr10I, homotetrameric enzymes that cleave, in a concerted manner, DNA with two recognition sites (26, 27, 29, 30). These enzymes share sequence similarities over a stretch of
70 amino acid residues, interrupted by two gaps of variable length (Fig. 1). This region is part of the conserved core of the PD... (D/E)XK superfamily of Type II restriction endonucleases. In NgoMIV, for which a co-crystal structure is available (31), this region encompasses helices 3 and 7 and
-strands 2 and 3 that are involved in DNA recognition and cleavage. The involvement of this region in DNA binding and cleavage by Cfr10I and EcoRII is also apparent from crystal structures (32, 33). For SsoII and PspGI, mutational analysis and protein-DNA cross-linking determined this region to be likewise important (23, 25).
|
GATC, G
GYRCC, and GG
CC (25), are evolutionarily related to SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I has not yet been explored experimentally.
Here, we present a detailed biochemical analysis of MboI, a Type II restriction enzyme from Moraxella bovis that recognizes and cleaves the sequence
GATC (35, 36). MboI shares sequence similarity with EcoRII, NgoMIV, Cfr10I, SsoII, and PspGI over precisely the region in these enzymes involved in DNA binding and cleavage. However, as evident in Fig. 1, unlike these enzymes, MboI does not have the variant PD... (D/E)XK motif but rather another variation, namely FD... ETN, related to the catalytic motif of BglII, i.e. ID.. .EVQ (37). Our results demonstrate that MboI, like SsoII and PspGI, is a Type IIP enzyme: a homodimer, both in the absence and presence of DNA, that cleaves DNA with one site as efficiently as DNA with two sites. Extensive mutational analysis reveals amino acid residues that appear conserved in the alignment with SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I are critical for DNA binding and/or catalysis by MboI. Lys-209 in MboI, suggested by the alignment with Arg-194 in NgoMIV to make a base-specific contact, forms a zero-length cross-link with 5-iododeoxyuridine (5-IdU)1 in an oligodeoxyribonucleotide carrying a GATC recognition site in which the A is substituted by 5-IdU. We conclude from these results that MboI shares a common structure and evolutionary origin with SsoII, PspGI, EcoRII, NgoMIV, and Cfr10I. This is the first example of a close evolutionary link between Type II restriction enzymes of completely different specificities.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Protein Expression and PurificationHis6-tagged MboI and its variants were expressed in Escherichia coli BL21 and purified to near homogeneity using nickel-nitrilotriacetic acid-agarose. Fractions containing pure MboI were collected and dialyzed against buffer D (10 mM Tris-HCl, 0.1 mM EDTA, 100 mM NaCl, 50% (v/v) glycerol, pH 8.0).
DNA Binding AssayMboI protein dimer (0.5 nM to 10 µM) was mixed with 5 nM 32P-labeled 188-bp PCR product (containing one MboI recognition site) in binding buffer (10 mM Tris-HCl, 100 mM NaCl, 5 mM CaCl2, 100 µg/ml bovine serum albumin, 10% (v/v) glycerol, pH 8.5) supplemented with 0.1 µg of poly(dI-dC) (Amersham Biosciences). After incubation for 15 min at ambient temperature, the samples were applied to a 6% polyacrylamide gel, electrophoresed for 3.5 h at 80 V (10 V/cm) in 20 mM Tris acetate, 5 mM CaCl2, pH 8.5, and the gels subjected to autoradiography using an instant imager (Packard Instrument Co.). Binding constants were determined by a linear least-square fit of the data to a simple A + B = C binding model. Gel-shift experiments to compare DNA binding by MboI to a substrate containing two recognition sites were done with a 5 nM 32P-labeled 352-bp PCR product.
DNA Cleavage AssayDNA cleavage activity of wild-type MboI and its variants was determined by incubating protein with 32P-labeled 188-bp PCR product in cleavage buffer (50 mM Tris-HCl, 100 mM NaCl, 10 mM MgCl2, 100 µg/ml bovine serum albumin, pH 8.0) at 37 °C for different times depending on the activity of the variant. Protein and DNA concentrations are indicated in Table I. Experiments designed to compare the rates of cleavage by wild-type MboI of one- versus two-site substrates were performed with two different PCR products containing one (188 bp) or two recognition sites (352 bp), respectively, in the absence or presence of a specific double-stranded oligodeoxyribonucleotide d(CATACGAGGATCCATTCGCT) (US20/LS20). Reaction conditions are given in the legend to Fig. 4. Reaction products were analyzed by electrophoresis on 10% polyacrylamide gels, which were subsequently subjected to autoradiography using an instant imager (Packard Instrument Co.).
|
|
Determination of the Salt Concentration DependenceSalt concentration dependence of DNA cleavage by MboI was measured in buffers (10 mM Tris-HCl, 10 mM MgCl2, 1 mM dithiothreitol, 100 µg/ml bovine serum albumin, pH 8.0) containing varying concentrations of NaCl (0200 mM). Cleavage was measured at an enzyme concentration of 1.25 nM MboI dimer and a substrate concentration of 25 nM 188-bp PCR product containing a single cleavage site. Cleavage reactions were performed for 4 min at 37 °C.
Determination of the pH DependenceA triple buffer containing 25 mM MES, 25 mM sodium acetate, and 50 mM Tris-HCl (39) was prepared at various pH values. This buffer was chosen to maintain constant ionic strength across the pH range examined. The cleavage rate was measured within the range pH 4.510 with an enzyme concentration of 2.5 nM MboI dimer and a substrate concentration of 25 nM 188-bp PCR product containing a single cleavage site. Cleavage reactions were performed for 4 min at 37 °C.
Gel FiltrationGel filtration experiments were performed at room temperature on a Merck-Hitachi high-performance liquid chromatography system using a Superdex 75 column (Amersham Biosciences) equilibrated with 10 mM Tris-HCl, pH 8.5, 500 mM NaCl, 5 mM CaCl2. Wild type MboI was diluted in the same buffer, which was loaded onto the column, and chromatography was carried out at a flow rate of 0.5 ml/min. Elution was monitored by absorbance at 280 nm. The molecular mass was determined by interpolation, using a calibration curve of proteins of known molecular mass (molecular weight marker kit, Sigma).
Transmission Electron MicroscopyFor electron microscopy, DNA fragments of 188 and 352 bp in length, containing one or two recognition sites for MboI, respectively, were produced by PCR and purified. In a reaction volume of 20 µl, 30 ng of DNA was incubated with 30 or 60 ng of MboI (the molar ratio of recognition sites to MboI monomers was
3.5 or 7, respectively) in 10 mM Tris-HCl, pH 8.5, 100 mM NaCl, 5 mM CaCl2 for 15 min at 23 °C, fixed for 10 min with 0.2% (w/v) glutaraldehyde, and prepared by adsorption to mica as described previously (40).
Photocross-linkingFor analytical scale photocross-linking, 10 µM MboI dimer was preincubated with 10 µM double-stranded eicosadeoxyribonucleotide monosubstituted with 5-IdU at various positions in the recognition site (underlined) or the two flanking positions, namely 5'-CATACGAGGATCCATTCGCT-3' (Thermo Hybaid) in buffer P (10 mM Tris-HCl, 100 mM NaCl, 5 mM CaCl2, pH 8.0) for 10 min at ambient temperature in a volume of 25 µl. Photocross-linking was carried out with a 40-milliwatt helium/cadmium laser emitting at 325 nm (Laser 2000). The total irradiation time was typically 45 min, or 090 min in kinetic experiments. Samples of 5 µl were taken before and after cross-linking and analyzed by electrophoresis on a 15% SDS-polyacrylamide gel. Gels were silver-stained, and radioactive bands were visualized by autoradiography with intensifying screens or using an imager. For preparative isolation of a cross-linked MboI-oligodeoxyribonucleotide complex (see below), 20 µM MboI dimer was preincubated with 20 µM double-stranded dodecadeoxyribonucleotide monosubstituted with 5-IdU at the A of the recognition sequence, namely 5'-CGAGGATCCATT-3' (US12-A-1/LS12, Biomers) in buffer P in a volume of 100 µl. A small amount of the oligodeoxyribonucleotide was radioactively 5'-end-labeled with 32P-phosphate.
Digestion of the Cross-linked MboI-dodecadeoxyribonucleotide ComplexCross-linked complex (125 µl and 2.5 nmol) of MboI and dodecadeoxyribonucleotide US12-A-1/LS12 was incubated at 37 °C overnight with 15.5 µg of trypsin (sequencing grade, Roche Applied Science) or a combination of 9.5 µg of trypsin and 9.5 µg of chymotrypsin (sequencing grade, Roche Applied Science) or with 12.5 µg of proteinase K (Fermentas). Aliquots were analyzed by electrophoresis on a 15% polyacrylamide gel containing 0.5x TBE (89 mM Tris base, 89 mM boric acid, 2 mM EDTA, pH 8.3) and 2 M urea (41). After complete digestion, samples were precipitated in the presence of 1/10 volume of 4 M LiCl and 3 volumes of ethanol, dissolved in 0.1 M acetic acid, and loaded onto Fe3+-IMAC columns prepared from nickel-nitrilotriacetic acid spin columns (Qiagen) by Ni2+/Fe3+ exchange according to a previous study (42). Only unreacted oligodeoxyribonucleotides and peptide-DNA hetero-conjugates were retained on the IMAC column when washed as described before (43). The retained DNA-containing components were eluted with 2 x 600 µl of H2O, pH 10.5 (adjusted with NH3), and evaporated in a SpeedVac. The dried material was dissolved in 40 µl of H2O.
Mass SpectrometryHydrogen fluoride (HF) treatment: selective cleavage of phosphodiester linkages was performed according to a previous study (44) using a modified protocol (45). Briefly, dried samples were incubated with 50 µl of 48% HF (Merck) at 4 °C overnight. The reagent was removed under a stream of nitrogen, and dried samples were dissolved in 50 µl of methanol and dried again; this procedure was repeated once.
MALDI-MS and MS/MS spectra were recorded using an Ultraflex time-of-flight mass spectrometer (Bruker) equipped with a LIFT-MS/MS facility. Samples were prepared on an AnchorChipTM 600/384 target plate. For oligodeoxyribonucleotides and oligodeoxyribonucleotide-peptide heteroconjugates, a saturated solution of 3-hydroxypicolinic acid (Sigma-Aldrich) in acetonitrile/water (45:65, v/v), containing 10% diammoniumhydrogen citrate was used as matrix. Matrix solution (0.5 µl) was applied to the anchor chip and allowed to dry. Then 0.51 µl of analyte was added, and the chip was dried again. HF-treated cross-linked complexes were analyzed as described in detail previously (46). The MALDI mass spectra obtained were externally calibrated using either an oligodeoxyribonucleotide calibration standard or peptide calibration standard II (both from Bruker). Fragment ion analysis in the tandem time-of-flight (TOF-TOF) mode was performed as described (47).
Protein Structure PredictionSecondary structure prediction and protein-fold recognition was carried out via the GeneSilico metaserver gateway (genesilico.pl/meta/ (48)). The reported alignments between the MboI sequence and structures of PD... (D/E)XK nucleases were used as starting points for homology modeling using the "Frankenstein's monster" approach, comprising cycles of model building, evaluation/scoring, realignment in poorly scored regions, and merging of best scoring fragments (49).
| RESULTS |
|---|
|
|
|---|
Analysis of DNA Cleavage by MboIOptimum DNA cleavage conditions, a prerequisite for the analysis of structure-function relationships, were determined for MboI. Experiments carried out to assess the Mg2+/Mn2+, ionic strength, and pH dependence of DNA cleavage by MboI show that this enzyme, like many Type II restriction endonucleases, has an Mg2+ (Mn2+) optimum between 1 and 10 mM (0.1 and 1 mM, corrected for ionic strength), requires
100 mM salt for optimal activity, and shows an increase in cleavage activity with pH in the range between 5 and 10 (data not shown).
Classification of MboI as a Type IIP EnzymeThe presumptive distant relatives of MboI are Type IIP (SsoII and PspGI), Type IIE (EcoRII), and Type IIF (NgoMIV and Cfr10I) restriction enzymes. To ascertain the subtype of MboI, four types of experiments were carried out: (i) gel filtration, to determine the quaternary structure of MboI; (ii) electron microscopy, to establish whether MboI binds two recognition sites simultaneously; (iii) gel electrophoretic mobility shift experiments, to observe if two recognition sites on one DNA molecule are bound cooperatively by MboI and; and (iv) DNA cleavage experiments, to decide whether the rate of cleavage of a substrate having one site can be increased by addition of an oligodeoxyribonucleotide substrate (activation in trans).
Gel filtration runs with MboI and marker proteins show that MboI is eluted between bovine serum albumin and ovalbumin (data not shown). The apparent molecular mass of
60 kDa indicates that MboI is a homodimer of the 32-kDa subunit, excluding it being a homotetrameric Type IIF enzyme. Electron microscopy experiments show that MboI does not bind to two recognition sites simultaneously but rather that on DNA with two MboI sites, one site is occupied preferentially (Fig. 2). Gel electrophoretic mobility shift experiments confirm that MboI does not bind cooperatively to two sites on DNA (Fig. 3). These results already suggest that MboI most likely is neither a Type IIF nor a Type IIE enzyme, which normally bind two sites simultaneously. This was confirmed by DNA cleavage experiments in the presence of increasing amounts of an oligodeoxyribonucleotide substrate (Fig. 4) that clearly demonstrate there is no activation in trans, as would have been expected for a Type IIE enzyme (50, 51). Furthermore, MboI does not cleave a substrate with two sites more readily than a substrate with one site (data not shown). Taken together, these data allow classifying MboI as an orthodox Type IIP enzyme, i.e. a homodimer that binds to and cleaves one site at a time and does not require allosteric activation by an effector site.
|
|
Asn-138, conserved among the close MboI relatives (which all recognize
GATC) and also present in EcoRII, can be replaced by Ala without affecting DNA binding and cleavage significantly. In contrast, Arg-140 and Arg-143, which define the R-box and are conserved in the MboI relatives as well as in EcoRII, SsoII, and PspGI, are very critical for DNA binding and cleavage. Their replacement by Ala but also by Lys leads to catalytically inactive (Ala) or almost inactive (Lys) variants that have a 1000-fold lower affinity for their DNA substrate. Asn-142, a residue within the R-box and conserved among the close MboI relatives but corresponding to Ser in EcoRII, SsoII, and PspGI, appears to be as important as its neighboring Arg residues; intriguingly, it cannot be substituted by Ser. In contrast, Asp-146, a few amino acids C-terminal of the R-box, and a residue that is not conserved among the MboI relatives, can be replaced by Lys, the corresponding residue in EcoRII, SsoII, and PspGI; the D146K variant has wild type binding affinity and cleavage activity. Ser-149, which aligns with Glu in the close MboI relatives as well as EcoRII, SsoII, and PspGI, can be substituted by Glu without impairment of the DNA binding affinity, but with a 100-fold reduction in DNA cleavage activity.
MboI has an unusual PD... (D/E)XK motif, in which Pro is replaced by Phe and Lys by Asn. The FD... EXN sequence is, however, fully conserved among the MboI relatives. To demonstrate that this sequence is indeed representing the catalytic center, Asp-186, Glu-199, and Asn-201 were substituted by Ala: the resulting variants are catalytically inactive, but show unaltered DNA binding affinity, which is not unusual for Ala substitutions of the PD... (D/E)XK residues of Type II restriction enzymes (compare with the "Discussion"). Interestingly, the N201K variant is catalytically inactive and impaired in DNA binding. Replacing a nearby Tyr (Tyr-203) by Ala leads to a variant with similar properties to the N201K variant.
Based on the alignment with NgoMIV, which uses Asp-193 and Arg-194 for recognition of the central 4 bp of its 6-bp recognition sequence, we have replaced in MboI the corresponding residues Ser-208 and Lys-209, which are conserved among the close relatives of MboI, as well as three adjacent residues, which are partially conserved among the close MboI relatives and have functional groups that could be used for base recognition (e.g. Asn-211, Glu-212, and Arg-215). The S208A and K209A variants are catalytically inactive and show a strong reduction in DNA binding affinity, as expected for residues involved in base recognition. Intriguingly, the K209R variant is not compromised in DNA binding, but is almost inactive in DNA cleavage, suggesting that Lys-209 is required for DNA binding (a function that can be taken over by Arg) and coupling of recognition to catalysis (a function that requires a precise interaction, which cannot be taken over by Arg). The N211A variant has near wild-type properties, whereas the E212Q as well as the R215A and R215K variants bind to DNA with wild-type affinity but are almost inactive in DNA cleavage.
Photocross-linking of MboI with 5-IdU-substituted OligodeoxyribonucleotidesA photocross-linking approach utilizing 5-iododeoxyuridine (5-IdU)-substituted oligodeoxyribonucleotides was used to determine whether the DNA target site GATC is in close contact to amino acid residues of MboI that are suggested by the alignment with EcoRII, SsoII, PspGI, and NgoMIV (Fig. 1) to be involved in DNA binding and cleavage. For this purpose, six double-stranded eicosadeoxyribonucleotides with a central GATC site and monosubstituted in one strand with 5-IdU were synthesized. The substitutions were localized in the recognition site and the two adjacent positions. These oligodeoxyribonucleotides were tested for cleavage by MboI. It was found that oligodeoxyribonucleotides with 5-IdU substitutions at the positions G, A, and C of the GATC recognition sequence cannot be cleaved by MboI but were all demonstrated to bind to MboI in the presence of Ca2+ (41). Results of the photocross-linking reactions are shown in Fig. 5A. Only the oligodeoxyribonucleotide that in one strand had the 5-IdU substitution at the adenine in the GATC recognition site showed a detectable cross-link on a silver-stained gel. A time course of the photocross-linking reaction with a dodecadeoxyribonucleotide with a GATC site, in which in one strand the alanine was substituted by 5-IdU, indicated that
3060 min of irradiation under the given conditions was sufficient to obtain the maximum yield of 15% (data not shown).
|
|
To facilitate experimental data interpretation in the sequence-structure-function context, we constructed a theoretical model of the MboI catalytic domain (residues 137280) in complex with target DNA. The homology model of the monomer is based on the fold recognition analysis (see "Experimental Procedures"). The mutual orientation of the two MboI monomers as well as the coordinates of the DNA molecule and the Mg2+ ions was derived from superposition onto subunits A and B in the NgoMIV co-crystal structure. The GATC site was generated by "mutating" bases in the original NgoMIV target (CCGG) and optimizing their geometry using HyperChem 7.1 (Hypercube, Inc.). The final model of the MboI dimer-GATC sequence complex (available from genesilico.pl/iamb/models/R.MboI/R.MboI.model.pdb) was obtained after removing steric clashes between residues from both monomers and the DNA by selecting different rotamers of the respective side chains (Figs. 7 and 8).
|
|
| DISCUSSION |
|---|
|
|
|---|
-helix and a loop (
-class (52)) and in general produce 5'-staggered ends, whereas enzymes of the EcoRV branch (BglI, EcoRV, HincII, NaeI, and PvuII) usually approach the DNA from the minor groove side (17), use a
-strand and a
-like turn for DNA recognition (
-class (52)), and in general produce blunt or 3'-staggered ends. Therefore, the two branches were termed "
" and "
," respectively (53). Structural variations within the EcoRI (
) and EcoRV (
) branch were used for a cladistic analysis of the members of the two branches using methods based on protein structure rather than amino acid sequence (54). This analysis allows inferences regarding evolutionary distances and construction of an evolutionary tree of Type II restriction enzymes of known three-dimensional structure (21).
In the absence of detailed structural information, predictions regarding phylogenetic relationships among restriction endonucleases can be made on the basis of amino acid sequence similarities. However, with few exceptions concerning isoschizomers, restriction enzymes do not show sufficient sequence similarities to infer phylogenetic relationships without additional structural and/or functional information. Based on sequence alignments, we previously proposed that SsoII, PspGI, and EcoRII belong to the EcoRI branch and are structurally related to Bse634I, Cfr10I, and NgoMIV (23, 25). This hypothesis was supported by a detailed experimental analysis and homology modeling, which demonstrated amino acid residues of SsoII and PspGI that align with functionally important residues in Bse634I, Cfr10I, and NgoMIV to be indeed essential for DNA binding and catalysis by SsoII and PspGI. Our suggestion to include EcoRII in this comparison has been validated by the recently published crystal structure analysis of EcoRII (33). These enzymes recognize similar sequences (e.g. SsoII
CCNGG, EcoRII
CCWGG, and NgoMIV G
CCGGC) but belong to different subtypes (Type IIP, IIE, and IIF, respectively). This has been interpreted as the enzymes having diverged early in evolution, presumably from a Type IIP enzyme that recognized
CCXGG or X
CCGGX. It is intriguing to note that the sequence similarity over an interrupted stretch of
70 amino acid residues can be extended to enzymes such as MboI (
GATC) that recognize unrelated sequences (25). Showing that MboI uses secondary structure elements composed of these
70 amino acid residues for DNA binding and cleavage would be the first example of an evolutionary relationship between two subfamilies of restriction endonucleases with widely differing specificities within the EcoRI branch.
In the present study, we followed the approach we developed to demonstrate an evolutionary relationship between SsoII, PspGI, and EcoRII on one side and Bse634I, Cfr10I, and NgoMIV on the other (23, 25). This approach consists of detailed biochemical analyses of the restriction enzyme, mutational analyses of amino acids assumed to be of functional importance, identification of amino acids involved in DNA binding by cross-linking, and homology modeling of the protein-DNA complex. Our results show that MboI is a typical Type II restriction enzyme with a divalent cation requirement, pH, and ionic strength dependence similar to that of PspGI (25). Like PspGI and SsoII, but unlike EcoRII (Type IIE) and Bse634I, and Cfr10I and NgoMIV (Type IIF), it is a Type IIP enzyme, because it interacts with DNA as a homodimer and does not require binding to a second recognition site for efficient DNA cleavage.
The mutational analysis shows that amino acid residues from the R-box (Table I), well conserved among the close MboI relatives, among them DpnII (Fig. 1), are essential for DNA binding and cleavage. The corresponding alanine variants of MboI are catalytically inactive and bind to DNA by approximately three orders of magnitude less than the wild type enzyme. A similar result was obtained with SsoII and PspGI, suggesting that the arginine residues of the R-box play an important role for these enzymes (23, 25). This is emphasized by the finding that also the lysine for arginine substitution of Arg-140 and Arg-143 in MboI is deleterious for DNA binding and cleavage. The amino acids of the R-box are not conserved among Bse634I, Cfr10I, and NgoMIV, but rather are replaced by hydrophilic residues. The co-crystal structure of an NgoMIV-DNA product complex shows that amino acids corresponding to R-box residues are involved in contacts to the region at the 3'-end of the recognition sequence, e.g. Gln-63 (which aligns with Asn-142 in MboI) is involved in a hydrogen bond to the N4 of the outer cytosine (underlined) of the G
CCGGC recognition sequence. This suggests that the R-box residues of MboI could be involved in contacts to the 3'-end and/or the flanking region of the
GATC recognition sequence (which is shorter than the G
CCGGC recognition sequence of NgoMIV).
Six residues C-terminal of the R-box is a glutamic acid that is conserved among the restriction enzymes under consideration in this study; in MboI, however, it is replaced by a serine (Fig. 1). In NgoMIV, this glutamic acid (Glu-70) is within <0.4 nm of Asp-140, the first carboxylate of the variant PD... (D/E)XK motif. The equivalent residue in MboI is Ser-149. It cannot be substituted by glutamic acid without interfering with catalysis, presumably because the active site of MboI has been fine-tuned in the framework of the entire structure. The catalytic motif in MboI is a variation of the classic PD... (D/E)XK motif (PDX4655KX13E (55)), inasmuch as the conserved lysine is substituted by asparagine. Notably, it is another variation than that found in EcoRII, SsoII, PspGI, NgoMIV, Cfr10I, and Bse634I. The mutational analysis clearly shows that Asp-186, Glu-199, and Asn-201 are essential for catalysis by MboI; furthermore, Asn-201 cannot be replaced by lysine, although this replacement would have reconstituted the classic PD... (D/E)XK motif. Replacements of the active site carboxylates by alanine do not affect the affinity of the MboI variants for their DNA substrate. This is in line with results obtained for other restriction enzymes (56, 57).
In NgoMIV, base-specific contacts to the two inner base pairs are made by amino acids 26 residues downstream from the catalytic lysine (Lys-187), among them Thr-189, Asp-193, and Arg-194; equivalent to Tyr-203, Ser-208, and Lys-209, respectively, in MboI. Replacing these residues by alanine led to inactive mutants, which in addition show a greatly decreased affinity for their DNA substrate, as observed for the R-box variants and as expected for residues involved in recognition. The replacement of Lys-209 by arginine did not compromise DNA binding but is deleterious for DNA cleavage activity. This indicates that the arginine for lysine substitution interferes with the coupling of recognition and catalysis.
Asn-197 in NgoMIV makes a backbone contact to the 5'-phosphate after cleavage of the scissile bond. The corresponding residue in MboI is Glu-212; its replacement by glutamine (as in EcoRII and Cfr10I) renders the variant almost inactive, without affecting the affinity for its substrate. A similar effect is observed for the R215A variant. Based on homology with NgoMIV, it is assumed that this region of MboI is involved in backbone interactions, presumably at the 5'-end of the
GATC recognition site or in buttressing interactions for the catalytic amino acid residues. Although the results of the mutational analysis strongly support our conclusions regarding the involvement of the stretch of
70 amino acids in DNA binding and cleavage by MboI, one would like to have direct evidence for this region being in contact with DNA. Our photocross-link experiments show that Lys-209 makes a zero-length cross-link to the second base of the
GATC recognition sequence.
Taken together, these results allowed us to model the structure of MboI. Fig. 7 shows a secondary structure prediction for the catalytic domain of MboI in comparison with the secondary structure of EcoRII and NgoMIV, deduced from the crystal structures (31, 33). The active site residues are indicated for all three enzymes, whereas the residues involved in DNA binding are only indicated for MboI (based on the mutational analysis) and NgoMIV (based on the co-crystal structure). Fig. 8 shows the stretch of
70 amino acids of the alignment (Fig. 1) projected onto the co-crystal structure of NgoMIV and the structural model of MboI (with the DNA adapted from the NgoMIV-product complex: TGCGGATCCGC). The two
-helices and two
-strands that make up the active site of these enzymes are indicated. As shown in the MboI model, Lys-209 (corresponding to Arg-194 in NgoMIV) is suitably positioned to cross-link with the second base of the MboI
GATC recognition sequence (reminiscent of NgoMIV Arg-194, which forms hydrogen bonds to the central CG base pairs of its G
CCGGC recognition sequence).
| CONCLUSIONS |
|---|
|
|
|---|
-helix and its preceding loop for DNA recognition. Because MboI has only a slight variation of the PD... (D/E)XK (K being substituted by N) motif, we infer that MboI, a Type IIP enzyme, resembles the common ancestor more than other enzymes that have a larger variation of the catalytic motif. In these enzymes, the second carboxylate of the catalytic motif, usually found on the
-strand preceding the recognition helix, is recruited from this helix. Two isoschizomers of MboI, namely Sau3AI and MutH (a nuclease involved in mismatch repair, but related to Type II restriction enzymes), in contrast to MboI belong to the EcoRV branch; they are related to each other but not to MboI. Furthermore, whereas MutH is a monomer that nicks DNA, Sau3AI is a monomer that dimerizes on the DNA and makes a double strand cut, activated by binding to a second recognition site (like a Type IIE enzyme) (40, 58). MboI and Sau3AI/MutH are therefore examples of independent invention of the same functionality (specificity toward GATC), i.e. a bona fide case of functional convergence of two proteins that shared a common ancestor, but underwent extreme divergence. Finally, it appears from our study that the
... 

unit that characterizes the stretch of
70 amino acids of sequence similarity has been used in a subset of enzymes of the EcoRI branch to evolve widely different specificities. It will be interesting to observe whether restriction endonucleases recognizing G
GYRCC, such as BanI, and GG
CC, such as FnuDI, which share sequence similarity with MboI (25, 59), indeed belong to the EcoRI branch of restriction endonucleases and to determine which elements they use for specific sequence recognition.
|
| FOOTNOTES |
|---|
This report is dedicated to Prof. Dr. G. Maass on the occasion of his 70th birthday. ![]()

An EMBO/Howard Hughes Medical Institute Young Investigator and a Recipient of the fellowship for young scientists from the Foundation for Polish Science. ![]()
To whom correspondence should be addressed: Tel.: 49-641-99-35402; Fax.: 49-641-99-35409; E-mail: vera.pingoud{at}chemie.bio.uni-giessen.de.
1 The abbreviations used are: 5-IdU, 5-iododeoxyuridine; MES, 4-morpholineethanesulfonic acid; IMAC, immobilized metal-affinity chromatography; HF, hydrogen fluoride; MALDI, matrix-assisted laser desorption ionization; MS, mass spectrometry; TOF, time-of-flight. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|