If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
To whom correspondence may be addressed: Dept. of Biochemistry, 868A Robinson Research Bldg., 2200 Pierce Ave., Nashville, TN 37232-0146. Tel.: 615-343-8070; Fax: 615-322-7122
To whom correspondence may be addressed: Dept. of Biochemistry, Vanderbilt University School of Medicine, 638B Robinson Research Bldg., 2200 Pierce Ave., Nashville, TN 37232-0146. Tel.: 615-322-2261; Fax: 615-322-4349
The most common lesion in DNA is an abasic site resulting from glycolytic cleavage of a base. In a number of cellular studies, abasic sites preferentially code for dATP insertion (the “A rule”). In some cases frameshifts are also common. X-ray structures with abasic sites in oligonucleotides have been reported for several microbial and human DNA polymerases (pols), e.g. Dpo4, RB69, KlenTaq, yeast pol ι, human (h) pol ι, and human pol β. We reported previously that hpol η is a major pol involved in abasic site bypass (Choi, J.-Y., Lim, S., Kim, E. J., Jo, A., and Guengerich, F. P. (2010 J. Mol. Biol. 404, 34–44). hpol η inserted all four dNTPs in steady-state and pre-steady-state assays, preferentially inserting A and G. In LC-MS analysis of primer-template pairs, A and G were inserted but little C or T was inserted. Frameshifts were observed when an appropriate pyrimidine was positioned 5′ to the abasic site in the template. In x-ray structures of hpol η with a non-hydrolyzable analog of dATP or dGTP opposite an abasic site, H-bonding was observed between the phosphate 5′ to the abasic site and water H-bonded to N1 and N6 of A and N1 and O6 of G nucleoside triphosphate analogs, offering an explanation for what appears to be a “purine rule.” A structure was also obtained for an A inserted and bonded in the primer opposite the abasic site, but it did not pair with a 5′ T in the template. We conclude that hpol η, a major copying enzyme with abasic sites, follows a purine rule, which can also lead to frameshifts. The phenomenon can be explained with H-bonds.
Coding properties of poly(deoxycytidylic acid) templates containing uracil or apyrimidinic sites: in vitro modulation of mutagenesis by deoxyribonucleic acid repair enzymes.
). However, not all DNA polymerases stringently follow the A rule. Human (h) pol β and Sulfolobus solfataricus Dpo4 have been reported to prefer to produce −1 frameshift deletions to a greater extent than A insertion (
Abasic translesion synthesis by DNA polymerase β violates the “A-rule.” Novel types of nucleotide incorporation by human DNA polymerase β at an abasic lesion in different sequence contexts.
), but this would not explain why different pols vary. A number of x-ray structures of pols with oligonucleotides with AP sites have been published, including S. solfataricus Dpo4 (
), which is consistent with slow rates of incorporation.
One of the issues is which pol(s) is most relevant in considering the bypass of AP sites. Although hpol η has been reported to be inefficient in the bypass of such sites (
The relative roles in vivo of Saccharomyces cerevisiae pol η, pol ζ, Rev1 protein and pol32 in the bypass and mutation induction of an abasic site, T-T (6-4) photoadduct and T-T cis-syn cyclobutane dimer.
) reported that hpol η was highly miscoding and that the majority of bypass events were insertions of A.
We analyzed hpol η in terms of its miscoding properties, in light of the evidence that it may be an important factor in mutagenesis at AP sites. We found a proclivity of hpol η to insert both purines, A and G, opposite AP sites, which is reminiscent of an earlier reported behavior of hpol β (
). A structural basis was identified using x-ray crystallography, i.e. water-mediated hydrogen bonding between the 5′-phosphate of the AP site and the Watson-Crick face of purines. When an appropriate pyrimidine is present in the template 5′ to the AP position, extensive −1 frameshifts occur.
EXPERIMENTAL PROCEDURES
Materials
Deoxyribonucleoside triphosphates (dNTPs), T4 polynucleotide kinase, and uracil DNA glycosylase were purchased from New England Biolabs (Ipswich, MA). A mixture of four dNTPs was purchased from Invitrogen. All nonhydrolyzable dNMPNPPs were obtained from Jena Bioscience (Jena, Germany). [γ-32P]ATP (specific activity 3000 Ci/mmol) was purchased from PerkinElmer Life Sciences. Biospin 6 columns were purchased from Bio-Rad. All unmodified oligonucleotides (HPLC purification) were obtained from Integrated DNA Technologies (Coralville, IA). Oligonucleotides containing an AP site or an AP analog (tetrahydrofuran (THF)) were synthesized by TriLink Biotechnologies (San Diego). The catalytic core (amino acids 1–432) of hpol η was expressed in Escherichia coli and purified as described previously (
). The oligonucleotides used in this study are listed in Table 1. Prior to kinetic assays, time course experiments were run to optimize reaction conditions. All polymerase reactions were carried out in 50 mm Tris-HCl buffer (pH 7.5) containing 5 mm MgCl2, 10 mm dithiothreitol (DTT), 100 mm KCl, 5% glycerol (v/v), and 100 μg/ml bovine serum albumin (BSA). The 5′-6-carboxyfluorescein-labeled primer-template (18-/23-mer) duplex (5 μm) was extended using 4–40 nm concentrations of hpol η in the presence of various concentrations of a single dNTP (0–1 mm) at 37 °C for 5–20 min. Reactions were terminated with a quench solution containing 20 mm EDTA, 95% formamide (v/v), bromphenol blue, and xylene cyanol. Products were separated on 18% (w/v) polyacrylamide gels containing 7.5 m urea. Gels were scanned by a Typhoon Scanner (GE Healthcare) and analyzed by fluorescence intensity using ImageJ software (National Institutes of Health). The values of kcat and Km were determined using GraphPad Prism (La Jolla, CA).
Rapid quench experiments were performed using a model RQF-3 KinTek Quench flow apparatus (KinTek, Austin, TX). The 18-mer primer was 5′-end-labeled using [γ-32P]ATP and T4 polynucleotide kinase and annealed to a 23-mer template. Reactions were initiated by rapidly mixing 32P-labeled primer-template/polymerase mixtures with an equal volume of dNTP-Mg2+ complex at 37 °C. The final concentrations of the reactants were as follows: 25 nm hpol η, 50 nm32P-labeled primer-template complex, 0.5 mm dNTP. Other reaction conditions were the same as described for steady-state kinetic assays. Reactions were quenched with 0.5 m EDTA at reaction times varying from 5 ms to 5 s. Products were separated using 18% (w/v) polyacrylamide gels. Bands on the gels were visualized using a phosphorimaging system (Bio-Rad, Molecular Imager FX) and Quantity One software as previously described (
DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4: analysis and crystal structures of multiple base pair substitution and frameshift products with the adduct 1,N2-ethenoguanine.
). Pre-steady-state points were fit (GraphPad Prism) to burst Equation 1,
(Eq. 1)
where A indicates the burst amplitude, kp is the first order rate, and kss is steady-state rate of nucleotide incorporation.
LC-MS/MS Analysis of Full-length Extended Products
A 5′-6-carboxyfluorescein-labeled primer containing a deoxyuridine (U) residue, 5′-CGG GCT CGT AAG CGT CUT-3′, was used to generate shorter chains after the cleavage by uracil DNA glycosylase and piperidine treatment (
DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4: analysis and crystal structures of multiple base pair substitution and frameshift products with the adduct 1,N2-ethenoguanine.
Replication past the N5-methyl-formamidopyrimidine lesion of deoxyguanosine by DNA polymerases and an improved procedure for sequence analysis of in vitro bypass products by mass spectrometry.
). Reaction conditions were similar to those used in steady-state kinetic assays except that the final concentrations were as follows: 3 μm hpol η and 25 μm primer-template duplex, in a total volume of 80 μl. The primer was extended in the presence of all four dNTPs (1 mm each) for 2–24 h at 37 °C. Reactions were terminated by removal of excess dNTP and Mg2+ using a spin column. The extent of reactions was monitored by electrophoresis prior to LC-MS/MS analysis. The reaction mixture was treated with 50 units of uracil DNA glycosylase and 0.25 m hot piperidine following a previous protocol (
DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4: analysis and crystal structures of multiple base pair substitution and frameshift products with the adduct 1,N2-ethenoguanine.
). The cleavage solution was lyophilized and reconstituted in 60 μl of H2O.
LC-MS/MS analysis was performed on an Acquity ultraperformance liquid chromatography (UPLC) system (Waters Associates) coupled to a Thermo Finnigan LTQ mass spectrometer (Thermo Scientific, San Jose, CA) operating in an ESI negative mode. Samples were separated on an Acquity UPLC BEH octadecylsilane (C18) column (1.7 μm, 2.1 mm × 100 mm) at a flow rate of 0.3 ml/min. The column temperature was maintained at 50 °C. Eluent A contained 10 mm NH4CH3CO2 in 98% H2O, 2% CH3CN (v/v), and eluent B consisted of 10 mm NH4CH3CO2 in 90% CH3CN, 10% H2O (v/v). A gradient program was run as follows: 0–3-min linear gradient from 0 to 3% B; 3–5-min linear gradient to 20% B; 5–6-min linear gradient to 100% B, held at 100% B for 2 min; 8–10 min from 100% B to 0% B, held at 0% B for 3 min. MS data were acquired using Xcalibur 2.1 software (Thermo). ESI settings were as follows: source voltage 4 kV; source current 100 A; capillary voltage −49 V; capillary temperature 350 °C; tube lens voltage −90 V. The most abundant species (−2 or −3 charged) were fragmented by collision-induced dissociation (CID) with a normalized collision energy of 35%. An activation Q setting of 0.25 and activation time of 30 ms were used. Oligonucleotide sequences were identified by comparing the observed CID spectra and theoretical spectra of candidate oligonucleotide sequences, which were calculated by software Mongo Oligo Calculator 2.0 from the University of Utah. The relative yields of various DNA extension products were based on their respective peak areas of the most abundant ions in extracted ion chromatograms (EICs). Some products with the same retention times and the same molecular weights were quantified by comparing the amounts of their specific CID fragments.
Crystallizations
Primer and template sequences employed in the crystallization experiments are listed in Table 1. DNA templates containing an AP site (THF-based mimic) were purchased from TriLink, and unmodified 8-mer primers were purchased from Integrated DNA Technologies. Template and primer strands were mixed in a 1:1 molar ratio and annealed in the presence of 10 mm sodium HEPES buffer (pH 8.0), 0.1 mm EDTA, and 50 mm NaCl by heating for 10 min at 85 °C followed by slow cooling to room temperature. Prior to crystallization, the DNA duplex was mixed with the protein in a 1.2:1 molar ratio in the presence of 50 mm Tris-HCl (pH 7.5) containing 450 mm KCl and 3 mm DTT. Following addition of 5 μl of 100 mm MgCl2 or CaCl2, the complex was concentrated to a final concentration of ∼2–3 mg/ml by ultrafiltration. Either non-hydrolyzable nucleoside triphosphates (dNMPNPP) or dCTP was then added to form the ternary complexes. Crystallization experiments were performed by the hanging drop vapor diffusion technique at 18 °C using a sparse matrix screen (Hampton Research, Aliso Viejo, CA) (
). One μl of the complex solution was mixed with 1 μl of reservoir solution and equilibrated against 500-μl reservoir wells. Crystals appeared in droplets containing 0.1 m sodium MES (pH 5.5), containing 5 mm MgCl2 or 5 mm CaCl2, and 24–28% (w/v) PEG 2000 monomethyl ether within 1 day and grew to their maximum size within a week.
X-ray Diffraction Data Collection, Structural Determination, and Refinement
Crystals were mounted in nylon loops, cryo-protected in reservoir solution containing 25% glycerol (v/v), and frozen in liquid nitrogen. Diffraction data were collected on the 21-ID-D or 21-ID-F beamline of the Life Sciences Collaborative Access Team (LS-CAT) at the Advanced Photon Source, Argonne National Laboratory (Argonne, IL). All data were integrated and scaled with the program HKL2000 (
), respectively. Selected crystal data, data collection, and refinement parameters are listed in Table 10 (see below). Illustrations were prepared with the program UCSF Chimera (
As a result of instability of natural AP sites, most crystal structure studies and kinetic assays have been carried out using a THF analog instead of true AP sites (Fig. 1) (
). In this study, experiments were performed using four DNA oligonucleotides containing the stable AP analog THF as well as two oligonucleotides containing a natural AP site (oligonucleotide sequences shown in Table 1). These oligonucleotides have similar sequences except for the neighboring bases at the 5′ side of AP sites. Natural AP sites and the THF analog demonstrated similar steady-state kinetic properties (Table 2). In this study, undamaged G was used as the control template base because G is the most commonly lost base due to AP site formation in cells (
). Steady-state kinetic results indicated that hpol η can insert one base opposite an AP site, although the bypass efficiency is much lower than that opposite a normal base.
FIGURE 1.Pre-steady-state kinetics of incorporation of individual dNTPs opposite G or an AP site.A, dATP incorporation opposite AP site; B, dGTP incorporation opposite AP site; C, dCTP incorporation opposite AP site; D, dTTP incorporation opposite AP site; E, dCTP incorporation opposite G. hpol η (25 nm)-catalyzed incorporation of dNTP (0.5 mm) was opposite 50 nm 18/23-mer template DNA (18-mer, 5′-CGG GCT CGT AAG CGT CAT-3′; 23-mer, 3′-GCC CGA GCA TTC GCA GTA XTA CT-5′, where X is either an AP site or G) at 37 °C. Reactions were quenched with 0.5 m EDTA at reaction times varying from 5 ms to 5 s. Products were analyzed by PAGE and phosphorimaging. All experiments were performed in duplicate. Data points are shown as means ± S.D. and were fit to the equation y = A(1 − e−kpt, where kp is the first order rate for the first catalytic cycle; kss is the steady-state rate, and A is the burst amplitude (GraphPad Prism).
Many previous studies have reported that nucleotide incorporation of DNA pols follows the A rule opposite AP sites. Some DNA pols also obey a 5′-frameshift rule (
). However, in this study, hpol η showed preferential incorporation of both dATP and dGTP in the different sequence contexts, with a dNTP selectivity of 0.68 to 1 for A and 0.59 to 1 for G (Table 2). With the templates containing an A or G 5′ of an AP site, the insertion of dATP and dGTP was still favored over dCTP and dTTP.
Pre-steady-state kinetics of dNTP incorporation opposite an AP site was also examined using a natural AP site-containing template with a neighboring base T (Table 3). Interestingly, the values of the burst rate kp and the amplitude indicated the preference of dATP insertion opposite the lesion. The higher catalytic efficiency of the first turnover when inserting dATP may be a result from the stabilization of the reaction when the incoming dATP is complementary to the downstream template base (T) of an AP site. Pre-steady-state kinetic analysis showed substoichiometric bursts, 7–28%, with damaged templates and 50% burst amplitude with a normal G-contained template. The similar results were also observed in our prior work on hpol η and 8-oxoG (
Explanation of pre-steady-state kinetics and decreased burst amplitude of HIV-1 reverse transcriptase at sites of modified DNA bases with an additional, nonproductive enzyme-DNA-nucleotide complex.
Although steady-state kinetic studies can gauge the efficiency of nucleotide incorporation catalyzed by a pol, LC-MS/MS methods we developed previously can reveal the ability of enzyme to read past and extend beyond a lesion (Fig. 2). LC-MS/MS experiments were performed following previous procedures (
DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4: analysis and crystal structures of multiple base pair substitution and frameshift products with the adduct 1,N2-ethenoguanine.
Replication past the N5-methyl-formamidopyrimidine lesion of deoxyguanosine by DNA polymerases and an improved procedure for sequence analysis of in vitro bypass products by mass spectrometry.
) using a uracil-containing primer. The sequences of extension products resulting from replication of six DNA templates with different downstream sequences and their yields were analyzed by LC-MS/MS (Tables 4–TABLE 6, TABLE 7, TABLE 8, TABLE 9). As shown in TABLE 4, TABLE 5, when templates contained a T 5′ of either an AP site or the THF analog, the most abundant products were −1 frameshift products, with yields of 68–81%, followed by 17–30% of the products containing G opposite the lesions. Minor products (<5%) corresponded to T incorporation opposite lesions. With templates containing a C 5′ of the AP (or THF) site, similar yields of −1 frameshift products were obtained (TABLE 6, TABLE 7), and A incorporation accounted for 26–32%. Only traces of products containing G or C opposite the lesion were obtained. When the T (or C) at the 5′ side of THF was replaced with an A (or G), preferential incorporation of A and G catalyzed by hpol η was still observed, but there were almost no frameshift products observed (TABLE 8, TABLE 9). These results indicate that the mechanism of AP site bypass follows a “purine rule” and is partially sequence-dependent. Frameshifts were favored when the incoming dNTP was complementary to the template base downstream of an abasic site.
FIGURE 2.LC-MS analysis of products of extension of primer opposite AP sites in six template contexts by hpol η in the presence of all four dNTPs.A, gel images of primer extension past AP sites by hpol η. B, EIC and CID spectrum for m/z 1106.7 corresponding to sequence 5′-pTATGATG-3′ extended opposite template 23-AP-T-mer. C, EIC and CID spectrum for m/z 954.6 corresponding to sequences 5′-pTGTGAA-3′ and 5′-pTAGTGA-3′ extended opposite template 23-AP-C-mer. D, EIC and CID spectrum for m/z 1106.7 corresponding to sequence 5′-pTATGATG-3′ extended opposite template 23-THF-T-mer. E, EIC and CID spectrum for m/z 954.6 corresponding to sequences 5′-pTGTGAA-3′ and 5′-pTAGTGA-3′ extended opposite template 23-THF-C-mer. F, EIC and CID spectrum for m/z 942.1 corresponding to sequence 5′-pTATTGA-3′ extended opposite template 23-THF-A-mer. G, EIC and CID spectrum for m/z 1091.2 corresponding to sequence 5′-pTACTGAA-3′ extended opposite template 23-THF-G-mer. Template sequences used in this experiment are presented in Table 1. The primer sequence was 5′-CGG GCT CGT AAG CGT CUT-3′. The sequences of which EICs and CID are shown here are the most abundant species generated from different templates. DNA polymerase reactions contained 3 μm hpol η, 25 μm DNA duplex, and 1 mm dNTPs and were incubated at 37 °C for 2–24 h. LC-MS/MS analysis was performed on an Waters Acquity UPLC system coupled to a Thermo Finnigan LTQ mass spectrometer operating in an ESI negative mode. Samples were separated on an Acquity UPLC BEH C18 column (1.7 μm, 2.1 mm × 100 mm) using a binary eluent (A, 10 mm NH4CH3CO2 in 98% H2O, 2% CH3CN (v/v), and B 10 mm NH4CH3CO2 in 90% CH3CN, 10%H2O (v/v)).
Crystal Structures of hpol η Tertiary Complexes at the Insertion Stage Opposite an AP Site
We determined structures of two hpol η complexes with either incoming dAMPNPP or dGMPNPP opposite the AP site (THF analog) at resolutions of 2.15 and 1.81 Å, respectively (Table 10 and Fig. 3, A and D). At the active site of both complexes, the base of the incoming nucleotide triphosphate is positioned opposite the AP residue (Fig. 3). In the complex with dAMPNPP, two water molecules mediate contacts between adenine (N1 and N6, Fig. 3, B and C) and the AP phosphate group in the major groove. In the minor groove, two more water molecules link N3(A) to O4′ of the 2′-deoxyribose from residue A5 and to the side chain of Gln-38 (Fig. 3C). In the complex with dGMPNPP, a single water bridges N1 and O6 of G and the AP phosphate (Fig. 3, E and F). The contact in the minor groove involving N3 to Gln-38 is direct, but the distance to O4′ of A5 is too long for H-bond formation (Fig. 3F). Projections roughly along the normal to the nucleobase of the incoming nucleotide triphosphate demonstrate that the base portions of purine nucleotides cannot just establish water-mediated interactions with a portion of the AP residue but also form more optimal stacking interactions with the adjacent nucleotide (A) from the template strand (Fig. 3, C and F). In comparison, the overlap between incoming A or incoming G and the thymine moiety from the 3′-terminal primer residue appears to be more limited. Superimposition of the active sites from the insertion complexes with dAMPNPP and dGMPNPP indicates that guanine has shifted toward the minor groove (Fig. 3, G and H). This shift goes along with an extended conformation of the Arg-61 side chain that interacts with the Hoogsteen edge (O6 and N7) of G (Fig. 3F). In the dAMPNPP complex, Arg-61 displays two alternative, curled conformations, whereby the guanidino moiety exhibits partial stacking on the nucleobase of the incoming nucleoside triphosphate in addition to interactions with the α- and β-phosphates (Fig. 3C).
FIGURE 3.Active site configurations in the ternary hpol η insertion-step complexes with either dAMPNPP or dGMPNPP opposite the AP site.A, quality of the final Fourier 2Fo − Fc sum electron density (1σ threshold) in the active site region of the complex with dAMPNPP. B, view into the major groove, and C, rotated by ∼90° around the horizontal axis and looking roughly along the normal to the nucleobase plane of the incoming dAMPNPP. The polymerase is shown as a schematic, and the DNA template (yellow)-primer (orange) duplex and selected hpol η side chains are shown in stick form (e.g. residues Gln-38 and Arg-61 from the finger domain; carbon atoms colored in gray). Carbon atoms of the AP residue and dAMPNPP are highlighted in green, and oxygen, nitrogen, and phosphorus atoms are colored in red, blue, and orange, respectively. Selected water molecules are cyan spheres; hydrogen bonds are thin solid lines, and Mg2+ ions are drawn as light green spheres. The side chain of Arg-61 displays two alternative conformations. D, quality of the final sum electron density for the complex with dGMPNPP and views of the active site from the major groove (E) and rotated by ∼90° around the horizontal and looking roughly normal to the nucleobase plane of the incoming dGMPNPP (F). Carbon atoms of the AP residue and dGMPNPP are highlighted in purple, and oxygen, nitrogen, and phosphorus atoms are colored in red, blue, and orange, respectively. Overlay of the active sites in the hpol η ternary complexes with either incoming dAMPNPP or dGMPNPP, viewed into the major groove (G), and rotated by ∼90° around the horizontal axis and looking roughly along the normal to the nucleobase plane of the incoming dNMPNPPs (H). Carbon atoms (G) of AP residue and dNMPNPP (the entire residues, H), water molecules, and Mg2+ ions as well as carbon atoms of amino acids Asn-38 and Arg-61 in the complexes with incoming A and G are highlighted in green and purple, respectively, and the template A and primer T of the adjacent base pair are colored in yellow and orange, respectively. The coordination of Mg2+ ions is indicated by thin solid lines in G. I, schematics of the bypass configuration encountered in crystal structures of complexes with incoming dATP or dGTP (left, implying insertion of the base opposite the AP site) and as established by LC-MS/MS (right, resulting in extensive frameshifting when the residue 5′ to AP is T (incoming dATP) or C (incoming dGTP)). See also Tables 4–TABLE 6, TABLE 7.
In both structures, T3 (which is located 5′ to the AP residue) is lodged outside the active site (Fig. 3, B and E). The crystal structure of the dGMPNPP complex is representative of the minor product seen in the LC-MS/MS analysis. Thus, it demonstrates that G is incorporated opposite the AP residue (TABLE 4, TABLE 5 and Fig. 3I, left panel) when the template nucleotide 5′ to AP is T (similar for A when the template nucleotide 5′ to AP is C (TABLE 6, TABLE 7)). However, the crystal structures only represent an intermediate stage in regard to the −1 frameshift products observed by LC-MS/MS, with A pairing with T 5′ to AP (or G pairing with C 5′ to AP; TABLE 4, TABLE 5, TABLE 6, TABLE 7, Fig. 3I, right panel). In order for the −1 frameshift products to form, T3 (or C in the case of incoming G) has to rotate around relative to the orientation seen in the two insertion complexes (Figs. 3, B and E, and 5) and swing into the active site to form a Watson-Crick base pair with dATP (or C with dGTP).
FIGURE 5.Conformational changes in the DNA template strand containing an AP site (THF analog) between the hpol η insertion and extension stages. Superimposition is shown of the hpol η insertion complex, with incoming dATP opposite the AP site (AP4, light blue carbon atoms and ribbon), and the hpol η extension complex, with incoming dCTP opposite G4, preceded by A opposite the AP site (AP5, beige carbon atoms and ribbon). Atoms of AP residues are shown in ball and stick mode; selected nucleotides and phosphorus atoms are labeled, and the latter are highlighted in purple in the insertion and in black in the extension complex. Water molecules are smaller spheres colored in cyan, except for two waters in green that mediate an interaction between dATP and the AP phosphate in the insertion complex. The superimposition reveals a significant movement of the T3 and AP4 residues into the major groove during insertion compared with the conformation of the template strand in the extension complex (orange arrow). Superimposition was performed using the MatchMaker option in UCSF Chimera.
Crystal Structure of an hpol η Extension Stage, Tertiary Complex Following an AP Site
We determined the structure of a post-insertion complex of hpol η with dA opposite the AP site (using the THF analog), followed by template G paired with incoming dCTP at 2.85 Å resolution (Table 4 and Fig. 4A). The configuration with A positioned opposite AP and an adjacent Watson-Crick G:C pair represents the major product of the in vitro bypass reaction of a template sequence 5′-G(AP)-3′ as established by LC-MS/MS (Table 9). Unlike the case of the insertion complexes, where the incoming purine base and AP are bridged by water molecules (Fig. 3, B, C, E, and F), the water molecule H-bonded to N1 of A in the extension complex is too far removed from the AP 5′- and 3′-phosphate groups to be able to link the moieties (Fig. 4, B and C). The looser spacing is a consequence of the orientation of the template guanine 5′ to the AP site (G4, Fig. 4B) that is accommodated inside the active site, resulting in the phosphates of both G4 and AP being pushed to the periphery of the template-primer duplex (Fig. 4C). With G4 providing a stacking platform, the next residue, T3, is also directed into the active site (Fig. 4B), rather than being swung outwards as in the case of the insertion complexes, where the AP residue is unable to engage in stacking (Figs. 3, B and E, and 5). The structural data at the extension stage together with the in vitro bypass assays as analyzed by LC-MS/MS (TABLE 8, TABLE 9) confirm the proclivity by hpol η for insertion of A and G opposite an AP site. A is preferred over G, independent of whether the template nucleotide 5′ to AP is A (Table 8) or G (Table 9). By comparison, −1 frameshifting is negligible (5%, Table 9), apparently because incoming dCTP aligning with G 5′ to AP is no match for purines positioned opposite AP and stabilized by a network of water molecules (Fig. 3, B and E).
FIGURE 4.Active site configuration in the ternary hpol η extension-step complex with primer dA opposite the AP site followed by dCTP opposite template dG.A, quality of the final Fourier 2Fo − Fc sum electron density (1σ threshold) in the active site region. B, view into the major groove; C, rotated by ∼90° around the horizontal axis and looking roughly along the normal to the nucleobase plane of the incoming dCTP. The polymerase is shown as a schematic, and the DNA template (yellow)-primer (orange) duplex and selected hpol η side chains are shown in stick form (e.g. residues Gln-38 and Arg-61 from the finger domain; carbon atoms colored in gray). Carbon atoms of the AP residue and dA are highlighted in green; carbon atoms of the following dG/dCTP pair are highlighted in brown, and oxygen, nitrogen, and phosphorus atoms are colored in red, blue, and orange, respectively. Selected water molecules are cyan spheres, hydrogen bonds are thin solid lines, and a Ca2+ ion is drawn as a light green sphere.
An investigation of translesion synthesis across AP sites (THF analog) catalyzed by human B-family pols α and δ and the human Y-family pols η, ι, κ, and Rev1 revealed various abilities of the individual pols to deal with this ubiquitous lesion (
). Both hpol α and δ/PCNA favored insertion of A opposite the AP site, but hpol α was severely blocked after insertion, whereas δ/PCNA were proficient at insertion and extension. Among Y-family pols, pol η was the most efficient in terms of both insertion and next-base extension. hpol κ extended from a nucleotide inserted opposite AP (C or A), but it was much less efficient in doing so than either hpols η or δ/PCNA. Rev1 preferentially inserted C opposite AP, but extension was blocked, similar to hpol ι, which was found to insert T, G, and A when encountering an AP site but then stopped (
). Overall, the results of this study were consistent with hpol η alone possessing the ability to efficiently bypass an AP site. To further characterize hpol η function and activity, we used steady-state and pre-steady-state kinetic tools in combination with LC-MS/MS assays to investigate hpol η bypass synthesis opposite AP sites. To better understand the structural basis for the observed preferences by hpol η, we determined crystal structures of ternary hpol η-DNA-dNTP (dNMPNPP analogs) complexes with AP-containing template strands (THF analogs), trapped either at the insertion or extension stages.
The kinetic analysis, both at the steady-state and pre-steady-state levels, attests to the preferred insertion by hpol η of purine nucleoside triphosphates opposite an AP site. This preference is maintained irrespective of whether the template strand contains a real AP site or the THF analog. A slight preference for A insertion in the pre-steady-state assays (Table 3) is likely a consequence of the sequence context in that a T 5′ to the AP site may result in a −1 frameshift and pairing of the incoming dATP with template T instead of opposite the AP residue. This interpretation is supported by the LC-MS/MS data demonstrating that templates with either a 5′-TpAP-3′ or a 5′-CpAP-3′ step result in −1 frameshifting, formation of a T:A or C:G pair, respectively, and an orphaned AP residue. Although the proclivity by hpol η to insert A opposite an AP site is slightly pronounced relative to G, this polymerase does not obey the so-called A rule (
Coding properties of poly(deoxycytidylic acid) templates containing uracil or apyrimidinic sites: in vitro modulation of mutagenesis by deoxyribonucleic acid repair enzymes.
). Instead, its bypass preference opposite AP sites is perhaps more appropriately termed a purine rule. With the in vitro bypass experiments augmented by LC-MS/MS analysis providing a clear picture as to the outcome of hpol η synthesis past an AP site, we turned to structural investigations to gain insight into the underlying basis for the preferred insertion of A and G.
Crystal structures of hpol η ternary complexes trapped at the insertion stage reveal that dATP and dGTP are positioned opposite the AP site (THF moiety) (Fig. 3). In both complexes, O3′ of the 3′-terminal T of the primer strand is optimally positioned for attack at the α-phosphate group of incoming nucleotide triphosphate (O3′…Pα distances of 3.32 Å and 3.21 Å in the dAMPNPP and dGMPNPP complexes, respectively, and nearly stretched O3′…(O)3P-O angles in both cases). The T upstream from the AP residue in the complexes is unstacked from the template strand (Figs. 3, B and E, and 5). The phosphate group linking T and the AP moiety resides in the major groove and is shifted by 4.5 Å relative to the position of the corresponding phosphate in the extension complex (5′-P of dG; Fig. 5, orange arrow). Unlike the template strand at that site in the insertion complexes, the primer-template duplex in the extension complex adopts a conformation that resembles a standard B-form geometry. Thus, the orientation of the AP 5′-phosphate in the insertion complexes is highly unusual and unlikely to be encountered in DNA duplexes devoid of mismatches or unpaired nucleotides. However, such orientations of phosphates are more commonly encountered in RNA, e.g. in loop regions (
), where the phosphate is linked to a purine base from the opposite strand by a water molecule. Indeed, the location of the 5′-phosphate of the AP residue in the major groove allows it to get into relatively close proximity of the Watson-Crick edge of the incoming nucleotide base. Thus, water molecules mediate interactions between adenine (N1 and N6) and guanine (N1 and O6) and the AP 5′-phosphate group (FIGURE 3., FIGURE 5.). These stabilizing contacts most likely form the underlying basis of the purine rule, because they cannot be established by the nucleobase portions of incoming pyrimidine nucleotide triphosphates. The water bridges between incoming purines and the AP phosphate group are probably more important in terms of the favorable incorporation of A and G than stacking interactions between purine moieties and the 3′-terminal primer nucleotides. However, stacking interactions between A and G of the incoming nucleotides and the purine (A) 3′-adjacent to the AP site (Fig. 3, C and F) may also contribute to the preference for insertion of purines opposite an AP residue by hpol η.
Water-mediated “pairing” between A or G and the AP phosphate, as seen in the two insertion complexes, is only partially representative of the ultimate outcome of AP bypass by hpol η. Thus, the result of bypass by this pol is dependent on the identity of the template nucleotide 5′-adjacent to the AP moiety. The state trapped in the crystal structures of insertion complexes does reflect to a large extent the outcome of bypass in the case of incoming dATP, with G, A, or C (or dGTP, with G, A, or T) 5′-adjacent to the AP site. Thus, A is accommodated opposite AP, and the next step might then involve e.g. insertion of dCTP paired in a Watson-Crick mode with template G (as seen in the extension complex, Fig. 4). However, as the LC-MS/MS data attest (Tables 4–7), the main product of the bypass reaction catalyzed by hpol η with incoming dATP opposite an AP site with 5′-adjacent template T (Fig. 3, B and C) is a −1 frameshift, which results in formation of an A:T pair and leaves the AP site unopposed by a nucleotide from the extended primer strand (similar to the case of incoming dGTP opposite an AP site with 5′-adjacent template C). Conformational changes leading to the frameshift and starting from the geometries of the primer-template duplexes trapped in the crystal structures of the insertion complexes entail the rotation out of the major groove of the AP residue and the 5′-adjacent T toward a more standard B-form backbone geometry as seen in the extension complex (see Fig. 5 for orientation).
TABLE 7LC-MS analysis of products of hpol η copying template 23-THF-C-mer (C 5′ of THF site)
Comparisons between the translesion synthesis activities of hpol η and the Dpo4 pol from S. solfataricus reveal similar preferences in some cases, although the mechanisms of bypass may deviate. For example, both pols replicate efficiently and quite accurately past the 8-oxoG lesion (C/A ratios of 19:1 and 4:1 for Dpo4 and hpol η, respectively) but use amino acids from different domains (Arg-332, Dpo4 little finger versus Arg-61, hpol η finger) to mediate correct bypass (
). Both pols efficiently catalyze replication of DNA past an AP lesion that results in frameshifting (−1 and +1 frameshifts with Dpo4 and −1 frameshifts with hpol η). However, crystal structures of Dpo4 complexes containing AP sites demonstrated that the AP residue is extrahelical, with the template nucleotide on the 5′ side of the lesion directing the incoming nucleoside triphosphate and thus leading to −1 frameshifts (
) (“5′ rule”). Unlike the case of the hpol η insertion complexes described here, the base pairs bracketing the bulged AP moiety remain stacked seamlessly. In another configuration of the Dpo4-DNA-dNTP complex, triggering +1 frameshifts, the AP residue is accommodated in the solvent-exposed minor groove. But none of the Dpo4 complexes exhibit the DNA geometry seen in the insertion complexes of hpol η, with water molecules mediating contacts between A or G and the AP 5′-phosphate group that gives rise to the purine rule. Therefore, pols sculpt lesioned DNA in various ways to guarantee efficient insertion and extension reactions that result in more or less error-free replication. In the case of hpol η, the crystal structure of the extension complex is consistent with efficient catalysis of the extension step in that the 3′-oxygen of the terminal primer nucleotide is positioned 3.9 Å from Pα of dCTP and with a near in-line orientation of O3′ and the scissile Pα–O(Pβ) bond. Overall, the structural and functional data presented here reveal a unique strategy by hpol η for efficiently bypassing AP sites (purine rule) that adds to the repertoire (e.g. A rule and 5′ rule) used by DNA pols to cope with this common lesion.
Acknowledgments
We are grateful to K. Trisler for assistance in the preparation of this manuscript. Vanderbilt University is a member institution of the Life Sciences Collaborative Access Team at Sector 21 of the Advanced Photon Source, Argonne, IL. Use of the Advanced Photon Source at Argonne National Laboratory was supported by the United States Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract DE-AC02-06CH11357.
REFERENCES
Friedberg E.C.
Walker G.C.
Siede W.
Wood R.D.
Schultz R.A.
Ellenberger T.
DNA Repair and Mutagenesis.
2nd Ed. American Society for Microbiology,
Washington, D. C2006
Coding properties of poly(deoxycytidylic acid) templates containing uracil or apyrimidinic sites: in vitro modulation of mutagenesis by deoxyribonucleic acid repair enzymes.
Abasic translesion synthesis by DNA polymerase β violates the “A-rule.” Novel types of nucleotide incorporation by human DNA polymerase β at an abasic lesion in different sequence contexts.
The relative roles in vivo of Saccharomyces cerevisiae pol η, pol ζ, Rev1 protein and pol32 in the bypass and mutation induction of an abasic site, T-T (6-4) photoadduct and T-T cis-syn cyclobutane dimer.
DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4: analysis and crystal structures of multiple base pair substitution and frameshift products with the adduct 1,N2-ethenoguanine.
Replication past the N5-methyl-formamidopyrimidine lesion of deoxyguanosine by DNA polymerases and an improved procedure for sequence analysis of in vitro bypass products by mass spectrometry.
Explanation of pre-steady-state kinetics and decreased burst amplitude of HIV-1 reverse transcriptase at sites of modified DNA bases with an additional, nonproductive enzyme-DNA-nucleotide complex.