![]()
|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 4, 1943-1955, January 27, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
From the Department of Microbiology, School of Medicine, University of Washington, Seattle, Washington 98195
Received for publication, September 26, 2005 , and in revised form, November 11, 2005.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
RNase H contributes to reverse transcription in three distinct ways (reviewed in Refs. 1, 7, and 8). First, RNase H carries out degradation of the RNA genome both during and after minus-strand DNA synthesis to facilitate plus-strand DNA synthesis and strand transfers. Second, RNase H creates the polypurine tract (PPT) primer from the viral genome. And third, RNase H removes the PPT and tRNA primers used to prime plus-strand and minus-stand DNA synthesis, respectively. Given the vital roles of RNase H in retroviral replication, it is important to understand how RNase H recognizes the RNA/DNA hybrids created during reverse transcription as substrates for cleavage. Extensive studies have shown that both generation of the PPT primer and removal of the PPT and tRNA primers require specific sequences (919). By contrast, fewer studies have examined general degradation by RNase H (2023), and less is known about the determinants that might influence RNase H specificity for this type of degradation during retroviral replication.
RNase H has both polymerization-dependent and polymerization-independent modes of cleavage. The polymerization-dependent mode accompanies RNA-dependent DNA synthesis, and the nascent DNA 3' primer terminus positions RNase H cleavages. Such cleavages occur 1720 nucleotides away on the RNA template strand but are not strictly coupled to DNA synthesis (20, 2426). This mode of cleavage likely functions at pause sites during minus-strand synthesis, when the 3' terminus of the nascent DNA is recessed on the genomic template. However, the polymerization rate of reverse transcriptase is faster than the rate of RNase H cleavage (27), and the polymerization-dependent mode of RNase H cleavage does not completely degrade the template RNA (20, 28). Thus significant amounts of RNA can remain annealed to the minus-strand DNA (22), and removal of these fragments by the polymerization-independent RNase H activity is likely necessary for efficient plus-strand DNA synthesis.
Polymerization-independent RNase H cleavages have been attributed to at least three distinct mechanisms, which differ in how reverse transcriptase associates with the hybrid substrate (reviewed in Refs. 1, 7, and 8). First, reverse transcriptase can bind the RNA strand of a hybrid without positioning by either a DNA 3'-end or an RNA 5'-end, resulting in internal cleavages by the RNase H (23). Second, the polymerase domain can bind a recessed DNA 3' primer terminus to facilitate DNA 3'-end-directed cleavages that occur
1720 nucleotides back on the RNA strand of a hybrid, similar to what happens at pause sites during polymerization. Third, the polymerase domain can associate with a recessed RNA 5'-end, and RNase H can carry out 5'-end-directed cleavages that occur
1520 nucleotides downstream on the RNA strand. While the distance from the recessed RNA 5'-end has been characterized as the primary determinant for this mechanism (29), these cleavages have been observed as close as 1215 nucleotides (30, 31) and as far as 21 nucleotides (29, 32) from the RNA 5'-end. Notably, this range of distances is much broader than the more fixed distance of 1719 nucleotides that separates the polymerase and RNase H active sites as determined by crystallography studies or footprinting of reverse transcriptase with substrates (3336).
A limited number of studies has suggested the possibility that sequence might influence 5'-end-directed cleavages (24, 29, 30), but this hypothesis has not been directly tested before. We recently reported that sequence specificity is an important determinant of internal RNase H cleavages and that an RNA 5'-end at a nick does not promote 5'-end-directed RNase H cleavages (23). These results prompted us to ask three fundamental questions concerning RNA 5'-end-directed cleavages by retroviral RNases H. First, does sequence influence 5'-end-directed cleavages? Second, what distances from the RNA 5'-end are acceptable for 5'-end-directed cleavage sites? Third, how large of a gap is required between the 3' terminus of an upstream RNA and the 5'-end of a downstream RNA to allow efficient 5'-end-directed cleavages in the downstream RNA? Our findings offer new insights into the mechanisms of RNA 5'-end-directed cleavages for M-MuLV and HIV-1 reverse transcriptases, and the role of RNase H in retroviral replication.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Preparation of RNAsPreparation of RNAs Md1-Md10, PPT20, PPT62, and NPPT has been described previously (23). To prepare R46 RNA, a 46-mer DNA oligonucleotide (5'-CAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTCGAGACG-3') was annealed to a 54-mer DNA oligonucleotide (5'-AATTCGTCTCGAAGCTAGCTTGCCAAACCTACAGGTGGGGTCTTTCATTGAGCT-3'), and this duplex was introduced into EcoRI- and SacI-linearized pGEM9Zf() (Promega). The resulting plasmid was linearized with BsmBI and transcribed in vitro as described (23). All other RNAs were prepared by in vitro transcription of linearized plasmids as follows. To make 35-mer R7Z1, pGEM7Zf(+) (Promega) was linearized with XbaI. To make 28-mer R9Z1, pGEM9Zf() was linearized with XbaI. To make 32-mer RpKS1, pBluescript KSII(+) (Stratagene) was linearized with NotI. To make 30-mer R3Z1, pGEM3Zf() (Promega) was linearized with BamHI. Because RNA R3Z1 is identical in sequence to the 5'-end of a 41-nucleotide RNA that has been used extensively to map 5'-end-directed cleavages (29, 3843), we have used the cleavage sites for RNA R3Z1 as mapped in our own experiments.
To generate additional RNAs with different sequences, some pGEM plasmids were digested with single-cutting restriction enzymes downstream of the T7 promoter (as indicated below), the overhangs were filled in or removed by T4 DNA polymerase treatment, the blunt ends were ligated using T4 DNA ligase, and the resulting plasmids were transformed into XL1-Blue cells (Stratagene). The sequences of these plasmids were confirmed by DNA sequencing. For T4 DNA polymerase treatment, a digested plasmid was incubated in a 20-µl reaction containing 1 mM dNTPs and 3 units of T4 DNA polymerase for 5 min at 37 °C and then for 15 min at 12 °C. To make 25-mer R3Z2, pGEM3Zf() was digested by EcoRI and BamHI, and the resulting plasmid was linearized with HindIII prior to in vitro transcription. To make 27-mer R7Z2, pGEM7Zf(+) was digested by ApaI, treated with T4 DNA polymerase, ligated, digested with AatII, treated with T4 DNA polymerase, and ligated, and then the resulting plasmid was linearized with XbaI. To make 32-mer R7Z3, pGEM7Zf(+) was digested by ApaI and XbaI, and the resulting plasmid was linearized with Acc65I. To make 30-mer R7Z4, pGEM7Zf(+) was digested by ApaI and EcoRI, and the resulting plasmid was linearized with BstBI. To make 30-mer R7Z5, pGEM7Zf(+) was digested by ApaI and SmaI, and the resulting plasmid was linearized with HindIII.
5'-End Labeling, 3'-End Labeling, and 5'-End Phosphorylation5'-End labeling or 3'-end labeling of RNAs were carried out as described previously (23, 37). Because cleavage by RNase H produces a 3'-hydroxyl group and a 5'-phosphate group, an unlabeled phosphate group was added to the 5'-ends of 3'-end-labeled RNAs as previously described (23).
Preparation of Hybrid SubstratesTo prepare the substrate without upstream RNA (No Upstream substrate), RNAs were annealed to the appropriate template DNAs at an RNA:DNA molar ratio of 1:2, and to prepare the Nick or Gap substrates, upstream PPT20 RNA was additionally included at a 3-fold molar excess over the first RNA. Annealings were carried out in 10 mM Tris-HCl, pH 8.0, and 200 mM KCl at 90 °C for 3 min followed by cooling to room temperature.
For the No Upstream substrates, the RNA 5'-end was recessed by 22 nucleotides from the DNA 3'-end and the RNA 3'-end extended two bases beyond the DNA 5'-end. Hybrids containing RNAs Md1, Md2, Md4, Md6, Md7, Md9, and Md10 were annealed to DNA strands D49, D49
1, D49
3, D49
5, D49
6, D49
8, and D49
9, respectively (23). Hybrids containing RNAs R3Z1, R3Z2, R7Z1, R7Z2, R7Z3, R7Z4, R7Z5, R9Z1, R46, and RpKS2 were similarly positioned on DNA strands containing the appropriate complementary sequences. RNA NPPT was annealed to D52N (23), and PPT62 was annealed to D+27temp4 (23).
For the Nick substrate containing upstream PPT20 and downstream Md1, the template strand was D49 (5'-CCAAACCTACAGGTGGGGTCTTTCATTT[]CCCCCCTTTTTCTGGAGACTAA-3'). The brackets in the sequence of D49 indicate the position that nucleotides were added to create the template strands for the corresponding Gap substrates, which are D49(1b) (5'...[T]...3'), D49(2b) (5'...[GT]...3'), D49(3b) (5'... [AGT]... 3'), D49(4b) (5'... [CAGT]... 3'), and D49(5b) (5'... [ACAGT]...3').
For the Nick substrate containing upstream PPT20 and downstream Md10, the template strand was D49
9 (5'-GCTAGCTTGCCAAACCTACAGGTGGGG[]CCCCCCTTTTTCTGGAGACTAA-3'). The brackets in the sequence of D49
9 indicate the position that nucleotides were added to create the template strands for the corresponding Gap substrates, which are D49
9(1b) (5'... [T]... 3'), D49
9(2b) (5'... [GT]... 3'), D49
9(3b) (5'... [AGT]... 3'), D49
9(4b) (5'... [CAGT]...3'), and D49
9(5b) (5'... [ACAGT]...3').
Cleavage Analysis of Hybrid SubstratesThree units (0.60.8 pmol) of M-MuLV reverse transcriptase or 0.5 unit (0.2 pmol) of HIV-1 reverse transcriptase were incubated with 0.2 pmol (10 nM final concentration) of a hybrid substrate in 20-µl reactions containing 50 mM Tris-HCl, pH 8.0, 50 mM KCl, 6 mM MgCl2, and 5 mM dithiothreitol at 37 °C for the times indicated. Aliquots were added to formamide stop mix (95% formamide, 20 mM EDTA) and electrophoresed in denaturing 20% polyacrylamide gels (National Diagnostics). Products were visualized by PhosphorImager analysis (Amersham Biosciences).
Statistical AnalysisFor each position of the aligned sequences, the chi-square one-dimensional test was used to determine the deviation from the random distribution of bases (44). The expected frequency for each base was determined by summing the nucleotide frequencies of the first 25 nucleotides from the 5'-end of each RNA used for HIV-1 (Fig. 5) and for M-MuLV (Fig. 6). For HIV-1, the expected frequencies were A = 0.233, C = 0.243, G = 0.319, and U = 0.205. For M-MuLV, the expected frequencies were A = 0.220, C = 0.251, G = 0.326, and U = 0.203.
Quantitative Analysis of RNase H Cleavage ProductsBand intensities of cleavage products were determined using ImageQuanT (Amersham Biosciences). To compare the amount of product resulting from cleavage at the same sequence in different RNAs (for example, counting from the RNA 5'-end, site F is cleaved in Md1 between the 16th and 17th nucleotides, and in Md2 between the 15th and 16th nucleotides), the area integration function was used to quantitate the relative abundance of individual bands present in each sample. This analysis normalized the amount of cleavage product for a specific site as a percentage of the total cleavage products for each hybrid substrate. Quantitations using data from the 15-s and 1-min time points were found to be essentially identical, and the results from 1-min time points are presented graphically in Fig. 4.
|
| RESULTS |
|---|
|
|
|---|
For RNase H cleavage assays, each RNA was 5'- or 3'-end-labeled and annealed to a template DNA to generate a hybrid with a recessed RNA 5'-end. Hybrid substrates were incubated with HIV-1 or M-MuLV reverse transcriptase in time-course assays; the products representing the initial 5'-end-directed cleavage events were found in the 15-s and 1-min time points when most of the initial substrate remained uncleaved. These initial cleavage sites were mapped by comparing the mobilities of cleavage products to size ladders generated with nuclease P1 (data not shown). Depending upon whether an RNA was 5'- or 3'-end-labeled, the longest products represented the cleavages that occur the furthest from or the closest to the RNA 5'-end, respectively. Importantly, the location and extent of cleavage for each site were determined by using the data obtained for both 5'- and 3'-end-labeled RNAs. To facilitate comparison of the same sites in different substrates, the cleavage sites generating the observed products have been named A through I, as indicated in the relevant figures and as described previously (23).
When substrates containing 5'-end-labeled RNAs Md1 through Md10 were treated with HIV-1 reverse transcriptase, it was immediately apparent that the cleavage products were not of uniform size (Fig. 2A). Instead, the products had varied lengths indicating that cleavages occurred at different distances from the RNA 5'-ends. The expected products resulting from cleavage at the same sites in the different substrates (labeled "EI") are connected by lines in Fig. 2. Substrate Md1 was predominantly cleaved at sites E, F, and G (Fig. 2A, lanes 15). Counting from the RNA 5'-end, site E is between nucleotides 13 and 14, site F is between nucleotides 16 and 17, and site G, which represented the most distal 5'-end-directed cleavage site in Md1, falls between nucleotides 19 and 20. Cleavage of substrate Md2 was very similar to that of substrate Md1 except that all of the cleavage products were one nucleotide smaller (lanes 610). Importantly, site G remained the most distal 5'-end-directed cleavage, even though this site was now located between the 18th and 19th nucleotides from the Md2 RNA 5'-end, and, unlike Md1, no cleavage occurred between the 19th and 20th nucleotides. Although site H was not cleaved in Md1 or Md2, this site was detectably cleaved between the 19th and 20th nucleotides in substrate Md4 (lanes 1115) and significantly cleaved between the 17th and 18th nucleotides in substrate Md6 (lanes 1620). No cleavage was observed at site I in substrates Md1 through Md7 (lanes 125), but this site was cleaved when located between the 19th and 20th nucleotides in substrate Md9 and between the 18th and 19th nucleotides in substrate Md10 (lanes 2535).
Assays with 3'-end-labeled RNAs and HIV-1 reverse transcriptase were used to confirm that the 5'-end-directed cleavages sites closest to the RNA 5'-ends in the above analysis represented the initial cleavage products and were not the result of secondary cleavages (Fig. 2B). In general, the locations were the same as for the 5'-end-labeled substrates, but the relative abundance of products sometimes varied because strong cleavage sites close to the labeled end diminished the detection of more distal sites. Site E was cleaved well in substrate Md1 at the earliest time points but was only weakly cleaved in subsequent substrates (lanes 120). Site F was a strong cleavage site when located as close as between the 13th and 14th nucleotides from the RNA 5'-end (Md1Md4, lanes 115) but was not cleaved efficiently when located closer than between the 11th and 12th nucleotides (Md6Md10, lanes 1535). Site G became the strongest cleavage site close to the RNA 5'-end when located between the 14th and 15th nucleotides in substrate Md6 (lanes 1620) and between the 13th and 14th nucleotides in substrate Md7 (lanes 2125). However, this site was not cleaved when moved two bases closer to the RNA 5'-end in substrate Md9 (lanes 2630). In substrates Md9 and Md10, the cleavage site recognized closest to the RNA 5'-end was site H (lanes 2630). Site I was not cleaved until located between the 19th and 20th nucleotides in Md9 or the 18th and 19th nucleotides in Md10 (lanes 2635).
|
|
The short fragment resulting from cleavage between the 8th and 9th nucleotides at site D observed with the 5'-end-labeled substrate (Fig. 3A, lane 2) could have been a secondary cleavage product. However, at least some of this product was generated independent of other cleavages, because the corresponding cleavage product was also observed using the 3'-end-labeled substrate Md1 (Fig. 3B, lanes 15). Limited but detectable independent cleavages also occurred between the 8th and 9th nucleotides of two other substrates, at site E in substrate Md6 and at site F in substrate Md9 (lanes 1620 and 2630, respectively). The relevance of these 5'-proximal cleavages to 5'-end-directed cleavage is considered under "Discussion."
|
First, the extent of cleavage at each site was evaluated in all substrates. For HIV-1 RNase H, cleavage at site F appeared to have a bell-shaped pattern, with the highest cleavage occurring with substrate Md4 (Fig. 4A). The pattern was similar for site G, but cleavage was most abundant in substrates Md6 and Md7. The pattern for cleavage of site H was also similar, with the highest cleavage in substrates Md9 and Md10. For M-MuLV RNase H, the amount of cleavage at each site also appeared as non-overlapping, bell-shaped patterns distributed in substrates Md1 through Md10 (Fig. 4B). Cleavage of site F was maximal in substrate Md2, cleavage of site G was highest in substrates Md4 and Md6, and cleavage of site H was greatest in substrates Md6 and Md7. Interestingly, the greatest amount of cleavage for a particular site occurred in different substrates for M-MuLV and HIV-1 RNases H. For example, site F was cleaved to the greatest extent in substrate Md2 for M-MuLV RNase H and in substrate Md4 for HIV-1 RNase H (Fig. 4AB).
Next, we determined how the extent of cleavage at sites F, G, and H was affected by the distance from the RNA 5'-end. For HIV-1 RNase H (Fig. 4C), the plots of distance versus cleavage at each site overlapped, indicating that the overall effects of distance on the extent of cleavage were very similar. A plot of the equivalent data for M-MuLV RNase H also showed that distance from the RNA 5'-end similarly influenced the extent of cleavage at sites F, G, and H (Fig. 4D).
Preferred Nucleotides Flank RNA 5'-End-directed Cleavage SitesWe next asked if preferred nucleotides could be identified near 5'-end-directed cleavage sites, because we recently showed that preferred nucleotides are found near internal cleavage sites recognized by the M-MuLV and HIV-1 RNases H (23). This analysis required the mapping of 5'-end-directed cleavage sites on a variety of different hybrid substrates containing RNAs with recessed 5'-ends. Three RNAs derived from viral sequences were used: PPT62, containing sequence from the M-MuLV genome (PPT62 (23)), and RNAs Md1 and Md10. To generate additional sequence diversity, several RNAs with unique sequences were made using in vitro transcription plasmids (see "Experimental Procedures"). Finally, for HIV-1 reverse transcriptase, we have included 5'-end-directed cleavage sites reported previously by other investigators in studies where the exact positions of these sites were mapped (21, 25, 29, 3843, 45).
To generate hybrid substrates with recessed 5'-RNA ends, RNAs were 5'- or 3'-end-labeled and annealed to DNA templates. These substrates were used in RNase H cleavage assays as shown above with HIV-1 or M-MuLV reverse transcriptase, and cleavage sites were mapped to the nucleotide level (data not shown). The relative extent of cleavage at the various sites in each substrate was classified as strong, medium, or weak. As an example using substrate containing RNA Md1 and HIV-1 reverse transcriptase (Fig. 2, A and B, lanes 15), sites E and F were classified as strong, and site G was classified as medium. Using substrate Md1 with M-MuLV reverse transcriptase (Fig. 3, A and B, lanes 15), site E was classified as medium and site F was classified as strong. For both M-MuLV and HIV-1, cleavages between sites E and F represent weak sites. In our analysis, only strong and medium sites were considered. For the mapped cleavage sites observed in prior studies with HIV-1 (21, 25, 29, 3843, 45), all of the identified RNase H cleavage sites were used.
To compare the sequences surrounding RNA 5'-end-directed cleavage sites, the cleavage was defined to occur between nucleotides 1 and +1, and the flanking sequences were aligned. These cleavage sites are presented for HIV-1 in Fig. 5 and for M-MuLV in Fig. 6. To statistically determine whether any base preferences correlated with the cleavage sites, the nucleotides from positions 10 to +4 were tabulated. The frequency of bases in the first 25 nucleotides of sequence beginning from each RNA 5'-end was calculated to determine the expected distribution of nucleotides. For a given cleavage site, the significance of any deviations from random nucleotide frequencies was determined by comparing the base distribution at each position with the expected distribution using the chi-square method ("Experimental Procedures"). The resulting chi-square values were plotted against the nucleotide positions (Fig. 7).
For both HIV-1 and M-MuLV, two nucleotide positions had p values <0.01 (
2 values > 11.34) and were considered strong deviations from random. Position +1 showed a strong preference for A or U, tolerated C, and strongly disfavored G. Position 2 had a strong preference for G or C and especially disfavored U. In addition, the 4 position for HIV-1 also had a p value <0.01 and disfavored A. Unlike the nucleotide preferences seen for internal cleavage sites (23), strong preferences for sequences further upstream were not observed (see "Discussion").
Accessibility to a 5'-End Affects RNA 5'-End-directed CleavagesWe recently showed that the 5'-end of an RNA at a nick does not allow 5'-end-directed cleavages (23). To determine the gap size required for 5'-end recognition, the RNase H cleavage pattern was compared using substrates in which the same RNA was placed at different distances downstream from PPT20, an RNase H-resistant RNA containing nucleotides 20 to 1 of the M-MuLV PPT sequence (Fig. 1 (37)). The distances between PPT20 and the downstream RNAs were either a nick or a gap of 15 bases. For M-MuLV reverse transcriptase, 5'-end-labeled Md1 was used as the downstream RNA. In the No Upstream substrate, cleavages occurred predominantly at sites D, E, and F in Md1 (Fig. 8, lanes 15 and 3640). Addition of upstream PPT20 to create a nick reduced 5'-end-directed cleavages at these sites and promoted internal cleavages at sites closer to the nick, such as sites A and B (lanes 610). This cleavage pattern was identical to that observed using a continuous RNA containing the same sequence but lacking the nick (23); together these data first suggested that a nick was insufficient to direct 5'-end-directed cleavages. Introduction of a 1- or 2-base gap between the upstream and downstream RNAs decreased cleavage at the A and B sites and increased cleavage at the E and F sites (Fig. 8, compare lanes 1120 with lanes 610). The cleavage pattern with a 3-base gap most closely matched the No Upstream substrate (compare lanes 2125 with 15 and 3640). Interestingly, the highest extent of 5'-end-directed cleavages was observed in substrates with a gap size of 4 or 5 bases (lanes 2635). Experiments performed using 3'-end-labeled Md1 confirmed these results (data not shown).
To test how a gap influences 5'-end-directed cleavages by HIV-1 reverse transcriptase, 5'-end-labeled Md10 was used as a downstream RNA (Fig. 9). In the No Upstream substrate, the initial cleavages in Md10 occurred at sites H and I (lanes 15 and 3640). Addition of upstream PPT20 decreased cleavage at sites H and I and increased internal cleavages at site E among others (lanes 610). For substrate with a 1-base gap, cleavages closer to the RNA 5'-end such as site E decreased while cleavages at sites H and I slightly increased (lanes 1115). In the 2-base and 3-base gap substrates, RNase H cleavages were most similar to those in substrate lacking upstream RNA (compare lanes 1625 with lanes 15 and 3640). Experiments performed with 3'-end-labeled Md10 and HIV-1 reverse transcriptase generated comparable results (data not shown).
RNase H assays were also carried out with M-MuLV or HIV-1 reverse transcriptase and substrates containing downstream Md10 or Md1, respectively (date not shown). In both cases and similar to the data shown in Figs. 8 and 9, the pattern of RNase H cleavages in substrates with a 2- or 3-base gap most clearly matched the No Upstream substrate.
| DISCUSSION |
|---|
|
|
|---|
Previous work has shown that reverse transcriptase does not efficiently utilize the RNA 5'-end at a nick to position 5'-end-directed cleavages (23). In this study, we addressed how much space between the 3'- and 5'-ends of RNA is required to render the RNA 5'-end accessible for directing cleavage in the downstream RNA. By increasing the distance between an upstream RNA 3'-end and a downstream RNA 5'-end in single nucleotide increments, we found that gaps of 2 or 3 bases permit 5'-end-directed cleavages at a level comparable to that observed for a recessed, free RNA 5'-end. It seems likely that the binding of reverse transcriptase to a recessed RNA 5'-end in a hybrid involves recognition of the discontinuity in the substrate structure at the junction between the single strand and the duplex. A similar discontinuity is offered by a typical primer-template, where the recessed end is a DNA 3' terminus rather than an RNA 5'-end. Thus, perhaps the primer-template binding cleft and in particular the primer grip region (34) of the polymerase domain positions the recessed RNA 5'-end of a hybrid for 5'-end-directed cleavage by recognizing some of the same structural features presented by a primer terminus. In substrates containing a nick or a 1-base gap, the RNA 5'-end is obscured and RNase H cleavages are limited to the internal model of cleavage. A substrate with a gap of 2 or more bases offers a sufficient distance between the upstream and downstream RNAs to allow the access and recognition required for the 5'-end-directed mode of cleavage and, additionally, may offer contact sites in the upstream RNA that facilitate enzyme binding. During general degradation of the genome, it seems likely that gaps are important to promote 5'-end-directed cleavages.
|
|
|
18 nucleotides from the RNA 5'-end and are followed by independent, secondary cleavages that are 8 to 9 nucleotides from the RNA 5'-end and occur at a slower rate. Although our findings demonstrate that sequence is an important determinant of 5'-end-directed cleavages, it remains to be investigated how sequence influences secondary cleavages. By mapping multiple 5'-end-directed cleavage sites in hybrids containing RNAs with different sequences, we observed a statistically significant bias for specific nucleotides at positions flanking these sites. For both HIV-1 and M-MuLV RNases H, A or C, but not G, was preferred at position +1, and C or G, but not U, was preferred at position 2. In addition, HIV-1 RNase H disfavored A at position 4. Precisely how the preferred nucleotides facilitate RNase H cleavage is unknown, but it has been proposed that the structures associated with some hybrid sequences might affect cleavage specificity (35). Several features influenced by sequence are the base composition, the trajectory of the helical axis, and the width of the major and minor grooves (35, 47, 48). Although it is possible that the preferred nucleotides flanking a cleavage site reflect interactions in the DNA strand instead of or in addition to the RNA strand, the co-crystal structure of HIV-1 reverse transcriptase and a PPT-containing RNA/DNA hybrid reveals potential hydrogen-bonding contacts between the +1 RNA base and Arg-448, and between the 2 RNA base and Gln-475 (35) that might contribute to the observed sequence preferences. Our observation that an A is disfavored at the 4 position does not correspond to any base contacts in the co-crystal structure, but there are phosphate contacts from 4 to 9 in the DNA strand and the RNase H primer grip region of the HIV-1 enzyme (35). Notably, the 4 position in the co-crystal structure falls within an unusual structural deformation consisting of a region of unpaired bases that may be PPT-specific and consequently may not reflect the binding situation for hybrids containing other sequences. Importantly, our data suggest that hybrid substrates containing the preferred nucleotides at the most critical positions of 2 and +1 may facilitate the generation of co-crystals of reverse transcriptase with bound substrate in future crystallographic analyses.
Very recently, the first co-crystal structure of a prokaryotic RNase H with the hybrid substrate positioned in the enzyme active site has been reported (49). The authors note that four residues (Asn-77, Asn-106, Gln-134, and Asn-105) donate hydrogen bonds to bases close to the scissile phosphate, and one residue (Arg-195) has a sequence-specific contact with the DNA strand at the +5 G residue. This latter residue is homologous to Arg-557 in HIV-1 reverse transcriptase, but thus far our sequence analyses of internal and 5'-end-directed cleavage sites have not suggested a preference for C in the RNA strand at the +5 position (Ref. 23 and data not shown).
Both the 5'-end-directed and internal modes of RNase H cleavage prefer similar nucleotides at positions +1 and 2 for M-MuLV and positions +1, 2, and 4 for HIV-1 (this work and Ref. 23). Internal cleavages also exhibit nucleotide preferences further upstream from the cleavage site (at positions 6 and 11 for M-MuLV and positions 7, 12, and 14 for HIV-1 (23), but no equivalent positions were identified from the statistical analyses of 5'-end-directed cleavage sites. The absence of obvious nucleotide preferences further upstream of 5'-end-directed cleavage sites may derive from the nature of the upstream sequences found in many of our hybrid substrates, where the first 810 nucleotides of RNA sequence are dictated by the in vitro transcription vectors. Although these sequences may not offer the optimal nucleotides that would otherwise be preferred for 5'-end-directed cleavage, the specificity determinant contributed by binding the RNA 5'-end may substitute or compensate for additional upstream preferences. This consideration suggests that preferred nucleotide positions located most proximal to the cleavage site are the most important in determining the positioning for both modes of RNase H cleavage.
As would be predicted from the overlap between the preferred nucleotide specificity of 5'-end-directed and internal cleavage sites, it appears that the 5'-end-directed sites are a subset of the possible internal sites that are only cleaved when they meet certain distance constraints. In support of this prediction, all of the 5'-end-directed cleavage sites we have observed thus far are also observed as internal cleavage sites on longer RNA/DNA hybrids that do not require a nucleic acid end for positioning (Ref. 23 and data not shown). For example, the predominant 5'-end-directed sites in substrates Md1 through Md10 are also recognized as internal cleavage sites in longer RNAs containing the sequence +1 through +29 beyond the M-MuLV PPT origin (Figs. 1 and 2) (23). As indicated in Fig. 11A, only some of the possible internal cleavage sites in the +1 to +29 sequence (indicated by arrows above the upper sequence) are recognized as 5'-end-directed sites in substrates Md1 and Md10 (indicated by vertical dashed lines). Importantly, the internal sites located close to the 5'- or 3'-ends of substrates Md1 or Md10 are not cleaved efficiently by the RNA 5'-end-directed mechanism because of the restrictions imposed by the distance from the RNA 5'-end.
|
|
|
|
5% of all bands, cleavages were optimal when the sites were located between the 13th and 18th nucleotides for HIV-1, and between the 13th and 17th nucleotides for M-MuLV in these substrates. Because these experiments used substrates with labeled RNA 5'-ends, these data were biased somewhat against the more distal cleavages in the cleavage window. This cleavage window model accounts for the range of distances observed in RNA 5'-end-directed cleavages assays. Although cleavage sites are often 1519 nucleotides from the RNA 5'-end (for example, see Refs. 25, 27, 29, and 5053), cleavages from 10 to 21 nucleotides from the RNA 5'-end have been reported in this study and others (2932). These cleavage distances are broader than the 17 or 18 nucleotides that separate the polymerase and RNase H active sites in HIV-1 reverse transcriptase, based upon co-crystal structures of the enzyme with duplex DNA or RNA/DNA hybrids in which a DNA 3'-end fits in the polymerase active site (3335). It is possible that contacts between the polymerase domain and a DNA 3'-primer terminus more tightly anchor the RNase H domain on the substrate, whereas an RNA 5'-end allows more flexibility in positioning. This would predict that the cleavage window for DNA 3'-end-directed cleavages is smaller than that for RNA 5'-end-directed cleavages, and experiments are underway to test this possibility. It is also possible that the co-crystal structures reflect the distance between the active sites when the enzyme initially binds the substrate, and that, in the case of RNA 5'-end-directed cleavages, the enzyme can slide on the substrate after the initial contacts with the RNA 5'-end are released or adjusted.
Secondary RNase H cleavages have been proposed to occur independently of primary, 5'-end-directed cleavages and result from an initial binding and more extensive sliding of reverse transcriptase on its substrate (3840). For M-MuLV, independent cleavages in this report were observed as close as between the 8th and 9th nucleotides from the RNA 5'-end (for example, site D in Md1). At this time, we cannot conclude whether these cleavages result from a secondary cleavage or a 5'-end-directed mechanism. It may be that the definition of 5'-end-directed versus secondary cleavages is difficult to distinguish at sites closer to the RNA 5'-end, and more experiments are required to determine the nature of these cleavages.
After minus-strand synthesis, extensive degradation of the RNA genome is required to facilitate plus-strand synthesis and strand transfers, and much of this general degradation appears to proceed by the polymerization-independent mode of RNase H activity (20, 22, 25, 28). The remaining RNA template that requires further degradation will likely contain nicks generated during polymerization of the minus-strand. Because the RNA 5'-end at a nick does not efficiently promote 5'-end-directed RNase H cleavages, this form of RNase H activity would be restricted to initially act at gaps, but whether gaps of sufficient size are generated during minus-strand synthesis remains to be determined. Internal cleavages do not require positioning by an RNA or DNA end, so at least initially the polymerization-independent activity of RNase H can carry out internal cleavages. If recognition sites are located closely together and cleaved by the internal mode, this would create gaps that are sufficient for 5'-end-directed cleavages. Because 5'-end-directed cleavages are kinetically favored over internal cleavages (54), the multitude of possible internal cleavage sites combined with robust 5'-end-directed cleavages would together facilitate rapid and thorough degradation of the retroviral genome.
| FOOTNOTES |
|---|
1 To whom correspondence should be addressed: Dept. of Microbiology, Box 357242, School of Medicine, University of Washington, Seattle, WA 98195-7242. Tel.: 206-543-8574; Fax: 206-543-8297; E-mail: champoux{at}u.washington.edu.
2 The abbreviations used are: HIV-1, human immunodeficiency virus, type 1; M-MuLV, Moloney murine leukemia virus; PPT, polypurine tract; ribonuclease H, RNase H. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|