Conformational Diversity Versus Nucleic Acid Triplex
Stability, a Combinatorial Study*
Eloy
Bernal-Méndez and
Christian J.
Leumann
From the Departement für Chemie und Biochemie,
Universität Bern, Freiestrasse 3, CH-3012 Bern, Switzerland
Received for publication, June 21, 2001
 |
ABSTRACT |
The stability of a triple helix
formed between a DNA duplex and an incoming oligonucleotide strand
strongly depends on the solvent conditions and on intrinsic chemical
and conformational factors. Attempts to increase triple helix stability
in the past included chemical modification of the backbone, sugar ring,
and bases in the third strand. However, the predictive power of such modifications is still rather poor. We therefore developed a method that allows for rapid screening of conformationally diverse third strand oligonucleotides for triplex stability in the parallel pairing
motif to a given DNA double helix sequence. Combinatorial libraries of
oligonucleotides of the requisite (fixed) base composition and length
that vary in their sugar unit (ribose or deoxyribose) at each position
were generated. After affinity chromatography against their
corresponding immobilized DNA target duplex, utilizing a temperature
gradient as the selection criterion, the oligonucleotides forming the
most stable triple helices were selected and characterized by
physicochemical methods. Thus, a series of oligonucleotides were
identified that allowed us to define basic rules for triple helix
stability in this conformationally diverse system. It was found that
ribocytidines in the third strand increase triplex stability relative
to deoxyribocytidines independently of the neighboring bases and
position along the strand. However, remarkable sequence-dependent differences in stability were
found for (deoxy)thymidines and uridines.
 |
INTRODUCTION |
Triple-stranded DNA and RNA structures were
first discovered in the late fifties by Felsenfeld and Rich (1, 2). In
1986, two independent research groups demonstrated that triplex-forming oligonucleotides (TFOs)1 can
be used to specifically recognize a given sequence in a DNA double
helix (3, 4), and thus paved the way for their potential use as
therapeutics in the antigene strategy and as tools in molecular biology. Polypyrimidine TFOs bind specifically to complementary poly(Pu)·poly(Py) double-helix sequences by formation of Hoogsteen base pairs between thymines or protonated cytosines in the TFO and
adenines or guanines, respectively, in the poly(Pu) strand of the DNA
double helix. The molecular recognition process is highly sensitive to
base mismatches, so that a single duplex site can be targeted within
megabase DNA (5-7).
It has been shown previously that triplexes not only form within the
pure DNA or RNA backbone context but also within mixed RNA and DNA
strands, although with distinct differences in stability within a given
sequence context (8). Recent analyses by NMR and FT-IR have shown that
D:DD and R:DD triple helices (where DD stands for the DNA double helix,
and D: or R: for the DNA or RNA TFO, respectively) have heterogeneous
backbone conformations, leading to energetically compromised
conformations for certain ribo- and deoxyribonucleotides in the three
strands (9-15). Thermodynamic studies have shown that R:DD triple
helices have higher thermal stability than D:DD ones (8, 15-18),
whereas similar free energies and equilibrium constants have been found
by isothermal titration calorimetry and EDTA cleavage (19, 20). Results
in our laboratory and others, with third strand sequences containing a
variable number of nucleotides with modified bases or backbones, show
thus far nonunderstood differences in affinity as a function of the target sequence and the position of the modified nucleosides in the
chain (21-23). Given these data, the question arises whether, upon
binding with its target double helix, a TFO containing an intrinsically
heterogeneous backbone conformation, allowing for a tailor-made
structural fit to the target, could increase the triplex stability
compared with a TFO with a homogeneous backbone.
With this in mind, we developed a general method based on a
combinatorial approach. The method is based on the synthesis of a
combinatorial library of TFOs containing either ribo- or
deoxyribonucleosides with the requisite base at each position in the
chain. Both types of nucleoside units intrinsically prefer different
sugar conformations (3'-endo for ribonucleosides and 2'-endo for
deoxyribonucleotides) and thus give rise to a large variety of backbone
conformations in the corresponding TFOs. After affinity chromatography
on the immobilized target double helix, utilizing a temperature
gradient as the selection criterion, and subsequent deconvolution of
the thus obtained fractions by chemical and analytical means, the molecular features of TFOs leading to enhanced triple helix stability were defined. Based on these features, single oligonucleotides were
designed, synthesized, and characterized for proof of principle. The
results obtained validate our combinatorial approach, showing distinct
effects of the nature of the sugar on triplex stability, depending on
the position within the TFO chain. This method lends itself for use in
the assay of oligonucleotides containing new, chemically modified
nucleoside analogues, as it does not rely on an enzymatic amplification
and in vivo deconvolution step typically used in DNA and RNA
selection protocols.
 |
MATERIALS AND METHODS |
Synthesis of Oligonucleotides and TFO Libraries--
Single
oligonucleotides were synthesized on an Amersham Pharmacia Biotech
Gene-Assembler special DNA synthesizer using standard
-cyanoethyl
phosphoramidite chemistry and the manufacturer's protocols (24). dU,
dT, dC, rU, rC, spacer-C18, and 5'-amino-modifier-C12 phosphoramidites
were from Glen Research. The TFOs were synthesized on a 1.0-µmol
scale. For the introduction of ribonucleotides, the coupling time was
prolonged to 6 min. After final detritylation, the oligomers were
removed from the solid support and deprotected with 25%
NH4OH:ethanol (3:1), at 55 °C overnight. No ethanol was added for the deprotection of all-DNA oligonucleotides. After filtration and drying, deprotection of the 2'-OH groups, where necessary, was achieved with 1 M tetrabutyl ammonium
fluoride in tetrahydrofuran (24 h at room temperature; Refs. 25
and 26). The TFOs were purified by IE-HPLC followed by IP-RP-HPLC under the conditions indicated below, and characterized by IE-HPLC and IP-RP-HPLC, ESI-MS, UV-visible spectrophotometry, and gel
electrophoresis (PAGE). The hairpin DNA double helix (dh) was
synthesized on a 10-µmol scale. The dh was deprotected, purified, and
characterized as indicated above for the TFOs. Names and composition
are as indicated in the corresponding figures. The triple helices are named as the corresponding TFO, with the addendum th. For the synthesis
of the TFO library M, the "split and combine" technique was used.
First, two 10-µmol columns filled with dC-CPG phosphoramidite solid
support were placed in the DNA-synthesizer, one dC residue was coupled
to one of them, and one rC residue to the other. At the end of the
coupling cycle (before detritylation), the columns were removed and the
resins mixed together and separated again in two equal parts. New
cycles were performed in the same way, always adding a ribonucleotide
in one column and the corresponding deoxyribonucleotide in the other
(rC and dC, or rU and dT, respectively). Removal of the solid support
and nucleotide deprotection were achieved as indicated above. Trityl-on
purification was performed with the aim to facilitate the separation of
all the members of the library from the non-tritylated, truncated
sequences. The crude, tritylated product was purified by IP-RP-HPLC,
detritylated with 80% acetic acid (30 min, room temperature),
lyophilized, and redissolved in deionized water. Thus, a library of
211 oligonucleotides was obtained.
Affinity Chromatography--
The hairpin DNA double helix
containing the 5'-aminolinker unit (dh-NH2) was coupled to
the gel matrix (NHS-activated Sepharose 4 Fast Flow, Amersham Pharmacia
Biotech) in 0.1 M NaCl, 0.1 M NaHCO3 buffer, pH 8, for 2 h at room temperature. The
derivatized matrix (10 ml) was then packed into a chromatographic
column (XK16, Amersham Pharmacia Biotech), and the remaining
non-reacted functional groups of the matrix were deactivated with
ethanolamine. The column was attached to a HPLC system
(ÄKTAbasic, Amersham Pharmacia Biotech). The total amount of dh
immobilized into the column was estimated to be 1 µmol. The column
was stored at 4 °C, using 20% ethanol as storage solution. The
affinity chromatography experiments were performed in the following
way. 0.42 µmol of the TFO library M in 1 ml of elution buffer (0.1 M NaCl, 1 mM EDTA, 10 mM
citrate/phosphate, pH 6.0) were loaded onto the affinity chromatography
(AC) column and allowed to hybridize for 1 h at 5 °C. The
fraction of the library containing the non-binders and the failure
sequences was then eluted with 15-20 ml of elution buffer at 1 ml/min
and 5 °C. Then, the temperature was increased by 10 °C, the
system left to stabilize for 15 min, and the next fraction was eluted.
This cycle was repeated four times, up to 45 °C, after which the
column was washed with pH 7.0 buffer to ensure that all TFO were
eluted. The five recovered fractions were lyophilized, redissolved in water, and desalted through Sephadex G-25 resin (Amersham Pharmacia Biotech). The fractions were characterized in the same manner as the
other oligonucleotides.
Mass Spectrometry--
The samples were used directly after
IP-RP purification or desalted by dialysis for 1.5 h against
deionized water on a nitrocellulose filter (Millipore), then
concentrated to 50-100 pmol/µl, and analyzed on a VG Platform
electrospray mass spectrometer (Micromass).
Partial Alkaline Digestion--
The oligonucleotides, TFO
libraries, or fractions were 5' end-labeled with 32P using
[
-32P]ATP (Hartmann), T4 polynucleotide kinase
(Amersham Pharmacia Biotech), and standard protocols. After
lyophilization, they were dissolved in 50 mM
NaHCO3, 0.5 mM EDTA, pH 9.3 buffer and
incubated at 95 °C for 5 min. The pH was then lowered with 1 M HCl, and a new incubation performed (2 min at 37 °C)
in order to break remaining 2'-3'-cyclophosphate ends (27). The results
were analyzed by PAGE under denaturing conditions (24% polyacrylamide,
8 M urea). Reading and quantification of the bands was done
on a Molecular Dynamics PhosphorImager using ImageQuant software
(version 3.3) for data processing. Results are an average of at least
three independent experiments.
UV Melting Experiments--
UV-visible absorbance measurements
were performed using a Cary 3E UV-visible spectrophotometer (Varian).
The molar extinction coefficient of each oligonucleotide (or AC
fraction) was calculated using nearest neighbor approximation (28). 2 nmol of the corresponding oligonucleotides were lyophilized to dryness
and redissolved in 1 ml of a 0.1 M NaCl, 1 mM
EDTA, 10 mM citrate/phosphate buffer (pH 4.4-6.8). To
ensure the correct formation of the triple helix, the resulting
solution was heated to 85 °C and allowed to slowly cool down to room
temperature, followed by storage at 4 °C overnight. The pH of the
solution was verified using a microelectrode. Prior to thermal
experiments, the UV-visible spectra of the samples (210-350 nm) were
measured at around 1 °C. Absorbance versus temperature profiles were measured at two wavelengths (260 and 300 nm), using a
linear gradient of 0.5 °C/min and a heating-cooling-heating cycle
between 0 and 90 °C. Data were analyzed using Origin50 software.
HPLC--
Liquid chromatography experiments were done on an
ÄKTAbasic F System with UNICORN software (Amersham Pharmacia
Biotech), using a DEAE column (Nucleogen 60-7) for IE and a C18 column
(Nucleosil 100-5) for IP-RP chromatography, both from Macherey-Nagel.
Eluents were as follows. For IE, eluent was 10 mM
KH2PO4/K2HPO4, pH 6.5, 20% CH3CN, with 1 M KCl in eluent B. For
IP-RP, eluent was 10 mM triethylamineacetic acid, pH 7.5, in H2O for eluent A and in 80% CH3CN for
eluent B. The gradient was as indicated for each experiment.
 |
RESULTS |
The TFO base sequence chosen in this investigation (see Fig.
1A) was designed as to contain
a T-rich part, a part of equal distribution of T and C, and a C-rich
part, allowing for maximum sequence variability in the TFO and in the
corresponding target double helix. Its length was chosen in order to
stay within the limits of its detectability by UV melting measurements
(see Tm values in Fig. 1A), and to
comply with the permissive temperature range of the affinity
chromatography matrix.

View larger version (41K):
[in this window]
[in a new window]
|
Fig. 1.
A, sequences and characterization of the
double helix coupled to the amino linker, the TFO library M, and the d
and r11d control TFOs. B, flow chart of the selection
process.
|
|
The selection process was designed as indicated in the flow chart (Fig.
1B). Synthesis of the TFO library and the immobilized dh, as
well as the hybridization and selection strategies, are detailed under
"Materials and Methods."
First Round of Selection
A first round of selection with the TFO library M, containing the
complete set of 2048 individual oligonucleotides, was performed utilizing a temperature gradient as indicated under "Materials and
Methods." The results are shown in Fig.
2. Five fractions (M05 to M45) at five
different temperatures, containing individuals with increasing affinity
to the double helix, were isolated and subsequently deconvoluted by a
series of analytical steps explained in detail below.

View larger version (20K):
[in this window]
[in a new window]
|
Fig. 2.
Affinity chromatography profile of M, with
the names of the recovered fractions. Solid
line, absorbance at 260 nm; dashed
line, absorbance at 280 nm; gray line,
elution temperature.
|
|
UV Melting Experiments--
These experiments were performed in
order to verify that oligonucleotides were eluted exclusively as a
function of the thermal stability of the corresponding triple helices.
As expected, hysteresis was observed between heating and cooling
profiles due to slow kinetics of triplex formation (29-32). The
melting temperatures for third strand dissociation
(Tm) were determined at the maximum of the first
derivative of the heating curves. They are taken as indicative values
for the stability of the triple helices, and not as thermodynamic
parameters. The observed Tm of the triple helices confirmed the non-binding of the M05 fraction and the stability
order of the triple helices formed by the other fractions under the
conditions applied (Table I).
View this table:
[in this window]
[in a new window]
|
Table I
Relative size (%) of the AC fractions from the TFO library M
Table shows characterization by thermal stability of the triple helix
(Tm), 260 to 280 nm absorbance ratio, and retention
time (tr) in IE-HPLC (15-65% B buffer gradient in
25 min).
|
|
ESI-MS--
The differences in mass between ribo- and
deoxyribonucleotides originated by the presence or absence of the 2'-O
and of the 5-Me group in the case of rU/dT. These differences led to a
mass distribution in the library M as indicated in Fig.
3A, where each mass represents
from only 1 (in the corners of the table) to 200 (in the center)
individual TFOs. The ribocytidine content of the TFOs can thus be
easily quantified by the increase of n × 16 mass units
(n = 0-5) for each new rC, with respect to the all-DNA
TFO d. By contrast, the presence of a rU instead of a dT in the
oligonucleotide leads to an increase of only two mass units (+ 16 from
2'-O,
14 from 5-Me). Due to the fact that the spectrum is complicated
by the presence of isotopic peaks that overlap with those corresponding to different rU contents in each group of constant rC, it was not
possible to resolve the families of constant rC into their different
rU-containing subfamilies, so that we could only estimate the rU/dT
ratio by comparing the maximum values of the peaks obtained for each
family. The mass spectrum of M is depicted in Fig. 3B together with the calculated one (Fig. 3B,
inset). The detection of all signals near to the expected
relative intensities assesses the quality of the library. This mass
spectrum was thus used as a control for the study of the corresponding
fractions.

View larger version (51K):
[in this window]
[in a new window]
|
Fig. 3.
A, masses of the TFOs included in M and
theoretical number of isomers in each case (in italics).
B, calculated (inset) and experimental mass
spectra of M. C, quantification of the mass spectra of the
AC fractions of M: , M; , M05; , M15; , M25; , M35; ,
M45. D, estimation of the number of species in the rC
families of each AC fraction.
|
|
The rC content in each fraction was defined from the quantification of
the signals corresponding to different rC families, and compared with
the mass spectrum of the whole library M. It clearly emerges (Fig.
3C) that there is a preference for more rC over dC units
when moving toward the more stable AC fractions. The first eluting
fractions have a slightly lower rC content than the average, while M35
contains a higher rC content. The AC fraction containing the strongest
binding TFOs is populated by rC-rich species, suggesting that the
increase in triplex stability goes parallel with the increase in the
number of rC residues. In order to correct for the different sizes of
the rC families and the different sizes of the fractions, each mass
signal was normalized, and the values obtained for the same mass peak
in different fractions were compared in order to obtain a rough idea on
the number of species that they represent. For this calculation, the
assumptions were made that every TFO of the library is only present in
one fraction, and that each TFO represents an equal part of the
library. The values obtained are shown in the table in Fig.
3D. It can be seen that the proportion of each rC family,
which is present in the M45 fraction, doubles at each single increase
in rC content, corroborating that individuals with more rC units have a
higher probability of being found in the more stable TFO fractions.
The average rU content in each rC family was estimated from the maximum
value of the corresponding mass peak. We found values ranking from 2.6 to 4.6, with higher rU content in the more retained AC fractions for
every rC family. These results indicate that the triple helix
stabilization brought about by rU does not depend on the presence of rC
or dC, as the values are similar for all rC families in the same
fraction. Another conclusion is that the presence of rU is not a
general guarantee for high stability, as fraction M45 contains an
average of only 4.1 rU of 6 possible.
Partial Alkaline Digestion (PAD)--
At basic pH (pH > 9),
ribooligonucleotides undergo strand cleavage by 2',3'-cyclophosphate
formation. Deoxyribonucleotides are stable under these conditions.
Statistical control of this degradation process allows the reaction to
take place at a maximum of one nucleotide per chain, and the 5' end
32P-labeled fragments can then be detected and analyzed by
PAGE. The presence of ribonucleotides at each position along the TFO can thus be evaluated. Applied to the library M and its fractions, an
average ribonucleoside distribution is obtained at each position in the
chain. An example of this experiment is shown in Fig.
4. For the analysis of these data, the
signal intensities were corrected to a normalized total reactivity for
each fraction and related to those of the whole library M. This
method is complementary to analysis by MS since it allows detection
of the positions at which ribonucleotides can be found, whereas MS
reveals the total number of ribonucleosides in the TFOs of the
fraction.

View larger version (36K):
[in this window]
[in a new window]
|
Fig. 4.
Partial alkaline digestion of M and its AC
fractions. The numbers beside the
lines indicate the nucleotide at the 3' end of the remaining
fragment. d and r are the control DNA and RNA
TFOs, respectively.
|
|
In the stronger binding AC fraction M45, we find a requirement for
ribonucleotides in the central part (positions 4-9) and a preference
for dT over rU in positions 1 and 2. In positions 10 and 11, similar
reactivities were found for all fractions, indicating a lack of
preference for these positions. This explains the fact that the average
rU content found for M45 by ESI-MS is only 4.1, and that many
individuals with one or two dC are present in this fraction. It emerges
as a general rule that sequence-dependent differences in
triplex stability are more encountered for dT versus rU than
for dC versus rC replacements. In the low binding fraction M05, the ribonucleotides are evenly scattered over all positions, and
an anomalous migration is found for the fragments containing the
nucleotides 8 and 9, which is not reflected in the migration of shorter
and longer fragments, as well as in the mass spectra. A possible
explanation for this and the poor triple-helix formation propensity by
the members of this fraction is that a 2' to 3' phosphate migration of
a part of the library had occurred during synthesis or deprotection.
UV-visible Absorbance--
The UV-visible spectrum (210-350 nm)
of RNA and DNA of the same sequence and length differ in the position
of the absorption peak, because uridine has a near-UV maximum at 260 nm, rather than the 268 nm for deoxythymidine (33). Therefore, we used the A260/A280 ratio in
order to estimate the rU/dT contents of the fractions. As shown in
Table I, only little differences were found, with the higher rU/dT
ratio occurring in the first and last fractions. These results indicate
independently that a continuously increasing rU/dT ratio in the TFO
goes not in parallel with increased stability of the triple helix.
HPLC--
In IE-HPLC, the whole TFO library M eluted with
intermediate retention times compared with the pure TFOs r11d and d
(Table I). As expected, M45 and M35 have shorter retention times,
indicating a higher ribonucleotide content. During IP-RP-HPLC, average
retention times followed the sequence M45 < M35 < M05
M
M25
M15, clearly indicating the presence of more
ribonucleotides in the strong binding fractions. The intermediate
position of the fraction M05 can again be explained by the fact that it
mainly contains truncated 11-mers and sequences originating from
phosphodiester isomerization. The chromatograms of M35 and M45 show
better resolved peaks, which is in agreement with the presence of fewer species.
Design of Sublibrary P and Second Round of Selection
The following conclusions could be drawn from the results
obtained. (i) An "RNA core" is necessary for high triple helix
stability, (ii) ribonucleotides at the 3' end of the TFO have a
negative effect on stability, and (iii) TFOs with higher rC content
show higher triplex stability in a sequence-independent manner. Taking this into account, we designed and synthesized the sublibrary P
corresponding to the all-rC family of M. This library contains only 64 individuals, with all C in the ribo-form (except the 3'-terminal unit),
and dT and rU as variables. With this focused library, a second round
of selection was performed.
The TFO sublibrary P (Fig. 5) was
prepared much in the same way as described for M. The sublibrary was
again assayed for triplex formation by affinity chromatography on the
immobilized dh, as before. The resulting elution profile
(Fig. 6), as expected, showed the higher
populated fractions eluting at higher temperatures (P35 and P45). The
small peak arising at 90-ml elution volume is an artifact produced by
the change of eluent. The thermal denaturation UV profiles of the
collected fractions (Fig. 7) again verify
the success of the selection process. UV-visible absorption spectra were measured for all fractions. The highest
A260/A280 ratios were
found for P45 and P35, indicating a higher content of rU (Fig. 6). The
fractions were further characterized as before.

View larger version (11K):
[in this window]
[in a new window]
|
Fig. 5.
Sequence and characterization of the TFO
library P and the single TFOs used as controls. Symbols are as in
Fig. 1. a, melting temperature of the triple
helix; conditions were as indicated under "Materials and Methods"
(pH 5.7).
|
|

View larger version (34K):
[in this window]
[in a new window]
|
Fig. 6.
Top, affinity chromatography of the TFO
library P, with the names of the recovered fractions. Solid
line, absorbance at 260 nm; dashed
line, absorbance at 280 nm; gray line,
elution temperature. Bottom, relative size (%) of each AC
fraction and characterization by 260-280 nm absorbance ratio and mass
spectrometry.
|
|

View larger version (22K):
[in this window]
[in a new window]
|
Fig. 7.
Melting profile and
Tm values of the library P and its AC
fractions, at pH 5.7. , P; , P05; , P15; , P25; ,
P35; , P45. nd, not detected.
|
|
ESI-MS--
Seven different masses are possible for the TFOs of P,
ranking between 3589 for the one having six rU (r11d) and 3577 for the
one with six dT (r6d; see Fig. 5, 6). This small difference in mass did
not allow us to obtain resolved mass peaks by ESI-MS. Therefore, only
an average rU/dT ratio could be obtained from the mass spectra of each
fraction. The obtained values are shown in Fig. 5. P05 and P15 are
around the average value of the library. P25 has the lowest mass of the
P fractions, with an average of 2.6 rU per TFO, still not far from the
3 rU average of the library. P45 shows the highest mass, with a
corresponding average of 3.8 rU per TFO. This value fits well with the
results obtained with M45, the most stable TFO fraction from the
library M, showing a preference for rU at positions 3, 4, 6, and 9, and
for dT at positions 1 and 2. In summary, a maximum difference of 1.2 average rU per TFO has been found, between P25 and P45, which is in
agreement with the results obtained from the previous selection step
with the library M.
Partial Alkaline Digestion--
The results of these experiments
clearly show a stronger reactivity of P45 at positions 6 and 9, indicating, as previously shown for the M library, that an RNA core is
required in the TFO for stronger triple helix stability (Fig.
8). Another feature of P45 is a low
reactivity at positions 1 and 2. This means that dT is preferred over
rU at these positions of the TFO. This is the opposite behavior with
respect to positions 6 and 9. The P25 and P35 fractions also show
interesting PAD profiles. At positions 6 and 9, their reactivities are
average, being a little higher in the case of P35 compared with P25.
Focusing on positions 1-4, distinctly different reactivities are
found. P25 has the highest rU content of all the fractions at positions
1 and 2 and the lowest (together with P35) at position 3. P35 has the
lowest signals at positions 3 and 4. A logical explanation fitting with
these reactivities is that TFOs with two dT at positions 6 and 9 mainly belong to the fraction P25, with also some of them in P15. TFOs with
only one dT at position 6 or 9 belong to fraction P35, with some, that
carry rU at positions 1 and 2, falling into P25. The TFOs having the
RNA core (rU in central positions 6 and 9) are in the more stable TFO
fraction P45, except for a few individuals with mainly rU in positions
1 and 2, which go to P35. These results confirm the slightly
destabilizing effect of rU at positions 1 and 2, and the opposite
effect for positions 3 and 4. Looking at fractions P05 and P15, we can
observe a regular reactivity all along the nucleotides, with the same
anomalous migrating product between positions 8 and 9 that was present
in the fraction M05 of the first TFO library M (Figs. 3 and 6),
indicating again partial 3' to 2' phosphodiester isomerization during
synthesis and deprotection of the library P.

View larger version (86K):
[in this window]
[in a new window]
|
Fig. 8.
PAD of P and its AC fractions. The
numbers beside the lines indicate the
nucleotide at the 3' end of the remaining fragment.
|
|
IP-RP-HPLC--
The reduced number of individuals present in the
sublibrary P, as compared with M (26 versus
211), facilitates the resolution of the different AC
fractions by HPLC. The average retention times of P and its fractions
followed the sequence P45 < P35 < (P
P05
P15) < P25 (Fig. 9). This is in
agreement with the results of ESI-MS showing the same order for the
average rU content of the fractions. P35 counts for some 35 individuals
of the library, giving rise to ~20 peaks and shoulders. In the case
of P45, the ~16 individuals that are contained are almost resolved by
LC.

View larger version (23K):
[in this window]
[in a new window]
|
Fig. 9.
IP-RP-HPLC chromatograms of the library P and
its AC fractions. Linear gradient of 10-20% B buffer in 30 min
was used. Abscissa, elution time (min); ordinate,
absorption at 260 nm.
|
|
The large range of retention times of the TFOs in P45 allowed us to
separate this fraction into five well resolved subfractions (Fig.
10, top). The subfraction 1 corresponds to r11d, the only TFO of this library to have six rU
residues, which gives it a unique retention time. The other
subfractions were assayed by PAD (Fig. 10, bottom), and the
results confirmed our expectations. No differences were found between
the subfractions in the reactivity at positions 6 and 9, confirming
that all products within P45 have an RNA core. The sum of the
reactivities at positions 1-4 point to one missing rU-unit per
subfraction when going from 2 to 5, confirming that the "peak
families" are mostly separated by ribonucleotide content. 2 contains
at least two products of the four possible that carry only one dT at
positions 1 to 4. 3 contains individuals with two dT (6 possibilities),
with positions 1 and 2 being favored for the presence of dT. The
reactivity of 4 is consistent with the presence of three dT, with high
probabilities for positions 1 and 2. Finally, the two peaks in the
chromatogram corresponding to 5 represent the TFO bearing four dT in
positions 1 to 4, plus the one containing three dT in positions 1-3
and one rU in position 4. These conclusions are in agreement with UV-visible, ESI-MS, and PAD, which foresaw an average of three to four
rU units in P45.

View larger version (60K):
[in this window]
[in a new window]
|
Fig. 10.
Top, fractionation of P45 by HPLC.
Bottom, PAD experiment on P45 and its subfractions
(subfraction 1 not included; see "Design of Sublibrary P and
Second Round of Selection."
|
|
Single TFO Synthesis and Validation of the Combinatorial
Approach
In order to validate these results and conclusions, we undertook
the synthesis of representative individuals of the library P. The
synthesized TFOs are described in Fig.
11. r11d and r6d are the maximum and
minimum ribonucleotide-containing TFOs included in P. T12
and T34 and 4T allow for the verification of the sequence dependent effects of the rU
dT substitution at positions 1-4 observed during the selection process. T6 and
T9 are expected to be members of P35, and T69
should be a representative of the P25 fraction. dU6 was
synthesized in order to assess the effect of the 5-Me group of thymines
on the triple helix stability. The TFOs were synthesized, purified, and
characterized (Fig. 11) as described previously. PAD was performed in
order to verify the reactivity of the mixed sequences. The expected
patterns were observed, with digestion signals only present at
ribonucleotide positions (data not shown).

View larger version (31K):
[in this window]
[in a new window]
|
Fig. 11.
Sequence, name, and characterization of all
of the TFOs synthesized separately. Symbols are as in Fig. 1.
a, melting temperature of the triple helix;
conditions were as indicated under "Materials and Methods" (pH
5.7).
|
|
Thermal stabilities of the corresponding triple helices were measured.
As shown in Fig. 11, the conclusions from the study of the
combinatorial libraries are fully confirmed. The
Tm values of the TFOs fit within the values of
the fractions to which they were allocated. In particular, the
Tm of P25th is similar to that of r6dth. The
conclusion that this fraction is mainly composed by TFOs completely
lacking the RNA core is therefore verified. The
Tm of P35 is just 2.7-3.0° lower than the
Tm of T6 and T9, which
also fits with our conclusions. The Tm of P45 is
similar to that of 4T, lower than that for T12 and r11d,
but higher than that for T34. All these TFOs have, as
expected for the members of the P45 fraction, an RNA core and
compositional diversity at positions 1-4.
The single TFOs were characterized separately by their retention times
in IP-RP-HPLC and then were co-injected (Fig.
12). The retention times are, as shown
previously, mainly a function of the ribonucleotide content of the
oligonucleotide. Some isomers carrying dT nucleotides in the same
number but at different positions within the TFO, as, e.g.,
for T12, T34, and T69, are also
resolved. By comparison to the chromatograms of the P fractions (Fig.
9), the major part of the peaks, or groups of peaks, could thus be assigned with high reliability.
 |
DISCUSSION |
Oligonucleotide triple helices composed of DNA or RNA third
strands dramatically vary in stability, and are structurally
heterogeneous in backbone conformation (9-16). No general rules are
yet available to predict conformational preferences for any given TFO
to its specific DNA duplex target, rendering the design of high
affinity TFOs difficult. Thus, a combinatorial method allowing for a
conformational screen of TFOs, as the one presented here, is an
appropriate method for defining the rules that govern high triple helix
stability. When compared with the commonly used methods for selection
and amplification (34, 35), the method described here shows some distinct advantages. (i) No enzymes are required at any step of the
selection cycles, so that not only chimeric ribo- and
deoxyribooligonucleotides, but a wide variety of sugar or base-modified
ribo- and deoxyribooligonucleotide analogues can in principle be used.
(ii) The approach is target-driven. This property is of considerable
importance when there is need to design a strong binding TFO to a
predefined DNA double helix sequence. (iii) The selection process can
be performed using a variety of physical selection criteria, as,
e.g., pH or ionic strength, in addition to temperature. This
can be important when studying the effect of substitutes of protonated
cytosines in the TFO (see Ref. 17 and reference therein), or in the
context of non-anionic backbones (see Refs. 6, 36, and 37 for review). Applied to the model triple helix investigated here, the following conclusions can be drawn.
Conformational Diversity and Triple Helix Stability--
Certainly
one of the most important conclusions of the present study is that the
introduction of conformational variability in the backbone by using
concomitantly deoxy- and ribonucleotides in the TFO clearly does not
lead to substantially enhanced binding affinity to a DNA target
compared with an all RNA TFO. However, there exist subtle sequence
effects, which are more pronounced for rU/dT than for rC/dC pairs.
Looking at the results in detail we find an increase of 4.3 °C
(0.9°/nucleotide) by exchanging five dC to rC units (d
r6d). All
ribocytidines seem to contribute equally to stability of the triple
helix irrespective of their sequence context. This is not true,
however, for dT to rU substitutions. Here, considerable
sequence-dependent effects on stability were observed. For
positions 6 and 9, in which the U(T) residues are flanked by protonated
cytosines, differences in Tm rank from +6.4 to
+6.7 °/modification, in favor for the ribonucleosides. Surprisingly, this is a higher effect on thermal stability for one single
substitution as compared with changing all the (non-terminal) dC to rC
units. Both protonated rC and dC of the TFOs are expected to be
preferentially in N-type conformation, due to the anomeric effect (38,
39), whereas rU and dT are preferentially in N-type and S-type
conformation, respectively. Thus, changing dC to rC in the TFO is not
expected to significantly alter the overall conformation of the TFO,
whereas changing dT to rU introduces conformational inhomogeneity. The fact that C/T alternating sequence tracts in particular are sensitive to dT/rU exchanges shows that introduction of local conformational inhomogeneity in this base sequence context creates a dramatic destabilizing effect. In a T(U)-rich sequence context, the differences between deoxy versus ribo-substitutions are much less
pronounced to almost inexistent. Residues at positions 3 and 4 still
exert a non-negligible effect of 2.3°/modification (probably due in large part to positions 4, as it is neighboring a cytosine). In contrast, an exchange of rU by dT at positions 1 and 2 leads to a
slight increase in stability by 0.5°/modification. Even given that
5'- and 3'-terminal deoxynucleosides as in r11d and T12 may stabilize the helix more efficiently than ribonucleosides due to more
efficient stacking and/or more favorable solvation, it becomes clear
that within homo(A)-homo(T) tracts of a target duplex, there is no
significant destabilization arising from conformational inhomogeneity
in the third strand. This is rather surprising but might have its
origin in the special structural features of the poly(dA)-poly(dT)
helix (44).
Contribution of the 5-Me Group of Thymine to Triple Helix
Stability--
The introduction of a 5-Me group on deoxycytidine or
deoxyuridine has been shown to stabilize the double and triple helices, probably via an entropic effect (40-43), but other studies carried out
by isothermal titration calorimetry (20) or with cytosine analogues (21) failed to show any effect. In order to verify the extent
of this contribution in the system investigated, we synthesized the TFO
dU6, which can directly be compared with T6 and
r11d. The stability of dU6th was found to be half-way
between those of T6th and r11dth, indicating that the 5-Me
group of the thymine at position 6 has a nearly similar negative
effect on the thermal stability of the triple helix as has the absence
of the 2'-OH group when compared with r11d. Such a negative effect, which is opposite to what is found for cytidine and 5-methylcytidine, has never been described before. It cannot be excluded that the 5-Me
group of dT within an overall A-like conformation of such a chimeric
DNA/RNA leads to unfavorable steric interactions.
In this model study, we exploited differences in chemical and physical
properties (chemical reactivity, mass, hydrophobicity, absorption
spectra) in order to differentiate between ribo- and deoxyribonucleotides in TFO libraries and fractions. In particular, IP-RP-HPLC coupled with mass spectrometry and PAD have revealed as
versatile analytical tools that enable separation, constitutional identification, and positional assignment of modifications within a set
of TFOs. The results obtained here highlight the power of combinatorial
methods for understanding and improving molecular recognition in
interacting systems, where rational prediction (yet) falls short due to
the complexity of the system.
 |
ACKNOWLEDGEMENTS |
We thank Drs. J. Schaller and S. Schürch for help with the MS measurements.
 |
FOOTNOTES |
*
This work was supported by the Swiss National Science
Foundation and Novartis AG, Basel.The costs of publication of this
article were defrayed in part by the
payment of page charges. The article must therefore be hereby marked
"advertisement" in
accordance with 18 U.S.C. Section
1734 solely to indicate this fact.
To whom correspondence should be addressed. Tel.: 41-31-6314355;
Fax: 41-31-6313422; E-mail: leumann@ioc.unibe.ch.
Published, JBC Papers in Press, July 5, 2001, DOI 10.1074/jbc.M105794200
 |
ABBREVIATIONS |
The abbreviations used are:
TFO, triplex-forming oligonucleotide;
IE, ion exchange;
IP-RP, ion pairing
reverse phase;
HPLC, high performance liquid chromatography;
ESI, electrospray;
MS, mass spectrometry;
AC, affinity chromatography;
rC, ribocytidine;
rU, ribouridine;
dC, deoxyribocytidine;
dT, deoxyribothymidine;
dU, deoxyuridine;
PAD, partial alkaline digestion;
PAGE, polyacrylamide gel electrophoresis.
 |
REFERENCES |
| 1.
|
Felsenfeld, G.,
and Rich, A.
(1957)
Biochim. Biophys. Acta
26,
457-468
|
| 2.
|
Felsenfeld, G.,
Davis, D. R.,
and Rich, A.
(1957)
J. Am. Chem. Soc.
79,
2023-2024
|
| 3.
|
LeDoan, T.,
Perrouault, L.,
Praseuth, D.,
Habhoub, N.,
Decout, J. L.,
Thuong, N. T.,
Lhomme, J.,
and Hélène, C.
(1987)
Nucleic Acids Res.
15,
7749-7761
|
| 4.
|
Moser, H. E.,
and Dervan, P. B.
(1987)
Science
238,
645-650
|
| 5.
|
Radhakrishnan, I.,
and Patel, D. J.
(1994)
Biochemistry
33,
11405-11416
|
| 6.
|
Sun, J. S.,
and Hélène, C.
(1993)
Curr. Opin. Struct. Biol.
3,
345-356
|
| 7.
|
Vasquez, K. M.,
and Wilson, J. H.
(1998)
Trends Biochem. Sci.
23,
4-9
|
| 8.
|
Roberts, R. W.,
and Crothers, D. M.
(1992)
Science
258,
1463-1465
|
| 9.
|
Liquier, J.,
Coffinier, P.,
Firon, M.,
and Taillander, E. J.
(1991)
Biomol. Struct. Dyn.
9,
437-445
|
| 10.
|
Macaya, R. F.,
Schultze, P.,
and Feigon, J.
(1992)
J. Am. Chem. Soc.
114,
781-783
|
| 11.
|
Liquier, J.,
Taillander, E.,
Klinck, R.,
Guittet, E.,
Gouyette, C.,
and Huynh-Dinh, T.
(1995)
Nucleic Acids Res.
23,
1722-1728
|
| 12.
|
Bartley, J. P.,
Brown, T.,
and Lane, A. N.
(1997)
Biochemistry
36,
14502-14511
|
| 13.
|
Tarköy, M.,
Phipps, A. K.,
Schultze, P.,
and Feigon, J.
(1998)
Biochemistry
37,
5810-5819
|
| 14.
|
Gotfredsen, C. H.,
Schultze, P.,
and Feigon, J.
(1998)
J. Am. Chem. Soc.
120,
4281-4289
|
| 15.
|
Asensio, J. L.,
Carr, R.,
Brown, T.,
and Lane, A. N.
(1999)
J. Am. Chem. Soc.
121,
11063-11070
|
| 16.
|
Shimizu, M.,
Konishi, A.,
Shimada, Y.,
Inoue, H.,
and Ohtsuka, E.
(1992)
FEBS Lett.
302,
155-158
|
| 17.
|
Escudé, C.,
François, J. C.,
Sun, J. S.,
Ott, G.,
Sprinzl, M.,
Garestier, T.,
and Hélène, C.
(1993)
Nucleic Acids Res.
21,
5547-5553
|
| 18.
|
Noronha, A.,
and Damha, M.
(1998)
Nucleic Acids Res.
26,
2665-2671
|
| 19.
|
Han, H.,
and Dervan, P. B.
(1993)
Proc. Natl. Acad. Sci. U. S. A.
90,
3806-3810
|
| 20.
|
Torigoe, H.,
Shimizume, R.,
Sarai, A.,
and Shindo, H.
(1999)
Biochemistry
38,
14653-14659
|
| 21.
|
Hildbrand, S.,
Blaser, A.,
Parel, S. P.,
and Leumann, C. J.
(1997)
J. Am. Chem. Soc.
119,
5499-5511
|
| 22.
|
Imanishi, T.,
and Obika, S. J.
(1999)
Synth. Org. Chem. Jpn.
57,
77-88
|
| 23.
|
Prakash, T. P.,
Manoharan, M.,
Fraser, A. S.,
Kawasaki, A. M.,
Lesnik, E. A.,
and Owens, S. R.
(2000)
Tetrahedron Let.
41,
4855-4859
|
| 24.
| Amersham Pharmacia Biotech. Users Manual 56-1111-56: Gene
Assembler Special/4 Primers, Amersham Pharmacia Biotech,
Piscataway, NJ
|
| 25.
|
Usman, N.,
Ogilvie, K. K.,
Jiang, M. Y.,
and Cedergren, R. J.
(1987)
J. Am. Chem. Soc.
109,
7845-7854
|
| 26.
|
Scaringe, S. A.,
Francklyn, C.,
and Usman, N.
(1990)
Nucleic Acids Res.
18,
5433-5441
|
| 27.
|
Beier, H.,
and Gross, H. J.
(1991)
in
Essential Molecular Biology: A Practical Approach
(Brown, T. A., ed)
, pp. 221-236, IRL-Oxford University Press, New York
|
| 28.
|
Puglisi, J. D.,
and Tinoco, I., Jr.
(1989)
Methods Enzymol.
180,
304-325
|
| 29.
|
Maher, L. J., III,
Dervan, P. B.,
and Wolf, B. J.
(1990)
Biochemistry
29,
8820-8826
|
| 30.
|
Rougée, M.,
Faucon, B.,
Barcelo, F.,
Giovannangeli, T.,
Garestier, T.,
and Hélène, C.
(1992)
Biochemistry
31,
9269-9278
|
| 31.
|
Shindo, H.,
Torigoe, H.,
and Sarai, A.
(1993)
Biochemistry
32,
8963-8969
|
| 32.
|
Xodo, L. E.
(1995)
Eur. J. Biochem.
228,
918-926
|
| 33.
|
Cantor, C. R.,
and Schimmel, P. R.
(1980)
Biophys. Chem.
2,
382-383
|
| 34.
|
Pei, D.,
Ulrich, H. D.,
and Schultz, P. G.
(1991)
Science
253,
1408-1411
|
| 35.
|
Handerbol, P.,
and van Dyke, M. W.
(1995)
Proc. Natl. Acad. Sci. U. S. A.
93,
2811-2816
|
| 36.
|
Nielsen, P. E.
(1995)
Annu. Rev. Biophys. Biomol. Struct.
24,
167-183
|
| 37.
|
Sun, J. S.,
Garestier, T.,
and Hélène, C.
(1996)
Curr. Opin. Struct. Biol.
6,
327-333
|
| 38.
|
Plavec, J.,
Tong, W. M.,
and Chattopadhyaya, J.
(1993)
J. Am. Chem. Soc.
115,
9734-9746
|
| 39.
|
Thibaudeau, C.,
Plavec, J.,
and Chattopadhyaya, J.
(1996)
J. Org. Chem.
61,
266-286
|
| 40.
|
Povsic, T. J.,
and Dervan, P. B.
(1989)
J. Am. Chem. Soc.
111,
3059-3061
|
| 41.
|
Plum, G. E.,
Park, Y. W.,
Singleton, S. F.,
Dervan, P. B.,
and Breslauer, K. J.
(1990)
Proc. Natl. Acad. Sci. U. S. A.
87,
9436-9440
|
| 42.
|
Xodo, L. E.,
Manzini, G.,
Quadrifoglio, F.,
van der Marel, G. A.,
and van Boom, J. H.
(1991)
Nucleic Acids Res.
19,
5625-5631
|
| 43.
|
Leitner, D.,
Schröder, W.,
and Weisz, K.
(2000)
Biochemistry
39,
5886-5892
|
| 44.
|
Sanger, W.
(1984)
Principles of Nucleic Acid Structure
, p. 271, Springer Verlag, New York
|
Copyright © 2001 by The American Society for Biochemistry and Molecular Biology, Inc.