From the Department of Biochemistry, University of Iowa,
Iowa City, Iowa 52240
Starvation inhibits and refeeding stimulates
transcription of the malic enzyme gene in chick liver. DNA between
320 and +72 base pairs (bp) is DNase I-hypersensitive in hepatic
nuclei from fed but not starved chicks (Ma, X. J., and Goodridge, A. G. (1992) Nucleic Acids Res. 20, 4997-5002). A polypyrimidine/polypurine
(PPY/PPU) tract lies within the DNase I-hypersensitive region. In
hepatocytes transiently transfected with plasmids containing
triiodothyronine response elements and a minimal promoter from the
malic enzyme gene linked to the chloramphenicol acetyltransferase gene,
deletion of the PPY/PPU tract inhibited chloramphenicol
acetyltransferase activity by about 90% with or without
triiodothyronine. Fine mapping of S1 nuclease-sensitive sites suggests
that the PPY/PPU tract can assume different isoforms of non-B-DNA, some
of which may be triplex structures. The PPY/PPU tract contains specific
binding sites for single- and double-stranded DNA binding proteins and,
with 8 bp 3
of the tract, can function as a promoter. A
(CT)7 repeat binds single-stranded DNA-binding protein and
is essential for promoter activity. Two C-rich elements bind
single-stranded DNA-binding proteins and may mediate inhibition of
promoter function. The single- and double-stranded DNA-binding proteins
that interact with the PPY/PPU tract may regulate transcription of the
malic enzyme gene.
INTRODUCTION
Transcription of eukaryotic genes is regulated by the interactions
of specific DNA sequence elements, their cognate transcription factors,
and the general transcription machinery. Genes that encode proteins
have two types of DNA elements (1, 2, 3). One element involves DNA
sequences that are specific to certain genes. Binding of transcription
factors to these elements and the subsequent interaction of those
factors with other parts of transcription machinery regulate gene
expression tissue-specifically and in response to exogenous agents such
as hormones. The other sequences are promoter elements common to most
genes; they provide a site for assembly of the transcription initiation
complex, identify the start site for transcription, and endow the gene
with a basal rate of expression. The most common type of promoter in
genes transcribed by RNA polymerase II contains TATA-box elements. In
promoters with TATA-boxes, binding of the general transcription factor
TFIID to the TATA-box represents the first step in transcription
initiation. There is, however, a growing number of promoters that lack
TATA-box elements (4, 5, 6, 7). These promoters are usually GC-rich, have
multiple transcription start sites, and use initiator sequences to
identify where the initiation complex should bind and initiate
transcription. How transcription is initiated and regulated on
promoters that lack TATA-boxes is not clear.
Regulation of promoter activity likely involves changes in conformation
of the trans-acting proteins when they bind to their specific sequence
elements and/or when a ligand, such as a hormone or another protein,
binds to a DNA-bound factor. Binding of a trans-acting protein to its
specific sequence elements also may cause the DNA to undergo changes in
conformation (8). Protein-induced changes in the secondary structure of
promoter DNA may regulate transcription. For example, binding of
protein to DNA may stimulate or stabilize the formation of a
functionally significant non-B-DNA structure. Alternatively, productive
binding of a regulatory protein to DNA may require prior formation of a
particular non-B-DNA structure. One of the earliest reports to suggest
a role for non-B-DNA in gene regulation showed that in actively
transcribed chicken globin genes, DNA in the chromatin of the promoter
region was sensitive to S1 nuclease treatment (9). Active transcription
was correlated with the formation of an unusual DNA structure with a
single-stranded region.
One of the non-B-DNA structures that has drawn a great deal of
attention in the last 10 years is the triple helix (10, 11). DNA
triplexes are formed by polypyrimidine/polypurine
(PPY/PPU)1 sequences. In this structure, a
DNA strand (donor strand) from one half of the sequence folds into the
major groove of the other half-duplex, forming Hoogsteen base pairs and
leaving the other strand in a single-stranded state. A triplex can be
either H-DNA if the pyrimidine strand is the donor, or H*-DNA if the
purine strand is the donor. A number of different triplex isoforms may
form along a long PPY/PPU sequence (12). Formation of specific
isoforms, or stabilization of those structures once formed, may
contribute to the regulation of transcription (13).
There are several reasons to believe that PPY/PPU tracts may have
important functional roles. On a statistical basis, PPY/PPU sequences
are overrepresented in eukaryotic genomes (14, 15, 16). Second,
triplex-forming PPY/PPU sequences have been discovered in the promoters
of several genes that lack TATA-boxes; the protooncogenes hEGFR,
c-ets-2, and c-Ki-ras are examples (6, 17, 18).
Deletion of the PPY/PPU sequences from these promoters decreases their
promoter activities in transient transfection assays. Third, when the
triplex-forming sequence of the c-myc gene is connected to a
heterologous promoter, it enhances expression of a linked reporter
gene. In a series of mutant PPY/PPU sequences, the degree of
transcription enhancement correlated with ability of the sequence to
adopt triplex structure (19). Finally, naturally occurring point
mutations in the triplex-forming sequence of the promoter of human
-globin gene cause abnormalities in regulation of transcription of
the genes of the hemoglobin locus (20, 21). Point mutations that
decrease the potential of the sequence to form triplex cause persistent
expression of
-globin in adults. Despite the evidence cited above,
the involvement of these unusual DNA structures in regulation of gene
expression remains controversial. Furthermore, the mechanisms by which
triplexes and related structures might regulate function remain
unclear.
Malic enzyme (L-malate:NADP+ oxidoreductase
(decarboxylating), EC) catalyzes the conversion of malate to
pyruvate and CO2. At the same time, it generates NADPH,
much of which is used for synthesis of fatty acids. Transcription of
the malic enzyme gene in liver is regulated by nutritional state and
endocrine status in vivo and by insulin, glucagon,
triiodothyronine (T3), glucocorticoids, and unesterified fatty acids in
hepatocytes in culture (22). The transition from a low transcription
rate in the starved state to a 40-fold higher transcription rate in the
refed state is accompanied by a profound change in chromatin structure
in the region from
320 to +72 bp, hypersensitive to DNase I and
certain restriction enzymes in livers from fed chicks and resistant in
starved ones. The maximum increase in DNase I hypersensitivity occurs
within 6 h, the same time required for the transcription rate to
reach its highest level. Sensitivity to DNase I and transcription rate
are rapidly and simultaneously decreased when food is removed from fed
chicks (23, 24).
The 5
-flanking region (5.8 kb) of the chicken malic enzyme gene has
been cloned, and the nucleotide sequence of 4.4 kb has been
determined.2 The promoter of the malic
enzyme gene is GC-rich, does not contain a TATA-box, and has multiple
transcription start sites. Like many promoters that lack a TATA-box
(11), the promoter of the malic enzyme gene contains a PPY/PPU tract;
it is located between
134 and
86 bp. This PPY/PPU tract is S1
nuclease-sensitive in the supercoiled state (25) and is a potential
candidate for involvement in the nutritional regulation of chromatin
structure and transcription rate. Here, we report an analysis of the
structure and function of this PPY/PPU tract. To our surprise, the
PPY/PPU tract plus an 8-bp 3
extension can function as an independent
promoter. To our knowledge, this is the first report that a PPY/PPU
tract with the potential to form triplex structures can be utilized to
direct transcription initiation in a promoter lacking a TATA-box.
EXPERIMENTAL PROCEDURES
Construction of Reporter Plasmids
Routine subcloning was
performed by standard methods (26, 27). Our initial reporter plasmid,
p[ME-5800/+31]CAT was made by inserting 5.8 kb of 5
-flanking DNA
plus 31 bp of 5
-untranslated sequence of the chicken malic enzyme gene
upstream of the bacterial reporter gene, chloramphenicol
acetyltransferase (CAT) in the promoterless construct,
pKS-CAT.2 pBH147CAT (Fig. 1) was made by inserting a
BstXI/HindIII fragment from pME[
5800/+31]CAT
upstream of a minimal promoter for malic enzyme linked to CAT
(pME[
147/+31]CAT). pBH147
PPYCAT is a mutated version in which
the sequence between
134 and
90 bp (all but 4 bp of the PPY/PPU
tract) was deleted using a polymerase chain reaction-based strategy
(28).
Fig. 1.
Functional importance of the PPY/PPU tract.
Panel A, sequence of the proximal promoter region of the
malic enzyme gene. The sequence is numbered from the major
transcription start site in the endogenous gene. The PPY/PPU sequence
between
134 and
86 bp is underlined; the 9-nucleotide
repeats are boxed; the major transcription start site is
indicated by an arrow. Panel B, graphic
representation of the wild-type reporter plasmid, pBH147CAT,
(pME[(
4135/
3471)ME
147/+31]CAT) used in the transient
transfection assay. pBH147CAT contains the
BstXI-HindIII fragment of the 5
-flanking region
of the malic enzyme gene joined to the minimal promoter of the
malic enzyme gene, all of which is linked to the CAT reporter
gene. The BstXI-HindIII fragment contains T3
response elements. In pBH147
PPYCAT
(pME[(
4135/
3471)ME
147/+31]
-134/-90CAT), the PPY/PPU
tract is deleted. Panel C, functional analysis of two
reporter plasmids in a transient transfection assay. Hepatocytes were
prepared as described under ``Experimental Procedures'' and incubated
in Waymouth medium 705/1 containing corticosterone (1 µM)
and insulin (50 nM). At about 20 h of incubation, the
medium was changed, and the cells were transfected with supercoiled
reporter plasmids using LipofectAceTM as described under
``Experimental Procedures.'' Two days after plating the cells, the
transfection medium was removed, and hepatocytes were treated with or
without T3 (1.6 µM) in the medium described above. After
an additional 48-h incubation, hepatocytes were harvested, and CAT
activity,
-galactosidase activity, and protein were measured. The
results were initially expressed as percentage of
[14C]chloramphenicol converted to acetylated
chloramphenicol per microgram of soluble protein and then corrected for
differences in transfection efficiency by dividing by
-galactosidase
activity of the same extract (A420 units per
microgram of protein). Relative CAT activities were then calculated by
setting the corrected CAT activity for T3-treated hepatocytes
transfected with pBH147CAT to 100 and adjusting all other activities
proportionately. The results are the means ± S.E. of seven
experiments, each one using an independently isolated batch of
hepatocytes. CAT and
-galactosidase activities of extracts from
T3-treated hepatocytes transfected with pBH147CAT were 0.54 ± 0.17 (mean ± S.E., n = 7) percent conversion/15
h/µg of protein and 32 ± 7 × 10
4 (mean ± S.E., n = 7) A420
units/min/µg of protein, respectively. Three or more independent
preparations of each plasmid were used.
To construct pME[
3903/
3703]TKCAT, a 201-bp malic enzyme fragment
from
3903 to
3703 bp was amplified by polymerase chain reaction and
inserted into the reporter plasmid pBLCAT2 (29) between the
SphI and BamHI sites. The promoterless construct,
pME[
3903/
3703]
TKCAT was made by removing the thymidine kinase
(TK) promoter from pME[-3903/
3703]TKCAT by digestion with
BamHI and BglII, followed by blunt-end ligation.
DNA fragments containing various parts of the PPY/PPU tract were made
by annealing two synthetic oligonucleotides and inserting the
double-stranded DNA into pME[
3903/
3703]
TKCAT in place of the
TK promoter. pME([
3903/
3703]ME-147/+31)CAT was constructed by
replacing the TK promoter of pME[
3903/
3703]TKCAT with malic
enzyme promoter sequence between
147 and +31 bp. The sequence of each
reporter plasmid was confirmed by sequencing using the dideoxy chain
termination method. DNA fragments used in constructing the reporter
plasmids and in other experiments are named by designating the 5
and
3
ends of each fragment relative to the major transcription start site
of the endogenous malic enzyme gene. For deletion constructs, a
``
'' precedes the deleted part of the DNA.
Transient Transfections
This procedure was carried out
essentially as described (30). Briefly, hepatocytes were prepared from
19-day-old embryos of white Leghorn chickens, plated at a high density
(22 µl of packed cells/35-mm plate), and incubated in Waymouth medium
705/1 supplemented with streptomycin (100 µg/ml), penicillin G (60 µg/ml), corticosterone (1 µM), and insulin (Eli Lilly,
Indianapolis, IN) (50 nM). At about 20 h of
incubation, the medium was changed, and the cells were transfected with
supercoiled reporter plasmids using LipofectAceTM. In
addition to the components named above, each plate contained 40 µg of
LipofectAce, 1.3 µg of pBH147CAT or an equimolar amount of another
reporter plasmid, 0.5 µg of pCMV
-GAL, and sufficient
pBluescriptKS(+) to make the total amount of transfected DNA 4.5 µg/plate. On day 2 (day 0 = day cells were prepared), the
transfection medium was removed, and hepatocytes were treated with or
without T3 (1.6 µM) in the medium described above. After
a 48-h incubation, hepatocytes in duplicate plates were harvested into
one tube and suspended in 200 µl of 0.1 M Tris-HCl, pH
7.8, 1 mM EDTA, 1 mM DTT, 10 µg/ml trypsin
inhibitor, and 0.174 mg/ml phenylmethylsulfonyl fluoride and lysed by
three cycles of freezing-thawing.
Freshly prepared cell extracts were used to measure protein and
-galactosidase activity (26, 31). Before measuring CAT activity,
extracts were heated to 60 °C for 30 min, and precipitated protein
was removed by centrifugation. Samples of heat-stable extract
containing the equivalent of 5-50 µg of unheated soluble protein
were incubated for 15 h at 37 °C in 0.1 M Tris-HCl,
pH 7.8, 1 mM EDTA, 12 µM
14C-chloramphenicol, 2 mM acetyl-CoA (150 µl
total volume). The products of the reaction were extracted with ethyl
acetate and separated by thin layer chromatography. Radioactivity in
substrate and products was visualized by autoradiography and measured
by liquid scintillation spectrometry or measured by direct
autoradiography using an InstantImagerTM (Packard
Instrument Co., Meriden, CT). The CAT activity was expressed initially
as the percentage of substrate converted to acetylated product per
microgram of unheated soluble protein and then normalized as described
in the figure legends.
Low Resolution Mapping of S1 Nuclease Sensitivity
pBH147CAT
or pBH147
PPYCAT (20 µg) were digested with 1 or 10 units of S1
nuclease at 37 °C for 15 min in a 100-µl reaction containing 30 mM sodium acetate, pH 4.6, 30 mM NaCl, 1 mM ZnCl2. The reaction was stopped by phenol
extraction, and the DNA was precipitated. DNA was then digested with
BamHI. Some DNA was treated with BamHI before
digesting with S1 nuclease. The resulting DNA fragments (about 1 µg)
were separated by size on 1% agarose gels. Gels were stained with
ethidium bromide, and the DNA fragments were visualized with UV
light.
High Resolution Mapping of S1 Nuclease and P1 Nuclease
Sensitivity
Supercoiled pME637 (
413 and +224 bp in the
PstI site of plasmid pIBI31) (20 µg) was partially
digested with 0, 0.05, or 0.2 units of S1 nuclease for 5 min at
37 °C in the S1 nuclease buffer described above or with 1 unit of P1
nuclease for 0, 1, or 4 min at 37 °C in a buffer containing 25 mM Tris-HCl, pH 7.0, 50 mM NaCl, 5 mM ZnCl2. Reactions were stopped by adding 0.5 M EDTA (1 µl) and extracting with phenol. DNA was
precipitated with ethanol and used for the following manipulations. To
detect nicks on the top strand, DNA (1 µg) was incubated with 10 pmol
of biotinylated oligonucleotide
5
-+136CTACCTTGATGAGGTGCGGGTC+115-3
at
95 °C for 5 min and then at 45 °C for 15 min in 19 µl of 10 mM KCl, 20 mM Tris-HCl, pH 8.8, 10 mM (NH4)2SO4, 2 mM MgSO4, 0.1% Triton X-100 and 100 µM each of dATP, dTTP, dGTP, and dCTP. To detect nicks on
the bottom stand, DNA was incubated with biotinylated oligonucleotide
5
-
367CACAAAAATAAGCGTGAGGAGGCAGG
342-3
at
95 °C for 5 min and then at 50 °C for 15 min.
Primer extension reactions containing 1 µl (2 units) of Vent
(exo
) DNA polymerase were incubated for 10 min at
76 °C and stopped by chilling on ice. The reaction mixture was then
added to 50 µl of 5 mM Tris-HCl, pH 8.5, 2.5 M NaCl, 0.5 mM EDTA containing 200 µg of
streptavidin-coated paramagnetic beads (Dynabeads M-280 streptavidin)
that had been pretreated as described (32). Binding was carried out for
30 min at room temperature. DNA bound to the beads was separated from
unbound DNA on a magnetic stand, and the unbound DNA was discarded. The
DNA template strand and the extended immobilized strand were denatured
in 50 µl of 150 mM NaOH at 50 °C for 5 min. The
extended immobilized strand was then separated from the template strand
by centrifugation or on a magnetic stand, and the extended strand was
discarded. The supernatant solution containing the templates was
neutralized with 50 µl of 150 mM HCl, and 10 µl of 1 M Tris-HCl, pH 7.4. Yeast tRNA (5 µg) was added to the
mixture, and the nucleic acids were precipitated. The DNA-RNA pellet
was dissolved in 36 µl of 10 mM KCl, 20 mM
Tris-HCl, pH 8.8, 10 mM
(NH4)2SO4, 2 mM
MgSO4, and 0.1% Triton X-100 and 100 µM each
of dATP, dTTP, dGTP, and dCTP; 18 µl of this mixture were then added
to 1 µl of 32P-labeled primer (1 pmol; top strand,
5
-
11CCTGCGGGAGCGGAGGCG
28-3
; bottom
strand, 5
-
194CGGTGGGTGACTCAGCG
178-3
). The
mixture was incubated at 95 °C for 1 min, then at 45 °C (bottom
strand) or 50 °C (top strand) for 15 min. Vent (exo
)
DNA polymerase (2 units) was added, and the reaction was incubated for
10 min at 76 °C. The reaction was stopped by chilling on ice. After
precipitating the DNA with ethanol, the extension products were
analyzed on 6% polyacrylamide, 8 M urea sequencing
gels.
DNase I Footprint of Single-stranded DNA
A PstI
fragment (
413 and +224 bp) was subcloned into plasmid M13mp19 at the
PstI site, and a clone containing the top strand of the
malic enzyme sequence was isolated and named pM13ME
413/+224(+). The
StuI-PstI fragment (
236 and + 224 bp) was
subcloned into M13mp19 at the PstI and SmaI
sites. The resulting plasmid, pM13ME
236/+224(
), contains the bottom
strand of the malic enzyme sequence. To prepare the top strand probe,
32P-labeled universal sequencing primer,
5
-GTAAAACGACGGCCAGT-3
, was extended on single-stranded
pM13ME
236/+224(
) using the Klenow fragment of Escherichia
coli DNA polymerase. The extension product was digested with
AvaII. To prepare the bottom strand probe,
32P-labeled oligonucleotide,
5
-
11CCTGCGGGAGCGGAGGCG
28-3
, was extended
on pM13ME
431/+224(+), and the extension product was digested with
StuI. Single-stranded probes were isolated after
electrophoresis through alkaline-agarose gels (27). About 100,000 cpm
of probe was preincubated with various amounts of nuclear protein in a
50-µl binding reaction containing 10 mM HEPES, pH 7.4, 50 mM KCl, 0.5 mM EDTA, 0.5 mM DTT,
0.1 µg/µl poly(dI-dC), 0.01% Nonidet P-40, 4% glycerol, 0.04 µg/µl bovine serum albumin. After a 1-h preincubation on ice, 5 µl of 100 mM Tris-HCl, pH 7.4, 500 mM
MgCl2, RQ-1 RNase-free DNase I (1 unit) was added. After 15 min in ice water, the reaction was stopped by 0.5 M EDTA (5 µl), followed by phenol extraction and precipitation with ethanol.
The resulting DNA fragments were separated by size on 6%
polyacrylamide, 8 M urea sequencing gels.
Footprinting of Double-stranded DNA
DNA fragments
containing the malic enzyme promoter were excised from plasmid
pME
413/+224 by digestion with either KpnI and
HindIII or AvaI and PstI. The top
strand was labeled at the HindIII site of the
KpnI/HindIII fragment; the bottom strand was
labeled at the AvaI site of the
AvaI/PstI fragment. About 0.5 ng of DNA probe was
incubated with 70 µg of nuclear proteins for 40 min on ice in 100 µl of 10 mM Tris-HCl, pH 7.4, 50 mM NaCl, 30 mM KCl, 0.5 mM EDTA, 1 mM DTT, 5%
glycerol, 0.03 µg/µl poly(dI-dC). DNase I and 5 mM
MgCl2 were then added, and the incubation continued for
another 10 min on ice. Digestion was stopped with 10 mM
EDTA, followed by phenol-chloroform extraction. The resulting DNA
fragments were precipitated with ethanol and separated by size on 5%
polyacrylamide, 8 M urea sequencing gels.
Gel Electrophoretic Mobility Shift Assay
Single-stranded
probes were labeled using T4 polynucleotide kinase. Double-stranded
probes were labeled using the Klenow fragment of E. coli DNA
polymerase. 32P-labeled probe (0.01 pmol) was mixed with
competitor DNA and then incubated with about 0.1 (single-stranded DNA
as probe) or 3 (double-stranded DNA as probe) µg of nuclear protein
in 20 µl of 10 mM HEPES, pH 7.4, 50 mM KCl,
0.5 mM EDTA, 0.5 mM DTT, 0.1 µg/µl
poly(dI-dC), 0.01% Nonidet P-40, 4% glycerol, 0.04 µg/µl bovine
serum albumin. The binding reaction was performed on ice for 45 min.
DNA and DNA-protein complexes were resolved on 6% nondenaturing
polyacrylamide gels.
Nuclear Extracts
Nuclear extracts were prepared from livers
of 2-week-old chicks. To prepare nuclear extracts from the livers of
intact chicks, two livers were homogenized in a Dounce homogenizer in
40 ml of 10 mM Tris-HCl, pH 7.4, 15 mM NaCl, 60 mM KCl, 0.15 mM spermine, 0.5 mM
spermidine, 2 mM EDTA, 0.1 mM EGTA, 0.2%
Nonidet P-40, 5% sucrose. The homogenate (20 ml) was layered over 25 ml of the same buffer with 10% sucrose. The nuclei were pelleted by
centrifugation for 15 min at 1700 × g and washed with
20 ml of 10 mM Tris-HCl, pH 7.4, 15 mM NaCl, 50 mM KCl, 0.15 mM spermine, and 0.5 mM spermidine. After a second 15-min centrifugation at
700 × g, the pelleted nuclei were resuspended in 20 mM HEPES, pH 7.4, 25% glycerol, 1.5 mM
MgCl2, 20 mM KCl, 0.5 mM EDTA, 1 mM phenylmethylsulfonyl fluoride, 0.5 mM DTT.
To extract nuclear protein, the KCl concentration was increased to 300 mM, and the mixture was incubated at 4 °C for 30 min
(27). Insoluble material was removed by centrifugation at 25,000 × g for 30 min. Nuclear extracts were dialyzed extensively
against 20 mM HEPES, pH 7.4, 100 mM KCl, 0.1 mM EDTA, 10% glycerol, 1 mM DTT and stored at
80 °C.
Oligonucleotides
Unmodified oligonucleotides were
synthesized and purified by the DNA Core Facility of the University of
Iowa. Oligonucleotides biotinylated at their 5
-ends were purchased
from Genosys Biotechnologies, Inc. (The Woodlands, TX). For the gel
electrophoretic mobility shift assay, double-stranded oligonucleotides
were made by annealing two complementary oligonucleotides (purified on
8% polyacrylamide gels and collected using an Elutrap device
(Schleicher & Schuell)).
Materials
Restriction enzymes were purchased from New
England Biolabs (Beverly, MA), except for BamHI, which was
obtained from Boehringer Mannheim, and were used in the buffers
provided by the manufacturers. Other enzymes were purchased from the
indicated sources: RQ1 RNase-free DNase (Promega, Madison, WI); S1
nuclease (Sigma); Vent (exo
) DNA polymerase, T4 DNA
ligase, and calf intestinal alkaline phosphatase (New England Biolabs);
Klenow fragment of E. coli DNA polymerase and Taq
DNA polymerase (Boehringer Mannheim); T4 polynucleotide kinase (New
England Biolabs). DNA sequencing kits were purchased from Bio-Rad.
Streptavidin-coated paramagnetic beads, Dynabeads M-280, were purchased
from Dynal (Lake Success, NY). Nucleotides and poly(dI-dC) were
purchased from Pharmacia Biotech Inc. Radiolabeled nucleotides were
purchased from Amersham Corp.
D-Threo-[dichloroacetyl-1,2-14C]-chloramphenicol
was from DuPont NEN. LipofectACETM and Waymouth medium MD
705/1 were obtained from Life Technologies, Inc. Corticosterone and
3,5,3
-L-triiodothyronine were purchased from Sigma.
Competent E. coli DH5
cells used for subcloning were
purchased from Life Technologies, Inc. Agarose was from Eastman Kodak
Co. All other chemicals were of reagent grade or the best quality
commercially available.
pIBI31 was obtained from Kodak. Bluescript KS+ was from
Stratagene (La Jolla, CA). Bruno Luckow and Gunter Schutz (German
Cancer Research Center, Heidelberg, Germany) provided pBLCAT2 (29).
pCMV-
GAL (33) was obtained from Richard Maurer (University of Iowa).
Supercoiled plasmid DNA was extracted from E. coli cultures
using alkaline lysis and purified by CsCl gradient centrifugation
(26).
Statistical Analysis
Where appropriate, statistical
significance of differences between pairs of means was determined by
the Wilcoxon signed rank test (34). Standard errors of the mean are
provided to indicate the degree of variability in the data.
RESULTS
Functional Role of PPY/PPU Tract
PPY/PPU tracts play
important functional roles in several promoters (6, 17, 18). We tested
the functional significance of the PPY/PPU sequence in the malic enzyme
promoter by comparing transcription from a promoter containing the
wild-type sequence (pBH147CAT) with that in a construct lacking all but
4 bp of the PPY/PPU tract (pBH147
PPYCAT). pBH147CAT contains a T3
response unit and a minimal promoter from the chicken malic enzyme gene
linked to a CAT reporter gene (Fig.
1B).2
In hepatocytes in culture transfected with the wild-type and deletion
constructs, T3 caused 8- and 20-fold stimulations of CAT activity,
respectively (Fig. 1C). In cells transfected with
pBH147
PPYCAT, however, T3-induced expression of CAT was 80% less
than that in cells transfected with pBH147CAT; the decrease was about
90% in cells not treated with T3. These results indicate that the
PPY/PPU tract is not required for T3 responsiveness but plays a major
role in determining both basal and T3-induced rates of transcription
from the malic enzyme promoter.
Sensitivity of the PPY/PPU Tract to Single-strand-specific
Nucleases
Analysis of the nucleotide sequence of the PPY/PPU
tract suggested that it might form triplex structures. Formation of an
intramolecular triplex structure leaves a loop of DNA in the
single-stranded form and requires the DNA to be in a supercoiled state
(8, 10, 11). We therefore examined S1 nuclease sensitivity in the DNA
of pBH147CAT. When supercoiled plasmid was treated with S1 nuclease and
then digested with BamHI, DNA fragments of 1.7 and 3.9 kb
were generated (Fig. 2). The intensities of both bands
increased as the amount of S1 nuclease in the reaction was increased.
These results indicated the presence of a single-stranded region 1.7 kb
from the BamHI site. When the PPY/PPU sequence at 1.7 kb
from BamHI site was deleted (pBH147
PPYCAT), neither the
1.7 nor the 3.9 kb band was detected, indicating that the PPY/PPU tract
was necessary for S1 sensitivity. When supercoiled plasmid was
linearized by BamHI before digestion with S1, the 1.7- and
3.9-kb fragments were not observed. Thus, formation of the
single-stranded region was dependent on supercoiling. This PPY/PPU
tract also conferred supercoiling-dependent sensitivity to
S1 nuclease when subcloned into other plasmid vectors (results not
shown).
Fig. 2.
Low resolution mapping of S1 nuclease
sensitivity. Supercoiled plasmid DNA (untreated, lanes
1 and 6) was treated with increasing amounts of S1
nuclease before (lanes 2, 3, 4,
7, 8, and 9) or after digestion with
BamHI (lanes 5 and 10). The
outer unnumbered lanes are molecular size markers (1 kb
ladder, Kodak). The two lower bands in lanes 3 and 4 are 3.9- and 1.7-kb fragments.
In an effort to determine the nature of non-B-DNA involved in the S1
sensitivity described above, we next determined which strand was
cleaved by the single-strand-specific nucleases S1 and P1 and which
specific bases were cleaved. For this purpose we modified standard
primer extension methodology. In this modification, supercoiled plasmid
DNA is partially digested by a single-strand-specific endonuclease to
generate a population of nicked but not linearized DNAs. The positions
of the nicks were then detected by primer extension. Two steps of
primer extension were employed. The first primer extension used a
sufficiently large amount of unlabeled biotinylated primer to compete
effectively for the template. The resulting extension products were
isolated from other DNA in solution using streptavidin-coated
paramagnetic beads. The DNA template strand and the immobilized
extension strand were then denatured under alkaline conditions and
separated by centrifugation or on a magnetic stand. The isolated
template strand was used for a second primer extension with a different
32P-labeled primer. The isolation and use of the
single-stranded template DNA increased the efficiency for primer
extension from a small amount of 32P-labeled primer. In
addition, formation of triplex structures can block the passage of DNA
polymerase (35). By using the procedure just described and performing
the primer extension at a high temperature we avoided artifacts
associated with such blockages. Prior to the first primer extension,
the plasmid is digested by restriction enzymes at both sides of the
region of interest. This provides a full-length extension product that
serves as an internal control in reactions without nuclease.
When supercoiled plasmid containing the malic enzyme sequence between
413 and +224 bp was incubated with nuclease, two kinds of cleavages
were observed (Fig. 3A). In one population,
the intensities of the signals increased with increasing nuclease
concentration, and the cleavages were specific to samples treated with
nuclease. The other population of cleavages was nonspecific and
occurred in both nuclease-treated and untreated samples. They probably
represent premature termination by DNA polymerase. On the top DNA
strand, S1 nuclease detected a major single-stranded region between
104 and
91 bp; this region of the PPY/PPU tract contains a
(CT)7 repeat. S1 cleavages were not detected on the
opposite strand of the same region. The asymmetrical distribution of
the single-stranded region suggests the DNA formed a triplex structure
in which one DNA strand was left in the single-stranded state while the
other strand was folded into the major groove of the double-stranded
DNA, where it was protected from enzymatic attack. DNA of this region
has the potential to form two triplex isoforms that, together, can
account for the observed digestion pattern (Fig. 3C,
structures a and b). The unpaired dinucleotides
on the bottom strands of these structures may not be accessible to
nuclease.
Fig. 3.
High resolution mapping of S1 nuclease and P1
nuclease sensitivity. Panel A, supercoiled pME637 (
413 and
+224 bp in pIBI31, 20 µg) was partially digested with 0 (lanes
1 and 7), 0.05 (lanes 2 and 8),
or 0.2 (lanes 3 and 9) units of S1 nuclease for 5 min at 37 °C or with 1 unit of P1 nuclease for 0 (lanes 4 and 10), 1 (lanes 5 and 11), or 4 (lanes 6 and 12) min at 37 °C as described
under ``Experimental Procedures.'' Lanes marked
A, G, C, and T are
sequencing reactions using the same primer used for the
nuclease-treated DNAs. Sequences of the complementary strands are
indicated so that they correspond to the strands being mapped.
Numbers indicate the nucleotide sequence with respect to the
major start site of transcription. Panel B, nucleotide
sequence of the PPY/PPU region, indicating major cleavages by S1
nuclease (*) and P1 nuclease (!) and minor cleavages by S1 nuclease (+)
and P1 nuclease (·). Panel C, possible non-B-DNA
structures consistent with the results shown in panels A and
B.
In addition to the major single-stranded region in the region of the
(CT)7 repeat, a minor S1 nuclease-sensitive region was
detected in the upstream part of the PPY/PPU tract between
115 and
128 bp. Again, no cleavage was observed on the bottom strand. DNA of
this region has the potential to form two triplex isoforms that,
together, can account for the observed digestion pattern (Fig.
3C, structures d and e). Cleavage in
this region was less extensive than that in the (CT)7
repeat. In this population of plasmid DNA molecules, isoforms
d and e may not be as abundant as isoforms a
and b.
We also used another single-strand-specific endonuclease, P1 nuclease,
to probe DNA structure in the region of the PPY/PPU tract. P1 nuclease
activity has a broader pH optimum, permitting assessment of structures
present at neutral pH. When supercoiled plasmid DNA was incubated with
P1 nuclease at pH 7.0 in the presence of 50 mM NaCl and 5 mM ZnCl2, the pattern of cleavages was
different from that observed with S1 nuclease at pH 4.6 (Fig.
3A). On the bottom strand, we observed a set of discrete
cleavages corresponding to the adenines in the (GA)7
repeat. P1 nuclease cleaves preferentially after adenines (36). This
result thus suggests that the purine strand was in the single-stranded
state and that only linkages after adenines were cleaved by P1 nuclease
under these conditions. Cleavage of the top strand in this region was
less extensive than that of the bottom strand. One interpretation of
this result is the presence of triplex structures that utilize
pyrimidines as the third strand, i.e. the H-DNA counterparts
of structures a and b (Fig. 3C).
Alternatively, the lack of cleavage may reflect the specificity of P1
nuclease, and a substantial fraction of DNA in this region may be in
the melted state (Fig. 3C, structure c).
As observed with S1 nuclease, the PPY sequence upstream of the
(CT)7 repeat was sensitive to P1 treatment while the
opposite strand was not; even the bond after the adenine was not cut by
P1 nuclease. This result suggests that only the PPY strand was in the
single-stranded state. This may reflect the presence of triplex
structures similar to those detected after treatment with S1 nuclease
except that the base triad ``CG*A+'' at
113 and
125
bp in the Watson-Crick duplex did not form (Fig. 3C).
Some nuclease P1 sensitivity was observed further downstream,
especially at
62 bp between two cytidines. A substantial cluster of
S1 nuclease-sensitive bonds also was observed on the bottom strand
between
78 and
58 bp. Both nucleases also caused cleavage in this
region on the top strand. The results suggest that the DNA is melted in
this C-rich region, but the mechanism is obscure.
Nuclear Proteins Bind to Double-stranded DNA in the PPY/PPU
Tract
The finding that sequences in the PPY/PPU tract could adopt
non-B-DNA structures and, in particular, single-stranded regions, led
us to examine nuclear extracts for proteins that would bind
specifically to double- or single-stranded regions of the PPY/PPU
tract. A number of possible DNase I footprints were detected on
double-stranded DNA between
413 and +224 bp (Fig. 4).
The clearest of these was located in the PPY/PPU tract between about
130 and
100 bp on the top strand and between about
125 and
100
bp on the bottom strand (Fig. 4). We used the gel electrophoretic
mobility shift assay to characterize this binding activity further
(Fig. 5). One major DNA-protein complex was formed when
nuclear protein was incubated with a double-stranded probe that spanned
134 to
103 bp. Unlabeled probe at 100-fold molar excess completely
eliminated binding to the labeled probe, suggesting that the binding
was sequence-specific. Neither the purine nor the pyrimidine strand
between
135 and
86 bp was able to compete for the binding at a
1000-fold molar excess. Thus, this nuclear protein(s) binds
specifically to double-stranded DNA and, unlike some PPY/PPU-binding
proteins (37), has no affinity for single-stranded DNA.
Fig. 4.
Footprinting of double-stranded DNA. DNA
fragments containing the malic enzyme promoter were excised from
pME
413/+224 by digestion with either
KpnI/HindIII or AvaI/PstI.
The top strand was labeled at the HindIII site of the
KpnI/HindIII fragment; the bottom strand was
labeled at the AvaI site of the
AvaI/PstI fragment. About 0.5 ng of DNA probe was
incubated with 70 µg of nuclear proteins for 40 min on ice. DNase I
and 5 mM MgCl2 were then added, and the
incubation continued for another 10 min on ice. The resulting DNA
fragments were separated by size on 5% polyacrylamide, 8 M
urea sequencing gels. Nuclear extracts were prepared from the livers of
2-week-old chicks; other details are given under ``Experimental
Procedures.'' Numbers indicate the nucleotide sequence with
respect to the major start site of transcription.
Fig. 5.
Gel electrophoretic mobility shift assay of
binding of nuclear protein to double-stranded PPY/PPU tract.
Double-stranded probe (
134 and
103 bp) was labeled using the Klenow
fragment of E. coli DNA polymerase. 32P-Labeled
probe (0.01 pmol) was mixed with competitor DNA at the indicated molar
ratios and then incubated with 3 µg of nuclear protein for 45 min on
ice. DNA and DNA-protein complexes were resolved on 6% nondenaturing
polyacrylamide gels. Nuclear extracts were prepared from the livers of
2-week-old chicks as described under ``Experimental Procedures.''
ssPU, single-stranded PPU tract (
135 and
86 bp);
ssPY, single-stranded PPY tract (
135 and
86 bp);
ds(dG)30, double-stranded
(dG)30·(dC)30.
Because this protein bound to the (dC)·(dG)-rich part of a nuclease
S1-sensitive region of the PPY/PPU tract, we were concerned that it
might be the erythrocyte-specific poly(dG·dC)-binding protein, BGP1,
or a related protein. BGP1 binds to an S1 nuclease-sensitive
poly(dG·dC) sequence in the promoter region of the chicken
-globin
gene and has been implicated in the regulation of expression of the
-globin gene (38, 39). Even though expression of BGP1 is restricted
to erythrocytes, our nuclear extracts could contain this protein
because both liver and isolated hepatocytes may contain small amounts
of erythrocytes. The minimal binding site for BGP1 is
(dG)7·(dC)7; we therefore used
(dG)30·(dC)30 to compete for the binding of
nuclear proteins to the double-stranded DNA fragment from
134 to
103 bp. No competition was detected, even when the double-stranded
competitor oligonucleotide was used at 5000-fold molar excess. This
result indicates that the protein is a sequence-specific
PPY/PPU-binding protein different from BGP1. The other minor bands on
this autoradiograph (Fig. 5) may be due to degradation of the protein
in the major complex or nonspecific binding proteins.
We also used the gel electrophoretic mobility assay to assess binding
of nuclear proteins to other regions of the PPY/PPU tract. We did not
detect specific binding to oligonucleotides containing double-stranded
DNA fragments spanning
117 to
90 bp or
105 to
78 bp (data not
shown). We have not yet analyzed binding to regions downstream of the
PPY/PPU tract using this approach.
Nuclear Proteins Bind to Single-stranded DNA from the PPY/PPU
Tract
The potential for the PPY/PPU tract to form non-B-DNA
structures prompted us to search for proteins that would bind
specifically to single-stranded regions in this structure. We first
performed a DNase I-footprint assay using single-stranded DNA as a
probe. On the top (PPY) strand, nuclear proteins protected a broad
region between about
142 and
84 bp (Fig. 6). The
protection increased when more nuclear extract was used. At the highest
level of nuclear protein, additional protected sites were noted both
flanking the PPY/PPU tract and further 5
and 3
. One of these appears
to map to a run of seven C nucleotides interrupted by one A just
downstream from the PPY/PPU tract at
76 to
68 bp. This site also
showed weak S1 nuclease sensitivity (Fig. 3A). The other
sites may represent protein binding to short runs of pyrimidines, which
are poorer binding sites and/or do not have the potential to form
single-stranded DNA in vivo.
Fig. 6.
DNase I footprint of single-stranded DNA of
malic enzyme promoter. pM13ME
413/+224(+) and
pM13ME
236/+224(
) were used to prepare top and bottom strand probes,
respectively, as indicated under ``Experimental Procedures.'' About
100,000 cpm of probe (0.2 pmol) was preincubated with increasing
amounts of nuclear protein (lanes 2-5 and
8-11). After a 1-h preincubation on ice, 1 unit of
RNase-free DNase I was added. After 15 min in ice water, the reaction
was stopped, and the resulting DNA fragments were separated by size on
6% polyacrylamide, 8 M urea sequencing gels. Nuclear
extracts were prepared from the livers of 2-week-old chicks as
described under ``Experimental Procedures.'' Products of reactions
that did not contain DNase I or nuclear protein are in lanes
1 and 7. Products of reactions that contained 4 µg of
nuclear protein but no DNase I are in lanes 6 and
12. The amount of reaction mixture added to each lane was
adjusted so that the signal for the full-length probe was about the
same intensity. Lanes marked A, G,
C, and T contained the products of sequencing
reactions using the same primer used for the treated DNAs.
Numbers indicate the nucleotide sequence with respect to the
major start site of transcription.
On the bottom (PPU) strand, the region between
124 and
90 bp was
not susceptible to cleavage by DNase I in the absence of nuclear
protein. Nevertheless, even the minor cleavages, especially at the 5
end of this region, were not protected by nuclear extract. Indeed, the
highest levels of extract caused increased cleavage, probably due to
endogenous nuclease in the extract. Lack of binding of nuclear proteins
to the PPU strand was confirmed by gel electrophoretic mobility shift
assays (data not shown).
Nuclear proteins that bind to the PPY strand were characterized using
the gel electrophoretic mobility shift assay. Three major binding
activities, A1, A2, and B, were identified when a single-stranded
oligonucleotide spanning nucleotides
119 to
90 was used as probe
(Fig. 7A). The specificities of these binding
activities were examined by using different synthetic oligonucleotides
to compete for binding. Complexes A1 and A2 were not competed by
d(A)30, d(G)30, or d(T)30, but were
competed by d(C)30 almost as efficiently as by unlabeled
probe. These results suggest that proteins in complexes A1 and A2
specifically recognized a C-rich sequence. The C-rich region in this
probe corresponds to the single-stranded loop in triplex
structure d (Fig. 3C). We therefore tested an
oligonucleotide corresponding to nucleotides
118 to
107 as a
competitor for binding of proteins in complexes A1 and A2 to the longer
probe. Fragment
118/
107 competed for the binding but with
substantially lower affinity than that of unlabeled probe. This result
suggested that proteins in complexes A1 and A2 recognize and can bind
to the single-stranded loop in the putative triplex structure
d. The observation that fragment
118/
107 binds less well than
fragment
119/
90 may reflect a nonspecific requirement for
additional bases at the 3
-end of
118/
107.
Fig. 7.
Gel electrophoretic mobility shift assay of
binding of nuclear proteins to single-stranded PPY sequences.
Single-stranded probes were labeled using T4 polynucleotide kinase.
32P-Labeled probe (0.01 pmol) was mixed with competitor DNA
at the indicated molar ratios and then incubated with about 0.1 µg of
nuclear protein as indicated under ``Experimental Procedures.'' The
binding reaction was performed on ice for 45 min. DNA and DNA-protein
complexes were resolved on 6% nondenaturing polyacrylamide gels.
Nuclear extracts were prepared from the livers of 2-week-old chicks as
described under ``Experimental Procedures.'' Panel A,
single-stranded oligonucleotide corresponding to PPY sequence between
119 and
90 bp was the probe. Panel B, single-stranded
oligonucleotide corresponding to PPY sequence between
135 and
115
bp was the probe.
The binding specificities of A1 and A2 were examined further with other
oligonucleotides. d(CT)10 and fragment
99/
86 were poor
competitors. The proteins in complexes A1 and A2 appeared to have some
affinity for double-stranded oligonucleotides because
d(CT)·d(GA)10 was modestly effective as a competitor.
This turned out to be an artifact caused by dissociation of the
double-stranded DNA in the binding reaction (see below). In sum, these
results suggest that the proteins in complexes A1 and A2 are
sequence-specific single-stranded DNA binding proteins that recognize
poly(dC) but may prefer that the C-run be interrupted by a T (or other
nucleotide).
The binding activity of protein in complex B also was
sequence-specific; binding was not competed by d(A)30,
d(G)30, d(T)30, or d(C)30 but was
competed by d(CT)10. This result suggested that the protein
in complex B may bind to the (CT)7 repeat of the probe. An
oligonucleotide corresponding to nucleotides
99 to
86, which
contains only four CT repeats, failed to compete for binding. Binding
of nuclear protein also was determined using a single-stranded fragment
spanning
101 to
81 as probe. Binding of the protein in complex B
was detected (results not shown). This fragment contains five CT
repeats and may represent the minimum binding site.
The ability of the protein in complex B to bind to double-stranded
(CT)10·(GA)10 also was tested; this protein
was competed by (CT)10·(GA)10, although not
as efficiently as by unlabeled probe. As described above for the A1 and
A2 complexes, however, this result was probably due to an artifact
caused by dissociation of double-stranded DNA. We labeled this
double-stranded oligonucleotide with T4 polynucleotide kinase and
incubated it at the same concentrations and under the same conditions
used in the competition reactions. Based on the subsequent analysis of
the DNA in a nondenaturing polyacrylamide gel, about 25% of the total
mass of DNA had dissociated into single-stranded oligonucleotides. We
also tested the binding activity of a double-stranded oligonucleotide
spanning
117 to
90 bp; this is the same region spanned by the
single-stranded probe,
119/
90. No binding activity was detected
with the double-stranded probe (data not shown). We conclude that the
protein in complex B is single-strand-specific and probably recognizes
the CT repeat.
DNase I footprinting of single-stranded DNA exhibited a broad region of
protection on the PPY strand that extended upstream to about nucleotide
144. We therefore tested binding of nuclear protein to the region
upstream of
119 bp using the single-stranded PPY tract from
nucleotide
135 to
115 as probe (Fig. 7B). Like the
protein(s) in complexes A1 and A2, the protein that formed a complex on
the 5
part of the polypyrimidine tract did not compete with
d(A)30, d(G)30, or d(T)30 but did
compete efficiently with d(C)30. In fact,
d(C)30 competed more efficiently than unlabeled probe.
Binding to this fragment also was competed by the 3
single-stranded
PPY fragment,
119 to
90. These results suggest that the binding
properties of the protein in complex C were similar to those in
complexes A1 and A2. Like the proteins in complexes A1 and A2, those in
complex C competed for
118/
107 substantially less well than for
unlabeled probe and practically not at all for d(CT)10. On
the other hand, the protein in complex C had a lower affinity for its
own probe than for fragment
119/
90. This may have been due to a
slight difference in nucleotide sequence, the run of Cs in
135/
115
has one less C and a differently positioned T. The similarity of
binding properties suggests that the protein in complex C is the same
as that in complex A1 or A2. Only one complex formed on fragment
135/
115, suggesting that A1 and A2 contain different proteins, only
one of which is the same as that in complex C.
The binding of protein in complex B to single-stranded probe was
enhanced when binding of protein in complexes A1 and A2 was reduced by
competition (Fig. 7A), suggesting that the proteins in A1
and A2 may compete with those in B in the following way: when complexes
A1 and/or A2 are formed, the oligonucleotide is no longer available for
binding of protein to B. We tested this hypothesis by changing the
ratio of nuclear protein to labeled probe in the binding reaction. When
the ratio of nuclear proteins to probe was highest, all probe was
shifted into complexes at A1 and A2. The proteins in complexes A1 and
A2 likely have higher affinities for probe than that in complex B or
are present at higher concentrations in the nuclear extract than the
one that binds to B. As the amount of protein in the extract was
reduced, complex B began to form. This result suggests that, although
the proteins in complexes A1 and A2 and that in complex B recognize
different sequences, the sites may overlap or that due to interactions,
formation of complexes A1 and A2 may preclude formation of complex B. To determine if binding of protein in complex B to its site can exclude
the binding of proteins in complex A, we started four binding reactions
with a ratio of nuclear protein to probe that permitted binding of
protein in complex B to the probe (Fig. 8). After a
30-min preincubation, increasing amounts of nuclear protein were added
to each binding reaction, and the incubation continued for another 30 min. Binding of protein in complex B was displaced by those in A1 and
A2 when sufficient nuclear protein was added to the preincubated
samples. This result suggests that when probe is limiting, only
proteins in complexes A1 and A2 will be bound.
Fig. 8.
Relative affinity of nuclear proteins for the
single-stranded probe corresponding to
119 to
90 bp. Lanes
1-4, 0.01 pmol of probe was incubated with 4 µg (lane
1), 0.8 µg (lane 2), 0.25 µg (lane 3),
or 0.08 µg (lane 4) of nuclear proteins for 30 min. 2 µl
of nuclear extract dialysis buffer was then added to each binding
reaction. Incubation continued for another 30 min before binding
reactions were loaded on gel. Lanes 5-8, 0.01 pmol of probe
was preincubated with 0.25 µg of nuclear proteins for 30 min. 0 (lane 5), 0.25 (lane 6), 0.8 (lane 7),
or 4 µg (lane 8) of nuclear proteins in 2 µl of dialysis
buffer was added to the binding reaction. Incubation was allowed for
another 30 min before binding reactions were loaded on gel.
The broad region of protection of single-stranded polypyrimidine tract
in the DNA footprint assay likely results from the binding of proteins
in complexes A1 and/or A2 to the two C-rich sites. Protein in complex B
also may bind to the (CT)7 region in the footprint assay,
depending on the DNA/protein ratio. Whether or not the protein in
complex B bound to the CT repeat in the footprint assay would not be
discernible, however, because the region of the CT repeat was resistant
to DNase I in the absence of added nuclear protein (Fig. 6).
Promoter Activity of the PPY/PPU Tract
The detection of
nuclear proteins that bind specifically to single-stranded DNAs that
may be single-stranded in triple-helical structures suggests that these
unusual structures may play a role in the function of the PPY/PPU
tract. What is the role of the PPY/PPU tract in regulating
transcription of the malic enzyme gene, and how could the protein-DNA
interactions that we have observed contribute to that function?
The transcription of the endogenous malic enzyme gene has its major
start site at +1 bp, with several minor start sites further upstream,
including ones at
74 and
89 bp.2 Messenger RNA
transcribed from p[ME
5800/+31]CAT in transfected hepatocytes also
has a start site at +1. Additional upstream start sites also are
detected, but they are similar in intensity to that at +1 rather than
minor as in the endogenous gene. In contrast, mRNA transcribed from
transfected pBH147CAT has major start sites of about equal intensity at
74 and
89 bp and little or no initiation from +1 bp. These results
and the inhibition of T3-induced and basal transcription caused by
deletion of the PPY tract from pBH147CAT suggested the possibility that
the PPY tract might serve as a promoter.
To test this hypothesis, we constructed several reporter plasmids in
which the TK promoter of pME[
3903/
3703]TKCAT was replaced by PPY
tract plus 8 bp of 3
-flanking DNA or variants thereof with different
deletions (Fig. 9A). The malic enzyme
fragment,
3903/
3703 bp, contains a T3 response unit composed of one
major and several weak T3 response elements.2 Promoter
activities of the resulting reporter plasmids were tested in transient
transfection assays in T3-treated chick embryo hepatocytes in culture.
The
3903/
3703 fragment itself lacked promoter activity (Fig.
9B). When the PPY tract plus 8 bp of 3
-flanking DNA were
inserted into the plasmid to form
pME[(
3903/
3703)ME
135/
78]CAT, the level of promoter activity
was comparable with that for hepatocytes transfected with pBH147CAT
(data not shown) and almost half that of
pME[(
3903/
3703)ME
147/+31]CAT. The latter contains the entire
minimal malic enzyme promoter from
147 to +31 bp. These results
indicate that the fragment from
135 to
78 was capable of acting as
an independent promoter.
Fig. 9.
Promoter activity of various sequences from
the PPY/PPU-encompassing region. Panel A, schematic
representation of test constructs. Promoterless reporter
pME(
3903/
3703)CAT was made by excising the TK promoter from
pME(
3903/
3703)TKCAT. The latter construct was made by inserting a
201-bp fragment of the malic enzyme gene (
3903 to
3703 bp) upstream
of the TK promoter in pBLCAT2. ME
3903/
3703 contains T3 response
elements and is designated a T3 response unit. Fragments of the PPY/PPU
region were ligated into pME(
3903/
3703)
TKCAT as indicated in the
figure. Panel B, transient transfection assay. Hepatocytes
were prepared and incubated as outlined in the legend to Fig. 1 and
``Experimental Procedures'' except that all plates contained T3.
Relative CAT activity was determined as described in the legend to Fig.
1 except that it was normalized to activity with
pME[(
3903/
3703)ME
135/
78]CAT. The results are expressed as the
means ± S.E. of 12 experiments. CAT and
-galactosidase
activities of the hepatocytes incubated with
pME[(
3903/
3703)ME
135/
78]CAT were 3.3 ± 0.5 (mean ± S.E., n = 20) percent conversion/15 h/µg of
protein and 37 ± 6 × 10
4 (mean ± S.E.,
n = 20) A420 units/min/µg of
protein, respectively. Two different preparations of each plasmid were
used in these experiments.
When the orientation of the PPY tract was reversed
(pME[(
3903/
3703)ME
78/
135]CAT), promoter activity was lost
(Fig. 9B). Thus, the potential for formation of an unusual
DNA structure was not sufficient for promoter activity; correct
orientation also was necessary. The nucleotide sequence between
88
and
80 bp is repeated exactly between
18 and
10 bp, suggesting a
possible role in transcription initiation. When most of the upstream
repeat was deleted from pME[(
3903/
3703)ME
135/
78]CAT, while
keeping the PPY/PPU tract intact, promoter activity was lost,
indicating that the PPY/PPU tract alone is not sufficient for promoter
activity. Deletion of the PPY/PPU tract (
134/
90) from
pME[(
3903/
3703)ME
135/
78]CAT also caused essentially complete
loss of promoter activity. Thus, one copy of the 9-bp repeat is
necessary but not sufficient for promoter activity. Deletion of the
(CT)7 repeat
(pME[(
3903/
3703)ME
135/
78
105/
92]CAT) also caused loss
of promoter activity but not to the same extent as when the entire
PPY/PPU tract or the 9-bp repeat was deleted. Finally, deletion of the
upstream part of the PPY/PPU tract between
135 and
106 bp caused a
64% increase in promoter activity.
In sum, these results suggest that both the CT repeats and the 9-bp
repeat are necessary for promoter activity and together are sufficient
for promoter activity. We have not detected a protein-binding activity
specific for the 9-bp repeat, but both the essential CT repeat and the
upstream part of the PPY/PPU tract specifically bind single- or
double-stranded proteins. Interestingly, the
(CT)7·(GA)7 region remained S1
nuclease-sensitive in pME[(
3903/
3703)ME
105/
78]CAT (data not
shown).
DISCUSSION
Several types of non-B-DNA structures play important roles in
various cellular events (8). One non-B-DNA structure is the triplex
structure. When the triple helix was first described, the requirement
for supercoiling and acidic conditions raised questions about its
physiological role in regulating the functions of DNA. However,
accumulating evidence suggests that triplex structures can form at
neutral pH, in a process favored by divalent metal ions and polyamines
(11, 40, 41). Furthermore, chromatin structure and other local
protein-DNA interactions result in a high degree of supercoiling in
intact cells (42, 43, 44). Recent work has expanded the repertoire of
sequences that can form the third strand of a triplex by demonstrating
an increased tolerance for certain mismatches in Hoogsteen base pairing
of H*-DNA (11, 45), further increasing the likelihood that triplexes
will form under physiological conditions.
The transition from B-DNA to triplex is proposed to start with melting
of the central region of a PPY/PPU tract (40, 46, 47), followed by the
bending of the region and formation of Hoogsteen base pairs. The
abundance and the stability of a particular form of triplex structure,
therefore, is determined largely by the energy required to melt the
central region and the length of Hoogsteen base-paired region. In the
PPY/PPU sequence of the malic enzyme gene, the sequence upstream of
(CT)7 is C-rich and more difficult to melt than the
downstream part. This may explain why the triplex structures formed by
the upstream region are less abundant than those formed in the
downstream region at acid pH (Fig. 3). The (CT)7 sequence,
although easier to open up, is not as long as other CT repeat sequences
that easily form stable triplex structures. The formation of triplex
structures a and b (Fig. 3C) by the
(CT)7-centered region may have been facilitated by
protonation of adenines in the acidic S1 digestion buffer. The
nonorthodox base triad CG*A+ has been observed as a major
component in H*-DNA formed under the acidic conditions (45). At neutral
pH, lack of protonation of A nucleotides introduces four continuous
mismatches in structure a and reduces the outer stretch of
Hoogsteen base pairs in structure b to three. The outer four
base pairs are separated from the remaining legitimate Hoogsteen base
pairs by two mismatches. Thus, the loss of one base pair in
structure b and the four mismatches in structure
a should have a substantial destabilizing effect on these triplex
structures. Consistent with this reasoning, our results did not suggest
formation of triplex structures by the (CT)7-centered
region at neutral pH. The non-B-DNA structure that we detected in this
region is likely the melted form.
In the 5
-flanking DNA of the malic enzyme gene from +31 to
419 bp,
the A(CT)7TT sequence in the PPY/PPU tract is the only
17-bp region that has an (A + T) content greater than 50%; the rest of
this region of the malic enzyme gene is very GC-rich. From +132 to
249 bp, the region is 70-80% GC; from
249 to
449, it is 67%
GC.2 Under the stress of supercoiling, the CT region may
open more readily than its neighboring sequences; the next step would
be formation of triplex structures, although the short length of the CT
repeat may preclude formation of a stable triplex structure. The
formation of transient triplex structures might shift the equilibrium
from B-DNA toward the melted structure, trapping the region in a
dynamic non-B-DNA state.
The ability of the d(CT)7 repeat to adopt
supercoiling-dependent non-B-DNA structures may explain why
this region is required for promoter activity in transfected
pME[(
3903/
3703)ME-135/
78]CAT. One of the crucial steps in
transcription initiation is melting of double-stranded DNA in the
promoter region. Conditions that promote melting not only stimulate
transcription initiation but also reduce or eliminate the requirements
for several general transcription factors (48, 49, 50). In another study,
the initiator-binding protein, YY1, supported transcription initiation
in the absence of TATA-binding protein on a supercoiled promoter but
not on a linear promoter (51), indicating that sequence-specific
changes in DNA structure caused by supercoiling may be involved in
TATA-binding protein-independent transcription.
We propose the following model for promoter function of the region
between
135 and
78 bp. Three sequence elements are important. The
first element, 5
-
88CCCGCAGGA
80-3
, is
essential for the promoter activity and may identify the transcription
start site. The second indispensable region is the (CT)7
repeat. The third important sequence is upstream of the CT repeat,
contains two C-rich elements, and in transfections has a negative
influence on promoter activity. The stress of supercoiling should cause
the PPY/PPU tract to melt and adopt one of several possible non-B-DNA
conformations. We suggest that a melted structure formed by the
(CT)7 repeat represents the active promoter conformation.
Binding of the (CT)n-specific protein to the top strand may be
sufficient to maintain the open conformation. The open structure of the
CT repeat may facilitate entry of RNA polymerase into the template.
Alternatively, interaction of the (CT)n-specific protein with
other factors may facilitate transcription initiation.
The sequence upstream of the CT repeat may serve as a regulatory unit.
Formation of triplex structure by the upstream region should release
supercoiling stress and may prevent formation of an open conformation
within the CT repeat. Binding of the poly(dC)-specific protein to
single-stranded regions of the triplex structures might stabilize the
triplex structure and its negative activity. Binding of the
poly(dC)-specific protein would not only lock the sequence in the
inactive triplex conformation but also might prevent the binding of the
(CT)n-specific protein (Fig. 8) even if the CT repeat did
undergo some melting. The negative effect of the upstream PPY/PPU
sequence, however, might be suppressed by the double-stranded DNA
binding protein because it would prevent formation of the triplex
structure, permitting the energy of supercoiling to melt the CT repeat
and facilitate transcription initiation.
In our model, we propose that two single-stranded DNA binding proteins
are important in regulating promoter activity. Single-stranded DNA
binding proteins with binding specificities similar to those we
describe have been identified in several mammalian organisms (37,
52, 53, 54, 55). Numerous reports suggest that single-stranded DNA binding
proteins are involved in transcription regulation. For example, the
levels of several single-stranded DNA-binding proteins correlate
with the level of expression of certain genes (56, 57). Two
single-stranded DNA-binding proteins have been cloned and contain
domains characteristic of some transcription factors (55, 56). One of
them, CNBP, stimulates expression of a reporter gene driven by a
promoter and DNA fragment containing CNBP binding sites (58). In
addition to proteins isolated by virtue of their ability to bind to
single-stranded DNA, some transcription factors have comparable or
higher affinity for one strand of their recognition sequence than for
the double-stranded form (59, 60). In the case of estrogen receptor,
for example, the affinity for noncoding strand is 60-fold higher than
that for the double-stranded sequence. Furthermore, some well known DNA
cis-elements are sensitive to S1 nuclease and may provide binding sites
for specific single-stranded DNA-binding proteins (61, 62). Finally, a
single-stranded DNA-binding protein, SSB, activates a promoter by
stabilizing a DNA hairpin structure, providing evidence that the
interplay between the secondary structure of DNA and binding of a
single-stranded DNA binding protein is involved in the promoter
function (63). All of these results support the hypothesis that
single-stranded DNA-binding proteins play important roles in
transcription and its regulation.
What is the role of the PPY/PPU in transcription of the endogenous
malic enzyme gene? It has its major start sites at +1, with only minor
transcription initiation immediately downstream of the PPY/PPU
sequence. This result suggests that, normally, the endogenous gene may
not use the PPY/PPU tract as a promoter. In the reporter plasmid,
pME[
5800/+31]CAT, use of start sites just downstream of the PPY/PPU
tract is essentially equivalent to that at +1. In the reporter plasmid,
pBH147CAT, both major start sites are just downstream from the PPY/PPU
tract. Furthermore, transcription is greatly curtailed in the absence
of the PPY/PPU tract. The increased usage of start sites at about +1
(pME[
5800/+31]CAT versus pBH147CAT) suggests that
sequence elements upstream of
147 bp may play a role in start site
selection. In any event, our results suggest that a PPY/PPU tract can
define a cryptic site for transcription initiation in certain promoters
and suggests a mechanism by which transcription could be initiated and
regulated in promoters lacking a TATA box.
One important difference between the endogenous malic enzyme gene and
transiently transfected malic enzyme genes is likely to be the lack of
assembly of the DNA into normal chromatin on the transiently
transfected genes. Proper chromatin structure may add constraints that
prevent this PPY/PPU tract from acting as a promoter in the endogenous
gene. The converse also may be true; changes in DNA structure may
regulate chromatin structure. For example, a PPU tract in the promoter
region of the chicken adult
-globin gene is involved in regulation
of nucleosome structure (38). Furthermore, it has been proposed that
nucleosomes cannot assemble on parts of the DNA that are in a triplex
structure (8). The transition from a low transcription rate in the
starved state to a 40-fold higher transcription rate in the refed state
is accompanied by a profound change in chromatin structure in the
region from
320 to
72 bp; chromatin in this region is
hypersensitive to DNase I in livers from fed chicks and resistant to
DNase I in starved chicks. The transition from DNase I sensitivity to
DNase I resistance occurs within 4 h and occurs with the same
kinetics as the increase in transcription rate (23). The changes in
chromatin structure do not appear to be caused by the altered
transcription rate because this region is hypersensitive to DNase I in
hepatocytes in culture whether the rate of transcription is high or low
(64). We speculate that the PPY/PPU-binding proteins that we have
described play a role in regulating chromatin structure. If so, they
may play important roles in mediating the effects of feeding and
starvation on transcription of the malic enzyme gene.