![]()
|
|
||||||||
J. Biol. Chem., Vol. 275, Issue 48, 38067-38072, December 1, 2000
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
From the Department of Life Sciences (Chemistry), Graduate School
of Arts and Sciences, The University of Tokyo, Komaba, Meguro,
Tokyo 153-8902, Japan
Received for publication, March 9, 2000, and in revised form, July 18, 2000
Sp1 is one of the well documented transcription
factors, but the whole structure of human Sp1 has not been determined
yet. In the present study, we isolated several cDNAs representing
two forms of human Sp1 mRNA with different 5'-terminal structures in HepG2 cells. Isolation of a genomic clone established that one of
the cDNAs represents the mRNA having consecutive alignment of
exons, which allowed deducing the complete amino acid sequence for
human Sp1. Another cDNA clone had a surprising structure that possessed an alignment of exons 3-2-3. Both reverse
transcriptase-polymerase chain reaction and RNase protection assays
confirmed accumulation of the two forms of Sp1 mRNA in HepG2 cells.
Because Southern blot analysis suggested that exon 3 is of a single
copy in the genome, the cDNA clone having the duplicated sequences
for exon 3 appeared to reflect the trans-splicing between
pre-mRNAs of human Sp1.
Transcription factor Sp1 was initially identified as a protein
that bound to multiple GGGCGG sequences (GC boxes) in the SV40 early promoter (1). Subsequent studies have shown that Sp1 also
interacts with GC boxes in the promoters of cellular and other viral
genes and activates expression of those genes (2, 3). Although Sp1 had
been regarded as a ubiquitous transcription factor that regulates
transcription from TATA-less promoters of housekeeping genes, recent
studies have suggested that Sp1 may be also involved in specific gene
activation through modulation of its abundance and phosphorylation and
glycosylation states in response to a variety of signals (4-8).
Likewise, our preliminary studies concerning gene expression responsive
to insulin suggested that synthesis and/or degradation of Sp1 protein
might be regulated by insulin. Accordingly, we started structural study
of human Sp1 mRNA to enquire any account for that apparent insulin
effect in its mRNA structure. Despite a large number of reports
concerning the function of Sp1, the complete structure of human Sp1
protein has not been established yet. The reported cDNA clones for
human Sp1 still lack the N-terminal and the upstream noncoding regions (9, 10), although a DNA-binding domain having three zinc finger motifs
and four transcriptional activation domains, termed domains A, B, C,
and D (11), have been identified in the partial Sp1 structure. We
report here accumulation of two forms of human Sp1 mRNA in HepG2
cells and the evidence that one form of the products was generated by
homotypic trans-splicing. The complete structure of human
Sp1 protein was also deduced from the cDNA sequence.
5' Rapid Amplification of cDNA Ends and Reverse
Transcriptase-Polymerase Chain Reaction--
Total RNA was extracted
from HepG2 cells using ISOGEN (Nippongene, Toyama, Japan) according to
instruction of the manufacturer. The single strand cDNA for 5'
RACE1 was prepared by
in vitro synthesis of cDNA with avian myeloblastosis virus reverse transcriptase XL (Takara Shuzo, Tokyo, Japan) using total
RNA (5 µg) and the primer RT (5'-TCTGTTCCTTTG-3') and digestion of
the template RNA with RNase H. When nucleotide positions were numbered
relative to the transcription start site that was identified in this
study, the primer RT corresponded to the positions from 2197 to 2186. The same nucleotide numbering was adopted throughout this paper. 5'
RACE was carried out using a 5' Full RACE Core Set (Takara Shuzo). The
first PCR was performed using the single strand cDNAs concatenated
by T4 RNA ligase and primers S1 (5'-GCTGGCAGATCATCTCTTCC-3', positions
2144-2163) and A1 (5'-ACCCTGTGAAAGTTGTGTGG-3', positions 2136-2117)
through a 25 cycle-amplification (94 °C for 30 s, 52 °C for
30 s, and 72 °C for 4 min). Then, a nested PCR was applied to
the first PCR products under the same condition using primers S2
(5'-GGATCCTCTGGGGCTACCCCTAC-3', positions 2164-2183) and
A2 (5'-GAATTCTGTGAGGTCAAGCTCACCTG-3', positions
2116-2096). Each primer contained both the sequence for a proper
segment in Sp1 gene and the sequence (underlined) for creation of a
restriction site. Each product of the nested PCR was cloned into pUC
vector for DNA sequencing.
For detection of the Sp1 mRNA with the exon 3-2-3 alignment by
RT-PCR, a 25-cycle amplification (94 °C for 30 s, 52 °C for 30 s, and 72 °C for 50 s) was applied to the single strand
cDNA that was used for 5' RACE and appropriate pairs of the
following primers; primer T1 (5'-CAAACAATCACCTTAGCCCC-3', positions
3265-3284), primer T2 (5'-AATGCAGGGTGTTTCCTTGG-3', positions
3285-3304), primer T3 (5'-CAAGATCACTCCATGGATGA-3', positions
1541-1560), primer T4 (5'-ATGAAATGACAGCTGTGGTG-3', positions
1557-1576), primer T5 (5'-CTGTGAGGTCAAGCTCACCT-3', positions
2116-2097), and primer T6 (5'-ATGATCTGCCAGCCATTGGC-3', positions
2156-2137).
For detection of the Sp1 mRNA with the exon 2-3-2 alignment by
RT-PCR, a single strand cDNA was prepared by reverse transcription from poly(A)+ RNA (1 µg) of HepG2 cells using the primer
R8 (5'-TGCCCGCAGGTGAGAGGTCTTG-3'); this primer was synthesized
referring to the cDNA sequence in the downstream of exon 3. The
first PCR was carried out using the primer X2
(5'-GTTCGCTTGCCTCGTCAGCG-3', positions 81-100), primer T3 and primer
2R-1 (5'-AAGGCACCACCACCATTACC-3', positions 1635-1616). The nested PCR
was accomplished through a 25-cycle amplification (94 °C for 30 s and 72 °C for 2 min) using the following primers; primer R14
(5'-TTCATCCATGGAGTGATCTTGGTCTGG-3', positions 1561-1539 and positions
3502-3499), primer R15 (5'-TTGAGCTTGTCCCTCAGCTGCCAC-3', positions
147-170), primer R16 (5'-TCCATGGATGAAATGACAGCTGTGGTG-3', positions
1550-1576), and primer R17 (5'-CTGGCAGCAACTTGCAGCAGAATTG-3', positions
2017-2041).
Isolation of a Genomic Clone--
Human genomic DNA was prepared
from HepG2 cells according to a standard protocol (12), and completely
digested with XbaI. Then a size-fractionated pool of the DNA
fragments was ligated with the phage vector Primer Extension Analysis--
The primer PE1
(5'-ATGGTGGCAGCTGAGGGACAAG-3', positions 173-152) was end-labeled with
[ RNase Protection Assay--
To construct template plasmids for
in vitro synthesis of riboprobes, DNA fragments were
amplified by PCR with two sets of primers. The primers XA1
(5'-AATAAGCTTGTTCGCTTGCCTCGTCAGCG-3', positions 81-100)
and XA2 (5'-TTATCTAGAAAGGCACCACCACCATTACC-3', positions
1636-1616) were used with the template of the 0.41-kb 5' RACE product,
and primers BA1 (5'-AATAAGCTTTCACACCCATTGCCTCAG-3', 3332-3349) and BA2 (5'-TAATCTAGAATTGCCCCCATTATTGCC-3',
1615-1598) were used with the 1.6-kb 5' RACE product. These primers
contained additional sequences (underlined) for creation of
XbaI or HindIII site at each end. Each amplified
DNA fragment was inserted between XbaI and
HindIII sites of pBluescript KS. The resulting plasmids were
linearized by digestion with HindIII, and antisense
riboprobes were synthesized from these T7 promoter-containing plasmids
in the presence of [ Genomic Southern Blot Analysis--
Genomic DNA (2 µg) from
HepG2 cells was digested completely with a restriction enzyme
(BamHI, EcoRI, PstI, or
XbaI), electrophoresed on a 0.7% agarose gel, and
transferred onto a Hybond-N membrane (Amersham Pharmacia Biotech). The
DNA on the membrane was allowed to hybridize with the
32P-labeled DNA fragment (positions 3225-3502) that
corresponded to exon 3 of the human Sp1 gene and washed under stringent
conditions (0.2 × SSPE plus 0.1% SDS, 15 min at 65 °C, two times).
Two Forms of Human Sp1 cDNA with Different 5'-Terminal
Regions--
To obtain a human Sp1 cDNA clone containing the
5'-terminal region, we employed a 5' RACE method using total RNA from
HepG2 cells. The primers were designed to anneal specifically to the sequence in a 5'-terminal region of human Sp1 on the basis of the data
registered in GenBankTM (accession number J03133). As shown
in Fig. 1A, three kinds of DNA
fragments with respective sizes of 0.34, 0.41, and 1.6 kb were mainly
amplified. The sequence analysis revealed that all the products indeed
possessed in common a known sequence in Sp1 gene; however, these
products were classified into two types based on the sequence upstream
of this sequence. The products of 0.41 and 0.34 kb had a new and
identical sequence in the immediate upstream of the known sequence,
although the 0.41-kb product contained the further upstream sequence of
71 bp long (Fig. 1B). By contrast, the 1.6-kb product had an
unexpected structure, in which another established sequence in the
downstream region of Sp1 gene was also linked upstream of the common
sequence (Fig. 1B).
Because we obtained different cDNA clones for the 5'-terminal
region of Sp1 mRNA, we analyzed the Sp1 gene in the human genome to
elucidate the mechanism of the generation of these differences. In
genomic Southern analysis with XbaI digestion, a single band of 14 kb was detected using the probe obtained from the 0.41-kb product
(data not shown); thus we constructed a genomic library from the
XbaI digest to obtain the genomic clone containing the 5'
region of human Sp1 gene. The screening of this library with the same
probe yielded a single positive clone, which was named
To confirm whether the 5'-terminal end of the 0.41-kb product
represented the 5' terminus of Sp1 mRNA, we next performed primer extension analysis with poly(A)+ RNA isolated from HepG2
cells. This experiment showed only a single band representing the
product extended 56-bp from the 5' end of the 0.41-kb product (Fig.
2B). There is no consensus
sequence for the splice acceptor site in the genomic sequence preceding the one corresponding to the 5' end region of 0.41-kb product. We
assume, therefore, the position 56 bp upstream from the 5' end of the
0.41-kb product as the transcription start site of human Sp1 gene (Fig.
2A). The genomic structure up to 266 bp upstream of this
putative transcription start site did not contain any TATA box-like
sequence but four possible GC boxes at positions starting from
Comparison of sequences of the genomic and cDNA clones revealed
exon-intron boundaries in the 5'-terminal region of Sp1 gene (Fig. 1,
B and C). The Establishment of the Presence of the Sp1 mRNA with the Exon
3-2-3 Alignment in HepG2 Cells--
To confirm whether the Sp1
mRNA represented by the 1.6-kb product is naturally produced or
not, RT-PCR analysis was performed with total RNA from HepG2 cells and
several sets of the primers (Fig.
4A). When the mRNA
corresponding to the 1.6-kb product is indeed present in HepG2 cells,
the 500-bp product is expected to be amplified in the RT-PCR with
primers T1 and T6 followed by the nested PCR with primers T2 and T5. As
the positive control, a nested PCR was also performed using primers T3
and T6 followed by primers T4 and T5 to amplify the 264-bp fragment.
These expected segments were all amplified in these PCRs (Fig.
4B, lanes 1 and 2), whereas no
amplifications were observed in the negative control PCR with primers
T6 and T5 (lane 3). We also confirmed that these amplified
products had the expected sequences. To provide further the direct
evidence for the occurrence of two forms for the Sp1 mRNA and to
estimate their accumulation levels, we next carried out RNase
protection assays with two antisense riboprobes (Fig. 5A). The riboprobe 1 that was
synthesized using the 1.6-kb product as the template has the sequence
complementary to the exon 3-2 boundary region covering 171 nt in exon 3 and 77 nt in exon 2. Thus, three sizes of protected fragments were
expected when riboprobe 1 was used; hybridization of this probe to the
mRNA corresponding to the 1.6-kb product should produce a 248-nt
fragment from the exon 3-2 junction-spanning regions and a 171-nt
fragment from the 3'-terminal region of exon 3 immediately preceding
exon 4, and hybridization to the other mRNA gives rise to a 77-nt
fragment from the 5'-terminal regions of exon 2 following exon 1. Such bands were clearly observed with total RNA from HepG2 cells in a
dose-dependent manner (Fig. 5B, lanes
2-4), whereas no band was observed with the yeast RNA that served
as a negative control (lane 5). The result with the
riboprobe 2 also confirmed the presence of the two Sp1 mRNAs. The
riboprobe 2 had the complementary sequence to the exon 1-2 boundary
region comprising 98 nt in exon 1 and 97 nt in exon 2 and gave rise to
two bands whose signal intensities were also dependent on the amount of
total RNA used (lanes 7-9). The fragment with 195 nt
corresponded to the full-protected product from the normal form of the
transcript, whereas the fragment with 97 nt corresponded to the
expected part of exon 2 in the mRNA related to the 1.6-kb product.
Together, these results directly demonstrate the presence of two forms
of Sp1 mRNAs with different 5'-terminal structures in HepG2 cells.
Furthermore, the signal intensity of each protected fragment also
suggested a significant level of accumulation of either form of Sp1
mRNA in HepG2 cells.
The Sp1 mRNA with the Exon 3-2-3 Alignment Is Produced by
trans-Splicing--
Because genomic rearrangement can cause exon
duplication in mRNA (13), we examined whether or not exon 3 of Sp1
gene is duplicated in the genome of HepG2 cells. Genomic Southern blot analysis was performed with genomic DNA digests with various
restriction enzymes using a DNA fragment from exon 3 as a probe. As
shown in Fig. 6, a single band was
detected in each lane (lanes 4-7). In addition, the signal
intensities of these bands were almost the same as that of the control
band for a single copy (lane 3). This estimation was further
validated by a parallel Southern analysis for an established single
copy gene, p53 gene, applied to the same DNA digests (data not shown).
These results suggested that exon 3 exists as a single copy in the
genome. Thus, not the genomic duplication but an RNA editing mechanism,
i.e. formation of circular RNA or trans-splicing,
appeared to give rise to the Sp1 mRNA with the exon 3-2-3 alignment. Because circular RNAs lack poly(A) tails per se,
we next performed RNase protection assay with poly(A)+-rich
RNA and poly(A)
We also investigated the structure upstream of the 3-2-3 alignment of
the trans-spliced Sp1 mRNA by RT-PCR. To examine whether exons 1 and 2 are located in the upstream of the 3-2-3 alignment, first
PCRs were carried out with primers X2 and 2R-1 or primers T3 and 2R-1
(Fig. 8A). Then the nested
PCRs were done using the primers R15, R16 or R17, and R14, which were
designed to anneal specifically to exon 1, exon 2, or exon 3, and the
exon 3-2 junction sequence, respectively (Fig. 8A). When the
first PCR product with T3 and 2R-1 primers was used as a template, the
amplified products were observed by the nested PCRs (Fig.
8B, lanes 2). DNA sequencing of these products
established their expected structure. In contrast, no product was
observed by the nested PCR when the first PCR product with X2 and 2R-1
primers was used as a template (lanes 1). The specificity of
the primers used were verified in the negative control PCRs using an
EcoRI-XbaI fragment (Fig. 1C) of a
genomic DNA clone (lanes G) or the cDNA clone containing
the exon 1-2-3-4 alignment (lanes C) as a template, and the
positive control PCR using the recombinant clone having the exon
1-2-3-2-3 alignment as a template (lanes R). Taken together,
the trans-spliced mRNA appeared to have exon 2 but not
exon 1 in its 5' region.
Here, we cloned human Sp1 cDNAs that represent two forms of
mRNAs with different structures in the 5'-terminal region. The results of RT-PCR and RNase protection assay confirmed that two Sp1
mRNAs are really present and accumulated in HepG2 cells. One of
them is generated through a well studied cis-splicing
process, and the other with the exon 2-3-2-3 alignment is by
trans-splicing. Consistently, we detected heterogeneous RNA
species in Northern analysis of RNA from HepG2 cells using a Sp1
cDNA fragment (positions 2765-3295) as a probe; the main band was
approximately 8.2 kb, whereas the other two were minor but still marked
representing smaller mRNAs (data not shown). The main band probably
corresponds to the main bands previously reported (4, 9, 14) for the human Sp1. Other distinct bands we observed correspond to smaller RNAs
whose occurrences may depend on cell lines and/or tissues, because the
smaller species also seem to be detected in MKN-28 (4) but not in HeLa
cells (9). Although we determined the structure of the 5'-terminal
region of human Sp1 mRNA in this study, the sequence for the 3'
noncoding region remains undetermined. We currently suspend, therefore,
the identification of those multiple bands in the Northern analysis
until accomplishment of the whole structure for Sp1 mRNA.
trans-Splicing is an RNA editing mechanism that produces
mature mRNA from separate pre-mRNAs. In trypanosoma, nematodes
and some other lower organisms, spliced leader RNA, which is similar to
spliceosomal U small nuclear RNAs, was ligated at the 5' ends of
diverse nuclear mRNAs (15-17). Another type of
trans-splicing has been discovered in plant mitochondria and
chloroplasts. In this trans-splicing, formation of group II
intron-like structures by base pairing between complementary segments
of introns in separate pre-mRNAs seems to be essential (17-19). In
mammalian cells, trans-splicing was first demonstrated
in vivo and in vitro using artificial RNA substrates (20, 21), and spliced leader RNA and actin-1 pre-mRNA from Caenorhabditis elegans (22). Subsequently, a few
examples of trans-splicing as a natural event have also been
found in mammalian cells (23-25). These trans-splicings
occur between different pre-mRNAs. On the other hand, very recent
studies unveiled trans-splicing between identical
pre-mRNAs, namely homotypic trans-splicing, in mammalian
cells in the expressions of the rat carnitine octanoyltransferase gene
(26), the rat SA gene (27), and the rat voltage-gated sodium channel
gene (28). Our present finding with the human Sp1 gene adds another
distinct example to the homotypic trans-splicing in
mammalian cells, suggesting this type of trans-splicing
might be rather a general mechanism for regulation of phenotype
expression in mammalian cells.
Based on our findings in this study, we propose the model of the
homotypic trans-splicing (Fig.
9). In this model, we present the
trans-spliced Sp1 mRNA that lacks exon 1. The reason
that we did not detect the trans-spliced Sp1 mRNA having
exon 1 is obscure. However, the presence of alternative transcription
start sites in intron 1 can be a candidate account, because we observed multiple products in the primer extension analysis with the exon 2-specific primer (data not shown). It has been proposed that trans-splicing process in mammals proceeds in spliceosome
complexes and through partial base pairing between two precursor
mRNAs (29). By the survey of complementarity between a segment in
the upstream of exon 2 and one in the downstream of exon 3, we found
two sets of complementary sequences (Fig. 9). We also found an exonic
splicing enhancer (ESE)-like sequence (GAGGAGGAGGG, positions
1680-1690) in exon 2 of the Sp1 gene (Fig. 9). The ESEs, which are
known to be involved in a weak splice site selection in alternative splicing with cooperation of serine/arginine-rich splicing factors (SR
proteins), are usually purine-rich sequences in the exons downstream of
a regulated 3' splice site (30-32). Recently, it has been also
demonstrated that ESE and SR proteins are important for
trans-splicing by an in vitro assay system (33,
34). Furthermore, two putative ESE elements were also reported in the
carnitine octanoyltransferase gene (26). Thus, the above-mentioned
complementary sequences and the ESE-like sequence are possibly involved
in the case of Sp1 mRNA maturation as well.
The biological significance of trans-splicing for Sp1
remains elusive. Because exon 3 in Sp1 mRNA mainly encodes the
transcriptional activation domains A and B (11), the product of the Sp1
mRNA with duplicated exon 3, if translated, has doubled
transcriptional activation domains. Although we suggested that the
trans-spliced form of human Sp1 mRNA lacked the first
exon, this possibility still remains because of the presence of the
second methionine in exon 2 (Fig. 3), which might serve as a
translational start site. Such a product may show stronger ability for
transactivation, because synergistic activation among activation
domains, named superactivation, was demonstrated; an added
transactivation domain elevated the ability of a truncated Sp1 having
domains for DNA binding and transactivation (35, 36). In this case, the
trans-splicing will result in a positive regulation. In
other cases, this trans-splicing might contribute to a
negative regulation by producing nonfunctional mRNA and reduces the
functional Sp1 mRNA. Although the result of RNase protection assays
suggests that the Sp1 mRNA with the exon 2-3-2-3 alignment has a
poly(A) tail, this possibility also remains. It is already proposed
that trans-splicing is a novel mechanism for regulating the
cellular events because the trans-spliced rat SA mRNA is
tissue-specific, and trans-splicing of pre-mRNA of rat
voltage-gated sodium channel is regulated by a nerve growth factor (27,
28).
Finally, our study also established the complete amino acid sequence of
human Sp1. A partial DNA sequence of 313 bp that covers only the
N-terminal coding region of Sp1 recently appeared in the data base
(accession number AJ272134), and this sequence perfectly matched to our
sequence. Although the newly identified amino acid sequence is not so
large, this information is valuable because a region close to the N
terminus of human Sp1 is critical for susceptibility to
proteasome-dependent degradation (37). In addition, the
three different sizes of mRNAs of mouse Sp1 were observed during
spermatogenesis, one of which encoded a Sp1 lacking the 7 amino acid
residues of N terminus (38). Therefore, the complete structure of human
Sp1 identified in this study will be also useful for further
investigation concerning the regulation of Sp1 function.
*
The costs of publication of this
article were defrayed in part by the
payment of page charges. The article
must therefore be hereby marked
"advertisement" in
accordance with 18 U.S.C. Section
1734 solely to indicate this fact.
Published, JBC Papers in Press, September 5, 2000, DOI 10.1074/jbc.M002010200
The abbreviations used are:
RACE, rapid
amplification of cDNA ends;
RT, reverse transcriptase;
PCR, polymerase chain reaction;
bp, base pair(s);
kb, kilobase pair(s);
nt, nucleotide(s);
ESE, exonic splicing enhancer.
Heterogeneous Sp1 mRNAs in Human HepG2 Cells Include a
Product of Homotypic trans-Splicing*
, and
![]()
ABSTRACT
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES
![]()
INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES
![]()
EXPERIMENTAL PROCEDURES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES
DASH II (Stratagene, La
Jolla, CA) to construct a human genomic library. This library was
screened by a plaque hybridization technique using the 120-bp DNA
fragment of human Sp1 cDNA (positions 56-175) as a probe.
-32P]ATP using T4 polynucleotide kinase (Nippongene).
The 32P-labeled primer (1.2 × 104 cpm)
was incubated with 16 µg of poly(A)+ RNA from HepG2 cells
at 30 °C for 16 h. The primer-RNA hybrid was precipitated with
ethanol and subjected to reverse transcription with avian
myeloblastosis virus reverse transcriptase XL at 42 °C for 60 min.
The extended product was analyzed on a 6% polyacrylamide gel
containing 8 M urea.
-32P]CTP using a T7 RNA Synthesis
Kit (Nippongene). The riboprobe for detection of the
-actin mRNA
was also synthesized using a
-actin human antisense control template
(Nippongene). RNase protection assays were performed with an RPA II kit
(Ambion, Inc., Austin, TX) according to the manufacturer's
instructions. In brief, riboprobes (each 5 × 105 cpm)
were incubated at 42 °C for 16 h with RNA samples as indicated in the figure legends. Then they were digested for 30 min at 37 °C
with 200 µl of a mixture of RNase A (2.5 unit/ml) and RNase T1 (100 unit/ml). The protected products were analyzed on a 6% polyacrylamide
gel containing 8 M urea.
![]()
RESULTS
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

View larger version (40K):
[in a new window]
Fig. 1.
Structure of cDNA and genomic clones for
human Sp1. A, products of 5' RACE using total RNA from
HepG2 cells were analyzed on a 0.7% agarose gel (lane P).
The sizes of the three major fragments are indicated. Lane M
represents the
DNA digested with HindIII as a molecular
size marker. B, schematic presentation of three 5' RACE
products. Each box is drawn on the basis of the exon-intron
structure identified in this study. The numbers of the
positions of the first and last bases of each exon are also shown
relative to the transcription start site identified in this study.
Arrowheads marked by 2095 indicate the 5' end
position of Sp1 cDNA previously deposited in GenBankTM
(accession number J03133). The translation initiation site
(ATG) is also shown in the 0.41- and 0.34-kb products.
C, the restriction enzyme map of a genomic clone,
Sp1E1.
X, XbaI; E, EcoRI;
H, HindIII. The DNA sequence of the region
(indicated below the map) was determined and registered in the
DDBJ/EMBL/GenBankTM (accession number AB039286). The
positions of exons 1-3 are also shown.
Sp1E1 (Fig.
1C). Characterization and sequence analysis revealed that
this genomic clone contained both the common Sp1 sequence and the new
sequence that was found in the present 5' RACE products (accession
number AB039286). This result indicated that the 0.41- and 0.34-kb
products contained an upstream exon of human Sp1 gene.
231,
182,
139, and
9, respectively, thus suggesting the possible
auto-regulation in Sp1 function (Fig. 2A). Although the
0.41- and 0.34-kb products did not contain the expected 5' terminus of
Sp1 mRNA, the newly determined sequence in these products had a
stop codon in the frame for Sp1 protein (Fig. 2A).
Therefore, the first methionine codon in this open reading frame
appeared to be the initiation codon and the complete amino acid
sequence of human Sp1 protein was thus deduced from the DNA sequences
of the 0.34- and 0.41-kb products. The deduced amino acid sequence of
human Sp1 is composed of 785 amino acid residues, and the calculated molecular mass is 80,691 Da. The resulting amino acid sequence of the
N-terminal region showed a high homology with those of mouse and rat
Sp1 proteins (Fig. 3).

View larger version (58K):
[in a new window]
Fig. 2.
Mapping of the transcription start site of
human Sp1 gene. A, the DNA sequence of the 5' flanking
region and the first exon of human Sp1 gene. Nucleotide numbers are
relative to the transcription start site, which is shown by a
bent arrow. The 5' end positions of 5' RACE products of 0.41 and 0.34 kb are also shown by small bent arrows. The
sequence of exon 1 is shown by capital letters, and others
are shown by small letters. A stop codon is
underlined, and the translation initiation codon is
boxed. Arrows indicate the putative Sp1 binding sites with
directions. B, primer extension analysis. Lane P,
the product by reverse transcription with a 5' end-labeled primer and
poly(A)+ RNA from HepG2 cells. Lanes A, C,
G, and T, the sequence ladders of the human Sp1
gene that was generated with the same primer. The position of the
extended product is indicated by an arrowhead. The position
corresponding to the 5' end of the 0.41-kb product is also shown with
an arrow.

View larger version (25K):
[in a new window]
Fig. 3.
Comparison of amino acid sequence of
Sp1. The amino acid sequence of the N-terminal region of human Sp1
protein that is deduced from the DNA sequences of the 0.41- and 0.34-kb
products is compared with those of mouse and rat Sp1 proteins.
Asterisks indicate the identical amino acid residues to
those of human Sp1. Gaps are inserted for the maximum
homology. The boxed amino acid residue is the first one
encoded by the DNA sequence previously deposited in
GenBankTM (accession number J03133).
Sp1E1 clone contained first
three exons of the Sp1 gene; the sizes of these exons were 178, 155, or
1513 bp, respectively. It was also shown that the 1.6-kb product had a
3'-terminal portion of exon 3 in the immediate upstream of exon 2; it
had the exon 3-2-3 alignment (Fig. 1B). The upstream exon 3 encoded exactly the same amino acid sequence coded by the downstream exon 3 in the same frame, except the codon at the junction between exons 3 and 2 (GAC), whereas the codon between exons 3 and 4 was GGT,
which encoded Asp and Gly, respectively.

View larger version (23K):
[in a new window]
Fig. 4.
Detection of the Sp1 transcript with the exon
3-2-3 alignment by RT-PCR. A, the positions and
directions of primers T1 through T6 in the cDNA structure are
indicated by arrows. The sizes of the expected fragments are
also shown. B, the amplified products were analyzed on a
1.5% agarose gel, and their sizes are shown at the side of
arrowheads. Lane 1, the product of the nested PCR with
primers T2 and T5 following the PCR with primers T1 and T6; lane
2, the product of nested PCR with primers T4 and T5 following the
PCR with primers T3 and T6; lane 3, the product of PCR with
primers T6 and T5; lane M, 50-bp ladders as size
markers.

View larger version (40K):
[in a new window]
Fig. 5.
RNase protection assay for the transcripts of
Sp1. A, the riboprobes for RNase protection assays. The
target sequences of riboprobes are shown beneath the structure of 5'
RACE products. Both riboprobes that were synthesized in
vitro from pBluescript derivatives contained a 40-bp vector
sequence as well. B, the riboprobes were incubated with 10 µg (lanes 2 and 7), 50 µg (lanes 3 and 8), or 100 µg (lanes 4 and 9) of
total RNA from HepG2 cells or 100 µg (lanes 5 and
10) of yeast RNA. The undigested probes were also loaded
(lanes 1 and 6). The positions of the products
are indicated by arrowheads, and the sizes of the products
are also shown. The nonspecific bands were marked by
asterisks, and a thin arrow shows an unidentified
fragment.
-rich RNA (Fig.
7). The distribution of the Sp1 mRNA
with the exon 3-2-3 alignment to these two fractions was similar to
that of the cis-spliced Sp1 mRNA. In addition, the
-actin mRNA that was used as a marker of fractionation, was
distributed similarly. Therefore, we concluded that the Sp1 mRNA
with the exon 3-2-3 alignment was produced by trans-splicing
between two Sp1 pre-mRNAs.

View larger version (54K):
[in a new window]
Fig. 6.
Genomic Southern analysis with exon 3 as a
probe. Human genomic DNA completely digested with BamHI
(lane 4), EcoRI (lane 5),
PstI (lane 6), or XbaI (lane
7) and a linearized plasmid containing exon 3 that was equivalent
to ten, three, or one copy per haploid of human genome
(lanes1-3) were used for hybridization. DNA size markers
are indicated on the left.

View larger version (31K):
[in a new window]
Fig. 7.
RNase protection assay with fractions rich in
poly(A)+ RNA or poly(A)
RNA. The
riboprobe 1 was incubated with either 100 µg of total RNA (lane
2), poly(A)+ RNA (lane 3), or
poly(A)
RNA (lane 4) that were prepared from
100 µg of total RNA or 100 µg of yeast RNA (lane 5). The
undigested riboprobe was also loaded (lane 1). The result
for the same samples using a
-actin probe was also shown below.
Three protected fragments were indicated by
arrowheads.

View larger version (38K):
[in a new window]
Fig. 8.
Detection of the Sp1 mRNA with the exon
2-3-2 alignment by RT-PCR. A, positions and directions
of primers used in this analysis are indicated by arrows on
an assumed trans-spliced Sp1 mRNA structure.
B, the nested PCR products were analyzed on a 1.0% agarose
gel. The primer sets used for the nested amplification are shown on the
top. The templates for the nested amplification were as
follows. Lanes 1, the first PCR product with X2 and 2R-1;
lanes 2, the first PCR product with T3 and 2R-1; lanes
G, 20 ng of human Sp1 genomic clone; lanes C, 20 ng of
the cDNA clone with the exon 1-2-3-4 alignment; lanes R,
a recombinant clone with the exon 1-2-3-2-3 alignment.
![]()
DISCUSSION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

View larger version (16K):
[in a new window]
Fig. 9.
Possible base pairing of human Sp1
pre-mRNA for trans-splicing. Exons and
introns are shown with boxes and lines,
respectively. Two sets of complementary sequences are also shown
together with their positions. A putative ESE element in exon 2 is
indicated by solid boxes.
![]()
FOOTNOTES
To whom correspondence should be addressed: Dept. of Life
Sciences (Chemistry), Graduate School of Arts and Sciences, The University of Tokyo, Komaba, Meguro, Tokyo 153-8902, Japan. Fax: 81-3-5454-6998; E-mail: csyanag@mail.ecc.u-tokyo.ac.jp.
![]()
ABBREVIATIONS
![]()
REFERENCES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES
1.
Dynan, W. S.,
and Tjian, R.
(1983)
Cell
35,
79-87
2.
Jones, K. A.,
and Tjian, R.
(1985)
Nature
317,
179-182
3.
Dynan, W. S.,
Sazer, S.,
Tjian, R.,
and Schimke, R. T.
(1986)
Nature
319,
246-248
4.
Kitadai, Y.,
Yasui, W.,
Yokozaki, H.,
Kuniyasu, H.,
Haruma, K.,
Kajiyama, G.,
and Tahara, E.
(1992)
Biochem. Cell Biol.
189,
1342-1348
5.
Viñals, F.,
Fandos, C.,
Santalucia, T.,
Ferré, J.,
Testar, X.,
Palacín, M.,
and Zorzano, A.
(1997)
J. Biol. Chem.
272,
12913-12921
6.
Daniel, S.,
Zhang, S.,
DePaoli-Roach, A. A.,
and Kim, K.-H.
(1996)
J. Biol. Chem.
271,
14692-14697
7.
Leggett, R. W.,
Armstrong, S. A.,
Barry, D.,
and Mueller, C. R.
(1995)
J. Biol. Chem.
270,
25879-25884
8.
Han, I.,
and Kudlow, J. E.
(1997)
Mol. Cell. Biol.
17,
2550-2558
9.
Kadonaga, J. T.,
Carner, K. R.,
Masiarz, F. R.,
and Tjian, R.
(1987)
Cell
51,
1079-1090
10.
Kadonaga, J. T.,
Courey, A. J.,
Ladika, J.,
and Tjian, R.
(1988)
Science
242,
1566-1570
11.
Courey, A. J.,
and Tjian, R.
(1988)
Cell
55,
887-898
12.
Sambrook, J.,
Fritsch, E. F.,
and Maniatis, T.
(1989)
Molecular Cloning: A Laboratory Manual
, 2nd Ed.
, pp. 9.14-9.23, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
13.
Pink, J. J.,
Wu, S.-Q.,
Wolf, D. M.,
Bilimoria, M. M.,
and Jordan, V. C.
(1996)
Nucleic Acids Res.
24,
962-969
14.
Hagen, G.,
Müller, S.,
Beato, M.,
and Suske, G.
(1992)
Nucleic Acids Res.
20,
5519-5525
15.
Murphy, W. J.,
Watkins, K. P.,
and Agabian, N.
(1986)
Cell
47,
517-525
16.
Sutton, R. E.,
and Boothroyd, J. C.
(1986)
Cell
47,
527-535
17.
Bonen, L.
(1993)
FASEB J.
7,
40-46
18.
Chapdelaine, Y.,
and Bonen, L.
(1991)
Cell
65,
465-472
19.
Wissinger, B.,
Schuster, W.,
and Brennicke, A.
(1991)
Cell
65,
473-482
20.
Konarska, M. M.,
Padgett, R. A.,
and Sharp, P. A.
(1985)
Cell
42,
165-171
21.
Solnick, D.
(1985)
Cell
42,
157-164
22.
Bruzik, J. P.,
and Maniatis, T.
(1992)
Nature
360,
692-695
23.
Li, B.-L.,
Li, X.-L.,
Duan, Z.-J.,
Lee, O.,
Lin, S.,
Ma, Z.-M.,
Chang, C. C. Y.,
Yang, X.-Y.,
Park, J. P.,
Mohandas, T. K.,
Noll, W.,
Chan, L.,
and Chang, T.-Y.
(1999)
J. Biol. Chem.
274,
11060-11071
24.
Shimizu, A.,
and Honjo, T.
(1993)
FASEB J.
7,
149-154
25.
Sullivan, P. M.,
Petrusz, P.,
Szpirer, C.,
and Joseph, D. R.
(1991)
J. Biol. Chem.
266,
143-154
26.
Caudevilla, C.,
Serra, D.,
Miliar, A.,
Codony, C.,
Asins, G.,
Bach, M.,
and Hegardt, F. G.
(1998)
Proc. Natl. Acad. Sci. U. S. A.
95,
12185-12190
27.
Frantz, S. A.,
Thiara, A. S.,
Lodwick, D.,
Ng, L. L.,
Eperon, I. C.,
and Samani, N. J.
(1999)
Proc. Natl. Acad. Sci. U. S. A.
96,
5400-5405
28.
Akopian, A. N.,
Okuse, K.,
Souslova, V.,
England, S.,
Ogata, N.,
and Wood, J. N.
(1999)
FEBS Lett.
445,
177-182
29.
Eul, J.,
Graessmann, M.,
and Graessmann, A.
(1995)
EMBO J.
14,
3226-3235
30.
Dirksen, W. P.,
Hampson, R. K.,
Sun, Q.,
and Rottman, F. M.
(1994)
J. Biol. Chem.
269,
6431-6436
31.
Sun, Q.,
Mayeda, A.,
Hampson, R. K.,
Krainer, A. R.,
and Rottman, F. M.
(1993)
Genes Dev.
7,
2598-2608
32.
Xu, R.,
Teng, J.,
and Cooper, T. A.
(1993)
Mol. Cell. Biol.
13,
3660-3674
33.
Bruzik, J. P.,
and Maniatis, T.
(1995)
Proc. Natl. Acad. Sci. U. S. A.
92,
7056-7059
34.
Chiara, M. D.,
and Reed, R.
(1995)
Nature
375,
510-513
35.
Pascal, E.,
and Tjian, R.
(1991)
Genes Dev.
5,
1646-1656
36.
Courey, A. J.,
Holtzman, D. A.,
Jackson, S. P.,
and Tjian, R.
(1989)
Cell
59,
827-836
37.
Su, K.,
Roos, M. D.,
Yang, X.,
Han, I.,
Paterson, A. J.,
and Kudlow, J. E.
(1999)
J. Biol. Chem.
274,
15194-15202
38.
Persengiev, S. P.,
Raval, P. J.,
Rabinovitch, S.,
Millette, C. F.,
and Kilpatrick, D. L.
(1996)
Endocrinology
137,
638-646
Copyright © 2000 by The American Society for Biochemistry and Molecular Biology, Inc.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
H. Li, J. Wang, G. Mor, and J. Sklar A Neoplastic Gene Fusion Mimics Trans-Splicing of RNAs in Normal Human Cells Science, September 5, 2008; 321(5894): 1357 - 1361. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. Viles and B. A. Sullenger Proximity-dependent and proximity-independent trans-splicing in mammalian cells RNA, June 1, 2008; 14(6): 1081 - 1094. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Gerstein, C. Bruce, J. S. Rozowsky, D. Zheng, J. Du, J. O. Korbel, O. Emanuelsson, Z. D. Zhang, S. Weissman, and M. Snyder What is a gene, post-ENCODE? History and updated definition Genome Res., June 1, 2007; 17(6): 669 - 681. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Dixon, I. C. Eperon, and N. J. Samani Complementary intron sequence motifs associated with human exon repetition: a role for intragenic, inter-transcript interactions in gene expression Bioinformatics, January 15, 2007; 23(2): 150 - 155. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Shao, V. Shepelev, and A. Fedorov Bioinformatic analysis of exon repetition, exon scrambling and trans-splicing in humans Bioinformatics, March 15, 2006; 22(6): 692 - 698. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Dixon, I. C. Eperon, L. Hall, and N. J. Samani A genome-wide survey demonstrates widespread non-linear mRNA in expressed sequences from multiple species Nucleic Acids Res., October 19, 2005; 33(18): 5904 - 5913. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Rigatti, J.-H. Jia, N. J. Samani, and I. C. Eperon Exon repetition: a major pathway for processing mRNA of some genes is allele-specific Nucleic Acids Res., January 22, 2004; 32(2): 441 - 446. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Horiuchi, E. Giniger, and T. Aigaki Alternative trans-splicing of constant and variable exons of a Drosophila axon guidance gene, lola Genes & Dev., October 15, 2003; 17(20): 2496 - 2501. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wang, H. Su, and A. Bradley Molecular mechanisms governing Pcdh-gamma gene expression: Evidence for a multiple promoter and cis-alternative splicing model Genes & Dev., August 1, 2002; 16(15): 1890 - 1905. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Finta and P. G. Zaphiropoulos Intergenic mRNA Molecules Resulting from trans-Splicing J. Biol. Chem., February 15, 2002; 277(8): 5882 - 5890. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Warner, C. Finta, and P. G. Zaphiropoulos Intergenic Transcripts Containing a Novel Human Cytochrome P450 2C Exon 1 Spliced to Sequences from the CYP2C9 Gene Mol. Biol. Evol., October 1, 2001; 18(10): 1841 - 1848. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nicolas, V. Noe, K. B. Jensen, and C. J. Ciudad Cloning and Characterization of the 5'-Flanking Region of the Human Transcription Factor Sp1 Gene J. Biol. Chem., June 15, 2001; 276(25): 22126 - 22132. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||