![]()
|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 52, 39953-39962, December 29, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
From the Institut de Biologie Moléculaire et Cellulaire, UPR CNRS Architecture et Réactivité de l'ARN, Université Louis Pasteur, 15 Rue René Descartes, 67084 Strasbourg Cedex, France
Received for publication, September 5, 2006 , and in revised form, November 7, 2006.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
ZNF76 and ZNF143 are two human homologs of Xenopus Staf, with ZNF143 being the ortholog and ZNF76 a likely paralog (68). ZNF143 is conserved in vertebrates and urochordates, such as Ciona intestinalis, whereas ZNF76 is only present in vertebrates. In this study, human Staf will be referred to as hStaf/ZNF143.
Moreover, Staf possesses the capacity to stimulate chloramphenicol acetyltransferase expression from a synthetic mRNA promoter (SBS-tkCAT), containing SBS linked to the basal promoter of the thymidine kinase gene fused to the chloramphenicol acetyltransferase reporter (9). The presence of two physically and functionally distinct activation domains constitutes the molecular basis underlying the abilities of hStaf/ZNF143 to stimulate transcription from either snRNA-type or mRNA promoters. Indeed, although a 93-amino acid domain made of four repeat units is specialized for activating transcription from an mRNA promoter, a segment of only 18 amino acids acts specifically on snRNA and snRNA-type promoters (10). To date, only the following seven protein coding genes have been described to be regulated by hStaf/ZNF143: the cytosolic chaperonin containing t-complex polypeptide 1 (TCP1) (11); the interferon regulatory factor (IRF3) (12); the neuronal nitric-oxide synthase (NOS1) (13); the transaldolase (TALDO1) (14); the aldehyde reductase (AKR1A1) (15); the mitochondrial ribosomal protein S11 (MRPS11) (16); and the synaptobrevin-like 1 (SYBL1) (17).
In this work, we describe the results of an analysis aimed at identifying hStaf/ZNF143-binding sites across mammalian genomes. We have used in silico genome-wide analysis to identify 1175 SBS distributed in 938 promoters, among which 295 were chosen for experimental validation by chromatin immunoprecipitation (ChIP). Semi-quantitative PCR confirmed the binding of hStaf/ZNF143 to 266 (90%) of the tested loci. Most of the identified SBS were located near the transcription start site and were tightly associated with CpG islands. Extrapolated to the full sizes of the mammalian genomes, this partial mapping revealed an unexpected large number of SBS in the promoters of protein coding genes, with at least 2500 SBS distributed in 2000 promoters. Furthermore, this study established that the presence of SBS alone is sufficient to stimulate the expression of a luciferase reporter gene, suggesting that hStaf/ZNF143 can recruit the transcription machinery and up-regulate transcription of an extraordinary high number of genes.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
|
Identification of additional Staf-binding sites from the work of Xie et al. (20) was performed as follows. The coordinates of the discovered motifs corresponding to sequences ACTAYRNNNCCCR, ACTACNNNNCCC, and ACTACNNNTCCCR (Xie motifs), in direct and reverse orientations, were obtained on line. Sequences located 3' to the Xie motifs were extracted with the Human Genome Browser. 3'-Extended Xie motifs were visually examined for the presence of an associated Staf-binding site. Conservation of the newly identified Staf motifs, in the mouse, rat, and dog genomes, was evaluated by the vertebrate Multiz alignment. Sequence logos were created using WebLogo (21).
Reporter ConstructsWTSBS-Luc, mut-1 SBS-Luc, and mut-2 SBS-Luc were obtained by inserting into the promoter-less pGL3-basic vector (Promega) a 150-bp KpnI-HindIII DNA fragment containing three copies of the tRNASec gene SBS, either the WT GTACCAGCATGCCTCGCG, mut-1 GTACCATACGTACTCGCG, or mut-2 GTAAACGCATGCCTCGCG sequences, respectively (1, 9).
Transfection and Luciferase AssaysCOS-7 cells were transfected by the calcium phosphate co-precipitation procedure with 1 µg of reporter construct, 0.5 µg of pCH110 plasmid as the internal control, and carrier DNA to bring up the total DNA content to 10 µg/plate. After 24 h, cells were lysed, and the
-galactosidase activity was measured as described previously (10). The luciferase assay was performed as recommended by the manufacturer. The luciferase activity was normalized to the
-galactosidase activity. Each transfection experiment was done in triplicate.
ChIP AssayThe ChIP procedure was essentially as described in Weinmann and Farnham (22) with a few modifications. HeLa cells (106 cells/ml) were treated with 1% formaldehyde at room temperature for 10 min. The reaction was stopped by addition of glycine to a final concentration of 125 mM. Cells were washed once with ice-cold phosphate-buffered saline containing protease inhibitors, scraped, centrifuged at 1500 rpm for 10 min, then resuspended in cell lysis buffer (5 mM HEPES-NaOH, pH 8, 85 mM KCl, 0.5% Nonidet P-40, protease inhibitors), and kept on ice for 10 min. They were homogenized 10 times with a Dounce homogenizer, and the resultant homogenates were centrifuged at 4000 rpm for 10 min to pellet the nuclei. Nuclei were resuspended in 1.5 ml of shearing buffer (50 mM Tris-HCl, pH 8, 10 mM EDTA, 1% SDS, protease inhibitors). Nuclear lysates were sonicated on ice to an average chromatin length of 0.51 kb (three pulses of 20 s at a setting of 25 with a Vibra-cell sonicator; Bioblock Scientific) and then centrifuged at 14,000 rpm for 12 min. The supernatant was incubated with 10 volumes of ChIP buffer (50 mM HEPES-NaOH, pH 7.9, 140 mM NaCl, 1% Triton X-100, 0.01% SDS, protease inhibitors) with protein A-Sepharose for 2 h at 4°C. After protein A-Sepharose removal, the precleared lysates were used as the soluble chromatin for ChIP. Chromatin was incubated at 4 °C overnight with 3.5 µg of anti-hStaf/ZNF143. Control for no antibody was included. Immune complexes were recovered by incubation for 2 h at 4°C with protein A-Sepharose previously blocked in buffer containing 2 mg/ml bovine serum albumin and 1 mg/ml salmon testis DNA. Immune complexes were washed four times sequentially in the following buffers (shearing buffer; wash buffer 1: 50 mM HEPES-NaOH, pH 7.9, 300 mM NaCl, 1% Triton X-100, 0.1% SDS; wash buffer 2: 50 mM HEPES-NaOH, pH 7.9, 500 mM NaCl, 1% Triton X-100, 0.1% SDS; wash buffer 3: 50 mM Tris-HCl, pH 8, 250 mM LiCl, 0.5% Nonidet P-40, 1 mM EDTA, 1% deoxycholate; TE buffer). The DNA-protein complex was eluted in 400 µl of elution buffer (100 mM NaHCO3, 1% SDS) at room temperature for 15 min. The formaldehyde cross-link was reversed by overnight incubation at 65 °C in the presence of 190 mM NaCl. Immunoprecipitated DNA was released by treatment with 100 µg of proteinase K for 1 h at 45 °C in the presence of 10 mM EDTA and 40 mM Tris-HCl, pH 6.5. The DNA was purified by phenol extraction and ethanol-precipitated. The rabbit polyclonal antipeptide antibody against a C-terminal epitope of the Xenopus Staf protein (5) was used for ChIP.
PCR AnalysisPurified DNA was analyzed in 25-µl PCRs with primer pairs flanking the predicted SBS and in the presence of 3 µCi of [
-32P]dCTP (3000 Ci/mmol). PCR products were 140400 bp in length. Half of the samples were resolved by native PAGE and revealed by autoradiography. Typically, 1/500 and 1/2000 of the immunoprecipitated DNA were used for PCR analysis. Decreasing amounts of input DNA (1/10,000, 1/25,000, and 1/100,000) were used to determine the linear range of the PCR for each primer pair. Cycling parameters were 95 °C for 3 min, 35 cycles at 95 °C for 30 s, 5272 °C (depending on each primer pair) for 30 s, 72 °C for 30 s, and 72 °C for 5 min. For the negative controls of ChIP assay, we used the PP1 to PP4 primer pairs hybridizing to unique regions lying 2.4, 2.1, 6.5, and 2.5 kbp upstream of the tRNASec, U4 ATAC, glyceraldehyde-3-phosphate dehydrogenase (GAPDH), and Budding Uninhibited by Benzimidazoles 1 (BUB1B) genes, respectively. The primer sequences used in this study are available on request.
| RESULTS |
|---|
|
|
|---|
Among the DBTSS entries, a high number of promoters scored positive with one or more potential Staf-binding sites within the predetermined boundaries, identifying a total of 1488 potential binding sites with 648 motifs (43.5%) conserved in the orthologous mouse promoters (Table 1 and supplemental Table S1). Table 1 indicates that the score distribution is not uniform, the highest ones being obtained for the A1 and F1 sequences with 185 and 97 human/mouse conserved hits, the lowest score arising from the H1 sequence with 13 human/mouse conserved hits. Generally, a good correlation was observed at one given position between score distribution and the nucleotide preference obtained from binding site selection experiments (2, 4). For example, binding site selection indicated that a C occupied position 3 in 95% of the sequences, whereas a T occurred in only 3% of the cases (2). In the DBTSS screen, the same distribution was observed. The query sequence carrying a C at position 3 (A1) produced 185 conserved hits between human and mouse, a single C to T change (H1) leading to only 13 hits (Table 1). Interestingly, sequence alignments of putative SBS show that residues preferentially occupying positions 711, 1316, and 18 (Fig. 1A, right panel), corresponding to N, R, or Y in the queried sequences in Table 1, are identical to those derived from the binding site selection experiments (2). This strongly suggests that the identified sequences represent bona fide SBS.
Inspection of the adjacent sequences to the potential SBS revealed the presence of the 7-bp ACTACAN motif, or a 12-bp variant, among 305 (47%) of the 648 identified SBS (Table 1; Fig. 1A, left panel; supplemental Table S1). It lies immediately 5' to the SBS. Therefore, to identify novel SBS that would harbor the ACTACAN motif, we performed a second round of motif search at DBTSS with the A2 to F2, H2, and J2 motifs as queries. These 19-bp motifs are arranged in the following way (see Table 2). The 5' part is now composed of the 7-bp submotif ACTACAN, the 3' part corresponding to bp 112 of the A1 to F1, H1, and J1 sequences. This second round generated 459 hits with 183 occurrences conserved in the orthologous mouse promoters but not identified by the A1 to K1 queries. It is striking to observe that the consensus sequence derived for the 6-bp sequence directly adjacent to the 3' part of the newly identified motifs, although not included in the queries, bears high sequence similarity to sequence CC(C/T)GCG of the A1 to K1 motifs (compare positions 1318 in Fig. 1, A and B; note that the heights of the letters are not directly proportional to occurrences). Again, this strongly argues for the characterization of genuine binding sites. Next, to characterize promoters containing at least two potential SBS, we performed a third search at DBTSS (LA1LK1, Table 3). This round was implemented with two simultaneous queries: (i) one with the previous A1-K1 motifs (sequence motif 1 in Table 3); (ii) the second one with only the newly identified ACTACAN motif (sequence motif 2 in Table 3). The bioinformatic search was followed by visual inspection of the occurrences to detect the presence of potential SBS linked to the ACTACAN motif. In this way, we could identify 21 additional motifs that are conserved in orthologous mouse promoters (Table 3, Fig. 1C, and supplemental Table S1). Finally, the sequences of the identified promoters were visually examined for unidentified sequence elements that might serve as Staf-binding sites. This inspection enabled detection of 41 additional sites conserved in orthologous mouse promoters (named X in Fig. 1D and supplemental Table S1). To extend the phylogenetic comparison, a comparative analysis was performed in the orthologous promoters of the rat and dog genomes. Among the 893 identified motifs that we found conserved in the human and mouse genomes, 772 orthologous regions were detected in the rat and canine genomes. Interestingly, sequence comparisons revealed that 753 (97%) are also conserved in these two genomes. In 18 cases, the SBS is conserved in one of the two genomes, and in one single case only (NM_003085 [GenBank] ) they both lack it (supplemental Table S1).
|
|
|
Identification of 282 Other Putative hStaf/ZNF143-binding SitesA recent report described the systematic discovery of regulatory motifs in human promoters by comparing several mammalian genomes (20). Among the 174 conserved motifs that the authors discovered in the promoters of protein coding genes, the 13-bp ACTAYRNNNCCCR M4 sequence is particularly interesting because ACTAYRN shows a strong match to the SBS-adjacent ACTACAN submotif, and NNCCCR is highly similar to the 5' part of the SBS. We thus sought to use the M4 motif to uncover new SBS by searching for the presence of SBS overlapping the M4 motif. This led us to discover 282 new potential SBS (see logo sequence derived from sequence comparisons, Fig. 1E). Among these, 23 are located in promoters previously identified by our search at DBTSS, and the other 259 SBS are located in 222 new promoters containing this site (Fig. 2A). The 282 sites are conserved in the rat and dog genomes, with 273 (96.8%) occurring also in the mouse genome (supplemental Table S1). Collectively, our in silico analysis resulted in the identification of 1175 conserved putative SBS harbored in 938 promoters of mammalian protein coding genes (Fig. 2A). Considering the results of this phylogenetic footprinting, it is likely that a high proportion of the identified SBS represent target sites for hStaf/ZNF143.
Characteristic Features of the Identified SBS and SBS-containing PromotersTwo features characterize a transcription factor-binding site as follows: first, its position, and second, its orientation relative to the gene. Fig. 2B (see also supplemental Table S1) shows the number of human SBS, with assigned positions, plotted versus their position relative to the TSS. We found that 673 (62%) of the 1077 SBS with assigned positions occur within 200 bp upstream of the TSS. Similar values were obtained with the SBS identified in mouse promoters, consistent with the involvement of this motif in initiation of transcription. We observed that 32% of the human/mouse orthologous SBS are positioned at the same distance from the TSS but that a difference of up to 50 bp occurs in 47% of the cases. It is very likely that this variation is related to the lack of accuracy in determining the TSS of numerous genes.
Another feature of the SBS motif that we detected is its property to appear as multiple copies. Overall, 188 (20%) of the 939 mammalian promoters we have identified contained multiple SBS lying within 500 bp (Fig. 2C). The majority (150) contain two SBS, with three SBS being found in 31 promoters; four genes contain four SBS (CDK5 (cyclin-dependent kinase 5), ENY2 (enhancer of yellow 2 homolog), NUDCD1 (NudC domain containing 1), and PRCC (papillary renal cell carcinoma)); two genes have five SBS (LIG1 (DNA ligase 1) and BANP (BTG3 associated nuclear protein)); one single gene represents a unique situation with six SBS (NCLN (nicalin homolog)). Fig. 3 displays the multiple sequence alignment of the nicalin homolog promoter region across the four mammalian genomes. The very high degree of sequence conservation of the six potential SBS appears clearly and stands out from the nonconserved flanking sequences.
Two interesting features regarding the physical arrangement of the SBS came from our study. First, of the 1175 identified SBS, a bias occurs toward a direct orientation with respect to the gene as follows: 702 (60%) are in direct orientation; among promoters possessing two SBS, 41% are in direct orientation, and 13% are in reverse orientation, and the remainder have a combination of both. Second, we identified SBS in potential bidirectional promoters, with 146 promoters (15%) arranged in a head-to-head configuration within 1 kbp (supplemental Table S1).
624 (87%) of the 716 human SBS-containing promoters that we identified at DBTSS overlap CpG islands, sequences known to be associated with active promoters (23, 24) (supplemental Table S1). In general, the promoters we found were GC-rich, and we could not recognize any significant TATA box sequence. TATA box consensus sequences could be observed in only 19 (5%) of the 355 examined promoters, suggesting that they are not a characteristic feature of SBS-containing gene promoters. Summarized in supplemental Table S1 are various features of the identified SBS and promoters, including identification number of the human promoters, gene name and symbol, position of the SBS in human and mouse promoters, coordinates of SBS on human chromosomes, motif orientation, association with CpG islands, sequence of the SBS motif with the associated 5' submotif, alignment of the mouse and human motifs, and motif conservation in rat and dog genomes.
Occupancy of Endogenous Genes by hStaf/ZNF143To determine whether hStaf/ZNF143 actually binds to the conserved motifs, we tested by ChIP assays 430 of the sites identified in silico and contained in 295 different promoters (listed in supplemental Table 1; Fig. 4A). The 295 promoters represent a statistically valid sample because they include 107 (56%) promoters with multiple sites (multiple site promoter (MSP)) and 188 (25%) with unique sites (unique site promoter (USP)). The tested MSP corresponded essentially to genes with an attributed function. The tested USP were extracted with random choice from the results of the different queries at DBTSS. To obtain statistically valid data, the sample of tested promoters containing SBS with and without the ACTACAN motif was directly proportional to the outcome of the various interrogation results. ChIP analysis was performed in HeLa cells with antibodies directed against hStaf/ZNF143. The recovered DNA was analyzed by semi-quantitative PCR with primer pairs spanning promoters with either single or multiple SBS. Each DNA sequence was amplified in parallel reactions with each of the three DNA templates purified from the following: 1) anti-hStaf/ZNF143 ChIP; 2) control ChIP; and 3) input chromatin. For each PCR, we tested two dilutions of DNA immunoprecipitated with anti-hStaf/ZNF143 (Fig. 4, BG, lanes 1 and 2; supplemental Fig. S1) or no antibody (Fig. 4, BG, lanes 3 and 4; supplemental Fig. S1). Also, serial dilutions of input material were analyzed to demonstrate that the PCR was quantitative within a linear range of amplification (Fig. 4, BG, lanes 57; supplemental Fig. S1). The positive controls for the ChIP assay included genes previously reported to respond to hStaf/ZNF143, the tRNASec, synaptobrevin-like1, aldehyde reductase, and t-complex polypeptide 1 (1, 11, 15, 17). As expected (Fig. 4B), a specific signal of higher intensity than in the no antibody control was obtained with the DNA immunoprecipitated with anti-hStaf/ZNF143 (compare lanes 1 and 2 and 3 and 4 in Fig. 4B). In contrast, no specific signal could be obtained with the primer pairs PP1 to PP4 amplifying DNA sequences lying several kbp upstream of the tRNASec, U4 ATAC, GAPDH, and BUB1B genes because these remote regions were not expected to interact with hStaf/ZNF143 (compare lanes 1 and 2 and 3 and 4 in Fig. 4C). Among the 295 PCR amplifications, 29 (9.8%) were close to the background (supplemental Table S1 and Fig. S1). This is illustrated in Fig. 4, E and G, where the assay was performed with promoters containing two (SKB1 and MRPS25) and one (ZNF345 and ZNF451) putative SBS. Of the remaining promoters that were tested, however, 100/107 MSP (93.4%) and 166/188 USP (88%) provided clear positive signals (supplemental Table S1 and Fig. S1). Typical results are shown for 10 promoters containing two to four SBS (VAPA, WASF1, TMEM59, CDK5, SERPINI1, BRD8, DNML1, HCFC1R1, NMT1, and MRPL24; Fig. 4D) and 10 promoters containing one SBS (MTR, KIF11, SCAMP3, MDH1, SMUG1, PUM1, FLJ1311, TAF6, LACE1, and MGC29463; Fig. 4F). We observed an identical percentage of positive results regardless of the sequence used for identifying the motifs. Furthermore, no bias was introduced whether promoters with or without the ACTACAN motif were employed. By inspecting the SBS sequences that overlap the M4 motif of Xie et al. (20) (Xie in supplemental Table S1), about 15% had the highly conserved C12 substituted by A, G, or T (compare sequence logos in Fig. 1, A and E). ChIP experiments with five of them (A12, GenBankTM accession numbers NM_001320 [GenBank] and NM_006185 [GenBank] ; G12, GenBankTM accession numbers NM_004748 [GenBank] and NM_012257 [GenBank] ; T12, GenBankTM accession number NM_005111; supplemental Fig. S1) concluded that sequence variation at position 12 did not influence the binding of hStaf/ZNF143.
|
|
To further evaluate the role of the ACTACAN motif, the effects of mutations therein were evaluated on the activity of a luciferase reporter gene directed only by the SBS and ACTACAN motifs. We constructed three different reporters containing, directly fused to the luciferase coding sequence, two identical copies of the following: 1) the wild-type ACTACAN motif and SBS (WT ACTACAN and WT SBS); 2) the mutated ACTACAN and WT SBS (mut ACTACAN and WT SBS); 3) the mutated ACTACAN motif and mutated SBS (mut ACTACAN and mut SBS) (Fig. 5A). In these constructs, the SBS resides at 189 and 303 bp upstream of the luciferase translation initiation codon. The construct containing both mutated motifs (mut ACTACAN and mut SBS) was unable to drive transcription above the basal level of the empty vector (Fig. 5C). In contrast, the presence of the WT ACATCAN associated with the WT SBS induced a 49-fold induction of the luciferase activity (Fig. 5C). Moreover, mutation of the ACTACAN motif alone reduced the transcriptional activity to about 50% of the WT level (Fig. 5C). Collectively, these results demonstrated the functional importance of the ACTACAN motif associated with the SBS.
|
| DISCUSSION |
|---|
|
|
|---|
Previously, hStaf/ZNF143-binding sites were identified by studying individual genes. At the onset of this work, only seven promoters of protein coding genes were recognized as hStaf/ZNF143 targets (1117). Among the 938 promoters identified in this study, 716 were obtained by queries at DBTSS (version 4), which contains only 8793 human promoters. Extrapolation to the 25,000 human genes (26) revealed a minimum estimate of 2500 SBS in 2000 promoters of protein coding genes. This value is very similar to the number of Sp1-binding sites in protein coding genes (27). Furthermore, the number of identified targets is most likely underestimated because our screen was based solely on the SBS consensus and its variants, whereas other variant sites can also be bound by the protein. For example, we could not identify the IRF3 and AKR1A1 promoters present in DBTSS and known as hStaf/ZNF143-dependent (12, 15) because the SBS sequences in these promoters do not fit the queried sequences used in the various screens.
About 58% of the identified SBS were found associated with the functional ACTACAN submotif, although our earlier work never detected it in the SBS of snRNA and snRNA-type genes (2, 28). It may well be that this submotif serves as a binding site for an unknown transcription factor. It might also induce a particular structure to the DNA that will optimize the binding of hStaf/ZNF143 to its cognate sequence. The ACTACAN sequence is particularly interesting in the light of a recent work describing the systematic discovery of motifs in human promoters by comparing several mammalian genomes (20). This promoter analysis yielded candidate motifs, including most of the previously known transcription factor-binding sites but also novel motifs. The top six of the discovered motifs and associated factors are M1, NRF-1; M2, MYC; M3, ELK-1; M4, unknown factor; M5, NFY; M6, Sp1 with a motif conservation score (MCS) of 107.8, 85.3, 80.4, 69.5, 64.6, and 63.9, respectively. Motifs with high MCS are both highly conserved and frequently occurring in the promoters of protein genes. The MCS of the M4 motif ACTAYRNNNCCCR has conservation and occurrence frequency higher than that of Sp1 and NFY transcription factors. Our work established clearly that ACTAYRN indeed corresponds to the ACTACAN sequence we identified and that NNCCCR represents SBS residues 16 (Fig. 1A, right panel). Xie et al. (20) did not report residues 718 of the SBS as a conserved motif. This is very likely because of the lower sequence conservation in the 3' part of the SBS. Considering that 42% of the identified SBS lacked the ACTACAN submotif, the global MCS for the hStaf/ZNF143-binding site (with and without ACTACAN) is higher than the MCS of M4. This strongly suggests that the hStaf/ZNF143-binding site represents one of the most frequently occurring transcription factor-binding sites in the human genome.
|
|
We examined the results of Kim et al. (29) who performed genome-wide location analysis experiments to identify gene promoters associated with TAF1, the largest subunit of TFIID. They concluded that a total of 9328 nonredundant gene promoters are occupied by TAF1. We found that 346 hStaf/ZNF143-bound promoters also harbor TAF1-binding sites. Comparison of the positions of SBS- and TAF1-binding sites in one given promoter established that 94.5% SBS are found upstream of TAF1 locations. Analysis of the distribution of the SBS and TAF1 sites revealed the interesting feature that SBS are essentially upstream of the transcription start site, whereas TAF sites are mostly found downstream of it (Fig. 2D). Moreover, the number of common promoter targets between TFIID and hStaf/ZNF143 far exceeds a random overlap between two similar groups, suggesting that hStaf/ZNF143 functions with TFIID in regulating gene expression. In this respect, we showed that SBS are sufficient to up-regulate a reporter gene on their own, and it is tempting to speculate that hStaf/ZNF143 can directly recruit TFIID.
SBS and Bidirectional PromotersIt is well established that bidirectional promoters are relatively common in mammalian genomes, representing more than 10% of the genes whose TSS are separated by less than 1000 bp (30, 31). In 15% of the identified promoters (146 promoters distributed in 73 pairs), a common SBS is present in two promoters in a head-to-head orientation. Further studies are needed to elucidate the direct involvement of hStaf/ZNF143 in this particular gene organization.
Functional Classification of the hStaf/ZNF143 Target GenesAmong the identified promoters, 30% were associated with genes with unknown function. To determine whether hStaf/ZNF143 is specifically associated with a gene category with a particular function, we categorized the genes according to their functional annotation. As shown in Fig. 6, the identified genes are distributed along all the functional categories. The prominent classes are as follows: (i) DNA-binding proteins and transcription factors that represent 23% of the total; (ii) protein synthesis degradation turnover and modification (21% of the total). We note that many of the identified genes are indeed important for cell growth.
In summary, our ChIP-coupled comparative genomic analysis has delivered a first version of the human catalog of hStaf/ZNF143 regulatory motifs in gene promoters. This work provides a convincing demonstration that performing combined in silico and experimental studies is absolutely instrumental in analyzing gene promoters.
| FOOTNOTES |
|---|
The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EBI Data Bank with accession number(s) NM_003085
[GenBank]
, NM_001320
[GenBank]
, NM_006185
[GenBank]
, NM_004748
[GenBank]
, NM_012257
[GenBank]
, and NM_005111. ![]()
The on-line version of this article (available at http://www.jbc.org) contains supplemental Table S1 and Fig. S1. ![]()
1 To whom correspondence should be addressed. Tel.: 33-3-88-41-70-64; Fax: 33-3-88-60-22-18; E-mail: P.Carbon{at}ibmc.u-strasbg.fr.
2 The abbreviations used are: snRNA, small nuclear RNA; h, human; ChIP, chromatin immunoprecipitation; WT, wild type; GAPDH, glyceraldehyde-3-phosphate dehydrogenase; SBS, Staf-binding site; TSS, transcription start sites; USP, unique site promoter; MSP, multiple site promoter; MCS, motif conservation score; DBTSS, Data Base of Human Transcription Start Sites. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C.-C. Yuan, X. Zhao, L. Florens, S. K. Swanson, M. P. Washburn, and N. Hernandez CHD8 Associates with Human Staf and Contributes to Efficient U6 RNA Polymerase III Transcription Mol. Cell. Biol., December 15, 2007; 27(24): 8729 - 8738. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Myslinski, M.-A. Gerard, A. Krol, and P. Carbon Transcription of the human cell cycle regulated BUB1B gene requires hStaf/ZNF143 Nucleic Acids Res., May 11, 2007; 35(10): 3453 - 3464. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||