|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 40, 41563-41572, October 1, 2004
Imperfect CAG Repeats Form Diverse Structures in SCA1 Transcripts*![]() From the Laboratory of Cancer Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
Received for publication, May 10, 2004 , and in revised form, July 15, 2004.
The expanded CAG repeat in the coding sequence of the spinocerebellar ataxia type 1 (SCA1) gene is responsible for SCA1, one of the hereditary human neurodegenerative diseases. In the normal SCA1 alleles usually 13 CAT triplets break the continuity of the CAG repeat tracts. Here we show what is the structural role of the CAU interruptions in the SCA1 transcripts. Depending on their number and localization within the repeat tract the interruptions either enlarge the terminal loop of the hairpin formed by the repeats, nucleate the internal loops in its stem structure, or force the repeats to fold into two smaller hairpins. Thus, the interruptions destabilize the CAG repeat hairpin, which is likely to decrease its ability to participate in the putative RNA pathogenesis mechanism driven by the long CAG repeat hairpins.
The spinocerebellar ataxia type 1 (SCA1)1 belongs to a larger group of human hereditary neurological diseases that are caused by triplet repeat expansions in single affected genes (13). The best known examples of these diseases are: myotonic dystrophy (DM) caused by the expansion of CTG repeat in 3'-UTR of the DMPK gene; fragile X-associated tremor-ataxia syndrome (FXTAS), and mental retardation syndrome (FXS) resulting from, respectively, smaller or larger expansions of the CGG repeat in 5'-UTR of the FMR1 gene; as well as Huntington disease (HD) in which the CAG repeat expansion in the protein coding region of the IT-15 gene is implicated. One of the hallmarks of these diseases is genetic anticipation i.e. an earlier age of the onset and increased severity of clinical symptoms in subsequent generations, correlated with the increasing length of expanded alleles. The SCA1 usually appears in the fourth decade of life and death occurs between 10 and 15 years after the onset of symptoms. Typical clinical features of the SCA1 include ataxia, dysarthria and bulbar dysfunction. The neuropathology of the SCA1 is characterized by the progressive loss of neurons in the cerebellum, brain stem, and spinocerebellar tract.
The SCA1 gene that encodes nuclear protein ataxin-1 of a yet unknown function, harbors the expandable CAG repeat in its coding region (46). The repeat is polymorphic in length and usually contains CAT interruptions in a normal population. The SCA1 mutant alleles contain from 39 to about 80 uninterrupted CAG repeats (16) encoding the expanded polyglutamine (poly-Q) tracts in the aberrant ataxin-1. The SCA1 pathology correlates with the appearance of nuclear aggregates containing the mutated protein (7) but whether the aggregate formation is a symptom or cause of the disease is still a matter of debate (810). The aberrant ataxin-1 either in its soluble form or in the form of aggregates was shown to be toxic to neurons supporting the notion that the poly-Q tract itself may play a critical role in SCA1 pathogenesis (712). It may interfere with neuronal transcription by interacting with specific transcription factors in a poly-Q-dependent manner and induce apoptotic cell death (13). On the other hand, recent studies revealed the importance of the ataxin-1-specific sequence context for the protein function and showed that even the high level expression of an expanded poly-Q tract protein in the nucleus may be insufficient to cause disease (14). In this light the RNA-mediated mechanisms of pathogenesis, similar to those proposed for DM (15) and FXTAS (1618) were also considered for polyglutamine diseases (1721). According to this mechanism the expanded repeats in transcripts form long stable hairpins, which recruit double-stranded RNA-binding proteins in a repeat length-dependent manner, sequestering them from other transcripts and interfering with their normal functions (22). The length of CAG repeat in the SCA1 gene ranges from 6 to 43 repeats and the number of CAT interruptions varies from 0 to 3 in a normal population (16, 23). Considering both repeat length and its interruption status about 40 different SCA1 alleles are known. The alleles containing two CAT interruptions strongly predominate in all analyzed populations (7895%). They range in length from 23 to 43 repeats and a single CAG triplet always separates CAT interruptions. The alleles harboring one or three interruptions contribute to 011% and 04% of all alleles, and normal SCA1 variants containing uninterrupted repeat tracts of a maximum length of 21 repeats are also rare and contribute to 07% in different populations (23). In contrast to normal alleles, the great majority of pathologically expanded SCA1 variants do not contain repeat interruptions. The postulated role of interruptions occurring within tandem repeats in genes is to prevent repeat expansion or contraction during DNA replication and repair. Their loss in the CAG repeat of the SCA1 gene generates much longer pure repeat tracts that are genetically less stable and more prone to expansion (2327). There are several possible mechanisms by which the interruptions may enhance the stability of the repeat tracts (2527). They may inhibit slippage between nascent and template strands by keeping them in register, or destabilize the hairpin structures aberrantly formed by slipped DNA strands. The SCA1 alleles containing either the interrupted normal repeats or the expanded mutant repeats are transcribed, processed, transported to cytoplasm, and translated. The question of the possible differences in the efficiency of these processes between various normal and between the normal and mutant SCA1 transcripts has not been addressed so far. The physiological cellular function of the CAG repeats in transcripts, and the role of repeat interruptions is also completely unknown. To make the first step toward finding answers to these questions we describe here structures formed by the repeat regions in SCA1 transcripts representing different types of naturally occurring SCA1 alleles.
Preparation of DNA Templates and in Vitro TranscriptionDNA templates for in vitro transcription were synthesized by PCR from gel-purified SCA1 amplification products containing different variants of the CAG repeat tracts as described earlier (23), using the forward primer (SCA1-FT7) containing a T7 RNA polymerase promoter 5'-TAATACGACTCACTATAGGGCAGTCTGAGCCA and reverse primer (SCA1-RT7) 5'-TCTGCTGGGCTGGTGG. The PCR was performed in a 50-µl reaction containing: a DNA template, 1 µM each primer SCA1-FT7 and SCA1-RT7, 10 mM Tris-HCl, pH 9.0, 1.5 mM MgCl2, 50 m M KCl, 0.1% Triton X-100, 200 µM of each of the dNTPs, and 1.5 units of TaqDNA polymerase (Promega) in the following conditions: 94 °C, 1 min; 35 cycles: 94 °C, 1 s, 62 °C, 1 s, 72 °C, 20 s.
A transcription reaction carried out in a 50-µl volume contained about 10 pmol of DNA template (1020 µl of the PCR product described above), 400 units of T7 RNA polymerase (Epicenter), 3 mM guanosine, 0.5 mM each of ribo-NTPs, 10 mM dithiothreitol, 40 mM Tris-HCl (pH 7.9), 6 mM MgCl2, 2 mM spermidine, 10 mM NaCl, and 50 units of RNazin. Incubation was at 37 °C for 1 h. The RNA products were purified in denaturing 6% polyacrylamide gels, and 5'-end-labeled with 10 units of T4 polynucleotide kinase (Epicenter) and 50 µCi of [
Preparation of Human Fibroblast Whole Cell ExtractThe whole cell extract was prepared from 6 x 107 human fibroblast cells growing in log phase according to modified standard procedure (28). In brief, the cells were pelleted and washed twice in phosphate-buffer saline and resuspended in a double volume of hypotonic swelling buffer (7 mM Tris-HCl, pH 7.5, 7 mM KCl, 1 mM MgCl2, 1 mM Nuclease Digestions and Lead Cleavages in Water Solution and in Cellular ExtractPrior to structure probing reactions, the 32P-labeled RNAs were subjected to a denaturation and renaturation procedure in a solution containing 20 mM Tris-HCl (pH 7.2), 80 mM NaCl, 2 mM MgCl2, by heating the sample at 80 °C for 1 min and then slowly cooling to 37 °C. Limited RNA digestion was initiated by mixing 5 µl of the RNA sample (50,000 cpm) with 5 µl of a probe solution containing either lead ions or nucleases S1, Mung Bean or ribonucleases T1, T2 (final probe concentrations are specified in the figure legends). The structure probing experiments were performed either in 1 or 10 mM final concentration of MgCl2 (other reaction conditions were the same) and no significant differences in cleavage patterns were observed. RNase H digestion was performed by adding to the transcript solution, after denaturation and renaturation procedure (as above), one of the following three types of oligodeoxynucleotides: libraries of random 6-mers (100 pmol) or random 7-mers (100 pmol) or complementary to CAG repeats oligonucleotide with sequence 5'-TGCTGCT (5 pmol), and RNase H to the final concentration of 0.10 or 0.25 units/µl. All reactions were performed at 37 °C for 20 min and stopped by adding an equal volume of stop solution (7.5 M urea and 20 mM EDTA with dyes) and sample freezing. The reactions in cell extracts were performed as follows. Before the nuclease treatment 1 µl of RNA sample in water solution (100,000 cpm) was preincubated at 37 °C for 5 min with 3 µl of cellular extract (protein concentration 2 mg/ml and the exogenously added Mg2+ and K+ ions were at about 0.6 mM and 15 mM concentration, respectively). Limited enzymatic digestion was initiated by adding to the RNA sample 2 µl of probe solution with different enzyme concentration: RNase T1 (0.05 and 0.1 units/µl), RNase T2 (0.02 and 0.05 units/µl), nuclease S1 (2.5 and 5.0 units/µl). All reactions were conducted at 37 °C for 5 min and stopped by phenol/chloroform extraction. After precipitation the RNA samples were diluted with the stop solution described above and loaded on the gel. Analysis of the Reaction ProductsTo determine the cleavage sites, the products of lead-induced hydrolysis and nuclease digestion were separated in 815% polyacrylamide gels containing 7.5 M urea, 90 mM Tris borate buffer, and 2 mM EDTA, along with the products of alkaline hydrolysis and limited T1 ribonuclease digestion of the same RNA molecule. The alkaline hydrolysis ladder was generated by the incubation of the labeled RNA in formamide containing 0.5 mM MgCl2 at 100 °C for 10 min. The partial T1 ribonuclease digestion of the RNAs was performed under semi-denaturing conditions (10 mM sodium citrate, pH 5.0; 3.5 M urea) with 0.2 units/µl of the enzyme during incubation at 55 °C for 10 min. The cleavage sites characteristic for nucleases S1 and H were determined by comparison with the homologous S1 ladder (not shown). The RNA fragments generated by nucleases S1 and H terminate with 3'-hydroxyls and migrate slower than the corresponding formamide and T1 fragments. Electrophoresis was performed at 1500 V and was followed by autoradiography at 80 °C. Electrophoresis in Nondenaturing ConditionsThe RNA structure homogeneity was analyzed for each transcript by the electrophoresis of radiolabeled samples in 10% nondenaturing polyacrylamide gel 400/300/0.4 mm (acrylamide/bisacrylamide, 29/1) buffered with 45 mM Tris borate, at a fixed temperature of 37 °C. Prior to gel electrophoresis, the 32P-labeled transcripts were subjected to a denaturation and renaturation procedure in a solution containing 10 mM Tris-HCl (pH 7.2), 40 mM NaCl, 10 mM MgCl2, by heating the sample at 80 °C for 1 min. and slowly cooling it to 37 °C, and mixed with an equal volume of 7% sucrose with dyes. Electrophoresis was conducted at a constant power of 15 watts and was followed by phosphorimager analysis. RNA Secondary Structure ModelingRNA secondary structure prediction was performed using the mfold program version 3.1 (29). This program is designed to determine the optimal and suboptimal secondary structures of RNA calculated for 1 M NaCl solution at 37 °C, and to count free energy contributions for various secondary structure motifs.
Fifteen distinct SCA1 transcripts analyzed in this study differed in the number of CAG repeats from 27 to 54, while having the same specific flanking sequences 37 nt at the 5'-side and 59 nt at 3'-side of the repeat. Besides their length, the repeat regions varied also in the number and locations of the repeat interruptions (Table I). Accordingly, the transcripts could be divided into four groups. The first and the largest group (10 transcripts) included all those containing two CAU interruptions described by the formula (CAG)nCAUCAGCAU(CAG)m, in short n-1-1-1-m (interruptions are underlined). Within this group there were transcripts containing 2736 repeats and either symmetrical or asymmetrical distribution of the interruptions. In the first subgroup (2 transcripts), the interruption region was flanked by 12 or 15 pure CAG repeats. In the larger second subgroup with the asymmetrical CAU distribution the longer tract of pure repeats was located either at the 5'-side n>m, or at 3'-side m>n of the interruption region. It is noteworthy, that within this group is a transcript from SCA1 allele from healthy individual, which contains as many as 43 repeats in the configuration 23-1-1-1-17. The genetic stability of such long alleles is secured by the presence of repeat interruptions (23, 30). In the second group there are two transcripts containing a single asymmetrically located CAU interruption in the tract of either 28 or 29 CAG repeats, and a transcript containing three interruptions in the 11-1-1-1-1-1-15 configuration is the only representative of the third group. Finally, the last group contains two transcripts harboring pure pathologically expanded repeat tracts of the 53 and 54 CAG repeats derived from SCA1 patients. Analyzed transcripts (Table I) represented all possible types of the interruption patterns described so far in world populations (23). In their selection the number and localization of CAU triplets as well as the total length of a triplet repeat region were taken into account.
All investigated transcripts were end-labeled and their solution structure was analyzed with a set of well-characterized chemical and enzymatic probes. Among the single-strand specific nucleases used were the nucleases S1 and Mung Bean showing little sequence specificity, RNase T2 cutting all internucleotide bonds but showing some preference for those after A-residues and G-specific RNase T1 (31, 32). The metal ion probes were represented by lead ions that cleave single-stranded RNA sections and relaxed parts of duplex structures (3335). Also, the short oligodeoxynucleotide 5'-TGCTGCT (ctg-ODN), as well as the ODN libraries (6-mer and 7-mer) with a randomized sequence were used together with Escherichia coli RNase H, which executed RNA cleavages at the sites of RNA/DNA hybrids. Earlier studies have shown that RNA is cut at the 3'-side of the RNA/DNA hybrid and 5'-phosphate and 3'-OH groups are left at the cleavage site (3638). Pure Expanded CAG Repeats Form a Stable Hairpin Structure, Which Is Further Stabilized by Specific Flanking Sequences in the SCA1 TranscriptThe normal SCA1 alleles containing pure uninterrupted CAG repeats are very rare and they were found in a few populations only. On the other hand, all mutant alleles contain pure repeat tracts. The structure probing results obtained for transcripts from two expanded SCA1 alleles are shown in Fig. 1. The most intense cuts generated by nucleases occur between the nucleotides A26 and A28 (i.e. between the adenyl residues of the CAG repeats 26 and 28) in the pure(CAG)53, and between A26 and A29 in pure(CAG)54. The rest of the repeat region is mildly cleaved by lead ions, which show some preference for cuts at the CpA bonds over the ApG, and cleavages at the internucleotide bonds C1pA1 and A1pG1 within the first CAG repeat are enhanced. Also, the enhanced RNase T2 and RNase H cuts can be detected in that region. What is noteworthy, is that none of the probes used cleaved short 67-nt sequences directly flanking the repeat region at both its sides. These results imply that the entire repeat region forms an autonomous structure, which includes the repeats themselves (hairpin H4) and the cleavage-resistant duplex formed by the six base pairs of its flanking sequences (H3) (Fig. 1B). This kind of structure explains why the different number of the reactive internucleotide bonds is observed in the terminal loop of the pure(CAG)53 and pure(CAG)54. In the former, the 4-nucleotide terminal loop occurs, whereas the 7-nucleotide loop region is present in the latter (Fig. 1B). These results are in accord with the observations made for shorter model transcripts (CNG)16 and (CNG)17 containing the artificial G-C clamp at the base of their hairpin stem (20). For comparison the model transcripts having not-clamped repeats were shown to have freedom of alignment and formed several alternative "slipped" structures each containing 4-nucleotide terminal loop (20). The lack of enhanced cuts at the 3'-terminal CAG repeats in pure(CAG)53 and pure(CAG)54 may serve as further argument for the existence of a natural clamp formed by a repeat flanking sequence in the SCA1 transcripts. This H3 duplex adds extra stability to the CAG repeat hairpin. Importantly, the same cleavage resistance of flanking sequences, which is shown for the two mutant transcripts is also observed in all other transcripts analyzed in this study. This means that the same type of structure (H3) is formed by flanking sequences in all these transcripts (Fig. 2). At the 3'-side of the repeat clamp another stable hairpin structure occurs (H5). It is composed of 11 uninterrupted base pairs and 7-nucleotide terminal loop, which is well mapped by the nucleases but not cleaved by RNase H in the presence of the 6-mer and 7-mer ODN libraries. The close proximity of the H3 and H5 helices, together with cleavage resistance of the region, which separates them, may suggest their involvement in the coaxial stacking, which could additionally stabilize the structure formed by the CAG repeat. According to the SCA1 mRNA structure modeling by the mfold algorithm, the helices H3 and H5 are also predicted to exist in the much longer mRNA fragments. Our experimental data show that the predicted single stranded regions that occur upstream of the H3 and downstream of the H5 are both efficiently cleaved by all the nucleases used and by the RNaseH/ODN library. The analysis of the transcripts containing much longer flanking regions confirmed the existence of the helices H3, H4, and H5 (data not shown).
CAG Repeats Containing Either One or Two CAU Interruptions Usually Form a Single Hairpin StructureMost of the normal SCA1 alleles contain two interruptions in their repeat tracts. The question we asked first was: what is the structural role of two CAU sequences located symmetrically within the CAG repeats in two SCA1 transcripts 12-1-1-1-12 and 15-1-1-1-15? The structure probing results (Fig. 2) show that the highest reactivity of all the nucleases used and RNase H/ODN is observed in the central region of the repeats in which also the interruptions are located. This reactive region is between the C13 and A16 in the 12-1-1-1-12 transcript (Fig. 2, A and B). In both transcripts the terminal loops are composed of 10 nucleotides and include both CAU triplets. The hairpin stem structure of the 12-1-1-1-12 transcript is stable as revealed by its temperature-induced melting followed by structure probing with the S1 nuclease (Fig. 2C). Only at the temperature of 70 °C and higher the structure formed by the helices H4 and H3 was melted and all internucleotide bonds of that region showed similar reactivity.
The structures formed by the repeats containing asymmetrically located interruptions are somewhat different (Fig. 3). In each case the most reactive regions are those in which the interruptions are located and also their closest neighborhood. In some transcripts with a small degree of asymmetry, in which the length difference between pure repeat tracts was one or two repeats (
Transcripts containing a single asymmetrically located interruption form hairpins with two loop regions. The internal loop a harbors the interruption and terminal loop b is the 4-nucleotide long in the 11-1-16 transcript and 7-nucleotide long in 12-1-16. Both types of loops are well recognized by all the structure probes used. It is also noteworthy that in the case of all of the above described transcripts the specific oligonucleotide ctg-ODN binds to the interrupted repeats with lower efficiency than to the pure repeats located in the single stranded regions. Notably, they also bind weakly to the repeats in the stem region but only at the 5'-side of the stem (Fig. 3B).
Long Repeat Tract with Asymmetric Interruptions Forms Two Stable ConformersUnexpectedly, more complex cleavage patterns different from those generated in other transcripts were observed in the SCA1 transcript containing 43 repeats in the 23-1-1-1-17 configuration (Fig. 4A). This pattern of cleavages suggested the existence of at least two stable conformers under structure probing conditions. This prediction turned out to be correct as shown by the result of the transcript analysis by non-denaturing gel electrophoresis (Fig. 4B). Unlike other analyzed transcripts the 23-1-1-1-17 migrated as two distinct species. The question was what is the structure of each of these stable conformers? In answering this question the results of 16-1-1-1-10 transcript structure analysis turned out to be helpful (Fig. 3, C and E). This shorter transcript has a similarly localized interruption region, i.e. the length difference between the 5' and 3' pure CAG repeat tracts is identical (
Three Interruptions Rearrange the Structure of the Repeat RegionIt appears from the analysis of the transcript in the 11-1-1-1-1-1-15 configuration (Fig. 5) that its structure differs from those of transcripts harboring two asymmetrically located interruptions (Fig. 3). The nucleases do not cut the internucleotide bonds belonging to the first repeats at the pure 3'-repeat tract. Such cleavages were observed for the similar transcript 12-1-1-1-17 in which a terminal loop was present at that site. Instead, the efficient cleavages occur in central parts of pure repeat tracts at A5-A7 and A23-A25 as well as in the region of interruptions A12-G17 (Fig. 5A). This cleavage pattern resembles that observed for conformer II of the 23-1-1-1-17 transcript (Fig. 4) and supports the four way junction structure within the repeat region. In this structure the helix H3, two hairpins composed of pure CAG repeats and lower stability hairpin containing the interruptions can be distinguished (Fig. 5B). The formation of a branched structure by the repeat region of this transcript is supported by its retarded migration during electrophoresis in non-denaturing polyacrylamide gels (Fig. 5C). It migrates slower than the transcript 17-1-1-1-16, which despite being 5 repeats longer has a more regular structure, and only slightly faster than transcript pure(CAG)54 that forms a single extended hairpin.
Structures of the SCA1 Transcripts Are the Same in Cellular ExtractsThe question often asked is whether the structures determined in vitro are also present in their natural cellular environment? In an attempt to answering this question we performed structure probing experiments in the whole cell extract from the human fibroblast cell line. It turned out from the analysis of two transcripts: 12-1-1-1-17 (Fig. 6) and 12-1-1-1-14 that the cuts generated by nucleases S1, T2, and T1 closely resembled those obtained in buffered water solution in the presence of Na1+ and Mg2+ ions only, and the observed differences were rather quantitative than qualitative supporting the notion that the structures which are most stable in vitro are likely to exist also in cells (3942).
In this study we have determined the structures of numerous variants of the CAG repeat region in the SCA1 transcript. These variants are representative for the alleles most commonly occurring in a normal population and for those present in SCA1 patients. It turned out from this analysis, that the uninterrupted repeat tracts in two mutant SCA1 transcripts: pure(CAG)53 and pure(CAG)54, which are typical repeat lengths in SCA1 patients, form undistorted hairpin structures. They contain a stem composed of the periodically repeated two G-C and C-G pairs followed by A-A mismatch, and terminal loops of different sizes depending on the repeat number. This is so because a strong natural clamp is formed by the specific flanking sequences at the base of the repeat hairpin stem. This clamp forces the repeats to a single alignment, and the 7-nucleotide loop is formed in the transcript containing an even number of repeats whereas the 4-nucleotide loop is present in those having an odd repeat number. This pattern of the stem structure and loop sizes follows those observed for shorter (CAG)16 and (CAG)17 repeat tracts containing an artificial G-C clamp (20). Thus, it may be concluded from the above that both the uninterrupted normal repeats as well as expanded repeats present in mutation carriers form the same type of regular hairpin structure (Fig. 7). The only difference between transcripts from normal and mutant alleles is the length and stability of the repeat hairpin stem.
The main focus of this study was on the role of the CAU sequences interrupting the CAG repeat tracts in the SCA1 transcript. In this discussion we will consider them as the single G U base substitutions occurring at either one, two, or three closely located sites within the CAG repeat. In the regular CAG repeat hairpin stem such substitutions would result in the U-C or C-U base oppositions if they were located within the 5'- or 3'-part of the repeated sequence, respectively. Often the interruptions occur centrally within the repeat tract and the question was: whether they accommodate themselves in the repeat hairpin stem or reside in the terminal loop? The answer is that the C-U oppositions are reluctant to stay in the stem and they enlarge the loop structure. The next question was: how do the U bases behave when their locations are more remote from the terminal loop? Will they be forced by their putative stem neighbors to stay within the stem or not? The answer to this question is also negative. They either form the local internal loops, which are well recognized by the single-strand specific probes or they redesign the structure of the repeat region entirely by splitting the repeated sequence into two separate hairpin structures (Fig. 7). What is specific about the U-C and C-U oppositions that they have been selected to destabilize and reorganize hairpin structures formed by the CAG repeats? Why are they so rare in natural RNAs compared with other non-WC base pairs? Which of the postulated two geometries of the U-C pairs occurs in the RNAs: the U-C+ (with the neighboring carbonyl and imino groups involved in the formation of hydrogen bonds) or the neutral U-C pair (with N3-imino proton of U forming a water-mediated hydrogen bond to the N3 of C)? Some answers to these questions come from the crystal structures of RNA duplexes containing the U-C pairs (4345). It appears that the RNA duplex containing two tandem U-C mismatches in its central positions is very unstable compared with a similar duplex with the U-U mismatches (43). The presence of eight water molecules, four per each U-C pair, bound to stabilize these pairs may be the factor that causes a severe nearly 2-fold widening of the major groove as compared with A-RNA, resulting in duplex destabilization (43). It is tempting to speculate that a similar effect may be the driving force for the CAG repeat duplex destabilization in the SCA1 transcripts. The periodic A-A mismatches that make the stem structure more relaxed as compared with the regular duplex may facilitate both the local structure changes and its global rearrangements. Thus, the number and localization of U-residues within the CAG repeat tract in the SCA1 transcript are the factors determining structure of the repeat region. We postulate that the role of the CAG repeat interruptions is either the weakening or effective shortening of hairpins formed by pure repeats (Fig. 7), most likely to diminish their ability to interact with double-stranded RNA-binding proteins. Transcripts containing the uninterrupted double-stranded CAG repeats were shown earlier to activate the RNA-dependent protein kinase PKR (46, 47), and form a complex with the specific 63 kDa CAG repeatbinding protein (48). Such interactions resulting in protein activation or sequestration were considered as effects potentially triggering the mechanism of RNA-mediated pathogenesis also in a group of neurodegenerative diseases caused by the CAG repeat expansions (1721,49). The prominent role of RNA is well documented for myoblast pathogenesis in two forms of myotonic dystrophy (15), and RNA toxicity is becoming recognized as an underlying cause of the more recently described FXTAS (1618). In the case of DM1 and DM2 the repeat length-dependent sequestration of the muscleblind family proteins (22) is postulated, whereas the specific proteins that bind to the CGG repeat in FXTAS patients have not been characterized yet. It will be intriguing to see the extent to which the RNA-mediated pathogenesis shapes the polyglutamine toxicity in the cerebellum neurons of SCA1 patients. In Fig. 7 we presented in a simplified way the different types of structures formed by the repeat regions in the investigated SCA1 transcripts in order to see how they fit into the model of a putative RNA role in the SCA1 pathogenesis. The shortest CAG repeat tract found in SCA1 gene of spinocerebellar ataxia type 1 patients contains 39 pure CAG repeats. Both structures shown in Fig. 7 (left panel) have reached the pathogenic threshold, which is defined by the repeat hairpin stem composed of at least 38 CAG repeats. On the contrary, none of the transcripts from the normal SCA1 alleles shown in Fig. 7 (right panel) reaches this threshold. However, there are some unusual SCA1 alleles e.g. (CAG)45-CAT-CAG-CAT-(CAG)10 for which their total length (58 repeats) suggested an early onset of the disease at 22 years of age, whereas the actual age at SCA1 onset was 50 years (30). In addition, one of the SCA1 variants analyzed in this study, which belonged to an unaffected carrier, had 43 repeats in the 23-1-1-1-17 configuration (Fig. 7) i.e. did not reach the pathogenic threshold. In a 45-1-1-1-10 transcript similar type of structure to that of 23-1-1-1-17 is expected. The longer arm of this branched structure passes the pathogenic threshold reaching 45 CAG repeats. Thus, differences in the structures of transcripts from different SCA1 alleles may well explain their variable effects on phenotype. Finally, the SCA1 gene is one of three genes implicated in triplet repeat expansion diseases that are equipped in the elaborated system of interruptions in repeat tracts. The SCA1 is, however, unique in the sense that its interruption system, which was likely developed to prevent DNA repeat expansions, has the potential to operate also on the levels of RNA and protein. The destabilization of longer potentially pathogenic CAG hairpins in RNA, and the insertion of histidines into the polyglutamine tracts in ataxin-1 may be considered the possible second and third lines of defense against SCA1 pathogenesis.
* This work was supported by the State Committee for Scientific Research, Grants No 2P05A-08826, PBZ-KBN-040/P04/2001, and Foundation for Polish Science, Grant 8/2000. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 The abbreviations used are: SCA1, spinocerebellar ataxia type 1; UTR, untranslated region; ODN, oligodeoxynucleotide; nt, nucleotide.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||