JBC Origene Your Gene Company

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M406415200 on July 29, 2004

J. Biol. Chem., Vol. 279, Issue 40, 41715-41726, October 1, 2004
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
279/40/41715    most recent
M406415200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Dere, R.
Right arrow Articles by Wells, R. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dere, R.
Right arrow Articles by Wells, R. D.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Hairpin Structure-forming Propensity of the (CCTG·CAGG) Tetranucleotide Repeats Contributes to the Genetic Instability Associated with Myotonic Dystrophy Type 2*

Ruhee Dere{ddagger}, Marek Napierala{ddagger}, Laura P. W. Ranum§, and Robert D. Wells{ddagger}

From the {ddagger}Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University System Health Science Center, Texas Medical Center, Houston, Texas 77030-3303 and the §Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota 55455

Received for publication, June 9, 2004 , and in revised form, July 21, 2004.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The genetic instabilities of (CCTG·CAGG)n tetranucleotide repeats were investigated to evaluate the molecular mechanisms responsible for the massive expansions found in myotonic dystrophy type 2 (DM2) patients. DM2 is caused by an expansion of the repeat from the normal allele of 26 to as many as 11,000 repeats. Genetic expansions and deletions were monitored in an African green monkey kidney cell culture system (COS-7 cells) as a function of the length (30, 114, or 200 repeats), orientation, or proximity of the repeat tracts to the origin (SV40) of replication. As found for CTG·CAG repeats related to DM1, the instabilities were greater for the longer tetranucleotide repeat tracts. Also, the expansions and deletions predominated when cloned in orientation II (CAGG on the leading strand template) rather than I and when cloned proximal rather than distal to the replication origin. Biochemical studies on synthetic d(CAGG)26 and d(CCTG)26 as models of unpaired regions of the replication fork revealed that d(CAGG)26 has a marked propensity to adopt a defined base paired hairpin structure, whereas the complementary d(CCTG)26 lacks this capacity. The effect of orientation described above differs from all previous results with three triplet repeat sequences (including CTG·CAG), which are also involved in the etiologies of other hereditary neurological diseases. However, similar to the triplet repeat sequences, the ability of one of the two strands to form a more stable folded structure, in our case the CAGG strand, explains this unorthodox "reversed" behavior.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Myotonic dystrophy type 2 (DM2)1 is a dominantly inherited neurological disease caused by the expansion of a (CCTG·CAGG) tetranucleotide repeat in intron 1 of the zinc finger protein 9 (ZNF9) gene (1). The characteristics of DM2 are very similar to those observed for DM1 and include myotonia, proximal weakness, frontal balding, cardiac arrhythmias, insulin-resistance associated diabetes mellitus, polychromatic cataracts, and infertility (26). The largest normal allele was found to contain 26 (CCTG·CAGG) repeats, whereas the repeats were expanded to 75–11,000 (average of 5,000 repeats) in patients (1). This is the largest known repeat expansion associated with a disease; also, it is the first tetranucleotide repeat to be implicated in a hereditary neurological disease.

At least 15 hereditary neurological diseases (i.e. myotonic dystrophy, fragile X syndrome, and Friedreich's ataxia) are associated with the expansions of (CTG·CAG)n, (CGG·CCG)n, or (GAA·TTC)n repeat tracts, respectively (713). Additionally, two other non-triplet repeat neurological diseases, spinocerebellar ataxia type 10 (SCA10) (14) and progressive myoclonus epilepsy of Unverricht-Lundborg type (EPM1) (15), are caused by the expansions of pentanucleotide and dodecanucleotide repeating sequences, respectively.

Replication (9, 10, 1623), recombination (2428), and repair (10, 21, 29, 30) were shown to be responsible for the instabilities of triplet repeat sequences. Slippage of the repeats (3134) as promoted by non-B DNA structures (911, 3537) formed by these repeating sequences causes polymerase to pause during replication, as shown both in vivo as well as in vitro (17, 20, 3843), thereby generating instabilities. Furthermore, these structures are also recognized by mismatch repair (MMR) (29, 4448) and nucleotide excision repair (NER) (49, 50); both pathways have been implicated in the stability of the secondary structures, thus influencing the expansion and deletion processes. Also, double strand breaks caused by replication fork arrest or repair of the non-B DNA structures induces repair-mediated recombination that may participate in the expansions observed in both prokaryotic as well as eukaryotic model systems (21, 30, 5156). Triplet repeat sequences are hotspots for recombination, which may account for the massive expansions found in certain diseases (2428, 57, 58).

Herein, we show that the repeating tetranucleotide genetic instabilities associated with DM2 are caused by a structure-mediated replication-based slippage mechanism. The difference in the propensities of the CAGG and CCTG strands to transiently form quasistable DNA hairpin structures determines the type and level of instability. Furthermore, this instability is influenced by the length, orientation, and position of the repeats with respect to the origin of replication.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Construction of the (CCTG·CAGG)n Containing Shuttle Vector—The (CCTG·CAGG) tetranucleotide repeats (1) were obtained from three pCR2.1TOPO derivatives containing 30, 114, or 200 repeats. All three (CCTG·CAGG)n inserts (where n = 30, 114, or 200) were excised from their respective pCR2.1TOPO derivative plasmids using either an EcoRV/EagI digest or an EcoRI digest (all enzymes used in this study were purchased from New England Biolabs, Inc.). The (CCTG·CAGG) inserts are flanked on either side by vector sequences and lack any flanking non-repetitive human DM2 sequences. On excising the inserts using the EcoRV/EagI digest, in addition to the 30, 114, and 200 repeats there are 21 bp of vector flanking sequences on the EcoRV side of the insert and 24 bp of vector flanking sequence on the EagI side. The EcoRV/EagI fragments were filled-in using 1 unit of the Klenow fragment of Escherichia coli DNA polymerase I (U. S. Biochemical Corp.), purified on a 5.5% polyacrylamide gel in TAE buffer (40 mM Tris acetate, 1 mM EDTA, pH 8), and blunt-end ligated into the SmaI site of pCDNA3.1 (Invitrogen). Alternately, the (CCTG·CAGG)n tracts were excised using an EcoRI digest, purified on a 5.5% polyacrylamide gel, and ligated into the MfeI site of the same shuttle vector. This fragment had 11 bp of pCR2.1TOPO vector flanking sequence on one side and 9 bp of vector flanking sequence on the other side of the repeating tract. The ligations were performed at 16 °C for 16 h by the addition of 20 units of T4 DNA ligase (U. S. Biochemical Corp.) in the presence of 1 mM ATP followed by transformation into E. coli HB101 (New England Biolabs, Inc.) (F{Delta} (gpt-proA)62, leuB6, glnV44, ara14, galK2, lacY1, {Delta}(mcrC-mrr), rpsL20 (Strr), xyl5, mtl-1, hsdS20 (r B, m B), recA13) and plated on LB plates containing ampicillin (100 µg/ml). Individual colonies were grown in LB broth containing ampicillin (100 µg/ml) for 16 h at 37 °C. Plasmids were isolated using the alkaline lysis procedure (Promega, Wizard Plus Miniprep DNA Purification System).

The plasmids carrying the (CCTG·CAGG)n inserts were characterized using restriction mapping and DNA sequencing. An EcoRI digest was used for repeat tracts cloned proximal (SmaI site) to the SV40 origin of replication and an AflIII/BglII digest was used for inserts cloned into the site distal (MfeI site) to the same origin of replication. The restriction fragments were end-labeled with [{alpha}-32P]dATP and 1 unit of the Klenow fragment of E. coli DNA polymerase I and analyzed on 5.5% polyacrylamide gels. Furthermore, the plasmids containing the (CCTG·CAGG)n repeats were dideoxy sequenced on both strands to determine the length, purity, and orientation of the repeats with respect to the origin of replication. The sequencing reactions were performed using the Thermo Sequenase Radiolabeled Terminator Cycle Sequencing Kit (U. S. Biochemical Corp.) according to the manufacturer's recommendations. The products of the sequencing reactions were analyzed on 6% Long Ranger gels (FMC BioProducts) containing 7.5 M urea in the glycerol tolerant gel buffer (1.78 M Tris, 0.57 M taurine, and 0.01 M EDTA) (U. S. Biochemical Corp.). The (CCTG·CAGG)30 is a pure repeat (i.e. it contains no polymorphisms/interruptions) as determined by sequencing of the entire repeat containing tract. The (CCTG·CAGG)114 and the (CCTG·CAGG)200 carried a single base pair interruption 11 repeats into the tract to give the sequence (CCTG)11CCTT(CCTG)n, where n = 102 and 188, respectively. The (CCTG·CAGG)n (where n = 114 or 200) repeats were sequenced using primers located at both ends of the repeating tract. In case of the (CCTG·CAGG)114, sequencing enabled the determination of ~200–300 bp into the repeats from either side; however, on excising and analyzing the repeat containing fragment from the plasmid, an error of approximately ±5 repeats was obtained. Similarly, on analyzing the excised (CCTG·CAGG)200 fragment on polyacrylamide gels, the estimated repeat length varied by ±5 repeats. Thus, the repeat lengths were estimated by both sequencing and fragment analyses. The plasmids carried the (CCTG·CAGG)n repeat tracts in both orientations with respect to the SV40 origin of replication (Fig. 1). However, the SV40 origin is bidirectional (59) and is the operative origin in COS-7 cells (60). Orientation I refers to the orientation in which the CCTG repeats are on the leading strand template with respect to the SV40 origin of replication, whereas the orientation in which the CCTG repeats are on the lagging strand template with respect to the same origin are referred to as orientation II.



View larger version (22K):
[in this window]
[in a new window]
 
FIG. 1.
Plasmids used in the study. The (CCTG·CAGG)n repeats (where n = 30, 114, or 200) were cloned into the SmaI site (proximal to the SV40 origin of replication) or the MfeI site (distal to the SV40 origin of replication) of pcDNA3.1 in both orientations relative to the bidirectional SV40 origin of replication. Orientations I and II are defined under "Experimental Procedures."

 
Transfection of the (CCTG·CAGG)n Repeat Containing Plasmids into COS-7 Cells—The plasmids containing the (CCTG·CAGG)n tracts were transfected into COS-7 cells. The COS-7 cells were grown in Dulbecco's modified Eagle's media (Sigma) containing 10% fetal bovine serum (Invitrogen) on 10-cm diameter plates. The cells were 75% confluent when transfected with 2 µg of DNA using Lipofectamine 2000 (Invitrogen). The COS-7 cells were allowed to grow for 24 h after transfection before replacing the Dulbecco's modified Eagle's medium. The cells were then cultured for either 48 h or 2 weeks after transfection. The COS-7 cells cultured for 2 weeks were split every 36–48 h when the cells were almost 100% confluent. Antibiotic selection (geneticin (G418), 400 µg/ml) (Invitrogen) was applied 48 h after transfection and continued thereafter for the entire 2-week period. The episomal DNA was isolated using alkaline lysis (Promega, Wizard Plus Miniprep DNA Purification System). This episomal DNA was then cleaved with 10 units of DpnI at 37 °C for 2 h. DpnI cleaves only DNA that is methylated at its GATC recognition site by the DAM methylase. Thus, treatment of the episomal DNA preparation fragments only the unreplicated DNA that is methylated, leaving the newly replicated but unmethylated or hemimethylated DNA intact. The effectiveness of the DpnI cleavage was assessed by digesting the parental plasmids with DpnI, followed by transformation in E. coli HB101 and plating on LB plates containing ampicillin (100 µg/ml). The absence of colonies on LB plates confirmed the complete fragmentation of the plasmids by DpnI. The DNA after the DpnI digestion was then further purified by phenol-chloroform extractions and ethanol precipitation.

Genetic Instabilities by Individual Colony Analyses—The episomal DNA, obtained after transfection in COS-7 cells cultured for either 48 h or 2 weeks, was transformed into E. coli HB101 and plated on LB plates containing ampicillin (100 µg/ml). Individual colonies were then picked and grown in LB broth for 16 h at 37 °C. The analyses of single colonies enabled the detection of individual events during replication of the plasmids in COS-7 cells. The plasmids were isolated using alkaline lysis and the DNA analyzed by restriction mapping. The (CCTG·CAGG)n inserts were excised using an EcoRI digest for plasmids carrying the repeats cloned into the SmaI site, whereas an AflIII/BglII digest was used to excise the inserts cloned into the MfeI site. The restriction fragments were resolved on 5.5% polyacrylamide gels. The genetic instability was measured as a change in the length of the (CCTG·CAGG)n repeat containing fragments when compared with the size of the insert excised from the parental plasmid and the 1-kbp DNA ladder size standard (Invitrogen). The lengths of each insert excised from individual colonies were determined using FluorChem version 3.04 (Alpha Innotech Corp.). Furthermore, ~20 random clones containing expansions were sequenced to confirm the lengths and sequences of the repeat containing inserts.

The genetic instability (expansions and deletions) of the (CCTG·CAGG)n repeats generated in COS-7 cells is calculated by subtracting the background instability of the repeat-containing plasmids in E. coli HB101. The genetic instability of the parental plasmids was measured by transforming these plasmids into E. coli HB101 and analyzing individual colonies using restriction mapping. The restriction digests used for these analyses were similar to those used for individual colony analyses of the episomal DNA. The percentage of background instability (average of two experiments for the three tetranucleotide lengths in both orientations (total of six experiments)) varied from 2 to 7% for repeats cloned proximal (SmaI site) and from 0 to 12% for repeats cloned distal (MfeI site) (except for (CCTG·CAGG)200 in orientation I, which was 23%) to the SV40 origin (data not shown). All statistical analyses were performed using SigmaStat version 2.03.

Substrate Preparation for Chemical Modification and Enzymatic Probing—To analyze the structural features of the (CCTG·CAGG) repeats, we used "single-stranded" synthetic oligonucleotides as models of unpaired regions of the duplex repeat sequences for chemical and enzymatic modification studies. The individual oligonucleotides (Genosys), d(CCTG)26 and d(CAGG)26, were purified on a 6% denaturing polyacrylamide gel containing 7.5 M urea in glycerol tolerant gel buffer (U. S. Biochemical Corp.). The purified oligonucleotides were labeled at the 5' end with 15 units of T4 polynucleotide kinase (U. S. Biochemical Corp.) and [{gamma}-32P]ATP at 37 °C for 1 h. The labeled oligonucleotides were purified on a 6% denaturing polyacrylamide gel. These purified and labeled oligonucleotides were used as substrates for the chemical and enzymatic probing reactions.

Chemical Modifications and Enzymatic Probing—Three chemical probes, osmium tetraoxide (OsO4) (Aldrich), potassium permanganate (KMnO4) (Fisher), and diethyl pyrocarbonate (DEPC) (Sigma), each were used to modify the d(CCTG)26 oligonucleotide, whereas the latter two chemicals were used to modify the d(CAGG)26 oligonucleotide. The purified and labeled oligonucleotides (4–5 x 105 cpm/reaction) in 10 mM Tris, 40 mM NaCl, and 10 mM MgCl2 were denatured by heating at 80 °C for 5 min followed by renaturation by gradually decreasing the temperature (2 °C/min) to the indicated reaction temperature (61). The chemical and enzymatic probes were then added along with the carrier DNA (salmon sperm DNA, 1 µg/µl) (Invitrogen).

The DEPC modification of the oligonucleotides was carried out in a buffer containing 50 mM sodium cacodylate, 40 mM NaCl, and 10 mM MgCl2 (pH 7.0) at 25 °C for 60 min (40, 6267). The KMnO4 reaction was performed at 20 °C for 30 min in a buffer containing 50 mM sodium cacodylate and 1 mM EDTA (pH 7.0) (65, 6871). A final concentration of 8% DEPC and 0.1, 0.5, and 1 mM KMnO4 was used. The chemical modification with 1 mM OsO4 in the presence of 1 mM 2,2'-bipyridine (Sigma) and 150 mM sodium phosphate (pH 7.8) was carried out at 5 °C for 30 min (64, 65, 67, 7274). The modified oligonucleotides were then ethanol precipitated and cleaved at the phosphodiester bonds using 1 M piperidine (67, 72, 73, 75, 76). A series of ethanol precipitations and lyophilization steps then were used to ensure removal of the piperidine.

The enzymatic probes used included S1 nuclease (77, 78) (Invitrogen), P1 nuclease (65, 71, 7981) (Invitrogen), and mung bean nuclease (8284) (New England Biolabs Inc.). All of the enzymatic probing reactions were carried out at 5 °C for 60 min in a buffer containing 10 mM Tris, 40 mM NaCl, and 10 mM MgCl2 (pH 7.2). 1 mM ZnCl2 was included in the buffer used for the S1 nuclease reactions (75, 78). Concentrations ranging from 4.3 to 8.7 units of S1 nuclease and 0.25–1.0 units of mung bean nuclease were used in a 10-µl total reaction volume. P1 nuclease was used in concentrations ranging from 0.05 to 0.25 µg/ml. The details of the actual concentrations are in the legends to Figs. 5 and 6. The reactions were stopped by addition of a urea-EDTA dye solution (61) followed by quick freezing on dry ice.



View larger version (63K):
[in this window]
[in a new window]
 
FIG. 5.
Analyses of the products of chemical and enzymatic probes on d(CAGG)26. The preparation and characterization of the labeled synthetic oligonucleotide (CAGG)26 is described under "Experimental Procedures." A, analyses of the probing for oligonucleotide secondary structure on 10% polyacrylamide gels with 7.5 M urea. Data obtained from S1 nuclease (S1), P1 nuclease (P1), mung bean nuclease (MB), and DEPC are shown. The numbers to the right of each of the panels indicate the number of tetranucleotide repeats (as designated on the second G residue of the CAGG repeats). Increasing concentrations of enzymatic probes (4.3, 6.5, and 8.7 units of S1 nuclease, 0.15, 0.2, and 0.25 µg/ml P1 nuclease and 0.25, 0.5, and 1.0 unit of mung bean nuclease) are indicated by a triangle above each of the panels. The triangle above the DEPC panel indicates an 8% DEPC solution used over two increasing time points of 30 and 60 min. C, control (no probe added); G, size marker (Maxam-Gilbert sequencing with dimethyl sulfate). B, possible secondary structures formed by d(CAGG)26. The reactivity of the oligonucleotide to the various probes has only been shown for the 10th CAGG repeat of the stem of the proposed hairpin loop structure for clarity; however, the same cleavage pattern was seen for all of the other CAGG repeats in the stem of the hairpin loop. Furthermore, the cleavage of the phosphodiester bonds in the terminal loop of the hairpin has also been indicated. The symbols corresponding to the various probes are shown to the right of the figure. The length of the symbols corresponds to the cleavage intensity as quantitated using ImageQuant version 5.1. The filled circles between the two DNA strands indicate Watson-Crick pairing and the open circles denote non-Watson Crick pairing.

 



View larger version (94K):
[in this window]
[in a new window]
 
FIG. 6.
Analyses of the products of chemical and enzymatic probes on d(CCTG)26. The results of the probing for oligonucleotide secondary structure in d(CCTG)26 using S1 nuclease (S1), P1 nuclease (P1), DEPC, and osmium tetraoxide (OsO4) are shown. Analyses were performed in 10% polyacrylamide gels with 7.5 M urea. Increasing concentrations of the enzymatic probes (4.3, 6.5, and 8.7 units of S1 nuclease, 0.05, 0.1, and 0.15 µg/ml of P1 nuclease) are indicated as triangles above each panel. The triangle above the panel for DEPC indicates two increasing time points of 30 and 60 min using an 8% DEPC solution. The triangle above the panel for OsO4 indicates two increasing time points of 15 and 30 min using 1 mM OsO4 in the presence of 1 mM 2,2'-bipyridine. The numbers to the right denote the number of tetranucleotide repeats (as designated on the second G residue of the CCTG repeats). C, control (no probe added); G, size marker (Maxam-Gilbert sequencing using dimethyl sulfate).

 
The products of the chemical and enzymatic reactions were then analyzed on 10% denaturing polyacrylamide gels using glycerol tolerant gel buffer. The size marker was prepared using the Maxam-Gilbert sequencing reaction on each of the synthetic deoxyoligonucleotides with dimethyl sulfate (Sigma) followed by piperidine cleavage (40, 63, 75). The gels were analyzed using a Amersham Biosciences Storm 820 (Amersham Biosciences) PhosphorImager and quantitative analyses were performed using ImageQuant version 5.1.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Strategy of Study—The (CCTG·CAGG)n repeats were cloned either proximal (SmaI site) or distal (MfeI site) to the SV40 origin of replication in the pcDNA3.1 shuttle vector (Fig. 1). An African green monkey kidney cell culture system was used to determine the genetic instability of the (CCTG·CAGG)n repeats associated with DM2. COS-7 cells were transfected with the tetranucleotide repeat containing plasmids (Fig. 1) and were then cultured for either 48 h or 2 weeks (i.e. for approximately 2 and 14 cell divisions, respectively) to determine the extent of genetic instability after several rounds of replication. The effects of length, orientation, and location of the repeats with respect to the origin of replication were determined.

To evaluate the effect of the length of the (CCTG·CAGG)n repeat tract on the genetic instability, repeats of 30, 114, or 200 (CCTG·CAGG) units were studied. Previous studies showed a marked effect of orientation of trinucleotide repeats (CTG·CAG, CGG·CCG, and GAA·TTC) with respect to the origin of replication (10, 16, 18, 19, 21, 39, 85, 86). Therefore, we studied plasmids carrying the (CCTG·CAGG) repeats in both orientations. The distance of the (CCTG·CAGG)n repeats from the origin of replication may also play an important role in determining the genetic instabilities of these repeats (22).2 Thus, the repeats were cloned either proximal (~74 bp from the center of the SV40 core origin (64 bp in length) (87) at the SmaI site (map position 2078)) or distal (~1842 bp from the center of the same core origin of replication at the MfeI site (map position 162)) to the SV40 origin of replication.

Genetic Instability of the (CCTG·CAGG)n Repeats—The plasmids carrying 30, 114, or 200 (CCTG·CAGG) repeats cloned in both orientations were transfected into COS-7 cells (Figs. 1 and 2) and the cells were cultured. The episomal DNA was isolated using alkaline lysis and the DNA was digested with DpnI (see "Experimental Procedures"). To determine the genetic instability of the episomal DNA after 48-h cultures, Southern blot analyses were performed with the (CCTG)8 oligonucleotide as a probe (data not shown), which confirmed the presence of the (CCTG·CAGG) repeats in the excised fragments. Because, no instability was observed, the more sensitive single colony analyses were performed.



View larger version (16K):
[in this window]
[in a new window]
 
FIG. 2.
Experimental strategy using the mammalian cell culture assay. The (CCTG·CAGG)n tracts (where n = 30, 114, or 200) were cloned either proximal to the origin of replication in the SmaI site (map position 2078) or distal to the origin of replication in the MfeI site (map position 162) (Fig. 1). These plasmids were then transfected into COS-7 cells, cultured for both 48 h and 2 weeks, and the episomal DNA was isolated using the alkaline lysis method. The episomal DNA was digested with DpnI to fragment the unreplicated DNA. The episomal DNA was transformed into E. coli HB101 and individual colonies were analyzed using biochemical analyses. Simultaneously, plasmids that were not replicated in COS-7 cells were also transformed into E. coli HB101 and individual colonies were subjected to similar biochemical analyses (see "Experimental Procedures").

 
The episomal DNA after treatment with DpnI from 48-h and 2-week cultures was transformed into E. coli HB101 and DNA from individual colonies was analyzed by restriction digestion and the products of cleavage were analyzed in high-resolution 5.5% polyacrylamide gels (Fig. 3). These analyses on individual colonies enable the detection of relatively small amounts of expansions and deletions that were not detected (see above) by the population studies by Southern blots. The instability observed (Fig. 2, right side) is the composite instability of the (CCTG·CAGG)n repeats in the eukaryotic and prokaryotic systems. To obtain the percentage of instability (expansions and deletions) in the mammalian system alone, the percentage of instability obtained from the prokaryotic cells was subtracted as the background (left side of Fig. 2). Thus, the percentages of both expansions and deletions as depicted in the bar graphs in Fig. 4 represent the net genetic instability of the (CCTG·CAGG)n tracts in the eukaryotic system.



View larger version (57K):
[in this window]
[in a new window]
 
FIG. 3.
Genetic instability of the (CCTG·CAGG)n repeats. The episomal DNA obtained from COS-7 cells cultured for 2 weeks was digested to excise the tetranucleotide repeats and the products were analyzed on 5.5% polyacrylamide gels. A–C, representative gels showing the various products of genetic instability of the (CCTG·CAGG)n repeats (A, pRW5110; B, pRW5112; C, pRW5114) cloned proximal to the SV40 origin of replication (SmaI site). The repeats were excised using an EcoRI digestion. D–F, representative gels showing the products of genetic instability of the tetranucleotide repeats (D, pRW5116; E, pRW5118; F, pRW5120) cloned distal to the SV40 origin of replication (MfeI site). The insets were excised using an AflIII/BglII digest. Lane M, 1-kbp ladder; lane C, (CCTG·CAGG)n repeats excised from the parental plasmids used as controls. The numbers above each gel indicate the various clones used. The arrowhead on the left side of each gel indicates the size of the full-length progenitor fragment, whereas the brackets above and below the arrowheads indicate the expansions and deletions, respectively. The clones that contained "rearrangements" (see "Results") are designated by arrows above each of the gels. The arrowheads on the right side of each gel indicate the two fragments that result from the digestion of the vector and the faint bands between these two fragments result from an incomplete digestion, as seen in panels B, E, and F.

 



View larger version (20K):
[in this window]
[in a new window]
 
FIG. 4.
Relative amounts of expansions and deletions of (CCTG·CAGG)n repeats. The episomal DNA was isolated from COS-7 cells, which were cultured for 2 weeks, and the lengths of the tetranucleotide repeats were determined as described ("Experimental Procedures"). The percentages of expansions and deletions of the tetranucleotide repeats were calculated by subtracting the background percentages obtained in E. coli HB101. The expansion and deletion percentages of plasmids carrying the (CCTG·CAGG)n tracts (where n = 114 and 200) proximal (SmaI site) and distal (MfeI site) to the SV40 origin of replication are indicated as bars above and below the line marked zero, respectively. The black bars indicate the percentage of expansions and deletions for inserts in orientation I, whereas the gray bars indicate the percentage of expansions and deletions for inserts in orientation II. All data are the average of duplicate experiments with the appropriate background subtracted for each experiment.

 
Analyses of the digestion products from individual colonies by restriction digestion and polyacrylamide gels revealed four different types of products (Fig. 3): the unchanged starting length tetranucleotide repeats, expansions, deletions, and "rearrangements" that showed an alternate digestion pattern from that observed for full-length, expansions and deletions. In some cases, the rearrangements showed a digestion pattern in which the vector backbone was altered such that an expected fragment was missing (e.g. Fig. 3B, lane 5). Previous studies (8890) have shown these products to result from illegitimate recombination in vectors carrying the SV40 origin of replication. To confirm that the rearrangements arose from an inherent property of the vector rather than a result of the presence of the (CCTG·CAGG)n inserts, the pcDNA3.1 vector (lacking the (CCTG·CAGG)n repeats) was transfected into COS-7 cells. The cells were cultured for 2 weeks and the episomal DNA was isolated, DpnI cleaved and transformed into E. coli HB101. The DNA from individual colonies was analyzed on a 1% agarose gel. The number of rearrangements observed were similar to those obtained for the plasmids carrying the (CCTG·CAGG)n repeats. Furthermore, no rearrangements were observed on analyzing individual colonies obtained from transforming the parental plasmids into E. coli HB101. Likewise, rearrangements were not observed previously in other prior instability studies with triplet repeat inserts in E. coli HB101 (10, 16, 18, 19, 39, 86, 9193). This confirms that the rearrangements were because of an intrinsic property of the pcDNA3.1 shuttle vector that carries the SV40 origin of replication.

Length-dependent Instability of the (CCTG·CAGG)n Tetranucleotide Repeats—For CTG·CAG, CGG·CCG, and GAA·TTC trinucleotide repeats, the number of repeat units plays an important role in genetic instability (10, 18, 21, 39, 93, 94). To evaluate the effect of the length of the (CCTG·CAGG)n tetranucleotide repeats (where n = 30, 114, or 200) on the genetic instability, three different lengths of the repeats were studied. The percentage of expansions and deletions found for each of the (CCTG·CAGG)n repeats in COS-7 cells for a 2-week period is indicated in Fig. 4. In general, the longer the length of the (CCTG·CAGG) tract, the greater the observed instability. The (CCTG·CAGG)30 repeats were extremely stable (lack of any genetic instability) when the plasmids harboring these repeats were cultured in COS-7 cells for both 48 h and 2 weeks (Fig. 3, A and D). A statistically significant difference in the genetic instabilities was observed between pRW5114 ((CCTG· CAGG)200, orientation II) and pRW5112 ((CCTG·CAGG)114, orientation II) (p = 0.003). Also, similar statistically significant differences in the genetic instability of these repeats were observed on comparing the two pairs of plasmids pRW5121 and pRW5119 (p = 0.012) and pRW5120 and pRW5118 (p = 0.010). However, in the case of the plasmids carrying (CCTG·CAGG)200 cloned in orientation I proximal (SmaI site) to the SV40 origin of replication, a statistically significant length-dependent effect was not observed.

Furthermore, the increase in genetic instability with an increase in the length of the (CCTG·CAGG)n tract was observed for repeats cloned both proximal (SmaI site) and distal (MfeI site) to the SV40 origin of replication. Thus, there is a positive correlation between the length of the (CCTG·CAGG)n tracts and their genetic instabilities.

Effect of Orientation—Prior genetic instability studies on microsatellites revealed the inequality of leading versus lagging strand DNA replication (i.e. the direction of replication) with respect to the types and amount of products (10, 16, 18, 22, 52, 85, 95, 96). To determine whether a similar orientation effect was observed for the tetranucleotide repeats, (CCTG·CAGG)n repeats were cloned in both orientations with respect to the bidirectional SV40 origin of replication (see "Experimental Procedures"). The percentage of instability (both expansions and deletions) obtained from plasmids cultured in COS-7 cells for 2 weeks (Fig. 4) showed that orientation II was more unstable than orientation I, especially when the inserts were cloned in the SmaI site. This effect was most pronounced for plasmids carrying the longest repeats. Thus, a statistically significant difference in instability was observed when comparing pRW5115 and pRW5114 (p < 0.001). In experiments where the plasmids were cultured in COS-7 cells for 48 h, a similar effect was observed (data not shown) for (CCTG·CAGG)200. Thus, once the tetranucleotide repeats reached a threshold of about 200 repeats, an orientation effect, where orientation II was significantly more unstable than orientation I, was observed. This was true of repeats cloned proximal (SmaI site) to the SV40 origin of replication. Alternatively, for repeats cloned distal (MfeI site) to the SV40 origin, the differences were not statistically significant, (Fig. 4), even for 48-h cultures (data not shown).

Furthermore, on analyzing the types of genetic instabilities i.e. expansions and deletions, it was observed that expansions in most cases predominated in orientation II compared with orientation I (Fig. 4); this behavior was especially pronounced for (CCTG·CAGG)200 when cloned into the SmaI site. A 1.5 to at least 35-fold increase in expansions in orientation II compared with orientation I was observed, which was dependent on the length of the (CCTG·CAGG)n repeats. Analyses of the instability products from plasmids cultured in COS-7 cells for 48 h also showed a similar trend where expansions predominated in orientation II, especially in the case of the repeats cloned proximal (SmaI site) to the SV40 origin of replication (data not shown). Deletions were present in both orientations. The percentage of deletions did in most cases increase as a function of length; however, an orientation effect was not observed.

Hence, the genetic instability was greater in the case of the repeats cloned in orientation II than in I and when the repeats were cloned proximal (SmaI site) rather than distal (MfeI site) to the SV40 origin. Furthermore, as the length of the repeats increased, the orientation effect was more pronounced. Expansions predominated in orientation II compared with I, especially at the SmaI site.

Magnitude of Expansions and Deletions—The ranges of expansions and deletions of the (CCTG·CAGG)n repeats cloned proximal (SmaI site) and distal (MfeI site) to the SV40 origin of replication is shown in Table I. For the (CCTG·CAGG)n repeats cloned proximal (SmaI site) to the SV40 origin of replication (Table I), the largest expansion product obtained from (CCTG·CAGG)114 (pRW5112) was a (CCTG·CAGG)210, showing a 1.84-fold increase in the number of repeats. pRW5114 expanded from 200 repeats to a maximum of 300 repeats, indicating a 1.5-fold increase in length. Both of these expansions were obtained for the repeats cloned in orientation II (Table I). The deletions obtained for the (CCTG·CAGG)n repeats cloned proximal to the SV40 origin of replication ranged from a complete loss of the repeats to a reduction of 2 or 3 repeats (Table I). Plasmids harboring the shortest repeats (n = 30) cloned into either of the sites were completely stable in all experiments.


View this table:
[in this window]
[in a new window]
 
TABLE I
Expansions and deletions of (CCTG·CAGG)n repeats

The episomal DNA was isolated from COS-7 cells cultured for 2 weeks and the sizes of the inserts were analyzed by restriction digestion and polyacrylamide gel electrophoresis ("Experimental Procedures"). The ranges of expansions and deletions of the (CCTG·CAGG)n repeats are shown; the estimated repeat length varied by ±5 repeats for the plasmids carrying either 200 or 114 repeats. The (CCTG·CAGG)30, however, was sequenced to indicate the precise number of repeats. In general, the distribution of product sizes within these ranges was random. The bracketed values indicate the average of the observed values. pRW5110 and pRW5116 were extremely stable after 2 weeks; hence, the data for these DNAs were derived from 48 h culture studies.

 
The largest percent expansion obtained for the (CCTG·CAGG)n repeats cloned distal (MfeI site) to the SV40 origin (Table I) was 56 repeats or 1.5-fold for pRW5119. The largest increase for (CCTG·CAGG)200 was 14 repeats. The deletions ranged from a complete loss of repeats to a loss of only 2 or 3 repeats similar to those observed for repeats cloned proximal to the SV40 origin (Table I). Approximately 20 clones containing expansions were chosen at random and the inserts sequenced; all inserts contained the (CCTG·CAGG) repeat tracts within the limits of our detection.

The plasmids cultured in COS-7 cells for a period of 48 h (data not shown) showed expansions ranging between 118 and 144 repeats for (CCTG·CAGG)114 and between 202 and 370 repeats (up to 1.85-fold) for (CCTG·CAGG)200. The deletions were also within a similar range as observed for the plasmids cultured for 2 weeks. This range of expansions and deletions was similar for repeats cloned both proximal (SmaI site) and distal (MfeI site) to the SV40 origin.

In summary, a range of expansions and deletions were found. The larger expanded products were found for the repeats cloned in orientation II compared with I. Furthermore, the magnitude of expansions was much larger for repeats cloned proximal to the origin than for repeats cloned distal.

Oligonucleotide Model Studies: Enzymatic Probing—Two oligonucleotides, d(CAGG)26 and d(CCTG)26, were chemically synthesized to study their structural properties as related to the behavior of unpaired regions of the (CCTG·CAGG) repeats during replication and related processes that unwind the duplex. d(CAGG)26 and d(CCTG)26 were purified and labeled ("Experimental Procedures"). The labeled oligomers were then probed with conformation-sensitive enzymatic agents (S1 nuclease, P1 nuclease, and mung bean nuclease) (65, 71, 77, 78, 8082, 84, 97). All three enzymes have been used widely to evaluate ordered and H-bonded pseudo-duplex DNA regions within DNA tracts that are otherwise disordered, random-coil structures (65, 71, 78, 79, 97).

S1 nuclease from Aspergillus oryzae, a zinc requiring enzyme with an acidic pH optimum, cleaves single-stranded and partially unpaired DNA and is not base-specific (77, 78, 98). On probing d(CAGG)26 with S1 nuclease (Fig. 5), the greatest reactivity was observed between the cytosine and adenine residues of the 14th CAGG repeat, followed by weaker cleavage between the adenine and guanine of the 14th CAGG repeat, as well as cleavage between the first and second guanine residues of the same repeat. Thus, these residues are more accessible to the enzymatic probe and may form a terminal loop of a hairpin structure. The stem of the hairpin probed with S1 nuclease showed cleavage of the phosphodiester bonds between residues CpA, ApG, and GpG (Fig. 5), but the intensity of cleavage in the proposed stem is weaker than that found in the putative loop (Fig. 5A). Alternatively, on probing the labeled d(CCTG)26 (Fig. 6), an equal intensity of cleavage was observed between the CpT and TpG residues, suggesting the lack of formation of a stable secondary structure.

P1 nuclease from Penicillium citrium is similar in its behavior to S1 nuclease in preferentially cleaving single-stranded, non-helical DNA tracts or regions that are transiently unpaired (65, 71, 7981, 98). However, P1 nuclease has a neutral pH optimum and lacks a requirement for zinc. Probing the d(CAGG)26 substrate with P1 nuclease further supported the evidence for formation of a terminal loop as demonstrated by the hypersensitivity of the substrate to this enzyme between the residues ranging from the adenine to the guanine residues of the 13th CAGG repeat and between the adenine and guanine of the 14th CAGG repeat. Furthermore, cleavage was also observed between the cytosine and adenine residues of the 14th CAGG repeat (Fig. 5). Cleavage occurred between ApG, GpG, and CpA with the strongest cleavage between the ApG and GpG residues in the stem (as illustrated for repeat 10 but also found for all other repeats in the stem) (Fig. 5). d(CCTG)26 showed a similar cleavage pattern to that observed with S1 nuclease when probed with P1 nuclease, where the phosphodiester bonds between the CpT and TpG residues were cleaved with an equal intensity (Fig. 6), again indicating that the d(CCTG)26 oligonucleotide did not form a stable secondary structure.

Mung bean nuclease is also highly sensitive to variations in DNA structure and converts single-stranded or unpaired DNA to mono- or oligonucleotides with 5'-phosphates (8284, 98). The probing of d(CAGG)26 with mung bean nuclease showed predominant cleavage between the first and second guanines of the 13th CAGG repeat, the cytosine and adenine, as well as the first and second guanine residues of the 14th CAGG repeat. Less cleavage was observed between the adenine and guanine residues of the 14th CAGG repeat (Fig. 5). These residues form the proposed terminal hairpin loop. The bonds between the Gs of each of the CAGG repeats (Fig. 5) were more resistant to cleavage than for the bonds in the putative loops; thus, we propose that they exist in the stem. Alternatively, the probing of d(CCTG)26 with mung bean showed cleavage of the bonds between GpC and CpC of each of the CCTG repeats (data not shown), thus indicating the lack of a stable secondary structure in agreement with the results from S1 and P1 nucleases. Hence, the enzymatic probing studies revealed the preferential formation of a hairpin structure by d(CAGG)26 as opposed to d(CCTG)26.

Chemical Probe Determinations—d(CAGG)26 and d(CCTG)26 were individually probed with OsO4, DEPC, or KMnO4. OsO4 specifically reacts with thymines, and to a much lesser extent with cytosines, in single-stranded oligonucleotides and single-stranded or distorted double-stranded regions in DNA to yield mainly cis-thymine glycols (39, 64, 65, 7274). OsO4/piperidine was used to probe d(CCTG)26 (Fig. 6). An equal intensity of cleavage was observed at each of the thymine residues of the CCTG repeats. This indicates the equal accessibility of the pyrimidines and hence the oligonucleotide did not form any preferential secondary structure. This probe was not used for the complementary oligomer because it lacked thymine residues.

DEPC reacts with the N7 positions of the purines, both adenines and guanines in single-stranded DNA, to form the corresponding ring-opened dicarboxyethoxylated derivatives (39, 40, 6264, 66, 67). DEPC/piperidine was used to probe both d(CAGG)26 and d(CCTG)26. Similar to the enzymatic probing, the most effective modification occurred at the adenine and the first guanine residues of the 13th CAGG repeat and at the adenine of the 14th CAGG repeat (Fig. 5), indicating the formation of a terminal loop of the putative hairpin structure. The putative stem showed cleavage of the adenine residues to a greater extent than the guanine residues (Fig. 5) as previously stated. In the case of d(CCTG)26, DEPC modified the Gs of each of the CCTG repeats and an equal intensity of cleavage was observed at all of the Gs (Fig. 6).

KMnO4 preferentially oxidizes unpaired or unstacked thymines (and to a much lesser extent cytosines) resulting in strand cleavage upon subsequent treatment with piperidine (65, 68, 69, 71, 76). KMnO4 showed reactivity at all residues in the oligomer but a very prominent cleavage was observed at the cytosine residue in the 14th CAGG repeat of d(CAGG)26 (data not shown). However, in the case of d(CCTG)26, KMnO4 equally modified the thymine residues of each CCTG repeat (data not shown) indicating again the lack of any specific stable secondary structure. Thus, the chemical modification studies further confirmed the results obtained from the enzymatic probing studies that indicate that d(CAGG)26, in contrast to d(CCTG)26, preferentially forms a stable hairpin structure.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
DM2 is caused by the expansion of a (CCTG·CAGG) tetranucleotide repeat (1) from a normal range of ~26 to ~11,000 repeats. Using an African green monkey kidney cell system, we demonstrate that replication-based slippage contributes to the genetic instability of these repeats. The instability was dependent on the length of the repeats, their orientation, and their distance from the replication origin. Furthermore, synthetic oligonucleotides representing the unpaired repeat regions during replication were analyzed with chemical and enzymatic probes revealing the preferential formation of hairpin structures by the CAGG oligomer, further supporting our structure-mediated replication-slippage model (Fig. 8).



View larger version (14K):
[in this window]
[in a new window]
 
FIG. 8.
Model for the orientation dependent instability of (CCTG·CAGG)n repeats in mammalian cells. A, the presence of the CAGG repeats on the leading strand template and on the newly synthesized products of the lagging strand template (orientation II) can give rise to expansions because the CAGG repeats on the nascent strand can form folded-back secondary structures by strand slippage and thus generate expansions. B, the presence of the CCTG repeats on the leading strand template and on the newly synthesized product of the lagging strand template (orientation I) preferentially give rise to deletions because the CAGG repeats on the lagging strand template can form slipped structures that may be bypassed during synthesis (see "Discussion").

 
Prior genetic instability studies on (CTG·CAG), (CGG·CCG), and (GAA·TTC) triplet repeats (811, 1618, 21, 39, 86, 9193, 99) have shown a length-dependent effect on instability. Similarly, the longest (CCTG·CAGG) repeat containing plasmids were the most unstable, especially after 2 weeks of culturing in COS-7 cells, generating both expansions and deletions when cloned proximal to the SV40 origin. However, when the repeats were cloned distal to the SV40 origin, deletions were predominant. Thus, for longer repeating sequences, an increase in the genetic instability was observed, as expected.

To study the propensity of the (CCTG·CAGG) repeats to form non-B DNA structures, we analyzed the synthetic oligonucleotides d(CAGG)26 and d(CCTG)26 with chemical and enzymatic probes. The modification patterns obtained for d(CAGG)26 showed the formation of a folded back, hairpin structure. Several different fold-back structures are possible (Fig. 7). d(CAGG)26 can form hairpin structures with 6, 5, 4, or 3 unpaired residues in the terminal loop (Fig. 7, A–D, respectively). This would result in different Watson-Crick and non-Watson-Crick pairing schemes in the putative stems. For the species shown in Fig. 7, B and C, there would be no Watson-Crick pairing making them less stable than the structures shown in Fig. 7, A and D. However, the structure formed with 6 unpaired residues in the terminal loop (Fig. 7A) is more stable than that with 3 residues in the loop (Fig. 7D) because of the two non-Watson-Crick A·G pairs that flank the Watson-Crick G·C pairs (Fig. 7A) compared with the A·A and G·G oppositions that flank the G·C pairs in the structure in Fig. 7D. Although the stability of the mispairs is context dependent, an approximation of the pairing stabilities is G·C > A·T > G·G > G·T {approx} G·A > A·C+ > T·T {approx} A·A {approx} C·C+ > T·C ≥ A·C ≥ C·C (100106). Previous studies have also indicated the greater stability of the loop of a hairpin with an even number of residues compared with an odd-numbered loop (107, 108), as well as the favored closing of the loop with a 5' pyrimidine and a 3' purine pair (107, 108). Thus, our chemical and enzymatic probe results showed the formation of the structure (Fig. 7A) that the above stated rationale predicts to be most stable. Furthermore, DEPC modified both the adenines and guanines at the N7 positions showing that this position is not involved in hydrogen bonding and that the G·A pairs are of the G(anti)·A(anti) or A(anti)·G(anti) types (102, 104, 109).



View larger version (31K):
[in this window]
[in a new window]
 
FIG. 7.
Theoretical hairpin structures formed by d(CAGG) and d(CCTG) oligonucleotides. A portion of the various folded-back structures that can be formed by the d(CAGG) and d(CCTG) oligomers are shown. Hairpin structures with 6, 5, 4, or 3 residues in the terminal loop formed by slippage and misalignment of 0, 1, 2, or 3 nucleotides are shown for the d(CAGG) (A–D) and d(CCTG) (E–H) oligonucleotides. The filled circles between the two DNA strands indicate Watson-Crick pairing and the open circles denote non-Watson-Crick pairing. Although only 10 CAGG and CCTG repeats are shown for simplicity, the same types of loops and pairing arrangements would apply to oligonucleotides of any length. All oligomers are numbered from their 5'-ends.

 
Alternatively, d(CCTG)26 did not show the formation of any stable secondary structure that could be detected under our analytical conditions. We believe this is because of the extremely unstable pairing schemes as shown in Fig. 7, E–H. The structures with 4 and 3 unpaired residues in the terminal loops lack any Watson-Crick pairs (Fig. 7, G and H, respectively) making them less stable than the structures with 6 and 5 residues in the loops (Fig. 7, E and F, respectively). Furthermore, the T·T mismatches as well as the C·T mispairs are extremely destabilizing (105, 110) as stated earlier. Thus, the possible base pairing arrangements reveal a greater probability that the CAGG strand forms a more stable structure than the CCTG strand.

A distinct orientation effect was observed for repeats cloned into the SmaI site where orientation II was significantly more unstable than orientation I. For the triplet repeat sequences, instability was defined primarily as the loss of the full-length progenitor fragment. Also, deletions were the predominant products of instability. In case of the tetranucleotide repeats, however, instability refers to both expansions and deletions. Thus, upon initial consideration, our results appear to be similar to the results with the triplet repeat sequences, including CTG·CAG, CGG·CCG, and GAA·TTC, in which orientation II was shown to be more unstable (9, 10, 16, 18, 19, 21, 22, 39, 52, 85, 86, 95, 96). However, the CCTG sequences are genetically unstable in the orientation prone to expand (orientation II) (Fig. 8) as compared with the triplet repeat sequences that are unstable in the orientation prone to delete (orientation II).

Fig. 8 shows a mechanism for the orientation-dependent instability of the (CCTG·CAGG) sequences. In 1995, Kang et al. (16) proposed the original model to explain the orientation dependence of (CTG·CAG) sequences associated with DM1. For this triplet repeat sequence, the CTG strand forms a more stable secondary structure than the CAG strand. However, for the DM2 sequence, the CAGG strand (on either the nascent lagging strand or the lagging strand template) forms a more stable structure compared with the CCTG strand, generating expansions and deletions, respectively, as shown herein. Thus, the tetranucleotide repeats cloned in orientation II are prone to expand, whereas those cloned in orientation I are prone to delete (Fig. 8). This is the first case of a "reversed" orientation behavior (9, 10). However, similar to the triplet repeats, the capability of one of the two DNA strands to form a quasistable folded structure, in our case the ability of the CAGG strand to adopt a hairpin structure, explains this reversed behavior.

The distance of the triplet repeats from the origin of replication plays an important role in the genetic instabilities of these repeats (16, 19, 22, 111).2 In our studies, the (CCTG·CAGG) repeats cloned proximal to the SV40 origin were more unstable than those cloned distal to the same origin. Several hypotheses were described to explain this effect. If an Okazaki initiation zone is a region of single-stranded template DNA, averaging 135 to 145 nucleotides (in eukaryotes) (22, 112), where priming of the Okazaki fragments occurs, then depending on where the repeats fall within the Okazaki fragments (i.e. at the 5' end or the 3' end), their ability to form secondary structures would differ thereby influencing the amount of instability observed (12, 22, 112). Alternatively, we speculate that because of almost continuous initiation events occurring at the origin, the repeats cloned close to the origin would be rendered single-stranded for a longer period of time thereby giving the repeats a greater opportunity to fold-back and form a stable secondary structure. Once replication has been initiated and the fork progresses away from the origin, the repeats lying distal to the origin do not have as great an opportunity to form these slipped structures on the leading strand. However, because of the single-stranded nature of the Okazaki fragments on the lagging strand, both expansions and deletions can occur depending on the stability of the hairpin structures formed by these repeats. The aberrant processing of the Okazaki fragments involving FEN-1 and DNA ligase has also been hypothesized to play an important role in generating genetic instabilities (21, 113118).

Furthermore, polymerase switching (PolI/PolIII) has been implicated in E. coli to contribute to the genetic instability of triplet repeats (16, 47). Polymerase switching has also been proposed to occur during eukaryotic replication where following synthesis of the RNA-DNA primer by the polymerase {alpha}-primase complex, replication is continued by polymerase {delta}, both on the leading and lagging strands (87, 119). Thus, polymerase switching may further contribute to the observed instability of the tetranucleotide repeats.

In the present study, we have assayed for replication-based instability; however, transcription through the repeats could also have an influence on the levels of observed instability (49, 93, 120123).3 The (CCTG·CAGG)n repeats in our experiments, cloned into the SmaI site, fall within the promoter region of the neomycin resistance gene that is transcribed. Because this gene is transcribed to elicit the drug resistance, a further increase in the amount of instability could be obtained for repeats cloned proximal to the SV40 origin. Also, the secondary structures formed by these repeats may cause both the transcription as well as the replication machinery to pause, as shown both in vitro and in vivo for triplet repeats (17, 20, 23, 122). This in turn could lead to activation of repair (10, 12, 21, 29, 30, 44, 45, 47, 48, 52, 94, 95, 124126) and recombination (2428) processes that, in the case of triplet repeats, are known to generate instability.

Thus, a complex interplay of replication, repair, recombination, and transcription may effect the massive expansions observed in DM2. In this study, we have specifically focused on replication and its role in generating the instabilities. However, ongoing work implicates recombination as a powerful mechanism that contributes to the instabilities of the tetranucleotide repeats.4 These data provide the first insights into an understanding of the molecular mechanisms of the tetranucleotide instabilities. Our ultimate goal is to understand the molecular processes causing these expansions to develop therapeutic strategies.


    FOOTNOTES
 
* This work was supported by National Institutes of Health Grants NS37554 and ES11347, the Robert A. Welch Foundation, and the Muscular Dystrophy (Seek a Miracle) Association (to R. D. W.) and National Institutes of Health Grant NS35870 and a grant from the Muscular Dystrophy Association (to L. P. W. R.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

To whom correspondence should be addressed: Center for Genome Research, Institute of Biosciences and Technology, Texas A&M University Health Science Center, 2121 W. Holcombe Blvd., Houston, TX 77030-3303. Tel.: 713-677-7651; Fax: 713-677-7689; E-mail: rwells{at}ibt.tamushsc.edu.

1 The abbreviations used are: DM2, myotonic dystrophy type 2; DEPC, diethylpyrocarbonate. Back

2 R. R. Iyer and R. D. Wells, unpublished data. Back

3 Mochmann, L., and Wells, R. D. (2004) Nucleic Acids Res. 32, 4469–4479. Back

4 R. Dere and R. D. Wells, unpublished data. Back



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Liquori, C. L., Ricker, K., Moseley, M. L., Jacobsen, J. F., Kress, W., Naylor, S. L., Day, J. W., and Ranum, L. P. (2001) Science 293, 864–867[Abstract/Free Full Text]
  2. Moxley, R. T., 3rd, and Ricker, K. (1995) Muscle Nerve 18, 557–558[CrossRef][Medline] [Order article via Infotrieve]
  3. Finsterer, J. (2002) Eur. J. Neurol. 9, 441–447[CrossRef][Medline] [Order article via Infotrieve]
  4. Mankodi, A., and Thornton, C. A. (2002) Curr. Opin. Neurol. 15, 545–552[CrossRef][Medline] [Order article via Infotrieve]
  5. Day, J. W., Ricker, K., Jacobsen, J. F., Rasmussen, L. J., Dick, K. A., Kress, W., Schneider, C., Koch, M. C., Beilman, G. J., Harrison, A. R., Dalton, J. C., and Ranum, L. P. (2003) Neurology 60, 657–664[Abstract/Free Full Text]
  6. Ranum, L. P., and Day, J. W. (2004) Am. J. Hum. Genet. 74, 793–804[CrossRef][Medline] [Order article via Infotrieve]
  7. Cummings, C. J., and Zoghbi, H. Y. (2000) Annu. Rev. Genomics Hum. Genet. 1, 281–328[CrossRef][Medline] [Order article via Infotrieve]
  8. Cummings, C. J., and Zoghbi, H. Y. (2000) Hum. Mol. Genet. 9, 909–916[Abstract/Free Full Text]
  9. Wells, R. D., and Warren, S. T. (eds) (1998) Genetic Instabilities and Hereditary Neurological Diseases, Academic Press, San Diego, CA
  10. Bowater, R. P., and Wells, R. D. (2001) Prog. Nucleic Acids Res. Mol. Biol. 66, 159–202[Medline] [Order article via Infotrieve]
  11. Sinden, R. R., Potaman, V. N., Oussatcheva, E. A., Pearson, C. E., Lyubchenko, Y. L., and Shlyakhtenko, L. S. (2002) J. Biosci. 27, 53–65[Medline] [Order article via Infotrieve]
  12. Cleary, J. D., and Pearson, C. E. (2003) Cytogenet. Genome Res. 100, 25–55[CrossRef][Medline] [Order article via Infotrieve]
  13. Lahue, R. S., and Slater, D. L. (2003) Front. Biosci. 8, 653–665
  14. Matsuura, T., Yamagata, T., Burgess, D. L., Rasmussen, A., Grewal, R. P., Watase, K., Khajavi, M., McCall, A. E., Davis, C. F., Zu, L., Achari, M., Pulst, S. M., Alonso, E., Noebels, J. L., Nelson, D. L., Zoghbi, H. Y., and Ashizawa, T. (2000) Nat. Genet. 26, 191–194[CrossRef][Medline] [Order article via Infotrieve]
  15. Lalioti, M. D., Scott, H. S., Buresi, C., Rossier, C., Bottani, A., Morris, M. A., Malafosse, A., and Antonarakis, S. E. (1997) Nature 386, 847–851[CrossRef][Medline] [Order article via Infotrieve]
  16. Kang, S., Jaworski, A., Ohshima, K., and Wells, R. D. (1995) Nat. Genet. 10, 213–218[Medline] [Order article via Infotrieve]
  17. Kang, S., Ohshima, K., Shimizu, M., Amirhaeri, S., and Wells, R. D. (1995) J. Biol. Chem. 270, 27014–27021[Abstract/Free Full Text]
  18. Shimizu, M., Gellibolian, R., Oostra, B. A., and Wells, R. D. (1996) J. Mol. Biol. 258, 614–626[CrossRef][Medline] [Order article via Infotrieve]
  19. Maurer, D. J., O' Callaghan, B. L., and Livingston, D. M. (1996) Mol. Cell. Biol. 16, 6617–6622[Abstract]
  20. Samadashwily, G. M., Raca, G., and Mirkin, S. M. (1997) Nat. Genet. 17, 298–304[Medline] [Order article via Infotrieve]
  21. Freudenreich, C. H., Kantrow, S. M., and Zakian, V. A. (1998) Science 279, 853–856[Abstract/Free Full Text]
  22. Cleary, J. D., Nichol, K., Wang, Y. H., and Pearson, C. E. (2002) Nat. Genet. 31, 37–46[CrossRef][Medline] [Order article via Infotrieve]
  23. Pelletier, R., Krasilnikova, M. M., Samadashwily, G. M., Lahue, R., and Mirkin, S. M. (2003) Mol. Cell. Biol. 23, 1349–1357[Abstract/Free Full Text]