![]()
|
|
||||||||
J. Biol. Chem., Vol. 279, Issue 50, 52353-52360, December 10, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



**

From the
Laboratory of Developmental Genetics and Imprinting, Developmental Genetics Programme, The Babraham Institute, Cambridge CB2 4AT, United Kingdom, the ¶DNA Editing Laboratory, Cancer Research UK, Clare Hall Laboratories, South Mimms EN6 3LD, United Kingdom, and the ||Protein and Nucleic Acid Chemistry Division, Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 2QH, United Kingdom
Received for publication, July 8, 2004 , and in revised form, September 7, 2004.
| ABSTRACT |
|---|
|
|
|---|
T transition mutations in methylated DNA, or in conjunction with repair of the T:G mismatch, to demethylation. The Aid and Apobec1 genes are located in a cluster of pluripotency genes including Nanog and Stella and are co-expressed with these genes in oocytes, embryonic germ cells, and embryonic stem cells. These results suggest that Aid and perhaps some of its family members may have roles in epigenetic reprogramming and cell plasticity. Transition in CpG dinucleotides is the most frequent mutation in human genetic diseases, and sequence context analysis of CpG transitions in the APC tumor suppressor gene suggests that DNA deaminases may play a significant role in tumor etiology. | INTRODUCTION |
|---|
|
|
|---|
In addition to Aid and Apobec1, there are other members of this deaminase family, including Apobec2 and Apobec3g. Some of these can mutate DNA in E. coli (12) and single-stranded DNA in vitro (13-15). Apobec3g is also known as the cellular Cem15 protein and is critical for innate immunity to retroviruses, partly by mutating viral single-stranded DNA (14). Thus Aid and Apobec3g (and perhaps other members of the family) have roles in acquired and innate immunity by mutating cytosine in DNA. An ancestral role of DNA deaminases in protecting the organism against invading DNA may hence have been adapted to mutating endogenous DNA to better combat pathogens (1).
Another mechanism for the protection of the genome against invading DNA such as transposable elements is by methylating their DNA at CpG dinucleotides (16). This leads to transcriptional silencing of transposons so that they cannot continue to spread. The methylation of cytosine also increases the rate of mutation of C (17) so that methylated transposons accumulate mutations. It is generally thought that this occurs by spontaneous deamination (17), because enzymes that can deaminate 5-methylcytosine (5meC) have not been identified. As a result the CpG dinucleotide has become depleted in species with CpG methylation. In addition to its role in genome defense, methylation is important for epigenetic gene regulation. This includes parental imprinting, X chromosome inactivation, and possibly more general gene regulatory mechanisms in development and differentiation (18, 19).
Recently it has been found that genomic patterns of DNA methylation are reprogrammed genome-wide in early embryos and in primordial germ cells (19-21). Reprogramming may be necessary for resetting the epigenetic state of the gametic genomes so that totipotency is restored to the embryonic genome. Epigenetic reprogramming involves demethylation, which can occur passively as a result of DNA replication in the absence of DNA methyltransferase 1 (16, 18-21). However, the dramatic demethylation of the paternal genome in the zygote before replication of DNA (22-24), and the rapid demethylation in primordial germ cells (25, 26) (PGCs) are likely to take place by a replication-independent mechanism. A number of mechanisms have been proposed for the loss of methylation (27), involving direct demethylation (28), oxidative demethylation (29), 5meC base excision repair (30), or nucleotide excision repair (27). However, no enzyme activity has been convincingly identified that can initiate these reactions (31-33).
We have asked whether cytosine deaminases can deaminate 5meC in DNA. We found that Aid and Apobec1 have 5meC deaminase activities, resulting in a thymine base opposite a guanine. If this mismatch is repaired, a methylated cytosine is replaced by an unmethylated one. If it is not repaired, it results in a C
T transition mutation. The expression of Aid has previously been thought to be limited to lymphoid tissues, but embryonic tissues were not examined (34). Unexpectedly we found that Aid and Apobec1 are colocalized within a cluster of pluripotency genes and are expressed in oocytes and primordial germ cells, which undergo epigenetic reprogramming. This suggests new roles for DNA mutators in the physiology and pathology of DNA methylation.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
-D-galactopyranoside-inducible Sss I expression cassette from the pCAL7 (New England Biolabs) plasmid was transferred into pACYC177 (NEB) by standard cloning techniques (details on request). The resulting plasmid, p15ASss I Kan, was transformed into a methylation restriction-deficient E. coli, ER1821 (Elisabeth Raleigh, NEB) (F-endA1 thi-1 supE44 spoT? rfbD? mcrA5 (mrr-hsdRMA-mcrB)1-::IS10) and checked for isopropyl 1-thio-
-D-galactopyranoside-inducible methylation of DNA. We ensured the endogenous rpoB C1586 site could be methylated by bisulfite sequence analysis as described (23, 25) using rpoB-specific primers for bisulfite converted DNA (gttgttttttttgatagtagataggtagtg, ctcaatttataatccaaaacaacc). To quantitate the percentage methylation of a particular site, we isolated plasmid DNA from three independent cultures after induction of Sss I and digested them with CpG methylation-sensitive SnaBI. Digested or control undigested plasmids were transformed into a methylation non-restricting E. coli (ER1821). From this we were able to assess that in vivo methylation by Sss I caused protection of the restriction site on 82% of plasmids (unmethylated digested plasmid gave <1% transformation efficiency; see Fig. 1A). This was also consistent with the extent of methylation determined by agarose gel analysis of the digested plasmids (data not shown).
|
-D-galactopyranoside for 24 h in culture. Rifampicin-resistant colonies were selected on 100 µg/ml rifampicin LB agar plates containing ampicillin (Aid plasmid) or kanamycin (Sss I plasmid) as appropriate. The median number of rifampicin resistant-colonies/number of viable cells was determined for each experiment consisting of 20 cultures/category (Aid and Sss I, vector and Sss I, Aid, vector). The CpG to CpA transition at the target site in rpoB was identified by a PCR assay that differentiates between the single nucleotide G or A at this position (1586). The screening primer (rpoB G
A: ctgagattacgcacaaacgtca) with a 3'-A can only anneal when a G
A transition has occurred at nucleotide position 1586 (1586G
A), giving an expected product of 429 bp using rpoB reverse (caccgacggataccacctgctg) as the second primer. For the locus control PCR, forward (rpoB forward: ttggcgaaatggcggaaaacc) and reverse primers (rpoB reverse) always produce a band of 627 bp, irrespective of mutations, acting as an internal control for success of the PCR. The PCR screen was performed on two randomly selected rifampicin colonies from each of the cultures within a category. The frequency of the 5meC transition was calculated from the number of 5meC transitions (at rpoB1586) to the total number of mutations. Preparation of Recombinant Aid and Apobec1Human Aid was subcloned into a pET30 derived vector with a C-terminal His tag (plasmid sequences on request). Preparation of a rat Apobec1 expression vector was described previously (13). Production and purification of recombinant protein was as described (13).
UDG-based Deamination AssayThe assay was done as described previously (13) with minor modifications. Aid/Apobec1 samples (0.4-1 µl) were incubated at 37 °C for 15 min in 10 µl of buffer R (40 mM Tris, pH 8.0, 40 mM KCl, 50 mM NaCl, 5 mM EDTA, 1 mM dithiothreitol, 10% glycerol) with 2.5 pmol of 5'-biotinylated oligonucleotides, 3'-labeled with fluorescein. The oligonucleotides substrates SPM319-SPM330 are all based on 5'-biotin-ATAAGAATAGAATGAFFFFFA-ATGAATSSSSSATGAATAGTA-fluorescein-3' where F and S are the first and second motifs indicated in Fig. 2B. Oligonucleotides were purified on streptavidin magnetic beads (Dynal) and washed at 70 °C. Deamination was monitored by incubating the bead-immobilized oligonucleotides at 37 °C for 1 h with excess uracil-DNA glycosylase (1 unit, New England Biolabs). Resultant cleaved oligonucleotides were subjected to 15-20% PAGE-urea gel electrophoresis, and the fluorescent signal was detected with a FLA-5000 scanner (Fuji). Percent conversions were calculated from the scanned images, percent conversion = pixel volume cleaved product (minus background)/pixel volume cleaved product (minus background) + pixel volume substrate (minus background). Data for rate calculations were based on numerous experiments on SPM320 and SPM348 for C deamination, and SPM347 and SPM359 for the 5meC deamination. Other oligonucleotides used: SPM348ATTATTGTTATTAGCTATTTGTTTATTTGTTTATTTATTT-fluor, SPM351ATTATTGTTATTAGCGATTTGTTTATTTGTTTATTTATTT-fluor.
|
Semiquantitative RT-PCRTotal RNA was isolated from various mouse tissues using Qiagen miniRNAeasy reagents (Qiagen, Germany). Random hexamer and SuperscriptII (Invitrogen) reagents were used to make cDNA. Primers for Aid (forward cagggacggcatgagacct and reverse tcagccttgcggtcttcaca), Apobec1 (forward ctctgtcatgatctggatagtcacac and reverse catcgcagcaacataagctcc), and Hprt (forward cctgctggattacattaaagcact and reverse gtcaagggcatatccaa) were used to amplify a PCR product from cDNA using Promega HotStarTaq. The amplification cycles were as follows: Aid and Apobec1, 95 °C 15 min, then 23-32 cycles of 94 °C 30 s, 65 °C 30 s, and 72 °C 30 s; Hprt, 95 °C 15 min, then 20-29 cycles of 94 °C 30 s, 55 °C 30 s, and 72 °C 30 s. For the amplification of Aid for 35-44 cycles, a nested approach was used with the first 20 cycles with outer primers (outer forward tttctttaccatttcaaaaatgtccg and outer reverse aatagtctttgaaggtcatgatccc) at 95 °C 15 min, 20 cycles of 94 °C 30 s, 63 °C 30 s, and 72 °C 30 s, then 15-24 cycles as above for Aid. The products were resolved on a 2% agarose gel, stained, scanned on a Fuji FLA-3000, and analyzed using Aidi v3 two-dimensional densitometry software. For Fig. 3C the intensity of the Hprt signal at 29 cycles was arbitrarily set to 100 units, and the intensity of the Aid signal at 32 cycles is displayed for each tissue relative to Hprt.
|
| RESULTS |
|---|
|
|
|---|
We expressed the CpG specific methyltransferase Sss I in E. coli; bisulfite sequencing confirmed that the selectable rpoB gene was methylated at CpG residues including C1586 (not shown), and in vivo CpG methylation by Sss I was able to protect 82% of sites from restriction (Fig. 1A). We determined the relative frequency of deamination at the methylated C1586 in rpoB using rifampicin selection and a single nucleotide discriminating PCR assay (Fig. 1A). If the methylated C were not a substrate for Aid, there would be no increase in its mutation rate over that with Sss I alone. However, the mutation rate at the C1586 position of rpoB in three independent experiments was
8-fold higher in E. coli with both Sss I and Aid than in those having only Sss I (Fig. 1B, Table I). Thus this experimental system demonstrates Aid has a 5meC deaminase activity. It is not possible to directly compare the relative activities of Aid as a C deaminase (Aid versus vector background) to Aid as a 5meC deaminase (Aid and Sss I versus vector and Sss I), because the repair systems for U:G (without Sss I) and T:G (with Sss I) are different and their relative efficiencies are not known. The increase in frequency of mutation with expression of Sss I alone may be in part because of spontaneous deamination. Whether Apobec1 and Apobec3g can deaminate 5meC could not be tested in the E. coli assay, because they do not mutate the C1586 position.
|
DNA deaminases require particular sequence motifs as targets, on unmethylated DNA Aid prefers the sequence WRC (A/T, A/G, C) in vivo and in vitro (8, 15, 38), which we confirmed with a panel of oligonucleotides containing different target motifs (Fig. 2B). In a subset of these oligonucleotides all Cs were replaced by 5meCs, and we found that the WRC motif is still the preferred target (Fig. 2C), with a minor difference. Although in the unmethylated oligonucleotides all four possible WRC targets were deaminated to the same extent, once methylated, there does appear to be a preference for AGC (Fig. 2C, lane 1 versus lane 2, lane 4 first motif versus lane 8 first motif). From Fig. 2A we estimate that the activity of Aid on 5meC is
3-fold lower than on C in the context AGC (SPM347 lane 2, SPM348 lane 8). Oligonucleotides containing both methylated and unmethylated targets demonstrate that the sequence context of each cytosine can render the deamination reaction of 5meC equivalent to C (Fig. 2C, lane 11, AA5meC versus ATC), or 5meC preferred to C (Fig. 2C, lane 12, AGmeC versus ATC). For the analysis in Fig. 2C we used three times as much enzyme than in Fig. 2B. From Coomassie and Western analysis (data not shown) we estimate a turnover of 60-70 fmol/min/µg for unmethylated (SPM320 and SPM348) and 20-25 fmol/min/µg methylated C deamination (SPM347 and SPM359) based on a 15-min incubation of 2.5 pmol of substrate and an
40% conversion rate using 1 and 3 µg of Aid, respectively. Deaminase activity on 5meC by purified Aid has been observed previously (7).
Apobec1 has also been shown to have C deaminase activity but could not be tested for 5meC deaminase activity in the E. coli assay. We previously showed that in vitro Apobec1 has a preference for NTC (13, 15). Apobec1 was also able to deaminate 5meC, but the target preference was considerably altered to AC (Fig. 2D). Thus both Aid and Apobec1 have 5meC deaminase activity in vitro that is affected by the sequence context.
Aid and Apobec1 Genes Are Colocalized with Nanog and Stella and Are Expressed in Pluripotent TissuesAid is thought to be exclusively expressed in B cells undergoing class switch recombination or somatic hypermutation (34) and thus finding it expressed in oocytes in our initial screen was unexpected. However, bioinformatic analysis revealed a striking arrangement of the Aid and Apobec1 genes within a 200-kb cluster of pluripotency genes, including Nanog, Stella (Dppa3), and Gdf3 on mouse chromosome 6 (Fig. 3A). Nanog and Stella have important roles in embryonic stem (ES) cell identity (39, 40) and preimplantation development (41), respectively, and all three genes are expressed in pluripotent tissues. We therefore carried out semiquantitative RT-PCR analysis of Aid mRNA and found high levels of expression in oocytes and ovaries, and moderate levels in embryonic germ (EG) cells, ES cells, E12.5 PGCs (isolated from genital ridges using a green fluorescent protein-expressing transgene) that undergo methylation reprogramming (25, 26), and the genital ridges of E11.5-12.5 embryos containing PGCs (Fig. 3, B and C). Apobec1 mRNA was also found in ovaries, oocytes (albeit at lower levels than Aid), and ES cells (Fig. 3D). The expression profiles of the five genes in the cluster are summarized in Fig. 3E; all genes are transcribed in pluripotent tissues that can undergo epigenetic reprogramming (oocytes, EG/PGC, ES cells). The clustered organization of the five genes is also found in the human genome (data not shown). To what extent regulatory elements for expression in pluripotent tissues are shared between the genes needs to be determined. It is interesting to note that Stella and Gdf3 are also expressed in lymphoid tissues (42).
| DISCUSSION |
|---|
|
|
|---|
It is now firmly established that Aid, Apobec1, and Apobec3g can mutate cytosine in DNA in a variety of different systems, including mammalian cells (4-6, 12, 14). Because a considerable proportion of Cs in mammalian genomes is methylated in the CpG dinucleotide, it is of great importance to ask whether C deaminases can act on 5meC. Although E. coli cytosine deaminase can act on both free bases cytosine and 5-methylcytosine (43), no prediction is possible for the distantly related Aid/Apobec deaminases, which work exclusively in the context of DNA/RNA. A methyl group in the 5-position of cytosine can certainly lead to steric hindrance for enzymes that act on DNA (for example, restriction enzymes). On the other hand, the deamination reaction by Aid is thought to occur by nucleophilic attack on position 4 of the pyrimidine ring of cytosine (1), and this is not necessarily hindered by the methyl group in position 5. Our results clearly established that Aid and Apobec1 deaminate 5meC in single-stranded DNA. It is difficult to compare directly their relative activities on C and 5meC, but our findings suggest they are in the same order of magnitude with other factors likely to modulate activities in vivo. The difference between the rate of spontaneous deamination (44) of 5meC and the enzymatically catalyzed one is thus 5-6 orders of magnitude (calculated from Refs. 7 and 13).
The DNA deaminase activity of Aid poses a potential danger to the genome in cells in which the enzyme is expressed. Deaminated 5meC is particularly vulnerable to mutation, because the resulting T can escape correction if not immediately repaired while mispaired with G; U can always be recognized as an error in DNA. Indeed, the ectopic expression of Aid and of Apobec1 in mice causes cancer (45, 46), and the preferred target sequences of the C deaminases are found around many C transitions in oncogenes and tumor suppressor genes in human tumors (15). Our work reveals the preferred sequence context for Aid as a 5meC deaminase to be AG5meCG. We analyzed the sequence context of all CpG mutations leading to a premature stop codon of the APC tumor suppressor gene in colorectal cancer (47). The sequence context of the 27 possible CpGs is shown in Fig. 4A; yet in tumors, of 73 CpG mutations analyzed, 55 (75%) occurred in the preferred target for Aid, AG(C)G (Fig. 4B). This is an enrichment of over 25-fold over the expected frequency of random CpG mutations giving rise to a stop codon, and is consistent with the possibility that misregulated or mistargeted activity of Aid can cause CpG mutations in methylated DNA.
|
In vivo the activity and targeting of Aid is likely to be tightly regulated, probably involving control of nuclear trafficking (50, 51), sequence preference (15, 38), the requirement for single-stranded DNA, and chromatin modifications (52). Both transcription and chromatin remodeling (53) can lead to the exposure of single-stranded DNA. Thus the large scale chromatin remodeling at fertilization that removes protamines from the sperm genome and replaces them with histones is likely to lead to a widespread exposure of single-stranded DNA. Single-stranded DNA could also be exposed by transcription or chromatin remodeling occurring during reprogramming in PGCs.
It is possible that similar pathways play similar roles in organisms other than mammals. In the seed plant Arabidopsis, the DNA glycosylase Demeter is required for the expression of the imprinted genes Medea and FWA and acts antagonistically to DNA methylation (54), suggesting that a similar system (perhaps involving yet unidentified 5meC deaminases) plays a role in epigenetic reprogramming in plants. In Neurospora, duplicated DNA becomes methylated and mutated at cytosines, which is likely to involve deamination (55). It is intriguing that the only enzymes thus far identified as being able to modify bases in DNA as part of a physiological process are DNA methyltransferases and DNA deaminases. Thus an ancestral system using methylases and deaminases to protect the organism from invading DNA may have evolved to take on additional roles in acquired immunity and the regulation of epigenetic information.
| FOOTNOTES |
|---|
Funded in part by a C. J. Martin Postdoctoral Fellowship. ![]()

Supported in part by the Arthritis Research Campaign. ![]()
** To whom correspondence should be addressed. E-mail: wolf.reik{at}bbsrc.ac.uk.
1 The abbreviations used are: UDG, uracil DNA glycosylase; 5meC, 5 methylcytosine; PGCs, primordial germ cells; EG, embryonic germ; ES, embryonic stem; E, embryonic day; mTDG, Methanobacterium thermoautotrophicum; RT, reverse transcription. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. M. Phillips and J. I. Goodman Identification of Genes that May Play Critical Roles in Phenobarbital (PB)-Induced Liver Tumorigenesis due to Altered DNA Methylation Toxicol. Sci., July 1, 2008; 104(1): 86 - 99. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Perez-Duran, V. G. de Yebenes, and A. R. Ramiro Oncogenic events triggered by AID, the adverse effect of antibody diversification Carcinogenesis, December 1, 2007; 28(12): 2427 - 2433. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gallais, F. Demay, P. Barath, L. Finot, R. Jurkowska, R. Le Guevel, F. Gay, A. Jeltsch, R. Metivier, and G. Salbert Deoxyribonucleic Acid Methyl Transferases 3a and 3b Associate with the Nuclear Orphan Receptor COUP-TFI during Gene Activation Mol. Endocrinol., September 1, 2007; 21(9): 2085 - 2098. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. T. Freberg, J. A. Dahl, S. Timoskainen, and P. Collas Epigenetic Reprogramming of OCT4 and NANOG Regulatory Regions by Embryonal Carcinoma Cell Extract Mol. Biol. Cell, May 1, 2007; 18(5): 1543 - 1553. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Phillips, Y. Yamamoto, M. Negishi, R. R. Maronpot, and J. I. Goodman Orphan Nuclear Receptor Constitutive Active/Androstane Receptor-Mediated Alterations in DNA Methylation during Phenobarbital Promotion of Liver Tumorigenesis Toxicol. Sci., March 1, 2007; 96(1): 72 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Larijani, A. P. Petrov, O. Kolenchenko, M. Berru, S. N. Krylov, and A. Martin AID Associates with Single-Stranded DNA with High Affinity and a Long Complex Half-Life in a Sequence-Independent Manner Mol. Cell. Biol., January 1, 2007; 27(1): 20 - 30. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kress, H. Thomassin, and T. Grange Active cytosine demethylation triggered by a nuclear receptor involves DNA strand breaks PNAS, July 25, 2006; 103(30): 11112 - 11117. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Cattoretti, M. Buttner, R. Shaknovich, E. Kremmer, B. Alobeid, and G. Niedobitek Nuclear and cytoplasmic AID in extrafollicular and germinal center B cells Blood, May 15, 2006; 107(10): 3967 - 3975. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Morales-Ruiz, A. P. Ortega-Galisteo, M. I. Ponferrada-Marin, M. I. Martinez-Macias, R. R. Ariza, and T. Roldan-Arjona DEMETER and REPRESSOR OF SILENCING 1 encode 5-methylcytosine DNA glycosylases PNAS, May 2, 2006; 103(18): 6853 - 6858. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Babbage, C. H. Ottensmeier, J. |