|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 282, Issue 35, 25308-25313, August 31, 2007
Activation-induced Cytidine Deaminase-mediated Sequence Diversification Is Transiently Targeted to Newly Integrated DNA Substrates*
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Activation-induced cytidine deaminase (AID) is required for GCV/SHM and is thought to initiate these reactions by deamination of cytosine to generate uracil in DNA (4–8). In DT40 cells, the uracils are processed predominantly by uracil DNA-glycosylase (UNG), and the resulting abasic site is channeled into either homology-based repair using pseudo V elements to produce GCV events or error-prone repair to produce SHM events.
Little is known about the molecular characteristics that render a gene accessible to AID-mediated diversification processes. Mouse and cell line studies of SHM have demonstrated that transcription is essential, whereas the endogenous Ig promoters and the Ig variable region sequences themselves are not required (9–13). Non-Ig genes have also been reported to undergo SHM, albeit at frequencies much lower than Ig genes, and such mis-targeting of SHM is thought to contribute to B cell malignancies (14–18).
We tested the ability of a non-Ig expression cassette to undergo sequence diversification in Ig and non-Ig loci of DT40 cells and found that it underwent AID-dependent mutation in a locus-independent fashion to generate predominantly transition mutations at G/C bp. In addition, despite stable levels of transcription, the mutability of the cassettes was transient and waned in the weeks after integration. Our results indicate that high level transcription is insufficient for stably committing a gene to AID-mediated sequence diversification. Furthermore, the lack of specific targeting information in the cassette is overcome by it being a recently incorporated component of the genome. These results suggest that newly integrated DNA assumes an AID-permissive state similar to that of Ig genes, with implications for how mis-targeting of AID occurs.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
promoter with 1 kb of its downstream intronic sequence was PCR-amplified (EFNHEF, 5'-cttgaaaggagctagcattggctccg-3'; EFLHINR, 5'-ctctagagaagctttcacgacacc-3') from the pEBB plasmid (19) to replace the
-actin promoter in a puromycin selection cassette flanked by mutant loxP sites (20). The resulting EF1
-puro cassette was cloned into targeting vectors L and E. There are two versions of vector L, both flanked by a left homology arm spanning 3.5 kb upstream of the IgL promoter and a 1-kb right homology arm that starts at the IgL leader intron and extending into the J-C intron. Targeted integration of either L construct will insert the EF1
-puro cassette upstream of the promoter; one of the constructs contains a T7 promoter that will replace the endogenous IgL promoter whereas the other preserves the endogenous promoter. The homology arms of construct E include 2.2 kb upstream and downstream of the IgL enhancer, and targeted integration will replace the enhancer with the EF1
-puro cassette. Clones included in this study have the EF1
-puro cassette integrated in either transcriptional orientation. As no phenotypic differences have been observed in clones with different transcriptional orientations in the EF1
-puro cassette, the results are discussed without distinction between the two scenarios.
Cell Culture and Transfection—All CL18-derivative clones of DT40 cells were cultured as described previously (21). Transfections were performed with 25 µg of NotI-linearized plasmid or a BamHI fragment containing the EF1
-puro cassette being electroporated into cells using a Gene Pulser (Bio-Rad) at 580 V and 25 microfarads. 10–14 h after transfection, the cells were seeded at limited dilutions and selected with 0.5 µg/ml puromycin (Sigma). Stable transfectants were identified 6–8 days after transfection.
Southern and Northern Blotting—Genomic DNA was purified, and total cellular RNA was extracted from different cell types using RNA-Bee (Tel-Test), separated on agarose gels, and transferred onto GeneScreen Plus membranes (PerkinElmer Life Sciences). Blots were then hybridized with random hexamer-labeled (Roche Applied Science) DNA probes against puromycin, the IgL constant region (CCLF1, 5'-cccaccgtcaaaggaggagctg-3'; CCLR2, 5'-gacagtgacaggtagctgctggccatatac-3'), or GAPDH (CHGAPDHF, 5'-accagggctgccgtcctctc-3'; CHGAPDHR, 5'-ttctccatggtggtgaagac-3').
Flow Cytometry—Surface IgM expression was monitored by staining cells with a phycoerythrin-
-chicken IgM antibody (clone M-1, Southern Biotech) and analyzed on a FACScan (BD Biosciences).
Sequence Analysis—Regions of interest were PCR-amplified using high fidelity Pfu polymerase (Stratagene) or Phusion DNA polymerase (New England Biolabs) (EF1
, 5EBBP, 5'-gtaagtgccgtgtgtggttcc-3', and 3EBBP, 5'-gtgtggggaaactccatcgc-3'; IgL, CVLF1-5'-ccatggcctgggctcctctcctcctg-3', and CLA2-5'-gacagcacttacctggacagctg-3'). Gel-purified PCR products were treated with Taq polymerase, TA-cloned (Invitrogen), and sequenced using the PCR forward primers at the W. M. Keck Facility, Yale University School of Medicine. Two-sample t tests were applied to determine the statistical significance of differences in mutation frequencies between samples using Data Desk 6.2 (Data Description).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
promoter, which included a 1-kb region of EF1-
intronic sequences downstream of the transcription start site known to contribute to high level transcription (22). In mammalian systems, SHM occurs at highest levels within 1 kb downstream of transcription start sites (23), and thus by separating the puromycin resistance gene from the promoter, we hoped to minimize mutation of the gene and hence the loss of mutated clones during the selection process.
|
-puro cassette was incorporated into two IgL targeting constructs. The L construct was designed to insert the cassette upstream of the coding VJ region, whereas the E construct was designed to replace the IgL enhancer with the EF1
-puro cassette (Fig. 1b). Either the L or E construct was electroporated into the CL18 clone of DT40 cells, and stably transfected single cell clones were selected in puromycin, expanded, and analyzed for targeted or random integration by Southern blotting (supplemental Fig. 1, A and B). Southern blotting was also used to determine the copy number of the integrated cassettes, and only single-copy integrants were chosen for subsequent analyses. Curiously, the majority of clones generated were single-copy integrants regardless of whether targeted integration had occurred, possibly implying that even random integrants utilized homology-based mechanisms for construct integration. Northern blots demonstrated that all clones expressed the EF1
-puro cassette, although transcription levels were higher in targeted integrants than random integrants (see below); other clones discussed in subsequent sections were also analyzed by Northern blots to measure expression of the EF1
-puro cassette (see below, and data not shown).
Heterologous Cassettes Can Undergo AID-dependent Hypermutation in Ig and Non-Ig Loci—We cultured two targeted L clones (L4 and L6) and three targeted E clones (E7, E8, and E18) for 4 weeks after transfection and sequenced the region 125–650 bp downstream of the transcription start site (the sequenced region is entirely within EF1-
intronic sequences). Both L4 and L6 were generated using the L construct that contained the T7 promoter so that the IgL promoter was replaced to avoid possible promoter competition (Fig. 1b). As there are no homologous DNA elements to the EF1
-puro cassette in the DT40 genome, we expected the majority of sequence variations to arise as untemplated point mutations and not by gene conversion (GCV). Mutations were found in all five clones, and collectively, 39 mutations were observed in 164 sequences analyzed, translating into a mutation frequency of 4.5 x 10–4 mutations per base pair (Table 1, lines 1–5). This is not much different from the combined IgL GCV/SHM frequency in other clones in which the IgL locus was unaltered (see below, 7.5 x 10–4 events per base pair; Table 2, lines 4 and 5, before and after subcloning).5 These results indicate that the EF1
-puro cassette can be mutated when integrated into the IgL locus.
|
|
We next asked if the mutability of the EF1
-puro cassette was restricted to the IgL locus by performing a similar sequence analysis of three random integrant clones L1, L12, and E20 after 4 weeks of culture. The L construct used to generate L1 and L12 contained the IgL promoter (Fig. 1b). Interestingly, the three randomly integrated clones also exhibited a substantial accumulation of mutations in the EF1-
puro cassette (combined mutation frequency of 7.6 x 10–4 mutations per base pair; Table 1, lines 6–8 pooled together). To test whether the constructs were being mutated because of a genome-wide mutation phenomenon, we sequenced the endogenous EF1-
gene, reasoning that if the cells had a generally elevated mutation activity, other highly transcribed endogenous genes should become targets as well. This was not the case, as very few mutations were found in the endogenous EF1-
gene (0.45 x 10–4 mutations per base pair; Table 1, line 10), in line with mutation frequencies of non-Ig genes reported previously in DT40 cells (3). These data suggest that mutation activity was specific to the transfected constructs.
Because the L and E constructs contained IgL locus sequences in the homology arms (albeit nonoverlapping sequences), we wondered if those sequences were contributing to the mutability of the EF1
-puro cassette. To address this, we generated single-copy, randomly integrated clones of the EF1
-puro cassette without any flanking IgL sequences (NF cells). Sequence analysis of three independent clones after at least 4 weeks of culture revealed that the cassettes were mutated at levels higher than those in AID-deficient cells (as described below), although the mutation frequency of the NF construct was lower than L or E (Table 1, line 9). We conclude that the flanking IgL sequences in constructs L and E are not required for the introduction of mutations into the EF1
-puro cassette.
As is the case for untemplated point mutations in DT40 Ig variable regions (3), the mutations observed in the EF1-
puro cassette in L, E, and NF clones were almost entirely at GC base pairs, suggesting that AID might be responsible. To test this, the L construct was transfected into AID-deficient DT40 cells (4), and six independent clones were subjected to sequence analysis after at least 4 weeks of culture. We found only three mutations out of 185 sequences, yielding a mutation frequency (0.31 x 10–4 mutations/bp; Table 1, line 11) more than 10-fold lower than that observed in wild-type CL18 cells. We conclude that most of the mutations found in the EF1
-puro cassette in wild-type cells were dependent on AID.
Heterologous Cassettes Mutate Only Transiently in Ig and Non-Ig Loci—The above results indicated that the EF1
-puro cassette was mutated regardless of its integration site. This led us to wonder if mutation of the cassette had something to do with it being a piece of DNA recently incorporated into the genome, in which case mutability might be lost after some time in culture. This was investigated by subcloning cells that had been grown for 3 weeks after transfection; randomly chosen subclones were then cultured for 4 additional weeks, which allowed mutations to accumulate for the same amount of time as in the analyses above. Sequence analysis revealed that the random integrant clones L1, L12, and E20 lost their ability to mutate the EF1-
construct after subcloning, with combined mutation levels dropping from 7.6 x 10–4 to 0.42 x 10–4 mutations/bp (Table 2, lines 4–6). Intriguingly, even cassettes integrated in the IgL locus did not mutate after subcloning (Table 2, lines 1–3). We recently observed that the EF1-
promoter has a substantial defect in supporting GCV/SHM of IgL when used to replace the endogenous IgL promoter (13). Thus the inability of the EF1
-puro cassette to support stable mutation in the IgL locus could be due to a defect in the promoter itself. Nonetheless, any defects in mutation targeting could be overcome during the initial period post-integration.
To ensure that the loss of mutability of the EF1
-puro cassette observed after subcloning was not the result of a general defect in AID-mediated diversification, we examined the IgL locus for evidence of GCV/SHM. The CL18 clone carries a frameshift mutation in the IgL variable region that can be corrected by GCV to allow surface IgM expression, and this provides a convenient readout for ongoing GCV (21). Measurements of surface IgM expression revealed that the L1 and L12 clones gave rise to comparable percentages of IgM+ cells before and after subcloning after 4 weeks of culture (Fig. 2a). Sequence analysis corroborated results from IgM reversion assays and showed that these cells were still capable of performing robust IgL GCV/SHM after they were subcloned (Table 2, lines 4 and 5). These results demonstrate that the loss of mutability that occurs after subcloning is specific to the EF1
-puro cassette and cannot be explained by the loss of an activity or factor, such as AID, required for GCV/SHM.
|
-puro cassette because of decreased transcription after subcloning. To address this, we performed Northern blots on cells before and after subcloning, and we found that expression of the heterologous cassette was unchanged after subcloning (Fig. 2b, and data not shown). Therefore, the loss in mutability was not a result of changes in expression of the cassette.
The Origin of Mutations in the Heterologous Cassettes—We analyzed the distribution and pattern of mutations occurring in the EF1
-puro cassette by combining all mutations recorded before subcloning. Mutations were detected throughout the region sequenced with two 50-bp intervals (400–450 and 500–550 bp from the transcription start site) showing the highest frequencies of mutation (Fig. 3b). This distribution is similar to that reported in mammalian Ig genes and transgenes and does not correlate in any obvious way with the distribution of SHM hot spots (RGYW motifs) in the EF1-
intronic region sequenced (Fig. 3b). In addition, there was a heavy bias for transition mutations at G and C residues (67%; Fig. 3a). This is in sharp contrast to what has been reported for SHM events in the DT40 IgL locus, which are biased toward G to C and C to G transversion mutations (3, 24, 25). It is thought that uracil DNA-glycosylase (UNG) recognizes and processes uracils introduced by AID; in UNG–/– DT40 cells, GCV is abrogated, and mutation patterns shift largely toward G/C to A/T transitions, presumably because the uracils remain in the DNA during replication (6–8, 26). The pattern we observe is therefore consistent with a model in which mutation is initiated by AID-mediated deamination of C, with replication of the resulting uracil generating G to A and C to T mutations. The high prevalence of G/C transition mutations seen in our EF1
-puro cassette suggests that UNG is not efficiently recruited to the cassette and that the repair pathways that would normally follow AID-mediated deamination of Ig genes were not recruited to the EF1
-puro cassette. This suggests that targeting of AID and targeting of error-prone repair can be dissociated from each other.
|
Our results suggest that DNA newly introduced into cells has properties, acquired or intrinsic, that distinguish it from existing genomic DNA, and such properties allow it to be targeted by AID. Although it is possible that the cassette can be accessed by AID prior to integration, all mutations reported here occurred post-integration. This is because the clones underwent singlecell cloning prior to expansion and sequencing; therefore, the expanded culture all originated from the same cell harboring a single copy of the transfected transgene. Because only unique mutations within a clone were scored, such mutations could only have arisen post-integration. The EF1
-puro cassette might have particular features that contribute to its mutability in these experiments, most notably that it contains a strong promoter. The human EF1-
promoter used here includes most of the sequences from the EF1-
first intron, a region that contains numerous transcription factor binding sites that greatly enhance transcription (22). Further studies will be required to determine how promoter strength and structure contribute to the mutability observed in our experiments.
Numerous studies using mammalian B cell lines or non-B cell lines overexpressing AID have reported the mutation of non-Ig cassettes when integrated into non-Ig loci (10, 28–31). These studies have led to the idea that transcription itself is the major determinant for AID targeting. Our results initially seemed to be in good agreement with this idea. However, it became apparent after subcloning that the mutability of the cassette was only transient and had dropped to background levels by 3 weeks post-transfection; in contrast, transcription of the cassette was unaffected by subcloning. It is still possible that mutations continued to accumulate after subcloning at rates below the sensitivity of our sequencing assay. Our results indicate that high levels of transcription alone are not sufficient to stably commit a gene to being an efficient target of AID and suggest that high level transcription is not the primary targeting mechanism of the Ig genes.
The transient nature of mutations we observed in the non-Ig cassette suggests that there are molecular properties that distinguish newly integrated DNA from genomic DNA, which allow the newly incorporated DNA to be targeted by AID. These properties are lost gradually after construct integration. The DNA used in transfections was of bacterial origin and hence differs from eukaryotic genomic DNA in several ways, including the fact that it is not packaged in chromatin and that it is not methylated at CpG islands. The relatively small number of redundant mutations found within each clone (7% on average) and the relatively high proportion of sequences without a mutation (78%) indicate that the window of AID accessibility encompasses more than just the first few cell divisions. Our results suggest that intermediate chromatin states of the newly integrated DNA allow it to bypass the lack of specific AID targeting information, possibly because these states resemble a chromatin structure that makes Ig genes predominant substrates of AID. A link between chromatin structure and AID accessibility was previously suggested by the finding that a histone deacetylase inhibitor could expand the region of the IgH locus subject to SHM (32). The existence of molecular similarities between newly incorporated DNA and mutationally active Ig genes could reflect the evolutionary origin of AID as a member of a deaminase family shown to have antiviral activities in part by editing viral genomes (see Ref. 33 and reviewed in Ref. 34). The potential of non-Ig genes to mimic features of Ig genes that confer AID accessibility also provides a plausible explanation for how mis-targeting of SHM occurs.
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains supplemental Fig. S1. ![]()
1 Supported by the Irvington Institute for Immunological Research. Present address: NIA, National Institutes of Health, 5600 Nathan Shock Dr., Baltimore, MD 21224. ![]()
2 Supported by a National Science Foundation predoctoral fellowship. ![]()
3 An investigator of the Howard Hughes Medical Institute. To whom correspondence should be addressed: 300 Cedar St., Box 208011, New Haven, CT 06520-8011. Tel.: 203-737-2255; Fax: 203-785-3855; E-mail: david.schatz{at}yale.edu.
4 The abbreviations used are: GCV, gene conversion; AID, activation-induced cytidine deaminase; IgL, immunoglobulin light chain; SHM, somatic hypermutation; UNG, uracil DNA-glycosylase; GAPDH, glyceraldehyde-3-phosphate dehydrogenase. ![]()
5 The "combined IgL GCV/SHM frequency in other clones in which the IgL locus was unaltered" was calculated by dividing the total number of mutation events in clones L1 and L12, before and after subcloning (Table 2, lines 4 and 5), by the total number of sequences obtained for both clones, before and after subcloning. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Arakawa and J.-M. Buerstedde Activation-induced cytidine deaminase-mediated hypermutation in the DT40 cell line Phil Trans R Soc B, March 12, 2009; 364(1517): 639 - 644. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Lin, S.-i. Hashimoto, H. Seo, T. Shibata, and K. Ohta Modulation of immunoglobulin gene conversion frequency and distribution by the histone deacetylase HDAC2 in chicken DT40. Genes Cells, March 1, 2008; 13(3): 255 - 268. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kothapalli, D. D. Norton, and S. D. Fugmann Cutting Edge: A cis-Acting DNA Element Targets AID-Mediated Sequence Diversification to the Chicken Ig Light Chain Gene Locus J. Immunol., February 15, 2008; 180(4): 2019 - 2023. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Molecular and Cellular Proteomics |
| Journal of Lipid Research | ASBMB Today |