|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 46, 47799-47807, November 12, 2004
A C-terminal Myb Extension Domain Defines a Novel Family of Double-strand Telomeric DNA-binding Proteins in Arabidopsis*![]() From the Department of Biochemistry and Biophysics, Texas A&M University, College Station, Texas 77843-2128
Received for publication, July 14, 2004 , and in revised form, September 7, 2004.
Little is known about the protein composition of plant telomeres. We queried the Arabidopsis thaliana genome data base in search of genes with similarity to the human telomere proteins hTRF1 and hTRF2. hTRF1/hTRF2 are distinguished by the presence of a single Myb-like domain in their C terminus that is required for telomeric DNA binding in vitro. Twelve Arabidopsis genes fitting this criterion, dubbed TRF-like (TRFL), fell into two distinct gene families. Notably, TRFL family 1 possessed a highly conserved region C-terminal to the Myb domain called Myb-extension (Myb-ext) that is absent in TRFL family 2 and hTRF1/hTRF2. Immunoprecipitation experiments revealed that recombinant proteins from TRFL family 1, but not those from family 2, formed homodimers and heterodimers in vitro. DNA binding studies with isolated C-terminal fragments from TRFL family 1 proteins, but not family 2, showed specific binding to double-stranded plant telomeric DNA in vitro. Removal of the Myb-ext domain from TRFL1, a family 1 member, abolished DNA binding. However, when the Myb-ext domain was introduced into the corresponding region in TRFL3, a family 2 member, telomeric DNA binding was observed. Thus, Myb-ext is required for binding plant telomeric DNA and defines a novel class of proteins in Arabidopsis.
Telomeres are the specialized nucleoprotein structures that comprise the natural ends of linear eukaryotic chromosomes and ensure their complete replication and stability (1, 2). In most eukaryotes, telomeric DNA is composed of tandem arrays of simple G-rich repeat sequences terminating in a single-strand 3'-overhang, which is maintained through the action of the telomerase reverse transcriptase (1). Both the double and single-strand regions of the telomere are coated with non-histone proteins that provide protection for telomeric DNA and regulate telomerase access to the chromosome terminus. Proteins that bind double-strand telomeric DNA are typified in vertebrates by TRF1 and TRF2 and in budding and fission yeast by Rap1 and Taz1, respectively (38). Human TRF1 (hTRF1)1 behaves as a negative regulator of telomere length; overexpression results in telomere shortening, whereas a dominant negative allele induces telomere elongation (9, 10). hTRF1 mediates telomere length control through interactions with other telomere-associated factors including tankyrase (11), TIN2 (12), PinX1 (13), and Pot1 (14). Although hTRF2 contributes to telomere length regulation (10), its major function is to conceal telomere ends from detection as double-strand breaks (15, 16). Inhibition of hTRF2 in cultured human cells results in the loss of the 3'-overhang and the formation of covalently fused telomeres (15). In addition, compromised hTRF2 function culminates in cell cycle arrest and ATM/p53-mediated apoptosis (16).
The functional domains of vertebrate TRF1 and TRF2 have been studied in some detail (17). The two proteins have similar molecular masses (5060 kDa), and resemble each other in domain structure. Although the N terminus is highly acidic in hTRF1 and highly basic in hTRF2, both proteins harbor a centrally located flexible hinge region called the TRF homology (TRFH) domain that is required for homodimer formation and interactions with other telomere-associated proteins (18). The most strongly conserved feature is a Myb/homeodomain-type helix-turn-helix DNA binding motif near the C terminus (5, 1922). The Myb domain is sufficient for specific binding to the telomeric DNA in vitro, but dimerization through the TRFH domain is required for TRF protein association with DNA in vivo (18). Bilaud et al. noted a sequence within the Myb domain of hTRF1 and hTRF2, VDLKDKWRT, that is also conserved in yeast double-strand telomere-binding proteins and dubbed it the telobox consensus (19). An NMR structure of the Myb motif from hTRF1 revealed specific contacts between amino acid residues within the telobox consensus and the human telomere repeat sequence TTAGGG (22).
Although the plant telomere repeat sequence, TTTAGGG, is closely related to that of humans (23), almost nothing is known about the protein composition of plant telomeres. Several proteins have been shown to bind double-stranded telomeric DNA in vitro (2430). From Arabidopsis these include several relatively small ( In this study we employed a BLAST search to identify Arabidopsis homologs of hTRF1 and hTRF2 using their Myb domains as the query. Although Arabidopsis harbors more than 100 genes with Myb domains (32), we found only 12 with a single Myb domain in their C terminus that contains the telobox consensus motif. We designated this group of genes TRF-like (TRFL). Here we provide a molecular characterization of the TRFL proteins. Our data reveal that the TRFL genes encode two distinct families of proteins that differ dramatically in their amino acid sequences, DNA binding properties, and protein interactions. Furthermore, we define a novel functional domain C-terminal to the Myb domain that is required for specific binding to duplex plant telomeric DNA.
Computer Search for Myb-containing Genes and Evolutionary Tree AnalysisThe Myb-domains of hTRF1, hTRF2, and Arabidopsis TRP1 were used in separate NCBI Blast searches to identify Arabidopsis thaliana genes predicted to encode proteins with a single Myb repeat at the C terminus. The telobox consensus motif was used as an additional criterion (19). Twelve TRFL genes were identified (see GenBankTM accession numbers in Table I). Multiple protein alignments were conducted using the Oxford Molecular Group's sequence analysis software MacVector 7.0 (Accelrys, San Diego, CA). A tree was constructed using the neighbor-joining method (33) with the bootstrap mode (34).
Expression Analysis, Molecular Cloning, and Production of Recombinant Proteins in VitroReverse transcription PCR was performed for 35 cycles using SuperScript III reverse transcriptase (Sigma) to determine whether TRFL genes were expressed in different Arabidopsis tissues. For most of the TRFL genes, primers flanked intron junctions, ruling out the possibility of genomic DNA contamination. Full-length cDNAs were obtained by reverse transcription PCR using total RNA from flowers. Sequence analysis of the cloned cDNAs verified the NCBI annotations. For protein interaction studies, the full-length cDNAs of TRFL genes were subcloned into pET-28A and pCITE 4A. For DNA binding studies, PCR was used to amplify either full-length coding regions or the Myb domains and the adjacent C-terminal region using plasmids that contained full-length cDNAs (see Table I for specific residues amplified). Overlapping PCR (35) was applied to create the fusion constructs shown in Fig. 6A. A T7 phage promoter was introduced into all the constructs by PCR, and the products were used as templates for in vitro transcription by T7 RNA polymerase (Stratagene). Transcripts were translated in either rabbit reticulocyte lysate (Promega) or a wheat germ translation system (Promega) in the presence or absence of [35S]methionine (Amersham Biosciences). An aliquot of the labeled protein was used to verify the expected apparent molecular weight of translated protein products by SDS-PAGE and autoradiography.
Electrophoresis Mobility Shift Assay (EMSA)Telomeric DNA probes and competitors were 5'-AGCATGCAGC(TTTAGGG)8-3', 5'-AGCATGCAGC(TTTAGGG)6-3', 5'-AGCATGCAGC(TTTAGGG)4-3', and 5'-AGCATGCAGC(TTTAGGG)2-3' from Arabidopsis, 5'-AGCATGCAGC(TTTGGG)8-3' from Paramecium, and 5'-AGCATGCAGC(TTAGGG)8 from Homo sapiens. Double-strand DNA probes were obtained by annealing the corresponding complementary oligonucleotides. For gel shift assays, peptides consisting of the Myb domain and the remainder of the C terminus (see above) were tested for DNA binding activity. Preparation of the probes and binding reactions were carried out as described previously (36). Protein-DNA complexes were loaded on a 4% polyacrylamide gel in 0.5x Tris borate-EDTA and subjected to electrophoresis for 4 h at 120 V. Dried gels were exposed for autoradiography. Co-immunoprecipitationFor each TRFL protein analyzed, two constructs were made, one with a T7 protein tag and one without. [35S]methionine-labeled non-tagged proteins or T7-tagged unlabeled proteins were synthesized in a TNT-coupled rabbit reticulocyte lysate translation system following the manufacturer's recommendations (Promega). Translation of T7-tagged proteins was verified in the presence of [35S]methionine on a small aliquot from the same master mix. T7-tagged and untagged radiolabeled proteins were combined and subjected to immunoprecipitation using agarose beads (Novagen) containing the T7 monoclonal antibody as described (37). Precipitate and supernatant fractions were analyzed by SDS-PAGE and autoradiography.
Identification and Expression of TRFL Genes in ArabidopsisWe performed a BLAST search to identify TRFL genes in Arabidopsis thaliana. Three query sequences were employed consisting of the Myb domains from human TRF1 and TRF2 and from Arabidopsis TRP1. Twelve genes were uncovered that encode proteins with a single Myb domain at their C terminus (Fig. 1 and Table I). As expected, the searches found TRP1 and TBP1 (26, 27), along with 10 uncharacterized genes that were designated TRFL110. With the hTRF1 Myb domain as the query, the most closely related sequence was the Myb domain for TRFL6 (E value = 3e05), whereas for hTRF2 the Myb domain of TRFL3 displayed the most similarity (E value = 3e06).
A conceptual protein alignment of the C-terminal Myb-containing region of TRFL proteins with the corresponding regions in hTRF1 and hTRF2 is shown in Fig. 1. The NMR structure of the Myb domain of hTRF1 bound to the human telomeric DNA repeat sequence reveals that the N-terminal arm of the first helix interacts with the TT sequence in the minor groove, whereas the third helix recognizes TAGGG in the major groove (22). The corresponding regions are well conserved among the Arabidopsis TRFL proteins and display extensive sequence similarity to hTRF1 and hTRF2 (Fig. 1A). The most highly conserved region within all of the TRFL proteins, VDLKDKWRT, lies within the telobox consensus. Evolutionary analysis indicated the presence of two distinct TRFL gene families (Fig. 1B). The TRFL family 1 includes TBP1, TRP1, TRFL1, TRFL2, TRFL4, and TRFL9, whereas the TRFL family 2 contains TRFL3, TRFL58, and TRFL10. The similarity within TRFL family 1 or family 2 members extends throughout their entire sequence, suggesting the DNA binding domains co-evolved with the remainder of the genes. Four of the TRFL genes reside in regions of the Arabidopsis genome that are known to be duplicated. These include TRP1 and TRFL1, which display 59% identity (70% similarity) and reside on chromosomes 5 and 3, respectively, and TRFL3 and TRFL6, which exhibit 52% identity (62% similarity) and are found in a duplicated region on chromosome 1. Although TBP1 and TRFL9 are not located in a duplicated region, they display a remarkably high degree of similarity (54% identity and 66% similarity) and, hence, may have been derived from a recent gene duplication. The same may also be true for TRFL2 and TRFL4 (30% identity and 40% similarity), although they share somewhat less conservation overall. Closer inspection of the predicted amino acid sequence for the TRFL proteins provided further evidence for distinct gene families. In all of the TRFL family 1 proteins, the region of amino acid conservation in the Myb domain extends further into the C terminus, creating a Myb extension (Myb-ext) domain (Fig. 1A). Interestingly, Myb-ext was absent from TRFL family 2 and from hTRF1 and hTRF2, which terminate immediately adjacent to the Myb domain. An additional region of substantial sequence conservation was detected in the central portion of TRFL family 1 members called the central domain (CD) (Fig. 2, A and B). This region bears no significant similarity to the TRFH domain in vertebrate TRF1/TRF2 proteins and is also absent in TRFL family 2. Aside from TRFL3 and TRFL6, which most likely represent a recent gene duplication, the remaining members of TRFL family 2 display no obvious sequence similarity outside their Myb domains.
Reverse transcription PCR analysis revealed that all of the TRFL genes are expressed in Arabidopsis (Fig. 3). Transcripts for each of the TRFL genes could be detected in all of the organs we examined (Fig. 3 and data not shown). Moreover, all of the genes were expressed at a relatively high level, with the exception of TRFL2 and TRFL4, which are both members of TRFL family 1. Although TRFL2 and TRFL4 were ubiquitously expressed, their transcripts were scarce and could only be observed by nested PCR (Fig. 3). Nevertheless, the constitutive expression of TRFL genes is consistent with a structural role in the telomere complex.
Proteins in TRFL Family 1 Can Form Homodimers and HeterodimersHuman TRF1 and TRF2 bind telomeric DNA as homodimers in vivo and in vitro (18, 38). Therefore, we tested whether recombinant TRFL proteins could form homodimers in vitro using co-immunoprecipitation experiments. For these studies, untagged [35S]methionine-labeled TRFL proteins were subjected to immunoprecipitation in the presence of the corresponding unlabeled T7-tagged protein using a T7 antibody (Fig. 4, A and B). Following immunoprecipitation, a homomeric interaction with a tagged protein will allow a radiolabeled but untagged protein to precipitate with the beads. Control reactions performed in the absence of tagged proteins showed no interaction between the T7 antibody and untagged proteins (Fig. 4, A and B, lane 2 in all panels). However, specific homomeric protein interactions were detected for TRP1, TRFL1, TRFL2, TRFL4, and TRFL9 (Fig. 4A, upper panels, and Table I). All of these proteins are members of TRFL family 1. In contrast, TRFL3, TRFL5, and TRFL6, members of TRFL family 2, did not exhibit the capacity for homodimerization in vitro (Fig. 4A, lower panels, and Table I).
Despite significant amino acid similarity in the TRFH domains of hTRF1 and hTRF2, steric hindrance prevents the formation of heterodimers (18). To determine whether TRFL proteins have the capacity to heterodimerize in vitro, additional co-immunoprecipitation experiments were performed with family 1 members TRP1, TRFL1, and TRFL9 (Fig. 4B). These data showed that TRP1 and TRFL1 can heterodimerize, as can TRP1 and TRFL9. Interestingly, although TRP1 is closely related to TRFL1, it bears only limited sequence similarity to TRFL9 (38.2% identity and 46.3% similarity). Thus, it is conceivable that functionally distinct TRFL proteins interact directly in vivo. DNA Binding Properties of TRFL ProteinsWe used EMSAs to see whether TRFL proteins bind telomeric DNA in vitro. Initially, several full-length recombinant proteins expressed in rabbit reticulocyte lysate were tested for DNA binding. As expected from a previous study (26), full-length TRP1 bound a duplex telomeric DNA probe consisting of eight TTTAGGG telomeric repeats (AtTR8) and formed a discrete protein-DNA complex that migrated into the gel, as well as complexes that stayed in the well (data not shown). Assays with other full-length TRFL proteins, including TRFL4 (Fig. 5A), produced complexes that failed to exit the well. However, such complexes were sequence-specific (Fig. 5A, left panel) and displayed length dependence in DNA binding (Fig. 5A, right panel). To alleviate the concern of protein aggregation and to further analyze protein-DNA interactions, we tested whether the isolated Myb domains from TRFL proteins would bind telomeric DNA in an EMSA. Previous studies showed that the corresponding regions of hTRF1, TRP1, and TBP1 were sufficient for specific interaction with duplex telomeric DNA in vitro (26, 27, 39). We expressed the corresponding region of TRFL proteins (Myb domain and the adjacent C-terminal region) in wheat germ extract, which we found yielded a higher amount of recombinant TRFL protein than did rabbit reticulocyte lysate. A summary of DNA binding results for the truncated TRFL proteins is shown in Table I.
All members of TRFL family 1 bound specifically to plant telomeric DNA (Table I and Fig. 5, B and C). Surprisingly, the minimal number of repeats required for binding varied among the different proteins. For example, TRFL1 449553 required a minimum of six telomere repeats for optimal binding (Fig. 5B, lane 6). Although a weak interaction was detected with probes consisting of four telomere repeats (Fig. 5B, lane 4), the complex appeared to be unstable, as no discrete band was observed. In contrast, binding assays with TRFL9 505620 yielded a more discrete DNA-protein complex with a probe containing four telomeric repeats (Fig. 5C, lane 4, and data not shown). Although the relative migration position of the DNA-protein complexes appears different for TRFL1 449553 and TRFL9 505620 (Fig. 5, B and C), slightly different running conditions were used in the two gels. When the complexes were analyzed side-by-side on the same gel, they migrated at approximately the same position (Fig. 5D). Competition experiments revealed that the association of TRFL family 1 proteins with plant telomeric DNA is specific (Fig. 5B, lanes 812 and Fig. 5C, lanes 1013, and data not shown). Competition was observed with probes consisting of double-stranded plant telomeric DNA, but not with nonspecific duplex DNA or duplex DNA comprised of human or Paramecium telomeric repeats (Fig. 5, AC, and data not shown). Assays with single-stranded telomeric DNA competitor also failed to compete with duplex plant telomeric DNA for binding (data not shown). Strikingly, no DNA binding was detected in the EMSA with TRFL family 2 members using either full-length proteins or isolated Myb domains (plus the remaining C terminus) (Table I, and see Fig. 6). As for TRFL family 1 members, all of the recombinant proteins generated from TRFL family 2 genes expressed well in the wheat germ translation system and were soluble (data not shown). Hence, the lack of telomeric DNA binding is likely to reflect inherent differences in the properties of the two TRFL families rather than protein misfolding. We conclude that, in contrast to the situation with human TRF1 (39), the capacity to bind to duplex plant telomeric DNA in vitro is not conveyed solely by the presence of a C-terminal Myb domain, because all of the TRFL proteins we examined have this feature. The Myb-ext Domain Is Necessary for Telomeric DNA Binding in VitroOne intriguing difference between TRFL family 1 and 2 members is the presence of the Myb-ext domain in family 1 (Fig. 1B). To examine the contribution of this region in telomeric DNA binding, we performed EMSA on several truncated versions of TRFL1 (Fig. 6A). TRFL1 449540 (Fig. 6A, construct C-2) contains the Myb domain and Myb-ext but lacks the remainder of the C terminus. TRFL1 449509 (Fig. 6A, construct C-3) includes only the Myb domain. EMSA showed that TRFL1 449540 formed a complex with telomeric DNA that resembled that of the TRFL1 449553 control (Fig. 6B, lanes 2 and 3). In contrast, no shifted complex was detected in the assay with TRFL1 449509 (Fig. 6B, lane 4). One explanation for the inability of TRFL1 449509 to bind telomeric DNA is that the C terminus is required for proper protein folding. To address this concern, we generated a construct in which the Myb domain of TRFL1 was fused directly to the C terminus of TRFL3, a member of the TRFL family 2 (Fig. 6A, construct C-4). No binding to telomeric DNA was detected with this construct (Fig. 6B, lane 5). Taken together, these data argue that the Myb-ext domain is required for binding to plant telomeric DNA. We next asked whether the Myb-ext domain was sufficient to confer telomeric DNA binding to a TRFL family 2 protein. For this experiment, a chimeric protein was generated that consisted of the TRFL3 Myb domain fused to the TRFL1 Myb-ext and C-terminal region (Fig. 6A, construct C-6). Strikingly, a DNA-protein complex was observed with this construct (Fig. 6B, lane 7). The capacity to bind telomeric DNA was not influenced by the residues C-terminal to the Myb-ext domain, as another chimeric construct (Fig. 6A, construct C-7) lacking this region also bound telomeric DNA specifically (Fig. 6B, lanes 812). This result not only indicates that the Myb-ext domain is required for binding to plant telomeric DNA but also demonstrates that, when placed next to a Myb domain of a TRFL family 2 protein, the Myb-ext domain is sufficient to convey telomeric DNA binding specificity.
TRF-like Proteins from ArabidopsisThe integrity of the nucleoprotein complex known as the telomere cap is crucial for genome stability and for cell proliferation in eukaryotes. Recent data indicate that plants display an extraordinary tolerance to telomere dysfunction as well as distinct mechanisms of telomere length regulation (40, 41). For example, mutations in the telomere-associated protein complex Ku70/80, which result in telomere shortening in yeast (42), leads to dramatic telomere elongation in Arabidopsis (4345). Unraveling these interesting evolutionary distinctions will require a greater understanding of telomere architecture in plants. In this study we used a genomic approach to search for Arabidopsis homologs of the vertebrate TRF1 and TRF2 genes. The defining feature of this class of proteins is a single C-terminal domain reminiscent of the c-Myb DNA binding domain in oncoproteins (18, 22). Among Myb-containing proteins, TRF1 and TRF2 are unique in that their Myb domains harbor an additional telobox consensus motif, which contributes to the specific recognition of human telomeric DNA. Therefore, we searched for genes harboring a single C-terminal Myb domain with a telobox consensus motif and identified 12 TRF-like genes. Because TRF1 and TRF2 are the only Myb-containing proteins known to bind directly to double-strand telomeric DNA in vertebrates, it was surprising to discover that Arabidopsis harbors so many TRFL genes. At least four TRFL genes (TRP1/TRFL1 and TRFL3/TRFL6) reside in regions of the Arabidopsis genome known to be duplicated. Two other pairs, TBP1 and TRFL9 and TRFL2 and TRFL4, may also reflect a recent duplication. Evidence for functional redundancy was obtained when we examined T-DNA insertion lines that disrupted the coding region in seven of the twelve TRFL genes (TRP1 and TRFL1, 2, 3, 4, 5, and 8). None of the single gene disruption mutants displayed defects in growth and development or showed perturbations in telomere length or genome stability.2 These data argue that at least a subset of the TRFL genes may have overlapping functions.
Nevertheless, the remarkable sequence divergence outside the DNA binding domain for many of the TRFL genes raises the distinct possibility of unique roles at telomeres or elsewhere in the Arabidopsis genome. The telomeric DNA-binding protein Rap1 from budding yeast exhibits pleiotropic functions and is required for telomere maintenance as well as for the regulation of gene expression at internal sites in the genome (46). Similarly, whereas the primary role of hTRF1 is associated with chromosome termini, TRF1 associates with interstitial blocks of telomeric DNA in some mammalian species (47). Arabidopsis also contains short stretches of interstitial telomeric DNA (typically one to two TTTAGGG repeats), predominantly located in promoter regions (48), that function in the control of gene expression (49, 50). Thus, some TRFL proteins may act in transcriptional regulation in a manner similar to Rap1. In this regard, it is noteworthy that TRFL family 1 members exhibit distinct minimal length requirements for telomeric DNA binding. For efficient in vitro binding, the Myb domain of TBP1 requires only two telomeric repeats, whereas TRP1, TRFL4, and TRFL9 require a minimum of four repeats and TRFL1 and TRFL2 need six repeats (this study) (26, 27). These apparent distinctions in telomere sequence recognition could potentially impact in vivo function. Proteins that bind efficiently to shorter stretches of telomeric DNA may preferentially act at promoters, whereas those that favor longer telomere tracts would have a higher probability of binding the long tracts of telomeric DNA associated with chromosome ends. Preliminary localization experiments indicate that all of the TRFL proteins accumulate in the Arabidopsis nucleus3; however more in-depth analysis will be required to determine which of these proteins associate specifically with chromosome ends in vivo.
Although all of the TRFL genes were identified on the basis of a C-terminal Myb-like domain, evolutionary tree analysis and inspection of the deduced protein sequence alignment indicates that these gene encode two distinct classes of proteins (Fig. 1B). In addition to the Myb domain, all of the proteins in family 1 possess two other highly conserved regions, a Myb-ext domain and another centrally located CD. The CD was previously noted by Yang et al., who suggested that it might serve as a substrate for ubiquitination as it contains a region similar to the ubiquitin domain (29). For vertebrate TRF1 and TRF2, the only conserved region outside the Myb-domain is the TRFH domain, which not only facilitates the formation of homodimers but also interacts with other telomere-associated factors such human Rap1 and Tin2 (12, 51). Homodimerization of hTRF1 and hTRF2 is required for association with telomeric DNA in vivo, but these proteins cannot form heterodimers (18). In contrast, Arabidopsis TRFL family 1 proteins form both homodimers and heterodimers in vitro (Fig. 4). Whether the CD in TRFL family 1 proteins serves as an interaction interface analogous to the TRFH domain in hTRF1 and hTRF2 remains to be determined. However, the capacity to form both homodimers and heterodimers in vitro suggests that TRFL family 1 members could participate in complex structural and functional regulation in vivo. A Novel Domain Is Necessary for TRFL Proteins to Bind Plant Telomeric DNAOne of the most striking differences among TRFL family 1 and family 2 proteins is the presence of a highly conserved Myb-ext domain in family 1. Interestingly, maize IBP1 (initiator-binding protein 1 and the parsley BPF1 (BoxP-binding factor 1 (BPF1) also harbor a Myb domain (52, 53) with an adjacent region having striking similarity to Myb-ext.4 IBP1 is known to interact at the transcription start site of the Shrunken promoter containing a perfect telomeric repeat, AGGGTTT, whereas BPF1 binds a series of GT-rich motifs. Based on their similarity to the TRFL family 1 from Arabidopsis, we predict that these proteins may have the capacity to efficiently bind longer stretches of plant telomeric DNA.
It is intriguing that the Myb domain of hTRF1 is sufficient for binding human telomeric repeat DNA in vitro, but this is not the case for TRFL proteins from Arabidopsis. Our data demonstrate that this latter class of proteins requires a more extended domain for telomeric DNA interactions. We found that Myb-ext is both necessary and sufficient to allow the Myb domains of TRFL proteins to bind double-strand plant telomeric DNA. Precisely how Myb-ext contributes to telomeric DNA binding is unclear. One possibility is that it stabilizes the essential protein contacts between the Myb domain and telomeric DNA. In eukaryotes a Myb-like domain is a defining feature of double-strand telomeric DNA-binding proteins. The large number of such proteins in Arabidopsis confounds the identification of bona fide telomere-associated factors. Although the Myb domains of TRFL family 2 proteins display even greater sequence similarity to the corresponding region of hTRF1 and hTRF2 than do TRFL family 1 proteins, TRFL family 2 proteins fail to bind plant telomeric DNA in vitro. We cannot exclude the possibility that TRFL family 2 proteins associate with telomeres in vivo through protein interactions in a manner similar to that of human Rap1 (51). Furthermore, it is evident that TRFL family 1 proteins are not the only factors capable of specific binding to double-strand plant telomeric DNA in vitro. Whereas the architecture of TRFL family 1 proteins closely resembles that of the known double-strand telomere-binding proteins in higher eukaryotes, the SMH (single Myb histone)-like proteins from maize and Arabidopsis plants harbor a single Myb domain in their N terminus, lack an associated Myb-ext domain, and yet still bind telomeric DNA in vitro (30, 31). Thus, TRFL family 1 and SMH-like proteins apparently represent two distinct classes of telomeric DNA-binding proteins. Deciphering the functions and interactions of such proteins may provide new insight into the structure and complex regulation of the telomere cap.
* This work was supported National Institutes of Health Grant GM65383 and National Science Foundation Grant MCB0349993) (to D. E. S.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 The abbreviations used are: hTRF, human TRF; TRFH, TRF homology; TRFL, TRF-like; CD, central domain; EMSA, electrophoresis mobility shift assay; Myb-ext, Myb extension.
2 L. Vespa, Y. V. Surovtseva, Z. N. Karamysheva, and D. E. Shippen, unpublished data.
3 L.Vespa, N. Kato, E. Lam and D. E. Shippen, unpublished data.
4 Z. N. Karamysheva and D. E. Shippen, unpublished data.
We thank Tom McKnight and members of the Shippen lab for many helpful discussions. We are also grateful to Tom McKnight, Carolyn Price, and Jeff Kapler for critically reading the manuscript.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||