|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 47, 48883-48892, November 19, 2004
Identification of an Evolutionarily Conserved Domain in Human Lens Epithelium-derived Growth Factor/Transcriptional Co-activator p75 (LEDGF/p75) That Binds HIV-1 Integrase*
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
80 amino acid residues and is both necessary and sufficient for binding to HIV-1 IN. Strikingly, the integrase binding domain (IBD) is not unique to LEDGF/p75, as a second human protein, hepatoma-derived growth factor-related protein 2 (HRP2), contains a homologous sequence. LEDGF/p75 and HRP2 IBDs avidly bound HIV-1 IN in an in vitro GST pull-down assay and each full-length protein potently stimulated HIV-1 IN activity in vitro. LEDGF/p75 and HRP2 are predicted to share a similar domain organization and have an evident evolutionary and likely functional relationship. | INTRODUCTION |
|---|
|
|
|---|
Mutations in HIV-1 IN display a wide range of phenotypes, affecting viral replication at the integration step (class I mutants), or causing various pleiotropic effects on virion morphogenesis and reverse transcription (class II) (8). Pleiotropic phenotypes of many IN mutants advocate that IN might have additional functions in viral replication. Thus, a role for IN in reverse transcription has been proposed (9). The complex phenotypes of class II mutants could potentially be explained by failure of the mutant INs to interact with viral reverse transcriptase and/or a host cell factor(s). A number of cellular and viral proteins were suggested to participate in retroviral integration (for a review see Ref. 10). Furthermore, several proteins were reported to directly interact with HIV-1 IN, including viral reverse transcriptase (9), a component of the SWI-SNF chromatin-remodeling complex INI1 (11), uracil DNA glycosylase UNG2 (12), heat shock protein HSP60 (13), a DNA repair protein Rad18 (14), a Polycomb group protein EED (15) and lens epithelium-derived growth factor/transcriptional coactivator p75 (LEDGF/p75) (Ref. 16, for a review see Ref. 17). The exact roles of these proteins and their importance to viral replication have yet to be determined. However, when HIV-1 or feline immunodeficiency virus (FIV) INs are expressed separately from other viral proteins, endogenous host-cell LEDGF/p75 appears to be the dominant interactor, accounting for their nuclear/chromosomal accumulation (16, 18, 19). LEDGF/p75 protein markedly stimulated HIV-1 IN activity in vitro and was recently reported to be associated with functional HIV-1 PICs (16, 19). These data cumulatively suggest that LEDGF/p75 and possibly its homologs pose as cellular host factors in retroviral replication likely acting at the levels of chromosomal targeting, and/or integration of viral cDNA (17).
LEDGF/p75 belongs to a family of hepatoma-derived growth factor (HDGF)-related proteins (HRPs). Five mammalian HRPs are known: HDGF, HRP1, HRP2, HRP3, HRP4, and LEDGF/p75 (see Supplementary Table I) (20, 21). The characteristic feature of these proteins is a high degree of sequence homology within their N-terminal 9095 residues, spanning the PWWP domain (InterPro accession number IPR000313) (20, 22). This domain, named for a conserved although not invariant Pro-Trp-Trp-Pro motif, extends for about 70 residues (22). Several dozen other PWWP domain-containing proteins have been described, including Wolf-Hirschhorn syndrome candidate 1 gene product WHSC1, mismatch repair protein MSH6, mammalian DNA methyltransferases Dnmt3a and Dnmt3b, and a plant homolog of ataxia telangiectasia-mutated protein kinase (22). PWWP domains seem to be distantly related to the Tudor and Chromo domains and are thought to mediate protein-protein interactions involved in regulation of chromatin structure (22, 23). Noteworthy, the PWWP domain of Dnmt3b methyltransferase was recently shown to be essential for chromatin association of the protein (24). Recognizable orthologs of mammalian HRPs seem to be present in all vertebrates, although proteins containing PWWP domains are more wide-spread and occur throughout eukaryota, including yeast (22). Apart from their homologous N-terminal PWWP domains, HRPs show little sequence similarity.
Cellular functions of LEDGF/p75 and other HRPs have not been studied in detail. Like other PWWP domain-containing proteins, HRPs are imported into the nucleus (21, 22, 25, 26). Chromatin association has so far been demonstrated only for LEDGF/p75 (16, 18, 27). LEDGF/p75 was implicated in the regulation of expression of stress response related genes, such as Hsp27,
B-crystallin, and antioxidant protein 2, presumably through binding to heat shock and stress-related regulatory elements in the promoters of the target genes (28, 29). Overexpression of LEDGF/p75 was reported to enhance cell viability under conditions of serum starvation, thermal, and oxidative stress (28, 30). During apoptosis, LEDGF/p75 is subject to cleavage by caspases that abolishes its activity as a cell-survival factor (30).
In this work, we studied the evolutionary conservation and domain organization of LEDGF/p75. We identified the HIV-1 IN binding domain (IBD) in this protein and found that another human protein, HRP2, can bind HIV-1 IN via a homologous domain and stimulate its activity in vitro.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
The 3'-terminal part of the Danio rerio (zebrafish) HRP2 cDNA was obtained from the following ESTs: BI886683 [GenBank] , AW154327 [GenBank] , CD586524 [GenBank] , BQ480774 [GenBank] , and AL916221 [GenBank] . The resulting contig was used to search through the zebrafish genome assembly (www.ensembl.org/Danio_rerio/) using nucleotide BLAST. The gene was identified on chromosome 22 and four exons containing the available portion of HRP2 cDNA sequence could be readily matched to the genomic sequence. Only one fragment predicted to encode a PWWP domain could be identified within the upstream 30 kb genomic sequence using PROSITE (us.expasy.org/prosite/). Assuming this sequence to represent the beginning of the HRP2 open reading frame (ORF), two sets of PCR primers were designed to amplify the cDNA: 5'-GTGGACGGATAGAAACG/5'-GAAGGAAGCCAAGGTGTG and 5'-AACGAGCAGAACGAGGAG/5'-GTTTGTGAGCATAAAAGGAG. Random-primed cDNA prepared from a sample of D. rerio kidney total RNA was used as template. PCRs with both primer pairs readily amplified fragments with the expected size of about 2.1 kb. Sequence analysis agreed with the chromosomal sequence and confirmed homology to human and mouse HRP2. The D. rerio HRP2 gene spans about 24.6 kb on the chromosome 22; the coding region of the cDNA is derived from 18 exons. The complete ORF from the X. laevis HRP2 cDNA was reconstructed from the following ESTs: CA789647 [GenBank] , AW643477 [GenBank] , BJ039312 [GenBank] , BJ626283 [GenBank] , BJ619541 [GenBank] , BF612425 [GenBank] , BU916767 [GenBank] , CA981423 [GenBank] , BJ054046 [GenBank] , BJ642669 [GenBank] , BE678817 [GenBank] , BE678996 [GenBank] , BX853753 [GenBank] , BF426654 [GenBank] , BJ622360 [GenBank] , CD363094 [GenBank] , BJ639036 [GenBank] , BG234506 [GenBank] , BJ086552 [GenBank] , BJ050372 [GenBank] , and BG812065. Partial G. gallus HRP2 cDNA sequence was obtained from a contig of the following ESTs: BU261466 [GenBank] , BU324804 [GenBank] , BU392278 [GenBank] , CD727797 [GenBank] , BU351707 [GenBank] , BU347241 [GenBank] , AI981158 [GenBank] , BU141299 [GenBank] , BU428133 [GenBank] , BU236024 [GenBank] .
Protein Secondary Structure Prediction and Sequence Analysis Secondary structure prediction was done using the PROFsec and NORSp programs accessed through the PredictProtein server (cubic.bioc.columbia.edu) (31, 32). Hydrophobicity profiles (33) were analyzed using BioAnnotator software (InforMax Inc.). Multiple sequence alignments were done with AlignX (InforMax Inc.) using BLO-SUM62 or GONNET matrices (34, 35). Homology between IBDs and the N-terminal domain of transcription factor IIS (TFIIS) was found using InterProScan release 7.2 (www.ebi.ac.uk/InterProScan/) (36) and SMART version 4.0 (smart.embl-heidelberg.de/) (37).
DNA Constructs for Bacterial Protein ExpressionAll glutathione S-transferase (GST)-LEDGF/p75 fusion constructs used for protein expression in this work were based on the pGEX-4T1 vector (Amersham Biosciences). The full-length LEDGF/p75 ORF and its fragments were PCR-amplified using Pfu-Ultra DNA polymerase (Stratagene). Sense primers were designed to incorporate a BamHI restriction site followed directly by the first codon of the relevant LEDGF/p75 fragment; anti-sense primers contained a stop codon (TGA) directly following the last codon. PCR fragments were digested with BamHI and subcloned between BamHI and SmaI sites of pGEX-4T1.
To clone the putative IBD of HRP2, a fragment coding for residues 470593 of the human protein was PCR-amplified from random-primed HeLa cDNA using Expand DNA polymerase and the following primers: 5'-GCGTGGATCCTCCGTGGAGGAGAAGCTGCAG/5'-CCCTCACTTGTCCTCCGCCTTCTCC. The resulting PCR fragment was digested with BamHI and ligated into BamHI/SmaI-digested pGEX-4T1. The full-length HRP2 ORF was PCR-amplified from cDNA clone MGC2641 (American Type Culture Collection) using primers 5'-GCGTGGATCCATGCCACACGCCTTCAAGCC and 5'-GCTCAGCTCTCCTCGTCCAGGGCCTC; the PCR fragment digested with BamHI was subcloned between BamHI and SmaI sites of pGEX-6P3, resulting in pCP-GSTHRP2. For expression of non-tagged full-length HRP2, the entire HRP2 ORF was amplified using primers 5'-TGCCACACGCCTTCAAGCC and 5'-GTTTTCACCGTCATCAC, the resulting PCR fragment was digested with XhoI and cloned between NdeI and XhoI sites of pRSETB (Invitrogen) (the vector NdeI terminus was filled-in using T4 DNA polymerase) giving pCP-NatHRP2. Non-modified pGEX-4T1 was used to produce GST as a control for pull-down experiments. Plasmids pCPNat75 and pKB-IN6H were described previously (18).
DNA Constructs for Expression in Human CellsPlasmids pBHA-P75 and pCPHA-HRP2 expressed human LEDGF/p75 and HRP2 with N-terminal influenza hemagglutinin (HA) tags, respectively, under the control of the human cytomegalovirus immediate-early promoter. To make pBHA-P75, the LEDGF/p75 ORF was PCR-amplified using 5'-CCGCGGATCCGACACCATGGCATACCCATACGACGTCCCAGACTACGCTACTCGCGATTTCAAACCTGGAGACC/5'-ATAAGAATGCGGCCGCCTAGTTATCTAGTGTAGAATCC and Pfu-Ultra DNA polymerase. The resulting amplicon was digested with BamHI and NotI and ligated into BamHI/NotI-digested pcDNA6-V5-HisB (Invitrogen). The BamHI/XhoI fragment of pCP-GSTHRP2 carrying the entire HRP2 ORF was re-cloned between BglII/XhoI sites of the pCPHA-NLS vector, fusing the 5'-end of the HRP2 ORF directly to the HA tag coding sequence, resulting in pCPHA-HRP2. The HA tag fusion vector pCPHA-NLS was made by first disrupting the BglII site in pcDNA6-V5-HisB by digesting it with BglII, filling-in using Pfu polymerase, and religation resulting in pcDNA6
Bgl. A DNA fragment obtained by annealing synthetic oligonucleotides 5'-CGGGAAGCTTAGACACCATGGCCTACCCTTACGACGTGCCCGACTACGCCAGATCTG and 5'-GGTGGGATCCCTCCACCTTCCGCTTCTTCTTGGGAGGGCCAGATCTGGCGTAGTCG followed by extension using Sequenase Version 2.0 T7 DNA polymerase (Amersham Biosciences) was restricted with HindIII and BamHI and then ligated with HindIII/BamHI-digested pcDNA6
Bgl. The resulting pCPHA-NLS vector encodes for the HA tag fused to the simian virus 40 large T antigen nuclear localization signal (NLS), with an intervening BglII restriction site. The construct from Bram et al. (38) was used to express human cyclophilin A (CypA) with a C-terminal HA tag. This plasmid will be referred to here as pCypA-HA. The construct pED-FLAG-IN was used to express FLAG-tagged HIV-1 IN (39). All expression constructs were verified to be free of inadvertent mutations by sequencing.
Protein Expression and PurificationGST fusion proteins were produced in Escherichia coli B strain BL21. Shake-flask cultures grown to an optical density of 0.91.0 at 600 nm were induced for 3 h by addition of 0.5 mM isopropyl-thio-
-D-galactopyranoside at 37 °C. The temperature of induction was reduced to 28 °C to increase the stability of full-length GST-LEDGF/p75, GST-LEDGF-(1325), and GST-LEDGF-(1471). Bacteria were disrupted by sonication in 500 mM NaCl, 5 mM dithiothreitol (DTT), 1 mM EDTA, 0.2 mM phenylmethylsulfonyl fluoride (PMSF), 50 mM Tris-HCl, pH 7.2. Because the fusion proteins differed in terms of stability, expression levels, and solubility, purification procedures had to be adjusted accordingly. Briefly, GST fusions containing full-length LEDGF/p75 and fragments LEDGF-(1471) and LEDGF-(1325) were isolated from soluble fractions of bacterial lysates by adsorption onto glutathione-Sepharose (Amersham Biosciences). Proteins eluted with 25 mM reduced glutathione (Sigma-Aldrich) in 500 mM NaCl/50 mM Tris-HCl, pH 7.2 were further purified by chromatography on 5-ml HiTrap heparin and SP-Sepharose columns (Amersham Biosciences) to partially remove proteolytic fragments. Both columns were operated in 50 mM NaH2PO4, pH 7.2, and proteins were eluted with a linear gradient of 0.2 M to 0.8 M NaCl. Peak fractions were pooled and diluted 1:3 with 50 mM NaH2PO4, pH 7.2 before injection onto SP-Sepharose. GST fusions of LEDGF/p75 fragments 326530, 326471, 347471, 366471, the HRP2 fragment 326530, as well as free GST protein were purified from soluble fractions in one step on glutathione-Sepharose. GST fused to LEDGF-(347429) was expressed in an insoluble form. To purify this protein, inclusion bodies were dissolved in 8 M urea, 100 mM NaCl, 1 mM DTT, 0.5 mM EDTA, 25 mM Tris-HCl, pH 7.2 and refolded by dilution into 10-fold excess of cold 100 mM Tris-HCl, pH 8.5. The protein was purified by chromatography on glutathione-Sepharose. GST fusion proteins dialyzed against excess 200 mM NaCl/25 mM Tris-HCl, pH 7.2 were concentrated using Centricon-30 (Millipore) when necessary, supplemented with 10% glycerol and stored at 70 °C after flash-freezing in liquid nitrogen.
Non-tagged HRP2 was induced in Rosetta2 (DE3) cells (Novagen) for 3 h at 28 °C by addition of 0.25 mM isopropyl-thio-
-D-galactopyranoside. The bacteria were disrupted by sonication in 1 M NaCl, 50 mM NaH2PO4, 5 mM DTT, 0.3 mM PMSF, pH 7.7. The lysate precleared by centrifugation at 15,000 rpm for 30 min was diluted with 50 mM NaH2PO4, pH 7.2 to reduce conductivity to 24 mS/cm and injected into a 5-ml HiTrap heparin column. Bound proteins were eluted with a linear salt gradient in 50 mM NaH2PO4, pH 7.2. HRP2 protein eluting at
500 mM NaCl was collected, the peak fractions were pooled, diluted 1:3 in 50 mM NaH2PO4, pH 7.2 and injected into a 5-ml HiTrap SP-Sepharose column. The protein was eluted with a linear gradient of NaCl from 0.15 to 1.0 M in 50 mM NaH2PO4, pH 7.2. Fractions containing HRP2 were pooled, concentrated using a Centricon device, and further separated on a Superdex 200HR column (Amersham Biosciences) at 0.25 ml/min in 250 mM NaCl, 50 mM NaH2PO4, pH 7.2. The purified protein was concentrated to 3.5 mg/ml, supplemented with 10% glycerol and stored at 70 °C after flash-freezing in liquid nitrogen. Non-tagged LEDGF/p75 and His6-tagged HIV-1 IN were produced in E. coli strain BL21(DE3), pLysS using pCPNat75 and pKB-IN6H, respectively, and purified according to published procedures (18, 40). LEDGF-(326530), LEDGF-(347471), and HRP2-(470593) fragments released from GST by digestion with thrombin (Sigma-Aldrich) were further purified by cation exchange chromatography on SP-Sepharose using a linear 0.10.5 M NaCl gradient in 50 mM sodium phosphate buffer, pH 7.2. Protein concentrations were determined using the Bradford colorimetric assay (Bio-Rad) employing bovine serum albumin (BSA) as a standard.
N-terminal Microsequencing and Mass SpectrometryTo determine the N-terminal residues of trypsin-resistant (TR) 1 and TR2 peptides, tryptic fragments resulting from digestion of 10 µg of LEDGF/p75 were separated by 1020% Tricine-SDS-PAGE (Invitrogen) and electroblotted onto Sequi-Blot polyvinylidene difluoride membrane (Bio-Rad). Bands excised from Coomassie Blue R250-stained membranes were subjected to Edman degradation in a Procise protein sequencer (Applied Biosystems). In-gel trypsin digestion and peptide extraction were done as described (41). To determine molecular masses of the intact TR1 and TR2 peptides, a mixture of digestion products was separated on a 2.1 x 250 mm Vidyac C8 column. Fractions containing the TR1 and TR2 fragments were analyzed by matrix-assisted laser desorption/ionization mass spectrometry (MALDI MS).
GST Pull-down AssayPurified GST fusion proteins were adsorbed onto glutathione-Sepharose beads (Amersham Biosciences) in 200 mM NaCl, 5 mM DTT, 25 mM Tris-HCl, pH 7.3, using 125 µl (settled volume) beads per 40 µg of protein. After 4 h at 4 °C, the beads were washed in excess buffer and stored on ice. To test for IN binding, 10 µl of glutathione-Sepharose beads carrying GST fusion proteins were resuspended in 200 µl of cold PD buffer (150 mM NaCl, 5 mM MgCl2, 5 mM DTT, 0.1% Nonidet P40, 25 mM Tris-HCl, pH 7.4) containing 10 µg of BSA. After addition of 3.8 µgofHis6-tagged HIV-1 IN the samples were gently rocked for 1.52 h at 4 °C and left for an additional 1530 min without mixing. After careful aspiration of the supernatant, the settled beads were resuspended in 700 µl of fresh PD buffer, and allowed to sediment without centrifugation. The wash was repeated twice and bound proteins were eluted in SDS-containing sample buffer and analyzed by SDS-PAGE. In certain cases IN pull-down was confirmed by Western blotting using polyclonal anti-IN serum (42).
Cell Transfection and Immunoprecipitation293T cells were maintained in Dulbecco's modified Eagle's medium containing 10% fetal calf serum (Invitrogen), 5 units/ml penicillin and 5 µg/ml streptomycin. 293T cells grown in 6-well dishes to 3050% confluency were transfected with 0.5 µg of pCypA-HA, pBHA-P75, or pCPHA-HRP2 along with 0.5 µg of pED-FLAG-IN per well using FuGENE 6 transfection reagent (Roche Applied Science). Twenty-four hours post-transfection, cells were washed in cold phosphate-buffered saline, and lysed in 400 µl of cell lysis buffer (500 mM NaCl, 0.5% Triton X-100, 50 mM HEPES pH 7.9, 5% glycerol, 2 mM MgCl2, 25 mM
-glycerophosphate, 1 mM sodium orthovanadate, supplemented with complete protease inhibitor mixture (Roche Applied Science)). The extracts were centrifuged at 19,000 x g to remove cell debris and precleared by incubation with 4 µl (settled volume) of protein G-Sepharose beads (Amersham Biosciences). Precleared supernatants were incubated with 4 µg of mouse anti-HA 12CA5 antibody (Roche Applied Science) at 4 °C, 4 µl of protein G-Sepharose beads were added, and the samples were left rocking for an additional hour. The beads were washed three times in cell lysis buffer, four times in reduced salt buffer (cell lysis buffer modified to contain 150 mM NaCl, 0.1% Triton X-100, and 0.1% Nonidet P-40). Whole cell extracts and immunoprecipitated proteins were resolved in 420% SDS-polyacrylamide gels. Following semi-dry transfer to polyvinylidene difluoride membranes, HA-tagged CypA, LEDGF/p75, and HRP2 proteins were detected by Western blotting using anti-HA 3F10 antibody conjugated to horseradish peroxidase (Roche Applied Science) and Western Lightning chemiluminescent reagent plus (PerkinElmer Life Sciences). FLAG-tagged IN was detected with anti-FLAG M2 antibody (Sigma-Aldrich) and goat anti-mouse IgG horseradish peroxidase conjugate (Jackson ImmunoResearch Laboratories).
| RESULTS |
|---|
|
|
|---|
48% identity between mammalian, avian, and amphibian LEDGF/p75 proteins (Supplementary Fig. S1). The plot in Fig. 1A summarizes this alignment by showing the degree of conservation along the protein sequence. Three regions of homology were evident (highlighted as shaded boxes in Fig. 1A). The most conserved fragment spanning residues 194 (conserved region I), which showed about 89% identity between human, chicken, and frog, corresponded to the PWWP domain (22). A 105-residue region spanning residues 351455 displayed about 87% identity (region III). In addition, a short fragment involving residues 178197 (region II) showed significant homology. Intuitively, these most conserved regions likely represent functional and/or structural determinants within the protein. The most variable regions encompassed an internal fragment flanking the PWWP domain (residues 94177 in human LEDGF/p75, showing only about 13% identity) and the 60 C-terminal residues of the protein (20% identity). The single conserved feature of the first hypervariable region was a 7-residue sequence, 146RRGRKRK152, which partially overlaps the NLS in human LEDGF/p75 (residues 148156) (43). Both chicken and frog LEDGF/p75 contain an insertion of 39 amino acids within the first hypervariable region (Supplementary Fig. S1).
|
-strand elements followed by
-helices in that region is accurate, since a five-stranded
-barrel core and a C-terminal bundle of
-helices are conserved structural features of PWWP domains (44). The region encompassing residues 347423 and matching homology region III (Fig. 1A) was predicted with high confidence to pack into four or five
-helices. Of note, this fragment, along with the N-terminal PWWP domain, span two mostly hydrophobic regions of LEDGF/p75 with average hydrophobicity indices above zero (data not shown).
Limited Proteolysis of LEDGF/p75We used limited proteolysis (45) to probe the domain organization of LEDGF/p75. As the protein is rich in charged amino acids, a cleavage site for trypsin is predicted on average every 45 residues. Considering all Lys and Arg residues, the largest hypothetical LEDGF/p75 tryptic peptide was just 25 residues (Thr477Lys501) with a molecular mass of about 2.6 kDa. We found that recombinant human LEDGF/p75 was indeed very sensitive to trypsin. A mass ratio of 250:1 of LEDGF/p75:protease yielded final proteolyzed products as well as semi-stable intermediates (Fig. 2A). The protease was quenched at different time points by addition of PMSF and reaction products were analyzed using Tris-glycine or Tricine SDS-PAGE. As quantified by densitometry of Coomassie-stained gels,
6070% of the protein became extinct after a relatively short exposure to trypsin (compare lanes 1 and 6 in Fig. 2A). As proteolysis proceeded, two distinct polypeptides TR1 and TR2 with apparent molecular masses close to 10 kDa gradually accumulated at the expense of the intermediate cleavage products (Fig. 2A). Both TR1 and TR2 fragments persisted even after overnight digestion under these conditions (data not shown).
|
In addition to trypsin, we tested proteinase K, thrombin, chymotrypsin, and Arg C proteases (data not shown). Unlike trypsin, digestion with proteinase K did not result in stable proteolytic products, however transient fragments of about 10 kDa in size were observed. Incubation of GST-LEDGF/p75 with thrombin resulted in multiple cuts within the putative loop region adjoining the PWWP domain. Chymotrypsin and Arg C proteases appeared less active than trypsin and although the fragments obtained confirmed the tryptic map, the cleavage patterns were more complex and longer incubation times were necessary to allow for accumulation of final products.
TR2 Is the Functional LEDGF/p75 IBDTo identify region(s) of LEDGF/p75 involved in the interaction with HIV-1 IN, we prepared a series of LEDGF/p75 deletion mutants. Mutants were expressed and purified as GST fusions, pre-adsorbed onto glutathione-Sepharose beads, and tested for their ability to pull-down recombinant HIV-1 IN. As can be seen from Fig. 3A, both the full-length protein (residues 1530) and the mutant lacking the variable 59 C-terminal residues (1471) readily bound HIV-1 IN (Fig. 3A, lanes 9 and 12). However, a more extended deletion from the C terminus disrupted interaction with IN, as LEDGF-(1325) lacking 205 residues failed to pull down IN (lane 10). This result corroborates the previous finding that LEDGF/p52, an alternative splice form containing a unique 8-residue tail in place of LEDGF/p75 residues 326530, did not bind HIV-1 IN (18). Furthermore, the C-terminal fragment of LEDGF/p75 containing residues 326530 was sufficient to pull down HIV-1 IN (lane 11). By making another set of deletions, the IN binding function of LEDGF/p75 was mapped to just 83 amino acids, spanning residues 347429 (Fig. 3B, lane 15; see also Supplementary Fig. S1). Importantly, this fragment lies within conserved region III of LEDGF/p75 (Fig. 1A) and the TR2 fragment (Fig. 2; see also Fig. 1B for summary). We found that further truncations from the N terminus of 347429 abolished the interaction with IN (lane 16) and reduced the solubility of the recombinant protein (data not shown). Deletions from the C terminus of this fragment, on the other hand, profoundly affected stability of GST fusion proteins in E. coli (data not shown). These observations indicated that residues 347429 of LEDGF/p75 span the IBD and comprise the minimal sequence required for its proper folding.
|
Identification of HRP2 as a Second IBD-containing Protein Using translated BLAST to search for human cDNAs encoding polypeptides with homology to the LEDGF/p75 IBD we found that a second HDGF-related protein, HRP2, contains a very similar sequence within its C-terminal region. Because this region of homology is relatively short and occurs within largely divergent sequences, the similarity within C-terminal regions of LEDGF/p75 and HRP2 remained unnoticed until now. Fig. 4A presents an alignment of the human LEDGF/p75 IBD with the related sequence from HRP2 and includes their respective orthologs from different species. Human LEDGF/p75 and HRP2 proteins are about 48% identical within this region, and, considering conservative amino acid substitutions, the similarity exceeds 70%. Furthermore, predicted secondary structural elements within the two putative IBDs matched very well, with both domains demonstrating high
-helical content (Fig. 4A). We identified several ESTs encoding an HRP2 ortholog from D. rerio, which allowed us to clone and sequence its complete coding region. In addition, HRP2 cDNA from X. laevis could be completely reconstructed from available ESTs (see "Experimental Procedures"). Sequence alignment of human, frog, and fish HRP2 revealed high degrees of sequence conservation within the PWWP and IBD-like regions (regions I and III, Fig. 4B) (for a complete alignment see supplementary Fig. S2). An approximate 20-amino acid region of homology (region II) was similar to homology region II in LEDGF/p75, with each region containing several conserved Pro, Arg, and Lys residues. HRP2 region IV, however, appears unique to this protein. In addition, we also identified a hypothetical 475-residue protein CG7946 from Drosophila melanogaster (GenBankTM accession NP_651768
[GenBank]
, UniGene cluster Dm.4512) that contains an IBD-related sequence. This fragment, spanning CG7946 residues 318400, shared about 21% identical and 46% similar residues with the HRP2 IBD (not shown). Intriguingly, since this protein is also predicted to possess an N-terminal PWWP domain, it likely represents an insect ortholog of HRP2. Additional searches using InterProScan and SMART (Simple Modular Architecture Research Tool) revealed homology between the IBDs and the N-terminal domain of TFIIS (SMART accession SM00509). Although the E-values reported by SMART for these hits were relatively high, equating to 3.1 and 1.2 for human LEDGF/p75 and HRP2 IBDs, respectively, the N-terminal domain of TFIIS seems to represent their closest relative among known protein domains. The TFIIS domain family includes four-helix bundle domains of TFIIS, elongin A, and CRSP70 (46).
|
|
|
To test whether HRP2 can stimulate HIV-1 IN in vitro, we purified full-length HRP2 protein (Fig. 6E). As can be seen from Fig. 6F, HRP2 was proficient in activating HIV-1 IN-mediated strand transfer at a concentration range similar to LEDGF/p75. Akin to the LEDGF/p75 IBD fragment (LEDGF-(347471)), IBD-containing HRP2-(470593) (lane 6, Fig. 5A) did not increase HIV-1 IN strand transfer (data not shown). In addition, LEDGF-(347471) competitively inhibited HRP2-dependent stimulation (lanes 48 in Fig. 6G), arguing that LEDGF/p75 and HRP2 share a common binding site on HIV-1 IN.
| DISCUSSION |
|---|
|
|
|---|
The LEDGF/p75 IBD is comprised of about 80 residues and is predicted to fold into four or five
-helices (Figs. 1B and 4). The minimal fragment that bound HIV-1 IN via GST pull-down spanned residues Ser347Val429. This is in agreement with a previous report that LEDGF/p52 protein lacking residues 326530 neither bound HIV-1 IN in vitro nor co-localized with it in live cells (18). Intriguingly, we identified a homologous sequence within another HDGF-related protein, HRP2, which likewise displayed affinity for HIV-1 IN. Thus, in addition to the N-terminal PWWP domains, LEDGF/p75 and HRP2 share conserved C-terminal domains, suggesting a close evolutionary and probable functional relationship between these proteins. Although we did not analyze susceptibility of HRP2 to proteases, analysis of its predicted amino acid sequence suggests that domain organization is similar to that of LEDGF/p75. Alignment of HRP2 orthologs from mammalian, amphibian, and fish sources showed a high degree of sequence conservation within the PWWP and IBD regions (Fig. 4B, see also Supplementary Fig. S2). Two additional fragments with significant interspecies homology (regions II and IV, Fig. 4B) were present in this protein. While HRP2 homology region II was clearly related to LEDGF/p75 region II, containing similarly spaced Pro and charged residues (Supplementary Figs. S1 and S2), region IV appears unique to HRP2. An extended
-helix involving residues Glu321Arg356 is predicted in this fragment. Thus, it is likely that HRP2 possesses an additional small structural domain. The sequences connecting the conserved regions in HRP2 contain multiple low complexity elements comprised of Pro, Ser, or Ser-Asp repeats, suggesting high flexibility (Supplementary Fig. S2). Low complexity sequences are common to eukaryotic proteins and are thought to be natively disordered (49). Such sequences are usually not conserved, in accordance with their putative roles as flexible hinges. A high prevalence of simple sequences in HRP2 explains the overall low degree of sequence conservation between orthologs compared with that of LEDGF/p75 (see Supplementary Table I and Figs. S1 and S2). In silico analysis of amino acid sequences of other HRPs suggest that although they do not possess IBD-like domains,
-helical elements are located within C-terminal regions of HDGF and HRP1 (data not shown), suggesting the presence of a second functional domain within these proteins as well.
Like HDGF, all HRPs seem to have mitogenic activity in cell culture (21, 25, 30). It is presently unclear whether the growth factor activity of such proteins that lack classical secretory signals is related to their functions in vivo (20). The original observation that LEDGF/p75 co-purified from HeLa nuclear extracts together with the transcription co-activator PC4 provided a clue that the protein might be involved in transcriptional regulation (50). More recently, LEDGF/p75 was reported to bind to heat shock and stress-related elements within promoters regions of the AOP2, Hsp27, and
B-crystallin genes and trans-activate their expression (28, 29). Although an earlier study isolated LEDGF/p75 from a lens epithelial cDNA library, expression of the protein is clearly not limited to lens. In contrast to the protein's name, cDNA clones encoding LEDGF/p75 have been isolated from a wide range of primary and transformed mouse and human tissues at all stages of development (refer to EST collections associated with the Uni-Gene entries from Supplementary Table I). Sequences derived from 215 cDNA clones suggesting several alternative LEDGF splice variants exist in the AceView data base (for up to date information consult www.ncbi.nlm.nih.gov/IEB/Research/Acembly/). While the most abundant splice form, supported by 170 cDNA clones, encodes for LEDGF/p75, only 12 cDNAs are derived from p52 mRNA. Although a detailed expression analysis of individual splice forms will require a specialized study, it would appear that LEDGF/p75 is the dominant protein product of the PSIP1 gene in most tissues.
According to the large numbers of human and mouse ESTs corresponding to LEDGF/p75 and HRP2, these proteins are ubiquitously expressed at relatively high levels (see Supplementary Table I). Although the HRP2 IBD displayed an apparent high affinity for HIV-1 IN by GST pull-down (Fig. 5A), results of co-immunoprecipitation experiments suggested that LEDGF/p75 was a more potent IN interactor than was full-length HRP2 in human cells (Fig. 5B). This was not entirely unexpected, as depletion of endogenous LEDGF/p75 alone by siRNA efficiently disrupted the nuclear and chromosomal accumulation of HIV-1 and FIV IN in cells (18, 19). However, LEDGF/p75 and HRP2 proteins stimulated HIV-1 IN to a comparable degree in vitro (Fig. 6F). Based on this result we speculate that binding of IN to HIV-1 cDNA termini might stabilize the HRP2-IN interaction. HRP2 could potentially explain the failure of persistent siRNA-mediated knockdowns of LEDGF/p75 to reduce viral replication (19). It would also be interesting to determine if LEDGF/p75 and/or HRP2 modulate the enzymatic activity of FIV and other retro/lenti-viral INs (19).
It was demonstrated that HIV-1 displays a significant bias toward integration into active genes (51, 52). Somewhat similar, but not identical integration specificity was observed for murine leukemia virus, which prefers to integrate within transcription start regions in the human genome (52). On a practical level, specificity for integration within or near active genes poses a problem in developing retroviral vector-based gene therapies (53). Distant relatives of retroviruses, yeast retrotransposons present the best studied paradigm of targeted integration in eukaryotes (reviewed in Ref. 54). At least in the case of the Ty5 retrotransposon, a specific interaction between Ty5 IN and the chromosomal protein Sir4p determines the specificity of retrotransposition into silent chromatin (55, 56). Integration of another yeast retrotransposon, Ty3, which has a preference for RNA polymerase III transcription start sites, is controlled by a TFIIIB transcription factor complex, although the interacting determinant on the retrotransposon side is not known (57). Putative chromodomains were identified in the C-terminal regions of INs from many LTR retrotransposons, such as fungal Cft1 and Skippy, and were hypothesized to mediate the targeting of their integration (58). In this context, a model involving a chromatin binding protein as a targeting factor for retroviral integration seems quite plausible. LEDGF/p75, a chromosomal protein and a putative regulator of transcription that binds lentiviral INs in live cells, represents such a candidate factor (1619). Identification of LEDGF/p75 as a component of HIV-1 PICs encourages further research, as it remains to be seen whether LEDGF/p75 and/or its close relative HRP2 play role(s) in PIC formation or targeting during retroviral infection (19).
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains Supplementary Materials.
The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EBI Data Bank with accession number(s) AY728140
[GenBank]
, AY728141
[GenBank]
, and AY728142
[GenBank]
. ![]()
** To whom correspondence should be addressed: Dept. of Cancer Immunology and AIDS, Dana-Farber Cancer Institute, 44 Binney St., Boston, MA 02115. Tel.: 617-632-4361; Fax: 617-632-3113; E-mail: alan_engelman{at}dfci.harvard.edu.
1 The abbreviations used are: HIV-1, human immunodeficiency virus type 1; BLAST, basic local alignment search tool; BSA, bovine serum albumin; CypA, cyclophilin A; DTT, dithiothreitol; EST, expressed sequence tag; FIV, feline immunodeficiency virus; GST, glutathione S-transferase; HA, hemagglutinin; HDGF, hepatoma-derived growth factor; HMGA, high mobility group superfamily A; HRP, HDGF-related protein; IBD, integrase-binding domain; IN, integrase; LEDGF, lens epithelium-derived growth factor; MALDI MS, matrix-assisted laser desorption/ionization mass spectrometry; NLS, nuclear localization signal; NP40, Nonidet P40; ORF, open reading frame; PIC, preintegration complex; PMSF, phenylmethylsulfonyl fluoride; SMART, simple modular architecture research tool; TFIIS, transcription factor IIS; TR, trypsin-resistant; Tricine, N-[2-hydroxy-1,1-bis(hydroxymethyl)ethyl]glycine; HPLC, high performance liquid chromatography. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. S. Adamson and E. O. Freed Anti-HIV-1 Therapeutics: From FDA-approved Drugs to Hypothetical Future Targets Mol. Interv., April 1, 2009; 9(2): 70 - 74. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-C. Shun, Y. Botbol, X. Li, F. Di Nunzio, J. E. Daigle, N. Yan, J. Lieberman, M. Lavigne, and A. Engelman Identification and Characterization of PWWP Domain Residues Critical for LEDGF/p75 Chromatin Binding and Human Immunodeficiency Virus Type 1 Infectivity J. Virol., December 1, 2008; 82(23): 11555 - 11567. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. McKee, J. J. Kessl, N. Shkriabai, M. J. Dar, A. Engelman, and M. Kvaratskhelia Dynamic Modulation of HIV-1 Integrase Structure and Function by Cellular Lens Epithelium-derived Growth Factor (LEDGF) Protein J. Biol. Chem., November 14, 2008; 283(46): 31802 - 31812. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Brown-Bryan, L. S. Leoh, V. Ganapathy, F. J. Pacheco, M. Mediavilla-Varela, M. Filippova, T. A. Linkhart, R. Gijsbers, Z. Debyser, and C. A. Casiano Alternative Splicing and Caspase-Mediated Cleavage Generate Antagonistic Variants of the Stress Oncoprotein LEDGF/p75 Mol. Cancer Res., August 1, 2008; 6(8): 1293 - 1307. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Hou, D. E. Mcguinness, A. J. Prongay, B. Feld, P. Ingravallo, R. A. Ogert, C. A. Lunn, and J. A. Howe Screening for Antiviral Inhibitors of the HIV Integrase--LEDGF/p75 Interaction Using the AlphaScreenTM Luminescent Proximity Assay J Biomol Screen, June 1, 2008; 13(5): 406 - 414. [Abstract] [PDF] |
||||
![]() |
Y. Botbol, N. K. Raghavendra, S. Rahman, A. Engelman, and M. Lavigne Chromatinized templates reveal the requirement for the LEDGF/p75 PWWP domain during HIV-1 integration in vitro Nucleic Acids Res., March 27, 2008; 36(4): 1237 - 1246. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-C. Shun, N. K. Raghavendra, N. Vandegraaff, J. E. Daigle, S. Hughes, P. Kellam, P. Cherepanov, and A. Engelman LEDGF/p75 functions downstream from preintegration complex formation to effect gene-specific HIV-1 integration Genes & Dev., July 15, 2007; 21(14): 1767 - 1778. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Hayouka, J. Rosenbluh, A. Levin, S. Loya, M. Lebendiker, D. Veprintsev, M. Kotler, A. Hizi, A. Loyter, and A. Friedler Inhibiting HIV-1 integrase by shifting its oligomerization equilibrium PNAS, May 15, 2007; 104(20): 8316 - 8321. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. K. Pandey, S. Sinha, and D. P. Grandgenett Transcriptional Coactivator LEDGF/p75 Modulates Human Immunodeficiency Virus Type 1 Integrase-Mediated Concerted Integration J. Virol., April 15, 2007; 81(8): 3969 - 3979. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Cherepanov LEDGF/p75 interacts with divergent lentiviral integrases and modulates their enzymatic activity in vitro Nucleic Acids Res., January 12, 2007; 35(1): 113 - 124. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. De Rijck, L. Vandekerckhove, R. Gijsbers, A. Hombrouck, J. Hendrix, J. Vercammen, Y. Engelborghs, F. Christ, and Z. Debyser Overexpression of the Lens Epithelium-Derived Growth Factor/p75 Integrase Binding Domain Inhibits Human Immunodeficiency Virus Replication J. Virol., December 1, 2006; 80(23): 11498 - 11509. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Llano, D. T. Saenz, A. Meehan, P. Wongthida, M. Peretz, W. H. Walker, W. Teo, and E. M. Poeschla An Essential Role for LEDGF/p75 in HIV Integration Science, October 20, 2006; 314(5798): 461 - 464. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. G. Sutherland, K. Newton, D. G. Brownstein, M. C. Holmes, C. Kress, C. A. Semple, and W. A. Bickmore Disruption of ledgf/psip1 results in perinatal mortality and homeotic skeletal transformations. Mol. Cell. Biol., October 1, 2006; 26(19): 7201 - 7210. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Zielske and M. Stevenson Modest but Reproducible Inhibition of Human Immunodeficiency Virus Type 1 Infection in Macrophages following LEDGFp75 Silencing J. Virol., July 15, 2006; 80(14): 7275 - 7280. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. N. Maertens, P. Cherepanov, and A. Engelman Transcriptional co-activator p75 binds and tethers the Myc-interacting protein JPO2 to chromatin J. Cell Sci., June 15, 2006; 119(12): 2563 - 2571. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Turlure, G. Maertens, S. Rahman, P. Cherepanov, and A. Engelman A tripartite DNA-binding element, comprised of the nuclear localization signal and two AT-hook motifs, mediates the association of LEDGF/p75 with chromatin in vivo Nucleic Acids Res., March 20, 2006; 34(5): 1653 - 1665. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Bradley and R. Craigie Seeing is believing: Structure of the catalytic domain of HIV-1 integrase in complex with human LEDGF/p75 PNAS, December 6, 2005; 102(49): 17543 - 17544. [Full Text] [PDF] |
||||
![]() |
P. Cherepanov, A. L. B. Ambrosio, S. Rahman, T. Ellenberger, and A. Engelman From the Cover: Structural basis for the recognition between HIV-1 integrase and transcriptional coactivator p75 PNAS, November 29, 2005; 102(48): 17308 - 17313. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Emiliani, A. Mousnier, K. Busschots, M. Maroun, B. Van Maele, D. Tempe, L. Vandekerckhove, F. Moisant, L. Ben-Slama, M. Witvrouw, et al. Integrase Mutants Defective for Interaction with LEDGF/p75 Are Impaired in Chromosome Tethering and HIV-1 Replication J. Biol. Chem., July 8, 2005; 280(27): 25517 - 25523. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Gelbart, N. Bachman, J. Delrow, J. D. Boeke, and T. Tsukiyama Genome-wide identification of Isw2 chromatin-remodeling targets by localization of a catalytically inactive mutant Genes & Dev., April 15, 2005; 19(8): 942 - 954. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Vanegas, M. Llano, S. Delgado, D. Thompson, M. Peretz, and E. Poeschla Identification of the LEDGF/p75 HIV-1 integrase-interaction domain and NLS reveals NLS-independent chromatin tethering J. Cell Sci., April 15, 2005; 118(8): 1733 - 1743. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Molecular and Cellular Proteomics |
| Journal of Lipid Research | ASBMB Today |