![]()
|
|
||||||||
J. Biol. Chem., Vol. 280, Issue 9, 7949-7955, March 4, 2005
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








From the
Ohio State University Health Sciences Center, College of Pharmacy, Center for Retrovirus Research and Comprehensive Cancer Center, Columbus, Ohio 43210,
HIV Drug Resistance Program and ||Laboratory of Medicinal Chemistry, NCI-Frederick, Frederick, Maryland 21702, the ¶Department of Cellular and Molecular Biology, Tbilisi State University, Tbilisi 0128, Georgia, and **NIDDK, National Institutes of Health, Bethesda, Maryland 20892
Received for publication, December 2, 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
HIV-1 IN catalyzes integration of the viral DNA, made by reverse transcription, into the host chromosome in a two-step reaction (reviewed in Ref. 1). In the first step, called 3'-processing, two nucleotides are removed at each 3'-end of the viral DNA. In the next step, called DNA strand transfer, concerted transesterification reactions integrate the viral DNA ends into the host genome.
HIV-1 IN is composed of three distinct structural and functional domains: the N-terminal domain (residues 150) that contains the HHCC zinc-binding motif, the core domain (residues 51212) that contains the catalytic site, and the C-terminal domain (residues 213270) that is thought to provide a platform for DNA binding. Crystallographic or NMR structural data are available for each of the individual domains (26). In addition, the structures of the combined core and C-terminal domains (7) and core and N-terminal domains (8) have been recently determined. However, efforts to obtain a structure of the full-length protein have been impeded by poor protein solubility.
Purified IN-based assays have been employed for screening of potential inhibitors. These studies have revealed several classes of compounds with anti-HIV-1 IN activity, including diketo acids, naphthyridines, pyranodipyrimidines, nucleotide analogs, hydroxylated aromatic compounds, DNA-interacting agents, peptides, and antibodies (922). Much effort has been devoted to dissecting the mechanism of inhibition and identifying the inhibitor binding sites. Crystallographic studies revealed two distinct binding sites in IN including the catalytic site for a diketo group-containing inhibitor (1-(5-chloroindol-3-yl)-3-hydroxy-3-(2H-tetrazol-5-yl)-propenone) and a site located near the IN dimer interface for 3,4-dihydroxyphenyltriphenylarsonium bromide (23, 24). However, these studies employed the isolated core domain of IN rather then the full-length protein.
Mass spectrometry is a powerful structural biology tool that allows the analysis of protein-inhibitor complexes under biologically relevant conditions and provides structural information complementary to NMR and crystallography. For example, we recently reported identification of a small molecule inhibitor binding site to full-length IN using the affinity acetylation and mass spectrometric analysis approach (25). These experiments were performed at low protein concentrations, where full-length IN as well as its complexes with inhibitor and DNA were fully soluble. In the present study, our efforts focused on interactions of a known nucleotide inhibitor, pyridoxal 5'-phosphate (PLP) with IN. Our interest in this compound was prompted for the following reasons. PLP has been previously shown to inhibit IN activity at low micromolar range (15, 16). More recently, certain PLP-derivatives were shown to exhibit potent antiviral activities in cell culture assays (26). In addition, PLP is known to form a Schiff base with the primary amine in the protein at the nucleotide-binding site (27). This intrinsic property coupled with mass spectrometric analysis could be exploited for determination of a nucleotide binding site in IN. Furthermore, protein interactions with the phosphate group of PLP could mimic that with the phosphate backbone of DNA. Finding the contact amino acids would benefit longstanding efforts in the field to define the DNA substrate binding cleft in HIV-1 IN. Indeed, our results reported here reveal a new plausible nucleotide-binding pocket in the protein. In particular, using mass spectrometry, we identified Lys244 as a primary binding site of PLP. The importance of this amino acid for the IN function and for viral replication was confirmed by site-directed mutagenesis.
| MATERIALS AND METHODS |
|---|
|
|
|---|
IN Activity3'-Processing and strand transfer activities were assayed in the reaction buffer containing 50 mM HEPES, pH 7.5, 50 mM NaCl, 10 mM MnCl2, 2 mM
-mercaptoethanol, 1 mM CHAPS, 50 nM 32P-labeled DNA substrate (21-mer blunt ended double-stranded DNA mimicking the U5 sequence), and 1 µM IN. The reaction products were separated by 20% denaturing PAGE and visualized by a PhosphorImager (Amersham Biosciences).
IN·DNA Photocross-linkingThe protocol for specific photocross-linking of IN to cognate DNA substrate was adopted from Esposito and Craigie (29). HIV-1 U5 LTR sequence containing DNA oligonucleotide (5'-GTGTGGAAAATCTCTAGCAGT-3') was annealed to the complementary DNA strand in which 5-iododeoxyuracil (IdU) was incorporated instead of the 5'-terminal adenine residue (5'-IdUCTGCTAGAGATTTTCCACAC-3'). The hybridized DNA substrates were radioactively labeled, purified through a Sephadex G25 desalting column, and used for cross-linking reactions. The reaction contained 50 mM HEPES, pH 7.8, 50 mM NaCl, 5 mM MnCl2,10mM DTT, 20 nM DNA substrate, and 1 µM integrase. Samples were first incubated for 15 min at room temperature and then irradiated with 302-nm light from a handheld ultraviolet lamp for 30 min. The cross-linked products were analyzed by SDS-PAGE with NuPage 412% MOPS gels (Novex, Inc.). Quantitation of reaction products was carried out using a PhosphorImager and ImageQuant software (Amersham Biosciences).
Formation and Mass Spectrometric Analysis of the IN·Pyridoxal 5'-Phosphate ComplexThe reaction mixture containing 50 mM HEPES, pH 7.8, 50 mM NaCl, 10 mM MnCl2, 10 mM DTT, 1 µM IN was incubated with varying concentrations of PLP at 37 °C for 30 min. Sodium borohydride (5 mM final concentration) was added subsequently to reduce the Schiff base formed upon the PLP interaction with a primary amine. The protein was then unfolded by adding 40 mM DTT and incubating at 70 °C for 20 min. To modify cysteine residues, the reaction mixture was complemented with 100 mM iodoacetamide and incubated at 37 °C for 45 min. The reaction was quenched by the addition of 100 mM DTT. The modified protein samples were subjected to SDS-PAGE. The IN band was excised and subjected to in-gel proteolysis using 0.5 µg of one of the following three proteases: trypsin, LysC, or GluC (Roche Applied Science). Trypsin and LysC digestions were performed in 50 mM ammonium bicarbonate buffer, pH 8.0, at 37 °C for 16 h. GluC proteolysis was carried out in 50 mM ammonium acetate, pH 4.0, at 25 °C for 16 h. The proteolytic peptide fragments were desiccated in a SpeedVac (Thermo Savant, Holbrook, NY) and resuspended in an aqueous solution of 0.1% trifluoroacetic acid.
MS and MS/MS analyses of proteolytic peptide fragments were carried out using a Micromass Q-TOF-II instrument equipped with an electrospray source and a micromass cap-LC. Peptides were separated with a Waters Symmetry300 C18 precolumn and a Micro-Tech Scientific VC-10-C18150 column using two sequential linear gradients of 540 and 4090% acetonitrile for 35 and 10 min, respectively. MS/MS analysis data and the Mascot search engine (www.matrixscience.com) were used to identify IN peptide peaks from the NCBInr primary sequence data base.
Preparation and in Vivo Analysis of K244E MutantTogether with the K244E mutation, we also analyzed HIV-1 NL43 viruses encoding wild type and D116N mutant IN as positive and negative controls, respectively. A 4.3-kb fragment of the NL43 vector spanning the SpeI and EcoRI sites was generated and subcloned into pBluescript SK(+) (Stratagene). Appropriate DNA oligonucleotides to introduce the K244E or D116N mutation were purchased from Integrated DNA Technologies (Coralville, IA). Preparation of mutant plasmids was carried out according to the Stratagene protocol using QuikChange XL site-directed mutagenesis kit (Stratagene). After confirming the mutation by DNA sequencing, the SpeI-EcoRI fragments were inserted back into the NL43 vector.
In vitro virus stock preparation and viral infectivity determination were carried out as follows. 293T cells (ATCC) were maintained in the presence of Dulbecco's modified Eagle's medium (Cellgro), 10% fetal calf serum (HyClone Laboratories), penicillin (50 units/ml; Invitrogen), and streptomycin (50 µg/ml; Invitrogen) and plated at a density of 5 x 106 cells/100-mm-diameter culture dish. After overnight incubation with 5% CO2 at 37 °C, the cells were transfected using an MBS mammalian transfection kit (Stratagene) with 10 µg of HIV-1 NL43 plasmid DNA encoding wild-type and mutant (K244E or D116N) IN. Cell culture supernatant was replaced with fresh medium 6 h after transfection and further cultured for a total of 48 h. Thereafter, cell culture supernatants containing viruses were harvested, clarified through a 0.45-µm membrane filter (Nalgene), aliquoted, and stored at -80 °C. In a separate experiment, 293T cells were also co-transfected with 10 µg of plasmid DNAs together with 1 µg of plasmid DNA of pHCMV-G expressing vesiculostomatitis virus envelope glycoprotein G (30). The viral supernatant was cleared and concentrated 40-fold by centrifugation at 25,000 rpm for 90 min at 4 °C. The concentrated virus pellet was resuspended and digested in a buffer containing 50 mM Tris-HCl (pH 8.0), 10 mM MgCl2, and 300 units/ml RNase-free DNase I (Roche Applied Science) for 60 min at room temperature. The viral p24 capsid protein amounts in the virus stocks were determined by using an enzyme-linked immunosorbent assay (p24 Core Profile Kit; DuPont). Human osteosarcoma cell line, GHOST (3) R3/X4/R5 (National Institutes of Health AIDS Research and Reference Reagent Program) transduced by HIV-2 LTR-GFP expressing both CD4 and CCR5 receptor were maintained in the same growth medium as 293T cells but with the additional selective reagents of G418 (500 µg/ml), hygromycin (1 µg/ml), and puromycin (1 µg/ml) (31). The cells were plated at a density of 1 x 105 in 0.5 ml of medium/well of a 24-well plate and cultured overnight under the conditions described above. Unconcentrated virus supernatants containing a range of 100400 ng of p24 were added in duplicates and cultured for a total of 40 h. The cells were trypsinized off of the plate and inactivated by 1% paraformaldehyde in phosphate-buffered saline. A total of 10,000 events representing cells were acquired through flow cytometry to measure the percentage of GFP-positive cells. Thereafter, the percentage of GFP-positive cells was normalized to 100 ng of p24 equivalent viral inoculum.
Viral DNA Quantitation by Real Time PCRCEMx174, the T and B lymphoid hybrid cell line (National Institutes of Health AIDS Research and Reference Reagent Program), was maintained in the presence of RPMI 1640 medium (Cellgro), 10% fetal calf serum (HyClone Laboratories), penicillin (50 units/ml; Invitrogen), and streptomycin (50 µg/ml; Invitrogen). 2 x 105 of CEMx174 cells in 0.5 ml of medium were infected by adding a 10-ng equivalent of p24 inoculum of vesiculostomatitis virus envelope glycoprotein G pseudotyped HIV-1 NL4-3 encoding wild-type and mutant (K244E or D116N) IN. Immediately and 12 h after infection, the cells were harvested, and total cellular DNA was extracted (QIAmp DNA Blood Mini Kit; Qiagen) and dissolved in water. DNAs extracted from cells immediately after infection were used as a background control for the carryover plasmid DNA derived from the transfection procedure. The extracted DNAs were used in a real time PCR to quantitatively detect viral late reverse transcription product (U5-
) and cellular DNA. The late reverse transcription product U5-
DNA was quantified as described previously (32, 33). The following primer and probe sets were employed in the real time PCR assay to detect DNA products. For the late reverse transcription product U5-
, the primer-probe sets were ES531 (5'-TGTGTGCCCGTCTGTTGTGT-3'), ES532 (5'-GAGTCCTGCGTCGAGAGATC-3'), and probe LRT-P (5'-6-carboxyfluorscein (FAM)-CAGTGGCGCCCGAACAGGGA-carboxytetramethylrhodamine (TAMRA)-3') (32). The PBGD primer-probe sets consisted of the forward primer DF3 (5'-AGGGCAGGAACCAGGGATTATG-3'), the reverse primer DR1 (5'-GGGCACCACACTCTCCTATCTTT-3'), and the probe PBGD (5'-ATGTCCACCACAGGGGACAAGATTC-3'). All real time PCRs were performed using the TaqMan Universal PCR Master Mix Kit (Roche Applied Science), and the products were quantified using the ABI Prism sequence detector 7700. Primers were used at 600 nM, and probes were used at 75 nM concentration. The conditions used for real time PCRs were as follows: one cycle at 50 °C for 2 min, one cycle at 95 °C for 10 min, and 40 cycles of amplification at 95 °C for 0.15 min and 60 °C for 1 min.
Molecular ModelingIn order to better understand the binding mode and interactions of PLP with HIV-1 IN, a structural model of PLP complexed with full-length HIV-1 IN dimer was generated. Based on our experimental results and reported crystal structures of PLP complexed with different proteins in the protein data bank (34), the IN·PLP complex was modeled as a covalent interaction between Lys244 and PLP by adding a bond between terminal NH2 of Lys244 and aldehyde carbonyl of PLP. The complex was then subjected to quenched molecular dynamics simulations using the Insight II 2000.0/Discover 97.0 modeling package (Molecular Simulations Inc., San Diego, CA), using the cff91 force field. Additional parameters for Mg2+ were determined using its ionic radius and small molecule crystallographic data. A dielectric constant of 1.00 and the cell multipole method with fine accuracy were used for all nonbonding interactions except for the quenched molecular dynamics. The quenched molecular dynamics cycle consisted of 100 ps of molecular dynamics simulation at 800 K in an NVT ensemble, keeping all protein atoms beyond 15 Å of PLP fixed. The van der Waals interactions were scaled to 2%, and the coulombic interactions were scaled to 20%. The coordinates were saved every 200 fs and subsequently minimized by 300 steps of the Polak-Ribiere conjugate gradient algorithm. The lowest energy conformation of the 500 minimized structures is depicted in Fig. 7A. The electrostatic potential on the solvent-accessible surface of the full-length IN dimer and PLP was calculated using the Delphi module in the Insight II program and is depicted in Fig. 7B. The default Delphi formal charges were applied for the protein, whereas ab initio calculation with Gaussian 98 A11 was used to obtain charges for PLP.
|
| RESULTS |
|---|
|
|
|---|
|
15 µM. Interestingly, mass spectrometric analysis of tryptic peptide fragments of the IN·PLP complex revealed that a single new peak was formed when compared with free IN digests (Fig. 2). The molecular weight of this peak corresponded to the molecular weight of IN peptide amino acids 241258 plus one molecule of PLP. This peptide contains two lysine residues, Lys244 and Lys258. Lys244, located within the peptide, was resistant to tryptic digestion. In contrast, Lys258, located at the peptide C terminus, was readily hydrolyzed by trypsin. Normally, unmodified lysine residues in proteins are easily cleaved by trypsin, whereas modified lysines become resistant to proteolysis.
|
|
|
|
viral DNA product derived from the late stage of viral reverse transcription was quantified by using a real time PCR assay. The data in Fig. 5B indicate that the wild-type as well as mutants D116N and K244E produced similar amounts of viral late reverse transcription products. These results indicated that reverse transcription was not affected by the K244E mutation. Comparison of two-LTR circle copy numbers standardized over the late reverse transcription products revealed 2.29- and 5.11-fold increases for K244E and D116N mutants, respectively, over wild type NL43. A similar picture was observed previously (35) with other DNA binding mutations (K156E and K159E). Increases in two-LTR circles were relatively modest for these two mutants in comparison with the increase seen with D116N (35). This pattern could possibly be explained by the notion that formation of the IN·DNA complex that takes place in the cytoplasm of the infected cells may be essential for the efficient nuclear import. Unlike the DNA binding mutations, D116N exhibits wild type levels of DNA binding and is inactive due to its failure to coordinate divalent metal ions. Collectively, the data in Fig. 5 suggest that the K244E mutation is detrimental for the integration step and does not impact reverse transcription.
We next performed in vitro analysis of the K244E mutant. The data depicted in Fig. 6 indicate that the mutation fully impaired 3'-processing and strand transfer activities. Taken together, in vivo and in vitro data argue strongly for the importance of Lys244 for IN function.
|
Importantly, Lys244 as well as the other PLP-interacting residues (Arg262, Lys264, and Lys266) are conserved in HIV-1, HIV-2, and SIV IN proteins. The structural fold of the HIV-1 IN C terminus is similar to that of other DNA-binding proteins exhibiting an Src homology 3-like fold (Sso7d and Sac7d) (7, 3941). However, HIV-1 IN residues Lys244, Arg262, Lys264, and Lys266 are not conserved in Sso7d and Sac7d. Therefore, a nucleotide inhibitor binding site uncovered in the present work is of interest for designing small molecule antagonists selective to retroviral IN.
| DISCUSSION |
|---|
|
|
|---|
To our knowledge, this is the first report indicating that a small molecule inhibitor can impair HIV-1 IN activity by binding to the C-terminal domain. A large molecule inhibitor, monoclonal antibody 33, was reported to bind to the C-terminal domain and impair IN coordination of the cognate DNA (17). Crystallographic and NMR efforts to determine small molecule inhibitor binding sites were restricted to analyses of the protein core domain rather than full-length IN (23, 24). Recently, using full-length IN and mass spectrometric footprinting, we mapped an acetylated inhibitor (methyl-N,O-bis-(3,4-diacetoxycinnamoyl)serinate) binding site to Lys173, which is located in the core domain (25). This inhibitor-binding site involves an architecturally critical protein dimer interface that is significantly distanced from the DNA binding cleft. Consistent with these structural data, the mechanistic studies indicated that the acetylated inhibitor does not interfere with formation of the IN·DNA complex.
Previous attempts to map a nucleotide analog binding site in IN included application of photoaffinity labeling with 3'-azido-3'-dideoxythymine analog 3',5-diazido-2',3'-dideoxyuridine 5'-monophosphate coupled with proteolytic digestion of the protein (42). Drake et al. (42) identified amino acid region 153167 of IN as the site of photocross-linking. In this cross-linking experiment, the core domain of IN (amino acids 50212) rather than the full length protein was used (42). In the present study with full-length IN, we found that PLP could also bind to the similar region by directly interacting with Lys156. However, the modification of Lys156 could only be detected at an elevated concentration (200 µM) and not at 15 µM PLP (data not shown). Equally, it should be noted that the IC50 value for inhibition of IN with 3'-azido-3'-dideoxythymine was reported to be 360 µM (42). Our results indicate that Lys244 binds PLP with a significantly higher affinity than Lys156.
Lys244 was accessible to PLP modification when the compound was introduced to the preassembled IN·DNA complex. Equally, we observed similar inhibition profiles whether PLP was added first to free IN and then DNA substrate was provided or the compound was exposed to the preassembled IN·DNA complex (data not shown). Our interpretation of this observation is as follows. The active IN protein is an oligomer where separate monomers appear to provide complementary rather than symmetrical contacts to the DNA substrate. Indeed, previous in vitro complementation assays have shown that two inactive IN proteins, one with the mutation in the core domain and another with the mutation in the C-terminal domain, can be combined to restore catalytic activity (43, 44). Thus, when an active site lysine in one monomer contacts the DNA substrate, the identical lysine in another monomer is exposed to solvent. Binding of PLP to the IN monomer in which Lys244 is accessible to the surface could destabilize the IN·DNA structure and compromise the enzymatic activities.
Our results suggest that Lys244 is an essential DNA binding residue. A study of Gao et al. (45) demonstrated that the E246C mutant preferentially interacted with position 7 of the U5 sequence. In particular, the disulfide cross-link between the cysteine residue and modified base in DNA was established through a 3-carbon linker arm. In our model, Glu246 is located within 5 Å of PLP (Fig. 7A). Therefore, it is logical to propose that Lys244 establishes direct contacts to DNA that could contribute to the observed cross-link of E246C with the respective DNA.
Our modeling studies revealed that Lys244 together with other DNA binding residues (Arg262, Lys264, and Lys266) forms a plausible nucleotide inhibitor-binding pocket. The poor membrane permeability and reduced specificity restricts application of PLP as a potent antiviral inhibitor. However, certain PLP derivatives have shown promise as potent antiviral agents in cell culture assays (26). The structural information we report here facilitates a more rational approach toward improving selectivity and potency of this class of HIV-1 IN inhibitors.
| FOOTNOTES |
|---|

To whom correspondence should be addressed: Ohio State University, 500 W. 12th Ave., Columbus, OH 43210. Tel.: 614-292-6091; Fax: 614-292-7766; E-mail: kvaratskhelia.1{at}osu.edu.
1 The abbreviations used are: HIV-1, human immunodeficiency virus type 1; IN, integrase; PLP, pyridoxal 5'-phosphate; LTR, long terminal repeats; Q-ToF, quadrupole-time of flight; DTT, dithiothreitol; CHAPS, 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonic acid; IdU, 5-iododeoxyuracil; MOPS, 4-morpholinepropanesulfonic acid; MS, mass spectrometry; GFP, green fluorescent protein; PBGD, phorphobilinogen deminase gene. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. Q. Al-Mawsawi, V. Fikkert, R. Dayam, M. Witvrouw, T. R. Burke Jr., C. H. Borchers, and N. Neamati From the Cover: Discovery of a small-molecule HIV-1 integrase inhibitor-binding site PNAS, June 27, 2006; 103(26): 10080 - 10085. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE |