|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 280, Issue 4, 2990-2997, January 28, 2005
The Crystal Structure of the Herpes Simplex Virus 1 ssDNA-binding Protein Suggests the Structural Basis for Flexible, Cooperative Single-stranded DNA Binding*
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
herpesvirus subfamily and of the family as a whole. The HSV-1 single-stranded DNA (ssDNA)-binding protein (SSB), ICP8, is a nuclear protein that, along with the six other HSV replication proteins (the viral polymerase (UL30) and its accessory factor (UL42), the trimeric helicase-primase complex (UL5-UL8-UL52), and the origin-binding protein (OBP), coded by the gene ul9), is required for viral DNA replication (1) during lytic infection. Replication has been thought to proceed by a rolling circle mechanism (2) partly because the replication product is a concatamer, although the observation of highly branched replication intermediates could be explained by other mechanisms that would link recombination and replication. ICP8 is a 128-kDa multifunctional zinc metalloprotein (3) encoded by the ul29 gene. It preferentially binds ssDNA over double-stranded DNA in a non-sequence-specific and cooperative manner (4). ICP8 has been reported to interact either directly or indirectly with several other viral proteins. There is evidence that it binds to the C terminus of the OBP and stimulates its helicase activity (5, 6), that it promotes the helicase activity of the viral helicase-primase complex (UL5-UL8-UL52) (7), and that it modulates the processivity of the viral polymerase (UL30) (8). Before viral DNA replication commences, these proteins are thought to be co-localized with ICP8 at small punctuate foci called prereplicative sites. With the onset of viral genome amplification, these proteins become redistributed into a larger globular replication compartment (9) whose location is defined by the preexisting host cell nuclear architecture, most probably at the periphery of the nuclear matrix-associated ND10 domains where the viral transactivator ICP0 and the viral input genome are believed to migrate in the early stages of infection (10). ICP8 is also involved in several other events of the DNA metabolism. It can promote DNA strand transfer (11), catalyze strand invasion in an ATP-independent manner (12), and renature complementary DNA strands (13), which indicates that ICP8 plays an important role in HSV genome recombination. The replication of HSV-1 DNA is also associated with a high degree of homologous recombination. Recently it was shown that ICP8 works together with alkaline nuclease (UL12), which is a 5'-3'-exonuclease, to effect strand exchange (14). In addition to its role in DNA synthesis, ICP8 has been shown to regulate viral gene expression by repressing transcription from the parental genome (15) and stimulating late gene expression from progeny genomes (16).
Genetic and biochemical analyses have failed to identify functionally independent domains within ICP8. Even the extent of the minimal DNA binding region has remained unclear. It has been placed in the C-terminal half of the protein (17) or in regions spanning residues 564-1110 (18) or 300-849 (19). The C-terminal 60 amino acid residues were shown to account for most of the cooperative behavior in ssDNA binding (20), possibly modulated by the two cysteines 254 and 455 (21). It has also been shown that the C-terminal 28 amino acids contain a nuclear localization signal (22), that the residues between 499 and 512 host a zinc binding motif (3), and that the residues from 1082-1169 are also important for the stimulation of late gene expression (18).
Here we have reported the first crystal structure of an ssDNA-binding protein of the Herpesviridae, a 60-amino acid C-terminal deletion mutant of ICP8, at 3.0 Å resolution. The structure consists of an unexpectedly large N-terminal folding unit and a small C-terminal
-helical domain, both with novel folds. In addition, it has provided insight into the likely mechanism of cooperative ssDNA binding and tempted us to speculate about the possible interaction with the origin-binding protein.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Ccc) have been described previously (23).
Protein Expression and Purification of Selenomethionine-ICP8
Ccc Selenomethionine (SeMet)-enriched ICP8
Ccc was expressed in High 5 insect cells grown as monolayer culture. Confluent cells were infected with the same recombinant virus used for expression of the native protein in a methionine-containing Sf-900II SFM medium (Invitrogen), supplemented with 10 µg/ml penicillin-streptomycin and 2% fetal calf serum. After 30 h, cells were washed with phosphate-buffered saline solution; methionine-free IPL-41 medium (Applichem) was added for starvation. The methionine-free medium was renewed after 4 h with SeMet-containing (50 mg/l) IPL-41 medium. Cells were incubated for a further 26 h before harvesting. The purification protocol was the same used for the native protein (20), with the only exception that all buffers were flushed with N2 and supplemented with 10 mM reducing agent (dithiothreitol or
-mercaptoethanol) to overcome a more pronounced tendency of the selenomethionine-containing protein to oxidize and aggregate.
CrystallizationSeMet-ICP8
Ccc crystals were grown at 22 °C in hanging drops by equilibration of 5 mg/ml protein in 10 mM Tris-HCl (pH 8.0), 300 mM NaBr, 20% glycerol, 10 mM dithiothreitol against 12-14% polyethylene glycol 3000 and 100 mM sodium-potassium phosphate, pH 6.3. This crystallization condition is similar to that for the native crystal growth. Within about a week, fragile, plate-like crystals (
0.2 x 0.2 x 0.05 mm3) grew by salting in. For derivatization with methyl mercury acetate (MMA), the protein was first dialyzed against a dithiothreitol-free buffer, then against a 5 mM MMA-containing buffer at pH 7.5, and was subsequently used to set crystallization drops. Plate-like crystals appeared in a week in conditions similar to the ones used for native crystal growth. Crystals formed in space group P212121 with two molecules in the asymmetric unit.
Data Collection, Structure Determination, and RefinementBoth SeMet- and MMA-containing ICP8
Ccc crystals were cryoprotected by brief soaks in 20% glycerol buffered at pH 6.3 before cryocooling in liquid nitrogen. Multiwavelength anomalous diffraction data from crystals of SeMet-ICP8
Ccc were collected at 100 K using synchrotron radiation at the 17-ID IMCA-CAT beamline of the Advanced Photon Source (Argonne) at three/two different wavelengths around the selenium absorption edge. A full diffraction data set was collected for the MMA derivative at 100 K, using the BW7B beam line of the European Molecular Biology Laboratory Hamburg Outstation. The diffraction data were processed using the HKL program package (24). Data collection statistics are shown in Table I.
|
|
Modeling of ssDNAThe structure of the EcoSSB-ssDNA complex, where two monomers of EcoSSB cover 26 nucleotides, (37) was used for the modeling of ssDNA onto the neck region of ICP8. One of the two monomers was superimposed with a monomer of human mitochondrial SSB (HsmtSSB) (38) with a root mean square deviation of 1.6 Å on the 97-C
target pair, and then the monomer of HsmtSSB was overlaid on the neck region of ICP8. In this way the relative orientation of ssDNA was modeled onto the complete structure of ICP8 with the ssDNA covering the neck region of ICP8.
The two independent molecules that form the protein chain have different spatial arrangements of the C- and N-terminal domains; however, the distance over the disordered region (1038-1048) is approximately the same (16.1 and 19.2 Å). The relative orientation of ssDNA was created for each monomer of the proposed protein chain formed by applying non-crystallographic symmetry. Visual inspection using computer graphics allowed the nucleotides between two non-crystallographic symmetry-related monomers (chains A and B) and between two crystallographically related monomers (chain B and symmetry mate of chain A) to be added while avoiding clashes with the protein chain. Again the crystallographic symmetry was applied to the newly built nucleotides. In this way, it was possible to join the ssDNA in a continuous chain. In the model the continuous chain of ssDNA contains 98 nucleotides covered by 7 monomers and the distance between 5'- and 3'-ends is 350 Å. The coordinates of the model are available from the authors upon request, and a more detailed illustration is included in the supplemental material.
| RESULTS |
|---|
|
|
|---|
20 kDa, and although the heterotrimeric eukaryotic SSB, RepA, is
116 kDa, this is believed to contain more than one DNA binding region. Crystallization studies of ICP8 have been reported earlier (23), and it was shown that crystals of full-length ICP8 diffracted too poorly to be useful. A mutant, ICP8
Ccc, with the C-terminal 60 residues deleted and two point mutations (C254S and C455S) has been crystallized under similar conditions to the full-length ICP8 and diffracted to at least 3 Å resolution. The mutant has been shown to bind ssDNA with much reduced cooperativity (22). The structure was solved by MAD and single isomorphous replacement with anomalous scattering methods using SeMet-substituted ICP8. The SeMet crystals formed in space group P212121 with two molecules in the asymmetric unit and diffracted to 3.2 Å resolution. The model (residues 9-1129 with the disordered regions described below) was refined to a crystallographic R value of 23.5% (Rfree = 28.6%) using data from 20.0 to 3.0 Å resolution (Table I).
Overall StructureThe structure of ICP8 (9-1129) (Fig. 1) is composed of a large N-terminal domain (9-1038) and a smaller
-helical C-terminal domain (1049-1129). The first 8 residues and the last 7 residues of the construct are not visible in the electron density and are presumed to be disordered. The N-terminal domain can be described as consisting of head, neck, and shoulder regions. The head consists of the eight helices
14,
15,
16,
21,
22,
23,
24, and
25 (Fig. 1B). The front side of the neck region consists of a five-stranded
-sheet (
16,
17,
23,
26, and
27) and two helices (
17 and
27), whereas the back side is a three-stranded
-sheet (
24,
25, and
28) (Fig. 1). The shoulder part of the N-terminal domain contains an
-helical and
-sheet region. The head, neck, and shoulders are interconnected in such a way that their individual structural folds are not formed by contiguous polypeptide chains. From the N terminus, the polypeptide chain forms a first helical region in the head and then one of the two
-sheet regions belonging to the neck. The strands
16 and
17 in the neck lead to strands
18-
22 in the shoulders before returning to strands
23-
26 in the neck. The strands
18-
22 are involved in interaction with residues in other strands from the N terminus (see Fig. 1A). This explains why limited proteolysis experiments have never yielded either soluble or functionally active fragments (20) and why so many mutant proteins have proven to be insoluble (see, for example, Ref. 18).
The C-terminal domain (1049-1129) is entirely helical (
28,
29,
30,
31, and
32) and is connected to the N-terminal domain
17 Å away by a disordered linker (residues 1038-1049) (Figs. 1 and 2).
|
-sheet with disordered connecting loops containing conserved basic and aromatic residues. The direction of each
-strand of the neck region that resembles the OB-fold is similar to that of HsmtSSB (38) (Fig. 3A). The proposed DNA binding region on the front side of the neck (Fig. 3A) contains elements of the sequence between amino acids 530 and 1028, similar to the boundaries suggested by Gao and Knipe (18). Limited proteolytic analysis studies had suggested that the putative boundaries of the minimal DNA binding region are between residues 300 and 849 (19). More recent evidence, based on ICP8 photo-affinity labeling with oligonucleotides, indicated a slightly different region, namely between residues 386 and 902 (43). There are a number of aromatic and positively charged residues from the front side of the neck that are exposed to the surface or lie in the disordered loops that are relatively well conserved across the Herpesviridae. These are Tyr543, Asn551, Arg772, Lys774, Arg776, Tyr988, Phe998, and Asn1002 (Fig. 4), which we believe are involved in ssDNA binding either by base stacking or electrostatic interactions.
|
|
7 and
2. It is primarily a hydrophobic interface formed by residues Leu112 and Leu115 packing against Leu501, Val511, and the hydrophobic residues on one face of helix
11, but there is also a hydrogen bond between Asn111N
2 and the carbonyl oxygen of Leu501. The second is toward the C terminus and contains the region between
24 and
26 (Fig. 1, sequence numbering as in Fig. 4). The interaction involves the positioning of His508, which hydrogen bonds with the carbonyl oxygen of Ser934 and is stabilized by a hydrogen bond between Trp933N
1 and Glu470O
1. The zinc binding region is remote to the proposed ssDNA binding region and therefore unlikely to be important for DNA binding.
|
919 Å2), both independent interaction surfaces are similar. However, the spatial arrangement between the two domains differs (Fig. 2), indicating the flexible nature of the protein chain that, in this case, is probably determined by the crystal packing. A continuous chain of molecules through the crystal lattice can also be observed in the case of T4 gene 32 (45) and the adenovirus ssDNA binding proteins (42) as well as in the organization of Escherichia Coli SSB tetramers (37). The arrangement of ICP8 molecules is similar to the beaded morphology observed in negatively stained electron micrographs (5) of ICP8 decorated ssDNA (Fig. 3C). We previously established that the 60-residue C-terminal region of the ICP8 is a principal determinant for cooperative DNA binding (23), at least on shorter oligonucleotides. Although the crystallized construct does not contain this region, we have postulated that the remaining weaker interactions generate the same chain in the crystal that would form on DNA in solution. Ser254 and Ser455 (Cys254 and Cys455 in the native protein) are located in the loop region between helices
6 and
7 and the C-cap of the
10 helix, respectively. These residues are neither solvent exposed nor close to the region in the neck that we have proposed to be involved in ssDNA binding. It is therefore unlikely that these two cysteines are involved in cooperativity as has been previously suggested (21).
We believe that, because of the nature of the ICP8 domain connection, ssDNA is covered in a flexible manner while keeping the ssDNA in an extended form (Fig. 3, B and C) that prevents formation of secondary structures. On the basis of the ICP8-ssDNA model (Fig. 3 and supplemental material), it seems that
14 oligonucleotides would be covered per ICP8 molecule, and this is in good agreement with biochemical data (23).
Interactions of the C TerminusThere is evidence (20) that the deletion of the C terminus seriously reduces cooperative binding for ICP8, suggesting that there is an additional protein-protein interaction involving this region. An F(N/D)F motif (amino acids 1142-1144) in the C terminus is identifiable in the Alphaherpes viruses and possibly in the Roseolovirus genus of the Betaherpesvirus subfamily (Fig. 4). We believe that at least one of the phenylalanines is involved in a hydrophobic interaction with a hydrophobic region formed by Phe827, Phe843, Trp844, Leu857, and Ile865 (Fig. 6) of the head. This region of ICP8 is also very well conserved among Alphaherpes viruses (Fig. 4). The C-terminal residue (Glu1129) of the model is in the vicinity of the head of the N-terminal domain of another molecule. Modeling a continuation of the C terminus would allow it to pass around the head and dock part of the F(N/D)F motif into the hydrophobic region mentioned above (Fig. 6). A similar interaction is important for the formation of the protein chain in the adenovirus SSB (42). The last C-terminal 28 residues that contain the nuclear localization signal must presumably remain free to facilitate nuclear import.
|
Potential Protein-Protein Interaction SitesWe have identified the function of head, neck, and the C-terminal helical region of ICP8; however, a large part of the shoulder region is not, according to our model, involved in cooperative ssDNA binding. It is unlikely that such a large part of the N-terminal domain would have no functional role, because the necessity of packaging the viral genome should tend to enhance the evolution of multifunctional proteins and reduce the likelihood of producing non-functional coding regions. Regulation of late gene expression could involve the ssDNA binding region, but presumably nuclear positioning involves the exposed part of the protein chain. Recent work (47) has identified, by immunocoprecipitation, a number of cellular proteins that co-localize with ICP8. Some of these co-localizations are not dependent on mediation by DNA and are involved in (cellular) DNA replication, repair, and recombination. Structurally there are two regions that are likely to be involved in some of these interactions. The first is a deep pocket (Fig. 5B) with conserved aromatic residues (Tyr20, Phe61, Tyr90) at the base and capped by the hydrogen bonding interaction between Glu58 and Arg193. The second is a cleft (Fig. 5C) containing the fully conserved Cys116 and Arg120, both on
2, at the side.
| FOOTNOTES |
|---|
* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
The on-line version of this article (available at http://www.jbc.org) contains supplemental figures. ![]()
Both authors contributed equally to this work. ![]()
Present address: European Institute of Oncology, Via Ripamonti 435, 20141 Milano, Italy. ![]()
¶ To whom correspondence should be addressed. Tel.: 49-40-8990-2129; Fax: 49-40-8990-2149; E-mail: tucker{at}embl-hamburg.de.
1 The abbreviations used are: HSV, herpes simplex virus; ssDNA, single-stranded DNA; SSB, ssDNA-binding protein; OBP, origin-binding protein; SeMet, selenomethionine; MAD, multiwavelength anomalous diffraction; MMA, methyl mercury acetate; OB, oligonucleotide/oligosaccharide binding; HsmtSSB, human mitochondrial SSB. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
I. Manolaridis, E. Mumtsidu, P. Konarev, A. M. Makhov, S. W. Fullerton, A. Sinz, S. Kalkhof, J. E. McGeehan, P. D. Cary, J. D. Griffith, et al. Structural and Biophysical Characterization of the Proteins Interacting with the Herpes Simplex Virus 1 Origin of Replication J. Biol. Chem., June 12, 2009; 284(24): 16343 - 16353. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Molecular and Cellular Proteomics |
| Journal of Lipid Research | ASBMB Today |