|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 33, 23792-23803, August 18, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

1
2
From the
Laboratoire de Biochimie, CNRS, UMR7654, Department of Biology, Ecole Polytechnique, 91128 Palaiseau, France and
Tyndall National Institute, University College Cork, Cork, Ireland
Received for publication, March 27, 2006 , and in revised form, June 12, 2006.
| ABSTRACT |
|---|
|
|
|---|
complex was shown recently to increase the Asp/Asn binding free energy difference, indicating that amino acid discrimination is substrate-assisted. Here, we used molecular dynamics free energy simulations and continuum electrostatic calculations to resolve two related questions. First, we showed that if one of the Mg2+ cations is removed, the Asp/Asn binding specificity is strongly reduced. Second, we computed the relative stabilities of the three-cation complex and the 2-cation complexes. We found that the 3-cation complex is overwhelmingly favored at ordinary magnesium concentrations, so that the protein is protected against the 2-cation state. In the homologous LysRS, the 3-cation complex was also strongly favored, but the third cation did not affect Lys binding. In tRNAbound AspRS, the single remaining Mg2+ cation strongly favored the Asp-adenylate substrate relative to Asn-adenylate. Thus, in addition to their structural and catalytic roles, the Mg2+ cations contribute to specificity in AspRS through long range electrostatic interactions with the Asp side chain in both the pre- and post-adenylation states. | INTRODUCTION |
|---|
|
|
|---|
The 20 aaRSs form two distinct classes of 10 members each (6). Below we have focused mainly on aspartyl-tRNA synthetase (AspRS), one of the best studied aaRSs. AspRS belongs to the aaRS class II, forming a subclass IIb with AsnRS and LysRS. Although aaRSs are generally very amino acid-specific, they have a complex evolutionary history (4, 12), which has led to a remarkable diversity in the modern enzymes. Within class IIb, for example, LysRS is very specific in yeast; but in Escherichia coli, it is more promiscuous (13). In E. coli, AspRS discriminates strongly against Asn, but more weakly against D-Asp (10). Several aaRSs achieve a high fidelity through their editing step. An ambiguous IleRS was constructed recently by deleting the IleRS editing domain (14); the resulting Ile/Val ambiguity actually led to a growth advantage in bacteria. As a last example, many archaebacteria lack AsnRS and produce tRNAAsn-Asn by an indirect route; tRNAAsn is aspartylated by a "nondiscriminating" AspRS (which accepts both tRNAAsp and tRNAAsn), and then the Asp moiety is amidated (1).
The mechanism of the two enzyme reactions, amino acid adenylation and tRNA aminoacylation, is qualitatively understood in both aaRS classes (1). For AspRS, crystal structures are available from several organisms, encompassing the three "kingdoms" of life and the whole reaction pathway: apoenzyme, complexes with Asp alone, ATP alone, aspartyl-adenylate alone (AspAMP) (15-20), and complexes with tRNAAsp present (21-23). Each substrate is recognized by side chains conserved throughout AspRSs (Asp sidechain recognition) or throughout all or most of class II (ATP, Asp backbone recognition). The active site is highly preorganized to receive the Asp and ATP substrates, with the apoenzyme largely superimposable on the various complex structures (22). Each substrate, taken separately, is almost exactly superimposable on the corresponding moiety in the known AspAMP or tRNA·Asp complexes. In all of class II, ATP binds in a very unusual, completely bent conformation, with three associated Mg2+ cations (or two in a few cases; see below). The principal cation coordinates both the reactive
-phosphate and the
-phosphate and is positioned by conserved side chains (24). The other two cations coordinate the
- and
-phosphates on either side of the ATP.
In AspRS, the adenylation reaction can occur in the absence of tRNA. Once Asp and ATP are in place, the Asp backbone reacts with the
-phosphate through an in-line mechanism and a pentacoordinate transition state. Inversion of the
-phosphate is clearly seen when crystals of AspRS·ATP and AspRS·AspAMP are compared (19). As with most enzyme reactions, the exact role of each surrounding group is hard to establish. It is presumed that the principal Mg2+ cation helps activate the
-phosphate by withdrawing electrons and pulling its oxygens into a pentacoordinate geometry, helping to stabilize the transition state through electrostatic interactions. The other two cations neutralize the leaving pyrophosphate product and may also contribute to transition state stabilization.
We have focused here on the role of Mg2+ in Asp/Asn discrimination by AspRS from E. coli. AspRS specificity is a complex problem. Although the enzyme is preorganized, Asp binding does induce structural reorganization in two important regions: (i) a so-called histidine loop (residues 436-449 in E. coli) shifts and becomes more ordered, with His-448 making a hydrogen bond to Asp; (ii) a flexible "flipping loop" (residues 167-173 in E. coli) closes over the aa binding site, bringing the negative Glu-171 close to the Asp ligand. The flipping loop is conserved in eukaryal, eubacterial, and archaebacterial AspRS (17, 18, 23), and a corresponding mobile loop is found in AsnRS (27) and LysRS (28). The exact populations of the open and closed loop states, with and without bound Asp, are unknown. Another difficulty is that the substrates Asp and ATP are both charged. Therefore, both short and long range electrostatic interactions are expected to play a role in both binding and specificity. Amino acid binding might couple to proton binding or release by His-448 or His-449 and to Mg2+ binding or release by ATP.
AspRS specificity has been analyzed with a powerful combination of crystallography, site-directed mutagenesis, kinetic and thermodynamic experiments, and phylogenetic analyses. These methods have limitations, however. Conserved residues may contribute to binding, binding specificity, catalysis, or all three. Experimental assays based on catalytic activity (29, 30) usually become infeasible once a single essential residue is mutated. The strength of electrostatic interactions is very difficult to infer from crystal structures, because the complex dielectric environment within a solvated protein causes large deviations from a simple Coulomb's law (31). Crystallography does not reveal the ionization states of acidic and basic residues, and pKa measurements are difficult for AspRS, which is a homodimer of 1180 residues. Most importantly, weakly populated states are difficult to study experimentally. One extensive mutation study explored changes in the apparent Asp and ATP dissociation constants spanning only 2 orders of magnitude, corresponding to a free energy span of less than 3 kcal/mol (26). Non-cognate complexes like AspRS·Asn could not be analyzed. Weakly populated states are invisible in a crystal structure. Thus, from the AspRS·ATP crystal structures, ATP binds preferentially with three associated Mg2+ cations (17). However, the crystallographic data do not reveal the exact occupancies of the three magnesium sites. A complex with ATP and only two cations, present in the crystal with a population of 10 or 20%, for example, would not have an appreciable effect on the electron density maps and would be impossible to infer from the crystallographic data.
Theoretical methods represent a valuable complementary tool that can resolve these difficulties (32-37). The specificity of Asp binding to AspRS is governed by the binding free energy difference between the cognate Asp and competitor ligands. This difference can be obtained from molecular dynamics free energy simulations (MDFE), which have matured enormously in recent years and have been used to study several aaRSs. Extensive studies (33-35) show that when the AspRS binding pocket is in the "open" state (open flipping loop), there is an enormous preference for Asp over Asn, thanks to a network of electrostatic interactions in the active site. A thermodynamic cycle (see below) was used to obtain binding free energy differences, and a group decomposition of the free energy was used to identity the residues determining amino acid binding specificity.
When the flipping loop closes (17, 23), the negative Glu-171 is brought close to the Asp ligand site. We recently predicted computationally (36) that this conformational change induces proton binding by the nearby His-448. The His-448 positive charge then accounts for most of the large, computed Asp/Asn discrimination. In another long range electrostatic effect, a substrate-assisted specificity was observed: co-binding of ATP increases the Asp/Asn discrimination further. In eukaryotic AspRSs, His-448 is absent, being replaced by an Arg that is more distant from the ligand site. In these organisms, the role of ATP as a mobile discriminator is therefore important, protecting against Asn binding. In the same study, the computational model was tested and validated by experimental measurements of Asp-stimulated pyrophosphate exchange and its inhibition by Asn (36).
The present article focuses on the precise stability of the ATP-associated Mg2+ cations in the AspRS structure and their role in the thermodynamics of Asp and Asn binding. We consider AspRS from both E. coli and the archaebacterium Pyrococcus kodakaraensis. We report free energy simulations that compare Asp and Asn binding to AspRS in the presence of either bound ATP·
or bound ATP·
. The level of Asp/Asn discrimination in each case was computed using two distinct, largely independent methods for the free energy changes. The first method, MDFE, alchemically transforms Asp into Asn during a series of molecular simulations with an explicit solvent representation (33). The second, Poisson-Boltzmann free energies (PBFE), models the ligand binding reactions using a continuum dielectric model of both protein and solvent (35). MDFE was then used to compute the relative affinities of AspRS for ATP·
and ATP·
and to demonstrate that the ATP·
complex has a negligible occupancy and does not play any role in the specificity. In the closely homologous E. coli LysRS, the ATP 3-cation complex is again strongly favored. Binding of the positively charged Lys substrate, however, is not affected by cation binding.
We also considered the post-adenylation, AspRS·tRNA·AspAMP complex. Our simulations predict a strongly bound Mg2+ cation that aids AspAMP recognition. We show that specificity is maintained in the post-adenylation state (11), with AspAMP binding more strongly than AsnAMP in the presence of one Mg2+ cation and tRNAAsp. The Mg2+ cation boosts AspAMP binding specificity, in a long range electrostatic effect similar to that of ATP·
in the preadenylation complex. Thus, the introduction of negative charge into AspRS, by conformational change (36) or tRNA binding, is compensated by histidine protonation (36) and/or cation binding to preserve Asp recognition.
Finally, in the supplemental material we have reported a survey of 238 x-ray structures of protein complexes with ATP or GTP taken from the Protein Data Bank, which sheds additional light on the role of the Mg2+ cations and supports the predicted 3-cation AspRS state. Indeed, we find that ATP binding in a completely bent conformation with three associated Mg2+ cations is a characteristic property of class II aaRSs.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
-carbon of the adenylate ligand of one monomer of the 1IL2 dimer. We overlaid either Asp or Asn on the adenylate molecule and deleted the original ligand. ATP and its associated cations were positioned by taking ATP·
from the 1.9-Å resolution P. kodakaraensis AspRS·ATP complex (Protein Data Bank entry 1B8A) (17) and building it into the 1IL2 structure so as to overlap with the AMP moiety of the original AspAMP ligand. Hydrogens were constructed with ideal stereochemistry. Protonation states of histidines were assigned by visual inspection, except for His-448 and His-449 in the active site, which were assigned earlier through extensive simulations (36). Orientations of His, Asn, and Gln side chains in the active site were taken from the crystal structure and verified by inspection (33). In addition to crystal waters, a 73-Å cubic box of water was overlaid, and waters overlapping the protein were removed. The final model contained the amino acid and ATP ligands, 357 protein residues, and around 10,000 waters, including 113 crystal waters. Periodic boundary conditions were assumed; i.e. the entire 73-Å box was replicated periodically in all directions.
We also generated AspRS·Asp·ATP complexes from the P. kodakaraensis AspRS·ATP complex (17). We built in Asp and Asn ligands by alignment with the E. coli structure (Protein Data Bank entry 1IL2; see above) and generated solvated 24-Å spheres centered on the ligand. We then used pKa calculations (see below) to determine the protonation state of His-223, a histidine residue oriented into the ATP binding pocket. LysRS structures were generated from a 2.1-Å resolution E. coli LysRS·Lys·ATP·
crystal structure (28). We took a 24-Å sphere centered on the
-carbon of the Lys ligand, replaced the Mn2+ cations with Mg2+, and then solvated the system. Histidine protonation states were assigned by visual inspection except for His-270, which points into the LysRS ATP-binding pocket. In both AspRS and LysRS, the protonation state of the histidine oriented into the ATP pocket is coupled to cation binding.
Structures for AspRS with bound AspAMP and tRNA were generated from the 2.4-Å resolution E. coli crystal structure 1C0A (18). Again, we considered a 24-Å sphere centered on the ligand
-carbon and solvated the system. pKa calculations were used to determine the predominant state of His-448, a histidine residue close to AspAMP. The cation associated with the AspAMP ligand phosphate group (15) was placed by structural alignment with the principal cation in the Asp·ATP·
state (17). The cation occupies the space assigned to water molecule 1073 in the x-ray structure (18) and remains in the same octahedral binding mode throughout the simulations, coordinating the Glu-482 and Asp-475 carboxylates and two or three water molecules and remaining 4-5 Å away from the AspAMP phosphate group.
For both AspRS and LysRS, if the entire aaRS protein were included in the model, rather than a spherical subset, a far greater number of solvating waters would be needed. To reduce artifacts due to the protein truncation, protein groups between 20 and 24 Å from the center were harmonically restrained to their positions in the crystal structure. In this way, protein regions beyond 24 Å are accounted for structurally. They will also be accounted for thermodynamically, in a separate step (see below), where the free energy to reintroduce the missing protein groups is computed from a continuum electrostatic model. This hybrid, atomic/continuum approach is accurate, because earlier work on this system showed that both structures and free energies obtained with spherical subsets of 20-, 24-, or 28-Å radii were all similar (37).
All long range electrostatic interactions were computed efficiently by the particle mesh Ewald (PME) method. Four sodium counterions were included to reduce the formal charge of the system. One nanosecond of unrestrained molecular dynamics was performed (for each complex) at constant room temperature and pressure with a Nosé-Hoover algorithm following 200 ps of thermalization. The complexes modeled are E. coli (23) AspRS·Asp·ATP·
, AspRS·Asp·ATP·
, AspRS·Asn·ATP·
, and AspRS·AsnATP·
. 500-ps trajectories were also produced for each amino acid ligand in solution, Asp or Asn, solvated at the center of a box of water molecules, with the ligand
-carbon weakly restrained at the origin throughout the dynamics.
Additional simulations were performed with a less expensive, spherical, continuum reaction field (CRF) (37, 38) model. It included the same protein residues as mentioned above, along with the water molecules inside the 24-Å sphere (about 560 waters). Water and protein outside the 24-Å sphere were treated as a single, homogeneous, dielectric medium with a dielectric constant of 80 (38). Electrostatic interactions between atoms within the sphere were computed without any cutoff, using an efficient multipole approximation for distant groups (39). A multipolar expansion with 20 terms was used to approximate the reaction field due to the surrounding continuum (37, 38). Newtonian dynamics were used for the inner 20 Å of the sphere and Langevin dynamics for the outer region (20-24 Å), with a bath temperature of 293 K. The same four complexes as above were simulated for 3 ns each, as well as the following 10 complexes (3 ns each): AspRS·Asp (or Asn)·ATP·
(or
) complexes from P. kodakaraensis (17); and E. coli (28, 18) LysRS·Lys·ATP·
(or
), AspRS·AspAMP (or AsnAMP)·Mg2+·tRNA, and AspRS·AspAMP (or AsnAMP)·tRNA complexes. The CHARMM22 force field (40) was used for the protein, ligands, and counterions. A slightly modified TIP3P model was used for the water (41). We used the CHARMM program (42), version c30b1, for all calculations.
Free Energy Calculations
The methods used to compute ligand binding free energy differences in AspRS have been described previously (33, 36, 43, 44). For Asp/Asn binding, we use the thermodynamic cycle shown in Fig. 1. We use the horizontal legs for simplified PBFE calculations and the vertical legs for more rigorous, alchemical, MDFE simulations. In the latter, the ligand Asp is reversibly transformed into Asn during a series of simulations; the corresponding work is derived from a thermodynamic integration formula (43). Similarly, to test the stability of the ATP·3-cation complex, the most weakly bound cation (see below) was reversibly transformed into a water molecule (as shown in Fig. 2). Related simulations, using the Poisson-Boltzmann linear response approximation (PB/LRA) method, were used to identify the pKa, and hence the predominant state, of protein histidine residues important for ligand binding. See two recent studies (36, 45) for details of the method.
PBFE ComputationsThe electrostatic contribution to the ligand binding free energy was obtained by subtracting the electrostatic free energy in the ligand·protein complex and in the separate ligand and protein (35). Non-electrostatic contributions to the free energy were assumed to cancel. This should be a good approximation for the Asp/Asn differences, because the two ligands have the same size and bind in the same position (35). The electrostatic potential was obtained by numerically solving the Poisson-Boltzmann equation using a cubic grid and a finite difference algorithm, implemented in CHARMM (grid size, 144 Å; grid spacing, 0.4 Å). The protein·solvent boundary was defined by the molecular surface as in an earlier work (35). PBFE was performed at zero ionic strength. In an earlier study of AspRS (35), the Asp/Asn discrimination computed with zero and physiological ionic strength differed by just 0.3 kcal/mol. The structure of the protein·ligand complex was taken from the MD simulations (above). Given the longer sampling times possible with CRF and the close similarity between the CRF and PME trajectories in terms of binding pocket geometry and flexibility, we generally used structures from CRF simulations. Calculations were performed for multiple structures, sampled every 4 ps along the equilibrated 3 ns trajectory, for a total of 750 structures. The separate ligand and protein structures were obtained by simply discarding the unwanted partner. Thus, structural relaxation on separating protein and ligand was not explicitly included (though it is implicit in the dielectric constant). The solvent dielectric constant was set to 80. The solute dielectric constant was set to 4, based on extensive earlier comparisons between MDFE and PBFE calculations for AspRS (35). Proteins groups outside the 24-Å sphere (above) were neglected. This was a convenient and harmless approximation, because they have been shown to contribute less than 1 kcal/mol to the Asp/Asn binding free energy difference (37). We also used PBFE to estimate the binding strengths of each Mg2+ cation in AspRS·ATP and LysRS·ATP, Lys binding to LysRS, AspAMP/AsnAMP binding to AspRS·tRNA, and cation binding to AspRS·aaAMP·tRNA, with the same setup.
MDFE SimulationsAsp/Asn mutation runs were performed with the PME simulation setup. The ligand side-chain geometry, atom types, and charges were reversibly changed from the Asp values to the Asn values (33-36) over a series of ten 100-ps simulations or "windows."
Having identified the most weakly bound cation from PBFE calculations (see above), we also performed MDFE simulations to determine the stability of the ATP·3-cation complex in AspRS and LysRS, reversibly changing the third cation into a water molecule over a series of MD windows. This procedure was analogous to that used for the Asp/Asn mutations. The energy function can be expressed as a linear mixture of Mg2+ and water terms,
![]() |
where
is a weight or "coupling parameter" and U0 represents interactions between parts of the system other than the hybrid ligand. We gradually mutated Mg2+ into water by changing
from 1 to zero. The successive weights were
= 0.99, 0.95, 0.9, 0.8, 0.6, 0.4, 0.2, 0.1, 0.05, and 0.01 for Mg2+ and 1-
for water. The derivative of the free energy, G, can be written,
![]() |
where the brackets represent a time average over an MD trajectory performed with the energy function U(
). The free energy derivatives were computed at each
value from a 100-ps MD simulation. Each run thus corresponds to 1.0 ns in total.
Equation 2 can be used to perform a group decomposition of the free energies (43, 46). Indeed, the term UMg2+is a sum over interactions between the cation and surrounding amino acids or waters, as is similarly true for Uwater. Thus, the free energy derivative and, ultimately, the free energy can be viewed as a sum of group contributions. Such decompositions have proven useful for identifying the sources of binding affinity (44).
Correction for Distant Parts of the ProteinIn the MD simulations above, protein groups outside of the 24-Å spherical region were only taken into account structurally through the application of harmonic restraints to protein groups near the 24-Å boundary. Their direct contribution to the free energy can be computed in a second step, where they are reversibly introduced back into the system. The free energy for this step can be obtained using a continuum model (as detailed in Refs 37 and 47). A continuum model is appropriate because the groups are more than 24 Å away from the ligands. Refs. 37 and 47 show that the net contribution of this second step to the Asp/Asn binding free energy difference is less than 1 kcal/mol; this is less than the overall uncertainty of both MDFE and PBFE. Therefore, in this work, we did not actually compute the long range correction. We simply approximated it by zero and included its effect in the overall uncertainty estimate. Note that in this work, continuum electrostatics are used in three distinct ways, which should not be confused with each other: 1) in this second, free energy step; 2) for PBFE calculations (see above); and 3) in the MDFE simulations with CRF boundary conditions.
| RESULTS |
|---|
|
|
|---|
or ATP·
and calculations to compare ATP·
and ATP·
binding to both AspRS and LysRS. Finally, we report the calculations to compare AspAMP and AsnAMP binding to AspRS·tRNA in the presence of one or no co-bound Mg2+ cations.
Structure, Dynamics, and Solvation of the Amino Acid, ATP, and tRNA Binding Sites in AspRS
The first set of simulations were done for both Asp and Asn bound to AspRS, co-bound with ATP and either three or two cations. The CRF and PME methods yield very similar binding pocket structures and dynamics. For the AspRS·Asp·ATP·
complex, the r.m.s. deviations from the starting crystal geometry were 0.7 and 1.4 Å for backbone and sidechain atoms, respectively. Fig. 3 illustrates the active site dynamics in a typical series of snapshots from the MD trajectory. Fig. 3 was prepared using Molscript (48) and rendered using Raster3D (49). The cognate Asp ligand makes a stable network of hydrogen bonds to several binding pocket residues, which agree very well with the available crystal structures (17, 18, 23) and previous MD simulations (33-35). From earlier pKa calculations (45), His-448 is doubly protonated (36) and forms a salt bridge to the Asp side-chain carboxylate. The Asp side chain and backbone amino group are also stabilized by H-bonds to Arg-489, Lys-198, a buried water molecule W1, Glu-171 of the closed flipping loop, and Gln-195. The terminal carboxylate forms H-bonds with Arg-217 and, 70% of the time, with Gln-231.
ATP retained its fully bent geometry, characteristic of ATP binding to class II aaRSs (50), throughout the simulations. The active site dynamics shown in Fig. 3 are from the E. coli structure (23), with ATP·
built in from the P. kodakaraensis structure (17) as described above under "Experimental Procedures." Here we focus on the dynamics of the ATP binding site in the P. kodakaraensis AspRS·ATP complex. The principal Mg2+ cation nearest the amino acid (labeled M1 in Table 4) is stabilized by H-bonds to the ATP
- and
-phosphates and also to Asp-475 and Glu-482. Two water molecules, or occasionally three depending on the strength of the M1/
-phosphate interaction, complete the M1 coordination sphere. One of these is a highly ordered water, always bridging M1 and Glu-482 in an interaction secondary to the direct M1-Glu-482 stabilization. Another cation, M2, is positioned between the ATP
-,
-phosphates and Glu-482, with three waters also coordinated. M2 and Glu-482 also have a secondary water-mediated interaction, although here the water is only present
50% of the time, with exchange in water molecules every few hundred ps. The ionized residues Asp-475 and Glu-482 are highly conserved in class II aaRSs; mutagenesis experiments indicate that both are functionally irreplaceable in AspRS (24).
|
-,
-phosphates but does not bind to protein. M3 remains on the more solvent-exposed side of the binding pocket, completing its octahedral coordination sphere with four water molecules. The coordinating waters have an average r.m.s. fluctuation of just 0.4 Å. The r.m.s. fluctuations of each of the cations are: M1 = 0.24 Å; M2 = 0.26 Å; M3 = 0.32 Å. Thus, M3 is predicted to have a crystallographic B-factor that is larger than M1 and M2, in agreement with the AspRS·ATP crystal structure (17). M3 has
55 water molecules within a 9-Å sphere, compared with 40 for the more strongly bound cations, 25 for the amino acid ligand backbone ammonium group, and 100 for bulk water. Finally, the ATP
-phosphate oxygen not pointing directly to either M2 or M3 has water-mediated interactions with M2 and M3 through the water molecules completing the cation coordination spheres. For M2, there is an exchange in water molecules on the sub-100-ps scale, so that only the water associated with M1 and Glu-482 above and the four waters coordinated to M3 are truly ordered. When Asn replaces Asp in the amino acid binding site, earlier free energy simulations showed that His-448 loses its labile proton and becomes neutral (36). In other respects, the structure is very similar to the Asp complex. Lifetimes of H-bonds stabilizing the Asn backbone groups are reduced by around 10% compared with the Asp complex, whereas the Asn side-chain carbonyl binds strongly to Arg-489 but not to Lys-198. Overall, the mean number of H-bonds between the amino acid and AspRS is reduced from 10 (Asp complex) to 7 (Asn complex).
When the third cation in the ATP 3-cation complex (M3) is removed, neither ATP nor its binding pocket undergoes significant displacements. The r.m.s. deviations of ATP and its immediate environment from the starting geometry are 1.7 and 1.1 Å, respectively, similar to those seen for the 3-cation complex above (1.4 and 1.2 Å, respectively). Also, the terminal oxygens of the aa ligand remain within 3.5-4.9 Å of the ATP
-phosphorus in both the 2-cation and 3-cation systems, similar to what was observed in HisRS simulations (51). Overall, the MD structures of the 2- and 3-cation complexes have a comparable agreement with the available x-ray structures. This confirms that in the AspRS·ATP crystal structure, if a 2-cation complex were present in 10 or 20% of the unit cells, it would not have a noticeable effect on the observed electron density and could not be detected. Therefore, to determine the probability of a 2-cation complex, structural data is not enough; free energies must be computed. Finally, we note that the pKa calculations (45) reported in Table 1 indicate that cation binding is coupled to nearby histidine charging. The ATP binding pockets of both AspRS and LysRS feature a histidine residue pointing toward the ATP
-phosphate (17, 28). The structure and dynamics of the LysRS substrate binding pocket will be described in detail elsewhere. This histidine is neutral in the presence of ATP·
but can gain a positive charge when ATP·
is artificially changed to ATP·
(Table 1). As shown in the energetic analysis below, this histidine charging does not fully compensate for the cost of unbinding the third cation, so that the state with three cations is strongly favored and always present.
|
Asp/Asn Discrimination by AspRS in the Presence of ATP and Either Three or Two Mg2+ Cations
We first compared the binding free energies of Asp and Asn in the presence of ATP with three associated cations. We compared MDFE results (some of them obtained earlier (36)) with the PBFE results obtained here. The data are summarized in Table 2. Each MDFE 
G value was computed from two solution simulations and four protein simulations (of ten 100-ps windows each; see "Experimental Procedures"). In solution, one run was performed in each direction, mutating Asp to Asn or the reverse, yielding
G1 (see Fig. 1). In the protein, two runs were performed in each direction, yielding
G4 (Fig. 1). When ATP·
is co-bound, 
G =+19 kcal/mol, favoring Asp. This decreases to +9 kcal/mol when His-448 is artificially maintained in the singly protonated state (36). Notice that His-448 is absent from most eukaryotic AspRSs, being replaced by an Arg that is more distant from the ligand site. PBFE free energy values are obtained from the vertical legs of the thermodynamic cycle in Fig. 1. In the three most important states for which both MDFE and PBFE simulations were performed, the PBFE estimates match the MDFE values to within ±2 kcal/mol. This is comparable with the uncertainty in the MDFE values themselves.
|
|

G = +5 kcal/mol, compared with 9 kcal/mol with ATP·
(Table 2). Agreement between MDFE and PBFE is good. Thus, even though the third Mg2+ cation is 11 Å away from the
-carbon of the ligand side chain, it interacts strongly with the amino acid ligand and strongly favors the negative Asp over neutral Asn. Consistent with this finding, PBFE estimates of amino acid binding strengths in Table 3 show that removing the third cation strongly reduces Asp binding to AspRS. The results for LysRS are also shown in Table 3. We see that Lys binding to LysRS is not sensitive to the number of Mg2+ cations present, despite its positive charge and in contrast to the large effect of the cations in AspRS. This is shown by the near equality of the Lys binding free energies for the LysRS·Lys·ATP·3-cation complex (with either neutral or charged His-270) and for the 2-cation complex (with either neutral or charged His-270). In the next section, we show that in both AspRS and LysRS, ATP always binds with all three cations.
|
Complex in AspRS and LysRS
From Table 4, it is clear that M3 is the least stable cation in all the complexes. In AspRS, the order of cation binding strengths is M2 > M1 > M3, irrespective of the amino acid ligand. M3 is on the solvent-exposed side of ATP. Its r.m.s. fluctuations are about 25% larger than those of M1 and M2. Of the three cations, M1, coordinating the ATP
,
-phosphates Asp-475 and Glu-482, is the most strongly stabilized by Asp rather than Asn binding. Its binding free energy is 19 kcal/mol lower with bound Asp than with bound Asn. M1 is the cation closest to the amino acid, and hence it is the most strongly affected by the net charge of the amino acid ligand. M2 is the most tightly bound cation, coordinating the
- and
-phosphates and Glu-482. M2 is furthest from the amino acid binding site, and hence the least affected by the amino acid identity; its binding free energy is 9 kcal/mol lower with bound Asp than with bound Asn. M3, the most weakly bound cation, coordinates the
- and
-phosphates but not protein. M3 is intermediate in distance from the amino acid pocket and intermediate in its energetics; its binding free energy is 15 kcal/mol lower with bound Asp than with bound Asn. This situation can be compared with the LysRS case. Table 4 shows clearly that in LysRS, M3 is again the most weakly bound cation. M1 and M2 have similar binding strengths; both are strongly stabilized by nearby LysRS glutamate residues.
Relative Stabilities of the ATP·
and ATP·
Complexes in Alchemical MDFE SimulationsATP co-binding along with three divalent cations is supported but not proved by the AspRS·ATP and LysRS·ATP crystal structures (17, 28). Indeed, the crystallographic data cannot rule out a partial occupancy for the third cation. In other words, the third cation could be present in fewer than 100% of the crystal cells. From the data presented above, a partial occupancy, even as high as 80 or 90%, would have a significant effect on the Asp/Asn discrimination in AspRS. Therefore, we performed alchemical free energy simulations in which the third Mg2+ cation, M3, was reversibly transformed into a water molecule, both in the enzyme and alone in solution. This cation was chosen because it is the most weakly bound of the three (Table 4). The data were analyzed with the thermodynamic cycle in Fig. 2. The double free energy difference 
G represents the standard free energy for binding the third cation to the preformed AspRS·aa·ATP·
complex.
|

G (36 kcal/mol). Thus, even though the precision of the MDFE data is low, the predicted Mg2+ binding trends are very strong and can be taken as qualitatively accurate.
|
LysRS+·Lys·ATP·
LysRS·Lys·ATP·
+ H+. Notice that the estimated uncertainty in the computed protonation free energy is moderate (Table 1). Similarly, for AspRS, we consider the process Mg2+ + AspRS+·Asp·ATP·
Mg2+ + AspRS·Asp·ATP·
+ H+
AspRS·Asp·ATP·
+ H+. The total binding free energy is +10-78 =-68 kcal/mol. Finally, the four AspRS·aa·ATP·
systems (aa = Asp or Asn; n = 2 or 3) form a thermodynamic cycle, allowing us to infer also the free energy for binding the third cation to the AspRS·Asn·ATP·
complex. Given that binding of the third cation in AspRS·Asp·ATP is favored by 68 kcal/mol, the third cation in AspRS·Asn·ATP is favored by 59 kcal/mol.
The concentration of Mg2+ in the cell is on the order of 1 mM (52). The free energy changes reported above correspond to a standard state of 1 M Mg2+. Thus, at physiological Mg2+ concentrations, the binding free energy is at most 3-5 kcal/mol lower than the standard state value. Overall, for both Asp and Asn, the third cation is much more stable bound to AspRS than alone in solution, and so ATP always binds to AspRS with all three associated Mg2+ cations; the Boltzmann probability of the 2-cation state is infinitesimal. The same qualitative result is found for LysRS; the 3-cation complex is favored by 52 kcal/mol in the standard state and by at least 47 kcal/mol at cellular concentrations. It is easy to show, using the data in Tables 1 and 5, that states in which the aaRS·aa·ATP complex binds only one or no Mg2+ cations are negligibly populated compared with even the aaRS·aa·ATP·
states. Note that Mg2+ competition with monovalent ions is also insignificant, because the most abundant ion, potassium, has a concentration of about 150 mM. The standard binding free energy of K+ can be estimated from electrostatic considerations to be weak, on the order of half that of the third Mg2+ cation. Thus, mixed complexes such as ATP·
·K+ are expected to be negligibly populated in both AspRS and LysRS.
To identify the most important interactions stabilizing the third cation, we performed a group decomposition of the computed binding free energy based on Equation 2 (46, 43). The resulting free energy components are given in Table 6. A very large, favorable contribution (+367 kcal/mol) comes from the ATP·
moiety itself, even though it is electrically neutral. Another +126 kcal/mol come from the conserved Glu-482/Asp-475 side chains, even though these are 6/9 Å away and are primarily involved in coordinating the other two cations. The contribution of solvent is small at -73 kcal/mol (opposing binding). The solvent contribution can be interpreted as a dielectric shielding of the negative charges that favor Mg2+ binding. Other protein groups make much smaller contributions. For example, the net contribution of the nearby salt bridge pair, Glu-219/Arg-225, is just +11 kcal/mol, and nearby Glu-171 and Arg-217 contribute just +14 kcal/mol. The Asp ligand contributes +45 kcal/mol. Overall, M3 binding is favored largely by ATP·
, the conserved Glu-482/Asp-475 pair, and the Asp ligand. On balance, these groups stabilize cation·protein binding compared with cation solvation in bulk water (Tables 5 and 6).
|
complexes (Table 4). PBFE calculations reported in Table 2 show that this cation is crucial for AspAMP/AsnAMP discrimination. For AspRS with the co-bound cation and a positively charged His-448, we computed an AspAMP/AsnAMP binding free energy difference 
G of +12 kcal/mol in favor of AspAMP. pKa calculations (Table 1) indicate a preference for neutral His-448 when AsnAMP replaces AspAMP, with a 3 kcal/mol free energy gain when His-448 becomes neutral. We therefore subtracted this 3 kcal/mol (Table 1) from our PBFE 
G, giving a corrected 
G of +9 kcal/mol in favor of AspAMP (Table 2). Artificially removing the cation reduces the AspAMP/AsnAMP difference strongly, with a 
G of just +4 kcal/mol. Interestingly, if one keeps the cation but artificially switches His-448 to its neutral state, one observes the same net effect, with a 
G of just +4 kcal/mol. This offers a further illustration of the coupling between cations in the ATP/AMP binding site and the ionizable His-448, near the amino acid side chain. | DISCUSSION |
|---|
|
|
|---|
For the crucial, long range, electrostatic interactions, we used two recent, sophisticated methods, including one that combines an atomic representation of groups near the active site with a continuum representation of distant groups (37, 47). The simplified treatment of distant groups is quite accurate. Indeed, calculations that use atomic regions of different sizes give very similar free energy changes. We note that although substrate interactions with distant groups have been observed experimentally and discussed extensively, they are weak compared with the effects of interest here. For example, AspRS is a homodimer, and a certain cooperativity is found for Asp and ATP binding to the two active sites (53). But the two binding constants for ATP differ by a factor of only 10, corresponding to 1.4 kcal/mol of binding free energy; Mg2+ effects are much larger.
To obtain thermodynamic information, we did MDFE simulations. MDFE has given reasonable accuracy for several proteins where direct experimental comparisons were possible; see recent work on redox potentials (54), side-chain pKa shifts (55), and protein·protein binding (56, 57). Its validity for AspRS was tested earlier by comparison with experimental measurements of Asp-stimulated pyrophosphate exchange (36). We made systematic comparisons with PBFE, an independent method that has been extensively calibrated and tested for AspRS, and found good agreement. We also explored the sensitivity of our results to system details by studying several AspRS variants: E. coli AspRS with and without tRNA and P. kodakaraensis AspRS. Considering the different amino acid ligands and the numbers of Mg2+ cations and His protonation states, we simulated 24 different systems for over 75 ns, which is several times the total simulation length in related studies (54, 55, 56).
Several predictions could be tested experimentally, in principle. For LysRS, one could measure the effect of Mg2+ on the Lys affinity (predicted here to be small) using an established fluorescence assay (58). For both AspRS and LysRS, we predicted that ATP is always bound with three Mg2+ cations, so that the rate constant, kcat, for the chemical adenylation step should be independent of Mg2+ concentration. We look forward to these predictions being tested by others.
Complexity of the Amino Acid Binding ReactionAspRS·aa recognition is especially complex because it involves "hidden" reorganization, substrate-assisted specificity, and long range interactions. Indeed, the binding pocket is largely preorganized, and rearrangement upon aa binding is mostly limited to the flipping loop, which brings its negative Glu-171 into the pocket. However, a hidden reorganization also occurs, with His-448 binding a proton to oppose the Glu-171 charge. This labile proton favors Asp binding and helps discriminate against Asn.
ATP·
acts as another mobile discriminator, so that AspRS specificity is substrate-assisted. The most distant Mg2+ cation contributes 4 kcal/mol to the Asp/Asn binding free energy difference. The strength of the coupling between the aa side chain and this cation is remarkable, given that they are 11 Å apart. We note that analogous couplings have been observed between aa and ATP binding in other aaRSs, e.g. ThrRS (59) and MetRS (60). But the coupling here was significantly larger.
|
(but not ATP·
) binds with Asn instead of Asp, His-448 will revert to its neutral form. Similarly, His-254 is predicted to be neutral when the third Mg2+ is present and positive when it is absent. Mg2+ Binding Is Governed by Charge Balance and Long Range InteractionsThe ATP phosphate groups are deprotonated at physiological pH, so that ATP interacts readily with cations. In solution, ATP usually binds one divalent cation. When complexed to a protein, it usually binds one or two (25). Yet our simulations show that the 3-cation form is overwhelmingly favored in AspRS. Given the role of the third cation, M3, in AspRS specificity, we want to understand its stability.
ATP binds preferentially to all 10 class II aaRSs with either three divalent cations or two cations and a positive protein side chain (see the data base analysis in the supplemental material). The fully bent, U-shaped geometry of ATP in AspRS is also characteristic of class II aaRSs and might be thought to play a role in M3 stabilization. Excluding aaRSs, only two of the 238 nonredundant NTP·protein complexes in the Protein Data Bank exhibit such a fully bent form (see supplemental material).
The most obvious M3 interactions (Fig. 3) are with the ATP
- and
-phosphates and surrounding waters, and ATP·Mg
does make a very large contribution to M3 binding (+367 kcal/mol; Table 6). But these groups are not unique to AspRS or aaRSs, so that the stability of M3 is something of a puzzle. In fact, longer range interactions are at play. These were identified by the group decomposition of the binding free energy. A large contribution (+126 kcal/mol) comes from the conserved Glu-482 and Asp-475 side chains, 6 and 9 Å away from M3, respectively. Their negative charges, along with the four negative phosphate charges, exactly balance the charges of the Mg2+ cations. A large, unfavorable contribution (-73 kcal/mol) arises from solvent, corresponding to a dielectric shielding of the negative charges in the binding pocket. Finally, the Asp ligand itself contributes +45 kcal/mol, another example of long range coupling in the AspRS active site.
Mg2+ Contributes to AspRS Specificity in Synergy with Other Active Site GroupsThis and earlier studies show that AspRS amino acid specificity arises from a complex network of electrostatic interactions involving conserved side chains and other factors. We have focused here on ATP and its associated cations found to play an important role. The simulations show that if the weakest bound Mg2+ cation is removed, the Asp/Asn discrimination is strongly reduced. The binding free energy difference 
G drops from 9 ± 2to5 ± 2 kcal/mol. The Asp and Asn dissociation constants then differ by a factor of between 10-2.2 and 10-5.1. The range of values reflects the uncertainty in the MDFE estimation. If the third cation were absent 10% of the time, and we averaged correspondingly over the 2- and 3-cation states, the average Asp and Asn dissociation constants would differ by a factor of between 10-3.2 and 10-6.1. This is roughly comparable with the average error rate in protein synthesis, 0.03% = 10-3.5.
This estimate assumes that His-448 is present. In fact, His-448 is conserved in prokaryotes but absent in eukaryotes. Fortunately, we found that AspRS is, in fact, protected against the 2-cation state, which is negligibly populated in all cases. By eliminating the 2-cation form and always binding ATP·
, AspRS ensures a large Asp/Asn discrimination even in eukaryotes, where His-448 is absent.
The Mg2+ cations and the conserved side chains that coordinate them are required for AspRS activity, and the cations play both a structural and a catalytic role. It seems economical that the enzyme should also use them for a third purpose, to help ensure substrate specificity. In contrast, the evolutionarily related LysRS also binds ATP only in the 3-cation form, but artificially removing the third cation does not affect binding of the (positively charged) Lys substrate.
Fig. 4 summarizes the substrate binding specificity in AspRS as determined from the present simulations and earlier studies. Many different states are potentially available, the most important of which are shown in Fig. 4. Some of them produce a low Asp/Asn specificity, and these states are invariably unstable and negligibly populated. Overall, a delicate interplay between conformational shifts, ATP·
binding, and histidine charging allows AspRS to combine a moderate Asp binding affinity with a very strong Asp/Asn binding specificity. It is striking that different steps in the aminoacylation process feature a similar charge balance in the Asp binding pocket, as reflected in the computed Asp/Asn binding free energy differences. For example, both loop closure in the preadenylation step and tRNA binding in the post-adenylation step trigger His-448 charging, and both ATP and AspAMP have very strongly bound cations. These Mg2+ cations act as mobile discriminators in AspRS. In tandem with His-448 charging, they compensate electrostatically for the Asp binding penalties associated with loop closure and tRNA binding.
|