|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 279, Issue 29, 30634-30642, July 16, 2004
Crystal Structure of PapA5, a Phthiocerol Dimycocerosyl Transferase from Mycobacterium tuberculosis*![]() ![]() ![]() ![]() ![]()
From the
Received for publication, April 12, 2004 , and in revised form, April 26, 2004.
Polyketide-associated protein A5 (PapA5) is an acyltransferase that is involved in production of phthiocerol and phthiodiolone dimycocerosate esters, a class of virulence-enhancing lipids produced by Mycobacterium tuberculosis. Structural analysis of PapA5 at 2.75-Å resolution reveals a two-domain structure that shares unexpected similarity to structures of chloramphenicol acetyltransferase, dihydrolipoyl transacetylase, carnitine acetyltransferase, and VibH, a non-ribosomal peptide synthesis condensation enzyme. The PapA5 active site includes conserved histidine and aspartic acid residues that are critical to PapA5 acyltransferase activity. PapA5 catalyzes acyl transfer reactions on model substrates that contain long aliphatic carbon chains, and two hydrophobic channels were observed linking the PapA5 surface to the active site with properties consistent with these biochemical activities and substrate preferences. An additional helix not observed in other acyltransferase structures blocks the putative entrance into the PapA5 active site, indicating that conformational changes may be associated with PapA5 activity. PapA5 represents the first structure solved for a protein involved in polyketide synthesis in Mycobacteria.
Mycobacterium tuberculosis produces polyketides, a complex family of lipids (1) that includes compounds associated with mycobacterial virulence. Up to 24 genes are predicted to encode proteins with polyketide synthase activity in the M. tuberculosis genome (H37Rv) (2). Although several of these and associated genes have been directly implicated with Mycobacterium virulence (311), several others remain less well characterized. One such group includes a family of five polyketide-associated proteins (Paps)1 that were suspected to encode activities associated with polyketide biosynthesis or transport (1215). Additional Pap orthologs have been uncovered in polyketide synthase loci in Mycobacterium leprae, Mycobacterium bovis, and other Mycobacterium species (16), suggesting that Paps may contribute activities to conserved pathways across Mycobacterium species.
Phthiocerol dimycocerosate esters and their congeners, otherwise known as PDIMs, comprise a polyketide family that has been shown to be directly involved in mycobacterial virulence (3, 4). M. tuberculosis mutants deficient in PDIM production are attenuated in mice and PDIMs produced by M. leprae promote Schwann cell tropism (17, 18). PDIM biosynthesis has been proposed to involve the activities of at least three polyketide synthase gene systems. The first and second systems include ppsA-E and mas, genes involved in phthiocerol/phthiodiolone synthesis and mycocerosic acid synthesis, respectively (10). The third includes pks15/1, a gene that has been associated with the incorporation of the phenolic group into mycoside PDIM variants (19), and possibly other pks genes to produce early biosynthetic precursors (6, 8, 21). The genes associated with diesterification of phthiocerol and phthiodiolone to mycocerosic acid have remained unclear, but recent progress has been made through the genetic and functional characterization of M. tuberculosis PapA5, a Pap family member located within the PDIM synthesis gene cluster (16). Deletion and complementation of the gene encoding PapA5 indicated that PapA5 was essential for PDIM production in M. tuberculosis, and although the involvement of PapA5 in the diesterification of phthiocerol and phthiodiolone could not be tested directly due to the unavailability of these compounds, PapA5 was assayed for CoA-dependent acyltransferase activities in an effort to define PapA5 substrate specificities for a variety of model lipid compounds. Taken together, these results suggested that papA5 encoded a protein capable of catalyzing acyl transfer chemistry. Although protein sequences of Pap family members include several amino acid motifs associated with other proteins that catalyze acyltransferase activities, the Pap family shares little sequence identity with acyltransferases outside of these regions. To elucidate structure-activity relationships for the Pap family of proteins, we determined the 2.75-Å crystal structure of PapA5 from M. tuberculosis. The structure reveals that PapA5 is related to the family of CoA-dependent acyltransferases. Further structural analysis combined with previously reported acyltransferase activities on defined lipid substrates suggests a model for PapA5 function.
Purification of M. tuberculosis PapA5PapA5-(1422) was cloned, expressed, and purified from Escherichia coli as a N-terminal His6-Smt3 fusion protein (22). The pET-based plasmid was transformed into E. coli BL21(DE3) CodonPlus RIL (Stratagene). A 5-liter culture was grown by fermentation at 37 °C to an A600 of 2, adjusted to 30 °C and 1 mM isopropyl-1-thio- -D-galactopyranoside, and incubated for 4 h. Cells were harvested by centrifugation and resuspended in 20 mM Tris-HCl (pH 8.0), 350 mM NaCl, 10 mM imidazole, 20% sucrose, 1 mM -mercaptoethanol, and 20 µg/ml lysozyme and sonicated. After insoluble material was removed by centrifugation, His6-Smt3-PapA5 was purified by metal affinity and gel filtration chromatography (Superdex 200). The His-Smt3 tag was removed by the Smt3-specific protease Ulp1, and PapA5 was further purified by gel filtration (Superdex 75). PapA5 was obtained at 10 mg/L of E. coli culture and appeared homogeneous by SDS-PAGE and Coommassie Blue staining. PapA5 was concentrated to 10 mg/ml, flash-frozen in liquid nitrogen, and stored at -80 °C.
Crystallographic AnalysisPapA5 crystals were obtained by vapor diffusion against a well solution containing 510% polyethylene glycol 4000, 0.2 M ammonium acetate, 5% glycerol, 0.1 M sodium acetate (pH 5), and 20 mM dithiothreitol. Crystals were cryo-protected by addition of 15% glycerol. Crystals of native protein diffracted X-rays to 2.5 Å, although data were only processed to 2.75 Å due to problems associated with crystal mosaicity (P3121 a = b = 172.98 Å, c = 80.54 Å,
Overview of the PapA5 StructurePurified recombinant PapA5 containing amino acids 1422 was purified and crystallized. Crystals of PapA5 belonged to space group P3121 and contained two independent monomers per asymmetric unit. Phases were calculated to 2.75 Å using 2-fold NCS averaging with data obtained from a native protein crystal and a native crystal into which thimerosal was soaked (see "Experimental Procedures"; Table I). Both PapA5 monomers contain segments of polypeptide that did not have sufficient electron density to permit model building. These regions include residues 8293, 176180, 192204, and 419422 for monomer A and 12, 8293, 176180, and 419422 for monomer B. The overall average Bfactor for the final coordinate set was 68 Å2. The termini that demarcate the disordered segments (marked by asterisks in Fig. 1) had Bfactor values of nearly twice that value, suggesting that segments not observed in the electron density maps were due to thermal motion in the crystal lattice. Despite the high overall Bfactor and disordered segments, the final model was successfully refined without NCS restraints to an Rfactor of 23.6 and Rfree of 29.5 with excellent geometry and no outliers in the Ramachandran plot. Monomer B will be referred to in subsequent discussions as few differences were observed between monomers, and monomer B contained a larger number of ordered amino acids.
The PapA5 structure can most easily be described by dividing the protein into two domains (Fig. 1). Domain 1 is composed of secondary structural elements that include -strands 18 and 13 and -helices A through D. Domain 2 includes -strains 912 and 1415 and -helices E through I. Domains 1 and 2 are self-contained with a few noted exceptions. Domain 1 includes 13, a strand that emanates from domain 2 to complete the four-stranded anti-parallel -sheet in domain 1 ( 6, 7, 2, and 13). In addition, a loop from domain 2 between -strands 10 and 11 extends into domain 1 and contacts portions of helix C and D. A large crossover loop is also observed between helix D and -strand 8 that spans nearly 50 Å between the two domains. Monomer B amino acids 192204 within the crossover loop have Bfactor values nearly twice that of the average model Bfactor, while the same region in monomer A is disordered and not present in the electron density maps. Despite these interdomain contacts, the connectivity between domains would not restrict movements of domain 1 and 2 with respect to one another. Domains 1 and 2 are structurally related and can be aligned to within 4.1 Å r.m.s.d. over 104 amino acids with 7% sequence identity. While similar, domain 1 contains the only known catalytic amino acid residues (His124 and Asp128) that have been directly implicated in PapA5 activity (16). His124 and Asp128 are located between strand 7 and helix C in the interface between domains 1 and 2 (Figs. 1 and 2).
PapA5 Is Related to CoA-dependent AcyltransferasesA structural alignment between PapA5 and the Protein Data Bank using DALI shows that PapA5 contains structural and sequence motifs that are characteristic of the CoA-dependent acyltransferase family (Fig. 2) (28). In rank order, proteins that could be aligned to PapA5 include the condensation domain from VibH (Protein Data Bank code 1l5a [PDB] ; 336 amino acids aligned to 4.0 Å r.m.s.d. with 11% sequence identity; Z-score 23.1), carnitine acetyltransferase (Protein Data Bank code 1ndf [PDB] ; 285 amino acids aligned to 3.8 Å r.m.s.d. with 10% sequence identity; Z-score 12.1), chloramphenicol acetyltransferase (CAT) (Protein Data Bank code 3cla [PDB] ; 127 amino acids aligned to 3.9 Å r.m.s.d. with 12% sequence identity; Z-score 6.9), and dihydrolipoyl transacetylase (Protein Data Bank code 1eaf [PDB] ; 108 amino acids aligned to 2.9 Å r.m.s.d. with 17% sequence identity; Z-score 5.6). As stated above, domains 1 and 2 from Papa5 are related to each other, so each can be aligned to a single protomer from chloramphenicol acetyltransferase. In addition, PapA5 domains 1 and 2 are oriented in a similar manner to that observed for two of the three chloramphenicol acetyltransferase protomers as observed in the intact chloramphenicol acetyltransferase trimer (29). Although structural alignments show PapA5 domain 2 to be more similar based on the number of amino acids that could be aligned, the alignments to domain 1 reveal higher sequence identity between structures, including the known HHX3DG catalytic amino acid motif that is conserved and observed in many CoA-dependent acyltransferase family members (Fig. 2).
Chloramphenicol acetyltransferase catalyzes CoA-dependent acetyl transfer in a reaction that is dependent on the second conserved histidine (His195) in the active site HHX3DG sequence motif (29). His195 has been proposed to be a general base that promotes deprotonation of the chloramphenicol hydroxyl prior to the transfer of the acetyl group, and mutation of this residue results in a severely defective enzyme. While the aspartic acid has also been shown to be critical for activity, it appears to play a structural role in the organization of the active site. His124 and Asp128 are the corresponding histidine and aspartic acid in the PapA5 sequence and structure (Figs. 2 and 3), and as observed for chloramphenicol acetyltransferase, His124 is essential for PapA5 CoA-dependent acyltransferase activity (see below; Ref. 16). Although conserved, the second histidine in the HHX3DG motif does not perform similar catalytic roles in all CAT family members insomuch as mutation of this residue in the yeast dihydrolipoyl transacetylase results in no observable catalytic defect (30).
PapA5 exhibits more structural similarity to VibH and carnitine acetyltransferase, since PapA5, VibH, and carnitine acetyltransferase each contain two tandem CAT-like domains with one active site located within the N-terminal CAT-like domain (31, 32). Carnitine acetyltransferase catalyzes the acyl transfer between carnitine and acetyl-CoA utilizing a similar mechanism to that proposed for CAT. In addition, the structural analysis of carnitine acetyltransferase and respective ligand complexes combined with previous biochemical data support a general base mechanism for His343 in deprotonation of the carnitine hydroxyl group prior to acyl transfer (32). VibH belongs to a large family of condensation domains that are associated with non-ribosomal peptide synthetases, large multidomain enzymes that catalyze the synthesis of a number of compounds such as antibiotics and virulence factors (3335). VibH catalyzes amide bond formation in the synthesis of vibriobactin (36, 37). VibH also contains a conserved HHX3DG catalytic amino acid motif, although structural and mutational analysis revealed that VibH utilizes a mechanism distinct from CAT and carnitine acetyltransferase in that mutation of the second histidine did not result in severe catalytic defects but mutation of the Asp residue did. These data suggest that the tandem CAT-like domain architecture can be utilized in alternative ways to achieve a variety of chemical reactions. CoA-dependent acyltransferase family members have been categorized on the structural level in the SCOP data base by virtue of their oligomeric state (38). Some CAT family members such as chloramphenicol acetyltransferase and dihydrolipoyl transacetylase are oligomeric and form their respective active sites in the intersubunit interfaces between protomers. Carnitine acetyltransferase and VibH are monomeric, but contain two tandem CAT-like domains. In both instances, VibH and carnitine acetyltransferase share similarity with the CAT intersubunit active site organization insomuch as each has its active site positioned within the interface between the two tandem CAT-like domains. PapA5 also contains two CAT-like domains and is organized in a similar manner to that observed for VibH and carnitine acetyltransferase (Fig. 1). Analysis of the aligned structures for PapA5 and carnitine acetyltransferase revealed similar locations for the active site histidine and aspartic acid residues located within domain 1 between domains 1 and 2 (Fig. 4). The carnitine acetyltransferase structure utilized for this alignment also included the substrate carnitine (32), and comparison of the PapA5 and carnitine acetyltransferase active site clefts revealed a deep substrate cleft for carnitine acetyltransferase (Fig. 4D), whereas the analogous PapA5 cleft is occluded by helix H (Fig. 4C). Helix H is unique to the Pap family (Fig. 2), and its potential role in substrate coordination is discussed further below.
PapA5 Active Site and Substrate SelectivityPapA5 has been proposed to catalyze the diesterification of phthiocerol and phthiodiolone with mycocerosate, possibly through a mechanism that is dependent on the activation of mycocerosic acids as thioesters (16). The possible mechanisms by which PapA5 might participate in this reaction were previously explored by measuring PapA5 acyltransferase activity using palmitoyl-CoA and several model substrates that included short-, medium-, and long-chain alcohols, diols, hydroxy esters, acids, amines, and thiols (16). A subset of those substrates tested is depicted in Fig. 5. PapA5 exhibited a preference for saturated medium chain alcohols in reactions that were dependent on the presence of amino acid residues His124 and Asp128, suggesting that PapA5 utilizes a similar acyl transfer mechanism to that observed for several CAT family members including carnitine acetyltransferase.
While CAT and dihydrolipoyl transacetylase utilize intersubunit interfaces to interact with respective ligands, large solvent-exposed channels were observed between the two tandem CAT-like domains in both VibH and carnitine acetyltransferase (Fig. 4, C and D) (31, 32). To gain insight into how PapA5 might organize its active site with respect to these other CAT family members, the PapA5 active site was superimposed to crystal structures of CAT, dihydrolipoyl transacetylase, and carnitine acetyltransferase to enable modeling of chloramphenicol (29), carnitine (32), and CoA (32, 39) into the PapA5 active site (Fig. 3B). Only CoA and chloramphenicol are depicted in Fig. 3B, since the ligand positions observed in carnitine acetyltransferase superimpose well to a first approximation with those ligands observed in CAT and dihydrolipoyl transacetylase (32).
A similar modeling exercise was undertaken using VibH and respective ligands from CAT and dihydrolipoyl transacetylase (31). These alignments revealed both ligand binding sites to be accessible to solvent within the VibH structure, suggesting that VibH utilizes similar surfaces and substrate clefts to interact with its substrates. Experimental structures of carnitine acetyltransferase in complex with carnitine and CoA also showed a similar arrangement of ligands, suggesting that it too utilizes the same solvent-exposed channels to bind and coordinate respective ligands (32). Inspection of the modeled ligands in the PapA5 active site shows the CoA ligand coordinated between The modeled positions of carnitine and chloramphenicol within the PapA5 active site show the respective hydroxyl moieties directly over catalytic His124 in PapA5, suggesting that the basic mechanisms employed in CAT activity are likely conserved in the PapA5 structure (Fig. 3B, carnitine shown in Fig. 4). Despite proper placement of the hydroxyl group, both carnitine and chloramphenicol encounter significant steric clashes with amino acid residues emanating from helix H, namely Phe327 and Phe331 (Fig. 3B). Helix H is unique to PapA5 and is not observed in either VibH, carnitine acetyltransferase, or other CAT family members (Fig. 2). Helix H effectively blocks access to one of the solvent exposed channels into the active sites observed in either VibH or carnitine acetyltransferase (Figs. 1, 3, and 4).
Further inspection of the PapA5 molecular surface reveals two channels that lead into the PapA5 active site (Fig. 6). Channel 1 is
The preferred substrate lengths exhibited by PapA5 in vitro suggest a possible mode of interaction with phthiocerol and phthiodiolone, the proposed in vivo substrates for PapA5 (Fig. 5). Phthiocerol and phthiodiolone include two hydroxyl moieties located at positions 9 and 11 along the aliphatic chain (Fig. 5C). The approximate length (911 carbon units) of the remaining aliphatic chain is architecturally similar in many respects to that observed for several of the preferred model substrates, namely octanol (Fig. 5, compare A and C). If analogous to carnitine acetyltransferase and chloramphenicol acetyltransferase, channel 2 would provide the most likely binding site for octanol and the analogous portions of either phthiocerol and phthiodiolone. In addition, the long aliphatic chains associated with the remaining portions of either phthiocerol and phthiodiolone could be accommodated in channel 1 (1721 carbon units; Figs. 5C and 6). Phthiocerol or phthiodiolone are not commercially available or easily purified from natural sources, so it is currently implausible to obtain the relevant complexes to enable a more detailed study of PapA5 in complex with its physiological ligands, a necessary step to provide the basis for development of structure-based inhibitors of this enzyme. Although natural ligand complexes are currently beyond the scope of this study, we do plan to obtain complexes between PapA5 and some of the model compounds previously reported (16) and represented in Fig. 5. However, the structural and biochemical characterization of PapA5 combined with the structures of carnitine acetyltransferase and VibH suggest putative roles for the catalytic residues observed in PapA5. In addition, structural and functional similarity between these protein families has likely identified the surfaces and channels utilized by PapA5 in its interactions with respective substrates, the exact details of which await further investigation. It has been previously noted that the Pap family shares weak similarity to Rif20 (16), a gene encoded within the rifamycin gene cluster (40). The PapA5 structure and the structure-based sequence alignment between PapA5 and Rif20 and the conserved catalytic elements observed in these proteins support our earlier speculation (16) that Rif20 encodes a protein with similar catalytic properties to that observed for PapA5 and is responsible for catalyzing the as yet unidentified acyltransferase activity required for C25 O-acetylation during rifamycin biosynthesis.
The atomic coordinates and structure factors (code 1Q9J [PDB] ) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
|| Supported by National Institutes of Health Grant 1 F31 AI054326
[GenBank]
-01 and Medical Scientist Training Program Grant GM07739.
** Supported in part by The Niarchos, The William Randolph Hearst, and The Potts Memorial Foundations.
1 The abbreviations used are: Pap, polyketide-associated protein; PDIM, phthiocerol dimycocerosate ester and its congener; NCS, non-crystallographic symmetry; r.m.s.d., root mean square deviation; CAT, chloramphenicol acetyltransferase.
We thank the staff of beamline X4A at the National Synchrotron Light Source.
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||