|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 283, Issue 5, 2871-2882, February 1, 2008
A New Autocatalytic Activation Mechanism for Cysteine Proteases Revealed by Prevotella intermedia Interpain A* 1 1![]() 2 ||3 4
From the
Received for publication, October 11, 2007 , and in revised form, November 7, 2007.
Prevotella intermedia is a major periodontopathogen contributing to human gingivitis and periodontitis. Such pathogens release proteases as virulence factors that cause deterrence of host defenses and tissue destruction. A new cysteine protease from the cysteine-histidine-dyad class, interpain A, was studied in its zymogenic and self-processed mature forms. The latter consists of a bivalved moiety made up by two subdomains. In the structure of a catalytic cysteine-to-alanine zymogen variant, the right subdomain interacts with an unusual prodomain, thus contributing to latency. Unlike the catalytic cysteine residue, already in its competent conformation in the zymogen, the catalytic histidine is swung out from its active conformation and trapped in a cage shaped by a backing helix, a zymogenic hairpin, and a latency flap in the zymogen. Dramatic rearrangement of up to 20Å of these elements triggered by a tryptophan switch occurs during activation and accounts for a new activation mechanism for proteolytic enzymes. These findings can be extrapolated to related potentially pathogenic cysteine proteases such as Streprococcus pyogenes SpeB and Porphyromonas gingivalis periodontain.
Periodontal disease (PD)5 affects the tissues that surround and support the teeth and may lead to loosening and eventual loss of teeth if untreated. It is caused by bacteria and affects mildly 90% and severely 10% of the population worldwide (1, 2). In addition, symptoms of PD appear in a series of systemic diseases due to its inflammatory and infective character (2, 3). Present day treatment and curettage of severe PD includes the mechanical cleansing of the affected area and is efficient in general. However, it is costly, time consuming, and painful and needs frequent repetition. In addition, it may entail the indiscriminate usage of antibiotics, which contributes to the spread of antibiotic-resistant strains (2, 4). Consequently, there is a need for innovative and specific therapeutic approaches against PD.
Prevotella intermedia is a major bacterial periodontal pathogen in humans together with Porphyromonas gingivalis, among others (5, 6). Such bacteria colonize the gingival crevice and produce virulence factors that cause disease. Bacterial infection leads to the bacterial secretion or induction of host overproduction of proteolytic enzymes such as bacterial collagenases, matrix metalloproteases, and serine and cysteine proteases (CPs) (2, 7, 8). These proteases destroy host tissue and compromise host defenses. In addition, proteases may give rise to fibrinolytic activity and inactivate components of the blood-coagulation cascade such as the protease inhibitors, Most studies on the bacterial proteolytic armamentarium in PD have been performed with P. gingivalis (9). In contrast, the factors governing P. intermedia infection, a black-pigmented Gram-negative obligate anaerobic non-motile rod bacterium, are poorly understood (7). In humans, Prevotella sp. have frequently been recovered from subgingival plaque in patients suffering from acute necrotizing gingivitis, pregnancy gingivitis, and adult periodontitis (10). In addition, Prevotella species easily acquire resistance toward antibiotics, which hampers their elimination (11). A deep molecular knowledge of how infection and resistance occur is crucial for the development of alternative treatments. In P. intermedia, several proteases have been described, among them trypsin-like serine proteases, a dipeptidyl peptidase IV and CPs (12-14), but no structural studies are available that could help in understanding their particular mode of action or facilitate the design of specific drugs. The structures of some clan-A papain-like CPs (according to the MEROPS data base (15)) from other infective bacteria are known, namely those of staphopain A and B from Staphylococcus aureus (16, 17), the avirulence putative peptidase AvrPphB from Pseudomonas syringae (18), and streptopain (alias streptococcal pyrogenic exotoxin B and SpeB) and IdeS endopeptidase, both from Streptococcus pyogenes (19, 20). Together with other bacterial enzymes such as bleomycin hydrolase from Lactococcus lactis and a calpain-like enzyme from P. gingivalis, they may be among the ancestral enzymes that gave rise to the 20 families currently identified within this clan of proteases (15, 21). They display a relatively broad substrate specificity but are restricted to a small group of related bacterial species or are even limited to a single species, thus constituting attractive targets for the selective design of antibiotics (22). All these proteases have been identified as or proposed to be secreted virulence factors that elicit nutrient generation, evasion of the adaptive immune system response through inactivation of immunoglobulins, or release of bacterial proteins from the cell surface (23). For more than 60 years, SpeB, a protein secreted by Streptococcus pyogenes (24), was considered a unique CP, unrelated to plant papains or vertebrate cathepsins, and the founding member of family C10 within clan CA (15). A recent analysis of bacterial genomes identified genes encoding potential SpeB orthologues in several species, predominantly Bacteroidetes (31). Interestingly, two forms of genes are common, either short orthologues encoding an SpeB-like protein with an N-terminal pro-domain and a catalytic CP domain or large orthologues with an additional large C-terminal extension, which shares no similarity with any other proteins sequenced. The latter orthologues are present in bacteria that are involved in pathogenicity of periodontal disease in humans. With this in mind, a genome search within P. intermedia 17 was undertaken, and three open reading frames potentially encoding CPs were identified (22). We studied the first of these potential proteases, interpain A (InpA), encoded by locus PIN0048. This gene encodes a long SpeB-orthologue of 868 residues, including a 44-residue signal peptide, a pro-domain (Ala1-Asn111, see Fig. 1), a catalytic domain (Val112-Pro359) and a further 465 C-terminal residues arranged in distinct domains, with putative regulatory and secretory functions (25). We cloned, overexpressed, purified, and functionally analyzed protein variants comprising the first two domains, the wild-type (wt) form and a variant, in which the active-site Cys154 had been mutated to alanine (C154A), hereafter termed pro-cd-InpA and pro-cd-InpA C154A, respectively. We further analyzed the three-dimensional structures of a major fragment of pro-cd-InpA C154A and of the wt catalytic domain, cd-InpA. Unexpectedly, these studies have uncovered a hitherto undescribed activation mechanism for cysteine proteases and helped us to understand a family of virulence factors produced by human pathogens.
Expression, Mutant Construction, and Purification of Pro-interpain A—Genomic DNA of P. intermedia was extracted from strain ATCC 25611. The structural gene region of InpA comprising the pro-domain and the catalytic domain, pro-cd-InpA, was amplified by PCR using forward primer 5'-ATGCCATGGCAAAGCCACGCACAAAGGAACAG-3' with an NcoI recognition site and reverse primer 5'-ATGCTCGAGTGGTTTTCCGTAAACACCC-3' with an XhoI recognition site. Because the NcoI site encompasses the ATG start codon, two bases (CA) were introduced into the forward primers immediately after the NcoI site for in-frame translation of the target protein. This genetic manipulation inserted a methionine before the N-terminal alanine residue of InpA. In addition, the reverse primer introduced two additional codons (CTC GAG) for a leucine and a glutamate following the C-terminal proline residue of pro-cd-InpA. The PCR product was purified and cloned into the NcoI/XhoI site of pET24d(+) expression vector (Novagen), which provides the coding sequence for a C-terminal hexahistidine tag (His6). The recombinant plasmid was transformed into Escherichia coli strain BL21(DE3) pLysS under the control of the T7 promoter. The wt construct was used to produce mutation C154A using overlap extension PCR (26). The correctness of the constructs was verified by double-stranded DNA sequencing.
Protein production and purification were essentially the same for the wt and the mutant protein. Cells freshly transfected with the expression plasmid were grown at 37 °C to an optical density (A600) of 0.7-0.8 in 1 liter of Luria-Bertani medium supplemented with 2% glucose and kanamycin sulfate (50 µg/ml). The culture was induced with isopropyl-1-thio-β-D-galactopyranoside to a final concentration of 0.1 mM and further incubated at 26 °C for 2-3 h for protein production. Cells were harvested, washed with phosphate-buffered saline buffer, and resuspended in binding buffer A (20 mM sodium phosphate, 500 mM NaCl, 20 mM imidazole, pH 7.4) supplemented with 1.5 mM 4',4'-dithiodipyridine (a reversible CP inhibitor), 6 mM 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonate, 10 µM phenylmethylsulfonyl fluoride, 1 mM HgCl2, and 1 mM 1,4-dithio-DL-threitol (DTT). The latter compounds were added to prevent protein aggregation and autolysis. Cells were lysed by ultrasonication on ice for Activity Assay—Activity was determined with the fluorigenic substrate di-tertbutyl dicarbonate-Val-Leu-Lys-aminomethylcoumarin. Briefly, recombinant pro-cd-InpA protein was activated at 37 °C in 0.1 M Tris·HCl, 5 mM EDTA, pH 7.5, freshly supplemented with 2 mM DTT. The fluorigenic reaction was started by adding substrate (10 mM; final concentration in the reaction mixture, 250 µM) and the release of aminomethylcoumarin was recorded by measuring the increase in fluorescence using a micro-titer plate reader. Autocatalytic Assay—A total of 100 µg of pro-cd-InpA protein, alone or with 0.7 µg of active cd-InpA protein, was preincubated at 37 °C in buffer C (0.1 M Tris·HCl, 1 mM HgCl2, 2 mM DTT, pH 7.6). The autocatalytic reaction was initiated by diluting the sample with buffer D (buffer C but with 5 mM EDTA instead of 1 mM HgCl2) at 37 °C (final pro-cd-InpA and cd-InpA concentrations were 10 and 0.1 µM, respectively). Aliquots were taken at distinct time intervals and mixed with E-64 inhibitor (N-[N-{L-trans-carboxyoxiran-2-carbonyl}-L-leucyl]agmatine) to stop the reaction. At the same time intervals, samples of the incubation mixture were assessed for activity against the above fluorigenic substrate, and the initial rate of substrate turnover was determined. As a negative control, the same experiments were carried out using buffer C. To ascertain whether pro-cd-InpA autoactivation was an intra- or an intermolecular process, the zymogen was incubated as described above at 10, 2, and 0.4 µM, respectively, with samples withdrawn from the above activation reaction mixture at the mentioned time intervals. Processing of Pro-cd-InpA C154A by wt cd-InpA—Pro-cd-InpA C154A was tested as a substrate for wt cd-InpA in a reaction mixture containing 0.1 µM of the latter and 10 µM of the former protein in buffer D at 37 °C. Aliquots of 10 µl were withdrawn from the reaction mixture at distinct time intervals, and the reaction was quenched by addition of E-64. Results were analyzed by 12% SDS-PAGE.
Generation of N-terminally Truncated Pro-cd-InpA C154A—Pro-cd-InpA C154A (25 mg/ml) in 20 mM Tris-HCl, pH 7.6, was incubated with 1.7 µg of DTT-activated wt cd-InpA overnight at 21 °C. The reaction was terminated by addition of E-64 to 100 µM final concentration, and the protein was purified by ionic-exchange chromatography employing a NaCl gradient. Fractions containing the N-terminally truncated 36-kDa form of pro-cd-InpA C154A were pooled, concentrated, and dialyzed against a buffer suitable for protein crystallization. N-terminal sequencing, mass spectrometry, and Western blot analyses revealed that this protein variant ( N-terminal Sequence Analysis—Wild-type pro-cd-InpA, the C154A mutant protein, their truncated variants, as well as their cleavage products were analyzed by 12% SDS-PAGE and transferred to polyvinylidene difluoride membranes. Membranes were stained with 0.2% Amido Black and subjected to Edman degradation using a Procise 494-HT protein sequencer.
Crystallization and Data Collection and Processing—
Structure Solution and Analysis—The structure of N1pro-cd-InpA C154A was solved with program PHASER (29) using all diffraction data, and the coordinates of S. pyogenes pro-SpeB protein (Protein data bank (PDB) access codes 1pvj and 1dki (19)) were used as a searching model. A refined final solution was found comprising 170.8, 60.0, and 294.0 for , β, and (in Eulerian angles) and 0.145, -1.007, and 0.734 for x, y, and z (in fractional cell coordinates), with a log-likelihood gain value of 44.2. The appropriately rotated and translated search-model coordinates were subjected to crystallographic refinement albeit with no positive result. Accordingly, the model was given 200 cycles of refinement with the program SHELXL (30) starting from a resolution of 3 Å and increasing it by 0.01 Å with every cycle until full data resolution (1.5 Å). No data were set aside as a free Rfactor set. The resulting model phases were subjected to a density modification step with SHELXE (31). Amplitudes and phases for non-measured reflections were thereafter extrapolated from the current map to fill in the missing reflections within the experimental resolution limits and beyond, to a resolution of 1.0 Å. 60 cycles of density modification and 5 iterations of 20 cycles each of main-chain tracing with SHELXE followed. Combination of the resulting partial main-chain model with the original phases was succeeded by a new density modification run that eventually led to a partial backbone model for 192 of the residues and a set of phases with a figure-of-merit of 0.76. An electron density map was computed and subjected to a final density modification step with the program DM within CCP4. This step improved the figure-of-merit to 0.87 and enabled straightforward model completion and refinement. Manual model completion with TURBO-Frodo alternated with crystallographic refinement using REFMAC5 within the CCP4 suite. The final N1pro-cd-InpA C154A model comprised all residues from Ala39 to Pro359 (Fig. 1) except Ser295-Gln301.
The structure of wt cd-InpA was solved with program AMoRe (32) using the coordinates corresponding to residues Ala121-Pro359 from N1pro-cd-InpA C154A and structure-factor amplitudes in the range 15-3.5 Å. These calculations unambiguously confirmed P41212 as the correct space group and a unique solution was found at 48.5, 88.4, 223.7, 0.1172, 0.5977, and 0.1152 ( , β, , x, y, and z; refined values after rigid-body refinement; see Ref. 32) and 41.7, 88.6, 115.6, 0.1940, 0.0880, and 0.7122 for each of the two molecules A and B in the asymmetric unit, respectively, with a combined score CCF/crystallographic Rfactor, according to Ref. 32, of 54.4%/41.4%. The appropriately rotated and translated coordinates were subjected to rigid-body and positional refinement applying strong non-crystallographic-symmetry restraints with CNS v. 1.2 (33). Despite the weakness and low resolution of the wt cd-InpA diffraction data, the resulting electron density maps clearly disclosed the entire polypeptide chain of the mature protease moiety. It contained an unambiguous trace for the first eight residues (Val112-Tyr120) of the polypeptide chain that had not been included in the search model, a proof of concept for data quality which ruled out model bias. Careful model building alternated with crystallographic refinement under application of strong non-crystallographic-symmetry-restraints with programs CNS and REFMAC5 (at the final stages). The final wt cd-InpA model comprised residues Val112-Gly357 for molecule A and Val112-Pro359 plus two residues from the C-terminal tag (termed Leu360 and Glu361) for molecule B. These two molecules were almost equivalent in practice. Accordingly, under "Results and Discussion" we consider molecule A unless otherwise stated. Table 1 provides statistics on the final refinement steps and parameters of the quality of the resulting models.
Miscellaneous—The figures were prepared with programs TURBO-Frodo, SETOR (34), and MOLMOL (35). Structures were superimposed with TURBO-Frodo. Bioinformatic amino acid sequence similarity searches were undertaken within MEROPS data base and with the PSI-BLAST server (www.ncbi.nlm.nih.gov/blast). Structural similarity searches were performed with program DALI and secondary structure predictions with program JPRED. Close contacts and interaction surfaces (with a probe radius of 1.4 Å) were calculated with CNS taking the half of the total surface buried at the interface. The final coordinates of
Protein Purification and Characterization—Pro-cd-InpA and pro-cd-InpA C154A were overexpressed as 40-kDa proteins and purified to homogeneity. The wt zymogen was readily converted into the fully processed mature 27-kDa catalytic domain during purification so that the zymogenic form could only be obtained if reversible CP inhibitors were included during homogenization of bacterial cells and purification (Fig. 2, A-C). Subsequent inhibition release resulted in time-dependent autocatalytic processing of the zymogen with the concurrent release of activity (Fig. 2, C and E). Processing and activity release were accelerated by catalytic amounts of active cd-InpA (Fig. 2, D and E). This, together with the finding that the initial rate of activity generation was dependent on the zymogen concentration (Fig. 2F), suggested that the autocatalytic maturation of pro-cd-InpA occurred in trans (intermolecularly). Pro-cd-InpA C154A was produced to elucidate the sequence of cleavage events during activation and for structural purposes. As in the case of other CPs, pro-cd-InpA C154A was enzymatically inert and did not undergo autoprocessing. Analysis of concentration-andtime-dependent proteolysis of pro-cd-InpA C154A by the active protease revealed that the process occurred stepwise through a main 36-kDa intermediate ( N1pro-cd-InpA C154A) generated by hydrolysis of peptide bond Thr38-Ala39. Accordingly, N1pro-cd-InpA C154A lacks the first 38 residues of the full-length zymogen (see Fig. 1). In addition, minor cleavages were mapped to Lys94-Ala95 and Ala95-Ile96 (Figs. 1 and 2G). Finally, limited proteolysis of the accumulating 36-kDa intermediate at the Asn111-Val112 peptide bond released the 27-kDa mature protein, which was resistant to further degradation. The same activation pathway may operate in vivo, because a similar band pattern representing variably processed InpA species was detected in the P. intermedia culture medium (Fig. 2H). The maturating self processing of InpA resembles pro-SpeB with respect to formation of one major intermediate and several cleavages within the remaining part of the N-terminal pro-domain (36). Such a mechanism provides regulation of proteolytic activity independent of other secreted and host proteases, thus ensuring that the activity is developed when required.
Structure Solution Employing a Novel Approach—Contrary to the intact mutant protein, the N1pro-cd-InpA C154A variant crystallized. Its structure was solved by Patterson-search methods using maximum-likelihood criteria. This approach improves the definition of the target for the search by removing the contribution of unknown variables. This means that the errors attributable to lack of completeness of a search model are better estimated. In practice, this entails a larger radius of convergence (i.e. it yields a solution for structurally more distant searching models) than conventional search methods, which failed in the present case. Unfortunately, the current crystallographic refinement programs have a shorter radius of convergence. This restricted model refinement and led us to develop a novel approach based on a further development of the SHELX suite of programs (37). It consists of the application of the "free-lunch algorithm," whose theoretical basis had been developed by Giacovazzo and co-workers (38), combined with autotracing, model refinement, and density modification. This process essentially envisaged that the initially (poorly) refined model, displaying a weighted mean-phase error of 64° with respect to the final refined model (as determined a posteriori), was used to calculate an electron density map that was subjected to density modification. With this map, missing structure-factor amplitudes and phases were estimated within the resolution range of the experimental data. Further values were extrapolated to a nominal resolution of 1.0 Å. Subsequently, density modification (weighted mean-phase error = 33°), main-chain auto-tracing and phase-combination (weighted mean-phase error = 27°) eventually produced an accurate partial model for 60% of the residues. In addition, the resulting electron density map was excellent, even in those parts where the original search model showed a different chain trace (Fig. 3). This permitted straightforward manual tracing of the entire molecule and successful refinement, enabling us to ascertain three differences in comparison to the sequence of the PIN0048 open reading frame in the Institute for Genomic Research data base that were subsequently confirmed by sequencing at the DNA level (see Fig. 1).
Structure of InpA Zymogen—The protein has an elongated shape with an N-terminal pro-domain (Ala39-Asn111) and a C-terminal papain-like CP domain (Val112-Pro359), which bifurcates into a right subdomain (RSD) and a left subdomain (LSD) (see Fig. 4A). RSD and LSD interact through a surface of 1,332 Å2 establishing 69 contacts (<4 Å), among them 11 hydrogen bonds (<3.4 Å) and 22 hydrophobic interactions (Table 2). The pro-domain contacts laterally the top of the CP moiety through a surface of 1177 Å2, with 54 contacts (<4 Å), among them 12 hydrogen bonds and 19 hydrophobic interactions (Fig. 4A and Table 2). The pro-domain is stabilized by a central hydrophobic core and evinces an open-faced sandwich with a twisted antiparallel four-stranded β-sheet (sheet I; strands β1-β4) of simple up-and-down connectivity mediated by short loops. After β4, a segment in extended conformation (loop joining strands β4 and 1, Lβ4 1) leading to helix 1. The N-terminal part of the helix approaches the active-site cleft, thus contributing to latency, and is hereafter termed "backing helix." The polypeptide reaches the molecular surface after 1 and undergoes a sharp turn, folding back along the surface and entering a connecting segment that links the pro-domain with the CP domain. This segment adopts an extended conformation from Asn107 to Pro117, i.e. optimal for binding to and cleavage by an active-site cleft of a protease (39). This stretch includes the activation cleavage point, Asn111-Val112 (Fig. 4A), which is superficial and accessible for processing.
At Val112, the polypeptide chain enters the RSD of the mature enzyme moiety, which is a split subdomain (Val112-Leu127 plus Thr260-Pro359) with an open-faced sandwich topology created by a six-stranded twisted antiparallel β-sheet (sheet II; strands β11-β16). The sheet extends from the bottom of the molecule (outermost strand β11) to the interface with the pro-domain at β15 (Fig. 4A). The twist gives rise to a concave and a convex face, and the latter mediates the main interaction with the LSD. The main contact between the pro-domain and the CP part is formed by the outermost strand of sheet I, β4, and the lateral strand of sheet II, β15. This gives rise to a continuous ten-stranded β-sheet that completely traverses the zymogen from its upper right to the bottom center (Fig. 4A). After the inset of the LSD (see below), the polypeptide chain rejoins the RSD at strand β11 of sheet II, which runs outward approximately perpendicular to the view in Fig. 4A. After this strand, a short loop leads to helix 5, which nestles in the concave side of sheet II, followed by the next four strands of sheet II (β12-β15), inserted with simple up-and-down connectivity. These strands are connected by loops, which contribute to the substrate-binding cleft and the active site. The polypeptide chain is very well defined for the whole protein moiety except for the tip of a β-hairpin structure created by strands β12 and β13 and the enclosed loop, the "zymogenic hairpin" in the following. The hairpin is rigid at its trunk, because it is stabilized by six β-sheet interactions between β12 and β13, but flexible at its tip (between Ser295 and Gln301 (Fig. 4)). After β15, the polypeptide runs below the backing helix 1 and gives rise to what will now be referred to as the "latency-flap," Lβ15β16, which spans the 16 residues from Ile334 to Gln349. This structure displays a unique conformation and is stabilized by a series of internal contacts. It consists of two sequential dextrohelical elements, Ile334-Asn338 and Ser344-Gln349, connected by two residues in extended conformation (Pro339-Gly340) and a tight 1,4-turn of type I (Asn341 O-Ser344 N, 3.13 Å), which protrudes from the molecular surface. The bottom of the first dextrohelical segment is anchored to Lβ11 5 through a bidentate interaction of its main chain with the completely buried side chain of Arg267 and includes another tight 1,4-turn of type I (Ile334 O-Leu337 N; 2.94 Å). In addition to this arginine anchor, the structure of the latency flap is galvanized by a total of nine internal hydrogen bonds that confer an extraordinary rigidity to this structural element. After this flap, the protein chain enters the second strand of sheet II, β16, and leads to the surface C terminus of the molecule at Pro359, whose position permits additional downstream domains in the full-length InpA protein (Fig. 4A).
The LSD (Leu128-Phe259) is inserted into the RSD and is characterized by a central three-helical bundle made up by helices Substrate-binding Crevice and Active Site—The active-site cleft of InpA is in a crevice formed by loops connecting strands of sheet II at its carboxyl end. The walls of the crevice are provided by RSD and LSD (see Fig. 4). Classic CPs like papain, cathepsin B, and staphopain have a short, four-residue segment connecting the two residues that are topologically equivalent to Gln134 and Gly153 of InpA, respectively, as contributors to the left-side rim of the cleft on its primed side. In contrast, InpA displays between the latter two residues an 18-residue insertion that forms a unique upper-left region of the molecule. This entails that the zone ascribable to substrate binding would be reduced in InpA to Gly133-Gln135 and Thr152-Gly153, immediately preceding the catalytic cysteine, Cys154. The former stretch includes Gln134, whose position is absolutely conserved among CPs and which, by analogy, would be involved in the formation of an oxyanion hole together with the amide nitrogen of Cys154, which would bind the scissile carbonyl (21, 40). On the non-primed side of the cleft, Ser242-Met246 and Tyr264 would also contribute to the left rim. Again in contrast to classic CPs, InpA possesses a much longer connection between helices, which shapes part of the front surface and gives rise to a unique β-hairpin, novel for CPs (β9β10). This entails that the residues from Pro238 to Gly241 should further assist Ser242-Met246 in shaping the cleft rim. In even greater contrast to classic CPs, the segments shaping the right-hand rim of the cleft on its primed side may be restricted to the side chains of the strongly conserved Trp324 from Lβ14β15, which becomes rearranged upon activation, and the previously mentioned Gln134 (21). Regarding the right rim on the non-primed side of the cleft, binding may be provided by the main chain of the rearranged zymogenic hairpin, in particular His305-Ala306 and Tyr291-Gly293, as well as Asp350.
Structures Related to InpA—As might have been expected, a search for structural relatives of N1pro-cd-InpA identified pro-SpeB as the closest homologue, with an rms deviation of 2.0 Å over 275 topological equivalent residues (PDB 1dki
[PDB]
and 1pvj (19)). This protein is secreted as a zymogen, and no structural information on the mature protein is currently available. InpA and SpeB are the only members of the catalytic-dyad enzymes, i.e. those lacking a catalytic asparagine, structurally studied to date (19). Because P. intermedia has been shown to degrade connective-tissue constituents and to interfere with the tightly regulated defense mechanism of the host (9), like SpeB in S. pyogenes (22), it is tempting to speculate that InpA is a virulence factor equivalent to SpeB in P. intermedia. In addition, P. gingivalis was shown to harbor a further CP, periodontain (41), which is closely related to SpeB and InpA. Accordingly, we conclude that P. gingivalis, P. intermedia, and S. pyogenes may have inherited these homologous proteins from a common ancestor and that they may undergo a similar activation mechanism (42).
Overall, the core of the protease and the pro-domain of InpA conform to the pro-SpeB fold (Fig. 5A). However, the difficulties encountered during
There are also important differences in the surface structures of pro-cd-InpA and pro-SpeB, which affect activation and substrate binding in the mature enzymes. At the end of the first segment of the RSD, at Leu128-Thr129, a three-residue insertion creates a bulge in pro-SpeB leading to structural differences in the loop structure preceding
In contrast to these differences, there are also similarities in detail. As in
A Novel Mechanism For Latency Maintenance and Activation—The mature enzyme structure confirms that the pro-domain, including the backing helix, is removed upon activation and that it does not sterically block access to the substrate-binding cleft in
In CPs, function requires a correct spatial arrangement of the catalytic cysteine provided by the active-site helix within LSD and the catalytic histidine of the RSD to render a functional thiolate-imidazolium ion pair (46) (Fig. 4, C-E). Unlike InpA, most other CPs also have an asparagine with a supportive role (46). The position and conformation of the active-site helix and the cysteine, Cys154, are maintained in both InpA structures. In contrast, the catalytic histidine, His305, undergoes major rearrangement. In the mature enzyme it is oriented to favor the interaction with Cys154 S
As mentioned, the zymogenic hairpin is only defined until residue Gly294 and from Asp302 onwards in the
Another important element shaping the histidine cage in the zymogen is the latency flap, which anchors the catalytic histidine in the non-competent position through a hydrogen bond (Glu348 O
In summary, we have described a new cysteine protease from a highly-active pathogenic bacterium, InpA, which undergoes autolytic activation in vitro and, possibly, in vivo. The structural features reported reveal a new mechanism of activation/latency maintenance within CPs, distinct from cathepsins and plant CPs, which may also be valid for related proteins such as S. pyogenes SpeB and P. gingivalis periodontain. This mechanism starts when the backing helix is removed after proteolytic cleavage at the Asn111-Val112 scissile peptide bond (step 1 in Fig. 4E). This liberates a space that enables stopper Trp324, actually a tryptophan switch, to reorient (see Fig. 4, C and D) and contribute to a hydrophobic pillow created by the apolar side chains of segment β14-Lβ14β15-β15 of the CP moiety. This segment participates in a large hydrophobic core with the backing helix in the zymogen and remains unchanged in the active enzyme. The movement of Trp324 correlates with the large displacement and internal rearrangement observed for the latency flap which, by pivoting around Met351 and Tyr332, causes this segment to adopt a β-hairpin-like structure and to occupy the space released by the backing helix (step 2 in Fig. 4E). Consequently, the zymogenic hairpin becomes rigid at its top and folds back through a
The atomic coordinates and structure factors (code 3bb7 and 3bba) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
* This work was supported in part by the former Spanish Ministry for Science and Technology (Grants BIO2004-20369-E and BIO2003-06653); the Spanish Ministry for Education and Science (Grants BIO2006-02668, BIO2006-14139, and BFU2006-09593, and CONSOLIDER-INGENIO 2010 Project "La Factoría de Cristalización" CSD2006-00015); European Union (EU) FP6 Integrated Project LSHC-CT-2003-503297 "CANCERDEGRADOME"; EU FP6 Strep Project 18830 "CAMP"; the "AVON-Project" (Grant 2005X0648) from the Spanish Association Against Cancer; the Danish National Science Research Council (to J. J. E.); and by Ministry for Science and Higher Education (Warsaw, Poland) and National Institutes of Health Grant DE 09761 (to J. P.). Funding for synchrotron diffraction data collection was provided by the European Synchrotron Radiation Facility and the EU. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 Both authors contributed equally to this work.
2 Beneficiary of the "Ramón y Cajal" Program of the Spanish Ministry for Science and Education. 3 To whom correspondence may be addressed. Tel.: 44-151-664-6343; Fax: 44-151-706-5809; E-mail: potempa{at}mol.uj.edu.pl. 4 To whom correspondence may be addressed. Tel.: 34-934-006-144; Fax: 34-932-045-904; E-mail: xgrcri{at}ibmb.csic.es.
5 The abbreviations used are: PD, periodontal disease; cd-InpA, catalytic domain of interpain A; CP, cysteine protease; DTT, 1,4-dithio-DL-threitol; E-64, N-[N-{L-trans-carboxyoxiran-2-carbonyl}-L-leucyl]-agmatine; RSD, right subdomain; LSD, left subdomain; pro-cd-InpA, prodomain+catalytic domain of interpain A; wt, wild-type.
We thank Robin Rycroft and Mary Kopecki for helpful contributions to the manuscript and George M. Sheldrick for providing an -version of program SHELXE.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||