If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Knowledge of the molecular mechanisms of specific bacterial virulence factors can significantly contribute to antibacterial drug discovery. Helicobacter pylori is a Gram-negative microaerophilic bacterium that infects almost half of the world’s population, leading to gastric disorders and even gastric cancer. H. pylori expresses a series of virulence factors in the host, among which high-temperature requirement A (HpHtrA) is a newly identified serine protease secreted by H. pylori. HpHtrA cleaves the extracellular domain of the epithelial cell surface adhesion protein E-cadherin and disrupts gastric epithelial cell junctions, allowing H. pylori to access the intercellular space. Here we report the first crystal structure of HpHtrA at 3.0 Å resolution. The structure revealed a new type of HtrA protease trimer stabilized by unique N-terminal domain swapping distinct from other known HtrA homologs. We further observed that truncation of the N terminus completely abrogates HpHtrA trimer formation as well as protease activity. In the presence of unfolded substrate, HpHtrA assembled into cage-like 12-mers or 24-mers. Combining crystallographic, biochemical, and mutagenic data, we propose a mechanistic model of how HpHtrA recognizes and cleaves the well-folded E-cadherin substrate. Our study provides a fundamental basis for the development of anti-H. pylori agents by using a previously uncharacterized HtrA protease as a target.
The current paradigm for treatment of bacterial infection is to eradicate bacterial pathogens with antibiotics. However, rapid evolution and dissemination of antibiotic resistance among pathogens pose a huge threat to human health worldwide (
). The strategy seeks to interfere with the bacterial virulence factors that promote infection without threatening their existence. This leads to reduced selective pressure for drug-resistant mutations. Notably, development of antivirulence drugs requires an in-depth understanding of the structures and functions of virulence factors in disease processes.
Helicobacter pylori, a Gram-negative microaerophilic bacterium that colonizes the human stomach and is the leading cause of human gastric disease, such as peptic ulcers, gastritis, and even gastric cancer (
). High-temperature requirement A protein of H. pylori (HpHtrA) is a newly identified virulence factor that helps H. pylori to efficiently break through the gastric epithelium by cleaving proteins within the epithelial tight junction (occludin and claudin-8) and adherens junction (E-cadherin) (
). Structural and functional characterization of this virulence factor may facilitate the design of new types of anti-H. pylori drugs.
HtrA homologs, which are widely found in prokaryotic and eukaryotic organisms, represent a class of highly evolutionarily conserved heat shock–induced serine proteases and chaperones (
) domain. Based on the different domain organization, HtrA family proteins can be classified into three groups. The members in group 1 contain only one protease and one PDZ domain, such as Escherichia coli DegS (EcDegS) and HtrA2 in mammals (
Characterization of the structure and function of Escherichia coli DegQ as a representative of the DegQ-like proteases of bacterial HtrA family proteins.
). EcDegP is a well-characterized group 2 protease. Crystal structures revealed that the inactive EcDegP is a hexamer. The basic EcDegP trimeric unit is composed of three monomers that closely interact with each other using the protease domains (
). However, recent studies demonstrated that H. pylori actively secretes HtrA into the extracellular environment, where it cleaves the extracellular domain of the epithelial cell surface adhesion protein E-cadherin, facilitating H. pylori transmigration across gastric epithelial cells (
Rapid paracellular transmigration of Campylobacter jejuni across polarized epithelial cells without affecting TER: role of proteolytic-active HtrA cleaving E-cadherin but not fibronectin.
). HtrA in H. pylori contains a signal peptide that is important for Sec-dependent cleavage and transport of the protease across the inner membrane into the periplasm (
). Although HtrA has been reported to be enriched in H. pylori outer membrane vesicles, the detailed mechanism of how it is transported across the outer membrane remains unclear (
). Edman degradation and mass spectrometry–based proteomics demonstrated that the [VITA]-[VITA]XXD[DN] signature motifs located at the linker regions of the EC domains are preferentially cleaved by HpHtrA (
). Importantly, inhibition of HpHtrA activity by chemical compounds efficiently blocked H. pylori transmigration across the gastric epithelial barrier (
Here we report the crystal structure of HtrA from H. pylori. The structure reveals a trimeric HpHtrA containing a unique extended N terminus that is absent in other HtrA homologs. Importantly, the N terminus bridges HtrA monomers through domain swapping to stabilize the homotrimer, which has not been identified in other HtrA homologs reported so far. We further demonstrate that the N terminus is also essential for HpHtrA oligomer assembly and protease activity. Moreover, a key residue, Lys328, in the HpHtrA PDZ1 domain was identified to be essential for E-cadherin cleavage but is dispensable for unstructured β-casein substrate. All of these data reveal a novel type of HtrA family protease secreted by the human pathogen H. pylori.
Results
The N terminus of HpHtrA is critical for protease activity
Sequence alignments of HtrA homologs from different bacterial species show that HpHtrA has almost the same domain composition as other members, including a signal peptide, a protease domain, and two PDZ domains (Fig. 1A). However, HpHtrA has a relatively extended N-terminal region that contains 19 residues after the signal peptide (18GNIQIQSMPKVKERVSVDP35) (Fig. 1B). To investigate the function of each domain, a series of domain truncation mutants of HpHtrA was constructed and purified (Fig. S1), including a mutant with the N-terminal 19 residues truncated (HpHtrA-ΔN), a mutant with the PDZ2 domain truncated (HpHtrA-ΔPDZ2), and a mutant with both PDZ domains truncated (HpHtrA-ΔPDZ1–2). The protease activity of the mutants was measured using a nonspecific substrate, β-casein, as described previously (
). As shown in Fig. 1C, proteolytic activity of the HpHtrA-ΔPDZ1–2 mutant was completely abolished. In contrast, the HpHtrA-ΔPDZ2 mutant exhibited even higher protease activity compared with the WT HpHtrA (residues from Gly18 to Lys475 without the signal peptide). The results are consistent with previous reports showing that the PDZ1 domain is essential for protease activity and responsible for recognizing and sequestering unfolded substrates through C-terminal residues, whereas the PDZ2 domain is mostly involved in maintaining the hexameric cage of DegP and displays an inhibitory role in chaperone and protease activity (
). Intriguingly, removal of the short N terminus also abrogated HpHtrA protease activity, indicative of an essential role of the N terminus in HpHtrA protease activity. The proteolytic activity of the mutants was subsequently measured using E-cadherin as a substrate. In contrast, all mutants showed decreased proteolytic activity against E-cadherin compared with WT HpHtrA, implying that all of these domains might participate in the E-cadherin cleavage process (Fig. 1D).
Figure 1The N terminus of HpHtrA is critical for protease activity.A, schematic of the HpHtrA domain architecture. SP, signal peptide; NT, N terminus. B, sequence alignments of HtrA family proteins from different bacterial species. Identical or similar amino acids are highlighted. The secondary structure of HpHtrA is shown. The green box indicates the extra N terminus of HpHtrA. The abbreviated species names and their GenBank accession numbers and PDB codes are as follows: Cj Protease DO: C. jejuni serine protease Do (CAL35343); PpDegP-like: Pseudomonas putida DegP-like protein (B1J4D7); MsDegP-like: Marinomonas sp. DegP-like protein (A6VUA4); HcDegP-like: Hahella chejuensis DegP-like protein (Q2SL36); HeDegP-like: Halomonas elongata DegP-like protein (E1V4H2); SeDegP: Salmonella enterica DegP (P26982); EcDegP-1KY9: E. coli DegP (P0C0V0), PDB code 1KY9; EcDegQ-3STJ: E. coli DegQ (P39099), PDB code 3STJ; LpDegQ-4YNN: Legionella pneumophila DegQ (Q5ZVV9), PDB code 4YNN; LfDegQ-3PV5: Legionella fallonii DegQ (CEG56683), PDB code 3PV5. C, time-course β-casein cleavage assay of WT HpHtrA and mutants. The total amount of β-casein substrate was normalized as 1.0, and the hydrolyzed substrates are plotted against reaction time. D, E-cadherin cleavage assay of WT-HpHtrA and mutants. The total amount of E-cadherin in each reaction was normalized as 1.0, and hydrolyzed substrate after 12 h is plotted. All cleavage experiments were done in triplicate, and the results are shown as the mean with standard deviation. *, p < 0.05; ** p < 0.01.
Characterization of the structure and function of Escherichia coli DegQ as a representative of the DegQ-like proteases of bacterial HtrA family proteins.
). To characterize the conformational state of HpHtrA in the absence of substrate, the oligomerization states of WT-HpHtrA and different mutants were examined by size-exclusion chromatography. As shown in Fig. 2A, WT HpHtrA was eluted at 11.5 ml with a calculated molecular mass of 170 kDa, consistent with a trimeric form of HpHtrA (52 kDa for a monomer). HpHtrA-ΔPDZ2 and HpHtrA-ΔPDZ1–2 were both eluted as trimers with calculated molecular masses of 120 and 90 kDa, respectively. Unexpectedly, HpHtrA-ΔN was eluted at 14.9 ml with a molecular mass of 50 kDa, indicative of a monomeric state of the HpHtrA-ΔN mutant. It has been reported that an autoproteolytic process that results in cleavage of the N-terminal region of EcDegP led to destabilization of a hexamer and formation of a monomer of EcDegP (
The N-terminal region of HtrA heat shock protease from Escherichia coli is essential for stabilization of HtrA primary structure and maintaining of its oligomeric structure.
). However, the autocleavage process removed the N-terminal 60–80 residues of EcDegP, in contrast to truncation of the N-terminal 19 residues in HpHtrA. The results indicate that the N terminus is essential to stabilize the HpHtrA trimer. On the other hand, the trimeric form of HtrA is likely the minimal functional unit with prominent protease activity because activation of HtrA required the substrate sensor loop from one monomer to interact with the activation loop of a neighboring monomer (
). Therefore, truncation of the N terminus of HpHtrA disrupted its trimeric form and significantly abolished its protease activity.
Figure 2Crystal structure of HpHtrA.A, size-exclusion chromatography analysis of WT HpHtrA and mutants. The schematics and molecular masses for each sample are shown. mAU, milli-absorbance units. B, overall structure of the full-length WT HpHtrA monomer. The N terminus (NT), protease domain, and PDZ1 and PDZ2 domains are shown as cartoons, and surfaces are colored in green, orange, marine, and cyan, respectively. C, the overall structure of the HpHtrA-ΔPDZ2 trimer. Each chain is shown as a cartoon and surface, with the NT in different colors. D, HtrA protease domain. The protease domains of HpHtrA-ΔPDZ2 (marine), inactive EcDegP (deep salmon, PDB code 1KY9), and active EcDegP (yellow, PDB code 3CS0) are superimposed and shown as cartoons. The conserved residues of the catalytic triad of HtrA are shown as sticks and numbered according to the HpHtrA sequence.
To further understand how the N terminus stabilizes trimeric HpHtrA, we first determined the crystal structure of full-length WT HpHtrA at a resolution of 3.7 Å (PDB code 5Y2D). The full-length WT HpHtrA crystals grew in trigonal space group R32 and contained one monomeric HpHtrA molecule per asymmetric unit. In the full-length HpHtrA structure, only the protease and PDZ1 domain are well defined with clear electronic density, whereas the PDZ2 domain is partially visible (Fig. 2B). Intriguingly, weak but evident electron density was observed for the extended N terminus of HpHtrA, which is usually disordered and invisible in other HtrA homolog structures (
To obtain a structure with higher resolution, we crystallized HpHtrA-ΔPDZ2. The HpHtrA-ΔPDZ2 structure was solved by molecular replacement and refined to 3.0 Å resolution (PDB code 5Y28). The crystal form belongs to the orthorhombic P222 space group and contains one HpHtrA-ΔPDZ2 trimeric molecule per asymmetric unit. In this structure, each HpHtrA monomer is composed of a protease domain and a C-terminal PDZ1 domain. However, only two PDZ1 domains are visible with clear electron densities among three HpHtrA monomers. Typically, HpHtrA monomers are packed symmetrically around a 3-fold molecular axis as a funnel-like shape, with the protease domain forming the core and the PDZ1 domain extending outward (Fig. 2C). The HpHtrA protease domain is similar to other proteases of the trypsin family, with two perpendicular β barrel lobes and a C-terminal α helix. The backbone root mean square deviation between the EcDegP and HpHtrA protease domains is 0.83 Å. The catalytic triad is located in a crevice between two β barrel lobes, including residues His116, Asp147, and Ser221 (Fig. 2D). The relative orientation between the PDZ1 and protease domains is almost the same in both full-length HpHtrA and HpHtrA-ΔPDZ2 structures (Fig. S2). It has been reported that the PDZ1 domain tilts away from the protease domain upon EcDegP activation. Structural alignments of HpHtrA with inactive (PDB code 1KY9) and active (PDB code 3CS0) EcDegP demonstrate that the overall conformation of HpHtrA protease and the PDZ1 domain is identical to the inactive form, with a root mean square deviation of all Cû atoms of 1.6 Å (Fig. S1).
Intriguingly, the N-terminal region of HpHtrA (Gly18-Tyr44) has clear electron density in all HpHtrA-ΔPDZ2 monomers, which allows the N terminus model to be built unambiguously (Fig. S3). The N terminus of HpHtrA is stretching out from the protease domain and is mainly composed of two short β strands and two unstructured loop regions: loop1 (Gln21-Lys29), β1 (Glu30-Val32), β2 (Thr40-Ser43), and loop2 (Ser33-Asp39). Unexpectedly, the crystal structure reveals that the extended N terminus is involved in a domain-swapping event wherein the N terminus transverses the interface to the two neighboring monomers; i.e. loop1 of monomer A interacted with the protease domain of monomer C, whereas the β1 and β2 of monomer A formed parallel β-sheets with β2 of monomer B and β1 of monomer C, respectively (Fig. 3A). Because of the relatively low local resolution, the side chains of residues at loop1 could not be unambiguously identified. However, loop1 has extensive contact with the neighboring monomer protease domain, as revealed by detailed protein interface analysis (Fig. S4). Biochemical data demonstrated that truncation of loop1 also abrogated the HpHtrA trimer into a monomer (Fig. S5A). The parallel β-sheets formed by two β-strands from neighboring monomers are connected by interstrand backbone hydrogen bonds. In particular, the side chain of Arg31 from monomer A stretched out and formed a typical salt bridge with Asp173 from monomer C. This intermolecular Arg31-Asp173 salt bridge is highly conserved in all three monomers (Fig. 3B and Fig. S6). It is worth noting that the N-terminal domain swapping of HpHtrA contributes to majority of protein–protein interfaces in HpHtrA. The average interface area between two neighboring monomers in the HpHtrA trimer is 1656 Å2, which is significantly larger than that in EcDegP and EcDegQ. However, truncation of the N-terminal 18 residues (including loop1 and the β1 strand) and 26 residues (including loop1, the β1 strand, loop2, and the β2 strand) dramatically reduced the interface areas to 690 Å2 and 446 Å2, respectively (Fig. 3C), which is consistent with experimental results showing that truncation of the corresponding N-terminal regions causes trimer disassembly.
Figure 3The N terminus stabilizes the HpHtrA trimer.A, schematic of N-terminal domain swapping in HpHtrA. The N termini of HpHtrA trimer are shown as cartoons in different colors as indicated. The protease domains are presented as hexagons. β-Strands 1 and 2 in chain B are labeled. B, details of the interfaces among three different HpHtrA monomers. Three HpHtrA monomers are shown in different colors. Residues involved in the interactions are shown as sticks. Hydrogen bonds are indicated as yellow dashed lines. The salt bridge is shown as a gray dashed line. C, interface area analysis of HtrA family proteins. The interface areas between each pair of monomers in HtrA protein trimer were analyzed using the PDBePISA server. The abbreviated species names are as follows: EcDegP-1KY9, E. coli DegP (PDB code 1KY9); EcDegP-3CS0, Escherichia coli DegP (PDB code 3CS0); EcDegQ-3STJ, E. coli DegQ (PDB code 3STJ); LpDegQ-4YNN, L. pneumophila DegQ (PDB code 4YNN); HpHtrA, H. pylori HtrA; HpHtrA-ΔL1-β1, HpHtrA with N-terminal loop1 and the β1 strand truncated; HpHtrA-ΔL1-β2, HpHtrA with N-terminal loop1, the β1 strand, loop2, and the β2 strand truncated. D, HtrA family protein trimer interface analysis. The surfaces of trimeric HpHtrA, EcDegP, and EcDegQ are shown in gray. The hydrophobic residues involved in trimer interfaces are highlighted in blue. E, urea denaturation curves of the HpHtrA (blue circles) and EcDegQ (purple squares).
The data suggest that N-terminal domain swapping is important to stabilize the HpHtrA homotrimer, which is distinct from other HtrA homologs. In previously reported HtrA homolog structures, including EcDegP and EcDegQ, formation of a homotrimer is exclusively mediated by intersubunit hydrophobic interactions involving the first α-helix and two β-strands of the protease domain rather than domain swapping (Fig. 3D) (
). Moreover, urea denaturation curves revealed that HpHtrA and EcDegQ had a denaturation midpoint of 5.0 m and 4.5 m urea, respectively (Fig. 3E), indicating that HpHtrA is relatively more stable than EcDegQ, which is consistent with the fact that HpHtrA has larger intersubunit interface areas. Therefore, HpHtrA represents a novel type of homotrimer of HtrA family proteases stabilized by unique N-terminal domain swapping.
Characterization of the structure and function of Escherichia coli DegQ as a representative of the DegQ-like proteases of bacterial HtrA family proteins.
). To examine whether HpHtrA employs a similar activation mechanism, we first analyzed the HpHtrA oligomeric state in the presence of β-casein, which is a well-characterized unfolded model substrate for the HtrA family (
). Similar to EcDegP and EcDegQ, size-exclusion chromatography analysis revealed that incubation of HpHtrA with different molar equivalents of β-casein led to formation of a higher-order oligomer, i.e. a substrate-engaged 12-mer (HpHtrA12) and 24-mer (HpHtrA24) (Fig. S7A). The sizes of the formed complexes are dependent on the β-casein substrate concentration. At lower substrate concentration, the 12-mer is predominantly formed, whereas HpHtrA24 was observed with increasing concentration of substrate.
Time-dependent analysis of the HpHtrA proteolytic products by size-exclusion chromatography showed that β-casein cleavage products by HpHtrA had two major groups of elution peaks, with the first peak eluted slightly later than β-casein, denoted as the primary product, and several smaller peaks eluted much later, denoted as the secondary products (Fig. 4A). In the course of time, the amounts of primary cleavage product were decreased, accompanied by an increase of the secondary cleavage products. The amount of HpHtrA12 was also decreasing during the cleavage process, indicating that the proteolytically active HpHtrA oligomer dissociated after substrate cleavage (Fig. S7B). In contrast, incubation of HpHtrA-ΔN with β-casein yielded no detectable HpHtrA oligomers or detectable cleavage products (Fig. 4B), suggesting that monomeric HpHtrA-ΔN loses the capability to assemble into the active oligomer even in the presence of excess amount of β-casein.
Figure 4Substrate binding triggers formation of the HpHtrA proteolytically active oligomer.A, size-exclusion chromatography analysis of the HpHtrA oligomerization state. The elution profiles of WT HpHtrA (red), β-casein (green), and the mixture of WT HpHtrA and β-casein (blue) are shown. The abbreviated eluted fractions are as follows: 12-mer, dodecameric WT-HpHtrA complex with the substrate; E, trimeric WT-HpHtrA; C, β-casein substrate; PP, primary digested product of β-casein; SP, secondary digested product of β-casein. The dodecameric HpHtrA, trimeric HpHtrA, and β-casein elution peaks are highlighted in yellow, red, and green, respectively. mAU, milli-absorbance units. B, size-exclusion chromatography profiles of WT-HpHtrA (blue) and HpHtrA-ΔN (magenta) incubated with β-casein substrate. The dodecameric HpHtrA with the β-casein complex elution peak is highlighted in yellow. The WT HtrA+casein curve was reused from A for comparison with HpHtrA-ΔN+casein. It indicates that monomeric HpHtrA-ΔN cannot assemble into the active oligomer even in the presence of an excess amount of β-casein. C, atomic force microscopy analysis of the HpHtrA oligomerization state. Top panel, AFM images of WT HpHtrA particles without (−P1) and with P1 peptide (+P1). Bottom panel, assembly height distribution histograms of WT HpHtrA in the absence (blue) and presence of P1 peptide (magenta).
The cleavage sites of E-cadherin by HpHtrA were located at the linker regions of EC domains. In particular, a 21-residue P1 peptide (Ac-TGTLLLILSDVNDNAPIPEPR-COOH) derived from the cleavage site between E-cadherin domains EC4 and EC5 could bind directly to HpHtrA (
). Therefore, we further investigated the binding between P1 peptide and HpHtrA. No high-order oligomer complex was observed in size-exclusion chromatography when HpHtrA was incubated with an excess amount of P1 peptide, which was probably due to the low binding affinity. To better characterize the interaction, atomic force microscopy (AFM) imaging experiments were carried out to capture the oligomerization state of HpHtrA upon P1 peptide binding. Deposition of HpHtrA alone onto freshly cleaved mica, followed by AFM imaging, resulted in distribution of the protein with a typical triangular shape, which is consistent with the trimeric form, as revealed by the crystal structure (Fig. 4C). Incubation of HpHtrA with P1 peptide caused substantial changes in protein particle shape in AFM imaging, in which the assembly heights of complex particles are significantly larger than that of HpHtrA alone, indicative of oligomerization of HpHtrA in the presence of P1 peptide (Fig. 4C). Collectively, we demonstrate that HpHtrA is able to assemble and disassemble dynamically to form a proteolytically active oligomer, which is promoted by binding of an unstructured substrate.
A lysine residue of HpHtrA is important for E-cadherin cleavage
In contrast to β-casein or P1 peptide, E-cadherin is a well-folded, rigid substrate for HpHtrA. To investigate the mechanism of E-cadherin cleavage by HpHtrA, a protein complex structural model was built by docking E-cadherin EC1–EC2 domains to the HpHtrA timer. In the docking model, the PDZ1 domains of two HpHtrA monomers work as clamps to bind the EC1–EC2 domains (Fig. S8A). Both of the EC domains have contact interfaces with the HpHtrA PDZ1 domains, which are closed to the identified PDZ1 substrate binding groove formed by β-strand A and α-helix B (
). Typically, two lysine residues, Lys326 and Lys328 from the PDZ1 domains, are proximal to the EC1 and EC2 domains in the complex model, implying that the two residues are possibly involved in EC1–EC2 domain recognition (Fig. S8B). To test the hypothesis, two HpHtrA mutants (HpHtrA-K326A and HpHtrA-K328A) were purified, and their substrates cleavage activities were investigated. Both mutants maintained similar β-casein cleavage activity compared with WT-HpHtrA. Intriguingly, the two mutants exhibited different cleavage activities for substrate E-cadherin. The HpHtrA-K326A mutant exhibited similar E-cadherin proteolytic activity compared with WT HpHtrA. In contrast, significantly attenuated E-cadherin cleavage activity was observed for the HpHtrA-K328A mutant (Fig. S9). Time-course substrate cleavage assay results also confirmed that K328A mutagenesis attenuated HpHtrA proteolytic activity for E-cadherin but not for β-casein (Fig. 5, B and C). The data suggest that Lys328 of the HpHtrA PDZ1 domain is critical for E-cadherin cleavage, whereas it is dispensable for β-casein proteolysis. The complex structural model here may represent the first snapshot of the binding and recognition of E-cadherin substrate by HpHtrA via the Lys328 residue of the PDZ1 domains, and binding of E-cadherin would further induce a conformational change of HpHtrA, facilitating cleavage of bound E-cadherin at the domain linker region.
Figure 5Substrate recognition model and phylogenetic analysis of HpHtrA.A, details of the interfaces between the HpHtrA PDZ1 domain and EC1–EC2. The peptide binding groove of the HpHtrA PDZ1 domain is colored orange. The Lys328 residue involved in E-cadherin recognition is shown as purple spheres. B and C, time-course cleavage assay of β-casein (B) and N-terminal E-cadherin (C) by WT HpHtrA and the HpHtrA(K328A) mutant. The total amount of substrate is normalized as 1, and the hydrolyzed substrates are plotted against time. The experiments were performed in triplicate, and the results are shown as mean value with standard deviation. D, phylogenetic tree of the HtrA family. The HpHtrA and DegS branches are highlighted in purple. The abbreviated species names and their GenBank accession numbers are as follows: CpDegPL, Chlamydia pneumoniae DegPL (Q9Z6T0); SsHtrA, Synechocystis sp. HtrA (P73354); DrHtrA1A, Danio rerio HtrA1A (Q6GMI0); XtHtrA, Xenopus tropicalis HtrA (A4IHA1); MsDegPL, Marinomonas sp. DegPL (A6VUA4); HcDegPL, H. chejuensis DegPL (Q2SL36); PfDegPL (fulva), Pseudomonas fulva DegPL (F6AA62); PfDegPL (fluorescens), Pseudomonas fluorescens DegPL (Q4KGQ4); BhDegPL, Bartonella henselae DegPL (P54925); EcDegP, E. coli DegP (P0C0V0); StDegP: Salmonella enterica serovar Typhimurium DegP (P26982); BaDegP (Baizongia pistaciae), Buchnera aphidicola subsp. Baizongia pistaciae DegP (Q89AP5); BaDegP (Acyrthosiphon pisum), B. aphidicola subsp. Acyrthosiphon pisum DegP (P57322); EcDegQ, E. coli DegQ (P39099); HpHtrA, H. pylori HtrA (G2J5T2); CjHtrA, C. jejuni HtrA (A7H2F1); EcDegS, E. coli DegS (P0AEE3); StDegS, S. enterica Typhimurium DegS (D0ZY51); HiDegS: Haemophilus influenzae DegS (P44947); LhHtrA, Lactobacillus helveticus HtrA (Q9Z4H7); LlHtrA, Lactococcus lactis HtrA (A2RNT9); BsHtrA: Bacillus subtilis HtrA (P39668). E, cellular location analysis of HpHtrA; Western blot analysis of HpHtrA in different separation fractions. Ctrl, purified HpHtrA protein; Bac., bacterial pellet; Med., extracellular medium; Tot., total protein after bacterial lysis; Sol., soluble protein after bacterial lysis; Wash, wash of pellet after cell lysis; Inner, inner membrane protein; Outer, outer membrane protein; HpTatC, twin-arginine translocation protein C, an inner membrane protein of H. pylori. HpTatC was observed in the total lysate and inner membrane fractions.
Phylogenetic analysis and cellular localization of HpHtrA
In E. coli, three members of the HtrA family have been identified: EcDegP, EcDegQ, and EcDegS. The three proteases exert different bacterial physiological functions (
). In contrast, only one HtrA homolog has been identified in H. pylori, implying that HpHtrA could be multifunctional in H. pylori. Sequence alignment of HtrA family proteins from several eukaryotic and prokaryotic species revealed that a similar extended N terminus can be found in Campylobacter jejuni HtrA (CjHtrA), which is also an identified protease secreted by human pathogenic bacteria. Intriguingly, the N terminus of HpHtrA exhibits high sequence similarity with the N-terminal region of DegS homologs, which is part of the transmembrane domain (Fig. S10). Further phylogenetic analysis of HtrA family proteins also indicates that HpHtrA is more closely related to DegS than to DegP, implying a potential evolutionary relationship between DegS and HpHtrA (Fig. 5D).
HpHtrA is a virulence factor secreted by H. pylori
HpHtrA contains a signal peptide for Sec-dependent transport across the inner membrane into the periplasm (
). However, it is still unknown whether HpHtrA is simultaneously transported across the out membrane when it enters the periplasm. Therefore, we investigated the cellular localization of HpHtrA with an HpHtrA-specific antibody. As shown in Fig. 5E (Fig. S11), clear HpHtrA protein bands were visible in the bacterial culture medium and soluble fraction after cell lysis. The soluble fraction should include HpHtrA from both the bacterial cytoplasm and periplasm. Surprisingly, substantial amounts of HpHtrA were also identified in the inner membrane fraction, implying that HpHtrA might directly anchor to the inner membrane or bind tightly to an inner-membrane protein. The results demonstrate that the H. pylori periplasm and inner membrane may serve as temporary reservoirs for HpHtrA before its extracellular secretion.
Discussion
It is well established that H. pylori translocates a virulence factor, CagA, into the gastric epithelial cell cytoplasm via a type IV secretion system (
). Recent studies demonstrated that H. pylori utilizes a novel secreted serine protease, HtrA, to cleave the host Occludin, Claudin-8, as well as E-cadherin proteins, which breaks down the E-cadherin–based adherens junctions and tight junctions between gastric epithelial cells to disintegrate the epithelial barrier. After cleavage, H. pylori can efficiently enter the intercellular space and interact with the exposed α5β1 receptors for virulence factor translocation (
). The HtrA protein was initially identified as a protease, functioning in the bacterial periplasmic space for protein quality control. Recent studies demonstrated that HtrA-mediated host cell E-cadherin cleavage is a prevalent pathogenic mechanism for multiple Gram-negative bacterial species, indicating that bacterial HtrA could serve as an attractive target for the design of antibacterial agents (
Here we determined the crystal structure of H. pylori HtrA protease, which possesses an extended N terminus different from other homologs. In the crystal structure, three HpHtrA monomeric molecules assemble into a new type of trimer via unique N-terminal domain swapping. Importantly, the HpHtrA trimer is mainly stabilized by N-terminal domain swapping. This assembly pattern is distinct from that of the E. coli homologs DegP and DegQ, both of which are stabilized by hydrophobic interactions of the protease domains. Truncation of the N terminus completely abrogated HpHtrA trimer formation, leading to monomeric HpHtrA without detectable protease activity, indicative of the essential role of the N terminus. A recent study of HpHtrA demonstrated that autocleavage of the HpHtrA N terminus abolished HpHtrA secretion and protease activity, which is consistent with our protein structure data (
). Although it is still unknown why HpHtrA adopts such a unique mechanism for protein trimer assembly, one plausible explanation is that larger intersubunit interface areas contributed by domain swapping enhance HpHtrA trimer stability so that HpHtrA can sustain protease activity after secretion into the hostile gastric niche (
). Therefore, targeting the N terminus to abolish HpHtrA trimer formation may represent a new potential anti-H. pylori strategy.
Similar to other HtrA homologs, HpHtrA could assemble into a proteolytically active oligomer in the presence of β-casein substrate. Similar assembly was also observed when HpHtrA was incubated with P1 peptide, which is derived from the E-cadherin cleavage site. However, HpHtrA oligomerization is unlikely to happen when HpHtrA binds to the E-cadherin ectodomain. Unlike unstructured β-casein or peptide, E-cadherin is a rigid protein with a well-folded structure, which could prevent the higher multimer active cage formation of HpHtrA. Previous studies demonstrated that EcDegP oligomeric cage assembly is not required for its proteolytic activation (
). It would not be surprising if HpHtrA could cleave E-cadherin in the trimeric form. Therefore, the recognition and cleavage mechanism of E-cadherin by HpHtrA should be different from that of β-casein. Indeed, our biochemical studies identified that Lys328 of HpHtrA was indispensable for E-cadherin cleavage but not for β-casein.
Although HpHtrA contains two PDZ domains similar to EcDegP and EcDegQ, phylogenetic analysis reveals a closer evolutionary relationship between HpHtrA and EcDegS. EcDegS is a serine protease anchored on the E. coli inner membrane and involved in the cellular response to extracytoplasmic stress via activation of the E. coli σ factor σE (
). Cellular localization analysis revealed that a substantial amount of HpHtrA was also identified in the bacterial inner membrane. Although it is unknown how HpHtrA is attached to the inner membrane, it is possible that H. pylori adopts a strategy to preserve HpHtrA so that export of the virulence factor could occur by a triggered mechanism, which avoids unnecessary secretion. Therefore, it would be interesting to further investigate whether the cellular localization of HpHtrA is functionally relevant.
Conclusion
In summary, we report the first crystal structure of the H. pylori HtrA trimer stabilized by unique N-terminal domain swapping, which represents an unprecedented novel assembly of HtrA family proteases. The N terminus of HtrA from H. pylori is also indispensable for its protease activity. The molecular mechanism of how HpHtrA recognizes and cleaves E-cadherin was elucidated based on a structural model that is distinct from that of the conventional substrate β-casein. Given that HpHtrA is a promising target for the design of anti-H. pylori agents, the structure we report here could facilitate the development of a new type of anti-H. pylori agents by targeting HtrA both at the active site and N terminus.
Experimental procedures
Protein expression and purification
Details regarding expression and purification of H. pylori HtrA proteins can be found in the supporting Methods. In brief, N-terminal His-tagged H. pylori WT or mutant HtrA proteins lacking the signal peptide sequence were expressed in the E. coli BL21 (DE3) strain and purified by nickel affinity chromatography followed by gel filtration. Recombinant proteins were pooled and stored at −80 °C. All the PCR primers were listed in Table S1.
Protein crystallization and structure determination
Crystals of full-length HpHtrA were obtained by sitting drop diffusion at 20 °C and mixing equal volumes of the protein and the reservoir solution, consisting of 2.1 m DL-malic acid, and 0.1 m HEPES (pH 7.0). For HpHtrA-ΔPDZ2, crystals were obtained similarly, with the reservoir solution containing 20% PEG 1500, 0.1 m HEPES (pH 7.5), and 0.2 m proline. Crystals were cryoprotected with reservoir solution supplemented with 10% glycerol and flash-frozen in liquid nitrogen. Diffraction data were performed at the Shanghai Synchrotron Radiation Facility (Shanghai, China) using beamlines 17B, 17U1, and 19U1. Raw data images were processed with HKL2000 (
). For the time-course β-casein cleavage assay, 40 nm WT HpHtrA or mutants was incubated with 3 μg of β-casein in reaction buffer (20 mm Tris-HCl (pH 7.5), 250 mm NaCl, and 1 mm DTT) at 37 °C. At the indicated times, aliquots of the reactions were removed for SDS-PAGE analysis. For the E-cadherin cleavage assay, ∼0.8 μm WT HpHtrA or mutants was incubated with 50 ng of the N-terminal domain of E-cadherin in reaction buffer for 16 h at 37 °C. The remaining substrates were blotted with an antibody to E-cadherin. The SDS-PAGE and Western blot band intensities were quantified using ImageJ software (
AFM imaging was carried out in QI mode in liquid with a NanoWizard 4 microscope (JPK Instruments AG) equipped with SNL-10 cantilevers (Bruker Probes). Square images of 256 × 256 pixels were collected with a scan size of 500 nm and a set point of 300 pN. For each experiment, ∼20 μl of HpHtrA protein samples (0.01 mg/ml) in the absence or presence of P1 peptide was added onto freshly cleaved mica and allowed to absorb for 15 min. After that, the mica surface was rinsed and submerged in Tris buffer (400 mm Tris-HCl (pH 7.4) and 100 mm NaCl).
Size-exclusion chromatography analysis
Size-exclusion chromatography was performed with a Tricorn Superdex 200 Increase 10/300 GL column (GE Healthcare) at 4 °C. The column was calibrated with a gel filtration calibration kit (GE Healthcare) and pre-equilibrated with gel filtration buffer (20 mm Tris-HCl (pH 7.5), 250 mm NaCl, and 1 mm DTT). For oligomerization state analysis, 100-μl samples containing 40 μm WT HpHtrA or mutants were injected into the column. For substrate-triggered oligomer formation, ∼100 μl of protein mixtures containing 40 mmHpHtrA and 160 mm β-casein or E-cadherin EC1-EC2 domain was incubated on ice for the indicated time before loading onto the column.
Structure model of the HpHtrA and E-cadherin complex
An HpHtrA trimer structure model with three intact PDZ1 domains was generated using chain C of the HpHtrA-ΔPDZ2 structure (PDB code 5Y28) as a template. The protein complex structure was built by docking the structure of the human E-cadherin EC1–2 domain (PDB code 4ZT1) to the HpHtrA trimer using the ESCHER protein–protein automatic docking system with default parameters (
). The structure with the lowest energy was selected as the complex model.
Circular dichroism
CD spectra were recorded at ambient temperature on a JASCO J-810 spectrophotometer using a quartz cuvette with a path length of 0.1 cm. Urea denaturation experiments were performed as described previously (
). In brief, urea gradients were added to 20 μm WT HpHtrA and EcDegQ in 10 mm Tris–H2SO4 buffer supplemented with 100 mm NaSO4 (pH 7.5). After equilibrating overnight at 4 °C, CD spectra were recorded between 200 and 270 nm with an average of three scans. The absorption at 220 nm was used to characterize the unfolding state of the protein.
Sequence alignment and phylogeny analysis
HtrA family protein sequence alignment and the phylogenetic tree were done using the MAFFT online program (version 7) with default options (https://mafft.cbrc.jp/alignment/server/).
) with minor modifications. Details of the experiments are described in the supporting Methods. All cellular fractions were analyzed by Western blotting using an HpHtrA-specific antibody.
Accession codes
The coordinates for full-length HpHtrA and HpHtrA-ΔPDZ2 have been deposited in the PDB with accession codes 5Y2D and 5Y28, respectively.
Author contributions
Z. Z., Q. H., X. T., G. S., P. Z., H. L., and H. S. data curation; Z. Z. and Q. H. formal analysis; Z. Z. and Q. H. writing-original draft; Q. H. and X. T. validation; Q. H. methodology; P. Z., H. S., and W. X. supervision; H. L., H. S., and W. X. writing-review and editing; H. S. and W. X. funding acquisition; W. X. conceptualization; W. X. investigation; W. X. project administration.
Acknowledgments
We thank the staff of the BL17B, BL17U1, and BL19U1 beamlines of the Shanghai Synchrotron Radiation Facility for assistance with data collection.
Characterization of the structure and function of Escherichia coli DegQ as a representative of the DegQ-like proteases of bacterial HtrA family proteins.
Rapid paracellular transmigration of Campylobacter jejuni across polarized epithelial cells without affecting TER: role of proteolytic-active HtrA cleaving E-cadherin but not fibronectin.
The N-terminal region of HtrA heat shock protease from Escherichia coli is essential for stabilization of HtrA primary structure and maintaining of its oligomeric structure.
This work was supported by grants from the National Natural Science Foundation of China (21671203, 21771103, and 21877131); the Science and Technology Program of Guangzhou, China (201707010038); RGC of Hong Kong (17305415, 17333616, and 17307017); the Ministry of Education of China (IRT-17R111); Fundamental Research Funds for the Central Universities, Natural Science Foundation of Jiangsu Province (BK20160639); the Shuangchuang Program of Jiangsu Province; and startup funding from Sun Yat-sen University. The authors declare that they have no conflicts of interest with the contents of this article.