Mechanism for Recognition of an Unusual Mycobacterial Glycolipid by the Macrophage Receptor Mincle

Background: Mincle facilitates establishment of persistent infections of macrophages by Mycobacterium tuberculosis. Results: The mechanism of mincle binding to mycobacterial glycolipids has been defined, and inhibitors have been synthesized. Conclusion: Mincle binds to both the sugar portion of the glycolipid and the hydrocarbon tail. Significance: The results suggest ways to manipulate the response to mycobacteria and to improve adjuvants that stimulate the immune system.

Binding of the macrophage lectin mincle to trehalose dimycolate, a key glycolipid virulence factor on the surface of Mycobacterium tuberculosis and Mycobacterium bovis, initiates responses that can lead both to toxicity and to protection of these pathogens from destruction. Crystallographic structural analysis, site-directed mutagenesis, and binding studies with glycolipid mimics have been used to define an extended binding site in the C-type carbohydrate recognition domain (CRD) of bovine mincle that encompasses both the headgroup and a portion of the attached acyl chains. One glucose residue of the trehalose Glc␣1-1Glc␣ headgroup is liganded to a Ca 2؉ in a manner common to many C-type CRDs, whereas the second glucose residue is accommodated in a novel secondary binding site. The additional contacts in the secondary site lead to a 36-fold higher affinity for trehalose compared with glucose. An adjacent hydrophobic groove, not seen in other C-type CRDs, provides a docking site for one of the acyl chains attached to the trehalose, which can be targeted with small molecule analogs of trehalose dimycolate that bind with 52-fold higher affinity than trehalose. The data demonstrate how mincle bridges between the surfaces of the macrophage and the mycobacterium and suggest the possibility of disrupting this interaction. In addition, the results may provide a basis for design of adjuvants that mimic the ability of mycobacteria to stimulate a response to immunization that can be employed in vaccine development.
The balance between latent and active forms of tuberculosis lies in a complex set of interactions between mycobacteria and the host immune system (1)(2)(3). In addition to tuberculosis in humans, caused by the infectious agent Mycobacterium tuberculosis, a bovine form of tuberculosis resulting from infection with Mycobacterium bovis is of wide interest because of the agricultural impact of persistent infections in domestic herds of cattle (4). Macrophages play a key role in sequestering the mycobacteria in granulomas, leading to persistent infections (2). Recent studies have revealed that the macrophage-inducible C-type lectin CLEC4E, called mincle, is an important component of communication between infecting mycobacteria and macrophages. Mincle was initially identified as a receptor in stimulated macrophages that induces cytokine release (5). It is a type II transmembrane protein, with a C-type carbohydrate recognition domain (CRD) 2 at the C-terminal end of the extracellular domain (see Fig. 1A). Although the CRD contains all of the amino acid residues commonly required for sugar binding by C-type CRDs (6,7), ligands for mincle have only recently been described. The major sugar-containing ligands are trehalose dimycolate, a glycolipid found in the outer membrane of mycobacteria (8,9), and other mannose-or glucose-containing glycoconjugates in fungi and yeast (10,11).
Trehalose dimycolate, which is a characteristic component of the mycobacterial surface, is also referred to as cord factor. It comprises a Glc␣1-1Glc␣ headgroup and two complex branched and hydroxylated acyl chains attached to the 6-OH groups of each of the sugar residues. The importance of mincle in the interaction of macrophages with mycobacteria stems both from its ability to interact with trehalose dimycolate and from its association with the common Fc receptor-1␥ subunit, which activates the Syk-CARD signaling pathway (12,13). This interaction leads to granuloma formation and may participate in directing the macrophages to tolerate rather than destroy internalized mycobacteria (14).
The studies described here define the mechanism by which bovine mincle targets trehalose dimycolate. X-ray crystallography and mutagenesis studies have revealed an extended ligandbinding site in mincle that interacts with both the sugar headgroup and acyl portions of the glycolipid. Small molecule mimics of trehalose dimycolate have been generated to establish the importance of different parts of the glycolipid for recognition.

EXPERIMENTAL PROCEDURES
Protein Expression and Purification-Fragments of the cDNA for bovine mincle were amplified from a liver cDNA library (US Biological) by PCR (Advantage 2 Polymerase Mix, Takara), cloned into the pCR2.1-TOPO vector (Invitrogen), and sequenced using an Applied Biosystems 310 genetic analyzer. The form cloned in this way encoded a threonine residue at position 174, compared with isoleucine in the reference sequence (NCBI accession number XP_002687869), reflecting a polymorphism that has been documented in the bovine genome as rs135158086. Restriction fragments were moved into the pT5T expression vector (15) in Escherichia coli strain BL21(DE3). For expression of biotin-tagged proteins, nucleotides encoding the biotinylation sequence Gly-Leu-Asn-Asp-Ile-Phe-Glu-Ala-Gln-Lys-Ile-Glu-Trp-His-Glu were appended at the 3Ј-end of the coding sequence by inclusion in the PCR primers, and the bacteria used for expression also contained the pBirA plasmid encoding biotin ligase (16). Mutagenesis was performed by two-step PCR (17) using the original cDNA clone as template.
The wild-type CRD from cow mincle was expressed, and inclusion bodies were isolated as described previously for other C-type CRDs (18). Inclusion bodies from 6 liters of bacterial culture were dissolved in 100 ml of 6 M guanidine HCl containing 100 mM Tris-Cl (pH 7.8) and incubated in the presence of 0.01% (v/v) 2-mercaptoethanol for 30 min at 4°C. Following centrifugation for 30 min at 100,000 ϫ g in a Beckman Ti50.2 rotor, the supernatant was dialyzed against 3 changes of loading buffer containing 0.5 M NaCl, 25 mM Tris-Cl (pH 7.8), and 25 mM CaCl 2 at 4°C. Insoluble material was removed by centrifugation as described above, and the supernatant was applied to a 10-ml column of trehalose-Sepharose prepared by coupling with divinyl sulfone (19). After rinsing with 14 ml of 150 mM NaCl, 25 mM Tris-Cl (pH 7.8), and 25 mM CaCl 2 , the bound protein was eluted with 16 aliquots of 1 ml of 150 mM NaCl, 25 mM Tris-Cl (pH 7.8), and 2.5 mM EDTA. Fractions containing the CRD were determined by examining aliquots on SDS-polyacrylamide gels containing 17.5% (w/v) acrylamide, which were stained with Coomassie Blue. The purified protein was characterized by gel filtration on a 1 ϫ 30-cm Superdex 200 column (GE Healthcare) eluted with 10 mM Tris-Cl (pH 7.8), 100 mM NaCl, and 2.5 mM EDTA at a flow rate of 0.5 ml/min, with absorbance monitored at 280 nm. Mutant proteins were purified by the same protocol, except that invertase-Sepharose was used in place of trehalose-Sepharose.
Binding Assays-Glycolipid blotting experiments were performed by spotting 2-4 l of the following glycolipid solutions on polyvinylidene difluoride membrane (Millipore): trehalose dimycolate (Carbosynth Ltd.) at 2 g/l in chloroform, galactosylceramide from bovine brain (Sigma) at 2 g/l in chloroform/methanol (2:1), and lipoteichoic acid from Staphylococcus aureus (Sigma) at 2 g/l in water/methanol (2:3). After the spots were dry, the membrane was wetted with methanol, immersed immediately in a solution of 5% (w/v) BSA in binding buffer (1.5 M NaCl, 25 mM Tris-Cl (pH 7.8), and 2.5 mM CaCl 2 ), and incubated for 1 h at room temperature with shaking. The biotin-tagged CRD from bovine mincle was added to a final concentration of 1 g/ml and incubated for an additional 1.5 h at room temperature. Following three washes of 5 min each with binding buffer, the membrane was incubated with 0.5 g/ml ExtrAvidin coupled to alkaline phosphatase (Sigma) in 5% (w/v) BSA in binding buffer for 1 h at room temperature. After an additional three washes with binding buffer, the blot was developed with nitro blue tetrazolium and 5-bromo-4chloro-3Ј-indolyl phosphate substrate (Millipore).
Immobilized CRDs for competition binding assays were prepared in streptavidin-coated wells (Thermo Scientific Pierce), washed three times with binding buffer, incubated with biotintagged CRDs at 5 g/ml in 0.1% (w/v) BSA in binding buffer for 2 h at 4°C, and washed an additional three times with binding buffer. Competing ligands were incubated in the wells in the presence of 0.5 g/ml 125 I-Man 31 -BSA in 0.1% (w/v) BSA in binding buffer for 2 h at 4°C. After washing three times with binding buffer, wells were counted in a Wallac WIZARD gamma counter (PerkinElmer Life Sciences). Assays were performed in duplicate, with the average values used to calculate inhibition constants using SigmaPlot for fitting to a simple binding curve (20). The order of binding determined from the fitting was always in the range of 0.8 -1.3. Ratios of K I(trehalose) / K I(acyltrehalose) were calculated for assays performed together on one assay plate. Each set of assays was then repeated three to four times. The reported values are means Ϯ S.D. for the ratios from replicate experiments. The pH dependence of ligand binding was measured in the same assay format, with no competing ligand and with a CaCl 2 concentration of 2.5 mM. A series of buffers, each containing both 25 mM MES and 25 mM MOPS and adjusted to various pH values with NaOH, were substituted for the Tris buffer.
Crystallization and Data Collection-The untagged CRD was concentrated by extensive dialysis against water and lyophilization before being dissolved in 2.5 mM CaCl 2 , 10 mM Tris-Cl (pH 8.0), and 25 mM NaCl. Crystals were obtained by hangingdrop vapor diffusion using a mixture of 3:1 and 2:1 l of protein/reservoir in the drops for the unliganded and liganded forms, respectively. Crystals were grown at 22°C from a protein solution comprising 2.2-3.5 mg/ml mincle, 2.5 mM CaCl 2 , 10 mM Tris-Cl (pH 8.0), and 25 mM NaCl. Trehalose (50 mM) was added to the protein solution to form the complex. The protein/ trehalose solution precipitated slightly but cleared while forming the crystallization drop. The reservoir solution used to obtain crystals of the native protein contained 20% (w/v) polyethylene glycol 4000, 20% (v/v) 2-propanol, and 0.1 M sodium citrate (pH 5.0), and the reservoir solution for the trehalose complex contained 25% (w/v) polyethylene glycol 4000, 17.5% (v/v) 2-propanol, and 0.1 M sodium citrate (pH 5.0). Crystals were transferred to perfluoropolyether (PFO-XR75, Lancaster Synthesis) before being frozen in liquid nitrogen for data collection. Diffraction data were measured at 100 K at the Stanford Synchrotron Radiation Lightsource. Data were processed with XDS and scaled with SCALA (21). Data collection statistics are summarized in Table 1.
Structure Determination-The structure of the native mincle CRD was solved by molecular replacement using the program Phaser (22). The model used for molecular replacement was derived from Protein Data Bank entry 1SL4, removing the sac-charide, Ca 2ϩ , water molecules, and residues 253-255 and 283-284. The molecular replacement solution confirmed the existence of two monomers, designated A and B, in the asymmetric unit and showed that the crystals belong to space group P6 5 . Model building and refinement were performed with Coot (23) and PHENIX (24). Refinement included individual positional isotropic temperature factor and TLS refinement. A difference Fourier map showed strong peaks at two positions in each monomer, one at the expected primary Ca 2ϩ site and another Ca 2ϩ at a site found in some other C-type lectins, as well as a citrate molecule bound at the primary Ca 2ϩ site of monomer A. Refinement of models with either Ca 2ϩ or Na ϩ revealed that Ca 2ϩ is present in these sites, as indicated by the comparable temperature factors of Ca 2ϩ and its surrounding atoms versus a very low refined value for Na ϩ . The presence of Ca 2ϩ was confirmed by the presence of peaks in an anomalous difference Fourier map (⌬fЉ ϭ 0.57 e for Ca 2ϩ versus 0.05 e for Na ϩ at 12.6 keV). In the trehalose complex, these peaks are at heights of 25 and 25 in copy A and of 19 and 6 in the less well defined copy B. In the native crystal, the peaks are at 6 and 9 in copy A and at 3 and 7 in copy B. In contrast, electron density modeled as Na ϩ showed no significant peaks in the anomalous difference Fourier map.
The structure of the trehalose complex was solved by molecular replacement using monomer A of a partially refined structure of the native CRD. The solution confirmed space group P4 1 2 1 2, with two monomers in the asymmetric unit. Anisotropic temperature factors were refined in monomer A in this structure. The atomic coordinates and structure factors have been deposited in the Protein Data Bank, with code 4KZW for the native CRD and code 4KZV for the trehalose complex.
Synthesis of Trehalose Derivatives-Reagents for synthesis were obtained from Sigma, and chromatography media were from Merck. Acylated derivatives of trehalose were prepared by reacting 500 mg of trehalose with 1-5 ml of carboxylic acid in the presence of 500 mg of lipase from Candida antarctica immobilized on polystyrene beads in 25 ml of isoamyl alcohol at 40°C with mixing (25). The lower concentration of acid was used to favor synthesis of the monoacylated derivatives. Monoand diacylated derivatives were separated by chromatography on 25 ml of silica gel in chloroform/methanol/water (75:25:4). Fractions of 5 ml were collected. Purification was monitored by thin layer chromatography on aluminum-backed silica gel plates developed in the same solvent and stained with orcinol/ sulfuric acid reagent (supplemental Fig. S1) (26). Purified compounds were dried from solvent, dissolved in water, filtered through Anotop 20-nm pore filters (Whatman), and lyophilized. Stock solutions were prepared in water and assayed using the anthrone reaction (27). Compounds were characterized by matrix-assisted laser desorption ionization mass spectrometry on an Applied Biosystems 4800 instrument and by one-dimensional proton NMR on a Bruker 400-MHz spectrometer (supplemental Figs. S2 and S3).

RESULTS
Characterization of the Mincle CRD-Comparison of mouse, human, and cow mincle reveals close sequence similarity throughout the polypeptides, including in the C-terminal CRD (Fig. 1B). A minimal CRD from bovine mincle was expressed in E. coli and could be purified by affinity chromatography on immobilized trehalose ( Fig. 2A). Binding of the mincle CRD to trehalose dimycolate was demonstrated using a glycolipid over- Conserved cysteine residues that form the three disulfide bonds characteristic of C-type CRDs are highlighted in yellow, and ligands for the conserved Ca 2ϩ site are highlighted in green (7). Key residues in the secondary glucose-binding site are shaded pink, and residues that form the hydrophobic groove are shaded blue. Residues that chelate the supplemental Ca 2ϩ are indicated in violet. The mouse and cow CRDs show sequence identities of 71 and 76% to the human protein, respectively.  OCTOBER 4, 2013 • VOLUME 288 • NUMBER 40 lay assay (Fig. 2B). Competition assays using the unusual 1-1Јlinked disaccharide trehalose headgroup of the glycolipid revealed a 36-fold higher affinity for the disaccharide compared with the constituent glucose monosaccharides (Fig. 2C), indicating the presence of an extended binding site encompassing both glucose residues.

Recognition of Mycobacterial Glycolipid by Mincle
Initial crystals of the expressed CRD, obtained in citrate buffer at pH 5, were solved by molecular replacement with a search model based on the CRD of DC-SIGN (Protein Data Bank entry 1SL4). The data statistics are summarized in Table  1, and the refinement statistics are shown in Table 2. The mincle polypeptide assumes an overall fold similar to other C-type CRDs (Fig. 3A) (7). As in these other structures, the polypeptide assumes a loop-out topology, in which ␤-strands at the N and C termini are paired at the bottom of the structure as it is presented in Fig. 3A and are flanked by two helices. The upper portion of the structure consists largely of a series of irregular loops surrounding a short pair of ␤-strands. However, in the conserved Ca 2ϩ -binding site that serves as the primary sugar-binding site in C-type lectins (7), only three of the five amino acid side chains that normally form the divalent cation-binding site (Fig. 1B) are coordinated to the Ca 2ϩ (Fig.  3B). Despite these differences, a Ca 2ϩ is found at the conserved site, coordinated to three oxygens of a citrate molecule (Fig. 3, B and C).
Structure of the Mincle CRD Bound to Trehalose-Because Ca 2ϩ and sugar binding are coupled in C-type lectins (28), a high carbohydrate ligand concentration should shift the conformational equilibrium of the conserved Ca 2ϩ site toward the sugar-binding conformation. Generating the mincle CRD crystals in the presence of trehalose resulted in a remarkable change in the organization of the crystals, from space group P6 5 to P4 1 2 1 2, and the diffraction limit for the crystals improved from 1.85 to 1.4 Å. Solving the structure of this new crystal form using the original structure as a search model revealed the presence of a trehalose molecule bound to the CRD (Fig. 4, A and B). There are substantial rearrangements of the polypeptide in the trehalose-bound structure (Fig. 4, C-E). The side chains of Asp-193 and Glu-176 shift into the positions observed in other C-type CRD Ca 2ϩ -binding sites such that the trehalose com- and ͗I(h)͘ is the mean intensity obtained from multiple measurements. plex has the expected arrangement in which five side chains are now ligands for the conserved Ca 2ϩ site (Fig. 4C).
In both the native and trehalose-bound structures, a Na ϩ ion is present near the Ca 2ϩ (Fig. 4, D and E), but it interacts with different residues, shifting closer to the Ca 2ϩ in the trehalose complex so that both Asp-193 and Glu-176 bridge between the Ca 2ϩ and Na ϩ . This region appears to be flexible, as it is poorly ordered in copy B of both the native and trehalose-bound structures. The change in space group is likely due to the different ordered conformations of this region present in copy A, which is packed in different lattice environments. Given that the native and trehalose-bound crystals were obtained under virtually identical conditions, the observed differences in conformation (and therefore, lattice packing) can be attributed to the presence of saturating trehalose. However, it is possible that the apparent flexibility is a consequence of the low pH.
The unusual conformation of the conserved Ca 2ϩ site in the native crystals compared with the trehalose complex may be related to the low pH crystallization condition, which is predicted to disrupt interactions of acidic side chains with Ca 2ϩ . Because sugar binding depends on the precise arrangement of ligands in the Ca 2ϩ site, loss of sugar-binding activity at low pH,  OCTOBER 4, 2013 • VOLUME 288 • NUMBER 40 resulting from reduced affinity for Ca 2ϩ , is characteristic of C-type CRDs that release their ligands in endosomes following endocytosis (29,30). As predicted, the affinity for ligand was dramatically diminished at low pH (Fig. 5).

Recognition of Mycobacterial Glycolipid by Mincle
One of the two glucose residues in the bound trehalose molecule occupies the canonical primary binding site typical of C-type CRDs, with the equatorial 3-and 4-OH groups forming coordination bonds to the Ca 2ϩ and hydrogen bonds with four of the amino acid side chains that also ligate the Ca 2ϩ : Glu-176, Asn-192, Glu-168, and Asn-170 (Fig. 4F). This arrangement of the equatorial 3-and 4-OH groups and side chains mimics the coordination of mannose and N-acetylglucosamine residues in DC-SIGN, mannose-binding protein, and other C-type CRDligand complexes (6,7,31). The A face of the glucose is packed against the side chain of Leu-172 (Fig. 4G), which prevents an alternative arrangement in which the positions of 3-and 4-OH groups are switched by a rotation of 180°because the presence of the second glucose residue would clash with the leucine side chain.
Trehalose bound to mincle falls in the ensemble of conformations observed in other crystal structures of unrelated proteins (for example, Protein Data Bank entry 1EU8, trehalose/ maltose-binding protein from Thermococcus litoralis; entry 1F0P, M. tuberculosis antigen 85B; entry 2DXY, C-terminal lobe of bovine lactoferrin; and entry 3OIH, xylanase-␣-amylase inhibitor protein) and in solution (32). The arrangement of the two glucose residues relative to each other brings the second glucose residue into contact with additional portions of the surface of the protein, which form a secondary binding site (Fig. 4,  G and H). In this site, glucose 2-OH is part of a cooperative hydrogen bonding network in which it accepts a hydrogen bond from Arg-182 and donates to Glu-135, thereby bridging these two side chains. Glucose C3 also packs against the Arg-182 side chain, which in turn also packs against Glu-135 (Fig. 4, G and H).
Changing Glu-135 to glutamine, reducing its effectiveness as a hydrogen bond acceptor, resulted in an 8-fold reduction of the enhanced binding of trehalose compared with glucose (Fig. 6). Changing Arg-182 to lysine reduced the affinity for trehalose to being the same as that for ␣-methyl glucoside, suggesting that the lysine side chain cannot form hydrogen bonds with 2-OH of the second glucose, thus effectively destroying the secondary binding site. The importance of Arg-182 also reflects the fact that it forms a hydrogen bond with 2-OH of the glucose in the primary binding site and packs against C3 of the glucose residue in the secondary binding site (Fig. 4, G and H). These results confirm that the secondary sugar-binding site observed in the crystal structure could function as part of the binding mechanism for trehalose dimycolate.
Binding of Glycolipid Analogs to the Mincle CRD-In the predominant mycobacterial glycolipid, both of the 6-OH groups of the glucose residues in trehalose are esterified with the unusual ␣-branched, ␤-hydroxylated mycolic acid (Fig. 7A). As a soluble mimic of this structure, trehalose derivatives esterified on the 6-OH groups with simple linear acids were prepared by enzymatic synthesis using immobilized lipase under anhydrous conditions to achieve regioselective modification of the primary hydroxyl (6-OH) groups (25). Binding competition assays demonstrated that acetylation at the two 6-OH groups is tolerated, confirming that these groups are unencumbered on the surface of the CRD. In addition, acylation with 3-or 4-carbon chains resulted in significantly enhanced affinities for mincle, reflected in an increase in the K I(trehalose) /K I(acyltrehalose) values (Fig. 7B).
Examination of the surface of the mincle CRD near the primary sugar-binding site reveals a hydrophobic channel running between Phe-197/Phe-198 on one side and Leu-172/Val-173 on the other side (Fig. 8). The top of this channel is directly adjacent to the 6-OH group of the glucose residue in the primary  binding site. Modeling of octanoic acid attached to this 6-OH group suggests that at least six carbon atoms would be accommodated within the channel (Fig. 8). Because the 6-OH group of the other glucose residue is positioned away from the surface of the CRD, ligands in which a single fatty acid chain was attached to one of the two glucose residues were synthesized to maintain solubility of the ligands. As expected, the mono-n-butanoyl derivative showed enhanced affinity compared with trehalose, and increasing the length of the fatty acid from four to eight carbons resulted in further increases in affinity (Fig. 7B). The fatty acid chains alone did not compete for binding at concentrations up to 50 mM, which is 20 -1000-fold higher than the K I values of the acylated trehalose derivatives (data not shown).
The model suggests that much of the increase in affinity for the acylated trehalose derivatives may result from favorable interactions with the hydrophobic groove, although some contribution of decreased solubility of the longer chain ligands cannot be ruled out. To obtain direct evidence for interaction of the acyl chains with the protein surface, mutations were made in the amino acid residues that form the hydrophobic groove. Removal of one side of the channel by mutation of Phe-197 and Phe-198 resulted in a 4-fold loss in affinity for mono-octanoyl trehalose based on the binding competition assay (Fig. 9). The other side of the groove could only be partially removed because Leu-172 forms part of the primary glucose-binding site. However, mutation of Val-173 resulted in a 1.5-fold reduction in affinity for the acylated ligand. These results are consistent with the proposed mode of acyl chain binding shown in Fig. 8.

DISCUSSION
The structural and binding data presented here support a model for binding of trehalose dimycolate in which one of the acyl chains of the lipid interacts directly with the CRD, providing specificity directed by both the sugar and lipid portions of the mycobacterial ligand (Fig. 10). The structural and biochemical description of the ligand-binding site in mincle presented here provides a novel paradigm for interaction of a glycan-binding receptor with a glycolipid target. The binding site consists of a canonical C-type primary binding site centered on Ca 2ϩ , supplemented on one side by a secondary binding site for the second glucose residue in the trehalose headgroup and on the other side by a hydrophobic channel that can bind acyl groups. In combination, these three sites provide an ideal mechanism for recognition of the unique features of a pathogen-specific glycolipid. The glucose diglycoside is essentially clamped between the primary and secondary sugar-binding sites, with stereospecificity derived from a network of hydrogen bonds to hydroxyl groups on each sugar, as well as non-polar contacts. In addition to allowing hydrophobic interactions with the linear hydrocarbon portion of mycolic acid, the open side of the hydrophobic channel accommodates the branching and hydroxylation of this mycobacterium-specific surface structure. The interaction of both the polar and non-polar portions of trehalose dimycolate with adjacent sites on the mincle CRD contrasts with the T-cell receptor-CD1 complex, in which the headgroup and acyl portions of a glycolipid are bound by separate receptors on two different cells (33).
C-type lectins (such as DC-SIGN) that are involved in pathogen recognition in innate immunity show high degrees of structural and functional divergence between species (34). In contrast, sequence conservation (Fig. 1B) suggests that the trehalose dimycolate-binding site described here is conserved in human and mouse orthologs of bovine mincle. Conservation of the mechanism for interaction with trehalose dimycolate across mammalian species may reflect evolutionary coexistence of mammalian hosts and mycobacterial pathogens (35).
The arrangement of a hydrophobic channel adjacent to the sugar-binding site in mincle suggests how other glycolipids with alternative headgroups, such as those found in the fungus Malassezia (10), might bind to and activate mincle. On the other hand, the absence of the residues that form the hydrophobic channel from the related macrophage receptor C-type lectin (MCL) indicates that weak binding of this additional receptor to trehalose dimycolate (36) probably involves the headgroup only.
Formation of the hydrophobic groove depends on the loop between residues 170 and 177 assuming the conformation observed in the trehalose complex, so the different conformation of this region observed in the native crystals may contribute to the observed pH dependence of ligand binding. The finding that binding would be reversed under endosomal conditions in turn provides a mechanism for down-regulation of signaling following internalization of mycobacteria. Interaction of mincle with a portion of the acyl chains attached to trehalose may also result in disruption of the mycobacterial membrane organization.
The action of trehalose dimycolate or cord factor as an adjuvant has been widely exploited (37)(38)(39). In addition to yielding insights into mycobacterial interactions with host macrophages, this investigation reveals how ligands such as the synthetic adjuvant trehalose dibehenate interact with the CRD in mincle and provides a basis for further development of adjuvants. FIGURE 8. Hydrophobic channel adjacent to the primary binding site. A model of the trehalose octanoate conjugate was made by adjusting the O5-C5-C6-O6 dihedral angle of glucose residue 1 by 120°and forming a bond of the appropriate length with the planar carboxylate group of octanoic acid from Protein Data Base entry 1H2B. The model suggests that at least the first six carbon atoms of the acyl chain would interact with the channel. The surface of the protein is colored based on the underlying atoms: green for carbon, red for oxygen, and blue for nitrogen. The hydrocarbon chain of the octanoic acid is colored cyan.