![]()
|
|
||||||||
J. Biol. Chem., Vol. 281, Issue 47, 35884-35893, November 24, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




1
From the
Structural Biology Research Center, Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization, Tsukuba, Ibaraki 305-0801,
Department of Endocrinology, Faculty of Medicine, Kagawa University, 1750-1 Ikenobe, Miki-cho, Kita-gun, Kagawa 761-0793, and the ¶Department of Applied Biological Chemistry, Faculty of Agriculture, Shizuoka University, 836 Ohya, Suruga-ku, Shizuoka 422-8529, Japan
Received for publication, July 13, 2006 , and in revised form, September 13, 2006.
| ABSTRACT |
|---|
|
|
|---|
-galactoside-binding animal lectins with a conserved carbohydrate recognition domain (CRD). They have a high affinity for small
-galactosides, but binding specificity for complex glycoconjugates varies considerably within the family. The ligand recognition is essential for their proper function, and the structures of several galectins have suggested their mechanism of carbohydrate binding. Galectin-9 has two tandem CRDs with a short linker, and we report the crystal structures of mouse galectin-9 N-terminal CRD (NCRD) in the absence and the presence of four ligand complexes. All structures form the same dimer, which is quite different from the canonical 2-fold symmetric dimer seen for galectin-1 and -2. The
-galactoside recognition mechanism in the galectin-9 NCRD is highly conserved among other galectins. In the apo form structure, water molecules mimic the ligand hydrogen-bond network. The galectin-9 NCRD can bind both N-acetyllactosamine (Gal
14GlcNAc) and T-antigen (Gal
13GalNAc) with the proper location of Arg-64. Moreover, the structure of the N-acetyllactosamine dimer (Gal
14GlcNAc
13Gal
14GlcNAc) complex shows a unique binding mode of galectin-9. Finally, surface plasmon resonance assay showed that the galectin-9 NCRD forms a homophilic dimer not only in the crystal but also in solution. | INTRODUCTION |
|---|
|
|
|---|
-galactosides, but the overall binding affinity for more complex glycoconjugates varies substantially. To date, 14 members of the mammalian galectin family have been identified (1). Hirabayshi and Kasai (2) proposed designating galectin subfamilies as proto-, chimera-, and tandem-repeat types based on their domain organization. The prototype galectins (galectin-1, -2, -5, -7, -10, -11, -13, and -14) consist of a single CRD with a short N-terminal sequence, but the tandem-repeat type galectins (galectin-4, -6, -8, -9, and -12) are composed of two non-identical CRDs joined by a short linker peptide sequence. The single chimera-type galectin (galectin-3) has one CRD and an extended N-terminal tail containing several repeats of proline-tyrosine-glycine-rich motif.
The structures of several galectin CRDs have been reported, and all exhibit a
-sandwich fold containing two antiparallel
-sheets (36). However, their quaternary structures differ. Galectin-1 and -2 form non-covalently associated homodimers through extended
-sheet interactions (7). The association state of galectin-3 is regulated by its N-terminal domain, and it can exist in monomeric or oligomeric forms (8). Finally, because the tandem-repeat type galectins possess two different CRDs, they may adopt more complex assembly states.
Galectins are found in both the cytoplasm and extracellular regions where they regulate inflammation, cell adhesion, cell proliferation, and cell death (9). Galectins lack a traditional signal sequence, and several are secreted by an unorthodox mechanism to exert their extracellular functions (10). There are a variety of potential glycoconjugate targets for galectins in mammalian cells, but the molecular mechanisms of carbohydrate recognition remain unclear.
Galectin-9, a tandem-repeat type galectin, is a 40-kDa protein consisting of 353 amino acids. The sequence identity between the N- and C-terminal CRDs is 35%. The C-terminal CRD (CCRD) is highly homologous to rat galectin-5 CRD with an amino acid sequence identity of 70%, but the N-terminal CRD (NCRD) is only moderately homologous with the known galectins. Among these, the galectin-9 NCRD shows the highest sequence identity (40%) with the galectin-3 CRD.
Galectin-9 was first cloned from tumor cells from Hodgkin disease, a condition characterized by blood and tissue eosinophilia (11). Moreover, the recombinant galectin-9 causes thymocyte apoptosis in mouse cells, suggesting a possible role for galectin-9 in negative selection during T-cell development (12, 13). Interestingly, galectin-9 was shown to be related to a novel eosinophil chemoattractant produced by T lymphocytes, previously designated "ecalectin" (14). Mutation studies showed that both the NCRD and CCRD of galectin-9 were required for the eosinophil chemoattraction activity (15). Additionally, galectin-9 interacts with Tim-3, which is specifically expressed on the surface of T helper type 1 (TH1) cell, through recognition of Tim-3 carbohydrates, and the Tim3-galectin9 pathway induces cell death in TH1 cells. This suggests that galectin-9 plays a role in down-regulating the effector TH1 responses (16).
Galectin-9 interacts with carbohydrate(s) covalently attached to the surface of Tim-3, but the molecular and structural basis for this recognition is unknown. In vitro analyses showed that galectin-9 has a high affinity for a variety of oligosaccharides containing
-galactosides (17), and the NCRD and CCRD of galectin-9 have different oligosaccharide-binding affinities. The biological activities of galectin-9 may be related to the ligand binding specificity of each CRD and the multivalent binding conferred by two CRDs. To date, the structures of many CRDs from fungi to human have been solved, but there is no structural information about the structure of tandem-repeat type galectin CRDs. Such information should greatly clarify the mechanism of carbohydrate recognition by the CRDs and the multivalent properties that lead to multiple functions for a single protein.
Compared with the galectin-9 CCRD, the NCRD shows striking affinities for complex glycoconjugates such as Forssman pentasaccharide and polymerized N-acetyllactosamine (17, 32). The specific interactions of the galectin-9 NCRD with the carbohydrates is thought to be the clue for understanding the physiological mechanism of galectin-9. We report here the crystal structures of the mouse galectin-9 N-terminal CRD in the absence and the presence of carbohydrate ligands. These structures show both the basic mechanism of carbohydrate binding and suggest a potential mechanism for the specificity of carbohydrate recognition and binding. Additionally, the galectin-9 NCRD dimerizes in both crystal and solution. From these observations, we discuss the relationship between the structure and function of galectin-9.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
14Glc) and Thomsen-Friedenreich antigen (T-antigen, Gal
13GalNAc) were from Sigma-Aldrich and Merck Ltd., respectively. N-Acetyllactosamine (LacNAc, Gal
14GlcNAc) was synthesized by a previously described method (18). N-Acetyllactosamine dimer (LN2) was also synthesized by the procedure described below. All crystallization reagents were purchased from Hampton Research (Aliso Viejo, CA) and deCODE genetics. Other chemicals were obtained from Wako Pure Chemical Industries Ltd. (Japan) and Sigma-Aldrich.
Synthesis of LN2 CarbohydrateUDP-GlcNAc and UDP-Gal were kind gifts from Yamasa Corp. (Japan). Fetal bovine serum was purchased from Dainippon Pharma Co., Ltd. (Japan). LacNAc (460 mg) UDP-GlcNAc·2Na (264 mg) and UDP-Gal·2Na (244 mg) were dissolved in 20 ml of 150 mM sodium cacodylate buffer (pH 6.8) containing MnCl2 (64 mg), ATP·2Na (20 mg), and 0.02% NaN3 (w/v) followed by addition of 3.2 g of crude enzyme preparation obtained by 80% saturated ammonium sulfate precipitation from fetal bovine serum. The mixture was incubated for 8 days at 310 K, and the reaction was terminated by boiling for 5 min. The resulting precipitate was removed by centrifugation (10,000 x g, 15 min), and the supernatant was loaded onto a charcoal-Celite column (
2.5 x 25 cm) equilibrated with H2O at a flow rate of 2.5 ml/min. The column was washed with 150 ml of H2O and eluted with a linear 050% ethanol gradient (total 2,000 ml). Absorbance was monitored at 210 nm, and peak fractions were collected, concentrated, and applied onto a Toyopearl HW-40S column (
2.5 x 65 cm) equilibrated with H2O at a flow rate of 0.5 ml/min. The column was eluted with H2O, and the fraction containing the product was concentrated and lyophilized to give LN2 carbohydrate (40.1 mg).
Protein Purification and CrystallizationThe N-terminal CRD of mouse galectin (15) was expressed as a glutathione S-transferase fusion protein in Escherichia coli strain BL21(DE3) cells using plasmid pGEX4T-1 (Amersham Biosciences). The cells were disrupted by sonication at 277 K. The supernatant was applied to a glutathione S-transferase affinity column of glutathione-Sepharose 4B and washed with 50 mM Tris-HCl buffer (pH 8.0) containing 500 mM NaCl and 1 mM dithiothreitol. The fusion protein bound to the resin was eluted with 10 mM glutathione-containing buffer, and glutathione S-transferase was removed from the fusion protein by cleaving with human
-thrombin (Amersham Biosciences) at 10 units/ml for 12 h at 293 K. The cleaved proteins were collected for further purification by benzamidine-Sepharose and Superdex 75. The purified protein was a single band on SDS-PAGE stained with Coomassie Brilliant Blue.
Crystals were grown using the hanging drop vapor diffusion method from drops containing equal volumes of protein (6 mg/ml) in 10 mM Tris (pH 8.0), 100 mM NaCl, 1 mM dithiothreitol, and precipitant composed of 0.1 M CHES (pH 9.5), 15% ethanol at 289 K. The crystals (apo form1) attained dimensions of 0.1 x 0.1 x 0.02 mm3 within 12 weeks and diffracted to
2.5-Å resolution. They grew in space group P41212 with unit cell dimensions of a = b = 58.1, and c = 221.7 Å. There are two molecules in the asymmetric unit. Under almost identical conditions (0.1 M Tris-HCl (pH 7.5), 15% ethanol), another crystal (apo form2) was obtained in another space group P21212 with unit cell dimensions of a = 56.4, b = 58.6, c = 48.4 Å, and only one molecule in the asymmetric unit. The complex crystals with carbohydrate were prepared by two methods. Lactose complex crystals were obtained by co-crystallization in 0.1 M sodium citrate (pH 5.0), 5% polyethylene glycol 6000, 10 mM lactose, and T-antigen complex crystals were obtained by soaking the apo form1 crystal in 10 mM T-antigen solution for 10 min at 293 K. In the case of LacNAc and LN2, the complexes were obtained by soaking the galactose complex crystals grown in 0.1 M sodium citrate (pH 5.0), 5% polyethylene glycol 6000, 10 mM galactose in 10 mM of each solution. In the galactose complex, the electron density of sugar moiety was poor, but the addition of longer carbohydrates greatly improved the quality of the crystal. Primitive and C-centered orthorhombic galactose complex crystals appeared under the same conditions.
Data Collection, Structure Determination, and RefinementSynchrotron data were collected at beamlines BL-5A, BL-6A, and AR-NW12A at the Photon Factory (Tsukuba, Japan) and BL41XU at SPring-8 (Harima, Japan). Data reduction was carried out with the HKL2000 suite of programs (19). The phases of the apo form1 were determined by the molecular replacement method using the galectin-3 CRD (PDB code: 1A3K [PDB] ) as a search model. The program MOLREP in the CCP4 program suite (20) was used to calculate the rotation and translation functions. The crystallographic refinement was performed by CNS (21) and REFMAC5 (20). The apo form2 and the other carbohydrate complexes were solved by molecular replacement using the apo form1 structure as a search model. All carbohydrate chains are clearly identified in the initial Fo Fc map. The qualities of the models were shown to be reasonable using PROCHECK program (22). Final statistics of the crystallographic refinement are summarized in Table 1. The figures were drawn with the programs MOLSCRIPT (23), Raster3D (24), GRASP (25), and PyMOL.3 Coordinates for the galectin-9 N-terminal CRD are being deposited in the Protein Data Bank of the Research Collaboratory for Structural Bioinformatics. The Protein Data Bank accession numbers for the apo form1, apo form2, lactose complex, LacNAc complex, T-antigen complex and LN2 complex structures are 2D6K, 2D6L, 2D6M, 2D6N, 2D6P, and 2D6O, respectively.
|
Surface Plasmon Resonance MeasurementSPR binding assay was performed at 298 K using BIACORE 2000 (Biacore). HBS buffer (10 mM HEPES (pH 7.2), 150 mM NaCl, 0.005% (v/v) Surfactant P20) was used as a running buffer at a flow rate of 5 µl/min. The galectin-9 NCRD was directly immobilized on CM4 sensor chips (Biacore) by amine coupling. The binding response was measured at 150 µM concentrations of galectin-9 NCRD as analyte. The net response was calculated by subtracting the background response from the binding response. Steady-state responses (Req) were determined from the net response of sensorgrams using BIAevaluation 3.2 program (Biacore). The Req values were plotted against galectin-9 NCRD concentrations and were fitted to a simple 1:1 steadystate binding model using the BIAevaluation 3.2 software, Req = CRmax/(Kd + C), where C is the analyte galectin-9 NCRD concentration, Rmax is the maximum binding response, and Kd is the equilibrium dissociation constant.
| RESULTS |
|---|
|
|
|---|
-sheets, which together form a
-sandwich arrangement (Fig. 1A). Thus, based on this structure, the galectin-9 NCRD is not buried in membranes. The overall structure of the galectin-9 NCRD is very similar to that of galectin-3 with an r.m.s.d. of C
atoms of 0.9 Å. The carbohydrate binding site is formed by the S4, S5, and S6
strands, and the carbohydrate recognition mechanism is similar to those of other galectins (see below).
|
-sheets. They form a continuous 12-stranded antiparallel
-sheet through interactions between the
strands of chain-A S6 and chain-B S6 (Fig. 1B). On the dimer interface, the main-chain oxygen and nitrogen atoms of Arg-86 form hydrogen bonds with the corresponding main-chain atoms of Arg-86 of the other monomer (Fig. 1C). Previously, a modified galectin-9 construct lacking the linker region was generated (29), and substitution of Arg-86 with alanine increased the solubility of this protein (data not shown). Thus, a local conformational change caused by this mutation may affect the intermolecular interaction of two galectin-9 monomers, suggesting that dimer formation occurs at the S6 strands in solution. The main-chain oxygen atoms of Glu-84 of both molecules also form hydrogen bonds with the main-chain nitrogen atoms of Met-88 of the other molecules. The N and C termini of each monomer are positioned at the opposite side of the dimer interface (Fig. 1A). This suggests that the CCRD does not prevent dimer formation of the NCRD. The contact area of the dimer is 615 Å2, and this value is slightly smaller than that of galectin-1 (670 Å2, PDB code: 1GZW). Although galectin-1 and -2 form homodimers, the interfaces are formed by extended
-sheet interactions across the two monomers at both sides (F1 and S1) (3, 4). The architecture is quite different from that of the galectin-9 NCRD. In our structures, this contact is conserved in all six crystals, which contain different space groups with only slight differences in the monomer orientations.
|
atoms of 0.4 Å. The carbohydrate binding site of apo form1 is occupied by four well ordered water molecules, which correspond to O4 (Wat-1), cyclic O5 (Wat-2), and O6 (Wat-3) of the
-galactoside and O6 of the reducing sugar (Wat-4) in the complexes with lactose and other carbohydrates (Fig. 2A). In the other apo form structure (apo form2), the positions of the water molecules are slightly shifted, and the hydrogen bond network is also slightly altered (Fig. 2B). The position of wat4 is common in both structures, but wat1 and wat2 move by 1.0 Å without losing the hydrogen bond network. Wat-3 moves by 0.4 Å and makes a new hydrogen bond with Asn-74. Because Wat-2 does not directly interact with the protein in both structures, this positional flexibility may affect the hydrogen network. These waters, which are held in the carbohydrate binding cleft, help to stabilize the spatial arrangement of the amino acid side chains involved in carbohydrate recognition in the absence of the ligand. Because the water molecules in the ligand binding cleft are reported to mimic the carbohydrate binding mode in galectin-1, chick galectin CG16 and fungal galectin CGL2, galectins may generally use such a water stabilization mechanism (3032).
Carbohydrate Recognition MechanismGalectin-9 recognizes many
-galactoside-containing carbohydrates (17, 33, 34). To elucidate the recognition mechanism at an atomic level, we determined four complex structures with different saccharides. The positions of the
-galactoside moiety at the non-reducing end are virtually the same in all the carbohydrate complexes examined (Fig. 3A). The
-galactoside moiety is most deeply buried in the binding site formed by
strands S4S6. O4 of the galactose plays a central role in forming hydrogen bonds, accepting protons from two highly conserved residues His-60 and Asn-62. O6 of the galactose also interacts with Asn-74 via another hydrogen bond. Two residues with planar side chains, His-60 and Trp-81, provide contacts that help align the carbohydrate. Trp-81 participates in a stacking interaction with the galactose ring similar to that seen in a number of other galactose and lactose binding lectins (35).
In the lactose complex, O6 of the glucose moiety is recognized by Arg-64, Glu-84, and Arg-86 through hydrogen bonds (Fig. 3B). Arg-64 also makes a hydrogen bond with O4 in the galactose moiety. Replacement of Arg-64 with Ala in mouse galectin-9 impairs its capacity to bind to Tim-3 (16). Moreover, ecalectin, which was previously isolated as an eosinophil chemoattractant from a human T-cell-derived expression library, is a variant of human galectin-9 with amino acid sequence identity of 66%, and substitution of Arg-65 by Asp, which corresponds to Arg-64 in mouse galectin-9, disrupts its ability to bind lactose (15). The importance of Arg-64 for galectin-9 activity shown by these experiments is explained well by our crystal structures.
Next, we determined the complex structures of galectin-9 NCRD with LacNAc and T-antigen. In the LacNAc complex, the N-acetylglucosamine (GlcNAc) moiety is exposed to solvent and is recognized by Arg-64 and Glu-84, which make hydrogen bonds with O3 of the GlcNAc (Fig. 3C, white). The methyl group makes a van der Waals contact with the guanidino head group of Arg-86. In contrast, T-antigen adopts a different conformation compared with LacNAc at the reducing end. The O4 in N-acetylgalactosamine (GalNAc) makes hydrogen bonds to Arg-64 and Glu-84 (Fig. 3C, yellow). The methyl group of GalNAc is situated away from Arg-86 and located near Arg-43. As the distance between O7 of GalNAc and NH1 of Arg-43 is too long to interact directly with each other, a water-mediated interaction may exist between them. We could not unambiguously assign water molecules around the carbohydrate binding site because of the relatively low resolution of the structure. Comparing the LacNAc and T-antigen complex structures, the position of O3 of GlcNAc is almost identical to that of O4 of GalNAc both of which are recognized by Arg-64 and Glu-84, but the orientation of the acetyl group differs (Fig. 3C). In the case of LacNAc complex, the GlcNAc moiety points to Arg-86, and this may explain why the galectin-9 NCRD can bind both types of sugar chains,
13 and
14.
|
atom of Gly-68 in the neighboring molecule makes van der Waals contact with C1 in GlcNAc at the reducing end of LN2. This additional interaction of the forth sugar residue with the neighboring galectin-9 NCRD molecule likely increases the affinity to glycans with repeating LacNAc units. Protein-Protein InteractionWe showed that the galectin-9 NCRD forms a dimer in all the six crystal structures obtained (two apo-forms and four carbohydrate complexes). To determine whether the NCRD-NCRD interaction occurs in solution, we performed an SPR analysis using BIACORE. The resonance unit of galectin-9 NCRD immobilized on the sensor chip increased with increasing concentrations of analyte galectin-9 NCRD. The plot of analyte concentration versus steady-state resonance unit (Req) is well fitted with a 1:1 steady-state binding model with a dissociation constant (Kd) of 20 µM (Fig. 5). Thus, the galectin-9 NCRD exhibits homophilic interactions in solution, and suggests that the interacting molecules may form a dimer in solution as in the obtained crystals.
The carbohydrate binding site of galectin-9 NCRD is located close to the dimer interface. This striking feature of galectin-9 NCRD is in stark contrast to that seen for galectin-1 and -2. The dimer surfaces of human galectin-1 and toad ovary galectin have a long negatively charged cleft in the cavity containing the carbohydrate binding pocket (30, 36). In contrast, the galectin-9 NCRD has a large positively charged patch on the dimer surface (Fig. 1D). The differences in the electrostatic potential of their surfaces may reflect differences in their physiological targets.
| DISCUSSION |
|---|
|
|
|---|
|
|
-sandwich fold as previously reported for other galectin structures, however, the dimer architecture of the crystals differs substantially from the prototype galectins, such as galectin-1 and CG-16 (30, 31). Galectin-1 forms a head-to-head dimer in which the N-terminal S1 strand is used for the dimer formation, whereas the galectin-9 NCRD forms a tail-to-tail dimer in which the S6 strands located at the opposite side of the N-terminal of the molecule interact with other for dimerization. Additionally, there is a significant local conformational difference between the galectin-1 and galectin-9 NCRD structures. The N-terminal region specific for the galectin-9 NCRD shields the F1 strand, which is responsible for dimer formation in galectin-1 (supplemental Fig. S1A). Within the S1 strand, Ser-7, which forms a hydrogen bond at the dimer interface of galectin-1, is replaced by Pro-20, and the proline ring sticks out to F1 strand (supplemental Fig. S1B). This change prevents the galectin-9 NCRD from forming head-to-head dimers. In contrast, the S6 strand in galectin-1 is kinked at Glu-74, and as a result, the position of Ala-75, which corresponds to Met-88 in galectin-9, moves and prevents tail-to-tail dimerization in galectin-1 (supplemental Fig. S1C). The overall structure of the galectin-9 NCRD is also quite similar to those of the galectin-3 and galectin-7 CRDs, which exist as monomers in solution, except for the N-terminal tail and S6 strand. The positions of oxygen atoms in the carbonyl backbone of the galectin-3 and -7 S6 strands differ slightly from that of the galectin-9 NCRD (supplemental Fig. S2), and this explains the different dimer arrangements of these proteins. The amino acid sequences of the S6 strands of the galectin-9 NCRD and CCRD are not identical. This suggests that the galectin-9 CCRD may not form tail-to-tail dimers like the NCRD. Interestingly, galectin-5, which is a prototype galectin with 78% sequence identity to the galectin-9 CCRD, weakly agglutinates rat erythrocytes, suggesting oligomerization (37). The galectin-9 CCRD may form oligomers, but the protein-protein interaction mode is likely different from that of the NCRD.
Carbohydrate recognition is the first and most critical step in galectin function. The amino acid residues involved in
-galactoside recognition are well conserved among proto-, chimera-, and tandem-repeat type galectins, whereas the residues involved in disaccharide recognition differ. Arg-86 of the galectin-9 NCRD recognizes the glucose moiety in lactose and the GlcNAc moiety in LacNAc via hydrogen bonds. The galectin-8 NCRD has weaker affinities for lactose and LacNAc than the galectin-9 NCRD as assessed by SPR (38). Arg-86 in the galectin-9 NCRD is replaced by isoleucine in galectin-8, and this is presumably responsible for the weaker interactions. In contrast, the affinity for T-antigen may be similar among the tandem-repeat type galectins, because Arg-43, which interacts with the carbohydrate of the reducing end of the galectin-9 NCRD, is conserved in all tandem-repeat-type galectins. The 9-amino acid deletion from the C terminus of the human galectin-9 NCRD (Val-140 to Gln-148) does not allow lactose binding (29). This region (F1 strand) is a part of the
-sheet and interacts with both the N-terminal tail and F2 strand. Because the F1 and F2 strands are spatially distant from the carbohydrate binding cleft (S4S6), this deletion may disrupt the conformation of the
-sandwich arrangement of the galectin-9 NCRD and cause its lack of carbohydrate binding activity.
|
13GalNAc structure is commonly found in these gangliosides, our crystal structure of the galectin-9 NCRD with the T-antigen complex explains the mechanism of this interaction with high affinity. Conversely, the affinity of the human galectin-9 NCRD for GM3 is dramatically weaker than that for GM1. We built a model structure of the mouse galectin-9 NCRD·GM1 complex (data not shown) from the structures of the T-antigen complex and cholera toxin B-pentamer·GM1 complex (PDB code: 1CT1 (39)). This model was very similar to the galectin-1·GM1 complex obtained by NMR analysis (40), where the lipid portion of GM1 was observed at the opposite side of the protein molecule. This orientation would support the interaction of galectins with glycolipids on the cell surface.
The human galectin-9 NCRD also exhibits high affinity for two other types of glycolipids, Forssman pentasaccharide (GalNAc
13GalNAc
13Gal
14Gal
14Glc) and A-hexasaccharide (GalNAc
13[Fuc
12]Gal
13GlcNAc
13Gal
14Glc) (17). Although the binding sites for these oligosaccharides remain unclear, these glycans have a common structure, which is the same as lactose, at the reducing end of the carbohydrate chains. If the lactose moieties of these carbohydrates bind to the galectin-9 NCRD analogous to lactose, the remaining carbohydrate chain would interact with the extended cleft formed by the S1S3 strands. Because the amino acid sequences of the S1S3 strands are not conserved among the tandem-repeat type galectins, ligand specificity may be determined by this region. Moreover, the positively charged surface of the galectin-9 NCRD may facilitate recognition of glycans attached to negatively charged extracellular surfaces under physiological conditions.
In many cases, galectins act as mediators of cell adhesion by binding to glycoconjugates at the cell surface or the extracellular matrix. Tandem-repeat type galectins may act as a bridge between specific carbohydrates, because they have two CRDs within one molecule. In the case of mouse galectin-9, mutation of either Arg-64 in the NCRD or Arg-238 in the CCRD, which corresponds to Arg-64 of the NCRD, decreased binding to Tim-3, whereas the double mutant completely abolished binding (16). Thus, both CRDs appear required for the strong interaction with Tim-3. Additionally, in the case of human galectin-9, both the individual N- and C-terminal CRDs exhibit eosinophil chemoattractant activity, however, this activity was substantially lower than full-length wild-type galectin-9. Likewise, each human galectin-9 CRD exhibits hemagglutination activity, but this activity is also lower than that of the wild-type, full-length protein (15). Conversely, recombinant chimeric proteins consist of two NCRDs or two CCRDs joined by a linker, have virtually the same eosinophil chemoattractant activity as wild-type galectin-9 (33). These results suggest that two CRDs connected by a linker are required for galectin-9 activity, but the precise domain identity and carbohydrate binding specificity are flexible.
The ability of galectins to cross-link ligand molecules is essential for cell adhesion, signal transduction through receptor clustering, and formation of multivalent galectin-glycoprotein networks on the cell surface (4145). Galectins regulate the degree of protein cross-linking thereby affecting a variety of physiological activities. Galectin-14, which was cloned from ovine eosinophil-rich leukocytes, is a prototype galectin with a CRD highly homologous to mouse galectin-9 NCRD (amino acid sequence identity of 55%). However, it exhibits distinct physiological effects from those of human galectin-9 (46). Galectin-14 is released from eosinophils into the lumen of the lungs after challenge with house dust mite allergen. Galectin-14 may dimerize, but it cannot form galectin-9 like cross-linking networks, because it has only one CRD, which might be responsible for the difference in their physiological roles.
Two models of galectin mediated cross-linking have been proposed. Galectin-3 forms heterogeneous and disorganized cross-linking complexes with multivalent carbohydrates (47), whereas galectin-1, and many plant lectins, form homogeneous and organized cross-linked complexes (48). Tandem-repeat type galectins may generate a cross-linking network by a novel mechanism, because they have two CRDs within one molecule. Here, our crystal structures of the galectin-9 NCRD homophilic dimers provide a new possibility for tandem-repeat-type galectins to form even larger carbohydrate binding surface. The combination of the bivalent properties of tandem-repeat-type galectins and the homophilic NCRD-NCRD interactions may allow galectin-9 to achieve its carbohydrate specificity toward complex large glycoprotein and glycolipids. The intermolecular interaction between the galectin-9 NCRDs calculated by SPR analysis is somewhat weak (Kd = 20 µM) and may not be compatible with the formation of such a cross-linking network. However, the local enrichment of galectin-9 in the immediate extracellular environment could be sufficient to generate protein concentrations compatible with dimerization. In addition, the two CCRDs found in galectin-9 could facilitate protein-protein interactions, and our current study cannot assess the contribution of these domains to galectin-9 dimerization.
Galectin-9 controls various biological events, such as cell aggregation, chemoattraction of eosinophils, and induction of apoptosis of thymocytes, immunocytic T cells, and melanoma cells (1316, 26). The multiple functions of galectin-9 may arise from the strict carbohydrate recognition and cross-linking between galectin-9 and target molecules. A recent report revealed that the carbohydrate on Tim-3, a TH1-specific cell surface molecule, contains a region targeted toward mouse galectin-9, and the interaction of these molecules regulates TH1 immunity (16). Structural studies on complexes between galectin-9 and its newly identified ligands will provide further insights into the molecular and cellular functions.
| FOOTNOTES |
|---|
The atomic coordinates and structure factors (code 2D6K, 2D6L, 2D6M, 2D6N, 2D6P, and 2D6O) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/). ![]()
The on-line version of this article (available at http://www.jbc.org) contains supplemental Figs. S1 and S2. ![]()
1 To whom correspondence should be addressed. Tel.: 81-29-879-6177; Fax: 81-29-879-6179; E-mail: ryuichi.kato{at}kek.jp.
2 The abbreviations used are: CRD, carbohydrate recognition domain; NCRD, N-terminal CRD; CCRD, C-terminal CRD; LacNAc, N-acetyllactosamine; T-antigen, Thomsen-Friedenreich antigen; LN2, N-acetyllactosamine dimer; Tris, tris-hydroxymethylaminomethane; CHES, N-cyclohexyl-2-aminoethanesulfonic acid; SPR, surface plasmon resonance; GA1, GAl
13Gal-NAc
14Gal
14Glc; GM1, Gal
13GalNAc
14[NeuAc
2-3]Gal
14Glc; GD1a, NeuAc
2-3Gal
13GalNAc
14[NeuAc
23]Gal
14Glc; GD1b, Gal
13GalNAc
14[NeuAc
2-6NeuAc
2-3]Gal
14Glc; Gb4, GalNAc
13Gal
14Gal
14Glc. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. R. Stowell, C. M. Arthur, K. A. Slanina, J. R. Horton, D. F. Smith, and R. D. Cummings Dimeric Galectin-8 Induces Phosphatidylserine Exposure in Leukocytes through Polylactosamine Recognition by the C-terminal Domain J. Biol. Chem., July 18, 2008; 283(29): 20547 - 20559. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Miyanishi, N. Nishi, H. Abe, Y. Kashio, R. Shinonaga, S.-i. Nakakita, W. Sumiyoshi, A. Yamauchi, T. Nakamura, M. Hirashima, et al. Carbohydrate-recognition domains of galectin-9 are involved in intermolecular interaction with galectin-9 itself and other members of the galectin family Glycobiology, April 1, 2007; 17(4): 423 - 432. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||