Identification and Characterization of the Iridoid Synthase Involved in Oleuropein Biosynthesis in Olive (Olea europaea) Fruits*

The secoiridoids are the main class of specialized metabolites present in olive (Olea europaea L.) fruit. In particular, the secoiridoid oleuropein strongly influences olive oil quality because of its bitterness, which is a desirable trait. In addition, oleuropein possesses a wide range of pharmacological properties, including antioxidant, anti-inflammatory, and anti-cancer activities. In accordance, obtaining high oleuropein varieties is a main goal of molecular breeding programs. Here we use a transcriptomic approach to identify candidate genes belonging to the secoiridoid pathway in olive. From these candidates, we have functionally characterized the olive homologue of iridoid synthase (OeISY), an unusual terpene cyclase that couples an NAD (P)H-dependent 1,4-reduction step with a subsequent cyclization, and we provide evidence that OeISY likely generates the monoterpene scaffold of oleuropein in olive fruits. OeISY, the first pathway gene characterized for this type of secoiridoid, is a potential target for breeding programs in a high value secoiridoid-accumulating species.

The secoiridoids are the main class of specialized metabolites present in olive (Olea europaea L.) fruit. In particular, the secoiridoid oleuropein strongly influences olive oil quality because of its bitterness, which is a desirable trait. In addition, oleuropein possesses a wide range of pharmacological properties, including antioxidant, anti-inflammatory, and anti-cancer activities. In accordance, obtaining high oleuropein varieties is a main goal of molecular breeding programs. Here we use a transcriptomic approach to identify candidate genes belonging to the secoiridoid pathway in olive. From these candidates, we have functionally characterized the olive homologue of iridoid synthase (OeISY), an unusual terpene cyclase that couples an NAD (P)H-dependent 1,4-reduction step with a subsequent cyclization, and we provide evidence that OeISY likely generates the monoterpene scaffold of oleuropein in olive fruits. OeISY, the first pathway gene characterized for this type of secoiridoid, is a potential target for breeding programs in a high value secoiridoid-accumulating species.
Olive (Olea europaea L.) produces a range of secondary metabolites that strongly affect the taste and nutritional properties of olive oil and fruits. The most abundant of these secondary metabolites are the secoiridoids, monoterpenoids with a 3,4-dihydropyran skeleton. These compounds are present as oleosidic secoiridoids or oleosides that have an exocyclic olefinic functionality (1) and possess a tyrosine-derived component (see Fig. 1A). Secoiridoids, which are recovered in virgin olive oil in small amounts, strongly influence olive oil taste, being responsible for the bitterness and pungency sensory notes (2), which are desirable traits for high quality olive oil.
Oleuropein is strongly associated with the beneficial properties of olive oil for human health (2,5). In particular, oleuropein exhibits antioxidant, anti-inflammatory, anti-atherogenic, anti-cancer, antimicrobial, and antiviral activities, and it has hypolipidemic and hypoglycemic effects (6). For instance, it plays a role in prevention of atherosclerosis and inhibition of low density lipoprotein peroxidation (7). It also exhibits cancer preventive activities (8) and can contribute to the nutritional prevention of osteoporosis (9). Additionally, this compound has been implicated in plant defense. Indeed, ␤-glucosidases released from herbivore-attacked tissues can convert oleuropein into a strong protein denaturant that has protein-crosslinking and lysine-alkylating activities (10,11). Oleuropein content differs markedly among different genotypes (3). However, high oleuropein varieties are desirable, and this trait is considered a target for olive breeding programs.
Oleuropein is present in all constituent parts of the plant but accumulates at higher levels in the fruits and leaves (3,(12)(13)(14). In olive fruit, oleuropein is present at highest amounts in small unripe fruits (45 days after flowering) and then dramatically decreases during fruit development and ripening (3).
Considering the high value of oleuropein, the identification of the genes and enzymes required for its synthesis is particularly important to facilitate the development of high oleuropein varieties and to develop synthetic pathways in microbial or plant hosts using metabolic engineering approaches. Until now, only a few candidate genes of the secoiridoid pathway have been proposed in olive (3,15). Recently, the iridoid pathway for the secoiridoid secologanin has been completely elucidated in Madagascar periwinkle (Catharanthus roseus), where the pathway feeds directly into the monoterpene indole alkaloid pathway (16 -20) (see Fig. 1B). Because the secoiridoids in the Oleaceae family are derived from secologanin or a secologanin precursor (21)(22)(23)(24), it is likely that the Oleaceae contain homo-logues of these C. roseus biosynthetic genes that participate in secoiridoid biosynthesis.
Iridoid biosynthesis is initiated from geranyl-pyrophosphate, which is then converted to secologanin by a series of reactions that include oxidations, reductions, glycosylations, and methylations. Secoiridoids are derived from iridoids by opening of the cyclopentane ring, and in the Oleaceae family, the resulting carbonyl group is oxidized and conjugated with a phenolic moiety. These Olea-specific reactions have not yet been resolved, although a putative pathway has been proposed (25) (see Fig. 2).
Integrated approaches coupling co-expression analyses of transcriptomic data with functional characterization studies have been used successfully for the investigation of numerous plant specialized metabolic pathways (26 -28). Moreover, clustering of transcript and metabolite profiles is becoming a powerful technique for identifying candidate genes (29 -32). Large expressed sequence tag collections from high secoiridoid-accumulating olive fruits and leaves (33)(34)(35)(36) are a valuable resource for the identification of candidate transcripts in secoiridoid biosynthesis. With the exception of a geraniol synthase involved in the synthesis of geraniol from geranyl-diphosphate (15), up to now no secoiridoid biosynthetic genes have been biochemically characterized in olive.
In this study, three olive homologues of ISY were identified from the olive transcriptomic data. We discovered one gene that showed high similarity to CrISY and confirmed CrISY-like iridoid synthase activity with NADPH consumption assays and product characterization by GC-MS. Moreover, this gene showed a very similar co-expression profile to other putative iridoid biosynthetic genes, suggesting that this enzyme plays a physiological role in oleuropein biosynthesis. The biochemical functions of two other ISY homologues were also investigated, and their roles in secoiridoid production are discussed. Our data shed new light on oleuropein biosynthesis and more broadly on the evolutionary origin of secoiridoids and iridoids in Asterid families.

Experimental Procedures
Plant Material-Samples were harvested from field plants of cv. Leccino from the Olive Cultivar Collection of CNR-Institute of Biosciences and Bio-resources (Perugia, Italy). Fruits (mesocarp and exocarp) at different developmental stages (45,75,105, and 135 days after flowering (DAF)), flowers at anthesis stage, and young leaves were collected in three biological replicates. Immediately after harvest, samples were frozen in liquid nitrogen and stored at Ϫ80°C.
Identification of Candidate Genes-The amino acid sequences from Madagascar periwinkle (C. roseus) iridoid synthase (ISY), 8-hydroxygeraniol oxidoreductase (8HGO), iridoid oxidase (IO), 7-deoxyloganetic acid-O-glucosyl transferase (7-DLGT), and 7-deoxyloganic acid hydroxylase (7-DLH), recently shown to be involved in monoterpene indole alkaloid biosynthesis (16,17,19,20), were used to search olive-expressed sequence tags by basic local alignment (BLAST) in olive transcriptomic data of olive fruit (33) and other tissues (35). The sequences of candidate genes were assembled in silico using preliminary genomic sequences. The closest CrISY homologue, named OeISY, was selected for further analyses together with two other genes that showed high similarity to the Digitalis lanata progesterone-5␤-reductase 1 (DlP5␤R1) gene and were also expressed in fruit tissues. These genes were named Oe1,4-R1 and Oe1,4-R3. Specific primers were designed to resequence the homologues from fruit cDNA of cv. Leccino and to obtain the entire coding regions by rapid amplification of cDNA ends-PCR.
Co-expression Analyses-The analyses were performed by using a 454 transcriptomic data set from fruit (26,500 transcripts) designed for the identification of genes involved in the metabolism of phenolic and secoiridoid compounds (33). To reduce the transcriptomic data set to a workable size, we selected genes exhibiting high expression in fruits (1258 transcripts with total rpkm values over 130, calculated considering all fruit samples). In addition, we selected those genes that were differentially expressed between 45 and 135 DAF (746 transcripts) applying the statistical R test (42) (R Ͼ 10). Hierarchical clustering analysis of this filtered data set using Cluster 3.0 (43) allowed the identification of a set of co-expressed genes.
Heterologous Protein Expression and Purification-The entire coding sequences were amplified from fruit cDNA of cv. Leccino using gene specific primers for OeISY (5Ј-ATGAGCT-GGTGGTTCAACAGATCT-3Ј, 5Ј-TCAAGGAATAAACCT-ATAAGCCCTC-3Ј), Oe1,4-R1 (5Ј-ATGAGTTGGTGGTGG-AAAGGTGC-3Ј, 5Ј-TCAAGGAACAATCTTGTGTGAT-TTCA-3Ј), and Oe1,4-R3 (5Ј-ATGAGTTGGTGGTGGGCCG-GAG-3Ј, 5Ј-TTAAGGGACGATCTTGTAAGCTTTCA-3Ј) and cloned into the Gateway vector pCR8-GWTOPO-T/A (Invitrogen). Subcloning into the vector pDEST17 using Gateway LR cloning yielded the Escherichia coli expression construct in frame with an N-terminal His tag. BL21 star cells (Life Technologies) harboring the desired plasmid were grown at 37°C, induced with 1 mM isopropyl ␤-D-1-thiogalactopyranoside at A 600 nm of 0.8, and then cultured at 22°C for 5 h. The cells were lysed by lysozyme treatment. For the purification, the soluble portion of the lysate was equilibrated with nickel-nitri-lotriacetic acid-agarose resin (Qiagen). The proteins were eluted via a stepwise imidazole gradient (50, 100, 150, and 300 mM), where the enzymes of interest eluted in the 100 mM imidazole fraction. These fractions were then concentrated and buffer-exchanged into a storage buffer of 20 mM MOPS, pH 7.0. Proteins were quantified by Bradford assay using BSA protein as standard. Proteins were separated by SDS-PAGE by using 12% polyacrylamide gel and then transferred onto a PVDF membrane by electroblotting. After blocking with 5% milk in TBS buffer, the membrane was incubated with mouse 1:1000 anti-His polyclonal antibody (Roche) in TBSTT buffer. Antimouse IgG antibody (diluted 1:5000 in TBSTT) conjugated with alkaline phosphatase (Sigma-Aldrich) was used as detection antibody.
Chemical Synthesis-The synthesis of 8-oxoneral was performed as described previously for 8-oxogeranial (16) but starting with nerol instead of geraniol. The final purification step on a kugelrohr was omitted, because the washes in 15% ether in hexanes afforded sufficiently pure material in comparable yields. All other substrates and standards used in this study were either commercially available or had been synthesized as described previously (16).
GC-MS-based Assays-The enzyme assays were carried out essentially as described previously (16), using 20 mM MOPS, pH 7.0, as buffer. Substrates were kept as 50 mM stocks in tetrahydrofuran except progesterone, which was kept as a 50 mM stock in ethanol. The reactions (50 l) were set up in glass vials using 200 M of substrate, 400 M of NADPH, and 1 g of purified protein and terminated after 1 h by adding 120 l of CH 2 Cl 2 . If the substrate conversion was incomplete, longer incubation (3 h and 24 h) with substrate was carried out. Enzymatic assays with progesterone were carried out at 30°C, and the reactions were terminated after 3 h. The organic phase was used directly for GC-MS.
GC-MS analyses were carried out on an Agilent 6890N GC system coupled to an Agilent 5973 MS detector. For the initial assays with 8-oxogeranial as substrate, all nonchiral separations were performed with a Zebron ZB-5 HT column (30 m ϫ 0.25 mm ϫ 0.10 m) using helium as carrier gas at 1 ml min Ϫ1 and with an injector temperature of 220°C. The program used was the following: 5 min isothermal at 60°C, 20°C min Ϫ1 gradient up to 150°C, 45°C min Ϫ1 gradient up to 280°C, and 4 min isothermal at 280°C (run time ϭ 16.39 min). GC-MS assays using progesterone as substrate were performed with a Zebron ZB-5 HT column (35 m ϫ 0.25 mm ϫ 0.10 m) using helium as carrier gas at 1 ml min Ϫ1 and with an injector temperature of 250°C. The program used was the following: 5°C min Ϫ1 gradient up to 280°C, 10 min isothermal at 280°C (run time ϭ 36.00 min). GC-MS assays with all the other substrates [8-oxoneral, (S)-8-oxocitronellal, citral, cinnamaldehyde, ␣-methylcinnamaldehyde] were carried out by using a DB-1 column (15 m ϫ 0.25 mm ϫ 0.10 m) using helium as carrier gas at 1 ml min Ϫ1 and with an injector temperature of 220°C. The program used was the following: 2 min isothermal at 60°C, 12°C min Ϫ1 gradient up to 150°C, 45°C min Ϫ1 gradient up to 280°C, 2 min isothermal at 280°C (run time ϭ 14.39 min). All the analyses were repeated three times with similar results. Where possible, mass spectra were compared with those of authentic standards.
Spectrophotometry-based Assays for Kinetic Studies-For the determination of Michaelis-Menten parameters with 8-oxogeranial, kinetics of NADPH consumption were determined spectrophotometrically at 340 nm in cuvettes with 1-cm path length. The reactions contained 50 M NADPH, 200 mM MOPS buffer, pH 7.0, 100 mM sodium chloride, and 2 nM (OeISY) or 40 nM (Oe1,4-P3) enzyme in a total volume of 800 l. Substrate was added from a 50 mM stock solution in tetrahydrofuran resulting in a final tetrahydrofuran concentration of less than 0.006% (OeISY) or 2% (Oe1,4-P3). The reaction was equilibrated at room temperature (22°C) and started by addition of enzyme. Plots of initial velocities versus substrate concentration were nonlinearly fit to the Michaelis-Menten equation in SigmaPlot 12.5 to obtain values of k cat and K m . Assays with Oe1,4-P3 and progesterone as a substrate were carried out at 40°C in 200 l of assay volume in 96-well plates. Each well contained 60 nM of enzyme. The data were collected for 20 min in 30-s intervals. NADPH consumption rates were determined taking into account background NADPH decay.
cDNA Synthesis and Gene Expression Analysis-Total RNA was extracted from 0.2 g of olive tissue with the RNeasy plant mini kit (Qiagen) and treated with DNase I (Ambion, Austin, TX). Reverse transcription of 2 g of RNA was performed using oligo(dT) 18 and the SuperScript III Reverse Transcriptase kit (Invitrogen) according to the manufacturer's instructions.
Accession Numbers-Sequence data from this article were submitted to the GenBank TM database under accession numbers KT954038 (OeISY),
Secoiridoid Genes are Co-expressed-To find sets of co-expressed genes possibly involved in secoiridoid biosynthesis, we analyzed transcriptomic data (33) from four cDNA libraries obtained from fruits of two olive varieties (Coratina and Tendellone) at two developmental stages (45 and 165 DAF). Coratina has a higher secoiridoid content compared with Tendellone, and both varieties are richer in secoiridoids at 45 DAF than at 165 DAF. To reduce the transcriptomic data set to a workable size, we considered all genes exhibiting high expression in fruits and selected those differentially expressed among developmental stages (R Ͼ 10). Hierarchical clustering analysis of the normalized data revealed that most of the candidate genes of secoiridoid biosynthesis that we had identified grouped within the same cluster (Fig. 3). In particular, OeISY and Oe1,4-R1 grouped together with Oe8HGO, OeIO, and other candidate transcripts of the secoiridoid pathway that we had previously identified (3) (1-deoxy-D-xylulose 5-phosphate reductoisomerase (OeDXR), geraniol 8-hydroxylase (OeG8H), and secologanin synthase-like (OeSLS-like3)), indicating that this set of genes is co-expressed (p ϭ 0.966). In addition, a transcript (OeCYP76A1) putatively encoding for an uncharacterized CYP76A1 also grouped with these genes. This CYP450 enzyme might carry out an oxidation reaction later in the oleuropein pathway. Other candidate genes for the biosynthesis of the terpenic moiety (1-deoxy-D-xylulose 5-phosphate synthase (OeDXS), geraniol synthase (OeGES), Oe7-DLGT2, loganic acid methyltransferase-like (OeLAMT), and secologanin synthaseslike (OeSLS-like2 and OeSLS-like4)) and the phenolic moiety (tyrosine decarboxylase (OeTYRD)) of the secoiridoids group together (p ϭ 0.832) and with a lower score (p ϭ 0.659) also with OeISY and Oe1,4-R1. In contrast, other potential candidates such as Oe7-DLGT1, Oe7-DLH-like, OeSLS-like1 and the other 1,4-reductases that we identified (1,4-R2 and 1,4-R3) had a different expression pattern and do not group with the other secoiridoid candidates.
These data indicate that most secoiridoid pathway candidates are co-expressed and have an expression profile consistent with the oleuropein content in the fruit that is higher at 45 DAF compared with 165 DAF (3). These results support the involvement of OeISY, Oe1,4-R1, Oe8HGO, OeIO, and Oe7-DLGT2 in the biosynthesis of olive secoiridoids.
OeISY, Oe1,4-R1, and Oe1,4-R3 Encode for 1,4-Reductases with Different Substrate Specificities-Considering the key role of ISY in the formation of the iridoid scaffold, we selected OeISY and Oe1,4-R1 (both potential iridoid synthases expressed in fruit) for heterologous expression and biochemical characterization. In addition, we selected Oe1,4-R3, which does not cluster with these iridoid biosynthetic genes, as a negative control that may potentially be involved in a different pathway. Oe1,4-R2 was not included in this analysis because it was expressed at low levels in fruits, and as with Oe1,4-R3, it did not cluster with the other candidate genes.
GC-MS analyses with the other substrates indicated that OeISY accepts other compounds closely related to 8-oxogeranial as a substrate (Table 2). OeISY was able to cyclize 8-oxoneral and reduce citral, whereas it was unable to metabolize progesterone, cinnamaldehyde, ␣-methylcinnamaldehyde, and (S)-8-oxocitronellal. The inability to reduce (S)-8-oxocitronellal indicates that OeISY specifically reduces the double bond at position C2. These data suggest that 8-oxogeranial and its isomer 8-oxoneral are the preferred substrates of OeISY.
According to GC-MS assays, Oe1,4-R1 was unable to metabolize 8-oxogeranial (Table 2). However, Oe1,4-R1 reduced citral (Fig. 5A) and inefficiently cyclized 8-oxoneral (Fig. 5B). The activity with 8-oxoneral was observed only after long incubation (24 h) with the substrate, and most of the substrate was not converted. These results indicate that Oe1,4-R1 is not involved in the iridoid synthase step of secoiridoid biosynthesis. The low efficiency of conversion observed with all the substrates tested probably indicates that we have not identified the native substrate for this enzyme. However, the strong co-expression of Oe1,4-R1 with OeISY and with the other candidate genes of secoiridoid biosynthesis suggests that it might be involved in another step of this pathway.
GC-MS analyses of Oe1,4-R3 biochemical assays revealed a wide activity range for this enzyme, because it was able to reduce all the tested substrates except cinnamaldehyde and ␣-methylcinnamaldehyde (Table 2). This enzyme reduced progesterone (Fig. 6A) and cyclized 8-oxogeranial (Fig. 6B) and 8-oxoneral. The ability to reduce (S)-8-oxocitronellal indicated that this enzyme, like Oe1,4-R1, does not specifically reduce the double bond at position C2. The steady-state kinetic constants of the reaction were determined through spectrophotometric NADPH consumption assays for progesterone (k cat ϭ 3.2 Ϯ 0.3 s Ϫ1 , K m ϭ 430 Ϯ 60 M, k cat /K m ϭ 0.0074 M Ϫ1 s Ϫ1 ; all data are means Ϯ error of fit) and 8-oxogeranial (k cat ϭ 0.90 Ϯ 0.08 s Ϫ1 , K m ϭ 2900 Ϯ 400 M, k cat /K m ϭ 0.0003 M Ϫ1 s Ϫ1 ; all data are means Ϯ error of fit). These results indicate low catalytic efficiencies for both substrates compared with OeISY with 8-oxogeranial but higher efficiency for progesterone compared with 8-oxogeranial (Fig. 6C). As hypothesized based on the expression analyses, Oe1,4-R3 appears not to be involved in the secoiridoid pathway.
The Expression Patterns of OeISY and Oe1,4-R1 Are Consistent with a Role in Secoiridoid Biosynthesis-Even though secoiridoids accumulate in all the organs of the olive plant, the highest content is found in leaf and fruit tissues. We conducted an expression analysis of different tissues using real time, quantitative PCR and found that OeISY and Oe1,4-R1 mRNA levels were several hundred-fold higher in fruits and leaves compared with roots and flowers (Fig. 7A). In contrast, Oe1,4-R3 was more highly expressed in the roots, where its mRNA levels were about 4-fold higher than in leaves, the tissue characterized by the lowest relative expression (Fig. 7A).
Previous analysis of the secoiridoid content of olive fruit during development showed that, after a peak at 45 DAF (about 120 mg/g dry weight for cultivar Leccino), the levels decrease until 165 DAF, where they reach the lowest content (about 48 mg/g dry weight for the same cultivar) (3). We included four fruit developmental stages in our expression analysis by quantitative PCR. In accordance with the reported trends in secoiridoid accumulation, we found that OeISY and Oe1,4-R1 mRNA levels were highest at 45 DAF and then decreased in the subsequent stages (Fig. 7B). In particular, at 45 DAF OeISY reached mRNA levels up to 8000-fold higher compared with 75 DAF, whereas at 105 and 135 DAF, it was not detected. Oe1,4-R1 relative mRNA levels were about 10,000-fold higher at 45 DAF compared with 135 DAF, which was the stage characterized by the lowest expression. In contrast, Oe1,4-R3 mRNA levels did not change significantly during fruit development (Fig. 7B). These results indicate that OeISY and Oe1,4-R1 expression patterns correlate well with the secoiridoid content of olive tissues. In contrast, the expression profile of Oe-1,4-R3 is unrelated to the secoiridoid content, suggesting that this gene may be involved in a different metabolic pathway.
Conserved Domains and Phylogenetic Analyses of Olive 1,4-Reductases-We classified OeISY, Oe1,4-R1, and Oe1,4-R3 as 1,4-reductases according to their ability to perform 1,4-reductions and showed that the sequences of the predicted proteins, like CrISY, are highly similar to P5␤Rs. Amino acid sequence comparisons revealed that, like P5␤Rs, they belong to a class of short chain dehydrogenases/reductases (SDRs) structurally characterized by Thorn et al. (47). This class of SDRs is defined by eight conserved motifs and two conserved active site residues (Tyr-179 and Lys-147) (Fig. 8). Residue numbering refers to the crystal structure of DlP5␤R (Protein Data Bank code 2V6G) (47).
Some amino acid substitutions were observed in the conserved motifs of the olive 1,4-reductases, possibly controlling the enzymatic activity and the substrate specificity. OeISY, similarly to CrISY, showed substitution of Tyr-180 for histidine in motif 5, a motif that harbors the Tyr-179 residue that is important for catalytic activity. Oe1,4-R1 showed a substitution of Val-200 for serine in the motif 6 involved in NADPH binding. Moreover, OeISY and Oe1,4-R1 differ in three hot spots that, according to Bauer et   MARCH 11, 2016 • VOLUME 291 • NUMBER 11
We carried out a phylogenetic analysis using the amino acid sequences of olive 1,4-reductases and their homologues from other plants (Fig. 9). Along with a number of predicted sequences, this analysis included CrISY, the progesterone reductases P5␤R1 and P5␤R2 of D. lanata and D. purpurea (38,39), and, in addition, some P5␤R homologues from C. roseus (CrP5␤R1-6) and M. truncatula (MtP5␤R1-4) whose exact role in plant metabolism remains unclear (49). The analysis highlights the fact that proteins sharing homology with P5␤R are very common in the plant kingdom, being present in different families of both monocots and dicots. Two main clades (A and B) seem to have originated from a common ancestor. All the proteins included in clade A are of unknown function and include only dicot sequences. Clade B gives rise to three different subclades: c and d, both specific to dicots, and e, which contains only monocot sequences. Oe1,4-R1, Oe1,4-R2, and Oe1,4-R3 group in the subclade c, the largest subclade, which includes representatives that are widely dispersed across numerous dicot families, whereas OeISY groups together with CrISY and with DlP5␤R2 and DpP5␤R2 in subclade d, which contains only proteins belonging to the lamiids. These data suggest that iridoid synthases might have originated from an   ancestor exclusively common to Asterids, the subclass that contains the iridoid and secoiridoid synthesizing species (50), shedding new light on their possible differentiation from other SDRs.

Discussion
Secoiridoids are responsible for many of the health promoting effects of olive oil and positively affect its organoleptic properties (5,6,51). Hence, high secoiridoid content in olive fruits is a desirable trait in olive cultivation. Based on sequence and transcriptomic data mining, we have identified five genes (OeISY, Oe1,4-R1, Oe8HGO, OeIO, and Oe7-DLGT2) that are potentially involved in the biosynthesis of these economically important natural products. The genes of the secoiridoid pathway can potentially be used as targets for the identification of molecular markers in olive breeding programs aiming to increase oleuropein production. Moreover, the identification of the biosynthetic genes have also applications for metabolic engineering in microbial or plant hosts. They can be used to develop synthetic pathways for a large scale production of such valuable compounds.
The co-expression observed for the genes of the putative olive secoiridoid pathway has also been observed for genes involved in iridoid biosynthesis in C. roseus (19,56). Conservation of iridoid biosynthesis genes in C. roseus and olive suggests common biosynthetic intermediates until the formation of 7-deoxyloganic acid (Fig. 1B). However, the downstream biosynthetic steps after this point still need to be clarified in the olive pathway. It has been hypothesized (22-24) that oleoside-11-methyl ester is an intermediate of the pathway, and it is presumably synthesized from 7-deoxyloganic acid via 7-ketologanic acid and 7-ketologanin (Fig. 2), but more experimental evidence for this prediction is required. These enzymatic reactions, which include oxidoreduction, methylation, and cleavage of the cyclopentane ring, are similar to those identified in the secologanin biosynthesis of monoterpene indole alkaloids (19). Therefore, we hypothesize that, in olive, genes similar to CrDLH, CrLAMT, and CrSLS might work on these compounds.
Among the candidate genes involved in secoiridoid biosynthesis, we focused our attention on the functional characterization of iridoid synthase, an enzyme that is key for the formation of the iridoid scaffold and belongs to a new and unexplored class of enzymes. The high specificity of OeISY for 8-oxogeranial, its co-expression with other candidate genes of the pathway (3), and the correlation of its expression levels with secoiridoid content all strongly suggest that OeISY synthesizes the iridoid scaffold in olive. Although iridoids are very common in flowering plants, only in very few cases has iridoid synthase activity been demonstrated in vitro (16,49). Similar to CrISY (16,40), OeISY gave a mixture of nepetalactol and iridodials as enzymatic products. It remains unclear how this mixture of products is channeled into secoiridoid biosynthesis. However, it has been shown in C. roseus that the enzyme that acts directly downstream of ISY in the pathway, iridoid oxidase, can convert both nepetalactol and iridodials in this mixture into 7-deoxyloganetic acid, although nepetalactol was consumed faster (19).
Oe1,4-R1 was co-expressed with OeISY and other candidates of secoiridoid pathway, but it did not show iridoid synthase activity in vitro. The strong co-expression profile with the iridoid synthase gene suggests a role in this pathway, but the function remains to be identified. Based on the current knowledge on oleuropein biosynthesis, we are unable to suggest a role for Oe1,4-R1. Many reactions in oleuropein biosynthesis are still uncertain or hypothetical, particularly the downstream reactions, and Oe1,4-R1 may have a role in an unpredicted step. Moreover, we cannot exclude the existence of alternative routes to oleuropein biosynthesis.
All ISY proteins show high sequence similarity to P5␤Rs, which convert progesterone to 5␤-pregnane-3,20-dione in cardenolide-producing plants (i.e. D. lanata, D. purpurea, and Erysimum crepidifolium) but, with unknown functions, also occur in numerous cardenolide-free plants such as Arabidopsis thaliana (48), C. roseus, and M. truncatula (49). A large number of P5␤R homologues from many species, including cardenolide-and iridoid-free plants, indicate a large and heterogeneous class of enzymes that is not fully explored in terms of biological function and biocatalytic potential.
The high similarity of OeISY and CrISY in terms of sequence and biological function may indicate that they are orthologues derived from a common ancestor belonging to the P5␤R family (Fig. 9). This hypothesis was substantiated by our phylogenetic analysis, which indicates that proteins similar to Oe1,4-R1 and Oe1,4-R3 are almost ubiquitous in dicot families, whereas ISYs group in a small subclade which includes only proteins belonging to the Asterid clade. Most likely, a common ISY ancestor in this subclass can account for the chemical diversity of iridoids and secoiridoids (50) synthesized by various species in this clade. As more ISY sequences from this subclass become available, it may be possible to elucidate how ISYs have evolutionarily diverged from other SDRs.
The enzymatic cyclization of 8-oxogeranial performed by iridoid synthases is a mechanistically remarkable reaction per- FIGURE 9. Phylogenetic tree of olive 1,4-reductases. Predicted amino acid sequences from monocots and dicots were aligned with the biochemically characterized enzymes from olive. The phylogenetic tree was drawn by maximum likelihood method with 1000 bootstrap replicates (percentage values shown at branch points), using P. patens (GenBank TM accession number EDQ81106.1) as outgroup.
formed by a large number of plants. By adding new members, like OeISY, to this recently discovered class of catalysts, we can better understand how the iridoid biosynthetic pathway evolved and which molecular features are essential for the iridoid synthase reaction. Knowledge of the enzymes performing secoiridoid synthesis in olives can lay the basis for enhancing the health promoting features of this important food crop.