Sandalwood Fragrance Biosynthesis Involves Sesquiterpene Synthases of Both the Terpene Synthase (TPS)-a and TPS-b Subfamilies, including Santalene Synthases*

Sandalwood oil is one of the worlds most highly prized fragrances. To identify the genes and encoded enzymes responsible for santalene biosynthesis, we cloned and characterized three orthologous terpene synthase (TPS) genes SaSSy, SauSSy, and SspiSSy from three divergent sandalwood species; Santalum album, S. austrocaledonicum, and S. spicatum, respectively. The encoded enzymes catalyze the formation of α-, β-, epi-β-santalene, and α-exo-bergamotene from (E,E)-farnesyl diphosphate (E,E-FPP). Recombinant SaSSy was additionally tested with (Z,Z)-farnesyl diphosphate (Z,Z-FPP) and remarkably, found to produce a mixture of α-endo-bergamotene, α-santalene, (Z)-β-farnesene, epi-β-santalene, and β-santalene. Additional cDNAs that encode bisabolene/bisabolol synthases were also cloned and functionally characterized from these three species. Both the santalene synthases and the bisabolene/bisabolol synthases reside in the TPS-b phylogenetic clade, which is more commonly associated with angiosperm monoterpene synthases. An orthologous set of TPS-a synthases responsible for formation of macrocyclic and bicyclic sesquiterpenes were characterized. Strict functionality and limited sequence divergence in the santalene and bisabolene synthases are in contrast to the TPS-a synthases, suggesting these compounds have played a significant role in the evolution of the Santalum genus.

Santalum album L. is a slow-growing hemi-parasitic tree, which has long been exploited for its fragrant heartwood. Other species such as S. spicatum, native to the arid and semi-arid regions of Western Australia (WA) and S. austrocaledonicum from Vanuatu and New Caledonia, have also contributed sub-stantially to the fragrance market (1). Unsustainable demand for sandalwood has led to the establishment of plantations to add supply. Knowledge of the molecular and physiological underpinnings of sandalwood oil biosynthesis will advance plantation development through improved management and selection. Sandalwood heartwood contains a complex mixture of sesquiterpene olefins and alcohols ( Fig. 1) with (ϩ)-␣-santalene, (Ϫ)-␤-santalene, (Ϫ)-␣-exo-bergamotene, (ϩ)-epi-␤santalene, and ␤-bisabolene frequently representing about 1-2% by weight of oil (2,3). The more odor intensive compounds, ␣and ␤-santalol, ␣-exo-bergamotol, and epi-␤-santalol, as well as (Z)-lanceol and ␣-bisabolol make up the bulk of S. album oil, but are often lower and more variable in S. austrocaledonicum (4) and S. spicatum (5). The compounds found in sandalwood oil may play an important role in warding off pathogens, as sesquiterpenoids have been implicated in plant pathogen defense strategies (6). Santalol biosynthesis is proposed to proceed by multiple Wagner-Meerwein rearrangements of transoid farnesyl diphosphate, (E,E)-FPP, 3 followed by oxidation at C12, probably via a cytochrome P450 (Fig. 1). Co-occurrence patterns of the santalenes and bergamotene, as well as other olefins and bisabolol, indicated that multi-product TPS enzymes may be responsible for the production of several sandalwood oil components (2). Multiple product formation from terpene synthase (TPS) enzymes in S. album was recently confirmed by characterization of two TPSs, however these were not responsible for santalene biosynthesis (7). The commercially exploited sandalwoods possess unique chemical phenotypes (3), whereas others such as S. accuminatum and S. murrayanum, endemic to southwestern WA, produce little or no oil (8). Despite this variation, the santalenes and their hydroxylated equivalents remain key components. To better understand essential oil biosynthesis as well as the molecular origins of chemical diversity in the Santalum genus, we cloned and functionally characterized several TPS cDNAs from three divergent oil-producing Santalum species, S. album, S. austrocaledonicum, and S. spicatum. We also compared genomic TPS sequences of the three commercial Santalum species to those of S. murranyanum to test if the absence of, or open reading frame mutations in the genes accounted for the oil-deficient phenotype of this species.

EXPERIMENTAL PROCEDURES
Chemicals and Reagents-All reagents, solvents, antibiotics, cloning kits, modifying enzymes, and precursor chemicals were purchased from commercial sources. (E,E)-FPP and geranyl diphosphate (GPP) were from Sigma. (Z,Z)-FPP was synthesized using standard methods (9, 10) and products verified by GC-MS, NMR, and IR spectroscopy. Monoterpene standards were from an in-house collection of commercially available standards. An olefin fraction of sandalwood oil yielded a santalene standard (see below under GC-MS analysis and product identification).
Plant Material Collection and RNA Extraction-Several 25 mm holes were drilled into the lower stems of mature S. album, S. austrocaledonicum, S. spicatum, and S. murrayanum trees growing on land managed by the Forest Products Commission of Western Australia. Wood shavings from the heartwood-sapwood transition zone were collected and frozen immediately in liquid nitrogen. The samples were transported to the lab where RNA was extracted from 10 g of tissue using an established protocol (11). After precipitation by LiCl, RNA was stored at Ϫ80°C until needed for reverse transcription and RACE. S. album RNA was transported to UBC Vancouver, Canada for cDNA library construction.
TPS Gene Discovery and RACE-cDNA was generated for S. austrocaledonicum, S. spicatum, and S. murrayanum in the same manner as before, except the cDNA was used directly as template for PCR. Primers based on the open reading frame (ORF) of previously identified S. album TPS genes were used for amplification (7) (supplemental Fig. S1). Where products could not be amplified, 5Ј-and 3Ј-RACE using the Clontech SMARTer TM kit was used to obtain the respective untranslated regions for more specific primer design. SaSesquiTPS1 gene orthologs were amplified in two rounds using a nested primer approach. All products were first cloned into a high-copy storage vector (TOPO Zero Blunt, Invitrogen, or pJET1.2, Fermentas) for sequencing before being subcloned into pET28b(ϩ). Several individuals of S. album and S. spicatum were studied to examine if polymorphisms were present in the ORFs across populations of these two species. No TPS genes could be amplified from S. murrayanum cDNA, although this could be due to exceedingly low transcript abundance. Genomic DNA sequences for all three TPS genes were also cloned and sequenced for all three Santalum species. The same ORF primers used for successful cDNA amplifications were used on genomic DNA extracted from the same individuals from which RNA extractions were performed. These larger gDNA fragments (3-4 kb) were cloned into pJET1.2 vectors and sent for sequencing (Macrogen, Korea). Genomic DNA sequences of TPS genes from the three oil-bearing species were spliced insilico with introns following the usual 3Ј n/GT and 5Ј AG/n pattern (12). Santalum murrayanum intron-exon patterns were determined by comparing the gDNA sequences to cDNAs of S. spicatum.
Phylogenetic Analysis and Alignment of TPS Genes-Sequences were aligned using ClustalX 2.1.0, trees were constructed using Phylip (13) and visualized with the Phylodraw 0.8 program. Sequences used in the phylogenetic analysis are listed in full under supplemental Fig. S2. Genomic DNA sequences of TPS genes from the three oil-bearing species were spliced in-silico with introns following the usual 3Ј n/GT and 5Ј AG/n pattern (12). S. murrayanum intron-exon patterns were determined by comparing the gDNA sequences to cDNA sequences of S. spicatum.
Bacterial Expression and Protein Isolation-TPS genes were cloned into the pET28b(ϩ) expression vector (Novagen, San Diego CA) with a polyhistidine tag in-frame. Depending on the restriction sites available, the His 6 tag was either N-terminal or C-terminal. Primers with appropriate restriction sites (supplemental Fig. S1) were used to amplify each gene and cloned into the pET28b(ϩ) vector. Vectors containing the TPS genes were transformed into chemically competent C41 E. coli cells (Avidis, Saint-Beauzire, France) containing the pRARE 2 plasmid isolated from Rosetta 2 competent cells (Novagen). Colonies were grown on LB plates containing kanamycin and chloramphenicol (50 g ml Ϫ1 ). Three independent colonies were picked and grown in a shaker overnight at 37°C in 5 ml of LB with the same antibiotics and this culture was used to inoculate 400 ml of Terrific Broth. Cell suspensions were grown at 37°C with shaking until the A 600 ϭ 0.8 and induced with isopropyl-␤-D-thiogalacto-pyranoside (IPTG) to a final concentration of 0.2 mM, and shaken overnight at 16°C. Cell suspensions were centrifuged at 4°C and pellets (ϳ1 g) were frozen at Ϫ80°C for future use. Cell pellets were resuspended in 5 ml of lysis buffer containing 1 mg ml Ϫ1 lysozyme, 1 mM MgCl 2 , 5 mM DTT, 0.01 mg ml Ϫ1 DNAse1, and RNAse1, 100 l of protease inhibitor mixture (Sigma) and made in His-trap binding buffer (20 mM Na 2 HPO 4 , pH 7.4, 500 mM NaCl, 30 mM imidazole, pH 7.4)., Cells were stirred thoroughly on ice with a glass rod for 30 min followed by homogenized using a high pressure cell cruncher. The lysate was centrifuged at 12,000 ϫ g at 4°C for 1.25 h before being decanted. Cleared lysate (ϳ12 ml) was purified using Ni 2ϩ affinity chromatography spin-columns (GE Healthcare) and eluted in 600 l of elution buffer (20 mM Na 2 HPO 4 pH 7.4, 500 mM NaCl, 500 mM imidazole, pH 7.4). The eluted protein was desalted on a PD-10 desalting column (GE Healthcare) using 25 mM 4-(2-hydroxyethyl)-1-piperazine ethanesulfonic acid (HEPES) pH 7.4, 10% (v/v) glycerol and 100 mM KCl. Approximately 1-ml fractions from a 3.5 ml elution were collected. Protein concentrations were determined using a Nano-Drop spectrophotometer with extinction coefficients calculated by amino acid composition (14). SDS-PAGE followed by Coomassie Blue staining visualized the purified recombinant proteins (supplemental Fig. S3).
Enzyme Functional Characterization and Kinetic Assays-Enzyme assays for all recombinant proteins were done in triplicate using the GC vial method described by O'Maille et al. (15). For enzyme assays where only product identification was assessed, 10 g of protein was used in a final volume of 500 l of reaction buffer (25 mM HEPES, 10% (v/v) glycerol, 5 mM DTT, and 10 mM of either Mg 2ϩ or Mn 2ϩ . Substrates (FPP and GPP) were added to a final reaction concentration of 100 M. Vials were overlaid with 500 l of hexane to trap volatile products and incubated at 30°C for 2 h. Mixtures were vortexed for 1 min to extract all volatiles and the vials were centrifuged to separate the organic layer. For determination of steady-state enzyme kinetic constants, conditions were as described previously except the enzyme concentration was kept at 10 nM. Substrate concentrations ranged from 1 M to 100 M, and reactions were incubated at 30°C for exactly 5 min. Critically, reactions were quenched at 5 min by the addition of 500 l 0.5 M EDTA, pH 8.0 and vortexed, then stored at Ϫ80°C immediately.
In Vivo Santalene Production in E. coli-An approach similar to the previously published method for diterpene production (16) was used for in vivo production of santalenes. The SaFPPS gene was identified in the xylem EST library (GenBank TM accession no. HQ343283) and was amplified from cDNA using primers with Nco1 and Not1 ends amenable to pCDFDuet-1 vector multiple cloning site 1. Likewise, SaSSy was amplified with Nde1 and Kpn1 sites adapted for use with the In-Fusion cloning system (Clontech) and cloned into the second multiple cloning site (supplemental Fig. S1). The dual expression vector construct, pCDFDuet-1:SaFPPS:SaSSy, as well as an empty pCDFDuet-1 vector control were transformed into chemically competent C41(DE3) cells and grown on 50 g of ml Ϫ1 streptomycin selective media. Individual colonies were grown in 5 ml overnight cultures, and these were used to inoculate 200 ml shaking cultures of Terrific Broth. As cells were approaching the log-phase of growth (A 600 ϭ 0.6) at 37°C the incubator was cooled to 20°C for 1 h before inducing with IPTG at a final concentration of 0.5 mM. Cultures were shaken for a further 16 h before being centrifuged to pellet the cells. Cell pellets were lysed with 0.2 M NaOH and neutralized with acetic acid before being extracted twice with hexanes and reduced by evaporation. Media was extracted twice with 100 ml of hexane and these fractions were reduced by rotary evaporation to yield a thin oily residue. The residues were resuspended in 2 ml of hexane and analyzed by GC-MS using conditions described below.
GC-MS Analysis and Product Identification-Product mixtures were analyzed by GC-MS in scan mode for product identification. A standard containing the three santalenes and ␣-exo-bergamotene was prepared by flash chromatography of 2 ml of neat S. album oil over silica and eluted in hexane. A final yield of 25 mg was resuspended in EtOH, and purity was con-firmed by GC. Monoterpene identification was aided through an in-house standard collection. All mass spectra were compared with the NIST 2005 library and the literature (17)(18)(19)(20)(21)(22)(23). The Kovat retention indices (KIs) were determined for all compounds (supplemental Figs. S4, S7, S10) using an n-alkane standard and compared with the literature wherever DB-WAX or similar phase column data were available (17-21, 23, 24). Where reliable retention data and pure standards were not available, a combination of mass spectral and retention properties was used to infer the candidate compound. GC-MS was performed on a Shimadzu GC2010 with a DB-WAX column and He as carrier gas. Conditions were as follows: Injector 200°C, MS interface 240°C, ion source 200°C, Oven program: 40°C for 3 min, then 8°C min Ϫ1 to 180°C, held 5 min, then 10°C min Ϫ1 to 220°C, held 10 min. Solvent cut time was set to 5 min. For product identification, total ion monitoring was used, scanning from m/z 41 to m/z 250. Incubations with (Z,Z)-FPP were analyzed on a HP5890 with a DB-WAX column and an initial oven temperature of 40°C then ramped at 10°C min Ϫ1 to 230°C and held for 20 min. For kinetic assays, single ion monitoring (SIM) of the sesquiterpene base ions m/z 91, 93, and 94 were used. Likewise monoterpene base ions (m/z 69, 71, and 93) were monitored for GPP assays. An internal standard (isobutyl benzene, 30 M) was added to the hexane used to overlay each reaction. Detector response factors were calcu-lated based on the santalene standard which was prepared earlier. Product losses due to extraction inefficiency was accounted for by first adding the standard to assay buffer as an EtOH stock and extracting into the hexane layer as per sample assays.

RESULTS
A full-length cDNA, SaSSy (S. album santalene synthase) homologous to previously reported angiosperm TPS-b group genes (25) was identified in the S. album xylem EST library and cloned. It encoded a 569 amino acid protein with 56% identity to SamonoTPS1 (7) and appeared to lack an N-terminal transit peptide. Upon heterologous expression in E. coli, the affinitypurified recombinant His 6 -tagged protein had a molecular mass of ϳ66 kDa, similar to that of most monomeric TPS enzymes (supplemental Fig. S3). When incubated in the presence of 10 mM Mg 2ϩ the enzyme converted (E,E)-FPP into ␣-, ␤-, and epi-␤-santalene, ␣-exo-bergamotene, as well as traces of ␣and ␤-farnesene (Fig. 2). Orthologous TPSs identified in two other oil-bearing species (SspiSSy from S. spicatum and SauSSy from S. austrocaledonicum) were also found to convert (E,E)-FPP into the santalenes in very similar proportions as SaSSy (Fig. 2). In all three orthologs, incubations with Mn 2ϩ yielded mainly ␣-exo-bergamotene (supplemental S4). The larger metal ion likely distorts the active site, causing premature  niol, and terpineol along with traces of ␣-pinene and camphene when incubated with geranyl diphosphate (GPP) (supplemental Fig. S4). Although conversion of GPP did occur, a linear relationship existed between substrate concentration and V 0 , even at high (Ͼ100 M) concentrations, rather than an asymptotic curve indicative of active site saturation. Monoterpenes have only been reported in sandalwood oil at very low concentrations (23,26).
Continuity of santalene biosynthesis from FPP produced by a S. album FPP synthase (SaFPPS) was confirmed using an in vivo E. coli expression system. SaFPPS and SaSSy were cloned into the dual expression vector pCDFDuet-1 (Novagen) and transformed into C41 chemically competent cells. No detectable levels of the santalenes or bergamotene were found in the cell pellet extraction, or in the empty vector control extracts, but all four compounds were detected in hexane extracts of media from overnight cultures (Fig. 4). These results validated the in vivo activity of SaSSy and demonstrate the feasibility of metabolically engineering a santalene synthase into an appropriate host microorganism for in vivo production of santalenes using established methods (16,27).
To further explore the origins of chemical diversity in the genus Santalum, the orthologous TPS gene pair SauBS (S. austrocaledonicum ␤-bisabolene synthase) and SspiBS (S. spicatum ␣-bisabolol synthase) were cloned and characterized using cDNA as PCR template with primers originally developed from S. album (7). Recombinant SauBS produced almost exclusively ␤-bisabolene and only traces of ␣-bisabolol with (E,E)-FPP, while only limonene and terpineol were produced when incubated with GPP (supplemental Figs. S5 and S6). SspiBS produced a mixture of ␤-bisabolene and ␣-bisabolol, along with traces of ␣-bisabolene and farnesene isomers. The functions of these two enzymes were very similar to the previously identified SaMonoTPS1 from S. album which produced mostly ␤-bisabolene and traces of ␣-bisabolol from (E,E)-FPP, but could also convert GPP into monoterpenes analogous in structure to the bisabolenes (7). Extending our investigation of Santalum sesquiterpene synthases into the TPS-a subfamily, two orthologous cDNAs; SspiSesquiTPS and SauSesquiTPS, from S. spicatum and S. austrocaledonicum, respectively, were cloned and characterized. The translated amino acid sequences of these genes were similar to the previously characterized SaSes-quiTPS1 from S. album (7) but the recombinant TPSs of these species yielded markedly different sesquiterpene profiles. Sau-SesquiTPS produced ␣-humulene and ␦-cadinene, along with ␤-elemene, which is the thermal rearrangement product of germacrene A (28) and several other bicyclic sesquiterpenes when incubated with (E,E)-FPP (supplemental Fig. S7). The cadinenes may also be the result of heat-induced dehydration rearrangements (7). In contrast, SspiSesquiTPS produced only three main compounds; ␤-elemol (the thermal rearrangement product of hedycaryol (29)), guaiol and bulnesol (supplemental Fig. S8). Both enzymes produced only traces of linalool with GPP (supplemental Fig. S9).
Each set of orthologous TPS across the Santalum genus showed sequence homology to previously reported TPS with key domains being well conserved (supplemental Fig. S10). As with all angiosperm TPSs, the R(R/P)X 8 W motif implicated in prenyl diphosphate ionization (30) and the aspartate-rich divalent metal ion binding domain (31) (DDXXD) are present. Residues likely to be responsible for product specificity and substrate preference were identified in the ␣14 helix, and the ␣18-␣19 helix turn respectively, based on those identified by Kamparinas et al. (32).
Genomic sequences of a set of Santalum TPS genes were compared with determine whether differences at the genomic level might be related to the chemotypic differences observed among oil producing (S. album, S. spicatum, S. austrocaledonicum) and oildeficient (S. murrayanum) species. All four species have genomic copies of the target TPS genes (supplemental Fig. S11) although cDNAs could not be amplified from a xylem-derived RNA pool of S. murrayanum. The deduced amino acid sequences of the santalene synthase, bisabolene/bisabolol synthase, and TPS-a sesquiterpene synthase gDNAs in S. murrayanum are highly homologous to those of the functionally characterized TPS cDNAs of the other species (Fig. 7) and possibly encode functional TPS enzymes. Intron-exon structure of sandalwood TPS genes were typical of angiosperm TPS-a and TPS-b genes; each bearing 6 introns and 7 exons (12) (supplemental Fig. S11). Intron boundaries observed a 3Ј ϳN GT, 5Ј AG Nϳ pattern. An ortholog of SauBS in S. murrayanum, labeled SmBS, showed a mutation encoding a stop codon in exon 3. We also cloned a variant SaSSy gDNA sequence from S. album with a 10-nucleotide deletion in exon 3, producing a frameshift and subsequent stop codon. No frame shifts or premature stop codons were present in the other TPS gDNAs sequenced.

DISCUSSION
The genes and encoded enzymes for santalene biosynthesis have been isolated from three divergent sandalwood species. SaSSy, SauSSy, and SpiSSy are bona fide sesquiterpene synthases for santalene formation, despite being phylogenetically aligned with the TPS-b subfamily, which comprises mainly of angiosperm monoterpene synthases. Low K m values for each santalene synthase indicated a high affinity for (E,E)-FPP, while no active site saturation was apparent with GPP, indicating they are genuine sesquiterpene synthases. Vestigial activity with GPP suggests that these santalene synthases may have evolved from a monoterpene synthase ancestor through loss of the plastid target peptide and subsequent specialization of the active site for (E,E)-FPP. The putative ancestral enzyme which gave rise to SaSSy, SauSSy, and SspiSSy likely had latent plasticity in the active site, enabling it to accommodate both GPP and FPP. Similarly, it has been shown in the case of fruit flavor biosynthesis in wild and cultivated strawberry (33) that molecular evolution of the corresponding TPSs involved plasticity of the active site to accommodate alternative substrates when subcellular localization of the enzymes is changed through loss of a plastid target peptide. Most intriguing, SaSSy was able to convert (Z,Z)-FPP into ␣-, ␤-, and epi-␤-santalenes and ␣endo-bergamotene (Fig. 3). Sallaud et al. (34) discovered a santalene/bergamotene synthase in wild tomato, which was able to accommodate (Z,Z)-FPP but was unreactive toward (E,E)-FPP (Fig. 5). SaSSy is uniquely able to accommodate both transoid and cisoid isomers of FPP and surprisingly, produce a similar suite of compounds. These rearrangements are likely to proceed via the initial ionization of (Z,Z)-FPP into either (3S) or (3R)-nerolidyl diphosphate (Fig. 6). Cyclization may occur through an endo-conformation, resulting in ␣-endo-bergamotene and (Ϫ)-epi-␤-santalene, while the exo-cyclization would lead to ␣and ␤-santalene. Sandalwood oil contains traces of ␣-endo-bergamotene (26) thus it is plausible that the native enzyme is able to access both isomers of FPP in the cell. To our knowledge this level of plasticity in the active site of a TPS is unprecedented.
In addition to characterizing the santalene synthases, continuity of the biosynthetic pathway from FPP to the santalenes (Fig. 4) was demonstrated using an in vivo production system, highlighting the potential for metabolic engineering of microorganisms to produce precursors of valuable fragrance compounds of limited availability from plants. This system will also serve as a useful platform for exploring further downstream metabolic processes such as hydroxylation to the santalols.
Like the santalene synthases, the active sites of the bisabolene/bisabolol synthases are sufficiently plastic to accommodate the same series of carbocation rearrangements for both C10 and C15 substrates, as has been found with a cineole synthase mutant from Salvia fruticosa (32). Santalum spicatum oil contains variable amounts of ␣-bisabolol relative to other sesquiterpenoids (5) and allelic variation (35) in the ORF of SspiBS, particularly in the ␣14 helix may be partly responsible for the diversity of phenotypes observed across the distribution of S. spicatum. Within-species variation in the santalene synthase ORFs were not evident for the species studied here.
Phylogenetically, SauBS and SspiBS also cluster with the santalene synthases in the TPS-b group (Fig. 7). Thus, TPSs of two phylogenetic clades, TPS-a and TPS-b, contribute to the sesquiterpene profiles of sandalwood oils.
The TPS-a sesquiterpene synthases from the three oil-bearing Santalum species all produced markedly different combinations of sesquiterpenes (supplemental Figs. S7-S9). This was true to a lesser extent for the bisabolene/bisabolol synthases (supplemental Figs. S5 and S6) yet in both cases the diversity of function is in contrast to the highly conserved product profiles of the three orthologous santalene synthases (Fig. 2).
Genomic sequences revealed that all four species of Santalum studied here had copies of TPS genes, and that no frame shifts or mutations in the open reading frames were apparent (supplemental Fig. S11). Even the genome of S. murrayanum contains a complete ORF of the santalene synthase, despite possessing an oil-deficient phenotype. This suggests factors controlling the spatial or temporal patterns of TPS expression, rather than the absence of, or mutations in the ORFs of the genes themselves, are likely to be responsible for the low-or no-oil phenotype.
All oil producing species of Santalum, and even ancestral genera within the Santalaceae family (36), contain detectable levels of the santalenes, ␣-bergamotene, bisabolene, and bisabolol, and all species studied here contain complete genomic copies of the santalene synthase ORF. The uniformity of santalene product profiles of SaSSy, SauSSy, and SspiSSy is in contrast to the variety of compounds produced by the bisabolene/ bisabolol synthases, and more so by the TPS-a group sesquiterpene synthases in the Santalum genus. The higher amino acid sequence identity of the santalene synthase orthologs (94 -98%) compared with the TPS-a genes (89 -93%) further support a genetic basis of functional conservation of santalene biosynthesis (supplemental Fig. S10). Function of SauBS and SaMonoTPS (7) is also well conserved between S. album and S. austrocaledonicum (99% identity) however SspiBS differs in both sequence identity (92%) and catalytic function (supplemental Fig. S2). These findings may suggest positive selection for santalene and to a lesser extent bisabolene biosynthesis in the Santalum genus, and possibly in the Santalaceae family more generally. As sesquiterpenes have been implicated in defense against pathogens (6) it is possible that selection pressure in the form of disease has allowed for the survival of populations with the santalene and bisabolene phenotype.
In conclusion, we have identified several terpene synthases responsible for the production of key fragrant compounds in three commercially exploited species of sandalwood. These findings are of great significance for the flavor and fragrance industry, as well as the growing plantation sandalwood industry.