Identification and Manipulation of the Caprazamycin Gene Cluster Lead to New Simplified Liponucleoside Antibiotics and Give Insights into the Biosynthetic Pathway*

Caprazamycins are potent anti-mycobacterial liponucleoside antibiotics isolated from Streptomyces sp. MK730-62F2 and belong to the translocase I inhibitor family. Their complex structure is derived from 5′-(β-O-aminoribosyl)-glycyluridine and comprises a unique N-methyldiazepanone ring. The biosynthetic gene cluster has been identified, cloned, and sequenced, representing the first gene cluster of a translocase I inhibitor. Sequence analysis revealed the presence of 23 open reading frames putatively involved in export, resistance, regulation, and biosynthesis of the caprazamycins. Heterologous expression of the gene cluster in Streptomyces coelicolor M512 led to the production of non-glycosylated bioactive caprazamycin derivatives. A set of gene deletions validated the boundaries of the cluster and inactivation of cpz21 resulted in the accumulation of novel simplified liponucleoside antibiotics that lack the 3-methylglutaryl moiety. Therefore, Cpz21 is assigned to act as an acyltransferase in caprazamycin biosynthesis. In vivo and in silico analysis of the caprazamycin biosynthetic gene cluster allows a first proposal of the biosynthetic pathway and provides insights into the biosynthesis of related uridyl-antibiotics.

(6) (6) and the muraymycins (7) (7), which is cyclized to form a rare diazepanone ring. Attached to the 3ٞ-OH are ␤-hydroxy fatty acids of different chain length resulting in CPZs A-G (1). They differ from the LPMs in the absence of a sulfate group at the 2Љ-position of the aminoribose and the presence of a permethylated L-rhamnose ␤-glycosidically linked to the 3-methylglutaryl (3-MG) moiety.
The LPMs have been shown to inhibit biosynthesis of the bacterial cell wall by targeting the formation of lipid I (8). The CPZs are expected to act in the same way and are assigned to the growing number of translocase I inhibitors that include other nucleoside antibiotics, like the tunicamycins and mureidomycins (9). During peptidoglycan formation, translocase I catalyzes the transfer of UDP-MurNAc-pentapeptide to the undecaprenyl phosphate carrier to generate lipid I (10). This reaction is considered an unexploited and promising target for new anti-infective drugs (11).
Recent investigations indicate that the 3Љ-OH group (12), the amino group of the aminoribosyl-glycyluridine, and an intact uracil moiety (13) are essential for the inhibition of the Escherichia coli translocase I MraY. The chemical synthesis of the (ϩ)-caprazol (5) was recently accomplished (14), however, this compound only shows weak antibacterial activity. In contrast, the acylated compounds 3 and 4 exhibit strong growth inhibition of mycobacteria, suggesting a potential role of the fatty acid side chain in penetration of the bacterial cell (15,16). Apparently, the acyl-caprazols (4) represent the most simplified antibiotically active liponucleosides and a good starting point for further optimization of this class of potential therapeutics.
Although chemical synthesis and biological activity of CPZs and LPMs has been studied in some detail, their biosynthesis remains speculative and only few data exists about the formation of other translocase I inhibitors (17,18). Nevertheless, we assume that the CPZ biosynthetic pathway is partially similar to that of LPMs, FR-90043 (6), and muraymycins (7) and presents a model for the comprehension and manipulation of liponucleoside formation. Considering the unique structural features of the CPZs we also expect some unusual biotransformations to be involved in the formation of, e.g. the (ϩ)-caprazol.
Here we report the identification and analysis of the CPZ gene cluster, the first cluster of a translocase I inhibitor. A set of gene disruption experiments provide insights into the biosynthetic origin of the CPZs and moreover, heterologous expres-sion of the gene cluster allows the generation of novel bioactive derivatives by pathway engineering.

EXPERIMENTAL PROCEDURES
Bacterial Strains and General Methods-Chemicals, microbiological, and molecular biological agents were purchased from standard commercial sources. Streptomyces sp. MK730-62F2 and Streptomyces coelicolor M512 (SCP1 Ϫ , SCP2 Ϫ , ⌬actIIorf4, ⌬redD) and their respective derivatives were maintained and grown on either MS agar (2% soy flour, 2% mannitol, 2% agar; components purchased from Carl Roth, Karlsruhe, Germany) or TSB medium (BD Biosciences). E. coli strains were cultivated in LB medium (components purchased from Carl Roth) supplemented with appropriate antibiotics. Mycobacterium phlei was cultured in nutrient agar (BD Biosciences) and used as an indicator strain in agar diffusion assays for the detection of bioactivity in culture extracts of Streptomyces sp. MK730-62F2, S. coelicolor M512, and their derivatives. DNA isolation and manipulations were carried out according to standard methods for E. coli (19) and Streptomyces (20).
Production, Extraction, and Detection of Caprazamycin Derivatives-50 ml of TSB media was inoculated with spore suspension of Streptomyces sp. MK730-62F2, S. coelicolor M512, or a derivative thereof. The cultures were incubated for 2 days at 30°C and 200 rpm. For the production of CPZs, 1 ml of the pre-cultures were inoculated into 100 ml of a medium containing 1% soytone, 1% soluble starch, and 2% D-maltose adjusted to pH 6.7 (components purchased from BD Biosciences). The cultures were incubated for 7 days at 30°C and 200 rpm. For rapid identification of CPZs, cells were harvested and extracted with ice-cold methanol. The extract was directly applied to LC-MS and agar diffusion assay. Partial purification of CPZs was achieved by the adjustment of the culture supernatant to pH 4 and its subsequent extraction with an equal volume of butanol. The organic phase was evaporated and extracts were resolved in 500 l of methanol. LC-MS/MS analysis was performed on a Surveyor HPLC system equipped with a Reprosil-Pur Basic C18 (5 m, 250 ϫ 2 mm) column (Dr. Maisch, Ammerbuch, Germany) coupled to a Thermo Finnigan TSQ Quantum triple quadrupole mass spectrometer (heated capillary temperature, 320°C; sheath gas, nitrogen). For sample separation, a linear gradient from 2 to 40% acetonitrile in aqueous formic acid (0.1%) over 4 min followed by a linear gradient from 40 to 100% acetonitrile in aqueous formic acid (0.1%) over 31 min was used; the flow rate was 0.2 liters min Ϫ1 and detection at 262 nm. Positive electrospray ionization ((ϩ)-ESI) was performed with electrospray voltage of 3.8 kV and collisioninduced dissociation spectra were recorded with collision Bioactivity of culture extracts was monitored using M. phlei as an indicator strain. 50 l of a glycerol culture of M. phlei was spread out on a nutrient agar plate. 5 l of the butanolic culture extracts were applied to filter paper discs (5 mm) and placed on the top of the agar. The assay was incubated at 30°C for 48 h.
DNA Sequencing and Computer-assisted Sequence Analysis-Double-stranded sequencing of the entire cosmid clone 31C2 (42,300-bp insert) was performed by GenoTech (Baejeon, Korea) by using a shotgun library with DNA fragments of ϳ0.5-1.0 kb in length. The DNASIS software package (Hitachi Software Engineering, Tokyo, Japan) and Artemis (Wellcome Trust Genome Campus, Cambridge, UK) were used for sequence analysis and annotation. Data base comparisons were carried out in the GenBank TM data base by using the BLAST program (21). Alignment and comparison of sequences were performed using the ClustalX algorithm (22) and GeneDoc alignment editor.
Generation of ⌬cpz21 and ⌬cpz23 Mutants in S. coelicolor M512-Deletion mutants were generated in accordance with the generation of the mutants for the detection of the cluster boundaries. Primer pairs cpz21_rv/cpz21_fw and cpz23_rv/ cpz23_fw were used to amplify the apramycin resistance cassette (supplemental Table S1). The resulting mutants were To generate the expression plasmids for mutant complementation, cpz21 and cpz23 were amplified from cosmid cpzLK09 using primer pairs cpz21Eco_fw/cpz21Hind_rv and cpz23Eco_fw/cpz23Hind_rv and cloned into the vector pGEM-T (Promega). The genes were subcloned into the EcoRI/SpeI sites of expression vector pUWL201 (28) under the control of the ermE* promoter. This resulted in plasmids pLL06 (cpz21) and pLL07 (cpz23), respectively. DNA sequencing of these plasmids confirmed the correct sequence of all constructs. For protoplast transformation, the two plasmids were transferred into the non-methylating E. coli strain ET12567 and DNA was isolated by standard procedures. Transformation of the S. coelicolor mutant strains by polyethylene glycol-mediated protoplast transformation (20) finally generated strains S. coelicolor M512/cpzLL06/pLL06 and S. coelicolor M512/cpzLL07/pLL07.

Identification and Cloning of the Caprazamycin Gene Cluster-
The unusual structure of CPZs and the lack of information about the biosynthetic origin make it difficult to select genetic probes for the identification of the gene cluster. However, the formation of the permethylated L-rhamnose moiety is known from other antibiotics like elloramycin (29) and spinosyn (30). Oligonucleotides deduced from a multiple sequence alignment based on the elloramycin methyltransferase elmM1 led to the amplification of a partial sequence of cpz28 with high similarity to sugar O-methyltransferases. Primer walking revealed two adjacent genes cpz29 and cpz30 to be homologous to other O-methyltransferases suggesting the presence of the CPZ gene cluster. To our knowledge, this is the first study demonstrating the successful application of degenerated primers for O-meth-yltransferases for probe development. Commonly, methyltransferases are considered to be too diverse on the nucleotide sequence level and too widely distributed in bacterial metabolism to be useful in the identification of a specific gene cluster.
Perfect matching primers were applied to a genomic library of Streptomyces sp. MK730-62F2 constructed in a SuperCos1 vector. Eight positive cosmids of 3000 could be identified and proven to contain overlapping DNA by restriction mapping. Cosmid 31C2 was finally selected for complete shotgun sequencing (nucleotide sequence of the gene cluster has been deposited at GenBank, accession number FJ490409).
Sequence Analysis of the Caprazamycin Gene Cluster-A contiguous 42.3-kb region could be assembled with an average GC content of 70.2%, a typical value for Streptomyces DNA. In silico sequence analysis guided by BLAST homology searches (21), conserved protein domain searches (31), and the GC frame plot method (32) revealed 34 candidate genes. A total of 23 open reading frames, designated cpz9 -31, were assigned to the CPZ gene cluster putatively encoding for biosynthesis, resistance, transport, and regulatory functions (Fig. 2). Table 1 summarizes the orthologous and proposed functions of the annotated genes. Notably, most of the putative gene products did not show homology to proteins found previously in other secondary metabolite gene clusters that reflects the unusual structure of the CPZs.
As proposed, the CPZ gene cluster would start with cpz9, which encodes for a putative regulator of the AraC family. Most members of this family are positive transcriptional activators containing a helix-turn-helix motif. They are known from sugar degradation and other pathways but are rarely found in gene clusters of secondary metabolism (33). The predicted gene product of cpz22 shows homology to ABC-transporters. Similar proteins can be found in many antibiotic gene clusters and are usually involved in self-resistance and export (34). cpz12 and cpz27 are two putative sugar kinase genes similar to tunicamycin resistance proteins, e.g. TmrD from Deinococcus radiodurans, which structure has been reported recently (35). The 2Ј-, 3Ј-, and 5Ј-hydroxy groups of the uridine have been suggested as potential targets for phosphorylation by TmrD, resulting in inactivation of the nucleoside antibiotic tunicamycin.
Cpz10 exhibits similarity to the Fe(II)/2-oxoglutaratedependent oxygenase family (36). Cpz11 and Cpz26 are two putative methyltransferases that contain conserved S-adenosylmethionine-binding domains (cd02440) but share low overall homology to each other (10% identity/18% similarity). Inter-estingly, both genes are translationally coupled to possible resistance genes cpz12 and cpz27 by overlap of start and stop codons. cpz13 shows weak homology to aminotransferase genes and is most likely translationally coupled to the predicted serine hydroxymethyltransferase gene cpz14.
The genes cpz15-23 seem to be co-translated as indicated by the overlap of start and stop codons. This subcluster would encode for Cpz15, another hypothetical Fe(II)/2-oxoglutaratedependent oxygenase, a putative nucleotidyltransferase Cpz16 and Cpz17, which shows similarity to the glycosyltransferases. Cpz18 seems to belong to the class III aminotransferases, whereas Cpz19 resembles pyrimidine-nucleoside phosphorylases. Cpz20 and Cpz21 are similar to a putative acyl-CoA synthase (TMCL4) and a carboxyesterase (TMCL1) from the tautomycetin gene cluster (37). Another possible esterase is encoded by cpz23 the last gene in the proposed subcluster. The deduced gene product of cpz25 is a hypothetical alcohol dehydrogenase.
cpz28, cpz29, cpz30, and cpz31 apparently constitute an operon for the attachment and methylation of a deoxysugar as indicated by probable translational coupling of these genes. They show strong similarity to O-methyltransferases and glycosyltransferases from known antibiotic gene clusters in particular to proteins participating in the formation of elloramycin (29,38) and spinosyn (39). Both compounds contain the same permethylated L-rhamnose moiety as found in the CPZs.
Interestingly, we could not identify genes for the dTDP-Lrhamnose biosynthesis on the cosmid. This was initially surprising, because all genes for the production of a bacterial secondary metabolite are usually clustered. However, neither the gene cluster of elloramycin (40) nor of spinosyn (41), steffimycin (42), or arranciamycin (43) contain genes for dTDP-L-rhamnose formation. We therefore suggest the genes for the CPZs deoxysugar biosynthesis to be located elsewhere on the genome of the natural producer.
Heterologous Expression of the Caprazamycin Gene Cluster-To investigate whether the genes found on cosmid 31C2 were sufficient for biosynthesis of CPZs we intended to express the cosmid heterologously. For this purpose, the betalactamase (bla) gene on the backbone of 31C2 was replaced with an integration cassette of pIJ787 (23,44) containing the attP attachment site and the integrase gene (int) of phage ⌽C31, a tetracycline resistance gene (tet) and an origin of transfer (oriT) using -Red recombination. The generated cosmid cpzLK09 was introduced into S. coelicolor M512 by polyethylene glycolmediated protoplast transformation (20) and three kanamycin FIGURE 2. Organization of the caprazamycin gene cluster. The putative assignment of the genes to different steps in the biosynthesis is indicated. Bars above the cluster mark the gene deletions performed in this study. Ϫ indicates that the deletion of the respective region led to an abolishment of CPZ production. ϩ indicates that CPZ production was not influenced. resistance clones were selected, referred to as S. coelicolor M512/cpzLK09(1), -(2), and - (3). Extracts of cultures of the wild-type and mutant strains were applied to HPLC and ESI-MS/MS. In the wild-type strain the known CPZs A-G, whose different fatty acid side chains result in three different masses were detected readily as depicted in the selected ion monitoring chromatograms of Fig S1). Additional evidence was obtained by product ion scans in negative mode using the same extracts (supplemental Fig. S2).
Although the masses for the CPZs could only be detected in the natural producer Streptomyces sp. MK730-62F2 ( Fig. 3A and supplemental Fig. S4), prominent mass peaks for the CPZ aglyca were observed in S. coelicolor M512/cpzLK09 (selected ion monitoring chromatograms in Fig. 3B; for the CPZ E and F  Fig. S1). Molecular ions of m/z 558, 427, and 315, assigned to components of the caprazol structure, were found in all three spectra. S. coelicolor M512 without the gene cluster did not produce any of these new substances (supplemental Figs. S3 and S4). The analytical data strongly implicates the production of non-glycosylated CPZs, which are similar to the type-(III) LPMs (Fig. 1, 3) isolated previously (46). These compounds have been reported to show excellent activity against Mycobacteria. Thus, culture extracts from the heterologous producer were applied to an agar diffusion assay against M. phlei (Fig. 4). An inhibition zone of similar size could be observed with extracts from the wild-type and mutant strains, whereas extracts from S. coelicolor M512 without the gene cluster showed no bioactivity in this assay. The production of the non-glycosylated CPZ derivatives correlates with the absence of genes for the dTDP-L-rhamnose biosynthesis on the cosmid. Apparently, S. coelicolor M512 is unable to provide the dTDP-L-rhamnose in compensation as the corresponding enzymes are not encoded on the genome. Both, analytical and biological data verify that the genes identified on cosmid 31C2 indeed represent the CPZ biosynthetic gene cluster.
Validation of the Cluster Boundaries-A set of gene disruption experiments was carried out to determine the cluster boundaries. Sequence analysis of overlapping cosmids suggested the CPZ gene cluster to be inserted into a genomic region apparently conserved in several Streptomyces strains. cpz1 and cpz3 at the one end of the cluster and cpz33 and cpz34 at the other are almost identical with a continuous part of sequence from Streptomyces sp. SCC 2136 (47). Primer walking and terminal sequencing of overlapping cosmids showed that this similarity extends further upstream of cpz1.
Although cpz4 encodes for a hypothetical protein with unknown function, cpz5 showed homology to 3-hydroxymethylglutaryl (HMG)-CoA synthases. 3-Hydroxymethylglutaryl-CoA synthases catalyze the aldol addition of acetyl-CoA onto acetoacetyl-CoA and usually participate in the mevalonate pathway (48). A biosynthetic route to the uncommon 3-MG moiety was proposed involving a 3-hydroxymethylglutaryl-CoA synthase, a dehydratase, and a hydrogenase. No function in CPZ formation could be assigned to a putative type III polyketide synthase encoded by cpz6 and the possibly co-transcribed genes cpz7 and cpz8.
To validate the left border of the cluster cpz1, cpz2, cpz3, and cpz4 were deleted in cpzLK09 to generate cosmid cpzWP05 and cpz5 was additionally deleted to generate cpzWP06. By inactivation of the suggested biosynthetic pathway to 3-MG in  cpzWP06 we hoped to produce compounds similar to the highly bioactive type (IV) LPMs (Fig. 1, 4). cpz6 was inactivated in addition to cpz1-cpz5 in cosmid cpzWP07. At the right end of the cluster cpz33 and cpz34 encoding for a hypothetical protein and a metallophosphoesterase were deleted in cosmid cpzWP01. A possible nucleotidyltransferases encoded by cpz32 was additionally deleted in cosmid cpzWP02.
After introducing the modified cosmids into S. coelicolor M512, positive candidates were selected by their kanamycin resistance and verified by PCR. Cultivation and analysis by HPLC and ESI-MS/MS revealed production of CPZ aglyca in all mutants (data not shown). In addition, bioassays of the culture extracts against M. phlei did not show any difference in inhibitory activity compared with S. coelicolor M512/cpzLK09 containing the intact gene cluster. In the case of ⌬cpz4, ⌬cpz5, and ⌬cpz32 complementation by host genes seems unlikely as the S. coelicolor genome contains no homologues. Therefore we concluded cpz1-6 and cpz32-34 to be non-essential in CPZ biosynthesis. Given that cpz6 is most likely co-transcribed with its downstream positioned genes, a functional knock-out of cpz7 and cpz8 can be assumed in cosmid cpzWP07. Consequently, the biosynthetic gene cluster for CPZs is predicted to span from cpz9 to cpz31 (Fig. 2).
Deletion of cpz21 and cpz23 and Production of Hydroxyacylcaprazols-Because cpz5 seems not to be required for CPZ formation and the corresponding mutant S. coelicolor M512/ cpzWP06 did not accumulate the desired ␤-hydroxyacylcaprazols (Fig. 5, 4), we searched for possible acyltransferases within the gene cluster. Two acyl moieties, the 3-MG and ␤-hydroxy fatty acids have to be attached during CPZ biosynthesis and the two putative hydrolases Cpz21 and Cpz23 could be considered for these transfer reactions. Cpz21 is predicted to contain a typical ␣/␤-hydrolase fold, the catalytic triad Ser 208 -Glu 326 -His 409 , and a GXSXG motif (49). The overall homology of Cpz21 is strongest with TMCL1 from Streptomyces sp. CK4412 (37). TMCL1, also named TmcC, is assigned to the esterification of a dialkymaleic anhydrid moiety to the linear polyketide during tautomycetin formation.
The amino acid sequence deduced from cpz23 shows highest overall homology to several hypothetical proteins from Streptomyces including AviX9 of the avilamycin gene cluster from Streptomyces viridochromogenes (50). According to the conserved protein domain search the C terminus of Cpz23 is similar to SGNH-hydrolases, a diverse family of lipases and carboxyesterases (51).
Both genes were individually deleted from cosmid cpzLK09 using -Red-mediated recombination. To create in-frame deletions, the disruption cassette from pIJ773 was subsequently removed by the use of FLP-recombinase (25) generating cosmids cpzLL06 (⌬cpz21) and cpzLL07 (⌬cpz23).   S1). Overall, fragments matched exactly the predicted characteristics of ␤-hydroxyacylcaprazols, structurally minimized liponucleosides antibiotics. In a bioassay against M. phlei (Fig.  4) extracts of a S. coelicolor M512/cpzLK09 and S. coelicolor M512/cpzLL06 (⌬cpz21) cultivation broths showed both similar inhibiting activities, whereas no growth inhibition could be observed with S. coelicolor M512 and S. coelicolor M512/cp-zLL07 (data not shown) extracts. Co-expression of intact copies of the deleted genes under the constitutive ermE* promotor in the respective mutants restored the production of CPZ aglyca (data not shown).
Although similar to the type (IV) LPMs (4) the ␤-hydroxyacylcaprazols are expected to be slightly more hydrophilic due to the additional OH-group at the 3a-C position. Thus, they represent interesting novel compounds for further investigations, e.g. in structure/activity relationships. Moreover, the data indicated that Cpz21 is most likely involved in attachment of the 3-MG moiety. Beside Cpz21 two other enzymes with homology to para-nitrobenzyl esterases are known from bacterial secondary metabolism. Both of these enzymes, TmcC and TtmK from the tautomycin gene cluster (52), were proposed to catalyze the attachment of an acyl group. However, inactivation of the corresponding genes in the gene cluster did not lead to the identification of an accumulated intermediate. Therefore the data presented here provides the first functional evidence that these family of enzymes indeed act as acyltransferases.
A Model for Caprazamycin Biosynthesis-Sequence analysis of the gene cluster combined with analytical data from heterologous expression and gene inactivation experiments may allow a first proposal of the CPZ biosynthetic pathway (Fig. 5) although many of the suggested reactions remain speculative at present.
A key question in the biosynthesis of CPZs and translocase I inhibitors of the same class is the origin of the glycyluridine (Fig. 5, 9). Metabolic labeling studies have shown that uridine is incorporated directly into related uridyl antibiotics such as tunicamycins (53). A pathway to the tunicamycins has been proposed to start with the oxidation of uridine to form uridine 5Ј-aldehyde (8) (17). We suggest a similar reaction for CPZ biosynthesis, which may be catalyzed by the putative alcohol dehydrogenase Cpz25. The resulting product (8) could undergo a subsequent aldol addition with a pyridoxal phosphate-glycine adduct to generate 9. This mechanism would be very similar to that of the well studied serine hydroxymethyltransferases, which are known to produce ␤-hydroxy ␣-amino acids from glycine and various aldehydes (54). Cpz14, with significant sequence similarity to serine hydroxymethyltransferases, is an obvious candidate for the catalysis of this reaction. The next step would be the transfer of a 3-amino-3-carboxypropyl group to the 5ٞ-amino group of 9 to form 10. A corresponding reaction occurs in the nocardicin biosynthesis (55). In this pathway, the gene product Nat utilizes S-adenosyl-methionine to transfer the 3-amino-3-carboxypropyl moiety to a nucleophilic acceptor (56). Nat shows conserved domains of S-adenosylmethionine-dependent methyltransferases, which are also found in Cpz11 and Cpz26, but overall sequence similarity is low with 16% to Cpz26 and 15% to Cpz11. Hence, both genes may be candidates for a 3-amino-3-carboxypropyl transfer in CPZ biosynthesis, although we rather consider them to be involved in the two N-methylation steps discussed below. We further speculate that 10 could be a common intermediate in the caprazamycin (1), liposidomycin (2), FR900493 (6), and the muraymycin (7) biosynthesis. ␤-Hydroxylation of the 3-amino-3-carboxypropyl group of 10 would lead to 11 and could be catalyzed by either Cpz10 or Cpz15. Both proteins show homology to oxygenases.
Subsequent biosynthetic steps, including formation and transfer of the aminoribose, cyclization, and N-methylation of the diazepanone ring and attachment of the fatty acid would finally lead to 4 (Fig. 5). Compounds of this structure were accumulated in the ⌬cpz21 mutant strain and are probable intermediates of the CPZ pathway. Reasonable candidate genes for these biosynthetic steps can be found in the cluster. However, the sequence of these reactions, described in the following paragraphs, is speculative at present.
Cyclization of 11 by amide bond formation between the carboxyl group and the secondary amino group would immediately result in the characteristic diazepanone ring. For this reaction, a previous activation of the carboxyl group, e.g. in the form of an acyl adenylate, a coenzyme A ester, or an acyl phosphate would be required. The hypothetical acyl-CoA synthase Cpz20 or the putative kinases Cpz12 and Cpz27 may be involved in this reaction.
Interestingly, a contiguous set of genes, cpz16 -19, was found in the CPZ cluster, which can be assigned to all steps required for the generation and attachment of the aminoribosyl moiety. This reaction sequence may start from a second molecule of 8, derived from uridine by a Cpz25-mediated oxidation as described above. Subsequently, the 5-aldehyde group could undergo an aminotransfer reaction, yielding a 5-aminated nucleoside possibly catalyzed by the hypothetical aminotransferase Cpz18. CetH, an orthologoue of Cpz18, has recently been assigned to the aminotransfer reaction in biosynthesis of the aminocyclitol cetoniacytone (57). 5-Amino-ribose-1-phosphate and uracil would be generated from the aminated nucleoside by Cpz19, a putative pyrimidine-nucleoside phosphorylase. A similar reaction has been shown in fluorothreonine biosynthesis where a 1-phosphoribosyl derivative is formed under catalysis of the pyrimidine phosphorylase FlB (58). Subsequently, the potential nucleotidyltransferase Cpz16 may convert the 5-amino-ribose-1-phosphate to dNDP-5-aminoribose. Then, the putative glycosyltransferase Cpz17 could transfer the aminoribose moiety forming a glycosidic bond.
Generally, ribosyl moieties are attached by phosphoribosyltransferases (59) using 5-phosphoribosyl-1-diphosphate as a donor to generate a 5Ј-phosphoribosylated product. Then, the 5Ј-phospho group is removed by a phosphatase. Similar reactions have recently been shown to lead to the ribosyl moiety in butirosin biosynthesis involving BtrL and BtrP (60). However, no orthologoues to BtrL and BtrP were found in the CPZ gene cluster, making the pathway described above a more likely alternative.
The fatty acid moieties of LPMs and CPZs are probably derived from primary metabolism, as feeding studies with labeled palmitic acid in Streptomyces griseosporeus showed the direct incorporation into LPMs (61). Hydroxylation of the fatty acids could either occur within primary metabolism or by oxygenases Cpz10 or Cpz15. Cpz23 may be involved in the attachment of the hydroxy fatty acids, due to its homology to lipases.
In the diazepanone ring, both nitrogens are methylated. The N-methylation reactions are likely to be catalyzed by Cpz11 and/or Cpz26. Notably, Cpz11 shows sequence similarity (55%) to one of the few characterized N-methyltransferases AtM1 from the gene cluster of AT2433 (62).
The biosynthetic origin of the 3-MG moiety remains elusive. By our inactivation experiments we could exclude an involvement of the putative 3-hydroxymethylglutaryl-CoA synthase Cpz5. Therefore, this moiety is likely generated by enzymes encoded outside the cluster, probably in the form of a coenzyme A ester. We assign the catalysis of the subsequent acyltransfer to Cpz21 as indicated by functional investigations in this study.
Analogous to the biosynthesis of elloramycin, L-rhamnose would be synthesized from enzymes encoded elsewhere on the genome (40). The dNDP-L-rhamnose probably constitutes the substrate for a transfer reaction to the CPZ aglycon catalyzed by the putative rhamnosyltransferase Cpz31. Sequential methylation of the deoxysugar moiety is likely catalyzed by the hypothetical sugar O-methyltransferases Cpz28, Cpz29, and Cpz30.
The identification and analysis of the caprazamycin gene cluster provides the first molecular basis for the proposal of a translocase I inhibitor biosynthetic pathway. Because the formation of intermediate 10 can be speculated to be similar for other structurally related compounds, this work may help in the development of probes for the discovery of gene clusters of other uridyl antibiotics. As proposed, several biosynthetic steps to the caprazamycins seem to be distinctive and unique in bacterial secondary metabolism. Apparently they represent intriguing subjects for further functional investigations. A detailed understanding of the caprazamycins biosynthetic pathway combined with the successful establishment of a heterologous expression system sets the basis for genetic and metabolic engineering toward the production of new liponucleoside antibiotics with improved properties.