Small efficient cell-penetrating peptides derived from scorpion toxin maurocalcine.

Maurocalcine is the first demonstrated example of an animal toxin peptide with efficient cell penetration properties. Although it is a highly competitive cell-penetrating peptide (CPP), its relatively large size of 33 amino acids and the presence of three internal disulfide bridges may hamper its development for in vitro and in vivo applications. Here, we demonstrate that several efficient CPPs can be derived from maurocalcine by replacing Cys residues by isosteric 2-aminobutyric acid residues and sequence truncation down to peptides of up to 9 residues in length. A surprising finding is that all of the truncated maurocalcine analogues possessed cell penetration properties, indicating that the maurocalcine is a highly specialized CPP. Careful examination of the cell penetration properties of the truncated analogues indicates that several maurocalcine-derived peptides should be of great interest for cell delivery applications where peptide size matters.


SUMMARY
Maurocalcine is the first demonstrated example of an animal toxin peptide with efficient cell penetration properties. While it is a highly competitive cell penetrating peptide (CPP), its relatively large size of 33 amino acids and the presence of three internal disulfide bridges may hamper its development for in vitro and in vivo applications. Here, we demonstrate that several efficient CPP can be derived from maurocalcine by replacing Cys residues by isosteric 2aminobutyric acid residues and sequence truncation down to peptides of up to 9 residues in length. A surprising finding is that all the truncated maurocalcine analogues possessed cell penetrating properties indicating that the maurocalcine is a highly specialized CPP. Careful examination of the cell penetrating properties of the truncated analogues indicates that several maurocalcine-derived peptides should be of great interest for cell delivery applications where peptide size matters.
Maurocalcine (MCa) is a 33-mer peptide that was initially isolated from the venom of a Tunisian chactid scorpion, Scorpio maurus palmatus (1). The toxin belongs to a family of peptide that folds according to an Inhibitor Cystine Knot (ICK motif), and thus contains three disulfide bridges with a Cys 1 -Cys 4 , Cys 2 -Cys 5 and Cys 3 -Cys 6 connecting pattern (2). The solution structure, as defined by 1 H-NMR, illustrates that MCa contains three β-strands (strand 1 from amino acid residues 9 to 11, strand 2 from 20 to 23, and strand 3 from 30 to 33). One distinctiveness of MCa is the fact that it is greatly enriched in basic amino acid residues. Out of the 33 amino acids that compose MCa, twelve of them are basic, most of them represented by Lys residues. Interestingly, the βstrands of MCa encompass most of the basic domains (see Fig. 1A). MCa turned to be of interest to our research group for several reasons. First, it is an exquisite pharmacological activator of the ryanodine receptor type 1 (RyR1) from skeletal muscle since it promotes high Po gating modes and long-lasting subconductance states of the ion channel (3,4). On myotubes, application of MCa rapidly induces Ca 2+ release from the sarcoplasmic reticulum (SR) (5), a result further confirmed by positive effect of MCa on the release of Ca 2+ from purified SR vesicles (3,5). The interaction of MCa with RyR1 has been witnessed by increased [ 3 H]ryanodine binding onto purified RyR1 (3,5). The binding site for MCa on RyR1 has also been mapped and shown to correspond to domain(s) that have a predicted localization within the cytoplasm (6). Second, similarly to imperatoxin A (7) for which this was first noted, MCa has an interesting sequence homology with the II-III loop of the L-type calcium channel Ca v 1.1 subunit over a domain that is slightly larger than the second βstrand of MCa (see Fig. 1A) (5). This loop is predominantly involved in excitation-contraction coupling through direct molecular interactions with RyR1 (6,8). This homology has been a source of inspiration to understand how toxins may interfere with the process of excitationcontraction coupling (4,8,9). Third, and this is the scope of this manuscript, MCa has been shown to act as a cell penetrating peptide (CPP) (10). This discovery stemmed from earlier criticisms that MCa may not be an activator of RyR1 because peptide toxins were not known to cross the plasma membrane, which would be required here to bind to RyR1. Studies that were undertaken to demonstrate the ability of MCa to reach its target showed that i) MCa triggers Ca 2+ release from the sarcoplasmic reticulum a few seconds after its application in the extracellular medium (5), and ii) intracellular accumulation of fluorescentstreptavidine occurs if it incubated first with biotinylated MCa (10). Since these pioneering studies, MCa or analogues thereof proved powerful vectors for the cell entry of proteins, peptides (11), nanoparticles, or drugs such as doxorubicine (12)(13)(14)(15). Although the mode of cell penetration of MCa may vary according to cargo nature, cell type or chemical linkage employed, the data gathered so far suggest that the peptide may enter cells according to two priming steps onto the plasma membrane: first an interaction with proteoglycans with an affinity in the micromolar range, followed by a second interaction with negatively charged lipids which occurs with greater affinity (16,17). The mode of cell entry of MCa is not altered by the absence of proteoglycans, but simply reduced quantitatively, suggesting that proteoglycans do not orient the mode of cell penetration. Two modes seem to concur to MCa cell entry, as far as observed, one related to macropinocytosis and another to membrane translocation. The balance between both modes of entry was found correlated to cargo nature and the type of MCa analogue used.
It is of great interest to pursue the study of MCa as CPP in spite of the wealth of new CPP sequences that are discovered yearly. Among the competitive advantage of MCa over other CPP sequences are the facts that it has almost no associated toxicity in vitro and in vivo, penetrates into cells at very low concentrations, and is extremely stable in vivo upon intravenous injection (over 24 hrsmanuscript in preparation). While MCa appears as an elaborate and efficient CPP, its pharmacological properties represent a serious hindrance while envisioning in vitro and in vivo applications. In addition, because of its length (33 amino acid residues) and the presence of three disulfide bridges, MCa is a relatively difficult to synthesize CPP, comparatively to other CPP, and would benefit from a downsizing approach. Several strategies have been successfully employed in the past to overcome one or both of these issues. The first strategy was based on single point mutations of MCa sequence. This strategy preserved the disulfide bridges and the 3D structure of the analogues. Overall, mutations affected more seriously the pharmacology of MCa than the cell penetration properties (18). Many of the amino acids involved in RyR1 binding and pharmacology were located within the cluster of basic amino acids that presented sequence homology with the L-type Ca v 1.1 channel. Some of these residues, but not all, were also important for cell penetration properties. Hence, several analogues could be defined that kept close to intact cell penetration properties while entirely losing their pharmacological action (MCa R24A for instance). Some other analogues were actually better than MCa itself for cell penetration suggesting that pairs of mutations, aiming at disrupting pharmacology and improving penetration, may be used in the future to define still better CPP analogues of MCa. The second strategy, that has yield success, is based on the chemical synthesis of D-MCa, an analogue entirely based on the use of D-amino acids. This peptide is a mirror image of the natural L-MCa but, like other D-CPP, preserves its cell penetration properties, while losing entirely its ability to interact with RyR1 (19). This method has several advantages. It no longer is sensitive to proteases which may be an additional advantage for in vivo experiments where the half-life of the circulating peptide matters. It is also possible to improve this analogue by introducing point mutations shown previously to improve cell penetration (19). In these two strategies, while being effective, one may argue that i) the peptides are still among the longest CPP known to date, implying increased costs of production, and ii) the yield of production of these peptides is hampered by the folding process. Also, the use of peptides with internal disulfide bridges, despite have advantageous features in term of stability in vivo, makes chemical coupling of these CPP to cargoes more complicated (difficulty to add extra Cys residues to the peptides for instance without interfering with the correct folding process). The third strategy that was used to circumvent one of this criticism was the chemical synthesis of an MCa analogue in which all internal Cys residues where replaced by isosteric 2-aminobutyric acid residues (11). The resulting peptide was still 33-mer long but one step in production was saved by avoiding the folding process. In addition, an extra-Cys residue could be added to the N-terminus of the peptide in order to favor simplified cargo grafting on this CPP analogue. This peptide, termed here C-MCa UF1-33 (C for extra-Cys, UF for unfolded, and 1-33 for its length, Fig. 1B) has no longer any secondary structures, but efficiently penetrates into cells. Interestingly also, the peptide completely lacks pharmacological activity indicating that folding and secondary structures are essential for binding onto RyR1. While this peptide is an efficient CPP, it remains less potent than MCa in its folded version suggesting that further optimization should be brought to this analogue. Such optimization appears feasible on the basis of the fact that MCa fulfills three different functions (pharmacology, obligation to resemble the L-type channel, and cell penetration). We reasoned that since only cell penetration was the quality we searched for, the peptide could be further simplified and novel analogues be designed. In this study, we undertook to identify new more potent MCa analogues with three criteria in mind. First, these analogues should be shorter than the folded version of MCa (MCa F ) or the unfolded version (MCa UF ). Second, these peptides should be designed in order to better delimitate the domains of MCa responsible for cell penetration. Third, we should be able to design analogues with extra free SH functions for cargo grafting. We present several new analogues that have highly potent cell penetration capabilities, while losing pharmacological activity, preserving lack of cell toxicity, and with facilitated cargo grafting. This new generation of MCa analogues is predicted to have bright futures for CPP applications in vitro and in vivo.

Experimental procedures
Reagents -N-α-Fmoc-L-aminoacid, Wang-Tentagel resin and reagents used for peptide syntheses were obtained from Iris Biotech. Solvents were analytical grade products from Acros Organics. Cy5 maleimide mono-reactive dye was purchased from GE Healthcare. Solid-phase peptide syntheses -Chemical syntheses of MCa analogues were performed as previously described (19). Briefly, analogues of MCa were chemically synthesized by the solidphase method (20) using an automated peptide synthesizer (CEM© Liberty). Peptide chains were assembled stepwise on 0.24 mEq of Fmoc-D-Arg-Pbf-Wang-Tentagel resin using 0.24 mmol of N-αfluorenylmethyloxycarbonyl (Fmoc) L-amino-acid derivatives. The side-chain protecting groups were: Trityl for Cys and Asn, tert-butyl for Ser, Thr, Glu and Asp, Pbf for Arg and tert-butylcarbonyl for Lys.
Reagents were at the following concentrations: Fmoc-amino-acids (0.2 M Fmoc-AA-OH in dimethylformamide (DMF)), activator (0.5 M 2-(1H-benzotriazole-1-yl)-1,1,3,3tetramethyluronium hexafluorophosphate in DMF), activator base (2 M diisopropylethylamine in N-methyl-pyrrolidone (NMP)) and deprotecting agent (5% piperazine / 0.1 M 1hydroxybenzotriazole in DMF), as advised by PepDriver (CEM©). After peptide chain assembly, resins were treated 4 hrs at room temperature with a mixture of trifluoroacetic acid/water/triisopropylsilan (TIS)/dithiothreitol (DTT) (92.5/2.5/2.5/2.5). The peptide mixtures were then filtered and the filtrates were precipitated by adding cold t-butylmethyl ether. The crude peptides were pelleted by centrifugation (10.000 × g, 15 min) and the supernatants were discarded. MCa analogues were purified by HPLC using a Vydac C18 column (218TP1010, 25×10 cm). Elutions of the peptides were performed with a 10-60% acetonitrile linear gradient containing 0.1% trifluoroacetic acid. The purified fractions were analyzed by analytical RP-HPLC (Vydac C18 column 218TP104, 25 × 4.6 cm). All analogues were characterized by MALDI-TOF mass spectrometry. Labeling of peptide with Cy5 -Each peptide was labeled with Cy5 according to the manufacturer's protocol (GE Healthcare). Peptides were dissolved at 1 mg/ml in 0.1 M Na 2 CO 3 buffer, pH 9.3. 300 µl of the solubilized peptides were added to Cy5maleimide containing tubes. The mixtures were incubated during 2 hrs at room temperature and then purified by HPLC using an analytical Vydac C18 column. Elution of the Cy5-labeled peptides was performed with a 10-60% acetonitrile linear gradient containing 0.1% trifluoroacetic acid. The pure peak fractions were lyophilized and peptides quantified by UV spectrophotometer at 649 nm. Cell culture -Chinese hamster ovary (CHO) and F98 cell lines (from ATCC) were maintained at 37°C in 5% CO 2 in F-12 nutrient medium (InVitrogen) supplemented with 10% (v/v, CHO) or 2% (v/v, F98) heat-inactivated fetal bovine serum (InVitrogen) and 100 units/ml streptomycin and penicillin (InVitrogen). MTT assay -Cells were seeded into 96-well micro plates at a density of approximately 8×10 4 cells/well. After 2 days of culture, the cells were incubated for 24 hrs at 37°C with MCa analogues at a concentration of 10 μM. Control wells containing cell culture medium alone or with cells, both without peptide addition, were included in each experiment. 0.1% saponin was used as toxic agent for comparison. The cells were then incubated with 3-(4, 5-dimethylthiazol-2-yl)-2, 5diphenyl-tetrazolium bromide (MTT) for 30 min. Conversion of MTT into purple colored MTT formazan by the living cells indicates the extent of cell viability. The crystals were dissolved with dimethyl sulfoxide (DMSO) and the optical density was measured at 540 nm using a microplate reader (Biotek ELx-800, Mandel Scientific Inc.) for quantification of cell viability. All assays were run in triplicates. Confocal microscopy -For analysis of the subcellular localization of MCa-Cy5 analogues in living cells, cell cultures were incubated with the fluorescent peptides for 2 hrs, and then washed with phosphate-buffered saline (PBS) alone. The plasma membrane was stained with 5 μg/ml rhodamine-conjugated concanavalin A (Molecular Probes) for 5 min. Cells were washed once more. Live cells were then immediately analyzed by confocal laser scanning microscopy using a Leica TCS-SPE operating system. Rhodamine (580 nm) and Cy5 (670 nm) were sequentially excited and emission fluorescence were collected in z-confocal planes of 10-15 nm steps. Fluorescence Activated Cell Sorting -CHO cells were incubated with various concentrations of Cy5-labeled peptides in F-12K culture medium without serum at 37°C for 2 hrs. The cells were then washed with PBS to remove excess extracellular peptide and treated with 1 mg/ml trypsin (Invitrogen) for 5 min at 37°C to detach cells from the surface, and centrifuged at 200 g before suspension in PBS. For experiments with the macropinocytosis inhibitor, amiloride, CHO cells were initially washed with F-12K and preincubated for 30 min at 37°C with 1 mM amiloride (Sigma). The cells were then incubated for 2 hrs at 37°C with 1 µM of the Cy5-MCa analogues. For all these experimental conditions, flow cytometry analyses were performed with live cells using a Becton Dickinson FACS LSR II flow cytometer (BD Biosciences). Data were obtained and analyzed using FCS express software (De Novo). Live cells were gated by forward/side scattering from a total of 10,000 events. Preparation of heavy SR vesicles -Heavy SR vesicles were prepared following the method of Kim et al. (21). Protein concentration was measured by the Biuret method. [ 3 H]-ryanodine retained on the filters was measured by liquid scintillation. Non-specific binding was measured in the presence of 80 µM unlabeled ryanodine. The data are presented as mean ± S.E. Each experiment was performed in triplicate. Statistical analyses -All data are given as mean ± SD for n number of observations, and statistical significance (p) was calculated using Student's t test. Figure 1A illustrates the primary structure of MCa with its secondary structures (β strands) and its pattern of disulfide bridges. This peptide will be termed MCa F , for folded (F) MCa. An earlier report has demonstrated that replacing the six internal cysteine residues of MCa by Abu residues results in a pharmacologically-inert and unfolded (UF) CPP (MCa UF1-33 , Fig. 1B). This peptide loses its secondary structures (11). Since this project aims at identifying shorter CPP sequences based on MCa UF1-33 sequence by the delivery of Cy5 cargo, we first determined where at the N-terminus (C-MCa UF1-33 ) or C-terminus (MCa UF1-33 -C) the cargo could be best grafted after addition of an extra cysteine residue (C) (Fig. 1B). As shown, both vector/cargo complexes Cy5-C-MCa UF1-33 and MCa UF1-33 -C-Cy5 penetrated efficiently within CHO cells, as estimated by confocal microscopy (Fig. 1C) or by FACS (Fig. 1D). At 3 µM, a slightly better cell penetration was observed with Cy5 localized at the C-terminus of MCa UF1-33 , but this difference was not significant. Since chemical syntheses of truncated MCa UF1-33 analogues was facilitated by adding the extra cysteine residue at the Cterminus of the sequence rather than at the Nterminus, we kept on working on the basis of MCa UF1-33 -C sequence. Nevertheless, these data indicate for the first time that cargo grafting on the CPP MCa UF1-33 can be performed likewise at both extremities of the sequence. Next, we designed a series of truncated MCa UF -C peptides comprising either a C-terminal truncation (3 analogues: MCa UF1-20 -C, MCa UF1-15 -C and MCa UF1-9 -C), a N-terminal truncation (7 analogues: MCa UF8-33 -C, MCa UF11-33 -C, MCa UF14-33 -C, MCa UF18-33 -C, MCa UF20-33 -C, MCa UF22-33 -C and MCa UF25-33 -C), and Both N-and C-terminal truncations (2 analogues: MCa UF6-25 -C and MCa UF14-25 -C) ( Fig. 2A). All of these analogues were then labeled with Cy5 to investigate their cell penetration properties. Every one of these peptides has been designed in such a way that the cargo would be removed from the peptide upon trypsin cleavage. This was useful for the FACS experiments in which the fluorescence associated to the cells is measured after trypsin treatment, thereby potentially removing the cargo from peptides that would eventually be associated to the outer part of the plasma membrane. The net positive charges of the peptides were drastically different, ranging from 0 (MCa UF1-15 -C and MCa UF1-9 -C) to +8 (MCa UF8-33 -C). However, many of the peptides contained a percentage of positively charged residues equal (MCa UF-25-33 -C) or superior to MCa F or MCa UF1-33 (8 out of twelve analogues). Three analogues had a lower percentage of basic residues than MCa F (all three C-terminal truncated analogues, MCa UF1-20 -C, MCa UF1-15 -C and MCa UF1-9 -C).

Non folded truncated maurocalcine peptides are efficient CPP
We first evaluated by FACS the fluorescence accumulation within CHO cells that occurred after 2 hrs incubation with 3 µM of positively charged MCa peptides (net charge ≥ +5; Fig. 2B). This first study revealed several unexpected findings. First, all of the charged peptides (8 tested) demonstrated CPP properties. These peptides had all the K 22 R 23 R 24 sequence in common, a cluster of basic amino acid residues shown to contribute to the dose-efficacy of cell penetration of MCa F in an earlier study (18). Interestingly, removing the last 8 C-terminal amino acids of MCa had little impact on the cell penetration properties (if one compares MCa UF14-25 -C with MCa UF14-33 -C). Similarly, the removal of the amino acid region His 6 -Asn 13 did not drastically change cell penetration properties (MCa UF6-25 -C versus MCa UF14-25 -C). Second, all peptides appeared to behave better than the reference peptide MCa UF1-33 -C, suggesting that sequence truncation of MCa UF may represent a potent strategy to define more efficient CPP. Less positively charged peptides were also tested for their ability to penetrate into CHO cells (Fig.  2C). No less surprisingly, all peptides showed CPP properties, including two peptides with no net positive charge (MCa UF1-9 -C and MCa UF1-15 -C). MCa UF1-9 -C appeared as a better CPP than MCa UF1-15 -C suggesting that the Abu 10 -Asp 15 region introduces no competitive advantage and confirming results shown in Fig. 2B. This may represent an inhibitory region because of the presence of Glu 12 and Asp 15 , two negatively charged residues. The finding that mutation of Glu 12 to Ala enhances cell penetration of both MCa F (18) and MCa UF (11) further supports this conclusion. MCa UF25-33 -C turned out to have also CPP properties, even though this sequence did not confer a competitive advantage to other MCa CPP analogues as shown in Fig. 2B. Two additional experiments were conducted to confirm the specificity of these findings. First, a truncated charybdotoxin peptide was synthesized (ChTx UF1-12 -C) in which the internal Cys residue was replaced by Abu and an additional C-terminal Cys residue added for Cy5 labeling, as for our MCa UF analogues ( Supplementary Fig. 1A). As shown by confocal microscopy, ChTx UF1-12 -C was unable to deliver the Cy5 cargo at a higher concentration of 5 µM. Second, we also designed two MCa UF peptides in which, instead of using Abu derivatives, we mutated internal Cys residues by Ala residues. As shown, both MCa UF1-9(Ala) -C and MCa UF14-25(Ala) -C worked well for Cy5 delivery into CHO cells demonstrating that Abu residues were not responsible by themselves for the cell penetration properties of the peptides (Supplementary Fig.  1B,C). Ala residues were however not fully equivalent to Abu residues as dose-response curve was slightly better for MCa UF14-25(Ala) -C than for MCa UF14-25 -C, further arguing that Abu residues were not central to the cell penetrating properties of truncated MCa UF peptides (Supplementary Fig.  1D). The overall message from this first study is that all truncated MCa UF analogues can behave as CPP at the concentration tested. The findings suggest that MCa is a peptide fully specialized to achieve cell penetration including in domains that are not highly charged.
The intracellular distribution of all truncated MCa UF analogues bear resemblance with that of the full length MCa UF While all truncated derivatives of MCa UF1-33 show cell penetration properties according to the FACS analyses, we examined whether there were differences in intracellular distribution among these peptides. This question was investigated by confocal microscopy after 2 hrs of peptide accumulation into CHO cells (Fig. 3). Interestingly, all peptides showed very resembling intracellular distributions, although the degree of accumulated cell fluorescence varied somewhat with peptide sequences. In confirmation of the FACS results, the peptide that appeared to penetrate the least was the full length unfolded MCa, MCa UF1-33 -C-Cy5 (Fig.  3A). The vast majority of the fluorescence appears in punctuate dots within the cells. In many cases, these dots appear at higher concentrations within one pole of the cell (see labeling of MCa UF8-33 -C-Cy5, MCa UF11-33 -C-Cy5, MCa UF25-33 -C-Cy5, and MCa UF1-9 -C-Cy5 for instance). On various occasions also, all of the peptides tend to present a subplasma membrane distribution, forming a rim of smaller circumference than the concanavalin A labeling itself. This sub-plasma membrane rim localization was more evident for CHO cells labeled with MCa UF14-25 -C-Cy5 as illustrated in Fig.  4A. Finally, more rarely, a direct plasma membrane labeling by the peptide-cargo complex was observable (Fig. 4B). This type of labeling could be observed with N-terminal truncated vectors exclusively and was most evident for MCa UF22-33 -C-Cy5. The staining of the plasma membrane was always diffuse in contrast to intracellular staining which was mainly punctuated. Diffuse membrane labeling was also observed for MCa UF25-33 -C-Cy5 and MCa UF20-33 -C-Cy5, two peptides that differ from 2 to 3 amino acids of MCa UF22-33 -C-Cy5. It was difficult to evidence for the other vector/cargo complexes. We propose that this staining coincides with an alteration of the duration of peptide plasma membrane residency for these truncated MCa UF analogues. The lower occurrence of this diffuse staining for the other truncated variants may reflect faster internalization by endocytosis and/or membrane translocation. Globally, these effects reflect cell entry and distribution tendencies that were hard to quantify and they should therefore be interpreted with caution.
In an attempt to better apprehend peptide behavior at the plasma membrane, we quantified the extent of Cy5 / rhodamine staining colocalization. Rhodamine-positive staining was also Cy5-positive for 63% to 86% of the pixels (best performing peptides were MCa UF14-33 -C-Cy5, MCa UF18-33 -C-Cy5, MCa UF20-33 -C-Cy5 and MCa UF22-33 -C-Cy5; data not shown). This finding indicates that the peptides invade large membrane areas and that membrane interaction is not limited to small specialized surface areas. In contrast, Cy5-positive pixels were rhodamine-positive to far more variable extents (Fig. 4C). For instance, 10.1 ± 2.6% of MCa UF1-33 -C-Cy5, the reference compound, was colocalized with the plasma membrane indicator. In spite of the fact that short plasma membrane staining times were used (few minutes), a fraction of the colocalization that is quantified also corresponds to intracellular staining following ongoing endocytosis. Nevertheless, this result indicates that this peptide does not remain stuck within the plasma membrane during its 2 hrs incubation with CHO cells. It indicates relatively fast cell penetration thus. Many of the other peptides however behaved differently from MCa UF1-33 -C-Cy5. Indeed, several peptides show surprisingly higher colocalization with rhodamine (21.3 ± 2.6% for MCa UF11-33 -C-Cy5 and 30.4 ± 1.4% for MCa UF1-20 -C-Cy5 for instance). These higher values of colocalization indicate that some peptides remain for longer periods of time or at higher concentration within the plasma membrane. Alternatively, these peptides may rely more heavily on endocytosis for cell penetration and are present within intracellular organelles to which subsequent endocytotic vesicles that contain rhodamine labeling will fuse. Peptides most concerned by these behaviors were MCa UF11-33 -C-Cy5 and MCa UF14-33 -C-Cy5, that contained two or one of the CPP inhibitory negative charges (Glu 12 and Asp 15 ), and MCa UF1-9 -C-Cy5 and MCa UF1-20 -C-Cy5, that were poorly charged peptides.

Amiloride-sensitivity of the cell penetration of truncated MCa UF analogues
In earlier studies, we have demonstrated that the cell entry of MCa UF1-33 was largely sensitive to amiloride. Since amiloride is an exquisite blocker of macropinosomes (22)(23)(24), this suggested a predominant macropinocytosis mechanism for its cell penetration (17). However, it was likely that such a predominant reliance on macropinocytosis was also conferred by the cargo type transported (streptavidine in that report). We therefore conducted an in depth analysis of the amiloridesensitivity of the various truncated MCa UF peptides with Cy5 as cargo and quantified by FACS the degree of cell penetration inhibition. Fig. 5A illustrates the amiloride-sensitivity of four different truncated peptides. As shown, amiloride inhibits the cell penetration of MCa UF1-33 -C-Cy5 by 19.6% and of MCa UF1-15 -C-Cy5 by 39%. The finding that amiloride blocks to a far lesser extend the penetration of Cy5 compared to that of streptavidin (17) when MCa UF1-33 is the vector indicates the influence of the cargo nature on the mechanism of cell entry. Surprisingly, amiloride was found to enhance rather than inhibit the cell penetration of MCa UF20-33 -C-Cy5 and MCa UF18-33 -C-Cy5 (Fig. 5A). Preserving the plasma membrane from undergoing macropinocytosis may free surface areas for enhanced peptide translocation through the membrane. The effect of amiloride was always associated with a sharpening of the fluorescence intensity distribution in the x axis (see for instance MCa UF1-15 -C-Cy5), reflecting reduced cell heterogeneity for the mechanisms underlying peptide penetration. The amiloridesensitivity of cell penetration was further investigated for all truncated MCa UF peptides and the results presented in Fig. 5B. Four peptides showed higher amiloride-sensitivity than MCa UF1-

-C-Cy5
(MCa UF11-33 -C-Cy5, MCa UF25-33 -C-Cy5, MCa UF1-15 -C-Cy5 and MCa UF14-25 -C-Cy5). All other peptides showed reduced amiloride-sensitivities or a tendency for greater cell penetration under the effect of amiloride. We conclude that the Cy5 cargo does not promote macropinocytosis as the main route of peptide entry, and that truncation of MCa UF may lead to analogues that rely to a lesser extent on macropinocytosis for cell entry.
Comparative dose-dependent cell penetration of the MCa UF analogues While we compared the properties of cell penetration of truncated peptides at rather mild concentrations, we also aimed at comparing the dose-dependence of cell penetration of these peptides by FACS (Fig. 6). One example of such an analysis is shown for peptide MCa UF8-33 -C-Cy5 in Fig. 6A. 33 µM was the highest concentration that could be tested on CHO cells and obviously cell penetration did not show any sign of saturation for cell incubation times with this peptide of 2 hrs. The dose-dependent cell penetration were compared for all N-terminal truncated peptides (Fig. 6B), C-terminal truncated peptides (Fig. 6C), and double truncated analogues (Fig. 6D) with the same settings. These analyses confirm that MCa UF1-33 -C-Cy5 is the least-performing cell penetrating peptide. Most truncated peptides show detectable cell penetration at concentrations equal or above 1 µM. One remarkable exception to this rule was noticeable. MCa UF1-9 -C-Cy5 shows an unusual dosedependent penetration with detectable cell penetration at 10 nM and only small progressive increases in fluorescence intensity with higher peptide concentrations (Fig. 6C). This peptide was therefore the best performing peptide for cell penetration at low concentrations. Finally, additional information that could be taken from these analyses is that the peptides differed significantly with regard to the maximal extent of cell penetration. Among the N-terminal truncated MCa UF analogues, MCa UF18-33 -C-Cy5 performed drastically better than the other truncated peptides (Fig. 6B). The difference in cell penetration among MCa UF11-33 -C-Cy5 and MCa UF18-33 -C-Cy5 resides in the removal of KENKDAbuAbu sequence which we presume is inhibitory to some extent because of the presence of Glu 12 and Asp 15 . Among the C-terminal truncated peptides, MCa UF1-20 -C-Cy5 was performing as well as MCa UF8-33 -C-Cy5, and although not tested at higher concentrations, MCa UF1-9 -C-Cy5 would be expected to perform still better. Finally, for N-and Cterminal truncated analogues, the best peptide turns out to be MCa UF14-25 -C-Cy5 that yields the greatest fluorescence accumulation at 33 µM compared to all other truncated MCa UF analogues. While all these peptides performed quite well, we were curious to compare them to a classical CPP under similar experimental conditions. Cell penetration was observed for TAT-C in CHO cells and dose-response curve was equivalent or even less favorable for TAT than for many MCa UF peptides ( Supplementary Fig. 1E,F). This was further largely confirmed when comparing the vector properties of TAT-C and MCa UF1-9 -C in yet another cell type, the glioma F98 rat cell line ( Supplementary Fig. 2). 3 µM MCa UF1-9 -C proved better than TAT-C at 3 and 10 µM. Truncated MCa UF peptides lack pharmacological effects and are predominantly non toxic An earlier report has shown that MCa UF1-33 is unable to interact with MCa's target, the ryanodine receptor RyR1 (11). This is due to the loss of secondary structures owing to the lack of internal disulfide bridging. We did therefore expect that truncated analogues of MCa UF should also be pharmacologically inert. This hypothesis was challenged by testing the ability of the Cy5free peptides to stimulate [ 3 H]-ryanodine binding (Fig. 7A). As shown, contrary to MCa F that contains secondary structures and disulfide bridges, none of the peptides we designed had an effect on [ 3 H]ryanodine binding. Finally, the peptides were challenged for their toxicity by incubating CHO cells with 1 or 10 µM peptide concentrations for an extended duration (24 hrs) that far exceeds the duration challenged for cell penetration (Fig. 7B). A 10 µM peptide concentration was generally slightly more toxic than 1 µM, except for MCa UF14-25 -C. At 1 µM, toxicity never exceeded 8% and significances of these effects were negligible. In contrast, toxicity could reach 20% at 10 µM peptide concentration and these effects had higher significance. Most peptides behaved equally well or better than MCa UF1-33 -C indicating that truncation did not enhance cell toxicity of the peptides.

Discussion
MCa F is a rather large and complex CPP with its three disulfide bridges which makes its use in in vitro and in vivo applications more delicate because of production yield (peptide with correct disulfide bridging) and cargo coupling (thiol groups are for the moment difficult to use). One may also argue that the size of the peptide introduces additional synthesis cost compared to other popular in use CPP such as Tat and penetratin. While size and folding most likely adds a competitive advantage for in vivo applications because of peptide stability issues, it may be of interest to derive small CPP from MCa that could have a broader use than MCa itself. The study we conducted not only aimed at defining these smaller MCa-derived CPP sequences, but also provided interesting clues on how MCa may have evolved for cell penetration. The most surprising finding from this study was therefore that all of the analogues we defined could be considered as CPP, which suggests that MCa is a highly specialized sequence for the purpose of cell penetration. While evidently none of the peptides could compete with MCa F itself for cell penetration, it is nevertheless obvious that many of the truncated MCa UF peptides are better CPP than MCa UF1-33 itself. These findings lead to two general conclusions. First, folding and disulfide bridging appears to be a prerequisite to properly optimize the CPP potential of all the MCa domains. It is highly likely that the secondary structures triggered by disulfide bridging play a key role in that respect. We therefore believe that it would be worthwhile to design additional MCa analogues presenting both truncations (like in the present study) and 1 to 2 disulfide bridges in order to regain some of the secondary structures that confer a competitive advantage to MCa F for cell penetration. Second, the fact that all the truncated sequences are CPP that possess different properties and efficacies, suggests that there is further room for MCa cell penetrating optimization. In this study, our data indicate that the amino acid region Lys 11 -Ser 18 would be an ideal target for mutagenesis. Glu 12 mutation has already been shown to improve cell penetration on various analogues (11,18). We now sense that Asp 15 may also represent an excellent target for further optimization of MCa's properties. Double mutagenesis may be an option. However, it will be essential to determine whether the disulfide bridging and folding ability of the native peptide is not affected by this procedure. Truncated MCa UF peptides may also benefit from further mutagenesis of negatively charged residues, such as MCa UF1-20 . It is noteworthy that another well behaving truncated CPP, MCa UF18-33 , also contains a Glu residue at position 29 that could represent another target for mutagenesis and peptide optimization. This is also the case for MCa UF1-9 that contains an Asp residue at position 2. Several other findings of this report merit some comments. First, we demonstrate that cargo coupling can occur at the N-terminus as well as the C-terminus of MCa enhancing the flexibility of cargo coupling to our vectors. Second, while it is obvious that most of our truncated MCa UF analogues remain heavily basic, we also found out that poorly charged MCa peptides could behave as efficient CPP. This is the case for MCa UF1-9 which is one of our best performing CPP, especially if one wishes to work with low CPP concentrations. Third, small size CPP can be derived from MCa UF , MCa UF1-9 and MCa UF25-33 being our smallest peptides. While we did not test greater truncation, it is not impossible that still smaller CPP might be designed on the basis of MCa sequence. Fourth, the truncated MCa UF analogues differ somewhat in their mode of cell penetration, some being more prone to enter cells by macropinocytosis than others. Various peptides were even insensitive to amiloride application suggesting that macropinocytosis did not contribute at all to their entry. We can't exclude at this stage that these peptides rely on other modes of endocytosis than macropinocytosis. This is suggested by the punctuate nature of intracellular Cy5 fluorescence distribution. How reliably a punctuate distribution reveals endocytosis is however not known, and one can't rule out also that this distribution is due to peptide aggregation with or without lipid components.
When plasma membrane distribution could be observed, large diffuse staining was evident indicating that when peptides encountered the membrane no such aggregation occurred. In any case, many of the peptides had distribution that mostly differed from plasma membrane labeling (including these membranes that had undergone endocytosis) indicating that peptide intracellular distribution may not necessarily colocalize with the plasma membrane components. Refined studies will be required however to define how much of the peptides enter cells by direct membrane translocation versus endocytosis. To conclude on this study, we identified several interesting lead CPP based on MCa UF truncation strategy. This is the case for MCa  (macropinocytosis entry-independent), MCa  (penetrates better at low concentration), and MCa UF14-25 (yields the greatest cell entry of the dye). These peptides are easy to produce, yield good cell penetration, and we should be able to further improve their cell penetrating characteristics by mutagenesis or by reintroducing one disulfide bridge to restore some of the secondary structures. While the scope of this study was not to compare our mini MCa with other popular CPP, we were surprised to find out that many of the MCa UF vectors behaved far better than TAT for the transport of Cy5, both in CHO and F98 cells. It may be argued that the field doesn't require many more CPP sequences. However, a significant fraction of the ongoing research is based on the use of TAT, which no longer appears as the most competitive peptide. Also, increasing evidences suggest that CPP sequences differ in their cell-type targeting properties, which will undoubtedly represent an important feature to exploit when it comes to developing in vivo applications. Other essential parameters for in vivo applications will be the peptide stability upon intravenous or intra-peritoneal injection. In that respect, we found out that MCa is a particularly stable peptide with and without disulfide bridges (unpublished observations). These still "hidden" aspects of the CPP features may be advantageously exploited later on if the number of known CPP is high. The advantages of using smaller CPP over MCa F remains to be investigated as one may of course argue also that they require higher concentrations and possibly longer incubation times for effectiveness, two factors that may lead to spurious signaling events in vivo.   analogues. CHO cells were incubated 2 hrs with 3 µM of MCa UF -C-Cy5 vector/cargo complexes, before extensive washing, membrane labeling with rhodamine-conjugated concanavalin A, and live cell confocal microscopy imaging. Cy5 is in blue, and rhodamine in red. White arrows illustrate a tendency for a preferential apical localization of the Cy5 dye. Yellow arrows illustrate the tendency for a sub-plasma membrane labeling of the Cy5 dye.