Originally published In Press as doi:10.1074/jbc.M205358200 on July 31, 2002
J. Biol. Chem., Vol. 277, Issue 41, 38803-38809, October 11, 2002
Biological Potential of a Functional Human SNAIL
Retrogene*
Annamaria
Locascio
,
Sonia
Vega§,
Cristina A.
de Frutos,
Miguel
Manzanares¶
, and
M. Angela
Nieto
From the Instituto Cajal, Consejo Superior de Investigaciones
Científicas, Av. Doctor Arce 37, 28002 Madrid, Spain
Received for publication, May 30, 2002, and in revised form, July 26, 2002
 |
ABSTRACT |
Snail genes encode zinc finger
transcription factors required for the development of vertebrate and
invertebrate embryos. They trigger epithelial to mesenchymal
transitions (EMTs), thereby allowing epithelial cells to emigrate from
their place of origin and form tissues such as the mesoderm and the
neural crest. Snail genes are also involved in the EMTs
responsible for the acquisition of invasiveness during tumor
progression. This aspect of their activity is associated with their
ability to directly repress E-cadherin transcription. Here
we describe the existence of an active human Snail
retrogene, inserted within an intron of a novel evolutionarily
conserved gene and expressed in different human tissues and cell lines.
Functional analyses in cell culture show that this retrogene maintains
the potential to induce EMTs, conferring migratory and invasive
properties to epithelial cells. In light of this data, we have renamed
it SNAIL-like, a new player that must be considered in both
physiological and pathological studies of SNAIL function in humans.
 |
INTRODUCTION |
Snail genes are zinc finger transcription factors with important
functions in vertebrates and invertebrates. After the identification of
Snail in Drosophila (1), multiple family
members have been isolated, leading to their classification in two main
families, Snail and Scratch (2). A common feature of the superfamily is
a role in neural development, whereas a conserved function in mesoderm
formation is associated with the Snail family (2, 3). Additional roles
in cell division, cell survival, left-right asymmetry, and wing and
limb development have also been described in different species (3,
4).
Within the Snail family, a duplication event led to the formation of
Snail and Slug in vertebrates, which evolved by
dividing their ancestral functions and acquiring new ones. An example
of new function can be seen during the development of the neural crest,
a population of cells involved in the formation of the vertebrate head
(5, 6). These genes are required for the acquisition of migratory
properties by both the neural crest and the mesoderm, principally by
triggering the epithelial-mesenchymal transition
(EMT)1 that confers upon
epithelial cells the capacity to migrate through the extracellular
matrix (7, 8). Indeed, the lethality of Snail mutant mice at
gastrulation is due to a defect in EMT during mesoderm formation (9).
In this respect, Snail has been shown to act as a direct repressor of
E-cadherin transcription both during embryonic development
and tumor progression. A direct correlation has been observed between
Snail activation and the acquisition of invasive and metastatic
properties in human tumor cell lines of different epithelial origin
(10-12). In addition, Snail is expressed at the invasive
front of mouse skin tumors and human breast carcinomas (10, 13,
14).
In addition to SNAIL (HUGO Genome Nomenclature Committee
approved symbol: SNAI1), an extremely related sequence was
found in the human genome and classified as a nonfunctional
retro-transcribed pseudogene (SNAI1P; Refs. 15 and 16).
Retrogenes result from the reverse transcription of an mRNA and
subsequent insertion into the genome mediated by different transposable
sequences such as retrovirus, LINE, or Alu elements (17). In general,
pseudogenes are not expressed due to the absence of promoter elements
in the region of insertion. Even when they have the ability of being expressed after integration, the lack of regulated expression or the
absence of selective pressure leads to their rapid inactivation, often
as a result of accumulated changes that render the protein nonfunctional (18-24). However, an expressed retrotransposed gene can
acquire adaptive mutations that lead to its functional differentiation and the acquisition of new properties (25).
Here we show that the SNAIL retrogene inserted in the human
genome constitutes a transcription unit that we propose to call SNAIL-like (SNAI1L). It has been subjected to
positive selection leading to the maintenance of the complete open
reading frame and the conservation of its zinc finger DNA binding (3)
and SNAG transactivation domains (26). It is integrated into a
new gene (2q34-X) and possesses a regulated pattern of
expression different from that of both SNAIL and
2q34-X. We also show that in a similar manner to SNAIL, it
is able to induce EMT in epithelial cell lines and maintains the
capacity of repressing E-cadherin expression. The biological
potential of SNAIL-like makes it an important subject for
functional studies in tumor progression. In addition, its similarity to
SNAIL might have been the cause of incorrect evaluation of
SNAIL expression in some studies carried out in human cells
lines and tumors.
 |
EXPERIMENTAL PROCEDURES |
Sequence Analysis--
Sequence comparisons were performed using
BLAST (Ref. 27; www.ncbi.nlm.nih.gov/BLAST). Gene predictions were
obtained with GENSCAN (Ref. 28; genes.mit.edu/GENSCAN.html) and those
of the pufferfish and ascidian exons using sequences from the Fugu
(fugu.hgmp.mrc.ac.uk) and the Ciona intestinalis
(jgi.doe.gov/programs/ciona.htm) genome projects. Transcripts, genes,
and ESTs present in the region analyzed (2q34) were confirmed via the
Ensembl Human Genome Server (www.ensembl.org, chromosome 2, nucleotides
219550835-220050835 view). Sequence alignments were carried out using
Clustal (29) and corrected by visual inspection.
Northern Blot and RT-PCR Analyses--
Poly(A)+ RNA from cell
lines was purified by oligo(dT)-cellulose chromatography (30). For
Northern blot assays the entire SNAIL-like or
SNAIL coding fragments were labeled using the Rediprime II
kit (Amersham Biosciences), and GAPD
(glyceraldehyde-3-phosphate dehydrogenase) was used as a control of
mRNA quantity. Adult human cDNAs were obtained from BD
Biosciences, whereas poly(A)+ was isolated from the different tumor
cell lines or stable transfectant clones (Microfast Track isolation
kit, Invitrogen) and treated with DNase I before cDNA synthesis.
PCR for E-cadherin, Box A, Box B, SNAIL-like, and
5'-SNAIL-like were performed over 35 cycles at an annealing
temperature of 65-70 °C. For all SNAIL amplifications from adult tissues, nested PCR reactions were carried out under the
same conditions. GAPD was amplified after 30 cycles at an annealing temperature of 60 °C. Primer sequences are available upon
request. Semi-quantitative analysis was carried out by densitometry of
the products over different number of cycles in the linear phase of amplification.
Generation and Characterization of SNAIL and SNAIL-like Stable
Transfectants--
SNAIL and SNAIL-like coding
sequences were amplified by RT-PCR from poly(A)+ RNA from the A375P
cell line. Primer sequences are available upon request. The amplified
fragments were cloned in the pZEOSV2+ vector (Invitrogen) and
transfected with LipofectAMINE Plus (31). Stable transfectants were
generated in MDCK cells after selection with Zeocine. Six independent
clones were analyzed from each pZeo-SNAI1 and
pZeo-SNAI1L and from mock pZeo transfections. The expression
of E-cadherin, SNAIL, and SNAIL-like was analyzed in stable transfected MDCK cells by RT-PCR. For immunofluorescence analysis, cells were grown on coverslips in 6-cm cell culture dishes
and fixed 24-48 h after transfection (32, 33).
Generation and Characterization of SNAIL and SNAIL-like-inducible
Transient Transfectants--
The same amplified fragments were fused
to a mutated version of the ligand binding domain of the human estrogen
receptor that recognizes the synthetic ligand 4-OH-tamoxifen. The
fragment corresponding to the binding site was obtained from the vector
pCre-ERT2 (34), kindly provided by P. Chambon. The final
fragments either containing SNAIL or SNAIL-like
and another one containing the internal ribosomal entry site and
EGFP sequences were subcloned in pcDNA3. The constructs were
transfected with LipofectAMINE Plus (31). 4-OH-tamoxifen (200 nM) was added 24 h after transfection, and cells were
fixed for immunofluorescence analysis after 72 h of treatment.
Cell Culture Assays--
For migration assays, the cells were
seeded in T6-well culture dishes at a density of 3 × 105 cells/well. A wound was incised 24 h later, and
cells were observed at different time intervals. Invasion assays on
collagen type-IV gels were carried out using the two-compartment Boyden
chamber in duplicate samples (35).
Promoter Analysis--
For the analysis of the
E-cadherin promoter, MCA3D cells were co-transfected with 50 ng of Renilla vector and 0.4 µg of pGL2 vector (Promega)
containing the E-cadherin promoter fused to the Luc reporter
gene together with either 50 ng of pZeo-SNAI1,
pZeo-SNAI1L, or control pZeo plasmids. Luciferase and
Renilla activities were assayed using the dual-luciferase
reporter system kit (Promega), and the activity normalized to that of
the promoter cotransfected with the control pZeo vector. The mouse
Snail expression construct was as described (10).
 |
RESULTS |
Analysis and Expression of an Intronless SNAIL-like Retrotransposed
Sequence--
An exhaustive analysis of different ESTs and genomic
databases using specific and diagnostic sequences to the Snail genes revealed multiple new members from Drosophila,
Caenorhabditis elegans, zebrafish, and humans (2). Among
these, we found a genomic sequence from human chromosome 2 (GenBankTM accession number AC006385) containing a 1863-bp
segment highly related to an intronless transcript of the
SNAIL gene (SNAI1). The nucleotide similarity was
81.9% in the coding region, 75% over a 254-bp stretch of the 5'-UTR,
and 74.6% over 848 bp of the 3'-UTR. The region similar to
SNAIL was followed by an Alu-like sequence in the 5' end and
flanked by 15-bp inverted repeats at both 5' and 3' ends. Outside of
this region, similarity drops to non-significant levels. This is strong
evidence that this gene resulted from a retrotranscription of a
SNAIL mRNA followed by its insertion in the genome.
Indeed, other groups recently described it as a processed
non-transcribed pseudogene and named it SNAI1P (15, 16).
Inspection of the coding region of SNAI1P showed that the
changes from the parental SNAIL sequence involved 11.8% of
the residues not including two insertions of 3 and 9 bp and one
deletion of 30 bp. None of these changes disrupted the open reading
frame. Evidences shown below indicate that this sequence is
transcribed, and we call it hereafter SNAIL- like
(SNAI1L). The change of name and symbol for this gene
has been approved by the HGNC
(www.gene.ucl.ac.uk/nomenclature/genefamily/snail.html).
Comparison of the deduced amino acid sequences of SNAIL and SNAIL-like
(Fig. 1A) shows that the
changes are not distributed equally along the whole length of the
predicted protein. No changes were observed in the first 9 amino acids
(aa)/27 bp, the highly conserved SNAG domain, present in all vertebrate
Snail family members (2) and implicated in the repressor activity of
these transcription factors (36-38).

View larger version (35K):
[in this window]
[in a new window]
|
Fig. 1.
Sequence comparison of SNAIL
(SNAI1) and SNAIL-like
(SNAI1L) predicted products. A,
SNAI1 and SNAI1L protein sequences show 100% similarity in the SNAG
domain (light gray) and 90.1% in the zinc finger domain
(dark gray). The intermediate region of the protein shows a
much higher degree of variation (77.6% similarity). B,
analysis of the differences based on the alignment of nucleotides and
amino acids. n, number of residues; m, number of
changes; S, synonymous changes; NS,
non-synonymous changes; t, transitions; T,
transversions.
|
|
We divided the remainder of the SNAIL-like protein into the C-terminal
zinc finger DNA binding domain and an intermediate putative protein
interaction domain. The zinc finger domain showed a 9.9% change in
amino acids, whereas the intermediate region had undergone a 22.4%
variation (Fig. 1A). This bias toward changes in the
intermediate region was also reflected in the nucleotide sequence. The
analysis of the substitutions in SNAIL-like
versus SNAIL-coding region is shown in Fig.
1B. The nature of nucleotide substitutions in terms of the
ratios of transitions (t)/transvections (T) and of synonymous (S,
non-aa-changing)/non-synonymous (NS, aa-changing) variations can be
used as an indicator of the degree of positive selection within the
amino acid sequence (39). We found no difference between the
intermediate and C-terminal regions with respect to t/T, but the
percentage of NS changes was higher in the intermediate region. These
data are indicative of a selective pressure that has led to the
conservation of the coding potential of SNAIL-like, making
it prone to produce an active protein.
The first indication that this sequence is actively transcribed was the
identification of two human ESTs derived from the 3'-UTR of
SNAIL-like. To directly assess the expression of this transcript, we carried out RT-PCR studies in panels of human adult tissues and tumor cell lines using specific pairs of primers. SNAIL-like was expressed in a subset of the tissues that
normally express SNAIL (Fig.
2A) and in all the cell lines
analyzed (Fig. 2B). Semi-quantitative analysis by RT-PCR
indicates that in the tissues and cell lines where the two genes are
expressed, their transcripts are represented at similar levels.

View larger version (27K):
[in this window]
[in a new window]
|
Fig. 2.
SNAIL (SNAI1) and
SNAIL-like (SNAI1L) expression in
human tissues and cell lines was analyzed by RT-PCR. Expression in
a panel of human adult tissues (A) and carcinoma cell lines
(B) of different etiologies is shown. SNAI1 was
expressed in the majority of adult tissues analyzed (A) and
in the invasive cell lines (B). SNAI1L is
expressed in a subset of the tissues that express SNAI1
(A) and in all carcinoma cell lines analyzed (B).
s. muscle, skeletal muscle. T, tumorigenic;
I, invasive; M, metastatic.
|
|
SNAIL-like Is Integrated within an Intron of an Evolutionarily
Conserved Novel Gene--
The differential expression of
SNAIL-like indicates that it contains a functional promoter
and that its expression is controlled by cis-regulatory elements either
present at the site of genomic insertion or included in the
retrotransposed sequence. If the latter were true, such elements might
derive from the original SNAIL gene or from transposable
and/or repeated elements (17). To address these questions, we studied
the genomic organization of the region where SNAIL-like is inserted.
When searching for known genes in the vicinity of
SNAIL-like, we found that the MAP2
(microtubule-associated protein 2) gene mapped 78 kb 5' and the
RPE (ribulose 5-phosphatase 3-epimerase) gene mapped 193 kb
3' of SNAIL-like (Fig.
3A). However, the expression patterns of these genes could not account for the differential expression of SNAIL-like (neural-specific for
MAP2 (40) and ubiquitous for RPE (41)). GENSCAN
analysis (28) on the 271-kb genomic segment located between these two
genes identified a predicted gene that spanned more than 200 kb and
coded for a product of 2915 aa, which we have called 2q34-X.
The prediction contained part of the coding region of
SNAIL-like, lacked the initiation and the stop codons, and
is a composite of three independent predicted transcripts,
ENST00000236970, ENST00000272845, and ENST00000272846 (www.ensembl.org/), spanning three contiguous bacterial
artificial chromosomes (BACs) (accession numbers AC006385, AC006464, and AC007038).

View larger version (14K):
[in this window]
[in a new window]
|
Fig. 3.
Genomic organization of the region
surrounding SNAIL-like (SNAI1L) on
human chromosome 2q34. A, map showing the location of
MAP2, RPE, SNAI1L, and the putative
2q34-X gene. Predicted exons of 2q34-X and
SNAI1L are shown to scale but not MAP2 nor
RPE. Not all exons of 2q34-X are displayed. Exons
coding for boxes A and B are shown. B,
comparison of predicted proteins from C. elegans and
D. melanogaster related to human 2q34-X. Boxes
show regions where similarity is higher than 70% at the amino acid
levels among the three products.
|
|
In the search for potential homologues, we found that the C-terminal
third of this product corresponds to the KIAA1843 protein, identified
in a large scale sequencing and expression study of large cDNAs
expressed in the human brain (42). We also found a predicted protein of
3147 aa from C. elegans (F25C8.3) that showed significant
similarity to 2q34-X distributed in blocks along the whole length of
both products (Fig. 3B). In Drosophila melanogaster, the predicted gene product CG18437 (252 aa long) matches the N-terminal end of the F25C8.3 and 2q34-X proteins. Analysis
of the Drosophila genomic segment containing this gene (accession number AE003762) unveiled a putative product of 3044 aa,
similar to both human 2q34-X and C. elegans F25C8.3 along its whole length (Fig. 3B). Therefore, we have identified an
evolutionary conserved gene family that encodes proteins of ~3000 aa,
spanning 10 kb in C. elegans, 40 kb in
Drosophila, and 200 kb in human. The predicted proteins do
not contain any known domain or similarity to other known proteins that
might provide us with a clue as to their function.
These data provide evidence that 2q34-X represents a true
transcription unit encoding a functional protein. Indeed, when the human predicted protein was used in searches with TBLASTN, we detected
multiple matching ESTs distributed over its whole length. Among these,
we found evidence for fusion transcripts including 5'
regions of 2q34-X and the majority of the coding region of SNAIL-like, similar although not identical to the gene
prediction described above. Other ESTs contained exons surrounding the
point of insertion of SNAIL-like, but not retrotransposed
sequences, pointing to a complex transcriptional profile including both
independent and fusion transcripts.
Compared Expression of SNAIL-like and 2q34-X--
The possible
presence of both independent and fusion transcripts from the
2q34-X and SNAIL-like genes led us to analyze
their relative expression patterns. Because of the similarity between SNAIL and SNAIL-like sequences, they
cross-hybridize when used as probes in Northern blot analyses. Thus, we
selected several cell lines, including MCF7 because it expresses
SNAIL-like but not SNAIL. A transcript of the
expected size of complete SNAIL-like or SNAIL
transcripts (around 2 kb) was detected in MCF7, MDA, and A375P RNA
preparations, indicating that at least in the MCF-7 cell line,
independent SNAIL-like transcripts are present. In addition,
signals for transcripts of more than 6.5 kb were observed in A375P,
which may correspond to a fusion transcript with 2q34-X exons (not shown).
To compare the expression of 2q34-X and
SNAIL-like by RT-PCR, we designed primers to amplify two
regions of the putative 2q34-X protein coding sequence,
boxes A and B, that surround the point of insertion of the
retrotransposed fragment in human, C. elegans and
Drosophila (Fig. 3). These boxes were chosen since similar sequences were found in mouse ESTs and in genomic sequences from the
teleost fish Takifugu rubripes and the urochordate
Ciona intestinalis, extending the phylogenetic range where
this new gene is present (Fig.
4A). We detected Box B
transcripts in adult brain and pancreas (Fig. 4B) and in the
melanoma A375P and colon carcinoma LoVo cells (Fig. 4C), a
subset of those cell lines expressing SNAIL-like. The same
results were obtained for the amplification of Box A in all cases (not
shown).

View larger version (34K):
[in this window]
[in a new window]
|
Fig. 4.
Comparison of SNAIL-like
(SNAI1L) and 2q34-X expression
in human tissues and tumor-derived cell lines. A,
comparison of the predicted amino acid sequences encoded by boxes
A and B from human, mouse, T. rubripes,
C. intestinalis, D. melanogaster, and C. elegans. B and C, RT-PCR analysis of the
expression of box B and SNAI1L in adult tissues
(B) and human cell lines (C). Expression of
box A was identical to that of box B. In
C, SNAI1L expression was also tested using
primers from the 5'-UTR, specific for independent SNAI1L
transcripts. GAPD was used as an internal control.
|
|
To discard the possibility that some SNAIL-like
amplifications originated from fusion transcripts, we performed a
RT-PCR analysis with primers specific to its 5'-UTR, absent from both
the gene prediction and the EST fusion transcripts. The expression
profile obtained was identical to that described (Figs. 2, A
and B, and 4C), corroborating the existence of
independent SNAIL-like transcripts.
SNAIL-like Can Induce EMT in Epithelial Cells in
Culture--
To identify the possible function of
SNAIL-like, we overexpressed the coding region of the
retrotransposed gene in a cell line (MDCK) and compared its effects
with those observed after overexpression of the original
SNAIL gene. We have previously shown that stable
transfectants of mouse Snail in these cells causes a
dramatic change in phenotype (epithelial to fibroblastic), mirroring
the EMT observed in developing embryos (10).
Parental MDCK cells and mock transfectant cells grow as a polarized
monolayer with E-cadherin localized at cell junctions (Fig.
5A). In contrast, both SNAIL
and SNAIL-like transfected cells adopted a spindle-like shape,
down-regulate E-cadherin expression, and no longer
established tight cell junctions (Fig. 5A). The loss of
E-cadherin was confirmed by RT-PCR analysis (data not shown). Thus,
SNAIL-like maintains the capacity to induce EMTs associated
with the loss of E-cadherin expression in MDCK cells. This
is not a property of particular constitutively expressing clones, since
it is also observed in an inducible system. We have used a
tamoxifen-inducible system to express SNAIL and SNAIL-like in MCDK
cells and found that the transfected cells, which also express EGFP,
have lost E-cadherin expression at the cell junctions (Fig.
5B).

View larger version (49K):
[in this window]
[in a new window]
|
Fig. 5.
Transfection of SNAIL
(SNAI1) and SNAIL-like
(SNAI1L) into MDCK cells induces EMT.
A, phase contrast, E-cadherin and vimentin immunostaining of
mock, SNAI1, and SNAI1L stable transfectants. Although mock-transfected
clones retain an epithelial morphology, both SNAI1 and SNAI1L show a
fibroblastic phenotype, clear down-regulation of E-cadherin, and an
increase and redistribution of vimentin. B, double-labeled
images showing E-cadherin (red) and EGFP expression
(green) in mock, SNAI1-, and SNAI1L-transfected cells using
a tamoxifen-inducible system in transient transfection analysis. EGFP
staining depicts transfected cells. Observe that both SNAIL-
and SNAIL-like-transfected cells have lost E-cadherin
expression after tamoxifen treatment. Tam,
4-OH-tamoxifen.
|
|
To further characterize the effect of SNAIL and
SNAIL-like overexpression on MDCK cells, we analyzed their
migratory properties in a wound healing assay with the stable
transformants (Fig. 6). Twelve hours
after the incision was made, cultures of mock transfectants showed no
signs of wound healing (Fig. 6, A and B), whereas
SNAIL and SNAIL-like transfectants invaded the
area of the wound after only 6 h (Fig. 6, C-F). The
invasiveness of the cells was determined by assaying their capacity to
migrate through a collagen matrix. Only cells expressing
SNAIL or SNAIL-like were able to go through the
collagen gel (Fig. 6, G-I). Thus, SNAIL-like has
the capacity to induce a fibroblastic transformation and to render MDCK
epithelial cells migratory and invasive.

View larger version (127K):
[in this window]
[in a new window]
|
Fig. 6.
Migratory and invasive behavior of
SNAIL (SNAI1) and
SNAIL-like (SNAI1L) stable
transfectants. Motility was tested in cultures using a wound
healing assay (A-F) and invasiveness in an assay of
migration through a collagen matrix (G-I). Mock
transfectants (A, B, and G) did not
show migratory nor invasive properties, in clear contrast to SNAI1
(C, D, and H) or SNAI1L (E,
F, and I) clones, which covered the wound in
6 h and were able to migrate through the collagen matrix in
24 h.
|
|
SNAIL-like Maintains the Ability to Repress E-cadherin
Transcription--
To check the ability of SNAIL-like to
repress the activity of the E-cadherin promoter, we
co-transfected SNAI1- and SNAI1L-pZeo vectors with a reporter luciferase construct containing the mouse proximal E-cadherin promoter (10) in the mouse MCA3D
keratinocyte cell line, which expresses high levels of E-cadherin. This
promoter fragment contains the specific binding site for Snail-mediated repression (10). The average values obtained in nine independent experiments are shown in Fig. 7. The
strongest repression (60% of the control value) was observed when
cotransfecting the mouse E-cadherin promoter with a mouse
Snail construct. SNAIL and SNAIL-like are weaker repressors, surely due to being human proteins working in a
mouse context (both promoter and cell line). Nevertheless, SNAIL-like cotransfected cells showed 45% of the repressor
activity evidenced by SNAIL.

View larger version (16K):
[in this window]
[in a new window]
|
Fig. 7.
SNAIL (SNAI1) and
SNAIL-like (SNAI1L) repressor
activity on E-cadherin promoter. MCA3D keratinocyte
cells were co-transfected with the mouse E-cadherin promoter
fused to a luciferase reporter gene and either the control pZeo empty
vector, SNAI1, SNAI1L, or mouse Snail
(mSna) expression constructs. Luciferase activity was
measured in 9 independent experiments 24 h after transfection, and
each value is represented relative to that of the control
experiment ± S.D.
|
|
 |
DISCUSSION |
SNAIL-like Is an Encoding Gene Subjected to Positive
Selection--
Gene duplication is one of the driving forces that
model genomes during evolution (43). Retrotransposition has been
considered as one of the possible mechanisms responsible for these
events (25). However, although examples of reverse transcription and insertion of an mRNA are well documented (17), cases of functional duplicates with proven biological significance are very rare. In this
work we show that a retrotransposed copy of human SNAIL previously described as a non-active pseudogene, SNAI1P (15, 16), encodes a fully functional factor that retains some of the key
roles in directing cellular phenotype. This is one of the few examples,
if not the only one, of a duplicated transcription factor originated by
retrotransposition that remains active. Thus, this gene is an
additional member of the Snail gene superfamily and,
therefore, more properly named as SNAIL-like.
Sequence analysis reveals three pieces of evidence to support this view
as follows. (i) SNAIL-like conserves a complete open reading
frame. (ii) The sequence similarity to SNAIL extends to the
5'- and 3'-untranslated regions. (iii) The number and nature of
nucleotide changes are unequally distributed along the coding region.
The retrogene holds more changes in the region responsible for putative
interactions with other proteins than in the DNA binding region (the
zinc finger domain). In addition, the transactivation SNAG domain (26),
fundamental for the repressor activity of Snail family members in
several species including humans, is unaltered (36-38). A high degree
of similarity is also observed in the UTR sequences, suggesting that
these regions may also be functional. This analysis argues in favor of
a high positive selection (39, 43) and suggests that this retrogene
encodes a functional protein.
Transcriptional Control of SNAIL-like Expression--
Confirmation
that SNAIL-like is expressed is provided by the existence of
ESTs derived from the 3'-UTR of SNAIL-like and the analysis
of expression in human adult tissues and cell lines. The parental gene,
SNAIL, was expressed in many adult tissues. Because it is
known to be expressed in fibroblasts and mesenchymal cells (10, 11, 14)
we cannot discard the possibility that the observed expression in some
tissues is due to the presence of these cell types. Interestingly,
SNAIL-like shows a much more restricted expression in adult
tissues, implying a specific transcriptional control, different from
that of the original gene. Transcripts were detected in brain and
pancreas but not in heart, placenta, and lung, the tissues that were
previously analyzed and that led to the idea that this retrogene was a
non-expressed pseudogene (15).
Regarding expression in the cell lines, SNAIL-like is
expressed in cells derived from different human breast and colon
carcinomas and one melanoma, whereas SNAIL shows a more
restricted pattern. Thus, once again, SNAIL-like presents a
distinct expression pattern, indicating that promoter and regulatory
control elements must drive its expression. Because it is extremely
unlikely that a retrogene inserts in the proximity to a promoter, the
most simple explanation is that part of the 5'-UTR sequences of the
retrotransposed SNAIL-like gene fulfils this function. This
seems to be the case since a fragment containing part of the
corresponding 5'-UTR sequences of the SNAIL gene shows
promoter activity in cotransfection assays in cell
culture.2 This suggests that
SNAIL may have two alternative promoters, a more distal one,
responsible for the transcript that was the template of the reverse
transcription, and a second proximal promoter used by
SNAIL-like.
The cis-regulatory elements responsible for the differential expression
in human tissues could be present in the genomic region where
SNAIL-like was inserted or included in the retrotranscribed sequence (17). We have found that SNAIL-like is integrated
in an intron of a novel gene, 2q34-X. Therefore, this
indicates that it is located in an active region of the genome,
accessible to the transcriptional machinery.
The comparison of the expression of SNAIL, SNAIL-like, and
2q34-X indicated that SNAIL-like and
2q34-X are present in a subset of
SNAIL-expressing tissues. This suggests that the expression of SNAIL-like may be driven by regulatory elements shared
with or from the 2q34-X gene. However, the presence of
SNAIL-like transcripts in all cell lines analyzed regardless
of the expression of both SNAIL and 2q34-X
indicates that the retrotransposed sequence must have lost repressor
elements and/or contain additional positive cis-regulatory elements.
Independent SNAIL-like transcripts exist, as confirmed in
MCF-7 cells. Interestingly, human ESTs that contain sequences of both
genes exist. Indeed, A375P cells, which express both genes as assessed
by RT-PCR, reveal a transcript bigger than 6 kb in addition to one of
the expected size after hybridization with the SNAIL-like
probe. Thus, both independent and fusion transcripts are generated.
Although nothing is known about the product of the novel
2q34-X gene, its conservation in different species from C. elegans to human suggests that it fulfils a relevant
function, which can be maintained in humans due to the presence of
independent transcripts.
Conserved and Divergent Functions of SNAIL and
SNAIL-like--
Recent studies demonstrate the critical role of mouse
Snail in the acquisition of malignant properties during tumor
progression through the direct repression of E-cadherin
expression (10, 11). The expression of SNAIL-like in all
tumor cell lines analyzed does not correlate with this behavior.
However, overexpression in MDCK cells showed that as with the mouse
(13, 14) and the human parental gene (this work), SNAIL-like can induce
a complete EMT.
It is not surprising that, as the mouse and human genes,
SNAIL-like is able to bind the E-boxes present in the
E-cadherin promoter, since the DNA binding domain was
subjected to selective pressure and maintains a high similarity with
that of the original gene. However, the repression of
E-cadherin is weaker. Additional E-cadherin
repressors that, as Snail family members, bind to E-boxes, have been
recently described (44, 45). These evidences support a model in which
the different factors co-operate, compete, and/or interact in the
transcription complex (45). If the interaction with other partners is
needed to achieve maximal repressor activity, the lower efficiency
shown in different carcinoma-derived cell lines may be due to the
absence of such factors or to the alterations in its putative
protein-protein interaction domain of SNAIL-like. High
levels of expression may overcome the need for additional proteins,
something that would explain its ability to induce a complete EMT in
stable transfectants. Control of SNAIL-like activity at the
translational level cannot be excluded.
Because the ectopic expression of E-cadherin in fibroblastic cells is
not sufficient to induce a complete reversion to an epithelial
phenotype (46), SNAIL must have additional targets. Thus, it remains
possible that the structural changes in SNAIL-like may affect the
regulation of these targets in a different way to that of
E-cadherin. Finally, it is possible that additional functions have been acquired associated to the products of the independent SNAIL-like transcripts and/or of the
2q34-X/SNAIL-like fusion transcripts.
SNAIL-like and the Analysis of SNAIL Expression in
Tumors--
Having shown that SNAIL-like is widely
expressed in different tumor cell lines and some normal adult tissues,
several important issues emerge. First of all, its potential to induce
a full EMT has to be considered both in physiological and pathological
studies. It could be speculated that an increase in the amount of this gene product may render it fully competent to induce the phenotypical changes that accompany the acquisition of invasive properties. Thus,
its expression might constitute a susceptibility factor to develop
malignancy. On the other hand and in contrast to SNAIL, its
transcription does not directly correlate with invasiveness, making it
extremely important to differentiate the detection of these two very
similar genes in expression studies in human tumor cell lines or tumor
biopsies. Although several studies have shown a correlation between
SNAIL expression and dedifferentiation in human samples
(10-14), others have described SNAIL expression regardless of the level of E-cadherin expression in human tumors (47) or cell
lines (48). In these cases, due to sequence similarity and experimental
approaches (RT-PCR and Northern, respectively), the failure to detect a
correlation may be the result of the inadvertent amplification of
SNAIL-like. Indeed, as assessed by RT-PCR, the epithelial
breast tumor-derived cell line MCF-7 does not express SNAIL
(10) but expresses the retrogene (this work) and shows a
positive signal in Northern analysis that was interpreted as expression
of the parental SNAIL gene (48). In cell lines, this problem
can be avoided by the use of SNAIL-like-specific
primers such as those used in this study. In conclusion, we have shown here that SNAIL-like is a new human member of the Snail
family that must be taken into consideration for functional studies.
 |
ACKNOWLEDGEMENTS |
We are grateful to H. Peinado for help in the
promoter analysis, to A. G. de Herreros for sharing unpublished
information, to Santiago Rodriguez de Córdoba for insightful
comments on this manuscript, to C. Azuara for technical assistance, and
to M. J. Blanco and other members of the lab for helpful discussions.
 |
FOOTNOTES |
*
This work was supported by grants FIS-01/985,
Dirección General de Enseñanza Superior e
Investigación Científica (DGESIC) Grant PM98-0125, and
CAM 08.1/0044/2000 (to M. A. N.).The costs of publication of this
article were defrayed in part by the
payment of page charges. The article
must therefore be hereby marked
"advertisement" in
accordance with 18 U.S.C. Section
1734 solely to indicate this fact.
Supported by a postdoctoral fellowship from the Spanish Secretary
of Education and Universities and the European Social Fund.
§
Supported by a postdoctoral fellowship from the Spanish Ministry of Health.
¶
Present address: Instituto de Investigaciones
Biomédicas, CSIC-UAM, Arturo Duperier, 4, 28029 Madrid, Spain.
To whom correspondence should be addressed. Tel.:
34-91-5854723; Fax: 34-91-5854754; E-mail: anieto@cajal.csic.es (to
M. A. N.) or Tel.: 34-91-5854736; E-mail:
mmanzanares@iib.uam.es (to M. M.).
Published, JBC Papers in Press, July 31, 2002, DOI 10.1074/jbc.M205358200
2
A. G. de Herreros, personal communication.
 |
ABBREVIATIONS |
The abbreviations used are:
EMT, mesenchymal
transition;
SNAG, Snail/Gfi;
Alu, primate-specific short interspread
sequences;
EGFP, enhanced green fluorescent protein;
GAPD, glyceraldehyde-3-phosphate dehydrogenase;
RT, reverse transcription;
MDCK cells, Madin-Darby canine kidney cells;
UTR, untranslated region;
aa, amino acids;
kb, kilobase(s).
 |
REFERENCES |
| 1.
|
Grau, Y.,
Carteret, C.,
and Simpson, P.
(1984)
Genetics
108,
347-360[Abstract/Free Full Text]
|
| 2.
|
Manzanares, M.,
Locascio, A.,
and Nieto, M. A.
(2001)
Trends Genet.
17,
178-181[CrossRef][Medline]
[Order article via Infotrieve]
|
| 3.
|
Nieto, M. A.
(2002)
Nat. Rev. Mol. Cell Biol.
3,
155-166[CrossRef][Medline]
[Order article via Infotrieve]
|
| 4.
|
Hemavathy, K.,
Ashraf, S. I.,
and Ip, Y. T.
(2000)
Gene
257,
1-12[CrossRef][Medline]
[Order article via Infotrieve]
|
| 5.
|
Gans, C.,
and Northcutt, R. G.
(1983)
Science
220,
268-274[Abstract/Free Full Text]
|
| 6.
|
Nieto, M. A.
(2001)
Mech. Dev.
105,
27-35[CrossRef][Medline]
[Order article via Infotrieve]
|
| 7.
|
Hay, E. D.
(1995)
Acta Anat.
154,
8-20[Medline]
[Order article via Infotrieve]
|
| 8.
|
Duband, J. L.,
Monier, F.,
Delannet, M.,
and Newgreen, D.
(1995)
Acta Anat.
154,
63-78[Medline]
[Order article via Infotrieve]
|
| 9.
|
Carver, E. A.,
Jiang, R.,
Lan, Y.,
Oram, K. F.,
and Gridley, T.
(2001)
Mol. Cell. Biol.
21,
8184-8188[Abstract/Free Full Text]
|
| 10.
|
Cano, A.,
Perez-Moreno, M. A.,
Rodrigo, I.,
Locascio, A.,
Blanco, M. J.,
del Barrio, M. G.,
Portillo, F.,
and Nieto, M. A.
(2000)
Nat. Cell Biol.
2,
76-83[CrossRef][Medline]
[Order article via Infotrieve]
|
| 11.
|
Batlle, E.,
Sancho, E.,
Franci, C.,
Dominguez, D.,
Monfar, M.,
Baulida, J.,
and Garcia De Herreros, A.
(2000)
Nat. Cell Biol.
2,
84-89[CrossRef][Medline]
[Order article via Infotrieve]
|
| 12.
|
Poser, I.,
Dominguez, D.,
de Herreros, A. G.,
Varnai, A.,
Buettner, R.,
and Bosserhoff, A. K.
(2001)
J. Biol. Chem.
276,
24661-24666[Abstract/Free Full Text]
|
| 13.
|
Cheng, C. W., Wu, P. E., Yu, J. C.,
Huang, C. S.,
Yue, C. T., Wu, C. W.,
and Shen, C. Y.
(2001)
Oncogene
20,
3814-3823[CrossRef][Medline]
[Order article via Infotrieve]
|
| 14.
|
Blanco, M. J.,
Moreno-Bueno, G.,
Sarrio, D.,
Locascio, A.,
Cano, A.,
Palacios, J.,
and Nieto, M. A.
(2002)
Oncogene
21,
3241-3246[CrossRef][Medline]
[Order article via Infotrieve]
|
| 15.
|
Paznekas, W. A.,
Okajima, K.,
Schertzer, M.,
Wood, S.,
and Jabs, E. W.
(1999)
Genomics
62,
42-49[CrossRef][Medline]
[Order article via Infotrieve]
|
| 16.
|
Twigg, S. R.,
and Wilkie, A. O.
(1999)
Hum Genet.
105,
320-326[CrossRef][Medline]
[Order article via Infotrieve]
|
| 17.
|
Brosius, J.
(1999)
Gene
238,
115-134[CrossRef][Medline]
[Order article via Infotrieve]
|
| 18.
|
Boer, P. H.,
Adram, C. N.,
Lau, Y. F.,
and McBurney, M. W.
(1987)
Mol. Cell. Biol.
7,
3107-3112[Abstract/Free Full Text]
|
| 19.
|
Adra, C. N.,
Ellis, N. A.,
and McBurney, M. W.
(1988)
Somatic Cell Mol. Genet.
14,
69-81[CrossRef][Medline]
[Order article via Infotrieve]
|
| 20.
|
Dahl, H. H.,
Brown, R. M.,
Hutchison, W. M.,
Maragos, C,
and Brown, G. K.
(1990)
Genomics
8,
225-232[CrossRef][Medline]
[Order article via Infotrieve]
|
| 21.
|
Fitzgerald, J.,
Hutchison, W. M.,
and Dahl, H. H.
(1992)
Biochim. Biophys. Acta
1131,
83-90[Medline]
[Order article via Infotrieve]
|
| 22.
|
Sargent, C. A.,
Young, C.,
Marsh, S.,
Ferguson-Smith, M. A.,
and Affara, N. A.
(1994)
Hum. Mol. Genet.
3,
1317-1324[Abstract/Free Full Text]
|
| 23.
|
Lahn, B. T.,
and Page, D. C.
(1999)
Nat. Genet.
21,
429-433[CrossRef][Medline]
[Order article via Infotrieve]
|
| 24.
|
Pan, Y.,
Decker, W. K.,
Hug, A. H. H. M.,
and Craigen, W. J.
(1999)
Genomics
59,
282-290[CrossRef][Medline]
[Order article via Infotrieve]
|
| 25.
|
Brosius, J.
(1991)
Science
251,
753[Free Full Text]
|
| 26.
|
Grimes, H. L.,
Chan, T. O.,
Zweidler-McKay, P. A.,
Tong, B.,
and Tsichlis, P. N.
(1996)
Mol. Cell. Biol.
16,
6263-6272[Abstract]
|
| 27.
|
Altschul, S. F.,
Gish, W.,
Miller, W.,
Myers, E. W.,
and Lipman, D. J.
(1990)
J. Mol. Biol.
215,
403-410[CrossRef][Medline]
[Order article via Infotrieve]
|
| 28.
|
Burge, C.,
and Karlin, S.
(1997)
J. Mol. Biol.
268,
78-94[CrossRef][Medline]
[Order article via Infotrieve]
|
| 29.
|
Thompson, J. D.,
Higgins, D. G.,
and Gibson, T. J.
(1994)
Nucleic Acids Res.
22,
4673-4680[Abstract/Free Full Text]
|
| 30.
|
Vennstrom, B.,
and Bishop, J. M.
(1982)
Cell
28,
135-143[CrossRef][Medline]
[Order article via Infotrieve]
|
| 31.
|
Faraldo, M. L.,
Rodrigo, I.,
Behrens, J.,
Birchmeier, W.,
and Cano, A.
(1997)
Mol. Carcinog.
20,
33-47[CrossRef][Medline]
[Order article via Infotrieve]
|
| 32.
|
Lozano, E.,
and Cano, A.
(1998)
Cell Adhes. Commun.
6,
51-67[Medline]
[Order article via Infotrieve]
|
| 33.
|
Gómez, M.,
Navarro, P.,
and Cano, A.
(1994)
Invasion Metastasis
14,
17-26[Medline]
[Order article via Infotrieve]
|
| 34.
|
Feil, R.,
Wagner, J.,
Metzger, D.,
and Chambon, P.
(1997)
Biochem. Biophys. Res. Commun.
237,
752-757[CrossRef][Medline]
[Order article via Infotrieve]
|
| 35.
|
Llorens, A.,
Rodrigo, I.,
Lopez-Barcons, L.,
Gonzalez-Garrigues, M.,
Lozano, E.,
Vinyals, A.,
Quintanilla, M.,
Cano, A.,
and Fabra, A.
(1998)
Lab. Invest.
78,
1131-1142[Medline]
[Order article via Infotrieve]
|
| 36.
|
Nakayama, H.,
Scott, I. C.,
and Cross, J. C.
(1998)
Dev. Biol.
199,
150-163[CrossRef][Medline]
[Order article via Infotrieve]
|
| 37.
|
Kataoka, H.,
Murayama, T.,
Yokode, M.,
Mori, S.,
Sano, H.,
Ozaki, H.,
Yokota, Y.,
Nishikawa, S.,
and Kita, T.
(2000)
Nucleic Acids Res.
28,
626-633[Abstract/Free Full Text]
|
| 38.
|
Hemavathy, K.,
Guru, S. C.,
Harris, J.,
Chen, J. D.,
and Ip, Y. T.
(2000)
Mol. Cell. Biol.
20,
5087-5095[Abstract/Free Full Text]
|
| 39.
|
Hughes, A. L.,
Green, J. A.,
Garbayo, J. M.,
and Roberts, R. M.
(2000)
Proc. Natl. Acad. Sci. U. S. A.
97,
3319-3323[Abstract/Free Full Text]
|
| 40.
|
Lewis, S. A.,
Villasante, A.,
Sherline, P.,
and Cowan, N. J.
(1986)
J. Cell Biol.
102,
2098-2105[Abstract/Free Full Text]
|
| 41.
|
Spencer, N.,
and Hopkinson, D. A.
(1980)
Ann. Hum. Genet.
43,
335-342[Medline]
[Order article via Infotrieve]
|
| 42.
|
Nagase, T.,
Nakayama, M.,
Nakajima, D.,
Kikuno, R.,
and Ohara, O.
(2001)
DNA Res.
8,
85-95[Abstract]
|
| 43.
|
Lynch, M.,
and Conery, J. S.
(2000)
Science
290,
1151-1155[Abstract/Free Full Text]
|
| 44.
|
Comijn, J.,
Berx, G.,
Vermassen, P.,
Verschueren, K.,
van Grunsven, L.,
Bruyneel, E.,
Mareel, M.,
Huylebroeck, D.,
and van Roy, F.
(2001)
Mol. Cell
7,
1267-1278[CrossRef][Medline]
[Order article via Infotrieve]
|
| 45.
|
Pérez-Moreno, M.,
Locascio, A.,
Rodrigo, I.,
Dhont, G.,
Portillo, F.,
Nieto, M. A.,
and Cano, A.
(2001)
J. Biol. Chem.
276,
27424-27431[Abstract/Free Full Text]
|
| 46.
|
Navarro, P.,
Lozano, E.,
and Cano, A.
(1993)
J. Cell Sci.
105,
923-934[Abstract]
|
| 47.
|
Saito, T.,
Oda, Y.,
Sugimachi, K.,
Kawaguchi, K.,
Tamiya, S.,
Tanaka, K.,
Matsuda, S.,
Sakamoto, A.,
Iwamoto, Y.,
and Tsuneyoshi, M.
(2001)
Am. J. Pathol.
159,
2114-2117
|
| 48.
|
Hajra, K. M.,
Chen, D. Y-S.,
and Fearon, E. R.
(2002)
Cancer Res.
62,
1613-1618[Abstract/Free Full Text]
|
Copyright © 2002 by The American Society for Biochemistry and Molecular Biology, Inc.

CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
S. Banerjee, G. Dhar, I. Haque, S. Kambhampati, S. Mehta, K. Sengupta, O. Tawfik, T. A. Phillips, and S. K. Banerjee
CCN5/WISP-2 Expression in Breast Adenocarcinoma Is Associated with Less Frequent Progression of the Disease and Suppresses the Invasive Phenotypes of Tumor Cells
Cancer Res.,
September 15, 2008;
68(18):
7606 - 7612.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Olmeda, G. Moreno-Bueno, J. M. Flores, A. Fabra, F. Portillo, and A. Cano
SNAI1 Is Required for Tumor Growth and Lymph Node Metastasis of Human Breast Carcinoma MDA-MB-231 Cells
Cancer Res.,
December 15, 2007;
67(24):
11721 - 11731.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Pena, J. M. Garcia, J. Silva, V. Garcia, R. Rodriguez, I. Alonso, I. Millan, C. Salas, A. G. de Herreros, A. Munoz, et al.
E-cadherin and vitamin D receptor regulation by SNAIL and ZEB1 in colon cancer: clinicopathological correlations
Hum. Mol. Genet.,
November 15, 2005;
14(22):
3361 - 3370.
[Abstract]
[Full Text]
[PDF]
|
 |
|