Long Non-coding RNAs (LncRNA) Regulated by Transforming Growth Factor (TGF) β

Background: Long noncoding RNAs (LncRNA) are emerging as key regulators in various biological processes. However, their role in epithelial-to-mesenchymal transition (EMT) remains elusive. Results: A subset of lncRNAs are dysregulated upon transforming growth factor (TGF) β-induced EMT, and lncRNA-HIT mediates this process. Conclusion: LncRNAs such as lncRNA-HIT ((HOXA transcript induced by TGFβ) play a pivotal role in EMT and breast cancer progression. Significance: Here we profiled lncRNAs in TGFβ-induced EMT and identified a novel conserved lncRNA-HIT. Long noncoding RNAs (lncRNAs) are emerging as key regulators in various biological processes. Epithelial-to-mesenchymal transition (EMT) is a developmental process hijacked by tumor cells to depart from the primary tumor site, invade surrounding tissue, and establish distant metastases. Transforming growth factor β (TGFβ) signaling has been shown to be a major inducer of EMT and to facilitate breast cancer metastasis. However, the role of lncRNAs in this process remains largely unknown. Here we report a genome-wide lncRNA profile in mouse mammary epithelial NMuMG cells upon TGFβ induction of EMT. Among 10,802 lncRNAs profiled, over 600 were up-regulated and down-regulated during the EMT, respectively. Furthermore, we identify that lncRNA-HIT (HOXA transcript induced by TGFβ) mediates TGFβ function, i.e. depletion of lncRNA-HIT inhibits TGFβ-induced migration, invasion, and EMT in NMuMG. LncRNA-HIT is also significantly elevated in the highly metastatic 4T1 cells. Knockdown of lncRNA-HIT in 4T1 results in decrease of cell migration, invasion, tumor growth, and metastasis. E-cadherin was identified as a major target of lncRNA-HIT. Moreover, lncRNA-HIT is conserved in humans and elevated expression associates with more invasive human primary breast carcinoma. Collectively, these data suggest that a subset of lncRNAs such as lncRNA-HIT play a significant role in regulation of EMT and breast cancer invasion and metastasis, and could be potential therapeutic targets in breast cancers.


Long noncoding RNAs (lncRNAs) are emerging as key regulators in various biological processes. Epithelial-to-mesenchymal transition (EMT) is a developmental process hijacked by tumor cells to depart from the primary tumor site, invade surrounding tissue, and establish distant metastases.
Transforming growth factor ␤ (TGF␤) signaling has been shown to be a major inducer of EMT and to facilitate breast cancer metastasis. However, the role of lncRNAs in this process remains largely unknown. Here we report a genome-wide lncRNA profile in mouse mammary epithelial NMuMG cells upon TGF␤ induction of EMT. Among 10,802 lncRNAs profiled, over 600 were up-regulated and down-regulated during the EMT, respectively. Furthermore, we identify that lncRNA-HIT (HOXA transcript induced by TGF␤) mediates TGF␤ function, i.e. depletion of lncRNA-HIT inhibits TGF␤-induced migration, invasion, and EMT in NMuMG. LncRNA-HIT is also significantly elevated in the highly metastatic 4T1 cells. Knockdown of lncRNA-HIT in 4T1 results in decrease of cell migration, invasion, tumor growth, and metastasis. E-cadherin was identified as a major target of lncRNA-HIT. Moreover, lncRNA-HIT is conserved in humans and elevated expression associates with more invasive human primary breast carcinoma. Collectively, these data suggest that a subset of lncRNAs such as lncRNA-HIT play a significant role in regulation of EMT and breast cancer invasion and metastasis, and could be potential therapeutic targets in breast cancers.
Long noncoding RNAs (lncRNA) 2 are transcripts greater than 200 nucleotides that contain no open reading frame and lack protein coding capacity. Although they are much less conserved than protein coding genes and microRNA, accumulating evidence suggests that lncRNAs function in a broad range of cellular processes, such as cell growth, survival, migration, invasion, and differentiation (1)(2)(3)(4). The TGF␤ pathway is in part responsible for the epithelial to mesenchymal transition (EMT), a process by which primary epithelial cells acquire mesenchymal gene signatures to become more motile and invasive eventually leading to metastasis. LncRNAs regulated by TGF␤ and their contribution to EMT has yet to be established in mammary epithelial cells.
TGF␤ binds to a heteromeric complex of transmembrane serine/threonine kinases, the type I and II TGF␤ receptors (T␤RI and T␤RII). Following ligand binding to T␤RII, the type I receptor is recruited to the ligand-receptor complex, where the constitutively active T␤RII transactivates T␤RI. Activated T␤RI phosphorylates the receptor-specific Smad2 and Smad3. Phosphorylated Smad2/Smad3 associates with Smad4 as a heteromeric complex and translocates to the nucleus, leading to the transcriptional induction or repression of a diverse array of genes (5). Previously, we and other have shown that protein coding genes and microRNAs that are regulated by the TGF␤ pathway are functionally important in driving EMT and breast cancer metastasis (6). Therefore, we hypothesized that lncRNAs could similarly be regulated in response to TGF␤, and play a role in breast tumor progression.
In this report, we profiled changes of lncRNAs in NMuMG cells following TGF␤ induction of EMT. AK020562, an uncharacterized lncRNA that locates in the Hoxa gene cluster, was significantly induced by TGF␤, and thus was named lncRNA-HIT. Depletion of lncRNA-HIT inhibits TGF␤-induced migration, invasion, and EMT in NMuMG, and decreases primary tumor growth and metastasis in a 4T1 orthotopic mouse xenograft model. In contrast to the protein coding genome, the conservation of lncRNA across species remains poorly defined and not very well understood (7,8). Despite this discrepancy, lncRNA-HIT is well conserved in sequence and chromosomal location from mouse to human. We were able to detect lncRNA-HIT in human and observed that increased expression directly correlates with breast cancer progression.

EXPERIMENTAL PROCEDURES
Cell Line and Treatment-NMuMG epithelial cells were purchased from the American Type Culture Collection (Manassas, VA). 4T1, 4T07, 168FARN, and 67NR cells were a kind gift of Fred Miller (Wayne State University). All cells were grown in complete medium, DMEM (Invitrogen) containing 10% fetal bovine serum (FBS) supplemented with 1 mM L-glutamine, penicillin/streptomycin, and non-essential amino acids (Gibco). NMuMG cells were treated with TGF␤ at a concentration of 5 ng/ml for the times indicated in the figures and legends. Cell transfection experiments were performed with Lipofectamine 2000 (Invitrogen).
LncRNA and mRNA Microarray-NCode TM Mouse Noncoding Microarray chip from Life Technologies (Carlsbad, CA), which contains 10,802 lncRNAs and 25,178 protein-coding genes, was used to interrogate lncRNA and mRNA changes in vehicleversus TGF␤ (5 ng/ml)-treated NMuMG cells after 24 h. TFG␤ was purchased from R&D Systems (Minneapolis, MN). Total RNA was isolated by TRIzol (Life Technologies), end labeled, and hybridized to array. Hybridization and analysis were performed in the Molecular Genomics Core at the H. Lee Moffitt Cancer Center.
RNA Isolation and Reverse Transcription-Quantitative Polymerase Chain Reaction (RT-qPCR)-Total RNAs were isolated with TRIzol reagent following the manufacturer's protocol and then subjected to RT reaction using a High Capacity cDNA Reverse Transcription Kit (Life Technologies). The RT product was used for subsequent qPCR. The qPCR was performed with SYBR Green 2ϫ Master Mix (Life Technologies) on ABI HT9600 from Applied Biosystems (Foster City, CA) and data were collected and analyzed using ABI SDS version 2.3. ⌬C T values were normalized to GAPDH, and ⌬⌬C T analysis was performed to calculate relative RNA expression. RT-qPCR primers and siRNAs were obtained from Integrated DNA Technologies (Coralville, IA) and their oligonucleotide sequences for array validation are listed in supplemental Table S3. LncRNA-HIT RT-qPCR primer sequences are as follows: forward 5Ј-GAATTG-GTCTCAGCCAGATTCG-3Ј and reverse 5Ј-GTCAAGTCAG-AGCTCAGTTCCC-3Ј. siRNAs sequences designed to target lncRNA-HIT are as follows: si-HIT-1, antisense 5Ј-CUAGAGU-UAGGAAGGUAUGUGAGACUU-3Ј and sense, 5Ј-GUCUCA-CAUACCUUCCUAACUCUAG-3Ј and si-HIT-2, antisense 5Ј-UUAAGGUCACAGACCACCUUGGAGGGU-3Ј and sense, 5Ј-CCUCCAAGGUGGUCUGUGACCUUAA-3Ј.
Plasmids-Full-length lncRNA-HIT cDNA was amplified using the FastStart High Fidelity PCR System (Roche Applied Science) and NMuMG genomic DNA as a template. Cloning primers contained EcoRI sites for inserting cDNA into the pcDNA3.1(ϩ) expression vector. Primer sequences used to amplify full-length lncRNA-HIT are as follows: forward, 5Ј-GGAATTCAGGTTTCGAGGAGATGAAAGTG-3Ј and reverse, 5Ј-GGAATTCTCTGAACACAATATCTCCTGTGC-3Ј. Plasmid was sequenced validated for PCR introduced errors and insert orientation following selection of positive clone. hE-cadherin-pcDNA3 was a kind gift from Barry Gumbiner and purchased through Addgene (9).
Invasion and Migration Assays-NMuMG and 4T1 cell lines were transfected with control siRNA (si-CTL), si-HIT-1 or -2. After transfection for 48 h, NMuMG and 4T1 cells were treated with TGF␤ (5 ng/ml) or vehicle control for 24 h and then seeded into the upper chamber of Boyden Chambers coated without (migration) and with (invasion) Matrigel. Top chambers contained serum-free media, whereas lower chambers had media containing 10% fetal bovine serum. After 16 h, invasion and migration were evaluated and quantified by estimating the mean Ϯ S.D. of 4 non-biased image fields.
Immunofluorescence, Immunoblotting, and Antibodies-The cells were grown to 60% confluence on coverslips and transfected with si-CTL or si-HIT-1 or -2 for 48 h. Subsequently, TGF␤ (5 ng/ml) was added to siRNA-transfected cells for the range of times indicated in the figure legends. Briefly, cells were washed with PBS, fixed with 10% formalin containing methanol, and permeabilized with 1% Nonidet P-40 in PBS. Cells were blocked in 10% normal goat serum for 1 h, and 1:200 dilution of primary antibodies were incubated at 4 o C overnight. Coverslips were washed in PBS 3 times and then appropriate secondary antibodies were added at 1:500 dilution. Coverslips were washed in TBS 3 times, counterstained with DAPI, and fixed for visualization. Western blot was performed as previously described (10). The band intensity of Western blots was quantified using ImageJ software and represented as ratio of target gene/␤-actin. Antibody against E-cadherin was purchased from BD Transduction Labs (San Jose, CA), Lamin A/C from Santa Cruz Biotechnology (Santa Cruz, CA), and antibodies for Vimentin and GAPDH from Cell Signaling Technologies (Danvers, MA). Alexa Fluor secondary anti-mouse and anti-rabbit antibodies for immunofluorescence were purchased from Life Technologies, and HRP-linked antibody for Western blot analysis was purchased from Sigma.
Luciferase Reporter Assay-E-cadherin promoter luciferase plasmid was co-transfected with vector or pcDNA-HIT for 48 h in NMuMG. Cells were then washed in PBS, lysed, and then firefly substrate was added to detect expression of luciferase activity. The reaction was stopped using Renilla substrate as a control and performed using the Promega Dual Luciferase Assay Kit (Madison, WI).
Soft Agar Colony Formation Assay-4T1 cells were transfected with si-CTL, si-HIT-1 or -2 for 48 h then washed, trypsinized, and resuspended in RPMI. A bottom layer of 0.6% agar in RPMI was poured and a layer of 0.3% agar in RPMI was layered on top containing 1,000,000 per 12-well and allowed to grow for 2 weeks. Four replicates were plated for each condition and quantitation is represented as mean Ϯ S.D, of colonies counted in 4 non-biased fields.
Orthotopic Xenograft Model-4T1 cells were transfected with si-CTL and si-HIT-1 for 48 h then washed, trypsinized, and resuspended in PBS at a concentration of 4 ϫ 10 6 cells/100 l. Cells were mixed with a 1:1 ratio with Matrigel from Corning (Manassas, VA) and injected into the lower mammary fat pad of Nu/Nu mice from Charles River (Wilmington, MA). Primary tumor growth was monitored via standard caliper measurements. At 4 weeks end point mice primary tumors were weighed and the lungs were removed, inflated with 10% buffered formalin, and stained using Bouin solution.
Locked Nucleic Acid in Situ Hybridization of Formalin-fixed, Paraffin-embedded Tissue Microarray-LncRNA-HIT locked nucleic acid (LNA) probe was prepared by 5Ј end labeling with digoxigenin-ddUTP terminal transferase using the DIG 5Ј End Labeling Kit from Roche Applied Science. Probe sequence containing LNAs (ϩ) is as follows: 5Ј-AATGGCAϩGϩAϩTϩ TϩCϩAϩCϩAAGCATCA-3Ј. Following deparaffinization and proteinase K digestion, breast tumor tissue microarrays were prehybridized for 1 h and then hybridized with 10 nmol/ liter of LNA lncRNA-HIT probe in a hybridization buffer (Roche Applied Science) for 12 h. After three consecutive washes in 4ϫ SSC, 50% formamide, 2ϫ SSC, and 0.1 ϫ SSC, sections were treated with a blocking buffer (Roche Applied Science) for 1 h and incubated with anti-DIG-AP Fab fragments (Roche Applied Science) for 12 h. After washing three times in 1ϫ maleic acid and 0.3% Tween 20 buffer, reactions were processed in a detection solution (100 mmol/liter of Tris-HCl (pH 9.5) and 100 mmol/liter of NaCl) in the presence of nitro blue tetrazolium and 5-bromo-4-chloro-3-indolyl phosphate from Promega and then visualized under a microscope. Allred scoring system was used for quantification (11).
Statistical Analysis-Statistical significance was determined using unpaired Student's t test, and p Յ 0.05 was considered to be statistically significant.

Profile of lncRNA Expression in TGF␤-induced EMT in
NMuMG Cells-Using the widely employed NMuMG mammary epithelial cells as a model to study TGF␤-induced EMT, we have previously shown that TGF␤ up-regulates ncRNA BIC, which processes into miR-155 to contribute to EMT (6). Therefore, we hypothesized that TGF␤ could similarly regulate lncRNAs that control EMT-associated phenotypes. To address this, we treated NMuMG cells with vehicle or TGF␤ (5 ng/ml) for 24 h to induce EMT (Fig. 1A). Total RNA was labeled with digoxigenin and hybridized to NCode Mouse Noncoding RNA Microarray. This is a high-density oligoarray that contains lncRNAs and protein coding genes to detect the dynamic expression of a large subset of lncRNAs and associated protein coding genes within the same biological sample. This array contains 2 replicates for each gene per sample to ensure signal detection is consistent and reliable. Plots of relative intensity signals of replicate spots for lncRNAs in vehicle and TGF␤ treatment indicate the consistency of signal for lncRNA within given sample. Linear regression analysis provides statistically significant evidence that both vehicle and TGF␤-treated replicates are consistent (Fig. 1, B and C). The same analysis was also performed and demonstrated to be significant for protein coding genes (data not shown).
The microarray contains replicates for 10,802 lncRNAs in which the signal was normalized, averaged, and plotted to assess overall changes in expression in vehicle versus TGF␤ treatment (Fig. 1D). We determined that lncRNA expression was significantly changed if the average signal of replicates changed Ն2-fold from vehicle to TGF␤-treated samples (red and green spots in Fig. 1D). Data including average signal and fold-change for all lncRNAs are summarized (supplemental Table S1). Microarray analysis was also performed for 25,178 protein coding genes and summarized in supplemental Table  S2). Using Ն2-fold change as the cutpoint, 633 lncRNAs were down-regulated and 680 lncRNAs were up-regulated following TGF␤-induced EMT (Fig. 2, A and B). Tables 1 and 2 summarize the top 15 down-regulated and 15 up-regulated lncRNAs. Furthermore, we confirmed the expression of several top deregulated lncRNAs by real-time PCR (Fig. 2, C and D). In total, 1,096 genes increased and 2,220 decreased expression upon TGF␤-induced EMT (Fig. 3, A and B). Furthermore, RT-qPCR validation   (Fig. 3, C and D), indicating that the lncRNA data generated in this study are dependable.
LncRNA-HIT Mediates TGF␤-induced Invasion, Migration, and EMT in NMuMG Cells-LncRNA-HIT (AK020562) is one of the top TGF␤ up-regulated lncRNAs, and was of particular interest to us given its genomic location, i.e. within the homeobox domain A (Hoxa) gene cluster. LncRNA-HIT resides in the sense orientation to the Hoxa protein genes and downstream of Hoxa13. There is no overlap between these transcripts and a gap of ϳ1.4 kb between the Hoxa13 3Ј termini and lncRNA-HIT 5Ј transcriptional start site (Fig. 4A). Previous studies have shown that the HOX gene clusters (HOXA, HOXB, HOXC, and HOXD) are well conserved across species and several lncRNAs have been shown to be functionally important within these regions (1,12,13).
Consistent with the microarray finding, RT-qPCR analysis showed that lncRNA-HIT expression was induced by TGF␤ (Fig. 4B). LncRNAs have been shown to play functional roles in both the nuclear and cytoplasmic compartments (14 -17). Therefore, we performed cellular fractionation after 24 h of treatment with TGF␤ to assess the subcellular localization of lncRNA-HIT. We observed that lncRNA-HIT expression was induced and remained localized to the nucleus upon TGF␤ treatment (Fig. 4C), suggesting that lncRNA-HIT functions in the nucleus. We further designed 2 siRNAs against lncRNA-HIT (si-HIT-1 and si-HIT-2) to experimentally address its functional significance in TGF␤-induced migration, invasion, and EMT. NMuMG cells were transfected with scrambled control siRNA (si-CTL), si-HIT-1, or si-  HIT-2 for 48 h. Cells were then treated with TGF␤ (5 ng/ml) or vehicle control for 24 h. Following confirmation of knockdown of lncRNA-HIT (Fig. 5A), we performed two-chamber migration and invasion assays and found that depletion of lncRNA-HIT significantly inhibited TGF␤-induced migration and invasion compared with si-CTL (Fig. 5, B and C).  TGF␤-induced EMT in NMuMG causes the cell to move from an epithelial to a more mesenychmal and fibroblast-like gene signature. A hallmark of EMT is the loss of E-cadherin and increase of Vimentin expression. To test whether lncRNA-HIT can promote the process of EMT, we transfected the cells for 48 h with si-CTL, si-HIT-1, and si-HIT-2 and then treated cells with and without TGF-␤ for 24 h to induce EMT. Immunoblotting analysis revealed that TGF␤ treatment reduced E-cadherin and increased Vimentin in si-CTL cells. However, knockdown of lncRNA-HIT largely overrode the TGF␤ action in E-cadherin and Vimentin expression (Fig. 5D). This result was recapitulated using immunofluorescence staining to visualize expression of E-cadherin and Vimentin (Fig. 5E). We noted that TGF␤-disrupted tight junctions were largely restored by knockdown of lncRNA-HIT (Fig. 5E). Moreover, we examined the effects of overexpression of lncRNA-HIT alone on EMT in NMuMG cells. In NMuMG cells overexpressing lncRNA-HIT (Fig. 6A), we observed a significant increase in both migration and invasion (Fig. 6, B and C). Ectopic expression of lncRNA-HIT was also able to disrupt tight junction as indicated by E-cadherin immunofluorescence staining (Fig. 6D). Collectively, these data indicate that lncRNA-HIT plays a pivotal role in TGF␤-induced cell migration, invasion, and EMT.

LncRNA-HIT Is Up-regulated in 4T1 Cells and Its Depletion Inhibits Cell Migration, Invasion, Lung Metastasis, and Tumor
Growth-We further examined lncRNA-HIT expression in four well characterized mouse mammary tumor cell lines (67NR, 168FARN, 4TO7, and 4T1) derived from a single spontaneously arising mammary tumor in a BALB/c mouse. Although each of these tumor cell lines is able to form primary tumors, they have different metastatic properties. 67NR cells form primary tumors readily, but tumor cells do not intravasate. 168FARN cells can be detected in lymph nodes but rarely in other tissues, suggesting that they can enter the vasculature, but extravasate inefficiently. 4TO7 cells can disseminate from primary mammary tumors into the lungs but do not form visible lung nodules. Moreover, disseminated 4TO7 cells in the lungs rapidly disappear when the primary tumor is removed, suggesting that they are unable to colonize distant sites. 4T1 cells are fully metastatic and form macroscopic lung nodules from primary mammary tumors. Interestingly, lncRNA-HIT was significantly elevated in the 4T1 cell line and expressed at low levels in other 3 cell lines (Fig. 7A), suggesting that lncRNA-HIT is involved in breast cancer metastasis. The importance of lncRNA-HIT in 4T1 cell migration and invasion was further investigated. After cells were transfected with si-HIT-1, and si-HIT-2 as well as si-CTL for 48 h (Fig. 7B), two-chamber migration and invasion assays were performed as described above. After 16 h, we observed a significant reduction in both migration and invasion of 4T1 cells in which lncRNA-HIT was depleted (Fig. 7, C and  D). However, we did not observe that TGF␤-induced lncRNA-HIT in 4T1 cells and that knockdown of lncRNA-HIT had no effect on TGF␤-induced 4T1 cell migration and invasion (data not shown). We next examined the effect of lncRNA-HIT on anchorage independent growth in soft agar following depletion of lncRNA-HIT. Compared with si-CTL-treated 4T1 cells, si-HIT-1 and -2 knockdown cells resuspended in soft agar and cultured for 2 weeks showed markedly reduced colony formation capacity (Fig. 7E). Notably, the orthotopic breast cancer model revealed that knockdown of lncRNA-HIT dramatically inhibited the number of metastatic lung nodules, breast tumor volume, and tumor weight (Fig. 6, F-H).
Identification of E-cadherin as a Major Target of lncRNA-HIT-To identify the genes regulated by lncRNA-HIT, we performed Affymetrix gene expression analysis after 48 h ectopic expression of lncRNA-HIT in NMuMG (Fig. 8A). One of the significant deregulated genes associated with EMT was E-cadherin. Accumulating studies have shown that loss of E-cadherin is not only a hallmark of EMT but also a key driver of EMT and metastasis (18 -21). Thus, we further examined if lncRNA-HIT inhibits E-cadherin transcription by performing RT-qPCR and using the E-cadherin luciferase promoter assay. Following ectopic expression of lncRNA-HIT, we observed a significant loss of E-cadherin mRNA and promoter activity (Fig. 8, B and  C). Furthermore, this correlated with a loss of E-cadherin protein after 72 h overexpression of lncRNA-HIT (Fig. 8D), however, we did not see changes of the other EMT-associated genes ZEB1 or Snail at this time point. Furthermore, the effects of lncRNA-HITinduced EMT, migration, and invasion were rescued through introduction of ectopic E-cadherin (Fig. 8, E and F).
Expression of LncRNA-HIT Is Conserved and Is Elevated in Human Invasive Ductal Breast Cancer-LncRNA-HIT resides in the 5Ј distal Hoxa gene cluster, a region of the genome that is highly conserved (12,22). We therefore used BLAST analysis software to evaluate the conservation of lncRNA-HIT in FIGURE 7. LncRNA-HIT is elevated in highly metastatic 4T1 cells and its knockdown results in significant reduction of cell migration and invasion. A, expression of lncRNA-HIT was evaluated in 4 isogenic mouse breast cancer cell lines 67NR, 168FARN, 4T07, and 4T1 by semiquantitative RT-PCR. B, knockdown of lncRNA-HIT with 2 siRNAs in 4T1 cells. Cells were transfected with the indicated siRNAs and then analyzed lncRNA-HIT expression by real-time PCR. C and D, depletion of lncRNA-HIT dramatically reduces 4T1 cell migration and invasion. Following treatment with the indicated siRNAs, 4T1 cells were assayed for cell migration and invasion as described in the legend to Fig. 4. E, soft agar colony formation assay was performed after 4T1 cells were transfected with si-CTL, si-HIT-1, or si-HIT-2. Colony forming capacity was dramatically decreased after 2 weeks in lncRNA-HIT-depleted cells. F, orthotopic breast cancer model. Control and lncRNA-HIT knockdown 4T1 cells (4 ϫ 10 6 ) were injected to mammary fat pad of female nude mice and tumor growth was monitored for 4 weeks. The number of metastatic lung nodules (F), tumor volume (G), and tumor weight (H) were assessed at completion of experiment. Asterisks represent p Ͻ 0.05. humans and the potential for orthologs residing in the same genomic location. LncRNA-HIT maps to the syntenic loci (Fig.  9A) and shares 99% coverage and significant sequence identity (80%) to human. Having demonstrated that lncRNA-HIT mediates TGF␤-induced EMT in NMuMG and cell migration and invasion in 4T1 cells, we asked if the expression of lncRNA-HIT was associated with cancer invasiveness in primary breast carcinoma in human. A total of 89 breast cancer specimens (15 noninvasive and 74 invasive breast carcinomas) and 4 normal and 9 hyperplasia breast tissue samples were examined for the expression of lncRNA-HIT. Locked nucleic acid in situ hybridization (LNA-ISH) (Fig. 9B) analyses revealed high levels of lncRNA-HIT in 29 of 74 invasive tumors but in only 2 of 15 noninvasive cancer tissues (Fig. 9C). The level of expression of lncRNA-HIT in normal breast tissue was low and gradually increased to invasive carcinoma suggesting lncRNA-HIT may play a role in tumor progression in humans (Fig. 9D). These data further support the findings demonstrating the involvement of lncRNA-HIT in EMT and invasion as observed in NMuMG and 4T1 cells, and suggest that conserved human lncRNA-HIT could play a pivotal role in breast cancer metastasis.

DISCUSSION
Accumulating studies have demonstrated that the TGF␤ pathway plays a critical role in breast cancer metastasis and several protein-coding genes and miRNAs have been described in this process (23)(24)(25)(26). Moreover, a recent report demonstrated that lncRNA-ATB is up-regulated in TGF␤-treated SMMC-7721 hepatoma cells and plays a significant role in hepatocellular carcinoma metastasis (27). In this study, we report a lncRNA expression signature of TGF␤-induced EMT in mouse mammary gland epithelial (NMuMG) cells. Over 600 lncRNAs were significantly up-regulated or down-regulated during EMT, respectively. Furthermore, we showed that lncRNA-HIT, one of the top up-regulated lncRNA, plays an important role in TGF␤induced EMT, cell migration, and invasion. Depletion of lncRNA-HIT can reverse the process of EMT-associated gene expression of E-cadherin and Vimentin. Furthermore, lncRNA-HIT expression is significantly elevated in the highly metastatic cell line 4T1 in comparison to 3 other isogenic mouse cell lines with less metastatic capacity. Depletion of lncRNA-HIT in this cell line results in a significant reduction of migration and invasion as well as lung metastasis and tumor growth. LncRNA-HIT is conserved in sequence and genomic location from mouse to human. Human lncRNA-HIT expression is associated with more invasive tumor and breast cancer progression. These findings are important for several reasons. First, this is the first study to demonstrate an lncRNA expression signature of TGF␤-induced EMT in mouse mammary gland epithelial cells. Second, this study established a critical role of a previously uncharacterized lncRNA, lncRNA-HIT, in TGF␤-induced EMT. Finally, conserved lncRNA-HIT was shown to be an important lncRNA in mouse and human breast cancer progression.
LncRNA-HIT resides in the 5Ј distal Hoxa gene cluster between but not overlapping with Hoxa13 and Hoxa11-as. This region of the genome is highly conserved across several species and several functional lncRNAs have been demonstrated to be important within the Hoxa clusters, including neighboring HOTTIP and HOXA11-AS (12,22,28). HOTTIP has been shown to positively regulate the protein coding genes in close proximity, notable HOXA13, through the recruitment of WDR5, a component of the histone methyltransferase protein MLL complex that promotes transcriptional activation (12). It has also recently been reported that elevated levels of human HOTTIP and HOXA13 expression associates with disease progression and poorer survival in hepatocellular carcinoma (29). However, lncRNAs HOTTIP and HOXA11-AS were not induced by TGF␤ in our microarray analysis, suggesting lncRNA-HIT has a unique role in TGF␤-induced EMT. In addition, we demonstrated that E-cadherin promoter activity and the mRNA level were significantly repressed by lncRNA-HIT and that enforced expression of E-cadherin largely abrogated lncRNA-HIT-induced EMT, cell migration, and invasion. These findings suggest that E-cadherin is a key target of lncRNA-HIT, which regulates E-cadherin via trans mechanism.
Although microarray analysis revealed expression changes for a number of typical TGF␤-regulated protein-coding genes upon TGF␤-induced EMT, the array did not show altered expression of the lncRNA-HIT neighbor genes including HOXA13 and HOXA11. Further investigation is warranted to determine the mechanism of lncRNA-HIT regulation of E-cadherin, as well as its use as a therapeutic target for breast cancer metastasis intervention.