IMR Press / FBL / Volume 28 / Issue 10 / DOI: 10.31083/j.fbl2810276
Open Access Original Research
Identification of Key Novel lncRNAs RP11-400N13.3 and RP11-197K6.1 that Construct ceRNA Networks Associated with Recurrence and Metastasis in Colon Cancer
Show Less
1 School of Pharmacy, China Medical University, 110052 Shenyang, Liaoning, China
2 Department of Clinical Research Center, Central Hospital Affiliated to Shandong First Medical University, 250021 Jinan, Shandong, China
3 School of Forensic Medicine, China Medical University, 110052 Shenyang, Liaoning, China
4 School of Pharmacy, Hainan Medical University, 571100 Haikou, Hainan, China
*Correspondence: hy0207147@hainmc.edu.cn (Jing-wei Liang); fhmeng@cmu.edu.cn (Fan-hao Meng)
These authors contributed equally.
Front. Biosci. (Landmark Ed) 2023, 28(10), 276; https://doi.org/10.31083/j.fbl2810276
Submitted: 22 March 2023 | Revised: 9 June 2023 | Accepted: 29 June 2023 | Published: 31 October 2023
Copyright: © 2023 The Author(s). Published by IMR Press.
This is an open access article under the CC BY 4.0 license.
Abstract

Background: Colon adenocarcinoma (COAD) is a major cause of cancer mortality worldwide. The occurrence and development of colon cancer is regulated by complex mechanisms that require further exploration. Recently, long non-coding RNAs (lncRNAs) were found to be related to the mortality of colon cancer patients through their participation in competing endogenous RNA (ceRNA) networks. Therefore, screening the lncRNAs involved in colon cancer may contribute to clarifying the complex mechanisms. Methods: In this study, we explored the potential lncRNAs associated with colon cancer by establishing a ceRNA network using bioinformatics, followed by biological verification. Results: RP11-197K6.1 and RP11-400N13.3 were screened out owing to their involvement in the expression of CDK2NA, a gene that potentially prevents colon cancer cells from high oxygen levels. Conclusions: Our work explored the mechanisms of recurrence and metastasis in colon cancer and provided potential targets for drug development.

Keywords
colon cancer
RP11-197K6.1
RP11-400N13.3
intracellular signaling
ceRNA network
1. Introduction

Colon cancer is one of the most prevalent cancers worldwide, with characteristics involving a high incidence rate and high mortality [1]. Presently, surgery is used as the main treatment option, supplemented by radiotherapy and chemotherapy to achieve the therapeutic effect of killing and reducing the number of tumor cells as much as possible [2]. However, the 5-year survival rate for colon cancer remains alarmingly low because of the high recurrence and metastasis rates [3]. Thus, a better understanding of the underlying mechanisms and contributing factors involved in the metastasis development of colorectal carcinoma (CRC) is hugely important to provide the foundation for identifying specific targets in the treatment of CRC. The function of microRNA (miRNA) was reported to play an important role in the recurrence and metastasis of colon cancer by regulating the epithelial–mesenchymal transition (EMT) at both the post-transcriptional and post-translational levels. However, it seems that all major signaling pathways modulated by miRNA are involved in the CRC EMT program and contribute differently to the progression of CRC, thereby exhibiting the complexity of the metastatic cascade [4].

Recent research has focused on long non-coding RNAs (lncRNAs) owing to their regulatory relationship with miRNAs. They are a group of non-coding RNA molecules with a length greater than 200 nucleotides. Human genome sequencing results have shown that less than 2% of the whole genome encodes proteins and that most of the non-coding sequences were transcribed into lncRNAs. The interaction between lncRNA and miRNA is closely related to the occurrence and development of tumors [5]. The lncRNAs that regulate miRNAs through “sponge” absorption are also known as competitive endogenous RNA (competing endogenous RNA, ceRNA). Numerous research studies have indicated that lncRNAs possess high potential as novel biomarkers and therapeutic targets in cancer treatment [6]. Compared to defined lncRNAs, the functions and mechanisms of most lncRNAs remain unknown. Moreover, the current lncRNA mining works adopt bioinformatical methods that lack any systematic approach and verification.

The present study aimed to discover lncRNAs that influence the overall survival (OS) and disease-free survival (DFS) of colon patients and identify the underlying mechanisms involved in the recurrence of colorectal cancer in patients through the interaction between lncRNA and miRNA; thus, multiple bioinformatics databases were employed to establish a ceRNA network for colon cancer via a regulatory mechanism. The prediction of cellular localization indicated that lncRNA may regulate the expression of messenger RNA (mRNA) through a competitive endogenous network. Then, according to the differential expression of lncRNA-related miRNAs in colon cancer tissues, the downstream mRNA target regulated by miRNA was screened out, and a regulatory network of lncRNA was constructed (Fig. 1).

Fig. 1.

Graphical summary of the screening and verification process. LncRNA, long non-coding RNA; mRNA, messenger RNA; miRNA, microRNA; ceRNA, messenger RNA.

2. Materials and Methods
2.1 Prognostic Value Analysis and lncRNA Cellular Localization

LncRNA expression levels in colon adenocarcinoma from The Cancer Genome Atlas Program (TCGA) were visualized using circlncRNAnet [7]. Differentially expressed lncRNAs were identified according to the criterion: |Log2 fold change| >5, p-adjusted < 1.00 × 10-11. Overall survival and disease-free survival of differentially expressed lncRNAs were analyzed and plotted using Gene Expression Profiling Interactive Analysis (GEPIA) [8]. The group cutoff was set as ‘median’, and the hazard ratio and 95% CI were set as ‘Yes’. The dataset was selected as colon cancer. LncRNA sequences were obtained from UCSC and LncBook [9, 10]. Then, lncLocator was used to predict the cellular localization of the lncRNA using its sequences based on a stacked ensemble classifier [11]. The highest score in the prediction result was the lncRNA location.

2.2 Identification of Differentially Expressed MiRNAs Associated with LncRNA

The miRNA datasets were obtained from the Gene Expression Omnibus (GEO). ‘Colon cancer’ and ‘miRNA’ were selected as keywords, and the GSE125961 dataset was produced [12]. This dataset included 6 colon cancer tissues and 6 normal tissues, which contained a total of 1540 miRNAs. All miRNAs related to lncRNA in the dataset were screened and displayed with Heatmap. The lncRNA-binding miRNAs were predicted using miRcode [13].

2.3 Construction of lncRNA–miRNA–mRNA Network

The miRNA target genes were acquired using miRTarBase [14]. The genes that were validated using at least one strong experimental technique i.e., reporter gene assay, Western blotting, or quantitative reverse transcription polymerase chain reaction (PCR), were regarded as miRNA targets. The lncRNA–miRNA–mRNA network was generated using Cytoscape 3.8.0 [15].

2.4 Screening Crucial Genes Related to Colon Cancer

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed for the mRNAs in the ceRNA networks, considering the various biological processes associated with Gene Ontology. GO and KEGG pathway analyses of lncRNA-related genes were performed using Metascape v3.5 (https://metascape.org/gp/index.html) [16]. After submitting a gene list in Metascape, ‘Input species’ and ‘Analysis as species’ was set alongside ‘H. sapiens’, and ‘Custom Analysis’ was selected for the next step. When using Metascape for pathway and process enrichment analysis, the minimum overlap was set to 3, with a p-value cutoff of 0.01, and a minimum enrichment of 1.5. ‘KEGG Pathway’, ‘GO Biological Processes’, and ‘Reactome Gene Sets’ were selected as the pathway options. All other options were set to default.

STRING was used to collect data from the protein–protein interaction (PPI) network construction of lncRNA-related target genes, and Cytoscape 3.8.0 (Institute for Systems Biology, Seattle, WA, USA) was used for network visualization. The hub genes were selected using the CytoNCA network analyzer plug-in based on three different centrality measures, including ‘betweenness centrality (BC)’, ‘closeness centrality (CC)’, and ‘subgraph centrality (SC)’ [17].

In the enrichment analysis data, the genes related to colon cancer and the genes with the top 20% centrality in the PPI network were both selected for subsequent analyses since they are hub genes related to lncRNA.

2.5 COX Regression Analysis

The gene expression data were obtained from the Human Protein Atlas (HPA), selected by using ‘colon’, and a subset of colon cancer in PATHOLOGY [18]. All patient data related to gene expression used in this study were obtained from TCGA (Table 1). R packages ‘survival’ and ‘plyr’ were used to perform cox regression analysis of the selected genes. Multivariate regression modeling was performed by univariate regression (p < 0.05) following the selection of the genes.

Table 1.Identified biological pathways related to colon cancer in the cluster analysis.
Term ID Description Log P Gene
hsa05200 Pathways in cancer –36.353 CDK4, CCND1, CDKN1A, RB1, E2F1, E2F2, E2F3, TP53, AKT2, AKT1, CDKN2A, CDK2, CCND2, RAF1, EGFR, ERBB2, PTEN, BAK1, STAT3, MAPK9, BCL2, SMAD4, TGFBR2, CASP3, FAS, VEGFA, ETS1, ESR1, BBC3, FGFR2, HIF1A, EPO, RUNX1, IGF2, IL6R, BCL2L11, BMP2, SMO, GLI1, CASP7
hsa05206 MicroRNAs in cancer –28.345 HDAC4, CDKN1A, BCL2L11, BMPR2, TP53, RAF1, EGFR, STAT3, CCDN2, E2F1, PTEN, CCND1, CDKN2A, BMI1, CASP3, BMF, MCL1, ERBB3, E2F2, ERBB2, BCL2, E2F3, ATM, BAK1, SOX4, CYP24A1, VEGFA, HMGA2
GO: 0070482 Response to oxygen levels –20.009 EIF4, EBP1, TP53, BCL2, AKT1, EPO, HIF1A, E2F1, PTEN, CD34, VEGFA, HK2, SMAD4, CASP3, BMP2, ADAM15, RAF1, ATM, NR4A2, TGFBR2, ETS1, FAS, CDK4, CDKN1A, PDPN
2.6 Simulation and Verification of ceRNA Regulatory Network

The selected genes from the HPA database were verified by antibody staining after multivariate regression modeling, to determine the location of the mRNA expression. ‘Colon cancer’ was selected in ‘protein expression’ from ‘pathology atlas’, while ‘staining’ was selected as ‘high’, and ‘intensity’ and ‘quantity’ were selected as ‘strong’ and ‘>75%’, respectively. The computer validation of the lncRNA binding and the related miRNA was performed by RNA Fold Web Server, which lists the centroid structure with minimal base pair distance to all structures in the thermodynamic ensemble (which can be computed simply as the structure containing all pairs with p > 0.5) [19]. A high similarity between the centroid and minimum free energy (MFE) structure indicated a reliable prediction. A comprehensive assessment was based on the similarity between the secondary structure and the corresponding binding sites in the lncRNA and miRNA. The miRNA sequences were obtained from miRBase [20]. GEPIA was used to analyze the prognostic survival value of the validated genes.

The Extracellular Vesicles miRNA (EVmiRNA) database analyzed 462 small RNA sequencing datasets containing extracellular vesicles from 17 tissues/diseases, which provided the expression of specific miRNAs in different types of cancer [21]. Therefore, EVmiRNA was used to identify the selected miRNAs in colon cancer.

The RNAfold Web Server was used to predict secondary structures of single-stranded RNA and RNA–RNA interaction. ‘Minimum free energy (MFE) and partition function’ was selected in ‘fold algorithms and basic options’ to calculate the partition function (1) and base pairing probability matrix in addition to the MFE structure (2). ΔGuA,B represents the free energy required to make the binding region in molecule A or B accessible by removing the intra-molecular structure; Gh denotes the free energy gained from forming the intermolecular duplex. pij is the probability of forming the (i,j) pair. The lncRNA and miRNA binding information could be obtained after submitting the relevant sequence, while the ‘length of the unstructured region’, and ‘maximal length of the region of interaction’ was set to 4 and 25, respectively.

(1) Δ 𝑮 binding = Δ 𝑮 u A + Δ 𝑮 u B + 𝑮 𝒉

(2) S = - j i p i j log p i j - p i u log p i u , p i u = 1 - j i p i j

2.7 Biological Validation of RP11-197K6.1 Regulatory Network
2.7.1 Cell Culture

Human colon adenocarcinoma cells (CACO-2) were purchased from the American Type Culture Collection (ATCC). The cell line was authenticated by short tandem repeats, and mycoplasma testing was conducted. Cells were cultured routinely in high glucose Dulbecco’s Modified Eagle’s Medium (DMEM-basic, Gibco, Waltham, MA, America), containing 20% fetal bovine serum (Biological Industries, Kibbutz Beit Haemek, Israel). When the cells reached 90% confluency, they were passaged using a 0.25% trypsin–EDTA solution (Solarbio, Beijing, China). The cells were cultured in a humidified incubator at 37 °C under an atmosphere of 5% CO2 (Memmert, Cologne, Germany).

2.7.2 Cell Transfection

Using the lncRNA RP11-197K6.1 sequences in the UCSC Genome Browser (http://genome.ucsc.edu/), small interfering RNA expressing negative control (NC) RNA and lncRNA were designed and developed by HippoBio in China. All of these sequences were transfected into cells in the logarithmic growth phase using RNA TransMate transfection reagent (Sangon, Shanghai, China), in accordance with the manufacturer’s protocol. When transfection and verification were complete, the cells were used in subsequent experiments.

2.7.3 Western Blot (WB) Analysis

For WB analysis, cells were lysed in radioimmunoprecipitation assay (RIPA) buffer (Keygen Biotechnology, Nanjing, China), supplemented with 1×phosphatase inhibitor (Seven Biotechnology, Shanghai, China), and 1× protease inhibitor (Seven Biotechnology). The cell lysates were centrifuged, and the pelleted debris was discarded. Then, the total protein was mixed with 5× SDS-PAGE loading buffer and heated to 100 °C for 5 min to prevent protein degradation. Protein samples (30 µg) were electrophoresed on an SDS-PAGE gel, and then, transferred to a polyvinylidene difluoride (PVDF) membrane. Subsequently, the membrane was blocked with 5% milk for 2 h before being incubated with the indicated primary antibody overnight. After washing with PBS–Tween (PBST) solution, the membrane was incubated with the appropriate secondary antibody, and the expression was detected using enhanced chemiluminescence (ECL) reagents. Relative protein expression was analyzed using Image J software (version 1.8, National Institutes of Health, Bethesda, MD, USA).

2.7.4 Real-Time Quantitative PCR (RT-qPCR)

Total RNA was extracted from cells using TRIzol reagent (R1100, Solarbio, China). The purity and concentration of RNA were calculated by optical density (OD) at wavelengths of 260 and 280 nm. The synthesis of complementary DNA (cDNA) was performed using FastKing RT Kit (KR116, Tiangen, Beijing, China) in PCR Thermal Cycler Dice (TP600, TaKaRa, Kusatsu, Japan). Real-time PCR analyses were performed using SuperReal PreMix Plus (FP205, Tiangen, China) in the Applied Biosystems 7300 Real-Time PCR System (Applied Biosystems, Foster, CA, USA). The expression level of RP11-197K6.1 was calculated using the 2-ΔΔCT method. The primer sequences used to amplify RP11-197K6.1 were 5-GGUGCAUGAUCGCCGAAUAAATT-3 (forward) and 5-UUUAUUCGGCGAUCAUGCACCTT-3 (reverse). Experiments were conducted independently in at least triplicate.

2.8 Statistical Analysis

All data are presented as mean ± SEM and processed using GraphPad Prism software (version 8.3, GraphPad Software, Inc., San Diego, CA, USA). An unpaired two-tailed t-test was used to compare the differences between the two groups. One-way ANOVA with Bonferroni correction for multiple comparisons was used to analyze the differences between the groups. p < 0.05 was determined as statistically significant.

3. Results
3.1 Differentially Expressed LncRNA Screening and Prognostic Value Analysis

Through data collection in circlncRNAnet, it was found that 59 lncRNAs are differentially expressed in colon cancer (Supplementary Table 1). The results identified that most of the differentially expressed lncRNAs were upregulated in colon cancer, with extremely high statistical significance (p-adjust < 0.0001), whereby RP11-474D1.3, FEZF1-AS1, and RP11-143E21.3 showed the highest fold changes. To identify the lncRNA with potential clinical significance in colon cancer, an omnibus database was adopted to analyze the OS and DFS of lncRNA in colon cancer patients. The GEPIA database showed that the high expressions of three lncRNAs significantly correlated to the reduction in OS and DFS in colon cancer patients (Fig. 2a).

Fig. 2.

The correlation between the expression of the three lncRNAs and OS and DFS in colon cancer patients: (a) Overall survival and prognostic values of lncRNA in colon cancer from TCGA were analyzed using GEPIA. The overall survival curve and prognostic survival curve of three lncRNAs. (b) The expression of lncRNA in colon adenocarcinoma patients was extensively upregulated compared to normal tissues. (c) Subcellular localization of RP11-815M8.1, RP11-400N13.3, and RP11-197K6.1 predicted by lncLocator. TCGA, The Cancer Genome Atlas; GEPIA, Gene Expression Profiling Interactive Analysis; OS, overall survival; DFS, disease-free survival; lncRNA, long non-coding RNAs; COAD, colon adenocarcinoma.

3.2 Cellular Localization and Construction of ceRNAnet

The research shows that the lncRNAs in the cytoplasm rather than in the other cell components interfere with the inhibitory effect of miRNA on mRNA. Thus, we screened for the lncRNAs that participate in the ceRNA network, according to their location. The result predicted that RP11-400N13.3, RP11-197K6.1, and RP11-815M8.1 were all located in the cytoplasm (Fig. 2c; score: 0.82, 0.85, and 0.89, respectively). Therefore, the three lncRNAs can potentially regulate cellular functions through the ceRNA network. To construct the downstream lncRNA regulatory network, the target miRNAs of the lncRNAs were predicted. Subsequently, 37 miRNA families were found to be downstream and under the regulation of the three lncRNAs (Supplementary Table 2). Differentially expressed gene (DEG) analysis indicated that 12 miRNAs were significantly downregulated in colon cancer patients according to the GSE125961 dataset (Fig. 3a; |Log2 fold change| >1, p < 0.05). Whereas, in subsequent analyses, we found the levels of the mRNAs regulated by several members of the 12 miRNAs, including miR-92b, miR-302b, and miR-214, did not match the trend of being significantly upregulated in tumor tissues. As a result, these miRNAs were excluded, while a further 8 miRNAs were also screened out: miR-302c-3p, miR-125b-5p, miR-302d-3p, miR-302a-3p, miR-106a-5p, miR-125a-5p, and miR-363-3p for RP11-197K6.1; miR-24-3p for RP11-400N13.3. To further complete the ceRNA network, the mRNAs regulated by miRNA need to be identified. The experimental data obtained using three experimental methods: reporter assay, Western blotting, and quantitative reverse transcription were used to identify the target mRNAs, using the hypothesis that the mechanism of the ceRNA regulatory network involved in the upregulation of the lncRNAs would inevitably lead to a decrease in the downstream miRNA expression. Meanwhile, the downregulation trend involving the target miRNA of RP11-815M8.1 was far inferior to the other two lncRNAs, indicating that its regulatory efficiency in the ceRNA network was weak. Moreover, these miRNA-regulated mRNAs were likewise not significantly upregulated in tumor tissues. Thus, RP11-815M8.1 was eliminated from further analysis, whereas RP11-400N13.3 and RP11-197K6.1-related microRNAs and mRNAs were selected for ceRNA network construction (Fig. 3b).

Fig. 3.

DEG analysis and ceRNAnet construction: (a) Differentially expressed miRNAs in colorectal cancer. Heatmap of differentially expressed miRNAs in colorectal cancer obtained from GSE125961 dataset using GEO2R (|Log2 fold change| >1; p-value < 0.05). The dataset included 6 pairs of colon cancer tissues (tumors 1–6) and their adjacent tissues (normal 1–6), while 12 miRNAs were significantly downregulated in the tumors. (b) Construction of lncRNA–miRNA–mRNA network. Competitive endogenous RNA network (ceRNAnet) contained a total of 144 target genes (CDK4, MAP114, E2F2, and CDKN2A were associated with both RP11-400N13.3 and RP11-197K6.1), 2 lncRNAs, and 9 miRNAs. RP11-400N13.3 was only targeted by miR-24-3p. DEG, differentially expressed gene.

3.3 Functional Enrichment Analysis of LncRNA-Target mRNAs

Function and pathway enrichment analysis was performed based on the online bioinformatics portal Metascape. The result of gene enrichment analysis showed that RP11-197K6.1-targeted mRNAs were enriched in cancer pathways, especially the “pathways in cancer (hsa05200)”, “microRNAs in cancer (ko05206)”, and “response to oxygen levels (GO: 0070482)” with a -log10(P) score of 37, 28, and 20, respectively (Fig. 4a). Moreover, the pathway “microRNAs in cancer (ko05206)” and “response to oxygen levels (GO: 0070482)” were reportedly associated with colon cancer development. Oxygen level is an intracellular factor that has been reported to prevent colon cancer cells from apoptosis by promoting DNA damage repair [22, 23, 24]. Therefore, the “response to oxygen levels” identification is of great value in the current research. In order to explore the specific relationship between oxygen levels and the development of colon cancer, three datasets were selected to identify an overlap between them: two biological pathways with the highest enrichment degree of colon cancer and one pathway involving the cellular response to oxygen levels. The protein interaction network for all the ceRNA targets was constructed, and the top 20% of the centrality targets were selected for further study (Supplementary Fig. 1). Finally, according to whether there were significant differences in the expression of these target genes between colon cancer tissues and normal tissues, all target genes highly expressed in colon cancer tissues were excluded. For RP11-400N13.3, its target mRNAs were clustered into “negative regulation of growth (GO: 0045926)”, “cell cycle (R-HAS-1640170)”, and “PID E2F PATHWAY (M40)” (Fig. 4c). However, there was an inadequate number of genes required for enrichment analysis to identify a clear relationship between RP11-400N13.3 and colon cancer. Nevertheless, there were some potential biological pathways related to cancer, including “DNA replication (GO: 0006260)”, “AMP-activated protein kinase (AMPK) signaling pathway (hsa04152)”, and “negative regulation of translation (GO: 0017148)”. Significantly, similar to RP11-400N13.3, the non-cancer-related pathways predicted to be involved in the functions of RP11-197K6.1 were “cell cycle-related” and “regulation of metabolic process”, suggesting a potential common mechanism between RP11-400N13.3 and RP11-197K6.1.

Fig. 4.

GO and KEGG enrichment analyses. (a) GO and KEGG enrichment analyses of genes associated with RP11-197K6.1. (b) Interaction network illustrating the top 20 enrichment terms for RP11-197K6.1-related genes, larger nodes indicate the enrichment of more genes in the pathway. (c) GO and KEGG enrichment analyses of genes associated with RP11-400N13.3. (d) Interaction network illustrating the top 10 enrichment terms for RP11-400N13.3-related genes. It shows that compared to RP11-400N13.3, more enrichment pathways are related to cancer for RP11-197K6.1.

3.4 Identification of Crucial Genes Associated with Colon Cancer

According to the enrichment analyses, it is suggested that the expression of mRNA regulated by RP11-400N13.3 and RP11-197K6.1 is closely related to colon cancer. The RP11-197K6.1-related genes enriched in the three biological pathways are shown in Table 1. To ensure the diversity and statistical significance of the samples included in the study, three different levels of p-value cancer-related pathways were selected from the results of the enrichment analyses. In Fig. 5a, the Venn diagram demonstrates that 27 targets overlap in two or more datasets. It was observed that the mRNA expression levels of nine RP11-197K6.1–related targets (CDKN2A, E2F1, TP53, CASP3, VEGFA, CDK4, CCND1, E2F3, and BAK1) were significantly upregulated in colon tumor compared to normal tissues. Based on the network topology analysis of the lncRNA-related target genes, the top 20% of genes with the highest importance were obtained (Supplementary Fig. 1). It was found that in addition to the 27 genes described in the Venn diagram, there were another 5 genes (BRCA1, CDK1, RUNX3, APP, and PPP1CA) highly expressed in colon cancer tissues (Supplementary Fig. 2). Thus, the total 14 crucial targets were further considered to investigate the relationship to the two lncRNAs (Fig. 5b).

Fig. 5.

Identification of RP11-400N13.3 and RP11-197K6.1–related crucial genes in colon cancer. (a) Venn diagram of the overlapping genes related to “microRNAs in cancer (ko05206)”, “response to oxygen levels (GO: 0070482), and “pathway in cancer (hsa05200)”. (b) Crucial genes upregulated in colon cancer by RP11-400N13.3 and RP11-197K6.1, via several miRNAs.

3.5 Screening Downstream mRNA Regulatory Network using Clinical Data

In order to screen out the genes whose expression significantly affected the patient’s survival, the sample data from 438 colon adenocarcinoma patients in the TCGA database were employed to analyze the expression of 14 genes highly expressed in colon cancer tissues. After COX univariate regression analysis of the genes using patient survival curves, VEGFA, E2F3, RUNX3, and CDKN2A were identified since their expression significantly reduced the patient’s overall survival (p < 0.05; hazard ratio >1). Subsequently, CASP3, TP53, PPP1CA, CDK4, and BRCA1 were also synchronously screened out as protective factors involved in the patient’s overall survival (p < 0.05; hazard ratio <1). With the purpose of further screening out the independent influencing factors on the patient’s survival, COX multivariate regression was performed on the genes identified previously with a p-value of < 0.05. The results showed that E2F3 and CDKN2A were independent risk factors in the patient’s overall survival, along with BRCA1, CDK4, and CASP3, which were independent protective factors (Table 2).

Table 2.Univariate and multivariate analyses of overall survival of colon adenocarcinoma patients.
Genes Univariate analysis Multivariate analysis
HR (95% CI) p-value HR (95% CI) p-value
BRCA1 0.613 (0.398–0.943) 0.026 0.428 (0.265–0.691) 0.001
RUNX3 2.172 (1.445–3.263) <0.001 0.602 (0.366–0.992) 0.047
PPP1CA 0.535 (0.292–0.979) 0.043 0.889 (0.521–1.517) 0.666
VEGFA 1.766 (1.183–2.634) 0.005 1.215 (0.763–1.936) 0.412
TP53 0.568 (0.365–0.886) 0.013 0.766 (0.489–1.201) 0.245
E2F3 1.652 (1.110–2.457) 0.013 1.602 (1.024–2.507) 0.039
CDKN2A 1.762 (1.178–2.635) 0.006 1.990 (1.314–3.014) 0.001
CDK4 0.649 (0.437–0.966) 0.006 0.632 (0.415–0.964) 0.033
CASP3 0.613 (0.410–0.916) 0.017 0.609 (0.402–0.923) 0.020
CDK1 0.688 (0.461–1.027) 0.067
APP 0.737 (0.467–1.161) 0.188
E2F1 0.697 (0.466–1.041) 0.078
CCND1 1.502 (0.987–2.285) 0.057
BAK1 0.719 (0.458–1.129) 0.152

HR, hazard ratio.

3.6 Pathological and Prognostic Verification of Downstream targets

As previously mentioned, the cytoplasm is where RP11-400N13.3 and RP11-197K6.1 inhibit miRNA. Therefore, the expression of E2F3 and CDKN2A in colon cancer tissues was searched for in the HPA database. The result showed that miR-125b-5p and miR-24-3p regulate CDKN2A in the microvesicles in colon cancer tissues, indicating that the two miRNAs may be affected by upstream factors in the cytoplasm (Fig. 6a,b). Analysis of clinical data for CDKN2A and the subcellular locations of two further miRNAs are also shown in Fig. 6c. The HPA database lacks E2F3 staining data in colon cancer tissues. Therefore, follow-up verification only discusses CDKN2A-related regulatory pathways.

Fig. 6.

The correlation between the expression of the expression of miR-125b-5p and miR-24-3p and OS and DFS in colon cancer patients: (a) MiR-24-3p is highly expressed in cytoplasmic microvesicles in colon carcinoma. (b) MiR-125b-5p is highly expressed in cytoplasmic microvesicles in colon carcinoma. (c) Upregulated CDKN2A is associated with poor overall survival and prognostic survival of COAD patients. COAD, colon adenocarcinoma.

3.7 Computer Simulation Predicts the lncRNA and miRNA Binding Sites

After submitting RP11-400N13.3 and RP11-197K6.1 sequences to RNAfold Web Server, results for MFE and thermodynamic ensemble prediction were obtained. The calculation results for RP11-197K6.1 illustrated that the thermodynamic ensemble free energy was –519.83 kcal/mol, the optimal secondary structure had MFE of –483.90 kcal/mol and the centroid secondary structure had MFE of –407.28 kcal/mol (Fig. 7b). The calculation results for RP11-400N13.3 highlighted that the free energy of the thermodynamic ensemble was –189.54 kcal/mol, the optimal secondary structure had a MFE of –179.10 kcal/mol and the centroid secondary structure had MFE of –134.90 kcal/mol (Fig. 7c). A high similarity between the centroid and MFE structure indicated a reliable prediction. Therefore, the centroid secondary structure was the optimal secondary structure for both RP11-197K6.1 and RP11-400N13.3. Then, we calculated the binding sites based on the optimal secondary structure of the two lncRNAs.

Fig. 7.

The formula representing the interaction free energy was computed and combined with the opening energy to determine the total blinding energies. (a) The optimal secondary structure for RP11-197K6.1 binding to miR-125b-5p; (b) Mountain plot representation of the MFE structure, the thermodynamic ensemble of RNA structures, and the centroid structure; (c) The plot shows the interaction free energy (RED) Gi, and the energy needed to open existing structures in the lncRNA sequence (BLACK); (d) Mountain plot representation of the MFE structure, the thermodynamic ensemble of RP11-400N13.3, and the centroid structure; (e) The optimal secondary structure for RP11-400N13.3 binding to miR-24-3p; (f) The target region of RP11-400N13.3 binding to miR-24-3p; (g) The plot shows the interaction free energy (RED) Gi, and the energy needed to open existing structures in RP11-400N13.3 sequence (BLACK).

The optimal secondary structure upon hybridization ranged from positions 1434 to 1442 in the RP11-197K6.1 sequence and from positions 2 to 12 in miR-125b-5p, while the total free energy of binding was –8.55 kcal/mol and the energy from duplex formation was –13.90 kcal/mol. The opening energy for RP11-197K6.1 and the miR-125b-5p sequence was 5.14 kcal/mol and 0.21 kcal/mol, respectively. It should be noted that positions 1434 to 1442 in the RP11-197K6.1 sequence did not contain hairpin loops and stems as the functional area of RNA binding, although positions 1161 to 1180 in the RP11-197K6.1 sequence had similarities with 1434 to 1442 in the interaction free energy and the energy needed to open the existing structure in the RP11-197K6.1 sequence. Positions 1161 to 1180 in the RP11-197K6.1 sequence had hairpin loops as the functional area for RNA binding and stable stems (Fig. 7a).

The optimal secondary structure upon hybridization ranged from positions 172 to 189 in the RP11-400N13.3 sequence and from positions 2 to 20 in miR-24-3p, while the total free energy of binding was –10.26 kcal/mol and the energy from duplex formation was –18.35 kcal/mol. The opening energy for the RP11-400N13.3 and miR-24-3p sequences was 5.23 kcal/mol and 2.85 kcal/mol, respectively (Fig. 7d–f). The positions 172 to 189 in the RP11-400N13.3 sequence contained hairpin loops and stems, which performed the specific function of RNA binding.

3.8 The Expression of CDKN2A Decreases after Silencing RP11-197K6.1

Screening of key lncRNAs in colon cancer and the functional prediction of RP11-400N13.3 and RP11-197K6.1 has been completed. In order to further verify our prediction results, a target gene silencing experiment was conducted by siRNA transfection. The fluorescence localization of the transfection experiment showed that the transfection efficiency exceeded 70% (Fig. 8a). Western blot analysis showed that after silencing RP11-197K6.1, the expression of CDKN2A was significantly lower than in the control group, indicating that the upregulation of lncRNA promoted CDKN2A expression in the colon cancer tissues (Fig. 8d).

Fig. 8.

Experimental verification of RP11-197K6.1 regulation of CDKN2A expression. (a) The overlap of the fluorescent field and the blank field showed that most colon cancer cells were transfected by siRNA. (b) The designed siRNA sequences. (c) The expression of lncRNA RP11-197K6.1 after cell transfection. The data are presented as the mean ± SD, n = 3, **, p < 0.01. (d) The CDKN2A expression in CACO-2 cells transfected with and without siRNA. The data are presented as the mean ± SD, n = 3, **, p < 0.01.

4. Discussion

Previous studies demonstrated that cytoplasmic lncRNAs regulate cellular processes by modulating mRNA stability or translation [25]. In this study, two lncRNAs were identified: RP11-400N13.3 and RP11-197K6.1. They were differentially expressed in colon adenocarcinoma, and significantly associated with a poor survival rate in colon cancer patients. It was indicated that RP11-400N13.3 and RP11-197K6.1 acted as a sponge of miRNAs, and competitively regulate mRNA expression in colon cancer, according to the ceRNA hypothesis [26].

Based on the GEO dataset, we found that eight miRNA families were downregulated in colon cancer cells, which were probably responsible for the lncRNA functions. Among them, a reduction in miR-302a-3p, miR-302c-3p, miR-302d-3p, miR-125a-5p, and miR-363-3p was associated with poor overall survival of patients more than colon cancer [27, 28, 29, 30]. In addition, recent research suggests that changes in miR-125b-5p levels play an important role in the development of drug resistance in colon cancer cells. Overexpression of miR-125b-5p significantly sensitized cisplatin-resistant cells [31]. Increased expression of miR-24-3p predicts poor DFS and OS for colon adenocarcinoma patients, independently of currently used clinical pathological parameters for the prognosis of this human malignancy. Downregulation of miR-24-3p contributed to the development and progression of colon cancer and probably possesses a potential role in prognosis and therapy [11]. The above-mentioned studies indicate that these candidate miRNAs may be involved in colon cancer as tumor suppressors.

COX univariate regression and multivariate regression of the Kaplan–Meier curve showed that some target genes significantly correlate to the survival of colon cancer patients. However, the high expression of these genes did not relate to all the risk factors connected to their death, given that some expressed genes were protective factors for patients (hazard ratio <1). Among these target genes, high expressions of E2F3 and CDKN2A were positively correlated to patient death.

The miRcode web server identifies putative target sites based on seed complementarity and evolutionary conservation; however, the prediction results lacked specific elaboration on RNA structure and binding sites, whereby the functional characteristics of lncRNA compete with miRNA to bind to mRNA. Therefore, predicting the secondary structure of lncRNA is of great significance to verify its function. RNAfold Web Server was used to predict the optimal secondary structure of RNA and obtain the predicted RNA binding sites after submitting the predicted secondary structure. The binding energy was employed to evaluate the predicted interaction results. The results showed that the two lncRNAs were predicted to possess hairpins and stem-loop structures for binding with mRNA. Moreover, the predicted secondary structures both conform to the stability of the energy system and meet the theoretical requirements of RNA secondary functional structures. The characteristic areas of lncRNAs that act as miRNA sponges were predicted for further exploration of the functions of RP11-400N13.3 and RP11-197K6.1.

Thus, the verification process consisted of two parts. The first was regarding the cellular localization of the overall regulatory network, given that most of the lncRNAs in the cytoplasm play a role in the combination of competitive regulatory networks and microRNAs. The microRNAs inhibited by lncRNA binding and thereby indirectly affecting the mRNAs should be detected in the cytoplasm; the second relates to the verification of lncRNA and microRNA binding sites. The binding position of microRNA and lncRNA should meet the secondary structure of the lncRNA. The results of the biological experiments further prove that inhibiting the expression of RP11-197K6.1 resulted in the downregulation of the CDKN2A protein in colon cancer cells. It indicated that our screening methods for lncRNAs were highly accurate. However, whether the RP11-197K6.1/miR-125b-5p/CDKN2A axis has clinical value in the treatment of colon cancer still requires more in vivo experiments and further mechanism research.

5. Conclusions

Through our screening process, using colon adenocarcinoma as an example, we found that RP11-197K6.1 significantly reduced the overall survival rate of colon cancer patients by indirectly regulating the expression of the CDKN2A protein. Our work established a ceRNA network using bioinformatics followed by biological verification, to explore the potential mechanisms involved in recurrence and metastasis in colon cancer, and provided potential targets for drug development.

Availability of Data and Materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Author Contributions

XZ, JL and FM designed the research study. SX and YW performed the research. EX and LY provided help and advice acquired the data for the work. XZ and YW analyzed the data, and YW and XZ drafted the work wrote the manuscript. FM provided the project funding, accounted for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work, and final approved the version to be published. All authors contributed to editorial changes in the manuscript. All authors read and approved the final manuscript.

Ethics Approval and Consent to Participate

Not applicable.

Acknowledgment

We thank all members of the laboratory for useful discussions.

Funding

This research was funded by the National Natural Science Foundation of China for Young Scholars (No. 82103998); the Key Research and Development project of Liaoning (Liaoning R&D [2019]26).

Conflict of Interest

The authors declare no conflict of interest.

References
[1]
Eheman C, Henley SJ, Ballard-Barbash R, Jacobs EJ, Schymura MJ, Noone AM, et al. Annual Report to the Nation on the status of cancer, 1975–2008, featuring cancers associated with excess weight and lack of sufficient physical activity. Cancer. 2012; 118: 2338–2366.
[2]
Manoochehri H, Sheykhhasan M, Samadi P, Pourjafar M, Saidijam M. System biological and experimental validation of miRNAs target genes involved in colorectal cancer radiation response. Gene Reports. 2019; 17: 100540.
[3]
Al-Khayal K, Abdulla M, Al-Obeed O, Al Kattan W, Zubaidi A, Vaali-Mohammed MA, et al. Identification of the TP53-induced glycolysis and apoptosis regulator in various stages of colorectal cancer patients. Oncology Reports. 2016; 35: 1281–1286.
[4]
Pretzsch E, Bösch F, Neumann J, Ganschow P, Bazhin A, Guba M, et al. Mechanisms of Metastasis in Colorectal Cancer and Metastatic Organotropism: Hematogenous versus Peritoneal Spread. Journal of Oncology. 2019; 2019: 7407190.
[5]
Schmitt AM, Chang HY. Long Noncoding RNAs in Cancer Pathways. Cancer Cell. 2016; 29: 452–463.
[6]
Lyu L, Xiang W, Zhu JY, Huang T, Yuan JD, Zhang CH. Integrative analysis of the lncRNA-associated ceRNA network reveals lncRNAs as potential prognostic biomarkers in human muscle-invasive bladder cancer. Cancer Management and Research. 2019; 11: 6061–6077.
[7]
Wu SM, Liu H, Huang PJ, Chang IYF, Lee CC, Yang CY, et al. circlncRNAnet: an integrated web-based resource for mapping functional networks of long or circular forms of noncoding RNAs. GigaScience. 2018; 7: 1–10.
[8]
Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Research. 2017; 45: W98–W102.
[9]
Haeussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Research. 2019; 47: D853–D858.
[10]
Ma L, Cao J, Liu L, Du Q, Li Z, Zou D, et al. LncBook: a curated knowledgebase of human long non-coding RNAs. Nucleic Acids Research. 2019; 47: 2699.
[11]
Gao Y, Liu Y, Du L, Li J, Qu A, Zhang X, et al. Down-regulation of miR-24-3p in colorectal cancer is associated with malignant behavior. Medical Oncology. 2015; 32: 362.
[12]
Zhou F, Tang D, Xu Y, He H, Wu Y, Lin L, et al. Identification of microRNAs and their Endonucleolytic Cleavaged target mRNAs in colorectal cancer. BMC Cancer. 2020; 20: 242.
[13]
Jeggari A, Marks DS, Larsson E. miRcode: a map of putative microRNA target sites in the long non-coding transcriptome. Bioinformatics. 2012; 28: 2062–2063.
[14]
Huang HY, Lin YCD, Li J, Huang KY, Shrestha S, Hong HC, et al. miRTarBase 2020: updates to the experimentally validated microRNA-target interaction database. Nucleic Acids Research. 2020; 48: D148–D154.
[15]
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003; 13: 2498–2504.
[16]
Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nature Communications. 2019; 10: 1523.
[17]
Tang Y, Li M, Wang J, Pan Y, Wu FX. CytoNCA: a cytoscape plugin for centrality analysis and evaluation of protein interaction networks. Bio Systems. 2015; 127: 67–72.
[18]
Uhlen M, Zhang C, Lee S, Sjöstedt E, Fagerberg L, Bidkhori G, et al. A pathology atlas of the human cancer transcriptome. Science. 2017; 357: eaan2507.
[19]
Gruber AR, Lorenz R, Bernhart SH, Neuböck R, Hofacker IL. The Vienna RNA websuite. Nucleic Acids Research. 2008; 36: W70–W74.
[20]
Kozomara A, Birgaoanu M, Griffiths-Jones S. miRBase: from microRNA sequences to function. Nucleic Acids Research. 2019; 47: D155–D162.
[21]
Liu T, Zhang Q, Zhang J, Li C, Miao YR, Lei Q, et al. EVmiRNA: a database of miRNA profiling in extracellular vesicles. Nucleic Acids Research. 2019; 47: D89–D93.
[22]
Afanas’ev I. Reactive oxygen species signaling in cancer: comparison with aging. Aging and Disease. 2011; 2: 219–230.
[23]
Kumar B, Koul S, Khandrika L, Meacham RB, Koul HK. Oxidative stress is inherent in prostate cancer cells and is required for aggressive phenotype. Cancer Research. 2008; 68: 1777–1785.
[24]
Vaquero EC, Edderkaoui M, Pandol SJ, Gukovsky I, Gukovskaya AS. Reactive oxygen species produced by NAD(P)H oxidase inhibit apoptosis in pancreatic cancer cells. The Journal of Biological Chemistry. 2004; 279: 34643–34654.
[25]
Salmena L, Poliseno L, Tay Y, Kats L, Pandolfi PP. A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language? Cell. 2011; 146: 353–358.
[26]
Guttman M, Rinn JL. Modular regulatory principles of large non-coding RNAs. Nature. 2012; 482: 339–346.
[27]
Dong J, Geng J, Tan W. MiR-363-3p suppresses tumor growth and metastasis of colorectal cancer via targeting SphK2. Biomedicine & Pharmacotherapy. 2018; 105: 922–931.
[28]
Tong Z, Liu N, Lin L, Guo X, Yang D, Zhang Q. miR-125a-5p inhibits cell proliferation and induces apoptosis in colon cancer via targeting BCL2, BCL2L12 and MCL1. Biomedicine & Pharmacotherapy. 2015; 75: 129–136.
[29]
Wang J, Chen S. RACK1 promotes miR-302b/c/d-3p expression and inhibits CCNO expression to induce cell apoptosis in cervical squamous cell carcinoma. Cancer Cell International. 2020; 20: 385.
[30]
Yang C, Deng SP. Mechanism of hsa-miR-302a-3p-targeted VEGFA in the Inhibition of Proliferation of Gastric Cancer Cell. Journal of Sichuan University. Medical Science Edition. 2019; 50: 13–19.
[31]
Shi H, Li K, Feng J, Liu G, Feng Y, Zhang X. LncRNA-DANCR Interferes With miR-125b-5p/HK2 Axis to Desensitize Colon Cancer Cells to Cisplatin vis Activating Anaerobic Glycolysis. Frontiers in Oncology. 2020; 10: 1034.

Publisher’s Note: IMR Press stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share
Back to top