- Academic Editor
†These authors contributed equally.
Background: Bladder urothelial carcinoma (BLCA) is a malignancy with a
high incidence worldwide. One-third of patients may experience aggressive
progression later on, and 70% of patients who have undergone surgical
intervention will still suffer from metastasis. Materials and Methods:
RNA sequencing profiles of BLCA samples were obtained from The Cancer Genome
Atlas (TCGA) database. Differential expression and univariate Cox regression
analyses were performed to identify prognosis-related differentially expressed
immune genes (DEIGs). Subsequently, a proportional hazards model of DEIGs was
then constructed by univariate regression analysis. Differential expression and
correlation analyses, CIBERSORT, Single Sample Gene Set Enrichment Analysis
(ssGSEA), GSVA were conducted on transcription factors (TFs), immune
cells/pathways and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. The
regulation network was then constructed. Eventually, ATAC-seq, ChIP-seq,
scRNA-seq, and multiple online databases were employed for further validation.
Results: A proportional hazards model of 31 DEIGs was constructed and
risk score was calculated and proven to be a independent prognostic factor. Then
5 immune genes were characterized to be significantly correlated with bone
metastasis, stage and TF expression simultaneously. 4 TFs were identified to be
significantly correlated with prognosis and RBP7 expression. 5 immune
cells/pathways were revealed to be significantly correlated with RBP7 expression.
Only 1 KEGG pathway was identified to be significant in Gene Set Enrichment
Analysis (GSEA) and Gene Set Variation Analysis (GSVA) analyses. The regulatory
relationship was then constructed, in which the correlation between EBF1 and RBP7
(R = 0.677, p
Bladder cancer has the highest incidence among urologic malignancies in China and is the sixth most common cancer globally [1, 2]. One third of patients might have aggressive progression later, and 70% of patients who undergo surgical intervention will still experience metastasis [3]. The pathological type of most bladder cancer patients is urothelial carcinoma, and patients with the same pathological features usually have a considerable difference in prognostic outcome [4]. Its morbidity has risen from 2000 to 2011, and patients with metastasis were reported to have a higher mortality rate [1]. After radical cystectomy, nearly 50% of patients will still develop varying degrees of progression. Around 5–10% of patients have already metastasized at the time of diagnosis [5]. Bone metastasis occurs frequently in bladder urothelial carcinoma (BLCA), which has a relative incidence of 40% and only 6–9 months of median survival from diagnosis [6, 7]. Thus, there is still an unmet clinical need for further research to reveal the potential mechanism of bone metastasis in BLCA.
High throughput sequencing data analysis has been widely used for the identification of significant molecular and cellular biomarkers in the metastasis and prognosis of BLCA [8, 9, 10, 11, 12]. However, little previous research has focused on immune genes and relative networks in the bone metastasis of BLCA.
In this study, we identified differentially expressed immune genes (DEIGs) and TFs in patients with BLCA. To obtain valuable prognostic biomarkers, univariate Cox regression was applied. Based on the DEIGs, we constructed a proportional hazards model and validated it to have good prognosis-predictive performance. We employed CIBERSORT and Single Sample Gene Set Enrichment Analysis (ssGSEA) method to quantify tumor-infiltrating immune cells/pathways, which were then co-analyzed with the DEIGs. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis was carried out employing the in silico tools Gene Set Variation Analysis (GSVA) and Gene Set Enrichment Analysis (GSEA). Subsequently, KEGG signaling pathways with statistically significant correlation relationships were selected. Assay for Transposase Accessible Chromatin with high-throughput sequencing (ATAC-seq), Chromatin Immunoprecipitation sequencing (ChIP-seq), single-cell RNA sequencing (scRNA-seq), and multiple online databases were utilized for external validation. Taken together, we proposed a scientific hypothesis for bone metastasis in BLCA.
RNA sequencing profiles of 408 BLCA samples were obtained from The Cancer Genome Atlas (TCGA) database (https://www.cancer.gov/tcga), and 209 BLCA samples with complete demographic and prognostic characteristics, especially bone metastasis information integrity, were eventually selected. Among the 209 BLCA samples, 11 had bone metastasis. Additionally, information on cancer-related transcription factors (TFs) was accessed through the Cistrome database (http://cistrome.org) [13]. In addition, 2498 immune-related genes and their gene signatures were retrieved from the ImmPort database (https://immport.org) [14] and the Molecular Signatures Database (MSigDB) v7.0 (https://www.gsea-msigdb.org) [15], respectively. What is more, the single-cell RNA sequencing profiles of the tumor sample, as well as CD4 and CD8 samples of BLCA patients were downloaded from the Gene Expression Omnibus database (https://www.ncbi.nlm.nih.gov/geo/) (accession no. GSE164046, no. GSE149652).
Differentially expressed genes (DEGs) were identified between bone-metastatic
and non-metastatic samples with the cutoff value of false discovery rate (FDR)
Using univariate and multivariate Cox regression analyses, DEIGs with statistical significance were included in the proportional hazards model. Risk score for prognosis was calculated according to the following formula:
In the formula, “i” stands for the number of the patient, “n” stands for the
number of significant DEIGs included in the model, and “
Differentially expressed TFs were identified by the edgeR package. FDR was set
to be less than 0.05 while
Differential enrichment of tumor infiltrating immune cells/pathways in BLCA was evaluated using the CIBERSORT and ssGSEA tools. The correlation between key DEIGs and immune cells/pathways was evaluated by Pearson/Spearman correlation coefficient. In addition, GSVA was applied to score KEGG pathways in each sample. To explore the potential KEGG pathways correlated with prognosis and metastasis in BLCA patients, Pearson/Spearman correlation analysis and univariate Cox regression were conducted. Moreover, the Venn diagram was plotted using the VennDiagram package [17].
Focusing on the immune genes, TFs and downstream immune cells/pathways and KEGG pathways were identified which were involved in the bone metastasis in BLCA. Therefore, we constructed a regulatory network to further elucidate the mechanism of bone metastasis in BLCA by exploiting the Cytoscape package [18].
To further validate the transcriptional regulation pattern between EBF1 and RBP7, data related to EBF1 acquired from chromatin immunoprecipitation sequencing (ChIP-seq) of in vitro human cell lines were downloaded from Gene Expression Omnibus (GEO) (GEO assession: samples GSM1958038 and GSM1958039 from GSE75503, GSM935375 from GSE31477, and GSM803386 from GSE32465) [19, 20]. ENCODE Transcription Factor Targets [21] and JASPAR Predicted Transcription Factor Targets [22] algorithms were then applied to re-predict the transcriptional regulation pattern between EBF1 and RBP7. Additionally, ATAC-seq data of BLCA samples were obtained from the TGCA project of chromatin accessibility landscape of primary human cancers (https://gdc.cancer.gov/about-data/publications/ATACseq-AWG), which were then utilized to explore the chromatin accessibility in specific locations of EBF1 and RBP7. What is more, the scRNA-seq profiles of BLCA samples from GEO were obtained to successively go through quality control, normalization, cell cycle assignment, feature selection, principal component analysis (PCA), Uniform Manifold Approximation and Projection (UMAP)/t-distributed Stochastic Neighbor Embedding (t-SNE) for dimensionality reduction and visualization, unsupervised clustering, differential expression analysis, and cell communication analysis.
Multiple online databases were used to access the external validation of our scientific hypothesis. CellMarkers (http://biocc.hrbmu.edu.cn/CellMarker/) [23] and PathCards (https://pathcards.genecards.org/) [24] were applied to find biomarkers related to immune genes/pathways and KEGG pathways. What is more, UALCAN (http://ualcan.path.uab.edu/index.html) [25], GEPIA (http://gepia.cancer-pku.cn/) [16], LinkedOmics (http://linkedomics.org/) [26], cBioportal (https://www.cbioportal.org/) [27], TISIDB (http://cis.hku.hk/TISIDB/) [28], UCSC xena (https://xena.ucsc.edu/) [29], OncoLnc (http://www.oncolnc.org/) [30], and PROGgeneV2 (http://www.progtools.net/gene/) [31] were employed for the external validation for our scientific hypothesis.
Survival profiles were analyzed by Kaplan–Meier survival analysis. Additionally, correlation analysis were performed using Pearson/Spearman correlation analysis. In addition, Biomarkers with prognostic value were distinguished by Cox univariate and multivariate analyses. All bioinformatic and statistical analyses were implemented with R software (Version 3.6.1; Institute for Statistics and Mathematics, Vienna, Austria; https://www.r-project.org). A significance level of a two-sided p value less than 0.05 was adopted.
The analysis procedure of this study was demonstrated in Fig. 1. Clinical information (Table 1) and RNA-seq data of 209 BLCA samples were accessed through TCGA database. Between bone-metastatic and non-metastatic samples, 1090 DEGs were identified (Fig. 2A,B). Additionally, GO and KEGG enrichment analyses, visualized as bubble plots, were performed to uncover that these DEGs actively participated in pathways such as axonogenesis, metal ion transmembrane transporter activity, and neuroactive ligand-receptor interaction which might play a role in the bone metastasis of BLCA patients (Supplementary Fig. 1A,B). In addition, using differential analysis, 86 of 1090 DEGs were found in the list of 2498 immune genes, which were defined as DEIGs and displayed in a heatmap plot (Fig. 2C) and a volcano plot (Fig. 2D). Moreover, in univariate Cox regression analysis, 31 DEIGs were exhibited to have a significant association with overall survival, in which 29 risk factors and 2 protective factors were identified in a forest plot (Fig. 2E). What is more, the differences of disease free survival between low and high expression of the 31 DEIGs were displayed in survival plots (Supplementary Fig. 2).
The analysis flowchart.
Variables | Total patients (N = 209) | |
Age, years | ||
Mean |
67.14 | |
Median (Range) | 67 (60–75) | |
Gender | ||
Male | 160 (76.56%) | |
Female | 49 (23.44%) | |
Race | ||
Asian | 39 (18.66%) | |
Black or African American | 10 (4.78%) | |
White | 148 (70.81%) | |
Not Reported | 12 (5.74%) | |
Stage | ||
I | 3 (1.44%) | |
II | 78 (37.32%) | |
III | 70 (33.49%) | |
IV | 58 (27.75%) | |
M | ||
M0 | 198 (94.74%) | |
M1 | 11 (5.26%) | |
N | ||
N0 | 133 (63.64%) | |
N1 | 21 (10.05%) | |
N2 | 29 (13.88%) | |
N3 | 3 (1.44%) | |
NX | 22 (10.53%) |
Abbreviation: SD, Standard deviation; M, bone metastasis; N, lymph node metastasis.
Differential gene expression analysis and univariate
Cox proportional hazards model. (A) Heatmap demonstrated DEGs between
bone-metastatic and non-metastatic BLCA samples. (B) Volcano plot showed
over-expressed and under-expressed DEGs with p
Based on the 31 prognosis-related DEIGs, we constructed a multivariate
prognostic prediction model. The receiver operating characteristic (ROC) curve of
the model was exhibited in Fig. 3A, in which the AUC was 0.793, indicating a good
predictive value. Sub-group survival analysis was conducted
according to the median risk score, displaying a significant difference
(p = 1.04
Construction of a multivariate prognostic model. (A) ROC curve
revealed a good prediction reliability (AUC = 0.793). (B) Kaplan-Meier survival
curve showed a significant difference (p = 1.04
Independent prognostic analysis. (A) Univariate Cox
regression analysis with age, gender, M, stage, and risk score. (B) Multivariate
Cox regression analysis with age, gender, M, stage, and risk score. (C) Heatmap
visualized differences of censor, age, gender, grade, TNM stage, clinical stage,
and bone metastasis status between low risk and high risk groups in 196 TARGET
patients (*: p
8 differentially expressed TFs (
Identification of differentially expressed TFs and
co-expression analysis. (A) Heatmap revealed differentially expressed TFs
between metastatic and non-metastatic BLCA samples. (B) Volcano plot showed
over-expressed and under-expressed TFs with p
For the purpose of exploring the potential signaling pathways related to bone
metastasis and BLCA prognosis, CIBERSORT, ssGSEA, and GSVA algorithms were
utilized. The correlation between expression of immune cells/pathways and RBP7
expression was analyzed by Pearson’s correlation (Supplementary Fig.
4A). Moreover, 5 immune cells/pathways were identified to be
statistically significant (Supplementary Fig. 4B), including macrophages
M0 (R = 0.212, p value = 0.007), Type 2 helper T (Th2) cells (R = 0.23,
p value
Using the GSEA, 23 metastasis-related KEGG pathways were identified. Thus, as illustrated in the Venn plot (Fig. 6A), the oocyte meiosis pathway was identified to be the key KEGG pathway both significant in GSEA and GSVA co-expression (Fig. 6B). What is more, the correlation relationship between the oocyte meiosis pathway and RBP7 (R = 0.14, p = 0.042) was shown in Fig. 6C. Besides, GSEA result of the oocyte meiosis was demonstrated by the enrichment plot in Fig. 6D.
Co-expression and correlation analysis of the KEGG oocyte meiosis pathway. (A) Venn plot for the KEGG pathway selected using GSVA and GSEA algorithms. (B) GSEA analysis for the oocyte meiosis pathway. (C) RBP7 expression was positively correlated with the oocyte meiosis pathway. (D) GSEA analysis showed a positive correlation between the oocyte meiosis pathway and RBP7 expression. Abbreviation: KEGG, Kyoto Encyclopedia of Genes and Genomes; GSVA, Gene-Set Variation Analysis; GSEA, Gene-Set Enrichment Analysis.
The regulatory network based on TFs, RBP7, immune genes/pathways and KEGG pathways was constructed in Fig. 7A. Taken together, we proposed that the EBF1-RBP7 axis (representing TF-immune gene relationship) played an important role in bone metastasis of BLCA through Th2 cells and the oocyte meiosis pathway.
Construction of the regulatory network. (A) Regulatory network
based on TFs transcription factors, RBP7, the 5 filtered immune cells/pathways,
and 9 filtered KEGG pathways. (B) Correlation analysis between EBF1 and RBP7 (R =
0.71, p
ATAC-seq analysis was performed to explore the correlation
between EBF1 and RBP7. Open chromatin loci on different chromosomes were
displayed in Supplementary Fig. 6A. Additionally, the venn pie and
upsetplot showed the distribution and intersection of different pick types such
as genic, intergenic, intron, and exon in Supplementary Fig. 6B.
Moreover, the distribution of binding loci relative to TSS was visualized in
Supplementary Fig. 6C. Additionally, the correlation between
EBF1 and RBP7 was analyzed, which turned out that the
expression of EBF1 was positively correlated with RBP7 (R =
0.71, p
To further illustrate the co-expression patterns between EBF1 and RBP7, ENCODE and JASPAR algorithms were utilized. In EBF1 ChIP-seq data of activated GM12878, LCL and Mutul cell line (homo sapiens), binding peaks were all found in RBP7 sequences (Fig. 8A), and 2 motifs of EBF1 were identified (Fig. 8B,C). Consequently, the transcriptional regulation pattern between EBF1 and RBP7 could be determined.
The results of ChIP-seq validation. In EBF1 ChIP-seq data of activated GM12878, LCL and Mutul cell lines (homo sapiens), binding peaks were all found in RBP7 sequences (A). Two EBF1-binding motifs (B,C).
Furthermore, in the scRNA-seq analysis of BLCA tumor cell lines, 12 seurat clusters and bulk labels were shown in t-SNE dimension reduction analysis (Fig. 9A). Additionally, the top 5 marker genes of each cluster was demonstrated in a heatmap (Fig. 9B). In addition, up- and down-regulated marker genes of each cluster were revealed in a point plot (Fig. 9C). More importantly, the distribution of 5 marker genes expression including EBF1, RBP7, CD24, MKI67, and CD44 was illustrated in Fig. 9D, which indicated that EBF1 and PBP7 had a co-expression relationship in clusters 0, 1, 2, 3 and 4, and they might also be co-expressed with tumor stemness and proliferation markers CD24, MKI67, and CD44. What’r more, the distribution of different stages in a cell cycle and relevant scores were visualized in Fig. 9E, which suggested that BLCA cells with high EBF1 and RBP7 expression had a relative high fraction of stages G2M and S cells, including clusters 2, 3 and 4. Last but not least, the cell communication among different clusters presented that clusters 1, 2 and 4 had the strongest interaction with other clusters (Fig. 9F). Moreover, in the scRNA-seq analysis of BLCA CD4 and CD8 T cell lines (Supplementary Fig. 7), a co-expression relationship between EBF1 and RBP7 was also identified.
Single-cell RNA sequencing of BLCA tumor cell lines. (A) t-SNE dimension reduction analysis showed 12 seurat clusters and bulk labels in BLCA tumor cell lines. (B) Top 5 marker genes of each cluster in a heatmap. (C) Marker genes of each cluster in a point plot. (D) The distribution of 5 marker genes including EBF1 and RBP7. (E) UMAP dimension reduction analysis showing the distribution of different stages in a cell cycle as G1, G2M and S, along with G2M.score and S.score in violin plot. (F) Ligand-receptors plot among 12 clusters.
In order to access the external validation of our scientific hypothesis, multiple online databases were employed. Three biomarkers of Th2 cells (IL1RL1, GATA3, and IL33) and the top five genes related to the oocyte meiosis pathway (ADCY3, ADCY5, ADCY9, MAPK3, and IGF1) were presented using the CellMarker and PathCards databases, respectively. All the external validation results are summarized in Table 2.
EBF1 | RBP7 | IL1RL1 | GATA3 | IL33 | ADCY3 | ADCY5 | ADCY9 | MAPK3 | IGF1 | |
UALCAN | OS: p = 0.300 | OS: p = 0.490 | OS: p = 0.130 | OS: p = 0.096 | OS: p = 0.240 | OS: p = 0.021 | OS: p = 0.920 | OS: p = 0.062 | OS: p = 0.047 | OS: p = 0.092 |
Expression: p |
Expression: p = 0.004 | Expression: p = 0.889 | Expression: p = 0.029 | Expression: p |
Expression: p = 0.466 | Expression: p |
Expression: p |
Expression: p = 0.618 | Expression: p | |
GEPIA | OS: p = 0.014 | OS: p = 0.047 | OS: p = 0.019 | OS: p = 0.012 | OS: p = 0.026 | OS: p = 0.027 | OS: p = 0.006 | OS: p = 0.037 | OS: p = 0.006 | OS: p = 0.003 |
Correlation: R = 0.580, p |
Correlation: R |
Correlation: R = –0.150, p = 0.002 | Correlation: R = 0.017, p = 0.730 | Correlation: R = 0.130, p = 0.008 | Correlation: R = 0.071, p = 0.140 | Correlation: R = 0.054, p = 0.270 | Correlation: R = 0.097, p = 0.046 | Correlation: R = 0.340, p | ||
LinkedOmics | OS: p = 0.060 | OS: p = 0.008 | OS: p = 0.432 | OS: p = 0.005 | OS: p = 0.224 | OS: p = 0.008 | OS: p = 0.066 | OS: p = 0.004 | OS: p = 0.004 | OS: p = 0.002 |
Correlation: R = 0.546, p |
Correlation: R = 0.108, p = 0.030 | Correlation: R = –0.170, p |
Correlation: R = 0.137, p = 0.006 | Correlation: R = 0.224, p |
Correlation: R = 0.302, p |
Correlation: R = 0.265, p |
Correlation: R = 0.221, p |
Correlation: R = 0.421, p | ||
TISIDB | OS: p = 0.111 | OS: p = 0.086 | OS: p = 0.173 | OS: p = 0.026 | OS: p = 0.720 | OS: p = 0.006 | OS: p = 0.071 | OS: p = 0.028 | OS: p = 0.017 | OS: p = 0.012 |
cBioportal | Correlation: R = 0.547, p |
Correlation: R = 0.110, p = 0.026 | Correlation: R = –0.172, p |
Correlation: R = 0.138, p = 0.005 | Correlation: R = 0.224, p |
Correlation: R = 0.301, p |
Correlation: R = 0.266, p |
Correlation: R = 0.222, p |
Correlation: R = 0.421, p | |
UCSC Xena | OS: p = 0.007 | OS: p = 0.086 | OS: p |
OS: p = 0.013 | OS: p = 0.004 | OS: p = 0.004 | OS: p = 0.019 | OS: p = 0.020 | OS: p = 0.173 | OS: p = 0.003 |
OncoLnc | OS: p = 0.014 | OS: p = 0.031 | OS: p = 0.115 | OS: p = 0.023 | OS: p = 0.017 | OS: p = 0.038 | OS: p = 0.020 | OS: p = 0.032 | OS: p = 0.043 | OS: p |
PROGgeneV2 | OS: p = 0.070 | OS: p = 0.034 | OS: p = 0.068 | OS: p = 0.138 | OS: p = 0.156 | OS: p = 0.460 | OS: p = 0.092 | OS: p = 0.007 | OS: p = 0.125 | OS: p = 0.006 |
In UALCAN database validation (Supplementary Figs. 8,9), high ADCY3 (Supplementary Fig. 8F) and MAPK3 (Supplementary Fig. 8I) expression levels were characterized as being significantly correlated with worse BLCA patients’ prognosis. Additionally, expression levels of EBF1 (Supplementary Fig. 9A), RBP7 (Supplementary Fig. 9B), IL33 (Supplementary Fig. 9E), ADCY5 (Supplementary Fig. 9G), ADCY9 (Supplementary Fig. 9H), and IGF1 (Supplementary Fig. 9J) were identified to be elevated in normal tissues compared to BLCA primary tumor tissues, while GATA3 expression was escalated in BLCA primary tumor tissues (Supplementary Fig. 9D).
In GEPIA database validation (Supplementary Figs. 10,11), it was discovered that elevated EBF1, RBP7, IL1RL1, IL33, ADCY3, ADCY5, ADCY9, MAPK3, and IGF1 (Supplementary Fig. 10A–H,J) expression levels were associated with unfavorable prognoses in BLCA patients, while over-expression of GATA3 (Supplementary Fig. 10I) indicated better overall survival. In addition, ADCY3 (Supplementary Fig. 11A), IGF1 (Supplementary Fig. 11B), and EBF1 (Supplementary Fig. 11G) expression were recognized to be positively correlated with RBP7 expression, while GATA3 (Supplementary Fig. 11I) expression was negatively correlated with RBP7 expression.
In LinkedOmics database validation (Supplementary Figs. 12,13), it was shown that escalated expression of ADCY3 (Supplementary Fig. 12A), IGF1 (Supplementary Fig. 12B), ADCY9 (Supplementary Fig. 12E), MAPK3 (Supplementary Fig. 12H), and RBP7 (Supplementary Fig. 12J) were significantly correlated with poor BLCA patients’ prognosis, while over-expressed GATA3 (Supplementary Fig. 12I) suggested good prognosis. Moreover, ADCY3, IGF1, ADCY5, IL1RL1, ADCY9, IL33, EBF1, MAPK3, and GATA3 expression (Supplementary Fig. 13A–H) were illustrated to be positively correlated with RBP7 expression, while GATA3 (Supplementary Fig. 13I) expression was negatively correlated with RBP7 expression.
In TISIDB database validation (Supplementary Fig. 14), up-regulation of ADCY3 (Supplementary Fig. 14A), IGF1 (Supplementary Fig. 14B), ADCY9 (Supplementary Fig. 14E), and MAPK3 (Supplementary Fig. 14H) were significantly linked to worse BLCA patients’ prognosis, while up-regulated GATA3 (Supplementary Fig. 14I) expression was significantly related to better prognosis of BLCA patients.
In cBioportal database validation (Supplementary Fig. 15), ADCY3, IGF1, ADCY5, IL1RL1, ADCY9, IL33, EBF1, and MAPK3 (Supplementary Fig. 15A–H) expression were positively correlated with RBP7 expression, while expression of GATA3 (Supplementary Fig. 15I) was negatively associated with expression of RBP7.
In UCSC Xena database validation (Supplementary Fig. 16), down-regulation of ADCY3, IGF1, ADCY5, IL1RL1, ADCY9, IL33, and EBF1 (Supplementary Fig. 16A–G) were significantly correlated with better BLCA patients’ prognosis, while down-regulated GATA3 (Supplementary Fig. 16I) expression was significantly correlated with better BLCA patients’ prognosis.
In OncoLnc database validation (Supplementary Fig. 17), lower expression of ADCY3, IGF1, ADCY5, IL1RL1, ADCY9, IL33, EBF1, and RBP7 (Supplementary Fig. 17A–G,J) were revealed to be significantly correlated with favorable prognosis of BLCA patients, while down-regulated GATA3 (Supplementary Fig. 17I) expression was significantly correlated with favorable prognosis of BLCA patients.
In PROGgeneV2 database validation (Supplementary Figs. 18,19), low-expressed IGF1 (Supplementary Fig. 18B), ADCY9 (Supplementary Fig. 18E) and RBP7 (Supplementary Fig. 18J) were demonstrated to be significantly associated with better BLCA patients’ prognosis. What is more, down-regulated combined gene expression of EBF1, RBP7, IL1RL1, GATA3, IL33, ADCY3, ADCY5, ADCY9, MAPK3, and IGF1 was significantly correlated with favorable prognosis of BLCA patients (Supplementary Fig. 19).
BLCA is a common cancer globally, requiring further investigation since patients with metastatic lesions still lack good prognosis and effective treatments. Around 25% of new patients are diagnosed with either muscle invasiveness or metastasis [32]. Locally advanced and metastatic cases usually respond poorly to current treatments and have poorer survival outcomes [33, 34]. High-throughput technologies have been widely used to explore the molecular mechanisms of BLCA. Several biomarkers with different clinical utilities have been identified, and some of them are already capable of capturing prominent tumors and their molecular biology characteristics [35]. Immune genes have been identified as prognostic biomarkers in BLCA patients, which provide new opportunities to explore the bone-metastatic mechanisms in BLCA [36].
In this study, we identified 31 DEIGs with prognostic value in bone metastatic
BLCA samples. Based on the 31 prognosis-related DEIGs, we developed and validated a proportional hazards model with a high effectiveness in Kaplan-Meier survival
analysis (p
As a member of early B-cell factors (EBFs) family, EBF1 was reported to play a pivotal role in the development and differentiation of B cell [37]. Genetic alteration in EBF1 was found in many B-lineage acute leukemia patients [38]. Recent research have revealed that several clinical features of acute lymphoblastic leukemia, such as pathogenesis, are related with the mutation of EBF1 [39]. Additionally, genomic loss of EBF1 has been reported in solid tumors such as breast cancer and pancreatic cancer [39, 40]. In this study, we found that EBF1 was low-expressed in BLCA patients, and the difference in expression levels between normal and tumor tissue was statistically significant. Besides, high-expressed EBF1 in BLCA predicts a worse survival outlook. It is reasonable to believe that the dysfunction of EBF1 can clarify the mechanism of tumorigenesis and bone metastasis in BLCA to some extent.
It has been documented that gene-based signatures related to immunology have
significant prognostic value in patients diagnosed with papillary thyroid cancer
and nonsquamous non–small cell lung cancer [41, 42]. Retinol binding protein 7
(RBP7), is a member of the cellular retinol-binding protein (CRBP) family, whose
members are required for vitamin A stability and metabolism [43, 44]. More
importantly, RBP7 has been reported by several recent studies to be a part of
immune-related gene signatures to indicate clinical significance and predict
prognosis of BLCA patients [36, 45, 46]. What is more, it was revealed that RBP7
was a significant biomarker whose high expression indicated poor prognosis of
colon cancer patients, as well as being related to tumor invasion and epithelial
mesenchymal transition [47]. Additionally, RBP7 was also discovered to be an
immune-related gene signature in gastric cancer [48]. In this study, we used
bioinformatics analysis and identified that RBP7 was statistically correlated
with stage, bone metastasis, and overall survival in BLCA patients, which may
provide us with a better understanding of bone metastasis in BLCA. Although no
previous study has reported the relationship between EBF1 and RBP7, Pearson
correlation analysis demonstrated that they are significantly correlated with
each other (R = 0.677, p value
An increase in bladder cancer metastasis has been confirmed both in humans and mice by previous study [36]. Additionally, Th2 cells can also function as a promote factor in tumor growth [49]. In addition, aberrant high levels of cytokines were found within the immune microenvironment mediated by Th2 cells in several cancer types [50]. Moreover, one of the cytokines, IL-10, was correlated with the low efficacy of immunotherapy of Bacillus Calmette Guérin (BCG) in bladder cancer [51]. Employing CIBERSORT, ssGSEA and Pearson/Spearman correlation analysis, Th2 cells were identified to be the immune cells which were actively involved in bone metastasis in BLCA. What is more, the biomarkers of Th2 cells, including IL1RL1, GATA3, and IL33 were also tested for the prognostic values by multiple online databases.
For further exploration of the mechanism of bone metastasis in BLCA, we performed GSVA and GSEA to find metastasis-related KEGG pathways, in which the oocyte meiosis pathway was identified. The oocyte meiosis pathway was also significantly correlated with RBP7 expression using Pearson/Spearman correlation analysis. Previous studies have demonstrated that the oocyte meiosis pathway might be related to BLCA development and bone metastasis [52, 53]. Additionally, multiple online databases validated that biomarkers of the oocyte meiosis pathway, including ADCY3, ADCY5, ADCY9, MAPK3 and IGF1, were significantly correlated with bone metastasis and prognosis in BLCA patients. However, only in silico analysis was performed in this study, further validation of our hypothesis demands supplementary experiments. Unfortunately, for lack of corresponding clinical samples and with limited experimental conditions, we couldn’t perform supplementary experiments to further validate our hypothesis immediately. Nevertheless, multi-omics data analysis was utilized to make up for it. More importantly, we are now working with several clinical departments to collect corresponding clinical samples, and it is believed that relevant experiments for the validation of our findings will soon be performed in our follow-up research.
Taken together, we applied an integrated bioinformatics analysis to identify DEIGs and related pathways involved in bone metastases of BLCA and improved our understanding of the molecular mechanisms. We proposed that the EBF1-RBP7 relationship has potential importance in bone metastatic BLCA through Th2 cells and the oocyte meiosis pathway.
The datasets generated and/or analysed during the current study are available in the TCGA portal (https://portal.gdc.cancer.gov).
Conception/design—YL, MF, SX, PH, MZ, XZ, HZ, JunZ, LD, RH, JieZ, ZH. Collection and/orassembly of data—YL, MF, SX, PH, MZ, XZ, ML, WM, DH, RH, JieZ, ZH. Data analysis and interpretation—YL, MF, SX, HZ, JunZ, LD, ML, WM, DH, RH, JieZ, ZH. Manuscript writing—YL, MF, SX, XZ, HZ, JunZ, ML, WM, DH, RH, JieZ, ZH. Final approval of manuscript—YL, MF, SX, PH, MZ, LD, ML, WM, DH, RH, JieZ, ZH. All authors have participated sufficiently in the work and agreed to be accountable for all aspects of the work.
The study was approved by the Ethics Committee of First Affiliated Hospital of Zhengzhou University (No. 2021-KY-190e).
We appreciate the The Cancer Genome Atlas (TCGA) team for providing an open access resource for our study.
This study was supported in part by the Shanghai Rising-Star Program (Sailing Special Program) (No. 23YF1458400); the Shanghai Municipal Health Commission (No.201940306); Henan medical science and technology research project (No. 201602031); Key project of provincial and ministerial co-construction of Henan Medical Science and Technology (No. SBGJ202002031). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The authors declare no conflict of interest.
Publisher’s Note: IMR Press stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.