- Academic Editors
†These authors contributed equally.
Background: Metabolic reprogramming provides a new perspective for understanding cancer. The targeting of dysregulated metabolic pathways may help to reprogram the immune status of the tumor microenvironment (TME), thereby increasing the effectiveness of immune checkpoint therapy. Colorectal cancer (CRC), especially colon adenocarcinoma (COAD), is associated with poor patient survival. The aim of the present study was to identify novel pathways involved in the development and prognosis of COAD, and to explore whether these pathways could be used as targets to improve the efficacy of immunotherapy. Methods: Metabolism-related differentially expressed genes (MRDEGs) between tumor and normal tissues were identified using The Cancer Genome Atlas (TCGA) dataset, together with metabolism-related prognostic genes (MRPGs). Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was performed separately for the MRDEGs and MRPGs. Gene Set Variation Analysis (GSVA) was also performed to explore the role of purine metabolism in COAD tumorigenesis. Consensus clustering of purine metabolism genes with the overall survival (OS) of patients and with anti-tumor immunity was also performed. Pearson correlation analysis was used to identify potential targets that correlated strongly with the expression of immune checkpoints. Results: A 6-gene signature that had independent prognostic significance for COAD was identified, together with a predictive model for risk stratification and prognosis. The most significantly enriched pathway amongst MRDEGs and MRPGs was purine metabolism. Differentially expressed purine metabolism genes could divide patients into two clusters with distinct prognosis and anti-tumor immunity. Further analysis suggested that purine metabolism was involved in anti-tumor immunity. Conclusions: This study confirmed the importance of metabolism-related pathways and in particular purine metabolism in the tumorigenesis, prognosis and anti-tumor immunity of COAD. We identified a 6-gene prognostic signature comprised of EPHX2, GPX3, PTGDS, NAT2, ACOX1 and CPT2. In addition, four potential immune-metabolic checkpoints (GUCY1A1, GUCY1B1, PDE1A and PDE5A) were identified, which could be used to improve the efficacy of immunotherapy in COAD.
Colon cancer has the third highest incidence of all malignant tumors, with approximately one million new cases diagnosed worldwide in 2018 and 551,269 patients dying from this disease [1]. Colon adenocarcinoma (COAD) is the most common type of colon cancer. A better understanding of the mechanism of COAD malignancies is urgently needed to help improve therapy.
Metabolic reprogramming is a critical hallmark of human cancer and is an adaptive response to the hypoxic and hypo-nutrient conditions of the tumor microenvironment (TME) [2, 3]. Several reprogrammed metabolic pathways in cancer cells have attracted considerable attention. In particular, enhanced aerobic glycolysis, also known as the Warburg effect, is a well-known alternative metabolic pathway in tumor cells [4]. The Kyoto Encyclopedia of Genes and Genomes (KEGG) database contains approximately 50 metabolism pathways. However, few comprehensive analyses have been conducted to identify the metabolic pathways involved in the poor prognosis of COAD.
There are several ways in which metabolic reprogramming may contribute to the tumor development and progression, including tumor immunity [5, 6, 7, 8]. Tumor cells adjust their metabolism to maintain continuous proliferation. Tumor tissue is therefore very different from normal tissue in terms of nutrient intake and waste accumulation. This can impact the presentation and recognition of antigens and impose metabolic stress on infiltrating immune cells, resulting in immune escape [9, 10, 11, 12]. Cancer immunotherapy is a major breakthrough in tumor treatment. However, many patients show no response to such therapy, possibly due to abnormal metabolic reprogramming of the immunosuppressive TME, resulting in the regression of the antitumor immune response [13]. Therefore, the targeting of dysregulated metabolism pathways that affect anti-tumor immunity may help to reprogram the immune status of the TME, thereby increasing the effectiveness of immune checkpoint therapy.
The aim of the present study was to identify potential pathways involved in the development and prognosis of COAD, and to explore whether these can be targeted to improve the efficacy of immunotherapy.
mRNA expression profiles and corresponding clinical information from The Cancer
Genome Atlas (TCGA) (https://portal.gdc.cancer.gov/) COAD dataset were used as
the discovery cohort in this study. Eligible samples met the following criteria:
(1) histologically confirmed COAD; (2) mRNA expression profile data and clinical
information were both available; and (3) only samples with a patient survival
time
This study was approved by the Institutional Ethics Review Board of the Third People’s Hospital of Chengdu and conducted following the Chinese ethical guidelines for human genome/gene research.
The R package “limma” was used to perform differentiation analysis of
metabolism-related gene expression by comparing tumor and normal samples. The
thresholds for DEGs were
Univariate analysis and least absolute shrinkage and selector operation (LASSO)
Cox regression analysis were performed using the R packages “glmnet” and
“survival”. These were used to identify metabolism-related prognostic gene
signatures and included only those genes with p
Risk score =
The median value of the risk scores was set as the cutoff point, and subsequently used to divide patients into high- and low-risk groups. Survival analysis was performed using Kaplan-Meier (K-M) survival curves. Discrimination between the outcomes for observations and predictions were performed using the area under the curve (AUC) of ROC in the package “survivalROC”. The AUC value ranged 0.5 to 1.0, with 0.5 indicating a random probability and 1.0 indicating perfect predictive ability.
KEGG analysis (p-value
Tissues were fixed in 4% PFA at room temperature, embedded in paraffin and
sectioned at 3 µm, then incubated with H
The software package R 3.6.0 (R Foundation for Statistical Computing, Vienna,
Austria) was used for statistical analyses. Groupwise comparisons were performed
by Wilcox-ranked sum testing. Pearson correlation analysis was performed to
identify metabolism-related prognostic genes (MRPGs) and potential
immune-metabolic checkpoints. The overall survival (OS) rate of patients was
estimated using K-M analysis, and the log-rank test was used to evaluate
differences between groups. p
Data for 945 metabolism-related genes was extracted from the KEGG dataset for
the differential expression analysis. Using the criteria of adjusted p
Construction of a metabolism-related prognostic signature for COAD. (A) Heatmap showing the expression of MRDEGs. (B) Forest plots of univariate analysis for OS in patients from the TCGA database. (C) LASSO coefficient profile for the common genes. (D) Cross-validation for turning parameter screening in the LASSO regression model. (E) Kaplan-Meier survival curves for patients from the TCGA dataset stratified according to their risk score. (F) Distribution of the risk scores and survival times for TCGA patients. (G,H) Receiver operating characteristic (ROC) curve analyses for the prediction of 3- and 5-year OS according to the risk score and other clinical features. COAD, colon adenocarcinoma; MRDEGs, metabolism-related differentially expressed genes; TCGA, The Cancer Genome Atlas; LASSO, least absolute shrinkage and selector operation; OS, overall survival.
To determine whether the 6-gene signature was an independent prognostic factor for COAD patients, the expression of these genes was determined using publicly available transcriptome data (TCGA). The expression of EPHX2, GPX3, NAT2, PTGDS, ACOX1, and CPT2 were lower in COAD tumor tissue than in normal tissues (Supplementary Fig. 1A–F). K-M survival analysis showed that three genes were positively correlated with the OS of COAD patients (Supplementary Fig. 1G–I). Patients with high CPT2, EPHX2 and NAT2 expression levels showed better OS. Additional independent cancer datasets on immunotherapy for melanoma (GSE91061, GSE78220) were used to evaluate the predictive value of these genes for sensitivity to immunotherapy. Patients with higher expression of PTGDS were more sensitive to immunotherapy (Supplementary Fig. 1J). Patients with high CPT2 and NAT2 expression also showed significantly better response to immunotherapy than those with low expression (Supplementary Fig. 1J). The above results suggest that patients with high expression levels of PTGDS, CPT2 and NAT2 are more likely to benefit from immunotherapy.
We next performed univariate and multivariate Cox regression analysis. TNM stage and the risk score showed significant prognostic value for OS in univariate analysis (Fig. 2A). Significant factors were then entered into a multivariate Cox regression model. The 6-gene signature risk score and patient age were found to be independent predictors of OS (Fig. 2B).
Validation of the prognostic value of the MRDEG signature. (A) Univariate analysis for the OS of patients from the TCGA database. (B) Multivariate analysis for the OS of patients from the TCGA database. (C,D) Distribution of risk scores and survival times for GEO patients stratified according to their risk score. (E) K-M survival curves for patients in the GEO dataset stratified according to their risk score. GEO, Gene Expression Omnibus; K-M survival curves, Kaplan-Meier survival curves.
To validate the prognostic value of the signature, patients in the GSE17536 and GSE17537 series were separated into high- and low-risk groups according to the 6-gene signature risk score (Fig. 2C,D). Patients in the high-risk group had worse OS than those in the low-risk group (Fig. 2E), thus confirming the prognostic value of the 6-gene signature based MRDEGs.
We next investigated the metabolic pathways involved in determining the
prognosis of COAD. KEGG enrichment analysis was performed on the 129 MRDEGs to
identify reprogrammed metabolic pathways in COAD. Purine metabolism was found to
be the most significantly enriched pathway (Fig. 3A). Other significantly altered
pathways were drug metabolism, retinol metabolism, tyrosine metabolism, and
pentose and glucuronate interconversion. Pearson correlation analysis was used to
identify genes that were co-expressed with the 6-gene signature, using p
Purine metabolism is the most significantly altered pathway related to the prognosis of COAD, and plays an important role in the development of COAD. (A) The top 12 enriched pathways identified from KEGG analysis of the MRDEGs. The bar color changes gradually from blue to red in ascending order according to the adjusted p-values. (B) The top 15 enriched pathways identified from KEGG analysis of the MRPGs. The bar color changes gradually from blue to red in ascending order according to the adjusted p-values. The size of the node represents the number of counts. (C) Heatmap for the expression of purine metabolism genes. (D–G) Distribution of GSVA enrichment scores for purine metabolism stratified according to TNM stage. (H–K) Associations between purine metabolism and the P53, PI3K/AKT/mTOR, cell-cycle, and MYC signaling pathways, as assessed by GSVA enrichment scores. These pathways were activated together with purine metabolism. KEGG, The Kyoto Encyclopedia of Genes and Genomes; GSVA, Gene Set Variation Analysis; TNM, tumor-node-metastasis.
GSVA was used to confirm that purine metabolism played an important role in COAD
and to further examine the biological role of purine metabolism in COAD. The GSVA
enrichment score was calculated for each patient, who were then divided into two
groups according to the median score. The heatmap for the two groups showed a
significant difference in the expression of genes for purine metabolism (Fig. 3C). The Wilcox test was performed to compare the GSVA enrichment score between
patients with different clinical characteristics (Fig. 3D–G). A lower purine
metabolism enrichment score was associated with higher tumor-node-metastasis
(TNM) stage (p
In view of the importance of purine metabolism in COAD, we next performed
consensus clustering. The k = 2 was identified by optimal clustering stability
from k = 2 to 10 based on the similarity displayed by the expression levels of
the MRDEGs and the proportion of ambiguous clustering measure. Patients were
divided into cluster-1 (n = 177) and cluster-2 (n = 139) (Fig. 4A–C). The OS of
cluster-1 patients was significantly better (p
The expression of purine metabolism genes is closely related to the survival of COAD patients. (A) Consensus clustering cumulative distribution function for k = 2 to k = 10. (B) Relative change in the area under the CDF curve for k = 2 to k = 10. (C) Consensus clustering matrix for k = 2. (D) K-M survival curves for patients in the two clusters.
GSEA analysis was performed to compare the biological features between the two subtypes. Samples in cluster-2 were more enriched in immune-related pathways, including the immune network for IgA production, natural killer cell-mediated cytotoxicity, antigen processing and presentation, graft versus host disease, and cytokine-receptor interaction (Fig. 5A). These results suggest that purine metabolism may have some interaction with tumor immunity.
Consensus clustering of purine metabolism genes with
anti-tumor immunity. (A) GSEA showed that samples in cluster-2 were more likely
to be enriched in immune-related pathways. (B–K) Expression levels for
PD1, PD-L1, PD-L2, LGALS9, HAVCR2,
CD86, CD80, CTLA4, TNFRSF9 and
TNFSF9 in cluster-1 and cluster-2 subtypes from the TCGA cohort. (L)
Infiltrating levels of 22 immune cell types in cluster-1 and cluster-2 subtypes
from the TCGA cohort. * p
The expression of immune checkpoints are closely related to the efficacy of immunotherapy. For example, the overexpression of PD-L1 is widely used as a predictive biomarker for the response to immunotherapy with checkpoint inhibitors. The expression of immune checkpoints was therefore investigated in the two subtypes. Patients in cluster-2 expressed higher levels of PD1, PD-L1, PD-L2, LGALS9, HAVCR2, CD86, CD80, CTLA4, TNFRSF9 and TNFSF9 than patients in cluster-1 (Fig. 5B–K), indicating they may respond better to immunotherapy.
The distribution of 22 immune cell types in the two subgroups was compared using the CIBERSORT algorithm. Cluster-1 showed higher levels of infiltration with monocytes and CD4 memory-resting T cells, whereas cluster-2 showed more CD8 T-cells, M1 macrophages, and follicular helper T cells (Fig. 5L).
Purine metabolism status had closely related to the expression of immune
checkpoints [16, 17]. Therefore, targeting of the purine metabolism pathway may
be a potential strategy for improving the efficacy of immune blockade therapy.
Pearson correlation analysis was used to compare the expression of genes in the
purine metabolism pathway with immune checkpoints. With a cut-off point of
Identification of potential immuno-metabolic checkpoints in COAD. (A–D) Expression levels for GUCY1A1, GUCY1B1, PDE1A and PDE5A in tumor and normal tissues from the GEPIA database. (E) IHC staining of GUCY1A1, GUCY1B1, PDE1A and PDE5A was significantly lower in COAD tissues compared to normal tissue. Scale bars represent 100 µm. (F,G) Correlation between the expression of immune checkpoints and GUCY1A1 in the TCGA database. (H,I) Correlation between the expression of immune checkpoints and GUCY1B1 in the TCGA database. (J) Correlation between the expression of PD-L2 and PDE1A in the TCGA database. (K) Correlation between the expression of CD80 and PDE5A in the TCGA database.
The heatmap shown in Supplementary Fig. 2D provides an overview of the correlation coefficients between the signature genes and the immune checkpoints. We found no significant difference in the expression of PDE6C, NT5C1B, ENTPD1 and AMPD3 between COAD and normal tissues (Supplementary Fig. 3A–G). However, the expression levels of PDE6C, NT5C1B, ENTPD1 and AMPD3 were also positively correlated to the expression of immune checkpoints (Supplementary Fig. 3H–M).
Ever since the “Warburg effect” was first described, much attention has been paid to metabolic reprogramming. This concept has been pivotal in our understanding of tumor metabolism. In the present study, KEGG analysis of MRDEGs and MRPGs revealed the involvement of purine metabolism in COAD. Activation of the purine metabolism pathway plays a critical role in the development of COAD, TME function, and the efficacy of immunotherapy. Targeting of GUCY1A1, GUCY1B1, PDE1A or PDE5A in the purine metabolism pathway may improve the effectiveness of immune checkpoint blockade therapy.
Screening for MRDEGs identified EPHX2, GPX3, PTGDS, NAT2, ACOX1, and
CPT2 to have the strongest prognostic values. The expression and
prognostic value of these genes in COAD were validated in independent datasets.
In addition, we developed a risk score formula based on this 6-gene signature and
validated its prognostic value. MRPGs were the genes that were highly correlated
with the 6-gene signature (
Different stages of COAD tumors were shown here to have different purine
metabolism enrichment scores. Moreover, GSVA enrichment scores for the purine
metabolism pathway were strongly associated with pathways that promote COAD
tumorigenesis (
Two tumor clusters were obtained by consensus clustering based on the expression
of purine metabolism genes. The main clinical differences between these clusters
were OS and anti-tumor immunity. We conclude that purine metabolism can reprogram
the TME immune status, thereby impacting anti-tumor immunity and tumor prognosis.
The purine nucleoside adenosine (ADO) is a product of purine metabolism and has
an immunosuppressive effect [25]. ADO is able to suppress differentiation of
monocytes into macrophages via the activation of A2 receptor signaling [25, 26, 27].
This could explain the different level of infiltration of monocytes and of M1
macrophages between the two clusters. Inosine is known to serve as an alternative
carbon source for CD8
Immune checkpoint blockade therapy has revolutionized cancer treatment due to
its durable responses and fewer side-effects compared with conventional cancer
treatments [29, 30, 31]. However, a considerable proportion of patients remain
unresponsive to these treatments, while about one-third of patients relapse after
the initial response and develop adaptive resistance [32]. In the present study,
higher expression of immune checkpoints was observed in cluster-2 COAD patients,
suggesting these patients may have better immunotherapy response. Therefore, the
purine metabolism status of tumor may predict their response to immunotherapy.
Current and future research strategies may involve targeting of the purine
metabolism pathway, and the development of combination strategies to increase the
efficacy of immunotherapy. The adenosinergic pathway forms a part of the purine
metabolism pathway and represents an attractive target for cancer therapy. The
ecto-nucleotidases CD39 and CD73, also known as ENTPD1 and NT5E, respectively,
are critical mediators of adenosine accumulation in the TME [32, 33]. Extensive
pre-clinical experiments with colon cancer cells have shown promising results by
targeting the adenosinergic pathway, including CD39, CD73 and receptors for
adenosine, in combination with chemotherapy or immunotherapy [34, 35, 36, 37]. In the
current study we identified 8 genes as potential targets for increasing the
efficacy of immunotherapy. The expression of these genes was highly correlated
with the expression of immune checkpoints. GUCY1A1 and GUCY1B1 are subunits of
soluble guanylyl cyclase (sGC) and produce the second messenger cyclic GMP
(cGMP). The role of sGC in other cancers has been reported [38, 39], but there
are currently no reports on its effectiveness for improving immunotherapy. We
found that GUCY1A1 and GUCY1B1 were downregulated in COAD, and their expression
was highly correlated with PD-L2, CD80 and CD86. PDE1A and PDE5A are members of
the phosphodiesterase (PDE) family, which catalyze the hydrolysis of 3
There are several limitations to our study. First, the relationship between reprogrammed purine metabolism and COAD prognosis was detected at the transcriptome level instead of protein level. Second, it is not known whether other metabolites in the purine metabolism pathway in addition to adenosine contribute to human tumor progression. Finally, only correlations between the expression of immune checkpoints and potential immuno-metabolic genes were identified, and further inference regarding causality cannot be drawn. Additional experiments are therefore required to justify this hypothesis.
This study highlights the importance of purine metabolism in the development of COAD, and identified several important metabolism-related genes involved in anti-tumor immunity. In particular, we identified four potential targets in purine metabolism that may increase the efficacy of immunotherapy. Our findings provide new insights into purine metabolism in cancer and serve as a vital reference for the discovery of new immune-metabolic checkpoints in COAD patient.
The data that support the findings of this study are openly available in TCGA, GEO and ICGC. Data sources and handling of these data are described in the Materials and Methods section. Further details are available from the corresponding author upon request.
TL, TZ contributed to conception and design of the study. TL, HL, YZ drafted the manuscript. TL, HL, YZ organized the database and performed the statistical analysis. TL, HL, QZ performed experiments and revised the manuscript. All authors commented on the manuscript. All authors read and approved the final manuscript. All authors have participated sufficiently in the work and agreed to be accountable for all aspects of the work.
The data from TCGA and GEO are both publicly available, the patients involved in the database have obtained ethical approval. Thus, the present study was exempted from the approval of local ethics committees. The current research follows the TCGA and GEO data access policies and publication guidelines. This study was approved by the Institutional Ethics Review Board of the Third People’s Hospital of Chengdu (2021-S-90) and conducted following the Chinese ethical guidelines for human genome/gene research.
We thank many researchers that deposited their valuable public data sets in TCGA, GEO and ICGC.
This study was supported by Natural Science Foundation of Sichuan Province (2023NSFSC1886), Natural Science Foundation of Sichuan Province (2023NSFSC0739).
The authors declare no conflict of interest.
Publisher’s Note: IMR Press stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.