Academic Editor: Josef Jampílek
Background: The aim of this study was to mine cartilage damage and regeneration-related biomarkers and identify the gene regulatory networks of cartilage damage. Methods: A gene expression data set (GSE129147) containing damaged and control samples collected from the knee of the same patients was employed. R package limma was used to identify differentially expressed genes (DEGs), and clusterProfiler was performed for the GO and KEGG functional enrichment analysis. Cytoscape plug-ins of CytoHubba and MCODE were applied to investigate protein-protein interaction (PPI) network, modules, and hub genes. Results: We identified 422 DEGs that were involved in skeletal system development, bone development, ossification, mesenchyme development, mesenchymal cell differentiation, connective tissue development, osteoblast differentiation, and extracellular matrix. We dug out 30 hub genes, identified three PPI modules, and constructed a miRNA regulatory network for DEGs. The miRNAs of the DEGs were predicted by miRNet, and the miRNA-mRNA network displayed some important miRNAs such as miR-335-5p, miR-92a-3p, and miR-98-5p. Conclusions: Collectively, these results have the potential to clarify the mechanism of cartilage damage and to assist us in discovering the damage and repair-related biomarkers.
In the joint, the surface of the connecting bone is covered with a layer of articular cartilage. The articular cartilage can absorb and buffer forces between the joints to the maximum extent. Cartilage damage, one of the major reasons for disability in the elderly [1], can occur in knees, hips, ankles, and elbows. Slight cartilage injuries may get better on their own in a few weeks, while severe damage may eventually require surgery. Because of the lack of self-regeneration for the damaged cartilage, it is essential to understand the molecular mechanisms in the progression of cartilage damage [2, 3]. Some progress has been made in understanding the mechanisms of cartilage matrix degradation, and has promoted the progressive remodeling of the affected joints [4, 5]. It is reported that the complex network of signaling molecules can fine-tune the cartilage differentiation [6, 7], so mining the molecular changes of damaged cartilage from a systematic level may shed light on the discovery of specific therapeutic targets.
Microarray technology has been extensively used in identifying disease-related biomarkers [8, 9]. Besides mining the cartilage damage and regeneration-related mRNAs, emerging evidence indicates that miRNAs may also play an indispensable role in the network of regulating cartilage development [10, 11]. MiRNAs are found to comprehensively modulate cartilage development by establishing an interaction network with target genes, transcription factors, and cytokines [12]. In this study, we performed a series of bioinformatics analyses to a public microarray data containing the cartilage-damaged samples and their corresponding control samples for the sake of revealing the mechanism of cartilage damage and mining cartilage damage and repair-related mRNA, miRNA biomarkers and their regulatory network.
Microarray data GSE129147 was downloaded from the Gene Expression Omnibus database (www.ncbi.nlm.nih.gov/geo). There are 19 samples containing 10 male patients with knee focal chondral defect and 9 undamaged regions of cartilage from the same patients. The array contains 49,395 probe sets in which 46,879 are mapped to at least one gene. We combined the probe sets mapping to one identical gene by maintaining the probe set that is most often associated with the highest expression level [13]. In total, 18,837 genes were achieved. The data set was quantile normalized by the function of normalizeQuantiles in R package limma (Fig. 1A) [14].
The boxplot and volcano plot for genes. (A) The boxplot for samples before normalization and after normalization. (B) The volcano plot for DEGs.
DEGs were identified by limma package in R [14]. A contrast matrix was
constructed to distinguish samples from two groups. Then the linear modeling
approach was implemented using function of lmFit and the empirical Bayes
statistics was implemented by eBayes to identify DEGs. An adjusted p
value
GO term and KEGG pathway functional enrichment analyses for DEGs were
implemented by the R package clusterProfiler [15]. The Benjamini-Hochberg method
adjusted p value
The PPI network was downloaded from the Search Tool for the Retrieval of Interacting Genes (STRING) database (https://string-db.org) [16]. The DEGs were inputted into the database under the default parameters, and the sub-PPI network for DEGs was obtained. Cytoscape [17] was used to visualize and analyze the DEGs-related PPI network. Hub genes of the network were identified by Cytoscape plug-in CytoHubba [18], and another plug-in named MCODE was used to identify the PPI modules [19].
Upstream miRNAs of DEGs were predicted using the miRNet database, which is an easy-to-use tool for miRNA-associated studies [20]. The DEGs were submitted to the database, and “Organism-H.sapiens” and “Tissue-Bone” were set as selection criteria.
A total of 422 DEGs were identified, compared with the undamaged samples (Fig. 1B).
Functional enrichment of GO terms showed that these DEGs were involved in
the extracellular matrix (ECM) (adjusted p value = 1.16
The dot plot of the top 20 functional enrichment results. (A) Biological process. (B) Molecular function. (C) Cellular component. (D) KEGG pathway.
The PPI network was built for the DEGs using the STRING database with 353 nodes and 1565 edges (see Fig. 3A). The degrees of the network were power-law distributed, which indicated that the network was scale free (see Fig. 3B and C). CytoHubba was applied to identify the hub genes in the network by gene degrees. We selected the top 30 genes with high degrees, which were MMP2, TGAM, MYC, CCND1, COL1A1, CXCR4, ESR1, EZH2, CDK1, TYROBP, C3AR1, LOX, PTGS2, ITGB2, KIF11, CCNB1, C1QB, KIF20A, CYBB, CDKN3, TYMS, PDGFRB, TOP2A, POSTN, MMP14, COL5A1, FCGR3A, FCGR2B, WNT5A and CDKN2A (see Table 1). The PPI modules were identified by Cytoscape plug-in MCODE. We selected three densely connected modules (see Fig. 4). Genes in module 1 were involved in the cell cycle (see Fig. 4A); genes module 2 were involved in the immune response related functions (see Fig. 4B); and genes in module 3 were related to extracellular matrix-related functions (see Fig. 4C), such as extracellular matrix organization, ECM-receptor interaction, focal adhesion, and collagen catabolic process.
The PPI network and its characteristics. (A) The PPI network from STRING database. (B) The node degree distribution for the PPI network. (C) The scale-free distribution of the node degree in the PPI network.
Symbol | Description | Degree | log2(FC) | p value |
MMP2 | Matrix metallopeptidase 2 | 49 | 1.96 | 3.59 |
ITGAM | Integrin, alpha M | 48 | 1.76 | 8.08 |
MYC | BHLH Transcription Factor | 47 | 1.23 | 3.23 |
CCND1 | Cyclin D1 | 46 | 1.42 | 1.45 |
COL1A1 | Collagen Type I Alpha 1 Chain | 43 | 3.78 | 1.28 |
CXCR4 | C-X-C Motif Chemokine Receptor 4 | 35 | 1.10 | 8.80 |
ESR1 | Estrogen Receptor 1 | 35 | –1.11 | 2.57 |
EZH2 | Enhancer Of Zeste 2 Polycomb Repressive Complex 2 Subunit | 34 | 1.10 | 1.34 |
CDK1 | Cyclin Dependent Kinase 1 | 34 | 1.46 | 1.13 |
TYROBP | Transmembrane Immune Signaling Adaptor TYROBP | 33 | 1.30 | 5.07 |
C3AR1 | Complement C3a Receptor 1 | 33 | 1.92 | 1.82 |
LOX | Lysyl Oxidase | 33 | 1.15 | 1.09 |
PTGS2 | Prostaglandin-Endoperoxide Synthase 2 | 32 | 1.90 | 7.77 |
ITGB2 | Integrin Subunit Beta 2 | 32 | 1.67 | 2.43 |
KIF11 | Kinesin Family Member 11 | 32 | 1.12 | 1.82 |
CCNB1 | Cyclin B1 | 32 | 1.03 | 2.08 |
C1QB | Complement C1q B Chain | 30 | 1.74 | 4.51 |
KIF20A | Kinesin Family Member 20A | 30 | 1.21 | 3.07 |
CYBB | Cytochrome B-245 Beta Chain | 29 | 2.46 | 2.57 |
CDKN3 | Cyclin Dependent Kinase Inhibitor 3 | 29 | 1.57 | 1.94 |
TYMS | Thymidylate Synthetase | 29 | 1.80 | 3.22 |
PDGFRB | Platelet Derived Growth Factor Receptor Beta | 28 | 1.16 | 2.40 |
TOP2A | DNA Topoisomerase II Alpha | 28 | 1.66 | 3.56 |
POSTN | Periostin | 27 | 4.02 | 9.05 |
MMP14 | Matrix Metallopeptidase 14 | 27 | 1.74 | 1.64 |
COL5A1 | Collagen Type V Alpha 1 Chain | 27 | 1.09 | 2.16 |
FCGR3A | Fc Fragment Of IgG Receptor IIIa | 26 | 2.57 | 4.23 |
FCGR2B | Fc Fragment Of IgG Receptor IIb | 26 | 1.48 | 4.01 |
WNT5A | Wnt Family Member 5A | 26 | 1.60 | 4.47 |
CDKN2A | Cyclin Dependent Kinase Inhibitor 2A | 26 | 1.05 | 4.43 |
Modules identified by MCODE and module functions by clusterProfiler. (A) Module 1 is involved in the cell cycle and included eight hub genes. (B) Module 2 is involved in the immune response and included five hub genes. (C) Module 3 is involved in extracellular matrix-related functions and included four hub genes. The colors represent the log2 fold change of genes. The nodes of triangle represent the hub genes identified by CytoHubba in PPI network.
Using the miRNet database, we predicted 57 miRNAs, targeting the 442 DEGs. Within the 57 miRNAs, there were 48 miRNAs targeting the 30 hub DEGs. The regulatory network of DEGs and their upstream miRNAs was built using Cytoscape. Fig. 5A showed that mir-335-5p (degree: 82), mir-124-3p (degree: 68), mir-16-5p (degree: 38), mir-192-5p (degree: 38), let-7b-5p (degree: 30), mir-92a-3p (degree: 30), mir-21-5p (degree: 23), mir-98-5p (degree: 22), and mir-17-5p (degree: 20) targeted no less than 20 genes respectively. Additionally, gene MYC (degree: 44) was regulated by most of the miRNAs in the whole miRNA-DEG regulatory network. Fig. 5B showed the regulatory network for hub DEGs and their upstream miRNAs. Additionally, genes of MYC (degree: 44), CCND1 (degree: 17), and EZH2 (degree: 11) were targeted by at least 10 different miRNAs. MiRNAs of mir-335-5p (degree: 7), mir-34a-5p (degree: 7), and mir-92a-3p (degree: 7) were regulated by at least seven different genes (Fig. 5).
The regulatory network of cartilage damage-related genes and miRNAs. (A) The DEGs-relate miRNA-target regulatory network (whole miRNA-DEG network). The size of the nodes represents their degree in the network. (B) The top 30 DEGs-related miRNA-target regulatory network (miRNA-hub DEG network). The circles represent the genes and the hexagons represent the miRNAs. Up-regulated genes are filled in red and down-regulated genes are filled in green. The size of the nodes represents their degree in the network.
We compared the gene expression profiles between damaged and non-damaged knee cartilage, and revealed the differences between them from a system biology level. We screened out 422 DEGs that were involved in extracellular matrix, ossification, osteoblast differentiation, and mesenchyme development, suggesting that these DEGs played important roles in the occurrence of cartilage damage. The PPI network was constructed using the STRING database, and the topological analysis showed that the node degree of the network was subject to the power-law distribution, which is a typical characteristic for a biological network [21]. Thirty DEGs with high degrees identified in the PPI network may play a pivotal role and modulate the functions of this network. Three modules were identified in the PPI network, and the 30 hub genes were highlighted in the modules.
Genes in module1 were involved in cell cycles, and there were eight hub genes including CCNB1, CDK1, CDKN3, EZH2, KIF11, KIF20A, TOP2A, and TYMS. Among them, CDK1 (cyclin-dependent kinase 1) plays a key role in controlling the cell-cycle process for the eukaryotes. It can interact with multiple interphase cyclins to promote G2-M transition, and it regulates the G1 progress and G1-S transition [22, 23]. KIF11 (kinesin family member 11) is an essential molecular motor protein for mitosis. It can support cell proliferation by mediating centrosome separation and formation of the bipolar mitotic spindle [24, 25, 26].
Genes in module2 were associated with immune response related categories. The immune system is involved in the process of tissue injury, which indicates that tissue, organ, or appendage regeneration might be influenced by cartilage damage [27]. There were five hub genes including C1QB, C3AR1, CXCR4, FCGR3A and TYROBP in this module. Among the five hub genes, CXCR4 is an important molecular in cartilage degradation and the CXCL12/CXCR4 axis plays a pivotal role in injury and cartilage repair by acting as a chemo attractant of cells involved in inflammation and stem cell migration [28, 29, 30].
Module 3 enriched genes involved in ECM-related functions. Four hub genes
including COL1A1, COL5A1, POSTN and PTGS2 were highlighted in
this module. COL1A1 and COL5A1 are collagen genes. The
expression of COL1A1 encoding a pro-alpha1 chain of type I collagen is
abundant in bone. The mutations on COL1A1 are the major cause of
osteogenesis imperfecta [31]. COL5A1 is the minor component of
connective tissue and encodes the alpha chain of type V collagen. An animal study
showed that the dysfunction of COL5A1 can generate an abnormal joint
phenotype, such as joint laxity and early-onset osteoarthritis [32]. The
extracellular matrix loss can lead to destruction of cartilage through
uncontrolled production of matrix-degrading enzymes [33]. POSTN encodes
a secreted extracellular matrix protein, playing a part in tissue development and
regeneration. It can solidify connective tissues by crosslinking to other ECM
proteins [31]. POSTN-null mice showed defective collagen crosslinks and
decreases resistance to mechanical stress [34]. POSTN is re-expressed in
fibrous tissues formed after injury and recruits mesenchymal cells by interacting
with integrin, which is followed by tissue repair [35]. Therefore, POSTN
has a crucial role in tissue repair. POSTN mRNA level was significantly
higher in the OA cartilage than that in the controls [36]. In our analysis,
POSTN is an up-regulated DEG (logFC = 4.02; p = 9.05
Furthermore, upstream regulatory miRNAs were predicted for the DEGs. Mir-335-5p
can regulate bone development to promote osteogenic differentiation [38]. In our
study, the target mRNAs for miR-335 were mainly up-regulated, which suggests that
the expression level of miR-335 may be down-regulated. It was reported that
tissue damage or proinflammatory signals may cause downregulation of miR-335 and
then activate the proliferative, migratory, and differentiation capacities of
MSCs [39]. MiR-92a-3p plays a key role in chondrogenesis and cartilage
degradation, and Mao et al. [40] revealed that miR-92a-3p enhances
cartilage development and prevents degradation by targeting WNT5A. In
this study, WNT5A was significantly up-regulated with log2 fold change
1.60 and p value 4.47
Although several important clusters and genes were identified in the cartilage damage patients, there were still some limitations for the current work. It is not available for additional microarray/RNA-seq data to validate the results in GSE129147. The mechanisms underlying the findings have not been thoroughly clarified and wet experiments such as qPCR and western blot in the mouse model or in clinical patients need to be carried out.
In this work, we analyzed a microarray data for cartilage-damaged patients, compared it with the controls, and identified three important modules, key genes and miRNAs related to the cartilage damage. Most of them were validated previously and related to the cartilage development or similar functions. All of these results need to be validated by more experiments.
BL conceived and designed the study, collected and analyzed the data, wrote and revised the manuscript. XX revised the manuscript.
Not applicable.
The author thanks the researchers from Ankara Yildirim Beyazit University submitting the dataset to GEO.
This work was supported by the Innovative Special Project of Agricultural Sci-Tech (Grant No. CAASASTIP-2014-LVRI-09).
The authors declare no conflict of interest.
The GSE129147 was downloaded from GEO website of https://www.ncbi.nlm.nih.gov/gds/?term=GSE129147.