4.1. Dysregulated MicroRNAs in Gastric Cancer
Differential expression analysis identified 2,565 miRNAs with significant alterations in gastric cancer tissues compared to normal controls. Among these, 43 miRNAs were upregulated, while 2,522 were downregulated (adjusted P < 0.05, |logFC| > 1.5).
Table 1 summarizes the top differentially expressed miRNAs identified through RNA sequencing analysis of gastric cancer tissues compared to normal controls. Columns include miRNA identifiers, logFC, average expression levels (AveExpr), statistical metrics (t-statistic, P-value, adjusted P-value, B-statistic), and direction of dysregulation (up/down). The findings highlight miRNAs with significant alterations in expression, providing insights into their potential roles in gastric carcinogenesis and their utility as diagnostic or therapeutic targets.
| miRNAs | miRNA_ID_LIST | LogFC | AveExpr | t | P-Value | Adjusted P-Value | B | Logp | Down/Up |
|---|
| MIMAT0019015 | hsa-miR-4481 | 2.23191 | 3.352408 | 26.59811 | 2.39E-139 | 9.04E-139 | 307.1369 | 138.6212 | Up |
| MIMAT0004592 | hsa-miR-125b-1-3p | 2.180458 | 2.129759 | 24.64279 | 1.17E-121 | 3.56E-121 | 266.4677 | 120.9317 | Up |
| MIMAT0022259 | hsa-miR-5100 | -3.90518 | 12.22446 | -113.822 | 0 | 0 | 2424.681 | Inf | Down |
| MIMAT0019776 | hsa-miR-1343-3p | -3.06295 | 8.151446 | -91.5028 | 0 | 0 | 1939.256 | Inf | Down |
| MIMAT0031000 | hsa-miR-8073 | -2.41184 | 7.41858 | -70.3271 | 0 | 0 | 1422.276 | Inf | Down |
| MIMAT0005880 | hsa-miR-1290 | -5.81738 | 7.250913 | -64.4289 | 0 | 0 | 1269.599 | Inf | Down |
| MIMAT0019957 | hsa-miR-4787-3p | -2.08496 | 6.845924 | -64.0984 | 0 | 0 | 1260.957 | Inf | Down |
| MIMAT0016916 | hsa-miR-4286 | -2.03617 | 6.948302 | -59.5751 | 0 | 0 | 1141.932 | Inf | Down |
| MIMAT0018976 | hsa-miR-4454 | -2.29451 | 10.676 | -57.9223 | 0 | 0 | 1098.156 | Inf | Down |
| MIMAT0001631 | hsa-miR-451a | -5.18689 | 6.720472 | -57.5567 | 0 | 0 | 1088.457 | Inf | Down |
| MIMAT0025847 | hsa-miR-6511b-5p | -2.30753 | 5.895848 | -57.4031 | 0 | 0 | 1084.382 | Inf | Down |
Abbreviations: miRNA, microRNA; logFC, log2 fold change; AverExpr, average expression levels.
4.1.1. Upregulated MicroRNA
4.1.1.1. Hsa-miR-4481
Exhibited a logFC of 2.23 (adjusted P = 9.04E-139), making it one of the most significantly upregulated miRNAs. Prior studies have implicated hsa-miR-4481 in promoting tumor progression by targeting tumor suppressor genes such as PTEN and CDKN1A. Its overexpression may enhance cell proliferation and survival through activation of the PI3K/Akt signaling pathway.
4.1.1.2. Hsa-miR-125b-1-3p
With a logFC of 2.18 (adjusted P = 3.56E-121), this miRNA has been previously associated with poor prognosis in multiple cancers. It is known to regulate key oncogenic pathways, including Wnt/β-catenin signaling, which plays a critical role in gastric cancer metastasis.
4.1.2. Downregulated MicroRNAs
4.1.2.1. Hsa-miR-5100
Hsa-miR-5100 Showed the most dramatic downregulation, with a logFC of -3.91 (adjusted P = 0). Its suppression in gastric cancer suggests a tumor-suppressive role, potentially mediated by inhibiting the epithelial-to-mesenchymal transition (EMT) process. The EMT is a hallmark of cancer metastasis, and restoring hsa-miR-5100 expression could serve as a therapeutic strategy.
4.1.2.2. Hsa-miR-23a-3p
Exhibiting a logFC of -3.53 (adjusted P = 0), this miRNA has been linked to the inhibition of metastasis in colorectal cancer. Our findings suggest that its downregulation in gastric cancer may contribute to increased invasiveness and poor patient outcomes.
4.2. Visualization of Results
In
Figure 1 the volcano plot visualizes the differential expression of miRNAs between gastric cancer and non-cancer samples using data from the GEO dataset GSE164174. The x-axis represents the logFC of miRNA expression levels, while the y-axis shows the negative logarithm (base 10) of the P-values indicating statistical significance. Vertical dashed lines at logFC = -2 and logFC = 2 demarcate thresholds for identifying significantly differentially expressed miRNAs, while the horizontal dashed line at -log
10(P) = 1.301 (P = 0.05) indicates the cutoff for statistical significance. The markers highlighted within the inset indicate significant results, demonstrating the overall distribution of miRNA expression changes, with a substantial number of downregulated miRNAs observed in gastric cancer compared to non-cancer samples. Such results underscore the potential role of miRNAs as biomarkers in the context of gastric cancer.
Volcano plot of microRNA (miRNA) expression differences in gastric cancer vs. non-cancer samples (GSE164174); Gray circles: The miRNAs with no significant difference in expression (P > 0.05). Blue triangles: The miRNAs that show significant downregulation [log2 fold change (logFC) < -2 and P < 0.05]. Red triangles: The miRNAs with significant upregulation (logFC > 2 and P < 0.05).
As shown in
Figure 2, uniform manifold approximation and projection (UMAP) clustering revealed distinct expression profiles between tumor and normal samples. We generated volcano plots in
Figure 1 and UMAP clustering visualizations in
Figure 2. The volcano plot clearly distinguishes between upregulated and downregulated miRNAs based on their fold changes and statistical significance.
Uniform manifold approximation and projection (UMAP) visualization of microRNA (miRNA) expression profiles in gastric cancer (red) versus non-cancer (blue) samples
The UMAP clustering revealed distinct expression profiles between tumor and normal samples, confirming the robustness of our differential expression analysis. This UMAP plot visualizes high-dimensional miRNA expression data (n = 80 samples: 50 gastric cancer, 30 non-cancer) reduced to two dimensions, highlighting distinct clustering patterns between gastric cancer (red) and non-cancer (blue) groups. The analysis was performed using the UMAP algorithm with n_neighbors = 15, a parameter balancing local and global structure preservation. Axes represent the two primary UMAP components (UMAP1 and UMAP2), with numeric scales indicating relative distances in the reduced space. The clear separation between clusters underscores significant differences in miRNA expression profiles between cancerous and normal tissues, supporting the utility of miRNAs as biomarkers for gastric cancer classification. Overlapping regions may reflect biological heterogeneity or transitional molecular states. The distinct clustering demonstrates differential miRNA signatures between groups, reinforcing their potential as diagnostic biomarkers (Parameters: n_neighbors = 15).
This ROC plot evaluates the diagnostic performance of the top 10 miRNAs identified in the study. Each curve represents the trade-off between the true positive rate (TPR, sensitivity) and false positive rate (FPR, 1-specificity) for a miRNA-based classifier. The diagonal dashed line indicates random chance (AUC = 0.5). Curves closer to the top-left corner reflect superior classification accuracy, with higher AUC values. The miRNAs are ranked by their AUC scores, with the top-performing miRNA (hsa-miR-4286, AUC = 0.96) demonstrating near-perfect discrimination between gastric cancer and non-cancer samples. This visualization underscores the potential of miRNA expression profiles as robust diagnostic biomarkers (
Figure 3).
Receiver operating characteristic (ROC) curves of the top 10 microRNA (miRNAs) ranked by diagnostic performance in distinguishing gastric cancer from non-cancer samples. The proximity of curves to the top-left corner reflects high classification accuracy, with area under the curve (AUC) values indicating biomarker efficacy. The dashed line represents random chance (AUC = 0.5).
This heatmap illustrates the expression patterns of the top 10 differentially expressed miRNAs across gastric cancer (n = 50) and non-cancer (n = 30) tissue samples. Rows represent miRNAs (e.g., hsa-miR-5100, hsa-miR-4286), while columns correspond to individual samples, grouped by disease status. Expression levels are color-coded, with red indicating upregulation and blue indicating downregulation relative to the mean. Hierarchical clustering of samples (columns) reveals distinct molecular subgroups, highlighting the consistent dysregulation of specific miRNAs in cancer tissues. Notably, hsa-miR-5100 and hsa-miR-4286 exhibit pronounced downregulation in tumors, aligning with their putative tumor-suppressive roles. This visualization underscores the utility of miRNA expression signatures for distinguishing cancerous from non-cancerous tissues (
Figure 4).
Heatmap of microRNA (miRNA) expression in gastric cancer versus non-cancer samples. Columns are grouped by disease status (cancer: Red bar; non-cancer: Blue bar), and rows represent miRNAs with significant differential expression. Color intensity reflects normalized expression levels (red: High; blue: Low). Clustering highlights distinct molecular patterns between groups.
4.3. Enriched Gene Ontology Terms Associated with MicroRNA Targets in Gastric Cancer
Functional annotation of predicted miRNA targets using GO enrichment analysis revealed several critical pathways implicated in gastric carcinogenesis. These pathways highlight the biological mechanisms underlying tumor progression and provide insights into potential therapeutic targets. Below, we describe the top enriched pathways, their biological significance, and relevance to gastric cancer.
Table 2 presents the top enriched GO terms associated with dysregulated miRNAs in gastric cancer. Columns include pathway ID, description, fold enrichment, statistical significance (P-value), adjusted P-value, false discovery rate (Q-value), and the number of genes involved (count). The pathways reveal critical biological processes, such as miRNA-mediated mRNA destabilization and translational regulation that contribute to gastric carcinogenesis.
| ID | Description | Fold Enrichment | P-Value | Adjusted P- Value | Q-Value | Count |
|---|
| GO:0035279 | miRNA-mediated gene silencing by mRNA destabilization | 39.31439 | 4.20E-63 | 6.31E-60 | 4.00E-60 | 46 |
| GO:0035278 | miRNA-mediated gene silencing by inhibition of translation | 33.66845 | 4.18E-50 | 3.15E-47 | 2.00E-47 | 39 |
| GO:0061157 | mRNA destabilization | 19.85575 | 6.38E-47 | 3.20E-44 | 2.03E-44 | 46 |
| GO:0050779 | RNA destabilization | 19.46257 | 1.75E-46 | 6.56E-44 | 4.16E-44 | 46 |
| GO:0061014 | POSITIVE regulation of mRNA catabolic process | 19.36669 | 2.24E-46 | 6.73E-44 | 4.27E-44 | 46 |
| GO:1903313 | POSITIVE regulation of mRNA metabolic process | 16.381 | 8.78E-43 | 2.20E-40 | 1.40E-40 | 46 |
| GO:0043488 | REGULATION of mRNA stability | 14.40088 | 4.39E-40 | 9.44E-38 | 5.98E-38 | 46 |
| GO:0043487 | REGULATION of RNA stability | 13.79452 | 3.40E-39 | 6.40E-37 | 4.06E-37 | 46 |
| GO:0034249 | NEGATIVE regulation of amide metabolic process | 15.7053 | 3.87E-39 | 6.47E-37 | 4.10E-37 | 43 |
| GO:0061013 | REGULATION of mRNA catabolic process | 13.55669 | 7.76E-39 | 1.17E-36 | 7.40E-37 | 46 |
| GO:0017148 | NEGATIVE regulation of translation | 16.59535 | 2.06E-37 | 2.82E-35 | 1.79E-35 | 40 |
Abbreviation: miRNA, microRNA.
This bar plot illustrates the top enriched biological processes identified through GO analysis of miRNA targets in gastric cancer. Terms are ranked by statistical significance (adjusted P-value) and include critical pathways such as "miRNA-mediated gene silencing by mRNA destabilization" (adjusted P = 6.31e-60, count = 46) and "positive regulation of mRNA catabolic process" (adjusted P = 2.92e-37, count = 44). The GeneRatio (0.18 - 0.20) reflects the proportion of genes associated with each term relative to the background gene set. The most significant terms highlight the central role of post-transcriptional regulation, particularly miRNA-driven mRNA destabilization and translational repression, in gastric carcinogenesis. These findings align with the dysregulation of tumor-suppressive miRNAs identified in the study, emphasizing their mechanistic impact on gene expression networks.
Top enriched biological processes are ranked by adjusted P-value, with bar lengths proportional to significance. Counts indicate the number of genes associated with each term. Dominant pathways include miRNA-mediated mRNA destabilization and regulation of RNA stability, underscoring their relevance to disease progression, according to
Figure 5. As shown in
Figure 5, miRNA-mediated mRNA destabilization (GO:0035279, fold enrichment = 39.3, P = 4.20E-63): This pathway was the most significantly enriched, highlighting the central role of miRNAs in post-transcriptional regulation. Key targets included well-known tumor suppressors such as PTEN, CDKN1A, and TP53. Destabilization of these mRNAs likely contributes to uncontrolled cell proliferation and evasion of apoptosis in gastric cancer.
Gene Ontology (GO) enrichment analysis of microRNA (miRNA) targets in gastric cancer
Negative regulation of translation (GO:0017148, fold enrichment = 16.6, P = 2.06E-37): This pathway underscores the importance of translational control in cancer biology. Dysregulation of translation initiation factors and ribosomal proteins can lead to increased synthesis of oncogenic proteins, driving tumor progression.
Regulation of apoptotic signaling (GO:0043065, fold enrichment = 12.8, P = 1.54E-29): Several downregulated miRNAs were predicted to target anti-apoptotic genes, suggesting that their loss may promote resistance to cell death in gastric cancer cells. These findings are illustrated in
Figure 6, which provides a comprehensive overview of the top enriched pathways and their biological relevance.
Feature importance of top 10 microRNAs (miRNAs) in gastric cancer classification, ranked by Mean Decrease Gini scores from a random forest model. Longer bars indicate greater contributions to distinguishing cancer from non-cancer samples, emphasizing their diagnostic relevance
4.4. Machine Learning Performance
4.4.1. Top 10 MicroRNAs Ranked by Feature Importance in Gastric Cancer Classification
This bar plot ranks the top 10 miRNAs based on their importance scores derived from a random forest machine learning model, measured by the Mean Decrease Gini Index. The index quantifies each miRNA's contribution to distinguishing gastric cancer from non-cancer samples, with higher values indicating greater discriminatory power. hsa-miR-1290 (importance = ~80) and hsa-miR-5100 (importance = ~70) emerge as the most critical features, consistent with their pronounced dysregulation in prior analyses. These miRNAs, many of which are downregulated tumor suppressors (e.g., hsa-miR-5100) or oncogenic drivers (e.g., hsa-miR-1290), highlight key molecular players in gastric carcinogenesis. The ranking underscores their potential utility as biomarkers for diagnostic models and therapeutic targets.
To evaluate the diagnostic potential of differentially expressed miRNAs, we trained random forest classifiers on normalized expression profiles. The RFE identified a subset of high-confidence biomarkers, including hsa-miR-4286 and hsa-miR-23a-3p. The performance metrics of the models are summarized below:
- Random Forest model:
(A) AUC for hsa-miR-4286: 0.96
(B) AUC for hsa-miR-23a-3p: 0.90
(C) Cross-validation accuracy: 92%
- Feature importance: The RFE algorithm ranked hsa-miR-4286 as the most informative feature, followed by hsa-miR-23a-3p and hsa-miR-5100. These miRNAs collectively accounted for over 75% of the model's predictive power.
The box plots illustrate median expression levels (the line within each box), interquartile ranges (the height of each box), and outliers (points beyond the whiskers) for each miRNA in both groups. Overall, the plots suggest a differential expression pattern, with some miRNAs showing significantly higher or lower expression levels in gastric cancer samples versus non-cancer controls, as shown in
Figure 7. As illustrated in
Figure 7, several miRNAs show significant differential expression between gastric cancer and non-cancer samples, supporting their potential diagnostic value.
Box plots of microRNA (miRNA) expression levels of selected miRNAs in gastric cancer compared to non-cancer individuals. Each panel presents a different miRNA (hsa-miR-107, hsa-miR-17-3p, hsa-miR-191-5p, hsa-miR-3934-5p, hsa-miR-4489, hsa-miR-4740-5p, hsa-miR-650, hsa-miR-6717-5p, hsa-miR-6720-3p, and hsa-miR-92a-3p), with the x-axis representing the two groups: Gastric Cancer (red) and Non-Cancer (cyan).