1. Background
2. Objectives
3. Methods
3.1. Data Acquisition and Processing
3.2. Screening for DEGs in NASH
3.3. Enrichment Analysis of DEGs
3.4. PPI Network Establishment and Module Analysis
3.5. Selection and Validation of Core Genes
3.6. Identification of Immune Infiltration-related DEG Signatures
3.7. Statistical Analysis
4. Results
4.1. Comparison of DEGs Between NASH and Healthy Control Samples
Screening of differentially expressed genes (DEGs) between non-alcoholic steatohepatitis and normal samples in the E-MEXP-3291 and GSE89632 datasets. A-B. Venn diagrams showing commonly upregulated (A) and downregulated (B) genes from the GSE89632 and E-MEXP-3291 datasets. The DEGs were selected with FRD < 0.05 and |log2 fold change (FC)| > 0.263 as thresholds. C. The heatmap shows the differences in the expression of DEGs between NASH and normal samples from the GSE89632 and E-MEXP-3291 datasets.
4.2. Functional and Pathway Enrichment Analysis on DEGs
Gene ontology-biological process (GO-BP) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment of differentially expressed genes (DEGs) between normal and NASH samples. The top 20 GO-BP (A) and KEGG pathways (B) for both upregulated and downregulated DEGs ranked by P values. Significantly enriched terms were selected based on the criteria of P < 0.05 and gene count ≥ 2. The x-axis indicates the gene count while the y-axis indicates the terms of the GO-BP and KEGG pathways with a significant correlation. The triangles and circles in B represent upregulated and downregulated DEGs, respectively.
4.3. Analysis Using Constructed PPI Network
Construction and module mining analysis of the protein-protein interaction (PPI) network. A, Establishment of a PPI network based on intersected differentially expressed genes (DEGs). This PPI network contains 246 genes (upregulated DEGs are shown in orange, and downregulated DEGs are shown in blue) and 555 pairs of interactions. B, The Venn diagram shows the top 20 genes ranked by gene count in terms of the topological properties of degree, betweenness, and closeness, and the intersected 12 DEGs were identified as hub genes in the PPI network. C, The heatmap shows the top five KEGG pathways significantly enriched in DEGs, ranked by P value in each submodule.
4.4. Screening and Validation of DEG Signatures
Screening of differentially expressed gene (DEG) signatures using least absolute shrinkage and selection operator (LASSO) regression analysis and the validation of their expression in normal, simple steatosis (SS), and non-alcoholic steatohepatitis (NASH) samples from the E-MEXP-3291 and GSE89632 datasets. A. LASSO regression analysis was used to identify 17 DEG signatures as the optimized gene set. B. The expression differences of 17 DEG signatures between normal, SS, and NASH samples from the E-MEXP-3291 dataset. C. The expression differences of 17 DEG signatures between the normal, SS, and NASH groups in the GSE89632 dataset.
4.5. Immune Infiltration Analysis of DEG Signatures
Immune cell infiltration analysis in normal and non-alcoholic steatohepatitis (NASH) samples in the combined dataset. A. Histograms show the infiltration abundance of 22 types of immune cells in eight NASH samples and nine normal controls from the combined dataset (E-MEXP-3291 and GSE89632). B. The violin diagram shows the differences in the abundance of 22 types of immune cells between the NASH and control groups. Blue and red bars indicate normal and NASH samples, respectively.
| Immune Cell | Gene | R a | P-Value | Q-Value b |
|---|---|---|---|---|
| Activated dendritic cells | SYK | -0.697731584 | 0.001844931 | 0.01833562 |
| Activated dendritic cells | CXCL10 | -0.684242924 | 0.002449187 | 0.01943022 |
| Activated dendritic cells | VAMP3 | -0.659718088 | 0.003958514 | 0.024026602 |
| Activated dendritic cells | MRAS | -0.619252109 | 0.00802977 | 0.039814276 |
| Activated dendritic cells | AVPR1A | 0.70141031 | 0.00170334 | 0.01833562 |
| Resting dendritic cells | CXCR1 | -0.760303501 | 0.000396191 | 0.009438255 |
| Resting dendritic cells | CXCL2 | -0.666025867 | 0.003512852 | 0.023302488 |
| M2 macrophages | ACVR2B | -0.617647059 | 0.009698737 | 0.046165989 |
| M2 macrophages | CXCL2 | -0.607843137 | 0.011192078 | 0.046651756 |
| M2 macrophages | VAMP3 | 0.644607843 | 0.006395802 | 0.034595476 |
| Monocytes | FGFR2 | -0.637254902 | 0.007189792 | 0.037199359 |
| Monocytes | SOS1 | -0.607843137 | 0.011192078 | 0.046651756 |
| Neutrophils | FGFR2 | -0.760269916 | 0.000396565 | 0.009438255 |
| Neutrophils | CDH1 | -0.727161387 | 0.000941323 | 0.016002484 |
| Neutrophils | SYK | -0.703862793 | 0.001613999 | 0.01833562 |
| Neutrophils | VAMP3 | -0.669528023 | 0.003283597 | 0.023302488 |
| Neutrophils | CXCL10 | -0.665849297 | 0.003524746 | 0.023302488 |
| Neutrophils | MRAS | -0.602084724 | 0.010545631 | 0.046651756 |
| Neutrophils | SMAD1 | 0.690374133 | 0.002157132 | 0.01833562 |
| Neutrophils | AVPR1A | 0.692826617 | 0.002048599 | 0.01833562 |
| Neutrophils | CXCL2 | 0.7627224 | 0.000369997 | 0.009438255 |
| Neutrophils | CXCR1 | 0.902513965 | 7.20E-07 | 8.57E-05 |
| Resting memory CD4 T cells | AVPR1A | -0.710784314 | 0.001926552 | 0.01833562 |
| Resting memory CD4 T cells | CXCR1 | -0.669117647 | 0.004239989 | 0.024026602 |
| Resting memory CD4 T cells | CXCL10 | 0.669117647 | 0.004239989 | 0.024026602 |
| Resting memory CD4 T cells | SYK | 0.713235294 | 0.001831279 | 0.01833562 |
| Resting memory CD4 T cells | CDH1 | 0.75245098 | 0.00074371 | 0.014750255 |
| Resting memory CD4 T cells | FGFR2 | 0.801470588 | 0.000158696 | 0.009438255 |
Abbreviation: DEG, differentially expressed gene.
a R stands for Spearman correlation coefficient.
b Q-value represents P-value adjusted by the Benjamini & Hochberg method, and Q < 0.05 indicates statistical significance.





