1. Background
2. Methods
2.1. Data Resource
2.2. Data Preprocessing and Differentially Expressed Gene Analysis
2.3. Database for Annotation, Visualization, and Integrated Discovery Analysis
2.4. PPI Interaction Analysis
2.5. Traditional Chinese Medicine Chemical Compounds
2.6. Molecular Docking of Target Proteins Associated with EM
3. Results
3.1. Identification of Microarray Data
3.2. DAVID Analysis
| Identifier | Molecular Function | Count | Fold Change | Benjamini | ES | P-Value | FDR |
|---|---|---|---|---|---|---|---|
| Symport | Sodium ion transmembrane transporter | 111 | 4.5E-75 | 5.8E0 | 34.76 | 2.8E-77 | 3.9E-75 |
| Keratin filament | 39 | 6.9E-26 | 8.4E-24 | 32.07 | 6.9E-26 | 8.0E-24 | |
| Zinc finger C2H2-type/integrase DNA-binding domain | Cellular process | 289 | 2.5E0 | 3.4E-52 | 30.07 | 1.6E-55 | 3.3E-52 |
| Transmembrane transporter | Substance transfer | 111 | 4.9E0 | 3.7E-53 | 29.33 | 2.1E-56 | 3.4E-53 |
| Ribonucleoprotein | Ribosomal protein | 90 | 2.2E0 | 1.2E-12 | 23.88 | 6.7E-14 | 9.1E-13 |
Abbreviations: FDR, false discovery rate; ES, Enrichment Score.
a The table presents key information including molecular function, gene count, fold change, Benjamini value, enrichment score, P-value, and false discovery rate.
| Identifier | Function | Count | Fold Change | Benjamini | ES | P-Value | FDR |
|---|---|---|---|---|---|---|---|
| Krueppel-associated box | Transcriptional repression | 6 | 1.3E-1 | 1.0E0 | 0 | 1.0E0 | 1.0E0 |
| Intermediate filament | Cytoskeleton & Nuclear envelope | 5 | 2.7E-1 | 1.0E0 | 0 | 1.0E0 | 1.0E0 |
| Homeobox | DNA binding | 9 | 3.3E-1 | 1.0E0 | 0 | 1.0E0 | 1.0E0 |
| Leucine-rich repeat | 17 | 4.3E-1 | 1.0E0 | 0 | 1.0E0 | 1.0E0 | |
| GTPase | 5 | 2.4E-1 | 1.0E0 | 0 | 1.0E0 | 1.0E0 |
Abbreviations: FDR, false discovery rate; ES, Enrichment Score.
a The table presents key information including molecular function, gene count, fold change, Benjamini value, enrichment score, P-value, and false discovery rate.
| Identifier | Count | Fold Change | Benjamini | ES | P-Value | FDR |
|---|---|---|---|---|---|---|
| Cadherin | 49 | 3.4E0 | 4.8E-13 | 14.71 | 5.8E-15 | 4.5E-13 |
| Cyclin | 31 | 4.9E0 | 1.1E-14 | 11.9 | 2.2E-16 | 9.0E-15 |
| Catenin | 25 | 6.3E0 | 4.7E-14 | 8.28 | 1.0E-16 | 4.4E-14 |
| Growth factor receptor binding (GRB) | 21 | 6.0E0 | 1.2E-11 | 7.79 | 9.4E-14 | 1.1E-11 |
| Mitogen-activated protein kinase | 10 | 7.3E0 | 7.7E-6 | 4.6 | 1.5E-7 | 7.2E-6 |
| serine/threonine kinase | 73 | 1.1E0 | 6.7E-1 | 4.29 | 1.5E-1 | 6.1E-1 |
| EGF Receptor | 43 | 1.8E0 | 1.5E-3 | 1.77 | 1.2E-4 | 1.3E-3 |
| BCL2 | 7 | 4.2E0 | 6.0E-2 | 1.52 | 3.3E-3 | 5.7E-2 |
| EGF | 37 | 1.1E0 | 1.0E0 | 0.7 | 3.9E-1 | 9.5E-1 |
| Growth arrest | 4 | 1.5E0 | 1.0E0 | 0.5 | 5.3E-1 | 9.3E-1 |
Abbreviations: FDR, false discovery rate; ES, Enrichment Score.
a These annotations offer insights into the molecular functions associated with genes related to endometriosis.
| Disease | Count | % | Enrichment Score | Benjamani | P-Value | FDR |
|---|---|---|---|---|---|---|
| Cockayne syndrome | 6 | 0.2 | 5.6 | 3.8E-2 | 9.3E-4 | 3.8E-2 |
| Bardet-Biedl syndrome | 13 | 0.4 | 3.5 | 8.5E-3 | 7.7E-5 | 8.5E-3 |
| Obesity | 18 | 0.6 | 1.9 | 4.0E-1 | 1.1E-2 | 4.0E-1 |
| Diabetes mellitus | 18 | 0.6 | 1.6 | 1.0E0 | 6.1E-2 | 1.0E0 |
| Intellectual disability | 135 | 4.5 | 1.3 | 2.4E-2 | 4.3E-4 | 2.4E-2 |
Abbreviation: FDR, false discovery rate.
a The table provides an overview of diseases frequently associated with EMS, including the count, percentage, enrichment score, Benjamani value, P-value, and false discovery rate.
3.3. PPI Network Construction
| Identifier | Name | PDB ID | Confidence score | Possible Role |
|---|---|---|---|---|
| BAD | BCL2-associated agonist of cell death | NA | 0.962 | Promotes cell death. |
| BCL2 | B-cell CLL/lymphoma 2 | 2Xa0 (ChainA) | 0.944 | Suppresses apoptosis in a variety of cell systems |
| BCL2L11 | BCL2-like 11 | NA | 0.840 | Apoptosis facilitator |
| BECN1 | Beclin 1, autophagy related; | NA | 0.866 | Plays a central role in autophagy |
| CCNA1 | Cyclin A1 | NA | 0.997 | Involved in the control of the cell cycle at the G1/S |
| CCNA2 | Cyclin A2 | 2x1n | 0.898 | Essential for the control of the cell cycle at the G1/S |
| CCNB1 | Cyclin B1 | NA | 0.993 | Essential for the control of the cell cycle at the G2/M |
| CDC25C | Cell division cycle 25 homolog C (S. pombe) | NA | 0.713 | Functions as a dosage-dependent inducer |
| CDH17 | Cadherin 17 | NA | 0.949 | Calcium-dependent cell adhesion proteins. |
| CDK1 | Cyclin-dependent kinase 1 | NA | 0.999 | Plays a key role in the control of the eukaryotic cell cycle by modulating the centrosome cycle |
| CDK7 | Cyclin-dependent kinase 7 | 1ua2 | 0.957 | Involved in cell cycle control and in RNA polymerase II-mediated RNA transcription. |
| CTNNA1 | Catenin (cadherin-associated protein), alpha 1, 102kDa | NA | 0.934 | Associates with the cytoplasmic domain of a variety of cadherins |
| CTNNB1 | Catenin (cadherin-associated protein), beta 1, 88kDa | NA | 0.987 | Key downstream component of the canonical Wnt signaling pathway. |
| EGF | Epidermal growth factor | 1nql | 0.900 | EGF stimulates the growth of various epidermal and epithelial tissues |
| GAS6 | Growth arrest-specific 6 | 1h30 (Chain A) | 0.900 | Implicated in cell growth and survival, cell adhesion and cell migration. |
| MAP3K7 | Mitogen-activated protein kinase kinase kinase 7 | NA | 0.996 | |
| RIPK1 | Receptor (TNFRSF)-interacting serine-threonine kinase 1 | NA | 0.995 | Transduces inflammatory and cell-death signals (programmed necrosis) f |
| TAB1 | TGF-beta activated kinase 1 | NA | 0.969 | Play an important role in mammalian embryogenesis |
| TAB2 | TGF-beta activated kinase 2 | 2wwz (Chain C) | 0.995 | Promotes MAP3K7 activation in the IL1 signaling pathway. |
a Key molecular players associated with EMS, providing information on PDB ID (if available), confidence score, and possible roles in cellular processes are presented.
3.4. Molecular Docking Analysis
| PDB and Ligand | MolDock Score | Rerank Score | Interaction | HBond | Total |
|---|---|---|---|---|---|
| 1H30 | |||||
| Gingerenone B | -127.09 | -102.84 | -163.32 | -4.51 | -397.77 |
| Sesamin | -133.83 | -79.18 | -136.07 | -8.77 | -357.85 |
| Quercetin-3-o-galactopyranoside | -87.98 | -88.63 | -137.90 | -11.97 | -326.49 |
| Moracin D | -114.52 | -76.18 | -124.08 | -8.51 | -323.28 |
| Alexandrin | -92.31 | -71.10 | -131.31 | -5.60 | -300.32 |
| Mulberrofuran A | -100.29 | -77.12 | -114.25 | -1.96 | -293.62 |
| Astragalin | -81.96 | -76.46 | -114.49 | -12.40 | -285.31 |
| Quercetin | -89.30 | -70.07 | -114.53 | -7.12 | -281.01 |
| Campesterol | -93.43 | -71.60 | -115.14 | 0.00 | -280.17 |
| Icaritin | -81.89 | -74.72 | -109.06 | -3.29 | -268.96 |
| 1NQL | |||||
| Procyanidin B1 | -136.94 | -115.57 | -168.94 | -9.73 | -431.17 |
| Mulberrofuran A | -120.75 | -94.89 | -132.47 | -2.50 | -350.61 |
| Sigmoidin B | -105.95 | -92.33 | -130.41 | -4.97 | -333.66 |
| Kushenol E | -97.78 | -90.73 | -127.27 | -5.00 | -320.78 |
| Astragalin | -100.41 | -89.62 | -123.37 | -10.00 | -323.40 |
| Beta-Sitosterol | -110.92 | -88.04 | -120.61 | -2.35 | -321.91 |
| Kadsurin A | -100.92 | -86.59 | -121.54 | -0.75 | -309.80 |
| Quercetin-3-o-galactopyranoside | -78.87 | -81.84 | -134.43 | -11.80 | -306.95 |
| Aureusidin | -93.71 | -80.87 | -104.05 | -6.05 | -284.68 |
| Arctigenin | -96.82 | -80.30 | -124.64 | -2.50 | -304.27 |
a Scores such as MolDock Score, Rerank Score, Interaction, H-Bond, and Total score provide insights into the binding affinities and interaction profiles of ligands.
| PDB and Ligand | MolDock Score | Rerank Score | Interaction | HBond | Total |
|---|---|---|---|---|---|
| 1UA2 | |||||
| Astragalin | -122.10 | -112.04 | -155.65 | -10.75 | -400.54 |
| Sitogluside | -110.44 | -90.10 | -150.16 | -10.84 | -361.55 |
| Sigmoidin B | -112.50 | -98.56 | -136.34 | -5.04 | -352.44 |
| Moracin D | -117.45 | -97.35 | -127.42 | -6.65 | -348.86 |
| Galgravin | -112.25 | -88.77 | -134.57 | -2.49 | -338.07 |
| Mulberrofuran A | -111.09 | -88.39 | -129.19 | -5.08 | -333.75 |
| alpha1-Sitosterol | -111.97 | -91.30 | -127.04 | 0.00 | -330.31 |
| Kadsurin A | -110.84 | -92.65 | -125.65 | -0.47 | -329.61 |
| Rhamnazin | -101.90 | -86.78 | -133.74 | -6.21 | -328.62 |
| Gingerenone B | -102.55 | -88.69 | -130.95 | -4.98 | -327.18 |
| 2WWZ | |||||
| Kushenol E | -118.92 | -99.03 | -144.71 | -7.78 | -370.44 |
| Procyanidin B1 | -117.52 | -89.11 | -139.46 | -13.84 | -359.94 |
| Icaritin | -113.54 | -98.83 | -140.02 | -5.41 | -357.80 |
| Episyringaresinol | -114.67 | -94.31 | -137.47 | -4.69 | -351.15 |
| Quercetin-3-o-galactopyranoside | -89.49 | -91.99 | -151.81 | -11.44 | -344.73 |
| Mulberrofuran A | -111.97 | -81.77 | -122.68 | -5.86 | -322.28 |
| Scopolin | -101.84 | -77.98 | -128.43 | -13.70 | -321.96 |
| Phaseol | -110.79 | -86.98 | -120.61 | -3.51 | -321.89 |
| Sigmoidin B | -99.27 | -86.14 | -123.78 | -7.52 | -316.71 |
| Glycyrol | -102.38 | -82.22 | -117.57 | -6.69 | -308.86 |
a Scores such as MolDock Score, Rerank Score, Interaction, H-Bond, and Total score provide insights into the binding affinities and interaction profiles of ligands.
| PDB and Ligand | MolDock Score | Rerank Score | Interaction | HBond | Total |
|---|---|---|---|---|---|
| 2X1N | |||||
| Quercetin-3-o-galactopyranoside | -125.30 | -125.65 | -187.46 | -20.17 | -458.58 |
| Kushenol E | -127.11 | -113.63 | -152.74 | -9.17 | -402.64 |
| Sigmoidin B | -126.48 | -111.16 | -150.33 | -13.94 | -401.91 |
| Mulberrofuran A | -137.57 | -109.82 | -148.86 | -2.02 | -398.27 |
| Arctigenin | -128.78 | -105.23 | -142.28 | -4.34 | -380.63 |
| Licoflavonol | -117.25 | -109.08 | -145.93 | -7.36 | -379.62 |
| Sitogluside | -118.77 | -98.61 | -157.28 | -4.15 | -378.81 |
| Glycyrol | -120.29 | -105.09 | -138.97 | -4.94 | -369.29 |
| Astragalin | -106.89 | -103.74 | -146.31 | -9.74 | -366.68 |
| Phaseol | -124.67 | -104.51 | -134.33 | -2.50 | -366.01 |
| 2XA0 | |||||
| Quercetin-3-o-galactopyranoside | -90.58 | -86.87 | -142.10 | -6.29 | -325.84 |
| Procyanidin B1 | -113.08 | -76.83 | -125.56 | -8.35 | -323.83 |
| Sitogluside | -99.00 | -85.65 | -122.43 | 0.00 | -307.08 |
| Kushenol E | -95.33 | -82.77 | -118.94 | -2.50 | -299.54 |
| Astragalin | -94.86 | -80.67 | -109.77 | -10.00 | -295.30 |
| Sesamin | -104.48 | -77.83 | -106.39 | -1.43 | -290.13 |
| Moracin D | -97.42 | -81.05 | -107.28 | -3.72 | -289.46 |
| Glycyrol | -95.99 | -80.34 | -108.72 | -3.82 | -288.87 |
| Mulberrofuran A | -97.32 | -79.13 | -110.80 | 0.00 | -287.25 |
| Gingerenone B | -88.69 | -74.76 | -120.70 | -2.42 | -286.56 |
a Scores such as MolDock Score, Rerank Score, Interaction, H-Bond, and Total score provide insights into the binding affinities and interaction profiles of ligands.
Energy map analysis of the top three docking hits against; A, gingerenone B, sesamin, and Quercetin-3-o-galactopyranoside at the active site of 1H30; B, Procyanidin B1, Mulberrofuran A, and Sigmoidin B at the active site of 1NQL; C, astragalin, Sitogluside, and Sigmoidin B at the active site of 1UA2; D, kushenol E, Procyanidin B1, and Icaritin at the active site of 2WWZ; E, quercetin-3-o-galactopyranoside, Kushenol E, and Sigmoidin B at the active site of 2X1N; and F, quercetin-3-o-galactopyranoside, Procyanidin B1, and Sitogluside at the active site of 2XA0 respectively. Green color indicates the region that might contribute to steric interaction, turquoise color indicates hydrogen acceptor favorable regions, yellow indicates hydrogen donor favorable regions, and electrostatic potential regions with red and blue color.


