1. Background
2. Objectives
3. Patients and Methods
3.1. Study Design and Patient Selection
3.2. Image Acquisition
3.3. Segmentation
3.4. Preprocessing and Radiomics Feature Extraction
3.5. Feature Selection, Model Validation, Classification, and Performance Evaluation
3.6. Conventional Statistical Analysis
3.7. Ethical Considerations
4. Results
| Variables | Independent samples t-test (t-test for equality of means) | ||
|---|---|---|---|
| Sig. (2-tailed) | Mean difference | Std. error difference | |
| Age | 0.889 | -0.28 | 2.02 |
| Body weight (kg) | 0.003 | -9.78 | 3.18 |
| BMI (kg/m2) | 0.001 | -4.37 | 1.27 |
Abbreviation: BMI, body mass index.
| Radiomics feature a | Definition |
|---|---|
| L1-4 log-sigma-3-0-mm-3D GLCM sum squares | The sum of squares or Variance is a measure of the mean intensity level in the GLCM in the distribution of neighboring intensity level pairs. The L1-4 label represents the related segmented vertebral level. It was obtained from the images created by choosing the Laplacian of Gaussian filter 3 mm. |
| L3 log-sigma-3-0-mm-3D first order minimum | First order minimum is the minimum grey level value in the image. The L3 label represents the related segmented vertebral level. It was obtained from the images created by choosing the Laplacian of Gaussian filter 3 mm. |
| L3 wavelet-LLH first order Mean | First order mean is the average grey level value in the image. The L3 label represents the related segmented vertebral level. It was obtained from the images created by wavelet transform. LLH is one of the 8 sub volumes in the 3D wavelet transform. |
| L1 wavelet-LHH GLSZM large area high gray level emphasis | LAHGLE measures the ratio in the image of the co-distribution of larger sized regions with higher grey level values. The L1 label represents the related segmented vertebral level. It was obtained from images created by wavelet transform. LHH is one of the 8 sub volumes in the 3D wavelet transform. |
| L2 log-sigma-3-0-mm-3D GLCM IDN | One of the measures of an image's local homogeneity is IDN (inverse difference normalized). Unlike homogeneity1, IDN normalizes the difference in intensity values between neighbors by dividing it by the total number of discrete intensity values. The L2 label represents the related segmented vertebral level. It was obtained from the images created by choosing the Laplacian of Gaussian filter 3 mm. |
Abbreviations: GLSZM, gray level size zone matrix; GLCM, gray level co-occurrence matrix.
a The 5 features selected by fast correlation-based filter (FCBF).
| Variables | AUC | CA | F1 | Precision | Recall | Specificity |
|---|---|---|---|---|---|---|
| ML Algorithm | ||||||
| Naive Bayes | 0.913 | 0.907 | 0.683 | 0.7 | 0.667 | 0.95 |
| Neural Network | 0.752 | 0.864 | 0.296 | 0.667 | 0.19 | 0.983 |
| Random Forest | 0.743 | 0.864 | 0.345 | 0.625 | 0.238 | 0.975 |
| SVM | 0.738 | 0.857 | 0.091 | 1 | 0.048 | 1 |
| Logistic Regression | 0.722 | 0.864 | 0.296 | 0.67 | 0.19 | 0.983 |
| kNN | 0.709 | 0.85 | 0.16 | 0.5 | 0.095 | 0.983 |
| Decision Tree | 0.654 | 0.8 | 0.333 | 0.333 | 0.333 | 0.882 |
Abbreviation: ML, machine learning; AUC, area under the curve; SVM, support vector machine; KNN, k-nearest neighbors.
a The performance of ML algorithms.
| Variable | Predicted | ||
|---|---|---|---|
| Osteoporotic | Non-osteoporotic | Total | |
| Actual | |||
| Osteoporotic | 14 (10) | 7 (5) | 21 |
| Non-osteoporotic | 6 (4) | 113 (81) | 119 |
| Total | 20 | 120 | 140 |
a Confusion matrix for the Naive Bayes model (showing number of instances and percentages).
| Radiomics feature | AUC (%95 CI) | P | Threshold a | Sensitivity | Specificity |
|---|---|---|---|---|---|
| L1-4 log-sigma-3-0-mm-3D first order median b | 0.742 (0.631 - 0.854) | 0.0004 | 161.8236694 | 0.714 | 0.714 |
| L1 wavelet-LHH first order range b | 0.742 (0.627 - 0.856) | 0.0004 | 772.300293 | 0.857 | 0.580 |
| L2-4 log-sigma-3-0-mm-3D first order median b | 0.74 (0.631 - 0.85) | 0.0005 | 130.8338051 | 0.905 | 0.513 |
| L1 wavelet-LHH GLDM high gray Level emphasis b | 0.733 (0.633 - 0.833) | 0.0007 | 12426.98297 | 0.762 | 0.655 |
| L1 wavelet-LHL GLDM large dependence high gray level emphasis b | 0.728 (0.623 - 0.834) | 0.0009 | 316.7002581 | 0.905 | 0.538 |
| L1 wavelet-LHL gray level size zone matrix (GLSZM) small area emphasis b | 0.726 (0.627 - 0.825) | 0.0010 | 669.0405296 | 0.857 | 0.597 |
| L1 wavelet-LHL first order minimum b | 0.726 (0.603 - 0.849) | 0.0010 | 138.3913483 | 0.667 | 0.798 |
| L4 log-sigma-3-0-mm-3D first order median b | 0.723 (0.617 - 0.829) | 0.0011 | 123.4394493 | 0.905 | 0.555 |
| L1 wavelet-LHL neighborhood gray-tone difference Matrix (NGTDM) coarseness b | 0.722 (0.623 - 0.821) | 0.0012 | 820.7429058 | 0.952 | 0.487 |
| L1 wavelet-LHL first order root mean squared b | 0.721 (0.613 - 0.829) | 0.0013 | 144355620.8 | 0.810 | 0.580 |
| L1 wavelet-HHL GLSZM zone percentage b | 0.72 (0.602 - 0.838) | 0.0013 | 0.424339815 | 0.714 | 0.723 |
| L1 wavelet-LHL first order entropy b | 0.72 (0.612 - 0.829) | 0.0013 | 594726568.4 | 0.810 | 0.580 |
| L1-4 log-sigma-3-0-mm-3D first order root mean squared b | 0.719 (0.607 - 0.831) | 0.0014 | 238.6731562 | 0.714 | 0.739 |
| L1 wavelet-LHL first order variance b | 0.719 (0.606 - 0.832) | 0.0014 | 183.7174645 | 0.952 | 0.403 |
| L1 wavelet-LHL gray level run length matrix (GLRLM) long run emphasis b | 0.719 (0.607 - 0.831) | 0.0014 | 56.88197474 | 0.952 | 0.403 |
| L1 wavelet-HHL NGTDM complexity b | 0.718 (0.6 - 0.836) | 0.0015 | 0.676536483 | 0.714 | 0.723 |
| L1 wavelet-LHL first order uniformity b | 0.718 (0.593 - 0.843) | 0.0015 | 5.645465978 | 0.571 | 0.857 |
| L1 wavelet-LHL GLRLM gray level variance b | 0.718 (0.605 - 0.831) | 0.0015 | 33487.69379 | 0.952 | 0.403 |
| L1 wavelet-LHL GLDM small dependence emphasis b | 0.718 (0.605 - 0.831) | 0.0015 | 53.66430712 | 0.952 | 0.403 |
| L1-4 log-sigma-3-0-mm-3D first order mean b | 0.718 (0.593 - 0.843) | 0.0015 | 155.6497469 | 0.762 | 0.639 |
| L1 wavelet-LHL GLRLM short run low gray level emphasis b | 0.717 (0.608 - 0.826) | 0.0015 | 2414.827911 | 0.905 | 0.471 |
| L1 wavelet-LHL GLSZM size zone non-uniformity normalized b | 0.717 (0.615 - 0.819) | 0.0015 | 893.0282637 | 0.762 | 0.664 |
| L1 wavelet-LHL GLRLM long run low gray level emphasis b | 0.717 (0.615 - 0.819) | 0.0015 | 849.1502259 | 0.762 | 0.664 |
| L1 wavelet-LHL GLDM small dependence high gray level emphasis b | 0.715 (0.614 - 0.817) | 0.0017 | 883.5853389 | 0.762 | 0.664 |
| L2 log-sigma-3-0-mm-3D first order median b | 0.715 (0.59 - 0.841) | 0.0017 | 182.5972176 | 0.619 | 0.824 |
| L3 log-sigma-3-0-mm-3D first order root mean squared b | 0.715 (0.609 - 0.821) | 0.0017 | 221.0883332 | 0.810 | 0.622 |
| L1 wavelet-HHL first order range b | 0.715 (0.603 - 0.827) | 0.0017 | 277.4433441 | 0.810 | 0.580 |
| L1 wavelet-LHL gray level co-occurrence matrix (GLCM) Id b | 0.714 (0.598 - 0.831) | 0.0018 | 74.51247581 | 0.619 | 0.723 |
| L1 wavelet-LHL GLSZM gray level non-uniformity normalized b | 0.713 (0.597 - 0.83) | 0.0018 | 5.074148459 | 0.619 | 0.782 |
| L3 log-sigma-3-0-mm-3D first order median b | 0.712 (0.601 - 0.823) | 0.0020 | 154.2673759 | 0.714 | 0.672 |
| L1-4 log-sigma-3-0-mm-3D first order 90percentile b | 0.711 (0.596 - 0.826) | 0.0021 | 374.7948471 | 0.714 | 0.748 |
| L1 wavelet-LHL first order energy b | 0.711 (0.592 - 0.83) | 0.0021 | 239.8392797 | 0.524 | 0.815 |
| L1 wavelet-LHL GLCM Imc2 b | 0.711 (0.607 - 0.815) | 0.0021 | 802.288168 | 0.762 | 0.639 |
| L1 wavelet-LHL GLCM IDM b | 0.71 (0.594 - 0.826) | 0.0022 | 90.46296759 | 0.619 | 0.731 |
| L2-4 log-sigma-3-0-mm-3D first order root mean squared b | 0.709 (0.601 - 0.818) | 0.0023 | 226.992795 | 0.714 | 0.681 |
| L3 log-sigma-3-0-mm-3D NGTDM busyness c | 0.709 (0.604 - 0.814) | 0.0023 | 0.368800857 | 0.905 | 0.471 |
| L2-4 log-sigma-3-0-mm-3D first order mean b | 0.708 (0.586 - 0.83) | 0.0024 | 145.8226851 | 0.762 | 0.588 |
| L1 wavelet-LHL first order kurtosis c | 0.707 (0.601 - 0.814) | 0.0025 | -640.053894 | 0.762 | 0.622 |
| L1 wavelet-HHL GLCM Cluster prominence c | 0.707 (0.604 - 0.81) | 0.0025 | 0.093677529 | 0.857 | 0.529 |
| L1 wavelet-LHL GLSZM Small Area high gray level emphasis c | 0.707 (0.592 - 0.821) | 0.0026 | 0.04102843 | 0.905 | 0.471 |
| L1 wavelet-LHL first order mean b | 0.706 (0.584 - 0.829) | 0.0026 | 4.713948691 | 0.571 | 0.807 |
| L1 wavelet-LHL GLCM IDMN b | 0.706 (0.601 - 0.811) | 0.0026 | 27.3785354 | 0.762 | 0.622 |
| L1 wavelet-LHL GLCM joint energy b | 0.706 (0.601 - 0.811) | 0.0026 | 54.75707079 | 0.762 | 0.622 |
| L1 wavelet-LHL GLRLM run length non-uniformity normalized b | 0.706 (0.603 - 0.809) | 0.0026 | 831.1979223 | 0.952 | 0.454 |
| L1 wavelet-LHL GLCM sum entropy b | 0.705 (0.59 - 0.821) | 0.0028 | 47.99747138 | 0.714 | 0.605 |
| L1 wavelet-HHL GLCM joint average c | 0.705 (0.585 - 0.824) | 0.0028 | 0.140912352 | 0.762 | 0.639 |
| L3 wavelet-LLH first order median b | 0.704 (0.565 - 0.843) | 0.0029 | 70.33731461 | 0.571 | 0.815 |
| L1 wavelet-LHL GLCM cluster tendency b | 0.704 (0.591 - 0.817) | 0.0029 | 575635.1876 | 0.810 | 0.571 |
| L1 wavelet-LHL first order range b | 0.703 (0.587 - 0.82) | 0.0030 | 2222.682983 | 0.571 | 0.773 |
| L2 wavelet-LLL GLCM difference average b | 0.703 (0.613 - 0.793) | 0.0031 | 18.4495188 | 1.000 | 0.479 |
| L2 log-sigma-3-0-mm-3D GLSZM zone variance c | 0.703 (0.597 - 0.809) | 0.0031 | 45.36158539 | 0.714 | 0.647 |
| L1 wavelet-LHL GLCM correlation b | 0.702 (0.579 - 0.825) | 0.0032 | 3.771944675 | 0.667 | 0.723 |
| L2 log-sigma-3-0-mm-3D GLSZM large area emphasis c | 0.702 (0.596 - 0.807) | 0.0032 | 101.9358356 | 1.000 | 0.370 |
| L1-4 wavelet-LLL GLCM difference variance b | 0.702 (0.599 - 0.805) | 0.0032 | 259.307057 | 0.857 | 0.529 |
| L3 log-sigma-3-0-mm-3D first order 90percentile b | 0.702 (0.594 - 0.809) | 0.0032 | 362.6814285 | 0.714 | 0.731 |
| L2 wavelet-LLL GLCM contrast b | 0.701 (0.611 - 0.792) | 0.0033 | 619.6491217 | 0.952 | 0.521 |
| L1 wavelet-LLL GLDM high gray level emphasis b | 0.701 (0.591 - 0.81) | 0.0034 | 269.6090191 | 0.857 | 0.504 |
| L2-4 log-sigma-3-0-mm-3D first order 90 percentile b | 0.7 (0.589 - 0.811) | 0.0035 | 357.1382874 | 0.762 | 0.681 |
a Threshold values are defined by Youden Index.
b Larger test result indicates more positive test.
c Smaller test result indicates more positive test.




