Abstract
Background:
Artificial intelligence (AI) models provide advanced applications to many scientific areas, including the prediction of the pathologic grade of tumors, utilizing radiology techniques. Gliomas are among the malignant brain tumors in human adults, and their efficient diagnosis is of high clinical significance.Objectives:
Given the contribution of AI to medical diagnoses, we investigated the role of deep learning in the differential diagnosis and grading of human brain gliomas.Methods:
This study developed a new AI diagnostic model, i.e., EfficientNetB0, to grade and classify human brain gliomas, using sequences from magnetic resonance imaging (MRI).Results:
We validated the new AI model, using a standard dataset (BraTS-2019) and demonstrated that the AI components, i.e., convolutional neural networks and transfer learning, provided excellent performance for classifying and grading glioma images at 98.8% accuracy.Conclusions:
The proposed model, EfficientNetB0, is capable of classifying and grading glioma from MRI sequences at high accuracy, validity, and specificity. It can provide better performance and diagnostic results for human glioma images than models developed by previous studies.Keywords
Deep Learning Convolutional Neural Networks Glioma Grading Magnetic Resonance Imaging Transfer Learning
1. Background
Brain glioma tumors are among the most prevalent malignancies, and early management is important in patients’ survival (1, 2). Gliomas are classified into 4 grades based on the histopathologic characteristics (3). The World Health Organization (WHO) has categorized gliomas into 2 main groups of high and low grades, each subdivided into grades 1 to 4, depending on the invasiveness (4). Grade 4 gliomas, termed glioblastoma multiform, are the most invasive type with the lowest survival rate (4). Thus, making a correct diagnosis of the tumor’s grade is the critical element of the effective treatment plan (5, 6).
Radiologic images obtained via MRI, using T1, T1c, T2, or fluid-attenuated inversion recovery (FLAIR) methods, provide the standard information and assist in the clinical decisions for an effective treatment plan (7). Grading gliomas by histopathology is costly and time-consuming. In recent years, modern non-invasive, rapid, safe, and inexpensive methods of making an efficient diagnosis, by combining artificial intelligence (AI) algorithms, are becoming increasingly popular in the management of brain tumors, including gliomas. Specifically, using the AI approach is a prudent step, since making the diagnostic and treatment decisions based on MRI scans alone may be difficult and associated with irreversible errors (8). One popular AI approach is machine learning, which uses the known patterns of human brain data processing for solving complex problems (9, 10). Also, other AI components, i.e., deep learning and convolutional neural networks (CNN) combined with sequences from MRI have shown promising outcomes, thus making significant contributions to the complex task of pathological grading and classification of gliomas (9, 10).
Even though various AI methods have been significantly helpful to the practice of diagnostic radiology, future advancements are needed to improve the lesion detection, segmentation, and classification of brain tumors including gliomas (11). This is a challenging goal when it comes to choosing appropriate deep learning methods to detect and classify gliomas that may occur in the brain (12). Both machine learning and deep learning offer great potentials to contribute advances to the radiologic diagnosis of gliomas (8, 13-15). To date; however, today there are certain questions on the role of AI in the grading and classification of human gliomas that remain unanswered, which are the impetus behind our planning for and undertaking this study.
2. Objectives
This study was conducted to develop a transfer learning model to efficiently and accurately grade glioma tumor sequences, using established data derived from the MRI scans of patients with gliomas.
3. Methods
3.1. Study Design
The study design was approved by the Ethics Committee, Dept. of Information Technology Management, Faculty of Management and Economics, Science and Research Branch, Islamic Azad University, Tehran, Iran (Ethics Code: IR.IAU.SRB.REC.1399.052). This study only involved in the collection and analysis of data from an original dataset entitled BraTS-2019, which had been developed and validated in 2019 by the Center for Biomedical Image Computing & Analytics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA. All of the human data that were derived from the original dataset were anonymous and kept strictly confidential during and after the completion of this study.
3.2. Literature Review
To select an appropriate deep learning method, we conducted a review of credible online literature. Initially, 190 published articles were identified based on their titles and keywords. Upon a careful review of the articles, 35 of them were found to be appropriate as the criteria for choosing a deep learning method.
3.3. Training the Model
The model was trained with the BraTS-2019 dataset, originally derived from the standard MRI scans from 335 patients with brain gliomas. Of the 26,904 images in the dataset, 20% were used for the validation with the other 80% for the training of the proposed model. The anonymous images were divided into two groups, consisting of 259 high-grade (HGG) and 76 low-grade gliomas (LGG). To create a balance between the two groups of images, similar numbers of 2-D sequences from the HGG (n = 13,233) and LGG (n = 13,671) were used to train our proposed model.
3.4. Deep Learning Models
Deep learning models are normally constructed by two approaches:
a) A model with all of the associated layers developed by the researchers. This was not our choice due to its high cost and being time-consuming.
b) A model known as transfer learning is used with the layers adopted from other pretrained models. Such a model is developed by well-known suppliers and is currently popular in AI applications. We used one of the small models from this AI family, called EfficientNetB0, for the current study. Figure 1 illustrates the schematic view of the proposed transfer learning model, and the processing of input images through the initial and advanced layers. After process, the input MRI scans were graded and categorized into two groups of either LGG or HGG images.
The schematic view of the proposed transfer learning model
3.5. Image Data Analyses
To include the images and edit them before training the model, the 3-D images were converted to 2-D ones by a data enhancement technique, using random flip and rotation methods, based on specific codes and parameters. Then, we designed the network layers, and accurately refined the parameters for transfer learning. Subsequently, we utilized a pretrained convolutional neural network (EfficientNetB0) as our base model (16), which is the most recent and efficient transfer learning model for image classification. Upon training the model, we analyzed the results for accuracy, sensitivity, specificity, and precision. As indicated earlier, 20% of the entire images (n = 26,904) were used for validation, with 80% remaining used for the model training.
3.6. Normal vs. Abnormal Images
The proposed model did not read normal images due to hardware and software limitations. Otherwise, it would require additional training before it could differentiate normal from abnormal ones. Thus, we excluded the processing of normal images.
3.7. Experimental Setup
For each patient, there were 5 brain tumor sequences available in the original dataset. We used 3 of the sequences; i.e., T1ce, T2, and fluid-attenuated inversion recovery (FLAIR). The latter is an MRI sequence with the inversion recovery set to null fluids. Also, one MRI sequence was assigned as the ground truth. The format for all sequences was Neuroimaging Informatics Technology Initiative (NIFTI), with the length, width, and slice numbers being 240, 240, and 155, respectively. The image size for inclusion in the proposed model was 240 × 240, with the batch size being 32. Details are shown in Table 1.
The Study’s Dataset of Qualified Glioma Scans
Also, we utilized Mango software for the initial image visualization. The model was implemented in Python programming language, using Keras library and Tensorflow backend. Also, we took advantage of the SimpleITK library to read the MRI scans, and the Matplotlib library to illustrate images. Further, we processed each image to 50 epochs only, due to hardware limitations (Table 2).
Stages and Parameters of the Prediction Model
3.8. Image Reading Process
Finally, we employed Google Colab to execute the image readings. This step provided an environment for writing and implementing the Python codes based on Cloud, and enabled access to GPU and a variety of P100s, P4s, T4s, and Nvidia K80s.
4. Results
Based on the methods, the proposed model was capable of predicting the tumor grades for the images at high accuracy. Details of the results via the model and its performance are presented below:
4.1. Validity and Versatility
Assuming that employing multiple sequences would provide more useful data for grading than using a single sequence, the obtained data to initiate the grading were based on T1ce, T2, and FLAIR sequences. Since we used the T1ce sequences, the T1 counterparts were not used, otherwise, additional hardware was needed, which we lacked. Table 1 presents the number of qualified images used for grading the HGG and LGG images. Based on the codes and parameters, and by image augmentation, we correctly included the MRI scans in the proposed model (Table 2).
Figure 2 illustrates the axial MRI views of LGG and HGG taken at axial angles, based on FLAIR sequences. Figure 3 shows a series of altered MRI scans of gliomas with the augmentation technique after being processed by the proposed model.
Glioma images, LGG and HGG, taken at axial angles. LGG, low grade glioma (A) and HGG, high grade glioma (B)
Altered images with data augmentation technique
4.2. Performance Assessment
To assess whether the model was overfitting, we used a cross-validation technique, for which the images were divided into separate training and validation sets. Figures 4 and 5 represent the performance of the model during the training processes. The proposed model was ready for grading tumor sequences after 50 epochs of training. As the training epochs increased, the accuracy of model also improved further (Figure 4). Therefore, both the training and test curves progressed together consistently at a similar rate. Also, by increasing the number of training epochs, the network loss declined exponentially, so both the training and test plots approached zero level at similar rates (Figure 5).
Plot of accuracy by increasing epochs
Plot of decreasing loss by increasing epochs
Upon developing and training, the model was capable of predicting the grades and classes of tumor sequences that had not been seen before or during the dataset training. The architecture, image dimension base datasets, and the accuracy of the model for classifying HGG and LGG images versus those achieved by seven earlier models between 2018 and 2020 have been compared, as are shown in Table 3.
Comparison of Earlier Models with the Proposed Model by the Current Study
Study | Model’s Architecture | Dimension | Dataset | Accuracy |
---|---|---|---|---|
(17) | Transfer Learning (VGG-16) | 2D | TCIA | 0.9500 |
(18) | CNN | 3D | TCIA | 0.9125 |
In-house | 0.9196 | |||
(19) | CNN | 2D | Brats-2013 | 0.9943 |
BraTs-2014 | 0.9538 | |||
BraTs-2015 | 0.9978 | |||
BraTs-2016 | 0.9569 | |||
BraTs-2017 | 0.9778 | |||
ISLES-2015 | 0.9227 | |||
(20) | Transfer Learning (AlexNet) | 2D | In-house | 0.8550 |
Transfer Learning (GoogLeNet) | 0.9090 | |||
Pre-trained AlexNet | 0.9270 | |||
Pre-trained (GoogLeNet) | 0.9450 | |||
(21) | CNN (2D Mask R-CNN) | 2D | BraTs-2018 TCIA | 0.9630 |
CNN(3DConvNet) | 3D | 0.9710 | ||
(22) | Multi-stream CNN | 2D | BraTs-2017 | 0.9087 |
(23) | 3D CNN | 3D | BraTs-2018 | 0.9649 |
Proposed Model | Transfer learning(EfficientNetB0) | 2D | BraTS-2019 | 0.9887 |
The accuracy of the old models was the reason behind comparing their performance and accuracy with those of the proposed model. The performance and accuracy of the earlier models fell between 0.8550 and 0.9978%. The accuracy of our EfficientNetB0 model was 0.9887. Our proposed model was also evaluated for its training capacity and validity of the graded images (Table 4).
Evaluation of the Proposed Model
Performance | Training Set | Validation Set |
---|---|---|
Accuracy | 0.9905 | 0.9887 |
Precision | 0.9915 | 0.9898 |
Sensitivity | 0.9934 | 0.9886 |
Specificity | 0.9920 | 0.9879 |
5. Discussion
The proposed model accurately classified the HGG and LGG images compared to those achieved in earlier studies (Table 3), which justifies our decision to choose the results achieved in the earlier studies versus those of our model. Specifically, our model was able to reliably classify the images with an accuracy of almost 99%, whether they belonged to the LGG or HGG group. The model performance was consistent with the acceptable validity level suggested by numerous earlier studies on grade-prediction and classification models (3, 8, 9, 12, 13, 17, 18, 24). One of the weaknesses of machine learning is the extraction and classification of specific data from the images it reads. The capability of deep learning has resolved this issue, hence the reason for its popularity. Also, the large number of images used in the deep learning dataset provides for better training and prevents the overfitting the model.
5.1. Validity and Versatility
Our decision to convert 3-D to 2-D images was justified by the fact that one 3-D image can be segmented into 150 slices of 2-D images, before being used for model training. The basis for feeding a total of 26,904 qualified 2-D images of either HGG or LGG gliomas into the model was that the more images used for training, the more efficient the model became. The use of T1ce, T2, and FLAIR sequences to develop the model, made it more versatile for reading images from MRI sequences compared to using a single MRI sequence. Also, based on its excellent performance, the proposed model is capable of being further enhanced for its tumor grading validity. This may be further improved by feeding it with additional images or training it with other advanced datasets.
5.2. Comparison of Deep Learning Models
The proposed model is not only comparable to older models (19, 22) in terms of validity, accuracy, and performance but may also be a better tool to assist in the grading of gliomas. The model detects extra image details since it was trained on greater MRI sequences. A recent study (21) developed a diagnostic model based on the BraTS-2018 dataset and TCIA, using both 2-D and 3-D CNN models, and slightly achieved better results from the 3-D than the 2-D images. In a similar study (24), the 2-D model achieved an almost 99% validity for grading various grades of gliomas. An earlier study (20) used local datasets and the transfer learning method to train the pretrained networks (GoogleNet & AlexNet) and compared their predictive and diagnostic performances. This study achieved better results from GoogleNet than from AlexNet.
5.3. Image Classification, Segmentation & Grading
The use of 2-D images in our study was justified since enough 3-D images were not available to us. Thus, we expanded the dataset of the study by converting the limited 3-D images into 2-D ones. Further, using 3-D images to train a model requires a large memory, which limits the network’s resolution and lowers its representation capability (7, 25). Also, using 3-D images increases the computational cost and requires GPU memory (19). A recent study (18) developed a machine learning model for diagnosing benign from malignant gliomas, using a pretrained CNN model and validating on multiple versions of BraTS datasets.
5.4. The 2D vs. 3D Advantages
Our main reason behind using the 2-D images was to develop a non-invasive and highly accurate transfer learning model and to validly diagnose the grades of 2-D MR glioma images. This approach allowed us to generate a large number of 2-D images from a single 3-D image, which improved the prediction accuracy of the model to almost 99%. Despite previous research, the role of the deep learning method in the improvement of tumor grading needs further exploration (8). A recent study has achieved excellent validity, accuracy, and specificity for this important purpose, using a transfer learning model that was pretrained with a CNN model based on the TCIA dataset (18).
5.5. Limitations
This study had limited access to the hardware and software required for processing 3-D images, therefore, it was more practical for us to convert the 3-D images into 2-D ones.
5.6. Conclusions
The proposed model developed by this study can classify and grade glioma MRI sequences at high accuracy, validity, and specificity. Also, its performance is consistent with those suggested by numerous earlier studies on tumor grading and classification. In the absence of hardware and technical limitations, it is plausible to achieve future diagnostic models applicable to MRI sequences with accuracy and performance superior to the currently existing deep learning AI models. Presumably, using the largest model of the EfficientNet family (i.e., EfficientNetB7), which has ample trainable capacity, is likely to provide researchers with the means of developing future AI diagnostic models with superior accuracy, performance, and versatility.
References
-
1.
Işın A, Direkoğlu C, Şah M. Review of MRI-based Brain Tumor Image Segmentation Using Deep Learning Methods. Procedia Comput. Sci. 2016;102:317-24. https://doi.org/10.1016/j.procs.2016.09.407.
-
2.
Van Meir EG, Hadjipanayis CG, Norden AD, Shu HK, Wen PY, Olson JJ. Exciting new advances in neuro-oncology: the avenue to a cure for malignant glioma. CA Cancer J Clin. 2010;60(3):166-93. [PubMed ID: 20445000]. [PubMed Central ID: PMC2888474]. https://doi.org/10.3322/caac.20069.
-
3.
Havaei M, Davy A, Warde-Farley D, Biard A, Courville A, Bengio Y, et al. Brain tumor segmentation with Deep Neural Networks. Med Image Anal. 2017;35:18-31. [PubMed ID: 27310171]. https://doi.org/10.1016/j.media.2016.05.004.
-
4.
Louis DN, Perry A, Reifenberger G, von Deimling A, Figarella-Branger D, Cavenee WK, et al. The 2016 World Health Organization Classification of Tumors of the Central Nervous System: a summary. Acta Neuropathol. 2016;131(6):803-20. [PubMed ID: 27157931]. https://doi.org/10.1007/s00401-016-1545-1.
-
5.
Tabatabai G, Stupp R, van den Bent MJ, Hegi ME, Tonn JC, Wick W, et al. Molecular diagnostics of gliomas: the clinical perspective. Acta Neuropathol. 2010;120(5):585-92. [PubMed ID: 20862485]. https://doi.org/10.1007/s00401-010-0750-6.
-
6.
Tabatabaei M, Razaei A, Sarrami AH, Saadatpour Z, Singhal A, Sotoudeh H. Current Status and Quality of Machine Learning-Based Radiomics Studies for Glioma Grading: A Systematic Review. Oncology. 2021;99(7):433-43. [PubMed ID: 33849021]. https://doi.org/10.1159/000515597.
-
7.
Yang T, Song J, Li L. A deep learning model integrating SK-TPCNN and random forests for brain tumor segmentation in MRI. Biocybern Biomed Eng. 2019;39(3):613-23. https://doi.org/10.1016/j.bbe.2019.06.003.
-
8.
Tandel GS, Biswas M, Kakde OG, Tiwari A, Suri HS, Turk M, et al. A Review on a Deep Learning Perspective in Brain Cancer Classification. Cancers (Basel). 2019;11(1). [PubMed ID: 30669406]. [PubMed Central ID: PMC6356431]. https://doi.org/10.3390/cancers11010111.
-
9.
Lotan E, Jain R, Razavian N, Fatterpekar GM, Lui YW. State of the Art: Machine Learning Applications in Glioma Imaging. AJR Am J Roentgenol. 2019;212(1):26-37. [PubMed ID: 30332296]. https://doi.org/10.2214/AJR.18.20218.
-
10.
Ebrahimighahnavieh MA, Luo S, Chiong R. Deep learning to detect Alzheimer's disease from neuroimaging: A systematic literature review. Comput Methods Programs Biomed. 2020;187:105242. [PubMed ID: 31837630]. https://doi.org/10.1016/j.cmpb.2019.105242.
-
11.
Abd-Ellah MK, Awad AI, Khalaf AAM, Hamed HFA. A review on brain tumor diagnosis from MRI images: Practical implications, key achievements, and lessons learned. Magn Reson Imaging. 2019;61:300-18. [PubMed ID: 31173851]. https://doi.org/10.1016/j.mri.2019.05.028.
-
12.
Chen H, Qin Z, Ding Y, Tian L, Qin Z. Brain tumor segmentation with deep convolutional symmetric neural network. Neurocomputing. 2020;392:305-13. https://doi.org/10.1016/j.neucom.2019.01.111.
-
13.
Sharif MI, Li JP, Khan MA, Saleem MA. Active deep neural network features selection for segmentation and recognition of brain tumors using MRI images. Pattern Recognit Lett. 2020;129:181-9. https://doi.org/10.1016/j.patrec.2019.11.019.
-
14.
Sotoudeh H, Sadaatpour Z, Rezaei A, Shafaat O, Sotoudeh E, Tabatabaie M, et al. The Role of Machine Learning and Radiomics for Treatment Response Prediction in Idiopathic Normal Pressure Hydrocephalus. Cureus. 2021;13(10). e18497. [PubMed ID: 34754658]. [PubMed Central ID: PMC8569645]. https://doi.org/10.7759/cureus.18497.
-
15.
Tabatabaie M, Sarrami AH, Didehdar M, Tasorian B, Shafaat O, Sotoudeh H. Accuracy of Machine Learning Models to Predict Mortality in COVID-19 Infection Using the Clinical and Laboratory Data at the Time of Admission. Cureus. 2021;13(10). e18768. [PubMed ID: 34804648]. [PubMed Central ID: PMC8592290]. https://doi.org/10.7759/cureus.18768.
-
16.
Tan M, Le QV. Efficientnet: Rethinking model scaling for convolutional neural networks. International Conference on Machine Learning. International Conference on Machine Learning. Long Beach, California, USA. 2019. p. 6105-14.
-
17.
Naser MA, Deen MJ. Brain tumor segmentation and grading of lower-grade glioma using deep learning in MRI images. Comput Biol Med. 2020;121:103758. [PubMed ID: 32568668]. https://doi.org/10.1016/j.compbiomed.2020.103758.
-
18.
Decuyper M, Van Holen R. Fully automatic binary glioma grading based on pre-therapy MRI using 3D convolutional neural networks. arXiv preprint arXiv:1908.01506. 2019.
-
19.
Amin J, Sharif M, Anjum MA, Raza M, Bukhari S. Convolutional neural network with batch normalization for glioma and stroke lesion detection using MRI. Cogn. Syst. Res. 2020;59:304-11. https://doi.org/10.1016/j.cogsys.2019.10.002.
-
20.
Yang Y, Yan LF, Zhang X, Han Y, Nan HY, Hu YC, et al. Glioma Grading on Conventional MR Images: A Deep Learning Study With Transfer Learning. Front Neurosci. 2018;12:804. [PubMed ID: 30498429]. [PubMed Central ID: PMC6250094]. https://doi.org/10.3389/fnins.2018.00804.
-
21.
Zhuge Y, Ning H, Mathen P, Cheng JY, Krauze AV, Camphausen K, et al. Automated glioma grading on conventional MRI images using deep convolutional neural networks. Med Phys. 2020;47(7):3044-53. [PubMed ID: 32277478]. [PubMed Central ID: PMC8494136]. https://doi.org/10.1002/mp.14168.
-
22.
Ge C, Gu IY, Jakola AS, Yang J. Deep Learning and Multi-Sensor Fusion for Glioma Classification Using Multistream 2D Convolutional Networks. Annu Int Conf IEEE Eng Med Biol Soc. 2018;2018:5894-7. [PubMed ID: 30441677]. https://doi.org/10.1109/EMBC.2018.8513556.
-
23.
Mzoughi H, Njeh I, Wali A, Slima MB, BenHamida A, Mhiri C, et al. Deep Multi-Scale 3D Convolutional Neural Network (CNN) for MRI Gliomas Brain Tumor Classification. J Digit Imaging. 2020;33(4):903-15. [PubMed ID: 32440926]. [PubMed Central ID: PMC7522155]. https://doi.org/10.1007/s10278-020-00347-9.
-
24.
Sultan HH, Salem NM, Al-Atabany W. Multi-Classification of Brain Tumor Images Using Deep Neural Network. IEEE Access. 2019;7:69215-25. https://doi.org/10.1109/access.2019.2919122.
-
25.
Wang G, Li W, Ourselin S, Vercauteren T. Automatic Brain Tumor Segmentation Using Cascaded Anisotropic Convolutional Neural Networks. In: Crimi A, Bakas S, Kuijf H, Menze B, Reyes M, editors. Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. 10670. Manhattan, New York City, New York, USA: Springer, Cham; 2018. p. 178-90. https://doi.org/10.1007/978-3-319-75238-9_16.