Designing a Data Mining System to Predict Treatment-Requiring Retinopathy of Prematurity in Neonates: A Pilot Study

authors:

avatar Farshid Khorasani 1 , avatar Ramak Roohi poor 2 , avatar Afsar Dastjani Farahani 3 , avatar Azam Orooji 4 , avatar Mohammad Reza Zarkesh 5 , *

Department of Pediatrics, Imam Khomeini Hospital, Tehran University of Medical Sciences, Tehran, Iran
Department of Ophthalmology, Farabi Eye Hospital, Tehran University of Medical, Tehran, Iran
Eye Research Center, Farabi Eye Hospital, Tehran University of Medical Sciences, Tehran, Iran
School of Medicine, North Khorasan University of Medical Science (NKUMS), Bojnurd, Iran
Maternal, Fetal and Neonatal Research Center, Tehran University of Medical Sciences, Tehran, Iran

how to cite: Khorasani F , Roohi poor R , Dastjani Farahani A, Orooji A , Zarkesh M R . Designing a Data Mining System to Predict Treatment-Requiring Retinopathy of Prematurity in Neonates: A Pilot Study. Iran J Pediatr. 2021;31(2):e103094. https://doi.org/10.5812/ijp.103094.

Abstract

Background:

Nowadays with advanced improvement in NICUs, more preterm infants are surviving with more risks related to ROP.

Objectives:

The aim of the present study was to collect ROP risk factors and design data mining techniques to suggest a predictive ROP treatment-requiring model.

Methods:

A cross-sectional study was carried out in an Iranian hospital (2014 - 2018). The population study consisted of 76 preterm neonates with ROP diagnosis. Of all, retinopathy was treated in 35 cases and others had not received any treatment associated with retinopathy. The pre-set questionnaire was used to extract the risk factors leading to treatment-requiring retinopathy. Then specific software models were designed for predicting ROP treatment-requiring model. In order to compare the performance of data mining methods, several performance metrics such as accuracy, precision, sensitivity, specificity, and F-measure have been used.

Results:

Seventy neonates with ROP entered the study. Results have shown that among four models, Naive Bayes had the best performance with the highest accuracy (87.14), precision (96.43), sensitivity (77.14) and F-measure (85.71). Confusion matrix for Naive Bayes classifier showed that positive predictive value and negative predictive value were 0.7714 and 0.9714, respectively. Overall 87.14% of all data were correctly classified. Moreover, of all data mining techniques, decision tree model could indicate understandable findings as follow; if oxygen therapy continues more than 16 days or blood infusion is > 6 units of packed cells then patients need treatment.

Conclusions:

The results of the present study have demonstrated that data mining techniques could be effectively implemented in ROP screening programs.

1. Background

Retinopathy of prematurity (ROP), as a vasoproliferative eye disorder, is a severe and common eye complication. ROP develops in about 64% of preterm newborns with birth weight < 1501 grams. It is the leading cause of a range of problems from visual impairment in childhood and permanent blindness (1, 2). Based on the results of a meta-analysis study that systematically reviewed 42 articles including 18,000 premature infants, the prevalence of ROP was reported to be 23.5% (95% CI: 20.4 - 26.8) in Iran. The prevalence of ROP stages 1, 2, 3, 4 and 5 was 7.9% (95% CI: 5.3 - 11.5), 9.7% (95% CI: 6.1 - 15.3), 2.8% (95% CI: 1.6 - 4.9), 2.9% (95% CI: 1.9 -4.5) and 3.6% (95% CI: 2.4 - 5.2), respectively (3).

The cause of ROP correlates to the use of supplemental oxygen therapy in preterm neonates suffering from respiratory distress. Although un-monitored treatment with 100% oxygen given in the hopes of improving the survival of neonate, it may cause more mature newborns with severe retinopathy (4). Oxygen toxicity affects the undeveloped retina and immature retinal vessels resultingin hypoxia-induced retina ischemia and detachment (5). The most typical risk factors include hyperoxemia, low birth weight in relation to weeks of pregnancy, low levels of insulin-like growth factor 1, neonatal co-morbidities like Intraventricular hemorrhage, sepsis, respiratory and heart diseases, necrotizing enterocolitis, blood infusion, lack of vitamin E, maternal complications and smoking (6, 7).

Nowadays with advanced improvement in NICUs and standards of health care systems, more extreme preterm infants are surviving with more risks related to ROP. To reduce unfavorable outcomes associated ROP, diagnosis of treatment-requiring cases in the premature newborns is of importance. Several screening strategies for ROP are published. An effective screening program is recommended by the American Academy of Pediatrics. It is stated that all neonates with birth weight ≤ 1500 g, or gestational age < 30 weeks, infants with birth weight 1500 - 2000 g and gestational age of > 30 weeks with other ROP risk factors or newborns receiving oxygen therapy for more than several days or un-monitored oxygenation, should be screened for ROP (8). The British Pediatrics and Child Health Guidelines also recommends a screening schedule for ROP in premature infants with gestational age < 32 weeks or birth weight ≤ 1501 g (9). A guideline by The Canadian Pediatric Society recommends ROP screening in preterm infants younger than 30 weeks or with a birth weight below 1500 g (10).

Different screening guidelines from different countries show some existing controversies related to gestational age or birth weight. Other investigations have also supported the necessity of some changes in present guidelines related priority of some risk factors and criteria. Moreover, because of numerous variations in regional ROP epidemiology, availability of appropriate equipment, infrastructures and skilled specialists, screening protocols need some reconsideration (10, 11).

Recently, data mining methods as valuable techniques are implemented in the health care system to include massive retrospective data, accommodate a great number of new variables and encompass a wide variety of analytical methods. Data mining techniques have been deployed to discover the implicit meaningful patterns and knowledge, use the extracted patterns for determination of disease diagnosis, prognosis, prediction algorithms models, treatment, and share the results (12, 13).

Several investigations have indicated the importance of epidemiologic factors in the prevalence of ROP (14, 15). Considering epidemiologic factors may influence the implementation of different prevention and treatment protocols in different countries.

2. Objectives

The aim of the present study was to collect ROP risk factors and predicting treatment-requiring ROP using four data mining techniques including multi-layer perceptron (MLP), Random Forest, Naive Bayes, and Decision Tree (J48). Moreover, the performance of the data mining models was compared based on different performance metrics. With the best model, early screening examinations and treatment for ROP could be implemented to avoid severe complications like permanent blindness in Iranian premature neonates.

3. Methods

A cross-sectional study was carried out in Yas Women Hospital affiliated with Tehran University of Medical Sciences, Tehran-Iran. The population study consisted of 76 hospitalized preterm neonates with ROP diagnosis from April 2014 to October 2018 and all of them are included in this study; therefore, there is no bias in sampling

Firstly risk factors related to retinopathy of prematurity requiring medical intervention were determined (16) and recorded in a checklist. The explored risk factors were as follows; gestational age, birth weight, sex, type of delivery, cord blood pH, first and 5th minute Apgar scores, hemoglobin, and C-reactive protein (CRP) levels, hyperglycemic status, history of sepsis and positive blood culture, history of blood transfusion (packed cell) or intra-lipid infusion, history of intrauterine growth restriction (IUGR), history of disseminated intravascular coagulation (DIC), abnormal brain ultrasound findings, intubation and oxygenation period, maternal age, history of maternal hypertension, eclampsia, controlled and uncontrolled diabetes. All participants' medical records were assessed. The pre-set checklist was used as a data extraction form to extract the risk factors leading to treatment-requiring retinopathy from neonates' medical records. Then, six records with more than 70% missing data, were excluded from analysis. There were 17 unregistered data elements that were replaced using mean and median for continuous and discrete attributes, respectively.

A high sensitive software model was designed to predict the need for medical intervention for retinopathy of prematurity. The final output was defined as "treatment-requiring" by positive or negative options. First, data preprocessing was done. Then, 4 data mining techniques including multi-layer perceptron (MLP), random forest, naive bayes and decision tree (J48) implemented in Weka were applied. To compare the performances of each data mining method, several performance metrics such as the accuracy, precision, sensitivity, specificity, and F-measure have been used whereas all of them were calculated based on confusion matrix (Table 1). Moreover, the 10-fold cross validation was implemented to evaluate the performance of each model. By this technique, unreliable and biased results were identified and avoided.

Table 1

. Definition of Performance Metrics

Performance MetricsDefinitions
PrecisionTP/ (TP + FP)
Specificity or true negative rate TN/N = TN/ (TN + FP)
Sensitivity or true positive rate (TPR) or recallTP/P = TP/ (TP + FN)
Accuracy (TP + TN)/ (TP + TN + FP + FN)
F-measure2 (precision × recall)/ (precision + recall)

3.1. Primary Outcomes

Our primary objective was to suggest some criteria for predicting treatment-requiring retinopathy of prematurity in neonates.

3.2. Ethical Considerations

The present study was taken from a medical student thesis with ID 9311165008. Our study was approved by the institutional review board of Tehran University of Medical Sciences (No: IR.TUMS.MEDICINE.REC.1397.547) according to the Helsinki declaration. All gathered data were considered confidential and no extra cost was imposed on our participants.

4. Results

Seventy neonates (40 males) with ROP entered the study. Retinopathy in 35 cases was treated and the others had not received any treatment associated with their retinopathy. The mean gestational age in treated and untreated groups was 29.34 ± 2.460 and 31.714 ± 2.77 weeks respectively. The mean birth weight of the treated group was also lower than that of the other group (1187.61 gr vs 1568.14 gr). Other risk factors were assessed between groups. Descriptive quantitative and qualitative data related to 23 input variables (risk factors) are presented in Tables 2 and 3. With positive and negative values, the status of treatment-requiring as the output was determined.

The accuracy, precision, sensitivity, specificity and F-measure for different predicting models were calculated and compared. Results associated to 10-fold cross validation are presented in percents in Table 4 and Figure 1.

Table 2.

Descriptive Data Related Quantitative Risk Factors in 2 Groups with and Without Treatment a

VariablesPositive group (with treatment), N = 35Negative Group (without Treatment), N = 35
Gestational age (weeks)29.34 ± 2.46031.714 ± 2.77
Weight (gr)1187.61 ± 414.151568.14 ± 694.63
Mother's age (y)29 ± 4.7829.10 ± 4.12
First minute apgar score6.62 ± 1.907.90 ± 1.13
5th Minute Apgar Score8.42 ± 0.838.91 ± 0.27
Cord blood pH7.26 ± 0.087.24 ± 0.08
Hb (mg/dL)14.55 ± 1.8916.54 ± 1.04
Blood infusion (units of packed cell)6.41 ± 4.832.41 ± 1.77
Intubation period (days)14.49 ± 12.683.50 ± 3.09
Oxygenation period (days)24.68 ± 16.747.97 ± 3.85
Table 3.

Descriptive Data Related Qualitative Risk Factors in 2 Groups with and without Treatment

VariablesPositive Group (with Treatment), N = 35Negative Group (without Treatment), N = 35
Sex
Male2020
Female1515
Intrauterine growth restriction
Yes811
No2724
C-reactive protein
Positive135
Negative2230
Hypertension
Yes512
No3023
Eclampsia
Yes99
No2626
Controlled diabetes
Yes49
No3126
Uncontrolled diabetes
Yes01
No3534
Type of delivery
C/S3126
NVD49
Intralipid infusion
Yes 3534
No01
Hyperglycemic status
Yes60
No2935
Sepsis
Yes1711
No1824
Disseminated intravascular coagulation
Yes2423
No1112
Brain ultrasound finding
Normal2630
Abnormal95
Table 4.

The Accuracy, Precision, Specificity, Sensitivity and F-Measure of Different Predicting Models

Predicting ModelsAccuracyPrecisionSpecificitySensitivityF-Measure
MLP71.4372.7374.2968.5770.59
Random forest85.7193.1094.2977.1484.38
Naive bayes87.1496.4397.1477.1485.71
Decision tree85.7110010071.4383.33
Comparison of performance metrics in four data mining thecniques using 10-fold cross validation
Comparison of performance metrics in four data mining thecniques using 10-fold cross validation

Comparing the performance of data mining methods in predicting treatment-requiring, several performance metrics (accuracy, precision, sensitivity, specificity, and F-measure) have been used. Since the aim of this study was to design a screening model, the best models were selected based on their sensitivity. In a model with high sensitivity, treatment-requiring could be recognized well. Results showed that the sensitivities correlated with Random Forestand Naive Bayes models were higher in comparison with other techniques. However, among 4 models, Naive Bayes had the best performance with the highest accuracy (87.14), precision (96.43), sensitivity (77.14) and F-measure (85.71). Confusion matrix for Naive Bayes classifier showed that 27 cases in the positive group were diagnosed correctly while 8 cases were classified in the false-negative group (positive predictive value; 0.7714). On the other hand, only one negative case was classified in the false-positive group (negative predictive value; 0.9714). Overall 87.14% of all data were correctly classified (Table 5).

Table 5.

Confusion Matrix for Naive Bayes Classifier

Model Output
No treatment requiringTreatment requiring
Number of patients in each group based on medical records
Treatment requiring827
No treatment requiring341

Of all data mining techniques, Decision Tree model could indicate understandable findings (Figure 2).

Decision tree based on input data
Decision tree based on input data

This model showed: If oxygen therapy continues more than 16 days then patients need treatment; If blood infusion is > 6 units of packed cells in cases with oxygen therapy period less than 16 days then patients need treatment; If oxygen therapy continues < 16 days oxygen therapy and blood infusion is < 6 units, then cases need no treatment.

5. Discussion

Many studies have indicated that ROP is associated with low gestational age and birth weight; however, different criteria cut-offs were produced in screening schedules (12-18). Variation in guidelines for ROP screening and implementing referral criteria maybe related to other factors including the availability of human and material resources, the health infrastructures, care programs associated with antenatal, obstetric and neonatal periods, as well as physician’s knowledge about ROP (19).

In accordance with other investigations, we found that the mean gestational age and birth weight in the ROP treated group was lower than that in the untreated group. The mean gestational age and birth weight in the treatment-requiring group was 29.34 ± 2.460 weeks and 1187.61 g. Another study from Iran by Karkhaneh et al. have demonstrated that the mean gestational age and birth weight in 953 premature infants with severe ROP was 28.8 ± 2.4 weeks and 1256 ± 389 g (20). Ahmadpour-Kacho et al. have also indicated that the mean gestational age and birth weight in 256 Iranian neonates with ROP diagnosis was 30.54 ± 2.28 weeks and 1403.47 ± 333.44 g, respectively. They have shown that the occurrence of ROP could be predicted in premature newborns by clinical risk index for babies (CRIB) as a scoring system; however, this index could not be a reliable predicting index for ROP severity or prognosis (21). Vyas et al. assessed the survival rates and rates of > stage 3 ROP in different populations. Survival rates in infants with gestational age < 26, birth weight < 751 and with CRIB > 10 were 47.5%, 41.2% and 25.2%, respectively. The rate of severe ROP was also 48.4%. They have shown that ROP group had higher mean birth weight (1403 g) and gestational age (30.54 weeks) but lower CRIB scores (22).

Pediatric screening guidelines were implemented to prevent the virulent form of ROP and childhood blindness based on Retinopathy of Prematurity Plus (23). On the other hand, Chiang et al. showed a fair diagnostic agreement of plus disease diagnosis among 22 ROP experts. Their results showed that all participants were agreeing on the same diagnosis related only 4 of 34 wide-angle retinal photographs (24). Other investigations also confirmed such findings by the mean weighted kappa statistic from 0.21 - 0.40 as fair to 0.41 - 0.60 as moderate values in diagnostic agreement of ROP treatment-requiring in plus disease (24-30).

Fortes et al. assessed the value of SNAPPE-II (score for neonatal acute physiology and perinatal extension) in predicting ROP but they could not find a significant association between the SNAPPE-II scores and the risk of ROP development (31). Fleck et al. found a correlation between international differences in ROP treatment rates within BOOST (benefits of oxygen saturation targeting) and international variation in ROP grading. They proposed strong needs for enhancement in the standardization of the diagnosis of ROP treatment-requiring, training in the grading of ROP, implementation of international approach, and ROP image analysis software (26).

The results of present study have demonstrated that designing data mining techniques to suggest a ROP treatment-requiring model could improve clinical outcomes. Both proposed diagnostic models including Random Forestand Naive Bayes models had high sensitivity by 77.14%. Regarding specificity, the Naive Bayes model with 94.29% was the best among the four techniques. Therefore Bayes' model with high sensitivity and specificity may be suggested as a screening treatment-requiring model. Previous studies have also shown that naïve Bayes is suitable for a small dataset with high correlation between the task and other non-task attribute variables (32). Moreover, of all data mining techniques, the Decision Tree model with reasonable sensitivity and specificity (71.43 & 100 %) using two variables composed of oxygen therapy duration and units of blood infusion could assess the trend of the screening process. Furthermore, the decision tree technique had the manual capability and there was no computer systems requirement. Consistent with our findings, Ray et al implemented machine learning to predict the incidence of ROP. Three class problems including No ROP, Regressed ROP and Progressed ROP were entered the study. Their results have indicated the Decision Tree model with the highest accuracy (83.26%) and the least false negative values could be used as a preferable screening ROP model (33). Other systems using the machine learning model could diagnose pre-plus and plus diseases based on ROP Imaging and Informatics data (34-37); however, with regard to ROP treatment-requiring, no significant correlation was observed. It is supposed that our designed models would be preferentially applied because of considering multiple risk factors affecting the development of ROP, as well as higher sensitivity and specificity rates.

Our study had some limitations. Machine learning algorithms require a huge number of samples for training (38, 39). However, collecting large number of samples is not always feasible. This is especially true in the medical field, when we are dealing with data on a rare disease or when, for whatever reason, limited samples are available (40, 41), besides, in this study, it was not possible to select a larger sample size due to the low prevalence of ROP in Iran.

Elimination of a few medical records that had missing data related risk factors and ROP prognosis, variance in diagnostic methods, and a single population were other limitations of the present study. These limitations could affect the accuracy of software algorithms and prediction ability. Further studies with larger sample size are strongly suggested.

5.1. Conclusion

The results of the present study have demonstrated that data mining techniques could be effectively implemented in ROP screening programs. Among the four techniques, the performance of the Naive Bayes model was the best regarding its sensitivity and specificity.

References

  • 1.

    Wang X, Tang K, Chen L, Cheng S, Xu H. Association between sepsis and retinopathy of prematurity: a systematic review and meta-analysis. BMJ Open. 2019;9(5). e025440. [PubMed ID: 31129577]. [PubMed Central ID: PMC6537987]. https://doi.org/10.1136/bmjopen-2018-025440.

  • 2.

    Kremer LJ, Reith DM, Medlicott N, Broadbent R. Systematic review of mydriatics used for screening of retinopathy in premature infants. BMJ Paediatr Open. 2019;3(1). e000448. [PubMed ID: 31206081]. [PubMed Central ID: PMC6542421]. https://doi.org/10.1136/bmjpo-2019-000448.

  • 3.

    Azami M, Jaafari Z, Rahmati S, Farahani AD, Badfar G. Prevalence and risk factors of retinopathy of prematurity in Iran: a systematic review and meta-analysis. BMC Ophthalmol. 2018;18(1):83. [PubMed ID: 29606108]. [PubMed Central ID: PMC5879798]. https://doi.org/10.1186/s12886-018-0732-3.

  • 4.

    Hellström A, Smith LEH, Dammann O. Retinopathy of prematurity. Lancet. 2013;382(9902):1445-57. https://doi.org/10.1016/s0140-6736(13)60178-6.

  • 5.

    Bashinsky AL. Retinopathy of prematurity. N C Med J. 2017;78(2):124-8. [PubMed ID: 28420777]. https://doi.org/10.18043/ncm.78.2.124.

  • 6.

    Fagerholm R, Vesti E. [Retinopathy of prematurity-from recognition of risk factors to treatment recommendations]. Duodecim. 2017;133(4):337-44. Finland. [PubMed ID: 29205980].

  • 7.

    Augsburger JJ, Bornfeld N, Yanoff M, Duker JS, St Louis. Ophtalmology. St. Louis, MO: Mosby; 2004. p. 1097-102.

  • 8.

    Fierson WM. Screening examination of premature infants for retinopathy of prematurity. Pediatrics. 2018;142(6). [PubMed ID: 30478242]. https://doi.org/10.1542/peds.2018-3061.

  • 9.

    Iu LP, Lai CH, Fan MC, et al. Screening for retinopathy of prematurity and treatment outcome in a tertiary hospital in Hong Kong. Hong Kong Med J. 2017;23(1):41-7. [PubMed ID: 28035052]. https://doi.org/10.12809/hkmj154811.

  • 10.

    Lee SK, Normand C, McMillan D. Evidence for changing guidelines for routine screening for retinopathy of prematurity. Arch Pediatr Adolesc Med. 2001;155(3):387-95. [PubMed ID: 11231807]. https://doi.org/10.1001/archpedi.155.3.387.

  • 11.

    Zin A. The increasing problem of retinopathy of prematurity. Community Eye Health. 2001;14(40):58-9. [PubMed ID: 17491934]. [PubMed Central ID: PMC1705951].

  • 12.

    Obenshain MK. Application of data mining techniques to healthcare data. Infect Control Hosp Epidemiol. 2004;25(8):690-5. [PubMed ID: 15357163]. https://doi.org/10.1086/502460.

  • 13.

    Huang M, Chen M, Lee S. Integrating data mining with case-based reasoning for chronic diseases prognosis and diagnosis. Expert syst appl. 2007;32(3):856-67. https://doi.org/10.1016/j.eswa.2006.01.038.

  • 14.

    Ludwig CA, Chen TA, Hernandez-Boussard T, Moshfeghi AA, Moshfeghi DM. The epidemiology of retinopathy of prematurity in the United States. Ophthalmic Surg Lasers Imaging Retina. 2017;48(7):553-62. [PubMed ID: 28728176]. https://doi.org/10.3928/23258160-20170630-06.

  • 15.

    Lala–Gitteau E, Majzoub S, Saliba E, Pisella P–J. Epidemiology and risk factors for retinopathy of prematurity. Invest Ophthalmol Vis Sci. 2005;46(13):4099.

  • 16.

    Solans Perez de Larraya AM, Ortega Molina JM, Uberos Fernandez J, Gonzalez Ramirez AR, Garcia Serrano JL. Speed of retinal vascularization in retinopathy of prematurity: Risk and protective factors. Biomed Res Int. 2019;2019:2721578. [PubMed ID: 31231670]. [PubMed Central ID: PMC6507164]. https://doi.org/10.1155/2019/2721578.

  • 17.

    Kim SJ, Port AD, Swan R, Campbell JP, Chan RP, Chiang MF. Retinopathy of prematurity: A review of risk factors and their clinical significance. Surv Ophthalmol. 2018;63(5):618-37. [PubMed ID: 29679617]. [PubMed Central ID: PMC6089661]. https://doi.org/10.1016/j.survophthal.2018.04.002.

  • 18.

    Wongnophirun A, Khuwuthyakorn V, Tantiprabha W, Wiwatwongwana A. Association between severe retinopathy of prematurity and postnatal weight gain in very low-birthweight infants at Chiang Mai university hospital, Thailand. Paediatr Int Child Health. 2020;40(2):85-91. [PubMed ID: 31272307]. https://doi.org/10.1080/20469047.2019.1631588.

  • 19.

    Afarid M, Hosseini H, Abtahi B. Screening for retinopathy of prematurity in South of Iran. Middle East Afr J Ophthalmol. 2012;19(3):277-81. [PubMed ID: 22837619]. [PubMed Central ID: PMC3401795]. https://doi.org/10.4103/0974-9233.97922.

  • 20.

    Karkhaneh R, Riazi Esfahani M, Ghojehzadeh L, et al. [Incidence and risk factors of retinopathy of prematurity]. Bina J Ophthalmol. 2005;11(1):81-90. Persian.

  • 21.

    Ahmadpour-kacho M, Zahed Pasha Y, Rasoulinejad R, Pourdad P. Correlation between retinopathy of prematurity and clinical risk index for babies score. Tehran Uni Med J. 2014;72(6):404-11.

  • 22.

    Vyas J, Field D, Draper ES, et al. Severe retinopathy of prematurity and its association with different rates of survival in infants of less than 1251 g birth weight. Arch Dis Child Fetal Neonatal Ed. 2000;82(2):F145-9. [PubMed ID: 10685989]. [PubMed Central ID: PMC1721052]. https://doi.org/10.1136/fn.82.2.f145.

  • 23.

    Solarte CE, Awad AH, Wilson CM, Ells A. Plus disease: Why is it important in retinopathy of prematurity? Middle East Afr J Ophthalmol. 2010;17(2):148-55. [PubMed ID: 20616922]. [PubMed Central ID: PMC2892131]. https://doi.org/10.4103/0974-9233.63080.

  • 24.

    Chiang MF, Jiang L, Gelman R, Du YE, Flynn JT. Interexpert agreement of plus disease diagnosis in retinopathy of prematurity. Arch Ophthalmol. 2007;125(7):875-80. [PubMed ID: 17620564]. https://doi.org/10.1001/archopht.125.7.875.

  • 25.

    Daniel E, Quinn GE, Hildebrand PL, et al. Validated system for centralized grading of retinopathy of prematurity: Telemedicine approaches to evaluating acute-phase retinopathy of prematurity (e-rop) study. JAMA Ophthalmol. 2015;133(6):675-82. [PubMed ID: 25811772]. [PubMed Central ID: PMC4910817]. https://doi.org/10.1001/jamaophthalmol.2015.0460.

  • 26.

    Fleck BW, Williams C, Juszczak E. An international comparison of retinopathy of prematurity grading performance within the Benefits of Oxygen Saturation Targeting II trials. Eye (Lond). 2018;32(1):74-80. [PubMed ID: 28752837]. [PubMed Central ID: PMC5669461]. https://doi.org/10.1038/eye.2017.150.

  • 27.

    Gschliesser A, Stifter E, Neumayer T, et al. Inter-expert and intra-expert agreement on the diagnosis and treatment of retinopathy of prematurity. Am J Ophthalmol. 2015;160(3):553-560 e3. [PubMed ID: 26004406]. https://doi.org/10.1016/j.ajo.2015.05.016.

  • 28.

    Campbell JP, Kalpathy-Cramer J, Erdogmus, et al. Plus disease in retinopathy of prematurity: A continuous spectrum of vascular abnormality as a basis of diagnostic variability. Ophthalmology. 2016;123(11):2338-44. [PubMed ID: 27591053]. [PubMed Central ID: PMC5077639]. https://doi.org/10.1016/j.ophtha.2016.07.026.

  • 29.

    Hripcsak G, Heitjan DF. Measuring agreement in medical informatics reliability studies. J Biomed Inform. 2002;35(2):99-110. https://doi.org/10.1016/s1532-0464(02)00500-2.

  • 30.

    Van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9(11).

  • 31.

    Fortes Filho JB, Dill JC, Ishizaki A, et al. Score for neonatal acute physiology and perinatal extension ii as a predictor of retinopathy of prematurity: Study in 304 very-low-birth-weight preterm infants. Ophthalmologica. 2009;223(3):177-82. [PubMed ID: 19174615]. https://doi.org/10.1159/000197114.

  • 32.

    Zhang Y, Xin Y, Li Q, Ma J, Li S, Lv X, et al. Empirical study of seven data mining algorithms on different characteristics of datasets for biomedical classification applications. Bio med Eng Online. 2017;16(1):125. [PubMed ID: 29096638]. [PubMed Central ID: PMC5668968]. https://doi.org/10.1186/s12938-017-0416-x.

  • 33.

    Ray A, Kumar V, Ravindran B, et al. Machine learning to predict the incidence of retinopathy of prematurity. Association for the Advancement of Artificial Intelligence. 2008:300-5.

  • 34.

    Campbell JP, Ataer-Cansizoglu E, Bolon-Canedo V, et al. Expert diagnosis of plus disease in retinopathy of prematurity from computer-based image analysis. JAMA Ophthalmol. 2016;134(6):651-7. [PubMed ID: 27077667]. [PubMed Central ID: PMC5201139]. https://doi.org/10.1001/jamaophthalmol.2016.0611.

  • 35.

    Ataer-Cansizoglu E, Bolon-Canedo V, Campbell JP, et al. Computer-based image analysis for plus disease diagnosis in retinopathy of prematurity: Performance of the "i-rop" system and image features associated with expert diagnosis. Transl Vis Sci Technol. 2015;4(6):5. [PubMed ID: 26644965]. [PubMed Central ID: PMC4669635]. https://doi.org/10.1167/tvst.4.6.5.

  • 36.

    Wittenberg LA, Jonsson NJ, Chan RV, Chiang MF. Computer-based image analysis for plus disease diagnosis in retinopathy of prematurity. J Pediatr Ophthalmol Strabismus. 2012;49(1):11-9. quiz 10, 20. [PubMed ID: 21366159]. [PubMed Central ID: PMC4036800]. https://doi.org/10.3928/01913913-20110222-01.

  • 37.

    Wallace DK, Zhao Z, Freedman SF. A pilot study using "ROPtool" to quantify plus disease in retinopathy of prematurity. J AAPOS. 2007;11(4):381-7. [PubMed ID: 17532238]. https://doi.org/10.1016/j.jaapos.2007.04.008.

  • 38.

    Majumdar A, Singh R, Vatsa M. Face verification via class sparsity based supervised encoding. IEEE Trans Pattern Anal Mach Intell (TPAMI). 2016;39(6):1273-80. [PubMed ID: 27214891]. https://doi.org/10.1109/TPAMI.2016.2569436.

  • 39.

    Huang G, Liu Z, Van Der Maaten L, Weinberger KQ. Densely connected convolutional networks. IEEE CVPR. 2017:2261-9. https://doi.org/10.1109/cvpr.2017.243.

  • 40.

    Thomas RM, Bruin W, Zhutovsky P, van Wingen G. Dealing with missing data, small sample sizes, and heterogeneity in machine learning studies of brain disorders. Machine Learning. Elsevier; 2020. p. 249-66.

  • 41.

    Keshari R, Ghosh S, Chhabra S, Vatsa M, Singh R. Unravelling Small Sample Size Problems in the Deep Learning World. Sixth International Conference on Multimedia Big Data (BigMM). New Delhi, India. IEEE; 2020. p. 134-43.