A Novel Classification Method Based on Multilayer Perceptron-Artificial Neural Network Technique for Diagnosis of Chronic Kidney Disease

authors:

avatar Ali Sharifi 1 , avatar Kamal Alizadeh ORCID 1 , *

Department of Chemistry, Lorestan University, Khorramabad, Iran

how to cite: Sharifi A, Alizadeh K. A Novel Classification Method Based on Multilayer Perceptron-Artificial Neural Network Technique for Diagnosis of Chronic Kidney Disease. Ann Mil Health Sci Res. 2020;18(1):e101585. https://doi.org/10.5812/amh.101585.

Abstract

Background:

Chronic kidney disease is one of the most common diseases. The early diagnosis of this disease will reduce the length of treatment and decrease high medical costs. In recent years, the use of computer techniques in data mining and intelligent algorithms has accelerated the early diagnosis of this disease. One of the intelligent methods to diagnose this disease is artificial intelligence networks.

Objectives:

This study aimed to investigate the diagnosis of chronic kidney disease using an artificial intelligence network based on the multilayer perceptron method.

Methods:

The data of laboratory samples were collected from 140 healthy people and patients with chronic kidney disease. After preprocessing and normalization, the data were given to a multilayer perceptron and the accuracy of disease diagnosis was evaluated. All analyses were performed using MATLAB software.

Results:

The simulation showed a 98% accuracy of diagnosis using the proposed model.

Conclusions:

The results of real data suggested that the proposed system was more effective and faster than other methods in the diagnosis of acute kidney disease and it can be used as a physician assistant tool in clinical practice. In addition, it can be a cost-effective method for patients.

1. Background

All disorders heterogeneously affecting the function and structure of the kidneys are referred to as chronic kidney disease (CKD). The physical symptoms of CKD such as reduced appetite and feeling unwell are not specific. The lack of symptoms in the early stages of the disease means that, without monitoring, CKD can easily go undetected, leading to progressive damage and loss of kidney function. This can result in the development of diseases that can be associated with complications such as cardiovascular diseases (CVD). Therefore, clinically, urine or blood tests are needed for determining specific biomarkers like creatinine, albuminuria, urea, or more specifically cystatin C (1-5). Also, that CKD is a common disease seen in 16.8% of the U.S. population aged ≥ 20 years, thus affecting the public, makes it very important to diagnose this disease in early stages (6).

As human health is being addressed in human medical research, the correct prediction of the results becomes more important. Thus, those methods should be used that have the least error and the highest certainty. Among the methods attracting the attention of many researchers are the chemometric methods and artificial neural networks. Chemometrics is the science of employing computer and mathematical methods to draw critical information from chemical systems by data-driven means (7). Pattern recognition is one of the main applications of chemometrics. Pattern recognition based on different chemometric methods has been applied in metabolomics (8), diagnosis (9), and classification (10).

Artificial neural networks (ANNs) are computer systems mimicking the human brain structure and behavior. They gather knowledge by recognizing the complex patterns of learning through experience (11) while adjusting ANN parameters by a process of minimizing errors. Any kind of input data like the gene expression profiles generated from cDNA microarrays can be employed to calibrate ANNs. The output is grouped based on the number of categories. Today, ANNs are employed in clinical practice including for the diagnosis of myocardial infarction (12) and arrhythmias based on electrocardiogram criteria (13) and the interpretation of radiographs and magnetic resonance images (14, 15). Multilayer perceptron (MLP) refers to a neural network with clearly defined architecture and a rather simple learning algorithm.

The set of source nodes in MLP consists of the input layer, the hidden layer(s), and the output layer. The system complexity is determined by the number of layers and the number of neurons in a layer, affecting the structure of the optimal network. The structure of a typical three-layer ANN is shown in Figure 1.

Multilayer perceptron neural network with an input layer, hidden layer(s), and an output layer
Multilayer perceptron neural network with an input layer, hidden layer(s), and an output layer

Limited studies conducted in recent years have shown that different methods can be used in the diagnosis of acute kidney disease. The method of ANNs in CKD diagnosis was first raised in the study by Neves et al. According to their results, ANNs had a sensitivity of 93.1% - 94.9% and a specificity of 91.9% - 94.2% in the diagnosis of CKD (16). By using ANNs, Di Noia et al. developed a software program to classify end-stage kidney disease and showed that the instrument had a 91.37% accuracy, 70.76% sensitivity, and 70.76% positive predictive power (17). Polat et al. used the support vector machine classification algorithm for CKD diagnosis. Their results showed higher accuracy (98.5%) of the best-first search algorithm than those of other available methods for CKD diagnosis (18). A problem in the diagnosis of CKD is the limited number of studies conducted in recent years. Conducting further studies and increasing the accuracy of diagnosis can be an effective step in the early diagnosis of the disease.

2. Objectives

This study aimed at the early diagnosis of CKD by increasing the accuracy of diagnosis using the MLP method to reduce the length of treatment and decrease the high cost of the disease.

3. Methods

In this study, we attempted to diagnose CKD with simple parameters through chemometric methods using MLP. The neural network used in this study was a parameterized computational non-linear algorithm for CKD data processing. The system architecture is shown in Figure 2.

The system architecture
The system architecture

3.1. Sample Collection and Preparation

We used the dataset of CKD from the Central Laboratory of Islam Abad-e-Qarb city, Kermanshah, Iran. The dataset included blood and urine samples collected from 50 healthy people and 90 patients. Informed consent was obtained from all subjects before enrolment in the study. Samples were kept at -20°C until use. For each person, we gathered the data of the concentrations of glucose (GLU), urea, creatinine, sodium (Na), potassium (K), calcium (Ca), phosphorus (P), cholesterol (Chol), triglycerides (TG), alkaline phosphatase (Alk), alanine aminotransferase (ALT), and aspartate aminotransferase (AST) to use as inputs to the ANN. The input, hidden, and output layers of the neural network for the diagnosis of CKD are shown in Table 1.

Table 1.

The Structure of Neural Network with Input, Hidden, and Output Layers for the Diagnosis of Chronic Kidney Disease

LayerStructure
Input layerFactorsGlucose (GLU)
Urea (Urea)
Creatinine (Creat)
Sodium (Na)
Potassium (K)
Calcium (Ca)
Phosphorus (P)
Cholesterol (Chol)
Triglycerides (TG)
Alkaline phosphatase (Alk)
Alanine aminotransferase (ALT)
Aspartate aminotransferase (AST)
Hidden layer(s)Number of hidden layers2
Output layerDependent variablesDisplacement
Units1
Rescaling the scale dependentsStandardized
Activation functionIdentity
Error functionSum of square

3.2. Data Cleaning and Pre-Processing

Before evaluation by the neural network, the training data were preprocessed. There are different approaches for this purpose. Since the logistic function is the most common transference function, the data were scaled normally to lie within the intervals [0, 1]. To reinforce the performance of network classification, cases with some missing data were excluded. The performance of classification is decreased if there is a database imbalance (different numbers of cases for each class) (19). Back-propagation is the most common training algorithm among others (20, 21). This algorithm is relied on two training parameters: learning rate and momentum. Learning is unstable and network generalization is poor if these parameters have higher values. The optimal values of the training parameters depend upon the complexity of the studied system. The momentum value is lower than the learning rate. The sum of the values should be approximately equal to unity.

3.3. Software

The ANNs in the current study were applied by classification toolbox 4.0 in MATLAB (MATLAB R2018a, The Mathworks Inc., Natick).

3.4. MLP-ANN Architecture

A multi-layer feed-forward neural network with four layers was utilized in the current study. To design the neural network architecture, the choice of activation is an important consideration based on the task and available data. Smooth linear activation function was applied to the input neuron while non-linear log sigmoid function was applied to hidden neurons. A constant bias was also added to the hidden and output layers (22). The designed network had one output neuron and it was formulated as a binary classification problem. Twelve inputs were chosen in the present study, which was appropriate for the prediction of CKD.

3.5. Training and Testing the ANN

The important phases of data processing, i.e., training and testing, were performed using the ANN technique. The back-propagation and gradient descent algorithms are the methods used to train or learn the designed neural networks to update the weights. Basic MATLAB was utilized for codification. The training method employed in the current study relies on the incremental mode of training (22). The data of 140 people including 90 CKD patients were utilized. The normalized inputs were fed into the network to train the ANN; for training the ANN the normalized inputs were fed into the network and the ANN was trained till the network reached sufficiently to small mean square error (MSE) error.

4. Results

In the current study, we designed and tested the ANN architecture with a hidden layer and different numbers of neurons. It had 12 input nodes and one output node and the problem was a binary classification. The output was either 0 or 1 where 0 indicated a healthy case and 1 stood for CKD. To train the network, the weights between hidden-output and input-hidden layers were randomly initialized with a small value ranging from 0 to 1. Figure 3 indicates the framework of ANN for the diagnosis of CKD. The input layer contained 12 neurons. In the hidden layers, there were 10 neurons. The output layer had only one neuron, representing CKD.

The framework of artificial neural network for the diagnosis of chronic kidney disease
The framework of artificial neural network for the diagnosis of chronic kidney disease

5. Discussion

In Figure 4, the training data, validation data, and testing data are shown as blue, green, and red bars, respectively. The histogram can give you an indication of outliers, including data points which their fit is significantly worse than the majority of data. Likewise, when most errors fall from -0.1451 to 0.0998, the validation point errors are -0.1801 to 0.0998. The target is perfectly tracked by the output for validation, testing, and training, and the overall R-value is about 0.99. This means that the network has an accuracy of 99% for the CKD dataset. The training state values are shown in Figure 5. The best validation performance is 0.0084 at epoch 3.

Error histogram presentation of neural network
Error histogram presentation of neural network
Neural network performance analysis with epoch variation
Neural network performance analysis with epoch variation

A multilayer feed-forward network was created with 12 feature inputs, 10 sigmoid hidden neurons, and one linear output neuron. By enough neurons and consistent data in the hidden layer, the created network could perfectly fit multi-dimensional mapping arbitrary problems (Figure 6). To train the network, the Levenberg-Marquardt back-propagation algorithm was utilized. The results indicated that the ANN methodology can distinguish well between normal and abnormal persons and can learn the pattern. The results of the network simulating in the testing set were also very satisfactory, classifying 97.7% of the cases in the testing set.

Neural network regression analysis
Neural network regression analysis

The ANN in the present study was measured by the calculation of absolute error between the desired target and the network response and after presenting the testing pattern to the network. The result was also validated by calculating the correlation coefficient R that was close to one, indicating the good performance of the network (23).

Relying on resolving optimization tasks, intelligent data processing is a modern approach to classify problems using ANNs. Our results suggest that ANNs may also be used to create effective, highly accurate models for CKD prediction. The 12 factors analyzed by MLP-ANN types yielded with 98% accuracy. Hence, the type of MLP-ANN and the number of variable factors can determine the diagnostic accuracy of the tool. The CKD diagnosis software can be designed based on our experiences on the NN models for CKD prediction.

Table 2 compares the performance of the proposed method in comparison with other methods in the literature using these datasets. The comparison indicates that our method has the highest diagnosis accuracy. The proposed method is relatively less complicated for CKD diagnosis. Moreover, the method is simple and can be employed in most libraries.

Table 2.

The Accuracy of Diagnosis by Different Approaches Available in the Literature Compared to our Method

MethodClassification Accuracy %Ref.
Naïve Bayes89.04(24)
SVM87.98
SVM82.41(25)
SVM76.32(26)
ANN87.70
KNN78.75(27)
SVM73.75
IFSS80(28)
FuRES99.2(29)
SVM without feature selection97.75(18)
SVM with ClassifierSubsetEval and Greedy stepwise98
SVM with WrapperSubsetEval and Best-First98.25
SVM with CfsSubsetEval and Greedy stepwise98.25
SVM with FilterSubsetEval and Best-First98.50
MLP-ANN98This work

5.1. Conclusions

This study aimed to develop a simple model with reliable analytical properties to diagnose CKD using MLP ANNs. In this study, we used the dataset of CKD from the Central Laboratory of Islam Abad-e-Qarb city, Kermanshah, Iran. The dataset included blood and urine parameters collected from 50 healthy people and 90 CKD patients. The models including both genetic and non-genetic factors had the highest accuracy. Before evaluation by the neural network, pre-processing steps were performed with data normalization. Using MATLAB software, the neural network was coded with 12 feature inputs, 10 sigmoid hidden neurons, and one linear output neuron. Models with 98% accuracy can be used for the development of software programs for reducing the cost of diagnosis without using invasive procedures for CKD prediction and diagnosis.

Acknowledgements

References

  • 1.

    Pottel H, Hoste L, Martens F. New insights in glomerular filtration rate formulas and chronic kidney disease classification. Clin Chim Acta. 2010;411(17-18):1341-7. [PubMed ID: 20501328]. https://doi.org/10.1016/j.cca.2010.05.031.

  • 2.

    Carter JL, Parker CT, Stevens PE, Eaglestone G, Knight S, Farmer CK, et al. Biological Variation of Plasma and Urinary Markers of Acute Kidney Injury in Patients with Chronic Kidney Disease. Clin Chem. 2016;62(6):876-83. [PubMed ID: 27026288]. https://doi.org/10.1373/clinchem.2015.250993.

  • 3.

    Marsik C, Endler G, Gulesserian T, Wagner OF, Sunder-Plassmann G. Classification of chronic kidney disease by estimated glomerular filtration rate. Eur J Clin Invest. 2008;38(4):253-9. [PubMed ID: 18339005]. https://doi.org/10.1111/j.1365-2362.2008.01934.x.

  • 4.

    Kong X, Ma X, Cui M, Xu D. Association of clustering of major cardiovascular risk factors with chronic kidney disease in the adult population. Clin Nephrol. 2014;82(2):92-7. [PubMed ID: 24887299]. https://doi.org/10.5414/CN108169.

  • 5.

    Peralta CA, Shlipak MG, Judd S, Cushman M, McClellan W, Zakai NA, et al. Detection of chronic kidney disease with creatinine, cystatin C, and urine albumin-to-creatinine ratio and association with progression to end-stage renal disease and mortality. JAMA. 2011;305(15):1545-52. [PubMed ID: 21482744]. [PubMed Central ID: PMC3697771]. https://doi.org/10.1001/jama.2011.468.

  • 6.

    Centers for Disease Control and Prevention (CDC). Prevalence of chronic kidney disease and associated risk factors--United States, 1999-2004. Morbidity and Mortality Weekly Report (MMWR). 2007;56(8):161.

  • 7.

    Miller JN, Miller JC. Statistics and chemometrics for analytical chemistry. 5th ed. Pearson: Harlow; 2005.

  • 8.

    Ramadan Z, Jacobs D, Grigorov M, Kochhar S. Metabolic profiling using principal component analysis, discriminant partial least squares, and genetic algorithms. Talanta. 2006;68(5):1683-91. [PubMed ID: 18970515]. https://doi.org/10.1016/j.talanta.2005.08.042.

  • 9.

    Lin X, Zhao Y, Song WM, Zhang B. Molecular classification and prediction in gastric cancer. Comput Struct Biotechnol J. 2015;13:448-58. [PubMed ID: 26380657]. [PubMed Central ID: PMC4556804]. https://doi.org/10.1016/j.csbj.2015.08.001.

  • 10.

    Perez Anton A, Del Nogal Sanchez M, Crisolino Pozas AP, Perez Pavon JL, Moreno Cordero B. Headspace-programmed temperature vaporizer-mass spectrometry and pattern recognition techniques for the analysis of volatiles in saliva samples. Talanta. 2016;160:21-7. [PubMed ID: 27591583]. https://doi.org/10.1016/j.talanta.2016.06.061.

  • 11.

    Bishop C. Neural Networks for Pattern Recognition. Oxford: Clarendon Press; 1995.

  • 12.

    Heden B, Ohlin H, Rittner R, Edenbrandt L. Acute myocardial infarction detected in the 12-lead ECG by artificial neural networks. Circulation. 1997;96(6):1798-802. [PubMed ID: 9323064]. https://doi.org/10.1161/01.cir.96.6.1798.

  • 13.

    Silipo R, Gori M, Taddei A, Varanini M, Marchesi C. Classification of arrhythmic events in ambulatory electrocardiogram, using artificial neural networks. Comput Biomed Res. 1995;28(4):305-18. [PubMed ID: 8549122]. https://doi.org/10.1006/cbmr.1995.1021.

  • 14.

    Ashizawa K, Ishida T, MacMahon H, Vyborny CJ, Katsuragawa S, Doi K. Artificial neural networks in chest radiography: application to the differential diagnosis of interstitial lung disease. Acad Radiol. 1999;6(1):2-9. [PubMed ID: 9891146]. https://doi.org/10.1016/s1076-6332(99)80055-5.

  • 15.

    Abdolmaleki P, Buadu LD, Murayama S, Murakami J, Hashiguchi N, Yabuuchi H, et al. Neural network analysis of breast cancer from MRI findings. Radiat Med. 1997;15(5):283-93. [PubMed ID: 9445150].

  • 16.

    Neves J, Martins MR, Vilhena J, Neves J, Gomes S, Abelha A, et al. A Soft Computing Approach to Kidney Diseases Evaluation. J Med Syst. 2015;39(10):131. [PubMed ID: 26310948]. https://doi.org/10.1007/s10916-015-0313-4.

  • 17.

    Di Noia T, Ostuni VC, Pesce F, Binetti G, Naso D, Schena FP, et al. An end stage kidney disease predictor based on an artificial neural networks ensemble. Expert Systems with Applications. 2013;40(11):4438-45. https://doi.org/10.1016/j.eswa.2013.01.046.

  • 18.

    Polat H, Danaei Mehr H, Cetin A. Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods. J Med Syst. 2017;41(4):55. [PubMed ID: 28243816]. https://doi.org/10.1007/s10916-017-0703-x.

  • 19.

    Mazurowski MA, Habas PA, Zurada JM, Lo JY, Baker JA, Tourassi GD. Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance. Neural Netw. 2008;21(2-3):427-36. [PubMed ID: 18272329]. [PubMed Central ID: PMC2346433]. https://doi.org/10.1016/j.neunet.2007.12.031.

  • 20.

    Morton H; Aleksander I. An introduction to neural computing. London: Int Thomson Comput Press; 1995.

  • 21.

    Ahmed FE. Artificial neural networks for diagnosis and survival prediction in colon cancer. Mol Cancer. 2005;4:29. [PubMed ID: 16083507]. [PubMed Central ID: PMC1208946]. https://doi.org/10.1186/1476-4598-4-29.

  • 22.

    Haykin S. Neural Networks- a comprehensive foundation. Prentice Hall India; 1999.

  • 23.

    Bt Jaafar SF, Mohd Ali D. Diabetes Mellitus Forecast Using Artificial Neural Network (ANN). Sensors and the International Conference on new Techniques in Pharmaceutical and Biomedical Research, 2005 Asian Conference on IEEE. 2005. p. 135-9.

  • 24.

    Akbarisanto R, Danar W, Purwarianti A. Analyzing bandung public mood using Twitter data. Fourth International Conference on Information and Communication Technologies (ICoICT). Bandung, Indonesia. 2016. p. 1-6.

  • 25.

    Jiang L, He Y, Zhang Y. Prediction of hepatotoxicity of traditional Chinese medicine compounds by support vector machine approach. The 8th International Conference on Systems Biology (ISB). Qingdao, China. 2014. p. 27-30.

  • 26.

    Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. International Journal of Computing and Business Research (IJCBR). 2015;6(2).

  • 27.

    Sinha P, Sinha P. Comparative Study of Chronic Kidney Disease Prediction using KNN and SVM. International Journal of Engineering Research and. 2015;V4(12). https://doi.org/10.17577/ijertv4is120622.

  • 28.

    Muthukumar P, Sai Sundara Krishnan G. A similarity measure of intuitionistic fuzzy soft sets and its application in medical diagnosis. Applied Soft Computing. 2016;41:148-56. https://doi.org/10.1016/j.asoc.2015.12.002.

  • 29.

    Chen Z, Zhang Z, Zhu R, Xiang Y, Harrington PB. Diagnosis of patients with chronic kidney disease by using two fuzzy classifiers. Chemometrics and Intelligent Laboratory Systems. 2016;153:140-5. https://doi.org/10.1016/j.chemolab.2016.03.004.

reza headary
2021-08-16 08:03:30
the best paper for MLP and ANN. thanks