Prediction of response to PEG-IFN plus RBV treatment based on viral and host factors using ANN model was the aim of this study. Hemoglobin was the predictive factor of SVR, IL-28b genotype was the predictive factor of relapse, and hemoglobin and IL-28b genotype were the predictive factors of non-response outcome. The ANN model was able to predict SVR, relapse and non-response outcomes with good accuracies.
The role of many factors in different responses of patients receiving chronic hepatitis C therapy was proved in previous investigations. Among these factors, younger age, female gender, absence of obesity, favorable genotype (genotypes 2 and 3 as opposed to genotypes 1 and 4), minimal or absence of fibrosis and milder hepatitis in case of liver histopathology, low baseline HCV RNA level (< 600 000 IU/mL) were associated with remarkable better response (
7-
9,
16-
21). In this study, there were significant differences between SVR, relapse and non-response groups with respect to hemoglobin, serum level of cholesterol and IL-28b genotype.
Using multivariate LR analysis, higher levels of hemoglobin were associated with increase in SVR rate, which is in agreement with shirakawa et al. results. They found higher pretreatment hemoglobin levels in SVR group compared to Non-SVR (
10). It may be against the theory that anemia induced by CHC drug therapy (exclusively due to RBV) can improve the treatment results and occasionally lead to SVR (
22). However, RBV dose reduction as a routine interventional method in such patients has been reported by dramatically lower SVR rates and prescription of erythropoiesis-stimulating agents has been shown to be a better approach to improve the general condition of patients and drug compliance (
23); therefore, reduced hemoglobin level state may be only an indicator of patient’s better corporal response to medication which increases the chance of SVR.
The level of total cholesterol in SVR group was higher than other groups, whereas it was not an independent predictive factor of treatment outcome. Harrison et al. reported in his retrospective study that elevated serum cholesterol levels have been associated with higher SVR rates through unknown mechanisms. However, increase in SVR rate can be due to statin use in patients with elevated cholesterol level and it needs further trials assessing potential advantages of statins as adjuvant therapy for CHC (
24).
In agreement with previous studies, IL-28b genotype is a strong predictor of treatment outcome in HCV patients. The global difference of alleles frequency can explain the ethnic variations in treatment response among different populations (
4,
25,
26). In the case of rs12979860 genotype, McCarthy et al. and sharafi et al. reported that patients carrying protective C-allele, had about 6-fold increase in SVR rate compared to CT and TT genotypes. According to our results, the C/C variant of the rs12979860 polymorphism was associated with an increased likelihood of SVR, whereas patients with TT genotype were more likely to be non-responders (
27-
29).
In former researches, patients who had undetectable HCV RNA at the end of therapy (48 weeks) considered to have SVR or named responders, and non-responders have been classified as patients whom HCV RNA counting did not suppress to undetectable at the end of treatment (
11,
12). Apart from these, in responders group, if HCV RNA becomes detectable again at week 24 after cessation of therapy, patient is considered to have relapsed. It is important to differentiate sustained virologic responders and relapsers, because relapsers may profit from longer courses of treatment or retreatment recommendations. Therefore, dividing the data into three SVR, relapse and non-response categories and using IL-28b SNPs polymorphism in the set of inputs made this study unique and validated the results.
In earlier studies, logistic regression (LR) models were mainly used as a non-invasive, technical method to predict treatment outcomes (
30-
32). On the other hand, in some articles the performance differences between two LR and ANN models were discussed in which ANN showed a significantly better performance (
12,
33). Considering all these cases, an ANN model was designed which is a non-linear statistical data modeling tool. ANN has the benefit of being able to learn non-linear interconnectivity of inputs and correlations between inputs and outputs by using a set of observations and put them into continuous functions to generate an accurate predictive model without the need of understanding the underlying relationships (
13,
14,
34).
Results and calculated performance parameters for each output category showed that designed ANN was able to develop an accurate, non-invasive and effective method, which can be applied on computer-based models for clinical purposes, receiving routine and inexpensive pretreatment clinical data of CHC infected patients and estimating the final response to treatment. The small number of entrance data (especially non-responders group) may be responsible for subsided accuracies and modeling could be extended using additional groups of data. This model should be validated in other populations before clinical implementation. By using such pretreatment predictive strategies in health and medical services, we can obviously reduce the number of patients who may undergo a course of treatment with potential side effects from which they would not drive a benefit. In conclusion, planning a predictive model based on simple and routine laboratory data, by utilizing the ANN, could clearly provide an estimation of how patients respond to PEG-IFN plus RBV therapy, which would be expected to be applied in interventional decision-making.