Abstract
Keywords
COX-2 inhibitors 1 4-Dihydropyridines 5-Oxo-1 4 5 6 7 8 hexahydroquinolines Multiple linear regression QSAR
Introduction
Non-steroidal anti-inflammatory drugs (NSAIDs) are a crucial category of medicine that used for the treatment of pain, fever, inflammation and arthritis-associated disorders. The main therapeutic mechanism of NSAIDs action arise from inhibition of biosynthesis of prostaglandins (PGs) by cyclooxygenase enzyme (COX). COX enzyme catalyzes the first step of the biosynthesis of PGG2 from arachidonic acid to generate PGH2. In the next step of enzyme catalysis PGH2 is converted to other prostaglandins and thromboxanes that are involved in diverse biological responses (1, 2). On the basis of crystal structures, the two isoforms, COX-1 and COX-2 have been identified. Cyclooxygenase-1 (COX-1) mainly associated with prostaglandin production in gastric mucosa and is constitutive but COX-2 is upregulated in response to inflammatory stimuli and involved in pathologic processes (3, 4). All classical NSAIDs are capable of inhibiting both COX-1 and COX-2. As a result, these drugs are associated with side effects such as gastric ulceration, bleeding and renal dysfunction (5). Accordingly, introducing the selective COX-2 inhibitor provides therapeutic benefit in inflammation without gastric ulceration. Within the last decades, various structural classes of selective COX-2 inhibitors were reported in the literature (6-12). In addition to their anti-inflammatory activity, selective COX-2 inhibitors are considered as attractive molecules for anti-cancer research and neurological disorders such as Parkinson and Alzheimer’s diseases (13-15). Thus, design and discovery of new COX-2 inhibitors without toxic side effects is required. For these reasons, in this project, as part of our ongoing program on discovering novel selective COX-2 inhibitors, we selected some 1,4-dihydropyridine and 5-oxo-1,4,5,6,7,8-hexahydroquinoline derivatives which have been synthesized and evaluated as selective COX-2 inhibitors in our laboratory for quantitative structure–activity relationship (QSAR) study (16, 17).
QSAR is an applicable approach in computer-aided drug design and development. The major role of QSAR study is correlating biological activities of chemicals to their structural features. Obtained validated QSAR models are utilized for quantitatively predicting the activities of candidate structures. Therefore, extra costs for drug design and discovery such as unnecessary synthesis processes and biological activity assays may be avoided (18-21).
In this study, we employed the stepwise (SW) selection method for the variable selection in the multiple linear regression (MLR) method to build an accurate quantitative relationship between the molecular structure and the COX-2 inhibitory activity of some 1,4-dihydropyridine and 5-oxo-1,4,5,6,7,8-hexahydroquinoline derivatives. The findings can provide helpful information for designing new biologically active molecules.
Experimental
Data set
Two series of potent 1, 4-dihydropyridine and5-oxo-1,4,5,6,7,8-hexahydroquinoline derivatives (21 compounds) which have been synthesized and evaluated as selective COX-2 inhibitors in our laboratory was taken for the study (16, 17). All the biological data expressed as IC50 were converted into pIC50 (-log IC50) values. The total set of molecules was randomly separated into a training set (17 compounds) for generating QSAR model and a test set (4 compounds) for validating the quality of the model. The general chemical structures and biological activity values of all of the compounds are shown in Table 1.
Molecular descriptors and geometry optimization
The 2D chemical structures of the molecules were built using the HyperChem 8.0 software (version 8.0; HyperChem, Alberta, Canada). The pre-optimization was conducted using the molecular mechanics force field (MM+) procedure included in HyperChem, and then semi-empirical method AM1 using the Polak-Ribiere algorithm until the root mean square gradientwas 0.001 Kcal.mol-1 was applied to optimize the molecules geometry. DRAGON software was used to calculate the descriptors and finally 1497 molecular descriptors such as constitutionaldescriptors, topological descriptors, molecular walk counts, BCUT descriptors, Galves topological charge indices, 2D autocorrelations, charge descriptors, aromaticity indices, Randic molecular profiles, geometrical descriptors, 3D-MoRSE descriptors, WHIM descriptors, GETAWAY descriptors, empirical descriptors was extracted (22).
The calculated descriptors were first analyzed for the existence of constant or near constant variables. The detected ones were then removed. Secondly, the descriptors correlation with each other and with the activity (pIC50) of the molecules was examined and the collinear descriptors (i.e. correlation coefficient between descriptors is greater than 0.9) were detected. Among the collinear descriptors, the one exhibiting the highest correlation with the activity was retained and others were removed from the data matrix and finally 366 descriptors were remained.
Results
For the selection of the most important descriptors, stepwise method-based MLR was used. According to rule of thumb, at least five compounds should be included in the equation for every descriptor. To investigate the optimum number of descriptors to be used in the equation, a graph between numbers of descriptors against statistical parameters (R2 and Standard Error of Estimate (SEE)) was plotted (Figure 1). Figure 1 shows that R2 increased with the increasing number of descriptors. However, the values of SEE decreased with the increasing number of descriptors. As can be seen, R2 and SEE remain almost parallel to the number of descriptors after three parameters and higher order models. This shows that the most suitable models are three parametric models.
The MLR analysis with a stepwise selection was carried out to relate the pIC50 to a three set of descriptors. The SPSS software (version 21.0; SPSS Inc., Chicago, IL, USA) was employed for the MLR analysis). It is described by the following equation:
pIC50 = 9.370 (± 3.76) – 7.397 (± 0.407) BEHm6 – 0.208 (± 0.19) Mor03u + 8.794 (± 1.685) IVDE
The built model produced good results for the training set and the test set (Tables 1 and 2).
It can be seen that the SW-MLR equation has acceptable quality and can predict the activity of train and test set with R2 = 0.972 and R2 = 0.531, respectively. The plots of the predicted pIC50 versus the experimental pIC50, obtained by the SW-MLR modeling, are demonstrated in
The cross-validation analysis was performed using leave-one-out (LOO) method in which one compound is removed from the data set and the activity is correlated using the rest of the data set. The cross-validated R2 (Q2) was found to be very close to the value of R2 for the train set and hence these models can be termed as statistically significant. The obtained statistical parameter of the leave-one-out cross-validation test (Q2) on SW-MLR model was 0.943 and RMSE of cross-validation was 0.13, which indicates reliability of the proposed model.
The selected variables of SW-MLR model are shown in Table 3.
Chemical structures and the corresponding observed and predicted pIC50 values by SW-MLR method.
Statistical parameters of SW-MLR model
Training Set | Test Set | F (3, 13) | Q2LOO | RMSE Q2LOO | |
---|---|---|---|---|---|
SEE | R2 | R2 | 151.3 | 0.94 | 0.13 |
0.12 | 0.97 | 0.531 |
The descriptor values were used in model construction
Name | BEHm6 | Mor03u | IVDE |
---|---|---|---|
1 | 3.085 | -6.333 | 2.102 |
2 | 3.07 | -6.379 | 2.102 |
3 | 3.167 | -8.315 | 2.09 |
4 | 3.202 | -8.583 | 2.122 |
5 | 3.128 | -9.681 | 2.09 |
6 | 3.215 | -10.729 | 2.057 |
7 | 3.164 | -9.581 | 2.1 |
8 | 3.083 | -7.199 | 2.043 |
9 | 3.083 | -7.83 | 2.083 |
10 | 3.083 | -8.38 | 2.081 |
11 | 3.083 | -7.66 | 2.083 |
12* | 3.174 | -8.569 | 2.083 |
13 | 3.294 | -8.497 | 2.083 |
14* | 3.167 | -4.367 | 2.07 |
15* | 3.115 | -7.492 | 2.043 |
16 | 3.146 | -7.588 | 2.083 |
17* | 3.155 | -7.806 | 2.081 |
18 | 3.133 | -6.644 | 2.083 |
19 | 3.221 | -5.683 | 2.083 |
20 | 3.295 | -7.389 | 2.083 |
21 | 3.213 | -4.398 | 2.07 |
Correlation coefficient matrix of the selected descriptors by SW-MLR
BEHm6 | Mor03u | IVDE | |
---|---|---|---|
BEHm6 | 1.00 | -0.06 | 0.05 |
Mor03u | 1.00 | -0.11 | |
IVDE | 1.00 |
R2 and Q2LOO values of SW-MLR after several Y-randomization test
Iteration | SW-MLR | |
---|---|---|
R2 | Q2LOO | |
1 | 0.49 | -0.08 |
2 | 0.09 | -0.96 |
3 | 0.31 | -0.39 |
4 | 0.08 | -0.76 |
5 | 0.15 | -0.49 |
6 | 0.08 | -0.52 |
7 | 0.05 | -0.96 |
8 | 0.12 | -0.70 |
9 | 0.09 | -0.76 |
10 | 0.42 | 0.10 |
Details of name of the descriptors were used in model construction
Descriptor name | Explanation | Descriptor family |
---|---|---|
BEHm6 | highest eigenvalue n. 6 of Burden matrix/weighted by atomic masses | Molecular descriptors |
Mor03u | Signal 03/unweighted | 3D MORSE descriptors |
IVDE | Mean information content on the vertex degree equality | Information indices |
Collinearity is a major disadvantage in MLR analysis methods. Thus, the inter-correlation between the three selected descriptors in SW-MLR models was calculated. Results from Table 4 indicated that the absolute correlation coefficient value of each pair descriptors was less than 0.11. Therefore, selected descriptors by stepwise method were completely independent.
The robustness of the resulting model was further validated by applying Y-randomization test. Several random shuffles were performed on dependent variable (pIC50) and new QSAR model was built. The low R2 and Q2LOO values show that the good results in obtained model regarded as reasonable and was not because of a chance correlation (Table 5).
After internal and external validation of the model, it cannot be claimed that this QSAR model is reliable for screening new compounds unless its domain of application is determined. The leverage along with the Williams plot is usually used to define applicability domain of a model. The Williams plot defines as the plot of the standardized residuals versus the leverage (h). In this plot, two horizontal lines and one vertical line mark a safety area. Compounds with standard residuals > 3 standard deviation units and leverage higher than the warning h* are regarded as outliers. The leverage (hi) of every compound is calculated by following equation:
hi = xi (XTX)-1 xiT
In this equation, xi is the descriptor-row vector of the query molecule and X is the k × n matrix containing the k descriptor values for each one of the n training molecules. The critical leverage h* (the vertical line) is fixed at 3(k + 1)/n (23, 24). From the Williams plot (Figure 3), it is obvious that all data points fall within the safety zone in both models. In addition, all compounds have the leverage lower than the warning h* value of 0.70. As a result, it can be said that the model is acceptable for prediction purpose.
Discussion
QSAR results can provide useful chemical visions for designing new compounds. For this purpose, interpretation of the descriptors appeared in the resulting models was discussed below. The interpretation of the descriptors that appeared in SW-MLR model was extracted from Handbook of Molecular Descriptors (25-27). The chemical meaning of selected descriptors is also displayed in Table 6.
The relative significance of the descriptors presented in the QSAR model was determined based on their standardized regression coefficients. The calculated MLR coefficients cannot be used because the descriptors in final MLR model have not the same units. Standardized regression coefficients of selected descriptors in SW-MLR model are shown graphically in Figure 4.
BEHm6 (highest eigenvalue n. 6 of Burden matrix/weighted by atomic masses) is one of the BCUT descriptors which has appeared in the SW-MLR model. BCUT is the eigenvalue based descriptor noted for its utility in chemical diversity. The descriptor is based on a weighted version of the Burden matrix which takes into account both the connectivity as well as atomic properties of a molecule. The weights are a variety of atom properties placed along the diagonal of the Burden matrix. This descriptor displays a main negative sign, which indicates that the pIC50 is directly related to molecules atomic masses. Therefore, increasing the BEHm6 descriptor leads to decrease in its pIC50 value.
Mor03u is one of 3-D MoRSE discriptors that defines as molecular descriptors calculated by summing atom weights viewed by a different angular scattering function. These descriptors are based on the idea of obtaining information from the 3D atomic coordinates by the transform used in electron diffraction studies for preparing theoretical scattering curves. This descriptor has a negative effect on the inhibitory activity of analogs.
IVDE is one of information indices that defines as molecular descriptors calculated as information content of molecules, based on the calculation of equivalence classes from the molecular graph. Since it presented a positive sign in MLR equation, increasing in value of this descriptor will cause to increase of the activity (pIC50).
Summarizing, it is concluded that atomic masses, mean information content on the vertex degree equality and molecules shapes had important effects on the pIC50 of the compounds studied.
Conclusion
In this study, SW-MLR was used to develop linear QSAR model for prediction of COX-2 inhibitory activity of 4-dihydropyridine (DHP) and 5-oxo-1,4,5,6,7,8-hexahydroquinoline derivatives. The built model displayed good correlations between the structure and activity of the studied compounds. The model was validated using LOO cross-validation, Y-randomization and external test set. The built model had a good self- and external-predictive power. Calculated applicability domain of model showed that obtained model was acceptable for prediction purpose. Based on QSAR models results, BEHm6 (highest eigenvalue n. 6 of Burden matrix/weighted by atomic masses), Mor03u (signal 03/unweighted) and IVDE (mean information content on the vertex degree equality) were found to be important factors controlling the COX-2 inhibitory activity.
References
-
1.
Henry DA. Side-effects of non-steroidal anti-inflammatory drugs. Bailliere Clin. Rheum. 1998;2:425-54.
-
2.
Raskin JB. Gastrointestinal effects of nonsteroidal anti-inflammatory therapy. Am. J. Med. 1999;106:4S-12S.
-
3.
Smith WL, DeWitt DL, Garavito RM. Cyclooxygenases: structural, cellular, and molecular biology. Annu. Rev. Biochem. 2000;69:145-82. [PubMed ID: 10966456].
-
4.
Abolhasani H, Dastmalchi S, Hamzeh-Mivehroud M, Daraei B, Zarghi A. Design, synthesis and biological evaluation of new tricyclic spiroisoxazoline derivatives as selective COX-2 inhibitors and study of their COX-2 binding modes via docking studies. Med. Chem. Res. 2016;25:858-69.
-
5.
Wallace JL. How do NSAIDs cause ulcer disease? Baillieres Best Pract. Res. Clin. Gastroenterol. 2000;14:147-59.
-
6.
Zarghi A, Arfaee S, Rao PNP, Knaus E. Design, synthesis, and biological evaluation of 1,3-diarylprop-2-en-1-ones: a novel class of cyclooxygenase-2 inhibitors. Bioorg. Med. Chem. 2006;14:2600-5. [PubMed ID: 16356730].
-
7.
Zarghi A, Zebardast T, Daraei B, Hedayati M. Design and synthesis of new 1,3-benzthiazinan-4-one derivatives as selective cyclooxygenase (COX-2) inhibitors. Bioorg. Med. Chem. 2009;17:5369-73. [PubMed ID: 19596198].
-
8.
Ghodsi R, Zarghi A, Daraei B, Hedayati M. Design, synthesis and biological evaluation of new 2,3-diarylquinoline derivatives as selective cyclooxygenase-2 inhibitors. Bioorg. Med. Chem. 2010;18:1029-33. [PubMed ID: 20061161].
-
9.
Arfaie S, Zarghi A. Design, synthesis and biological evaluation of new (E) - and (Z)-1,2,3-triaryl-2-propen-1-ones as selective COX-2 inhibitors. Eur. J. Med. Chem. 2010;45:4013-7. [PubMed ID: 20691338].
-
10.
Zarghi A, Arfaei S. Selective COX-2 inhibitors: a review of their structure-activity relationships. Iran. J. Pharm. Res. 2011;10:655-83. [PubMed ID: 24250402].
-
11.
Unsal-Tan O, Ozadali K, Piskin K, Balkan A. Molecular modeling, synthesis and screening of some new 4-thiazolidinone derivatives with promising selective COX-2 inhibitory activity. Eur. J. Med. Chem. 2012;57:59-64. [PubMed ID: 23047224].
-
12.
Ghodsi R, Azizi E, Ferlin MG, Pezzi V, Zarghi A. Design, synthesis and biological evaluation of 4-(Imidazolylmethyl)-2-aryl-quinoline derivatives as aromatase inhibitors and anti-breast cancer agents. Lett. Drug. Des. Discov. 2016;13:89-97.
-
13.
Song X, Lin HP, Johnson AJ, Tseng PH, Yang YT, Kulp SK, Chen CS. Cyclooxygenase-2, player or spectator in cyclooxygenase-2 inhibitor-induced apoptosis in prostate cancer cells. J. Natl. Cancer Inst. 2002;94:585-91. [PubMed ID: 11959891].
-
14.
Feng ZH, Wang TG, Li DD, Fung P, Wilson B, Liu B, Ali SF, Langenbach R, Hong JS. Cyclooxygenase-2-deficient mice are resistant to 1-methyl-4-phenyl1, 2, 3, 6-tetrahydropyridine-induced damage of dopaminergic neurons in the substantia nigra. Neurosci. Lett. 2002;329:354-8. [PubMed ID: 12183047].
-
15.
Xiang Z, Ho L, Valdellon J, Borchelt D, Kelley K, Spielman L, Aisen PS, Pasinetti GM. Cyclooxygenase (COX)-2 and cell cycle activity in a transgenicmouse model of Alzheimer’s disease neuropathology. Neurobiol. Aging. 2002;23:327-34. [PubMed ID: 11959394].
-
16.
Zarghi A, Sabakhi I, Topuzyan V, Hajimahdi Z, Daraie B. Design, synthesis and biological evaluation of 5-oxo-1,4,5,6,7,8 hexahydroquinoline derivatives as selective cyclooxygenase-2 inhibitors. Iran. J. Pharm. Res. 2014;13(supplement):61-9. [PubMed ID: 24711830].
-
17.
Sabakhi I, Topuzyan V, Hajimahdi Z, Daraie B, Arefi H, Zarghi A. Design, synthesis and biological evaluation of new 1, 4-dihydropyridine (DHP) derivatives as selective cyclooxygenase-2 inhibitors. Iran. J. Pharm. Res. 2015;14:1087-93. [PubMed ID: 26664375].
-
18.
Irannejad H, Kebriaieezadeh A, Zarghi A, Montazer-Sadegh F, Shafiee A, Assadieskandar A, Amini M. Synthesis, docking simulation, biological evaluations and 3D-QSAR study of 5-aryl-6-(4-methylsulfonyl)-3-(metylthio)-1, 2, 4-triazine as selective cyclooxygenase-2 inhibitors. Bioorg. Med. Chem. 2014;22:865-73. [PubMed ID: 24361187].
-
19.
Hajimahdi Z, Ranjbar A, Suratgar AA, Zarghi A. QSAR study on anti-HIV-1 activity of 4-oxo-1,4-dihydroquinoline and 4-oxo-4H-pyrido[1,2-a]pyrimidine derivatives using SW-MLR, artificial neural network and filtering methods. Iran. J. Pharm. Res. 2015;14(Supplement):69-75. [PubMed ID: 26185507].
-
20.
Soltani S, Abolhasani H, Zarghi A, Jouyban A. QSAR analysis of diaryl COX-2 inhibitors: comparison of feature selection and train-test data selection methods. Eur. J. Med. Chem. 2010;45:2753-60. [PubMed ID: 20332057].
-
21.
Hajimahdi Z, Safizadeh F, Zarghi A. QSAR analysis for some 1, 2-benzisothiazol-3-one derivatives as caspase-3 inhibitors by stepwise MLR method. Iran. J. Pharm. Res. 2016;15:439-48. [PubMed ID: 27642314].
-
22.
-
23.
Gramatica P. Principles of QSAR models validation: internal and external. QSAR Comb. Sci. 2007;26:694-701.
-
24.
Wu W, Zhang C, Lin W, Chen Q, Guo X, Qian Y, Zhang L. Quantitative structure-property relationship (QSPR) modeling of drug-loaded polymeric micelles via genetic function approximation. PLoS ONE. 2015;10:e0119575. [PubMed ID: 25780923].
-
25.
Todeschini R, Consonni V. Handbook of molecular descriptors. Weinheim: Wiley-VCH; 2008.
-
26.
Todeschini R, Consonni V. Handbook of Molecular Descriptors. Weinheim: Wiley-VCH; 2002.
-
27.
Todeschini R, Consonni V. Molecular descriptors for chemoinformatics (2 volumes). Weinheim: Wiley-VCH; 2009.