QSAR Analysis for Some 1, 2-Benzisothiazol-3-one Derivatives as Caspase-3 Inhibitors by Stepwise MLR Method

authors:

avatar Zahra Hajimahdi , avatar Fatemeh Safizadeh , avatar Afshin Zarghi , *

Department of Medicinal Chemistry, School of Pharmacy, Shahid Beheshti University of Medical Sciences, Tehran, Iran.

How To Cite Hajimahdi Z, Safizadeh F, Zarghi A. QSAR Analysis for Some 1, 2-Benzisothiazol-3-one Derivatives as Caspase-3 Inhibitors by Stepwise MLR Method. Iran J Pharm Res. 2016;15(2):e125202. https://doi.org/10.22037/ijpr.2016.1855.

Abstract

Caspase-3 inhibitory activities of some 1, 2-benzisothiazol-3-one derivatives were modeled by quantitative structure–activity relationship (QSAR) using stepwise-multiple linear regression (SW-MLR) method. The built model was robust and predictive with correlation coefficient (R2) of 0.91 and 0.59 for training and test groups, respectively. The quality of the model was evaluated by leave-one out (LOO) cross validation (LOO correlation coefficient, Q2) of 0.80). The results indicate that the descriptors related to the electronegativity, the atomic masses, the atomic van der Waals volumes and R--CX--R Atom-centered fragments play a more significant role in caspase-3 inhibitory activity.

Introduction

Apoptosis or programmed cell death is vital in eukaryotic organisms (1). However, dysregulation of this process can cause many diseases in human such as autoimmune disorders, stroke, neurodegenerative diseases and cancer (2).

Caspases (Cystein-dependent aspartyl proteases) have been identified as the key enzymes in initiation and execution of apoptosis (3). Two different groups of enzymes from caspase family are involved in apoptosis. The first group including caspase 2, 8, 9 and 10 are upstream regulators and activate caspases of second group (3, 6 and 7), which are the major effectors caspases in apoptosis (4).

Caspase-3, one of the dominant effectors caspases, is activated in almost every model of apoptosis with various signaling pathways. Hence, inhibition of caspase-3 has become an attractive target in the treatment of neurodegenerative diseases including Alzheimer’s, Huntington’s and Parkinson’s diseases in which excessive neuronal apoptosis occurs (5-6).

Our strategy is to identify potent caspase-3 enzyme inhibitors and study the quantitative relationship between their inhibitory activities and structures. The results of this study can provide useful chemical visions for designing new capase-3 inhibitors. Quantitative structure–activity relationship (QSAR) studies play a critical role in the rational drug design. The main aim of QSAR study is to develop quantitative models to predict biological activity of compounds (7-8). Through the years different methods were used to build QSAR models capable of accurate prediction of biological activity of compounds (9-10). In this study, we employed the stepwise (SW) selection method for the variable selection in the multiple linear regression (MLR) method. The aim of this study is to search for an efficient method to build an accurate quantitative relationship between the molecular structure and the caspase-3 inhibitory activity of some 1, 2-benzisothiazol-3-one derivatives.

Methods and data

Data set

A series of potent 1, 2-benzisothiazol-3-one derivatives (53 compounds) with experimental biological activities, which were reported by Liu et al. and Wu et al., was taken for the study (11-12). All the biological data expressed as IC50 were converted into pIC50 (-log IC50) values. The total set of molecules was randomly separated into a training set (43 compounds) for generating QSAR model and a test set (10 compounds) for validating the quality of the model. The general chemical structures and biological activity values of all of the compounds are shown in Table 1.

Table 1

Chemical structures and the corresponding observed and predicted pIC50 values by SW-MLR method.

Molecular descriptors and geometry optimization

The chemical structures of the molecules were built using the Hyperchem 8.0 software (version 8.0; Hyperchem, Alberta, Canada) (13). The pre-optimization was conducted using the molecular mechanics force field (MM+) procedure included in Hyperchem, and then semi-empirical method AM1 using the Polak–Ribiere algorithm was applied to optimize the molecules geometry. DRAGON software was used to calculate the descriptors among a total of 1200 molecular descriptors, belonging to different types of theoretical descriptors such as constitutional descriptors, topological descriptors, molecular walk counts, BCUT descriptors, Galves topological charge indices, 2D autocorrelations, charge descriptors, aromaticity indices, Randic molecular profiles, geometrical descriptors, 3D-MoRSE descriptors, WHIM descriptors, GETAWAY descriptors, empirical descriptors (14). The calculated descriptors were first analyzed for the existence of constant or near constant variables. The detected ones were then removed. Secondly, the descriptors correlation with each other and with the activity (pIC50) was of the molecules was examined and the collinear descriptors (i.e. correlation coefficient between descriptors is greater than 0.9) were detected. Among the collinear descriptors, the one exhibiting the highest correlation with the activity was retained and others were removed from the data matrix. And finally 363 descriptors were remained.

Results

For the selection of the most important descriptors, stepwise method-based MLR was used. According to the rule of thumb, at least five compounds should be included in the equation for every descriptor. To investigate the optimum number of descriptors to be used in the equation, a graph between numbers of descriptors against statistical parameters (R2 and Standard Error of Estimate (SEE)) was plotted (Figure 1). Figure 1 shows that R2 increased with the increasing number of descriptors. However, the values of SEE decreased with the increasing number of descriptors. As can be seen, R2 and SEE remain almost parallel to the number of descriptors after nine parameters and higher order models. This shows that the most suitable models are nine parametric models.

Influences of the number of descriptors on the R2 and SEE of the regression model.

The MLR analysis with a stepwise selection was carried out to relate the pIC50 to a nine set of descriptors. The SPSS software (version 13.0; SPSS Inc., Chicago, IL, USA) (15) was employed for the MLR analysis). It is described by the following equation:

pIC50 = 4.30 (± 1.54) –13.56 (± 1.46) P2v–19.68 (±8.10) R7e+ – 8.86 (± 1.33) R2m+

– 12.71 (± 1.54) MATS1e –0.59 (± 0.10) C-026 + 4.25 (± 0.59) Mor28m – 0.32 (± 0.06) RDF125m

+ 0.27 (± 0.05) RDF115m + 19.10 (± 8.62) G2e

The built model produced the good results for the training set and the test set (Table 1 and 2).

Table 2

Statistical parameters of SW-MLR model.

Training setTest setFQ2LOO
SEER2R2
0.38 0.91 0.59 37.87 0.80

The obtained statistical parameter of the leave-one-out cross-validation test (Q2) on SW-MLR model was 0.80, which indicates reliability of the proposed model. The plots of the predicted pIC50versus the experimental pIC50, obtained by the SW-MLR modeling, are demonstrated in Figure 2.

The predicted pIC50values by the SW-MLR modeling versus the observed pIC50values.

The selected variables of SW-MLR model are shown in Table 3, and the correlation matrix of these descriptors visualized is shown in Table 4. From Table 4, it could be seen that the correlation coefficient value of each pair descriptors was less than 0.65, which meant that the selected descriptors were independent.

Table 3

The descriptor values were used in model construction.

No.P2vR7e+R2m+MATS1eC-026Mor28mRDF125mRDF115mG2e
10.32700.117-0.02500.314000.204
2a0.2210.0390.0970.05100.282000.193
30.1650.0410.0930.06300.253000.185
4 a0.140.0370.0860.06400.237000.183
50.1370.0220.0560.06300.472000.177
60.1080.0240.0570.0700.44700.1090.172
70.1160.0410.0760.0700.357000.172
8 a0.0880.0320.0780.02810.3730.0020.2180.167
90.1150.040.0990.01610.391000.172
100.0850.0290.0770.07600.2070.0050.2140.168
110.0930.0250.043-0.02710.3560.0190.2190.171
12 a0.0910.0430.067-0.04320.40.0090.1120.165
130.1150.0290.075-0.04320.360.1111.9470.165
140.0650.0430.076-0.04320.2642.481.6210.165
150.1190.0360.236-0.04120.5430.0360.180.171
160.1020.0230.249-0.04120.4720.4541.1550.171
170.0790.0250.217-0.04120.4860.0290.2160.171
180.0960.0420.055-0.03920.3040.0410.1080.171
190.0940.0250.064-0.03920.3780.1441.7810.171
20 a0.0920.0250.058-0.03920.3282.0451.530.171
210.1230.0230.0990.13120.7550.0120.1550.168
220.0890.040.292-0.04320.5121.0662.7760.167
230.0850.0360.059-0.01600.3530.0050.2940.167
240.0970.0260.047-0.00700.46100.120.163
250.1060.0260.044-0.03210.4840.0421.1560.162
260.0880.0250.048-0.03210.460.0620.5290.162
270.0820.0270.06-0.03210.3731.4864.3050.162
280.10.0290.071-0.0310.3930.0070.2270.167
290.090.0480.101-0.0310.3190.0010.1770.167
30 a0.1020.0320.103-0.0310.510.0931.2850.167
31 a0.150.0280.054-0.00700.418000.163
320.0710.0160.07-0.03900.4170.7330.1490.168
330.0910.0270.087-0.03900.3840.010.1850.168
340.0550.0390.088-0.02600.351.3151.2620.18
350.10.0350.083-0.0600.30100.0360.171
360.0560.0270.118-0.02400.4790.9333.5080.167
370.0690.0190.0740.00100.4082.1711.2980.174
38 a0.2010.0290.066-0.03210.6030.0120.1660.156
39 a0.0960.0220.054-0.03210.6051.9235.2380.156
400.0550.020.062-0.03210.6141.9612.7840.169
410.2810.020.06-0.01110.6290.0220.20.153
420.1210.020.054-0.01110.6331.3571.6230.153
430.0460.0170.062-0.01110.5952.8072.7180.153
440.0510.0190.062-0.00110.4571.9183.1530.155
450.0540.0180.061-0.02110.4792.7463.9030.16
460.1080.0170.055-0.02300.4961.8792.3580.154
470.0830.0150.046-0.01510.5465.4441.8370.157
480.0540.0210.0990.0320.623.12.2970.154
490.0670.0240.080.05520.693.8576.2990.161
500.1080.0210.075-0.02310.5563.9592.3020.165
51 a0.080.0210.0730.00610.3281.9616.4490.153
520.090.0210.071-0.01310.1751.1414.810.157
530.1020.0230.079-0.01710.160.6752.2420.158
Table 4

Correlation coefficient matrix of the selected descriptors by SW-MLR.

P2vR7e+R2m+MATS1eC-026Mor28mRDF125mRDF115mG2e
P2v1-0.130.030.15-0.22-0.05-0.40-0.400.44
R7e+10.200.040.09-0.36-0.39-0.330.20
R2m+1-0.140.350.09-0.15-0.050.24
MATS1e1-0.260.06-0.08-0.120.21
C-02610.310.210.26-0.31
Mor28m10.350.20-0.41
RDF125m10.65-0.41
RDF115m1-0.50
G2e1

Discussion

QSAR results can provide useful chemical visions for designing new compounds. For this purpose, interpretation of the descriptors appeared in the resulting models was discussed below (16). The chemical meanings of selected descriptors are also displayed in Table 5.

P2v is one of the WHIM descriptors which has appeared in the SW-MLR model. WHIM descriptors are molecular descriptors based on the projections of the atoms along principal axes. WHIM descriptors are built in such a way as to capture relevant molecular 3D information regarding molecular size, shape, and symmetry and atom distribution with respect to invariant reference frames. The property in this case is van der Waals volume. This descriptor has a significant negative effect on the inhibitory activity of analogs. G2e is another WHIM descriptor in this model that has a negative influence on PIC50. The negative sign suggests that the PIC50 value is inversely related to this descriptor.

From the nine selected descriptors, three of them belong to the 2D autocorrelation descriptors (R7e +, R2m + and MATS1e). In 2D autocorrelation descriptors, the molecule atoms represent a set of discrete points in space, and the atomic property and function are evaluated at those points. The symbol for each of the autocorrelation descriptors is followed by two indices d and w; where d stands for the lag and w stands for the weight. The lag is defined as the topological distance d between pairs of atoms. The weight can be m (relative atomic mass), p (polarizability), e (Sanderson electronegativity) and v (Vander Waals volume). The physico-chemical properties (weights) for R2m+, R7e+ and MATS1e are atomic mass and Sandersonn electronegativity, respectively. Figure 4 displays that these three descriptors have negative effects on caspase-3 inhibitory activity, which indicates that pIC50 is inversely related to atomic Sanderson electronegativities and atomic mass.

The seventh and eighth descriptors are RDF115m and RDF125m, which belong to the RDF descriptors. The RDF in these forms meets all the requirements for the 3D structure descriptors. It is independent of the atom number (i.e., the size of a molecule), it is unique regarding the 3D arrangement of the atoms, and it is invariant against the translation and rotation of the entire molecule. The RDF descriptors are based on the distance distribution in the molecule. The RDF of an ensemble of n atoms can be interpreted as the probability distribution of finding an atom in a spherical volume of radius R. RDF115m and RDF125m descriptors play a main role in analogs activities. RDF115m and RDF125m have positive and negative influence on PIC50, respectively.

Mor28m is one of the 3D-MoRSE descriptors. 3D Molecule Representation of Structures based on Electron diffraction (3D MoRSE) descriptors is derived from infrared spectra simulation using a generalized scattering function. This descriptor was proposed as signal 22⁄weighted by atomic masses, which relates to masses of the molecules .

C-026 is one of the Molecular descriptors that are based on the counting of 120 atom-centered fragments. Atom-centered fragments are those defined by Ghose and Crippen (17-18). Each atom type is an atom in the molecule described by its neighboring atoms. Hydrogen and halogen atoms are classified by the hybridization and oxidation state of the carbon atom to which they are bonded. Carbon atoms are classified by their hybridization state and depending on whether their neighbors are carbon or heteroatoms. C-026 is defined as R--CX--R Atom-centered fragments which R represents any group linked through carbon; X represents any electronegative atom (O, N, S, P, Se, halogens) and -- represents an aromatic bond as in benzene. C-026 has negative effect on pIC50 .

In summary, it is concluded that atomic masses, atomic Sanderson electronegativities, atomic van der Waals volumes and atom-centered fragments play the main roles in the caspase-3 inhibitory activity of the compounds. Figure 3 shows that R7e +, MATS1e and G2e mean effects have negative and positive signs, respectively. The R7e +, MATS1e mean effects values are higher than that of G2e, which indicates that pIC50 is inversely related to atomic Sanderson electronegativities. It is also obvious that atomic masses mean effect on pIC50 is positive, because Mor28m, RDF115m mean effects values are higher than that of R2m + and RDF125m.

Standardized coefficients versus descriptor values in MLR.
Table 5

Details of name of the descriptors were used in model construction.

DescriptorsChemical meanings
P2v2nd component shape directional weighted by atomic van der Waals volumes
R7e+R maximal autocorrelation of lag 7/weighted by atomic Sanderson electronegativities
R2m+R maximal autocorrelation of lag 2/weighted by atomic masses
MATS1eMoran autocorrelation lag 1 / weighted by atomic Sanderson electronegativities
C-026R--CX--R Atom-centred fragments
Mor28m3D-MoRSE - signal 28/weighted by atomic masses
RDF125mRadial Distribution Function - 12.5/weighted by atomic masses
RDF115mRadial Distribution Function- 11.5/weighted by atomic masses
G2e2st component symmetry directional WHIM index/weighted by atomic Sanderson electronegativities

Conclusion

In this study, SW-MLR was used to develop linear QSAR model for prediction of caspase-3 inhibitory activity of 1, 2-benzisothiazol-3-one derivatives. The built model displayed good correlations between the structure and activity of the studied compounds. The model was validated using LOO cross-validation and external test set. The built model has a good self-and external-predictive power. Based on QSAR model results, electronegativity, the atomic masses, the atomic van der Waals volumes and R--CX--R Atom-centered fragments were found to be important factors controlling the caspase-3 inhibitory activity.

References

  • 1.

    Zhu QH, Gao LX, Chen ZP, Zheng SC, Shu HF, Li J, Jiang HF, Liu SW. A novel class of small-molecule caspase-3 inhibitors prepared by multicomponent reactions. Eur. J. Med. Chem. 2012;54:232-8. [PubMed ID: 22652225].

  • 2.

    Limpachayaporn P, Schäfers M, Schober O, Kopka K, Haufe G. Synthesis of new fluorinated, 2-substituted 5-pyrrolidinylsulfonyl isatin derivatives as caspase-3 and caspase-7 inhibitors: Nonradioactive counterparts of putative PET-compatible apoptosis imaging agents. Bioorg. Med. Chem. 2013;21:2025-36. [PubMed ID: 23411396].

  • 3.

    Colantonio P, Leboffe L, Bolli A, Marino M, Ascenzi P, Luisi G. Human caspase-3 inhibition by Z-tLeu-Asp-H: tLeu (P2) counterbalances Asp (P4) and Glu (P3) specific inhibitor truncation. Biochem. Biophys. Res. Commun. 2008;377:757-762. [PubMed ID: 18854175].

  • 4.

    Chu WH, Rothfuss J, Zhou D, Mach RH. Synthesis and evaluation of isatin analogs as caspase-3 inhibitors: Introduction of a hydrophilic group increases potency in a whole cell assay. Bioorg. Med. Chem. Lett. 2011;21:2192-7. [PubMed ID: 21441025].

  • 5.

    Lee D, Long SA, Murray JH, Adams JL, Nuttall ME, Nadeau DP, Kikly K, Winkler JD, Sung CM, Ryan MD, Levy MA, Keller PM, DeWolf WE Jr. Potent and selective nonpeptide inhibitors of caspases 3 and 7. J. Med. Chem. 2001;44:2015-26. [PubMed ID: 11384246].

  • 6.

    Taylor RC, Cullen SP, Martin S. Apoptosis: controlled demolition at the cellular level. J. Nat. Rev. Mol. Cell Biol. 2008;9:231-241.

  • 7.

    Hajimahdi Z, Ranjbar A, Suratgar AA, Zarghi A. QSAR study on anti-HIV-1 activity of 4-oxo-1, 4-dihydroquinoline and 4-oxo-4H-pyrido[1, 2-a]pyrimidine derivatives using SW-MLR, artificial neural network and filtering methods. Iran. J. Pharm. Res. 2015;14 (Supplement):69-75. [PubMed ID: 26185507].

  • 8.

    Ketabforoosh SHME, Amini M, Vosooghi M, Shafiee A, Azizi E, Kobarfard F. Synthesis, evaluation of anticancer activity and QSAR study of heterocyclic esters of caffeic acid. Iran. J. Pharm. Res. 2013;12:705-719. [PubMed ID: 24523750].

  • 9.

    Iman M, Davood A, Khamesipour A. Computational study of quinolone derivatives to improve their therapeutic index as anti-malaria agents: QSAR and QSTR. Iran. J. Pharm. Res. 2015. In Press.

  • 10.

    Davood A, Nematollahi A, Iman M, Shafiee A. Computational studies of new 1, 4-dihydropyridines containing 4-(5)-chloro-2-ethyl-5-(4)-imidazolyl substituent: QSAR and docking. Med. Chem. Res. 2010;19:58-70.

  • 11.

    Liu D, Tian Z, Yan Z, Wu L, Ma Y, Wang Q, Liu W, Zhou H, Yang C. Design, synthesis and evaluation of 1,2-benzisothiazol-3-one derivatives as potent caspase-3 inhibitors. Bioorg. Med. Chem. 2013;21:2960-7. [PubMed ID: 23632366].

  • 12.

    Wu L, Lu M, Yan Z, Tang X, Sun B, Liu W, Zhou H, Yang C. 1, 2-Benzisothiazol-3-one derivatives as a novel class of small-molecule caspase-3 inhibitors. Bioorg. Med. Chem. 2014;22:2416-26. [PubMed ID: 24656804].

  • 13.

    Hyper Chem Release 8. HyperCube, Inc; Availabe from: URL: http://www.hyper.com.

  • 14.

  • 15.

    SPSS for Windows. Statistical Package for IBM PC, SPSS Inc; Available from: URL: http://www.spss.com.

  • 16.

    Todeschini R, Consonni V. Handbook of Molecular Descriptors. Weinheim: Wiley-VCH; 2002.

  • 17.

    Viswanadhan VN, Ghose AK, Revankar GR, Robins RK. Atomic physicochemical parameters for three dimensional structure directed quantitative structure-activity relationships Additional parameters for hydrophobic and dispersive interactions and their application for an automated superposition of certain naturally occurring nucleoside antibiotics. J. Chem. Inf. Comput. Sci. 1989;29:163-172.

  • 18.

    Ghose AK, Viswanadhan VN, Wendoloski JJ. Prediction of hydrophobic (lipophilic) properties of small organic molecules using fragmental methods:  An analysis of ALOGP and CLOGP methods. J. Phys. Chem. A. 1998;102:3762-72.