Schizophrenia is a chronic and severe mental illness leading to a wide range of symptoms such as delusions, hallucinations, disorganized speech, thinking problems, and demotivation (
1). Schizophrenia usually affects 1% of the general population and is traditionally diagnosed by DSM-V criteria (
2). Schizophrenia is often categorized based on its dominant symptoms, including positive and negative symptoms. Positive symptoms include hallucination, delusion, and conceptual disorganization, while negative symptoms are apathy, affective blunting, and poverty of speech (
3). These symptoms can be evaluated in different ways, including clinical interviews with expert psychiatrists or psychological tests such as PANSS (
4). The diagnosis of schizophrenia based on dominant symptoms plays an important role in differential diagnosis, treatment plan, the selection of best medication by a doctor, and the prognosis of the disease. Although schizophrenic patients may have positive or negative symptoms, some patients have both types of symptoms, which one of them being more dominant. The diagnosis of schizophrenic patients with dominant positive symptoms is easier than the diagnosis of those with negative symptoms. The main challenge in diagnosing schizophrenic patients with negative symptoms is the similarity of their signs and symptoms with other psychiatric diseases, including major depressive disorder (MDD) and schizoid disorder personality (
5). Moreover, patients with dominant-negative symptoms tend to be associated with remarkably poor outcomes and inaccurate prognosis. Accordingly, the precise categorization of schizophrenic patients based on their dominant symptoms is still challenging for psychiatrists (
6). To overcome this drawback, it is better to find a quantitative biomarker for a better diagnosis of this complicated disease. Different aspects of schizophrenic patients with positive and negative syndromes have already been studied. In schizophrenic patients with both symptoms, a few brain functions and cerebrospinal fluid flow are affected and show abnormal behavior. In addition, brain structural change and deterioration in their genetics are observed (
7). To monitor the electrical and functional activities of the brain, several types of equipment, including electroencephalography (EEG) systems, functional magnetic resonance imaging (fMRI), and positron emission tomography (PET), have been developed. Note that schizophrenia and its subtypes have been studied with these signals and imaging modalities (
8,
9). Recent progress in neuroimaging modalities enables specialists to better understand the different aspects of various brain diseases. Structural abnormalities were also studied in schizophrenia with different types of symptomatology and revealed lower volume in frontal and temporal lobes (
10).
Electroencephalography is a quantitative and physiological-based tool with a high temporal resolution and has become an important modality in studying and diagnosing various brain disorders. Since the EEG signal acquisition is cheap and has no side effects, researchers have paid attention to using this physiology-based signal to reveal different abnormalities in the brain. A few studies have been conducted to differentiate schizophrenic patients with positive and negative symptoms by analyzing their EEG signals (
11-
13).
Temporal synchronization and the oscillation behavior of neurons in the EEG signals are critical factors in cognition and perception tasks, both of which are impaired in schizophrenic patients. These impairments lead to abnormal EEG behaviors. However, the relationship between the gate sensory deficit and schizophrenia symptoms is still ambiguous. Keil et al. (
11) recorded the EEG signals of 22 schizophrenic patients (with an equal number of dominant symptoms) and 22 age-matched controls in the presence of an auditory stimulus. They observed an increase in the Gamma-band power sensory gating in schizophrenic patients with positive symptoms. In contrast, alpha-band phase synchrony sensory gating was decreased in schizophrenic patients with negative symptoms.
Fan et al. (
12) asked a group of hospitalized schizophrenic patients to do a cognitive task to compare their cognitive ability with that of the healthy subjects. The results of their test illustrated some features, such as the amplitude of EEGs and the frequency distribution of EEGs, which can guide specialists to classify schizophrenic patients with different dominant symptoms more accurately. In another study, Saletu et al. (
13) assessed the brain maps of 48 schizophrenic patients and showed a severe drop and incline in their Alpha and Beta bands, respectively, compared to healthy subjects. Moreover, they compared the EEG variations of schizophrenic patients with negative and positive symptoms and observed an increase in the low-frequency EEG bands (e.g., delta/theta) of patients with dominant-negative symptoms while observed a drop in the activity of the Alpha band and an increase in the Beta band in schizophrenic patients with dominant positive symptoms. A review of the literature reveals that a few studies have aimed to discriminate schizophrenic patients with different dominant symptoms, some of which have traced the increase or decrease in the standard EEG band powers in classifying schizophrenic patients by their dominant symptoms.
As EEG signals behave irregularly and obey no specific pattern, information-theoretic measures are adopted to decode the EEG variations and reveal their content. The domain of information theory is vast and contains several issues such as coding and decoding different types of data (e.g., text, image, and video) (
14) and measuring, roughness, and complexity captured in different types of data, including EEG signals, MRI images, data stream, micro-array sequences, and radio signals. One of the main measures in information theory is entropy. In this regard, several methods have been proposed to measure entropy because the value of entropy directly relates to the amount of information included in data. The well-known entropy measures are approximate entropy (ApEn), Kolmogorov entropy (
15), Shannon entropy (in the time domain), spectral entropy, and multiscale entropy (MSE) (
16). In a study, Shannon entropy was applied to the fMRI images to identify the nonlinear dynamics of the brain functional networks in schizophrenic patients (
17). The usefulness of ApEn in diagnosing schizophrenia was studied by Taghavi et al. (
18). They showed that the value of ApEn in most EEG channels significantly decreased in schizophrenic patients compared to healthy individuals. Sabeti et al. (
19,
20) extracted ApEn, spectral entropy, Shanon entropy, Limpel-Ziv, fractal dimension, and auto-regressive (AR) coefficients from the EEG signals of healthy subjects and schizophrenic patients to differentiate these two groups of individuals (
21,
22). They achieved a maximum classification accuracy of 91% for 20 schizophrenic patients and 20 age-matched controls. Note that the estimated value of entropy by different schemes is not equal, and some measures can reveal the EEG variations better (for a specific brain disease) than the other entropy measures.
Multiscale entropy is an efficient signal processing scheme to reveal the hidden patterns captured in relatively short time-series such as segmented EEG. Other entropy measures quantify only the irregularity of time-series on a single scale. In contrast, using a coarse-graining procedure, the MSE method quantifies the degree of complexity on multiple scales (
23).
It is noteworthy to mention that extraction of entropy features (by each of the aforementioned schemes) from several EEG channels in successive time frames leads to providing high dimensional feature vectors. Moreover, some of these features are correlated. To make these features independent and also diminish the size of the features, some methods [e.g., principal component analysis (PCA)] can be used (
24). The features extracted by PCA guarantee to preserve maximum information about the original features and provide new features with no dependency and much lower dimension.
After extracting low dimensional independent features, to classify the patients automatically, we need an intelligent classifier to learn patterns in the offline phase (train phase) and then classify the test features in the online phase. Since the Bayes classifier is the optimum classifier, this classifier can be used. Bayes' theorem measures the probability of each class regarding the input feature vector. In other words, for each class, a probability density function is estimated, and an input feature vector is assigned to that class whose classifier obtains a higher posterior probability. Nonetheless, we need to know the data distribution to use the general form of the Bayes classifier. Since the distribution of data is unknown, we cannot use the standard form of this optimum classier.
Nevertheless, there is a weak form of Bayes classifier, known as Gaussian Naïve Bayes (GNB), in which each dimension obeys a Gaussian distribution. Despite the simplicity of GNB, this classifier can capture the mean and variance of the features (
25). GNB determines the label of each input by maximizing the posterior probability (
26). If we apply the features to PCA before applying them to the classifier, we can guarantee that the input features to the GNB are independent. To classify the feature vectors of each class, we need to train an exclusive GNB classifier.
This study aimed to evaluate and separate schizophrenic patients with different symptomatology by characterizing their EEGs using the MSE method. We also differentiated schizophrenic subjects from healthy subjects using the same method. Moreover, the patient’s MSE patterns on all EEG channels were also studied and reported. Few studies have quantitatively assessed the differences in the EEG signals of schizophrenic patients with different dominant symptoms; hence, this area is still spending its infancy period, and more research is required to reveal more differences in their EEG signal patterns. To the best of the authors' knowledge, no research has yet been conducted on discriminating schizophrenic patients with different dominant symptoms by eliciting the MSE of their EEGs. Furthermore, in the recent and previous studies, researchers just showed a positive or negative correlation of the band power in different EEG bands in schizophrenic patients with positive and negative symptoms; however, they did not report the classification results. To fill up this gap, in this research, after extracting MSEs from all channels using the successive time frames, we applied PCA to these feature vectors to diminish their size and provide new independent features. Then we automatically used the reduced features of the GNB classifiers (
27) to distinguish schizophrenic patients with positive and negative symptoms automatically. It should be emphasized that we aimed to classify patients automatically by adopting statistical classifiers and training them in the offline phase.
Moreover, we considered a control group and recorded their EEG signals to distinguish them from schizophrenic patients. In other words, a GNB for each class is trained, and in the test phase, the extracted features of EEG signals are applied to the three GNBs, and an input label is assigned to that class, the posterior probability of whose GNB is maximized. Another contribution of our study was to collect and prepare a new EEG dataset from 26 healthy subjects and 36 schizophrenic patients with positive and negative symptoms.
The present paper is outlined as follows. Section 2 introduces the collected EEG dataset, and the subjects and the recording protocol are specified. Then the preprocessing stage is explained, and the candidate features and classifier are expressed. Section 3 presents the empirical results of our dataset, and section 4 discusses the pros and cons of the proposed method compared to past research. The paper is finally concluded in section 5, and there are some recommendations for future research.