Depression is a common mental disorder globally, with a high rank of disease burden (
1). It is the fourth major cause of disability and has a considerable non-fatal burden of disease (
2-
4). The World Health Organization (WHO) has estimated that by 2020, depression will be the second most common health-related problem after cardiovascular disease (
5).
Although accurate information about the prevalence of depression is not available in many countries, it appears to vary widely in its prevalence (
6). Depression is the most prevalent, debilitating, and costly mental illness in the United States. Approximately 12% of youth and 16.6% of the adult population had the diagnostic criteria of depression, including major depression (
7).
A national study in Iran has demonstrated that the prevalence of depression was approximately 20%. Depression was found to be the biggest health problem among mental disorders in terms of disability-adjusted life years (DALYs) by the National Burden of Disease and Injury in Iran (
8). It is the third major cause of disability and has been considered to increase the years of life with disability in Iran from 2007 to 2017 (
9).
Previous studies show that depression has been related to various socioeconomic factors (
10), including age, sex, heredity, lifestyle, local social structure, workplace, cultural, and environmental status, which are effective in determining the health of individuals (
11). According to the relationship between socioeconomic, cultural, and environmental factors with depression, the use of spatial models is necessary to find the geographical distributions and the other related factors.
The geographical distribution of this phenomenon is important, especially in the field of health research and health economics (
12). Modeling the spatial distribution of disease is helpful to assess whether health care resources should be uniformly distributed or they should be primarily available in certain areas (
13). In many classical statistical methods, the correlation between observations that can cause inflammation of variances is not considered in data analysis.
In fact, we face non-independent observations in most sciences in which they are correlated to each other in terms of location. This is called spatial correlation and such data are known as spatial data. For analyzing spatial data, a new branch of statistics that is called spatial statistics has been developed.
One of the statistical methods used in spatial statistics is the spatial generalized linear mixed model (
14). With the assumption of independence in observations, a linear relationship is established between the mean of the observations and covariates using a link function in these models. A generalization of the models mentioned above is a generalized linear mixed model. The correlation between observations is considered a latent variable by adding random effects and independent assumption of observations change to conditional independence. Generalized linear mixed models are used when spatial-dependent data and spatial responses are both discrete distribution of the exponential family (
15).