Our study’s results suggest that the Persian version of EDI-3 would be a reliable questionnaire with good validity for measurement of eating disorders in healthy Persian adults. Also, the study findings demonstrated acceptable alpha coefficients in all domains but asceticism and interpersonal alienation subscales in both sexes. In general, the results of this study are partly in agreement with other researches that have reported that EDI-3 is a valid and reliable tool on normative samples from the USA, Canada, Europe and Australia (
12). Also, Nyman-Carlsson et al., stated acceptable reliability for all subscales of EDI-3 questionnaire except asceticism among Swedish normal controls (
13). In addition, Tachikawa et al, administered the Japanese version of eating disorder inventory (EDI-2) to 91 Japanese eating disordered inpatients and 119 matched non-clinical controls. The results of their study showed that all subscales (except asceticism) reached sufficient internal consistencies (
14). It can be attributed to the unknown feeling that some respondents declared. Also, it might be that these domains do not appear very homogenous at least in adult Iranian culture. The other possible explanation would be that the responders do not have a very strong tendency to place a positive connotation on achieving virtue through self-restrain or to experience guilt and shame surrounding the experience of pleasure. A low score scale not only is common among respondents from the nonclinical sample but also could reflect denial of current clinical state or response bias on the part of the patients.
Drive to thinness is one of the main signs and also is considered as the essential criterion for diagnosis of eating disorder. Seven questions related to this subscale evaluate these items: desire to be thin, worry about dieting, preoccupation with weight and intense fear of weight gain. The ICC value for drive to thinness was 0.8 in our study, but Clausen et al., reported that drive to thinness may score high for any reason unrelated to the pathology of eating disorders or inaccuracy in perception or recognition of bodily states so this important pathognomonic sign of the specific eating disorder psychopathology, frequently led to missteps in distinguishing signs of hunger (
4).
The bulimia scale measures the issue of binge eating and only one of its items referred to the compensatory method of vomiting. Studying 108 American men and women, Stanford et al. showed that both the male and female subjects’ scores were mainly similar for eating disorders, reflecting a diagnosis of binge eating. As he stated, which is in line with our study, the bulimia scale on the EDI-3 in point of fact, sufficiently evaluates binge eating symptoms in men (
6).
Perfectionism subscale includes six items to evaluate respondent’s desire for hard work to achieve high goals. Questions related to the subscale of perfectionism are divided into two dimensional constructs; self-oriented perfectionism and perfectionism is reflected by the community over the individual (
15,
16). This subscale showed a borderline score in our study. This could be due to these two independent dimensional constructions of items related to this subscale. For instance, one might be perfectionism, but this feeling is not related to the expectations of those around a person. Furthermore, a person may be under pressure from expectations of others indicating in this item “my parents have expected excellence of me” ,but the best in his life is not one’s own effort relating in this item “I hate being less than best at things”. It seems that if perfectionism is divided into two categories and evaluated separately, the result would be more satisfying. However, testing such hypotheses is beyond the scope of this study.
The ICC values in current study for body dissatisfaction in men and women were 0.6 and 0.8 respectively, along with the Stanford et al study, asserted that the body dissatisfaction subscale alone could not remarkably aid the recognition of an eating disorder in males (
6). However, it could be said that the Stanford et al study participants were 66 men who received treatment for eating disorder and addiction and 45 healthy ones were matched as control group (
6). The comparison between Swedish, Danish and international control samples demonstrated that the Swedish control scored notably higher than the other two controls, unless in the eating disorder risk scales, perfectionism and asceticism (
13). While Swedish controls scored higher on interoceptive deficits, emotional dysregulation and maturity fears, the international counterpart achieved higher score on interpersonal alienation subscale (
13). Also, Nyman-Carlsson et al reported that psychiatric outpatients scored higher than both ED and normal to controls just in two scales, including interpersonal insecurity and interpersonal alienation (
13). Furthermore, a study released by Waldherr et al, showed that the average EDI scores of the southern countries are similar to the North American and tendency to weight seems to be less prevalent in the Netherlands than in North America and central and southern parts of Europe. Also, due to the subscales of bulimia, perfectionism, interoceptive awareness and maturity fears, the Dutch females of all age groups scored lower than Italian females. In addition, Italians earned slightly higher scores in bulimia and interoceptive awareness than the US. In sum, the results of the Waldherr et al study revealed that the EDI scores may be affected by socio-cultural factors, cultural properties, culture-specific social requirements on young adults and differences in maturation between North and South Europe (
3).
The Mahmoodi et al study’s findings indicated that although restraint, eating concerns, laxative misuse and self-induced vomiting are rare phenomena in Iranian undergraduate women, but shape concern, objective binge eating, and dietary restraint are commonly seen.
Also, this study presented some similarities between Iranian and Spanish students on the scores of the eating, weight and shape concerns subscales. Furthermore, Iranian and Portuguese women were similar on shape concern subscale. But Iranian women obtained a higher score on restraint subscale than Japanese female students (
17).
In a review study that ED assessments in athletes have been observed, results showed that 24 studies calculated internal consistency and only three of them calculated test-retest reliability. In 14 studies, the second version of EDI has been used and most of them have cited Garner (1983, 1991) reliability coefficients except three that calculated coefficients ranged between 0.69-0.90. The writer has proposed that the other types of reliability assessments including test-retest reliability would be valuable to evaluate whether or not athletes may earn approximately the same EDs score during multiple measurements and /or make sure that changes of EDs scores overtime are not due to measurement property change of eating disorder assessments (
18).
The uses of EDI-3 vary from diagnostic perception, therapy plans, and outcomes interpretation for female eating disorder patients to ED screening in nonclinical groups. Overall, considering these parameters, such as considerably expanded patient samples, upgraded scale derivation methods and theoretical structure, inclusion of response style indices, and a better section on test analysis, the new EDI-3 seems to be experimentally superior to EDI-2. But it was designed for use with females aged 13 - 53 years, and then a lack of information about its value and appeal with men in both clinical and nonclinical settings is a primary deficit (
5).
One of the strengths of our study is the careful planning and proper execution of the test-retest reliability. A comprehensive number of university students were registered and encouraged to fill out questionnaires as completely as possible. Moreover, the time and place was arranged precisely in the same place to carry out the re-test two weeks later.
5.1. Limitations
The results of this study need interpretation with caution. First of all, it could be said that the sampling method and nature of the sample (i.e., only university students) limit the generalization of the findings into the general population and clinical sample.
Secondly, the university students should fill out a large questionnaire. In some situations, they did not complete the questionnaire completely. We prepared refreshments for them and asked to fill out the whole questionnaire completely by a gifted pencil. The examiner (H.D) should check all questionnaires to ensure that all items were answered. But some items were missed in spite of precise assessment. Finally, it's worth noting that calculating discriminant or convergent validity especially in the patients group would be much more preferable.
In conclusion, based on our study results, the validity and reliability of Persian version of EDI-3 is acceptable in all subscales except asceticism and interpersonal alienation in both men and women.