1. Background
Overweight in adults has increased significantly in developed countries in recent decades. This problem has attracted interests of researchers in the field of nutrition knowledge. Nutrition knowledge is an important factor based on which researchers are trying to develop a nutrition knowledge scale.
The nutrition knowledge questionnaires developed so far have some limitations. Either they are weak in the area of psychometric validation or they target a limited area of nutrition knowledge (1-8). For example, Towler and Shepherd showed that their questionnaire has good internal reliability and construct validity (9). Their study does not include the kind of psychometric validation. The content validity is questionable because of little explanation on how the items were generated. That questionnaire has no questions about dietary recommendations and the relationship between disease and diet, but the questionnaire evaluates knowledge about the nutrient content of foods. Anderson et al conducted a study using a questionnaire with good content validity (10). In contrast to Towler and Shepherd’s questionnaire, the items concerned familiarity with nutrition-related phrases as well as knowledge about dietary recommendations and their practical application. But the internal consistency of this questionnaire is poor, and the study was not subjected to construct validity and test-retest reliability. So we want the questionnaires that consider both aspects of psychometric validation and nutrition knowledge, as well as dietary habits. Nutrition knowledge questionnaire for adults (NKQA) (11), and the dietary habits and nutritional knowledge (DHNK) questionnaire (12) together are included nutrition knowledge and dietary habits and also consider psychometric validation. There is no overlap or similarity in nutrition knowledge questions between two questionnaires. By these two questionnaires we want to study a wide range of nutritional issues of Iranian adults for the very first time. Nutrition knowledge is important because it affects food choices. Some studies have shown that application of the beliefs models in increasing nutrition knowledge can lead to better dietary behavior and intake (13, 14). Over the last few decades, the people of Iran have changed their eating patterns. The rising incidence of obesity and chronic diseases may be due to an imbalance in dietary patterns (15). So far, no research has focused on the psychometric evaluation of a nutrition knowledge and dietary habit questionnaires with regard to Iranian adults.
On the other hand, scales in a foreign language can be bound to a certain a culture. We cannot use such questionnaires directly because of the existence of cultural bias. Translation is a common method to prepare questionnaires for cross-cultural adaptation. So after translation, it necessitates assessing psychometric evaluation of translated questionnaires (16-18).
Therefore, this study carries out a psychometric evaluation of a Persian version of the scales and confirms its structural model among Iranian medical staff at a hospital in Mashhad city, which is in northeast Iran. Also the researchers offer this scale for more studies to use in order to gather data about dietary habits and find ways to raise public awareness about nutrition and unhealthy dietary habits. Hereby overweight/obesity can be prevented.
2. Objectives
The present study aimed to evaluate the validity and reliability of the Persian version of the dietary habits and nutritional knowledge questionnaire and the nutrition knowledge questionnaire for adults, and to confirm their structural model in the Iranian population.
3. Methods
The scale used in this study combines two English questionnaires. Questionnaire 1 is a DHNK questionnaire that includes two subscales: dietary habits (18 items which are scored on a four-point Likert scale from 1 (never) to 4 (always)) and nutrition knowledge (29 items which are scored on a four-point Likert scale from 1 (strongly disagree) to 4 (strongly agree)). The reliability coefficients for original DHNK questionnaire (English version) were 0.66 and 0.645 for dietary habits and nutritional knowledge, respectively (12). Questionnaire 2 is NKQA, which contains three subscales: familiarity with MyPyramid (20 items), the nutrition content of foods (15 items), and the diet-disease relationship (11 items). Each item was scored either correct or incorrect, and the scores were then totalled for each subscale and the questionnaire overall. For original English version of NKQA Cronbach’s alpha and test-rest were 0.91 and 0.95 respectively which demonstrating internal consistency reliability and test-retest reliability (11). There are two sections in the combined questionnaire: (1) demographic variables such as gender, age, academic degree, and body mass index, and (2) the main part, which consists of 93 statements (46 items for NKQA and 47 items for DHNK) on nutritional information and dietary habits.
The research design consists of a translation procedure and psychometric evaluation. This includes assessing content validity, construct validity, and reliability. Permission for translation of the NKQA and DHNK questionnaires was obtained by email from Prof. Anna Marie Jones and Prof. Carol Biddington, respectively. The original English version of NKQA and DHNK questionnaire were disseminated in years 2015 and 2005 respectively (11, 12). A total of 332 questionnaires were distributed in the second half of 2016. 12 respondents were excluded following the exclusion criteria and 20 respondents denied to end up with the 300. The staff inclusion criteria were being able to read and write Persian, and being over 18 years of age.
3.1. Procedure
The NKQA and DHNK questionnaires were translated into Persian by using the Brislin translation method (forward translation) (19). Two experts of health education and nutrition who were familiar with English and Persian did the translation work. They translated the questionnaires to Persian independently. The two translations were then formed into a single unit. Thereafter, it was sent to health education and nutrition experts to evaluate its content validity. The translated version was also provided to the other experts of health education and nutrition for their comments. The goal was to ensure that the items of the questionnaires had no ambiguity, and that different experts had the same understanding of the items. The final version of the translated scale was approved using the most appropriate translated terms and the imperfections were removed. Then, an English language expert who was unfamiliar with the content of the primary English questionnaires performed backward translation from Persian to English. The English language expert provided his comments and, by comparing the English questionnaires, the necessary changes were applied to the Persian translation. This version of the questionnaires was then distributed among 10 experts in the field of nutrition knowledge to evaluate its content validity.
3.1.1. Content Validity
To assess the content validity index (CVI), the experts were asked to talk about each item in order to assess its simplicity, relevancy, and clarity with regard to the content as per the Waltz and Bausell index (20). The experts also were asked to rate scale items on a 4-point ordinal scale (Table 1) (14).
Details | |
---|---|
Simplicity | not simple |
item need some revision | |
simple but need minor revision | |
very simple | |
Relevancy | not relevant |
item need some revision | |
relevant but need minor revision | |
very relevant | |
Clarity | not clear |
item need some revision | |
clear but need minor revision | |
very clear |
Scoring Method
To obtain content validity index for simplicity, relevancy and clarity of each item, the number of those judging the item as simple or relevant or clear (rating 3 or 4) was divided by the number of content experts. The values for the CVI which are given greater or equal than 0.79 were considered acceptable (0.7 - 0.78: revision and correction; < 0.70: unacceptable) (20).
The experts were also requested to assess each item as ‘essential’, ‘useful but not essential’, or ‘not necessary’ in order to compute the content validity ratio (CVR). A decision was made to accept or reject CVR values using the Lawshe table (21). This table shows the critical CVR values for different values of the sample size of experts. A test evaluator could determine the size of the calculated CVR that is necessary to exceed the chance expectation. Lawshe developed following formula to calculate content validity ratio.
CVR = (ne - N/2)/(N/2)
Where ne is the number of experts assessed the item as essential and N is total number of experts.
In addition to determine the quality of the content validity, floor and ceiling effects were computed. A floor or ceiling effects of 15% or less is considered acceptable (22).
3.1.2. Reliability
The reliability of the Persian version of the questionnaires was first assessed through evaluating internal consistency. Cronbach’s alpha coefficient was employed to assess the internal consistency of the questionnaires. If the value of Cronbach’s alpha coefficient is greater than or equal to 0.7, it indicates satisfactory internal consistency (23). After that, the construct reliability (CR) of each of the subscales was evaluated (24). CR values between 0.6 and 0.7 can be accepted provided other indicators are good (25). In addition, to compute the intra-class correlation coefficient (ICC), and the stability and reliability through the test-retest, the questionnaire was given to 30 health employees in two-week intervals. The ICC values which are greater or equal than 0.60 were considered satisfactory (r ≥ 0.81 - 1.0 as excellent, 0.61 - 0.80 very good, 0.41 - 0.60 good, 0.21 - 0.40 fair, and 0.0 - 0.20 poor) (26). Data were analysed using SPSS version 23.
3.1.3. Construct Validity
To determine construct validity, confirmatory factor analysis (CFA) was performed. The LISREL program version 8.80 was used to carry out CFA in order to ensure that the factor structure of the translated version of the questionnaires is similar to the original version. LISREL is a program that can perform CFA to release model fit indices (27). The values are shown on the side of the box in LISREL outputs (path diagrams) are errors (Figures 1 and 2). In our study, as errors decrease, the factor loadings will increase.
We used following fit indices in the present study and their acceptable ranges are reported as following (26, 28, 29): the chi-squared goodness-of-fit test (chi-square/df) < 5 (< 3: good), root mean square error of approximation (RMSEA) < 0.1 (< 0.08: good), The normed fit index (NFI) > 0.9 as well as non-normed fit index (NNFI) > 0.9 and comparative fit index (CFI) > 0.9.
3.1.4. Sample Size
Determining the sample size is a prerequisite in order to perform CFA. There are several recommendations for an acceptable sample size for factor analysis. Some authors recommend 5 persons per item while others recommend 10 (30, 31). The original questionnaires used in this study have 46 and 47 items. So, the recommended samples for each individual questionnaire ranged from 230 to 470. To get the samples for factor analysis, stratified sampling was used. The hospital staff was classified into physicians, nurses, and other employees. All participants gave informed consent according to the research ethic committee of the Baqiyatallah University of Medical Sciences. Strata sample sizes are determined by the following equation:
nh = (Nh/N) * n
where nh, Nh, N and n denote the sample size for stratum h, the population size for stratum h, total population size and total sample size respectively. For example for physician stratum, we obtained (Nh/N) = 0.04. So the sample size for this stratum is 300 * 0.04.
4. Results
The participants comprised physicians, nurses, and other employees working at a hospital in Mashhad. Thereby, the content validity, internal consistency, and construct validity of the Persian version of the two questionnaires was obtained. The mean ages of the participants were 30.7 ± 8.3 years. Of the 300 respondents, 41.8% were male and 58.2% were married. The demographic characteristics of the study sample are presented in Table 2.
Variable | Valuesa |
---|---|
Gender | |
Male | 125 (41.7) |
Female | 175 (58.3) |
Job title | |
Physician | 12 (4) |
Nurse | 93 (31) |
Other employees | 195 (65) |
Academic degree | |
PhD | 14 (4.7) |
MSc | 45 (15) |
BSc | 156 (52) |
Associate Degree | 38 (12.7) |
High school diploma | 37 (12.3) |
Less than Diploma | 10 (3.3) |
BMI (body mass index) | 23.74 ± 3.6 |
Demographic Characteristics of Sample
The results of reliability of test-retest and internal consistency are reported in Table 3.
Scales (Range) | Subscale | No. of Items | r | CR | Mean | SD | Floor Effect (%) | Ceiling Effect (%) |
---|---|---|---|---|---|---|---|---|
NKQA (0-22) | Familiarity with MyPyramid | 20 | 0.95 | 0.83 | 9.3 | 3.71 | 0.3 | 0.3 |
NKQA (0-26) | Nutrition content of foods | 15 | 0.97 | 0.80 | 11.93 | 5.06 | 1.7 | 0.3 |
NKQA (0-18) | Diet-disease relationship | 11 | 0.98 | 0.78 | 8.7 | 3.92 | 0.3 | 6 |
DHNK (18-72) | Dietary habits | 18 | 0.99 | 0.96 | 51 | 4.46 | 0.3 | 1 |
DHNK (29-116) | Nutritional knowledge | 29 | 0.94 | 0.88 | 84.76 | 7.93 | 0.3 | 0.3 |
Study Measures and Descriptive Statistics for Study Participants
The Cronbach’s alpha coefficients for the total scale of NKQA and DHNK were 0.78 and 0.74 respectively. Also the ICC for the total scale of NKQA was 0.97 and for DHNK was 0.95. Table 3 also shows participants mean scores. High scores in the following subscales show good nutrition knowledge of participants: familiarity with MyPyramid, nutrition content of foods, diet-disease relationship and nutritional knowledge. High dietary habits subscale scores reveal healthy participants’ dietary habits.
In this study, 10 experts provided comments. They found the questionnaires satisfactory and approved all items in it. The CVR scores were 0.91, 0.87, 0.84, 0.9, and 0.85 for the subscales of familiarity with MyPyramid, nutrition content of foods, diet-disease relationship, dietary habits, and nutrition knowledge, respectively. Besides, the CVI scores for all items were ≥ 0.80 and the qualitative content validity was approved. The mean CVI for all items was 89% for relevancy, 87% for clarity, and 90% for simplicity. Floor and ceiling effects of all subscales are in acceptable range.
To determine construct validity, CFA was conducted separately for the two translated questionnaires. Table 4 shows the results of the CFA on the Persian versions of the NKQA and DHNK questionnaires.
Goodness of Fit Indices | Chi-Square | Df | Chi-Square/df | CFI | NFI | NNFI | RMSEA |
---|---|---|---|---|---|---|---|
NKQA | 1977.09 | 973 | 2.03 | 0.92 | 0.86 | 0.92 | 0.059 |
DHNK | 1890.16 | 998 | 1.89 | 0.97 | 0.93 | 0.96 | 0.055 |
Results of the Confirmatory Factor Analysis
The CFA results showed that the translated Persian version of the questionnaires was good enough to fit. All indices are in an acceptable range and show strong goodness of fit. Figures 1 and 2 show the confirmatory factor structure of NKQA and DHNK. Saggino and Kline suggested that items with absolute loading values of 0.3 or greater should be considered appropriate (32). In our study, all factor loadings are greater than 0.3. The negative factor loadings mean that hidden factors have the characteristic opposite of whatever the observable measures measure.
The values of construct reliability for all subscales are greater than 0.7 and show the reliability of the translated scale. In addition, the ICC values are greater than 0.81 and satisfactory. With regard to the CVI and CVR values, the Persian version of the scale has good content validity. As a reliable and valid scale, it can be used among Iranian adults.
5. Discussion
There is a relationship between the internal consistency and homogeneity of the scale (33). In well-designed scales, Cronbach’s alpha coefficient is greater than 0.7 (34). Kliemann et al. (35) evaluated an updated version of the general nutrition knowledge questionnaire (GNKQ). They reported that the reliability was greater than 0.7 for all sections, and their updated scale was reliable and valid. Then the values of construct reliability for all subscales were greater than 0.7 which indicated good reliability (24). In another study on nutrition involving a 20-item questionnaire on nutrition knowledge, internal consistency reliability and test-retest reliability were 0.74 and 0.86. This revealed the validity and reliability of the scale (36). For the present study, the level of internal consistency is satisfactory. To investigate the stability, the ICC was used and the responses showed high stability. All subscales had good internal consistency and test–retest reliability.
The studies that assessed original questionnaires, did not fit CFA models to questionnaires (11, 12). In this paper CFA models fitted to the translated questionnaires and all fit indices were in acceptance range.
However, the scale was used among health employees, but it does not mean that the scale is not applicable to the general population. Because according to Table 2, 65% of staff has no health-related field of study. Also 28.3% of staff has no any college education.
The strength of this study is that it has completed the psychometric process to provide a valid and reliable scale for the first time in Iran. The scale needs to be used by experts to assess nutritional knowledge and dietary habits of general population.
One of the limitations of our study is that, although the scale was demonstrated to be valid and reliable, the large number of items may make it boring and some participants may not complete it. Regarding to Table 3, by the time the participants start the second questionnaire (NKQA), they are tired and the scores drop. But they have better scores on items relating to dietary habits and nutrition knowledge (DHNK). The questionnaires order effect bias can be minimised by changing the order the questionnaires are administered. Moreover, the mean of the employees’ BMI staff shows that the employees are healthy, which may be due to healthy habits. A decline in the scores of the NKQA subscales could be due to the large number of questions. A shorter version of this scale could give better results because a shorter version would have reduced the variation in the answers. The questionnaire’s items can be reduced by using higher cutoff for loading factor, or using data reduction analyses such as “component factor analysis”. It is an important issue tough what items would be excluded to shorten the tool and what loss would be had in the information collected. But in this study, the aim is to evaluate the validity and reliability of the Persian version of the original questionnaires with all items. Whether and how this subject can be adapted to our scale is an interesting project for further investigations.
5.1. Conclusions
This reliable and valid scale can be used in Iranian population.