Following permission for adapting the DEAP from the developer, Prof. Barbara Dodd, and the publisher, Pearson Inc., to ensure cross-cultural translation-adaptation, a standard procedure including multiple forward and backward translations and qualitative (via expert panel sessions for cognitive debriefing) and quantitative (measuring CVR and CVI) evaluation of the translations were followed, and necessary adaptations were implemented (
19). In this article, validation processes of the articulation assessment were described. Content validity ratio (CVR) and content validity index (CVI) were measured for all individual items and the instrument as a whole (
19,
20).
Translation and adaptation of the instrument followed a multistage procedure. An official permission for the questionnaire was obtained from both test developers and the publishing company. Then an English translator, a speech and language pathologist and a linguist (all Persian native speakers), translated the questionnaire independently. In order to avoid ambiguity and cultural issues, these independent translations were presented to a panel of experts to finalize the translation. To correspond with Persian linguistic properties and the Iranian culture (the context where the test was standardized), a pool of Persian items was developed consistent with the global outline of the articulation assessment of DEAP battery. The DEAP manual (
21) provided criteria for generating items for the articulation test. It also provided the theoretical basis for developing the picture naming test (
8,
22). All words were adopted from the Persian core vocabulary for Iranian children (
23), picture-riddle dictionary (
24), children’s story books, Persian dictionary for school children (
25), Persian translation of McArthur-Bates communicative development inventory (
26) and available Persian phonetic tests. Then generated items were attached to the primary translation. Two linguists and two speech-language pathologists evaluated the quality of translated instructions and assigned items in the preliminary Persian version of the instrument. A native English/Persian bilingual speaker with linguistic background back-translated the instrument from Persian into English. The second author rated the quality of forward-backward translation in terms of clarity, common language use and conceptual equivalence. The back translation was submitted to the Pearson Inc. to test the equivalence of back-translated version with the original version. After that the preliminary form of the instrument was reviewed by a panel of experts (four speech therapists and three linguists) who rated the relevancy of the instrument (content coverage) in terms of the items’ syllable structure, items’ familiarity for the targeted age group, positions of consonants and vowels in the words, image agreement and transparency of the items, and finally the instructions’ relevancy, simplicity and clarity. This involved the use of a 100-point rating scale, ranging from completely undesirable (0) to completely desirable (100) and a box for comments. Backward translation resulted in no major linguistic or cultural concerns. Finally, a pilot study was conducted with 60 participants (male-female ratio = 1:1) in six various age groups M (SD):54.1 (± 11.1) in months. During the pilot study, three pictures seemed a bit ambiguous requiring further description to elicit the target word. These were as follows: the word /bαd/ (meaning wind) with the prompt sentence asking, ‘What makes the leaves blow?’ [bærge deræxtαro ʧi mibære?]; the word /nej/ (meaning straw) with the prompt sentence: [ʃiro bα ʧi mixori?]: ‘What would you drink your milk with?’, and the word /riʃ/(meaning beard) with the prompt sentence: [ruje suræte mærdhα ʧi dær miαd?]: ‘What grows on a man’s face?’.
The pilot data of the study led to the development of the final draft of the Persian articulation assessment (PAA). The sounds elicited cover all consonants of the targeted syllable in the initial and final positions along with all vowels. A stimulus list was also provided for eliciting speech sounds which were not produced at the previous stage.
In this study, reliability was assessed through test-retest reliability and score-rescore consistency (
27). Test-retest analyses were reported for 52 children (13.4% with the mean age of 53.3 months) who were able to return for the re-administration of the PAA within 1 - 3 weeks after their initial test. For Score-rescore reliability (Consistency) two independent examiners who had not been involved in the PAA’s development rescored audio-video recordings of 70 children being randomly selected for the interrater (score-rescore) reliability analyses (18.8% with the mean age of 54.1 months).
Audio-video recordings were made through the assessment procedure to allow the revision of online transcription difficulties and transcription reliability measurement. For score-rescore reliability two independent examiners who had not been involved in the PAA’s development, reviewed and rescored some transcription with reference to its audio-video recordings. The current study has utilized the Kappa statistics usually being used as a measure of reproducibility between repeated assessments of the same variable (
28). Kappa amounts which are over 0.75 denote great reproducibility (
29-
31).
The evidence of construct validity was provided by a priori hypothesis patterns of association with other measures (the convergent validity with PIT) and evaluating the discriminative validity. PIT was used for evaluating convergent validity as an evidence of construct validity. For this part of study, the results of PIT and PAA were compared.
For discriminative validity a sample of 36 children aged between 3 - 6 years with and without articulation deficit (18 participants in each group, respectively whose diagnosis was approved by three experienced speech language pathologists) enrolled in this part of study. An independent t-test was utilized to analyze the difference between the two groups. Alpha level of 0.05 was considered for all statistical procedures.