This was a developmental cross-sectional study conducted from May to December 2024 at Shahid Beheshti University of Medical Sciences, Tehran, Iran. The study was carried out in three main phases: (1) Identification of the minimum dataset, (2) design and implementation of the system, and (3) evaluation of usability and user satisfaction.
In the first phase, articles were initially searched in many trusted databases to compile a list of possible data elements for chronic pain management (
26-
29). An expert panel of eight pain physicians, anesthesiologists, and general practitioners then participated in a two-round Delphi study to review and refine the list. Researchers selected experts through purposive sampling, ensuring all had worked as clinicians for at least five years, had knowledge of managing chronic pain, and were willing to participate. Two weeks were allowed between each round of Delphi voting.
The questionnaire was designed using a 5-point Likert scale ranging from strongly agree (score 5) to strongly disagree (score 1) to assess the perceived necessity of data elements. The questionnaire comprised eight categories: Demographic information, patient history, pain history, a list of psychological elements and activities of daily living, diagnosis and treatment, paraclinical tests, laboratory tests, and examinations. An open-ended question was included at the end of the questionnaire for each section of data elements to allow experts to suggest additional data elements if they deemed necessary. The questionnaire was distributed to experts in the field in both paper and online formats.
Once expert feedback on the perceived necessity of the datasets was collected, the data was analyzed using descriptive statistics and SPSS software version 18. Based on the results of the questionnaire analysis, non-essential data elements that did not receive a satisfactory score were removed from the minimum dataset. Finally, the information elements that were deemed essential for recording information related to chronic pain management and that should be included in the system's database were identified. The decision to include or exclude data elements was based on the scores calculated from expert feedback. If the consensus among experts was less than 50%, the data item was removed. If the score was higher than 75%, the data item was considered approved. If a question received a score between 50 and 75%, it proceeded to a second Delphi round for further evaluation.
In the second phase, following the analysis of the data obtained from the Needs Assessment Questionnaire, the initial version of the system was developed in the visual studio code environment using NodeJS/JavaScript and Python programming languages, along with the MySQL database. To evaluate the usability and identify potential design and user interface problems in the initial prototype of the system, a heuristic evaluation was conducted using Nielsen's ten usability heuristics: Visibility of system status, match between system and the real world, user control and freedom, consistency and standards, error prevention, recognition rather than recall, flexibility and efficiency of use, aesthetic and minimalist design, help users recognize, diagnose, and recover from errors, and help and documentation.
Three medical informatics experts, each with over five years of experience and knowledge of human-computer interaction, were selected using a convenient and targeted approach to evaluate the system through a usability test. The evaluators independently assessed different parts of the system for adherence to Nielsen's ten heuristics. They identified and documented any problems in a data collection form. This form included the problem name, a complete description of the problem, the location of the problem, the violated usability principle, and the severity of the problem. The evaluators thoroughly discussed and debated the identified usability problems and their alignment with each of Nielsen's ten heuristics. Any disagreements were resolved through consensus.
The final list of problems was provided to the evaluators, who independently assessed the severity of each problem. The severity of the problem was determined based on three criteria: Repetition, continuity, and impact. For the ranking of the found problems, one of the degrees of not a problem at all (score 0), cosmetic problem only (score 1), minor problem (score 2), major problem (score 3), and catastrophic problem (score 4) was assigned (
30). The actual severity of each problem was determined by the average score of the evaluators. Finally, the identified problems were categorized based on the average severity obtained into one of five categories: Not a problem at all (score 0 - 0.5), cosmetic problem only (score 0.6 - 1.5), minor problem (score 1.5 - 2.5), major problem (score 2.6 - 3.5), and catastrophic problem (score 3.6 - 5).
In the third phase, the AramDard system was developed. Users accessed the system through dedicated panels designed for patients, medical staff, and administrators. The system enabled members to use electronic health records, submit consultation requests, upload documents, schedule appointments, follow up on referrals, and send direct messages to their doctors. Structured forms were also provided to allow the submission of paraclinical images and PDF files. The Questionnaire for User Interface Satisfaction (QUIS) was employed to evaluate user satisfaction with the system's interface (
31).
This questionnaire had 30 questions; 3 questions related to the participant's identity information and 27 other questions related to the evaluation of usability and satisfaction in 5 parts: Comments related to overall reactions to the software (6 questions), screen (4 questions), terminology and system information (6 questions), learning (6 questions), and system capabilities (5 questions) (
31). Each question had an answer with a score of 0 to 9, where the number 0 indicates the lowest level of ability and satisfaction, and the number 9 indicates the highest level of ability and satisfaction (a score of 0 - 3 is weak, 3 - 6 is moderate, 6 - 9 is good). Each user completed this questionnaire online and provided it to the researcher.
In this study, 15 patients with chronic pain and 7 pain management specialists completed the QUIS Questionnaire. Patient inclusion criteria were age ≥ 18 years, a diagnosis of chronic pain, the ability and expected use of the telemedicine platform, and willingness to participate, while patients with severe cognitive impairment or inability to provide informed consent were excluded. Specialists were selected based on relevant experience in pain management (at least 5 years) and their willingness to participate in the system evaluation and complete the QUIS Questionnaire. The summary of the implementation method is shown in
Figure 1.
Summary of the implementation method
This research was approved by the Ethics Committee of Shahid Beheshti University of Medical Sciences (ethics code:
IR.SBMU.RETECH.REC.1403.026). According to the approval granted by the Ethics Committee, verbal informed consent was obtained from all participants prior to their inclusion in the study. For each participant, verbal consent was documented at the time of the interview by the principal researcher, following procedures approved by the Ethics Committee, using an internal data collection form; This form included participant codes, interview dates, and confirmation of consent. Additionally, a member of the research team was present to witness the consent process. Since all data were collected and stored anonymously, written consent was deemed unnecessary. Prior to participation, participants were fully informed about the study’s objectives, procedures, and their right to withdraw at any time. Participants explicitly communicated their agreement to the research team.
3.1. Data Analysis
Data analysis was conducted using SPSS version 18. Descriptive statistics, including means, standard deviations, and frequency distributions, were calculated to summarize participant characteristics and responses to the QUIS Questionnaire. User satisfaction was assessed based on average scores and response distributions. Usability problems identified during the heuristic evaluation were categorized according to the heuristics they violated, and both their frequency and mean severity were computed.