Köhler Susanne M, Holtz Svea, Neff Michaela C, Schaaf Jannik, von Wagner Michael, Müller Beate S, Schütze Dania
Institute of General Practice, Goethe University Frankfurt, Theodor-Stern-Kai 7, Frankfurt am Main, 60590, Germany, 49 696301 ext 7267.
Institute of Medical Informatics, Goethe University Frankfurt, University Medicine Frankfurt, Frankfurt am Main, Germany.
JMIR Form Res. 2025 Aug 20;9:e69875. doi: 10.2196/69875.
General practitioners are confronted with a wide variety of diseases and sometimes diagnostic uncertainty. Clinical decision support systems could be valuable to improve diagnosis, but existing tools are not adapted to the requirements and workflow in the primary setting. In project SATURN (Smart physician portal for patients with unclear disease), the prototype of a clinical decision support system based on artificial intelligence is being developed together with users specifically for primary care in Germany. It aims to reduce diagnostic uncertainty in cases of unclear and rare diseases and focuses on 3 medical fields. A user-centered design approach is applied for prototype development and evaluation.
This study evaluates the usability of a high-fidelity prototype and explores aspects of user experience like the subjective impression, satisfaction, and areas of improvement.
A total of 5 general practitioners participated in the evaluation, which consisted of (1) a remote think-aloud test, (2) a postsession interview, and (3) a survey with the System Usability Scale. During the think-aloud tests, the participants verbalized their thoughts and actions and solved several vignette-based tasks. Remarkable observations were logged, transcribed with quotes, and analyzed for usability problems and positive findings. All observations and interview responses were deductively assigned to the following categories: (1) content, (2) comprehensibility, (3) user-friendliness, (4) layout, (5) feedback, and (6) navigation. Usability problems were described in detail and solutions for improvement proposed. Median and total scores were calculated for all questionnaire items.
The evaluation detected both strengths and areas for improvement. The participants particularly liked the clear and well-structured layout of the prototype. Key issues identified were content-related limitations, such as the inability to enter unlisted symptoms, medications, and examination findings. Also, participants found the terminology for laboratory not suitable to their needs. Another key issue was a lack of user-friendliness concerning the time required to input medication plans and lab values. Participants expressed a need for faster data entry, potentially through direct imports from practice management systems or laboratory files. Adding symptom duration, weighting symptoms, and incorporating hereditary factors were suggestions made for improvement. Overall, the SATURN prototype was deemed useful and promising for future clinical use, despite the need for further refinements, particularly in the areas of data entry, as this is a key obstacle to its use.
The usability evaluation methods combined proved to be location independent and easy to use. They provided important findings on usability issues and improvements that will be implemented in a second high-fidelity prototype, which will also be tested by users. Technically demanding user requirements, such as direct data transfer from the practice management system and entry options that require complex data models, were beyond the scope of this project, but should be considered in future development projects.
全科医生面临着各种各样的疾病,有时诊断存在不确定性。临床决策支持系统对于改善诊断可能很有价值,但现有工具并不适应基层医疗环境的要求和工作流程。在SATURN项目(针对疾病不明患者的智能医生门户)中,正在与用户共同开发一个基于人工智能的临床决策支持系统原型,专门用于德国的基层医疗。其目的是减少不明和罕见疾病病例中的诊断不确定性,并专注于3个医学领域。在原型开发和评估中采用了以用户为中心的设计方法。
本研究评估高保真原型的可用性,并探索用户体验方面,如主观印象、满意度和改进领域。
共有5名全科医生参与了评估,评估包括(1)远程出声思维测试,(2)测试后访谈,以及(3)使用系统可用性量表进行的调查。在出声思维测试期间,参与者说出他们的想法和行动,并解决了几个基于病例的任务。记录显著的观察结果,附上引述进行转录,并分析可用性问题和积极发现。所有观察结果和访谈回复都演绎性地归入以下类别:(1)内容,(2)可理解性,(3)用户友好性,(4)布局,(5)反馈,以及(6)导航。详细描述了可用性问题,并提出了改进解决方案。计算所有问卷项目的中位数和总分。
评估发现了优点和改进领域。参与者特别喜欢原型清晰且结构良好的布局。确定的关键问题是与内容相关的限制,例如无法输入未列出的症状、药物和检查结果。此外,参与者发现实验室术语不符合他们的需求。另一个关键问题是在输入用药计划和实验室值所需时间方面缺乏用户友好性。参与者表示需要更快的数据输入,可能通过从实践管理系统或实验室文件直接导入。增加症状持续时间、对症状进行加权以及纳入遗传因素是提出的改进建议。总体而言,尽管需要进一步完善,特别是在数据输入方面,因为这是其使用的关键障碍,但SATURN原型被认为对未来临床使用有用且有前景。
事实证明,所采用的可用性评估方法不受地点限制且易于使用。它们提供了关于可用性问题和改进的重要发现,这些将在第二个高保真原型中实施,该原型也将由用户进行测试。技术要求较高的用户需求,如从实践管理系统直接数据传输和需要复杂数据模型的输入选项,超出了本项目的范围,但应在未来的开发项目中予以考虑。