Institute of Global Health Innovation, Imperial College London, London, United Kingdom.
Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom.
JMIR Res Protoc. 2024 Sep 18;13:e58202. doi: 10.2196/58202.
Quality assessment of diagnostic accuracy studies (QUADAS), and more recently QUADAS-2, were developed to aid the evaluation of methodological quality within primary diagnostic accuracy studies. However, its current form, QUADAS-2 does not address the unique considerations raised by artificial intelligence (AI)-centered diagnostic systems. The rapid progression of the AI diagnostics field mandates suitable quality assessment tools to determine the risk of bias and applicability, and subsequently evaluate translational potential for clinical practice.
We aim to develop an AI-specific QUADAS (QUADAS-AI) tool that addresses the specific challenges associated with the appraisal of AI diagnostic accuracy studies. This paper describes the processes and methods that will be used to develop QUADAS-AI.
The development of QUADAS-AI can be distilled into 3 broad stages. Stage 1-a project organization phase had been undertaken, during which a project team and a steering committee were established. The steering committee consists of a panel of international experts representing diverse stakeholder groups. Following this, the scope of the project was finalized. Stage 2-an item generation process will be completed following (1) a mapping review, (2) a meta-research study, (3) a scoping survey of international experts, and (4) a patient and public involvement and engagement exercise. Candidate items will then be put forward to the international Delphi panel to achieve consensus for inclusion in the revised tool. A modified Delphi consensus methodology involving multiple online rounds and a final consensus meeting will be carried out to refine the tool, following which the initial QUADAS-AI tool will be drafted. A piloting phase will be carried out to identify components that are considered to be either ambiguous or missing. Stage 3-once the steering committee has finalized the QUADAS-AI tool, specific dissemination strategies will be aimed toward academic, policy, regulatory, industry, and public stakeholders, respectively.
As of July 2024, the project organization phase, as well as the mapping review and meta-research study, have been completed. We aim to complete the item generation, including the Delphi consensus, and finalize the tool by the end of 2024. Therefore, QUADAS-AI will be able to provide a consensus-derived platform upon which stakeholders may systematically appraise the methodological quality associated with AI diagnostic accuracy studies by the beginning of 2025.
AI-driven systems comprise an increasingly significant proportion of research in clinical diagnostics. Through this process, QUADAS-AI will aid the evaluation of studies in this domain in order to identify bias and applicability concerns. As such, QUADAS-AI may form a key part of clinical, governmental, and regulatory evaluation frameworks for AI diagnostic systems globally.
INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): DERR1-10.2196/58202.
质量评估诊断准确性研究(QUADAS),以及最近的 QUADAS-2,旨在帮助评估主要诊断准确性研究中的方法学质量。然而,其当前形式 QUADAS-2 并未解决人工智能(AI)为中心的诊断系统带来的独特考虑因素。人工智能诊断领域的快速发展需要合适的质量评估工具来确定偏倚和适用性风险,并随后评估其在临床实践中的转化潜力。
我们旨在开发一种专门针对 AI 的 QUADAS(QUADAS-AI)工具,以解决评估 AI 诊断准确性研究相关的特定挑战。本文描述了开发 QUADAS-AI 将采用的流程和方法。
QUADAS-AI 的开发可以概括为 3 个广泛的阶段。第 1 阶段是项目组织阶段,在此期间成立了一个项目团队和一个指导委员会。指导委员会由一个代表不同利益相关者群体的国际专家小组组成。在此之后,确定了项目的范围。第 2 阶段是将完成项目条目生成过程,包括(1)映射审查,(2)元研究,(3)国际专家范围调查,以及(4)患者和公众参与和参与工作。候选条目将提交给国际 Delphi 小组,以就纳入修订工具达成共识。将采用改良的 Delphi 共识方法,包括多个在线轮次和一次最终共识会议,以完善工具,之后将起草初始 QUADAS-AI 工具。将进行试点阶段,以确定被认为模棱两可或缺失的组件。第 3 阶段是指导委员会完成 QUADAS-AI 工具后,将分别针对学术、政策、监管、行业和公众利益相关者制定具体的传播策略。
截至 2024 年 7 月,项目组织阶段以及映射审查和元研究已经完成。我们计划在 2024 年底前完成条目生成,包括 Delphi 共识,并最终确定工具。因此,QUADAS-AI 将能够提供一个共识驱动的平台,利益相关者可以通过该平台系统地评估与 AI 诊断准确性研究相关的方法学质量,预计在 2025 年初投入使用。
人工智能驱动的系统在临床诊断研究中占比越来越大。通过这一过程,QUADAS-AI 将有助于评估该领域的研究,以识别偏倚和适用性问题。因此,QUADAS-AI 可能成为全球 AI 诊断系统临床、政府和监管评估框架的重要组成部分。
国际注册报告标识符(IRRID):DERR1-10.2196/58202。