• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

与评估人工智能的临床试验完成相关的试验因素:回顾性病例对照研究。

Trial Factors Associated With Completion of Clinical Trials Evaluating AI: Retrospective Case-Control Study.

机构信息

Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada.

Michael G. DeGroote School of Medicine, McMaster University, Hamilton, ON, Canada.

出版信息

J Med Internet Res. 2024 Sep 23;26:e58578. doi: 10.2196/58578.

DOI:10.2196/58578
PMID:39312296
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11459098/
Abstract

BACKGROUND

Evaluation of artificial intelligence (AI) tools in clinical trials remains the gold standard for translation into clinical settings. However, design factors associated with successful trial completion and the common reasons for trial failure are unknown.

OBJECTIVE

This study aims to compare trial design factors of complete and incomplete clinical trials testing AI tools. We conducted a case-control study of complete (n=485) and incomplete (n=51) clinical trials that evaluated AI as an intervention of ClinicalTrials.gov.

METHODS

Trial design factors, including area of clinical application, intended use population, and intended role of AI, were extracted. Trials that did not evaluate AI as an intervention and active trials were excluded. The assessed trial design factors related to AI interventions included the domain of clinical application related to organ systems; intended use population for patients or health care providers; and the role of AI for different applications in patient-facing clinical workflows, such as diagnosis, screening, and treatment. In addition, we also assessed general trial design factors including study type, allocation, intervention model, masking, age, sex, funder, continent, length of time, sample size, number of enrollment sites, and study start year. The main outcome was the completion of the clinical trial. Odds ratio (OR) and 95% CI values were calculated for all trial design factors using propensity-matched, multivariable logistic regression.

RESULTS

We queried ClinicalTrials.gov on December 23, 2023, using AI keywords to identify complete and incomplete trials testing AI technologies as a primary intervention, yielding 485 complete and 51 incomplete trials for inclusion in this study. Our nested propensity-matched, case-control results suggest that trials conducted in Europe were significantly associated with trial completion when compared with North American trials (OR 2.85, 95% CI 1.14-7.10; P=.03), and the trial sample size was positively associated with trial completion (OR 1.00, 95% CI 1.00-1.00; P=.02).

CONCLUSIONS

Our case-control study is one of the first to identify trial design factors associated with completion of AI trials and catalog study-reported reasons for AI trial failure. We observed that trial design factors positively associated with trial completion include trials conducted in Europe and sample size. Given the promising clinical use of AI tools in health care, our results suggest that future translational research should prioritize addressing the design factors of AI clinical trials associated with trial incompletion and common reasons for study failure.

摘要

背景

评估人工智能(AI)工具在临床试验中的应用仍然是将其转化为临床应用的金标准。然而,成功完成试验的设计因素以及试验失败的常见原因尚不清楚。

目的

本研究旨在比较完全和不完全临床试验的试验设计因素,以评估 AI 工具。我们对 ClinicalTrials.gov 评估 AI 作为干预措施的完全(n=485)和不完全(n=51)临床试验进行了病例对照研究。

方法

提取了临床试验设计因素,包括临床应用领域、预期使用人群和 AI 的预期作用。未评估 AI 作为干预措施的试验和正在进行的试验被排除在外。评估的与 AI 干预相关的试验设计因素包括与器官系统相关的临床应用领域;患者或医疗保健提供者的预期使用人群;以及 AI 在患者面向临床工作流程中的不同应用的作用,例如诊断、筛查和治疗。此外,我们还评估了一般临床试验设计因素,包括研究类型、分配、干预模型、掩蔽、年龄、性别、资助者、大陆、试验持续时间、样本量、入组地点数量和研究启动年份。主要结局是临床试验的完成情况。使用倾向评分匹配的多变量逻辑回归计算所有临床试验设计因素的比值比(OR)和 95%置信区间(CI)值。

结果

我们于 2023 年 12 月 23 日在 ClinicalTrials.gov 上使用 AI 关键字查询,以确定作为主要干预措施的 AI 技术的完全和不完全试验,共纳入 485 项完全和 51 项不完全试验进行研究。我们嵌套的倾向评分匹配病例对照结果表明,与北美试验相比,在欧洲进行的试验与试验完成显著相关(OR 2.85,95%CI 1.14-7.10;P=.03),并且试验样本量与试验完成呈正相关(OR 1.00,95%CI 1.00-1.00;P=.02)。

结论

我们的病例对照研究是首次确定与 AI 试验完成相关的试验设计因素并记录 AI 试验失败原因的研究之一。我们观察到与试验完成呈正相关的试验设计因素包括在欧洲进行的试验和样本量。鉴于 AI 工具在医疗保健中的临床应用前景广阔,我们的研究结果表明,未来的转化研究应优先解决与 AI 临床试验不完整和常见研究失败原因相关的设计因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8189/11459098/9bf44199f053/jmir_v26i1e58578_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8189/11459098/9bf44199f053/jmir_v26i1e58578_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8189/11459098/9bf44199f053/jmir_v26i1e58578_fig1.jpg

相似文献

1
Trial Factors Associated With Completion of Clinical Trials Evaluating AI: Retrospective Case-Control Study.与评估人工智能的临床试验完成相关的试验因素:回顾性病例对照研究。
J Med Internet Res. 2024 Sep 23;26:e58578. doi: 10.2196/58578.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Artificial intelligence-supported screen reading versus standard double reading in the Mammography Screening with Artificial Intelligence trial (MASAI): a clinical safety analysis of a randomised, controlled, non-inferiority, single-blinded, screening accuracy study.人工智能支持的屏幕阅读与人工智能筛查中的标准双读(MASAI)试验:一项随机、对照、非劣效、单盲、筛查准确性研究的临床安全性分析。
Lancet Oncol. 2023 Aug;24(8):936-944. doi: 10.1016/S1470-2045(23)00298-X.
4
Noncompletion and nonpublication of trials studying rare diseases: A cross-sectional analysis.罕见病研究试验的完成和发表情况:一项横断面分析。
PLoS Med. 2019 Nov 21;16(11):e1002966. doi: 10.1371/journal.pmed.1002966. eCollection 2019 Nov.
5
A Phase I/II Clinical Trial to evaluate the efficacy of baricitinib to prevent respiratory insufficiency progression in onco-hematological patients affected with COVID19: A structured summary of a study protocol for a randomised controlled trial.一项评估巴瑞替尼预防 COVID19 相关血液肿瘤患者呼吸功能不全进展的疗效的 I/II 期临床试验:一项随机对照试验研究方案的结构化总结。
Trials. 2021 Feb 5;22(1):116. doi: 10.1186/s13063-021-05072-4.
6
The role of patient-reported outcome measures in trials of artificial intelligence health technologies: a systematic evaluation of ClinicalTrials.gov records (1997-2022).患者报告结局测量在人工智能健康技术试验中的作用:对 ClinicalTrials.gov 记录的系统评价(1997-2022)。
Lancet Digit Health. 2023 Mar;5(3):e160-e167. doi: 10.1016/S2589-7500(22)00249-7.
7
Automated Large Vessel Occlusion Detection Software and Thrombectomy Treatment Times: A Cluster Randomized Clinical Trial.自动化大血管闭塞检测软件与取栓治疗时间:一项集群随机临床试验。
JAMA Neurol. 2023 Nov 1;80(11):1182-1190. doi: 10.1001/jamaneurol.2023.3206.
8
Randomized Controlled Trials of Artificial Intelligence in Clinical Practice: Systematic Review.人工智能在临床实践中的随机对照试验:系统评价。
J Med Internet Res. 2022 Aug 25;24(8):e37188. doi: 10.2196/37188.
9
Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。
Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.
10
Ethical Considerations in the Design and Conduct of Clinical Trials of Artificial Intelligence.人工智能临床试验设计和实施中的伦理考虑。
JAMA Netw Open. 2024 Sep 3;7(9):e2432482. doi: 10.1001/jamanetworkopen.2024.32482.

引用本文的文献

1
Concordance with SPIRIT-AI guidelines in reporting of randomized controlled trial protocols investigating artificial intelligence in oncology: a systematic review.在肿瘤学中研究人工智能的随机对照试验方案报告中与SPIRIT-AI指南的一致性:一项系统评价。
Oncologist. 2025 May 8;30(5). doi: 10.1093/oncolo/oyaf112.

本文引用的文献

1
APPRAISE-AI Tool for Quantitative Evaluation of AI Studies for Clinical Decision Support.用于临床决策支持的人工智能研究定量评估的 APPRAISE-AI 工具。
JAMA Netw Open. 2023 Sep 5;6(9):e2335377. doi: 10.1001/jamanetworkopen.2023.35377.
2
Revolutionizing healthcare: the role of artificial intelligence in clinical practice.人工智能在临床实践中的应用:医疗保健的革命。
BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.
3
The role of patient-reported outcome measures in trials of artificial intelligence health technologies: a systematic evaluation of ClinicalTrials.gov records (1997-2022).
患者报告结局测量在人工智能健康技术试验中的作用:对 ClinicalTrials.gov 记录的系统评价(1997-2022)。
Lancet Digit Health. 2023 Mar;5(3):e160-e167. doi: 10.1016/S2589-7500(22)00249-7.
4
Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review.导致医疗保健差距长期存在的人工智能偏差来源——一项全球综述。
PLOS Digit Health. 2022 Mar 31;1(3):e0000022. doi: 10.1371/journal.pdig.0000022. eCollection 2022 Mar.
5
Evaluation of a decided sample size in machine learning applications.机器学习应用中确定样本量的评估。
BMC Bioinformatics. 2023 Feb 14;24(1):48. doi: 10.1186/s12859-023-05156-9.
6
Randomized Clinical Trials of Machine Learning Interventions in Health Care: A Systematic Review.机器学习干预在医疗保健中的随机临床试验:系统评价。
JAMA Netw Open. 2022 Sep 1;5(9):e2233946. doi: 10.1001/jamanetworkopen.2022.33946.
7
Clinical trial registration was associated with lower risk of bias compared with non-registered trials among trials included in systematic reviews.在纳入系统评价的试验中,与未注册的试验相比,临床试验注册与较低的偏倚风险相关。
J Clin Epidemiol. 2022 May;145:164-173. doi: 10.1016/j.jclinepi.2022.01.012. Epub 2022 Jan 23.
8
AI in health and medicine.人工智能在医疗中的应用。
Nat Med. 2022 Jan;28(1):31-38. doi: 10.1038/s41591-021-01614-0. Epub 2022 Jan 20.
9
Toward Generalizability in the Deployment of Artificial Intelligence in Radiology: Role of Computation Stress Testing to Overcome Underspecification.迈向人工智能在放射学应用中的可推广性:计算压力测试在克服规格不足方面的作用
Radiol Artif Intell. 2021 Oct 27;3(6):e210097. doi: 10.1148/ryai.2021210097. eCollection 2021 Nov.
10
Design and Analysis Methods for Trials with AI-Based Diagnostic Devices for Breast Cancer.基于人工智能的乳腺癌诊断设备试验的设计与分析方法
J Pers Med. 2021 Nov 4;11(11):1150. doi: 10.3390/jpm11111150.