机器学习干预在医疗保健中的随机临床试验：系统评价。

Randomized Clinical Trials of Machine Learning Interventions in Health Care: A Systematic Review.

机构信息

Harvard Medical School, Boston, Massachusetts.

Department of Medicine, Yale University, New Haven, Connecticut.

出版信息

JAMA Netw Open. 2022 Sep 1;5(9):e2233946. doi: 10.1001/jamanetworkopen.2022.33946.

DOI:10.1001/jamanetworkopen.2022.33946

PMID:36173632

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9523495/

Abstract

IMPORTANCE

Despite the potential of machine learning to improve multiple aspects of patient care, barriers to clinical adoption remain. Randomized clinical trials (RCTs) are often a prerequisite to large-scale clinical adoption of an intervention, and important questions remain regarding how machine learning interventions are being incorporated into clinical trials in health care.

OBJECTIVE

To systematically examine the design, reporting standards, risk of bias, and inclusivity of RCTs for medical machine learning interventions.

EVIDENCE REVIEW

In this systematic review, the Cochrane Library, Google Scholar, Ovid Embase, Ovid MEDLINE, PubMed, Scopus, and Web of Science Core Collection online databases were searched and citation chasing was done to find relevant articles published from the inception of each database to October 15, 2021. Search terms for machine learning, clinical decision-making, and RCTs were used. Exclusion criteria included implementation of a non-RCT design, absence of original data, and evaluation of nonclinical interventions. Data were extracted from published articles. Trial characteristics, including primary intervention, demographics, adherence to the CONSORT-AI reporting guideline, and Cochrane risk of bias were analyzed.

FINDINGS

Literature search yielded 19 737 articles, of which 41 RCTs involved a median of 294 participants (range, 17-2488 participants). A total of 16 RCTS (39%) were published in 2021, 21 (51%) were conducted at single sites, and 15 (37%) involved endoscopy. No trials adhered to all CONSORT-AI standards. Common reasons for nonadherence were not assessing poor-quality or unavailable input data (38 trials [93%]), not analyzing performance errors (38 [93%]), and not including a statement regarding code or algorithm availability (37 [90%]). Overall risk of bias was high in 7 trials (17%). Of 11 trials (27%) that reported race and ethnicity data, the median proportion of participants from underrepresented minority groups was 21% (range, 0%-51%).

CONCLUSIONS AND RELEVANCE

This systematic review found that despite the large number of medical machine learning-based algorithms in development, few RCTs for these technologies have been conducted. Among published RCTs, there was high variability in adherence to reporting standards and risk of bias and a lack of participants from underrepresented minority groups. These findings merit attention and should be considered in future RCT design and reporting.

摘要

重要性

尽管机器学习有可能改善患者护理的多个方面，但临床采用仍存在障碍。随机临床试验（RCT）通常是干预措施大规模临床采用的前提，但关于机器学习干预措施如何被纳入医疗保健临床试验仍存在重要问题。

目的

系统地检查医疗机器学习干预措施的 RCT 的设计、报告标准、偏倚风险和包容性。

证据审查

在这项系统评价中，检索了 Cochrane 图书馆、Google Scholar、Ovid Embase、Ovid MEDLINE、PubMed、Scopus 和 Web of Science Core Collection 在线数据库，并进行了引文追踪以找到从每个数据库成立到 2021 年 10 月 15 日发表的相关文章。使用了机器学习、临床决策和 RCT 的搜索词。排除标准包括实施非 RCT 设计、缺乏原始数据以及评估非临床干预措施。从已发表的文章中提取数据。分析了试验特征，包括主要干预措施、人口统计学、对 CONSORT-AI 报告指南的遵守情况和 Cochrane 偏倚风险。

发现

文献检索产生了 19737 篇文章，其中 41 项 RCT 涉及中位数为 294 名参与者（范围为 17-2488 名参与者）。共有 16 项 RCT（39%）于 2021 年发表，21 项（51%）在单一地点进行，15 项（37%）涉及内窥镜检查。没有试验完全遵守所有 CONSORT-AI 标准。不遵守的常见原因包括未评估质量差或不可用的输入数据（38 项试验[93%]）、未分析性能错误（38 [93%]）以及未包含关于代码或算法可用性的声明（37 [90%]）。总体偏倚风险在 7 项试验（17%）中较高。在报告种族和族裔数据的 11 项试验（27%）中，参与者中代表性不足的少数群体的中位数比例为 21%（范围为 0%-51%）。

结论和相关性

这项系统评价发现，尽管有大量基于机器学习的医疗算法正在开发中，但针对这些技术的 RCT 数量却很少。在已发表的 RCT 中，报告标准和偏倚风险的遵守情况存在很大差异，代表性不足的少数群体参与者人数不足。这些发现值得关注，应在未来的 RCT 设计和报告中考虑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9a0/9523495/66b7d9d6dd4e/jamanetwopen-e2233946-g001.jpg

相似文献

Randomized Clinical Trials of Machine Learning Interventions in Health Care: A Systematic Review.

JAMA Netw Open. 2022 Sep 1;5(9):e2233946. doi: 10.1001/jamanetworkopen.2022.33946.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

The future of Cochrane Neonatal.

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

Consolidated standards of reporting trials (CONSORT) and the completeness of reporting of randomised controlled trials (RCTs) published in medical journals.

Cochrane Database Syst Rev. 2012 Nov 14;11(11):MR000030. doi: 10.1002/14651858.MR000030.pub2.

Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.

Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.

Reporting Quality of Randomized Controlled Trials of Periodontal Diseases in Journal Abstracts-A Cross-sectional Survey and Bibliometric Analysis.

J Evid Based Dent Pract. 2018 Jun;18(2):130-141.e22. doi: 10.1016/j.jebdp.2017.08.005. Epub 2017 Sep 21.

Bias due to selective inclusion and reporting of outcomes and analyses in systematic reviews of randomised trials of healthcare interventions.

Cochrane Database Syst Rev. 2014 Oct 1;2014(10):MR000035. doi: 10.1002/14651858.MR000035.pub2.

Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies.

BMJ. 2020 Mar 25;368:m689. doi: 10.1136/bmj.m689.

Detecting Algorithmic Errors and Patient Harms for AI-Enabled Medical Devices in Randomized Controlled Trials: Protocol for a Systematic Review.

JMIR Res Protoc. 2024 Jun 28;13:e51614. doi: 10.2196/51614.

School-based education programmes for the prevention of unintentional injuries in children and young people.

Cochrane Database Syst Rev. 2016 Dec 27;12(12):CD010246. doi: 10.1002/14651858.CD010246.pub2.

引用本文的文献

Machine Learning in Predicting Wound Healing and Limb Salvage Outcomes Following Lower Limb Revascularization: A Systematic Review of Prognostic Accuracy.

Cureus. 2025 Jul 23;17(7):e88568. doi: 10.7759/cureus.88568. eCollection 2025 Jul.

A Randomized-Clinical Trial of Two Ambient Artificial Intelligence Scribes: Measuring Documentation Efficiency and Physician Burnout.

medRxiv. 2025 Jul 11:2025.07.10.25331333. doi: 10.1101/2025.07.10.25331333.

Machine Learning in Primary Health Care: The Research Landscape.

Healthcare (Basel). 2025 Jul 7;13(13):1629. doi: 10.3390/healthcare13131629.

Deep learning in obsessive-compulsive disorder: a narrative review.

Front Psychiatry. 2025 Jun 13;16:1581297. doi: 10.3389/fpsyt.2025.1581297. eCollection 2025.

Potentially inappropriate polypharmacy is an important predictor of 30-day emergency hospitalisation in older adults: a machine learning feature validation study.

Age Ageing. 2025 May 31;54(6). doi: 10.1093/ageing/afaf156.

Rethinking clinical trials for medical AI with dynamic deployments of adaptive systems.

NPJ Digit Med. 2025 May 6;8(1):252. doi: 10.1038/s41746-025-01674-3.

Advancing cardiovascular care through actionable AI innovation.

NPJ Digit Med. 2025 May 5;8(1):249. doi: 10.1038/s41746-025-01621-2.

"Be Really Careful about That": Clinicians' Perceptions of an Intelligence Augmentation Tool for In-Hospital Deterioration Detection.

Appl Clin Inform. 2025 Mar;16(2):377-392. doi: 10.1055/a-2505-7743. Epub 2025 Apr 30.

Artificial Intelligence in Atrial Fibrillation: From Early Detection to Precision Therapy.

J Clin Med. 2025 Apr 11;14(8):2627. doi: 10.3390/jcm14082627.

Bridging the Gap: From AI Success in Clinical Trials to Real-World Healthcare Implementation-A Narrative Review.

Healthcare (Basel). 2025 Mar 22;13(7):701. doi: 10.3390/healthcare13070701.

本文引用的文献

Citationchaser: A tool for transparent and efficient forward and backward citation chasing in systematic searching.

Res Synth Methods. 2022 Jul;13(4):533-545. doi: 10.1002/jrsm.1563. Epub 2022 May 7.

Clinical impact and quality of randomized controlled trials involving interventions evaluating artificial intelligence prediction tools: a systematic review.

NPJ Digit Med. 2021 Oct 28;4(1):154. doi: 10.1038/s41746-021-00524-2.

Artificial Intelligence Algorithm Improves Radiologist Performance in Skeletal Age Assessment: A Prospective Multicenter Randomized Controlled Trial.

Radiology. 2021 Dec;301(3):692-699. doi: 10.1148/radiol.2021204021. Epub 2021 Sep 28.

Deep Learning Computer-aided Polyp Detection Reduces Adenoma Miss Rate: A United States Multi-center Randomized Tandem Colonoscopy Study (CADeT-CS Trial).

Clin Gastroenterol Hepatol. 2022 Jul;20(7):1499-1507.e4. doi: 10.1016/j.cgh.2021.09.009. Epub 2021 Sep 14.

Mitigating bias in machine learning for medicine.

Commun Med (Lond). 2021 Aug 23;1:25. doi: 10.1038/s43856-021-00028-w.

Artificial intelligence-assisted colonoscopy: A prospective, multicenter, randomized controlled trial of polyp detection.

Cancer Med. 2021 Oct;10(20):7184-7193. doi: 10.1002/cam4.4261. Epub 2021 Sep 3.

The Role of Deep Learning-Based Echocardiography in the Diagnosis and Evaluation of the Effects of Routine Anti-Heart-Failure Western Medicines in Elderly Patients with Acute Left Heart Failure.

J Healthc Eng. 2021 Aug 9;2021:4845792. doi: 10.1155/2021/4845792. eCollection 2021.

The promise of artificial intelligence: a review of the opportunities and challenges of artificial intelligence in healthcare.

Br Med Bull. 2021 Sep 10;139(1):4-15. doi: 10.1093/bmb/ldab016.

Artificial intelligence-assisted clinical decision support for childhood asthma management: A randomized clinical trial.

PLoS One. 2021 Aug 2;16(8):e0255261. doi: 10.1371/journal.pone.0255261. eCollection 2021.

Effectiveness of App-Delivered, Tailored Self-management Support for Adults With Lower Back Pain-Related Disability: A selfBACK Randomized Clinical Trial.

JAMA Intern Med. 2021 Oct 1;181(10):1288-1296. doi: 10.1001/jamainternmed.2021.4097.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器学习干预在医疗保健中的随机临床试验：系统评价。

Randomized Clinical Trials of Machine Learning Interventions in Health Care: A Systematic Review.

机构信息

Harvard Medical School, Boston, Massachusetts.

Department of Medicine, Yale University, New Haven, Connecticut.