临床医生诊断表现与基于机器学习的决策支持系统的关联：系统评价。

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.

机构信息

Nuffield Department of Surgical Sciences, University of Oxford, Oxford, United Kingdom.

Department of Radiology, University of Cambridge, Cambridge, United Kingdom.

出版信息

JAMA Netw Open. 2021 Mar 1;4(3):e211276. doi: 10.1001/jamanetworkopen.2021.1276.

DOI:10.1001/jamanetworkopen.2021.1276

PMID:33704476

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7953308/

Abstract

IMPORTANCE

An increasing number of machine learning (ML)-based clinical decision support systems (CDSSs) are described in the medical literature, but this research focuses almost entirely on comparing CDSS directly with clinicians (human vs computer). Little is known about the outcomes of these systems when used as adjuncts to human decision-making (human vs human with computer).

OBJECTIVES

To conduct a systematic review to investigate the association between the interactive use of ML-based diagnostic CDSSs and clinician performance and to examine the extent of the CDSSs' human factors evaluation.

EVIDENCE REVIEW

A search of MEDLINE, Embase, PsycINFO, and grey literature was conducted for the period between January 1, 2010, and May 31, 2019. Peer-reviewed studies published in English comparing human clinician performance with and without interactive use of an ML-based diagnostic CDSSs were included. All metrics used to assess human performance were considered as outcomes. The risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) and Risk of Bias in Non-Randomised Studies-Intervention (ROBINS-I). Narrative summaries were produced for the main outcomes. Given the heterogeneity of medical conditions, outcomes of interest, and evaluation metrics, no meta-analysis was performed.

FINDINGS

A total of 8112 studies were initially retrieved and 5154 abstracts were screened; of these, 37 studies met the inclusion criteria. The median number of participating clinicians was 4 (interquartile range, 3-8). Of the 107 results that reported statistical significance, 54 (50%) were increased by the use of CDSSs, 4 (4%) were decreased, and 49 (46%) showed no change or an unclear change. In the subgroup of studies carried out in representative clinical settings, no association between the use of ML-based diagnostic CDSSs and improved clinician performance could be observed. Interobserver agreement was the commonly reported outcome whose change was the most strongly associated with CDSS use. Four studies (11%) reported on user feedback, and, in all but 1 case, clinicians decided to override at least some of the algorithms' recommendations. Twenty-eight studies (76%) were rated as having a high risk of bias in at least 1 of the 4 QUADAS-2 core domains, and 6 studies (16%) were considered to be at serious or critical risk of bias using ROBINS-I.

CONCLUSIONS AND RELEVANCE

This systematic review found only sparse evidence that the use of ML-based CDSSs is associated with improved clinician diagnostic performance. Most studies had a low number of participants, were at high or unclear risk of bias, and showed little or no consideration for human factors. Caution should be exercised when estimating the current potential of ML to improve human diagnostic performance, and more comprehensive evaluation should be conducted before deploying ML-based CDSSs in clinical settings. The results highlight the importance of considering supported human decisions as end points rather than merely the stand-alone CDSSs outputs.

摘要

重要性

越来越多的基于机器学习（ML）的临床决策支持系统（CDSS）在医学文献中被描述，但这项研究几乎完全集中在直接比较 CDSS 与临床医生（人与计算机）上。对于这些系统作为人类决策的辅助工具（人与计算机辅助的人类）的结果知之甚少。

目的

进行系统评价，以调查基于 ML 的诊断 CDSS 的交互使用与临床医生表现之间的关联，并检查 CDSS 的人为因素评估的程度。

证据综述

对 MEDLINE、Embase、PsycINFO 和灰色文献进行了为期 2010 年 1 月 1 日至 2019 年 5 月 31 日的搜索。纳入了比较使用和不使用基于 ML 的诊断 CDSS 对人类临床医生表现的同行评审研究。考虑了用于评估人类表现的所有指标作为结果。使用诊断准确性研究的质量评估（QUADAS-2）和非随机研究干预的偏倚风险（ROBINS-I）评估偏倚风险。对于主要结果，生成了叙述性总结。鉴于医疗条件、感兴趣的结果和评估指标的异质性，没有进行荟萃分析。

结果

最初检索到 8112 项研究，筛选了 5154 篇摘要；其中，37 项研究符合纳入标准。参与临床医生的中位数为 4（四分位距，3-8）。在报告具有统计学意义的 107 个结果中，有 54 个（50%）因使用 CDSS 而增加，4 个（4%）减少，49 个（46%）没有变化或变化不明确。在代表临床环境的研究亚组中，无法观察到基于 ML 的诊断 CDSS 的使用与临床医生表现的提高之间存在关联。观察者间一致性是最常报告的结果，其变化与 CDSS 使用的相关性最强。四项研究（11%）报告了用户反馈，除了一个案例外，临床医生决定至少部分覆盖算法的建议。28 项研究（76%）在 QUADAS-2 的至少 4 个核心领域中的 1 个领域被评为高偏倚风险，6 项研究（16%）使用 ROBINS-I 被认为存在严重或关键偏倚风险。

结论和相关性

本系统评价仅发现了少量证据表明，基于 ML 的 CDSS 的使用与临床医生诊断表现的提高有关。大多数研究的参与者人数较少，存在高或不明确的偏倚风险，且很少或没有考虑人为因素。在估计 ML 当前提高人类诊断表现的潜力时应谨慎，并且应在临床环境中部署基于 ML 的 CDSS 之前进行更全面的评估。结果强调了将支持人类的决策作为终点而不仅仅是独立的 CDSS 输出考虑的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59cb/7953308/60a752038a71/jamanetwopen-e211276-g001.jpg

相似文献

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.临床医生诊断表现与基于机器学习的决策支持系统的关联：系统评价。

JAMA Netw Open. 2021 Mar 1;4(3):e211276. doi: 10.1001/jamanetworkopen.2021.1276.

Computerised decision support systems in order communication for diagnostic, screening or monitoring test ordering: systematic reviews of the effects and cost-effectiveness of systems.计算机决策支持系统在诊断、筛查或监测检验申请方面的交流应用：系统的效果和成本效益的系统评价。

Health Technol Assess. 2010 Oct;14(48):1-227. doi: 10.3310/hta14480.

Enabling health care decisionmaking through clinical decision support and knowledge management.通过临床决策支持和知识管理实现医疗保健决策。

Evid Rep Technol Assess (Full Rep). 2012 Apr(203):1-784.

Do computerised clinical decision support systems for prescribing change practice? A systematic review of the literature (1990-2007).用于处方开具的计算机化临床决策支持系统能否改变医疗行为？对文献（1990 - 2007年）的系统评价

BMC Health Serv Res. 2009 Aug 28;9:154. doi: 10.1186/1472-6963-9-154.

The impact of pharmacy computerised clinical decision support on prescribing, clinical and patient outcomes: a systematic review of the literature.药学计算机化临床决策支持对处方、临床及患者结局的影响：文献系统评价

Int J Pharm Pract. 2010 Apr;18(2):69-87.

Human Factors and Technological Characteristics Influencing the Interaction of Medical Professionals With Artificial Intelligence-Enabled Clinical Decision Support Systems: Literature Review.影响医学专业人员与人工智能临床决策支持系统交互的人为因素和技术特征：文献综述

JMIR Hum Factors. 2022 Mar 24;9(1):e28639. doi: 10.2196/28639.

Effects of computer-based clinical decision support systems on physician performance and patient outcomes: a systematic review.基于计算机的临床决策支持系统对医生绩效和患者结局的影响：一项系统评价。

JAMA. 1998 Oct 21;280(15):1339-46. doi: 10.1001/jama.280.15.1339.

A systematic review of the value of clinical decision support systems in the prescription of antidiabetic drugs.系统评价临床决策支持系统在糖尿病药物处方中的价值。

Int J Med Inform. 2024 Nov;191:105581. doi: 10.1016/j.ijmedinf.2024.105581. Epub 2024 Jul 30.

Clinician involvement in research on machine learning-based predictive clinical decision support for the hospital setting: A scoping review.临床医生参与基于机器学习的预测性临床决策支持在医院环境中的研究：范围综述。

J Am Med Inform Assoc. 2021 Mar 1;28(3):653-663. doi: 10.1093/jamia/ocaa296.

Machine learning-based clinical decision support systems for pregnancy care: A systematic review.基于机器学习的妊娠护理临床决策支持系统：一项系统综述。

Int J Med Inform. 2023 May;173:105040. doi: 10.1016/j.ijmedinf.2023.105040. Epub 2023 Mar 8.

引用本文的文献

Artificial intelligence in medicine: Current applications in cardiology, oncology, and radiology.医学中的人工智能：当前在心脏病学、肿瘤学和放射学中的应用。

World J Methodol. 2025 Dec 20;15(4):106854. doi: 10.5662/wjm.v15.i4.106854.

AI-driven early detection of severe influenza in Jiangsu, China: a deep learning model validated through the design of multi-center clinical trials and prospective real-world deployment.中国江苏基于人工智能的严重流感早期检测：通过多中心临床试验设计和前瞻性实际应用验证的深度学习模型

Front Public Health. 2025 Aug 18;13:1610244. doi: 10.3389/fpubh.2025.1610244. eCollection 2025.

[Translational challenges and clinical potential of artificial intelligence in minimally invasive surgery].人工智能在微创手术中的转化挑战与临床潜力

Chirurgie (Heidelb). 2025 Aug 26. doi: 10.1007/s00104-025-02366-0.

Medical laboratory data-based models: opportunities, obstacles, and solutions.基于医学实验室数据的模型：机遇、障碍与解决方案。

J Transl Med. 2025 Jul 24;23(1):823. doi: 10.1186/s12967-025-06802-x.

AI Workflow, External Validation, and Development in Eye Disease Diagnosis.人工智能在眼病诊断中的工作流程、外部验证与发展

JAMA Netw Open. 2025 Jul 1;8(7):e2517204. doi: 10.1001/jamanetworkopen.2025.17204.

Regulating the future of laboratory medicine: European regulatory landscape of AI-driven medical device software in laboratory medicine.规范检验医学的未来：欧洲检验医学中人工智能驱动的医疗设备软件监管格局

Clin Chem Lab Med. 2025 May 28. doi: 10.1515/cclm-2025-0482.

Applications of machine learning and deep learning in musculoskeletal medicine: a narrative review.机器学习和深度学习在肌肉骨骼医学中的应用：一篇叙述性综述。

Eur J Med Res. 2025 May 15;30(1):386. doi: 10.1186/s40001-025-02511-9.

Deployable machine learning-based decision support system for tracheostomy in acute burn patients.用于急性烧伤患者气管切开术的可部署机器学习决策支持系统。

Burns Trauma. 2025 May 13;13:tkaf010. doi: 10.1093/burnst/tkaf010. eCollection 2025.

All in the Name of Artificial Intelligence: A Commentary on Linardon (2025).一切皆以人工智能之名：对利纳尔多（2025年）的评论

Int J Eat Disord. 2025 Jul;58(7):1191-1195. doi: 10.1002/eat.24446. Epub 2025 Apr 15.

Investigating Clinicians' Intentions and Influencing Factors for Using an Intelligence-Enabled Diagnostic Clinical Decision Support System in Health Care Systems: Cross-Sectional Survey.调查医疗保健系统中临床医生使用智能诊断临床决策支持系统的意图及影响因素：横断面调查

J Med Internet Res. 2025 Apr 7;27:e62732. doi: 10.2196/62732.

本文引用的文献

A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.深度学习在医学影像疾病检测方面的性能与医疗保健专业人员的比较：系统评价和荟萃分析。

Lancet Digit Health. 2019 Oct;1(6):e271-e297. doi: 10.1016/S2589-7500(19)30123-2. Epub 2019 Sep 25.

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension.涉及人工智能干预的临床试验报告的报告指南：CONSORT-AI 扩展。

BMJ. 2020 Sep 9;370:m3164. doi: 10.1136/bmj.m3164.

Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI Extension.涉及人工智能干预的临床试验方案指南：SPIRIT-AI 扩展。

BMJ. 2020 Sep 9;370:m3210. doi: 10.1136/bmj.m3210.

Augmented Intelligence Dermatology: Deep Neural Networks Empower Medical Professionals in Diagnosing Skin Cancer and Predicting Treatment Options for 134 Skin Disorders.增强智能皮肤科：深度神经网络为医疗专业人员诊断皮肤癌和预测 134 种皮肤疾病的治疗方案提供支持。

J Invest Dermatol. 2020 Sep;140(9):1753-1761. doi: 10.1016/j.jid.2020.01.019. Epub 2020 Mar 31.

Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies.人工智能与临床医生：深度学习研究的设计、报告标准和主张的系统评价。

BMJ. 2020 Mar 25;368:m689. doi: 10.1136/bmj.m689.

Improved Cancer Detection Using Artificial Intelligence: a Retrospective Evaluation of Missed Cancers on Mammography.利用人工智能提高癌症检测率：对乳腺 X 光摄影术漏诊癌症的回顾性评估。

J Digit Imaging. 2019 Aug;32(4):625-637. doi: 10.1007/s10278-019-00192-5.

Reporting of artificial intelligence prediction models.人工智能预测模型的报告。

Lancet. 2019 Apr 20;393(10181):1577-1579. doi: 10.1016/S0140-6736(19)30037-6.

Effect of a Deep Learning Framework-Based Computer-Aided Diagnosis System on the Diagnostic Performance of Radiologists in Differentiating between Malignant and Benign Masses on Breast Ultrasonography.深度学习框架辅助诊断系统对乳腺超声良恶性肿块鉴别诊断中放射科医生诊断性能的影响。

Korean J Radiol. 2019 May;20(5):749-758. doi: 10.3348/kjr.2018.0530.

Development and Validation of a Deep Learning-Based Automated Detection Algorithm for Major Thoracic Diseases on Chest Radiographs.基于深度学习的胸部 X 线片主要胸部疾病自动检测算法的开发与验证。

JAMA Netw Open. 2019 Mar 1;2(3):e191095. doi: 10.1001/jamanetworkopen.2019.1095.

A computer-aided diagnosis system using artificial intelligence for the diagnosis and characterization of breast masses on ultrasound: Added value for the inexperienced breast radiologist.一种使用人工智能的计算机辅助诊断系统，用于超声下乳腺肿块的诊断和特征描述：对经验不足的乳腺放射科医生的附加价值。

Medicine (Baltimore). 2019 Jan;98(3):e14146. doi: 10.1097/MD.0000000000014146.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

临床医生诊断表现与基于机器学习的决策支持系统的关联：系统评价。

Association of Clinician Diagnostic Performance With Machine Learning-Based Decision Support Systems: A Systematic Review.

机构信息

出版信息

IMPORTANCE

OBJECTIVES

EVIDENCE REVIEW

FINDINGS

CONCLUSIONS AND RELEVANCE

重要性

目的

证据综述

结果

结论和相关性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献