• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习模型进行医学诊断的研究报告质量:系统评价。

Reporting quality of studies using machine learning models for medical diagnosis: a systematic review.

机构信息

Health Professions, Manchester Metropolitan University, Manchester, UK

Centre for Research and Interdisciplinarity (CRI), Université Paris Descartes, Paris, Île-de-France, France.

出版信息

BMJ Open. 2020 Mar 23;10(3):e034568. doi: 10.1136/bmjopen-2019-034568.

DOI:10.1136/bmjopen-2019-034568
PMID:32205374
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7103817/
Abstract

AIMS

We conducted a systematic review assessing the reporting quality of studies validating models based on machine learning (ML) for clinical diagnosis, with a specific focus on the reporting of information concerning the participants on which the diagnostic task was evaluated on.

METHOD

Medline Core Clinical Journals were searched for studies published between July 2015 and July 2018. Two reviewers independently screened the retrieved articles, a third reviewer resolved any discrepancies. An extraction list was developed from the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis guideline. Two reviewers independently extracted the data from the eligible articles. Third and fourth reviewers checked, verified the extracted data as well as resolved any discrepancies between the reviewers.

RESULTS

The search results yielded 161 papers, of which 28 conformed to the eligibility criteria. Detail of data source was reported in 24 of the 28 papers. For all of the papers, the set of patients on which the ML-based diagnostic system was evaluated was partitioned from a larger dataset, and the method for deriving such set was always reported. Information on the diagnostic/non-diagnostic classification was reported well (23/28). The least reported items were the use of reporting guideline (0/28), distribution of disease severity (8/28 patient flow diagram (10/28) and distribution of alternative diagnosis (10/28). A large proportion of studies (23/28) had a delay between the conduct of the reference standard and ML tests, while one study did not and four studies were unclear. For 15 studies, it was unclear whether the evaluation group corresponded to the setting in which the ML test will be applied to.

CONCLUSION

All studies in this review failed to use reporting guidelines, and a large proportion of them lacked adequate detail on participants, making it difficult to replicate, assess and interpret study findings.

PROSPERO REGISTRATION NUMBER

CRD42018099167.

摘要

目的

我们进行了一项系统评价,评估了基于机器学习(ML)的临床诊断模型验证研究的报告质量,特别关注评估诊断任务所依据的参与者信息的报告情况。

方法

检索了 2015 年 7 月至 2018 年 7 月发表的研究。两名审查员独立筛选检索到的文章,第三名审查员解决任何分歧。从 Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis 指南中制定了一个提取清单。两名审查员独立从合格文章中提取数据。第三和第四名审查员检查、验证提取的数据以及解决审查员之间的任何分歧。

结果

搜索结果产生了 161 篇论文,其中 28 篇符合入选标准。28 篇论文中有 24 篇报告了数据来源的详细信息。对于所有的论文,基于 ML 的诊断系统评估的患者集合是从更大的数据集分离出来的,并且总是报告了这种集合的推导方法。关于诊断/非诊断分类的信息报告得很好(23/28)。报告最少的项目是使用报告指南(0/28)、疾病严重程度分布(8/28 患者流程图(10/28)和替代诊断分布(10/28)。很大一部分研究(23/28)在参考标准和 ML 测试之间存在延迟,而一项研究没有延迟,四项研究不清楚。对于 15 项研究,不清楚评估组是否对应于 ML 测试将应用的环境。

结论

本综述中的所有研究都没有使用报告指南,其中很大一部分研究缺乏参与者的充分细节,使得难以复制、评估和解释研究结果。

PROSPERO 注册号:CRD42018099167。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ebc/7103817/8e8b63312e69/bmjopen-2019-034568f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ebc/7103817/8e8b63312e69/bmjopen-2019-034568f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ebc/7103817/8e8b63312e69/bmjopen-2019-034568f01.jpg

相似文献

1
Reporting quality of studies using machine learning models for medical diagnosis: a systematic review.使用机器学习模型进行医学诊断的研究报告质量:系统评价。
BMJ Open. 2020 Mar 23;10(3):e034568. doi: 10.1136/bmjopen-2019-034568.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Protocol for a systematic review on the methodological and reporting quality of prediction model studies using machine learning techniques.使用机器学习技术的预测模型研究的方法学和报告质量的系统评价议定书。
BMJ Open. 2020 Nov 11;10(11):e038832. doi: 10.1136/bmjopen-2020-038832.
5
Diagnostic accuracy of machine-learning-assisted detection for anterior cruciate ligament injury based on magnetic resonance imaging: Protocol for a systematic review and meta-analysis.基于磁共振成像的机器学习辅助检测前交叉韧带损伤的诊断准确性:系统评价与荟萃分析方案
Medicine (Baltimore). 2019 Dec;98(50):e18324. doi: 10.1097/MD.0000000000018324.
6
Consolidated standards of reporting trials (CONSORT) and the completeness of reporting of randomised controlled trials (RCTs) published in medical journals.试验报告的统一标准(CONSORT)以及医学期刊上发表的随机对照试验(RCT)的报告完整性。
Cochrane Database Syst Rev. 2012 Nov 14;11(11):MR000030. doi: 10.1002/14651858.MR000030.pub2.
7
Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Modeling Studies: Development and Validation.用于预后和诊断机器学习建模研究的综合报告指南:制定和验证。
J Med Internet Res. 2023 Aug 31;25:e48763. doi: 10.2196/48763.
8
Completeness of reporting of clinical prediction models developed using supervised machine learning: a systematic review.基于监督机器学习开发的临床预测模型报告的完整性:系统评价。
BMC Med Res Methodol. 2022 Jan 13;22(1):12. doi: 10.1186/s12874-021-01469-6.
9
Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.超越黑木树:影响澳大利亚地区、农村和偏远地区的健康研究问题的快速综述。
Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.
10
Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence.基于人工智能的诊断和预后预测模型研究报告指南(TRIPOD-AI)和偏倚风险工具(PROBAST-AI)制定方案。
BMJ Open. 2021 Jul 9;11(7):e048008. doi: 10.1136/bmjopen-2020-048008.

引用本文的文献

1
TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods: a Korean translation.TRIPOD+AI声明:使用回归或机器学习方法的临床预测模型报告的更新指南:韩文翻译
Ewha Med J. 2025 Jul;48(3):e48. doi: 10.12771/emj.2025.00668. Epub 2025 Jul 31.
2
PROBAST+AI: an updated quality, risk of bias, and applicability assessment tool for prediction models using regression or artificial intelligence methods.PROBAST+AI:一种用于使用回归或人工智能方法的预测模型的更新后的质量、偏倚风险和适用性评估工具。
BMJ. 2025 Mar 24;388:e082505. doi: 10.1136/bmj-2024-082505.
3

本文引用的文献

1
A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.深度学习在医学影像疾病检测方面的性能与医疗保健专业人员的比较:系统评价和荟萃分析。
Lancet Digit Health. 2019 Oct;1(6):e271-e297. doi: 10.1016/S2589-7500(19)30123-2. Epub 2019 Sep 25.
2
Reporting of artificial intelligence prediction models.人工智能预测模型的报告。
Lancet. 2019 Apr 20;393(10181):1577-1579. doi: 10.1016/S0140-6736(19)30037-6.
3
Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for Diagnostic Analysis of Medical Images: Results from Recently Published Papers.
Comprehensive Analysis of Cardiovascular Diseases: Symptoms, Diagnosis, and AI Innovations.
心血管疾病综合分析:症状、诊断与人工智能创新
Bioengineering (Basel). 2024 Dec 7;11(12):1239. doi: 10.3390/bioengineering11121239.
4
Integrative Stacking Machine Learning Model for Small Cell Lung Cancer Prediction Using Metabolomics Profiling.基于代谢组学分析的小细胞肺癌预测集成堆叠机器学习模型
Cancers (Basel). 2024 Dec 18;16(24):4225. doi: 10.3390/cancers16244225.
5
Studies of Artificial Intelligence/Machine Learning Registered on ClinicalTrials.gov: Cross-Sectional Study With Temporal Trends, 2010-2023.在 ClinicalTrials.gov 上注册的人工智能/机器学习研究:2010-2023 年的时间趋势横断面研究。
J Med Internet Res. 2024 Oct 25;26:e57750. doi: 10.2196/57750.
6
Machine learning-based prognostic model for 30-day mortality prediction in Sepsis-3.基于机器学习的 Sepsis-3 30 天死亡率预测预后模型。
BMC Med Inform Decis Mak. 2024 Sep 9;24(1):249. doi: 10.1186/s12911-024-02655-4.
7
Using machine learning methods to predict all-cause somatic hospitalizations in adults: A systematic review.使用机器学习方法预测成年人全因躯体住院治疗:系统评价。
PLoS One. 2024 Aug 23;19(8):e0309175. doi: 10.1371/journal.pone.0309175. eCollection 2024.
8
A Novel Machine-Learning Algorithm to Predict the Early Termination of Nutrition Support Team Follow-Up in Hospitalized Adults: A Retrospective Cohort Study.一种预测住院成人营养支持团队随访提前终止的新型机器学习算法:一项回顾性队列研究。
Nutrients. 2024 Jul 31;16(15):2492. doi: 10.3390/nu16152492.
9
AI Quality Standards in Health Care: Rapid Umbrella Review.医疗保健中的人工智能质量标准:快速伞式综述。
J Med Internet Res. 2024 May 22;26:e54705. doi: 10.2196/54705.
10
REFORMS: Consensus-based Recommendations for Machine-learning-based Science.改革:基于共识的机器学习科学建议。
Sci Adv. 2024 May 3;10(18):eadk3452. doi: 10.1126/sciadv.adk3452. Epub 2024 May 1.
研究报告用于医学图像诊断分析的人工智能算法性能的设计特点:近期发表论文的结果。
Korean J Radiol. 2019 Mar;20(3):405-410. doi: 10.3348/kjr.2019.0025.
4
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models.系统评价显示,机器学习在临床预测模型中并未优于逻辑回归。
J Clin Epidemiol. 2019 Jun;110:12-22. doi: 10.1016/j.jclinepi.2019.02.004. Epub 2019 Feb 11.
5
High-performance medicine: the convergence of human and artificial intelligence.高性能医学:人机智能融合。
Nat Med. 2019 Jan;25(1):44-56. doi: 10.1038/s41591-018-0300-7. Epub 2019 Jan 7.
6
Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction.医学诊断和预测人工智能技术临床效能评估的方法学指南
Radiology. 2018 Mar;286(3):800-809. doi: 10.1148/radiol.2017171920. Epub 2018 Jan 8.
7
STARD 2015 guidelines for reporting diagnostic accuracy studies: explanation and elaboration.《STARD 2015诊断准确性研究报告指南:解释与详述》
BMJ Open. 2016 Nov 14;6(11):e012799. doi: 10.1136/bmjopen-2016-012799.
8
Dermatologist-level classification of skin cancer with deep neural networks.基于深度神经网络的皮肤癌皮肤科医生级分类。
Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.
9
Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View.生物医学研究中机器学习预测模型开发与报告指南:多学科视角
J Med Internet Res. 2016 Dec 16;18(12):e323. doi: 10.2196/jmir.5870.
10
Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.预测未来——大数据、机器学习与临床医学。
N Engl J Med. 2016 Sep 29;375(13):1216-9. doi: 10.1056/NEJMp1606181.