• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于医疗设备的基于机器学习的分类方法的基于风险的评估。

Risk-based evaluation of machine learning-based classification methods used for medical devices.

作者信息

Haimerl Martin, Reich Christoph

机构信息

Furtwangen University of Applied Sciences, Furtwangen, Germany.

出版信息

BMC Med Inform Decis Mak. 2025 Mar 11;25(1):126. doi: 10.1186/s12911-025-02909-9.

DOI:10.1186/s12911-025-02909-9
PMID:40069689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11895222/
Abstract

BACKGROUND

In the future, more medical devices will be based on machine learning (ML) methods. In general, the consideration of risks is a crucial aspect for evaluating medical devices. Accordingly, risks and their associated costs should be taken into account when assessing the performance of ML-based medical devices. This paper addresses the following three research questions towards a risk-based evaluation with a focus on ML-based classification models.

METHODS

First, we analyzed how often risk-based metrics are currently utilized in the context of ML-based classification models. This was performed using a literature research based on a sample of recent scientific publications. Second, we introduce an approach for evaluating such models where expected risks and associated costs are integrated into the corresponding performance metrics. Additionally, we analyze the impact of different risk ratios on the resulting overall performance. Third, we elaborate how such risk-based approaches relate to regulatory requirements in the field of medical devices. A set of use case scenarios were utilized to demonstrate necessities and practical implications, in this regard.

RESULTS

First, it was shown that currently most scientific publications do not include risk-based approaches for measuring performance. Second, it was demonstrated that risk-based considerations have a substantial impact on the outcome. The relative increase of the resulting overall risks can go up to 196% when the ratio between different types of risks (false negatives vs. false positives) changes by a factor of 10.0. Third, we elaborated that risk-based considerations need to be included into the assessment of ML-based medical devices, according to the relevant EU regulations and standards. In particular, this applies when a substantial impact on the clinical outcome / in terms of the risk-benefit relationship occurs.

CONCLUSION

In summary, we demonstrated the necessity of a risk-based approach for the evaluation of medical devices which include ML-based classification methods. We showed that currently many scientific papers in this area do not include risk considerations. We developed basic steps towards a risk-based assessment of ML-based classifiers and elaborated consequences that could occur, when these steps are neglected. And, we demonstrated the consistency of our approach with current regulatory requirements in the EU.

摘要

背景

未来,更多的医疗设备将基于机器学习(ML)方法。一般来说,风险考量是评估医疗设备的关键方面。因此,在评估基于ML的医疗设备性能时,应考虑风险及其相关成本。本文针对基于风险的评估提出了以下三个研究问题,重点关注基于ML的分类模型。

方法

首先,我们分析了基于ML的分类模型背景下当前使用基于风险的指标的频率。这是通过对近期科学出版物样本进行文献研究来完成的。其次,我们引入了一种评估此类模型的方法,即将预期风险和相关成本纳入相应的性能指标中。此外,我们分析了不同风险比率对最终整体性能的影响。第三,我们阐述了这种基于风险的方法与医疗器械领域监管要求的关系。在这方面,利用一组用例场景来展示必要性和实际意义。

结果

首先,结果表明目前大多数科学出版物都没有包括用于衡量性能的基于风险的方法。其次,结果表明基于风险的考量对结果有重大影响。当不同类型风险(假阴性与假阳性)之间的比率变化10.0倍时,最终整体风险的相对增加可高达196%。第三,我们阐述了根据欧盟相关法规和标准,基于风险的考量需要纳入对基于ML的医疗设备的评估中。特别是,当对临床结果/在风险效益关系方面产生重大影响时,情况更是如此。

结论

总之,我们证明了对包括基于ML的分类方法的医疗设备进行基于风险的评估的必要性。我们表明目前该领域的许多科学论文都没有考虑风险因素。我们制定了对基于ML的分类器进行基于风险评估的基本步骤,并阐述了忽略这些步骤可能产生的后果。而且,我们证明了我们的方法与欧盟当前监管要求的一致性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/51efd541fd1b/12911_2025_2909_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/64af7bfa6b97/12911_2025_2909_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/67ffac8385fc/12911_2025_2909_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/230a79d60963/12911_2025_2909_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/06a59e1f30e9/12911_2025_2909_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/f7f1ab94cd98/12911_2025_2909_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/51efd541fd1b/12911_2025_2909_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/64af7bfa6b97/12911_2025_2909_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/67ffac8385fc/12911_2025_2909_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/230a79d60963/12911_2025_2909_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/06a59e1f30e9/12911_2025_2909_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/f7f1ab94cd98/12911_2025_2909_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/11895222/51efd541fd1b/12911_2025_2909_Fig6_HTML.jpg

相似文献

1
Risk-based evaluation of machine learning-based classification methods used for medical devices.用于医疗设备的基于机器学习的分类方法的基于风险的评估。
BMC Med Inform Decis Mak. 2025 Mar 11;25(1):126. doi: 10.1186/s12911-025-02909-9.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Consideration of Cybersecurity Risks in the Benefit-Risk Analysis of Medical Devices: Scoping Review.医疗设备获益-风险分析中的网络安全风险考量:范围综述
J Med Internet Res. 2024 Dec 24;26:e65528. doi: 10.2196/65528.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
6
Legal regulation of the production and trade of medical devices and medical equipment in the EU and US: experience for Ukraine.欧盟和美国医疗器械及医疗设备生产与贸易的法律监管:乌克兰的经验
Wiad Lek. 2017;70(3 pt 2):614-618.
7
The Effectiveness of Integrated Care Pathways for Adults and Children in Health Care Settings: A Systematic Review.综合护理路径在医疗环境中对成人和儿童的有效性:一项系统评价。
JBI Libr Syst Rev. 2009;7(3):80-129. doi: 10.11124/01938924-200907030-00001.
8
From BSE to MDT, scientific opinion and public policy on risks in medical devices.从疯牛病到多学科团队协作,关于医疗设备风险的科学观点与公共政策
Med Device Technol. 1998 Nov;9(9):6-8, 10-1.
9
An integrated benefit-risk assessment of cobalt-containing alloys used in medical devices: Implications for regulatory requirements in the European Union.医疗器械中含钴合金的综合获益-风险评估:对欧盟监管要求的影响。
Regul Toxicol Pharmacol. 2021 Oct;125:105004. doi: 10.1016/j.yrtph.2021.105004. Epub 2021 Jul 10.
10
Review of Machine Learning Techniques in Soft Tissue Biomechanics and Biomaterials.机器学习技术在软组织生物力学和生物材料中的应用综述。
Cardiovasc Eng Technol. 2024 Oct;15(5):522-549. doi: 10.1007/s13239-024-00737-y. Epub 2024 Jul 2.

本文引用的文献

1
Metrics reloaded: recommendations for image analysis validation.重新加载指标:图像分析验证的建议。
Nat Methods. 2024 Feb;21(2):195-212. doi: 10.1038/s41592-023-02151-z. Epub 2024 Feb 12.
2
Comparison of Classification Success Rates of Different Machine Learning Algorithms in the Diagnosis of Breast Cancer.不同机器学习算法在乳腺癌诊断中的分类成功率比较。
Asian Pac J Cancer Prev. 2022 Oct 1;23(10):3287-3297. doi: 10.31557/APJCP.2022.23.10.3287.
3
Efficient Model for Coronary Artery Disease Diagnosis: A Comparative Study of Several Machine Learning Algorithms.
用于冠心病诊断的高效模型:几种机器学习算法的比较研究。
J Healthc Eng. 2022 Oct 18;2022:5359540. doi: 10.1155/2022/5359540. eCollection 2022.
4
Predictive Analysis of Diabetes-Risk with Class Imbalance.糖尿病风险的不平衡分类预测分析。
Comput Intell Neurosci. 2022 Oct 11;2022:3078025. doi: 10.1155/2022/3078025. eCollection 2022.
5
In-hospital risk stratification algorithm of Asian elderly patients.亚洲老年住院患者的院内风险分层算法。
Sci Rep. 2022 Oct 20;12(1):17592. doi: 10.1038/s41598-022-18839-9.
6
Detecting and Analyzing Suicidal Ideation on Social Media Using Deep Learning and Machine Learning Models.利用深度学习和机器学习模型检测和分析社交媒体上的自杀意念。
Int J Environ Res Public Health. 2022 Oct 3;19(19):12635. doi: 10.3390/ijerph191912635.
7
Automated assessment of balance: A neural network approach based on large-scale balance function data.基于大规模平衡功能数据的神经网络平衡自动评估方法。
Front Public Health. 2022 Sep 21;10:882811. doi: 10.3389/fpubh.2022.882811. eCollection 2022.
8
SYNPRED: prediction of drug combination effects in cancer using different synergy metrics and ensemble learning.SYNPRED:使用不同协同作用指标和集成学习预测癌症药物组合的效果。
Gigascience. 2022 Sep 26;11. doi: 10.1093/gigascience/giac087.
9
Machine-learning-derived predictive score for early estimation of COVID-19 mortality risk in hospitalized patients.基于机器学习的预测评分模型,用于早期评估住院患者 COVID-19 死亡风险。
PLoS One. 2022 Sep 22;17(9):e0274171. doi: 10.1371/journal.pone.0274171. eCollection 2022.
10
Application of machine learning algorithms in predicting HIV infection among men who have sex with men: Model development and validation.机器学习算法在预测男男性行为者中 HIV 感染中的应用:模型开发和验证。
Front Public Health. 2022 Aug 25;10:967681. doi: 10.3389/fpubh.2022.967681. eCollection 2022.