一种用于在医疗设备安全性评估中调整学习效应的机器学习框架。

A machine learning framework to adjust for learning effects in medical device safety evaluation.

作者信息

Koola Jejo D, Ramesh Karthik, Mao Jialin, Ahn Minyoung, Davis Sharon E, Govindarajulu Usha, Perkins Amy M, Westerman Dax, Ssemaganda Henry, Speroff Theodore, Ohno-Machado Lucila, Ramsay Craig R, Sedrakyan Art, Resnic Frederic S, Matheny Michael E

机构信息

Department of Medicine, University of California San Diego, San Diego, CA 92093, United States.

School of Medicine, University of California San Diego, San Diego, CA 92093, United States.

出版信息

J Am Med Inform Assoc. 2025 Jan 1;32(1):206-217. doi: 10.1093/jamia/ocae273.

DOI:10.1093/jamia/ocae273

PMID:39471493

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11648715/

Abstract

OBJECTIVES

Traditional methods for medical device post-market surveillance often fail to accurately account for operator learning effects, leading to biased assessments of device safety. These methods struggle with non-linearity, complex learning curves, and time-varying covariates, such as physician experience. To address these limitations, we sought to develop a machine learning (ML) framework to detect and adjust for operator learning effects.

MATERIALS AND METHODS

A gradient-boosted decision tree ML method was used to analyze synthetic datasets that replicate the complexity of clinical scenarios involving high-risk medical devices. We designed this process to detect learning effects using a risk-adjusted cumulative sum method, quantify the excess adverse event rate attributable to operator inexperience, and adjust for these alongside patient factors in evaluating device safety signals. To maintain integrity, we employed blinding between data generation and analysis teams. Synthetic data used underlying distributions and patient feature correlations based on clinical data from the Department of Veterans Affairs between 2005 and 2012. We generated 2494 synthetic datasets with widely varying characteristics including number of patient features, operators and institutions, and the operator learning form. Each dataset contained a hypothetical study device, Device B, and a reference device, Device A. We evaluated accuracy in identifying learning effects and identifying and estimating the strength of the device safety signal. Our approach also evaluated different clinically relevant thresholds for safety signal detection.

RESULTS

Our framework accurately identified the presence or absence of learning effects in 93.6% of datasets and correctly determined device safety signals in 93.4% of cases. The estimated device odds ratios' 95% confidence intervals were accurately aligned with the specified ratios in 94.7% of datasets. In contrast, a comparative model excluding operator learning effects significantly underperformed in detecting device signals and in accuracy. Notably, our framework achieved 100% specificity for clinically relevant safety signal thresholds, although sensitivity varied with the threshold applied.

DISCUSSION

A machine learning framework, tailored for the complexities of post-market device evaluation, may provide superior performance compared to standard parametric techniques when operator learning is present.

CONCLUSION

Demonstrating the capacity of ML to overcome complex evaluative challenges, our framework addresses the limitations of traditional statistical methods in current post-market surveillance processes. By offering a reliable means to detect and adjust for learning effects, it may significantly improve medical device safety evaluation.

摘要

目的

医疗设备上市后监测的传统方法往往无法准确考虑操作人员的学习效应，导致对设备安全性的评估存在偏差。这些方法在处理非线性、复杂的学习曲线以及随时间变化的协变量（如医生经验）方面存在困难。为解决这些局限性，我们试图开发一种机器学习（ML）框架来检测并调整操作人员的学习效应。

材料与方法

使用梯度提升决策树ML方法分析合成数据集，这些数据集复制了涉及高风险医疗设备的临床场景的复杂性。我们设计这个过程，使用风险调整累积和方法检测学习效应，量化因操作人员缺乏经验导致的额外不良事件发生率，并在评估设备安全信号时将这些因素与患者因素一起进行调整。为保持完整性，我们在数据生成和分析团队之间采用了盲法。合成数据基于2005年至2012年退伍军人事务部的临床数据使用基础分布和患者特征相关性。我们生成了2494个具有广泛不同特征的合成数据集，包括患者特征数量、操作人员和机构数量以及操作人员学习形式。每个数据集包含一个假设的研究设备（设备B）和一个参考设备（设备A）。我们评估了识别学习效应以及识别和估计设备安全信号强度的准确性。我们的方法还评估了安全信号检测的不同临床相关阈值。

结果

我们的框架在93.6%的数据集中准确识别了学习效应的存在与否，在93.4%的案例中正确确定了设备安全信号。在94.7%的数据集中，估计的设备优势比的95%置信区间与指定比例准确对齐。相比之下，一个排除操作人员学习效应的比较模型在检测设备信号和准确性方面表现明显较差。值得注意的是，我们的框架对于临床相关安全信号阈值实现了100%的特异性，尽管敏感性随所应用的阈值而变化。

讨论

针对上市后设备评估的复杂性量身定制的机器学习框架，在存在操作人员学习效应时，可能比标准参数技术提供更好的性能。

结论

我们的框架展示了ML克服复杂评估挑战的能力，解决了当前上市后监测过程中传统统计方法的局限性。通过提供一种检测和调整学习效应的可靠方法，它可能显著改善医疗设备安全评估。

相似文献

A machine learning framework to adjust for learning effects in medical device safety evaluation.一种用于在医疗设备安全性评估中调整学习效应的机器学习框架。

J Am Med Inform Assoc. 2025 Jan 1;32(1):206-217. doi: 10.1093/jamia/ocae273.

A Statistical Framework to Detect and Quantify Operator-Learning Curves in Medical Device Safety Evaluation.一种用于在医疗设备安全评估中检测和量化操作员学习曲线的统计框架。

Med Devices (Auckl). 2025 Jul 2;18:361-375. doi: 10.2147/MDER.S520191. eCollection 2025.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Perceptions and experiences of the prevention, detection, and management of postpartum haemorrhage: a qualitative evidence synthesis.预防、检测和管理产后出血的认知和经验：定性证据综合。

Cochrane Database Syst Rev. 2023 Nov 27;11(11):CD013795. doi: 10.1002/14651858.CD013795.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Blood biomarkers for the non-invasive diagnosis of endometriosis.用于子宫内膜异位症无创诊断的血液生物标志物。

Cochrane Database Syst Rev. 2016 May 1;2016(5):CD012179. doi: 10.1002/14651858.CD012179.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

引用本文的文献

Med Devices (Auckl). 2025 Jul 2;18:361-375. doi: 10.2147/MDER.S520191. eCollection 2025.

本文引用的文献

Simulating complex patient populations with hierarchical learning effects to support methods development for post-market surveillance.利用层次学习效应模拟复杂患者人群，以支持上市后监测方法的开发。

BMC Med Res Methodol. 2023 Apr 11;23(1):89. doi: 10.1186/s12874-023-01913-9.

The learning curve of robotic coronary arterial bypass surgery: A report from the STS database.机器人冠状动脉旁路移植术的学习曲线：来自 STS 数据库的报告。

J Card Surg. 2021 Nov;36(11):4178-4186. doi: 10.1111/jocs.15945. Epub 2021 Aug 29.

Research: Evaluation of Orthopedic Hip Device Recalls by the FDA from 2007 to 2017.研究：2007 年至 2017 年 FDA 对骨科髋关节器械召回的评估。

Biomed Instrum Technol. 2020 Nov 1;54(6):418-426. doi: 10.2345/0899-8205-54.6.418.

Trauma and Orthopedic Surgery Curriculum Concordance: An Operative Learning Curve Trajectory Perspective.创伤与骨科手术课程一致性：手术学习曲线轨迹视角

J Surg Educ. 2019 Nov-Dec;76(6):1569-1578. doi: 10.1016/j.jsurg.2019.05.009. Epub 2019 May 27.

Procedural Volume and Outcomes for Transcatheter Aortic-Valve Replacement.经导管主动脉瓣置换术的操作量与结果。

N Engl J Med. 2019 Jun 27;380(26):2541-2550. doi: 10.1056/NEJMsa1901109. Epub 2019 Apr 3.

A systematic review of the learning curve in robotic surgery: range and heterogeneity.机器人手术学习曲线的系统评价：范围和异质性。

Surg Endosc. 2019 Feb;33(2):353-365. doi: 10.1007/s00464-018-6473-9. Epub 2018 Sep 28.

Ethical Considerations for Increased Transparency and Reproducibility in the Retrospective Analysis of Health Care Data.医疗保健数据回顾性分析中提高透明度和可重复性的伦理考量

Ther Innov Regul Sci. 2015 May;49(3):342-347. doi: 10.1177/2168479015578155.

Learning Curve for Transcatheter Aortic Valve Implantation Under a Controlled Introduction System　- Initial Analysis of a Japanese Nationwide Registry.经控量引入系统行经导管主动脉瓣植入术的学习曲线：日本全国注册研究的初步分析

Circ J. 2018 Jun 25;82(7):1951-1958. doi: 10.1253/circj.CJ-18-0211. Epub 2018 May 22.

Can machine learning complement traditional medical device surveillance? A case study of dual-chamber implantable cardioverter-defibrillators.机器学习能否补充传统的医疗设备监测？双腔植入式心脏复律除颤器的案例研究。

Med Devices (Auckl). 2017 Aug 16;10:165-188. doi: 10.2147/MDER.S138158. eCollection 2017.

Procedural Experience for Transcatheter Aortic Valve Replacement and Relation to Outcomes: The STS/ACC TVT Registry.经导管主动脉瓣置换术的操作经验与结局的关系：STS/ACC TVT 注册研究。

J Am Coll Cardiol. 2017 Jul 4;70(1):29-41. doi: 10.1016/j.jacc.2017.04.056.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验