• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于在带时间戳的数据上本地验证临床机器学习模型的诊断框架。

Diagnostic framework to validate clinical machine learning models locally on temporally stamped data.

作者信息

Schuessler Maximilian, Fleming Scott, Meyer Shannon, Seto Tina, Hernandez-Boussard Tina

机构信息

Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.

Department of Statistics, Stanford University, Stanford, CA, USA.

出版信息

Commun Med (Lond). 2025 Jul 1;5(1):261. doi: 10.1038/s43856-025-00965-w.

DOI:10.1038/s43856-025-00965-w
PMID:40596645
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12219301/
Abstract

BACKGROUND

Real-world medical environments such as oncology are highly dynamic due to rapid changes in medical practice, technologies, and patient characteristics. This variability, if not addressed, can result in data shifts with potentially poor model performance. Presently, there are few easy-to-implement, model-agnostic diagnostic frameworks to vet machine learning models for future applicability and temporal consistency.

METHODS

We extracted clinical data from EHR for a cohort of over 24,000 patients who received antineoplastic therapy within a distinct year. The label of this study are acute care utilization (ACU) events, i.e., emergency department visits and hospitalizations, within 180 days of treatment initiation. Our cross-sectional data spans treatment initiation points from 2010-2022. We implemented three models within our validation framework: Least Absolute Shrinkage and Selection Operator (LASSO), Random Forest (RF), and Extreme Gradient Boosting (XGBoost).

RESULTS

Here, we introduce a model-agnostic diagnostic framework to validate clinical machine learning models on time-stamped data, consisting of four stages. First, the framework evaluates performance by partitioning data from multiple years into training and validation cohorts. Second, it characterizes the temporal evolution of patient outcomes and characteristics. Third, model longevity and trade-offs between data quantity and recency are explored. Finally, feature importance and data valuation algorithms are applied for feature reduction and data quality assessment. When applied to predicting ACU in cancer patients, the framework highlights fluctuations in features, labels, and data values over time.

CONCLUSIONS

The work in this study emphasizes the importance of data timeliness and relevance. The results on ACU in cancer patients show moderate signs of drift and corroborate the relevance of temporal considerations when validating machine learning models for deployment at the point of care.

摘要

背景

由于医疗实践、技术和患者特征的快速变化,肿瘤学等实际医疗环境具有高度动态性。这种变异性若不加以解决,可能导致数据偏移,进而使模型性能可能变差。目前,几乎没有易于实施的、与模型无关的诊断框架来审查机器学习模型的未来适用性和时间一致性。

方法

我们从电子健康记录(EHR)中提取了一组超过24000名在特定年份接受抗肿瘤治疗的患者的临床数据。本研究的标签是治疗开始后180天内的急性护理利用(ACU)事件,即急诊就诊和住院情况。我们的横断面数据涵盖了2010年至2022年的治疗开始时间点。我们在验证框架内实施了三种模型:最小绝对收缩和选择算子(LASSO)、随机森林(RF)和极端梯度提升(XGBoost)。

结果

在此,我们引入了一个与模型无关的诊断框架,用于在带时间戳的数据上验证临床机器学习模型,该框架由四个阶段组成。首先,该框架通过将多年数据划分为训练和验证队列来评估性能。其次,它描述了患者结局和特征的时间演变。第三,探索模型寿命以及数据数量和时效性之间的权衡。最后,应用特征重要性和数据评估算法进行特征约简和数据质量评估。当应用于预测癌症患者的ACU时,该框架突出了特征、标签和数据值随时间的波动。

结论

本研究工作强调了数据及时性和相关性的重要性。癌症患者ACU的结果显示出适度的偏移迹象,并证实了在验证用于即时护理部署的机器学习模型时考虑时间因素的相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/23548c43b40b/43856_2025_965_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/44a25331b1c1/43856_2025_965_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/e562b42a77e6/43856_2025_965_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/375499ff1627/43856_2025_965_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/066b83498aaf/43856_2025_965_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/045cb1012cda/43856_2025_965_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/23548c43b40b/43856_2025_965_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/44a25331b1c1/43856_2025_965_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/e562b42a77e6/43856_2025_965_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/375499ff1627/43856_2025_965_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/066b83498aaf/43856_2025_965_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/045cb1012cda/43856_2025_965_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7a4/12219301/23548c43b40b/43856_2025_965_Fig6_HTML.jpg

相似文献

1
Diagnostic framework to validate clinical machine learning models locally on temporally stamped data.用于在带时间戳的数据上本地验证临床机器学习模型的诊断框架。
Commun Med (Lond). 2025 Jul 1;5(1):261. doi: 10.1038/s43856-025-00965-w.
2
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
EORTC guidelines for the use of erythropoietic proteins in anaemic patients with cancer: 2006 update.欧洲癌症研究与治疗组织(EORTC)癌症贫血患者促红细胞生成蛋白使用指南:2006年更新版
Eur J Cancer. 2007 Jan;43(2):258-70. doi: 10.1016/j.ejca.2006.10.014. Epub 2006 Dec 19.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
7
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
8
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
9
Thrombolysis for acute ischaemic stroke.急性缺血性脑卒中的溶栓治疗
Cochrane Database Syst Rev. 2003(3):CD000213. doi: 10.1002/14651858.CD000213.
10
Tobacco packaging design for reducing tobacco use.用于减少烟草使用的烟草包装设计。
Cochrane Database Syst Rev. 2017 Apr 27;4(4):CD011244. doi: 10.1002/14651858.CD011244.pub2.

本文引用的文献

1
Detecting and Remediating Harmful Data Shifts for the Responsible Deployment of Clinical AI Models.检测并纠正有害数据偏移,以实现临床人工智能模型的负责任部署。
JAMA Netw Open. 2025 Jun 2;8(6):e2513685. doi: 10.1001/jamanetworkopen.2025.13685.
2
Randomised controlled trials evaluating artificial intelligence in clinical practice: a scoping review.随机对照试验评估人工智能在临床实践中的应用:范围综述。
Lancet Digit Health. 2024 May;6(5):e367-e373. doi: 10.1016/S2589-7500(24)00047-5.
3
Changes in cancer diagnoses and stage distribution during the first year of the COVID-19 pandemic in the USA: a cross-sectional nationwide assessment.
美国 COVID-19 大流行第一年癌症诊断和分期分布的变化:一项全国性的横断面评估。
Lancet Oncol. 2023 Aug;24(8):855-867. doi: 10.1016/S1470-2045(23)00293-0.
4
Immune checkpoint therapy-current perspectives and future directions.免疫检查点治疗——现状与未来方向。
Cell. 2023 Apr 13;186(8):1652-1669. doi: 10.1016/j.cell.2023.03.006.
5
There is no such thing as a validated prediction model.没有经过验证的预测模型这种东西。
BMC Med. 2023 Feb 24;21(1):70. doi: 10.1186/s12916-023-02779-w.
6
Garbage in, Garbage out-Words of Caution on Big Data and Machine Learning in Medical Practice.输入垃圾,输出垃圾——医疗实践中大数据与机器学习的警示之言。
JAMA Health Forum. 2023 Feb 3;4(2):e230397. doi: 10.1001/jamahealthforum.2023.0397.
7
Trends and Characteristics of Potentially Preventable Emergency Department Visits Among Patients With Cancer in the US.美国癌症患者中潜在可预防的急诊科就诊趋势和特征。
JAMA Netw Open. 2023 Jan 3;6(1):e2250423. doi: 10.1001/jamanetworkopen.2022.50423.
8
Evaluation of domain generalization and adaptation on improving model robustness to temporal dataset shift in clinical medicine.评估领域泛化和适应对提高临床医学中模型对时间数据集变化的鲁棒性。
Sci Rep. 2022 Feb 17;12(1):2726. doi: 10.1038/s41598-022-06484-1.
9
An artificial intelligence framework integrating longitudinal electronic health records with real-world data enables continuous pan-cancer prognostication.人工智能框架将纵向电子健康记录与真实世界数据相结合,实现了泛癌种的连续预后预测。
Nat Cancer. 2021 Jul;2(7):709-722. doi: 10.1038/s43018-021-00236-2. Epub 2021 Jul 22.
10
Harnessing multimodal data integration to advance precision oncology.利用多模态数据整合推进精准肿瘤学。
Nat Rev Cancer. 2022 Feb;22(2):114-126. doi: 10.1038/s41568-021-00408-3. Epub 2021 Oct 18.