• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

虚拟患者的计算模拟可减少数据集偏差,并改善基于机器学习从嘈杂的异构重症监护病房数据集中检测急性呼吸窘迫综合征的能力。

Computational Simulation of Virtual Patients Reduces Dataset Bias and Improves Machine Learning-Based Detection of ARDS from Noisy Heterogeneous ICU Datasets.

作者信息

Sharafutdinov Konstantin, Fritsch Sebastian Johannes, Iravani Mina, Ghalati Pejman Farhadi, Saffaran Sina, Bates Declan G, Hardman Jonathan G, Polzin Richard, Mayer Hannah, Marx Gernot, Bickenbach Johannes, Schuppert Andreas

机构信息

Institute for Computational BiomedicineRWTH Aachen University 52062 Aachen Germany.

Joint Research Center for Computational BiomedicineRWTH Aachen University 52062 Aachen Germany.

出版信息

IEEE Open J Eng Med Biol. 2023 Feb 8;5:611-620. doi: 10.1109/OJEMB.2023.3243190. eCollection 2024.

DOI:10.1109/OJEMB.2023.3243190
PMID:39184970
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11342939/
Abstract

Machine learning (ML) technologies that leverage large-scale patient data are promising tools predicting disease evolution in individual patients. However, the limited generalizability of ML models developed on single-center datasets, and their unproven performance in real-world settings, remain significant constraints to their widespread adoption in clinical practice. One approach to tackle this issue is to base learning on large multi-center datasets. However, such heterogeneous datasets can introduce further biases driven by data origin, as data structures and patient cohorts may differ between hospitals. In this paper, we demonstrate how mechanistic virtual patient (VP) modeling can be used to capture specific features of patients' states and dynamics, while reducing biases introduced by heterogeneous datasets. We show how VP modeling can be used for data augmentation through identification of individualized model parameters approximating disease states of patients with suspected acute respiratory distress syndrome (ARDS) from observational data of mixed origin. We compare the results of an unsupervised learning method (clustering) in two cases: where the learning is based on original patient data and on data derived in the matching procedure of the VP model to real patient data. More robust cluster configurations were observed in clustering using the model-derived data. VP model-based clustering also reduced biases introduced by the inclusion of data from different hospitals and was able to discover an additional cluster with significant ARDS enrichment. Our results indicate that mechanistic VP modeling can be used to significantly reduce biases introduced by learning from heterogeneous datasets and to allow improved discovery of patient cohorts driven exclusively by medical conditions.

摘要

利用大规模患者数据的机器学习(ML)技术是预测个体患者疾病演变的有前景的工具。然而,基于单中心数据集开发的ML模型的有限通用性及其在现实环境中未经证实的性能,仍然是其在临床实践中广泛应用的重大限制。解决这个问题的一种方法是基于大型多中心数据集进行学习。然而,这样的异构数据集可能会引入由数据来源驱动的进一步偏差,因为不同医院之间的数据结构和患者队列可能不同。在本文中,我们展示了机械虚拟患者(VP)建模如何用于捕获患者状态和动态的特定特征,同时减少异构数据集引入的偏差。我们展示了如何通过从混合来源的观察数据中识别近似疑似急性呼吸窘迫综合征(ARDS)患者疾病状态的个性化模型参数,将VP建模用于数据增强。我们比较了无监督学习方法(聚类)在两种情况下的结果:学习基于原始患者数据以及基于VP模型与真实患者数据匹配过程中得出的数据。在使用模型得出的数据进行聚类时观察到了更稳健的聚类配置。基于VP模型的聚类还减少了因纳入不同医院的数据而引入的偏差,并且能够发现一个具有显著ARDS富集的额外聚类。我们的结果表明,机械VP建模可用于显著减少从异构数据集中学习所引入的偏差,并允许更好地发现仅由医疗状况驱动的患者队列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/9a985f0350e5/shara3-3243190.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/95e2feb42c3c/shara1-3243190.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/97b58de2f732/shara2-3243190.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/9a985f0350e5/shara3-3243190.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/95e2feb42c3c/shara1-3243190.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/97b58de2f732/shara2-3243190.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/811e/11342939/9a985f0350e5/shara3-3243190.jpg

相似文献

1
Computational Simulation of Virtual Patients Reduces Dataset Bias and Improves Machine Learning-Based Detection of ARDS from Noisy Heterogeneous ICU Datasets.虚拟患者的计算模拟可减少数据集偏差,并改善基于机器学习从嘈杂的异构重症监护病房数据集中检测急性呼吸窘迫综合征的能力。
IEEE Open J Eng Med Biol. 2023 Feb 8;5:611-620. doi: 10.1109/OJEMB.2023.3243190. eCollection 2024.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Application of convex hull analysis for the evaluation of data heterogeneity between patient populations of different origin and implications of hospital bias in downstream machine-learning-based data processing: A comparison of 4 critical-care patient datasets.凸包分析在评估不同来源患者群体间数据异质性及医院偏差在基于机器学习的下游数据处理中的影响中的应用:4个重症监护患者数据集的比较
Front Big Data. 2022 Oct 31;5:603429. doi: 10.3389/fdata.2022.603429. eCollection 2022.
4
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
5
Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study.基于机器学习的急性呼吸窘迫综合征患者死亡率预测:一项基于人群的研究。
Ann Transl Med. 2021 May;9(9):794. doi: 10.21037/atm-20-6624.
6
SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.SCORCH:利用机器学习分类器、数据增强和不确定性估计改进基于结构的虚拟筛选。
J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.
7
A narrative review of progress in the application of artificial intelligence in acute respiratory distress syndrome: subtypes and predictive models.人工智能在急性呼吸窘迫综合征中的应用进展:亚型与预测模型的叙述性综述
Ann Transl Med. 2023 Jan 31;11(2):128. doi: 10.21037/atm-22-3153. Epub 2022 Dec 20.
8
Machine learning-based prediction model of acute kidney injury in patients with acute respiratory distress syndrome.基于机器学习的急性呼吸窘迫综合征患者急性肾损伤预测模型。
BMC Pulm Med. 2023 Oct 3;23(1):370. doi: 10.1186/s12890-023-02663-6.
9
Predicting ICU Mortality in Acute Respiratory Distress Syndrome Patients Using Machine Learning: The Predicting Outcome and STratifiCation of severity in ARDS (POSTCARDS) Study.利用机器学习预测急性呼吸窘迫综合征患者的 ICU 死亡率:急性呼吸窘迫综合征预后和严重程度分层(POSTCARDS)研究。
Crit Care Med. 2023 Dec 1;51(12):1638-1649. doi: 10.1097/CCM.0000000000006030. Epub 2023 Aug 31.
10
Machine learning-based nonlinear regression-adjusted real-time quality control modeling: a multi-center study.基于机器学习的非线性回归调整实时质量控制建模:一项多中心研究。
Clin Chem Lab Med. 2023 Nov 21;62(4):635-645. doi: 10.1515/cclm-2023-0964. Print 2024 Mar 25.

引用本文的文献

1
A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals.一种用于跨多家医院进行 ICU 死亡率的可推广和可解释预测的混合建模框架。
Sci Rep. 2024 Mar 8;14(1):5725. doi: 10.1038/s41598-024-55577-6.
2
Developing an Artificial Intelligence-Based Representation of a Virtual Patient Model for Real-Time Diagnosis of Acute Respiratory Distress Syndrome.开发基于人工智能的虚拟患者模型以实时诊断急性呼吸窘迫综合征。
Diagnostics (Basel). 2023 Jun 17;13(12):2098. doi: 10.3390/diagnostics13122098.

本文引用的文献

1
Application of convex hull analysis for the evaluation of data heterogeneity between patient populations of different origin and implications of hospital bias in downstream machine-learning-based data processing: A comparison of 4 critical-care patient datasets.凸包分析在评估不同来源患者群体间数据异质性及医院偏差在基于机器学习的下游数据处理中的影响中的应用:4个重症监护患者数据集的比较
Front Big Data. 2022 Oct 31;5:603429. doi: 10.3389/fdata.2022.603429. eCollection 2022.
2
A computational cardiopulmonary physiology simulator accurately predicts individual patient responses to changes in mechanical ventilator settings.一种计算心肺生理学模拟器可以准确预测个体患者对机械呼吸机设置变化的反应。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3261-3264. doi: 10.1109/EMBC48229.2022.9871182.
3
A Comprehensive Review of Machine Learning Used to Combat COVID-19.用于抗击新冠疫情的机器学习综合综述
Diagnostics (Basel). 2022 Jul 31;12(8):1853. doi: 10.3390/diagnostics12081853.
4
Multitask Learning With Recurrent Neural Networks for Acute Respiratory Distress Syndrome Prediction Using Only Electronic Health Record Data: Model Development and Validation Study.仅使用电子健康记录数据的急性呼吸窘迫综合征预测的递归神经网络多任务学习:模型开发与验证研究
JMIR Med Inform. 2022 Jun 15;10(6):e36202. doi: 10.2196/36202.
5
Systematic Review and Comparison of Publicly Available ICU Data Sets-A Decision Guide for Clinicians and Data Scientists.系统综述和比较公开可用的 ICU 数据集——临床医生和数据科学家的决策指南。
Crit Care Med. 2022 Jun 1;50(6):e581-e588. doi: 10.1097/CCM.0000000000005517. Epub 2022 Mar 2.
6
Artificial intelligence for mechanical ventilation: systematic review of design, reporting standards, and bias.人工智能在机械通气中的应用:设计、报告标准和偏差的系统评价。
Br J Anaesth. 2022 Feb;128(2):343-351. doi: 10.1016/j.bja.2021.09.025. Epub 2021 Nov 9.
7
Biometric covariates and outcome in COVID-19 patients: are we looking close enough?生物识别协变量与 COVID-19 患者的结局:我们观察得足够仔细吗?
BMC Infect Dis. 2021 Nov 4;21(1):1136. doi: 10.1186/s12879-021-06823-z.
8
Algorithmic surveillance of ICU patients with acute respiratory distress syndrome (ASIC): protocol for a multicentre stepped-wedge cluster randomised quality improvement strategy.急性呼吸窘迫综合征 ICU 患者的算法监测(ASIC):一项多中心阶梯式楔形集群随机质量改进策略的方案。
BMJ Open. 2021 Apr 8;11(4):e045589. doi: 10.1136/bmjopen-2020-045589.
9
External Validation of a Widely Implemented Proprietary Sepsis Prediction Model in Hospitalized Patients.在住院患者中验证广泛实施的专有脓毒症预测模型的外部有效性。
JAMA Intern Med. 2021 Aug 1;181(8):1065-1070. doi: 10.1001/jamainternmed.2021.2626.
10
Artificial Intelligence Applications for COVID-19 in Intensive Care and Emergency Settings: A Systematic Review.人工智能在重症监护和急救环境中应用于 COVID-19:系统评价。
Int J Environ Res Public Health. 2021 Apr 29;18(9):4749. doi: 10.3390/ijerph18094749.