侯利汉参数：解决高吞吐量病态不适定问题的一种方法。

The parameter Houlihan: A solution to high-throughput identifiability indeterminacy for brutally ill-posed problems.

机构信息

Department of Biomedical Informatics, Columbia University, 622 West 168th Street, PH-20, New York, NY, USA; Department of Pediatrics, Division of Informatics, University of Colorado Medicine, Mail: F443, 13199 E. Montview Blvd. Ste: 210-12 | Aurora, CO 80045 USA.

Department of Computational and Mathematical sciences, California Institute of Technology, 1200 E California Blvd M/C 305-16 Pasadena, CA 91125 USA.

出版信息

Math Biosci. 2019 Oct;316:108242. doi: 10.1016/j.mbs.2019.108242. Epub 2019 Aug 24.

DOI:10.1016/j.mbs.2019.108242

PMID:31454628

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6759390/

Abstract

One way to interject knowledge into clinically impactful forecasting is to use data assimilation, a nonlinear regression that projects data onto a mechanistic physiologic model, instead of a set of functions, such as neural networks. Such regressions have an advantage of being useful with particularly sparse, non-stationary clinical data. However, physiological models are often nonlinear and can have many parameters, leading to potential problems with parameter identifiability, or the ability to find a unique set of parameters that minimize forecasting error. The identifiability problems can be minimized or eliminated by reducing the number of parameters estimated, but reducing the number of estimated parameters also reduces the flexibility of the model and hence increases forecasting error. We propose a method, the parameter Houlihan, that combines traditional machine learning techniques with data assimilation, to select the right set of model parameters to minimize forecasting error while reducing identifiability problems. The method worked well: the data assimilation-based glucose forecasts and estimates for our cohort using the Houlihan-selected parameter sets generally also minimize forecasting errors compared to other parameter selection methods such as by-hand parameter selection. Nevertheless, the forecast with the lowest forecast error does not always accurately represent physiology, but further advancements of the algorithm provide a path for improving physiologic fidelity as well. Our hope is that this methodology represents a first step toward combining machine learning with data assimilation and provides a lower-threshold entry point for using data assimilation with clinical data by helping select the right parameters to estimate.

摘要

将知识注入具有临床影响力的预测的一种方法是使用数据同化，这是一种将数据投影到机械生理模型而不是一组函数（例如神经网络）上的非线性回归。这种回归的优点是对于特别稀疏、非平稳的临床数据非常有用。然而，生理模型通常是非线性的，并且可能具有许多参数，从而导致参数可识别性或找到一组可最小化预测误差的唯一参数的能力出现问题。通过减少估计的参数数量，可以最小化或消除可识别性问题，但减少估计的参数数量也会降低模型的灵活性，从而增加预测误差。我们提出了一种方法，即参数 Houlihan，它将传统机器学习技术与数据同化相结合，选择正确的模型参数集，以最小化预测误差，同时减少可识别性问题。该方法效果很好：使用 Houlihan 选择的参数集进行基于数据同化的葡萄糖预测和估计，与手动选择参数等其他参数选择方法相比，通常也可以最小化预测误差。然而，具有最低预测误差的预测并不总是准确地代表生理学，但是算法的进一步改进为提高生理逼真度提供了途径。我们希望这种方法代表了将机器学习与数据同化相结合的第一步，并通过帮助选择要估计的正确参数，为使用数据同化与临床数据提供了一个较低的切入点。

相似文献

The parameter Houlihan: A solution to high-throughput identifiability indeterminacy for brutally ill-posed problems.侯利汉参数：解决高吞吐量病态不适定问题的一种方法。

Math Biosci. 2019 Oct;316:108242. doi: 10.1016/j.mbs.2019.108242. Epub 2019 Aug 24.

Interpretable physiological forecasting in the ICU using constrained data assimilation and electronic health record data.使用约束数据同化和电子健康记录数据进行 ICU 中的可解释生理预测。

J Biomed Inform. 2023 Sep;145:104477. doi: 10.1016/j.jbi.2023.104477. Epub 2023 Aug 20.

Personalized glucose forecasting for type 2 diabetes using data assimilation.使用数据同化技术对2型糖尿病进行个性化血糖预测。

PLoS Comput Biol. 2017 Apr 27;13(4):e1005232. doi: 10.1371/journal.pcbi.1005232. eCollection 2017 Apr.

Mechanistic machine learning: how data assimilation leverages physiologic knowledge using Bayesian inference to forecast the future, infer the present, and phenotype.机理机器学习：如何利用数据同化利用贝叶斯推断利用生理知识来预测未来、推断现在和表现型。

J Am Med Inform Assoc. 2018 Oct 1;25(10):1392-1401. doi: 10.1093/jamia/ocy106.

Identifiability Analysis of Three Control-Oriented Models for Use in Artificial Pancreas Systems.用于人工胰腺系统的三种面向控制模型的可识别性分析

J Diabetes Sci Technol. 2018 Sep;12(5):937-952. doi: 10.1177/1932296818788873. Epub 2018 Aug 10.

ASAS-NANP symposium: Mathematical Modeling in Animal Nutrition: The power of identifiability analysis for dynamic modeling in animal science:a practitioner approach.ASAS-NANP 研讨会：动物营养中的数学建模：可识别性分析在动物科学动态建模中的作用：一种实践者的方法。

J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad320.

Inference-based assessment of parameter identifiability in nonlinear biological models.基于推断的非线性生物模型中参数可识别性评估。

J R Soc Interface. 2018 Jul;15(144). doi: 10.1098/rsif.2018.0318.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测：机器学习在 1 型糖尿病中的应用。

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Assessing parameter identifiability in compartmental dynamic models using a computational approach: application to infectious disease transmission models.使用计算方法评估房室动力学模型中的参数可识别性：在传染病传播模型中的应用。

Theor Biol Med Model. 2019 Jan 14;16(1):1. doi: 10.1186/s12976-018-0097-6.

Identifiability and online estimation of diagnostic parameters with in the glucose insulin homeostasis.葡萄糖胰岛素内稳态中诊断参数的可识别性与在线估计。

Biosystems. 2012 Mar;107(3):135-41. doi: 10.1016/j.biosystems.2011.11.003. Epub 2011 Nov 12.

引用本文的文献

A methodology of phenotyping ICU patients from EHR data: High-fidelity, personalized, and interpretable phenotypes estimation.从电子健康记录数据中对 ICU 患者进行表型分析的方法：高保真、个性化且可解释的表型估计。

J Biomed Inform. 2023 Dec;148:104547. doi: 10.1016/j.jbi.2023.104547. Epub 2023 Nov 18.

Geometric analysis enables biological insight from complex non-identifiable models using simple surrogates.几何分析可以使用简单的替代模型从复杂的不可识别模型中获得生物学见解。

PLoS Comput Biol. 2023 Jan 20;19(1):e1010844. doi: 10.1371/journal.pcbi.1010844. eCollection 2023 Jan.

A Damaged-Informed Lung Ventilator Model for Ventilator Waveforms.一种基于损伤信息的呼吸机波形肺通气模型。

Front Physiol. 2021 Oct 1;12:724046. doi: 10.3389/fphys.2021.724046. eCollection 2021.

J Am Med Inform Assoc. 2018 Oct 1;25(10):1392-1401. doi: 10.1093/jamia/ocy106.

本文引用的文献

Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model.利用谷歌流感趋势数据和状态空间SEIR模型追踪流行病

J Am Stat Assoc. 2012;107(500):1410-1426. doi: 10.1080/01621459.2012.713876. Epub 2012 Dec 21.

Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。

NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.

J Am Med Inform Assoc. 2018 Oct 1;25(10):1392-1401. doi: 10.1093/jamia/ocy106.

Methodological variations in lagged regression for detecting physiologic drug effects in EHR data.滞后回归法在电子健康记录数据中检测药物生理效应的方法学变异。

J Biomed Inform. 2018 Oct;86:149-159. doi: 10.1016/j.jbi.2018.08.014. Epub 2018 Aug 30.

High-fidelity phenotyping: richness and freedom from bias.高保真表型分析：丰富性与无偏性

J Am Med Inform Assoc. 2018 Mar 1;25(3):289-294. doi: 10.1093/jamia/ocx110.

Personalized glucose forecasting for type 2 diabetes using data assimilation.使用数据同化技术对2型糖尿病进行个性化血糖预测。

PLoS Comput Biol. 2017 Apr 27;13(4):e1005232. doi: 10.1371/journal.pcbi.1005232. eCollection 2017 Apr.

Comparing lagged linear correlation, lagged regression, Granger causality, and vector autoregression for uncovering associations in EHR data.比较滞后线性相关、滞后回归、格兰杰因果关系和向量自回归以揭示电子健康记录（EHR）数据中的关联。

AMIA Annu Symp Proc. 2017 Feb 10;2016:779-788. eCollection 2016.

A Systematic Approach to Determining the Identifiability of Multistage Carcinogenesis Models.系统方法确定多阶段致癌模型的可识别性。

Risk Anal. 2017 Jul;37(7):1375-1387. doi: 10.1111/risa.12684. Epub 2016 Sep 9.

Electronic medical record phenotyping using the anchor and learn framework.使用锚定与学习框架进行电子病历表型分析。

J Am Med Inform Assoc. 2016 Jul;23(4):731-40. doi: 10.1093/jamia/ocw011. Epub 2016 Apr 23.

Data-driven health management: reasoning about personally generated data in diabetes with information technologies.数据驱动的健康管理：利用信息技术对糖尿病患者个人生成的数据进行推理。

J Am Med Inform Assoc. 2016 May;23(3):526-31. doi: 10.1093/jamia/ocv187. Epub 2016 Mar 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

侯利汉参数：解决高吞吐量病态不适定问题的一种方法。

The parameter Houlihan: A solution to high-throughput identifiability indeterminacy for brutally ill-posed problems.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献