机器学习辅助的相关协变量筛选：应用于去甲丙咪嗪的临床数据

Machine-Learning Assisted Screening of Correlated Covariates: Application to Clinical Data of Desipramine.

作者信息

Asiimwe Innocent Gerald, S'fiso Ndzamba Bonginkosi, Mouksassi Samer, Pillai Goonaseelan Colin, Lombard Aurelie, Lang Jennifer

机构信息

The Wolfson Centre for Personalized Medicine, Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK.

APT-Africa Fellowship Program, c/o Pharmacometrics Africa NPC, K45 Old Main Building, Groote Schuur Hospital, Cape Town, South Africa.

出版信息

AAPS J. 2024 May 30;26(4):63. doi: 10.1208/s12248-024-00934-6.

DOI:10.1208/s12248-024-00934-6

PMID:38816519

Abstract

Stepwise covariate modeling (SCM) has a high computational burden and can select the wrong covariates. Machine learning (ML) has been proposed as a screening tool to improve the efficiency of covariate selection, but little is known about how to apply ML on actual clinical data. First, we simulated datasets based on clinical data to compare the performance of various ML and traditional pharmacometrics (PMX) techniques with and without accounting for highly-correlated covariates. This simulation step identified the ML algorithm and the number of top covariates to select when using the actual clinical data. A previously developed desipramine population-pharmacokinetic model was used to simulate virtual subjects. Fifteen covariates were considered with four having an effect included. Based on the F1 score (an accuracy measure), ridge regression was the most accurate ML technique on 200 simulated datasets (F1 score = 0.475 ± 0.231), a performance which almost doubled when highly-correlated covariates were accounted for (F1 score = 0.860 ± 0.158). These performances were better than forwards selection with SCM (F1 score = 0.251 ± 0.274 and 0.499 ± 0.381 without/with correlations respectively). In terms of computational cost, ridge regression (0.42 ± 0.07 seconds/simulated dataset, 1 thread) was ~20,000 times faster than SCM (2.30 ± 2.29 hours, 15 threads). On the clinical dataset, prescreening with the selected ML algorithm reduced SCM runtime by 42.86% (from 1.75 to 1.00 days) and produced the same final model as SCM only. In conclusion, we have demonstrated that accounting for highly-correlated covariates improves ML prescreening accuracy. The choice of ML method and the proportion of important covariates (unknown a priori) can be guided by simulations.

摘要

逐步协变量建模（SCM）计算负担高，且可能选择错误的协变量。机器学习（ML）已被提议作为一种筛选工具，以提高协变量选择的效率，但对于如何将ML应用于实际临床数据却知之甚少。首先，我们基于临床数据模拟数据集，比较各种ML和传统药代动力学（PMX）技术在考虑和不考虑高度相关协变量情况下的性能。这个模拟步骤确定了在使用实际临床数据时要选择的ML算法和顶级协变量的数量。使用先前开发的地昔帕明群体药代动力学模型来模拟虚拟受试者。考虑了15个协变量，其中4个有影响。基于F1分数（一种准确性度量），岭回归是200个模拟数据集上最准确的ML技术（F1分数 = 0.475 ± 0.231），当考虑高度相关协变量时，性能几乎翻倍（F1分数 = 0.860 ± 0.158）。这些性能优于SCM的向前选择（分别为不考虑/考虑相关性时的F1分数 = 0.251 ± 0.274和0.499 ± 0.381）。在计算成本方面，岭回归（0.42 ± 0.07秒/模拟数据集，1个线程）比SCM（2.30 ± 2.29小时，15个线程）快约20,000倍。在临床数据集上，使用选定的ML算法进行预筛选将SCM运行时间减少了42.86%（从1.75天降至1.00天），并且产生了与仅使用SCM相同的最终模型。总之，我们已经证明考虑高度相关协变量可提高ML预筛选的准确性。ML方法的选择和重要协变量的比例（先验未知）可以通过模拟来指导。

相似文献

Machine-Learning Assisted Screening of Correlated Covariates: Application to Clinical Data of Desipramine.

AAPS J. 2024 May 30;26(4):63. doi: 10.1208/s12248-024-00934-6.

Fast screening of covariates in population models empowered by machine learning.

J Pharmacokinet Pharmacodyn. 2021 Aug;48(4):597-609. doi: 10.1007/s10928-021-09757-w. Epub 2021 May 21.

The lasso--a novel method for predictive covariate model building in nonlinear mixed effects models.

J Pharmacokinet Pharmacodyn. 2007 Aug;34(4):485-517. doi: 10.1007/s10928-007-9057-1. Epub 2007 May 22.

Application of a single-objective, hybrid genetic algorithm approach to pharmacokinetic model building.

J Pharmacokinet Pharmacodyn. 2012 Aug;39(4):393-414. doi: 10.1007/s10928-012-9258-0. Epub 2012 Jul 6.

Impact of covariate model building methods on their clinical relevance evaluation in population pharmacokinetic analyses: comparison of the full model, stepwise covariate model (SCM) and SCM+ approaches.

J Pharmacokinet Pharmacodyn. 2024 Dec;51(6):653-670. doi: 10.1007/s10928-024-09911-0. Epub 2024 Apr 9.

Efficient and relevant stepwise covariate model building for pharmacometrics.

CPT Pharmacometrics Syst Pharmacol. 2022 Sep;11(9):1210-1222. doi: 10.1002/psp4.12838. Epub 2022 Jul 19.

Application of machine learning techniques in population pharmacokinetics/pharmacodynamics modeling.

Drug Metab Pharmacokinet. 2024 Jun;56:101004. doi: 10.1016/j.dmpk.2024.101004. Epub 2024 Feb 17.

Operating characteristics of stepwise covariate selection in pharmacometric modeling.

J Pharmacokinet Pharmacodyn. 2019 Jun;46(3):273-285. doi: 10.1007/s10928-019-09635-6. Epub 2019 Apr 24.

Go beyond the limits of genetic algorithm in daily covariate selection practice.

J Pharmacokinet Pharmacodyn. 2024 Apr;51(2):109-121. doi: 10.1007/s10928-023-09875-7. Epub 2023 Jul 26.

Comparison of covariate selection methods with correlated covariates: prior information versus data information, or a mixture of both?

J Pharmacokinet Pharmacodyn. 2020 Oct;47(5):485-492. doi: 10.1007/s10928-020-09700-5. Epub 2020 Jul 13.

引用本文的文献

Risk prediction models for complications after flap repair surgery: a systematic review and meta-analysis.

BMC Surg. 2025 Aug 27;25(1):398. doi: 10.1186/s12893-025-03072-8.

Covariate Model Selection Approaches for Population Pharmacokinetics: A Systematic Review of Existing Methods, From SCM to AI.

CPT Pharmacometrics Syst Pharmacol. 2025 Apr;14(4):621-639. doi: 10.1002/psp4.13306. Epub 2025 Jan 20.

Advancing pharmacometrics in Africa-Transition from capacity development toward job creation.

CPT Pharmacometrics Syst Pharmacol. 2025 Mar;14(3):407-419. doi: 10.1002/psp4.13291. Epub 2024 Dec 9.

本文引用的文献

Integrating real-world data to accelerate and guide drug development: A clinical pharmacology perspective.

Clin Transl Sci. 2022 Oct;15(10):2293-2302. doi: 10.1111/cts.13379. Epub 2022 Aug 7.

Efficient and relevant stepwise covariate model building for pharmacometrics.

CPT Pharmacometrics Syst Pharmacol. 2022 Sep;11(9):1210-1222. doi: 10.1002/psp4.12838. Epub 2022 Jul 19.

Population pharmacokinetic model selection assisted by machine learning.

J Pharmacokinet Pharmacodyn. 2022 Apr;49(2):257-270. doi: 10.1007/s10928-021-09793-6. Epub 2021 Oct 27.

Fast screening of covariates in population models empowered by machine learning.

J Pharmacokinet Pharmacodyn. 2021 Aug;48(4):597-609. doi: 10.1007/s10928-021-09757-w. Epub 2021 May 21.

Machine learning in pharmacometrics: Opportunities and challenges.

Br J Clin Pharmacol. 2022 Feb;88(4):1482-1499. doi: 10.1111/bcp.14801. Epub 2021 Mar 17.

Pharmacometrics and Systems Pharmacology 2030.

Clin Pharmacol Ther. 2020 Jan;107(1):76-78. doi: 10.1002/cpt.1683. Epub 2019 Nov 23.

Machine learning algorithm validation with a limited sample size.

PLoS One. 2019 Nov 7;14(11):e0224365. doi: 10.1371/journal.pone.0224365. eCollection 2019.

Operating characteristics of stepwise covariate selection in pharmacometric modeling.

J Pharmacokinet Pharmacodyn. 2019 Jun;46(3):273-285. doi: 10.1007/s10928-019-09635-6. Epub 2019 Apr 24.

Variable selection - A review and recommendations for the practicing statistician.

Biom J. 2018 May;60(3):431-449. doi: 10.1002/bimj.201700067. Epub 2018 Jan 2.

A Tutorial on RxODE: Simulating Differential Equation Pharmacometric Models in R.

CPT Pharmacometrics Syst Pharmacol. 2016 Jan;5(1):3-10. doi: 10.1002/psp4.12052. Epub 2015 Dec 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器学习辅助的相关协变量筛选：应用于去甲丙咪嗪的临床数据

Machine-Learning Assisted Screening of Correlated Covariates: Application to Clinical Data of Desipramine.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献