机器学习聚类算法在急性呼吸窘迫综合征治疗效果异质性检测中的比较：三项随机对照试验的二次分析。

Comparison of machine learning clustering algorithms for detecting heterogeneity of treatment effect in acute respiratory distress syndrome: A secondary analysis of three randomised controlled trials.

机构信息

Division of Clinical and Translational Research, Division of Critical Care, Department of Anesthesia, Washington University School of Medicine, Saint Louis, MO.

Department of Medicine, University of Wisconsin- Madison, Madison, Wisconsin.

出版信息

EBioMedicine. 2021 Dec;74:103697. doi: 10.1016/j.ebiom.2021.103697. Epub 2021 Dec 1.

DOI:10.1016/j.ebiom.2021.103697

PMID:34861492

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8645454/

Abstract

BACKGROUND

Heterogeneity in Acute Respiratory Distress Syndrome (ARDS), as a consequence of its non-specific definition, has led to a multitude of negative randomised controlled trials (RCTs). Investigators have sought to identify heterogeneity of treatment effect (HTE) in RCTs using clustering algorithms. We evaluated the proficiency of several commonly-used machine-learning algorithms to identify clusters where HTE may be detected.

METHODS

Five unsupervised: Latent class analysis (LCA), K-means, partition around medoids, hierarchical, and spectral clustering; and four supervised algorithms: model-based recursive partitioning, Causal Forest (CF), and X-learner with Random Forest (XL-RF) and Bayesian Additive Regression Trees were individually applied to three prior ARDS RCTs. Clinical data and research protein biomarkers were used as partitioning variables, with the latter excluded for secondary analyses. For a clustering schema, HTE was evaluated based on the interaction term of treatment group and cluster with day-90 mortality as the dependent variable.

FINDINGS

No single algorithm identified clusters with significant HTE in all three trials. LCA, XL-RF, and CF identified HTE most frequently (2/3 RCTs). Important partitioning variables in the unsupervised approaches were consistent across algorithms and RCTs. In supervised models, important partitioning variables varied between algorithms and across RCTs. In algorithms where clusters demonstrated HTE in the same trial, patients frequently interchanged clusters from treatment-benefit to treatment-harm clusters across algorithms. LCA aside, results from all other algorithms were subject to significant alteration in cluster composition and HTE with random seed change. Removing research biomarkers as partitioning variables greatly reduced the chances of detecting HTE across all algorithms.

INTERPRETATION

Machine-learning algorithms were inconsistent in their abilities to identify clusters with significant HTE. Protein biomarkers were essential in identifying clusters with HTE. Investigations using machine-learning approaches to identify clusters to seek HTE require cautious interpretation.

FUNDING

NIGMS R35 GM142992 (PS), NHLBI R35 HL140026 (CSC); NIGMS R01 GM123193, Department of Defense W81XWH-21-1-0009, NIA R21 AG068720, NIDA R01 DA051464 (MMC).

摘要

背景

由于急性呼吸窘迫综合征（ARDS）的非特异性定义，其异质性导致了大量负面的随机对照试验（RCT）。研究人员试图使用聚类算法来确定治疗效果的异质性（HTE）。我们评估了几种常用的机器学习算法识别可能检测到 HTE 的聚类的能力。

方法

五种无监督算法：潜在类别分析（LCA）、K-均值、中位数分区、层次和谱聚类；以及四种监督算法：基于模型的递归分区、因果森林（CF）和带有随机森林的 X-learner（XL-RF）和贝叶斯加法回归树分别应用于三项先前的 ARDS RCT。临床数据和研究蛋白生物标志物被用作分区变量，后者在二次分析中被排除。对于聚类方案，根据治疗组和聚类与 90 天死亡率作为因变量的交互项评估 HTE。

结果

没有一种算法能够在所有三项试验中识别出具有显著 HTE 的聚类。LCA、XL-RF 和 CF 最常识别出 HTE（2/3 RCT）。无监督方法中的重要分区变量在算法和 RCT 之间是一致的。在监督模型中，重要的分区变量在算法之间和 RCT 之间有所不同。在同一试验中算法中聚类显示出 HTE 的算法中，患者在算法之间经常从治疗受益聚类切换到治疗危害聚类。除了 LCA 之外，所有其他算法的结果都受到聚类组成和 HTE 的显著改变，并且随着随机种子的改变而改变。去除研究生物标志物作为分区变量大大降低了所有算法检测 HTE 的机会。

解释

机器学习算法在识别具有显著 HTE 的聚类的能力上不一致。蛋白生物标志物对于识别具有 HTE 的聚类至关重要。使用机器学习方法识别聚类以寻求 HTE 的研究需要谨慎解释。

资助

NIGMS R35 GM142992（PS）、NHLBI R35 HL140026（CSC）；NIGMS R01 GM123193、国防部 W81XWH-21-1-0009、NIA R21 AG068720、NIDA R01 DA051464（MMC）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1009/8645454/d12ad8900d78/gr1.jpg

相似文献

Comparison of machine learning clustering algorithms for detecting heterogeneity of treatment effect in acute respiratory distress syndrome: A secondary analysis of three randomised controlled trials.

EBioMedicine. 2021 Dec;74:103697. doi: 10.1016/j.ebiom.2021.103697. Epub 2021 Dec 1.

Heterogeneity of treatment effect by baseline risk of mortality in critically ill patients: re-analysis of three recent sepsis and ARDS randomised controlled trials.

Crit Care. 2019 May 3;23(1):156. doi: 10.1186/s13054-019-2446-1.

Outcome risk model development for heterogeneity of treatment effect analyses: a comparison of non-parametric machine learning methods and semi-parametric statistical methods.

BMC Med Res Methodol. 2024 Jul 23;24(1):158. doi: 10.1186/s12874-024-02265-8.

Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials.

Lancet Respir Med. 2020 Mar;8(3):247-257. doi: 10.1016/S2213-2600(19)30369-8. Epub 2020 Jan 13.

Heterogeneous effects of alveolar recruitment in acute respiratory distress syndrome: a machine learning reanalysis of the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial.

Br J Anaesth. 2019 Jul;123(1):88-95. doi: 10.1016/j.bja.2019.02.026. Epub 2019 Apr 5.

Longitudinal phenotypes in patients with acute respiratory distress syndrome: a multi-database study.

Crit Care. 2022 Nov 4;26(1):340. doi: 10.1186/s13054-022-04211-w.

Preventing false discovery of heterogeneous treatment effect subgroups in randomized trials.

Trials. 2018 Jul 16;19(1):382. doi: 10.1186/s13063-018-2774-5.

Sheep's coping style can be identified by unsupervised machine learning from unlabeled data.

Behav Processes. 2022 Jan;194:104559. doi: 10.1016/j.beproc.2021.104559. Epub 2021 Nov 25.

Machine Learning Classifier Models Can Identify Acute Respiratory Distress Syndrome Phenotypes Using Readily Available Clinical Data.

Am J Respir Crit Care Med. 2020 Oct 1;202(7):996-1004. doi: 10.1164/rccm.202002-0347OC.

The potential of clustering methods to define intersection test scenarios: Assessing real-life performance of AEB.

Accid Anal Prev. 2018 Apr;113:1-11. doi: 10.1016/j.aap.2018.01.010. Epub 2018 Jan 30.

引用本文的文献

Acute Respiratory Distress Syndrome Phenotypes After Stem Cell Transplantation: A Latent Class Analysis.

Crit Care Explor. 2025 Sep 5;7(9):e1312. doi: 10.1097/CCE.0000000000001312. eCollection 2025 Sep 1.

Mass medicine vs. personalized medicine: from mathematical methods to regulatory implications.

Front Physiol. 2025 Jul 14;16:1649114. doi: 10.3389/fphys.2025.1649114. eCollection 2025.

Identification and longitudinal assessment of sepsis phenotypes derived from routine clinical data in critically ill patients: a retrospective repeated measures latent profile analysis.

Infection. 2025 Jul 23. doi: 10.1007/s15010-025-02607-8.

Predictive Modeling of Heterogeneous Treatment Effects in RCTs: A Scoping Review.

JAMA Netw Open. 2025 Jul 1;8(7):e2522390. doi: 10.1001/jamanetworkopen.2025.22390.

Enriching patient populations in ICU trials: reducing heterogeneity through machine learning.

Curr Opin Crit Care. 2025 Aug 1;31(4):410-416. doi: 10.1097/MCC.0000000000001280. Epub 2025 May 2.

Precision Medicine in Acute Respiratory Distress Syndrome: Progress, Challenges, and the Road ahead.

Clin Chest Med. 2024 Dec;45(4):835-848. doi: 10.1016/j.ccm.2024.08.005. Epub 2024 Sep 20.

CIRCULATING HEPARAN SULFATE PROFILES IN PEDIATRIC ACUTE RESPIRATORY DISTRESS SYNDROME.

Shock. 2024 Oct 1;62(4):496-504. doi: 10.1097/SHK.0000000000002421.

Toward Precision in Critical Care Research: Methods for Observational and Interventional Studies.

Crit Care Med. 2024 Sep 1;52(9):1439-1450. doi: 10.1097/CCM.0000000000006371. Epub 2024 Aug 15.

Machine Learning Predicts Unplanned Care Escalations for Post-Anesthesia Care Unit Patients during the Perioperative Period: A Single-Center Retrospective Study.

J Med Syst. 2024 Jul 23;48(1):69. doi: 10.1007/s10916-024-02085-9.

A systematic review of machine learning models for management, prediction and classification of ARDS.

Respir Res. 2024 Jun 4;25(1):232. doi: 10.1186/s12931-024-02834-x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器学习聚类算法在急性呼吸窘迫综合征治疗效果异质性检测中的比较：三项随机对照试验的二次分析。

Comparison of machine learning clustering algorithms for detecting heterogeneity of treatment effect in acute respiratory distress syndrome: A secondary analysis of three randomised controlled trials.

机构信息

Division of Clinical and Translational Research, Division of Critical Care, Department of Anesthesia, Washington University School of Medicine, Saint Louis, MO.

Department of Medicine, University of Wisconsin- Madison, Madison, Wisconsin.