BioSymetrics, Inc., Huntington, NY, USA.
Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA; Frank G. Zarb School of Business, Hofstra University, Hempstead, NY, USA.
Comput Biol Med. 2023 Nov;166:107483. doi: 10.1016/j.compbiomed.2023.107483. Epub 2023 Sep 16.
The most common cause of death in people with COVID-19 is Acute Respiratory Distress Syndrome (ARDS). Prior studies have demonstrated that ARDS is a heterogeneous syndrome and have identified ARDS sub-types (phenoclusters). However, non-COVID-19 ARDS phenoclusters do not clearly apply to COVID-19 ARDS patients. In this retrospective cohort study, we implemented an iterative approach, combining supervised and unsupervised machine learning methodologies, to identify clinically relevant COVID-19 ARDS phenoclusters, as well as characteristics that are predictive of the outcome for each phenocluster. To this end, we applied a supervised model to identify risk factors for hospital mortality for each phenocluster and compared these between phenoclusters and the entire cohort. We trained the models using a comprehensive, preprocessed dataset of 2,864 hospitalized COVID-19 ARDS patients. Our research demonstrates that the risk factors predicting mortality in the overall cohort of COVID-19 ARDS may not necessarily apply to specific phenoclusters. Additionally, some risk factors increase the risk of hospital mortality in some phenoclusters but decrease mortality in others. These phenocluster-specific risk factors would not have been observed with a single predictive model. Heterogeneity in phenoclusters of COVID-19 ARDS as well as the drivers of mortality may partially explain challenges in finding effective treatments for all patients with ARDS.
在 COVID-19 患者中,最常见的死亡原因是急性呼吸窘迫综合征(ARDS)。先前的研究表明,ARDS 是一种异质性综合征,并确定了 ARDS 亚型(表型簇)。然而,非 COVID-19 的 ARDS 表型簇并不完全适用于 COVID-19 ARDS 患者。在这项回顾性队列研究中,我们采用迭代方法,结合有监督和无监督机器学习方法,来识别与 COVID-19 ARDS 相关的临床表型簇,以及对每个表型簇的结果具有预测性的特征。为此,我们应用有监督模型来识别每个表型簇的住院死亡率的危险因素,并在表型簇之间和整个队列中进行比较。我们使用了一个综合的、预处理的 2864 名住院 COVID-19 ARDS 患者的数据集来训练模型。我们的研究表明,预测 COVID-19 ARDS 总体队列中死亡率的危险因素可能不一定适用于特定的表型簇。此外,一些危险因素在某些表型簇中增加了住院死亡率的风险,但在其他表型簇中降低了死亡率。如果使用单一的预测模型,这些表型簇特异性的危险因素是无法被观察到的。COVID-19 ARDS 表型簇的异质性以及死亡率的驱动因素部分解释了为什么为所有 ARDS 患者找到有效治疗方法存在挑战。