Division of Pulmonary, Critical Care, Allergy and Sleep Medicine, Department of Medicine, and.
Department of Anesthesia, University of California San Francisco, San Francisco, California; and.
Am J Respir Crit Care Med. 2020 Oct 1;202(7):996-1004. doi: 10.1164/rccm.202002-0347OC.
Two distinct phenotypes of acute respiratory distress syndrome (ARDS) with differential clinical outcomes and responses to randomly assigned treatment have consistently been identified in randomized controlled trial cohorts using latent class analysis. Plasma biomarkers, key components in phenotype identification, currently lack point-of-care assays and represent a barrier to the clinical implementation of phenotypes. The objective of this study was to develop models to classify ARDS phenotypes using readily available clinical data only. Three randomized controlled trial cohorts served as the training data set (ARMA [High vs. Low Vt], ALVEOLI [Assessment of Low Vt and Elevated End-Expiratory Pressure to Obviate Lung Injury], and FACTT [Fluids and Catheter Treatment Trial]; = 2,022), and a fourth served as the validation data set (SAILS [Statins for Acutely Injured Lungs from Sepsis]; = 745). A gradient-boosted machine algorithm was used to develop classifier models using 24 variables (demographics, vital signs, laboratory, and respiratory variables) at enrollment. In two secondary analyses, the ALVEOLI and FACTT cohorts each, individually, served as the validation data set, and the remaining combined cohorts formed the training data set for each analysis. Model performance was evaluated against the latent class analysis-derived phenotype. For the primary analysis, the model accurately classified the phenotypes in the validation cohort (area under the receiver operating characteristic curve [AUC], 0.95; 95% confidence interval [CI], 0.94-0.96). Using a probability cutoff of 0.5 to assign class, inflammatory biomarkers (IL-6, IL-8, and sTNFR-1; < 0.0001) and 90-day mortality (38% vs. 24%; = 0.0002) were significantly higher in the hyperinflammatory phenotype as classified by the model. Model accuracy was similar when ALVEOLI (AUC, 0.94; 95% CI, 0.92-0.96) and FACTT (AUC, 0.94; 95% CI, 0.92-0.95) were used as the validation cohorts. Significant treatment interactions were observed with the clinical classifier model-assigned phenotypes in both ALVEOLI ( = 0.0113) and FACTT ( = 0.0072) cohorts. ARDS phenotypes can be accurately identified using machine learning models based on readily available clinical data and may enable rapid phenotype identification at the bedside.
两种不同表型的急性呼吸窘迫综合征(ARDS),其临床结局和对随机分配治疗的反应也不同,这在使用潜在类别分析的随机对照试验队列中得到了一致的证实。在表型识别中,作为关键组成部分的血浆生物标志物目前缺乏即时检测检测方法,这也是表型临床应用的障碍。本研究的目的是开发一种仅使用现成的临床数据对 ARDS 表型进行分类的模型。三个随机对照试验队列被用作训练数据集(ARMA[高与低 VT]、ALVEOLI[评估低 VT 和升高呼气末压以避免肺损伤]和 FACTT[液体和导管治疗试验];=2022),第四个队列被用作验证数据集(SAILS[他汀类药物治疗脓毒症急性肺损伤];=745)。使用梯度提升机算法,根据入组时的 24 个变量(人口统计学、生命体征、实验室和呼吸变量)开发分类器模型。在两项二次分析中,ALVEOLI 和 FACTT 队列分别作为验证数据集,其余合并队列作为每个分析的训练数据集。使用受试者工作特征曲线下的面积(AUC)评估模型在验证队列中的性能(AUC,0.95;95%置信区间[CI],0.94-0.96)。使用概率截止值 0.5 来分配类别,模型分类的高炎症表型中炎症生物标志物(IL-6、IL-8 和 sTNFR-1;<0.0001)和 90 天死亡率(38% vs. 24%;=0.0002)明显更高。当使用 ALVEOLI(AUC,0.94;95%CI,0.92-0.96)和 FACTT(AUC,0.94;95%CI,0.92-0.95)作为验证队列时,模型的准确性也相似。在 ALVEOLI(=0.0113)和 FACTT(=0.0072)队列中,观察到与临床分类模型分配表型之间存在显著的治疗相互作用。
基于现有临床数据的机器学习模型可以准确识别 ARDS 表型,并且可能使床边快速表型识别成为可能。