Ren Yuanfang, Loftus Tyler J, Li Yanjun, Guan Ziyuan, Ruppert Matthew M, Datta Shounak, Upchurch Gilbert R, Tighe Patrick J, Rashidi Parisa, Shickel Benjamin, Ozrazgat-Baslanti Tezcan, Bihorac Azra
Intelligent Critical Care Center (IC), University of Florida, Gainesville, Florida, United States of America.
Department of Medicine, Division of Nephrology, Hypertension, and Renal Transplantation, University of Florida, Gainesville, Florida, United States of America.
PLOS Digit Health. 2022;1(10). doi: 10.1371/journal.pdig.0000110. Epub 2022 Oct 13.
During the early stages of hospital admission, clinicians use limited information to make decisions as patient acuity evolves. We hypothesized that clustering analysis of vital signs measured within six hours of hospital admission would reveal distinct patient phenotypes with unique pathophysiological signatures and clinical outcomes. We created a longitudinal electronic health record dataset for 75,762 adult patient admissions to a tertiary care center in 2014-2016 lasting six hours or longer. Physiotypes were derived via unsupervised machine learning in a training cohort of 41,502 patients applying consensus -means clustering to six vital signs measured within six hours of admission. Reproducibility and correlation with clinical biomarkers and outcomes were assessed in validation cohort of 17,415 patients and testing cohort of 16,845 patients. Training, validation, and testing cohorts had similar age (54-55 years) and sex (55% female), distributions. There were four distinct clusters. Physiotype A had physiologic signals consistent with early vasoplegia, hypothermia, and low-grade inflammation and favorable short-and long-term clinical outcomes despite early, severe illness. Physiotype B exhibited early tachycardia, tachypnea, and hypoxemia followed by the highest incidence of prolonged respiratory insufficiency, sepsis, acute kidney injury, and short- and long-term mortality. Physiotype C had minimal early physiological derangement and favorable clinical outcomes. Physiotype D had the greatest prevalence of chronic cardiovascular and kidney disease, presented with severely elevated blood pressure, and had good short-term outcomes but suffered increased 3-year mortality. Comparing sequential organ failure assessment (SOFA) scores across physiotypes demonstrated that clustering did not simply recapitulate previously established acuity assessments. In a heterogeneous cohort of hospitalized patients, unsupervised machine learning techniques applied to routine, early vital sign data identified physiotypes with unique disease categories and distinct clinical outcomes. This approach has the potential to augment understanding of pathophysiology by distilling thousands of disease states into a few physiological signatures.
在住院早期阶段,随着患者病情严重程度的发展,临床医生会利用有限的信息来做出决策。我们假设,对入院后6小时内测得的生命体征进行聚类分析,将揭示具有独特病理生理特征和临床结局的不同患者表型。我们创建了一个纵向电子健康记录数据集,该数据集涵盖了2014年至2016年期间在一家三级医疗中心入院的75762名成年患者,持续时间为6小时或更长。通过无监督机器学习,在一个由41502名患者组成的训练队列中,对入院后6小时内测得的六项生命体征应用一致性均值聚类法得出生理类型。在17415名患者的验证队列和16845名患者的测试队列中评估了可重复性以及与临床生物标志物和结局的相关性。训练、验证和测试队列的年龄(54 - 55岁)和性别(55%为女性)分布相似。有四个不同的聚类。生理类型A具有与早期血管麻痹、体温过低和低度炎症一致的生理信号,尽管早期病情严重,但短期和长期临床结局良好。生理类型B表现为早期心动过速、呼吸急促和低氧血症,随后发生长时间呼吸功能不全、脓毒症、急性肾损伤以及短期和长期死亡率的发生率最高。生理类型C早期生理紊乱最小,临床结局良好。生理类型D慢性心血管和肾脏疾病的患病率最高,表现为血压严重升高,短期结局良好,但3年死亡率增加。比较不同生理类型的序贯器官衰竭评估(SOFA)评分表明,聚类并非简单地重复先前确立的病情严重程度评估。在一组异质性住院患者中,将无监督机器学习技术应用于常规的早期生命体征数据,识别出了具有独特疾病类别和不同临床结局的生理类型。这种方法有可能通过将数千种疾病状态提炼为几种生理特征来增强对病理生理学的理解。