Dai Yang, Lokhandwala Sharukh, Long William, Mark Roger, Lehman Li-Wei H
Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA.
IEEE EMBS Int Conf Biomed Health Inform. 2017 Feb;2017:401-404. doi: 10.1109/BHI.2017.7897290. Epub 2017 Apr 13.
Among critically-ill patients, hypotension represents a failure in compensatory mechanisms and may lead to organ hypoperfusion and failure. In this work, we adopt a data-driven approach for phenotype discovery and visualization of patient similarity and cohort structure in the intensive care unit (ICU). We used Hierarchical Dirichlet Process (HDP) as a nonparametric topic modeling technique to automatically learn a d-dimensional feature representation of patients that captures the latent "topic" structure of diseases, symptoms, medications, and findings documented in hospital discharge summaries. We then used the t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm to convert the d-dimensional latent structure learned from HDP into a matrix of pairwise similarities for visualizing patient similarity and cohort structure. Using discharge summaries of a large patient cohort from the MIMIC II database, we evaluated the clinical utility of the discovered topic structure in phenotyping critically-ill patients who experienced hypotensive episodes. Our results indicate that the approach is able to reveal clinically interpretable clustering structure within our cohort and may potentially provide valuable insights to better understand the association between disease phenotypes and outcomes.
在重症患者中,低血压代表代偿机制失效,可能导致器官灌注不足和功能衰竭。在这项研究中,我们采用数据驱动的方法来发现表型,并可视化重症监护病房(ICU)中患者的相似性和队列结构。我们使用分层狄利克雷过程(HDP)作为非参数主题建模技术,自动学习患者的d维特征表示,该表示捕捉了出院小结中记录的疾病、症状、药物和检查结果的潜在“主题”结构。然后,我们使用t分布随机邻域嵌入(t-SNE)算法将从HDP学到的d维潜在结构转换为成对相似性矩阵,以可视化患者相似性和队列结构。利用MIMIC II数据库中一个大型患者队列的出院小结,我们评估了所发现的主题结构在对经历低血压发作的重症患者进行表型分析中的临床效用。我们的结果表明,该方法能够揭示我们队列中具有临床可解释性的聚类结构,并可能为更好地理解疾病表型与预后之间的关联提供有价值的见解。