Dombowsky Alexander, Dunson David B, Madut Deng B, Rubach Matthew P, Herring Amy H
Department of Statistical Science, Duke University, NC, USA.
Department of Mathematics, Duke University, NC, USA.
Ann Appl Stat. 2025 Sep;19(3):2193-2217. doi: 10.1214/25-aoas2045. Epub 2025 Aug 28.
Sepsis is a life-threatening condition caused by a dysregulated host response to infection. Recently, researchers have hypothesized that sepsis consists of a heterogeneous spectrum of distinct subtypes, motivating several studies to identify clusters of sepsis patients that correspond to subtypes, with the long-term goal of using these clusters to design subtype-specific treatments. Therefore, clinicians rely on clusters having a concrete medical interpretation, usually corresponding to clinically meaningful regions of the sample space that have a concrete implication to practitioners. In this article, we propose Clustering Around Meaningful Regions (CLAMR), a Bayesian clustering approach that explicitly models the medical interpretation of each cluster center. CLAMR favors clusterings that can be summarized via meaningful feature values, leading to medically significant sepsis patient clusters. We also provide details on measuring the effect of each feature on the clustering using Bayesian hypothesis tests, so one can assess what features are relevant for cluster interpretation. Our focus is on clustering sepsis patients from Moshi, Tanzania, where patients are younger and the prevalence of HIV infection is higher than in previous sepsis subtyping cohorts.
脓毒症是一种由宿主对感染的失调反应引起的危及生命的病症。最近,研究人员推测脓毒症由不同亚型的异质性谱组成,这促使多项研究去识别与亚型相对应的脓毒症患者集群,其长期目标是利用这些集群来设计亚型特异性治疗方法。因此,临床医生依赖于具有具体医学解释的集群,这些集群通常对应于样本空间中对从业者有具体意义的临床有意义区域。在本文中,我们提出了围绕有意义区域聚类(CLAMR),这是一种贝叶斯聚类方法,它明确地对每个聚类中心的医学解释进行建模。CLAMR支持可以通过有意义的特征值进行总结的聚类,从而产生具有医学意义的脓毒症患者集群。我们还提供了使用贝叶斯假设检验来测量每个特征对聚类的影响的详细信息,这样就可以评估哪些特征与聚类解释相关。我们的重点是对来自坦桑尼亚莫希的脓毒症患者进行聚类,那里的患者更年轻,且艾滋病毒感染率高于以前的脓毒症亚型队列。