University of Bristol, Bristol BS1 5DD, UK.
University of Bristol, Bristol BS8 1TD, UK.
Exp Biol Med (Maywood). 2023 Dec;248(24):2547-2559. doi: 10.1177/15353702231214253. Epub 2023 Dec 15.
We present a pipeline in which machine learning techniques are used to automatically identify and evaluate subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. Patient clusters are determined using routinely collected hospital data, such as those used in the UK's National Early Warning Score 2 (NEWS2). An iterative, hierarchical clustering process was used to identify the minimum set of relevant features for cluster separation. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning, illustrating their robustness. In parallel, clinicians assessed intracluster similarities and intercluster differences of the identified patient subtypes within the context of their clinical knowledge. For each cluster, outcome prediction models were trained and their forecasting ability was illustrated against the NEWS2 of the unclustered patient cohort. These preliminary results suggest that subtype models can outperform the established NEWS2 method, providing improved prediction of patient deterioration. By considering both the computational outputs and clinician-based explanations in patient subtyping, we aim to highlight the mutual benefit of combining machine learning techniques with clinical expertise.
我们提出了一个管道,其中使用机器学习技术自动识别和评估 2017 年至 2021 年间在一家英国大型教学医院住院的患者亚组。使用常规收集的医院数据(如英国国家早期预警评分 2(NEWS2)中使用的数据)确定患者聚类。使用迭代、分层聚类过程确定用于聚类分离的最小相关特征集。通过使用最先进的可解释性技术,对识别出的亚组进行解释并赋予临床意义,展示其稳健性。同时,临床医生根据其临床知识评估了所识别的患者亚组内的聚类内相似性和聚类间差异。为每个聚类训练了结果预测模型,并针对未聚类患者队列的 NEWS2 展示了它们的预测能力。这些初步结果表明,亚组模型可以优于既定的 NEWS2 方法,提供对患者恶化的改善预测。通过在患者亚组中同时考虑计算输出和基于临床医生的解释,我们旨在强调将机器学习技术与临床专业知识相结合的互惠互利。