College of Science, North China University of Science and Technology, Tangshan, Hebei, China.
Department of Respiratory Medicine, North China University of Science and Technology, Affiliated Hospital, Tangshan, Hebei, China.
Sci Rep. 2024 Sep 19;14(1):21884. doi: 10.1038/s41598-024-72310-5.
Evaluating Community-Acquired Pneumonia (CAP) is crucial for determining appropriate treatment methods. In this study, we established a machine learning model using radiomics and clinical features to rapidly and accurately identify Severe Community-Acquired Pneumonia (SCAP). A total of 174 CAP patients were included in the study, with 64 cases classified as SCAP. Radiomic features were extracted from chest CT scans using radiomics techniques, and screened to remove irrelevant features. Additionally, clinical indicators of patients were similarly screened and constituted the clinical feature set. Subsequently, eight common machine learning models were employed to complete the SCAP identification task. Specifically, interpretability analysis was conducted on the models. In the end, we screened out 15 radiomic features (such as LeastAxisLength, Maximum2DDiameterColumn and ZonePercentage) and two clinical features: Lymphocyte (p = 0.041) and Albumin (p = 0.044). Using radiomic features as inputs in model predictions yielded the highest AUC of 0.85 on the test set. When using the clinical feature set as model inputs, the AUC was 0.82. Combining the two sets of features as model inputs, Ada Boost achieved the best performance with an AUC of 0.89. Our study demonstrates that combining radiomics and clinical data using machine learning methods can more accurately identify SCAP patients.
评估社区获得性肺炎(CAP)对于确定适当的治疗方法至关重要。在这项研究中,我们使用放射组学和临床特征建立了一个机器学习模型,以快速准确地识别严重社区获得性肺炎(SCAP)。共有 174 名 CAP 患者纳入研究,其中 64 例被分类为 SCAP。使用放射组学技术从胸部 CT 扫描中提取放射组学特征,并进行筛选以去除不相关的特征。此外,还对患者的临床指标进行了类似的筛选,并构成了临床特征集。随后,使用了八种常见的机器学习模型来完成 SCAP 识别任务。具体来说,对模型进行了可解释性分析。最后,我们筛选出 15 个放射组学特征(如最小轴长、最大 2D 直径柱和区域百分比)和两个临床特征:淋巴细胞(p=0.041)和白蛋白(p=0.044)。使用放射组学特征作为模型预测的输入,在测试集上获得了最高的 AUC 为 0.85。当使用临床特征集作为模型输入时,AUC 为 0.82。将两组特征结合作为模型输入,Ada Boost 表现最佳,AUC 为 0.89。我们的研究表明,使用机器学习方法结合放射组学和临床数据可以更准确地识别 SCAP 患者。