一种基于最优特征选择的可解释机器学习模型，用于识别轻度创伤性脑损伤患者的CT异常。

An interpretable machine learning model based on optimal feature selection for identifying CT abnormalities in patients with mild traumatic brain injury.

作者信息

Pan Yuling, Wei Mengqi, Jin Mengyuan, Liang Ying, Yi Tianjiao, Tu Jiancheng, Wu Shimin, Hu Fang, Liang Chunzi

机构信息

School of Laboratory Medicine, Hubei University of Chinese Medicine, 16 Huangjia Lake West Road, Wuhan, 430065, China.

Hubei Shizhen Laboratory, Hubei University of Chinese Medicine, 16 Huangjia Lake West Road, Wuhan, 430065, China.

出版信息

EClinicalMedicine. 2025 Apr 3;82:103192. doi: 10.1016/j.eclinm.2025.103192. eCollection 2025 Apr.

DOI:10.1016/j.eclinm.2025.103192

PMID:40242564

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12002887/

Abstract

BACKGROUND

Minor head trauma is a frequent cause of emergency department visits, early identification and prediction of mild traumatic brain injury (mTBI) patients with abnormal brain lesions are vital for minimizing unnecessary computed tomography (CT) scans, reducing radiation exposure, and ensuring timely effective treatment and care. This study aims to develop and validate an interpretable machine learning (ML) prediction model using routine laboratory data for guiding clinical decisions on CT scan use in mTBI patients.

METHODS

We conducted a multicentre study in China including data from January 2019 to July 2024. Our study included three patient cohorts: a retrospective training cohort (654 patients for training and 163 for internal testing) and two prospective validation cohorts (86 internal and 290 external patients). Fifty-one routine clinical laboratory characteristics, readily available from the electronic medical record (EMR) system within the first 24 h of admission, were collected. Seven ML algorithms were trained to develop predictive models, with the random forest (RF) algorithm used to optimize key feature combinations. Model predictive performance was evaluated using metrics such as the area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), and F1 scores. The SHapley Additive exPlanation (SHAP) was applied to interpret the final model, while decision curve analysis (DCA) was used to assess the clinical net benefit.

FINDINGS

In the derivation cohort, 599 (73.3%) patients had normal CT scans and 218 (26.7%) had abnormal CT scans. The Gradient boosting classifier (GBC) model performed best among the seven ML models, with an AUC of 0.932 (95% CI: 0.900-0.963). After reducing features to 21 (8 biochemical test indicators, 3 coagulation markers, and 10 complete blood cell count indicators) according to feature importance rank, an explainable GBC-final model was established. The final model accurately predicted mTBI patients with abnormal CT in both internal (AUC 0.926, 95% CI: 0.893-0.958) and external (AUC 0.904, 95% CI: 0.835-0.973) validation cohorts. In the prospective cohort, final GBC model achieved AUC of 0.885 (95% CI: 0.753-1.000) and was significantly superior to traditional TBI biomarkers GFAP (AUC: 0.745) and PGP9.5 (AUC: 0.794). DCA revealed that the final model offered greater net benefits than "full intervention" or "no intervention" strategies within a probability threshold range of 0.16-0.93. SHAP analysis identified D-dimer levels, absolute lymphocyte and neutrophil counts, and hematocrit as key high-risk features.

INTERPRETATION

Our optimal feature selection-based ML model accurately and reliably predicts CT abnormalities in mTBI patients using routine test data. By addressing clinicians' concerns regarding transparency and decision-making through SHAP and DCA analyses, we strengthen the potential clinical applicability of our ML model.

FUNDING

The Natural Science Foundation of Hubei Province, high-level Talent Research Startup Funding of Hubei University of Chinese Medicine, Wuhan Health and Family Planning Scientific Research Fund Project of Hubei Province, and Machine Learning-based Intelligent Diagnosis System for AFP-negative Liver Cancer Project.

摘要

背景

轻度头部外伤是急诊科就诊的常见原因，早期识别和预测脑损伤异常的轻度创伤性脑损伤（mTBI）患者对于减少不必要的计算机断层扫描（CT）、降低辐射暴露以及确保及时有效的治疗和护理至关重要。本研究旨在开发并验证一种可解释的机器学习（ML）预测模型，该模型使用常规实验室数据来指导mTBI患者CT扫描使用的临床决策。

方法

我们在中国进行了一项多中心研究，纳入了2019年1月至2024年7月的数据。我们的研究包括三个患者队列：一个回顾性训练队列（654例用于训练，163例用于内部测试）和两个前瞻性验证队列（86例内部患者和290例外部患者）。收集了入院后24小时内可从电子病历（EMR）系统中轻松获取的51项常规临床实验室特征。训练了七种ML算法来开发预测模型，使用随机森林（RF）算法优化关键特征组合。使用受试者操作特征曲线下面积（AUC）、阳性预测值（PPV）和F1分数等指标评估模型预测性能。应用SHapley加法解释（SHAP）来解释最终模型，同时使用决策曲线分析（DCA）评估临床净效益。

结果

在推导队列中，599例（73.3%）患者CT扫描正常，218例（26.7%）患者CT扫描异常。梯度提升分类器（GBC）模型在七种ML模型中表现最佳，AUC为0.932（95%CI：0.900 - 0.963）。根据特征重要性排名将特征减少到21个（8个生化测试指标、3个凝血标志物和10个全血细胞计数指标）后，建立了一个可解释的GBC最终模型。最终模型在内部（AUC 0.926，95%CI：0.893 - 0.958）和外部（AUC 0.904，95%CI：0.835 - 0.973）验证队列中均准确预测了CT异常的mTBI患者。在前瞻性队列中，最终GBC模型的AUC为0.885（95%CI：0.753 - 1.000），明显优于传统TBI生物标志物GFAP（AUC：0.745）和PGP9.5（AUC：0.794）。DCA显示，在概率阈值范围为0.16 - 0.93内，最终模型比“完全干预”或“不干预”策略提供了更大的净效益。SHAP分析确定D - 二聚体水平、绝对淋巴细胞和中性粒细胞计数以及血细胞比容为关键高危特征。

解读

我们基于最优特征选择的ML模型使用常规测试数据准确可靠地预测了mTBI患者的CT异常。通过SHAP和DCA分析解决了临床医生对透明度和决策的担忧，我们增强了ML模型潜在的临床适用性。

资助

湖北省自然科学基金、湖北中医药大学高层次人才科研启动基金、湖北省武汉市卫生和计划生育科研基金项目以及基于机器学习的AFP阴性肝癌智能诊断系统项目。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0c/12002887/38b5df49b5ba/gr1.jpg

相似文献

An interpretable machine learning model based on optimal feature selection for identifying CT abnormalities in patients with mild traumatic brain injury.一种基于最优特征选择的可解释机器学习模型，用于识别轻度创伤性脑损伤患者的CT异常。

EClinicalMedicine. 2025 Apr 3;82:103192. doi: 10.1016/j.eclinm.2025.103192. eCollection 2025 Apr.

Development and validation of an interpretable machine learning model for predicting the risk of distant metastasis in papillary thyroid cancer: a multicenter study.用于预测乳头状甲状腺癌远处转移风险的可解释机器学习模型的开发与验证：一项多中心研究

EClinicalMedicine. 2024 Oct 30;77:102913. doi: 10.1016/j.eclinm.2024.102913. eCollection 2024 Nov.

Identification and validation of an explainable prediction model of acute kidney injury with prognostic implications in critically ill children: a prospective multicenter cohort study.识别并验证一种对危重症儿童急性肾损伤具有预后意义的可解释预测模型：一项前瞻性多中心队列研究。

EClinicalMedicine. 2024 Jan 5;68:102409. doi: 10.1016/j.eclinm.2023.102409. eCollection 2024 Feb.

Explainable Machine Learning Model for Predicting Persistent Sepsis-Associated Acute Kidney Injury: Development and Validation Study.用于预测持续性脓毒症相关急性肾损伤的可解释机器学习模型：开发与验证研究

J Med Internet Res. 2025 Apr 28;27:e62932. doi: 10.2196/62932.

Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测：回顾性队列研究

Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.

Development and validation of an interpretable machine learning model for predicting the risk of hepatocellular carcinoma in patients with chronic hepatitis B: a case-control study.用于预测慢性乙型肝炎患者肝细胞癌风险的可解释机器学习模型的开发与验证：一项病例对照研究

BMC Gastroenterol. 2025 Mar 11;25(1):157. doi: 10.1186/s12876-025-03697-2.

Risk of intraoperative hemorrhage during cesarean scar ectopic pregnancy surgery: development and validation of an interpretable machine learning prediction model.剖宫产瘢痕部位异位妊娠手术中术中出血的风险：一种可解释的机器学习预测模型的开发与验证

EClinicalMedicine. 2024 Nov 29;78:102969. doi: 10.1016/j.eclinm.2024.102969. eCollection 2024 Dec.

Highly sensitive detection platform-based diagnosis of oesophageal squamous cell carcinoma in China: a multicentre, case-control, diagnostic study.基于高灵敏度检测平台的中国食管鳞状细胞癌诊断：一项多中心、病例对照诊断研究。

Lancet Digit Health. 2024 Oct;6(10):e705-e717. doi: 10.1016/S2589-7500(24)00153-5.

Non-invasive Prediction of Lymph Node Metastasis in NSCLC Using Clinical, Radiomics, and Deep Learning Features From F-FDG PET/CT Based on Interpretable Machine Learning.基于可解释机器学习，利用F-FDG PET/CT的临床、影像组学和深度学习特征对非小细胞肺癌淋巴结转移进行无创预测

Acad Radiol. 2025 Mar;32(3):1645-1655. doi: 10.1016/j.acra.2024.11.037. Epub 2024 Dec 10.

Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS：一项多中心回顾性研究

BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.

引用本文的文献

Machine Learning Models for Predicting Abnormal Brain CT Scan Findings in Mild Traumatic Brain Injury Patients.用于预测轻度创伤性脑损伤患者脑部CT扫描异常结果的机器学习模型

Arch Acad Emerg Med. 2025 Jun 28;13(1):e60. doi: 10.22037/aaemj.v13i1.2709. eCollection 2025.

本文引用的文献

A machine learning prediction model for Cardiac Amyloidosis using routine blood tests in patients with left ventricular hypertrophy.一种使用常规血液检测对左心室肥厚患者的心脏淀粉样变性进行机器学习预测的模型。

Sci Rep. 2024 Nov 19;14(1):28644. doi: 10.1038/s41598-024-77466-8.

The Use of Deep Learning and Machine Learning on Longitudinal Electronic Health Records for the Early Detection and Prevention of Diseases: Scoping Review.深度学习和机器学习在纵向电子健康记录中用于疾病的早期检测和预防的应用：范围综述。

J Med Internet Res. 2024 Aug 20;26:e48320. doi: 10.2196/48320.

Automated abdominal CT contrast phase detection using an interpretable and open-source artificial intelligence algorithm.使用可解释和开源人工智能算法进行自动腹部 CT 对比期检测。

Eur Radiol. 2024 Oct;34(10):6680-6687. doi: 10.1007/s00330-024-10769-6. Epub 2024 Apr 29.

Stakeholder perspectives towards diagnostic artificial intelligence: a co-produced qualitative evidence synthesis.利益相关者对诊断人工智能的看法：一项联合生成的定性证据综合分析

EClinicalMedicine. 2024 Mar 22;71:102555. doi: 10.1016/j.eclinm.2024.102555. eCollection 2024 May.

Development of a machine learning-based model to predict hepatic inflammation in chronic hepatitis B patients with concurrent hepatic steatosis: a cohort study.基于机器学习的模型预测慢性乙型肝炎合并肝脂肪变性患者肝脏炎症的研究：一项队列研究

EClinicalMedicine. 2024 Jan 16;68:102419. doi: 10.1016/j.eclinm.2023.102419. eCollection 2024 Feb.

Optimizing Clinical Decision Making with Decision Curve Analysis: Insights for Clinical Investigators.运用决策曲线分析优化临床决策：给临床研究者的见解

Healthcare (Basel). 2023 Aug 10;11(16):2244. doi: 10.3390/healthcare11162244.

Applicability of machine learning technique in the screening of patients with mild traumatic brain injury.机器学习技术在轻度创伤性脑损伤患者筛查中的适用性。

PLoS One. 2023 Aug 24;18(8):e0290721. doi: 10.1371/journal.pone.0290721. eCollection 2023.

Systemic immune inflammation index and peripheral blood carbon dioxide concentration at admission predict poor prognosis in patients with severe traumatic brain injury.入院时的全身免疫炎症指数和外周血二氧化碳浓度可预测严重创伤性脑损伤患者的预后不良。

Front Immunol. 2023 Jan 9;13:1034916. doi: 10.3389/fimmu.2022.1034916. eCollection 2022.

Statistical and machine learning approaches to predict the necessity for computed tomography in children with mild traumatic brain injury.统计和机器学习方法预测轻度创伤性脑损伤儿童是否需要进行计算机断层扫描。

PLoS One. 2023 Jan 3;18(1):e0278562. doi: 10.1371/journal.pone.0278562. eCollection 2023.

Stockholm score of lesion detection on computed tomography following mild traumatic brain injury (SELECT-TBI): study protocol for a multicentre, retrospective, observational cohort study.轻度创伤性脑损伤（SELECT-TBI）后计算机断层扫描的病变检测斯德哥尔摩评分：一项多中心、回顾性、观察性队列研究的研究方案。

BMJ Open. 2022 Sep 1;12(9):e060679. doi: 10.1136/bmjopen-2021-060679.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于最优特征选择的可解释机器学习模型，用于识别轻度创伤性脑损伤患者的CT异常。

An interpretable machine learning model based on optimal feature selection for identifying CT abnormalities in patients with mild traumatic brain injury.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

FINDINGS

INTERPRETATION

FUNDING

背景

方法

结果

解读

资助

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献