Xie Si, Wu Mo, Shang Yu, Tuo Wenbin, Wang Jun, Cai Qinzhen, Yuan Chunhui, Yao Cong, Xiang Yun
Department of Laboratory Medicine, Wuhan Children's Hospital (Wuhan Maternal and Child Healthcare Hospital), Tongji Medical College, Huazhong University of Science & Technology, Wuhan, 430016, China.
Health Care Department, Wuhan Children's Hospital (Wuhan Maternal and Child Healthcare Hospital), Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430016, China.
Respir Res. 2025 May 13;26(1):182. doi: 10.1186/s12931-025-03262-1.
Pneumonia is a major threat to the health of children, especially those under the age of five. Mycoplasma pneumoniae infection is a core cause of pediatric pneumonia, and the incidence of severe mycoplasma pneumoniae pneumonia (SMPP) has increased in recent years. Therefore, there is an urgent need to establish an early warning model for SMPP to improve the prognosis of pediatric pneumonia.
The study comprised 597 SMPP patients aged between 1 month and 18 years. Clinical data were selected through Lasso regression analysis, followed by the application of eight machine learning algorithms to develop early warning model. The accuracy of the model was assessed using validation and prospective cohort. To facilitate clinical assessment, the study simplified the indicators and constructed visualized simplified model. The clinical applicability of the model was evaluated by DCA and CIC curve.
After variable selection, eight machine learning models were developed using age, sex and 21 serum indicators identified as predictive factors for SMPP. A Light Gradient Boosting Machine (LightGBM) model demonstrated strong performance, achieving AUC of 0.92 for prospective validation. The SHAP analysis was utilized to screen advantageous variables, which contains of serum S100A8/A9, tracheal computed tomography (CT), retinol-binding protein(RBP), platelet larger cell ratio(P-LCR) and CD4+CD25+Treg cell counts, for constructing a simplified model (SCRPT) to improve clinical applicability. The SCRPT diagnostic model exhibited favorable diagnostic efficacy (AUC > 0.8). Additionally, the study found that S100A8/A9 outperformed clinical inflammatory markers can also differentiate the severity of MPP.
The SCRPT model consisting of five dominant variables (S100A8/A9, CT, RBP, PLCR and Treg cell) screened based on eight machine learning is expected to be a tool for early diagnosis of SMPP. S100A8/A9 can also be used as a biomarker for validity differentiation of SMPP when medical conditions are limited.
肺炎是儿童健康的主要威胁,尤其是五岁以下儿童。肺炎支原体感染是小儿肺炎的核心病因,近年来重症肺炎支原体肺炎(SMPP)的发病率有所上升。因此,迫切需要建立SMPP的早期预警模型,以改善小儿肺炎的预后。
本研究纳入了597例年龄在1个月至18岁之间的SMPP患者。通过Lasso回归分析选择临床数据,随后应用八种机器学习算法建立早期预警模型。使用验证集和前瞻性队列评估模型的准确性。为便于临床评估,研究简化了指标并构建了可视化简化模型。通过决策曲线分析(DCA)和一致性指数曲线(CIC)评估模型的临床适用性。
经过变量筛选,利用年龄、性别和21种血清指标作为SMPP的预测因素,开发了八个机器学习模型。轻量级梯度提升机(LightGBM)模型表现出色,前瞻性验证的AUC为0.92。利用SHAP分析筛选优势变量,包括血清S100A8/A9、气管计算机断层扫描(CT)、视黄醇结合蛋白(RBP)、血小板大细胞比率(P-LCR)和CD4+CD25+调节性T细胞计数,构建简化模型(SCRPT)以提高临床适用性。SCRPT诊断模型具有良好的诊断效能(AUC>0.8)。此外,研究发现S100A8/A9优于临床炎症标志物,也可区分MPP的严重程度。
基于八种机器学习筛选出的由五个主要变量(S100A8/A9、CT、RBP、PLCR和调节性T细胞)组成的SCRPT模型有望成为SMPP早期诊断的工具。当医疗条件有限时,S100A8/A9也可作为SMPP有效性鉴别的生物标志物。