Li Tian-Ning, Liu Yan-Hong, Yiu Kwok-Leung, Liu Lu, Han Meng, Ma Wei-Jia, Zhou Chun-Lei, Mu Hong
Department of Clinical Lab, Tianjin First Central Hospital, Tianjin, China.
Tianjin Union Medical Center, Nankai University, Tianjin, China.
Immun Inflamm Dis. 2025 Aug;13(8):e70237. doi: 10.1002/iid3.70237.
In the aftermath of the COVID-19 pandemic, China witnessed a surge in respiratory virus infections, which presented considerable challenges to primary health care systems. This study developed an interpretable prediction model using complete blood count (CBC) test data. This model aims to identify common respiratory virus infections in patients.
The study's derivation cohort included 7471 patients who presented with fever at Central Hospital between November and December 2023. Each patient underwent diagnostic procedures, including influenza A (Flu A) and Mycoplasma pneumoniae (MP) antibody testing and CBC. On the basis of the results of the CBC and patients' basic information, modelling and prediction through machine learning (ML) were performed, and external verification was conducted.
Among the developed models, we constructed two distinct versions of the three-class model: one emphasizing high recall and the other balancing precision and recall. The final model was refined through manual parameter adjustments and a comprehensive network search. The high-recall model demonstrated superior performance in detecting Flu A, with a recall rate of 81.0%. Conversely, the precision‒recall balanced model exhibited enhanced accuracy in identifying MP cases, with a precision rate of 84.3%.
Our interpretable ML model not only achieves accurate identification of Flu A and MP infections in febrile patients but also addresses the prevalent "black box" concerns associated with ML techniques. This technique can aid clinicians in enhancing diagnostic efficiency and accuracy. Therefore, this improvement can lead to reduced medical expenses by minimizing unnecessary tests and treatments.
在新冠疫情之后,中国呼吸道病毒感染激增,这给基层医疗系统带来了巨大挑战。本研究利用全血细胞计数(CBC)检测数据开发了一种可解释的预测模型。该模型旨在识别患者中常见的呼吸道病毒感染。
该研究的推导队列包括2023年11月至12月在中心医院出现发热症状的7471名患者。每位患者都接受了诊断程序,包括甲型流感(Flu A)和肺炎支原体(MP)抗体检测以及CBC检测。基于CBC检测结果和患者的基本信息,通过机器学习(ML)进行建模和预测,并进行外部验证。
在开发的模型中,我们构建了两类不同版本的三类模型:一类强调高召回率,另一类平衡精确率和召回率。最终模型通过手动参数调整和全面的网络搜索进行了优化。高召回率模型在检测甲型流感方面表现出卓越性能,召回率为81.0%。相反,精确率-召回率平衡模型在识别MP病例方面表现出更高的准确性,精确率为84.3%。
我们的可解释ML模型不仅能够准确识别发热患者中的甲型流感和MP感染,还解决了与ML技术相关的普遍存在的“黑箱”问题。该技术可以帮助临床医生提高诊断效率和准确性。因此,这种改进可以通过减少不必要的检测和治疗来降低医疗费用。