Department of Respiratory and Critical Care Medicine, West China School of Medicine and West China Hospital, Sichuan University, No. 37 Guoxue Alley, Chengdu, 610041, China.
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, 610065, China.
Respir Res. 2024 Jul 24;25(1):286. doi: 10.1186/s12931-024-02911-1.
The use of machine learning(ML) methods would improve the diagnosis of small airway dysfunction(SAD) in subjects with chronic respiratory symptoms and preserved pulmonary function(PPF). This paper evaluated the performance of several ML algorithms associated with the impulse oscillometry(IOS) analysis to aid in the diagnostic of respiratory changes in SAD. We also find out the best configuration for this task.
IOS and spirometry were measured in 280 subjects, including a healthy control group (n = 78), a group with normal spirometry (n = 158) and a group with abnormal spirometry (n = 44). Various supervised machine learning (ML) algorithms and feature selection strategies were examined, such as Support Vector Machines (SVM), Random Forests (RF), Adaptive Boosting (ADABOOST), Navie Bayesian (BAYES), and K-Nearest Neighbors (KNN).
The first experiment of this study demonstrated that the best oscillometric parameter (BOP) was R5, with an AUC value of 0.642, when comparing a healthy control group(CG) with patients in the group without lung volume-defined SAD(PPFN). The AUC value of BOP in the control group was 0.769 compared with patients with spirometry defined SAD(PPFA) in the PPF population. In the second experiment, the ML technique was used. In CGvsPPFN, RF and ADABOOST had the best diagnostic results (AUC = 0.914, 0.915), with significantly higher accuracy compared to BOP (p < 0.01). In CGvsPPFA, RF and ADABOOST had the best diagnostic results (AUC = 0.951, 0.971) and significantly higher diagnostic accuracy (p < 0.01). In the third, fourth and fifth experiments, different feature selection techniques allowed us to find the best IOS parameters (R5, (R5-R20)/R5 and Fres). The results demonstrate that the performance of ADABOOST remained essentially unaltered following the application of the feature selector, whereas the diagnostic accuracy of the remaining four classifiers (RF, SVM, BAYES, and KNN) is marginally enhanced.
IOS combined with ML algorithms provide a new method for diagnosing SAD in subjects with chronic respiratory symptoms and PPF. The present study's findings provide evidence that this combination may help in the early diagnosis of respiratory changes in these patients.
使用机器学习 (ML) 方法将提高对有慢性呼吸系统症状和保留肺功能 (PPF) 的小气道功能障碍 (SAD) 患者的诊断。本文评估了几种与脉冲振荡法 (IOS) 分析相关的 ML 算法的性能,以辅助诊断 SAD 中的呼吸变化。我们还找到了完成此任务的最佳配置。
对 280 名受试者进行了 IOS 和肺量计检查,包括健康对照组 (n = 78)、肺量计正常组 (n = 158) 和肺量计异常组 (n = 44)。检查了各种监督机器学习 (ML) 算法和特征选择策略,例如支持向量机 (SVM)、随机森林 (RF)、自适应提升 (ADABOOST)、朴素贝叶斯 (BAYES) 和 K 最近邻 (KNN)。
本研究的第一个实验表明,在比较健康对照组 (CG) 与无肺容积定义 SAD 组 (PPFN) 的患者时,最佳振荡参数 (BOP) 为 R5,AUC 值为 0.642。在 PPF 人群中,BOP 在 CG 与肺量计定义的 SAD 组 (PPFA) 患者之间的 AUC 值为 0.769。在第二个实验中,使用 ML 技术。在 CGvsPPFN 中,RF 和 ADABOOST 具有最佳的诊断结果 (AUC = 0.914、0.915),准确性明显高于 BOP(p < 0.01)。在 CGvsPPFA 中,RF 和 ADABOOST 具有最佳的诊断结果 (AUC = 0.951、0.971),且诊断准确性显著提高 (p < 0.01)。在第三、四、五个实验中,不同的特征选择技术使我们能够找到最佳的 IOS 参数 (R5、(R5-R20)/R5 和 Fres)。结果表明,ADABOOST 的性能在应用特征选择器后基本保持不变,而其余四个分类器 (RF、SVM、BAYES 和 KNN) 的诊断准确性略有提高。
IOS 结合 ML 算法为诊断有慢性呼吸系统症状和 PPF 的 SAD 患者提供了一种新方法。本研究的结果表明,这种组合可能有助于早期诊断这些患者的呼吸变化。