de Moura Luís Vinícius, Mattjie Christian, Dartora Caroline Machado, Barros Rodrigo C, Marques da Silva Ana Maria
Medical Image Computing Laboratory, School of Technology, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil.
Graduate Program in Biomedical Gerontology, School of Medicine, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil.
Front Digit Health. 2022 Jan 17;3:662343. doi: 10.3389/fdgth.2021.662343. eCollection 2021.
Both reverse transcription-PCR (RT-PCR) and chest X-rays are used for the diagnosis of the coronavirus disease-2019 (COVID-19). However, COVID-19 pneumonia does not have a defined set of radiological findings. Our work aims to investigate radiomic features and classification models to differentiate chest X-ray images of COVID-19-based pneumonia and other types of lung patterns. The goal is to provide grounds for understanding the distinctive COVID-19 radiographic texture features using supervised ensemble machine learning methods based on trees through the interpretable Shapley Additive Explanations (SHAP) approach. We use 2,611 COVID-19 chest X-ray images and 2,611 non-COVID-19 chest X-rays. After segmenting the lung in three zones and laterally, a histogram normalization is applied, and radiomic features are extracted. SHAP recursive feature elimination with cross-validation is used to select features. Hyperparameter optimization of XGBoost and Random Forest ensemble tree models is applied using random search. The best classification model was XGBoost, with an accuracy of 0.82 and a sensitivity of 0.82. The explainable model showed the importance of the middle left and superior right lung zones in classifying COVID-19 pneumonia from other lung patterns.
逆转录聚合酶链反应(RT-PCR)和胸部X光检查都用于诊断2019冠状病毒病(COVID-19)。然而,COVID-19肺炎并没有明确的一组影像学表现。我们的工作旨在研究放射组学特征和分类模型,以区分基于COVID-19的肺炎的胸部X光图像和其他类型的肺部影像模式。目标是通过基于树的可解释的Shapley加性解释(SHAP)方法,使用监督集成机器学习方法,为理解COVID-19独特的放射学纹理特征提供依据。我们使用了2611张COVID-19胸部X光图像和2611张非COVID-19胸部X光图像。在将肺部分为三个区域并进行横向分割后,应用直方图归一化,并提取放射组学特征。使用带有交叉验证的SHAP递归特征消除来选择特征。使用随机搜索对XGBoost和随机森林集成树模型进行超参数优化。最佳分类模型是XGBoost,准确率为0.82,灵敏度为0.82。可解释模型显示了左中肺区和右上肺区在将COVID-19肺炎与其他肺部影像模式分类中的重要性。
Curr Med Imaging. 2023
Chaos Solitons Fractals. 2021-4
J Imaging. 2022-9-30
IEEE Access. 2021-3-9
Appl Intell (Dordr). 2021
Appl Soft Comput. 2021-11