Yang Mingyan, Tanaka Hisashi, Ishida Takayuki
Division of Health Sciences, Graduate School of Medicine, Osaka University, 1-7 yamadaoka, suita, Osaka, 565-0871, Japan.
Division of Health Sciences, Osaka University, 1-7 yamadaoka, suita, Osaka, 565-0871, Japan.
Int J Comput Assist Radiol Surg. 2023 Jan;18(1):181-189. doi: 10.1007/s11548-022-02684-2. Epub 2022 May 26.
This study aimed at developing a deep learning-based method for multi-label thoracic abnormality classification on frontal view chest X-ray (CXR). To improve the performance of classification, issues of class imbalance, noisy labels and ensemble of networks are addressed in the paper.
The experiments were performed on a public dataset called Chest X-ray 14 (CXR14), which includes 112,120 frontal view CXRs from 30,805 patients. We came up with an ensemble learning framework to improve the classification and a noisy label detection method to detect the CXRs with noisy labels. The detected CXRs were reviewed by two board-certificated radiologists in a consensus fashion to evaluate detected noisy labels. The classification was assessed on CXR14 with area under the receiver operating characteristic curve (AUC).
Report from the radiologists indicated that detected noisy labels had high possibility to be true positives. A notable improvement from baseline in performance of classification was observed with the ensemble learning framework. After removing the CXRs with detected noisy labels, 8 out of 14 abnormalities improved significantly on CXR14. The suggested framework achieved AUC score of 0.827 on CXR14.
The methods of this study boost the classification on CXR with awareness of the label noise. Expanded experimental results show that all of them were able to improve multi-label thoracic abnormality classification performance, respectively. A new state-of-the-art is achieved in this study.
本研究旨在开发一种基于深度学习的方法,用于对胸部正位X线片(CXR)上的多标签胸部异常进行分类。为提高分类性能,本文探讨了类别不平衡、噪声标签和网络集成等问题。
实验在一个名为胸部X线14(CXR14)的公共数据集上进行,该数据集包含来自30805名患者的112120张胸部正位X线片。我们提出了一个集成学习框架来改进分类,并提出了一种噪声标签检测方法来检测带有噪声标签的CXR。两名具有专业资格的放射科医生以共识方式对检测到的CXR进行复查,以评估检测到的噪声标签。使用受试者工作特征曲线下面积(AUC)在CXR14上评估分类。
放射科医生的报告表明,检测到的噪声标签很可能是真阳性。集成学习框架使分类性能相对于基线有了显著提高。在去除检测到带有噪声标签的CXR后,CXR14上14种异常中的8种有了显著改善。所提出的框架在CXR14上的AUC得分为0.827。
本研究的方法在考虑标签噪声的情况下提高了CXR的分类性能。扩展的实验结果表明,所有方法都能够分别提高多标签胸部异常的分类性能。本研究实现了新的技术水平。