Yuan Yubo, Liu Lijun, Yang Xiaobing, Liu Li, Huang Qingsong
School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500, China.
Key Laboratory of Application in Computer Technology in Yunnan Province, Kunming, 650500, China.
J Imaging Inform Med. 2024 Dec;37(6):2752-2767. doi: 10.1007/s10278-024-01133-7. Epub 2024 May 17.
Accurately identifying and locating lesions in chest X-rays has the potential to significantly enhance diagnostic efficiency, quality, and interpretability. However, current methods primarily focus on detecting of specific diseases in chest X-rays, disregarding the presence of multiple diseases in a single chest X-ray scan. Moreover, the diversity in lesion locations and attributes introduces complexity in accurately discerning specific traits for each lesion, leading to diminished accuracy when detecting multiple diseases. To address these issues, we propose a novel detection framework that enhances multi-scale lesion feature extraction and fusion, improving lesion position perception and subsequently boosting chest multi-disease detection performance. Initially, we construct a multi-scale lesion feature extraction network to tackle the uniqueness of various lesion features and locations, strengthening the global semantic correlation between lesion features and their positions. Following this, we introduce an instance-aware semantic enhancement network that dynamically amalgamates instance-specific features with high-level semantic representations across various scales. This adaptive integration effectively mitigates the loss of detailed information within lesion regions. Additionally, we perform lesion region feature mapping using candidate boxes to preserve crucial positional information, enhancing the accuracy of chest disease detection across multiple scales. Experimental results on the VinDr-CXR dataset reveal a 6% increment in mean average precision (mAP) and an 8.4% improvement in mean recall (mR) when compared to state-of-the-art baselines. This demonstrates the effectiveness of the model in accurately detecting multiple chest diseases by capturing specific features and location information.
准确识别和定位胸部X光片中的病变有可能显著提高诊断效率、质量和可解释性。然而,目前的方法主要集中在检测胸部X光片中的特定疾病,而忽略了单次胸部X光扫描中存在多种疾病的情况。此外,病变位置和属性的多样性给准确辨别每个病变的特定特征带来了复杂性,导致在检测多种疾病时准确性降低。为了解决这些问题,我们提出了一种新颖的检测框架,该框架增强了多尺度病变特征提取和融合,改善了病变位置感知,进而提高了胸部多疾病检测性能。首先,我们构建了一个多尺度病变特征提取网络,以应对各种病变特征和位置的独特性,加强病变特征与其位置之间的全局语义相关性。在此之后,我们引入了一个实例感知语义增强网络,该网络在不同尺度上动态地将实例特定特征与高级语义表示融合在一起。这种自适应集成有效地减轻了病变区域内详细信息的损失。此外,我们使用候选框进行病变区域特征映射,以保留关键的位置信息,提高跨多个尺度的胸部疾病检测准确性。在VinDr-CXR数据集上的实验结果表明,与最先进的基线相比,平均精度均值(mAP)提高了6%,平均召回率(mR)提高了8.4%。这证明了该模型通过捕捉特定特征和位置信息来准确检测多种胸部疾病的有效性。