Li Tongtong, Mao Junfeng, Yu Jiandong, Zhao Ziyang, Chen Miao, Yao Zhijun, Fang Lei, Hu Bin
School of Information Science and Engineering, Lanzhou University, Lanzhou, China.
Gansu Provincial Key Laboratory of Wearable Computing, Lanzhou University, Lanzhou, China.
Quant Imaging Med Surg. 2024 Aug 1;14(8):5526-5540. doi: 10.21037/qims-24-234. Epub 2024 Jul 22.
Lung cancer is a malignant tumor, for which pulmonary nodules are considered to be significant indicators. Early recognition and timely treatment of pulmonary nodules can contribute to improving the survival rate of patients with cancer. Positron emission tomography-computed tomography (PET/CT) is a noninvasive, fusion imaging technique that can obtain both functional and structural information of lung regions. However, studies of pulmonary nodules based on computer-aided diagnosis have primarily focused on the nodule level due to a reliance on the annotation of nodules, which is superficial and unable to contribute to the actual clinical diagnosis. The aim of this study was thus to develop a fully automated classification framework for a more comprehensive assessment of pulmonary nodules in PET/CT imaging data.
We developed a two-stage multimodal learning framework for the diagnosis of pulmonary nodules in PET/CT imaging. In this framework, Stage I focuses on pulmonary parenchyma segmentation using a pretrained U-Net and PET/CT registration. Stage II aims to extract, integrate, and recognize image-level and feature-level features by employing the three-dimensional (3D) Inception-residual net (ResNet) convolutional block attention module architecture and a dense-voting fusion mechanism.
In the experiments, the proposed model's performance was comprehensively validated using a set of real clinical data, achieving mean scores of 89.98%, 89.21%, 84.75%, 93.38%, 86.83%, and 0.9227 for accuracy, precision, recall, specificity, F1 score, and area under curve values, respectively.
This paper presents a two-stage multimodal learning approach for the automatic diagnosis of pulmonary nodules. The findings reveal that the main reason for limiting model performance is the nonsolitary property of nodules in pulmonary nodule diagnosis, providing direction for future research.
肺癌是一种恶性肿瘤,肺结节被认为是其重要指标。早期识别并及时治疗肺结节有助于提高癌症患者的生存率。正电子发射断层扫描-计算机断层扫描(PET/CT)是一种无创融合成像技术,可获取肺部区域的功能和结构信息。然而,基于计算机辅助诊断的肺结节研究主要集中在结节层面,因为依赖于结节标注,这种标注较为表面,无法对实际临床诊断提供帮助。因此,本研究的目的是开发一个全自动分类框架,以便更全面地评估PET/CT成像数据中的肺结节。
我们开发了一个用于PET/CT成像中肺结节诊断的两阶段多模态学习框架。在此框架中,第一阶段使用预训练的U-Net和PET/CT配准专注于肺实质分割。第二阶段旨在通过采用三维(3D)Inception-残差网络(ResNet)卷积块注意力模块架构和密集投票融合机制来提取、整合和识别图像级和特征级特征。
在实验中,使用一组真实临床数据对所提出模型的性能进行了全面验证,准确率、精确率、召回率、特异性、F1分数和曲线下面积值的平均得分分别为89.98%、89.21%、84.75%、93.38%、86.