Gao Yanhua, Liu Bo, Zhu Yuan, Chen Lin, Tan Miao, Xiao Xiaozhou, Yu Gang, Guo Youmin
Department of Medical Imaging, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China.
Department of Ultrasound, The Third Affiliated Hospital of Xi'an Jiaotong University, Shaanxi Provincial People's Hospital, Xi'an, China.
Quant Imaging Med Surg. 2021 Jun;11(6):2265-2278. doi: 10.21037/qims-20-12b.
The successful recognition of benign and malignant breast nodules using ultrasound images is based mainly on supervised learning that requires a large number of labeled images. However, because high-quality labeling is expensive and time-consuming, we hypothesized that semi-supervised learning could provide a low-cost and powerful alternative approach. This study aimed to develop an accurate semi-supervised recognition method and compared its performance with supervised methods and sonographers.
The faster region-based convolutional neural network was used for nodule detection from ultrasound images. A semi-supervised classifier based on the mean teacher model was proposed to recognize benign and malignant nodule images. The general performance of the proposed method on two datasets (8,966 nodules) was reported.
The detection accuracy was 0.88±0.03 and 0.86±0.02, respectively, on two testing sets (1,350 and 2,220 nodules). When 800 labeled training nodules were available, the proposed semi-supervised model plus 4,396 unlabeled nodules performed better than the supervised learning model (area under the curve (AUC): 0.934±0.026 0.83±0.050; 0.916±0.022 0.815±0.049). The performance of the semi-supervised model trained on 800 labeled and 4,396 unlabeled nodules was close to that of the supervised learning model trained on a massive number of labeled nodules (n=5,196) (AUC: 0.934±0.026 0.952±0.027; 0.916±0.022 0.918±0.017). Moreover, the semi-supervised model was better than the average accuracy of five human sonographers (AUC: 0.922 0.889).
The semi-supervised model can achieve excellent performance for nodule recognition and be useful for medical sciences. The method reduced the number of labeled images required for training, thus significantly alleviating the difficulty in data preparation of medical artificial intelligence.
利用超声图像成功识别乳腺良恶性结节主要基于监督学习,而这需要大量带标签的图像。然而,由于高质量的标注成本高昂且耗时,我们推测半监督学习可以提供一种低成本且强大的替代方法。本研究旨在开发一种准确的半监督识别方法,并将其性能与监督方法及超声医师的表现进行比较。
使用基于区域的快速卷积神经网络从超声图像中检测结节。提出了一种基于平均教师模型的半监督分类器来识别良恶性结节图像。报告了该方法在两个数据集(8966个结节)上的总体性能。
在两个测试集(分别为1350个和2220个结节)上,检测准确率分别为0.88±0.03和0.86±0.02。当有800个带标签的训练结节时,所提出的半监督模型加上4396个无标签结节的表现优于监督学习模型(曲线下面积(AUC):0.934±0.026对0.83±0.050;0.916±0.022对0.815±0.049)。在800个带标签和4396个无标签结节上训练的半监督模型的性能接近在大量带标签结节(n = 5196)上训练的监督学习模型(AUC:0.934±0.026对0.952±0.027;0.916±0.022对0.918±0.017)。此外,半监督模型优于五名超声医师的平均准确率(AUC:差0.922对0.889)。
半监督模型在结节识别方面可取得优异性能,对医学科学有用。该方法减少了训练所需的带标签图像数量,从而显著减轻了医学人工智能数据准备的难度。