IEEE Trans Med Imaging. 2022 Jul;41(7):1688-1698. doi: 10.1109/TMI.2022.3146973. Epub 2022 Jun 30.
When deep neural network (DNN) was first introduced to the medical image analysis community, researchers were impressed by its performance. However, it is evident now that a large number of manually labeled data is often a must to train a properly functioning DNN. This demand for supervision data and labels is a major bottleneck in current medical image analysis, since collecting a large number of annotations from experienced experts can be time-consuming and expensive. In this paper, we demonstrate that the eye movement of radiologists reading medical images can be a new form of supervision to train the DNN-based computer-aided diagnosis (CAD) system. Particularly, we record the tracks of the radiologists' gaze when they are reading images. The gaze information is processed and then used to supervise the DNN's attention via an Attention Consistency module. To the best of our knowledge, the above pipeline is among the earliest efforts to leverage expert eye movement for deep-learning-based CAD. We have conducted extensive experiments on knee X-ray images for osteoarthritis assessment. The results show that our method can achieve considerable improvement in diagnosis performance, with the help of gaze supervision.
当深度神经网络 (DNN) 首次被引入医学图像分析领域时,研究人员对其性能印象深刻。然而,现在很明显,大量的手动标记数据通常是训练一个正常工作的 DNN 所必需的。这种对监督数据和标签的需求是当前医学图像分析的一个主要瓶颈,因为从经验丰富的专家那里收集大量注释可能既耗时又昂贵。在本文中,我们证明了放射科医生阅读医学图像时的眼球运动可以成为一种新的监督形式,用于训练基于 DNN 的计算机辅助诊断 (CAD) 系统。特别是,我们记录了放射科医生阅读图像时的注视轨迹。然后,通过注意力一致性模块处理注视信息,并使用该信息来监督 DNN 的注意力。据我们所知,上述流水线是最早利用专家眼球运动进行基于深度学习的 CAD 的努力之一。我们已经在用于骨关节炎评估的膝关节 X 射线图像上进行了广泛的实验。结果表明,在注视监督的帮助下,我们的方法可以显著提高诊断性能。