Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China.
School of Communication, The Hang Seng University of Hong Kong, Hang Shin Link, Siu Lek Yuen, Shatin, Hong Kong, China.
Sensors (Basel). 2021 Mar 29;21(7):2375. doi: 10.3390/s21072375.
Deep reinforcement learning (DRL) has been utilized in numerous computer vision tasks, such as object detection, autonomous driving, etc. However, relatively few DRL methods have been proposed in the area of image segmentation, particularly in left ventricle segmentation. Reinforcement learning-based methods in earlier works often rely on learning proper thresholds to perform segmentation, and the segmentation results are inaccurate due to the sensitivity of the threshold. To tackle this problem, a novel DRL agent is designed to imitate the human process to perform LV segmentation. For this purpose, we formulate the segmentation problem as a Markov decision process and innovatively optimize it through DRL. The proposed DRL agent consists of two neural networks, i.e., First-P-Net and Next-P-Net. The First-P-Net locates the initial edge point, and the Next-P-Net locates the remaining edge points successively and ultimately obtains a closed segmentation result. The experimental results show that the proposed model has outperformed the previous reinforcement learning methods and achieved comparable performances compared with deep learning baselines on two widely used LV endocardium segmentation datasets, namely Automated Cardiac Diagnosis Challenge (ACDC) 2017 dataset, and Sunnybrook 2009 dataset. Moreover, the proposed model achieves higher accuracy compared with deep learning methods when training with a very limited number of samples.
深度强化学习(DRL)已被应用于许多计算机视觉任务中,如目标检测、自动驾驶等。然而,在图像分割领域,尤其是在左心室分割方面,提出的 DRL 方法相对较少。早期工作中的基于强化学习的方法通常依赖于学习适当的阈值来进行分割,但是由于阈值的敏感性,分割结果不准确。为了解决这个问题,设计了一种新的 DRL 代理来模拟人类的过程进行 LV 分割。为此,我们将分割问题表述为马尔可夫决策过程,并通过 DRL 对其进行创新优化。所提出的 DRL 代理由两个神经网络组成,即 First-P-Net 和 Next-P-Net。First-P-Net 定位初始边缘点,Next-P-Net 依次定位其余边缘点,最终获得封闭的分割结果。实验结果表明,该模型在两个广泛使用的 LV 心内膜分割数据集(即 Automated Cardiac Diagnosis Challenge(ACDC)2017 数据集和 Sunnybrook 2009 数据集)上的表现优于以前的强化学习方法,并与深度学习基线方法的性能相当。此外,当用非常有限的样本进行训练时,与深度学习方法相比,该模型具有更高的准确性。