Graduate School of Informatics, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, 464-8601, Japan.
Clinical Effectiveness Research Group, University of Oslo, Gaustad Sykehus, Bygg 20, Sognsvannsveien 21, Oslo, 0372, Norway.
Int J Comput Assist Radiol Surg. 2021 Jun;16(6):989-1001. doi: 10.1007/s11548-021-02398-x. Epub 2021 May 17.
A three-dimensional (3D) structure extraction technique viewed from a two-dimensional image is essential for the development of a computer-aided diagnosis (CAD) system for colonoscopy. However, a straightforward application of existing depth-estimation methods to colonoscopic images is impossible or inappropriate due to several limitations of colonoscopes. In particular, the absence of ground-truth depth for colonoscopic images hinders the application of supervised machine learning methods. To circumvent these difficulties, we developed an unsupervised and accurate depth-estimation method.
We propose a novel unsupervised depth-estimation method by introducing a Lambertian-reflection model as an auxiliary task to domain translation between real and virtual colonoscopic images. This auxiliary task contributes to accurate depth estimation by maintaining the Lambertian-reflection assumption. In our experiments, we qualitatively evaluate the proposed method by comparing it with state-of-the-art unsupervised methods. Furthermore, we present two quantitative evaluations of the proposed method using a measuring device, as well as a new 3D reconstruction technique and measured polyp sizes.
Our proposed method achieved accurate depth estimation with an average estimation error of less than 1 mm for regions close to the colonoscope in both of two types of quantitative evaluations. Qualitative evaluation showed that the introduced auxiliary task reduces the effects of specular reflections and colon wall textures on depth estimation and our proposed method achieved smooth depth estimation without noise, thus validating the proposed method.
We developed an accurate depth-estimation method with a new type of unsupervised domain translation with the auxiliary task. This method is useful for analysis of colonoscopic images and for the development of a CAD system since it can extract accurate 3D information.
从二维图像中提取三维(3D)结构对于开发结肠镜检查的计算机辅助诊断(CAD)系统至关重要。然而,由于结肠镜的一些局限性,直接将现有的深度估计方法应用于结肠镜图像是不可能或不合适的。特别是,由于缺乏结肠镜图像的真实深度,这阻碍了监督机器学习方法的应用。为了解决这些困难,我们开发了一种无监督且准确的深度估计方法。
我们通过引入朗伯反射模型作为真实和虚拟结肠镜图像之间域转换的辅助任务,提出了一种新颖的无监督深度估计方法。该辅助任务通过保持朗伯反射假设有助于准确的深度估计。在我们的实验中,我们通过与最先进的无监督方法进行比较,对所提出的方法进行了定性评估。此外,我们使用测量设备以及新的 3D 重建技术和测量的息肉大小对所提出的方法进行了两种定量评估。
我们提出的方法在两种定量评估中,对于接近结肠镜的区域,平均估计误差都小于 1 毫米,实现了准确的深度估计。定性评估表明,引入的辅助任务减少了镜面反射和结肠壁纹理对深度估计的影响,并且我们提出的方法实现了没有噪声的平滑深度估计,从而验证了该方法的有效性。
我们开发了一种具有新类型的无监督域转换和辅助任务的准确深度估计方法。由于该方法可以提取准确的 3D 信息,因此对于结肠镜图像的分析和 CAD 系统的开发非常有用。