Bardozzo Francesco, Collins Toby, Forgione Antonello, Hostettler Alexandre, Tagliaferri Roberto
DISA-MIS, University of Salerno, Fisciano 84084, Italy.
IRCAD France, Strasbourg, France; IRCAD Africa, Kigali, Rwanda.
Med Image Anal. 2022 Apr;77:102380. doi: 10.1016/j.media.2022.102380. Epub 2022 Jan 30.
Developing accurate and real-time algorithms for a non-invasive three-dimensional representation and reconstruction of internal patient structures is one of the main research fields in computer-assisted surgery and endoscopy. Mono and stereo endoscopic images of soft tissues are converted into a three-dimensional representation by the estimation of depth maps. However, automatic, detailed, accurate and robust depth map estimation is a challenging problem that, in the stereo setting, is strictly dependent on a robust estimate of the disparity map. Many traditional algorithms are often inefficient or not accurate. In this work, novel self-supervised stacked and Siamese encoder/decoder neural networks are proposed to compute accurate disparity maps for 3D laparoscopy depth estimation. These networks run in real-time on standard GPU-equipped desktop computers and the outputs may be used for depth map estimation using the a known camera calibration. We compare performance on three different public datasets and on a new challenging simulated dataset and our solutions outperform state-of-the-art mono and stereo depth estimation methods. Extensive robustness and sensitivity analyses on more than 30000 frames has been performed. This work leads to important improvements in mono and stereo real-time depth map estimation of soft tissues and organs with a very low average mean absolute disparity reconstruction error with respect to ground truth.
开发用于无创三维呈现和重建患者内部结构的精确实时算法是计算机辅助手术和内窥镜检查的主要研究领域之一。通过估计深度图,将软组织的单目和立体内窥镜图像转换为三维呈现。然而,自动、详细、准确且稳健的深度图估计是一个具有挑战性的问题,在立体设置中,它严格依赖于视差图的稳健估计。许多传统算法往往效率低下或不准确。在这项工作中,提出了新颖的自监督堆叠式和暹罗编码器/解码器神经网络,用于计算用于三维腹腔镜深度估计的精确视差图。这些网络在配备标准GPU的台式计算机上实时运行,其输出可用于使用已知相机校准进行深度图估计。我们在三个不同的公共数据集和一个新的具有挑战性的模拟数据集上比较了性能,我们的解决方案优于当前最先进的单目和立体深度估计方法。已对超过30000帧进行了广泛的稳健性和敏感性分析。这项工作在软组织和器官的单目和立体实时深度图估计方面带来了重要改进,相对于地面真值,平均绝对视差重建误差非常低。