State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, 37 Xueyuan Road, Haidian District, Beijing, 100191, China; Peng Cheng Lab, 2 Xingke 1st Street, Nanshan District, Shenzhen, Guangdong Province, 518000, China.
Peng Cheng Lab, 2 Xingke 1st Street, Nanshan District, Shenzhen, Guangdong Province, 518000, China.
Comput Biol Med. 2024 Jun;175:108546. doi: 10.1016/j.compbiomed.2024.108546. Epub 2024 Apr 30.
Three-dimensional reconstruction of images acquired through endoscopes is playing a vital role in an increasing number of medical applications. Endoscopes used in the clinic are commonly classified as monocular endoscopes and binocular endoscopes. We have reviewed the classification of methods for depth estimation according to the type of endoscope. Basically, depth estimation relies on feature matching of images and multi-view geometry theory. However, these traditional techniques have many problems in the endoscopic environment. With the increasing development of deep learning techniques, there is a growing number of works based on learning methods to address challenges such as inconsistent illumination and texture sparsity. We have reviewed over 170 papers published in the 10 years from 2013 to 2023. The commonly used public datasets and performance metrics are summarized. We also give a taxonomy of methods and analyze the advantages and drawbacks of algorithms. Summary tables and result atlas are listed to facilitate the comparison of qualitative and quantitative performance of different methods in each category. In addition, we summarize commonly used scene representation methods in endoscopy and speculate on the prospects of deep estimation research in medical applications. We also compare the robustness performance, processing time, and scene representation of the methods to facilitate doctors and researchers in selecting appropriate methods based on surgical applications.
内窥镜获取的图像的三维重建在越来越多的医学应用中发挥着重要作用。临床上使用的内窥镜通常分为单目内窥镜和双目内窥镜。我们根据内窥镜的类型回顾了深度估计方法的分类。基本上,深度估计依赖于图像的特征匹配和多视图几何理论。然而,这些传统技术在内窥镜环境中存在许多问题。随着深度学习技术的不断发展,越来越多的基于学习方法的工作开始解决光照不一致和纹理稀疏等挑战。我们回顾了 2013 年至 2023 年 10 年间发表的超过 170 篇论文。总结了常用的公共数据集和性能指标。我们还对方法进行了分类,并分析了算法的优缺点。列出了汇总表和结果图谱,以方便比较不同方法在每个类别中的定性和定量性能。此外,我们总结了内窥镜中常用的场景表示方法,并对内窥镜中深度估计研究的前景进行了推测。我们还比较了方法的鲁棒性性能、处理时间和场景表示,以方便医生和研究人员根据手术应用选择合适的方法。