• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多级代价体和多尺度特征一致性的立体匹配

Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy.

作者信息

Liang Zhengfa, Guo Yulan, Feng Yiliu, Chen Wei, Qiao Linbo, Zhou Li, Zhang Jianfeng, Liu Hengzhu

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):300-315. doi: 10.1109/TPAMI.2019.2928550. Epub 2020 Dec 4.

DOI:10.1109/TPAMI.2019.2928550
PMID:31329107
Abstract

For CNNs based stereo matching methods, cost volumes play an important role in achieving good matching accuracy. In this paper, we present an end-to-end trainable convolution neural network to fully use cost volumes for stereo matching. Our network consists of three sub-modules, i.e., shared feature extraction, initial disparity estimation, and disparity refinement. Cost volumes are calculated at multiple levels using the shared features, and are used in both initial disparity estimation and disparity refinement sub-modules. To improve the efficiency of disparity refinement, multi-scale feature constancy is introduced to measure the correctness of the initial disparity in feature space. These sub-modules of our network are tightly-coupled, making it compact and easy to train. Moreover, we investigate the problem of developing a robust model to perform well across multiple datasets with different characteristics. We achieve this by introducing a two-stage finetuning scheme to gently transfer the model to target datasets. Specifically, in the first stage, the model is finetuned using both a large synthetic dataset and the target datasets with a relatively large learning rate, while in the second stage the model is trained using only the target datasets with a small learning rate. The proposed method is tested on several benchmarks including the Middlebury 2014, KITTI 2015, ETH3D 2017, and SceneFlow datasets. Experimental results show that our method achieves the state-of-the-art performance on all the datasets. The proposed method also won the 1st prize on the Stereo task of Robust Vision Challenge 2018.

摘要

对于基于卷积神经网络的立体匹配方法,代价体在实现良好的匹配精度方面起着重要作用。在本文中,我们提出了一种端到端可训练的卷积神经网络,以充分利用代价体进行立体匹配。我们的网络由三个子模块组成,即共享特征提取、初始视差估计和视差细化。使用共享特征在多个层次上计算代价体,并将其用于初始视差估计和视差细化子模块。为了提高视差细化的效率,引入了多尺度特征一致性来衡量特征空间中初始视差的正确性。我们网络的这些子模块紧密耦合,使其紧凑且易于训练。此外,我们研究了开发一个鲁棒模型以在具有不同特征的多个数据集上都能良好运行的问题。我们通过引入两阶段微调方案来将模型平缓地迁移到目标数据集来实现这一点。具体来说,在第一阶段,使用一个大型合成数据集和目标数据集以相对较大的学习率对模型进行微调,而在第二阶段,仅使用目标数据集以较小的学习率对模型进行训练。所提出的方法在包括米德尔伯里2014、KITTI 2015、ETH3D 2017和SceneFlow数据集在内的几个基准测试中进行了测试。实验结果表明,我们的方法在所有数据集上都达到了当前最优性能。所提出的方法还在2018年鲁棒视觉挑战赛的立体任务中获得了一等奖。

相似文献

1
Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy.基于多级代价体和多尺度特征一致性的立体匹配
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):300-315. doi: 10.1109/TPAMI.2019.2928550. Epub 2020 Dec 4.
2
Deep Stereo Matching With Hysteresis Attention and Supervised Cost Volume Construction.基于迟滞注意力和监督代价体构建的深度立体匹配
IEEE Trans Image Process. 2022;31:812-822. doi: 10.1109/TIP.2021.3135485. Epub 2022 Jan 4.
3
Efficient Multi-Scale Stereo-Matching Network Using Adaptive Cost Volume Filtering.基于自适应代价体滤波的高效多尺度立体匹配网络
Sensors (Basel). 2022 Jul 23;22(15):5500. doi: 10.3390/s22155500.
4
A Fast Stereo Matching Network with Multi-Cross Attention.一种具有多交叉注意力的快速立体匹配网络。
Sensors (Basel). 2021 Sep 8;21(18):6016. doi: 10.3390/s21186016.
5
Parallax attention stereo matching network based on the improved group-wise correlation stereo network.基于改进的分组相关立体网络的视差注意力立体匹配网络。
PLoS One. 2022 Feb 9;17(2):e0263735. doi: 10.1371/journal.pone.0263735. eCollection 2022.
6
An end-to-end stereo matching algorithm based on improved convolutional neural network.基于改进卷积神经网络的端到端立体匹配算法。
Math Biosci Eng. 2020 Nov 6;17(6):7787-7803. doi: 10.3934/mbe.2020396.
7
Segment-Based Disparity Refinement With Occlusion Handling for Stereo Matching.用于立体匹配的基于片段的视差细化与遮挡处理
IEEE Trans Image Process. 2019 Aug;28(8):3885-3897. doi: 10.1109/TIP.2019.2903318. Epub 2019 Mar 6.
8
Rethinking Training Strategy in Stereo Matching.重新思考立体匹配中的训练策略
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7796-7809. doi: 10.1109/TNNLS.2022.3146306. Epub 2023 Oct 5.
9
Disparity refinement framework for learning-based stereo matching methods in cross-domain setting for laparoscopic images.用于腹腔镜图像跨域设置中基于学习的立体匹配方法的视差细化框架。
J Med Imaging (Bellingham). 2023 Jul;10(4):045001. doi: 10.1117/1.JMI.10.4.045001. Epub 2023 Jul 14.
10
A Disparity Refinement Framework for Learning-based Stereo Matching Methods in Cross-domain Setting for Laparoscopic Images.一种用于腹腔镜图像跨域设置中基于学习的立体匹配方法的视差细化框架。
Proc SPIE Int Soc Opt Eng. 2023 Feb;12466. doi: 10.1117/12.2654804. Epub 2023 Apr 3.

引用本文的文献

1
Research on 3D virtual vision matching based on interactive color segmentation.基于交互式颜色分割的3D虚拟视觉匹配研究
PeerJ Comput Sci. 2024 Jun 28;10:e2114. doi: 10.7717/peerj-cs.2114. eCollection 2024.
2
TransNet: Transformer-Based Point Cloud Sampling Network.TransNet:基于 Transformer 的点云采样网络。
Sensors (Basel). 2023 May 11;23(10):4675. doi: 10.3390/s23104675.
3
Robust Cost Volume Generation Method for Dense Stereo Matching in Endoscopic Scenarios.用于内窥镜场景中密集立体匹配的稳健代价体生成方法。
Sensors (Basel). 2023 Mar 24;23(7):3427. doi: 10.3390/s23073427.