• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过注意力拼接体实现准确高效的立体匹配

Accurate and Efficient Stereo Matching via Attention Concatenation Volume.

作者信息

Xu Gangwei, Wang Yun, Cheng Junda, Tang Jinhui, Yang Xin

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2461-2474. doi: 10.1109/TPAMI.2023.3335480. Epub 2024 Mar 6.

DOI:10.1109/TPAMI.2023.3335480
PMID:38015702
Abstract

Stereo matching is a fundamental building block for many vision and robotics applications. An informative and concise cost volume representation is vital for stereo matching of high accuracy and efficiency. In this article, we present a novel cost volume construction method, named attention concatenation volume (ACV), which generates attention weights from correlation clues to suppress redundant information and enhance matching-related information in the concatenation volume. The ACV can be seamlessly embedded into most stereo matching networks, the resulting networks can use a more lightweight aggregation network and meanwhile achieve higher accuracy. We further design a fast version of ACV to enable real-time performance, named Fast-ACV, which generates high likelihood disparity hypotheses and the corresponding attention weights from low-resolution correlation clues to significantly reduce computational and memory cost and meanwhile maintain a satisfactory accuracy. Furthermore, we design a highly accurate network ACVNet and a real-time network Fast-ACVNet based on our ACV and Fast-ACV respectively, which achieve state-of-the-art performance on several benchmarks.

摘要

立体匹配是许多视觉和机器人应用的基本构建模块。一个信息丰富且简洁的代价体表示对于高精度和高效率的立体匹配至关重要。在本文中,我们提出了一种新颖的代价体构建方法,称为注意力拼接体(ACV),它从相关线索中生成注意力权重,以抑制冗余信息并增强拼接体中与匹配相关的信息。ACV可以无缝嵌入到大多数立体匹配网络中,由此产生的网络可以使用更轻量级的聚合网络,同时实现更高的精度。我们进一步设计了ACV的快速版本以实现实时性能,称为Fast-ACV,它从低分辨率相关线索中生成高似然性视差假设和相应的注意力权重,以显著降低计算和内存成本,同时保持令人满意的精度。此外,我们分别基于ACV和Fast-ACV设计了一个高精度网络ACVNet和一个实时网络Fast-ACVNet,它们在多个基准测试中实现了领先的性能。

相似文献

1
Accurate and Efficient Stereo Matching via Attention Concatenation Volume.通过注意力拼接体实现准确高效的立体匹配
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2461-2474. doi: 10.1109/TPAMI.2023.3335480. Epub 2024 Mar 6.
2
Efficient Multi-Scale Stereo-Matching Network Using Adaptive Cost Volume Filtering.基于自适应代价体滤波的高效多尺度立体匹配网络
Sensors (Basel). 2022 Jul 23;22(15):5500. doi: 10.3390/s22155500.
3
Accuracy and efficiency stereo matching network with adaptive feature modulation.具有自适应特征调制的精确高效立体匹配网络
PLoS One. 2024 Apr 25;19(4):e0301093. doi: 10.1371/journal.pone.0301093. eCollection 2024.
4
A Fast Stereo Matching Network with Multi-Cross Attention.一种具有多交叉注意力的快速立体匹配网络。
Sensors (Basel). 2021 Sep 8;21(18):6016. doi: 10.3390/s21186016.
5
A Joint 2D-3D Complementary Network for Stereo Matching.一种用于立体匹配的二维-三维互补网络。
Sensors (Basel). 2021 Feb 18;21(4):1430. doi: 10.3390/s21041430.
6
An efficient and accurate multi-level cascaded recurrent network for stereo matching.一种用于立体匹配的高效且准确的多级级联循环网络。
Sci Rep. 2024 Apr 8;14(1):8148. doi: 10.1038/s41598-024-57321-6.
7
Deep Stereo Matching With Hysteresis Attention and Supervised Cost Volume Construction.基于迟滞注意力和监督代价体构建的深度立体匹配
IEEE Trans Image Process. 2022;31:812-822. doi: 10.1109/TIP.2021.3135485. Epub 2022 Jan 4.
8
Single-shot 3D shape measurement using an end-to-end stereo matching network for speckle projection profilometry.使用端到端立体匹配网络进行散斑投影轮廓测量的单次3D形状测量。
Opt Express. 2021 Apr 26;29(9):13388-13407. doi: 10.1364/OE.418881.
9
Optimizing 3D Convolution Kernels on Stereo Matching for Resource Efficient Computations.优化立体匹配中的 3D 卷积核以实现资源高效计算。
Sensors (Basel). 2021 Oct 13;21(20):6808. doi: 10.3390/s21206808.
10
Unified Confidence Estimation Networks for Robust Stereo Matching.用于稳健立体匹配的统一置信估计网络。
IEEE Trans Image Process. 2019 Mar;28(3):1299-1313. doi: 10.1109/TIP.2018.2878325. Epub 2018 Oct 26.

引用本文的文献

1
ODTFormer: Efficient Obstacle Detection and Tracking with Stereo Cameras Based on Transformer.ODTFormer:基于Transformer的立体相机高效障碍物检测与跟踪
Rep U S. 2024 Oct;2024:9721-9728. doi: 10.1109/iros58592.2024.10802476. Epub 2024 Dec 25.
2
Advances in Real-Time 3D Reconstruction for Medical Endoscopy.医学内窥镜实时三维重建的进展
J Imaging. 2024 May 14;10(5):120. doi: 10.3390/jimaging10050120.
3
An Image Stereo Matching Algorithm with Multi-Spectral Attention Mechanism.一种具有多光谱注意力机制的图像立体匹配算法
Sensors (Basel). 2023 Sep 29;23(19):8179. doi: 10.3390/s23198179.
4
Efficient Stereo Depth Estimation for Pseudo-LiDAR: A Self-Supervised Approach Based on Multi-Input ResNet Encoder.基于多输入 ResNet 编码器的伪激光雷达高效立体深度估计:一种自监督方法。
Sensors (Basel). 2023 Feb 2;23(3):1650. doi: 10.3390/s23031650.