• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种具有多光谱注意力机制的图像立体匹配算法

An Image Stereo Matching Algorithm with Multi-Spectral Attention Mechanism.

作者信息

Quan Zhenhua, Wu Bin, Luo Liang

机构信息

Institute of Electronic Engineering, China Academy of Engineering Physics, Mianyang 621900, China.

School of Information Engineering, Southwest University of Science and Technology, Mianyang 621000, China.

出版信息

Sensors (Basel). 2023 Sep 29;23(19):8179. doi: 10.3390/s23198179.

DOI:10.3390/s23198179
PMID:37837009
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10574877/
Abstract

With the advancement of artificial intelligence technology and computer hardware, the stereo matching algorithm has been widely researched and applied in the field of image processing. In scenarios such as robot navigation and autonomous driving, stereo matching algorithms are used to assist robots in acquiring depth information about the surrounding environment, thereby improving the robot's ability for autonomous navigation during self-driving. In this paper, we address the issue of low matching accuracy of stereo matching algorithms in specular regions of images and propose a multi-attention-based stereo matching algorithm called MANet. The proposed algorithm embeds a multi-spectral attention module into the residual feature-extraction network of the PSMNet algorithm. It utilizes different 2D discrete cosine transforms to extract frequency-specific feature information, providing rich and effective features for cost computation in matching. The pyramid pooling module incorporates a coordinated attention mechanism, which not only maintains long-range dependencies with directional awareness but also captures more positional information during the pooling process, thereby enhancing the network's representational capacity. The MANet algorithm was evaluated on three major benchmark datasets, namely, SceneFlow, KITTI2015, and KITTI2012, and compared with relevant algorithms. Experimental results demonstrated that the MANet algorithm achieved higher accuracy in predicting disparities and exhibited stronger robustness against specular reflections, enabling more accurate disparity prediction in specular regions.

摘要

随着人工智能技术和计算机硬件的发展,立体匹配算法在图像处理领域得到了广泛的研究和应用。在机器人导航和自动驾驶等场景中,立体匹配算法用于帮助机器人获取周围环境的深度信息,从而提高机器人在自动驾驶过程中的自主导航能力。在本文中,我们解决了立体匹配算法在图像镜面区域匹配精度较低的问题,并提出了一种基于多注意力的立体匹配算法MANet。该算法将多光谱注意力模块嵌入到PSMNet算法的残差特征提取网络中。它利用不同的二维离散余弦变换来提取特定频率的特征信息,为匹配中的代价计算提供丰富而有效的特征。金字塔池化模块采用了协同注意力机制,不仅能保持具有方向感知的长距离依赖性,还能在池化过程中捕获更多的位置信息,从而增强网络的表征能力。MANet算法在SceneFlow、KITTI2015和KITTI2012这三个主要基准数据集上进行了评估,并与相关算法进行了比较。实验结果表明,MANet算法在预测视差方面具有更高的精度,并且对镜面反射表现出更强的鲁棒性,能够在镜面区域进行更准确的视差预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/55cd730586b9/sensors-23-08179-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/7abe3546c4b7/sensors-23-08179-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/2d3403f98af6/sensors-23-08179-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/4e96bc39e8e7/sensors-23-08179-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/f54727bfd759/sensors-23-08179-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/56dc47ff4de2/sensors-23-08179-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/bc2ba51775fb/sensors-23-08179-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/638805dd7207/sensors-23-08179-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/9b0722de3576/sensors-23-08179-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/cac312335354/sensors-23-08179-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/c2738ba9217e/sensors-23-08179-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/55cd730586b9/sensors-23-08179-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/7abe3546c4b7/sensors-23-08179-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/2d3403f98af6/sensors-23-08179-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/4e96bc39e8e7/sensors-23-08179-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/f54727bfd759/sensors-23-08179-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/56dc47ff4de2/sensors-23-08179-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/bc2ba51775fb/sensors-23-08179-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/638805dd7207/sensors-23-08179-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/9b0722de3576/sensors-23-08179-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/cac312335354/sensors-23-08179-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/c2738ba9217e/sensors-23-08179-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b63f/10574877/55cd730586b9/sensors-23-08179-g011.jpg

相似文献

1
An Image Stereo Matching Algorithm with Multi-Spectral Attention Mechanism.一种具有多光谱注意力机制的图像立体匹配算法
Sensors (Basel). 2023 Sep 29;23(19):8179. doi: 10.3390/s23198179.
2
A Fast Stereo Matching Network with Multi-Cross Attention.一种具有多交叉注意力的快速立体匹配网络。
Sensors (Basel). 2021 Sep 8;21(18):6016. doi: 10.3390/s21186016.
3
A stereo matching algorithm based on the improved PSMNet.基于改进的 PSMNet 的立体匹配算法。
PLoS One. 2021 Aug 19;16(8):e0251657. doi: 10.1371/journal.pone.0251657. eCollection 2021.
4
An application of stereo matching algorithm based on transfer learning on robots in multiple scenes.基于迁移学习的立体匹配算法在多场景机器人中的应用。
Sci Rep. 2023 Aug 6;13(1):12739. doi: 10.1038/s41598-023-39964-z.
5
Parallax attention stereo matching network based on the improved group-wise correlation stereo network.基于改进的分组相关立体网络的视差注意力立体匹配网络。
PLoS One. 2022 Feb 9;17(2):e0263735. doi: 10.1371/journal.pone.0263735. eCollection 2022.
6
Deep Stereo Matching With Hysteresis Attention and Supervised Cost Volume Construction.基于迟滞注意力和监督代价体构建的深度立体匹配
IEEE Trans Image Process. 2022;31:812-822. doi: 10.1109/TIP.2021.3135485. Epub 2022 Jan 4.
7
LANet: Stereo matching network based on linear-attention mechanism for depth estimation optimization in 3D reconstruction of inter-forest scene.LANet:基于线性注意力机制的立体匹配网络,用于森林间场景三维重建中的深度估计优化
Front Plant Sci. 2022 Sep 2;13:978564. doi: 10.3389/fpls.2022.978564. eCollection 2022.
8
An end-to-end stereo matching algorithm based on improved convolutional neural network.基于改进卷积神经网络的端到端立体匹配算法。
Math Biosci Eng. 2020 Nov 6;17(6):7787-7803. doi: 10.3934/mbe.2020396.
9
Dual Guided Aggregation Network for Stereo Image Matching.双通道引导聚合网络的立体图像匹配。
Sensors (Basel). 2022 Aug 16;22(16):6111. doi: 10.3390/s22166111.
10
Adaptive Aggregate Stereo Matching Network with Depth Map Super-Resolution.自适应聚合立体匹配网络与深度图超分辨率。
Sensors (Basel). 2022 Jun 16;22(12):4548. doi: 10.3390/s22124548.

引用本文的文献

1
Iterative Network for Disparity Prediction with Infrared and Visible Light Images Based on Common Features.基于共同特征的红外与可见光图像视差预测迭代网络
Sensors (Basel). 2023 Dec 28;24(1):196. doi: 10.3390/s24010196.

本文引用的文献

1
Accurate and Efficient Stereo Matching via Attention Concatenation Volume.通过注意力拼接体实现准确高效的立体匹配
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2461-2474. doi: 10.1109/TPAMI.2023.3335480. Epub 2024 Mar 6.