• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于迁移学习的立体匹配算法在多场景机器人中的应用。

An application of stereo matching algorithm based on transfer learning on robots in multiple scenes.

作者信息

Bi Yuanwei, Li Chuanbiao, Tong Xiangrong, Wang Guohui, Sun Haiwei

机构信息

School of Computer Control and Engineering, Yantai University, Yantai, 264005, China.

出版信息

Sci Rep. 2023 Aug 6;13(1):12739. doi: 10.1038/s41598-023-39964-z.

DOI:10.1038/s41598-023-39964-z
PMID:37544958
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10404586/
Abstract

Robot vision technology based on binocular vision holds tremendous potential for development in various fields, including 3D scene reconstruction, target detection, and autonomous driving. However, current binocular vision methods used in robotics engineering have limitations such as high costs, complex algorithms, and low reliability of the generated disparity map in different scenes. To overcome these challenges, a cross-domain stereo matching algorithm for binocular vision based on transfer learning was proposed in this paper, named Cross-Domain Adaptation and Transfer Learning Network (Ct-Net), which has shown valuable results in multiple robot scenes. First, this paper introduces a General Feature Extractor to extract rich general feature information for domain adaptive stereo matching tasks. Then, a feature adapter is used to adapt the general features to the stereo matching network. Furthermore, a Domain Adaptive Cost Optimization Module is designed to optimize the matching cost. A disparity score prediction module was also embedded to adaptively adjust the search range of disparity and optimize the cost distribution. The overall framework was trained using a phased strategy, and ablation experiments were conducted to verify the effectiveness of the training strategy. Compared with the prototype PSMNet, on KITTI 2015 benchmark, the 3PE-fg of Ct-Net in all regions and non-occluded regions decreased by 19.3 and 21.1% respectively, meanwhile, on the Middlebury dataset, the proposed algorithm improves the sample error rate at least 28.4%, which is the Staircase sample. The quantitative and qualitative results obtained from Middlebury, Apollo, and other datasets demonstrate that Ct-Net significantly improves the cross-domain performance of stereo matching. Stereo matching experiments in real-world scenes have shown that it can effectively address visual tasks in multiple scenes.

摘要

基于双目视觉的机器人视觉技术在包括三维场景重建、目标检测和自动驾驶等各个领域都具有巨大的发展潜力。然而,目前机器人工程中使用的双目视觉方法存在一些局限性,如成本高、算法复杂以及在不同场景中生成的视差图可靠性低等问题。为了克服这些挑战,本文提出了一种基于迁移学习的双目视觉跨域立体匹配算法,名为跨域自适应与迁移学习网络(Ct-Net),该算法在多个机器人场景中取得了有价值的成果。首先,本文引入了一个通用特征提取器,用于提取丰富的通用特征信息,以进行域自适应立体匹配任务。然后,使用一个特征适配器将通用特征适配到立体匹配网络。此外,设计了一个域自适应成本优化模块来优化匹配成本。还嵌入了一个视差分数预测模块,以自适应地调整视差搜索范围并优化成本分布。整体框架采用分阶段策略进行训练,并进行了消融实验以验证训练策略的有效性。与原型PSMNet相比,在KITTI 2015基准测试中,Ct-Net在所有区域和非遮挡区域的3PE-fg分别下降了19.3%和21.1%,同时,在Middlebury数据集中,所提出的算法将样本错误率至少提高了28.4%,即阶梯样本。从中立数据集、阿波罗数据集和其他数据集中获得的定量和定性结果表明,Ct-Net显著提高了立体匹配的跨域性能。在真实场景中的立体匹配实验表明,它可以有效地解决多个场景中的视觉任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/20e72e3dfb74/41598_2023_39964_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/3481dfad178d/41598_2023_39964_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/d8b0f6b163df/41598_2023_39964_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/73c93d734a53/41598_2023_39964_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/5323bdf0a05e/41598_2023_39964_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/c2c1c1fe2886/41598_2023_39964_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/791eed890fb5/41598_2023_39964_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/745ece0da0bd/41598_2023_39964_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/3b87c1052790/41598_2023_39964_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/5a1247e9b1d0/41598_2023_39964_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/07b407b45eda/41598_2023_39964_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/d2dcfbb02ada/41598_2023_39964_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/20e72e3dfb74/41598_2023_39964_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/3481dfad178d/41598_2023_39964_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/d8b0f6b163df/41598_2023_39964_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/73c93d734a53/41598_2023_39964_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/5323bdf0a05e/41598_2023_39964_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/c2c1c1fe2886/41598_2023_39964_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/791eed890fb5/41598_2023_39964_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/745ece0da0bd/41598_2023_39964_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/3b87c1052790/41598_2023_39964_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/5a1247e9b1d0/41598_2023_39964_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/07b407b45eda/41598_2023_39964_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/d2dcfbb02ada/41598_2023_39964_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/351a/10404586/20e72e3dfb74/41598_2023_39964_Fig12_HTML.jpg

相似文献

1
An application of stereo matching algorithm based on transfer learning on robots in multiple scenes.基于迁移学习的立体匹配算法在多场景机器人中的应用。
Sci Rep. 2023 Aug 6;13(1):12739. doi: 10.1038/s41598-023-39964-z.
2
A Miniature Binocular Endoscope with Local Feature Matching and Stereo Matching for 3D Measurement and 3D Reconstruction.一种带有局部特征匹配和立体匹配的微型双目内窥镜,用于三维测量和三维重建。
Sensors (Basel). 2018 Jul 12;18(7):2243. doi: 10.3390/s18072243.
3
A stereo matching algorithm based on the improved PSMNet.基于改进的 PSMNet 的立体匹配算法。
PLoS One. 2021 Aug 19;16(8):e0251657. doi: 10.1371/journal.pone.0251657. eCollection 2021.
4
An Image Stereo Matching Algorithm with Multi-Spectral Attention Mechanism.一种具有多光谱注意力机制的图像立体匹配算法
Sensors (Basel). 2023 Sep 29;23(19):8179. doi: 10.3390/s23198179.
5
Binocular stereo matching of real scenes based on a convolutional neural network and computer graphics.基于卷积神经网络和计算机图形学的真实场景双目立体匹配
Opt Express. 2021 Aug 16;29(17):26876-26893. doi: 10.1364/OE.433247.
6
An end-to-end stereo matching algorithm based on improved convolutional neural network.基于改进卷积神经网络的端到端立体匹配算法。
Math Biosci Eng. 2020 Nov 6;17(6):7787-7803. doi: 10.3934/mbe.2020396.
7
Parallax attention stereo matching network based on the improved group-wise correlation stereo network.基于改进的分组相关立体网络的视差注意力立体匹配网络。
PLoS One. 2022 Feb 9;17(2):e0263735. doi: 10.1371/journal.pone.0263735. eCollection 2022.
8
LANet: Stereo matching network based on linear-attention mechanism for depth estimation optimization in 3D reconstruction of inter-forest scene.LANet:基于线性注意力机制的立体匹配网络,用于森林间场景三维重建中的深度估计优化
Front Plant Sci. 2022 Sep 2;13:978564. doi: 10.3389/fpls.2022.978564. eCollection 2022.
9
A Fast Stereo Matching Network with Multi-Cross Attention.一种具有多交叉注意力的快速立体匹配网络。
Sensors (Basel). 2021 Sep 8;21(18):6016. doi: 10.3390/s21186016.
10
A Robust Monocular and Binocular Visual Ranging Fusion Method Based on an Adaptive UKF.一种基于自适应无迹卡尔曼滤波器的鲁棒单目与双目视觉测距融合方法。
Sensors (Basel). 2024 Jun 27;24(13):4178. doi: 10.3390/s24134178.

引用本文的文献

1
Development and validation of a collaborative robotic platform based on monocular vision for oral surgery: an in vitro study.基于单目视觉的口腔手术协作机器人平台的开发与验证:一项体外研究。
Int J Comput Assist Radiol Surg. 2024 Sep;19(9):1797-1808. doi: 10.1007/s11548-024-03161-8. Epub 2024 Jun 1.
2
A dense matching method for remote sensing images fused with CPS denoising.一种融合CPS去噪的遥感图像密集匹配方法。
Sci Rep. 2024 Apr 23;14(1):9309. doi: 10.1038/s41598-024-59980-x.

本文引用的文献

1
Agricultural Robot-Centered Recognition of Early-Developmental Pest Stage Based on Deep Learning: A Case Study on Fall Armyworm ().基于深度学习的农业机器人早期发育期害虫识别:以草地贪夜蛾为例。
Sensors (Basel). 2023 Mar 15;23(6):3147. doi: 10.3390/s23063147.
2
Efficient Multi-Scale Stereo-Matching Network Using Adaptive Cost Volume Filtering.基于自适应代价体滤波的高效多尺度立体匹配网络
Sensors (Basel). 2022 Jul 23;22(15):5500. doi: 10.3390/s22155500.
3
Rethinking Training Strategy in Stereo Matching.重新思考立体匹配中的训练策略
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7796-7809. doi: 10.1109/TNNLS.2022.3146306. Epub 2023 Oct 5.
4
Learning Depth with Convolutional Spatial Propagation Network.基于卷积空间传播网络的深度学习
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2361-2379. doi: 10.1109/TPAMI.2019.2947374. Epub 2019 Oct 15.
5
The ApolloScape Open Dataset for Autonomous Driving and Its Application.阿波罗景观开放数据集在自动驾驶中的应用
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2702-2719. doi: 10.1109/TPAMI.2019.2926463. Epub 2019 Jul 2.