• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ELNet:一种用于全景深度估计的高效轻量级网络。

ELNet: An Efficient and Effective Lightweight Network for Panoramic Depth Estimation.

作者信息

Xu Jiayue, Zhao Jianping, Li Hua, Han Cheng, Xu Chao

机构信息

School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China.

出版信息

Sensors (Basel). 2023 Nov 16;23(22):9218. doi: 10.3390/s23229218.

DOI:10.3390/s23229218
PMID:38005604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10675273/
Abstract

Monocular panoramic depth estimation has various applications in robotics and autonomous driving due to its ability to perceive the entire field of view. However, panoramic depth estimation faces two significant challenges: global context capturing and distortion awareness. In this paper, we propose a new framework for panoramic depth estimation that can simultaneously address panoramic distortion and extract global context information, thereby improving the performance of panoramic depth estimation. Specifically, we introduce an attention mechanism into the multi-scale dilated convolution and adaptively adjust the receptive field size between different spatial positions, designing the adaptive attention dilated convolution module, which effectively perceives distortion. At the same time, we design the global scene understanding module to integrate global context information into the feature maps generated using the feature extractor. Finally, we trained and evaluated our model on three benchmark datasets which contains the virtual and real-world RGB-D panorama datasets. The experimental results show that the proposed method achieves competitive performance, comparable to existing techniques in both quantitative and qualitative evaluations. Furthermore, our method has fewer parameters and more flexibility, making it a scalable solution in mobile AR.

摘要

单目全景深度估计因其能够感知整个视野而在机器人技术和自动驾驶中有着广泛应用。然而,全景深度估计面临两个重大挑战:全局上下文捕捉和畸变感知。在本文中,我们提出了一种用于全景深度估计的新框架,该框架能够同时解决全景畸变问题并提取全局上下文信息,从而提高全景深度估计的性能。具体而言,我们将注意力机制引入多尺度扩张卷积,并在不同空间位置之间自适应调整感受野大小,设计了自适应注意力扩张卷积模块,该模块能有效感知畸变。同时,我们设计了全局场景理解模块,将全局上下文信息整合到使用特征提取器生成的特征图中。最后,我们在三个包含虚拟和真实世界RGB-D全景数据集的基准数据集上对我们的模型进行了训练和评估。实验结果表明,所提出的方法在定量和定性评估中均取得了具有竞争力的性能,与现有技术相当。此外,我们的方法参数更少且更具灵活性,使其成为移动增强现实中一种可扩展的解决方案。

相似文献

1
ELNet: An Efficient and Effective Lightweight Network for Panoramic Depth Estimation.ELNet:一种用于全景深度估计的高效轻量级网络。
Sensors (Basel). 2023 Nov 16;23(22):9218. doi: 10.3390/s23229218.
2
Lightweight monocular depth estimation using a fusion-improved transformer.使用融合改进型变压器的轻量级单目深度估计
Sci Rep. 2024 Sep 28;14(1):22472. doi: 10.1038/s41598-024-72682-8.
3
SPDET: Edge-Aware Self-Supervised Panoramic Depth Estimation Transformer With Spherical Geometry.SPDET:具有球面几何的边缘感知自监督全景深度估计变换器
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12474-12489. doi: 10.1109/TPAMI.2023.3272949. Epub 2023 Sep 5.
4
GLPanoDepth: Global-to-Local Panoramic Depth Estimation.GLPanoDepth:全局到局部的全景深度估计
IEEE Trans Image Process. 2024;33:2936-2949. doi: 10.1109/TIP.2024.3386403. Epub 2024 Apr 22.
5
MSDCNN: A multiscale dilated convolution neural network for fine-grained 3D shape classification.MSDCNN:一种用于细粒度 3D 形状分类的多尺度扩张卷积神经网络。
Neural Netw. 2024 Apr;172:106141. doi: 10.1016/j.neunet.2024.106141. Epub 2024 Jan 23.
6
DMCT-Net: dual modules convolution transformer network for head and neck tumor segmentation in PET/CT.DMCT-Net:用于 PET/CT 中头颈部肿瘤分割的双模块卷积变换网络。
Phys Med Biol. 2023 May 22;68(11). doi: 10.1088/1361-6560/acd29f.
7
Monocular catadioptric panoramic depth estimation via caustics-based virtual scene transition.基于焦散的虚拟场景转换的单目折反射全景深度估计
J Opt Soc Am A Opt Image Sci Vis. 2016 Sep 1;33(9):1872-9. doi: 10.1364/JOSAA.33.001872.
8
Coarse-to-fine prior-guided attention network for multi-structure segmentation on dental panoramic radiographs.基于粗到精先验引导注意力网络的口腔全景片多结构分割。
Phys Med Biol. 2023 Oct 26;68(21). doi: 10.1088/1361-6560/ad0218.
9
Locating and Counting Heads in Crowds With a Depth Prior.基于深度先验的人群中人头定位与计数。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9056-9072. doi: 10.1109/TPAMI.2021.3124956. Epub 2022 Nov 7.
10
Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement.具有语义增强的多尺度空间注意力引导单目深度估计
IEEE Trans Image Process. 2021;30:8811-8822. doi: 10.1109/TIP.2021.3120670. Epub 2021 Oct 27.

本文引用的文献

1
GLPanoDepth: Global-to-Local Panoramic Depth Estimation.GLPanoDepth:全局到局部的全景深度估计
IEEE Trans Image Process. 2024;33:2936-2949. doi: 10.1109/TIP.2024.3386403. Epub 2024 Apr 22.
2
BiFuse++: Self-Supervised and Efficient Bi-Projection Fusion for 360° Depth Estimation.BiFuse++:用于 360° 深度估计的自监督高效双投影融合。
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5448-5460. doi: 10.1109/TPAMI.2022.3203516. Epub 2023 Apr 3.
3
Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement.
具有语义增强的多尺度空间注意力引导单目深度估计
IEEE Trans Image Process. 2021;30:8811-8822. doi: 10.1109/TIP.2021.3120670. Epub 2021 Oct 27.
4
Deep Ordinal Regression Network for Monocular Depth Estimation.用于单目深度估计的深度序数回归网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2018 Jun;2018:2002-2011. doi: 10.1109/CVPR.2018.00214. Epub 2018 Dec 17.
5
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.