• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过高效注意力桥接融合实现极化驱动的语义分割

Polarization-driven semantic segmentation via efficient attention-bridged fusion.

作者信息

Xiang Kaite, Yang Kailun, Wang Kaiwei

出版信息

Opt Express. 2021 Feb 15;29(4):4802-4820. doi: 10.1364/OE.416130.

DOI:10.1364/OE.416130
PMID:33726028
Abstract

Semantic segmentation (SS) is promising for outdoor scene perception in safety-critical applications like autonomous vehicles, assisted navigation and so on. However, traditional SS is primarily based on RGB images, which limits the reliability of SS in complex outdoor scenes, where RGB images lack necessary information dimensions to fully perceive unconstrained environments. As a preliminary investigation, we examine SS in an unexpected obstacle detection scenario, which demonstrates the necessity of multimodal fusion. Thereby, in this work, we present EAFNet, an Efficient Attention-bridged Fusion Network, to exploit complementary information coming from different optical sensors. Specifically, we incorporate polarization sensing to obtain supplementary information, considering its optical characteristics for robust representation of diverse materials. By using a single-shot polarization sensor, we build the first RGB-P dataset which consists of 394 annotated pixel-aligned RGB-polarization images. A comprehensive variety of experiments shows the effectiveness of EAFNet to fuse polarization and RGB information, as well as its flexibility to be adapted to other sensor combination scenarios.

摘要

语义分割(SS)在诸如自动驾驶车辆、辅助导航等安全关键应用中的户外场景感知方面具有广阔前景。然而,传统的语义分割主要基于RGB图像,这限制了其在复杂户外场景中的可靠性,因为在这些场景中RGB图像缺乏充分感知无约束环境所需的必要信息维度。作为一项初步研究,我们在意外障碍物检测场景中研究语义分割,这证明了多模态融合的必要性。因此,在这项工作中,我们提出了EAFNet,一种高效的注意力桥接融合网络,以利用来自不同光学传感器的互补信息。具体而言,考虑到偏振传感对不同材料的鲁棒表示的光学特性,我们将其纳入以获取补充信息。通过使用单镜头偏振传感器,我们构建了第一个RGB-P数据集,该数据集由394张带注释的像素对齐RGB-偏振图像组成。各种各样的实验表明了EAFNet融合偏振和RGB信息的有效性,以及其适应其他传感器组合场景的灵活性。

相似文献

1
Polarization-driven semantic segmentation via efficient attention-bridged fusion.通过高效注意力桥接融合实现极化驱动的语义分割
Opt Express. 2021 Feb 15;29(4):4802-4820. doi: 10.1364/OE.416130.
2
EPMF: Efficient Perception-Aware Multi-Sensor Fusion for 3D Semantic Segmentation.EPMF:用于3D语义分割的高效感知感知多传感器融合
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8258-8273. doi: 10.1109/TPAMI.2024.3402232. Epub 2024 Nov 6.
3
MixImages: An Urban Perception AI Method Based on Polarization Multimodalities.MixImages:一种基于偏振多模态的城市感知人工智能方法。
Sensors (Basel). 2024 Jul 28;24(15):4893. doi: 10.3390/s24154893.
4
A Preliminary Study of Deep Learning Sensor Fusion for Pedestrian Detection.深度学习传感器融合在行人检测中的初步研究。
Sensors (Basel). 2023 Apr 21;23(8):4167. doi: 10.3390/s23084167.
5
MAFFNet: real-time multi-level attention feature fusion network with RGB-D semantic segmentation for autonomous driving.MAFFNet:用于自动驾驶的具有RGB-D语义分割的实时多级注意力特征融合网络
Appl Opt. 2022 Mar 20;61(9):2219-2229. doi: 10.1364/AO.449589.
6
Mitigating Modality Discrepancies for RGB-T Semantic Segmentation.减轻RGB-T语义分割中的模态差异
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9380-9394. doi: 10.1109/TNNLS.2022.3233089. Epub 2024 Jul 8.
7
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
8
GMNet: Graded-Feature Multilabel-Learning Network for RGB-Thermal Urban Scene Semantic Segmentation.GMNet:用于RGB-热红外城市场景语义分割的分级特征多标签学习网络
IEEE Trans Image Process. 2021;30:7790-7802. doi: 10.1109/TIP.2021.3109518. Epub 2021 Sep 14.
9
Implementation of a Lightweight Semantic Segmentation Algorithm in Road Obstacle Detection.在道路障碍物检测中实现轻量级语义分割算法。
Sensors (Basel). 2020 Dec 10;20(24):7089. doi: 10.3390/s20247089.
10
Simple Scalable Multimodal Semantic Segmentation Model.简单可扩展的多模态语义分割模型
Sensors (Basel). 2024 Jan 22;24(2):0. doi: 10.3390/s24020699.

引用本文的文献

1
Polarization of road target detection under complex weather conditions.复杂天气条件下道路目标检测的极化
Sci Rep. 2024 Dec 5;14(1):30348. doi: 10.1038/s41598-024-80830-3.
2
MixImages: An Urban Perception AI Method Based on Polarization Multimodalities.MixImages:一种基于偏振多模态的城市感知人工智能方法。
Sensors (Basel). 2024 Jul 28;24(15):4893. doi: 10.3390/s24154893.
3
Polarimetric Imaging for Robot Perception: A Review.用于机器人感知的偏振成像:综述
Sensors (Basel). 2024 Jul 9;24(14):4440. doi: 10.3390/s24144440.
4
Passive Polarized Vision for Autonomous Vehicles: A Review.用于自动驾驶车辆的被动偏振视觉:综述
Sensors (Basel). 2024 May 22;24(11):3312. doi: 10.3390/s24113312.
5
Semantic Guidance Fusion Network for Cross-Modal Semantic Segmentation.用于跨模态语义分割的语义引导融合网络
Sensors (Basel). 2024 Apr 12;24(8):2473. doi: 10.3390/s24082473.
6
Simple Scalable Multimodal Semantic Segmentation Model.简单可扩展的多模态语义分割模型
Sensors (Basel). 2024 Jan 22;24(2):0. doi: 10.3390/s24020699.
7
Experimental Study on Bottom-Up Detection of Underwater Targets Based on Polarization Imaging.基于偏振成像的水下目标自下而上检测的实验研究。
Sensors (Basel). 2022 Apr 7;22(8):2827. doi: 10.3390/s22082827.
8
How deep learning is empowering semantic segmentation: Traditional and deep learning techniques for semantic segmentation: A comparison.深度学习如何助力语义分割:语义分割的传统技术与深度学习技术比较
Multimed Tools Appl. 2022;81(21):30519-30544. doi: 10.1007/s11042-022-12821-3. Epub 2022 Apr 6.
9
Unifying Obstacle Detection, Recognition, and Fusion Based on the Polarization Color Stereo Camera and LiDAR for the ADAS.基于极化彩色立体相机和 LiDAR 的 ADAS 障碍物检测、识别和融合
Sensors (Basel). 2022 Mar 23;22(7):2453. doi: 10.3390/s22072453.