• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于实时RGBD语义分割的空间信息引导卷积

Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation.

作者信息

Chen Lin-Zhuo, Lin Zheng, Wang Ziqin, Yang Yong-Liang, Cheng Ming-Ming

出版信息

IEEE Trans Image Process. 2021;30:2313-2324. doi: 10.1109/TIP.2021.3049332. Epub 2021 Jan 27.

DOI:10.1109/TIP.2021.3049332
PMID:33481707
Abstract

3D spatial information is known to be beneficial to the semantic segmentation task. Most existing methods take 3D spatial data as an additional input, leading to a two-stream segmentation network that processes RGB and 3D spatial information separately. This solution greatly increases the inference time and severely limits its scope for real-time applications. To solve this problem, we propose Spatial information guided Convolution (S-Conv), which allows efficient RGB feature and 3D spatial information integration. S-Conv is competent to infer the sampling offset of the convolution kernel guided by the 3D spatial information, helping the convolutional layer adjust the receptive field and adapt to geometric transformations. S-Conv also incorporates geometric information into the feature learning process by generating spatially adaptive convolutional weights. The capability of perceiving geometry is largely enhanced without much affecting the amount of parameters and computational cost. Based on S-Conv, we further design a semantic segmentation network, called Spatial information Guided convolutional Network (SGNet), resulting in real-time inference and state-of-the-art performance on NYUDv2 and SUNRGBD datasets.

摘要

已知3D空间信息对语义分割任务有益。大多数现有方法将3D空间数据作为额外输入,导致一个双流分割网络,该网络分别处理RGB和3D空间信息。这种解决方案大大增加了推理时间,并严重限制了其在实时应用中的范围。为了解决这个问题,我们提出了空间信息引导卷积(S-Conv),它允许高效地整合RGB特征和3D空间信息。S-Conv能够推断由3D空间信息引导的卷积核的采样偏移,帮助卷积层调整感受野并适应几何变换。S-Conv还通过生成空间自适应卷积权重将几何信息纳入特征学习过程。在不影响参数数量和计算成本的情况下,感知几何的能力得到了很大增强。基于S-Conv,我们进一步设计了一个语义分割网络,称为空间信息引导卷积网络(SGNet),在NYUDv2和SUNRGBD数据集上实现了实时推理和领先的性能。

相似文献

1
Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation.用于实时RGBD语义分割的空间信息引导卷积
IEEE Trans Image Process. 2021;30:2313-2324. doi: 10.1109/TIP.2021.3049332. Epub 2021 Jan 27.
2
OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.OBELISK-Net:稀疏可变形卷积解决三维多器官分割问题,所需层数更少。
Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.
3
A lightweight multi-dimension dynamic convolutional network for real-time semantic segmentation.一种用于实时语义分割的轻量级多维动态卷积网络。
Front Neurorobot. 2022 Dec 15;16:1075520. doi: 10.3389/fnbot.2022.1075520. eCollection 2022.
4
DO-Conv: Depthwise Over-Parameterized Convolutional Layer.深度可分离过参数化卷积层(DO-Conv)
IEEE Trans Image Process. 2022;31:3726-3736. doi: 10.1109/TIP.2022.3175432. Epub 2022 May 26.
5
SyS3DS: Systematic Sampling of Large-Scale LiDAR Point Clouds for Semantic Segmentation in Forestry Robotics.SyS3DS:用于林业机器人语义分割的大规模激光雷达点云系统采样
Sensors (Basel). 2024 Jan 26;24(3):823. doi: 10.3390/s24030823.
6
Superpixel Guided Deformable Convolution Network for Hyperspectral Image Classification.用于高光谱图像分类的超像素引导可变形卷积网络
IEEE Trans Image Process. 2022;31:3838-3851. doi: 10.1109/TIP.2022.3176537. Epub 2022 Jun 2.
7
MFAFNet: A Lightweight and Efficient Network with Multi-Level Feature Adaptive Fusion for Real-Time Semantic Segmentation.MFAFNet:一种用于实时语义分割的具有多级特征自适应融合的轻量级高效网络。
Sensors (Basel). 2023 Jul 13;23(14):6382. doi: 10.3390/s23146382.
8
A Novel Local-Global Graph Convolutional Method for Point Cloud Semantic Segmentation.一种用于点云语义分割的新型局部-全局图卷积方法。
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):4798-4812. doi: 10.1109/TNNLS.2022.3155282. Epub 2024 Apr 4.
9
Siamese network with a depthwise over-parameterized convolutional layer for visual tracking.用于视觉跟踪的带有深度过度参数化卷积层的暹罗网络。
PLoS One. 2022 Aug 31;17(8):e0273690. doi: 10.1371/journal.pone.0273690. eCollection 2022.
10
Zig-Zag Network for Semantic Segmentation of RGB-D Images.Zig-Zag 网络用于 RGB-D 图像的语义分割。
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2642-2655. doi: 10.1109/TPAMI.2019.2923513. Epub 2019 Jun 18.

引用本文的文献

1
An Adaptive Multi-Stage and Adjacent-Level Feature Integration Network for Brain Tumor Image Segmentation.一种用于脑肿瘤图像分割的自适应多阶段和相邻层级特征集成网络。
Interdiscip Sci. 2025 Aug 14. doi: 10.1007/s12539-025-00748-w.
2
Phenotypic dynamics and temporal heritability of tomato architectural traits using an unmanned ground vehicle-based plant phenotyping system.使用基于无人地面车辆的植物表型系统研究番茄株型性状的表型动态和时间遗传力
Hortic Res. 2025 Apr 30;12(8):uhaf109. doi: 10.1093/hr/uhaf109. eCollection 2025 Aug.
3
CFANet: The Cross-Modal Fusion Attention Network for Indoor RGB-D Semantic Segmentation.
CFANet:用于室内RGB-D语义分割的跨模态融合注意力网络
J Imaging. 2025 May 27;11(6):177. doi: 10.3390/jimaging11060177.
4
Visual image reconstructed without semantics from human brain activity using linear image decoders and nonlinear noise suppression.使用线性图像解码器和非线性噪声抑制技术,从人类大脑活动中重建无语义的视觉图像。
Cogn Neurodyn. 2025 Dec;19(1):20. doi: 10.1007/s11571-024-10184-z. Epub 2025 Jan 9.
5
DDNet: Depth Dominant Network for Semantic Segmentation of RGB-D Images.DDNet:用于RGB-D图像语义分割的深度主导网络
Sensors (Basel). 2024 Oct 28;24(21):6914. doi: 10.3390/s24216914.
6
Semantic Guidance Fusion Network for Cross-Modal Semantic Segmentation.用于跨模态语义分割的语义引导融合网络
Sensors (Basel). 2024 Apr 12;24(8):2473. doi: 10.3390/s24082473.
7
Multi-scale fusion for RGB-D indoor semantic segmentation.用于RGB-D室内语义分割的多尺度融合
Sci Rep. 2022 Nov 24;12(1):20305. doi: 10.1038/s41598-022-24836-9.