• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于机器人视觉感知的级联轮廓增强全景分割

Cascade contour-enhanced panoptic segmentation for robotic vision perception.

作者信息

Xu Yue, Liu Runze, Zhu Dongchen, Chen Lili, Zhang Xiaolin, Li Jiamao

机构信息

Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai, China.

School of Information Science and Technology, ShanghaiTech University, Shanghai, China.

出版信息

Front Neurorobot. 2024 Oct 21;18:1489021. doi: 10.3389/fnbot.2024.1489021. eCollection 2024.

DOI:10.3389/fnbot.2024.1489021
PMID:39498236
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11532083/
Abstract

Panoptic segmentation plays a crucial role in enabling robots to comprehend their surroundings, providing fine-grained scene understanding information for robots' intelligent tasks. Although existing methods have made some progress, they are prone to fail in areas with weak textures, small objects, etc. Inspired by biological vision research, we propose a cascaded contour-enhanced panoptic segmentation network called CCPSNet, attempting to enhance the discriminability of instances through structural knowledge. To acquire the scene structure, a cascade contour detection stream is designed, which extracts comprehensive scene contours using channel regulation structural perception module and coarse-to-fine cascade strategy. Furthermore, the contour-guided multi-scale feature enhancement stream is developed to boost the discrimination ability for small objects and weak textures. The stream integrates contour information and multi-scale context features through structural-aware feature modulation module and inverse aggregation technique. Experimental results show that our method improves accuracy on the Cityscapes (61.2 PQ) and COCO (43.5 PQ) datasets while also demonstrating robustness in challenging simulated real-world complex scenarios faced by robots, such as dirty cameras and rainy conditions. The proposed network promises to help the robot perceive the real scene. In future work, an unsupervised training strategy for the network could be explored to reduce the training cost.

摘要

全景分割在使机器人理解周围环境方面起着至关重要的作用,为机器人的智能任务提供细粒度的场景理解信息。尽管现有方法已经取得了一些进展,但它们在纹理较弱、物体较小等区域容易失败。受生物视觉研究的启发,我们提出了一种名为CCPSNet的级联轮廓增强全景分割网络,试图通过结构知识增强实例的可辨别性。为了获取场景结构,设计了一个级联轮廓检测流,它使用通道调节结构感知模块和由粗到细的级联策略提取综合场景轮廓。此外,还开发了轮廓引导的多尺度特征增强流,以提高对小物体和弱纹理的辨别能力。该流通过结构感知特征调制模块和逆聚合技术集成轮廓信息和多尺度上下文特征。实验结果表明,我们的方法在Cityscapes(61.2 PQ)和COCO(43.5 PQ)数据集上提高了准确率,同时在机器人面临的具有挑战性的模拟真实世界复杂场景(如脏相机和雨天条件)中也表现出了鲁棒性。所提出的网络有望帮助机器人感知真实场景。在未来的工作中,可以探索该网络的无监督训练策略以降低训练成本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/832d2791bdff/fnbot-18-1489021-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/02811fcce1f1/fnbot-18-1489021-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/86762a540d3f/fnbot-18-1489021-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/942824cb943d/fnbot-18-1489021-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/5f81ea26734c/fnbot-18-1489021-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/cd56481855e8/fnbot-18-1489021-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/0a2994c057a8/fnbot-18-1489021-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/832d2791bdff/fnbot-18-1489021-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/02811fcce1f1/fnbot-18-1489021-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/86762a540d3f/fnbot-18-1489021-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/942824cb943d/fnbot-18-1489021-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/5f81ea26734c/fnbot-18-1489021-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/cd56481855e8/fnbot-18-1489021-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/0a2994c057a8/fnbot-18-1489021-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc8f/11532083/832d2791bdff/fnbot-18-1489021-g0007.jpg

相似文献

1
Cascade contour-enhanced panoptic segmentation for robotic vision perception.用于机器人视觉感知的级联轮廓增强全景分割
Front Neurorobot. 2024 Oct 21;18:1489021. doi: 10.3389/fnbot.2024.1489021. eCollection 2024.
2
Fast Panoptic Segmentation with Soft Attention Embeddings.快速全景分割的软注意嵌入。
Sensors (Basel). 2022 Jan 20;22(3):783. doi: 10.3390/s22030783.
3
Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.基于视频全景分割的动态场景密集像素级解读
IEEE Trans Image Process. 2022;31:5383-5395. doi: 10.1109/TIP.2022.3183440. Epub 2022 Aug 17.
4
Mask-Pyramid Network: A Novel Panoptic Segmentation Method.掩码金字塔网络:一种新型全景分割方法。
Sensors (Basel). 2024 Feb 22;24(5):1411. doi: 10.3390/s24051411.
5
Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks.通过动态移位网络实现统一的3D和4D全景分割
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3480-3495. doi: 10.1109/TPAMI.2023.3349304. Epub 2024 Apr 3.
6
A visual-degradation-inspired model with HSV color-encoding for contour detection.一种基于 HSV 颜色编码的视觉退化启发模型,用于轮廓检测。
J Neurosci Methods. 2022 Mar 1;369:109423. doi: 10.1016/j.jneumeth.2021.109423. Epub 2021 Nov 24.
7
Object-Centric Representation Learning for Video Scene Understanding.用于视频场景理解的以对象为中心的表示学习
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8410-8423. doi: 10.1109/TPAMI.2024.3401409. Epub 2024 Nov 6.
8
Fully Convolutional Networks for Panoptic Segmentation With Point-Based Supervision.基于点监督的全景分割全卷积网络
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4552-4568. doi: 10.1109/TPAMI.2022.3200416. Epub 2023 Mar 7.
9
Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.全景部分Former++:用于全景部分分割的统一解耦视图
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11087-11103. doi: 10.1109/TPAMI.2024.3453916. Epub 2024 Nov 6.
10
G2ViT: Graph Neural Network-Guided Vision Transformer Enhanced Network for retinal vessel and coronary angiograph segmentation.G2ViT:基于图神经网络引导的视觉Transformer 增强网络,用于视网膜血管和冠状动脉造影分割。
Neural Netw. 2024 Aug;176:106356. doi: 10.1016/j.neunet.2024.106356. Epub 2024 May 3.

引用本文的文献

1
Panoptic Image Segmentation Method Based on Dynamic Instance Query.基于动态实例查询的全景图像分割方法
Sensors (Basel). 2025 May 5;25(9):2919. doi: 10.3390/s25092919.

本文引用的文献

1
Remote intelligent perception system for multi-object detection.用于多目标检测的远程智能感知系统
Front Neurorobot. 2024 May 20;18:1398703. doi: 10.3389/fnbot.2024.1398703. eCollection 2024.
2
Based on cross-scale fusion attention mechanism network for semantic segmentation for street scenes.基于跨尺度融合注意力机制网络的街景语义分割
Front Neurorobot. 2023 Aug 31;17:1204418. doi: 10.3389/fnbot.2023.1204418. eCollection 2023.
3
IDNet: Information Decomposition Network for Fast Panoptic Segmentation.IDNet:用于快速全景分割的信息分解网络。
IEEE Trans Image Process. 2024;33:1487-1496. doi: 10.1109/TIP.2023.3234499. Epub 2024 Feb 21.
4
Rethinking 1D convolution for lightweight semantic segmentation.重新思考用于轻量级语义分割的一维卷积
Front Neurorobot. 2023 Feb 9;17:1119231. doi: 10.3389/fnbot.2023.1119231. eCollection 2023.
5
Dual-flow network with attention for autonomous driving.用于自动驾驶的带注意力机制的双流网络。
Front Neurorobot. 2023 Jan 9;16:978225. doi: 10.3389/fnbot.2022.978225. eCollection 2022.
6
A lightweight multi-dimension dynamic convolutional network for real-time semantic segmentation.一种用于实时语义分割的轻量级多维动态卷积网络。
Front Neurorobot. 2022 Dec 15;16:1075520. doi: 10.3389/fnbot.2022.1075520. eCollection 2022.
7
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation.用于快速全景分割的学习类别和实例感知像素嵌入
IEEE Trans Image Process. 2021;30:6013-6023. doi: 10.1109/TIP.2021.3090522. Epub 2021 Jul 1.
8
A Multimodal Saliency Model for Videos with High Audio-Visual Correspondence.一种用于具有高视听对应性视频的多模态显著性模型。
IEEE Trans Image Process. 2020 Jan 17. doi: 10.1109/TIP.2020.2966082.
9
Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network.使用语义分割网络的快速R-CNN用于稳健行人检测
Front Neurorobot. 2018 Oct 5;12:64. doi: 10.3389/fnbot.2018.00064. eCollection 2018.
10
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.