• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于有向无环图递归神经网络的场景分割。

Scene Segmentation with DAG-Recurrent Neural Networks.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1480-1493. doi: 10.1109/TPAMI.2017.2712691. Epub 2017 Jun 6.

DOI:10.1109/TPAMI.2017.2712691
PMID:28600239
Abstract

In this paper, we address the challenging task of scene segmentation. In order to capture the rich contextual dependencies over image regions, we propose Directed Acyclic Graph-Recurrent Neural Networks (DAG-RNN) to perform context aggregation over locally connected feature maps. More specifically, DAG-RNN is placed on top of pre-trained CNN (feature extractor) to embed context into local features so that their representative capability can be enhanced. In comparison with plain CNN (as in Fully Convolutional Networks-FCN), DAG-RNN is empirically found to be significantly more effective at aggregating context. Therefore, DAG-RNN demonstrates noticeably performance superiority over FCNs on scene segmentation. Besides, DAG-RNN entails dramatically less parameters as well as demands fewer computation operations, which makes DAG-RNN more favorable to be potentially applied on resource-constrained embedded devices. Meanwhile, the class occurrence frequencies are extremely imbalanced in scene segmentation, so we propose a novel class-weighted loss to train the segmentation network. The loss distributes reasonably higher attention weights to infrequent classes during network training, which is essential to boost their parsing performance. We evaluate our segmentation network on three challenging public scene segmentation benchmarks: Sift Flow, Pascal Context and COCO Stuff. On top of them, we achieve very impressive segmentation performance.

摘要

在本文中,我们解决了场景分割这一具有挑战性的任务。为了捕获图像区域的丰富上下文依赖关系,我们提出了有向无环图递归神经网络(DAG-RNN)来对局部连接的特征图进行上下文聚合。具体来说,DAG-RNN 位于预训练的 CNN(特征提取器)之上,将上下文嵌入到局部特征中,从而增强其表示能力。与普通 CNN(如全卷积网络-FCN)相比,实验发现 DAG-RNN 在聚合上下文方面效果显著更好。因此,DAG-RNN 在场景分割方面明显优于 FCN。此外,DAG-RNN 需要的参数和计算操作明显更少,这使得 DAG-RNN 更有利于潜在地应用于资源受限的嵌入式设备。同时,场景分割中的类出现频率极其不平衡,因此我们提出了一种新的类加权损失来训练分割网络。该损失在网络训练期间合理地为罕见类分配更高的注意力权重,这对于提高它们的解析性能至关重要。我们在三个具有挑战性的公共场景分割基准上评估了我们的分割网络:SiftFlow、Pascal Context 和 COCO Stuff。在这些基准上,我们实现了非常令人印象深刻的分割性能。

相似文献

1
Scene Segmentation with DAG-Recurrent Neural Networks.基于有向无环图递归神经网络的场景分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1480-1493. doi: 10.1109/TPAMI.2017.2712691. Epub 2017 Jun 6.
2
Automatic bladder segmentation from CT images using deep CNN and 3D fully connected CRF-RNN.利用深度卷积神经网络和 3D 全连接条件随机场循环神经网络自动进行 CT 图像的膀胱分割。
Int J Comput Assist Radiol Surg. 2018 Jul;13(7):967-975. doi: 10.1007/s11548-018-1733-7. Epub 2018 Mar 19.
3
Scene Segmentation With Dual Relation-Aware Attention Network.基于双重关系感知注意力网络的场景分割。
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2547-2560. doi: 10.1109/TNNLS.2020.3006524. Epub 2021 Jun 2.
4
Towards Achieving Robust Low-level and High-level Scene Parsing.迈向实现稳健的低层次和高层次场景解析。
IEEE Trans Image Process. 2018 Oct 31. doi: 10.1109/TIP.2018.2878975.
5
Automatic segmentation of OCT retinal boundaries using recurrent neural networks and graph search.使用递归神经网络和图搜索自动分割光学相干断层扫描(OCT)视网膜边界
Biomed Opt Express. 2018 Oct 26;9(11):5759-5777. doi: 10.1364/BOE.9.005759. eCollection 2018 Nov 1.
6
ACR-SA: attention-based deep model through two-channel CNN and Bi-RNN for sentiment analysis.ACR-SA:通过双通道卷积神经网络和双向循环神经网络实现的基于注意力的深度情感分析模型
PeerJ Comput Sci. 2022 Mar 17;8:e877. doi: 10.7717/peerj-cs.877. eCollection 2022.
7
Relative CNN-RNN: Learning Relative Atmospheric Visibility From Images.相对卷积神经网络-循环神经网络:从图像中学习相对大气能见度。
IEEE Trans Image Process. 2019 Jan;28(1):45-55. doi: 10.1109/TIP.2018.2857219. Epub 2018 Jul 18.
8
Object class segmentation of RGB-D video using recurrent convolutional neural networks.使用递归卷积神经网络对RGB-D视频进行目标类别分割。
Neural Netw. 2017 Apr;88:105-113. doi: 10.1016/j.neunet.2017.01.003. Epub 2017 Jan 30.
9
Learning Contextual Dependence With Convolutional Hierarchical Recurrent Neural Networks.用卷积层次递归神经网络学习上下文相关性。
IEEE Trans Image Process. 2016 Jul;25(7):2983-2996. doi: 10.1109/TIP.2016.2548241.
10
Spatial Clockwork Recurrent Neural Network for Muscle Perimysium Segmentation.用于肌肉束膜分割的空间发条循环神经网络
Med Image Comput Comput Assist Interv. 2016 Oct;9901:185-193. doi: 10.1007/978-3-319-46723-8_22. Epub 2016 Oct 2.

引用本文的文献

1
MDEM: A Multi-Scale Damage Enhancement MambaOut for Pavement Damage Classification.MDEM:一种用于路面损伤分类的多尺度损伤增强曼巴输出模型
Sensors (Basel). 2025 Sep 4;25(17):5522. doi: 10.3390/s25175522.
2
Enhancing concealed object detection in active THz security images with adaptation-YOLO.利用自适应YOLO增强有源太赫兹安全图像中的隐藏物体检测
Sci Rep. 2025 Jan 21;15(1):2735. doi: 10.1038/s41598-024-81054-1.
3
Global domain adaptation attention with data-dependent regulator for scene segmentation.基于数据相关调节器的全局域自适应注意力的场景分割。
PLoS One. 2024 Feb 14;19(2):e0295263. doi: 10.1371/journal.pone.0295263. eCollection 2024.
4
Based on the multi-scale information sharing network of fine-grained attention for agricultural pest detection.基于农业害虫检测的细粒度注意力多尺度信息共享网络。
PLoS One. 2023 Oct 5;18(10):e0286732. doi: 10.1371/journal.pone.0286732. eCollection 2023.
5
Implementation of CT Image Segmentation Based on an Image Segmentation Algorithm.基于图像分割算法的CT图像分割实现
Appl Bionics Biomech. 2022 Oct 12;2022:2047537. doi: 10.1155/2022/2047537. eCollection 2022.
6
Two-Stage CNN Whole Heart Segmentation Combining Image Enhanced Attention Mechanism and Metric Classification.两阶段 CNN 全心脏分割结合图像增强注意力机制和度量分类。
J Digit Imaging. 2023 Feb;36(1):124-142. doi: 10.1007/s10278-022-00708-6. Epub 2022 Sep 29.
7
A Dataset for Temporal Semantic Segmentation Dedicated to Smart Mobility of Wheelchairs on Sidewalks.一个用于时间语义分割的数据集,专门用于人行道上轮椅的智能移动。
J Imaging. 2022 Aug 7;8(8):216. doi: 10.3390/jimaging8080216.
8
Radiological identification of temporal lobe epilepsy using artificial intelligence: a feasibility study.利用人工智能进行颞叶癫痫的放射学识别:一项可行性研究。
Brain Commun. 2021 Dec 8;4(2):fcab284. doi: 10.1093/braincomms/fcab284. eCollection 2022.
9
Tissue self-attention network for the segmentation of optical coherence tomography images on the esophagus.用于食管光学相干断层扫描图像分割的组织自注意力网络
Biomed Opt Express. 2021 Apr 7;12(5):2631-2646. doi: 10.1364/BOE.419809. eCollection 2021 May 1.
10
: Dual-resolution Semantic Segmentation with Rare Class-Oriented Superpixel Prior.基于面向稀有类别的超像素先验的双分辨率语义分割
Multimed Tools Appl. 2021 Jan;80(2):1687-1706. doi: 10.1007/s11042-020-09691-y. Epub 2020 Sep 9.