• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于面向稀有类别的超像素先验的双分辨率语义分割

: Dual-resolution Semantic Segmentation with Rare Class-Oriented Superpixel Prior.

作者信息

Yu Liangjiang, Fan Guoliang

机构信息

School of Electrical and Computer Engineering, Oklahoma State University, USA.

出版信息

Multimed Tools Appl. 2021 Jan;80(2):1687-1706. doi: 10.1007/s11042-020-09691-y. Epub 2020 Sep 9.

DOI:10.1007/s11042-020-09691-y
PMID:33776547
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7988710/
Abstract

Rare-class objects in natural scene images that are usually small and less frequent often convey more important information for scene understanding than the common ones. However, they are often overlooked in scene labeling studies due to two main reasons, low occurrence frequency and limited spatial coverage. Many methods have been proposed to enhance overall semantic labeling performance, but only a few consider rare-class objects. In this work, we present a deep semantic labeling framework with special consideration of rare classes via three techniques. First, a novel dual-resolution coarse-to-fine superpixel representation is developed, where fine and coarse superpixels are applied to rare classes and background areas respectively. This unique dual representation allows seamless incorporation of shape features into integrated global and local convolutional neural network (CNN) models. Second, shape information is directly involved during the CNN feature learning for both frequent and rare classes from the re-balanced training data, and also explicitly involved in data inference. Third, the proposed framework incorporates both shape information and the CNN architecture into semantic labeling through a fusion of probabilistic multi-class likelihood. Experimental results demonstrate competitive semantic labeling performance on two standard datasets both qualitatively and quantitatively, especially for rare-class objects.

摘要

自然场景图像中的稀有类对象通常较小且出现频率较低,但对于场景理解而言,它们往往比常见对象传达更重要的信息。然而,由于出现频率低和空间覆盖范围有限这两个主要原因,它们在场景标注研究中常常被忽视。已经提出了许多方法来提高整体语义标注性能,但只有少数方法考虑了稀有类对象。在这项工作中,我们通过三种技术提出了一个特别考虑稀有类的深度语义标注框架。首先,开发了一种新颖的双分辨率从粗到精的超像素表示,其中精细和粗糙超像素分别应用于稀有类和背景区域。这种独特的双重表示允许将形状特征无缝整合到集成的全局和局部卷积神经网络(CNN)模型中。其次,形状信息在从重新平衡的训练数据中对频繁类和稀有类进行CNN特征学习期间直接参与,并且也明确参与数据推理。第三,所提出的框架通过概率多类似然的融合将形状信息和CNN架构都纳入语义标注中。实验结果在定性和定量方面都展示了在两个标准数据集上具有竞争力的语义标注性能,特别是对于稀有类对象。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/44581af7151f/nihms-1627718-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/8dbadb37ad71/nihms-1627718-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/e82d156e85da/nihms-1627718-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/5e589dcaad5f/nihms-1627718-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/e66fce620568/nihms-1627718-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/92d8c6179348/nihms-1627718-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/ac46511ad903/nihms-1627718-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/44581af7151f/nihms-1627718-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/8dbadb37ad71/nihms-1627718-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/e82d156e85da/nihms-1627718-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/5e589dcaad5f/nihms-1627718-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/e66fce620568/nihms-1627718-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/92d8c6179348/nihms-1627718-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/ac46511ad903/nihms-1627718-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f06/7988710/44581af7151f/nihms-1627718-f0009.jpg

相似文献

1
: Dual-resolution Semantic Segmentation with Rare Class-Oriented Superpixel Prior.基于面向稀有类别的超像素先验的双分辨率语义分割
Multimed Tools Appl. 2021 Jan;80(2):1687-1706. doi: 10.1007/s11042-020-09691-y. Epub 2020 Sep 9.
2
Attention-Based Context Aware Network for Semantic Comprehension of Aerial Scenery.基于注意力的上下文感知网络,用于对航空场景进行语义理解。
Sensors (Basel). 2021 Mar 11;21(6):1983. doi: 10.3390/s21061983.
3
High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network.高分辨率航空影像语义标注的密集金字塔网络方法
Sensors (Basel). 2018 Nov 5;18(11):3774. doi: 10.3390/s18113774.
4
A Multi-Modal, Discriminative and Spatially Invariant CNN for RGB-D Object Labeling.一种用于RGB-D目标标注的多模态、判别式且空间不变的卷积神经网络。
IEEE Trans Pattern Anal Mach Intell. 2018 Sep;40(9):2051-2065. doi: 10.1109/TPAMI.2017.2747134. Epub 2017 Aug 30.
5
Self-supervised learning for remote sensing scene classification under the few shot scenario.基于小样本场景的遥感场景分类的自监督学习。
Sci Rep. 2023 Jan 9;13(1):433. doi: 10.1038/s41598-022-27313-5.
6
Global-and-Local Context Network for Semantic Segmentation of Street View Images.用于街景图像语义分割的全局-局部上下文网络。
Sensors (Basel). 2020 May 21;20(10):2907. doi: 10.3390/s20102907.
7
Domain adaptive semantic segmentation by optimal transport.基于最优传输的域自适应语义分割
Fundam Res. 2023 Jul 1;4(5):981-991. doi: 10.1016/j.fmre.2023.06.006. eCollection 2024 Sep.
8
Coarse-to-fine airway segmentation using multi information fusion network and CNN-based region growing.基于多信息融合网络和基于 CNN 的区域生长的粗到细气道分割。
Comput Methods Programs Biomed. 2022 Mar;215:106610. doi: 10.1016/j.cmpb.2021.106610. Epub 2022 Jan 8.
9
Joint superpixel and Transformer for high resolution remote sensing image classification.用于高分辨率遥感图像分类的联合超像素与Transformer方法
Sci Rep. 2024 Mar 1;14(1):5054. doi: 10.1038/s41598-024-55482-y.
10
Superpixel-based deep convolutional neural networks and active contour model for automatic prostate segmentation on 3D MRI scans.基于超像素的深度卷积神经网络和主动轮廓模型在 3D MRI 扫描上的自动前列腺分割。
Med Biol Eng Comput. 2020 Sep;58(9):1947-1964. doi: 10.1007/s11517-020-02199-5. Epub 2020 Jun 21.

引用本文的文献

1
Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks.利用连体网络学习视觉目标跟踪中目标的几何信息。
Sensors (Basel). 2021 Nov 23;21(23):7790. doi: 10.3390/s21237790.
2
Whole-Field Reinforcement Learning: A Fully Autonomous Aerial Scouting Method for Precision Agriculture.全场强化学习:精准农业的全自动空中侦察方法。
Sensors (Basel). 2020 Nov 18;20(22):6585. doi: 10.3390/s20226585.

本文引用的文献

1
Scene Segmentation with DAG-Recurrent Neural Networks.基于有向无环图递归神经网络的场景分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1480-1493. doi: 10.1109/TPAMI.2017.2712691. Epub 2017 Jun 6.
2
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
3
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.
SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
4
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
5
Scene Parsing With Integration of Parametric and Non-Parametric Models.场景解析与参数和非参数模型的整合。
IEEE Trans Image Process. 2016 May;25(5):2379-91. doi: 10.1109/TIP.2016.2533862.
6
Learning hierarchical features for scene labeling.学习用于场景标注的层次特征。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1915-29. doi: 10.1109/TPAMI.2012.231.
7
SLIC superpixels compared to state-of-the-art superpixel methods.SLIC 超像素与最先进的超像素方法比较。
IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2274-82. doi: 10.1109/TPAMI.2012.120.
8
Nonparametric Scene Parsing via Label Transfer.基于标签转移的非参数场景解析。
IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2368-82. doi: 10.1109/TPAMI.2011.131. Epub 2011 Jun 30.