• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于语义图像分割的基于自上而下方式的深度卷积神经网络(DCNN)架构。

A top-down manner-based DCNN architecture for semantic image segmentation.

作者信息

Qiao Kai, Chen Jian, Wang Linyuan, Zeng Lei, Yan Bin

机构信息

National Digital Switching System Engineering and Technological Research Centre, Zhengzhou, China.

出版信息

PLoS One. 2017 Mar 24;12(3):e0174508. doi: 10.1371/journal.pone.0174508. eCollection 2017.

DOI:10.1371/journal.pone.0174508
PMID:28339486
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5365135/
Abstract

Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets.

摘要

鉴于深度卷积神经网络(DCNN)在识别方面具有强大的特征表示能力,它们推动了高级计算机视觉任务的快速发展。然而,其在语义图像分割方面的性能仍不尽人意。基于对视觉机制的分析,我们得出结论,自下而上的DCNN方式是不够的,因为语义图像分割任务不仅需要识别能力,还需要视觉注意力能力。在本研究中,以自上而下的方式引入了包含视觉注意力信息的超像素,并提出了一种可扩展的架构,以改善当前基于DCNN方法的分割结果。我们采用当前最先进的全卷积网络(FCN)以及带有条件随机场的FCN(DeepLab-CRF)作为基线来验证我们的架构。PASCAL VOC分割任务的实验结果定性地表明,粗糙边缘和错误分割结果得到了很好的改善。我们还在PASCAL VOC 2011和2012测试集上定量地获得了约2%-3%的交并比(IOU)精度提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/de39a346ebc8/pone.0174508.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/ea1b68389016/pone.0174508.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/600a810c2eb8/pone.0174508.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/576860df96ff/pone.0174508.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/711d116c9d11/pone.0174508.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/2db46329a570/pone.0174508.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/de39a346ebc8/pone.0174508.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/ea1b68389016/pone.0174508.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/600a810c2eb8/pone.0174508.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/576860df96ff/pone.0174508.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/711d116c9d11/pone.0174508.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/2db46329a570/pone.0174508.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8d1/5365135/de39a346ebc8/pone.0174508.g006.jpg

相似文献

1
A top-down manner-based DCNN architecture for semantic image segmentation.一种用于语义图像分割的基于自上而下方式的深度卷积神经网络(DCNN)架构。
PLoS One. 2017 Mar 24;12(3):e0174508. doi: 10.1371/journal.pone.0174508. eCollection 2017.
2
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
3
Efficient skin lesion segmentation using separable-Unet with stochastic weight averaging.使用可分离 U-Net 和随机权重平均化实现高效的皮肤病变分割。
Comput Methods Programs Biomed. 2019 Sep;178:289-301. doi: 10.1016/j.cmpb.2019.07.005. Epub 2019 Jul 8.
4
Convolutional neural network for automated mass segmentation in mammography.卷积神经网络在乳腺 X 线摄影中用于自动肿块分割。
BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):192. doi: 10.1186/s12859-020-3521-y.
5
A comparative study of pre-trained convolutional neural networks for semantic segmentation of breast tumors in ultrasound.用于超声乳腺肿瘤语义分割的预训练卷积神经网络的比较研究
Comput Biol Med. 2020 Nov;126:104036. doi: 10.1016/j.compbiomed.2020.104036. Epub 2020 Oct 8.
6
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
7
Depth Density Achieves a Better Result for Semantic Segmentation with the Kinect System.深度密度可通过 Kinect 系统实现更好的语义分割效果。
Sensors (Basel). 2020 Feb 3;20(3):812. doi: 10.3390/s20030812.
8
A Lightweight Semantic Segmentation Algorithm Based on Deep Convolutional Neural Networks.基于深度卷积神经网络的轻量级语义分割算法。
Comput Intell Neurosci. 2022 Sep 6;2022:5339664. doi: 10.1155/2022/5339664. eCollection 2022.
9
Automated pectoral muscle identification on MLO-view mammograms: Comparison of deep neural network to conventional computer vision.基于 MLO 视图的乳腺钼靶片中自动胸大肌识别:深度神经网络与传统计算机视觉的比较。
Med Phys. 2019 May;46(5):2103-2114. doi: 10.1002/mp.13451. Epub 2019 Mar 12.
10
Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks.基于最先进的深度卷积神经网络的手写孟加拉语字符识别。
Comput Intell Neurosci. 2018 Aug 27;2018:6747098. doi: 10.1155/2018/6747098. eCollection 2018.

引用本文的文献

1
A Kinect-Based Segmentation of Touching-Pigs for Real-Time Monitoring.基于 Kinect 的触碰小猪实时监测分割。
Sensors (Basel). 2018 May 29;18(6):1746. doi: 10.3390/s18061746.

本文引用的文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
Robust Neighborhood Preserving Projection by Nuclear/L2,1-Norm Regularization for Image Feature Extraction.基于核范数/L2,1-范数正则化的鲁棒邻域保持投影的图像特征提取。
IEEE Trans Image Process. 2017 Apr;26(4):1607-1622. doi: 10.1109/TIP.2017.2654163. Epub 2017 Jan 16.
3
Real-Time Superpixel Segmentation by DBSCAN Clustering Algorithm.
基于DBSCAN聚类算法的实时超像素分割
IEEE Trans Image Process. 2016 Dec;25(12):5933-5942. doi: 10.1109/TIP.2016.2616302. Epub 2016 Oct 11.
4
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
5
Cross-Modal Retrieval With CNN Visual Features: A New Baseline.基于卷积神经网络视觉特征的跨模态检索:一个新的基线。
IEEE Trans Cybern. 2017 Feb;47(2):449-460. doi: 10.1109/TCYB.2016.2519449. Epub 2016 Mar 8.
6
Human Parsing with Contextualized Convolutional Neural Network.基于上下文卷积神经网络的人体解析
IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):115-127. doi: 10.1109/TPAMI.2016.2537339. Epub 2016 Mar 2.
7
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
8
Performance-optimized hierarchical models predict neural responses in higher visual cortex.性能优化的层次模型预测高级视觉皮层中的神经反应。
Proc Natl Acad Sci U S A. 2014 Jun 10;111(23):8619-24. doi: 10.1073/pnas.1403112111. Epub 2014 May 8.
9
Learning hierarchical features for scene labeling.学习用于场景标注的层次特征。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1915-29. doi: 10.1109/TPAMI.2012.231.
10
Guided image filtering.引导图像滤波。
IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1397-409. doi: 10.1109/TPAMI.2012.213.