• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于上下文卷积神经网络的人体解析

Human Parsing with Contextualized Convolutional Neural Network.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):115-127. doi: 10.1109/TPAMI.2016.2537339. Epub 2016 Mar 2.

DOI:10.1109/TPAMI.2016.2537339
PMID:26955019
Abstract

In this work, we address the human parsing task with a novel Contextualized Convolutional Neural Network (Co-CNN) architecture, which well integrates the cross-layer context, global image-level context, semantic edge context, within-super-pixel context and cross-super-pixel neighborhood context into a unified network. Given an input human image, Co-CNN produces the pixelwise categorization in an end-to-end way. First, the cross-layer context is captured by our basic local-to-global-to-local structure, which hierarchically combines the global semantic information and the local fine details across different convolutional layers. Second, the global image-level label prediction is used as an auxiliary objective in the intermediate layer of the Co-CNN, and its outputs are further used for guiding the feature learning in subsequent convolutional layers to leverage the global image-level context. Third, semantic edge context is further incorporated into Co-CNN, where the high-level semantic boundaries are leveraged to guide pixel-wise labeling. Finally, to further utilize the local super-pixel contexts, the within-super-pixel smoothing and cross-super-pixel neighbourhood voting are formulated as natural sub-components of the Co-CNN to achieve the local label consistency in both training and testing process. Comprehensive evaluations on two public datasets well demonstrate the significant superiority of our Co-CNN over other state-of-the-arts for human parsing. In particular, the F-1 score on the large dataset [1] reaches 81.72 percent by Co-CNN, significantly higher than 62.81 percent and 64.38 percent by the state-of-the-art algorithms, M-CNN [2] and ATR [1], respectively. By utilizing our newly collected large dataset for training, our Co-CNN can achieve 85.36 percent in F-1 score.

摘要

在这项工作中,我们使用一种新颖的上下文卷积神经网络(Co-CNN)架构来处理人体解析任务,该架构将跨层上下文、全局图像级上下文、语义边缘上下文、超像素内上下文和跨超像素邻域上下文很好地集成到一个统一的网络中。给定一幅输入的人体图像,Co-CNN以端到端的方式生成像素级分类。首先,我们通过基本的局部到全局再到局部结构捕捉跨层上下文,该结构将不同卷积层的全局语义信息和局部精细细节分层结合起来。其次,全局图像级标签预测在Co-CNN的中间层用作辅助目标,其输出进一步用于指导后续卷积层的特征学习,以利用全局图像级上下文。第三,语义边缘上下文进一步融入Co-CNN,利用高层语义边界来指导像素级标注。最后,为了进一步利用局部超像素上下文,将超像素内平滑和跨超像素邻域投票作为Co-CNN的自然子组件,以在训练和测试过程中实现局部标签一致性。在两个公共数据集上的综合评估充分证明了我们的Co-CNN相对于其他最先进的人体解析方法具有显著优势。特别是,Co-CNN在大型数据集[1]上的F-1分数达到81.72%,分别显著高于最先进算法M-CNN[2]和ATR[1]的62.81%和64.38%。通过使用我们新收集的大型数据集进行训练,我们的Co-CNN在F-1分数上可以达到85.36%。

相似文献

1
Human Parsing with Contextualized Convolutional Neural Network.基于上下文卷积神经网络的人体解析
IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):115-127. doi: 10.1109/TPAMI.2016.2537339. Epub 2016 Mar 2.
2
Deep Human Parsing with Active Template Regression.深度人体解析与主动模板回归。
IEEE Trans Pattern Anal Mach Intell. 2015 Dec;37(12):2402-14. doi: 10.1109/TPAMI.2015.2408360.
3
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
4
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
5
HCP: A Flexible CNN Framework for Multi-label Image Classification.HCP:一种用于多标签图像分类的灵活卷积神经网络框架。
IEEE Trans Pattern Anal Mach Intell. 2016 Sep 1;38(9):1901-1907. doi: 10.1109/TPAMI.2015.2491929. Epub 2015 Oct 26.
6
Esophagus segmentation in CT via 3D fully convolutional neural network and random walk.基于 3D 全卷积神经网络和随机游走的 CT 食管分割。
Med Phys. 2017 Dec;44(12):6341-6352. doi: 10.1002/mp.12593. Epub 2017 Oct 23.
7
Co-trained convolutional neural networks for automated detection of prostate cancer in multi-parametric MRI.基于多参数 MRI 的协同训练卷积神经网络在前列腺癌自动检测中的应用
Med Image Anal. 2017 Dec;42:212-227. doi: 10.1016/j.media.2017.08.006. Epub 2017 Aug 24.
8
Diverse Region-Based CNN for Hyperspectral Image Classification.基于多样化区域的卷积神经网络在高光谱图像分类中的应用。
IEEE Trans Image Process. 2018 Jun;27(6):2623-2634. doi: 10.1109/TIP.2018.2809606.
9
Scene Parsing With Integration of Parametric and Non-Parametric Models.场景解析与参数和非参数模型的整合。
IEEE Trans Image Process. 2016 May;25(5):2379-91. doi: 10.1109/TIP.2016.2533862.
10
Learning a Convolutional Neural Network for Image Compact-Resolution.学习用于图像小分辨率的卷积神经网络。
IEEE Trans Image Process. 2019 Mar;28(3):1092-1107. doi: 10.1109/TIP.2018.2872876. Epub 2018 Sep 28.

引用本文的文献

1
A top-down manner-based DCNN architecture for semantic image segmentation.一种用于语义图像分割的基于自上而下方式的深度卷积神经网络(DCNN)架构。
PLoS One. 2017 Mar 24;12(3):e0174508. doi: 10.1371/journal.pone.0174508. eCollection 2017.