• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有条件核的概率图注意力网络用于逐像素预测。

Probabilistic Graph Attention Network With Conditional Kernels for Pixel-Wise Prediction.

作者信息

Xu Dan, Alameda-Pineda Xavier, Ouyang Wanli, Ricci Elisa, Wang Xiaogang, Sebe Nicu

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2673-2688. doi: 10.1109/TPAMI.2020.3043781. Epub 2022 Apr 1.

DOI:10.1109/TPAMI.2020.3043781
PMID:33301402
Abstract

Multi-scale representations deeply learned via convolutional neural networks have shown tremendous importance for various pixel-level prediction problems. In this paper we present a novel approach that advances the state of the art on pixel-level prediction in a fundamental aspect, i.e. structured multi-scale features learning and fusion. In contrast to previous works directly considering multi-scale feature maps obtained from the inner layers of a primary CNN architecture, and simply fusing the features with weighted averaging or concatenation, we propose a probabilistic graph attention network structure based on a novel Attention-Gated Conditional Random Fields (AG-CRFs) model for learning and fusing multi-scale representations in a principled manner. In order to further improve the learning capacity of the network structure, we propose to exploit feature dependant conditional kernels within the deep probabilistic framework. Extensive experiments are conducted on four publicly available datasets (i.e. BSDS500, NYUD-V2, KITTI and Pascal-Context) and on three challenging pixel-wise prediction problems involving both discrete and continuous labels (i.e. monocular depth estimation, object contour prediction and semantic segmentation). Quantitative and qualitative results demonstrate the effectiveness of the proposed latent AG-CRF model and the overall probabilistic graph attention network with feature conditional kernels for structured feature learning and pixel-wise prediction.

摘要

通过卷积神经网络深度学习的多尺度表示已在各种像素级预测问题中显示出巨大的重要性。在本文中,我们提出了一种新颖的方法,该方法在一个基本方面推进了像素级预测的技术水平,即结构化多尺度特征学习与融合。与先前直接考虑从主卷积神经网络(CNN)架构的内层获得的多尺度特征图,并简单地通过加权平均或拼接来融合特征的工作不同,我们基于一种新颖的注意力门控条件随机场(AG-CRFs)模型提出了一种概率图注意力网络结构,用于以有原则的方式学习和融合多尺度表示。为了进一步提高网络结构的学习能力,我们建议在深度概率框架内利用特征相关的条件核。我们在四个公开可用的数据集(即BSDS500、NYUD-V2、KITTI和Pascal-Context)以及三个涉及离散和连续标签的具有挑战性的像素级预测问题(即单目深度估计、物体轮廓预测和语义分割)上进行了广泛的实验。定量和定性结果证明了所提出的潜在AG-CRF模型以及带有特征条件核的整体概率图注意力网络在结构化特征学习和像素级预测方面的有效性。

相似文献

1
Probabilistic Graph Attention Network With Conditional Kernels for Pixel-Wise Prediction.具有条件核的概率图注意力网络用于逐像素预测。
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2673-2688. doi: 10.1109/TPAMI.2020.3043781. Epub 2022 Apr 1.
2
Monocular Depth Estimation Using Multi-Scale Continuous CRFs as Sequential Deep Networks.使用多尺度连续条件随机场作为序列深度网络的单目深度估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1426-1440. doi: 10.1109/TPAMI.2018.2839602. Epub 2018 May 22.
3
Variational Structured Attention Networks for Deep Visual Representation Learning.用于深度视觉表征学习的变分结构化注意力网络
IEEE Trans Image Process. 2022 Mar 2;PP. doi: 10.1109/TIP.2021.3137647.
4
A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积
Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.
5
Group-Wise Learning for Weakly Supervised Semantic Segmentation.基于群体学习的弱监督语义分割。
IEEE Trans Image Process. 2022;31:799-811. doi: 10.1109/TIP.2021.3132834. Epub 2022 Jan 4.
6
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
7
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
8
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.
9
Building segmentation through a gated graph convolutional neural network with deep structured feature embedding.通过具有深度结构化特征嵌入的门控图卷积神经网络进行建筑物分割。
ISPRS J Photogramm Remote Sens. 2020 Jan;159:184-197. doi: 10.1016/j.isprsjprs.2019.11.004.
10
Automatic liver segmentation by integrating fully convolutional networks into active contour models.基于全卷积网络的主动轮廓模型自动肝脏分割
Med Phys. 2019 Oct;46(10):4455-4469. doi: 10.1002/mp.13735. Epub 2019 Aug 16.

引用本文的文献

1
Shearlets as feature extractor for semantic edge detection: the model-based and data-driven realm.用于语义边缘检测的剪切波作为特征提取器:基于模型和数据驱动的领域。
Proc Math Phys Eng Sci. 2020 Nov;476(2243):20190841. doi: 10.1098/rspa.2019.0841. Epub 2020 Nov 25.