• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有中间概念的深度监督

Deep Supervision with Intermediate Concepts.

作者信息

Li Chi, Zia M Zeeshan, Tran Quoc-Huy, Yu Xiang, Hager Gregory D, Chandraker Manmohan

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1828-1843. doi: 10.1109/TPAMI.2018.2863285. Epub 2018 Aug 13.

DOI:10.1109/TPAMI.2018.2863285
PMID:30106706
Abstract

Recent data-driven approaches to scene interpretation predominantly pose inference as an end-to-end black-box mapping, commonly performed by a Convolutional Neural Network (CNN). However, decades of work on perceptual organization in both human and machine vision suggest that there are often intermediate representations that are intrinsic to an inference task, and which provide essential structure to improve generalization. In this work, we explore an approach for injecting prior domain structure into neural network training by supervising hidden layers of a CNN with intermediate concepts that normally are not observed in practice. We formulate a probabilistic framework which formalizes these notions and predicts improved generalization via this deep supervision method. One advantage of this approach is that we are able to train only from synthetic CAD renderings of cluttered scenes, where concept values can be extracted, but apply the results to real images. Our implementation achieves the state-of-the-art performance of 2D/3D keypoint localization and image classification on real image benchmarks including KITTI, PASCAL VOC, PASCAL3D+, IKEA, and CIFAR100. We provide additional evidence that our approach outperforms alternative forms of supervision, such as multi-task networks.

摘要

最近,数据驱动的场景解释方法主要将推理视为一种端到端的黑箱映射,通常由卷积神经网络(CNN)执行。然而,几十年来在人类和机器视觉中关于感知组织的研究表明,推理任务中往往存在一些内在的中间表示,这些表示为提高泛化能力提供了重要的结构。在这项工作中,我们探索了一种方法,通过用实际中通常无法观察到的中间概念来监督CNN的隐藏层,从而将先验领域结构注入到神经网络训练中。我们制定了一个概率框架,将这些概念形式化,并通过这种深度监督方法预测泛化能力的提高。这种方法的一个优点是,我们能够仅从杂乱场景的合成CAD渲染图进行训练,在这些渲染图中可以提取概念值,但将结果应用于真实图像。我们的实现方法在包括KITTI、PASCAL VOC、PASCAL3D +、宜家家居和CIFAR100在内的真实图像基准测试中,实现了2D/3D关键点定位和图像分类的最优性能。我们还提供了额外的证据,证明我们的方法优于其他形式的监督,比如多任务网络。

相似文献

1
Deep Supervision with Intermediate Concepts.具有中间概念的深度监督
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1828-1843. doi: 10.1109/TPAMI.2018.2863285. Epub 2018 Aug 13.
2
Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.基于3D卷积神经网络的实时3D手部姿态估计
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):956-970. doi: 10.1109/TPAMI.2018.2827052. Epub 2018 Apr 16.
3
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.
4
Hierarchical Scene Parsing by Weakly Supervised Learning with Image Descriptions.通过带有图像描述的弱监督学习进行分层场景解析
IEEE Trans Pattern Anal Mach Intell. 2019 Mar;41(3):596-610. doi: 10.1109/TPAMI.2018.2799846. Epub 2018 Jan 30.
5
Multi-view and 3D deformable part models.多视图和 3D 可变形部件模型。
IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2232-45. doi: 10.1109/TPAMI.2015.2408347.
6
WHSP-Net: A Weakly-Supervised Approach for 3D Hand Shape and Pose Recovery from a Single Depth Image.WHSP-Net:一种用于从单张深度图像中恢复三维手部形状和姿态的弱监督方法。
Sensors (Basel). 2019 Aug 31;19(17):3784. doi: 10.3390/s19173784.
7
Deep Visual Attention Prediction.深度视觉注意力预测。
IEEE Trans Image Process. 2018 May;27(5):2368-2378. doi: 10.1109/TIP.2017.2787612. Epub 2017 Dec 27.
8
A novel biomedical image indexing and retrieval system via deep preference learning.一种基于深度偏好学习的新型生物医学图像索引和检索系统。
Comput Methods Programs Biomed. 2018 May;158:53-69. doi: 10.1016/j.cmpb.2018.02.003. Epub 2018 Feb 6.
9
A novel retinal vessel detection approach based on multiple deep convolution neural networks.基于多个深度卷积神经网络的新型视网膜血管检测方法。
Comput Methods Programs Biomed. 2018 Dec;167:43-48. doi: 10.1016/j.cmpb.2018.10.021. Epub 2018 Oct 30.
10
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.

引用本文的文献

1
A Deep Learning-based Pipeline for Segmenting the Cerebral Cortex Laminar Structure in Histology Images.基于深度学习的组织学图像脑皮层分层结构分割流水线。
Neuroinformatics. 2024 Oct;22(4):745-761. doi: 10.1007/s12021-024-09688-0. Epub 2024 Oct 17.
2
Generative Adversarial Networks With Radiomics Supervision for Lung Lesion Generation.基于放射组学监督的生成对抗网络用于肺病变生成
IEEE Trans Biomed Eng. 2025 Jan;72(1):286-296. doi: 10.1109/TBME.2024.3451409. Epub 2025 Jan 15.
3
Progressive DeepSSM: Training Methodology for Image-To-Shape Deep Models.
渐进式深度SSM:图像到形状深度模型的训练方法
Shape Med Imaging (2023). 2023 Oct;14350:157-172. doi: 10.1007/978-3-031-46914-5_13. Epub 2023 Oct 31.
4
A Continuous Non-Invasive Blood Pressure Prediction Method Based on Deep Sparse Residual U-Net Combined with Improved Squeeze and Excitation Skip Connections.基于深度稀疏残差 U-Net 结合改进的挤压激励跳跃连接的连续无创血压预测方法。
Sensors (Basel). 2024 Apr 24;24(9):2721. doi: 10.3390/s24092721.
5
SM-SegNet: A Lightweight Squeeze M-SegNet for Tissue Segmentation in Brain MRI Scans.SM-SegNet:一种用于脑 MRI 扫描中组织分割的轻量级挤压 M-SegNet。
Sensors (Basel). 2022 Jul 8;22(14):5148. doi: 10.3390/s22145148.
6
On Interpretability of Artificial Neural Networks: A Survey.人工神经网络的可解释性:一项综述。
IEEE Trans Radiat Plasma Med Sci. 2021 Nov;5(6):741-760. doi: 10.1109/trpms.2021.3066428. Epub 2021 Mar 17.
7
Deep 3D attention CLSTM U-Net based automated liver segmentation and volumetry for the liver transplantation in abdominal CT volumes.基于深度3D注意力CLSTM U-Net的腹部CT容积数据中肝脏移植的肝脏自动分割与容积测量
Sci Rep. 2022 Apr 16;12(1):6370. doi: 10.1038/s41598-022-09978-0.
8
Generative Adversarial Networks and Radiomics Supervision for Lung Lesion Synthesis.用于肺病变合成的生成对抗网络与影像组学监督
Proc SPIE Int Soc Opt Eng. 2021 Feb;11595. doi: 10.1117/12.2582151. Epub 2021 Feb 15.
9
Solubility Prediction from Molecular Properties and Analytical Data Using an In-phase Deep Neural Network (Ip-DNN).利用同相深度神经网络(Ip-DNN)从分子性质和分析数据预测溶解度
ACS Omega. 2021 May 17;6(22):14278-14287. doi: 10.1021/acsomega.1c01035. eCollection 2021 Jun 8.
10
A fast and fully-automated deep-learning approach for accurate hemorrhage segmentation and volume quantification in non-contrast whole-head CT.一种快速且全自动的深度学习方法,可准确分割非对比全头部 CT 中的出血并进行体积定量。
Sci Rep. 2020 Nov 9;10(1):19389. doi: 10.1038/s41598-020-76459-7.