HD-MTL：用于大规模视觉识别的分层深度多任务学习

HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition.

作者信息

Fan Jianping, Zhao Tianyi, Kuang Zhenzhong, Zheng Yu, Zhang Ji, Yu Jun, Peng Jinye

出版信息

IEEE Trans Image Process. 2017 Apr;26(4):1923-1938. doi: 10.1109/TIP.2017.2667405. Epub 2017 Feb 9.

DOI:10.1109/TIP.2017.2667405

PMID:28207396

Abstract

In this paper, a hierarchical deep multi-task learning (HD-MTL) algorithm is developed to support large-scale visual recognition (e.g., recognizing thousands or even tens of thousands of atomic object classes automatically). First, multiple sets of multi-level deep features are extracted from different layers of deep convolutional neural networks (deep CNNs), and they are used to achieve more effective accomplishment of the coarseto- fine tasks for hierarchical visual recognition. A visual tree is then learned by assigning the visually-similar atomic object classes with similar learning complexities into the same group, which can provide a good environment for determining the interrelated learning tasks automatically. By leveraging the inter-task relatedness (inter-class similarities) to learn more discriminative group-specific deep representations, our deep multi-task learning algorithm can train more discriminative node classifiers for distinguishing the visually-similar atomic object classes effectively. Our hierarchical deep multi-task learning (HD-MTL) algorithm can integrate two discriminative regularization terms to control the inter-level error propagation effectively, and it can provide an end-to-end approach for jointly learning more representative deep CNNs (for image representation) and more discriminative tree classifier (for large-scale visual recognition) and updating them simultaneously. Our incremental deep learning algorithms can effectively adapt both the deep CNNs and the tree classifier to the new training images and the new object classes. Our experimental results have demonstrated that our HD-MTL algorithm can achieve very competitive results on improving the accuracy rates for large-scale visual recognition.

摘要

在本文中，我们开发了一种分层深度多任务学习（HD-MTL）算法，以支持大规模视觉识别（例如，自动识别数千甚至数万种原子对象类别）。首先，从深度卷积神经网络（深度CNN）的不同层中提取多组多级深度特征，并将它们用于更有效地完成分层视觉识别的从粗到细任务。然后，通过将具有相似学习复杂度的视觉相似原子对象类别分配到同一组中来学习视觉树，这可以为自动确定相关学习任务提供良好的环境。通过利用任务间相关性（类间相似性）来学习更具判别力的特定组深度表示，我们的深度多任务学习算法可以训练更具判别力的节点分类器，以有效地区分视觉相似的原子对象类别。我们的分层深度多任务学习（HD-MTL）算法可以集成两个判别正则化项，以有效地控制层间误差传播，并且它可以提供一种端到端的方法，用于联合学习更具代表性的深度CNN（用于图像表示）和更具判别力的树分类器（用于大规模视觉识别）并同时更新它们。我们的增量深度学习算法可以有效地使深度CNN和树分类器适应新的训练图像和新的对象类别。我们的实验结果表明，我们的HD-MTL算法在提高大规模视觉识别准确率方面可以取得非常有竞争力的结果。

相似文献

HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition.HD-MTL：用于大规模视觉识别的分层深度多任务学习

IEEE Trans Image Process. 2017 Apr;26(4):1923-1938. doi: 10.1109/TIP.2017.2667405. Epub 2017 Feb 9.

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition.用于大规模视觉识别的深度多样专家混合模型

IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1072-1087. doi: 10.1109/TPAMI.2018.2828821. Epub 2018 Apr 20.

Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition.利用深度网络嵌入视觉层次结构进行大规模视觉识别。

IEEE Trans Image Process. 2018 Jun 7. doi: 10.1109/TIP.2018.2845118.

Hierarchical Learning of Tree Classifiers for Large-Scale Plant Species Identification.基于树分类器的层次学习方法在大规模植物物种识别中的应用。

IEEE Trans Image Process. 2015 Nov;24(11):4172-84. doi: 10.1109/TIP.2015.2457337.

Discriminative Fast Hierarchical Learning for Multiclass Image Classification.用于多类图像分类的判别式快速分层学习

IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2779-2790. doi: 10.1109/TNNLS.2019.2948881. Epub 2019 Nov 20.

IEEE Trans Image Process. 2019 Sep 5. doi: 10.1109/TIP.2019.2938321.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition.多尺度多视角深度特征聚合的食物识别方法。

IEEE Trans Image Process. 2020;29:265-276. doi: 10.1109/TIP.2019.2929447. Epub 2019 Jul 29.

Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks.基于深度残差网络的皮肤镜图像中黑色素瘤的自动识别。

IEEE Trans Med Imaging. 2017 Apr;36(4):994-1004. doi: 10.1109/TMI.2016.2642839. Epub 2016 Dec 21.

Multi-Instance Deep Learning: Discover Discriminative Local Anatomies for Bodypart Recognition.多实例深度学习：发现身体部位识别的有判别力的局部解剖结构。

IEEE Trans Med Imaging. 2016 May;35(5):1332-1343. doi: 10.1109/TMI.2016.2524985. Epub 2016 Feb 3.

引用本文的文献

Fault Diagnosis of Lithium Battery Modules via Symmetrized Dot Pattern and Convolutional Neural Networks.基于对称点模式和卷积神经网络的锂电池模块故障诊断

Sensors (Basel). 2024 Dec 27;25(1):94. doi: 10.3390/s25010094.

An Overview of Organs-on-Chips Based on Deep Learning.基于深度学习的器官芯片概述。

Research (Wash D C). 2022 Jan 19;2022:9869518. doi: 10.34133/2022/9869518. eCollection 2022.

Modulation Recognition of Radar Signals Based on Adaptive Singular Value Reconstruction and Deep Residual Learning.基于自适应奇异值重构和深度残差学习的雷达信号调制识别

Sensors (Basel). 2021 Jan 10;21(2):449. doi: 10.3390/s21020449.

Multidataset Independent Subspace Analysis With Application to Multimodal Fusion.多数据集独立子空间分析及其在多模态融合中的应用。

IEEE Trans Image Process. 2021;30:588-602. doi: 10.1109/TIP.2020.3028452. Epub 2020 Nov 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

HD-MTL：用于大规模视觉识别的分层深度多任务学习

HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献