分层语义风险最小化在大规模分类中的应用。

Hierarchical Semantic Risk Minimization for Large-Scale Classification.

出版信息

IEEE Trans Cybern. 2022 Sep;52(9):9546-9558. doi: 10.1109/TCYB.2021.3059631. Epub 2022 Aug 18.

DOI:10.1109/TCYB.2021.3059631

Abstract

Hierarchical structures of labels usually exist in large-scale classification tasks, where labels can be organized into a tree-shaped structure. The nodes near the root stand for coarser labels, while the nodes close to leaves mean the finer labels. We label unseen samples from the root node to a leaf node, and obtain multigranularity predictions in the hierarchical classification. Sometimes, we cannot obtain a leaf decision due to uncertainty or incomplete information. In this case, we should stop at an internal node, rather than going ahead rashly. However, most existing hierarchical classification models aim at maximizing the percentage of correct predictions, and do not take the risk of misclassifications into account. Such risk is critically important in some real-world applications, and can be measured by the distance between the ground truth and the predicted classes in the class hierarchy. In this work, we utilize the semantic hierarchy to define the classification risk and design an optimization technique to reduce such risk. By defining the conservative risk and the precipitant risk as two competing risk factors, we construct the balanced conservative/precipitant semantic (BCPS) risk matrix across all nodes in the semantic hierarchy with user-defined weights to adjust the tradeoff between two kinds of risks. We then model the classification process on the semantic hierarchy as a sequential decision-making task. We design an algorithm to derive the risk-minimized predictions. There are two modules in this model: 1) multitask hierarchical learning and 2) deep reinforce multigranularity learning. The first one learns classification confidence scores of multiple levels. These scores are then fed into deep reinforced multigranularity learning for obtaining a global risk-minimized prediction with flexible granularity. Experimental results show that the proposed model outperforms state-of-the-art methods on seven large-scale classification datasets with the semantic tree.

摘要

层次结构的标签通常存在于大规模分类任务中，其中标签可以组织成树状结构。根节点附近的节点表示较粗的标签，而靠近叶子的节点表示较细的标签。我们从根节点开始对未标记的样本进行标记，直到到达叶节点，从而在层次分类中获得多粒度的预测。有时，由于不确定性或信息不完整，我们无法获得叶决策。在这种情况下，我们应该在内部节点停止，而不是贸然前进。然而，大多数现有的层次分类模型旨在最大化正确预测的百分比，而不考虑错误分类的风险。在一些实际应用中，这种风险至关重要，可以通过类别层次结构中真实值与预测值之间的距离来衡量。在这项工作中，我们利用语义层次结构来定义分类风险，并设计了一种优化技术来降低这种风险。通过将保守风险和激进风险定义为两个竞争风险因素，我们在语义层次结构的所有节点上构建了平衡的保守/激进语义（BCPS）风险矩阵，并使用用户定义的权重来调整两种风险之间的权衡。然后，我们将分类过程建模为一个顺序决策任务。我们设计了一种算法来得出风险最小化的预测。该模型有两个模块：1）多任务层次学习和 2）深度强化多粒度学习。第一个模块学习多个层次的分类置信度得分。然后，这些分数被输入到深度强化多粒度学习中，以获得具有灵活粒度的全局风险最小化预测。实验结果表明，该模型在七个具有语义树的大规模分类数据集上的表现优于最先进的方法。

相似文献

Hierarchical Semantic Risk Minimization for Large-Scale Classification.分层语义风险最小化在大规模分类中的应用。

IEEE Trans Cybern. 2022 Sep;52(9):9546-9558. doi: 10.1109/TCYB.2021.3059631. Epub 2022 Aug 18.

Mandatory leaf node prediction in hierarchical multilabel classification.层次多标签分类中的强制叶节点预测。

IEEE Trans Neural Netw Learn Syst. 2014 Dec;25(12):2275-87. doi: 10.1109/TNNLS.2014.2309437.

Label-activating framework for zero-shot learning.标签激活框架用于零样本学习。

Neural Netw. 2020 Jan;121:1-9. doi: 10.1016/j.neunet.2019.08.023. Epub 2019 Sep 6.

Intelligent Classification Method of Archive Data Based on Multigranular Semantics.基于多粒度语义的档案数据智能分类方法。

Comput Intell Neurosci. 2022 May 14;2022:7559523. doi: 10.1155/2022/7559523. eCollection 2022.

Rebalanced Zero-Shot Learning.重新平衡的零样本学习

IEEE Trans Image Process. 2023;32:4185-4198. doi: 10.1109/TIP.2023.3295738. Epub 2023 Jul 25.

Multigranularity Label Prediction Model for Automatic International Classification of Diseases Coding in Clinical Text.多粒度标签预测模型在临床文本自动国际疾病分类编码中的应用

J Comput Biol. 2023 Aug;30(8):900-911. doi: 10.1089/cmb.2023.0096. Epub 2023 Jul 31.

Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition.基于知识引导的通用图像识别的多标签少样本学习。

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1371-1384. doi: 10.1109/TPAMI.2020.3025814. Epub 2022 Feb 3.

Deep Semantic Segmentation Feature-Based Radiomics for the Classification Tasks in Medical Image Analysis.基于深度语义分割特征的放射组学在医学图像分析中的分类任务。

IEEE J Biomed Health Inform. 2021 Jul;25(7):2655-2664. doi: 10.1109/JBHI.2020.3043236. Epub 2021 Jul 27.

Attributes learning network for generalized zero-shot learning.属性学习网络用于广义零样本学习。

Neural Netw. 2022 Jun;150:112-118. doi: 10.1016/j.neunet.2022.02.018. Epub 2022 Mar 5.

ML-Tree: a tree-structure-based approach to multilabel learning.ML-Tree：一种基于树结构的多标签学习方法。

IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):430-43. doi: 10.1109/TNNLS.2014.2315296. Epub 2014 Apr 29.

引用本文的文献

Clinical evaluation of a novel atlas-based PET/CT brain image segmentation and quantification method for epilepsy.一种基于图谱的新型PET/CT脑图像分割与定量方法在癫痫中的临床评估

Quant Imaging Med Surg. 2022 Sep;12(9):4538-4548. doi: 10.21037/qims-21-1005.

分层语义风险最小化在大规模分类中的应用。

Hierarchical Semantic Risk Minimization for Large-Scale Classification.

出版信息

IEEE Trans Cybern. 2022 Sep;52(9):9546-9558. doi: 10.1109/TCYB.2021.3059631. Epub 2022 Aug 18.

DOI:10.1109/TCYB.2021.3059631

PMID:33729972

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

分层语义风险最小化在大规模分类中的应用。

Hierarchical Semantic Risk Minimization for Large-Scale Classification.

出版信息

相似文献

引用本文的文献

分层语义风险最小化在大规模分类中的应用。

Hierarchical Semantic Risk Minimization for Large-Scale Classification.

出版信息

相似文献

引用本文的文献