基于部分标记数据学习的统一概率框架。

A Unifying Probabilistic Framework for Partially Labeled Data Learning.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8036-8048. doi: 10.1109/TPAMI.2022.3228755. Epub 2023 Jun 5.

DOI:10.1109/TPAMI.2022.3228755

Abstract

Partially labeled data learning (PLDL), including partial label learning (PLL) and partial multi-label learning (PML), has been widely used in nowadays data science. Researchers attempt to construct different specific models to deal with the different classification tasks for PLL and PML scenarios respectively. The main challenge in training classifiers for PLL and PML is how to deal with ambiguities caused by the noisy false-positive labels in the candidate label set. The state-of-the-art strategy for both scenarios is to perform disambiguation by identifying the ground-truth label(s) directly from the candidate label set, which can be summarized into two categories: 'the identifying method' and 'the embedding method'. However, both kinds of methods are constructed by hand-designed heuristic modeling under considerations like feature/label correlations with no theoretical interpretation. Instead of adopting heuristic or specific modeling, we propose a novel unifying framework called A Unifying Probabilistic Framework for Partially Labeled Data Learning (UPF-PLDL), which is derived from a clear probabilistic formulation, and brings existing research on PLL and PML under one theoretical interpretation with respect to information theory. Furthermore, the proposed UPF-PLDL also unifies 'the identifying method' and 'the embedding method' into one integrated framework, which naturally incorporates the feature and label correlation considerations. Comprehensive experiments on synthetic and real-world datasets for both PLL and PML scenarios clearly demonstrate the superiorities of the derived framework.

摘要

部分标记数据学习（PLDL），包括部分标签学习（PLL）和部分多标签学习（PML），已在当今的数据科学中得到广泛应用。研究人员尝试构建不同的特定模型，分别用于 PLL 和 PML 场景的不同分类任务。在 PLL 和 PML 场景中训练分类器的主要挑战是如何处理候选标签集中由嘈杂的假阳性标签引起的歧义。这两种情况的最新策略是通过直接从候选标签集中识别真实标签来进行去歧义，这可以总结为两类：“识别方法”和“嵌入方法”。然而，这两种方法都是基于特征/标签相关性等考虑因素，通过手工设计的启发式建模来构建的，没有理论解释。我们没有采用启发式或特定的建模方法，而是提出了一个称为 A 统一概率框架的新的统一框架，用于部分标记数据学习（UPF-PLDL），该框架源于清晰的概率公式，并将 PLL 和 PML 的现有研究纳入一个理论解释中，与信息论有关。此外，所提出的 UPF-PLDL 还将“识别方法”和“嵌入方法”统一到一个集成框架中，自然地考虑了特征和标签相关性。在 PLL 和 PML 场景的合成和真实数据集上进行的综合实验清楚地证明了该框架的优越性。

相似文献

A Unifying Probabilistic Framework for Partially Labeled Data Learning.基于部分标记数据学习的统一概率框架。

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8036-8048. doi: 10.1109/TPAMI.2022.3228755. Epub 2023 Jun 5.

Discriminative Metric Learning for Partial Label Learning.用于部分标签学习的判别度量学习

IEEE Trans Neural Netw Learn Syst. 2023 Aug;34(8):4428-4439. doi: 10.1109/TNNLS.2021.3118362. Epub 2023 Aug 4.

Partial Multi-Label Learning With Noisy Label Identification.基于噪声标签识别的部分多标签学习

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3676-3687. doi: 10.1109/TPAMI.2021.3059290. Epub 2022 Jun 3.

Top-k Partial Label Machine.Top-k 部分标签机

IEEE Trans Neural Netw Learn Syst. 2021 Jun 4;PP. doi: 10.1109/TNNLS.2021.3083397.

Partial Multilabel Learning Using Noise-Tolerant Broad Learning System With Label Enhancement and Dimensionality Reduction.基于标签增强和降维的抗噪声广义学习系统的部分多标签学习

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3758-3772. doi: 10.1109/TNNLS.2024.3352285. Epub 2025 Feb 6.

Partial label learning: Taxonomy, analysis and outlook.部分标签学习：分类、分析与展望。

Neural Netw. 2023 Apr;161:708-734. doi: 10.1016/j.neunet.2023.02.019. Epub 2023 Feb 16.

PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning.PiCO+：用于稳健部分标签学习的对比标签消歧

IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3183-3198. doi: 10.1109/TPAMI.2023.3342650. Epub 2024 Apr 3.

Large Margin Partial Label Machine.大间隔部分标签机

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2594-2608. doi: 10.1109/TNNLS.2019.2933530. Epub 2019 Sep 6.

Prior Knowledge Regularized Self-Representation Model for Partial Multilabel Learning.基于先验知识正则化自表示模型的部分多标签学习。

IEEE Trans Cybern. 2023 Mar;53(3):1618-1628. doi: 10.1109/TCYB.2021.3107422. Epub 2023 Feb 15.

Towards Enabling Binary Decomposition for Partial Multi-Label Learning.迈向实现部分多标签学习的二元分解

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13203-13217. doi: 10.1109/TPAMI.2023.3290797. Epub 2023 Oct 3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于部分标记数据学习的统一概率框架。

A Unifying Probabilistic Framework for Partially Labeled Data Learning.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献