• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多类别AUC学习:理论与算法

Learning With Multiclass AUC: Theory and Algorithms.

作者信息

Yang Zhiyong, Xu Qianqian, Bao Shilong, Cao Xiaochun, Huang Qingming

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7747-7763. doi: 10.1109/TPAMI.2021.3101125. Epub 2022 Oct 4.

DOI:10.1109/TPAMI.2021.3101125
PMID:34329155
Abstract

The Area under the ROC curve (AUC) is a well-known ranking metric for problems such as imbalanced learning and recommender systems. The vast majority of existing AUC-optimization-based machine learning methods only focus on binary-class cases, while leaving the multiclass cases unconsidered. In this paper, we start an early trial to consider the problem of learning multiclass scoring functions via optimizing multiclass AUC metrics. Our foundation is based on the M metric, which is a well-known multiclass extension of AUC. We first pay a revisit to this metric, showing that it could eliminate the imbalance issue from the minority class pairs. Motivated by this, we propose an empirical surrogate risk minimization framework to approximately optimize the M metric. Theoretically, we show that: (i) optimizing most of the popular differentiable surrogate losses suffices to reach the Bayes optimal scoring function asymptotically; (ii) the training framework enjoys an imbalance-aware generalization error bound, which pays more attention to the bottleneck samples of minority classes compared with the traditional O(√{1/N}) result. Practically, to deal with the low scalability of the computational operations, we propose acceleration methods for three popular surrogate loss functions, including the exponential loss, squared loss, and hinge loss, to speed up loss and gradient evaluations. Finally, experimental results on 11 real-world datasets demonstrate the effectiveness of our proposed framework. The code is now available at https://github.com/joshuaas/Learning-with-Multiclass-AUC-Theory-and-Algorithms.

摘要

ROC曲线下面积(AUC)是用于诸如不平衡学习和推荐系统等问题的一种知名排序指标。绝大多数现有的基于AUC优化的机器学习方法仅关注二分类情况,而未考虑多分类情况。在本文中,我们开启了一项早期尝试,通过优化多分类AUC指标来考虑学习多分类评分函数的问题。我们的基础是基于M指标,它是AUC的一种知名多分类扩展。我们首先重新审视了这个指标,表明它可以消除少数类对中的不平衡问题。受此启发,我们提出了一个经验替代风险最小化框架来近似优化M指标。从理论上讲,我们证明了:(i)优化大多数流行的可微替代损失足以渐近地达到贝叶斯最优评分函数;(ii)训练框架具有不平衡感知泛化误差界,与传统的O(√{1/N})结果相比,它更关注少数类的瓶颈样本。在实践中,为了处理计算操作的低可扩展性,我们针对三种流行的替代损失函数提出了加速方法,包括指数损失、平方损失和铰链损失,以加快损失和梯度评估。最后,在11个真实世界数据集上的实验结果证明了我们提出的框架的有效性。代码现已在https://github.com/joshuaas/Learning-with-Multiclass-AUC-Theory-and-Algorithms上获取。

相似文献

1
Learning With Multiclass AUC: Theory and Algorithms.多类别AUC学习:理论与算法
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7747-7763. doi: 10.1109/TPAMI.2021.3101125. Epub 2022 Oct 4.
2
To Combat Multiclass Imbalanced Problems by Aggregating Evolutionary Hierarchical Classifiers.通过聚合进化层次分类器来对抗多类不平衡问题。
IEEE Trans Neural Netw Learn Syst. 2024 Apr 8;PP. doi: 10.1109/TNNLS.2024.3383672.
3
Optimizing Two-Way Partial AUC With an End-to-End Framework.用端到端框架优化双向部分 AUC。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):10228-10246. doi: 10.1109/TPAMI.2022.3185311. Epub 2023 Jun 30.
4
Parameter-Free Loss for Class-Imbalanced Deep Learning in Image Classification.无参数损失的图像分类中深度不平衡学习。
IEEE Trans Neural Netw Learn Syst. 2023 Jun;34(6):3234-3240. doi: 10.1109/TNNLS.2021.3110885. Epub 2023 Jun 1.
5
mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework.mCRF和mRD:基于新型多类标签噪声过滤学习框架的两种分类方法。
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):2916-2930. doi: 10.1109/TNNLS.2020.3047046. Epub 2022 Jul 6.
6
Optimizing Partial Area Under the Top-k Curve: Theory and Practice.
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5053-5069. doi: 10.1109/TPAMI.2022.3199970. Epub 2023 Mar 7.
7
Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data.不要被类别不平衡问题困扰:选择合适的分类器和性能指标,对不平衡数据进行脑解码。
Neuroimage. 2023 Aug 15;277:120253. doi: 10.1016/j.neuroimage.2023.120253. Epub 2023 Jun 28.
8
Multiclass Imbalance Problems: Analysis and Potential Solutions.多类不平衡问题:分析与潜在解决方案
IEEE Trans Syst Man Cybern B Cybern. 2012 Aug;42(4):1119-30. doi: 10.1109/TSMCB.2012.2187280. Epub 2012 Mar 16.
9
A calibrated multiclass extension of AdaBoost.一种经过校准的AdaBoost多类扩展。
Stat Appl Genet Mol Biol. 2011 Nov 20;10(1):/j/sagmb.2011.10.issue-1/1544-6115.1731/1544-6115.1731.xml. doi: 10.2202/1544-6115.1731.
10
Radial-Based Oversampling for Multiclass Imbalanced Data Classification.基于径向基的多类不平衡数据分类过采样方法
IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2818-2831. doi: 10.1109/TNNLS.2019.2913673. Epub 2019 Jun 21.

引用本文的文献

1
KDDC: a new framework that integrates kmers, dataset filtering, dimension reduction and classification algorithms to achieve immune cell heterogeneity classification.KDDC:一种整合k-mer、数据集过滤、降维和分类算法以实现免疫细胞异质性分类的新框架。
Front Immunol. 2025 May 30;16:1602907. doi: 10.3389/fimmu.2025.1602907. eCollection 2025.
2
Advanced prognostic modeling with deep learning: assessing long-term outcomes in liver transplant recipients from deceased and living donors.基于深度学习的高级预后模型:评估已故和活体供肝肝移植受者的长期预后
J Transl Med. 2025 Feb 16;23(1):188. doi: 10.1186/s12967-025-06183-1.
3
Forecasting West Nile Virus With Graph Neural Networks: Harnessing Spatial Dependence in Irregularly Sampled Geospatial Data.
利用图神经网络预测西尼罗河病毒:利用不规则采样地理空间数据中的空间依赖性
Geohealth. 2024 Jul 3;8(7):e2023GH000784. doi: 10.1029/2023GH000784. eCollection 2024 Jul.
4
Comparing multi-class classifier performance by multi-class ROC analysis: A nonparametric approach.通过多类ROC分析比较多类分类器性能:一种非参数方法。
Neurocomputing (Amst). 2024 May 28;583. doi: 10.1016/j.neucom.2024.127520. Epub 2024 Mar 6.
5
Genistin: A Novel Estrogen Analogue Targeting ERβ to Alleviate Thrombocytopenia.染料木苷:一种靶向雌激素受体β以缓解血小板减少症的新型雌激素类似物。
Int J Biol Sci. 2024 Mar 25;20(6):2236-2260. doi: 10.7150/ijbs.90483. eCollection 2024.
6
A hierarchical multilabel graph attention network method to predict the deterioration paths of chronic hepatitis B patients.一种用于预测慢性乙型肝炎患者恶化路径的分层多标签图注意网络方法。
J Am Med Inform Assoc. 2023 Apr 19;30(5):846-858. doi: 10.1093/jamia/ocad008.
7
Cell senescence-associated genes predict the malignant characteristics of glioblastoma.细胞衰老相关基因可预测胶质母细胞瘤的恶性特征。
Cancer Cell Int. 2022 Dec 16;22(1):411. doi: 10.1186/s12935-022-02834-1.