• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过成本敏感学习实现的奈曼-皮尔逊多类分类

Neyman-Pearson Multi-class Classification via Cost-sensitive Learning.

作者信息

Tian Ye, Feng Yang

机构信息

Department of Statistics, Columbia University.

Department of Biostatistics, School of Global Public Health, New York University.

出版信息

J Am Stat Assoc. 2025;120(550):1164-1177. doi: 10.1080/01621459.2024.2402567. Epub 2024 Nov 19.

DOI:10.1080/01621459.2024.2402567
PMID:40689012
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12268361/
Abstract

Most existing classification methods aim to minimize the overall misclassification error rate. However, in applications such as loan default prediction, different types of errors can have varying consequences. To address this asymmetry issue, two popular paradigms have been developed: the Neyman-Pearson (NP) paradigm and the cost-sensitive (CS) paradigm. Previous studies on the NP paradigm have primarily focused on the binary case, while the multi-class NP problem poses a greater challenge due to its unknown feasibility. In this work, we tackle the multi-class NP problem by establishing a connection with the CS problem via strong duality and propose two algorithms. We extend the concept of NP oracle inequalities, crucial in binary classifications, to NP oracle properties in the multi-class context. Our algorithms satisfy these NP oracle properties under certain conditions. Furthermore, we develop practical algorithms to assess the feasibility and strong duality in multi-class NP problems, which can offer practitioners the landscape of a multi-class NP problem with various target error levels. Simulations and real data studies validate the effectiveness of our algorithms. To our knowledge, this is the first study to address the multi-class NP problem with theoretical guarantees. The proposed algorithms have been implemented in the R package npcs, which is available on CRAN.

摘要

大多数现有的分类方法旨在最小化总体误分类错误率。然而,在诸如贷款违约预测等应用中,不同类型的错误可能会产生不同的后果。为了解决这种不对称问题,已经开发了两种流行的范式:奈曼 - 皮尔逊(NP)范式和成本敏感(CS)范式。先前关于NP范式的研究主要集中在二分类情况,而多分类NP问题由于其可行性未知而带来了更大的挑战。在这项工作中,我们通过强对偶性与CS问题建立联系来解决多分类NP问题,并提出了两种算法。我们将二分类中至关重要的NP预言机不等式的概念扩展到多分类背景下的NP预言机性质。我们的算法在某些条件下满足这些NP预言机性质。此外,我们开发了实用算法来评估多分类NP问题中的可行性和强对偶性,这可以为从业者提供具有各种目标错误水平的多分类NP问题的全貌。模拟和实际数据研究验证了我们算法的有效性。据我们所知,这是第一项在理论保证下解决多分类NP问题的研究。所提出的算法已在R包npcs中实现,该包可在CRAN上获取。

相似文献

1
Neyman-Pearson Multi-class Classification via Cost-sensitive Learning.通过成本敏感学习实现的奈曼-皮尔逊多类分类
J Am Stat Assoc. 2025;120(550):1164-1177. doi: 10.1080/01621459.2024.2402567. Epub 2024 Nov 19.
2
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
3
Short-Term Memory Impairment短期记忆障碍
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
5
Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
9
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

本文引用的文献

1
A flexible model-free prediction-based framework for feature ranking.一种基于灵活的无模型预测的特征排序框架。
J Mach Learn Res. 2021 May;22.
2
Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines.统计假设检验与机器学习二元分类:区别与指南
Patterns (N Y). 2020 Oct 9;1(7):100115. doi: 10.1016/j.patter.2020.100115.
3
Neyman-Pearson classification algorithms and NP receiver operating characteristics.Neyman-Pearson 分类算法和 NP 接收机工作特性。
Sci Adv. 2018 Feb 2;4(2):eaao1659. doi: 10.1126/sciadv.aao1659. eCollection 2018 Feb.
4
Efficient multiclass ROC approximation by decomposition via confusion matrix perturbation analysis.通过混淆矩阵扰动分析进行分解的高效多类ROC近似
IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):810-22. doi: 10.1109/TPAMI.2007.70740.
5
Ideal observers and optimal ROC hypersurfaces in N-class classification.N类分类中的理想观察者与最优ROC超曲面
IEEE Trans Med Imaging. 2004 Jul;23(7):891-5. doi: 10.1109/TMI.2004.828358.
6
Comparing three-class diagnostic tests by three-way ROC analysis.通过三元ROC分析比较三类诊断测试。
Med Decis Making. 2000 Jul-Sep;20(3):323-31. doi: 10.1177/0272989X0002000309.
7
Three-way ROCs.三元ROC曲线
Med Decis Making. 1999 Jan-Mar;19(1):78-89. doi: 10.1177/0272989X9901900110.