• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

正例无标注学习的高效训练

Efficient Training for Positive Unlabeled Learning.

作者信息

Sansone Emanuele, De Natale Francesco G B, Zhou Zhi-Hua

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2584-2598. doi: 10.1109/TPAMI.2018.2860995. Epub 2018 Jul 30.

DOI:10.1109/TPAMI.2018.2860995
PMID:30059293
Abstract

Positive unlabeled (PU) learning is useful in various practical situations, where there is a need to learn a classifier for a class of interest from an unlabeled data set, which may contain anomalies as well as samples from unknown classes. The learning task can be formulated as an optimization problem under the framework of statistical learning theory. Recent studies have theoretically analyzed its properties and generalization performance, nevertheless, little effort has been made to consider the problem of scalability, especially when large sets of unlabeled data are available. In this work we propose a novel scalable PU learning algorithm that is theoretically proven to provide the optimal solution, while showing superior computational and memory performance. Experimental evaluation confirms the theoretical evidence and shows that the proposed method can be successfully applied to a large variety of real-world problems involving PU learning.

摘要

正无标记(PU)学习在各种实际情况中都很有用,在这些情况下,需要从未标记的数据集中学习针对感兴趣类别的分类器,该数据集可能包含异常以及来自未知类别的样本。学习任务可以在统计学习理论框架下被表述为一个优化问题。最近的研究从理论上分析了它的性质和泛化性能,然而,在考虑可扩展性问题上几乎没有做什么工作,特别是当有大量未标记数据可用时。在这项工作中,我们提出了一种新颖的可扩展PU学习算法,该算法在理论上被证明能提供最优解,同时展现出卓越的计算和内存性能。实验评估证实了理论依据,并表明所提出的方法可以成功应用于涉及PU学习的各种现实世界问题。

相似文献

1
Efficient Training for Positive Unlabeled Learning.正例无标注学习的高效训练
IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2584-2598. doi: 10.1109/TPAMI.2018.2860995. Epub 2018 Jul 30.
2
Large-Margin Label-Calibrated Support Vector Machines for Positive and Unlabeled Learning.用于正例和无标注学习的大间隔标签校准支持向量机
IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3471-3483. doi: 10.1109/TNNLS.2019.2892403. Epub 2019 Feb 6.
3
Loss Decomposition and Centroid Estimation for Positive and Unlabeled Learning.用于正例和无标签学习的损失分解与质心估计
IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):918-932. doi: 10.1109/TPAMI.2019.2941684. Epub 2021 Feb 4.
4
Convex formulation of multiple instance learning from positive and unlabeled bags.从正例和未标记袋中进行多示例学习的凸公式化。
Neural Netw. 2018 Sep;105:132-141. doi: 10.1016/j.neunet.2018.05.001. Epub 2018 May 24.
5
Positive-unlabeled learning in bioinformatics and computational biology: a brief review.生物信息学和计算生物学中的正无标记学习:简要综述。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab461.
6
Positive-unlabeled learning for disease gene identification.基于正例无标记学习的疾病基因识别。
Bioinformatics. 2012 Oct 15;28(20):2640-7. doi: 10.1093/bioinformatics/bts504. Epub 2012 Aug 24.
7
Semisupervised Multiclass Classification Problems With Scarcity of Labeled Data: A Theoretical Study.半监督多类分类问题与标签数据的稀缺:理论研究。
IEEE Trans Neural Netw Learn Syst. 2016 Dec;27(12):2602-2614. doi: 10.1109/TNNLS.2015.2498525. Epub 2015 Nov 23.
8
Positive-Unlabeled Learning With Label Distribution Alignment.基于标签分布对齐的正例-无标签学习
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15345-15363. doi: 10.1109/TPAMI.2023.3319431. Epub 2023 Nov 3.
9
SVRG-MKL: A Fast and Scalable Multiple Kernel Learning Solution for Features Combination in Multi-Class Classification Problems.SVRG-MKL:一种用于多类分类问题中特征组合的快速且可扩展的多核学习解决方案。
IEEE Trans Neural Netw Learn Syst. 2020 May;31(5):1710-1723. doi: 10.1109/TNNLS.2019.2922123. Epub 2019 Jul 4.
10
A Robust AUC Maximization Framework With Simultaneous Outlier Detection and Feature Selection for Positive-Unlabeled Classification.一种具有异常值检测和特征选择的稳健 AUC 最大化框架,用于正-未标记分类。
IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):3072-3083. doi: 10.1109/TNNLS.2018.2870666. Epub 2018 Oct 9.

引用本文的文献

1
A recent survey on instance-dependent positive and unlabeled learning.一项关于实例依赖型正例和无标签学习的近期调查。
Fundam Res. 2022 Oct 12;5(2):796-803. doi: 10.1016/j.fmre.2022.09.019. eCollection 2025 Mar.
2
Positive-unlabeled learning identifies vaccine candidate antigens in the malaria parasite Plasmodium falciparum.正未标记学习可识别恶性疟原虫中的疫苗候选抗原。
NPJ Syst Biol Appl. 2024 Apr 27;10(1):44. doi: 10.1038/s41540-024-00365-1.
3
Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings.
通过使用深度蛋白质序列嵌入的暹罗神经网络对病毒-宿主蛋白质-蛋白质相互作用进行准确预测。
Patterns (N Y). 2022 Jul 31;3(9):100551. doi: 10.1016/j.patter.2022.100551. eCollection 2022 Sep 9.
4
Probing lncRNA-Protein Interactions: Data Repositories, Models, and Algorithms.探索长链非编码RNA-蛋白质相互作用:数据存储库、模型和算法
Front Genet. 2020 Jan 31;10:1346. doi: 10.3389/fgene.2019.01346. eCollection 2019.
5
Revealing Drug-Target Interactions with Computational Models and Algorithms.揭示药物-靶标相互作用的计算模型和算法。
Molecules. 2019 May 2;24(9):1714. doi: 10.3390/molecules24091714.