• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动分布式主动学习的统一框架。

A Unified Framework for Automatic Distributed Active Learning.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9774-9786. doi: 10.1109/TPAMI.2021.3129793. Epub 2022 Nov 7.

DOI:10.1109/TPAMI.2021.3129793
PMID:34813465
Abstract

We propose a novel unified frameork for automated distributed active learning (AutoDAL) to address multiple challenging problems in active learning such as limited labeled data, imbalanced datasets, automatic hyperparameter selection as well as scalability to big data. First, automated graph-based semi-supervised learning is conducted by aggregating the proposed cost functions from different compute nodes and jointly optimizing hyperparameters in both the classification and query selection stages. For dense datasets, clustering-based uncertainty sampling with maximum entropy (CME) loss is applied in the optimization. For sparse and imbalanced datasets, shrinkage optimized KL-divergence regularization and local selection based active learning (SOAR) loss are further naturally adapted in AutoDAL. The optimization is efficiently resolved by iteratively executing a genetic algorithm (GA) refined with a local generating set search (GSS) and solving an integer linear programming (ILP) problem. Moreover, we propose an efficient distributed active learning algorithm which is scalable for big data. The proposed AutoDAL algorithm is applied to multiple benchmark datasets and two real-world datasets including an electrocardiogram (ECG) dataset and a credit fraud detection dataset for classification. We demonstrate that the proposed AutoDAL algorithm is capable of achieving significantly better performance compared to several state-of-the-art AutoML approaches and active learning algorithms.

摘要

我们提出了一个新颖的统一框架,用于自动化分布式主动学习(AutoDAL),以解决主动学习中的多个挑战性问题,如有限的标记数据、不平衡数据集、自动超参数选择以及大数据的可扩展性。首先,通过聚合来自不同计算节点的建议成本函数,并在分类和查询选择阶段联合优化超参数,进行自动化基于图的半监督学习。对于密集数据集,在优化中应用基于聚类的不确定性抽样最大熵(CME)损失。对于稀疏和不平衡数据集,AutoDAL 中进一步自然适应了收缩优化 KL 散度正则化和基于局部选择的主动学习(SOAR)损失。通过迭代执行遗传算法(GA)并使用局部生成集搜索(GSS)进行精炼以及解决整数线性规划(ILP)问题,有效地解决了优化问题。此外,我们提出了一种高效的分布式主动学习算法,适用于大数据。将所提出的 AutoDAL 算法应用于多个基准数据集和两个真实世界数据集,包括心电图(ECG)数据集和信用欺诈检测数据集,用于分类。我们证明,与几种最先进的 AutoML 方法和主动学习算法相比,所提出的 AutoDAL 算法能够实现显著更好的性能。

相似文献

1
A Unified Framework for Automatic Distributed Active Learning.自动分布式主动学习的统一框架。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9774-9786. doi: 10.1109/TPAMI.2021.3129793. Epub 2022 Nov 7.
2
Neurodynamics-driven holistic approaches to semi-supervised feature selection.基于神经动力学的半监督特征选择整体方法。
Neural Netw. 2023 Jan;157:377-386. doi: 10.1016/j.neunet.2022.10.029. Epub 2022 Nov 3.
3
Self-Supervised Node Classification with Strategy and Actively Selected Labeled Set.基于策略和主动选择标记集的自监督节点分类
Entropy (Basel). 2022 Dec 23;25(1):30. doi: 10.3390/e25010030.
4
Flexible multi-view semi-supervised learning with unified graph.基于统一图的灵活多视图半监督学习。
Neural Netw. 2021 Oct;142:92-104. doi: 10.1016/j.neunet.2021.04.033. Epub 2021 May 5.
5
Semi-supervised learning via regularized boosting working on multiple semi-supervised assumptions.基于多种半监督假设的正则化提升的半监督学习。
IEEE Trans Pattern Anal Mach Intell. 2011 Jan;33(1):129-43. doi: 10.1109/TPAMI.2010.92.
6
Handling Imbalanced Data: Uncertainty-Guided Virtual Adversarial Training With Batch Nuclear-Norm Optimization for Semi-Supervised Medical Image Classification.处理不平衡数据:半监督医学图像分类中的不确定性引导的虚拟对抗训练与批量核范数优化。
IEEE J Biomed Health Inform. 2022 Jul;26(7):2983-2994. doi: 10.1109/JBHI.2022.3162748. Epub 2022 Jul 1.
7
A distributed semi-supervised learning algorithm based on manifold regularization using wavelet neural network.基于流形正则化的小波神经网络的分布式半监督学习算法。
Neural Netw. 2019 Oct;118:300-309. doi: 10.1016/j.neunet.2018.10.014. Epub 2018 Nov 14.
8
Deep semi-supervised learning via dynamic anchor graph embedding in latent space.基于潜在空间动态锚图嵌入的深度半监督学习。
Neural Netw. 2022 Feb;146:350-360. doi: 10.1016/j.neunet.2021.11.026. Epub 2021 Dec 1.
9
Distribution-free Bayesian regularized learning framework for semi-supervised learning.用于半监督学习的无分布贝叶斯正则化学习框架。
Neural Netw. 2024 Jun;174:106262. doi: 10.1016/j.neunet.2024.106262. Epub 2024 Mar 20.
10
L2-norm multiple kernel learning and its application to biomedical data fusion.L2-范数多核学习及其在生物医学数据融合中的应用。
BMC Bioinformatics. 2010 Jun 8;11:309. doi: 10.1186/1471-2105-11-309.