• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的多标签主动学习元框架。

A meta-framework for multi-label active learning based on deep reinforcement learning.

机构信息

College of Mathematics and Statistics, Shenzhen University, Shenzhen, 518060, China.

College of Mathematics and Statistics, Shenzhen University, Shenzhen, 518060, China; Shenzhen Key Lab. of Advanced Machine Learning and Applications, Shenzhen University, Shenzhen, 518060, China; Guangdong Key Lab. of Intelligent Information Process, Shenzhen University, Shenzhen, 518060, China.

出版信息

Neural Netw. 2023 May;162:258-270. doi: 10.1016/j.neunet.2023.02.045. Epub 2023 Mar 7.

DOI:10.1016/j.neunet.2023.02.045
PMID:36913822
Abstract

Multi-label Active Learning (MLAL) is an effective method to improve the performance of the classifier on multi-label problems with less annotation effort by allowing the learning system to actively select high-quality examples (example-label pairs) for labeling. Existing MLAL algorithms mainly focus on designing reasonable algorithms to evaluate the potential values (as previously mentioned quality) of the unlabeled data. These manually designed methods may show totally different results on various types of datasets due to the defect of the methods or the particularity of the datasets. In this paper, instead of manually designing an evaluation method, we propose a deep reinforcement learning (DRL) model to explore a general evaluation method on several seen datasets and eventually apply it to unseen datasets based on a meta framework. In addition, a self-attention mechanism along with a reward function is integrated into the DRL structure to address the label correlation and data imbalanced problems in MLAL. Comprehensive experiments show that our proposed DRL-based MLAL method is able to produce comparable results as compared with other methods reported in the literature.

摘要

多标签主动学习(MLAL)是一种有效的方法,可以通过允许学习系统主动选择高质量的示例(示例-标签对)进行标注,从而减少标注工作量,提高多标签问题的分类器性能。现有的 MLAL 算法主要侧重于设计合理的算法来评估未标记数据的潜在值(如前所述的质量)。由于方法的缺陷或数据集的特殊性,这些手动设计的方法在各种类型的数据集上可能会产生完全不同的结果。在本文中,我们不是手动设计评估方法,而是提出了一个深度强化学习(DRL)模型,在几个已见数据集上探索一种通用的评估方法,最终基于元框架将其应用于未见数据集。此外,我们还将自注意力机制和奖励函数集成到 DRL 结构中,以解决 MLAL 中的标签相关性和数据不平衡问题。综合实验表明,我们提出的基于 DRL 的 MLAL 方法能够产生与文献中报道的其他方法相当的结果。

相似文献

1
A meta-framework for multi-label active learning based on deep reinforcement learning.基于深度强化学习的多标签主动学习元框架。
Neural Netw. 2023 May;162:258-270. doi: 10.1016/j.neunet.2023.02.045. Epub 2023 Mar 7.
2
Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey.深度强化学习及其在医学影像和放射治疗中的应用:综述。
Phys Med Biol. 2022 Nov 11;67(22). doi: 10.1088/1361-6560/ac9cb3.
3
Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.基于双目标框架的多机器人编队分布式深度强化学习
Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.
4
What matters in reinforcement learning for tractography.在轨迹追踪中强化学习的要点。
Med Image Anal. 2024 Apr;93:103085. doi: 10.1016/j.media.2024.103085. Epub 2024 Jan 11.
5
LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning.LJIR:在合作多智能体强化学习中学习联合行动内在奖励
Neural Netw. 2023 Oct;167:450-459. doi: 10.1016/j.neunet.2023.08.016. Epub 2023 Aug 22.
6
Deep reinforcement learning for automated radiation adaptation in lung cancer.深度强化学习在肺癌放射自适应中的应用。
Med Phys. 2017 Dec;44(12):6690-6705. doi: 10.1002/mp.12625. Epub 2017 Nov 14.
7
Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target.具有移动目标的无图路径高效导航的预测分层强化学习。
Neural Netw. 2023 Aug;165:677-688. doi: 10.1016/j.neunet.2023.06.007. Epub 2023 Jun 10.
8
Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm.基于鲸鱼优化算法优化自动驾驶中深度强化学习的超参数。
PLoS One. 2021 Jun 10;16(6):e0252754. doi: 10.1371/journal.pone.0252754. eCollection 2021.
9
Meta attention for Off-Policy Actor-Critic.用于离策略演员-评论家的元注意力机制
Neural Netw. 2023 Jun;163:86-96. doi: 10.1016/j.neunet.2023.03.024. Epub 2023 Mar 28.
10
A deep reinforcement learning algorithm for the rectangular strip packing problem.一种用于矩形带材打包问题的深度强化学习算法。
PLoS One. 2023 Mar 16;18(3):e0282598. doi: 10.1371/journal.pone.0282598. eCollection 2023.