• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于漂移数据流的在线主动学习集成框架

Online Active Learning Ensemble Framework for Drifted Data Streams.

作者信息

Shan Jicheng, Zhang Hang, Liu Weike, Liu Qingbao

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 Feb;30(2):486-498. doi: 10.1109/TNNLS.2018.2844332. Epub 2018 Jul 2.

DOI:10.1109/TNNLS.2018.2844332
PMID:29994730
Abstract

In practical applications, data stream classification faces significant challenges, such as high cost of labeling instances and potential concept drifting. We present a new online active learning ensemble framework for drifting data streams based on a hybrid labeling strategy that includes the following: 1) an ensemble classifier, which consists of a long-term stable classifier and multiple dynamic classifiers (a multilevel sliding window model is used to create and update the dynamic classifiers to effectively process both the gradual drift type and sudden drift type data stream) and 2) active learning, which takes a nonfixed labeling budget, supports on-demand request labeling, and adopts an uncertainty strategy and random strategy to label instances. The decision threshold of the uncertainty strategy is adjusted dynamically, i.e., when concept drift occurs, the threshold is gradually reduced to query the most uncertain instances in priority to reduce the request expense as much as possible. Experiments on synthetic and real data sets show that precise prediction accuracy can be obtained by the proposed method without increasing the total cost of labeling, and that the labeling cost can be dynamically allocated according to the concept drift.

摘要

在实际应用中,数据流分类面临着重大挑战,例如标记实例的成本高昂以及潜在的概念漂移。我们基于一种混合标记策略,提出了一种用于漂移数据流的新型在线主动学习集成框架,该框架包括以下内容:1)一个集成分类器,它由一个长期稳定的分类器和多个动态分类器组成(使用多级滑动窗口模型来创建和更新动态分类器,以有效处理渐变漂移类型和突发漂移类型的数据流);2)主动学习,它采用非固定的标记预算,支持按需请求标记,并采用不确定性策略和随机策略来标记实例。不确定性策略的决策阈值会动态调整,即当概念漂移发生时,阈值会逐渐降低,以便优先查询最不确定的实例,从而尽可能降低请求成本。在合成数据集和真实数据集上的实验表明,所提出的方法能够在不增加总标记成本的情况下获得精确的预测准确率,并且可以根据概念漂移动态分配标记成本。

相似文献

1
Online Active Learning Ensemble Framework for Drifted Data Streams.用于漂移数据流的在线主动学习集成框架
IEEE Trans Neural Netw Learn Syst. 2019 Feb;30(2):486-498. doi: 10.1109/TNNLS.2018.2844332. Epub 2018 Jul 2.
2
Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift.用于处理带有概念漂移的不平衡数据流的基于自适应块的动态加权多数算法
IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2764-2778. doi: 10.1109/TNNLS.2019.2951814. Epub 2019 Dec 5.
3
Active learning from stream data using optimal weight classifier ensemble.使用最优权重分类器集成从流数据中进行主动学习。
IEEE Trans Syst Man Cybern B Cybern. 2010 Dec;40(6):1607-21. doi: 10.1109/TSMCB.2010.2042445. Epub 2010 Apr 1.
4
Dynamic Ensemble Selection for Imbalanced Data Streams With Concept Drift.具有概念漂移的不平衡数据流的动态集成选择
IEEE Trans Neural Netw Learn Syst. 2022 Jun 22;PP. doi: 10.1109/TNNLS.2022.3183120.
5
Online Active Learning for Drifting Data Streams.针对漂移数据流的在线主动学习
IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):186-200. doi: 10.1109/TNNLS.2021.3091681. Epub 2023 Jan 5.
6
An ensemble learning method with GAN-based sampling and consistency check for anomaly detection of imbalanced data streams with concept drift.一种基于生成对抗网络(GAN)采样和一致性检查的集成学习方法,用于具有概念漂移的不平衡数据流的异常检测。
PLoS One. 2024 Jan 26;19(1):e0292140. doi: 10.1371/journal.pone.0292140. eCollection 2024.
7
Diverse Instance-Weighting Ensemble Based on Region Drift Disagreement for Concept Drift Adaptation.基于区域漂移分歧的多样化实例加权集成用于概念漂移适应
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):293-307. doi: 10.1109/TNNLS.2020.2978523. Epub 2021 Jan 4.
8
Reacting to different types of concept drift: the Accuracy Updated Ensemble algorithm.应对不同类型的概念漂移:准确性更新集成算法。
IEEE Trans Neural Netw Learn Syst. 2014 Jan;25(1):81-94. doi: 10.1109/TNNLS.2013.2251352.
9
Active learning with drifting streaming data.带有漂移流数据的主动学习。
IEEE Trans Neural Netw Learn Syst. 2014 Jan;25(1):27-39. doi: 10.1109/TNNLS.2012.2236570.
10
Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble.使用异构多样化动态类加权集成对漂移数据流进行分类。
PeerJ Comput Sci. 2021 Apr 1;7:e459. doi: 10.7717/peerj-cs.459. eCollection 2021.

引用本文的文献

1
Longitudinal clustering analysis and prediction of Parkinson's disease progression using radiomics and hybrid machine learning.使用放射组学和混合机器学习对帕金森病进展进行纵向聚类分析和预测。
Quant Imaging Med Surg. 2022 Feb;12(2):906-919. doi: 10.21037/qims-21-425.
2
Active Learning on Dynamic Clustering for Drift Compensation in an Electronic Nose System.电子鼻系统中基于动态聚类的主动学习用于漂移补偿
Sensors (Basel). 2019 Aug 19;19(16):3601. doi: 10.3390/s19163601.
3
Predicting Outpatient Appointment Demand Using Machine Learning and Traditional Methods.
运用机器学习和传统方法预测门诊预约需求。
J Med Syst. 2019 Jul 19;43(9):288. doi: 10.1007/s10916-019-1418-y.
4
Gas-Sensor Drift Counteraction with Adaptive Active Learning for an Electronic Nose.电子鼻中采用自适应主动学习的气体传感器漂移校正。
Sensors (Basel). 2018 Nov 19;18(11):4028. doi: 10.3390/s18114028.