• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FlexSketch:平稳和非平稳数据流的概率密度估计

FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams.

作者信息

Park Namuk, Kim Songkuk

机构信息

School of Integrated Technology, Yonsei University, Incheon 21983, Korea.

出版信息

Sensors (Basel). 2021 Feb 4;21(4):1080. doi: 10.3390/s21041080.

DOI:10.3390/s21041080
PMID:33557367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7915800/
Abstract

Efficient and accurate estimation of the probability distribution of a data stream is an important problem in many sensor systems. It is especially challenging when the data stream is non-stationary, i.e., its probability distribution changes over time. Statistical models for non-stationary data streams demand agile adaptation for concept drift while tolerating temporal fluctuations. To this end, a statistical model needs to forget old data samples and to detect concept drift swiftly. In this paper, we propose FlexSketch, an online probability density estimation algorithm for data streams. Our algorithm uses an ensemble of histograms, each of which represents a different length of data history. FlexSketch updates each histogram for a new data sample and generates probability distribution by combining the ensemble of histograms while monitoring discrepancy between recent data and existing models periodically. When it detects concept drift, a new histogram is added to the ensemble and the oldest histogram is removed. This allows us to estimate the probability density function with high update speed and high accuracy using only limited memory. Experimental results demonstrate that our algorithm shows improved speed and accuracy compared to existing methods for both stationary and non-stationary data streams.

摘要

在许多传感器系统中,高效且准确地估计数据流的概率分布是一个重要问题。当数据流是非平稳的,即其概率分布随时间变化时,这一问题尤其具有挑战性。非平稳数据流的统计模型需要在容忍时间波动的同时,对概念漂移进行灵活适应。为此,统计模型需要遗忘旧的数据样本并迅速检测概念漂移。在本文中,我们提出了FlexSketch,一种用于数据流的在线概率密度估计算法。我们的算法使用一组直方图,每个直方图代表不同长度的数据历史。FlexSketch为新的数据样本更新每个直方图,并通过组合直方图组来生成概率分布,同时定期监测近期数据与现有模型之间的差异。当检测到概念漂移时,会向直方图组中添加一个新的直方图,并移除最旧的直方图。这使我们能够仅使用有限的内存,以高更新速度和高精度估计概率密度函数。实验结果表明,与现有方法相比,我们的算法在平稳和非平稳数据流上均具有更高的速度和准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/e3b72aa06a46/sensors-21-01080-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/e1cfb255246f/sensors-21-01080-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/8bc12de46846/sensors-21-01080-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/6d826bed228a/sensors-21-01080-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/5da62371dcb1/sensors-21-01080-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/478e00ac889f/sensors-21-01080-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/fa05fc64af1e/sensors-21-01080-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/51f729ff516d/sensors-21-01080-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/0a625565f7ba/sensors-21-01080-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/440612bf55cf/sensors-21-01080-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/b2c1576ee0fa/sensors-21-01080-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/736454ccef9d/sensors-21-01080-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/e3b72aa06a46/sensors-21-01080-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/e1cfb255246f/sensors-21-01080-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/8bc12de46846/sensors-21-01080-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/6d826bed228a/sensors-21-01080-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/5da62371dcb1/sensors-21-01080-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/478e00ac889f/sensors-21-01080-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/fa05fc64af1e/sensors-21-01080-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/51f729ff516d/sensors-21-01080-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/0a625565f7ba/sensors-21-01080-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/440612bf55cf/sensors-21-01080-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/b2c1576ee0fa/sensors-21-01080-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/736454ccef9d/sensors-21-01080-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e204/7915800/e3b72aa06a46/sensors-21-01080-g012.jpg

相似文献

1
FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams.FlexSketch:平稳和非平稳数据流的概率密度估计
Sensors (Basel). 2021 Feb 4;21(4):1080. doi: 10.3390/s21041080.
2
Diverse Instance-Weighting Ensemble Based on Region Drift Disagreement for Concept Drift Adaptation.基于区域漂移分歧的多样化实例加权集成用于概念漂移适应
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):293-307. doi: 10.1109/TNNLS.2020.2978523. Epub 2021 Jan 4.
3
On the Parzen Kernel-Based Probability Density Function Learning Procedures Over Time-Varying Streaming Data With Applications to Pattern Classification.基于Parzen核的时变流数据概率密度函数学习方法及其在模式分类中的应用
IEEE Trans Cybern. 2020 Apr;50(4):1683-1696. doi: 10.1109/TCYB.2018.2877611. Epub 2018 Nov 15.
4
Adaptive Robust Local Online Density Estimation for Streaming Data.用于流数据的自适应鲁棒局部在线密度估计
Int J Mach Learn Cybern. 2021 Jun;12(6):1803-1824. doi: 10.1007/s13042-021-01275-y. Epub 2021 Feb 3.
5
Reacting to different types of concept drift: the Accuracy Updated Ensemble algorithm.应对不同类型的概念漂移:准确性更新集成算法。
IEEE Trans Neural Netw Learn Syst. 2014 Jan;25(1):81-94. doi: 10.1109/TNNLS.2013.2251352.
6
Adaptive windowing based recurrent neural network for drift adaption in non-stationary environment.基于自适应窗口的循环神经网络用于非平稳环境中的漂移自适应
J Ambient Intell Humaniz Comput. 2022 Jun 30:1-15. doi: 10.1007/s12652-022-04116-0.
7
Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift.用于处理带有概念漂移的不平衡数据流的基于自适应块的动态加权多数算法
IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2764-2778. doi: 10.1109/TNNLS.2019.2951814. Epub 2019 Dec 5.
8
A dynamic ensemble framework for mining textual streams with class imbalance.一种用于挖掘具有类别不平衡的文本流的动态集成框架。
ScientificWorldJournal. 2014;2014:497354. doi: 10.1155/2014/497354. Epub 2014 Apr 10.
9
SOMKE: kernel density estimation over data streams by sequences of self-organizing maps.序列自组织映射的数据流核密度估计
IEEE Trans Neural Netw Learn Syst. 2012 Aug;23(8):1254-68. doi: 10.1109/TNNLS.2012.2201167.
10
Concept Drift Detection via Equal Intensity k-Means Space Partitioning.通过等强度 k-均值空间分区进行概念漂移检测。
IEEE Trans Cybern. 2021 Jun;51(6):3198-3211. doi: 10.1109/TCYB.2020.2983962. Epub 2021 May 18.

本文引用的文献

1
Gaussian Process Regression-Based Video Anomaly Detection and Localization With Hierarchical Feature Representation.基于高斯过程回归的分层特征表示的视频异常检测与定位。
IEEE Trans Image Process. 2015 Dec;24(12):5288-301. doi: 10.1109/TIP.2015.2479561. Epub 2015 Sep 17.
2
Incremental learning of concept drift in nonstationary environments.非平稳环境中概念漂移的增量学习
IEEE Trans Neural Netw. 2011 Oct;22(10):1517-31. doi: 10.1109/TNN.2011.2160459. Epub 2011 Aug 4.