• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 P-Splines 的时间序列数据的快速加权模糊 C-中值聚类。

A Fast Weighted Fuzzy C-Medoids Clustering for Time Series Data Based on P-Splines.

机构信息

College of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China.

Engineering Lab of Intelligence Business & Internet of Things, Xinxiang 453007, China.

出版信息

Sensors (Basel). 2022 Aug 17;22(16):6163. doi: 10.3390/s22166163.

DOI:10.3390/s22166163
PMID:36015930
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9414275/
Abstract

The rapid growth of digital information has produced massive amounts of time series data on rich features and most time series data are noisy and contain some outlier samples, which leads to a decline in the clustering effect. To efficiently discover the hidden statistical information about the data, a fast weighted fuzzy C-medoids clustering algorithm based on P-splines (PS-WFCMdd) is proposed for time series datasets in this study. Specifically, the P-spline method is used to fit the functional data related to the original time series data, and the obtained smooth-fitting data is used as the input of the clustering algorithm to enhance the ability to process the data set during the clustering process. Then, we define a new weighted method to further avoid the influence of outlier sample points in the weighted fuzzy C-medoids clustering process, to improve the robustness of our algorithm. We propose using the third version of mueen's algorithm for similarity search (MASS 3) to measure the similarity between time series quickly and accurately, to further improve the clustering efficiency. Our new algorithm is compared with several other time series clustering algorithms, and the performance of the algorithm is evaluated experimentally on different types of time series examples. The experimental results show that our new method can speed up data processing and the comprehensive performance of each clustering evaluation index are relatively good.

摘要

数字信息的快速增长产生了大量具有丰富特征的时间序列数据,而大多数时间序列数据都是嘈杂的,并包含一些异常样本,这导致聚类效果下降。为了有效地发现数据的隐藏统计信息,针对时间序列数据集,本文提出了一种基于 P-样条的快速加权模糊 C-均值聚类算法(PS-WFCMdd)。具体来说,使用 P-样条方法拟合与原始时间序列数据相关的函数型数据,并将获得的平滑拟合数据用作聚类算法的输入,以增强聚类过程中处理数据集的能力。然后,我们定义了一种新的加权方法,进一步避免了加权模糊 C-均值聚类过程中异常样本点的影响,提高了算法的稳健性。我们提出使用 mueen 算法的第三个版本(MASS 3)来快速准确地测量时间序列之间的相似性,以进一步提高聚类效率。我们将新算法与其他几种时间序列聚类算法进行了比较,并在不同类型的时间序列示例上进行了实验评估算法的性能。实验结果表明,我们的新方法可以加快数据处理速度,并且每个聚类评估指标的综合性能都相对较好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/99f1ede57e63/sensors-22-06163-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/35a014cb96be/sensors-22-06163-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/cbf14f5ba20b/sensors-22-06163-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/6b1ac65050d7/sensors-22-06163-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/dd919e105abd/sensors-22-06163-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/33d719be34c9/sensors-22-06163-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/9dbe30f0df3f/sensors-22-06163-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/99f1ede57e63/sensors-22-06163-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/35a014cb96be/sensors-22-06163-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/cbf14f5ba20b/sensors-22-06163-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/6b1ac65050d7/sensors-22-06163-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/dd919e105abd/sensors-22-06163-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/33d719be34c9/sensors-22-06163-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/9dbe30f0df3f/sensors-22-06163-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/569f/9414275/99f1ede57e63/sensors-22-06163-g007.jpg

相似文献

1
A Fast Weighted Fuzzy C-Medoids Clustering for Time Series Data Based on P-Splines.基于 P-Splines 的时间序列数据的快速加权模糊 C-中值聚类。
Sensors (Basel). 2022 Aug 17;22(16):6163. doi: 10.3390/s22166163.
2
Incremental fuzzy C medoids clustering of time series data using dynamic time warping distance.基于动态时间规整距离的时间序列数据的增量模糊 C 均值聚类。
PLoS One. 2018 May 24;13(5):e0197499. doi: 10.1371/journal.pone.0197499. eCollection 2018.
3
A robust fuzzy local information C-Means clustering algorithm.一种鲁棒的模糊局部信息 C-均值聚类算法。
IEEE Trans Image Process. 2010 May;19(5):1328-37. doi: 10.1109/TIP.2010.2040763. Epub 2010 Jan 19.
4
Information Granulation-Based Fuzzy Clustering of Time Series.基于信息粒度的时间序列模糊聚类。
IEEE Trans Cybern. 2021 Dec;51(12):6253-6261. doi: 10.1109/TCYB.2020.2970455. Epub 2021 Dec 22.
5
Point Tracking Technology of Sports Image Sequence Marks Based on Fuzzy Clustering Algorithm.基于模糊聚类算法的运动图像序列标志点跟踪技术。
Comput Intell Neurosci. 2022 Apr 28;2022:3814252. doi: 10.1155/2022/3814252. eCollection 2022.
6
Hybrid fuzzy cluster ensemble framework for tumor clustering from biomolecular data.用于从生物分子数据中进行肿瘤聚类的混合模糊聚类集成框架。
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):657-70. doi: 10.1109/TCBB.2013.59.
7
Brain tumor segmentation approach based on the extreme learning machine and significantly fast and robust fuzzy C-means clustering algorithms running on Raspberry Pi hardware.基于极端学习机和显著快速稳健的模糊 C 均值聚类算法在 Raspberry Pi 硬件上运行的脑肿瘤分割方法。
Med Hypotheses. 2020 Mar;136:109507. doi: 10.1016/j.mehy.2019.109507. Epub 2019 Nov 18.
8
Spatial robust fuzzy clustering of COVID 19 time series based on B-splines.基于B样条的COVID-19时间序列空间稳健模糊聚类
Spat Stat. 2022 Jun;49:100518. doi: 10.1016/j.spasta.2021.100518. Epub 2021 May 15.
9
The fuzzy clustering analysis based on AFS theory.基于AFS理论的模糊聚类分析。
IEEE Trans Syst Man Cybern B Cybern. 2005 Oct;35(5):1013-27. doi: 10.1109/tsmcb.2005.847747.
10
A new validity measure for a correlation-based fuzzy c-means clustering algorithm.一种基于相关性的模糊 c 均值聚类算法的新有效性度量。
Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:3865-8. doi: 10.1109/IEMBS.2009.5332582.

引用本文的文献

1
Equivalence partition based morphological similarity clustering for large-scale time series.基于等价划分的形态相似性聚类在大规模时间序列中的应用。
Sci Rep. 2023 Apr 11;13(1):5900. doi: 10.1038/s41598-023-33074-6.

本文引用的文献

1
Pythagorean fuzzy time series model based on Pythagorean fuzzy c-means and improved Markov weighted in the prediction of the new COVID-19 cases.基于勾股模糊C均值和改进马尔可夫加权的勾股模糊时间序列模型在新型冠状病毒肺炎病例预测中的应用
Soft comput. 2021;25(22):13881-13896. doi: 10.1007/s00500-021-06259-2. Epub 2021 Oct 6.
2
Particle swarm optimization of partitions and fuzzy order for fuzzy time series forecasting of COVID-19.用于COVID-19模糊时间序列预测的分区粒子群优化与模糊排序
Appl Soft Comput. 2021 Oct;110:107611. doi: 10.1016/j.asoc.2021.107611. Epub 2021 Jun 17.
3
Spatial robust fuzzy clustering of COVID 19 time series based on B-splines.
基于B样条的COVID-19时间序列空间稳健模糊聚类
Spat Stat. 2022 Jun;49:100518. doi: 10.1016/j.spasta.2021.100518. Epub 2021 May 15.
4
Kernel Probabilistic K-Means Clustering.核概率 K-均值聚类。
Sensors (Basel). 2021 Mar 8;21(5):1892. doi: 10.3390/s21051892.
5
Intuitionistic Fuzzy C-Means Algorithm Based on Membership Information Transfer-Ring and Similarity Measurement.基于隶属度信息传递环和相似度测度的直觉模糊 C 均值算法。
Sensors (Basel). 2021 Jan 20;21(3):696. doi: 10.3390/s21030696.
6
Fuzzy clustering method to compare the spread rate of Covid-19 in the high risks countries.用于比较新冠病毒在高风险国家传播率的模糊聚类方法。
Chaos Solitons Fractals. 2020 Nov;140:110230. doi: 10.1016/j.chaos.2020.110230. Epub 2020 Aug 22.
7
Incremental fuzzy C medoids clustering of time series data using dynamic time warping distance.基于动态时间规整距离的时间序列数据的增量模糊 C 均值聚类。
PLoS One. 2018 May 24;13(5):e0197499. doi: 10.1371/journal.pone.0197499. eCollection 2018.
8
Time warp edit distance with stiffness adjustment for time series matching.用于时间序列匹配的带刚度调整的时间扭曲编辑距离。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):306-18. doi: 10.1109/TPAMI.2008.76.
9
Properties of the Hubert-Arabie adjusted Rand index.休伯特 - 阿拉比调整兰德指数的性质。
Psychol Methods. 2004 Sep;9(3):386-96. doi: 10.1037/1082-989X.9.3.386.