学习具有有限标签的高维演化数据流。

IEEE Trans Cybern. 2022 Nov;52(11):11373-11384. doi: 10.1109/TCYB.2021.3070420. Epub 2022 Oct 17.

In the context of streaming data, learning algorithms often need to confront several unique challenges, such as concept drift, label scarcity, and high dimensionality. Several concept drift-aware data stream learning algorithms have been proposed to tackle these issues over the past decades. However, most existing algorithms utilize a supervised learning framework and require all true class labels to update their models. Unfortunately, in the streaming environment, requiring all labels is unfeasible and not realistic in many real-world applications. Therefore, learning data streams with minimal labels is a more practical scenario. Considering the problem of the curse of dimensionality and label scarcity, in this article, we present a new semisupervised learning technique for streaming data. To cure the curse of dimensionality, we employ a denoising autoencoder to transform the high-dimensional feature space into a reduced, compact, and more informative feature representation. Furthermore, we use a cluster-and-label technique to reduce the dependency on true class labels. We employ a synchronization-based dynamic clustering technique to summarize the streaming data into a set of dynamic microclusters that are further used for classification. In addition, we employ a disagreement-based learning method to cope with concept drift. Extensive experiments performed on many real-world datasets demonstrate the superior performance of the proposed method compared to several state-of-the-art methods.

在流数据的背景下，学习算法通常需要应对几个独特的挑战，如概念漂移、标签稀缺和高维性。在过去的几十年中，已经提出了几种概念漂移感知的数据流学习算法来解决这些问题。然而，大多数现有的算法都利用监督学习框架，并要求所有真实的类标签来更新他们的模型。不幸的是，在流环境中，要求所有标签是不可行的，并且在许多实际应用中是不现实的。因此，使用最少的标签学习数据流是一个更实际的场景。考虑到维度和标签稀缺的诅咒问题，在本文中，我们提出了一种新的流数据半监督学习技术。为了解决维度的诅咒问题，我们采用了去噪自编码器将高维特征空间转换为降维、紧凑和更具信息量的特征表示。此外，我们使用聚类和标签技术来减少对真实类标签的依赖。我们采用基于同步的动态聚类技术将流数据总结为一组动态微聚类，进一步用于分类。此外，我们采用基于不一致的学习方法来应对概念漂移。在许多真实数据集上进行的大量实验表明，与几种最先进的方法相比，所提出的方法具有优越的性能。

相似文献

Learning High-Dimensional Evolving Data Streams With Limited Labels.

IEEE Trans Cybern. 2022 Nov;52(11):11373-11384. doi: 10.1109/TCYB.2021.3070420. Epub 2022 Oct 17.

Difficult Novel Class Detection in Semisupervised Streaming Data.

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6872-6886. doi: 10.1109/TNNLS.2022.3213682. Epub 2023 Oct 5.

A dynamic ensemble framework for mining textual streams with class imbalance.

ScientificWorldJournal. 2014;2014:497354. doi: 10.1155/2014/497354. Epub 2014 Apr 10.

Joint Label Inference and Discriminant Embedding.

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4413-4423. doi: 10.1109/TNNLS.2021.3057270. Epub 2022 Aug 31.

A Bi-Criteria Active Learning Algorithm for Dynamic Data Streams.

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):74-86. doi: 10.1109/TNNLS.2016.2614393. Epub 2016 Oct 21.

Robust dimensionality reduction via feature space to feature space distance metric learning.

Neural Netw. 2019 Apr;112:1-14. doi: 10.1016/j.neunet.2019.01.001. Epub 2019 Jan 21.

Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift.

IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2764-2778. doi: 10.1109/TNNLS.2019.2951814. Epub 2019 Dec 5.

Extended T: Learning With Mixed Closed-Set and Open-Set Noisy Labels.

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3047-3058. doi: 10.1109/TPAMI.2022.3180545. Epub 2023 Feb 3.

Semisupervised Feature Selection via Structured Manifold Learning.

IEEE Trans Cybern. 2022 Jul;52(7):5756-5766. doi: 10.1109/TCYB.2021.3052847. Epub 2022 Jul 4.

Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams.

IEEE Trans Cybern. 2022 Jun;52(6):4173-4186. doi: 10.1109/TCYB.2020.3023973. Epub 2022 Jun 16.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Learning High-Dimensional Evolving Data Streams With Limited Labels.

IEEE Trans Cybern. 2022 Nov;52(11):11373-11384. doi: 10.1109/TCYB.2021.3070420. Epub 2022 Oct 17.

Difficult Novel Class Detection in Semisupervised Streaming Data.

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6872-6886. doi: 10.1109/TNNLS.2022.3213682. Epub 2023 Oct 5.

A dynamic ensemble framework for mining textual streams with class imbalance.

ScientificWorldJournal. 2014;2014:497354. doi: 10.1155/2014/497354. Epub 2014 Apr 10.

Joint Label Inference and Discriminant Embedding.

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4413-4423. doi: 10.1109/TNNLS.2021.3057270. Epub 2022 Aug 31.

A Bi-Criteria Active Learning Algorithm for Dynamic Data Streams.

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):74-86. doi: 10.1109/TNNLS.2016.2614393. Epub 2016 Oct 21.

Robust dimensionality reduction via feature space to feature space distance metric learning.

Neural Netw. 2019 Apr;112:1-14. doi: 10.1016/j.neunet.2019.01.001. Epub 2019 Jan 21.

Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift.

IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2764-2778. doi: 10.1109/TNNLS.2019.2951814. Epub 2019 Dec 5.

Extended T: Learning With Mixed Closed-Set and Open-Set Noisy Labels.

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3047-3058. doi: 10.1109/TPAMI.2022.3180545. Epub 2023 Feb 3.

Semisupervised Feature Selection via Structured Manifold Learning.

IEEE Trans Cybern. 2022 Jul;52(7):5756-5766. doi: 10.1109/TCYB.2021.3052847. Epub 2022 Jul 4.

Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams.

IEEE Trans Cybern. 2022 Jun;52(6):4173-4186. doi: 10.1109/TCYB.2020.3023973. Epub 2022 Jun 16.

Learning High-Dimensional Evolving Data Streams With Limited Labels.

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献