• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于不平衡数据集的机器故障分类的 Tomk Link 和 SMOTE 方法。

Tomek Link and SMOTE Approaches for Machine Fault Classification with an Imbalanced Dataset.

机构信息

Department of Electrical and Electronics Engineering Technology, Doornfontein Campus, University of Johannesburg, Johannesburg 2028, South Africa.

Institute for Intelligent Systems, Auckland Park Campus, University of Johannesburg, Johannesburg 2006, South Africa.

出版信息

Sensors (Basel). 2022 Apr 23;22(9):3246. doi: 10.3390/s22093246.

DOI:10.3390/s22093246
PMID:35590937
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9099503/
Abstract

Data-driven methods have prominently featured in the progressive research and development of modern condition monitoring systems for electrical machines. These methods have the advantage of simplicity when it comes to the implementation of effective fault detection and diagnostic systems. Despite their many advantages, the practical implementation of data-driven approaches still faces challenges such as data imbalance. The lack of sufficient and reliable labeled fault data from machines in the field often poses a challenge in developing accurate supervised learning-based condition monitoring systems. This research investigates the use of a Naïve Bayes classifier, support vector machine, and k-nearest neighbors together with synthetic minority oversampling technique, Tomek link, and the combination of these two resampling techniques for fault classification with simulation and experimental imbalanced data. A comparative analysis of these techniques is conducted for different imbalanced data cases to determine the suitability thereof for condition monitoring on a wound-rotor induction generator. The precision, recall, and f1-score matrices are applied for performance evaluation. The results indicate that the technique combining the synthetic minority oversampling technique with the Tomek link provides the best performance across all tested classifiers. The k-nearest neighbors, together with this combination resampling technique yielded the most accurate classification results. This research is of interest to researchers and practitioners working in the area of condition monitoring in electrical machines, and the findings and presented approach of the comparative analysis will assist with the selection of the most suitable technique for handling imbalanced fault data. This is especially important in the practice of condition monitoring on electrical rotating machines, where fault data are very limited.

摘要

数据驱动方法在电机现代状态监测系统的不断研究和发展中占据重要地位。这些方法在实施有效的故障检测和诊断系统方面具有简单的优势。尽管具有许多优势,但数据驱动方法的实际实施仍然面临挑战,例如数据不平衡。现场机器缺乏足够和可靠的标记故障数据,这在开发基于监督学习的准确状态监测系统方面带来了挑战。本研究调查了朴素贝叶斯分类器、支持向量机和 k-最近邻与合成少数过采样技术、Tomek 链接以及这两种重采样技术的组合在模拟和实验不平衡数据中的故障分类中的应用。对不同不平衡数据情况的这些技术进行了比较分析,以确定它们在绕线转子感应发电机状态监测中的适用性。使用精度、召回率和 f1 分数矩阵进行性能评估。结果表明,在所有测试的分类器中,结合了合成少数过采样技术和 Tomek 链接的技术提供了最佳的性能。k-最近邻与这种组合重采样技术相结合,产生了最准确的分类结果。本研究对从事电机状态监测的研究人员和从业者具有重要意义,研究结果和提出的比较分析方法将有助于选择最适合处理不平衡故障数据的技术。这在电气旋转机器的状态监测实践中尤为重要,因为故障数据非常有限。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/3c14ce790c03/sensors-22-03246-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/e287dedf3eb5/sensors-22-03246-g001a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/bda595480912/sensors-22-03246-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/a21183f6bdd4/sensors-22-03246-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/f674e6e0ebdc/sensors-22-03246-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/beca24bb4d87/sensors-22-03246-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/4ef288ee9782/sensors-22-03246-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/49927c62f30c/sensors-22-03246-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/0ded02ebfd23/sensors-22-03246-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/1106e3f469de/sensors-22-03246-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/1fd1eb0b211a/sensors-22-03246-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/d3df6de40a1a/sensors-22-03246-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/646791611c71/sensors-22-03246-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/fbd658c749cd/sensors-22-03246-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/1bc8aa2bf047/sensors-22-03246-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/5db5523fad4b/sensors-22-03246-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/029240a375e0/sensors-22-03246-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/98573e6662c4/sensors-22-03246-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/12a705fdcc05/sensors-22-03246-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/bcb127b9729b/sensors-22-03246-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/2c157d2520cf/sensors-22-03246-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/0520cf84dcad/sensors-22-03246-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/3c14ce790c03/sensors-22-03246-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/e287dedf3eb5/sensors-22-03246-g001a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/bda595480912/sensors-22-03246-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/a21183f6bdd4/sensors-22-03246-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/f674e6e0ebdc/sensors-22-03246-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/beca24bb4d87/sensors-22-03246-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/4ef288ee9782/sensors-22-03246-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/49927c62f30c/sensors-22-03246-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/0ded02ebfd23/sensors-22-03246-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/1106e3f469de/sensors-22-03246-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/1fd1eb0b211a/sensors-22-03246-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/d3df6de40a1a/sensors-22-03246-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/646791611c71/sensors-22-03246-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/fbd658c749cd/sensors-22-03246-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/1bc8aa2bf047/sensors-22-03246-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/5db5523fad4b/sensors-22-03246-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/029240a375e0/sensors-22-03246-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/98573e6662c4/sensors-22-03246-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/12a705fdcc05/sensors-22-03246-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/bcb127b9729b/sensors-22-03246-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/2c157d2520cf/sensors-22-03246-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/0520cf84dcad/sensors-22-03246-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/086f/9099503/3c14ce790c03/sensors-22-03246-g022.jpg

相似文献

1
Tomek Link and SMOTE Approaches for Machine Fault Classification with an Imbalanced Dataset.基于不平衡数据集的机器故障分类的 Tomk Link 和 SMOTE 方法。
Sensors (Basel). 2022 Apr 23;22(9):3246. doi: 10.3390/s22093246.
2
Churn prediction in telecommunication industry using kernel Support Vector Machines.使用核支持向量机进行电信行业的流失预测。
PLoS One. 2022 May 24;17(5):e0267935. doi: 10.1371/journal.pone.0267935. eCollection 2022.
3
Comparing Sampling Strategies for Tackling Imbalanced Data in Human Activity Recognition.比较处理人体活动识别中不平衡数据的采样策略。
Sensors (Basel). 2022 Feb 11;22(4):1373. doi: 10.3390/s22041373.
4
Classification of Imbalanced Data by Oversampling in Kernel Space of Support Vector Machines.支持向量机核空间中基于过采样的不平衡数据分类
IEEE Trans Neural Netw Learn Syst. 2018 Sep;29(9):4065-4076. doi: 10.1109/TNNLS.2017.2751612. Epub 2017 Oct 10.
5
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
6
The prediction of cardiac abnormality and enhancement in minority class accuracy from imbalanced ECG signals using modified deep neural network models.使用改进的深度神经网络模型从不平衡心电图信号中预测心脏异常及少数类准确率的增强。
Comput Biol Med. 2022 Nov;150:106142. doi: 10.1016/j.compbiomed.2022.106142. Epub 2022 Sep 22.
7
A quantum-based oversampling method for classification of highly imbalanced and overlapped data.一种基于量子的过采样方法,用于分类高度不平衡和重叠的数据。
Exp Biol Med (Maywood). 2023 Dec;248(24):2500-2513. doi: 10.1177/15353702231220665. Epub 2024 Jan 28.
8
Structure-activity relationship-based chemical classification of highly imbalanced Tox21 datasets.基于结构-活性关系的高度不平衡Tox21数据集的化学分类
J Cheminform. 2020 Oct 27;12(1):66. doi: 10.1186/s13321-020-00468-x.
9
Predicting stroke events with a proactive fusion system: a comprehensive study on imbalance class handling in computational biomechanics.使用主动融合系统预测中风事件:计算生物力学中不平衡类处理的综合研究。
Comput Methods Biomech Biomed Engin. 2024 Jun 20:1-18. doi: 10.1080/10255842.2024.2363946.
10
SMOTE for high-dimensional class-imbalanced data.过采样处理高维类别不平衡数据。
BMC Bioinformatics. 2013 Mar 22;14:106. doi: 10.1186/1471-2105-14-106.

引用本文的文献

1
Identification of water sources of mine water bursts based on the FPS-DT model.基于FPS-DT模型的矿井突水水源识别
Sci Rep. 2025 Jul 27;15(1):27327. doi: 10.1038/s41598-025-13301-y.
2
An improved SMOTE algorithm for enhanced imbalanced data classification by expanding sample generation space.一种通过扩展样本生成空间来增强不平衡数据分类的改进型SMOTE算法。
Sci Rep. 2025 Jul 2;15(1):23521. doi: 10.1038/s41598-025-09506-w.
3
Linear B-cell epitope prediction for SARS and COVID-19 vaccine design: Integrating balanced ensemble learning models and resampling strategies.

本文引用的文献

1
Intelligent fault diagnosis of machines with small & imbalanced data: A state-of-the-art review and possible extensions.小数据与不平衡数据情况下机器的智能故障诊断:最新综述与可能的拓展
ISA Trans. 2022 Jan;119:152-171. doi: 10.1016/j.isatra.2021.02.042. Epub 2021 Mar 8.
用于SARS和COVID-19疫苗设计的线性B细胞表位预测:集成平衡集成学习模型和重采样策略
PeerJ Comput Sci. 2025 Jun 18;11:e2970. doi: 10.7717/peerj-cs.2970. eCollection 2025.
4
Enhancing patient rehabilitation outcomes: artificial intelligence-driven predictive modeling for home discharge in neurological and orthopedic conditions.提高患者康复效果:针对神经科和骨科疾病出院居家情况的人工智能驱动预测模型
J Neuroeng Rehabil. 2025 May 26;22(1):117. doi: 10.1186/s12984-025-01654-4.
5
An ensemble deep learning framework for emotion recognition through wearable devices multi-modal physiological signals.一种通过可穿戴设备多模态生理信号进行情感识别的集成深度学习框架。
Sci Rep. 2025 May 18;15(1):17263. doi: 10.1038/s41598-025-99858-0.
6
Explainable machine learning for predictive modeling of blowing snow detection and meteorological feature assessment using XGBoost-SHAP.基于XGBoost-SHAP的吹雪检测与气象特征评估预测建模的可解释机器学习
PLoS One. 2025 Mar 28;20(3):e0318835. doi: 10.1371/journal.pone.0318835. eCollection 2025.
7
Addressing data imbalance in collision risk prediction with active generative oversampling.通过主动生成过采样解决碰撞风险预测中的数据不平衡问题。
Sci Rep. 2025 Mar 17;15(1):9133. doi: 10.1038/s41598-025-93851-3.
8
Fault Detection in Induction Machines Using Learning Models and Fourier Spectrum Image Analysis.基于学习模型和傅里叶频谱图像分析的感应电机故障检测
Sensors (Basel). 2025 Jan 15;25(2):471. doi: 10.3390/s25020471.
9
Dual inhibition of AChE and MAO-B in Alzheimer's disease: machine learning approaches and model interpretations.阿尔茨海默病中乙酰胆碱酯酶和单胺氧化酶-B的双重抑制:机器学习方法与模型解读
Mol Divers. 2025 Jan 21. doi: 10.1007/s11030-024-11061-x.
10
Learning from Imbalanced Data: Integration of Advanced Resampling Techniques and Machine Learning Models for Enhanced Cancer Diagnosis and Prognosis.从不平衡数据中学习:先进重采样技术与机器学习模型的整合用于增强癌症诊断与预后
Cancers (Basel). 2024 Oct 8;16(19):3417. doi: 10.3390/cancers16193417.