• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Yeo-Johnson、K-means SMOTE 和最优岩爆特征维度确定的集成堆叠岩爆预测模型。

Ensemble stacking rockburst prediction model based on Yeo-Johnson, K-means SMOTE, and optimal rockburst feature dimension determination.

机构信息

School of Resources and Environmental Engineering, Wuhan University of Science and Technology, Wuhan, 430081, Hubei, China.

Wuhan Safety and Environmental Protection Research Institute of Sinosteel Group, Wuhan, 430081, Hubei, China.

出版信息

Sci Rep. 2022 Sep 12;12(1):15352. doi: 10.1038/s41598-022-19669-5.

DOI:10.1038/s41598-022-19669-5
PMID:36097043
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9468028/
Abstract

Rockburst forecasting plays a crucial role in prevention and control of rockburst disaster. To improve the accuracy of rockburst prediction at the data structure and algorithm levels, the Yeo-Johnson transform, K-means SMOTE oversampling, and optimal rockburst feature dimension determination are used to optimize the data structure. At the algorithm optimization level, ensemble stacking rockburst prediction is performed based on the data structure optimization. First, to solve the problem of many outliers and data imbalance in the distribution of rockburst data, the Yeo-Johnson transform and k-means SMOTE algorithm are respectively used to solve the problems. Then, based on six original rockburst features, 21 new features are generated using the PolynomialFeatures function in Sklearn. Principal component analysis (PCA) dimensionality reduction is applied to eliminate the correlations between the 27 features. Thirteen types of machine learning algorithms are used to predict datasets that retain different numbers of features after dimensionality reduction to determine the optimal rockburst feature dimension. Finally, the 14-feature rockburst dataset is used as the input for integrated stacking. The results show that the ensemble stacking model based on Yeo-Johnson, K-means SMOTE, and optimal rockburst feature dimension determination can improve the accuracy of rockburst prediction by 0.1602-0.3636. Compared with the 13 single machine learning models without data preprocessing, this data structure optimization and algorithm optimization method effectively improves the accuracy of rockburst prediction.

摘要

岩爆预测在岩爆灾害的防治中起着至关重要的作用。为了提高数据结构和算法层面的岩爆预测精度,采用了约翰逊变换、K-means SMOTE 过采样和最优岩爆特征维度确定来优化数据结构。在算法优化层面,基于数据结构优化进行集成堆叠岩爆预测。首先,为了解决岩爆数据分布中存在的大量异常值和数据不平衡问题,分别采用了约翰逊变换和 K-means SMOTE 算法进行处理。然后,基于六个原始岩爆特征,使用 Sklearn 中的 PolynomialFeatures 函数生成了 21 个新特征。应用主成分分析(PCA)降维来消除 27 个特征之间的相关性。使用 13 种机器学习算法来预测降维后保留不同数量特征的数据集,以确定最优的岩爆特征维度。最后,将 14 特征岩爆数据集作为集成堆叠的输入。结果表明,基于约翰逊变换、K-means SMOTE 和最优岩爆特征维度确定的集成堆叠模型可以将岩爆预测的准确率提高 0.1602-0.3636。与没有数据预处理的 13 个单机器学习模型相比,这种数据结构优化和算法优化方法有效地提高了岩爆预测的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/3dbe08a1426b/41598_2022_19669_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/28ab3c1fbaa2/41598_2022_19669_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/735f475d6119/41598_2022_19669_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/b123119c54f7/41598_2022_19669_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/ad193a4a3056/41598_2022_19669_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/31f96298d393/41598_2022_19669_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/36e177e9f82e/41598_2022_19669_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/d14ed574c0d3/41598_2022_19669_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/3152f4a1b24b/41598_2022_19669_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/3dbe08a1426b/41598_2022_19669_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/28ab3c1fbaa2/41598_2022_19669_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/735f475d6119/41598_2022_19669_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/b123119c54f7/41598_2022_19669_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/ad193a4a3056/41598_2022_19669_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/31f96298d393/41598_2022_19669_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/36e177e9f82e/41598_2022_19669_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/d14ed574c0d3/41598_2022_19669_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/3152f4a1b24b/41598_2022_19669_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffd7/9468028/3dbe08a1426b/41598_2022_19669_Fig9_HTML.jpg

相似文献

1
Ensemble stacking rockburst prediction model based on Yeo-Johnson, K-means SMOTE, and optimal rockburst feature dimension determination.基于 Yeo-Johnson、K-means SMOTE 和最优岩爆特征维度确定的集成堆叠岩爆预测模型。
Sci Rep. 2022 Sep 12;12(1):15352. doi: 10.1038/s41598-022-19669-5.
2
Intelligent prediction of rockburst in tunnels based on back propagation neural network integrated beetle antennae search algorithm.基于反向传播神经网络集成甲壳虫触角搜索算法的隧道岩爆智能预测。
Environ Sci Pollut Res Int. 2023 Mar;30(12):33960-33973. doi: 10.1007/s11356-022-24420-8. Epub 2022 Dec 11.
3
Novel ensemble intelligence methodologies for rockburst assessment in complex and variable environments.新型集成智能方法在复杂多变环境下的岩爆评估。
Sci Rep. 2022 Feb 3;12(1):1844. doi: 10.1038/s41598-022-05594-0.
4
Feature group partitioning: an approach for depression severity prediction with class balancing using machine learning algorithms.特征分组划分:一种使用机器学习算法进行抑郁严重程度预测和类别平衡的方法。
BMC Med Res Methodol. 2024 Jun 3;24(1):123. doi: 10.1186/s12874-024-02249-8.
5
Application of KNN-based isometric mapping and fuzzy c-means algorithm to predict short-term rockburst risk in deep underground projects.基于 KNN 的等距映射和模糊 C 均值算法在深部地下工程中预测短期岩爆风险的应用。
Front Public Health. 2022 Oct 20;10:1023890. doi: 10.3389/fpubh.2022.1023890. eCollection 2022.
6
Effective treatment of imbalanced datasets in health care using modified SMOTE coupled with stacked deep learning algorithms.使用改进的SMOTE与堆叠深度学习算法相结合有效处理医疗保健中的不平衡数据集。
Appl Nanosci. 2023;13(3):1829-1840. doi: 10.1007/s13204-021-02063-4. Epub 2022 Feb 3.
7
A hybrid Stacking-SMOTE model for optimizing the prediction of autistic genes.一种混合的堆叠-SMOTE 模型,用于优化自闭症基因预测。
BMC Bioinformatics. 2023 Oct 6;24(1):379. doi: 10.1186/s12859-023-05501-y.
8
Long-term prediction modeling of shallow rockburst with small dataset based on machine learning.基于机器学习的小数据集浅部岩爆长期预测建模
Sci Rep. 2024 Jul 12;14(1):16131. doi: 10.1038/s41598-024-64107-3.
9
A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods.基于 GA 堆叠的智能家居能耗预测集成方法研究:集成方法比较
J Environ Manage. 2024 Jul;364:121264. doi: 10.1016/j.jenvman.2024.121264. Epub 2024 Jun 12.
10
Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment.用于大规模评估中作弊检测的堆叠集成机器学习算法探索
Educ Psychol Meas. 2023 Aug;83(4):831-854. doi: 10.1177/00131644221117193. Epub 2022 Aug 13.

引用本文的文献

1
Event recognition technology and short-term rockburst early warning model based on microseismic monitoring and ensemble learning.基于微震监测与集成学习的事件识别技术及短期岩爆预警模型
Sci Rep. 2025 May 28;15(1):18674. doi: 10.1038/s41598-025-03657-6.
2
Interpretable real-time monitoring of short-term rockbursts in underground spaces based on microseismic activities.基于微震活动的地下空间短期岩爆可解释实时监测
Sci Rep. 2025 Jan 6;15(1):911. doi: 10.1038/s41598-024-85042-3.
3
Clustering and classification for dry bean feature imbalanced data.

本文引用的文献

1
Selective oversampling approach for strongly imbalanced data.针对严重不平衡数据的选择性过采样方法。
PeerJ Comput Sci. 2021 Jun 18;7:e604. doi: 10.7717/peerj-cs.604. eCollection 2021.
2
Do Additional Features Help or Hurt Category Learning? The Curse of Dimensionality in Human Learners.额外特征有助于还是有损于类别学习?人类学习者维度诅咒。
Cogn Sci. 2019 Mar;43(3):e12724. doi: 10.1111/cogs.12724.
3
Prior knowledge enhances the category dimensionality effect.先验知识增强了类别维度效应。
干豆特征不平衡数据的聚类与分类
Sci Rep. 2024 Dec 28;14(1):31058. doi: 10.1038/s41598-024-82253-6.
4
Investigation and application of data balancing and combined discriminant model in rock burst severity prediction.岩爆烈度预测中数据平衡与组合判别模型的研究与应用
Sci Rep. 2024 Nov 29;14(1):29657. doi: 10.1038/s41598-024-81307-z.
5
AdaBoost model for rockburst intensity prediction considering class differences and quantitative characterization of misclassification difference.考虑类别差异和误分类差异定量表征的岩爆强度预测AdaBoost模型
Sci Rep. 2024 Nov 15;14(1):28232. doi: 10.1038/s41598-024-79141-4.
6
Long-term prediction modeling of shallow rockburst with small dataset based on machine learning.基于机器学习的小数据集浅部岩爆长期预测建模
Sci Rep. 2024 Jul 12;14(1):16131. doi: 10.1038/s41598-024-64107-3.
Mem Cognit. 2008 Mar;36(2):256-70. doi: 10.3758/mc.36.2.256.
4
Category dimensionality and feature knowledge: when more features are learned as easily as fewer.类别维度与特征知识:何时学习更多特征与学习更少特征一样容易。
J Exp Psychol Learn Mem Cogn. 2006 Mar;32(2):301-315. doi: 10.1037/0278-7393.32.3.301.