• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于随机森林的特征选择算法在运动效能评估中的研究与性能分析

Research and performance analysis of random forest-based feature selection algorithm in sports effectiveness evaluation.

作者信息

Li Yujiao, Mu Yingjie

机构信息

Harbin Normal University, Harbin, 150025, China.

出版信息

Sci Rep. 2024 Nov 1;14(1):26275. doi: 10.1038/s41598-024-76706-1.

DOI:10.1038/s41598-024-76706-1
PMID:39487220
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11530685/
Abstract

The rapid progress in fields such as data mining and machine learning, as well as the explosive growth of sports big data, have posed new challenges to the research of sports big data. Most of the available sports data mining techniques concentrates on extracting and constructing effective features for basic sports data, which cannot be achieved simply by using data statistics. Especially in the targeted mining of sports data, traditional mining techniques still have shortcomings such as low classification accuracy and insufficient refinement. In order to solve the problem of low accuracy in traditional mining methods, the study combines the random forest algorithm with the artificial raindrop algorithm, and adopts a sports data mining method based on feature selection to achieve effective analysis of sports big data. This study is based on the evaluation method of motion effects using random forests, and uses feature extraction algorithms to study the motion effect impacts. It uses the information gain index to rank the importance of features and accurately gain the degree of influence of exercise on various indicators of the human body. Through simulation verification, the algorithm proposed by the research institute performs the best in accuracy and FI scores on the training and testing sets, with accuracies of 0.849 ± 0.021 and 0.819 ± 0.022, respectively, and F1 scores of 0.837 ± 0.020 and 0.864 ± 0.021, respectively. This indicates that the algorithm proposed by the research institute has high classification accuracy and performance proves that the Random Forest-based feature selection algorithm established in this study is superior to the existing traditional feature extraction and extraction methods in terms of both performance and accuracy. The proposal of this data analysis method has achieved accurate and efficient utilization of sports big data, which is of great significance for the development of the sports education industry.

摘要

数据挖掘和机器学习等领域的快速发展,以及体育大数据的爆炸式增长,给体育大数据的研究带来了新的挑战。现有的大多数体育数据挖掘技术都集中在为基础体育数据提取和构建有效特征上,而这无法简单地通过数据统计来实现。特别是在体育数据的针对性挖掘方面,传统挖掘技术仍存在分类准确率低和精细化不足等缺点。为了解决传统挖掘方法中准确率低的问题,该研究将随机森林算法与人工雨滴算法相结合,采用基于特征选择的体育数据挖掘方法,以实现对体育大数据的有效分析。本研究基于随机森林的运动效果评估方法,利用特征提取算法研究运动效果影响。它使用信息增益指数对特征的重要性进行排序,准确获取运动对人体各项指标的影响程度。通过仿真验证,该研究所提出的算法在训练集和测试集上的准确率和F1分数表现最佳,准确率分别为0.849±0.021和0.819±0.022,F1分数分别为0.837±0.020和0.864±0.021。这表明该研究所提出的算法具有较高的分类准确率,其性能证明了本研究建立的基于随机森林的特征选择算法在性能和准确率方面均优于现有的传统特征提取方法。这种数据分析方法的提出实现了对体育大数据的准确高效利用,对体育教育产业的发展具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/a64a09578f6b/41598_2024_76706_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/8348bf8e1f16/41598_2024_76706_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/7457e9cf288a/41598_2024_76706_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/4766df71ad97/41598_2024_76706_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/b186d37edad2/41598_2024_76706_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/4bac1661d2b2/41598_2024_76706_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/9ce6dd544b7e/41598_2024_76706_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/f2c2c8b676da/41598_2024_76706_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/ec54762c2ff7/41598_2024_76706_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/53ccb7e1bece/41598_2024_76706_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/cb0bf8e3234b/41598_2024_76706_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/a64a09578f6b/41598_2024_76706_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/8348bf8e1f16/41598_2024_76706_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/7457e9cf288a/41598_2024_76706_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/4766df71ad97/41598_2024_76706_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/b186d37edad2/41598_2024_76706_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/4bac1661d2b2/41598_2024_76706_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/9ce6dd544b7e/41598_2024_76706_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/f2c2c8b676da/41598_2024_76706_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/ec54762c2ff7/41598_2024_76706_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/53ccb7e1bece/41598_2024_76706_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/cb0bf8e3234b/41598_2024_76706_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7be3/11530685/a64a09578f6b/41598_2024_76706_Fig11_HTML.jpg

相似文献

1
Research and performance analysis of random forest-based feature selection algorithm in sports effectiveness evaluation.基于随机森林的特征选择算法在运动效能评估中的研究与性能分析
Sci Rep. 2024 Nov 1;14(1):26275. doi: 10.1038/s41598-024-76706-1.
2
Application of Motion Effect Evaluation Algorithm Based on Random Forest.基于随机森林的运动效果评估算法的应用。
Comput Intell Neurosci. 2022 Oct 6;2022:2039423. doi: 10.1155/2022/2039423. eCollection 2022.
3
Optimization of sports effect evaluation technology from random forest algorithm and elastic network algorithm.从随机森林算法和弹性网络算法优化运动效果评估技术。
PLoS One. 2023 Oct 20;18(10):e0292557. doi: 10.1371/journal.pone.0292557. eCollection 2023.
4
[Random forest classification of Callicarpa nudiflora from WorldView-3 imagery based on optimized feature space].基于优化特征空间的利用WorldView-3影像对裸花紫珠进行随机森林分类
Zhongguo Zhong Yao Za Zhi. 2019 Oct;44(19):4073-4077. doi: 10.19540/j.cnki.cjcmm.20190731.104.
5
Improved random forest classification model combined with C5.0 algorithm for vegetation feature analysis in non-agricultural environments.结合C5.0算法的改进随机森林分类模型用于非农业环境中的植被特征分析
Sci Rep. 2024 May 6;14(1):10367. doi: 10.1038/s41598-024-60066-x.
6
Feature Selection and Feature Stability Measurement Method for High-Dimensional Small Sample Data Based on Big Data Technology.基于大数据技术的高维小样本数据特征选择与特征稳定性测量方法。
Comput Intell Neurosci. 2021 Sep 23;2021:3597051. doi: 10.1155/2021/3597051. eCollection 2021.
7
Improving the Accuracy of Feature Selection in Big Data Mining Using Accelerated Flower Pollination (AFP) Algorithm.使用加速花授粉(AFP)算法提高大数据挖掘中特征选择的准确性。
J Med Syst. 2019 Mar 9;43(4):96. doi: 10.1007/s10916-019-1200-1.
8
RST-DE: Rough Sets-Based New Differential Evolution Algorithm for Scalable Big Data Feature Selection in Distributed Computing Platforms.基于粗糙集的新差分进化算法在分布式计算平台中的可扩展大数据特征选择。
Big Data. 2022 Aug;10(4):356-367. doi: 10.1089/big.2021.0267. Epub 2022 May 4.
9
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
10
Upper-Limb Motion Recognition Based on Hybrid Feature Selection: Algorithm Development and Validation.基于混合特征选择的上肢运动识别:算法开发与验证。
JMIR Mhealth Uhealth. 2021 Sep 2;9(9):e24402. doi: 10.2196/24402.

引用本文的文献

1
Fusing wrist pulse and ECG data for enhanced identification of coronary heart disease and its complications.融合手腕脉搏和心电图数据以增强对冠心病及其并发症的识别。
Front Physiol. 2025 Jul 29;16:1628309. doi: 10.3389/fphys.2025.1628309. eCollection 2025.
2
Hybrid lion and exponential PSO-based metaheuristic clustering approach for efficient dynamic data stream management.基于混合狮子和指数粒子群优化的元启发式聚类方法用于高效动态数据流管理
Sci Rep. 2025 Jul 1;15(1):22343. doi: 10.1038/s41598-025-07404-9.

本文引用的文献

1
Identifying low-PM exposure commuting routes for cyclists through modeling with the random forest algorithm based on low-cost sensor measurements in three Asian cities.通过基于亚洲三个城市低成本传感器测量数据的随机森林算法建模,为骑行者识别低颗粒物暴露的通勤路线。
Environ Pollut. 2022 Feb 1;294:118597. doi: 10.1016/j.envpol.2021.118597. Epub 2021 Nov 27.
2
Effects of combat sports on functional network connectivity in adolescents.格斗运动对青少年功能网络连接的影响。
Neuroradiology. 2021 Nov;63(11):1863-1871. doi: 10.1007/s00234-021-02713-y. Epub 2021 Apr 29.
3
Lupus or not? SLE Risk Probability Index (SLERPI): a simple, clinician-friendly machine learning-based model to assist the diagnosis of systemic lupus erythematosus.
是否为狼疮?SLE 风险概率指数 (SLERPI):一种简单、便于临床医生使用的基于机器学习的模型,用于辅助系统性红斑狼疮的诊断。
Ann Rheum Dis. 2021 Jun;80(6):758-766. doi: 10.1136/annrheumdis-2020-219069. Epub 2021 Feb 10.
4
Effects of chronic exercise on the inhibitory control of children and adolescents: A systematic review and meta-analysis.慢性运动对儿童和青少年抑制控制的影响:系统评价和荟萃分析。
Scand J Med Sci Sports. 2021 Jun;31(6):1196-1208. doi: 10.1111/sms.13934. Epub 2021 Mar 10.