• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用概率随机森林、主成分分析和遗传算法的新型物种分布预测混合模型。

A novel hybrid model for species distribution prediction using probabilistic random forest, principal component analysis and genetic algorithm.

作者信息

Adekunle Taiwo A, Ogundoyin Ibrahim K, Akanbi Caleb O

机构信息

Department of Computer Science, Osun State University, Osogbo, Nigeria.

出版信息

PLoS One. 2025 Sep 10;20(9):e0326122. doi: 10.1371/journal.pone.0326122. eCollection 2025.

DOI:10.1371/journal.pone.0326122
PMID:40929112
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12422458/
Abstract

Probabilistic Random Forest is an extension of the traditional Random Forest machine learning algorithm that is one of the frequently used machine learning algorithms employed for species distribution modeling. However, with the use of complex dataset for predicting the presence or absence of the species, It is essential that feature extraction is important to generate optimal prediction that can affect the model accuracy and AUC score of the model simulation. In this paper, we integrated the Genetic Algorithm Optimization technique, which is popular for its excellent feature extraction technique, to enhance the predictive performance of the PRF Model. a novel hybrid algorithm the genetically optimized probabilistic random forest algorithm, designed for predicting the distribution of mastomys natalensis in Nigeria. The model was also compared with existing models for dimensionality reduction with other optimization techniques, such as Principal Component Analysis, Grey Wolf, Optimizer optimized backpropagation neural network algorithm (GNNA), Butterfly Optimization Algorithm. These models were evaluated using four performance metrics, accuracy, the areas under curve, sensitivity, specificity, F1_score and precision. We also examined the spatial predictive distribution of the models. The results generated that the predictive performance of PRFGA, significantly improved compared to PRFPCA, GNNA and PRFBOA in predicting the presence or absence of mastomys natalensis with a presence only and pseudo-absence sample set. the PRFGA demonstrated a high predictive power in predicting the spatial distribution of the presence or absence of mastomys natalensis in Nigeria. The integration of the Genetic Algorithm optimization technique, stems from its renowned ability to address the specific challenges of data uncertainty and high-dimensionality reduction in feature extraction sets of SDMs, to enhance the performance of the PRF model.

摘要

概率随机森林是传统随机森林机器学习算法的扩展,传统随机森林是物种分布建模中常用的机器学习算法之一。然而,在使用复杂数据集预测物种的存在与否时,特征提取对于生成能够影响模型准确性和模型模拟AUC分数的最优预测至关重要。在本文中,我们集成了以其出色的特征提取技术而闻名的遗传算法优化技术,以提高PRF模型的预测性能。一种新颖的混合算法——遗传优化概率随机森林算法,旨在预测尼日利亚家鼠的分布。该模型还与使用其他优化技术(如主成分分析、灰狼优化器优化的反向传播神经网络算法(GNNA)、蝴蝶优化算法)进行降维的现有模型进行了比较。使用四个性能指标(准确率、曲线下面积、灵敏度、特异性、F1分数和精确率)对这些模型进行了评估。我们还检查了模型的空间预测分布。结果表明,在使用仅存在和伪不存在样本集预测家鼠的存在与否时,PRFGA的预测性能与PRFPCA、GNNA和PRFBOA相比有显著提高。PRFGA在预测尼日利亚家鼠存在与否的空间分布方面表现出很高的预测能力。遗传算法优化技术的集成源于其解决物种分布模型特征提取集中数据不确定性和高维降维特定挑战的卓越能力,从而提高了PRF模型的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/b273e23be8c2/pone.0326122.g025.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/8828916e931f/pone.0326122.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/03511b84975b/pone.0326122.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/4de24af22c4e/pone.0326122.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/72aa854602ed/pone.0326122.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/50f804ea960c/pone.0326122.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/de0c5d2dcad2/pone.0326122.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/f233527e803a/pone.0326122.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/5ebc9810341f/pone.0326122.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/e76d87f86ec6/pone.0326122.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/838bf3aa221f/pone.0326122.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/adfc5e220998/pone.0326122.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/11a3adda621d/pone.0326122.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/01ab90e271ad/pone.0326122.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/3d363d2cc839/pone.0326122.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/1322487ba814/pone.0326122.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/ebbb2334205e/pone.0326122.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/6c656bc949a8/pone.0326122.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/dee0bb2f70ba/pone.0326122.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/c70bb84268cb/pone.0326122.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/19059ab1fe5e/pone.0326122.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/32932de80f14/pone.0326122.g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/896e2e258a13/pone.0326122.g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/0b0402c6bdd9/pone.0326122.g023.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/46e73474a0d2/pone.0326122.g024.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/b273e23be8c2/pone.0326122.g025.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/8828916e931f/pone.0326122.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/03511b84975b/pone.0326122.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/4de24af22c4e/pone.0326122.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/72aa854602ed/pone.0326122.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/50f804ea960c/pone.0326122.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/de0c5d2dcad2/pone.0326122.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/f233527e803a/pone.0326122.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/5ebc9810341f/pone.0326122.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/e76d87f86ec6/pone.0326122.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/838bf3aa221f/pone.0326122.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/adfc5e220998/pone.0326122.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/11a3adda621d/pone.0326122.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/01ab90e271ad/pone.0326122.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/3d363d2cc839/pone.0326122.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/1322487ba814/pone.0326122.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/ebbb2334205e/pone.0326122.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/6c656bc949a8/pone.0326122.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/dee0bb2f70ba/pone.0326122.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/c70bb84268cb/pone.0326122.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/19059ab1fe5e/pone.0326122.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/32932de80f14/pone.0326122.g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/896e2e258a13/pone.0326122.g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/0b0402c6bdd9/pone.0326122.g023.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/46e73474a0d2/pone.0326122.g024.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6d8/12422458/b273e23be8c2/pone.0326122.g025.jpg

相似文献

1
A novel hybrid model for species distribution prediction using probabilistic random forest, principal component analysis and genetic algorithm.一种使用概率随机森林、主成分分析和遗传算法的新型物种分布预测混合模型。
PLoS One. 2025 Sep 10;20(9):e0326122. doi: 10.1371/journal.pone.0326122. eCollection 2025.
2
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.
3
A study on the effectiveness of machine learning models for hepatitis prediction.关于机器学习模型用于肝炎预测有效性的研究。
Sci Rep. 2025 Aug 20;15(1):30659. doi: 10.1038/s41598-025-07104-4.
4
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
5
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
6
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
9
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
10
Machine learning based screening of biomarkers associated with cell death and immunosuppression of multiple life stages sepsis populations.基于机器学习对与多生命阶段脓毒症人群细胞死亡和免疫抑制相关生物标志物的筛选。
Sci Rep. 2025 Aug 19;15(1):30302. doi: 10.1038/s41598-025-14600-0.

本文引用的文献

1
A novel hybrid model for species distribution prediction using neural networks and Grey Wolf Optimizer algorithm.一种使用神经网络和灰狼优化算法的物种分布预测的新型混合模型。
Sci Rep. 2024 May 20;14(1):11505. doi: 10.1038/s41598-024-62285-8.
2
The dimensionality reductions of environmental variables have a significant effect on the performance of species distribution models.环境变量的降维对物种分布模型的性能有显著影响。
Ecol Evol. 2023 Nov 20;13(11):e10747. doi: 10.1002/ece3.10747. eCollection 2023 Nov.
3
Classification of Diabetes Using Feature Selection and Hybrid Al-Biruni Earth Radius and Dipper Throated Optimization.
基于特征选择以及混合阿尔-比鲁尼地球半径和北斗咽喉优化的糖尿病分类
Diagnostics (Basel). 2023 Jun 12;13(12):2038. doi: 10.3390/diagnostics13122038.
4
Probabilistic Random Forest improves bioactivity predictions close to the classification threshold by taking into account experimental uncertainty.概率随机森林通过考虑实验不确定性,改进了接近分类阈值的生物活性预测。
J Cheminform. 2021 Aug 19;13(1):62. doi: 10.1186/s13321-021-00539-7.
5
Evaluating collinearity effects on species distribution models: An approach based on virtual species simulation.评估物种分布模型中的共线性效应:一种基于虚拟物种模拟的方法。
PLoS One. 2018 Sep 11;13(9):e0202403. doi: 10.1371/journal.pone.0202403. eCollection 2018.
6
Active Tuberculosis Case Finding in Haiti.海地活动性肺结核病例发现
Am J Trop Med Hyg. 2017 Aug;97(2):433-435. doi: 10.4269/ajtmh.16-0674. Epub 2017 Jul 19.
7
At Home with and : Human-Rodent Interactions and Potential for Primary Transmission of Lassa Virus in Domestic Spaces.与拉沙病毒和家鼠共处:人与啮齿动物的相互作用以及拉沙病毒在家庭环境中主要传播的可能性
Am J Trop Med Hyg. 2017 Apr;96(4):935-943. doi: 10.4269/ajtmh.16-0675. Epub 2017 Feb 6.
8
Design of Probabilistic Random Forests with Applications to Anticancer Drug Sensitivity Prediction.用于抗癌药物敏感性预测的概率随机森林设计
Cancer Inform. 2016 Mar 31;14(Suppl 5):57-73. doi: 10.4137/CIN.S30794. eCollection 2015.
9
A working guide to boosted regression trees.提升回归树实用指南。
J Anim Ecol. 2008 Jul;77(4):802-13. doi: 10.1111/j.1365-2656.2008.01390.x. Epub 2008 Apr 8.