• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用改进差分进化优化算法的优化深度森林算法:宿主-病原体蛋白质-蛋白质相互作用预测实例

An optimized deep-forest algorithm using a modified differential evolution optimization algorithm: A case of host-pathogen protein-protein interaction prediction.

作者信息

Emmanuel Jerry, Isewon Itunuoluwa, Oyelade Jelili

机构信息

Department of Computer and Information Sciences, Covenant University, Ota, Nigeria.

Covenant Applied Informatics and Communication African Centre of Excellence (CApIC-ACE), Nigeria.

出版信息

Comput Struct Biotechnol J. 2025 Jan 26;27:595-611. doi: 10.1016/j.csbj.2025.01.020. eCollection 2025.

DOI:10.1016/j.csbj.2025.01.020
PMID:39995682
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11849198/
Abstract

Deep Forest employs forest structures and leverages deep architecture to learn feature vector information adaptively. However, deep forest-based models have limitations such as manual hyperparameter optimization and time and memory usage inefficiencies. Bayesian optimization is a widely used model-based hyperparameter optimization method. Evolutionary algorithms such as Differential Evolution (DE) have recently been introduced to improve Bayesian optimization's acquisition function. Despite its effectiveness, DE has a significant drawback as it relies on randomly selecting indices from the population of target vectors to construct donor vectors in search of optimal solutions. This randomness is ineffective, as suboptimal or redundant indices may be selected. Therefore, in this research we developed a modified differential evolution (DE) acquisition function for improved host-pathogen protein-protein interaction prediction. The modified DE introduces a weighted and adaptive donor vector technique that selects the best-fitted donor vectors as opposed to the random approach. This modified optimization approach was implemented in a deep forest model for automatic hyperparameter optimization. The performance of the optimized deep forest model was evaluated on human- protein sequence datasets using 10-fold cross-validation. The results were compared with standard optimization methods such as traditional Bayesian optimization, genetic algorithms, evolutionary strategies, and other machine learning models. The optimized model achieved an accuracy of 89.3 %, outperforming other models across all metrics, including a sensitivity of 85.4 % and a precision of 91.6 %. Additionally, the optimized model predicted seven novel host-pathogen interactions. Finally, the model was implemented as a web application which is accessible at http://dfh3pi.covenantuniversity.edu.ng.

摘要

深度森林采用森林结构并利用深度架构来自适应地学习特征向量信息。然而,基于深度森林的模型存在局限性,如手动超参数优化以及时间和内存使用效率低下等问题。贝叶斯优化是一种广泛使用的基于模型的超参数优化方法。最近引入了诸如差分进化(DE)等进化算法来改进贝叶斯优化的采集函数。尽管其有效,但DE有一个显著缺点,即它依赖于从目标向量群体中随机选择索引来构建供体向量以寻找最优解。这种随机性是无效的,因为可能会选择次优或冗余索引。因此,在本研究中,我们开发了一种改进的差分进化(DE)采集函数,以改进宿主 - 病原体蛋白质 - 蛋白质相互作用预测。改进后的DE引入了加权和自适应供体向量技术,与随机方法不同,它选择最适合的供体向量。这种改进的优化方法在深度森林模型中实现,用于自动超参数优化。使用10折交叉验证在人类蛋白质序列数据集上评估优化后的深度森林模型的性能。将结果与传统贝叶斯优化、遗传算法、进化策略等标准优化方法以及其他机器学习模型进行比较。优化后的模型准确率达到89.3%,在所有指标上均优于其他模型,包括85.4%的灵敏度和91.6%的精确率。此外,优化后的模型预测了七种新的宿主 - 病原体相互作用。最后,该模型被实现为一个网络应用程序,可通过http://dfh3pi.covenantuniversity.edu.ng访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/113f4494da9e/gr14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/af67ce1822de/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/defac7420115/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/58120bda75e3/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/20cf575a72c6/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/a63c024f5ec3/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/e66e1106139b/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/a3b52c7a3d08/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/01a5ab458690/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/189c64695d1a/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/ccbe16c1760d/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/d72d58864a74/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/dd9d909a98ab/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/89c392ba773b/gr12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/0c788e4a80f8/gr13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/113f4494da9e/gr14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/af67ce1822de/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/defac7420115/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/58120bda75e3/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/20cf575a72c6/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/a63c024f5ec3/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/e66e1106139b/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/a3b52c7a3d08/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/01a5ab458690/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/189c64695d1a/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/ccbe16c1760d/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/d72d58864a74/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/dd9d909a98ab/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/89c392ba773b/gr12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/0c788e4a80f8/gr13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e53/11849198/113f4494da9e/gr14.jpg

相似文献

1
An optimized deep-forest algorithm using a modified differential evolution optimization algorithm: A case of host-pathogen protein-protein interaction prediction.一种使用改进差分进化优化算法的优化深度森林算法:宿主-病原体蛋白质-蛋白质相互作用预测实例
Comput Struct Biotechnol J. 2025 Jan 26;27:595-611. doi: 10.1016/j.csbj.2025.01.020. eCollection 2025.
2
Heuristic hyperparameter optimization of deep learning models for genomic prediction.启发式深度学习模型的基因组预测超参数优化。
G3 (Bethesda). 2021 Jul 14;11(7). doi: 10.1093/g3journal/jkab032.
3
An improved hyperparameter optimization framework for AutoML systems using evolutionary algorithms.使用进化算法改进 AutoML 系统的超参数优化框架。
Sci Rep. 2023 Mar 23;13(1):4737. doi: 10.1038/s41598-023-32027-3.
4
Optimizing Machine Learning Algorithms for Landslide Susceptibility Mapping along the Karakoram Highway, Gilgit Baltistan, Pakistan: A Comparative Study of Baseline, Bayesian, and Metaheuristic Hyperparameter Optimization Techniques.优化巴基斯坦吉尔吉特-巴尔蒂斯坦喀喇昆仑公路沿线滑坡易发性制图的机器学习算法:基线、贝叶斯和元启发式超参数优化技术的比较研究
Sensors (Basel). 2023 Aug 1;23(15):6843. doi: 10.3390/s23156843.
5
Machine Learning-Based Boosted Regression Ensemble Combined with Hyperparameter Tuning for Optimal Adaptive Learning.基于机器学习的增强回归集成与超参数调整相结合,实现最优自适应学习。
Sensors (Basel). 2022 May 16;22(10):3776. doi: 10.3390/s22103776.
6
Ensemble effort estimation with metaheuristic hyperparameters and weight optimization for achieving accuracy.使用启发式超参数和权重优化进行集成努力估计,以实现准确性。
PLoS One. 2024 Apr 4;19(4):e0300296. doi: 10.1371/journal.pone.0300296. eCollection 2024.
7
SRADHO: statistical reduction approach with deep hyper optimization for disease classification using artificial intelligence.SRADHO:使用人工智能进行疾病分类的深度超优化统计归约方法。
Sci Rep. 2025 Jan 7;15(1):1245. doi: 10.1038/s41598-024-82838-1.
8
Optimizing Deep Learning Models with Improved BWO for TEC Prediction.使用改进的二进制鲸鱼优化算法优化深度学习模型用于对流层延迟预测
Biomimetics (Basel). 2024 Sep 22;9(9):575. doi: 10.3390/biomimetics9090575.
9
Machine Learning-Based Risk Factor Analysis and Prediction Model Construction for the Occurrence of Chronic Heart Failure: Health Ecologic Study.基于机器学习的慢性心力衰竭发生风险因素分析及预测模型构建:健康生态学研究
JMIR Med Inform. 2025 Jan 31;13:e64972. doi: 10.2196/64972.
10
Genetic Algorithm-based Convolutional Neural Network Feature Engineering for Optimizing Coronary Heart Disease Prediction Performance.基于遗传算法的卷积神经网络特征工程用于优化冠心病预测性能
Healthc Inform Res. 2024 Jul;30(3):234-243. doi: 10.4258/hir.2024.30.3.234. Epub 2024 Jul 31.

本文引用的文献

1
Employing Machine Learning Techniques to Detect Protein Function: A Survey, Experimental, and Empirical Evaluations.运用机器学习技术检测蛋白质功能:一项综述、实验及实证评估
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):1965-1986. doi: 10.1109/TCBB.2024.3427381. Epub 2024 Dec 10.
2
Improving classification accuracy of fine-tuned CNN models: Impact of hyperparameter optimization.提高微调卷积神经网络(CNN)模型的分类准确率:超参数优化的影响。
Heliyon. 2024 Feb 23;10(5):e26586. doi: 10.1016/j.heliyon.2024.e26586. eCollection 2024 Mar 15.
3
A novel multi-hybrid differential evolution algorithm for optimization of frame structures.
一种用于框架结构优化的新型多混合差分进化算法。
Sci Rep. 2024 Feb 28;14(1):4877. doi: 10.1038/s41598-024-54384-3.
4
Improved Differential Evolution Algorithm Guided by Best and Worst Positions Exploration Dynamics.基于最佳和最差位置探索动态引导的改进差分进化算法
Biomimetics (Basel). 2024 Feb 16;9(2):119. doi: 10.3390/biomimetics9020119.
5
Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster.启发式支持的主动机器学习:以预测黑腹果蝇必需发育阶段和免疫反应基因为例的研究。
PLoS One. 2023 Aug 9;18(8):e0288023. doi: 10.1371/journal.pone.0288023. eCollection 2023.
6
Predicting activatory and inhibitory drug-target interactions based on structural compound representations and genetically perturbed transcriptomes.基于结构化合物表示和遗传扰动转录组预测激活和抑制性药物-靶标相互作用。
PLoS One. 2023 Apr 12;18(4):e0282042. doi: 10.1371/journal.pone.0282042. eCollection 2023.
7
An improved hyperparameter optimization framework for AutoML systems using evolutionary algorithms.使用进化算法改进 AutoML 系统的超参数优化框架。
Sci Rep. 2023 Mar 23;13(1):4737. doi: 10.1038/s41598-023-32027-3.
8
On the choice of negative examples for prediction of host-pathogen protein interactions.关于用于预测宿主-病原体蛋白质相互作用的阴性实例的选择
Front Bioinform. 2022 Dec 15;2:1083292. doi: 10.3389/fbinf.2022.1083292. eCollection 2022.
9
A systematic review on the state-of-the-art strategies for protein representation.关于蛋白质表示的最新策略的系统综述。
Comput Biol Med. 2023 Jan;152:106440. doi: 10.1016/j.compbiomed.2022.106440. Epub 2022 Dec 17.
10
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.