• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于遗传算法的集成模型,用于高效识别白细胞介素6诱导肽。

A genetic algorithm-based ensemble model for efficiently identifying interleukin 6 inducing peptides.

作者信息

Harun-Or-Roshid Md, Kurata Hiroyuki

机构信息

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan.

出版信息

Sci Rep. 2025 Jul 1;15(1):21213. doi: 10.1038/s41598-025-05491-2.

DOI:10.1038/s41598-025-05491-2
PMID:40594650
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12215643/
Abstract

Interleukin-6 (IL-6) is a cytokine with diverse biological activities that contribute to a variety of physiologic and immune responses. IL-6-inducing peptides are the short protein fragments that are critical for playing a contributing role in biological processes. Extensive research has advanced the development of IL-6-inducing peptides, but identifying these peptides experimentally remains time-consuming, labor-intensive, and costly. Therefore, computational prediction has gained attention as an alternative method. Meanwhile, some computational methods have already been developed, but they suffer from insufficient accuracy and inadequate feature engineering. In this study, we developed PredIL6, an advanced ensemble learning model that precisely identifies IL-6-inducing peptides by combining probability scores from 148 baseline machine learning and deep learning models, using a genetic algorithm-based meta-classifier. A forward feature selection method was used to construct the ensemble model, which consists of 20 baseline or single-feature models, including AAINDEX, BLOSUM62, and language models (ESM-2 and word2vec). PredIL6 outperformed existing state-of-the-art methods, achieving accuracy values of 0.934 and 0.899 on the training and test sets, respectively. Thus, PredIL6 is a powerful tool for expediting the identification of IL-6-inducing peptides. A freely available web application and a standalone PredIL6 program are provided.

摘要

白细胞介素-6(IL-6)是一种具有多种生物活性的细胞因子,可促成多种生理和免疫反应。IL-6诱导肽是短蛋白质片段,在生物过程中发挥作用至关重要。广泛的研究推动了IL-6诱导肽的发展,但通过实验鉴定这些肽仍然耗时、费力且成本高昂。因此,计算预测作为一种替代方法受到了关注。同时,已经开发了一些计算方法,但它们存在准确性不足和特征工程不够的问题。在本研究中,我们开发了PredIL6,这是一种先进的集成学习模型,通过使用基于遗传算法的元分类器,结合148个基线机器学习和深度学习模型的概率分数,精确识别IL-6诱导肽。采用前向特征选择方法构建集成模型,该模型由20个基线或单特征模型组成,包括氨基酸指数(AAINDEX)、布罗莫尔矩阵62(BLOSUM62)和语言模型(ESM-2和词向量(word2vec))。PredIL6优于现有的最先进方法,在训练集和测试集上的准确率分别达到0.934和0.899。因此,PredIL6是加速鉴定IL-6诱导肽的强大工具。我们提供了一个免费的网络应用程序和一个独立的PredIL6程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/197beaa89277/41598_2025_5491_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/0c58b835af13/41598_2025_5491_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/83d6bb1e5cf3/41598_2025_5491_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/0a1587dd2e71/41598_2025_5491_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/60283b46de31/41598_2025_5491_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/197beaa89277/41598_2025_5491_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/0c58b835af13/41598_2025_5491_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/83d6bb1e5cf3/41598_2025_5491_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/0a1587dd2e71/41598_2025_5491_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/60283b46de31/41598_2025_5491_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ced/12215643/197beaa89277/41598_2025_5491_Fig5_HTML.jpg

相似文献

1
A genetic algorithm-based ensemble model for efficiently identifying interleukin 6 inducing peptides.一种基于遗传算法的集成模型,用于高效识别白细胞介素6诱导肽。
Sci Rep. 2025 Jul 1;15(1):21213. doi: 10.1038/s41598-025-05491-2.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
ToxinPred 3.0: An improved method for predicting the toxicity of peptides.ToxinPred 3.0:一种改进的多肽毒性预测方法。
Comput Biol Med. 2024 Sep;179:108926. doi: 10.1016/j.compbiomed.2024.108926. Epub 2024 Jul 21.
4
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
5
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
7
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
8
Blood biomarkers for the non-invasive diagnosis of endometriosis.用于子宫内膜异位症无创诊断的血液生物标志物。
Cochrane Database Syst Rev. 2016 May 1;2016(5):CD012179. doi: 10.1002/14651858.CD012179.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
10
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

本文引用的文献

1
PredIL13: Stacking a variety of machine and deep learning methods with ESM-2 language model for identifying IL13-inducing peptides.PredIL13:结合多种机器和深度学习方法以及 ESM-2 语言模型,用于识别诱导 IL13 的肽。
PLoS One. 2024 Aug 22;19(8):e0309078. doi: 10.1371/journal.pone.0309078. eCollection 2024.
2
Meta-2OM: A multi-classifier meta-model for the accurate prediction of RNA 2'-O-methylation sites in human RNA.Meta-2OM:一种用于准确预测人类 RNA 2'-O-甲基化位点的多分类器元模型。
PLoS One. 2024 Jun 26;19(6):e0305406. doi: 10.1371/journal.pone.0305406. eCollection 2024.
3
MLm5C: A high-precision human RNA 5-methylcytosine sites predictor based on a combination of hybrid machine learning models.
MLm5C:一种基于混合机器学习模型组合的高精度人类 RNA 5-甲基胞嘧啶位点预测器。
Methods. 2024 Jul;227:37-47. doi: 10.1016/j.ymeth.2024.05.004. Epub 2024 May 8.
4
Stack-DHUpred: Advancing the accuracy of dihydrouridine modification sites detection via stacking approach.Stack-DHUpred:通过堆叠方法提高二氢尿嘧啶修饰位点检测的准确性。
Comput Biol Med. 2024 Feb;169:107848. doi: 10.1016/j.compbiomed.2023.107848. Epub 2023 Dec 13.
5
Interleukin 6: at the interface of human health and disease.白细胞介素 6:在人类健康与疾病的交界处。
Front Immunol. 2023 Sep 28;14:1255533. doi: 10.3389/fimmu.2023.1255533. eCollection 2023.
6
MVIL6: Accurate identification of IL-6-induced peptides using multi-view feature learning.MVIL6:使用多视图特征学习准确识别白细胞介素-6诱导的肽段。
Int J Biol Macromol. 2023 Aug 15;246:125412. doi: 10.1016/j.ijbiomac.2023.125412. Epub 2023 Jun 14.
7
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
8
A comprehensive meta-analysis comprising 149 case-control studies to investigate the association between IL-6 gene rs1800795 polymorphism and multiple disease risk.一项综合荟萃分析,涵盖149项病例对照研究,以调查白细胞介素-6基因rs1800795多态性与多种疾病风险之间的关联。
Gene. 2023 Apr 20;861:147234. doi: 10.1016/j.gene.2023.147234. Epub 2023 Feb 1.
9
iACVP: markedly enhanced identification of anti-coronavirus peptides using a dataset-specific word2vec model.iACVP:使用特定于数据集的 word2vec 模型显著提高了抗病毒肽的鉴定能力。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac265.
10
iAtbP-Hyb-EnC: Prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model.iAtbP-Hyb-EnC:基于异质特征表示和遗传算法的集合学习模型对抗结核肽的预测。
Comput Biol Med. 2021 Oct;137:104778. doi: 10.1016/j.compbiomed.2021.104778. Epub 2021 Aug 25.