• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PredIL13:结合多种机器和深度学习方法以及 ESM-2 语言模型,用于识别诱导 IL13 的肽。

PredIL13: Stacking a variety of machine and deep learning methods with ESM-2 language model for identifying IL13-inducing peptides.

机构信息

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Kawazu, Iizuka, Fukuoka, Japan.

出版信息

PLoS One. 2024 Aug 22;19(8):e0309078. doi: 10.1371/journal.pone.0309078. eCollection 2024.

DOI:10.1371/journal.pone.0309078
PMID:39172871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11340954/
Abstract

Interleukin (IL)-13 has emerged as one of the recently identified cytokine. Since IL-13 causes the severity of COVID-19 and alters crucial biological processes, it is urgent to explore novel molecules or peptides capable of including IL-13. Computational prediction has received attention as a complementary method to in-vivo and in-vitro experimental identification of IL-13 inducing peptides, because experimental identification is time-consuming, laborious, and expensive. A few computational tools have been presented, including the IL13Pred and iIL13Pred. To increase prediction capability, we have developed PredIL13, a cutting-edge ensemble learning method with the latest ESM-2 protein language model. This method stacked the probability scores outputted by 168 single-feature machine/deep learning models, and then trained a logistic regression-based meta-classifier with the stacked probability score vectors. The key technology was to implement ESM-2 and to select the optimal single-feature models according to their absolute weight coefficient for logistic regression (AWCLR), an indicator of the importance of each single-feature model. Especially, the sequential deletion of single-feature models based on the iterative AWCLR ranking (SDIWC) method constructed the meta-classifier consisting of the top 16 single-feature models, named PredIL13, while considering the model's accuracy. The PredIL13 greatly outperformed the-state-of-the-art predictors, thus is an invaluable tool for accelerating the detection of IL13-inducing peptide within the human genome.

摘要

白细胞介素 (IL)-13 已成为最近确定的细胞因子之一。由于 IL-13 导致 COVID-19 的严重程度并改变关键的生物学过程,因此迫切需要探索能够包括 IL-13 的新型分子或肽。计算预测作为体内和体外实验鉴定 IL-13 诱导肽的补充方法受到了关注,因为实验鉴定既耗时、费力又昂贵。已经提出了一些计算工具,包括 IL13Pred 和 iIL13Pred。为了提高预测能力,我们开发了 PredIL13,这是一种基于最新 ESM-2 蛋白质语言模型的前沿集成学习方法。该方法堆叠了 168 个单特征机器/深度学习模型输出的概率得分,然后使用堆叠的概率得分向量训练基于逻辑回归的元分类器。关键技术是实现 ESM-2,并根据其用于逻辑回归的绝对权重系数 (AWCLR) 为每个单特征模型选择最佳单特征模型,这是每个单特征模型重要性的指标。特别是,基于迭代 AWCLR 排序的单特征模型序列删除 (SDIWC) 方法构建了由前 16 个单特征模型组成的元分类器,名为 PredIL13,同时考虑了模型的准确性。PredIL13 大大优于最先进的预测器,因此是加速在人类基因组中检测 IL13 诱导肽的宝贵工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/caac780d223e/pone.0309078.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/91c5951b5df0/pone.0309078.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/1777e9398074/pone.0309078.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/5ea13d519c35/pone.0309078.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/b12dce9e093d/pone.0309078.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/0cdb74f4b251/pone.0309078.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/cddfca7ad61c/pone.0309078.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/1505116da1fa/pone.0309078.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/caac780d223e/pone.0309078.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/91c5951b5df0/pone.0309078.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/1777e9398074/pone.0309078.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/5ea13d519c35/pone.0309078.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/b12dce9e093d/pone.0309078.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/0cdb74f4b251/pone.0309078.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/cddfca7ad61c/pone.0309078.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/1505116da1fa/pone.0309078.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5492/11340954/caac780d223e/pone.0309078.g008.jpg

相似文献

1
PredIL13: Stacking a variety of machine and deep learning methods with ESM-2 language model for identifying IL13-inducing peptides.PredIL13:结合多种机器和深度学习方法以及 ESM-2 语言模型,用于识别诱导 IL13 的肽。
PLoS One. 2024 Aug 22;19(8):e0309078. doi: 10.1371/journal.pone.0309078. eCollection 2024.
2
iIL13Pred: improved prediction of IL-13 inducing peptides using popular machine learning classifiers.iIL13Pred:使用流行的机器学习分类器改进 IL-13 诱导肽的预测。
BMC Bioinformatics. 2023 Apr 11;24(1):141. doi: 10.1186/s12859-023-05248-6.
3
mACPpred 2.0: Stacked Deep Learning for Anticancer Peptide Prediction with Integrated Spatial and Probabilistic Feature Representations.mACPpred 2.0:具有集成空间和概率特征表示的用于抗癌肽预测的堆叠深度学习。
J Mol Biol. 2024 Sep 1;436(17):168687. doi: 10.1016/j.jmb.2024.168687. Epub 2024 Jun 25.
4
StackIL6: a stacking ensemble model for improving the prediction of IL-6 inducing peptides.StackIL6:一种用于提高白细胞介素 6 诱导肽预测能力的堆叠集成模型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab172.
5
PLMACPred prediction of anticancer peptides based on protein language model and wavelet denoising transformation.基于蛋白质语言模型和小波去噪变换的抗癌肽 PLMACPred 预测。
Sci Rep. 2024 Jul 23;14(1):16992. doi: 10.1038/s41598-024-67433-8.
6
UsIL-6: An unbalanced learning strategy for identifying IL-6 inducing peptides by undersampling technique.UsIL-6:一种通过欠采样技术识别 IL-6 诱导肽的不平衡学习策略。
Comput Methods Programs Biomed. 2024 Jun;250:108176. doi: 10.1016/j.cmpb.2024.108176. Epub 2024 Apr 12.
7
SEP-AlgPro: An efficient allergen prediction tool utilizing traditional machine learning and deep learning techniques with protein language model features.SEP-AlgPro:一种利用传统机器学习和深度学习技术以及蛋白质语言模型特征的高效过敏原预测工具。
Int J Biol Macromol. 2024 Jul;273(Pt 2):133085. doi: 10.1016/j.ijbiomac.2024.133085. Epub 2024 Jun 11.
8
Prediction of peptide hormones using an ensemble of machine learning and similarity-based methods.基于机器学习和基于相似度的方法的组合预测肽激素。
Proteomics. 2024 Oct;24(20):e2400004. doi: 10.1002/pmic.202400004. Epub 2024 May 27.
9
MultiFeatVotPIP: a voting-based ensemble learning framework for predicting proinflammatory peptides.MultiFeatVotPIP:一种基于投票的集成学习框架,用于预测促炎肽。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae505.
10
iNP_ESM: Neuropeptide Identification Based on Evolutionary Scale Modeling and Unified Representation Embedding Features.iNP_ESM:基于进化尺度建模和统一表示嵌入特征的神经肽识别。
Int J Mol Sci. 2024 Jun 27;25(13):7049. doi: 10.3390/ijms25137049.

引用本文的文献

1
A genetic algorithm-based ensemble model for efficiently identifying interleukin 6 inducing peptides.一种基于遗传算法的集成模型,用于高效识别白细胞介素6诱导肽。
Sci Rep. 2025 Jul 1;15(1):21213. doi: 10.1038/s41598-025-05491-2.
2
EnGCI: enhancing GPCR-compound interaction prediction via large molecular models and KAN network.EnGCI:通过大分子模型和KAN网络增强GPCR-化合物相互作用预测
BMC Biol. 2025 May 15;23(1):136. doi: 10.1186/s12915-025-02238-3.

本文引用的文献

1
Stack-DHUpred: Advancing the accuracy of dihydrouridine modification sites detection via stacking approach.Stack-DHUpred:通过堆叠方法提高二氢尿嘧啶修饰位点检测的准确性。
Comput Biol Med. 2024 Feb;169:107848. doi: 10.1016/j.compbiomed.2023.107848. Epub 2023 Dec 13.
2
iIL13Pred: improved prediction of IL-13 inducing peptides using popular machine learning classifiers.iIL13Pred:使用流行的机器学习分类器改进 IL-13 诱导肽的预测。
BMC Bioinformatics. 2023 Apr 11;24(1):141. doi: 10.1186/s12859-023-05248-6.
3
Evolutionary-scale prediction of atomic-level protein structure with a language model.
用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
4
SARS-CoV2 Endotheliopathy: Insights from Single Cell RNAseq.严重急性呼吸综合征冠状病毒2型(SARS-CoV-2)血管内皮病变:来自单细胞RNA测序的见解
Am J Respir Crit Care Med. 2022 Nov 1;206(9):1178-1179. doi: 10.1164/rccm.202206-1105LE.
5
iACVP: markedly enhanced identification of anti-coronavirus peptides using a dataset-specific word2vec model.iACVP:使用特定于数据集的 word2vec 模型显著提高了抗病毒肽的鉴定能力。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac265.
6
SARS-CoV-2 infection of airway cells causes intense viral and cell shedding, two spreading mechanisms affected by IL-13.SARS-CoV-2 感染气道细胞会导致强烈的病毒和细胞脱落,这两种传播机制都受 IL-13 的影响。
Proc Natl Acad Sci U S A. 2022 Apr 19;119(16):e2119680119. doi: 10.1073/pnas.2119680119. Epub 2022 Mar 30.
7
IL13Pred: A method for predicting immunoregulatory cytokine IL-13 inducing peptides.IL13Pred:一种预测免疫调节细胞因子白细胞介素-13诱导肽的方法。
Comput Biol Med. 2022 Apr;143:105297. doi: 10.1016/j.compbiomed.2022.105297. Epub 2022 Feb 8.
8
IL-13 is a driver of COVID-19 severity.IL-13 是导致 COVID-19 严重程度的因素之一。
JCI Insight. 2021 Aug 9;6(15):e150107. doi: 10.1172/jci.insight.150107.
9
LSTM-PHV: prediction of human-virus protein-protein interactions by LSTM with word2vec.LSTM-PHV:基于词向量的 LSTM 预测人类病毒蛋白质相互作用
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab228.
10
NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning.神经肽预测模型 FRL:基于特征表示学习的神经肽识别可解释预测模型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab167.