• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iACVP:使用特定于数据集的 word2vec 模型显著提高了抗病毒肽的鉴定能力。

iACVP: markedly enhanced identification of anti-coronavirus peptides using a dataset-specific word2vec model.

机构信息

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan.

Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea.

出版信息

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac265.

DOI:10.1093/bib/bbac265
PMID:35772910
Abstract

The COVID-19 pandemic caused several million deaths worldwide. Development of anti-coronavirus drugs is thus urgent. Unlike conventional non-peptide drugs, antiviral peptide drugs are highly specific, easy to synthesize and modify, and not highly susceptible to drug resistance. To reduce the time and expense involved in screening thousands of peptides and assaying their antiviral activity, computational predictors for identifying anti-coronavirus peptides (ACVPs) are needed. However, few experimentally verified ACVP samples are available, even though a relatively large number of antiviral peptides (AVPs) have been discovered. In this study, we attempted to predict ACVPs using an AVP dataset and a small collection of ACVPs. Using conventional features, a binary profile and a word-embedding word2vec (W2V), we systematically explored five different machine learning methods: Transformer, Convolutional Neural Network, bidirectional Long Short-Term Memory, Random Forest (RF) and Support Vector Machine. Via exhaustive searches, we found that the RF classifier with W2V consistently achieved better performance on different datasets. The two main controlling factors were: (i) the dataset-specific W2V dictionary was generated from the training and independent test datasets instead of the widely used general UniProt proteome and (ii) a systematic search was conducted and determined the optimal k-mer value in W2V, which provides greater discrimination between positive and negative samples. Therefore, our proposed method, named iACVP, consistently provides better prediction performance compared with existing state-of-the-art methods. To assist experimentalists in identifying putative ACVPs, we implemented our model as a web server accessible via the following link: http://kurata35.bio.kyutech.ac.jp/iACVP.

摘要

COVID-19 大流行导致了全球数百万人死亡。因此,开发抗冠状病毒药物迫在眉睫。与传统的非肽类药物不同,抗病毒肽类药物具有高度特异性,易于合成和修饰,不易产生耐药性。为了减少筛选数千种肽并测定其抗病毒活性的时间和费用,需要开发用于识别抗冠状病毒肽(ACVP)的计算预测器。但是,尽管已经发现了相对大量的抗病毒肽(AVP),但可用的经实验验证的 ACVP 样本却很少。在这项研究中,我们试图使用 AVP 数据集和一小部分 ACVP 来预测 ACVP。使用常规特征、二进制轮廓和词嵌入 word2vec(W2V),我们系统地探索了五种不同的机器学习方法:Transformer、卷积神经网络、双向长短期记忆、随机森林(RF)和支持向量机。通过穷尽搜索,我们发现 RF 分类器与 W2V 一致地在不同数据集上实现了更好的性能。两个主要控制因素是:(i)数据集特定的 W2V 字典是从训练和独立测试数据集生成的,而不是使用广泛的通用 UniProt 蛋白质组,(ii)进行了系统搜索并确定了 W2V 中的最佳 k-mer 值,这提供了阳性和阴性样本之间更好的区分度。因此,与现有最先进的方法相比,我们提出的方法(称为 iACVP)一致地提供了更好的预测性能。为了帮助实验人员识别潜在的 ACVP,我们将我们的模型实现为一个可通过以下链接访问的网络服务器:http://kurata35.bio.kyutech.ac.jp/iACVP。

相似文献

1
iACVP: markedly enhanced identification of anti-coronavirus peptides using a dataset-specific word2vec model.iACVP:使用特定于数据集的 word2vec 模型显著提高了抗病毒肽的鉴定能力。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac265.
2
ACP-Dnnel: anti-coronavirus peptides' prediction based on deep neural network ensemble learning.ACP-Dnnel:基于深度神经网络集成学习的抗冠状病毒肽预测
Amino Acids. 2023 Sep;55(9):1121-1136. doi: 10.1007/s00726-023-03300-6. Epub 2023 Jul 4.
3
Comparative analysis of machine learning-based approaches for identifying therapeutic peptides targeting SARS-CoV-2.基于机器学习的方法识别针对 SARS-CoV-2 的治疗性肽的比较分析。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab412.
4
Cross-attention PHV: Prediction of human and virus protein-protein interactions using cross-attention-based neural networks.交叉注意力PHV:使用基于交叉注意力的神经网络预测人类与病毒的蛋白质-蛋白质相互作用。
Comput Struct Biotechnol J. 2022;20:5564-5573. doi: 10.1016/j.csbj.2022.10.012. Epub 2022 Oct 8.
5
Current Development of Data Resources and Bioinformatics Tools for Anticoronavirus Peptide.抗冠状病毒肽的数据资源和生物信息学工具的当前发展。
Curr Med Chem. 2024;31(26):4079-4099. doi: 10.2174/0109298673264218231121104407.
6
FEOpti-ACVP: identification of novel anti-coronavirus peptide sequences based on feature engineering and optimization.FEOpti-ACVP:基于特征工程和优化的新型抗冠状病毒肽序列的鉴定。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae037.
7
Deepstacked-AVPs: predicting antiviral peptides using tri-segment evolutionary profile and word embedding based multi-perspective features with deep stacking model.深度堆叠 AVPs:使用三片段进化特征和基于单词嵌入的多视角特征与深度堆叠模型预测抗病毒肽。
BMC Bioinformatics. 2024 Mar 7;25(1):102. doi: 10.1186/s12859-024-05726-5.
8
iACVP-MR: Accurate Identification of Anti-coronavirus Peptide based on Multiple Features Information and Recurrent Neural Network.iACVP-MR:基于多特征信息和循环神经网络的抗冠状病毒肽精确识别
Curr Med Chem. 2025;32(10):2055-2067. doi: 10.2174/0109298673277663240101111507.
9
Meta-iAVP: A Sequence-Based Meta-Predictor for Improving the Prediction of Antiviral Peptides Using Effective Feature Representation.Meta-iAVP:一种基于序列的元预测器,用于使用有效的特征表示来改进抗病毒肽的预测。
Int J Mol Sci. 2019 Nov 15;20(22):5743. doi: 10.3390/ijms20225743.
10
AVP-IC50 Pred: Multiple machine learning techniques-based prediction of peptide antiviral activity in terms of half maximal inhibitory concentration (IC50).抗利尿激素半数抑制浓度预测:基于多种机器学习技术,根据半数最大抑制浓度(IC50)对肽的抗病毒活性进行预测。
Biopolymers. 2015 Nov;104(6):753-63. doi: 10.1002/bip.22703.

引用本文的文献

1
A genetic algorithm-based ensemble model for efficiently identifying interleukin 6 inducing peptides.一种基于遗传算法的集成模型,用于高效识别白细胞介素6诱导肽。
Sci Rep. 2025 Jul 1;15(1):21213. doi: 10.1038/s41598-025-05491-2.
2
GRU4ACE: Enhancing ACE inhibitory peptide prediction by integrating gated recurrent unit with multi-source feature embeddings.GRU4ACE:通过将门控循环单元与多源特征嵌入相结合来增强血管紧张素转换酶抑制肽预测
Protein Sci. 2025 Jun;34(6):e70026. doi: 10.1002/pro.70026.
3
VITALdb: to select the best viroinformatics tools for a desired virus or application.
VITALdb:为所需病毒或应用选择最佳的病毒信息学工具。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf084.
4
ACVPICPred: Inhibitory activity prediction of anti-coronavirus peptides based on artificial neural network.ACVPICPred:基于人工神经网络的抗冠状病毒肽抑制活性预测
Comput Struct Biotechnol J. 2024 Oct 2;23:3625-3633. doi: 10.1016/j.csbj.2024.09.015. eCollection 2024 Dec.
5
Innovative Alignment-Based Method for Antiviral Peptide Prediction.基于创新比对的抗病毒肽预测方法。
Antibiotics (Basel). 2024 Aug 14;13(8):768. doi: 10.3390/antibiotics13080768.
6
PredIL13: Stacking a variety of machine and deep learning methods with ESM-2 language model for identifying IL13-inducing peptides.PredIL13:结合多种机器和深度学习方法以及 ESM-2 语言模型,用于识别诱导 IL13 的肽。
PLoS One. 2024 Aug 22;19(8):e0309078. doi: 10.1371/journal.pone.0309078. eCollection 2024.
7
Meta-2OM: A multi-classifier meta-model for the accurate prediction of RNA 2'-O-methylation sites in human RNA.Meta-2OM:一种用于准确预测人类 RNA 2'-O-甲基化位点的多分类器元模型。
PLoS One. 2024 Jun 26;19(6):e0305406. doi: 10.1371/journal.pone.0305406. eCollection 2024.
8
A review of machine learning-based methods for predicting drug-target interactions.基于机器学习的药物-靶点相互作用预测方法综述。
Health Inf Sci Syst. 2024 Apr 12;12(1):30. doi: 10.1007/s13755-024-00287-6. eCollection 2024 Dec.
9
iACVP-MR: Accurate Identification of Anti-coronavirus Peptide based on Multiple Features Information and Recurrent Neural Network.iACVP-MR:基于多特征信息和循环神经网络的抗冠状病毒肽精确识别
Curr Med Chem. 2025;32(10):2055-2067. doi: 10.2174/0109298673277663240101111507.
10
EACVP: An ESM-2 LM Framework Combined CNN and CBAM Attention to Predict Anti-coronavirus Peptides.EACVP:一种结合卷积神经网络(CNN)和CBAM注意力机制的ESM-2语言模型框架,用于预测抗冠状病毒肽。
Curr Med Chem. 2025;32(10):2040-2054. doi: 10.2174/0109298673287899240303164403.