Liang Yunyun, Ma Xinyan, Li Jin, Zhang Shengli
School of Science, Xi'an Polytechnic University, Xi'an, 710048, P.R. China.
School of Mathematics and Statistics, Xidian University, Xi'an, 710071, P.R. China.
Curr Med Chem. 2025;32(10):2055-2067. doi: 10.2174/0109298673277663240101111507.
Over the years, viruses have caused human illness and threatened human health. Therefore, it is pressing to develop anti-coronavirus infection drugs with clear function, low cost, and high safety. Anti-coronavirus peptide (ACVP) is a key therapeutic agent against coronavirus. Traditional methods for finding ACVP need a great deal of money and man power. Hence, it is a significant task to establish intelligent computational tools to able rapid, efficient and accurate identification of ACVP.
In this paper, we construct an excellent model named iACVP-MR to identify ACVP based on multiple features and recurrent neural networks. Multiple features are extracted by using reduced amino acid component and dipeptide component, compositions of k-spaced amino acid pairs, BLOSUM62 encoder according to the N5C5 sequence, as well as second-order moving average approach based on 16 physicochemical properties. Then, two recurrent neural networks named long-short term memory (LSTM) and bidirectional gated recurrent unit (BiGRU) combined attention mechanism are used for feature fusion and classification, respectively.
The accuracies of ENNAVIA-C and ENNAVIA-D datasets under the 10-fold cross-validation are 99.15% and 98.92%, respectively, and other evaluation indexes have also obtained satisfactory results. The experimental results show that our model is superior to other existing models.
The iACVP-MR model can be viewed as a powerful and intelligent tool for the accurate identification of ACVP. The datasets and source codes for iACVP-MR are freely downloaded at https://github.com/yunyunliang88/iACVP-MR.
多年来,病毒已引发人类疾病并威胁人类健康。因此,开发功能明确、成本低廉且安全性高的抗冠状病毒感染药物迫在眉睫。抗冠状病毒肽(ACVP)是对抗冠状病毒的关键治疗剂。传统寻找ACVP的方法需要大量资金和人力。因此,建立智能计算工具以能够快速、高效且准确地识别ACVP是一项重大任务。
在本文中,我们构建了一个名为iACVP-MR的优秀模型,基于多种特征和循环神经网络来识别ACVP。通过使用简化氨基酸组成和二肽组成、k间隔氨基酸对组成、根据N5C5序列的BLOSUM62编码以及基于16种物理化学性质的二阶移动平均方法来提取多种特征。然后,分别使用名为长短期记忆(LSTM)和双向门控循环单元(BiGRU)的两种循环神经网络结合注意力机制进行特征融合和分类。
在10折交叉验证下,ENNAVIA-C和ENNAVIA-D数据集的准确率分别为99.15%和98.92%,其他评估指标也取得了令人满意的结果。实验结果表明我们的模型优于其他现有模型。
iACVP-MR模型可被视为准确识别ACVP的强大且智能的工具。iACVP-MR的数据集和源代码可在https://github.com/yunyunliang88/iACVP-MR免费下载。