Suppr超能文献

LABAMPsGCN:一种基于图卷积神经网络识别乳酸菌抗菌肽的框架。

LABAMPsGCN: A framework for identifying lactic acid bacteria antimicrobial peptides based on graph convolutional neural network.

作者信息

Sun Tong-Jie, Bu He-Long, Yan Xin, Sun Zhi-Hong, Zha Mu-Su, Dong Gai-Fang

机构信息

College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China.

College of Food Science and Engineering, Inner Mongolia Agricultural University, Hohhot, China.

出版信息

Front Genet. 2022 Nov 3;13:1062576. doi: 10.3389/fgene.2022.1062576. eCollection 2022.

Abstract

Lactic acid bacteria antimicrobial peptides (LABAMPs) are a class of active polypeptide produced during the metabolic process of lactic acid bacteria, which can inhibit or kill pathogenic bacteria or spoilage bacteria in food. LABAMPs have broad application in important practical fields closely related to human beings, such as food production, efficient agricultural planting, and so on. However, screening for antimicrobial peptides by biological experiment researchers is time-consuming and laborious. Therefore, it is urgent to develop a model to predict LABAMPs. In this work, we design a graph convolutional neural network framework for identifying of LABAMPs. We build heterogeneous graph based on amino acids, tripeptide and their relationships and learn weights of a graph convolutional network (GCN). Our GCN iteratively completes the learning of embedded words and sequence weights in the graph under the supervision of inputting sequence labels. We applied 10-fold cross-validation experiment to two training datasets and acquired accuracy of 0.9163 and 0.9379 respectively. They are higher that of other machine learning and GNN algorithms. In an independent test dataset, accuracy of two datasets is 0.9130 and 0.9291, which are 1.08% and 1.57% higher than the best methods of other online webservers.

摘要

乳酸菌抗菌肽(LABAMPs)是乳酸菌代谢过程中产生的一类活性多肽,可抑制或杀死食品中的病原菌或腐败菌。LABAMPs在与人类密切相关的重要实际领域有着广泛应用,如食品生产、高效农业种植等。然而,生物实验研究人员筛选抗菌肽既耗时又费力。因此,迫切需要开发一种预测LABAMPs的模型。在这项工作中,我们设计了一个用于识别LABAMPs的图卷积神经网络框架。我们基于氨基酸、三肽及其关系构建异构图,并学习图卷积网络(GCN)的权重。我们的GCN在输入序列标签的监督下,迭代完成图中嵌入词和序列权重的学习。我们对两个训练数据集进行了10折交叉验证实验,准确率分别为0.9163和0.9379。它们高于其他机器学习和GNN算法的准确率。在一个独立测试数据集中,两个数据集的准确率分别为0.9130和0.9291,比其他在线网络服务器的最佳方法分别高出1.08%和1.57%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e4/9669054/4c6534ac7816/fgene-13-1062576-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验