Sun Tong-Jie, Bu He-Long, Yan Xin, Sun Zhi-Hong, Zha Mu-Su, Dong Gai-Fang
College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China.
College of Food Science and Engineering, Inner Mongolia Agricultural University, Hohhot, China.
Front Genet. 2022 Nov 3;13:1062576. doi: 10.3389/fgene.2022.1062576. eCollection 2022.
Lactic acid bacteria antimicrobial peptides (LABAMPs) are a class of active polypeptide produced during the metabolic process of lactic acid bacteria, which can inhibit or kill pathogenic bacteria or spoilage bacteria in food. LABAMPs have broad application in important practical fields closely related to human beings, such as food production, efficient agricultural planting, and so on. However, screening for antimicrobial peptides by biological experiment researchers is time-consuming and laborious. Therefore, it is urgent to develop a model to predict LABAMPs. In this work, we design a graph convolutional neural network framework for identifying of LABAMPs. We build heterogeneous graph based on amino acids, tripeptide and their relationships and learn weights of a graph convolutional network (GCN). Our GCN iteratively completes the learning of embedded words and sequence weights in the graph under the supervision of inputting sequence labels. We applied 10-fold cross-validation experiment to two training datasets and acquired accuracy of 0.9163 and 0.9379 respectively. They are higher that of other machine learning and GNN algorithms. In an independent test dataset, accuracy of two datasets is 0.9130 and 0.9291, which are 1.08% and 1.57% higher than the best methods of other online webservers.
乳酸菌抗菌肽(LABAMPs)是乳酸菌代谢过程中产生的一类活性多肽,可抑制或杀死食品中的病原菌或腐败菌。LABAMPs在与人类密切相关的重要实际领域有着广泛应用,如食品生产、高效农业种植等。然而,生物实验研究人员筛选抗菌肽既耗时又费力。因此,迫切需要开发一种预测LABAMPs的模型。在这项工作中,我们设计了一个用于识别LABAMPs的图卷积神经网络框架。我们基于氨基酸、三肽及其关系构建异构图,并学习图卷积网络(GCN)的权重。我们的GCN在输入序列标签的监督下,迭代完成图中嵌入词和序列权重的学习。我们对两个训练数据集进行了10折交叉验证实验,准确率分别为0.9163和0.9379。它们高于其他机器学习和GNN算法的准确率。在一个独立测试数据集中,两个数据集的准确率分别为0.9130和0.9291,比其他在线网络服务器的最佳方法分别高出1.08%和1.57%。