Li Min, Shi Wenbo, Zhang Fuhao, Zeng Min, Li Yaohang
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):833-842. doi: 10.1109/TCBB.2022.3170719. Epub 2023 Apr 3.
The understanding of protein functions is critical to many biological problems such as the development of new drugs and new crops. To reduce the huge gap between the increase of protein sequences and annotations of protein functions, many methods have been proposed to deal with this problem. These methods use Gene Ontology (GO) to classify the functions of proteins and consider one GO term as a class label. However, they ignore the co-occurrence of GO terms that is helpful for protein function prediction. We propose a new deep learning model, named DeepPFP-CO, which uses Graph Convolutional Network (GCN) to explore and capture the co-occurrence of GO terms to improve the protein function prediction performance. In this way, we can further deduce the protein functions by fusing the predicted propensity of the center function and its co-occurrence functions. We use Fmax and AUPR to evaluate the performance of DeepPFP-CO and compare DeepPFP-CO with state-of-the-art methods such as DeepGOPlus and DeepGOA. The computational results show that DeepPFP-CO outperforms DeepGOPlus and other methods. Moreover, we further analyze our model at the protein level. The results have demonstrated that DeepPFP-CO improves the performance of protein function prediction. DeepPFP-CO is available at https://csuligroup.com/DeepPFP/.
对蛋白质功能的理解对于许多生物学问题至关重要,例如新药和新作物的研发。为了缩小蛋白质序列增长与蛋白质功能注释之间的巨大差距,人们提出了许多方法来解决这个问题。这些方法使用基因本体论(GO)对蛋白质功能进行分类,并将一个GO术语视为一个类别标签。然而,它们忽略了有助于蛋白质功能预测的GO术语的共现情况。我们提出了一种名为DeepPFP-CO的新型深度学习模型,该模型使用图卷积网络(GCN)来探索和捕捉GO术语的共现情况,以提高蛋白质功能预测性能。通过这种方式,我们可以通过融合中心功能及其共现功能的预测倾向来进一步推断蛋白质功能。我们使用Fmax和AUPR来评估DeepPFP-CO的性能,并将DeepPFP-CO与诸如DeepGOPlus和DeepGOA等最先进的方法进行比较。计算结果表明,DeepPFP-CO优于DeepGOPlus和其他方法。此外,我们在蛋白质水平上进一步分析了我们的模型。结果表明,DeepPFP-CO提高了蛋白质功能预测的性能。DeepPFP-CO可在https://csuligroup.com/DeepPFP/获取。