Li Gen, Zhang Ning, Fan Long
Production and R&D Center I of LSS, GenScript (Shanghai) Biotech Co., Ltd., Shanghai 200131, China.
Production and R&D Center I of LSS, GenScript Biotech Corporation, Nanjing 211122, China.
ACS Omega. 2025 Jan 24;10(4):3910-3916. doi: 10.1021/acsomega.4c09688. eCollection 2025 Feb 4.
Solubility is a key biophysical property of proteins and is essential for evaluating the effectiveness of proteins in biochemical engineering. In recent years, the prediction method of protein solubility has received extensive attention in the protein engineering research community. Many methods have been developed to predict protein solubility, but the generalization performance of existing prediction methods on independent test sets must be improved. In addition, solubility prediction methods do not work well when they are used for regression tasks. To address these issues, we developed a new method, ProG-SOL, an innovative sequence-based dual-graph convolutional network that simultaneously exploits the protein pretrained graph and the protein evolutionary graph for assessing solubility. Compared with other methods, ProG-SOL achieves better classification and regression results for different independent test sets at the same time. The model framework of our method may also be used to predict other properties of proteins such as protein function, protein-protein interaction, protein folding, and drug design, which provide broad application prospects in protein engineering.
溶解度是蛋白质的一项关键生物物理特性,对于评估蛋白质在生化工程中的有效性至关重要。近年来,蛋白质溶解度预测方法在蛋白质工程研究领域受到广泛关注。人们已开发出多种预测蛋白质溶解度的方法,但现有预测方法在独立测试集上的泛化性能仍有待提高。此外,溶解度预测方法在用于回归任务时效果不佳。为解决这些问题,我们开发了一种新方法ProG-SOL,这是一种创新的基于序列的双图卷积网络,它同时利用蛋白质预训练图和蛋白质进化图来评估溶解度。与其他方法相比,ProG-SOL在不同独立测试集上同时取得了更好的分类和回归结果。我们方法的模型框架还可用于预测蛋白质的其他特性,如蛋白质功能、蛋白质-蛋白质相互作用、蛋白质折叠和药物设计,这在蛋白质工程中具有广阔的应用前景。