• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合图嵌入与网络拓扑特征的蛋白质热稳定性预测模型研究

[Research on prediction model of protein thermostability integrating graph embedding and network topology features].

作者信息

Pan Shuyi, Xiang Xiaoyang, Yan Qunfang, Ding Yanrui

机构信息

School of Science, Jiangnan University, Wuxi, Jiangsu 214122, P. R. China.

出版信息

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2025 Aug 25;42(4):817-823. doi: 10.7507/1001-5515.202501045.

DOI:10.7507/1001-5515.202501045
PMID:40887198
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12409512/
Abstract

Protein structure determines function, and structural information is critical for predicting protein thermostability. This study proposes a novel method for protein thermostability prediction by integrating graph embedding features and network topological features. By constructing residue interaction networks (RINs) to characterize protein structures, we calculated network topological features and utilize deep neural networks (DNN) to mine inherent characteristics. Using DeepWalk and Node2vec algorithms, we obtained node embeddings and extracted graph embedding features through a TopN strategy combined with bidirectional long short-term memory (BiLSTM) networks. Additionally, we introduced the Doc2vec algorithm to replace the Word2vec module in graph embedding algorithms, generating graph embedding feature vector encodings. By employing an attention mechanism to fuse graph embedding features with network topological features, we constructed a high-precision prediction model, achieving 87.85% prediction accuracy on a bacterial protein dataset. Furthermore, we analyzed the differences in the contributions of network topological features in the model and the differences among various graph embedding methods, and found that the combination of DeepWalk features with Doc2vec and all topological features was crucial for the identification of thermostable proteins. This study provides a practical and effective new method for protein thermostability prediction, and at the same time offers theoretical guidance for exploring protein diversity, discovering new thermostable proteins, and the intelligent modification of mesophilic proteins.

摘要

蛋白质结构决定功能,结构信息对于预测蛋白质热稳定性至关重要。本研究提出了一种通过整合图嵌入特征和网络拓扑特征来预测蛋白质热稳定性的新方法。通过构建残基相互作用网络(RINs)来表征蛋白质结构,我们计算了网络拓扑特征,并利用深度神经网络(DNN)挖掘内在特征。使用DeepWalk和Node2vec算法,我们获得了节点嵌入,并通过结合双向长短期记忆(BiLSTM)网络的TopN策略提取了图嵌入特征。此外,我们引入了Doc2vec算法来取代图嵌入算法中的Word2vec模块,生成图嵌入特征向量编码。通过采用注意力机制将图嵌入特征与网络拓扑特征融合,我们构建了一个高精度预测模型,在细菌蛋白质数据集上实现了87.85%的预测准确率。此外,我们分析了模型中网络拓扑特征贡献的差异以及各种图嵌入方法之间的差异,发现DeepWalk特征与Doc2vec和所有拓扑特征的组合对于热稳定蛋白质的识别至关重要。本研究为蛋白质热稳定性预测提供了一种实用有效的新方法,同时为探索蛋白质多样性、发现新的热稳定蛋白质以及嗜温蛋白质的智能改造提供了理论指导。

相似文献

1
[Research on prediction model of protein thermostability integrating graph embedding and network topology features].整合图嵌入与网络拓扑特征的蛋白质热稳定性预测模型研究
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2025 Aug 25;42(4):817-823. doi: 10.7507/1001-5515.202501045.
2
DeepWalk-aware graph attention networks with CNN for circRNA-drug sensitivity association identification.基于 DeepWalk 感知图注意力网络和 CNN 的环状 RNA-药物敏感性关联识别方法
Brief Funct Genomics. 2024 Jul 19;23(4):418-428. doi: 10.1093/bfgp/elad053.
3
An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.一种用于准确预测蛋白质-蛋白质相互作用的端到端知识图谱融合图神经网络
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2518-2530. doi: 10.1109/TCBB.2024.3486216. Epub 2024 Dec 10.
4
Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models.从在细胞图上训练的图神经网络中提取知识,用于非神经学生模型。
Sci Rep. 2025 Aug 10;15(1):29274. doi: 10.1038/s41598-025-13697-7.
5
FuseLinker: Leveraging LLM's pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs.FuseLinker:利用大语言模型的预训练文本嵌入和领域知识增强基于图神经网络的生物医学知识图谱的链接预测。
J Biomed Inform. 2024 Oct;158:104730. doi: 10.1016/j.jbi.2024.104730. Epub 2024 Sep 24.
6
Accelerated prediction of molecular properties for per- and polyfluoroalkyl substances using graph neural networks with adjacency-free message passing.使用无邻接消息传递的图神经网络对全氟和多氟烷基物质的分子性质进行加速预测。
Environ Pollut. 2025 Jun 30;382:126705. doi: 10.1016/j.envpol.2025.126705.
7
MEGDTA: multi-modal drug-target affinity prediction based on protein three-dimensional structure and ensemble graph neural network.MEGDTA:基于蛋白质三维结构和集成图神经网络的多模态药物-靶点亲和力预测
BMC Genomics. 2025 Aug 11;26(1):738. doi: 10.1186/s12864-025-11943-w.
8
A Novel Framework for Enhancing Decision-Making in Autonomous Cyber Defense Through Graph Embedding.一种通过图嵌入增强自主网络防御中决策制定的新型框架。
Entropy (Basel). 2025 Jun 11;27(6):622. doi: 10.3390/e27060622.
9
Multi-view based heterogeneous graph contrastive learning for drug-target interaction prediction.基于多视图的异构图对比学习用于药物-靶点相互作用预测
J Biomed Inform. 2025 Aug;168:104852. doi: 10.1016/j.jbi.2025.104852. Epub 2025 Jun 2.
10
NPI-HetGNN: A Prediction Model of ncRNA-Protein Interactions Based on Heterogeneous Graph Neural Networks.NPI-HetGNN:一种基于异构图神经网络的非编码RNA-蛋白质相互作用预测模型。
Interdiscip Sci. 2025 Jun 2. doi: 10.1007/s12539-025-00716-4.

本文引用的文献

1
BacDive in 2025: the core database for prokaryotic strain data.2025年的细菌数据库(BacDive):原核生物菌株数据的核心数据库。
Nucleic Acids Res. 2025 Jan 6;53(D1):D748-D756. doi: 10.1093/nar/gkae959.
2
Exploring Druggable Binding Sites on the Class A GPCRs Using the Residue Interaction Network and Site Identification by Ligand Competitive Saturation.利用残基相互作用网络和配体竞争饱和法鉴定A类G蛋白偶联受体上的可成药结合位点
ACS Omega. 2024 Sep 13;9(38):40154-40171. doi: 10.1021/acsomega.4c06172. eCollection 2024 Sep 24.
3
Characterisation and engineering of a thermophilic RNA ligase from Palaeococcus pacificus.来自太平洋古球菌的嗜热RNA连接酶的表征与工程改造
Nucleic Acids Res. 2024 Apr 24;52(7):3924-3937. doi: 10.1093/nar/gkae149.
4
Screening and characterization of thermostable enzyme-producing bacteria from selected hot springs of Ethiopia.从埃塞俄比亚选定的温泉中筛选和鉴定产耐热酶的细菌。
Microbiol Spectr. 2024 Mar 5;12(3):e0371023. doi: 10.1128/spectrum.03710-23. Epub 2024 Jan 31.
5
Machine learning models based on residue interaction network for ABCG2 transportable compounds recognition.基于残基相互作用网络的 ABCG2 外排化合物识别的机器学习模型。
Environ Pollut. 2023 Nov 15;337:122620. doi: 10.1016/j.envpol.2023.122620. Epub 2023 Sep 26.
6
Protein-based materials: from sources to innovative sustainable materials for biomedical applications.基于蛋白质的材料:从来源到用于生物医学应用的创新可持续材料
J Mater Chem B. 2014 Jun 28;2(24):3715-3740. doi: 10.1039/c4tb00168k. Epub 2014 May 8.
7
Thermophilic Proteins as Versatile Scaffolds for Protein Engineering.嗜热蛋白作为蛋白质工程的通用支架
Microorganisms. 2018 Sep 25;6(4):97. doi: 10.3390/microorganisms6040097.
8
Thermophilic Adaptation in Prokaryotes Is Constrained by Metabolic Costs of Proteostasis.原核生物的耐热性受到蛋白质稳定性代谢成本的限制。
Mol Biol Evol. 2018 Jan 1;35(1):211-224. doi: 10.1093/molbev/msx282.
9
Molecular dynamics simulations and modelling of the residue interaction networks in the BRAF kinase complexes with small molecule inhibitors: probing the allosteric effects of ligand-induced kinase dimerization and paradoxical activation.BRAF激酶复合物与小分子抑制剂的残基相互作用网络的分子动力学模拟与建模:探究配体诱导的激酶二聚化和反常激活的变构效应。
Mol Biosyst. 2016 Oct 20;12(10):3146-65. doi: 10.1039/c6mb00298f. Epub 2016 Aug 2.
10
The RING 2.0 web server for high quality residue interaction networks.用于高质量残基相互作用网络的RING 2.0网络服务器。
Nucleic Acids Res. 2016 Jul 8;44(W1):W367-74. doi: 10.1093/nar/gkw315. Epub 2016 May 19.