• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于序列的蛋白质溶剂可及性预测

Sequence-Based Prediction for Protein Solvent Accessibility.

作者信息

Yang Yang, Chen Mengqi, Liu Congrui, Vihinen Mauno

机构信息

Computing Science and Artificial Intelligence College, Suzhou City University, Suzhou 215004, China.

School of Computer Science and Technology, Soochow University, Suzhou 215006, China.

出版信息

Int J Mol Sci. 2025 Jun 11;26(12):5604. doi: 10.3390/ijms26125604.

DOI:10.3390/ijms26125604
PMID:40565067
Abstract

When globular proteins fold into their characteristic three-dimensional structures, some amino acids are located on the surface, while others are situated in the protein core, where they cannot interact with molecules in the environment. Predicting the degree of solubility of amino acids provides insight into the function and relevance of residues. Residue accessibility is crucial for several protein functions, including enzymatic activity, allostery, multimer formation, binding to other molecules, and immunogenicity. We developed a novel sequence-based predictor for amino acid accessibility with features derived from three-dimensional protein structures. Several machine learning algorithms were tested, and the long short-term memory (LSTM) deep learning method demonstrated the best performance; thus, it was utilized to develop the freely available SolAcc tool. It showed superior performance compared to state-of-the-art predictors in a blind test.

摘要

当球状蛋白质折叠成其特有的三维结构时,一些氨基酸位于表面,而另一些则位于蛋白质核心,在那里它们无法与环境中的分子相互作用。预测氨基酸的溶解度程度有助于深入了解残基的功能和相关性。残基可及性对于多种蛋白质功能至关重要,包括酶活性、别构效应、多聚体形成、与其他分子的结合以及免疫原性。我们开发了一种基于序列的新型氨基酸可及性预测器,其特征源自三维蛋白质结构。测试了几种机器学习算法,长短期记忆(LSTM)深度学习方法表现最佳;因此,利用该方法开发了免费可用的SolAcc工具。在盲测中,它比现有最佳预测器表现更优。

相似文献

1
Sequence-Based Prediction for Protein Solvent Accessibility.基于序列的蛋白质溶剂可及性预测
Int J Mol Sci. 2025 Jun 11;26(12):5604. doi: 10.3390/ijms26125604.
2
Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis.揭示增强蛋白质结构预测政策的演变:全面分析。
Comput Biol Med. 2024 Sep;179:108815. doi: 10.1016/j.compbiomed.2024.108815. Epub 2024 Jul 11.
3
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
6
Systematic review and validation of prediction rules for identifying children with serious infections in emergency departments and urgent-access primary care.系统评价和验证预测规则,以识别急诊科和紧急初级保健中严重感染的儿童。
Health Technol Assess. 2012;16(15):1-100. doi: 10.3310/hta16150.
7
Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.通过综合生物信息学分析和机器学习破译妊娠期糖尿病和子痫前期之间共享的基因特征及免疫浸润特征
Reprod Sci. 2025 May 15. doi: 10.1007/s43032-025-01847-1.
8
Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences.蛋白质序列中核酸结合残基预测二十年进展
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf016.
9
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
10
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

本文引用的文献

1
BTKbase, Bruton Tyrosine Kinase Variant Database in X-Linked Agammaglobulinemia: Looking Back and Ahead.BTKbase,X连锁无丙种球蛋白血症中的布鲁顿酪氨酸激酶变异数据库:回顾与展望。
Hum Mutat. 2023 Jul 31;2023:5797541. doi: 10.1155/2023/5797541. eCollection 2023.
2
E-pRSA: Embeddings Improve the Prediction of Residue Relative Solvent Accessibility in Protein Sequence.E-pRSA:嵌入改进了蛋白质序列中残基相对溶剂可及性的预测。
J Mol Biol. 2024 Sep 1;436(17):168494. doi: 10.1016/j.jmb.2024.168494. Epub 2024 Feb 15.
3
Accurate structure prediction of biomolecular interactions with AlphaFold 3.
利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
4
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
5
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
6
NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning.NetSurfP-3.0:通过蛋白质语言模型和深度学习实现蛋白质结构特征的准确快速预测。
Nucleic Acids Res. 2022 Jul 5;50(W1):W510-W515. doi: 10.1093/nar/gkac439.
7
Reaching alignment-profile-based accuracy in predicting protein secondary and tertiary structural properties without alignment.无需对齐即可达到基于对齐轮廓的预测蛋白质二级和三级结构性质的准确性。
Sci Rep. 2022 May 9;12(1):7607. doi: 10.1038/s41598-022-11684-w.
8
Search and sequence analysis tools services from EMBL-EBI in 2022.2022 年 EMBL-EBI 的搜索和序列分析工具服务。
Nucleic Acids Res. 2022 Jul 5;50(W1):W276-W279. doi: 10.1093/nar/gkac240.
9
SSpro/ACCpro 6: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, deep learning and structural similarity.SSpro/ACCpro 6:使用轮廓、深度学习和结构相似性进行蛋白质二级结构和相对溶剂可及性的近乎完美预测。
Bioinformatics. 2022 Mar 28;38(7):2064-2065. doi: 10.1093/bioinformatics/btac019.
10
DeepREx-WS: A web server for characterising protein-solvent interaction starting from sequence.DeepREx-WS:一个从序列开始表征蛋白质-溶剂相互作用的网络服务器。
Comput Struct Biotechnol J. 2021 Oct 13;19:5791-5799. doi: 10.1016/j.csbj.2021.10.016. eCollection 2021.