• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

支持蛋白质结构预测方法开发的蛋白质结构搜索。

Protein structure search to support the development of protein structure prediction methods.

作者信息

Ayoub Ronald, Lee Yugyung

机构信息

School of Computing and Engineering, University of Missouri at Kansas City, Kansas City, Missouri, USA.

出版信息

Proteins. 2021 Jun;89(6):648-658. doi: 10.1002/prot.26048. Epub 2021 Feb 2.

DOI:10.1002/prot.26048
PMID:33458852
Abstract

Protein structure prediction is a long-standing unsolved problem in molecular biology that has seen renewed interest with the recent success of deep learning with AlphaFold at CASP13. While developing and evaluating protein structure prediction methods, researchers may want to identify the most similar known structures to their predicted structures. These predicted structures often have low sequence and structure similarity to known structures. We show how RUPEE, a purely geometric protein structure search, is able to identify the structures most similar to structure predictions, regardless of how they vary from known structures, something existing protein structure searches struggle with. RUPEE accomplishes this through the use of a novel linear encoding of protein structures as a sequence of residue descriptors. Using a fast Needleman-Wunsch algorithm, RUPEE is able to perform alignments on the sequences of residue descriptors for every available structure. This is followed by a series of increasingly accurate structure alignments from TM-align alignments initialized with the Needleman-Wunsch residue descriptor alignments to standard TM-align alignments of the final results. By using alignment normalization effectively at each stage, RUPEE also can execute containment searches in addition to full-length searches to identify structural motifs within proteins. We compare the results of RUPEE to the protein structure searches mTM-align, SSM, CATHEDRAL, and VAST using a benchmark derived from the protein structure predictions submitted to CASP13. RUPEE identifies better alignments on average with respect to TM-score as well as scores specific to SSM and CATHEDRAL, Q-score and SSAP-score, respectively.

摘要

蛋白质结构预测是分子生物学中一个长期未解决的问题,随着深度学习在第13届蛋白质结构预测关键评估(CASP13)中借助AlphaFold取得的最新成功,该问题再次受到关注。在开发和评估蛋白质结构预测方法时,研究人员可能希望识别与其预测结构最相似的已知结构。这些预测结构通常与已知结构的序列和结构相似性较低。我们展示了RUPEE(一种纯几何蛋白质结构搜索方法)如何能够识别与结构预测最相似的结构,无论它们与已知结构有多大差异,而这是现有蛋白质结构搜索方法所难以做到的。RUPEE通过使用一种将蛋白质结构作为残基描述符序列的新型线性编码来实现这一点。利用快速的Needleman-Wunsch算法,RUPEE能够对每个可用结构的残基描述符序列进行比对。接下来是一系列越来越精确的结构比对,从以Needleman-Wunsch残基描述符比对初始化的TM-align比对到最终结果的标准TM-align比对。通过在每个阶段有效地使用比对归一化,RUPEE除了能进行全长搜索外,还可以执行包含搜索以识别蛋白质中的结构基序。我们使用源自提交给CASP13的蛋白质结构预测的基准,将RUPEE的结果与蛋白质结构搜索方法mTM-align、SSM、CATHEDRAL和VAST的结果进行比较。就TM分数以及分别特定于SSM和CATHEDRAL的分数(Q分数和SSAP分数)而言,RUPEE平均能识别出更好的比对结果。

相似文献

1
Protein structure search to support the development of protein structure prediction methods.支持蛋白质结构预测方法开发的蛋白质结构搜索。
Proteins. 2021 Jun;89(6):648-658. doi: 10.1002/prot.26048. Epub 2021 Feb 2.
2
RUPEE: A fast and accurate purely geometric protein structure search.鲁比:一种快速而准确的纯几何蛋白质结构搜索方法。
PLoS One. 2019 Mar 15;14(3):e0213712. doi: 10.1371/journal.pone.0213712. eCollection 2019.
3
ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs.ProALIGN:通过利用上下文特定的对齐基序直接学习蛋白质结构预测的对齐。
J Comput Biol. 2022 Feb;29(2):92-105. doi: 10.1089/cmb.2021.0430. Epub 2022 Jan 21.
4
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align:一种基于片段比对和TM分数的新型蛋白质结构比对方法。
BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.
5
DNSS2: Improved ab initio protein secondary structure prediction using advanced deep learning architectures.DNSS2:使用先进深度学习架构改进从头算蛋白质二级结构预测
Proteins. 2021 Feb;89(2):207-217. doi: 10.1002/prot.26007. Epub 2020 Sep 16.
6
How to choose templates for modeling of protein complexes: Insights from benchmarking template-based docking.如何为蛋白质复合物建模选择模板:基于基准测试的模板对接的见解。
Proteins. 2020 Aug;88(8):1070-1081. doi: 10.1002/prot.25875. Epub 2020 Feb 7.
7
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对:一种基于残基-残基接触面积的灵活蛋白质结构比对方法。
PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.
8
Protein structure prediction improves the quality of amino-acid sequence alignment.蛋白质结构预测可以提高氨基酸序列比对的质量。
Proteins. 2022 Dec;90(12):2144-2147. doi: 10.1002/prot.26392. Epub 2022 Jul 15.
9
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
10
Sequence and structure alignments in post-AlphaFold era.后 AlphaFold 时代的序列和结构比对。
Curr Opin Struct Biol. 2023 Apr;79:102539. doi: 10.1016/j.sbi.2023.102539. Epub 2023 Feb 6.

引用本文的文献

1
Prediction of secreted uncharacterized protein structures from Beauveria bassiana ARSEF 2860 unravels novel toxins-like families.球孢白僵菌ARSEF 2860分泌的未知蛋白质结构预测揭示了新的毒素样家族。
Sci Rep. 2025 May 22;15(1):17747. doi: 10.1038/s41598-025-02618-3.
2
AlphaFold2 assists in providing novel mechanistic insights into the interactions among the LUBAC subunits.AlphaFold2 有助于提供关于 LUBAC 亚基相互作用的新的机制见解。
Acta Biochim Biophys Sin (Shanghai). 2024 Apr 23;56(7):1034-1043. doi: 10.3724/abbs.2024047.
3
In Silico Evaluation, Phylogenetic Analysis, and Structural Modeling of the Class II Hydrophobin Family from Different Fungal Phytopathogens.
不同真菌植物病原体中II类疏水蛋白家族的计算机模拟评估、系统发育分析和结构建模
Microorganisms. 2023 Oct 26;11(11):2632. doi: 10.3390/microorganisms11112632.
4
Structure-Function Analysis of RBP7910: An Editosome Z-Binding Protein in Trypanosomatids.结构功能分析 RBP7910:一种锥虫的 editosome Z 结合蛋白。
Molecules. 2023 Oct 7;28(19):6963. doi: 10.3390/molecules28196963.
5
The Venturia inaequalis effector repertoire is dominated by expanded families with predicted structural similarity, but unrelated sequence, to avirulence proteins from other plant-pathogenic fungi.畸形外囊菌效应子组库主要由结构相似、序列无关的扩展家族组成,与其他植物病原真菌的无毒蛋白具有预测的结构相似性。
BMC Biol. 2022 Nov 3;20(1):246. doi: 10.1186/s12915-022-01442-9.