• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GR-Align:使用图元度数相似度进行快速灵活的蛋白质 3D 结构比对。

GR-Align: fast and flexible alignment of protein 3D structures using graphlet degree similarity.

机构信息

Department of Computing, Imperial College London, SW7 2AZ, UK.

出版信息

Bioinformatics. 2014 May 1;30(9):1259-65. doi: 10.1093/bioinformatics/btu020. Epub 2014 Jan 17.

DOI:10.1093/bioinformatics/btu020
PMID:24443377
Abstract

MOTIVATION

Protein structure alignment is key for transferring information from well-studied proteins to less studied ones. Structural alignment identifies the most precise mapping of equivalent residues, as structures are more conserved during evolution than sequences. Among the methods for aligning protein structures, maximum Contact Map Overlap (CMO) has received sustained attention during the past decade. Yet, known algorithms exhibit modest performance and are not applicable for large-scale comparison.

RESULTS

Graphlets are small induced subgraphs that are used to design sensitive topological similarity measures between nodes and networks. By generalizing graphlets to ordered graphs, we introduce GR-Align, a CMO heuristic that is suited for database searches. On the Proteus_300 set (44 850 protein domain pairs), GR-Align is several orders of magnitude faster than the state-of-the-art CMO solvers Apurva, MSVNS and AlEigen7, and its similarity score is in better agreement with the structural classification of proteins. On a large-scale experiment on the Gold-standard benchmark dataset (3 207 270 protein domain pairs), GR-Align is several orders of magnitude faster than the state-of-the-art protein structure comparison tools TM-Align, DaliLite, MATT and Yakusa, while achieving similar classification performances. Finally, we illustrate the difference between GR-Align's flexible alignments and the traditional ones by querying a flexible protein in the Astral-40 database (11 154 protein domains). In this experiment, GR-Align's top scoring alignments are not only in better agreement with structural classification of proteins, but also that they allow transferring more information across proteins.

摘要

动机

蛋白质结构比对对于将信息从研究充分的蛋白质转移到研究较少的蛋白质至关重要。结构比对确定了等价残基的最精确映射,因为在进化过程中结构比序列更保守。在蛋白质结构对齐的方法中,最大接触图重叠(CMO)在过去十年中一直受到关注。然而,已知的算法表现出中等的性能,并且不适用于大规模比较。

结果

图元是用于设计节点和网络之间敏感拓扑相似性度量的小诱导子图。通过将图元推广到有序图,我们引入了 GR-Align,这是一种适用于数据库搜索的 CMO 启发式算法。在 Proteus_300 数据集(44850 个蛋白质域对)上,GR-Align 的速度比最先进的 CMO 求解器 Apurva、MSVNS 和 AlEigen7 快几个数量级,其相似度得分与蛋白质的结构分类更一致。在 Gold-standard 基准数据集(3207270 个蛋白质域对)的大规模实验中,GR-Align 的速度比最先进的蛋白质结构比较工具 TM-Align、DaliLite、MATT 和 Yakusa 快几个数量级,同时实现了类似的分类性能。最后,我们通过在 Astral-40 数据库(11154 个蛋白质域)中查询一个灵活的蛋白质来展示 GR-Align 的灵活对齐与传统对齐之间的差异。在这个实验中,GR-Align 的最高得分对齐不仅与蛋白质的结构分类更一致,而且它们允许在蛋白质之间传递更多的信息。

相似文献

1
GR-Align: fast and flexible alignment of protein 3D structures using graphlet degree similarity.GR-Align:使用图元度数相似度进行快速灵活的蛋白质 3D 结构比对。
Bioinformatics. 2014 May 1;30(9):1259-65. doi: 10.1093/bioinformatics/btu020. Epub 2014 Jan 17.
2
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对:一种基于残基-残基接触面积的灵活蛋白质结构比对方法。
PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.
3
Maximum contact map overlap revisited.重新审视最大接触图重叠
J Comput Biol. 2011 Jan;18(1):27-41. doi: 10.1089/cmb.2009.0196.
4
LS-align: an atom-level, flexible ligand structural alignment algorithm for high-throughput virtual screening.LS-align:一种适用于高通量虚拟筛选的原子级、灵活的配体结构对齐算法。
Bioinformatics. 2018 Jul 1;34(13):2209-2218. doi: 10.1093/bioinformatics/bty081.
5
Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
6
Fast overlapping of protein contact maps by alignment of eigenvectors.通过特征向量比对实现蛋白质接触图谱的快速重叠。
Bioinformatics. 2010 Sep 15;26(18):2250-8. doi: 10.1093/bioinformatics/btq402. Epub 2010 Jul 7.
7
Selecting Near-Native Protein Structures from Predicted Decoy Sets Using Ordered Graphlet Degree Similarity.使用有序图节点度相似性从预测的伪结构集中选择近天然蛋白质结构。
Genes (Basel). 2019 Feb 11;10(2):132. doi: 10.3390/genes10020132.
8
Fast protein structure alignment using Gaussian overlap scoring of backbone peptide fragment similarity.基于骨干肽段相似性的高斯叠合评分进行快速蛋白质结构比对。
Bioinformatics. 2012 Dec 15;28(24):3274-81. doi: 10.1093/bioinformatics/bts618. Epub 2012 Oct 23.
9
BLANT-fast graphlet sampling tool.BLANT 快速图元采样工具。
Bioinformatics. 2019 Dec 15;35(24):5363-5364. doi: 10.1093/bioinformatics/btz603.
10
Comparing protein structures and inferring functions with a novel three-dimensional Yau-Hausdorff method.用一种新颖的三维 Yau-Hausdorff 方法比较蛋白质结构和推断功能。
J Biomol Struct Dyn. 2019 Oct;37(16):4151-4160. doi: 10.1080/07391102.2018.1540359. Epub 2018 Dec 5.

引用本文的文献

1
A fast approach for structural and evolutionary analysis based on energetic profile protein comparison.一种基于能量分布蛋白质比较的结构与进化分析快速方法。
Nat Commun. 2025 Mar 6;16(1):2231. doi: 10.1038/s41467-025-57374-9.
2
Current and future directions in network biology.网络生物学的当前与未来发展方向。
Bioinform Adv. 2024 Aug 14;4(1):vbae099. doi: 10.1093/bioadv/vbae099. eCollection 2024.
3
An interactive visualization tool for educational outreach in protein contact map overlap analysis.一种用于蛋白质接触图重叠分析教育推广的交互式可视化工具。
Front Bioinform. 2024 Mar 15;4:1358550. doi: 10.3389/fbinf.2024.1358550. eCollection 2024.
4
Contact-Assisted Threading in Low-Homology Protein Modeling.接触辅助线程在低同源性蛋白质建模中的应用。
Methods Mol Biol. 2023;2627:41-59. doi: 10.1007/978-1-0716-2974-1_3.
5
Virtual Screening of FDA-Approved Drugs for Enhanced Binding with Mitochondrial Aldehyde Dehydrogenase.FDA 批准药物与线粒体乙醛脱氢酶增强结合的虚拟筛选。
Molecules. 2022 Dec 10;27(24):8773. doi: 10.3390/molecules27248773.
6
ContactPFP: Protein function prediction using predicted contact information.ContactPFP:利用预测的接触信息进行蛋白质功能预测。
Front Bioinform. 2022 Jun;2. doi: 10.3389/fbinf.2022.896295. Epub 2022 Jun 2.
7
Understanding structural variability in proteins using protein structural networks.利用蛋白质结构网络理解蛋白质中的结构变异性。
Curr Res Struct Biol. 2022 Apr 27;4:134-145. doi: 10.1016/j.crstbi.2022.04.002. eCollection 2022.
8
Multi-layer sequential network analysis improves protein 3D structural classification.多层序列网络分析提高蛋白质 3D 结构分类。
Proteins. 2022 Sep;90(9):1721-1731. doi: 10.1002/prot.26349. Epub 2022 May 2.
9
Pairwise Biological Network Alignment Based on Discrete Bat Algorithm.基于离散蝙蝠算法的成对生物网络比对。
Comput Math Methods Med. 2021 Nov 3;2021:5548993. doi: 10.1155/2021/5548993. eCollection 2021.
10
DeeplyEssential: a deep neural network for predicting essential genes in microbes.深度必需:一种用于预测微生物必需基因的深度神经网络。
BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):367. doi: 10.1186/s12859-020-03688-y.