Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania.
Nat Commun. 2024 Aug 24;15(1):7305. doi: 10.1038/s41467-024-51669-z.
With protein databases growing rapidly due to advances in structural and computational biology, the ability to accurately align and rapidly search protein structures has become essential for biological research. In response to the challenge posed by vast protein structure repositories, GTalign offers an innovative solution to protein structure alignment and search-an algorithm that achieves optimal superposition at high speeds. Through the design and implementation of spatial structure indexing, GTalign parallelizes all stages of superposition search across residues and protein structure pairs, yielding rapid identification of optimal superpositions. Rigorous evaluation across diverse datasets reveals GTalign as the most accurate among structure aligners while presenting orders of magnitude in speedup at state-of-the-art accuracy. GTalign's high speed and accuracy make it useful for numerous applications, including functional inference, evolutionary analyses, protein design, and drug discovery, contributing to advancing understanding of protein structure and function.
随着结构和计算生物学的进步,蛋白质数据库迅速增长,准确对齐和快速搜索蛋白质结构已成为生物研究的关键。针对庞大的蛋白质结构库带来的挑战,GTalign 提供了一种创新的蛋白质结构对齐和搜索解决方案——一种高速实现最佳叠加的算法。通过设计和实现空间结构索引,GTalign 对残基和蛋白质结构对的所有叠加搜索阶段进行并行化,从而快速确定最佳叠加。在各种数据集上的严格评估表明,GTalign 是结构对齐器中最准确的,同时在当前最先进的精度水平上实现了数量级的加速。GTalign 的高速和准确性使其在许多应用中非常有用,包括功能推断、进化分析、蛋白质设计和药物发现,有助于提高对蛋白质结构和功能的理解。