Suppr超能文献

基于信息准则的蛋白质铰链更快更准确估计

Faster and More Accurate Estimation of Protein Hinges Based on Information Criteria.

作者信息

Koyano Bunsho, Shibuya Tetsuo

机构信息

Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo, Japan.

Human Genome Center, Institute of Medical Science, The University of Tokyo, Japan.

出版信息

J Comput Biol. 2025 May;32(5):498-519. doi: 10.1089/cmb.2024.0731. Epub 2025 Apr 28.

Abstract

Protein hinges are flexible parts connecting several rigid substructures of proteins that are crucial to determine protein function. Various methods have been developed for efficiently and accurately estimating protein hinge positions by comparing two different conformations of the same protein for a growing number of protein structures. However, few studies have focused on accurately estimating the number of hinges, and it is required to accurately estimate both the number and positions of hinges. We propose faster and more accurate algorithms for estimating the number and positions of hinges by utilizing information criteria that run in ()-time, where is the protein length. Our algorithms utilize Bayesian Information Criterion (BIC) or Akaike's Information Criterion based on a newly proposed -hinge structure generation model that models the hinge motions between two protein conformations. Our exact algorithm based on BIC outperformed the most accurate previous method in terms of both hinge number and position accuracy on our simulation dataset. Our exact algorithm was approximately as fast as the previous fastest method, DynDom, on our simulation dataset. We evaluated the hinge number and position accuracy of our exact algorithm and previous methods on one hinge-annotated dataset. The hinge number and position accuracy of our exact algorithm were comparable to the most accurate previous method on the hinge-annotated dataset. We further propose even faster ()-time heuristic algorithms, where is the protein length. Our heuristic algorithm achieved almost the same hinge number and position accuracy as our exact algorithm, and was over 18 times faster than our exact algorithm and DynDom.

摘要

蛋白质铰链是连接蛋白质几个刚性子结构的柔性部分,对确定蛋白质功能至关重要。随着蛋白质结构数量的不断增加,已经开发出各种方法,通过比较同一蛋白质的两种不同构象来高效、准确地估计蛋白质铰链位置。然而,很少有研究专注于准确估计铰链的数量,而准确估计铰链的数量和位置都是必要的。我们提出了更快、更准确的算法,通过利用运行时间为O(n)(其中n是蛋白质长度)的信息准则来估计铰链的数量和位置。我们的算法基于一种新提出的n-铰链结构生成模型,该模型对两种蛋白质构象之间的铰链运动进行建模,利用贝叶斯信息准则(BIC)或赤池信息准则。在我们的模拟数据集上,基于BIC的精确算法在铰链数量和位置准确性方面均优于之前最准确的方法。在我们的模拟数据集上,我们的精确算法速度与之前最快的方法DynDom大致相同。我们在一个带有铰链注释的数据集上评估了我们的精确算法和之前方法的铰链数量和位置准确性。在该铰链注释数据集上,我们精确算法的铰链数量和位置准确性与之前最准确的方法相当。我们进一步提出了更快的O(n)时间启发式算法,其中n是蛋白质长度。我们的启发式算法在铰链数量和位置准确性方面几乎与我们的精确算法相同,并且比我们的精确算法和DynDom快18倍以上。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验