• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向高通量、多准则蛋白质结构比较和分析。

Toward high-throughput, multicriteria protein-structure comparison and analysis.

机构信息

School of Computer Science, University of Nottingham, Nottingham NG81BB, U.K.

出版信息

IEEE Trans Nanobioscience. 2010 Jun;9(2):144-55. doi: 10.1109/TNB.2010.2043851.

DOI:10.1109/TNB.2010.2043851
PMID:20650704
Abstract

Protein-structure comparison (PSC) is an essential component of biomedical research as it impacts on, e.g., drug design, molecular docking, protein folding and structure prediction algorithms as well as being essential to the assessment of these predictions. Each of these applications, as well as many others where molecular comparison plays an important role, requires a different notion of similarity that naturally lead to the multicriteria PSC (MC-PSC) problem. Protein (Structure) Comparison, Knowledge, Similarity, and Information (ProCKSI) (www.procksi.org) provides algorithmic solutions for the MC-PSC problem by means of an enhanced structural comparison that relies on the principled application of information fusion to similarity assessments derived from multiple comparison methods. Current MC-PSC works well for moderately sized datasets and it is time consuming as it provides public service to multiple users. Many of the structural bioinformatics applications mentioned above would benefit from the ability to perform, for a dedicated user, thousands or tens of thousands of comparisons through multiple methods in real time, a capacity beyond our current technology. In this paper, we take a key step into that direction by means of a high-throughput distributed reimplementation of ProCKSI for very large datasets. The core of the proposed framework lies in the design of an innovative distributed algorithm that runs on each compute node in a cluster/grid environment to perform structure comparison of a given subset of input structures using some of the most popular PSC methods [e.g., universal similarity metric (USM), maximum contact map overlap (MaxCMO), fast alignment and search tool (FAST), distance alignment (DaliLite), combinatorial extension (CE), template modeling alignment (TMAlign)]. We follow this with a procedure of distributed consensus building. Thus, the new algorithms proposed here achieve ProCKSI's similarity assessment quality but with a fraction of the time required by it. Our results show that the proposed distributed method can be used efficiently to compare: 1) a particular protein against a very large protein structures dataset (target-against-all comparison), and 2) a particular very large-scale dataset against itself or against another very large-scale dataset (all-against-all comparison). We conclude the paper by enumerating some of the outstanding challenges for real-time MC-PSC.

摘要

蛋白质结构比较 (PSC) 是生物医学研究的一个重要组成部分,因为它会影响药物设计、分子对接、蛋白质折叠和结构预测算法等,并且对于这些预测的评估也是必不可少的。这些应用中的每一个,以及许多其他分子比较起着重要作用的应用,都需要不同的相似性概念,这自然导致了多标准 PSC(MC-PSC)问题。蛋白质(结构)比较、知识、相似性和信息(ProCKSI)(www.procksi.org)通过一种增强的结构比较,为 MC-PSC 问题提供了算法解决方案,这种比较依赖于信息融合的原则应用于从多种比较方法中得出的相似性评估。目前的 MC-PSC 适用于中等大小的数据集,而且由于它为多个用户提供公共服务,所以时间消耗很大。上述许多结构生物信息学应用程序都将受益于能够通过多种方法实时为专用用户执行数千或数万个比较的能力,这是我们当前技术所无法达到的。在本文中,我们通过对非常大的数据集进行高通量分布式重新实现 ProCKSI,朝着这个方向迈出了关键的一步。所提出框架的核心在于设计一种创新的分布式算法,该算法在集群/网格环境中的每个计算节点上运行,使用一些最流行的 PSC 方法(例如通用相似性度量 (USM)、最大接触图重叠 (MaxCMO)、快速对齐和搜索工具 (FAST)、距离对齐 (DaliLite)、组合扩展 (CE)、模板建模对齐 (TMAlign)) 对给定输入结构子集进行结构比较。然后,我们按照分布式共识构建过程进行操作。因此,这里提出的新算法实现了 ProCKSI 的相似性评估质量,但所需时间仅为其一部分。我们的结果表明,所提出的分布式方法可以有效地用于比较:1)特定蛋白质与非常大的蛋白质结构数据集(目标对所有比较),2)特定非常大规模数据集与其自身或与另一个非常大规模数据集(所有对所有比较)。最后,我们列举了实时 MC-PSC 的一些突出挑战。

相似文献

1
Toward high-throughput, multicriteria protein-structure comparison and analysis.面向高通量、多准则蛋白质结构比较和分析。
IEEE Trans Nanobioscience. 2010 Jun;9(2):144-55. doi: 10.1109/TNB.2010.2043851.
2
ProCKSI: a decision support system for Protein (structure) Comparison, Knowledge, Similarity and Information.ProCKSI:一种用于蛋白质(结构)比较、知识、相似性和信息的决策支持系统。
BMC Bioinformatics. 2007 Oct 26;8:416. doi: 10.1186/1471-2105-8-416.
3
Protein structural similarity search by Ramachandran codes.通过拉马钱德兰编码进行蛋白质结构相似性搜索。
BMC Bioinformatics. 2007 Aug 23;8:307. doi: 10.1186/1471-2105-8-307.
4
Fast tandem mass spectra-based protein identification regardless of the number of spectra or potential modifications examined.基于快速串联质谱的蛋白质鉴定,无论所检测的谱图数量或潜在修饰如何。
Bioinformatics. 2005 May 15;21(10):2177-84. doi: 10.1093/bioinformatics/bti362. Epub 2005 Mar 3.
5
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
6
Protein structure mining using a structural alphabet.使用结构字母表进行蛋白质结构挖掘。
Proteins. 2008 May 1;71(2):920-37. doi: 10.1002/prot.21776.
7
Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre.在Phyre程序中使用集成折叠识别方法探索序列/结构空间的极限。
Proteins. 2008 Feb 15;70(3):611-25. doi: 10.1002/prot.21688.
8
HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.HYPROSP II——一种基于局部预测置信度的用于蛋白质二级结构预测的基于知识的混合方法。
Bioinformatics. 2005 Aug 1;21(15):3227-33. doi: 10.1093/bioinformatics/bti524. Epub 2005 Jun 2.
9
Predicting functional sites with an automated algorithm suitable for heterogeneous datasets.使用适用于异构数据集的自动算法预测功能位点。
BMC Bioinformatics. 2005 May 13;6:116. doi: 10.1186/1471-2105-6-116.
10
Protein structure prediction based on sequence similarity.基于序列相似性的蛋白质结构预测。
Methods Mol Biol. 2009;569:129-56. doi: 10.1007/978-1-59745-524-4_7.

引用本文的文献

1
Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC.使用 pyMCPSC 进行多标准蛋白质结构比较和结构相似性分析。
PLoS One. 2018 Oct 17;13(10):e0204587. doi: 10.1371/journal.pone.0204587. eCollection 2018.
2
Efficient Multicriteria Protein Structure Comparison on Modern Processor Architectures.现代处理器架构上的高效多标准蛋白质结构比较
Biomed Res Int. 2015;2015:563674. doi: 10.1155/2015/563674. Epub 2015 Oct 28.
3
Accelerating large-scale protein structure alignments with graphics processing units.
利用图形处理单元加速大规模蛋白质结构比对
BMC Res Notes. 2012 Feb 22;5:116. doi: 10.1186/1756-0500-5-116.