使用双线性核的快速蛋白质片段相似性评分。

Fast protein fragment similarity scoring using a Binet-Cauchy kernel.

机构信息

Univ Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques in Silico, UMR 973, F-75205 Paris, France, INSERM, U973, F-75205 Paris, France and Univ Paris Diderot, Ressources Parisiennes de Bioinformatique Structurale, F-75205 Paris, France.

出版信息

Bioinformatics. 2014 Mar 15;30(6):784-91. doi: 10.1093/bioinformatics/btt618. Epub 2013 Oct 27.

DOI:10.1093/bioinformatics/btt618

PMID:24167157

Abstract

MOTIVATION

Meaningful scores to assess protein structure similarity are essential to decipher protein structure and sequence evolution. The mining of the increasing number of protein structures requires fast and accurate similarity measures with statistical significance. Whereas numerous approaches have been proposed for protein domains as a whole, the focus is progressively moving to a more local level of structure analysis for which similarity measurement still remains without any satisfactory answer.

RESULTS

We introduce a new score based on Binet-Cauchy kernel. It is normalized and bounded between 1-maximal similarity that implies exactly the same conformations for protein fragments-and -1-mirror image conformations, the unrelated conformations having a null mean score. This allows for the search of both similar and mirror conformations. In addition, such score addresses two major issue of the widely used root mean square deviation (RMSD). First, it achieves length independent statistics even for short fragments. Second, it shows better performance in the discrimination of medium range RMSD values. Being simpler and faster to compute than the RMSD, it also provides the means for large-scale mining of protein structures.

AVAILABILITY AND IMPLEMENTATION

The computer software implementing the score is available at http://bioserv.rpbs.univ-paris-diderot.fr/BCscore/

CONTACT

frederic.guyon@univ-paris-diderot.fr

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

评估蛋白质结构相似性的有意义分数对于破译蛋白质结构和序列进化至关重要。随着越来越多的蛋白质结构被挖掘，需要快速准确的具有统计学意义的相似性度量。虽然已经提出了许多用于整个蛋白质域的方法，但重点逐渐转移到更局部的结构分析层面，而对于这种分析，相似性测量仍然没有令人满意的答案。

结果

我们引入了一种基于 Binet-Cauchy 核的新分数。它是归一化的，并且在 1-最大相似性（对于蛋白质片段意味着完全相同的构象）和-1-镜像构象之间有界，不相关的构象具有零均值分数。这允许搜索相似构象和镜像构象。此外，该分数解决了广泛使用的均方根偏差（RMSD）的两个主要问题。首先，它甚至可以对短片段实现长度独立的统计。其次，它在区分中等 RMSD 值方面表现出更好的性能。由于比 RMSD 更简单、更快，它还为蛋白质结构的大规模挖掘提供了手段。

可用性和实现

实现该分数的计算机软件可在 http://bioserv.rpbs.univ-paris-diderot.fr/BCscore/ 上获得。

联系人

frederic.guyon@univ-paris-diderot.fr

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

Fast protein fragment similarity scoring using a Binet-Cauchy kernel.使用双线性核的快速蛋白质片段相似性评分。

Bioinformatics. 2014 Mar 15;30(6):784-91. doi: 10.1093/bioinformatics/btt618. Epub 2013 Oct 27.

BCSearch: fast structural fragment mining over large collections of protein structures.BCSearch：在大量蛋白质结构集合上进行快速结构片段挖掘。

Nucleic Acids Res. 2015 Jul 1;43(W1):W378-82. doi: 10.1093/nar/gkv492. Epub 2015 May 14.

Detecting protein candidate fragments using a structural alphabet profile comparison approach.利用结构字母表谱比较方法检测蛋白质候选片段。

PLoS One. 2013 Nov 26;8(11):e80493. doi: 10.1371/journal.pone.0080493. eCollection 2013.

Improving protein fold recognition with hybrid profiles combining sequence and structure evolution.利用结合序列和结构进化的混合剖面提高蛋白质折叠识别。

Bioinformatics. 2015 Dec 1;31(23):3782-9. doi: 10.1093/bioinformatics/btv462. Epub 2015 Aug 7.

RapidRMSD: rapid determination of RMSDs corresponding to motions of flexible molecules.RapidRMSD：对应柔性分子运动的 RMSD 的快速确定。

Bioinformatics. 2018 Aug 15;34(16):2757-2765. doi: 10.1093/bioinformatics/bty160.

HHalign-Kbest: exploring sub-optimal alignments for remote homology comparative modeling.HHalign-Kbest：探索远程同源性比较建模的次优比对。

Bioinformatics. 2015 Dec 1;31(23):3850-2. doi: 10.1093/bioinformatics/btv441. Epub 2015 Jul 30.

A novel exhaustive search algorithm for predicting the conformation of polypeptide segments in proteins.一种用于预测蛋白质中多肽片段构象的新型穷举搜索算法。

Proteins. 2000 Jul 1;40(1):135-44.

PEP-FOLD: an online resource for de novo peptide structure prediction.PEP-FOLD：一种用于从头预测肽结构的在线资源。

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W498-503. doi: 10.1093/nar/gkp323. Epub 2009 May 11.

PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex.PEP-FOLD3：用于溶液中和复合物中线性肽的更快的从头结构预测。

Nucleic Acids Res. 2016 Jul 8;44(W1):W449-54. doi: 10.1093/nar/gkw329. Epub 2016 Apr 29.

fRMSDPred: predicting local RMSD between structural fragments using sequence information.fRMSDPred：利用序列信息预测结构片段之间的局部均方根偏差。

Comput Syst Bioinformatics Conf. 2007;6:311-22.

引用本文的文献

PatchSearch: a web server for off-target protein identification.PatchSearch：一个用于识别脱靶蛋白的网络服务器。

Nucleic Acids Res. 2019 Jul 2;47(W1):W365-W372. doi: 10.1093/nar/gkz478.

DaReUS-Loop: accurate loop modeling using fragments from remote or unrelated proteins.DaReUS-Loop：使用来自远程或不相关蛋白质的片段进行准确的环建模。

Sci Rep. 2018 Sep 12;8(1):13673. doi: 10.1038/s41598-018-32079-w.

PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex.PEP-FOLD3：用于溶液中和复合物中线性肽的更快的从头结构预测。

Nucleic Acids Res. 2016 Jul 8;44(W1):W449-54. doi: 10.1093/nar/gkw329. Epub 2016 Apr 29.

Comparisons of Allergenic and Metazoan Parasite Proteins: Allergy the Price of Immunity.变应原与后生动物寄生虫蛋白质的比较：过敏——免疫的代价

PLoS Comput Biol. 2015 Oct 29;11(10):e1004546. doi: 10.1371/journal.pcbi.1004546. eCollection 2015 Oct.

Amplitude spectrum distance: measuring the global shape divergence of protein fragments.振幅谱距离：测量蛋白质片段的整体形状差异。

BMC Bioinformatics. 2015 Aug 14;16:256. doi: 10.1186/s12859-015-0693-y.

BCSearch: fast structural fragment mining over large collections of protein structures.BCSearch：在大量蛋白质结构集合上进行快速结构片段挖掘。

Nucleic Acids Res. 2015 Jul 1;43(W1):W378-82. doi: 10.1093/nar/gkv492. Epub 2015 May 14.

The OPEP protein model: from single molecules, amyloid formation, crowding and hydrodynamics to DNA/RNA systems.OPEP蛋白模型：从单分子、淀粉样蛋白形成、拥挤效应和流体动力学到DNA/RNA系统

Chem Soc Rev. 2014 Jul 7;43(13):4871-93. doi: 10.1039/c4cs00048j. Epub 2014 Apr 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用双线性核的快速蛋白质片段相似性评分。

Fast protein fragment similarity scoring using a Binet-Cauchy kernel.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

CONTACT

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

联系人

补充信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献