• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

H2rs:通过对多个序列比对进行基于熵和相似性的分析来推断进化和功能重要的残基位置。

H2rs: deducing evolutionary and functionally important residue positions by means of an entropy and similarity based analysis of multiple sequence alignments.

机构信息

Institute of Biophysics and Physical Biochemistry, University of Regensburg, D-93040 Regensburg, Germany.

出版信息

BMC Bioinformatics. 2014 Apr 27;15:118. doi: 10.1186/1471-2105-15-118.

DOI:10.1186/1471-2105-15-118
PMID:24766829
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4021312/
Abstract

BACKGROUND

The identification of functionally important residue positions is an important task of computational biology. Methods of correlation analysis allow for the identification of pairs of residue positions, whose occupancy is mutually dependent due to constraints imposed by protein structure or function. A common measure assessing these dependencies is the mutual information, which is based on Shannon's information theory that utilizes probabilities only. Consequently, such approaches do not consider the similarity of residue pairs, which may degrade the algorithm's performance. One typical algorithm is H2r, which characterizes each individual residue position k by the conn(k)-value, which is the number of significantly correlated pairs it belongs to.

RESULTS

To improve specificity of H2r, we developed a revised algorithm, named H2rs, which is based on the von Neumann entropy (vNE). To compute the corresponding mutual information, a matrix A is required, which assesses the similarity of residue pairs. We determined A by deducing substitution frequencies from contacting residue pairs observed in the homologs of 35 809 proteins, whose structure is known. In analogy to H2r, the enhanced algorithm computes a normalized conn(k)-value. Within the framework of H2rs, only statistically significant vNE values were considered. To decide on significance, the algorithm calculates a p-value by performing a randomization test for each individual pair of residue positions. The analysis of a large in silico testbed demonstrated that specificity and precision were higher for H2rs than for H2r and two other methods of correlation analysis. The gain in prediction quality is further confirmed by a detailed assessment of five well-studied enzymes. The outcome of H2rs and of a method that predicts contacting residue positions (PSICOV) overlapped only marginally. H2rs can be downloaded from http://www-bioinf.uni-regensburg.de.

CONCLUSIONS

Considering substitution frequencies for residue pairs by means of the von Neumann entropy and a p-value improved the success rate in identifying important residue positions. The integration of proven statistical concepts and normalization allows for an easier comparison of results obtained with different proteins. Comparing the outcome of the local method H2rs and of the global method PSICOV indicates that such methods supplement each other and have different scopes of application.

摘要

背景

功能重要残基位置的鉴定是计算生物学的一项重要任务。相关分析方法可用于鉴定残基对,由于蛋白质结构或功能的限制,这些残基对的占据是相互依赖的。一种常见的评估这些依赖性的度量是互信息,它基于香农信息论,仅使用概率。因此,这些方法不考虑残基对的相似性,这可能会降低算法的性能。一种典型的算法是 H2r,它通过 conn(k)-值来描述每个单独的残基位置 k,该值是其所属的显著相关对的数量。

结果

为了提高 H2r 的特异性,我们开发了一种名为 H2rs 的改进算法,该算法基于冯·诺依曼熵(vNE)。为了计算相应的互信息,需要一个评估残基对相似性的矩阵 A。我们通过从已知结构的 35809 个同源蛋白质中观察到的接触残基对中推导出取代频率来确定 A。与 H2r 类似,增强算法计算归一化 conn(k)-值。在 H2rs 的框架内,仅考虑具有统计学意义的 vNE 值。为了确定显著性,该算法通过对每个残基对进行随机化测试来计算 p 值。对大型计算机测试库的分析表明,H2rs 的特异性和精度均高于 H2r 和其他两种相关分析方法。对五个研究充分的酶的详细评估进一步证实了预测质量的提高。H2rs 的结果与预测接触残基位置的方法(PSICOV)的结果仅略有重叠。H2rs 可从 http://www-bioinf.uni-regensburg.de 下载。

结论

通过冯·诺依曼熵和 p 值考虑残基对的取代频率,提高了识别重要残基位置的成功率。结合经过验证的统计概念和归一化,允许更容易地比较使用不同蛋白质获得的结果。比较局部方法 H2rs 和全局方法 PSICOV 的结果表明,这些方法相互补充,具有不同的应用范围。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/ac06e37a1f06/1471-2105-15-118-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/be37e4d2a50f/1471-2105-15-118-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/dcc3012faaa4/1471-2105-15-118-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/3c95fa2bf121/1471-2105-15-118-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/7c25f106f98e/1471-2105-15-118-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/ea316a7f8760/1471-2105-15-118-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/ac06e37a1f06/1471-2105-15-118-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/be37e4d2a50f/1471-2105-15-118-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/dcc3012faaa4/1471-2105-15-118-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/3c95fa2bf121/1471-2105-15-118-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/7c25f106f98e/1471-2105-15-118-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/ea316a7f8760/1471-2105-15-118-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b01/4021312/ac06e37a1f06/1471-2105-15-118-6.jpg

相似文献

1
H2rs: deducing evolutionary and functionally important residue positions by means of an entropy and similarity based analysis of multiple sequence alignments.H2rs:通过对多个序列比对进行基于熵和相似性的分析来推断进化和功能重要的残基位置。
BMC Bioinformatics. 2014 Apr 27;15:118. doi: 10.1186/1471-2105-15-118.
2
H2r: identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments.H2r:通过对多序列比对进行基于熵的分析来识别进化上重要的残基。
BMC Bioinformatics. 2008 Mar 18;9:151. doi: 10.1186/1471-2105-9-151.
3
CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.CLIPS-1D:分析多重序列比对,推断残基位置在催化、配体结合或蛋白质结构中的作用。
BMC Bioinformatics. 2012 Apr 5;13:55. doi: 10.1186/1471-2105-13-55.
4
Experimental assessment of the importance of amino acid positions identified by an entropy-based correlation analysis of multiple-sequence alignments.基于多重序列比对的熵相关性分析鉴定的氨基酸位置重要性的实验评估。
Biochemistry. 2012 Jul 17;51(28):5633-41. doi: 10.1021/bi300747r. Epub 2012 Jul 6.
5
Estimating residue evolutionary conservation by introducing von Neumann entropy and a novel gap-treating approach.通过引入冯·诺依曼熵和一种新颖的缺口处理方法来估计残基进化保守性。
Amino Acids. 2008 Aug;35(2):495-501. doi: 10.1007/s00726-007-0586-0. Epub 2007 Aug 21.
6
PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.PSICOV:使用基于稀疏逆协方差估计的大型多重序列比对进行精确结构接触预测。
Bioinformatics. 2012 Jan 15;28(2):184-90. doi: 10.1093/bioinformatics/btr638. Epub 2011 Nov 17.
7
Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction.不受系统发育或熵影响的互信息显著改善了残基接触预测。
Bioinformatics. 2008 Feb 1;24(3):333-40. doi: 10.1093/bioinformatics/btm604. Epub 2007 Dec 5.
8
AL2CO: calculation of positional conservation in a protein sequence alignment.AL2CO:蛋白质序列比对中位置保守性的计算
Bioinformatics. 2001 Aug;17(8):700-12. doi: 10.1093/bioinformatics/17.8.700.
9
Improving the alignment quality of consistency based aligners with an evaluation function using synonymous protein words.利用同义蛋白质词的评估函数提高一致性比对器的比对质量。
PLoS One. 2011;6(12):e27872. doi: 10.1371/journal.pone.0027872. Epub 2011 Dec 2.
10
Estimates of statistical significance for comparison of individual positions in multiple sequence alignments.用于比较多序列比对中各个位置的统计学显著性估计。
BMC Bioinformatics. 2004 Aug 5;5:106. doi: 10.1186/1471-2105-5-106.

引用本文的文献

1
Deep Analysis of Residue Constraints (DARC): identifying determinants of protein functional specificity.深度解析残基约束(DARC):鉴定蛋白质功能特异性的决定因素。
Sci Rep. 2020 Feb 3;10(1):1691. doi: 10.1038/s41598-019-55118-6.
2
A Single Mutation Increases the Thermostability and Activity of Amine Transaminase.单个突变可提高胺转氨酶的热稳定性和活性。
Molecules. 2019 Mar 27;24(7):1194. doi: 10.3390/molecules24071194.
3
Molecular dynamics and structure function analysis show that substrate binding and specificity are major forces in the functional diversification of Eqolisins.

本文引用的文献

1
CLIPS-4D: a classifier that distinguishes structurally and functionally important residue-positions based on sequence and 3D data.CLIPS-4D:一种基于序列和 3D 数据区分结构和功能重要残基位置的分类器。
Bioinformatics. 2013 Dec 1;29(23):3029-35. doi: 10.1093/bioinformatics/btt519. Epub 2013 Sep 18.
2
MISTIC: Mutual information server to infer coevolution.MISTIC:互信息服务器推断共进化。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W8-14. doi: 10.1093/nar/gkt427. Epub 2013 May 28.
3
Emerging methods in protein co-evolution.
分子动力学和结构功能分析表明,底物结合和特异性是 Eqolisin 功能多样化的主要力量。
BMC Bioinformatics. 2018 Sep 24;19(1):338. doi: 10.1186/s12859-018-2348-2.
4
Inferring joint sequence-structural determinants of protein functional specificity.推断蛋白质功能特异性的关节序列结构决定因素。
Elife. 2018 Jan 16;7:e29880. doi: 10.7554/eLife.29880.
5
Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.基于统计相关性推断功能相关的N-乙酰转移酶残基
PLoS Comput Biol. 2016 Dec 21;12(12):e1005294. doi: 10.1371/journal.pcbi.1005294. eCollection 2016 Dec.
6
Emerging Computational Methods for the Rational Discovery of Allosteric Drugs.用于变构药物合理发现的新兴计算方法
Chem Rev. 2016 Jun 8;116(11):6370-90. doi: 10.1021/acs.chemrev.5b00631. Epub 2016 Apr 13.
蛋白质共进化的新兴方法。
Nat Rev Genet. 2013 Apr;14(4):249-61. doi: 10.1038/nrg3414. Epub 2013 Mar 5.
4
MAFFT multiple sequence alignment software version 7: improvements in performance and usability.MAFFT 多序列比对软件版本 7:性能和易用性的改进。
Mol Biol Evol. 2013 Apr;30(4):772-80. doi: 10.1093/molbev/mst010. Epub 2013 Jan 16.
5
Highly covarying residues have a functional role in antibody constant domains.高度变异性残基在抗体恒定域中具有功能作用。
Proteins. 2013 May;81(5):884-95. doi: 10.1002/prot.24247. Epub 2013 Feb 27.
6
Update on activities at the Universal Protein Resource (UniProt) in 2013.2013 年 泛蛋白资源库(UniProt)活动更新。
Nucleic Acids Res. 2013 Jan;41(Database issue):D43-7. doi: 10.1093/nar/gks1068. Epub 2012 Nov 17.
7
Protein structure prediction from sequence variation.从序列变异预测蛋白质结构。
Nat Biotechnol. 2012 Nov;30(11):1072-80. doi: 10.1038/nbt.2419.
8
Reliable and robust detection of coevolving protein residues.可靠且稳健的共进化蛋白质残基检测。
Protein Eng Des Sel. 2012 Nov;25(11):705-13. doi: 10.1093/protein/gzs081. Epub 2012 Oct 16.
9
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction.解析进化信号:保守性、特异性决定位置和共进化。对催化残基预测的启示。
BMC Bioinformatics. 2012 Sep 14;13:235. doi: 10.1186/1471-2105-13-235.
10
Coupled mutation finder: a new entropy-based method quantifying phylogenetic noise for the detection of compensatory mutations.耦合突变发现器:一种新的基于熵的方法,用于量化系统发育噪声以检测补偿突变。
BMC Bioinformatics. 2012 Sep 11;13:225. doi: 10.1186/1471-2105-13-225.