• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

耦合突变发现器:一种新的基于熵的方法,用于量化系统发育噪声以检测补偿突变。

Coupled mutation finder: a new entropy-based method quantifying phylogenetic noise for the detection of compensatory mutations.

机构信息

Institute of Computer Science, University of Göttingen, Goldschmidtstr. 7, Göttingen, 37077, Germany.

出版信息

BMC Bioinformatics. 2012 Sep 11;13:225. doi: 10.1186/1471-2105-13-225.

DOI:10.1186/1471-2105-13-225
PMID:22963049
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3577461/
Abstract

BACKGROUND

The detection of significant compensatory mutation signals in multiple sequence alignments (MSAs) is often complicated by noise. A challenging problem in bioinformatics is remains the separation of significant signals between two or more non-conserved residue sites from the phylogenetic noise and unrelated pair signals. Determination of these non-conserved residue sites is as important as the recognition of strictly conserved positions for understanding of the structural basis of protein functions and identification of functionally important residue regions. In this study, we developed a new method, the Coupled Mutation Finder (CMF) quantifying the phylogenetic noise for the detection of compensatory mutations.

RESULTS

To demonstrate the effectiveness of this method, we analyzed essential sites of two human proteins: epidermal growth factor receptor (EGFR) and glucokinase (GCK). Our results suggest that the CMF is able to separate significant compensatory mutation signals from the phylogenetic noise and unrelated pair signals. The vast majority of compensatory mutation sites found by the CMF are related to essential sites of both proteins and they are likely to affect protein stability or functionality.

CONCLUSIONS

The CMF is a new method, which includes an MSA-specific statistical model based on multiple testing procedures that quantify the error made in terms of the false discovery rate and a novel entropy-based metric to upscale BLOSUM62 dissimilar compensatory mutations. Therefore, it is a helpful tool to predict and investigate compensatory mutation sites of structural or functional importance in proteins. We suggest that the CMF could be used as a novel automated function prediction tool that is required for a better understanding of the structural basis of proteins. The CMF server is freely accessible at http://cmf.bioinf.med.uni-goettingen.de.

摘要

背景

在多重序列比对 (MSA) 中检测显著的补偿突变信号通常会受到噪声的干扰。生物信息学中的一个挑战性问题仍然是从系统发育噪声和不相关的对信号中分离两个或多个非保守残基位点之间的显著信号。确定这些非保守残基位点对于理解蛋白质功能的结构基础和识别功能重要的残基区域与识别严格保守的位置同样重要。在这项研究中,我们开发了一种新方法,即耦合突变发现器 (CMF),用于量化检测补偿突变的系统发育噪声。

结果

为了证明该方法的有效性,我们分析了两个人类蛋白质的必需位点:表皮生长因子受体 (EGFR) 和葡萄糖激酶 (GCK)。我们的结果表明,CMF 能够将显著的补偿突变信号与系统发育噪声和不相关的对信号区分开来。CMF 发现的绝大多数补偿突变位点与这两种蛋白质的必需位点有关,它们可能会影响蛋白质的稳定性或功能。

结论

CMF 是一种新方法,它包括一个基于多重测试程序的 MSA 特定的统计模型,该模型以假发现率来量化错误,并使用一种新的基于熵的度量来放大 BLOSUM62 不相似的补偿突变。因此,它是预测和研究蛋白质结构或功能重要性补偿突变位点的有用工具。我们建议将 CMF 用作一种新的自动化功能预测工具,以更好地理解蛋白质的结构基础。CMF 服务器可在 http://cmf.bioinf.med.uni-goettingen.de 免费访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/6874dc0483b7/1471-2105-13-225-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/5e49b646d7cb/1471-2105-13-225-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/4e32ec12193c/1471-2105-13-225-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/c82ca03bfd0a/1471-2105-13-225-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/3c6df6d7d7a4/1471-2105-13-225-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/ed4f022c9283/1471-2105-13-225-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/6874dc0483b7/1471-2105-13-225-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/5e49b646d7cb/1471-2105-13-225-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/4e32ec12193c/1471-2105-13-225-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/c82ca03bfd0a/1471-2105-13-225-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/3c6df6d7d7a4/1471-2105-13-225-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/ed4f022c9283/1471-2105-13-225-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0b/3577461/6874dc0483b7/1471-2105-13-225-6.jpg

相似文献

1
Coupled mutation finder: a new entropy-based method quantifying phylogenetic noise for the detection of compensatory mutations.耦合突变发现器:一种新的基于熵的方法,用于量化系统发育噪声以检测补偿突变。
BMC Bioinformatics. 2012 Sep 11;13:225. doi: 10.1186/1471-2105-13-225.
2
Quantum coupled mutation finder: predicting functionally or structurally important sites in proteins using quantum Jensen-Shannon divergence and CUDA programming.量子耦合突变发现器:使用量子 Jensen-Shannon 散度和 CUDA 编程预测蛋白质中的功能或结构重要位点。
BMC Bioinformatics. 2014 Apr 3;15:96. doi: 10.1186/1471-2105-15-96.
3
Using information theory to search for co-evolving residues in proteins.利用信息论寻找蛋白质中共同进化的残基。
Bioinformatics. 2005 Nov 15;21(22):4116-24. doi: 10.1093/bioinformatics/bti671. Epub 2005 Sep 13.
4
Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.CCMgen 通过合成蛋白比对量化残基残基接触预测中的噪声。
PLoS Comput Biol. 2018 Nov 5;14(11):e1006526. doi: 10.1371/journal.pcbi.1006526. eCollection 2018 Nov.
5
H2r: identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments.H2r:通过对多序列比对进行基于熵的分析来识别进化上重要的残基。
BMC Bioinformatics. 2008 Mar 18;9:151. doi: 10.1186/1471-2105-9-151.
6
Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction.不受系统发育或熵影响的互信息显著改善了残基接触预测。
Bioinformatics. 2008 Feb 1;24(3):333-40. doi: 10.1093/bioinformatics/btm604. Epub 2007 Dec 5.
7
Improved multiple sequence alignments using coupled pattern mining.使用耦合模式挖掘改进多序列比对。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Sep-Oct;10(5):1098-112. doi: 10.1109/TCBB.2013.36.
8
Using multiple interdependency to separate functional from phylogenetic correlations in protein alignments.利用多重相互依赖性在蛋白质比对中区分功能相关性与系统发育相关性。
Bioinformatics. 2003 Apr 12;19(6):750-5. doi: 10.1093/bioinformatics/btg072.
9
Phylogenetic and Other Conservation-Based Approaches to Predict Protein Functional Sites.基于系统发育和其他保护的方法来预测蛋白质功能位点。
Methods Mol Biol. 2018;1762:51-69. doi: 10.1007/978-1-4939-7756-7_4.
10
PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.PSICOV:使用基于稀疏逆协方差估计的大型多重序列比对进行精确结构接触预测。
Bioinformatics. 2012 Jan 15;28(2):184-90. doi: 10.1093/bioinformatics/btr638. Epub 2011 Nov 17.

引用本文的文献

1
MIDESP: Mutual Information-Based Detection of Epistatic SNP Pairs for Qualitative and Quantitative Phenotypes.MIDESP:基于互信息的定性和定量表型上位性SNP对检测
Biology (Basel). 2021 Sep 16;10(9):921. doi: 10.3390/biology10090921.
2
Extensive expansion of A1 family aspartic proteinases in fungi revealed by evolutionary analyses of 107 complete eukaryotic proteomes.通过对107个完整真核生物蛋白质组的进化分析揭示真菌中A1家族天冬氨酸蛋白酶的广泛扩张
Genome Biol Evol. 2014 Jun;6(6):1480-94. doi: 10.1093/gbe/evu110.
3
H2rs: deducing evolutionary and functionally important residue positions by means of an entropy and similarity based analysis of multiple sequence alignments.

本文引用的文献

1
Glucokinase (GCK) mutations and their characterization in MODY2 children of southern Italy.意大利南部 MODY2 儿童的葡萄糖激酶(GCK)突变及其特征。
PLoS One. 2012;7(6):e38906. doi: 10.1371/journal.pone.0038906. Epub 2012 Jun 20.
2
Functional characterization of MODY2 mutations highlights the importance of the fine-tuning of glucokinase and its role in glucose sensing.MODY2 突变的功能特征强调了葡萄糖激酶精细调节及其在葡萄糖感应中的作用的重要性。
PLoS One. 2012;7(1):e30518. doi: 10.1371/journal.pone.0030518. Epub 2012 Jan 24.
3
New methods to measure residues coevolution in proteins.
H2rs:通过对多个序列比对进行基于熵和相似性的分析来推断进化和功能重要的残基位置。
BMC Bioinformatics. 2014 Apr 27;15:118. doi: 10.1186/1471-2105-15-118.
4
Quantum coupled mutation finder: predicting functionally or structurally important sites in proteins using quantum Jensen-Shannon divergence and CUDA programming.量子耦合突变发现器:使用量子 Jensen-Shannon 散度和 CUDA 编程预测蛋白质中的功能或结构重要位点。
BMC Bioinformatics. 2014 Apr 3;15:96. doi: 10.1186/1471-2105-15-96.
用于测量蛋白质中残基共进化的新方法。
BMC Bioinformatics. 2011 May 26;12:206. doi: 10.1186/1471-2105-12-206.
4
Ensembl 2011.Ensembl 2011年版
Nucleic Acids Res. 2011 Jan;39(Database issue):D800-6. doi: 10.1093/nar/gkq1064. Epub 2010 Nov 2.
5
Computing and visually analyzing mutual information in molecular co-evolution.计算和可视化分析分子共进化中的互信息。
BMC Bioinformatics. 2010 Jun 17;11:330. doi: 10.1186/1471-2105-11-330.
6
Sequence and structure signatures of cancer mutation hotspots in protein kinases.癌症激酶蛋白热点突变的序列和结构特征。
PLoS One. 2009 Oct 16;4(10):e7485. doi: 10.1371/journal.pone.0007485.
7
Integration of evolutionary features for the identification of functionally important residues in major facilitator superfamily transporters.整合进化特征以鉴定主要转运子超家族转运蛋白中功能重要的残基。
PLoS Comput Biol. 2009 Oct;5(10):e1000522. doi: 10.1371/journal.pcbi.1000522. Epub 2009 Oct 2.
8
Quantitative prediction of fold resistance for inhibitors of EGFR.表皮生长因子受体(EGFR)抑制剂的抗折叠性定量预测
Biochemistry. 2009 Sep 8;48(35):8435-48. doi: 10.1021/bi900729a.
9
Why should we care about molecular coevolution?我们为什么要关注分子协同进化?
Evol Bioinform Online. 2008 Feb 14;4:29-38.
10
Prediction by graph theoretic measures of structural effects in proteins arising from non-synonymous single nucleotide polymorphisms.利用图论方法预测非同义单核苷酸多态性引起的蛋白质结构效应
PLoS Comput Biol. 2008 Jul 25;4(7):e1000135. doi: 10.1371/journal.pcbi.1000135.