检测蛋白质结构域内部及之间的协同进化。

Detecting coevolution in and among protein domains.

作者信息

Yeang Chen-Hsiang, Haussler David

机构信息

Simons Center for Systems Biology, Institute for Advanced Study, Princeton, New Jersey, United States of America.

出版信息

PLoS Comput Biol. 2007 Nov;3(11):e211. doi: 10.1371/journal.pcbi.0030211. Epub 2007 Sep 18.

DOI:10.1371/journal.pcbi.0030211

PMID:17983264

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2098842/

Abstract

Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level.

摘要

核酸或氨基酸的相关变化为分子的结构和相互作用提供了有力信息。尽管在共进化序列分析方面有丰富的文献，但先前的方法往往不得不在通用性、简单性、系统发育信息以及关于相互作用的特定知识之间进行权衡。此外，尽管在选定的蛋白质家族中有共进化的证据，但仍缺乏对所有蛋白质结构域之间共进化的全面筛选。我们提出了一种用于序列共进化的增强型连续时间马尔可夫过程模型。该模型可以处理不同类型的相互作用，纳入系统发育信息和序列替换，只有一个额外的自由参数，并且不需要关于相互作用规则的知识。我们将此模型应用于对整个蛋白质结构域数据库（Pfam）的大规模筛选。令人惊讶的是，在执行了1万亿次测试的情况下，大多数推断出的共进化蛋白质结构域在功能上是相关的，并且共进化的氨基酸残基在空间上是耦合的。此外，许多共进化位点位于蛋白质/蛋白质复合物的功能重要位点上，例如超氧化物歧化酶的亚基连接区、核糖体的tRNA结合位点、RNA聚合酶的DNA结合区域以及各种酶的活性和配体结合位点。结果表明序列共进化体现了蛋白质的结构和功能限制。序列共进化与各种选择限制之间的复杂关系值得在更深层次上进行探究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3328/2098842/690b6dfdb076/pcbi.0030211.g001.jpg

相似文献

Detecting coevolution in and among protein domains.检测蛋白质结构域内部及之间的协同进化。

PLoS Comput Biol. 2007 Nov;3(11):e211. doi: 10.1371/journal.pcbi.0030211. Epub 2007 Sep 18.

Improving homology models for protein-ligand binding sites.改进蛋白质-配体结合位点的同源模型。

Comput Syst Bioinformatics Conf. 2008;7:211-22.

Relating destabilizing regions to known functional sites in proteins.将不稳定区域与蛋白质中已知的功能位点相关联。

BMC Bioinformatics. 2007 Apr 30;8:141. doi: 10.1186/1471-2105-8-141.

A novel method for detecting intramolecular coevolution: adding a further dimension to selective constraints analyses.一种检测分子内协同进化的新方法：为选择性限制分析增添新维度。

Genetics. 2006 May;173(1):9-23. doi: 10.1534/genetics.105.053249. Epub 2006 Mar 17.

Prediction of protein interdomain linker regions by a hidden Markov model.利用隐马尔可夫模型预测蛋白质结构域间连接区域

Bioinformatics. 2005 May 15;21(10):2264-70. doi: 10.1093/bioinformatics/bti363. Epub 2005 Mar 3.

Sequence coevolution between RNA and protein characterized by mutual information between residue triplets.基于残基三联体之间互信息的 RNA 和蛋白质序列共进化特征。

PLoS One. 2012;7(1):e30022. doi: 10.1371/journal.pone.0030022. Epub 2012 Jan 18.

Bioinformatics identification of coevolving residues.共同进化残基的生物信息学鉴定。

Methods Mol Biol. 2014;1123:223-43. doi: 10.1007/978-1-62703-968-0_15.

Detecting the coevolution of biosequences--an example of RNA interaction prediction.检测生物序列的协同进化——RNA相互作用预测的一个例子。

Mol Biol Evol. 2007 Sep;24(9):2119-31. doi: 10.1093/molbev/msm142. Epub 2007 Jul 17.

MAGOS: multiple alignment and modelling server.MAGOS：多序列比对与建模服务器。

Bioinformatics. 2006 Sep 1;22(17):2164-5. doi: 10.1093/bioinformatics/btl349. Epub 2006 Jul 4.

Reducing the false positive rate in the non-parametric analysis of molecular coevolution.降低分子协同进化非参数分析中的假阳性率。

BMC Evol Biol. 2008 Apr 10;8:106. doi: 10.1186/1471-2148-8-106.

引用本文的文献

Protein Structural Phylogenetics.蛋白质结构系统发育学

Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf139.

Identification of coevolving positions by ancestral reconstruction.通过祖先重建鉴定协同进化位点。

Commun Biol. 2025 Feb 28;8(1):329. doi: 10.1038/s42003-025-07676-x.

Mutual information networks reveal evolutionary relationships within the influenza A virus polymerase.互信息网络揭示了甲型流感病毒聚合酶内部的进化关系。

Virus Evol. 2023 May 27;9(1):vead037. doi: 10.1093/ve/vead037. eCollection 2023.

CastNet: a systems-level sequence evolution simulator.CastNet：一种系统级序列进化模拟器。

BMC Bioinformatics. 2023 Jun 12;24(1):247. doi: 10.1186/s12859-023-05366-1.

Predicting mutational function using machine learning.利用机器学习预测突变功能。

Mutat Res Rev Mutat Res. 2023 Jan-Jun;791:108457. doi: 10.1016/j.mrrev.2023.108457. Epub 2023 Mar 23.

Mutual information networks reveal evolutionary relationships within the influenza A virus polymerase.互信息网络揭示了甲型流感病毒聚合酶内部的进化关系。

bioRxiv. 2023 Feb 17:2023.02.16.528850. doi: 10.1101/2023.02.16.528850.

A Novel Information-Theory-Based Genetic Distance That Approximates Phenotypic Differences.一种基于信息论的新遗传距离，可近似表型差异。

J Comput Biol. 2023 Apr;30(4):420-431. doi: 10.1089/cmb.2022.0395. Epub 2023 Jan 3.

Local fitness and epistatic effects lead to distinct patterns of linkage disequilibrium in protein-coding genes.局部适应性和上位性效应对蛋白质编码基因中的连锁不平衡模式有显著影响。

Genetics. 2022 Jul 30;221(4). doi: 10.1093/genetics/iyac097.

Modulating Glycoside Hydrolase Activity between Hydrolysis and Transfer Reactions Using an Evolutionary Approach.利用进化方法调节糖苷水解酶在水解和转移反应之间的活性。

Molecules. 2021 Oct 30;26(21):6586. doi: 10.3390/molecules26216586.

Evolutionary conservation of RNA sequence and structure.RNA 序列和结构的进化保守性。

Wiley Interdiscip Rev RNA. 2021 Sep;12(5):e1649. doi: 10.1002/wrna.1649. Epub 2021 Mar 22.

本文引用的文献

Detecting the coevolution of biosequences--an example of RNA interaction prediction.检测生物序列的协同进化——RNA相互作用预测的一个例子。

Mol Biol Evol. 2007 Sep;24(9):2119-31. doi: 10.1093/molbev/msm142. Epub 2007 Jul 17.

Specificity in protein interactions and its relationship with sequence diversity and coevolution.蛋白质相互作用中的特异性及其与序列多样性和共同进化的关系。

Proc Natl Acad Sci U S A. 2007 May 8;104(19):7999-8004. doi: 10.1073/pnas.0609962104. Epub 2007 Apr 27.

Identification and classification of conserved RNA secondary structures in the human genome.人类基因组中保守RNA二级结构的鉴定与分类

PLoS Comput Biol. 2006 Apr;2(4):e33. doi: 10.1371/journal.pcbi.0020033. Epub 2006 Apr 21.

Genetics. 2006 May;173(1):9-23. doi: 10.1534/genetics.105.053249. Epub 2006 Mar 17.

Predicting functional gene links from phylogenetic-statistical analyses of whole genomes.通过对全基因组进行系统发育统计分析来预测功能基因联系。

PLoS Comput Biol. 2005 Jun;1(1):e3. doi: 10.1371/journal.pcbi.0010003. Epub 2005 Jun 24.

Mutual information in protein multiple sequence alignments reveals two classes of coevolving positions.蛋白质多序列比对中的互信息揭示了两类共同进化的位点。

Biochemistry. 2005 May 17;44(19):7156-65. doi: 10.1021/bi050293e.

Context dependence and coevolution among amino acid residues in proteins.蛋白质中氨基酸残基之间的上下文依赖性和协同进化。

Methods Enzymol. 2005;395:779-90. doi: 10.1016/S0076-6879(05)95040-4.

Novel catalytic mechanism of glycoside hydrolysis based on the structure of an NAD+/Mn2+ -dependent phospho-alpha-glucosidase from Bacillus subtilis.基于枯草芽孢杆菌中一种NAD⁺/Mn²⁺依赖性磷酸-α-葡萄糖苷酶的结构的新型糖苷水解催化机制

Structure. 2004 Sep;12(9):1619-29. doi: 10.1016/j.str.2004.06.020.

Architecture of the photosynthetic oxygen-evolving center.光合放氧中心的结构

Science. 2004 Mar 19;303(5665):1831-8. doi: 10.1126/science.1093087. Epub 2004 Feb 5.

Multiple sequence alignment with the Clustal series of programs.使用Clustal系列程序进行多序列比对。

Nucleic Acids Res. 2003 Jul 1;31(13):3497-500. doi: 10.1093/nar/gkg500.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

检测蛋白质结构域内部及之间的协同进化。

Detecting coevolution in and among protein domains.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献