利用蛋白质趋同的纠错率检测宏观进化的基因型-表型关联。

Detecting macroevolutionary genotype-phenotype associations using error-corrected rates of protein convergence.

机构信息

Institute for Molecular Plant Physiology and Biophysics, University of Würzburg, Würzburg, Germany.

Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO, USA.

出版信息

Nat Ecol Evol. 2023 Jan;7(1):155-170. doi: 10.1038/s41559-022-01932-7. Epub 2023 Jan 5.

DOI:10.1038/s41559-022-01932-7

PMID:36604553

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9834058/

Abstract

On macroevolutionary timescales, extensive mutations and phylogenetic uncertainty mask the signals of genotype-phenotype associations underlying convergent evolution. To overcome this problem, we extended the widely used framework of non-synonymous to synonymous substitution rate ratios and developed the novel metric ω, which measures the error-corrected convergence rate of protein evolution. While ω distinguishes natural selection from genetic noise and phylogenetic errors in simulation and real examples, its accuracy allows an exploratory genome-wide search of adaptive molecular convergence without phenotypic hypothesis or candidate genes. Using gene expression data, we explored over 20 million branch combinations in vertebrate genes and identified the joint convergence of expression patterns and protein sequences with amino acid substitutions in functionally important sites, providing hypotheses on undiscovered phenotypes. We further extended our method with a heuristic algorithm to detect highly repetitive convergence among computationally non-trivial higher-order phylogenetic combinations. Our approach allows bidirectional searches for genotype-phenotype associations, even in lineages that diverged for hundreds of millions of years.

摘要

在宏观进化时间尺度上，广泛的突变和系统发育不确定性掩盖了趋同进化背后基因型-表型关联的信号。为了解决这个问题，我们扩展了广泛使用的非同义到同义替换率比值的框架，并开发了新的度量ω，用于衡量蛋白质进化的纠错趋同率。虽然 ω 在模拟和真实示例中区分了自然选择、遗传噪声和系统发育错误，但它的准确性允许在没有表型假设或候选基因的情况下，对适应性分子趋同进行探索性的全基因组搜索。利用基因表达数据，我们探索了脊椎动物基因中超过 2000 万个分支组合，并确定了表达模式和蛋白质序列与功能重要位点上氨基酸替换的联合趋同，为未发现的表型提供了假设。我们进一步通过启发式算法扩展了我们的方法，以检测计算上复杂的高阶系统发育组合之间的高度重复趋同。我们的方法允许双向搜索基因型-表型关联，即使在已经分化了数亿年的谱系中也是如此。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cd4/9834058/620bf7b8e82c/41559_2022_1932_Fig1_HTML.jpg

相似文献

Detecting macroevolutionary genotype-phenotype associations using error-corrected rates of protein convergence.利用蛋白质趋同的纠错率检测宏观进化的基因型-表型关联。

Nat Ecol Evol. 2023 Jan;7(1):155-170. doi: 10.1038/s41559-022-01932-7. Epub 2023 Jan 5.

Convergent evolution of marine mammals is associated with distinct substitutions in common genes.海洋哺乳动物的趋同进化与常见基因中的独特替换有关。

Sci Rep. 2015 Nov 9;5:16550. doi: 10.1038/srep16550.

Avian phenotypic convergence is subject to low genetic constraints based on genomic evidence.基于基因组证据，鸟类表型趋同受遗传限制较低。

BMC Evol Biol. 2020 Nov 7;20(1):147. doi: 10.1186/s12862-020-01711-7.

Gene Tree Discordance Can Generate Patterns of Diminishing Convergence over Time.基因树分歧可以随时间产生趋异收敛程度降低的模式。

Mol Biol Evol. 2016 Dec;33(12):3299-3307. doi: 10.1093/molbev/msw197. Epub 2016 Sep 15.

Phenotypic Convergence Is Not Mirrored at the Protein Level in a Lizard Adaptive Radiation.表型趋同并未在蜥蜴适应性辐射的蛋白质水平上得到反映。

Mol Biol Evol. 2020 Jun 1;37(6):1604-1614. doi: 10.1093/molbev/msaa028.

A Phenotype-Genotype Codon Model for Detecting Adaptive Evolution.一种用于检测适应性进化的表型-基因型密码子模型。

Syst Biol. 2020 Jul 1;69(4):722-738. doi: 10.1093/sysbio/syz075.

Hundreds of Genes Experienced Convergent Shifts in Selective Pressure in Marine Mammals.数百个基因在海洋哺乳动物的选择压力中经历了趋同变化。

Mol Biol Evol. 2016 Sep;33(9):2182-92. doi: 10.1093/molbev/msw112. Epub 2016 Jun 21.

Quantification provides a conceptual basis for convergent evolution.量化为趋同进化提供了概念基础。

Biol Rev Camb Philos Soc. 2017 May;92(2):815-829. doi: 10.1111/brv.12257. Epub 2016 Mar 1.

Adaptive molecular convergence: Molecular evolution versus molecular phylogenetics.适应性分子趋同：分子进化与分子系统发育

Commun Integr Biol. 2010 Jan;3(1):67-9. doi: 10.4161/cib.3.1.10174.

Determining the Null Model for Detecting Adaptive Convergence from Genomic Data: A Case Study using Echolocating Mammals.确定用于从基因组数据中检测适应性趋同的零模型：以回声定位哺乳动物为例的研究。

Mol Biol Evol. 2015 May;32(5):1232-6. doi: 10.1093/molbev/msv013. Epub 2015 Jan 27.

引用本文的文献

From Trees to Traits: A Review of Advances in PhyloG2P Methods and Future Directions.从树到性状：系统发育基因到性状（PhyloG2P）方法的进展回顾与未来方向

Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf150.

Novel genomics insights into the molecular evolution of long-distance migratory mammals.关于长途迁徙哺乳动物分子进化的新基因组学见解。

BMC Genomics. 2025 Sep 2;26(1):795. doi: 10.1186/s12864-025-12022-w.

Convergent Molecular Evolution Associated With Repeated Transitions to Gregarious Larval Behavior in Heliconiini.与赫利孔亚族幼虫群居行为的反复转变相关的趋同分子进化。

Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf179.

Chromosome-level genome of Zoysia sinica in the intertidal zone reveals genomic insights into waterlogging stress adaptation.潮间带中华结缕草的染色体水平基因组揭示了对涝渍胁迫适应的基因组见解。

Plant Genome. 2025 Sep;18(3):e70070. doi: 10.1002/tpg2.70070.

Evolutionary sparse learning reveals the shared genetic basis of convergent traits.进化稀疏学习揭示了趋同性状的共同遗传基础。

Nat Commun. 2025 Apr 4;16(1):3217. doi: 10.1038/s41467-025-58428-8.

Comparative genomics uncovers evolutionary drivers of locust migratory adaptation.比较基因组学揭示了蝗虫迁徙适应性的进化驱动因素。

BMC Genomics. 2025 Feb 28;26(1):203. doi: 10.1186/s12864-025-11376-5.

Convergent evolution of noncoding elements associated with short tarsus length in birds.鸟类中与短跗骨长度相关的非编码元件的趋同进化。

BMC Biol. 2025 Feb 21;23(1):52. doi: 10.1186/s12915-025-02156-4.

Convergent Evolution and Predictability of Gene Copy Numbers Associated with Diets in Mammals.哺乳动物中与饮食相关的基因拷贝数的趋同进化与可预测性

Genome Biol Evol. 2025 Feb 3;17(2). doi: 10.1093/gbe/evaf008.

Evolutionary sparse learning with paired species contrast reveals the shared genetic basis of convergent traits.基于配对物种对比的进化稀疏学习揭示了趋同性状的共享遗传基础。

bioRxiv. 2025 Jan 8:2025.01.08.631987. doi: 10.1101/2025.01.08.631987.

The macroevolutionary dynamics of pharyngognathy in fishes fail to support the key innovation hypothesis.鱼类咽颅形态的宏观进化动态不能支持关键创新假说。

Nat Commun. 2024 Nov 28;15(1):10325. doi: 10.1038/s41467-024-53141-4.

本文引用的文献

The Earth BioGenome Project 2020: Starting the clock.地球生物基因组计划2020：开启计时。

Proc Natl Acad Sci U S A. 2022 Jan 25;119(4). doi: 10.1073/pnas.2115635118.

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库：用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。

Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.

Parallel adaptation in autopolyploid Arabidopsis arenosa is dominated by repeated recruitment of shared alleles.同源多倍体拟南芥（Arabidopsis arenosa）的平行适应主要由共享等位基因的重复招募决定。

Nat Commun. 2021 Aug 17;12(1):4979. doi: 10.1038/s41467-021-25256-5.

Rare variant contribution to human disease in 281,104 UK Biobank exomes.281104 名英国生物银行外显子组中罕见变异对人类疾病的贡献。

Nature. 2021 Sep;597(7877):527-532. doi: 10.1038/s41586-021-03855-y. Epub 2021 Aug 10.

Up to date on cholesterol 7 alpha-hydroxylase (CYP7A1) in bile acid synthesis.胆汁酸合成中胆固醇7α-羟化酶（CYP7A1）的最新进展。

Liver Res. 2020 Jun;4(2):47-63. doi: 10.1016/j.livres.2020.05.001. Epub 2020 Jun 3.

Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。

Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

Array programming with NumPy.使用 NumPy 进行数组编程。

Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.

Amalgamated cross-species transcriptomes reveal organ-specific propensity in gene expression evolution.合并跨物种转录组揭示了基因表达进化中的器官特异性倾向。

Nat Commun. 2020 Sep 8;11(1):4459. doi: 10.1038/s41467-020-18090-8.

NCBI Taxonomy: a comprehensive update on curation, resources and tools.NCBI 分类学：在管理、资源和工具方面的全面更新。

Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa062.

GeneRax: A Tool for Species-Tree-Aware Maximum Likelihood-Based Gene Family Tree Inference under Gene Duplication, Transfer, and Loss.GeneRax：一种在基因复制、转移和丢失情况下基于最大似然法的物种树感知的基因家族树推断工具。

Mol Biol Evol. 2020 Sep 1;37(9):2763-2774. doi: 10.1093/molbev/msaa141.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用蛋白质趋同的纠错率检测宏观进化的基因型-表型关联。

Detecting macroevolutionary genotype-phenotype associations using error-corrected rates of protein convergence.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献