• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组规模数据集时代的系统发育推断和实验设计的最佳速率。

Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets.

机构信息

North Carolina Museum of Natural Sciences, Raleigh, 1671 Goldstar Drive, NC 27601, USA.

Department of Ecology and Evolutionary Biology, Yale University, New Haven, 165 Prospect Street, CT 06525, USA.

出版信息

Syst Biol. 2019 Jan 1;68(1):145-156. doi: 10.1093/sysbio/syy047.

DOI:10.1093/sysbio/syy047
PMID:29939341
Abstract

With the rise of genome-scale data sets, there has been a call for increased data scrutiny and careful selection of loci that are appropriate to use in an attempt to resolve a phylogenetic problem. Such loci should maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve at an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Herein, we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters-not genes-are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogenetic experimental design. [Divergence time; GC bias; homoplasy; incongruence; information content; internode length; optimal rates; phylogenetic informativeness; phylogenetic theory; phylogenetic utility; phylogenomics; signal and noise; subtending branch length; state space; taxon and character sampling.].

摘要

随着基因组规模数据集的兴起,人们呼吁加强数据审查,并仔细选择适合解决系统发育问题的基因座。这些基因座应该最大限度地提高系统发育信息量,同时最大限度地降低同型性的风险。理论假设存在以最佳速率进化的特征,并且确定最佳推断速率的努力一直是系统发育实验设计的基石已有二十多年。然而,对最佳速率的理论和经验研究的结论差异很大:从没有关系到变化率和系统发育效用之间的紧密关系。在此,我们综合了这些看似矛盾的观点,证明了在每种情况下都是正确的经验和理论条件。我们发现,字符(而非基因)的最佳速率通常对大多数实验设计决策具有鲁棒性。此外,在给定基因座内考虑位点速率异质性对于准确预测效用至关重要。分类群采样或支持拓扑结构的目标字符数量等因素对于基于字符变化率的系统发育效用预测也至关重要。此外,速率的最优性和系统发育效用的预测并不等效,这表明需要进一步发展全面的系统发育实验设计理论。

相似文献

1
Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets.基因组规模数据集时代的系统发育推断和实验设计的最佳速率。
Syst Biol. 2019 Jan 1;68(1):145-156. doi: 10.1093/sysbio/syy047.
2
More on the Best Evolutionary Rate for Phylogenetic Analysis.关于系统发育分析的最佳进化速率的更多内容。
Syst Biol. 2017 Sep 1;66(5):769-785. doi: 10.1093/sysbio/syx051.
3
Optimal selection of gene and ingroup taxon sampling for resolving phylogenetic relationships.最佳基因选择和内群分类单元抽样,用于解决系统发育关系。
Syst Biol. 2010 Jul;59(4):446-57. doi: 10.1093/sysbio/syq025. Epub 2010 May 19.
4
Profiling phylogenetic informativeness.分析系统发育信息性。
Syst Biol. 2007 Apr;56(2):222-31. doi: 10.1080/10635150701311362.
5
Is homoplasy or lineage sorting the source of incongruent mtdna and nuclear gene trees in the stiff-tailed ducks (Nomonyx-Oxyura)?在硬尾鸭(Nomonyx - Oxyura)中,同塑性或谱系分选是线粒体DNA和核基因树不一致的根源吗?
Syst Biol. 2005 Feb;54(1):35-55. doi: 10.1080/10635150590910249.
6
Utility of characters evolving at diverse rates of evolution to resolve quartet trees with unequal branch lengths: analytical predictions of long-branch effects.以不同进化速率演变的性状在解析具有不等分支长度的四重树时的效用:长分支效应的分析预测
BMC Evol Biol. 2015 May 14;15:86. doi: 10.1186/s12862-015-0364-7.
7
Relative character-state space, amount of potential phylogenetic information, and heterogeneity of nucleotide and amino acid characters.相对性状状态空间、潜在系统发育信息的量以及核苷酸和氨基酸性状的异质性。
Mol Phylogenet Evol. 2004 Sep;32(3):913-26. doi: 10.1016/j.ympev.2004.04.011.
8
How Should Genes and Taxa be Sampled for Phylogenomic Analyses with Missing Data? An Empirical Study in Iguanian Lizards.基因和分类单元应该如何采样以进行含缺失数据的系统基因组分析?鬣蜥类蜥蜴的实证研究。
Syst Biol. 2016 Jan;65(1):128-45. doi: 10.1093/sysbio/syv058. Epub 2015 Sep 1.
9
Why Do Phylogenomic Data Sets Yield Conflicting Trees? Data Type Influences the Avian Tree of Life more than Taxon Sampling.为什么系统发育基因组数据集会产生相互冲突的树?数据类型对鸟类生命树的影响大于分类群抽样。
Syst Biol. 2017 Sep 1;66(5):857-879. doi: 10.1093/sysbio/syx041.
10
The information content of a character under a Markov model of evolution.进化的马尔可夫模型下一个字符的信息内容。
Mol Phylogenet Evol. 2000 Nov;17(2):231-43. doi: 10.1006/mpev.2000.0846.

引用本文的文献

1
When the Past Fades: Detecting Phylogenetic Signal with SatuTe.当过去消逝:使用SatuTe检测系统发育信号。
Mol Biol Evol. 2025 Apr 30;42(5). doi: 10.1093/molbev/msaf090.
2
Short branch attraction in phylogenomic inference under the multispecies coalescent.多物种溯祖模型下系统发育基因组学推断中的短枝吸引问题
Front Ecol Evol. 2023;11. doi: 10.3389/fevo.2023.1134764. Epub 2023 Jun 28.
3
ClockstaRX: Testing Molecular Clock Hypotheses With Genomic Data.ClockstaRX:利用基因组数据检验分子钟假说。
Genome Biol Evol. 2024 Apr 2;16(4). doi: 10.1093/gbe/evae064.
4
Identifying Impacts of Contact Tracing on Epidemiological Inference from Phylogenetic Data.确定接触者追踪对基于系统发育数据的流行病学推断的影响。
bioRxiv. 2024 Sep 6:2023.11.30.567148. doi: 10.1101/2023.11.30.567148.
5
Lineage-specific genes are clustered with HET-domain genes and respond to environmental and genetic manipulations regulating reproduction in Neurospora.谱系特异性基因与 HET 结构域基因聚类,并对环境和遗传操作做出响应,从而调节 Neurospora 中的生殖。
PLoS Genet. 2023 Nov 7;19(11):e1011019. doi: 10.1371/journal.pgen.1011019. eCollection 2023 Nov.
6
Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea.困惑将是我的墓志铭:基因组尺度的不一致性抑制了海参纲的系统发育分辨率。
Proc Biol Sci. 2023 Jul 12;290(2002):20230988. doi: 10.1098/rspb.2023.0988.
7
Filtration of Gene Trees From 9,000 Exons, Introns, and UCEs Disentangles Conflicting Phylogenomic Relationships in Tree Frogs (Hylidae).从 9000 个外显子、内含子和 UCEs 中过滤基因树,厘清树蛙(树蛙科)中冲突的系统发育关系。
Genome Biol Evol. 2023 May 5;15(5). doi: 10.1093/gbe/evad070.
8
Placing human gene families into their evolutionary context.将人类基因家族置于其进化背景之中。
Hum Genomics. 2022 Nov 11;16(1):56. doi: 10.1186/s40246-022-00429-5.
9
A Practical Guide to Design and Assess a Phylogenomic Study.《系统发育基因组学研究设计与评估实用指南》
Genome Biol Evol. 2022 Sep 6;14(9). doi: 10.1093/gbe/evac129.
10
Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction.替代模型选择对蛋白质祖先序列重建的影响。
Mol Biol Evol. 2022 Jul 2;39(7). doi: 10.1093/molbev/msac144.