Suppr超能文献

基因树分歧、单形图和合并模型下的统计检验。

Gene Tree Discord, Simplex Plots, and Statistical Tests under the Coalescent.

机构信息

Department of Mathematics and Statistics, University of Alaska Fairbanks, Fairbanks, AK 99709, USA.

Unité Bioinformatique Evolutive, C3BI USR 3756, Institut Pasteur & CNRS, Paris, France.

出版信息

Syst Biol. 2022 Jun 16;71(4):929-942. doi: 10.1093/sysbio/syab008.

Abstract

A simple graphical device, the simplex plot of quartet concordance factors, is introduced to aid in the exploration of a collection of gene trees on a common set of taxa. A single plot summarizes all gene tree discord and allows for visual comparison to the expected discord from the multispecies coalescent model (MSC) of incomplete lineage sorting on a species tree. A formal statistical procedure is described that can quantify the deviation from expectation for each subset of four taxa, suggesting when the data are not in accord with the MSC, and thus that either gene tree inference error is substantial or a more complex model such as that on a network may be required. If the collection of gene trees is in accord with the MSC, the plots reveal when substantial incomplete lineage sorting is present. Applications to both simulated and empirical multilocus data sets illustrate the insights provided. [Gene tree discordance; hypothesis test; multispecies coalescent model; quartet concordance factor; simplex plot; species tree].

摘要

引入了一种简单的图形设备,即四分体一致因子的单纯形图,以帮助探索一组常见分类单元上的基因树。单个图总结了所有基因树分歧,并允许与物种树上不完全谱系分选的多物种合并模型 (MSC) 的预期分歧进行直观比较。描述了一种正式的统计程序,可以量化每个四个分类单元子集的偏离预期的程度,这表明数据与 MSC 不一致,因此要么基因树推断错误很大,要么需要更复杂的模型,例如网络上的模型。如果基因树的集合与 MSC 一致,则这些图揭示了存在大量不完全谱系分选的情况。对模拟和经验多基因数据集的应用说明了所提供的见解。[基因树分歧;假设检验;多物种合并模型;四分体一致因子;单纯形图;种系树]。

相似文献

引用本文的文献

1
NANUQ: A divide-and-conquer approach to network estimation.NANUQ:一种用于网络估计的分治法。
Algorithms Mol Biol. 2025 Jul 25;20(1):14. doi: 10.1186/s13015-025-00274-w.

本文引用的文献

1
Hypothesis testing near singularities and boundaries.奇点和边界附近的假设检验。
Electron J Stat. 2019;13(1):2150-2193. doi: 10.1214/19-ejs1576. Epub 2019 Jun 28.
5
Topological Metrizations of Trees, and New Quartet Methods of Tree Inference.树的拓扑度量及其新的四重树推断方法。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2107-2118. doi: 10.1109/TCBB.2019.2917204. Epub 2020 Dec 8.
9
DiscoVista: Interpretable visualizations of gene tree discordance.DiscoVista:基因树分歧的可解释可视化
Mol Phylogenet Evol. 2018 May;122:110-115. doi: 10.1016/j.ympev.2018.01.019. Epub 2018 Feb 5.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验