• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估多物种网络合并模型对多基因座数据的拟合度。

Assessing the fit of the multi-species network coalescent to multi-locus data.

机构信息

Department of Statistics, University of Wisconsin - Madison, Madison, WI 53706, USA.

Department of Botany, University of Wisconsin - Madison, Madison, WI 53706, USA.

出版信息

Bioinformatics. 2021 May 5;37(5):634-641. doi: 10.1093/bioinformatics/btaa863.

DOI:10.1093/bioinformatics/btaa863
PMID:33027508
Abstract

MOTIVATION

With growing genome-wide molecular datasets from next-generation sequencing, phylogenetic networks can be estimated using a variety of approaches. These phylogenetic networks include events like hybridization, gene flow or horizontal gene transfer explicitly. However, the most accurate network inference methods are computationally heavy. Methods that scale to larger datasets do not calculate a full likelihood, such that traditional likelihood-based tools for model selection are not applicable to decide how many past hybridization events best fit the data. We propose here a goodness-of-fit test to quantify the fit between data observed from genome-wide multi-locus data, and patterns expected under the multi-species coalescent model on a candidate phylogenetic network.

RESULTS

We identified weaknesses in the previously proposed TICR test, and proposed corrections. The performance of our new test was validated by simulations on real-world phylogenetic networks. Our test provides one of the first rigorous tools for model selection, to select the adequate network complexity for the data at hand. The test can also work for identifying poorly inferred areas on a network.

AVAILABILITY AND IMPLEMENTATION

Software for the goodness-of-fit test is available as a Julia package at https://github.com/cecileane/QuartetNetworkGoodnessFit.jl.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

随着下一代测序产生的越来越多的全基因组分子数据集,可以使用各种方法来估计系统发育网络。这些系统发育网络明确地包括杂交、基因流或水平基因转移等事件。然而,最准确的网络推断方法计算量很大。可扩展到更大数据集的方法并没有计算完整的似然度,因此传统的基于似然度的模型选择工具不适用于确定过去的杂交事件中有多少最适合数据。我们在这里提出了一种拟合优度检验,以量化全基因组多基因座数据中观察到的数据与候选系统发育网络上多物种合并模型下预期模式之间的吻合程度。

结果

我们发现了之前提出的 TICR 检验的弱点,并提出了修正方法。我们的新检验的性能通过对真实系统发育网络的模拟进行了验证。我们的检验提供了第一个严格的模型选择工具之一,用于为手头的数据选择适当的网络复杂度。该检验还可以用于识别网络上推断不佳的区域。

可用性和实现

拟合优度检验的软件作为 Julia 包可在 https://github.com/cecileane/QuartetNetworkGoodnessFit.jl 获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Assessing the fit of the multi-species network coalescent to multi-locus data.评估多物种网络合并模型对多基因座数据的拟合度。
Bioinformatics. 2021 May 5;37(5):634-641. doi: 10.1093/bioinformatics/btaa863.
2
PhyloCoalSimulations: A Simulator for Network Multispecies Coalescent Models, Including a New Extension for the Inheritance of Gene Flow.PhyloCoalSimulations:一种用于网络多物种合并模型的模拟器,包括一个新的用于遗传基因流的扩展。
Syst Biol. 2023 Nov 1;72(5):1171-1179. doi: 10.1093/sysbio/syad030.
3
Transfer index, NetUniFrac and some useful shortest path-based distances for community analysis in sequence similarity networks.转移指数、NetUniFrac 和一些基于最短路径的有用距离,用于序列相似性网络中的群落分析。
Bioinformatics. 2020 May 1;36(9):2740-2749. doi: 10.1093/bioinformatics/btaa043.
4
PhyloNetworks: A Package for Phylogenetic Networks.PhyloNetworks:一个用于系统发育网络的软件包。
Mol Biol Evol. 2017 Dec 1;34(12):3292-3298. doi: 10.1093/molbev/msx235.
5
NetRAX: accurate and fast maximum likelihood phylogenetic network inference.NetRAX:准确快速的最大似然系统发育网络推断。
Bioinformatics. 2022 Aug 2;38(15):3725-3733. doi: 10.1093/bioinformatics/btac396.
6
Inference of Phylogenetic Networks From Sequence Data Using Composite Likelihood.使用复合似然法从序列数据推断系统发育网络
Syst Biol. 2025 Feb 10;74(1):53-69. doi: 10.1093/sysbio/syae054.
7
MASCOT: parameter and state inference under the marginal structured coalescent approximation.最大相似度算法:边缘结构合并近似下的参数和状态推断。
Bioinformatics. 2018 Nov 15;34(22):3843-3848. doi: 10.1093/bioinformatics/bty406.
8
A divide-and-conquer method for scalable phylogenetic network inference from multilocus data.一种用于从多基因座数据推断可扩展系统发育网络的分而治之方法。
Bioinformatics. 2019 Jul 15;35(14):i370-i378. doi: 10.1093/bioinformatics/btz359.
9
A LASSO-based approach to sample sites for phylogenetic tree search.基于套索法的系统发育树搜索采样位点选择方法。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i118-i124. doi: 10.1093/bioinformatics/btac252.
10
syntenet: an R/Bioconductor package for the inference and analysis of synteny networks.syntenet:用于推断和分析同线性网络的 R/Bioconductor 包。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac806.

引用本文的文献

1
Reticulate allopolyploidy and subsequent dysploidy drive evolution and diversification in the cotton family.网状异源多倍体及随后的染色体数异常推动了棉属植物的进化与多样化。
Nat Commun. 2025 Aug 12;16(1):7480. doi: 10.1038/s41467-025-62644-7.
2
Phylogenetic networks empower biodiversity research.系统发育网络助力生物多样性研究。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2410934122. doi: 10.1073/pnas.2410934122. Epub 2025 Jul 28.
3
Misspecification Strikes: ASTRAL can Mislead in the Presence of Hybridization, even for Nonanomalous Scenarios.
错误设定的影响:即使在非异常情况下,当存在杂交现象时,ASTRAL也可能产生误导。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf049.
4
Phylogenetics and biogeography of the olive family (Oleaceae).橄榄科的系统发育和生物地理学。
Ann Bot. 2024 Oct 30;134(4):577-592. doi: 10.1093/aob/mcae100.
5
Phylogenomic Discordance is Driven by Wide-Spread Introgression and Incomplete Lineage Sorting During Rapid Species Diversification Within Rattlesnakes (Viperidae: Crotalus and Sistrurus).系统基因组学的不协调性是由响尾蛇(蝰蛇科:Crotalus 和 Sistrurus)快速物种多样化过程中的广泛基因渗入和不完全谱系分类驱动的。
Syst Biol. 2024 Oct 25;73(4):722-741. doi: 10.1093/sysbio/syae018.
6
Hybridization and Transgressive Evolution Generate Diversity in an Adaptive Radiation of Anolis Lizards.杂种形成和越界进化在拟蜥蜴的适应性辐射中产生多样性。
Syst Biol. 2023 Aug 7;72(4):874-884. doi: 10.1093/sysbio/syad026.
7
Phylogenomic approaches to detecting and characterizing introgression.基于系统发生基因组学的渐渗检测与特征分析方法
Genetics. 2022 Feb 4;220(2). doi: 10.1093/genetics/iyab173.