重新定义可能：结合系统基因组学和超级稀疏数据研究蛙类。

Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.

机构信息

Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ.

Department of Herpetology, California Academy of Sciences, USA.

出版信息

Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad109.

DOI:10.1093/molbev/msad109

PMID:37140129

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10202597/

Abstract

The data available for reconstructing molecular phylogenies have become wildly disparate. Phylogenomic studies can generate data for thousands of genetic markers for dozens of species, but for hundreds of other taxa, data may be available from only a few genes. Can these two types of data be integrated to combine the advantages of both, addressing the relationships of hundreds of species with thousands of genes? Here, we show that this is possible, using data from frogs. We generated a phylogenomic data set for 138 ingroup species and 3,784 nuclear markers (ultraconserved elements [UCEs]), including new UCE data from 70 species. We also assembled a supermatrix data set, including data from 97% of frog genera (441 total), with 1-307 genes per taxon. We then produced a combined phylogenomic-supermatrix data set (a "gigamatrix") containing 441 ingroup taxa and 4,091 markers but with 86% missing data overall. Likelihood analysis of the gigamatrix yielded a generally well-supported tree among families, largely consistent with trees from the phylogenomic data alone. All terminal taxa were placed in the expected families, even though 42.5% of these taxa each had >99.5% missing data and 70.2% had >90% missing data. Our results show that missing data need not be an impediment to successfully combining very large phylogenomic and supermatrix data sets, and they open the door to new studies that simultaneously maximize sampling of genes and taxa.

摘要

用于重建分子系统发育的可用数据变得千差万别。系统基因组学研究可以为数十个物种的数千个遗传标记生成数据，但对于数百个其他分类群，可能只有少数几个基因的数据可用。这两种类型的数据能否整合在一起，同时利用两者的优势，解决数百个物种与数千个基因的关系问题？在这里，我们使用青蛙的数据表明这是可能的。我们为 138 个内类群物种和 3784 个核标记（超保守元件 [UCEs]）生成了一个系统基因组数据集，其中包括 70 个物种的新 UCE 数据。我们还组装了一个超级矩阵数据集，包括 97%的青蛙属（共 441 个）的数据，每个分类群有 1-307 个基因。然后，我们生成了一个包含 441 个内类群分类群和 4091 个标记的组合系统基因组-超级矩阵数据集（“gigamatrix”），但总体上有 86%的数据缺失。gigamatrix 的似然分析产生了一个在科之间总体上得到很好支持的树，与仅从系统基因组数据得出的树基本一致。所有终端分类群都被放置在预期的科中，尽管 42.5%的分类群每个都有>99.5%的数据缺失，70.2%的分类群有>90%的数据缺失。我们的结果表明，缺失数据不一定是成功组合非常大的系统基因组和超级矩阵数据集的障碍，并且为同时最大限度地增加基因和分类群采样的新研究开辟了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/561d/10202597/e764c49138bd/msad109f1.jpg

相似文献

Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.

Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad109.

Frog phylogeny: A time-calibrated, species-level tree based on hundreds of loci and 5,242 species.

Mol Phylogenet Evol. 2023 Nov;188:107907. doi: 10.1016/j.ympev.2023.107907. Epub 2023 Aug 25.

How Should Genes and Taxa be Sampled for Phylogenomic Analyses with Missing Data? An Empirical Study in Iguanian Lizards.

Syst Biol. 2016 Jan;65(1):128-45. doi: 10.1093/sysbio/syv058. Epub 2015 Sep 1.

A large-scale phylogeny of Amphibia including over 2800 species, and a revised classification of extant frogs, salamanders, and caecilians.

Mol Phylogenet Evol. 2011 Nov;61(2):543-83. doi: 10.1016/j.ympev.2011.06.012. Epub 2011 Jun 23.

A phylogenomic supermatrix of Galliformes (Landfowl) reveals biased branch lengths.

Mol Phylogenet Evol. 2021 May;158:107091. doi: 10.1016/j.ympev.2021.107091. Epub 2021 Feb 2.

Combining phylogenomic and supermatrix approaches, and a time-calibrated phylogeny for squamate reptiles (lizards and snakes) based on 52 genes and 4162 species.

Mol Phylogenet Evol. 2016 Jan;94(Pt B):537-547. doi: 10.1016/j.ympev.2015.10.009. Epub 2015 Oct 22.

Insight from an ultraconserved element bait set designed for hemipteran phylogenetics integrated with genomic resources.

Mol Phylogenet Evol. 2019 Jan;130:297-303. doi: 10.1016/j.ympev.2018.10.026. Epub 2018 Oct 22.

Evaluating methods for phylogenomic analyses, and a new phylogeny for a major frog clade (Hyloidea) based on 2214 loci.

Mol Phylogenet Evol. 2018 Feb;119:128-143. doi: 10.1016/j.ympev.2017.10.013. Epub 2017 Oct 27.

Analysis of a Rapid Evolutionary Radiation Using Ultraconserved Elements: Evidence for a Bias in Some Multispecies Coalescent Methods.

Syst Biol. 2016 Jul;65(4):612-27. doi: 10.1093/sysbio/syw014. Epub 2016 Feb 10.

Larger, unfiltered datasets are more effective at resolving phylogenetic conflict: Introns, exons, and UCEs resolve ambiguities in Golden-backed frogs (Anura: Ranidae; genus Hylarana).

Mol Phylogenet Evol. 2020 Oct;151:106899. doi: 10.1016/j.ympev.2020.106899. Epub 2020 Jun 24.

引用本文的文献

High speciation in the cryptic clade (Anura: Strabomantidae) of the Mira river basin, Ecuador-Colombia.

PeerJ. 2025 Jan 29;13:e18680. doi: 10.7717/peerj.18680. eCollection 2025.

The Amphibian Genomics Consortium: advancing genomic and genetic resources for amphibian research and conservation.

BMC Genomics. 2024 Nov 1;25(1):1025. doi: 10.1186/s12864-024-10899-7.

Global shortfalls of knowledge on anuran tadpoles.

NPJ Biodivers. 2023 Oct 30;2(1):22. doi: 10.1038/s44185-023-00027-1.

The Amphibian Genomics Consortium: advancing genomic and genetic resources for amphibian research and conservation.

bioRxiv. 2024 Oct 3:2024.06.27.601086. doi: 10.1101/2024.06.27.601086.

Marine introgressions and Andean uplift have driven diversification in neotropical Monkey tree frogs (Anura, Phyllomedusinae).

PeerJ. 2024 Apr 16;12:e17232. doi: 10.7717/peerj.17232. eCollection 2024.

Practical guidelines for Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC).

Open Res Eur. 2024 Aug 5;3:204. doi: 10.12688/openreseurope.16679.1. eCollection 2023.

Filtration of Gene Trees From 9,000 Exons, Introns, and UCEs Disentangles Conflicting Phylogenomic Relationships in Tree Frogs (Hylidae).

Genome Biol Evol. 2023 May 5;15(5). doi: 10.1093/gbe/evad070.

本文引用的文献

A species-level timeline of mammal evolution integrating phylogenomic data.

Nature. 2022 Feb;602(7896):263-267. doi: 10.1038/s41586-021-04341-1. Epub 2021 Dec 22.

FrogCap: A modular sequence capture probe-set for phylogenomics and population genetics for all frogs, assessed across multiple phylogenetic scales.

Mol Ecol Resour. 2022 Apr;22(3):1100-1119. doi: 10.1111/1755-0998.13517. Epub 2021 Oct 12.

DNA Barcodes Combined with Multilocus Data of Representative Taxa Can Generate Reliable Higher-Level Phylogenies.

Syst Biol. 2022 Feb 10;71(2):382-395. doi: 10.1093/sysbio/syab038.

Do Alignment and Trimming Methods Matter for Phylogenomic (UCE) Analyses?

Syst Biol. 2021 Apr 15;70(3):440-462. doi: 10.1093/sysbio/syaa064.

Phylogenomics Reveals Ancient Gene Tree Discordance in the Amphibian Tree of Life.

Syst Biol. 2021 Jan 1;70(1):49-66. doi: 10.1093/sysbio/syaa034.

Analysis of ultraconserved elements supports African origins of narrow-mouthed frogs.

Mol Phylogenet Evol. 2020 May;146:106771. doi: 10.1016/j.ympev.2020.106771. Epub 2020 Feb 19.

phylotaR: An Automated Pipeline for Retrieving Orthologous DNA Sequences from GenBank in R.

Life (Basel). 2018 Jun 5;8(2):20. doi: 10.3390/life8020020.

The performance of coalescent-based species tree estimation methods under models of missing data.

BMC Genomics. 2018 May 8;19(Suppl 5):286. doi: 10.1186/s12864-018-4619-8.

Earth BioGenome Project: Sequencing life for the future of life.

Proc Natl Acad Sci U S A. 2018 Apr 24;115(17):4325-4333. doi: 10.1073/pnas.1720115115.

A large-scale phylogeny of Microhylidae inferred from a combined dataset of 121 genes and 427 taxa.

Mol Phylogenet Evol. 2018 Sep;126:85-91. doi: 10.1016/j.ympev.2018.03.036. Epub 2018 Apr 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

重新定义可能：结合系统基因组学和超级稀疏数据研究蛙类。

Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.

机构信息

Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ.

Department of Herpetology, California Academy of Sciences, USA.

出版信息

Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad109.

DOI:10.1093/molbev/msad109

PMID:37140129

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10202597/

Abstract

摘要

重新定义可能：结合系统基因组学和超级稀疏数据研究蛙类。

Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

重新定义可能：结合系统基因组学和超级稀疏数据研究蛙类。

Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献