• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

复合分类单元在系统发育分析中的使用和有效性。

The use and validity of composite taxa in phylogenetic analysis.

机构信息

Département de sciences biologiques, Université de Montréal, C.P. 6128, Succ. Centre-ville, Montréal, Québec H3C 3J7, Canada.

出版信息

Syst Biol. 2009 Dec;58(6):560-72. doi: 10.1093/sysbio/syp056. Epub 2009 Sep 21.

DOI:10.1093/sysbio/syp056
PMID:20525609
Abstract

In phylogenetic analysis, one possible approach to minimize missing data in DNA supermatrices consists in sampling sequences from different species to obtain a complete sequence for all genes included in the study. We refer to those complete sequences as composite taxa because DNA sequences that are combined belong to different species. An alternative approach is to analyze incomplete supermatrices by coding unavailable DNA sequences as missing. The accuracy of phylogenetic trees estimated using matrices that include composite taxa has recently been questioned, and the best approach for analyzing incomplete supermatrices is highly debated. Through computer simulations, we compared the phylogenetic accuracy of the 2 competing approaches. We explored the effect of composite taxa when inferring higher level relationships, that is, relationships between monophyletic groups. DNA sequences were simulated on a 42-taxon model tree and incomplete supermatrices containing different percentages of missing data were generated. These incomplete supermatrices were analyzed either by coding the missing data with "?" or by reducing the amount of missing data through the combination of 2 or more taxa to generate composite taxa. Of 180 comparisons (18 simulation cases with 2 different inference methods and 5 levels of incompleteness), we observed significantly higher phylogenetic accuracies for composite matrices in 46 comparisons, whereas missing data matrices outperformed composites in 8 comparisons. In all other cases, the phylogenetic accuracy obtained with composite matrices was not significantly different from that of missing data matrices. This study demonstrates that composite taxa represent an interesting approach to minimize the amount of missing data in supermatrices and we suggest that it is the optimal approach to use in phylogenomic studies to reduce computing time.

摘要

在系统发育分析中,一种最小化 DNA 超矩阵中缺失数据的可能方法是从不同物种中采样序列,以获得研究中包含的所有基因的完整序列。我们将这些完整序列称为复合分类单元,因为组合在一起的 DNA 序列属于不同的物种。另一种方法是通过将不可用的 DNA 序列编码为缺失来分析不完整的超矩阵。最近,人们对使用包含复合分类单元的矩阵估计系统发育树的准确性提出了质疑,并且分析不完整超矩阵的最佳方法存在很大争议。通过计算机模拟,我们比较了两种竞争方法的系统发育准确性。我们探讨了在推断更高水平的关系(即单系群之间的关系)时复合分类单元的影响。在 42 分类单元模型树上模拟了 DNA 序列,并生成了包含不同缺失数据百分比的不完整超矩阵。这些不完整的超矩阵要么通过用“?”编码缺失数据进行分析,要么通过将两个或更多分类单元组合起来生成复合分类单元来减少缺失数据的数量。在 180 次比较(18 个模拟案例,使用 2 种不同的推断方法和 5 个不完整级别)中,我们观察到在 46 次比较中,复合矩阵的系统发育准确性明显更高,而在 8 次比较中,缺失数据矩阵的表现优于复合矩阵。在所有其他情况下,使用复合矩阵获得的系统发育准确性与缺失数据矩阵没有显著差异。这项研究表明,复合分类单元是一种减少超矩阵中缺失数据量的有趣方法,我们建议在基因组学研究中使用它来减少计算时间。

相似文献

1
The use and validity of composite taxa in phylogenetic analysis.复合分类单元在系统发育分析中的使用和有效性。
Syst Biol. 2009 Dec;58(6):560-72. doi: 10.1093/sysbio/syp056. Epub 2009 Sep 21.
2
Retrieving a mitogenomic mammal tree using composite taxa.利用复合分类单元构建线粒体基因组哺乳动物系统发育树
Mol Phylogenet Evol. 2011 Feb;58(2):149-56. doi: 10.1016/j.ympev.2010.11.017. Epub 2010 Dec 4.
3
From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case.从系统发育学到系统基因组学:以昆虫内共生γ-变形菌的进化关系为例
Syst Biol. 2007 Feb;56(1):1-16. doi: 10.1080/10635150601109759.
4
Phylogeny of extant and fossil Juglandaceae inferred from the integration of molecular and morphological data sets.基于分子和形态数据集整合推断的现存和化石胡桃科系统发育
Syst Biol. 2007 Jun;56(3):412-30. doi: 10.1080/10635150701408523.
5
Impact of missing data on phylogenies inferred from empirical phylogenomic data sets.缺失数据对从经验系统发育基因组数据集推断的系统发育的影响。
Mol Biol Evol. 2013 Jan;30(1):197-214. doi: 10.1093/molbev/mss208. Epub 2012 Aug 28.
6
Towards building the tree of life: a simulation study for all angiosperm genera.迈向构建生命之树:对所有被子植物属的模拟研究
Syst Biol. 2005 Apr;54(2):183-96. doi: 10.1080/10635150590923254.
7
Sparse supermatrices for phylogenetic inference: taxonomy, alignment, rogue taxa, and the phylogeny of living turtles.用于系统发育推断的稀疏超级矩阵:分类学、比对、异常分类单元和活海龟的系统发育。
Syst Biol. 2010 Jan;59(1):42-58. doi: 10.1093/sysbio/syp075. Epub 2009 Nov 11.
8
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.最大似然估计物种树:系统发育推断的准确性如何取决于分歧历史和采样设计。
Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20.
9
The phylogeny of Cetartiodactyla: the importance of dense taxon sampling, missing data, and the remarkable promise of cytochrome b to provide reliable species-level phylogenies.鲸偶蹄目的系统发育:密集分类群抽样、缺失数据的重要性,以及细胞色素b在提供可靠的物种水平系统发育方面的显著前景。
Mol Phylogenet Evol. 2008 Sep;48(3):964-85. doi: 10.1016/j.ympev.2008.05.046. Epub 2008 Jun 12.
10
Is homoplasy or lineage sorting the source of incongruent mtdna and nuclear gene trees in the stiff-tailed ducks (Nomonyx-Oxyura)?在硬尾鸭(Nomonyx - Oxyura)中,同塑性或谱系分选是线粒体DNA和核基因树不一致的根源吗?
Syst Biol. 2005 Feb;54(1):35-55. doi: 10.1080/10635150590910249.

引用本文的文献

1
Phylogenomics supports a single origin of terrestriality in isopods.系统发生基因组学支持等足目动物在陆地上的单一起源。
Proc Biol Sci. 2024 Oct;291(2033):20241042. doi: 10.1098/rspb.2024.1042. Epub 2024 Oct 30.
2
Diversification of flowering plants in space and time.开花植物在时间和空间上的多样化。
Nat Commun. 2023 Nov 22;14(1):7609. doi: 10.1038/s41467-023-43396-8.
3
DrosoPhyla: Resources for Drosophilid Phylogeny and Systematics.双翅目:果蝇系统发育和分类学资源。
Genome Biol Evol. 2021 Aug 3;13(8). doi: 10.1093/gbe/evab179.
4
What Is ? Evidence from nuclear and mitochondrial DNA sequences suggests the need for a new classification.是什么?来自核DNA和线粒体DNA序列的证据表明需要一种新的分类方法。
J Mammal. 2015 Aug 3;96(4):708-719. doi: 10.1093/jmammal/gyv067. Epub 2015 May 13.
5
Resolving the evolutionary history of Campanula (Campanulaceae) in western North America.解析北美西部风铃草属(桔梗科)的演化历史。
PLoS One. 2011;6(9):e23559. doi: 10.1371/journal.pone.0023559. Epub 2011 Sep 9.
6
Phylogenomics with incomplete taxon coverage: the limits to inference.不完全分类群覆盖的系统基因组学:推断的局限性。
BMC Evol Biol. 2010 May 25;10:155. doi: 10.1186/1471-2148-10-155.
7
Molecular systematics: A synthesis of the common methods and the state of knowledge.分子系统学:通用方法与知识现状的综合。
Cell Mol Biol Lett. 2010 Jun;15(2):311-41. doi: 10.2478/s11658-010-0010-8. Epub 2010 Mar 5.