• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

缺失数据、不完整分类单元和系统发育准确性。

Missing data, incomplete taxa, and phylogenetic accuracy.

作者信息

Wiens John J

机构信息

Department of Ecology and Evolution, State University of New York, Stony Brook, New York 11794-5245, USA.

出版信息

Syst Biol. 2003 Aug;52(4):528-38. doi: 10.1080/10635150390218330.

DOI:10.1080/10635150390218330
PMID:12857643
Abstract

The problem of missing data is often considered to be the most important obstacle in reconstructing the phylogeny of fossil taxa and in combining data from diverse characters and taxa for phylogenetic analysis. Empirical and theoretical studies show that including highly incomplete taxa can lead to multiple equally parsimonious trees, poorly resolved consensus trees, and decreased phylogenetic accuracy. However, the mechanisms that cause incomplete taxa to be problematic have remained unclear. It has been widely assumed that incomplete taxa are problematic because of the proportion or amount of missing data that they bear. In this study, I use simulations to show that the reduced accuracy associated with including incomplete taxa is caused by these taxa bearing too few complete characters rather than too many missing data cells. This seemingly subtle distinction has a number of important implications. First, the so-called missing data problem for incomplete taxa is, paradoxically, not directly related to their amount or proportion of missing data. Thus, the level of completeness alone should not guide the exclusion of taxa (contrary to common practice), and these results may explain why empirical studies have sometimes found little relationship between the completeness of a taxon and its impact on an analysis. These results also (1) suggest a more effective strategy for dealing with incomplete taxa, (2) call into question a justification of the controversial phylogenetic supertree approach, and (3) show the potential for the accurate phylogenetic placement of highly incomplete taxa, both when combining diverse data sets and when analyzing relationships of fossil taxa.

摘要

数据缺失问题通常被认为是重建化石类群系统发育以及整合来自不同性状和类群的数据进行系统发育分析时最重要的障碍。实证研究和理论研究表明,纳入高度不完整的类群会导致出现多个同等简约的树、分辨率低的合意树,以及系统发育准确性的降低。然而,导致不完整类群产生问题的机制仍不明确。人们普遍认为,不完整类群存在问题是因为它们所具有的缺失数据的比例或数量。在本研究中,我通过模拟表明,纳入不完整类群导致的准确性降低是由于这些类群具有的完整性状太少,而非缺失数据单元格太多。这种看似细微的区别具有许多重要意义。首先,矛盾的是,不完整类群所谓的数据缺失问题与其缺失数据的数量或比例并无直接关系。因此,仅完整性水平不应指导类群的排除(与通常做法相反),这些结果或许可以解释为什么实证研究有时发现一个类群的完整性与其对分析的影响之间几乎没有关系。这些结果还(1)提出了一种处理不完整类群的更有效策略,(2)对有争议的系统发育超树方法的一种正当理由提出质疑,以及(3)显示了在整合不同数据集以及分析化石类群关系时,对高度不完整类群进行准确系统发育定位的潜力。

相似文献

1
Missing data, incomplete taxa, and phylogenetic accuracy.缺失数据、不完整分类单元和系统发育准确性。
Syst Biol. 2003 Aug;52(4):528-38. doi: 10.1080/10635150390218330.
2
The phylogenetic trunk: maximal inclusion of taxa with missing data in an analysis of the lepospondyli (Vertebrata, Tetrapoda).系统发育主干:在离片椎类(脊椎动物,四足动物)分析中对缺失数据的分类单元进行最大程度纳入。
Syst Biol. 2001 Apr;50(2):170-93. doi: 10.1080/10635150119889.
3
Phylogeny of extant and fossil Juglandaceae inferred from the integration of molecular and morphological data sets.基于分子和形态数据集整合推断的现存和化石胡桃科系统发育
Syst Biol. 2007 Jun;56(3):412-30. doi: 10.1080/10635150701408523.
4
Does adding characters with missing data increase or decrease phylogenetic accuracy?添加具有缺失数据的字符会提高还是降低系统发育准确性?
Syst Biol. 1998 Dec;47(4):625-40. doi: 10.1080/106351598260635.
5
The importance of even highly incomplete fossil taxa in reconstructing the phylogenetic relationships of the tetraodontiformes (acanthomorpha: pisces).重建四齿鲀形目(棘背鱼目:硬骨鱼)系统发育关系时,即使是高度不完全的化石分类群也很重要。
Integr Comp Biol. 2004 Nov;44(5):349-57. doi: 10.1093/icb/44.5.349.
6
Bias and sensitivity in the placement of fossil taxa resulting from interpretations of missing data.由于对缺失数据的解释而导致化石分类群定位中的偏差和敏感性。
Syst Biol. 2015 Mar;64(2):256-66. doi: 10.1093/sysbio/syu093. Epub 2014 Nov 27.
7
The use and validity of composite taxa in phylogenetic analysis.复合分类单元在系统发育分析中的使用和有效性。
Syst Biol. 2009 Dec;58(6):560-72. doi: 10.1093/sysbio/syp056. Epub 2009 Sep 21.
8
Highly incomplete taxa can rescue phylogenetic analyses from the negative impacts of limited taxon sampling.高度不完全分类单元可以从有限的分类单元采样的负面影响中拯救系统发育分析。
PLoS One. 2012;7(8):e42925. doi: 10.1371/journal.pone.0042925. Epub 2012 Aug 10.
9
Combining phylogenomics and fossils in higher-level squamate reptile phylogeny: molecular data change the placement of fossil taxa.系统发生基因组学与化石相结合在高级有鳞目爬行动物系统发育中的应用:分子数据改变了化石分类单元的位置。
Syst Biol. 2010 Dec;59(6):674-88. doi: 10.1093/sysbio/syq048. Epub 2010 Oct 7.
10
Can incomplete taxa rescue phylogenetic analyses from long-branch attraction?不完整的分类单元能否挽救系统发育分析于长枝吸引问题?
Syst Biol. 2005 Oct;54(5):731-42. doi: 10.1080/10635150500234583.

引用本文的文献

1
Plastome characterization and its phylogenetic implications on Lithocarpus (Fagaceae).柯属(壳斗科)的质体基因组特征及其系统发育意义
BMC Plant Biol. 2024 Dec 30;24(1):1277. doi: 10.1186/s12870-024-05874-z.
2
Data-driven guidelines for phylogenomic analyses using SNP data.使用单核苷酸多态性(SNP)数据进行系统发育基因组分析的数据驱动指南。
Appl Plant Sci. 2024 Aug 9;12(6):e11611. doi: 10.1002/aps3.11611. eCollection 2024 Nov-Dec.
3
An alignment-free method for detection of missing regions for phylogenetic analysis.一种用于系统发育分析中缺失区域检测的无比对方法。
Heliyon. 2024 Jun 4;10(11):e32227. doi: 10.1016/j.heliyon.2024.e32227. eCollection 2024 Jun 15.
4
A Guide to Phylogenomic Inference.系统发育基因组推断指南。
Methods Mol Biol. 2024;2802:267-345. doi: 10.1007/978-1-0716-3838-5_11.
5
Marine introgressions and Andean uplift have driven diversification in neotropical Monkey tree frogs (Anura, Phyllomedusinae).海洋入侵和安第斯山脉隆升推动了新热带地区树蛙(无尾目,叶泡蛙亚科)的物种多样化。
PeerJ. 2024 Apr 16;12:e17232. doi: 10.7717/peerj.17232. eCollection 2024.
6
In-depth Phylogenomic Analysis of Arbuscular Mycorrhizal Fungi Based on a Comprehensive Set of Genome Assemblies.基于一套全面的基因组组装对丛枝菌根真菌进行深入的系统基因组分析。
Front Fungal Biol. 2021 Sep 29;2:716385. doi: 10.3389/ffunb.2021.716385. eCollection 2021.
7
A taxonomic revision of , including three new species and its phylogenetic realignment with Ehretiaceae (Boraginales).对[具体内容未给出]的分类学修订,包括三个新物种及其与紫草科(紫草目)的系统发育重新排列。
PhytoKeys. 2023 Feb 20;219:145-170. doi: 10.3897/phytokeys.219.101779. eCollection 2023.
8
Redefining Possible: Combining Phylogenomic and Supersparse Data in Frogs.重新定义可能:结合系统基因组学和超级稀疏数据研究蛙类。
Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad109.
9
A plastid phylogenomic framework for the palm family (Arecaceae).一个用于棕榈科(Arecaceae)的质体系统发育基因组学框架。
BMC Biol. 2023 Mar 8;21(1):50. doi: 10.1186/s12915-023-01544-y.
10
A pipeline for assembling low copy nuclear markers from plant genome skimming data for phylogenetic use.用于组装植物基因组刮削数据中低拷贝核标记的流水线,以便进行系统发育分析。
PeerJ. 2022 Dec 6;10:e14525. doi: 10.7717/peerj.14525. eCollection 2022.