Suppr超能文献

字符编码在贝叶斯形态系统发生学中的基础作用。

The Fundamental Role of Character Coding in Bayesian Morphological Phylogenetics.

机构信息

Department of Biological Sciences, Southeastern Louisiana University, Hammond, LA 70401, USA.

GeoBio-Center, Ludwig-Maximilians-Universität München, 80333 Munich, Germany.

出版信息

Syst Biol. 2024 Oct 30;73(5):861-871. doi: 10.1093/sysbio/syae033.

Abstract

Phylogenetic trees establish a historical context for the study of organismal form and function. Most phylogenetic trees are estimated using a model of evolution. For molecular data, modeling evolution is often based on biochemical observations about changes between character states. For example, there are 4 nucleotides, and we can make assumptions about the probability of transitions between them. By contrast, for morphological characters, we may not know a priori how many characters states there are per character, as both extant sampling and the fossil record may be highly incomplete, which leads to an observer bias. For a given character, the state space may be larger than what has been observed in the sample of taxa collected by the researcher. In this case, how many evolutionary rates are needed to even describe transitions between morphological character states may not be clear, potentially leading to model misspecification. To explore the impact of this model misspecification, we simulated character data with varying numbers of character states per character. We then used the data to estimate phylogenetic trees using models of evolution with the correct number of character states and an incorrect number of character states. The results of this study indicate that this observer bias may lead to phylogenetic error, particularly in the branch lengths of trees. If the state space is wrongly assumed to be too large, then we underestimate the branch lengths, and the opposite occurs when the state space is wrongly assumed to be too small.

摘要

系统发育树为研究生物形态和功能提供了历史背景。大多数系统发育树是使用进化模型来估计的。对于分子数据,进化建模通常基于对字符状态之间变化的生化观察。例如,有 4 个核苷酸,我们可以假设它们之间转换的概率。相比之下,对于形态特征,我们可能不知道每个特征有多少特征状态,因为现生物种采样和化石记录可能高度不完整,这导致了观察者偏见。对于给定的特征,状态空间可能大于研究人员收集的分类群样本中观察到的状态空间。在这种情况下,甚至描述形态特征状态之间的转换需要多少个进化率可能不清楚,这可能导致模型指定不当。为了探索这种模型指定不当的影响,我们模拟了具有不同特征状态数量的特征数据。然后,我们使用这些数据使用具有正确字符状态数量和不正确字符状态数量的进化模型来估计系统发育树。这项研究的结果表明,这种观察者偏见可能导致系统发育错误,特别是在树的分支长度上。如果错误地假设状态空间过大,则会低估分支长度,而当错误地假设状态空间过小时,则会出现相反的情况。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验