Didier Gilles, Fau Marine, Laurin Michel
Aix Marseille Univ, CNRS, Centrale Marseille, I2M, Marseille, France.
Département Histoire de la Terre CR2P, Sorbonne Universités, CNRS/MNHN/UPMC, Muséum National d'Histoire Naturelle, 75005 Paris, France.
Syst Biol. 2017 Nov 1;66(6):964-987. doi: 10.1093/sysbio/syx045.
Since the diversification process cannot be directly observed at the human scale, it has to be studied from the information available, namely the extant taxa and the fossil record. In this sense, phylogenetic trees including both extant taxa and fossils are the most complete representations of the diversification process that one can get. Such phylogenetic trees can be reconstructed from molecular and morphological data, to some extent. Among the temporal information of such phylogenetic trees, fossil ages are by far the most precisely known (divergence times are inferences calibrated mostly with fossils). We propose here a method to compute the likelihood of a phylogenetic tree with fossils in which the only considered time information is the fossil ages, and apply it to the estimation of the diversification rates from such data. Since it is required in our computation, we provide a method for determining the probability of a tree topology under the standard diversification model. Testing our approach on simulated data shows that the maximum likelihood rate estimates from the phylogenetic tree topology and the fossil dates are almost as accurate as those obtained by taking into account all the data, including the divergence times. Moreover, they are substantially more accurate than the estimates obtained only from the exact divergence times (without taking into account the fossil record). We also provide an empirical example composed of 50 Permo-Carboniferous eupelycosaur (early synapsid) taxa ranging in age from about 315 Ma (Late Carboniferous) to 270 Ma (shortly after the end of the Early Permian). Our analyses suggest a speciation (cladogenesis, or birth) rate of about 0.1 per lineage and per myr, a marginally lower extinction rate, and a considerable hidden paleobiodiversity of early synapsids. [Extinction rate; fossil ages; maximum likelihood estimation; speciation rate.].
由于多样化过程无法在人类尺度上直接观察到,因此必须从现有的信息,即现存分类群和化石记录来进行研究。从这个意义上说,包含现存分类群和化石的系统发育树是人们能够得到的多样化过程的最完整表述。这种系统发育树可以在一定程度上从分子和形态数据重建。在这种系统发育树的时间信息中,化石年龄是目前已知最精确的(分歧时间大多是用化石校准的推断)。我们在此提出一种方法来计算包含化石的系统发育树的似然性,其中唯一考虑的时间信息是化石年龄,并将其应用于从这些数据估计多样化速率。由于在我们的计算中需要,我们提供了一种在标准多样化模型下确定树拓扑结构概率的方法。在模拟数据上测试我们的方法表明,从系统发育树拓扑结构和化石日期得到的最大似然率估计几乎与考虑所有数据(包括分歧时间)得到的估计一样准确。此外,它们比仅从精确分歧时间(不考虑化石记录)得到的估计要准确得多。我们还提供了一个实证例子,由50个二叠纪 - 石炭纪真盘龙目(早期合弓纲)分类群组成,年龄范围从约3.15亿年(晚石炭世)到2.7亿年(早二叠世末期后不久)。我们的分析表明,每个谱系和每百万年的物种形成(分支发生,或诞生)速率约为0.1,灭绝速率略低,早期合弓纲存在相当数量的隐藏古生物多样性。[灭绝速率;化石年龄;最大似然估计;物种形成速率。]