Suppr超能文献

最大化分辨率还是最小化误差?利用简化基因组测序技术研究半日花科植物的近期分化

Maximize Resolution or Minimize Error? Using Genotyping-By-Sequencing to Investigate the Recent Diversification of (Cistaceae).

作者信息

Martín-Hernanz Sara, Aparicio Abelardo, Fernández-Mazuecos Mario, Rubio Encarnación, Reyes-Betancort J Alfredo, Santos-Guerra Arnoldo, Olangua-Corral María, Albaladejo Rafael G

机构信息

Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain.

Departamento de Biodiversidad y Conservación, Real Jardín Botánico (RJB-CSIC), Madrid, Spain.

出版信息

Front Plant Sci. 2019 Nov 11;10:1416. doi: 10.3389/fpls.2019.01416. eCollection 2019.

Abstract

A robust phylogenetic framework, in terms of extensive geographical and taxonomic sampling, well-resolved species relationships and high certainty of tree topologies and branch length estimations, is critical in the study of macroevolutionary patterns. Whereas Sanger sequencing-based methods usually recover insufficient phylogenetic signal, especially in recently diversified lineages, reduced-representation sequencing methods tend to provide well-supported phylogenetic relationships, but usually entail remarkable bioinformatic challenges due to the inherent trade-off between the number of SNPs and the magnitude of associated error rates. The genus (Cistaceae) is a species-rich and taxonomically complex Palearctic group of plants that diversified mainly since the Upper Miocene. It is a challenging case study since previous attempts using Sanger sequencing were unable to resolve the intrageneric phylogenetic relationships. Aiming to obtain a robust phylogenetic reconstruction based on genotyping-by-sequencing (GBS), we established a rigorous methodological workflow in which we i) explored how variable settings during dataset assembly have an impact on error rates and on the degree of resolution under concatenation and coalescent approaches, ii) assessed the effect of two extreme parameter configurations (minimizing error rates vs. maximizing phylogenetic resolution) on tree topology and branch lengths, and iii) evaluated the effects of these two configurations on estimates of divergence times and diversification rates. Our analyses produced highly supported topologically congruent phylogenetic trees for both configurations. However, minimizing error rates did produce more reliable branch lengths, critically affecting the accuracy of downstream analyses (i.e. divergence times and diversification rates). In addition to recommending a revision of intrageneric systematics, our results enabled us to identify three highly diversified lineages in in contrasting geographical areas and ecological conditions, which started radiating in the Upper Miocene.

摘要

一个强大的系统发育框架,在广泛的地理和分类学采样、良好解析的物种关系以及树拓扑结构和分支长度估计的高确定性方面,对于宏观进化模式的研究至关重要。基于桑格测序的方法通常恢复的系统发育信号不足,尤其是在最近分化的谱系中,而简化代表性测序方法往往能提供得到充分支持的系统发育关系,但由于单核苷酸多态性数量与相关错误率大小之间的内在权衡,通常会带来显著的生物信息学挑战。半日花属(半日花科)是一个物种丰富且分类复杂的古北植物类群,主要自上新世以来开始多样化。这是一个具有挑战性的案例研究,因为之前使用桑格测序的尝试未能解决属内的系统发育关系。为了基于测序基因分型(GBS)获得一个强大的系统发育重建,我们建立了一个严格的方法流程,在其中我们:i)探索了数据集组装过程中的可变设置如何影响错误率以及在串联和合并方法下的解析程度,ii)评估了两种极端参数配置(最小化错误率与最大化系统发育分辨率)对树拓扑结构和分支长度的影响,以及iii)评估了这两种配置对分歧时间和多样化速率估计的影响。我们的分析为两种配置都生成了拓扑结构一致且得到高度支持的系统发育树。然而,最小化错误率确实产生了更可靠的分支长度,这对下游分析(即分歧时间和多样化速率)的准确性产生了关键影响。除了建议修订属内系统分类学外,我们的结果使我们能够在半日花属中识别出三个在不同地理区域和生态条件下高度多样化的谱系,它们在上新世开始辐射分化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57e2/6859804/463fd932dde5/fpls-10-01416-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验