Suppr超能文献

栗属(板栗)和金缕梅属(金缕梅)的系统发育基因组学与生物地理学——RAD测序和杂交测序方法的选择

Phylogenomics AND biogeography of Castanea (chestnut) and Hamamelis (witch-hazel) - Choosing between RAD-seq and Hyb-Seq approaches.

作者信息

Zhou Wenbin, Xiang Qiu-Yun Jenny

机构信息

Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27965, USA.

出版信息

Mol Phylogenet Evol. 2022 Nov;176:107592. doi: 10.1016/j.ympev.2022.107592. Epub 2022 Aug 1.

Abstract

Hyb-Seq and RAD-seq are well-established high throughput sequencing technologies that have been increasingly used for plant phylogenomic studies. Each method has its own pros and cons. The choice between them is a practical issue for plant systematists studying the evolutionary histories of biodiversity of relatively recent origins. However, few studies have compared the congruence and conflict between results from the two methods within the same group of organisms in plants. In this study, we employed RAD-seq and Hyb-Seq of Angiosperms353 genes in phylogenomic and biogeographic studies of Hamamelis (the witch-hazels) and Castanea (chestnuts), two classic examples exhibiting the well-known eastern Asian (EA) -eastern North American (ENA) disjunct distribution, and compared them side by side. Our results showed congruences in phylogenetic inference and divergence time dating between the two data sets obtained through our customized procedures of library preparation and sequence trimming, although they differed in the number of loci and informative sites, the amount of missing data, and sampling within species. We provide recommendations regarding the selection of the two methods for phylogenomic study at generic level based on fund availability and sampling scale. If funds and time are not constrained, we recommend Hyb-Seq. If funds and time are somewhat limited and sampling is large, we recommend RAD-seq. However, we found greater conflict among gene trees from the RAD-seq data due to the short sequences per locus. Therefore, species tree building and network detecting with the RAD-seq data with short RAD-seq loci (e.g., <150 bp) should avoid using analytical methods relying on gene trees of individual locus, but using site-based methods such as SVDQuartets and D-statistic method. Our phylogenetic analyses of RAD-seq and Hyb-Seq data resulted in well-resolved species relationships. Analyses of the data using the D-statistic test and PhyloNet revealed ancient introgressions in both genera. Biogeographic analyses including fossil data using total evidence-based dated tree and DEC model applying specific inter-area dispersal probabilities revealed a complicated history for each genus, indicating multiple interareal dispersals and local extinctions within and outside areas of the taxa's modern ranges in both the Paleogene and Neogene. The study demonstrates the importance of including fossil taxa for a more accurate reconstruction of biogeographic histories of taxa to understand the EA and ENA floristic disjunction. Our results support a widespread ancestral range in EA-western North America (WNA) followed by early diversification in EA and expansion to North America (NA) and Europe for Castanea and a more widespread ancestral range in EA-ENA-WNA for Hamamelis. The origins of the modern EA-ENA disjunction in both genera were suggested to be the result of vicariance from widespread ancestors in Eurasia-ENA of the mid-Miocene and in EA-NA of the late Oligocene, respectively.

摘要

Hyb-Seq和RAD-seq是成熟的高通量测序技术,已越来越多地用于植物系统基因组学研究。每种方法都有其优缺点。对于研究相对近期起源的生物多样性进化历史的植物系统学家来说,在它们之间做出选择是一个实际问题。然而,很少有研究比较这两种方法在同一组植物生物体中的结果之间的一致性和冲突。在本研究中,我们在金缕梅属(金缕梅)和栗属(栗树)的系统基因组学和生物地理学研究中,对被子植物353个基因进行了RAD-seq和Hyb-Seq分析,这两个经典例子呈现出著名的东亚(EA)-北美东部(ENA)间断分布,并对它们进行了并排比较。我们的结果表明,通过我们定制的文库制备和序列修剪程序获得的两个数据集在系统发育推断和分歧时间测定方面具有一致性,尽管它们在基因座数量和信息位点、缺失数据量以及物种内抽样方面存在差异。我们根据资金可用性和抽样规模,为在属水平上进行系统基因组学研究选择这两种方法提供了建议。如果资金和时间不受限制,我们推荐Hyb-Seq。如果资金和时间有点有限且抽样规模较大,我们推荐RAD-seq。然而,由于每个基因座的序列较短,我们发现RAD-seq数据的基因树之间存在更大的冲突。因此,使用短RAD-seq基因座(例如,<150 bp)的RAD-seq数据构建物种树和检测网络时,应避免使用依赖单个基因座基因树的分析方法,而应使用基于位点的方法,如SVDQuartets和D统计方法。我们对RAD-seq和Hyb-Seq数据的系统发育分析得出了分辨率良好的物种关系。使用D统计检验和PhyloNet对数据进行分析,揭示了两个属中都存在古老的基因渐渗。使用基于总证据的定年树和应用特定区域间扩散概率的DEC模型进行的包括化石数据的生物地理学分析,揭示了每个属的复杂历史,表明在古近纪和新近纪,该类群现代分布范围内外都存在多次区域间扩散和局部灭绝。该研究证明了纳入化石类群对于更准确地重建类群生物地理历史以理解东亚和北美东部植物区系间断分布的重要性。我们的结果支持栗属在东亚 - 北美西部(WNA)有广泛的祖先分布范围,随后在东亚早期多样化并扩展到北美(NA)和欧洲,而金缕梅属在东亚 - ENA - WNA有更广泛的祖先分布范围。这两个属中现代东亚 - ENA间断分布的起源分别被认为是中新世中期欧亚大陆 - ENA广泛祖先以及渐新世晚期东亚 - NA广泛祖先的地理隔离的结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验