Nicholls James A, Pennington R Toby, Koenen Erik J M, Hughes Colin E, Hearn Jack, Bunnefeld Lynsey, Dexter Kyle G, Stone Graham N, Kidner Catherine A
Ashworth Labs, Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh Edinburgh, UK ; Royal Botanic Garden Edinburgh Edinburgh, UK.
Royal Botanic Garden Edinburgh Edinburgh, UK.
Front Plant Sci. 2015 Sep 17;6:710. doi: 10.3389/fpls.2015.00710. eCollection 2015.
Evolutionary radiations are prominent and pervasive across many plant lineages in diverse geographical and ecological settings; in neotropical rainforests there is growing evidence suggesting that a significant fraction of species richness is the result of recent radiations. Understanding the evolutionary trajectories and mechanisms underlying these radiations demands much greater phylogenetic resolution than is currently available for these groups. The neotropical tree genus Inga (Leguminosae) is a good example, with ~300 extant species and a crown age of 2-10 MY, yet over 6 kb of plastid and nuclear DNA sequence data gives only poor phylogenetic resolution among species. Here we explore the use of larger-scale nuclear gene data obtained though targeted enrichment to increase phylogenetic resolution within Inga. Transcriptome data from three Inga species were used to select 264 nuclear loci for targeted enrichment and sequencing. Following quality control to remove probable paralogs from these sequence data, the final dataset comprised 259,313 bases from 194 loci for 24 accessions representing 22 Inga species and an outgroup (Zygia). Bayesian phylogenies reconstructed using either all loci concatenated or a gene-tree/species-tree approach yielded highly resolved phylogenies. We used coalescent approaches to show that the same targeted enrichment data also have significant power to discriminate among alternative within-species population histories within the widespread species I. umbellifera. In either application, targeted enrichment simplifies the informatics challenge of identifying orthologous loci associated with de novo genome sequencing. We conclude that targeted enrichment provides the large volumes of phylogenetically-informative sequence data required to resolve relationships within recent plant species radiations, both at the species level and for within-species phylogeographic studies.
在多样的地理和生态环境中,进化辐射在许多植物谱系中显著且普遍存在;在新热带雨林中,越来越多的证据表明,相当一部分物种丰富度是近期辐射的结果。要理解这些辐射背后的进化轨迹和机制,需要比目前这些类群所具备的更高的系统发育分辨率。新热带树木英加属(豆科)就是一个很好的例子,现存约300个物种,冠龄为200万至1000万年,但超过6千碱基对的质体和核DNA序列数据在物种间仅给出了较差的系统发育分辨率。在此,我们探索通过靶向富集获得的更大规模核基因数据的用途,以提高英加属内的系统发育分辨率。利用来自三个英加属物种的转录组数据选择264个核基因座进行靶向富集和测序。在进行质量控制以从这些序列数据中去除可能的旁系同源物后,最终数据集包含来自194个基因座的259,313个碱基,用于24个样本,代表22个英加属物种和一个外类群(孪叶豆属)。使用所有基因座串联或基因树/物种树方法重建的贝叶斯系统发育树产生了高度解析的系统发育关系。我们使用溯祖方法表明,相同的靶向富集数据也有很大的能力来区分广泛分布的物种伞形英加内不同的种内群体历史。在任何一种应用中,靶向富集都简化了识别与从头基因组测序相关的直系同源基因座的信息学挑战。我们得出结论,靶向富集提供了大量系统发育信息丰富的序列数据,这些数据对于解析近期植物物种辐射内的物种间关系以及种内系统地理学研究都是必需的。