Department of Entomology, Washington State University, Pullman, Washington, USA.
Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA.
Mol Ecol Resour. 2024 Apr;24(3):e13925. doi: 10.1111/1755-0998.13925. Epub 2024 Jan 6.
Sequence data assembly is a foundational step in high-throughput sequencing, with untold consequences for downstream analyses. Despite this, few studies have interrogated the many methods for assembling phylogenomic UCE data for their comparative efficacy, or for how outputs may be impacted. We study this by comparing the most commonly used assembly methods for UCEs in the under-studied bee lineage Nomiinae and a representative sampling of relatives. Data for 63 UCE-only and 75 mixed taxa were assembled with five methods, including ABySS, HybPiper, SPAdes, Trinity and Velvet, and then benchmarked for their relative performance in terms of locus capture parameters and phylogenetic reconstruction. Unexpectedly, Trinity and Velvet trailed the other methods in terms of locus capture and DNA matrix density, whereas SPAdes performed favourably in most assessed metrics. In comparison with SPAdes, the guided-assembly approach HybPiper generally recovered the highest quality loci but in lower numbers. Based on our results, we formally move Clavinomia to Dieunomiini and render Epinomia once more a subgenus of Dieunomia. We strongly advise that future studies more closely examine the influence of assembly approach on their results, or, minimally, use better-performing assembly methods such as SPAdes or HybPiper. In this way, we can move forward with phylogenomic studies in a more standardized, comparable manner.
序列数据组装是高通量测序的基础步骤,对下游分析有不可估量的影响。尽管如此,很少有研究探讨过组装系统发生 UCE 数据的许多方法的比较效果,或者它们的输出可能会受到怎样的影响。我们通过比较在研究较少的熊蜂科 Nomiinae 及其代表性亲缘种中最常用的 UCE 组装方法来研究这一点。使用包括 ABySS、HybPiper、SPAdes、Trinity 和 Velvet 在内的五种方法对 63 个仅 UCE 和 75 个混合分类群的数据进行了组装,然后根据它们在基因座捕获参数和系统发育重建方面的相对性能进行了基准测试。出乎意料的是,在基因座捕获和 DNA 矩阵密度方面,Trinity 和 Velvet 落后于其他方法,而 SPAdes 在大多数评估指标中表现良好。与 SPAdes 相比,引导组装方法 HybPiper 通常能恢复更高质量的基因座,但数量较少。根据我们的结果,我们正式将 Clavinomia 移至 Dieunomiini,并再次将 Epinomia 归入 Dieunomia 的一个亚属。我们强烈建议未来的研究更仔细地检查组装方法对其结果的影响,或者至少使用性能更好的组装方法,如 SPAdes 或 HybPiper。通过这种方式,我们可以以更标准化、更可比的方式推进系统发生基因组学研究。