Wold Jana R, Guhlin Joseph G, Dearden Peter K, Santure Anna W, Steeves Tammy E
University of Canterbury, Christchurch, New Zealand.
Genomics Aotearoa and Biochemistry Department, University of Otago, Dunedin, New Zealand.
Mol Ecol Resour. 2023 Mar 14. doi: 10.1111/1755-0998.13783.
There is growing interest in the role of structural variants (SVs) as drivers of local adaptation and speciation. From a biodiversity genomics perspective, the characterization of genome-wide SVs provides an exciting opportunity to complement single nucleotide polymorphisms (SNPs). However, little is known about the impacts of SV discovery and genotyping strategies on the characterization of genome-wide SV diversity within and among populations. Here, we explore a near whole-species resequence data set, and long-read sequence data for a subset of highly represented individuals in the critically endangered kākāpō (Strigops habroptilus). We demonstrate that even when using a highly contiguous reference genome, different discovery and genotyping strategies can significantly impact the type, size and location of SVs characterized genome-wide. Further, we found that the mean number of SVs in each of two kākāpō lineages differed both within and across generations. These combined results suggest that genome-wide characterization of SVs remains challenging at the population-scale. We are optimistic that increased accessibility to long-read sequencing and advancements in bioinformatic approaches including multireference approaches like genome graphs will alleviate at least some of the challenges associated with resolving SV characteristics below the species level. In the meantime, we address caveats, highlight considerations, and provide recommendations for the characterization of genome-wide SVs in biodiversity genomic research.
结构变异(SVs)作为局部适应和物种形成驱动因素的作用正受到越来越多的关注。从生物多样性基因组学的角度来看,全基因组SVs的表征为补充单核苷酸多态性(SNPs)提供了一个令人兴奋的机会。然而,关于SV发现和基因分型策略对种群内和种群间全基因组SV多样性表征的影响,我们知之甚少。在这里,我们探索了一个近乎全物种的重测序数据集,以及极度濒危的鸮鹦鹉(Strigops habroptilus)中一部分高代表性个体的长读长序列数据。我们证明,即使使用高度连续的参考基因组,不同的发现和基因分型策略也会显著影响全基因组表征的SVs的类型、大小和位置。此外,我们发现两个鸮鹦鹉谱系中每个谱系的SVs平均数量在代内和代间都有所不同。这些综合结果表明,在种群规模上,全基因组SVs的表征仍然具有挑战性。我们乐观地认为,长读长测序的可及性增加以及包括基因组图谱等多参考方法在内的生物信息学方法的进步,将至少缓解一些与解析物种水平以下的SV特征相关的挑战。与此同时,我们讨论了注意事项,强调了考虑因素,并为生物多样性基因组研究中全基因组SVs的表征提供了建议。