Du Ze-Zhen, He Jia-Bao, Jiao Wen-Biao
National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, 430070 China.
Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070 China.
aBIOTECH. 2025 Mar 28;6(2):361-376. doi: 10.1007/s42994-025-00206-7. eCollection 2025 Jun.
Innovations in DNA sequencing technologies have greatly boosted population-level genomic studies in plants, facilitating the identification of key genetic variations for investigating population diversity and accelerating the molecular breeding of crops. Conventional methods for genomic analysis typically rely on small variants, such as SNPs and indels, and use single linear reference genomes, which introduces biases and reduces performance in highly divergent genomic regions. By integrating the population level of sequences, pangenomes, particularly graph pangenomes, offer a promising solution to these challenges. To date, numerous algorithms have been developed for constructing pangenome graphs, aligning reads to these graphs, and performing variant genotyping based on these graphs. As demonstrated in various plant pangenomic studies, these advancements allow for the detection of previously hidden variants, especially structural variants, thereby enhancing applications such as genetic mapping of agronomically important genes. However, noteworthy challenges remain to be overcome in applying pangenome graph approaches to plants. Addressing these issues will require the development of more sophisticated algorithms tailored specifically to plants. Such improvements will contribute to the scalability of this approach, facilitating the production of super-pangenomes, in which hundreds or even thousands of de novo-assembled genomes from one species or genus can be integrated. This, in turn, will promote broader pan-omic studies, further advancing our understanding of genetic diversity and driving innovations in crop breeding.
DNA测序技术的创新极大地推动了植物群体水平的基因组研究,有助于识别用于研究群体多样性的关键遗传变异,并加速作物的分子育种。传统的基因组分析方法通常依赖于单核苷酸多态性(SNP)和插入缺失等小变异,并使用单一的线性参考基因组,这会引入偏差并降低在高度分化的基因组区域中的性能。通过整合群体水平的序列,泛基因组,特别是图形泛基因组,为应对这些挑战提供了一个有前景的解决方案。迄今为止,已经开发了许多算法用于构建泛基因组图谱、将 reads 比对到这些图谱以及基于这些图谱进行变异基因分型。正如在各种植物泛基因组研究中所证明的那样,这些进展使得能够检测到以前隐藏的变异,特别是结构变异,从而增强了诸如重要农艺基因的遗传定位等应用。然而,在将泛基因组图谱方法应用于植物方面,仍有值得注意的挑战有待克服。解决这些问题将需要开发专门针对植物的更复杂算法。这种改进将有助于提高该方法的可扩展性,促进超级泛基因组的产生,其中可以整合来自一个物种或属的数百甚至数千个从头组装的基因组。反过来,这将促进更广泛的泛组学研究,进一步加深我们对遗传多样性的理解,并推动作物育种的创新。