Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Geneva, NY 14456, United States.
Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, United States.
DNA Res. 2024 Oct 1;31(5). doi: 10.1093/dnares/dsae026.
Wild Malus species flourished in North America long before Europeans introduced domesticated apples. Malus coronaria and M. ioensis are native to the mid-western and eastern United States, while M. angustifolia and M. fusca grow in the southeast and west, respectively. They offer disease resistance, climate and soil adaptability, and horticultural traits for apple breeding. However, their utilization remains limited due to insufficient genomic resources and specific genetics. We report high-quality phased chromosome-scale assemblies of M. coronaria and M. ioensis, generated using long-read and conformation capture sequencing. Phylogenetic and synteny analysis indicated high relatedness between these 2 genomes and previously published genome of M. angustifolia, and lower relatedness with M. fusca. Gene family-based pangenome of North American Malus identified 60,211 orthogroups containing 340,087 genes. Genes involved in basic cellular and metabolic processes, growth, and development were core to the existence of these species, whereas genes involved in secondary metabolism, stress response, and interactions with other organisms were accessory and are likely associated with adaptation to specific environments. Structural variation hotspots were mostly overlapping with high gene density. This study offers novel native North American Malus genome resources that can be used to identify genes for apple breeding and understand their evolution and adaptation.
野生苹果属植物在欧洲人引入栽培苹果之前就在北美的野外大量生长。冠状苹果(Malus coronaria)和爱奥尼斯苹果(M. ioensis)原产于美国中西部和东部,而狭叶苹果(M. angustifolia)和黑果腺肋花楸(M. fusca)分别生长在美国东南部和西部。它们为苹果的选育提供了抗病性、对气候和土壤的适应性以及园艺特性。然而,由于基因组资源不足和特定的遗传特性,它们的利用仍然受到限制。我们报告了使用长读长和构象捕获测序生成的冠状苹果和爱奥尼斯苹果高质量的相染色体级别的组装。系统发育和共线性分析表明,这 2 个基因组与先前发表的狭叶苹果基因组高度相关,与黑果腺肋花楸的相关性较低。北美苹果属基于基因家族的泛基因组鉴定出 60,211 个直系同源群,包含 340,087 个基因。参与基本细胞和代谢过程、生长和发育的基因是这些物种存在的核心,而参与次生代谢、应激反应和与其他生物相互作用的基因是辅助性的,可能与适应特定环境有关。结构变异热点与高基因密度大多重叠。这项研究提供了新的北美本土苹果属基因组资源,可用于鉴定苹果选育的基因,并了解它们的进化和适应。