Zhang Huiting, Ko Itsuhiro, Eaker Abigail, Haney Sabrina, Khuu Ninh, Ryan Kara, Appleby Aaron B, Hoffmann Brendan, Landis Henry, Pierro Kenneth A, Willsea Noah, Hargarten Heidi, Yocca Alan E, Harkess Alex, Honaas Loren, Ficklin Stephen
Department of Horticulture, Washington State University, Pullman, WA, 99164, USA.
USDA Agricultural Research Service, Wenatchee, WA, USA 98801, USA.
G3 (Bethesda). 2024 Sep 17;14(12). doi: 10.1093/g3journal/jkae222.
Genome sequencing for agriculturally important Rosaceous crops has made rapid progress both in completeness and annotation quality. Whole genome sequence and annotation gives breeders, researchers, and growers information about cultivar specific traits such as fruit quality and disease resistance, and informs strategies to enhance postharvest storage. Here we present a haplotype-phased, chromosomal level genome of Malus domestica, 'WA 38', a new apple cultivar released to market in 2017 as Cosmic Crisp®. Using both short and long read sequencing data with a k-mer based approach, chromosomes originating from each parent were assembled and segregated. This is the first pome fruit genome fully phased into parental haplotypes in which chromosomes from each parent are identified and separated into their unique, respective haplomes. The two haplome assemblies, 'Honeycrisp' originated HapA and 'Enterprise' originated HapB, are about 650 Megabases each, and both have a BUSCO score of 98.7% complete. A total of 53,028 and 54,235 genes were annotated from HapA and HapB, respectively. Additionally, we provide genome-scale comparisons to 'Gala', 'Honeycrisp', and other relevant cultivars highlighting major differences in genome structure and gene family circumscription. This assembly and annotation was done in collaboration with the American Campus Tree Genomes project that includes 'WA 38' (Washington State University), 'd'Anjou' pear (Auburn University), and many more. To ensure transparency, reproducibility, and applicability for any genome project, our genome assembly and annotation workflow is recorded in detail and shared under a public GitLab repository. All software is containerized, offering a simple implementation of the workflow.
在农业上具有重要意义的蔷薇科作物的基因组测序在完整性和注释质量方面都取得了迅速进展。全基因组序列和注释为育种者、研究人员和种植者提供了有关品种特定性状(如果实品质和抗病性)的信息,并为加强采后储存的策略提供了依据。在这里,我们展示了苹果品种‘WA 38’(2017年作为Cosmic Crisp®推向市场的一个新苹果品种)的单倍型定相、染色体水平的基因组。使用短读长和长读长测序数据以及基于k-mer的方法,组装并分离了来自每个亲本的染色体。这是第一个完全定相为亲本单倍型的梨果基因组,其中来自每个亲本的染色体被识别并分离成其独特的、各自的单倍体基因组。两个单倍体基因组组装,即源自‘蜜脆’的单倍型A和源自‘企业’的单倍型B,每个大约有650兆碱基,并且两者的BUSCO完整性得分均为98.7%。分别从单倍型A和单倍型B注释出了总共53,028个和54,235个基因。此外,我们提供了与‘嘎啦’、‘蜜脆’和其他相关品种的全基因组比较,突出了基因组结构和基因家族界定的主要差异。这个组装和注释是与美国校园树木基因组项目合作完成的,该项目包括‘WA 38’(华盛顿州立大学))、‘安茹’梨(奥本大学)等更多品种。为确保任何基因组项目的透明度、可重复性和适用性,我们的基因组组装和注释工作流程被详细记录并在一个公共的GitLab仓库中共享。所有软件都被容器化,提供了工作流程的简单实现。