Corporación Colombiana de Investigación Agropecuaria (AGROSAVIA), Bogotá, Cundinamarca 250047, Colombia.
Boyce Thompson Institute, Ithaca, NY 14850, USA.
G3 (Bethesda). 2024 Sep 4;14(9). doi: 10.1093/g3journal/jkae139.
Potato (Solanum tuberosum) is an essential crop for food security and is ranked as the third most important crop worldwide for human consumption. The Diacol Capiro cultivar holds the dominant position in Colombian cultivation, primarily catering to the food processing industry. This highly heterozygous, autotetraploid cultivar belongs to the Andigenum group and it stands out for its adaptation to a wide variety of environments spanning altitudes from 1,800 to 3,200 meters above sea level. Here, a chromosome-scale assembly, referred to as DC, is presented for this cultivar. The assembly was generated by combining circular consensus sequencing with proximity ligation Hi-C for the scaffolding and represents 2.369 Gb with 48 pseudochromosomes covering 2,091 Gb and an anchor rate of 88.26%. The reference genome metrics, including an N50 of 50.5 Mb, a BUSCO (Benchmarking Universal Single-Copy Orthologue) score of 99.38%, and an Long Terminal Repeat Assembly Index score of 13.53, collectively signal the achieved high assembly quality. A comprehensive annotation yielded a total of 154,114 genes, and the associated BUSCO score of 95.78% for the annotated sequences attests to their completeness. The number of predicted NLR (Nucleotide-Binding and Leucine-Rich-Repeat genes) was 2107 with a large representation of NBARC (for nucleotide binding domain shared by Apaf-1, certain R gene products, and CED-4) containing domains (99.85%). Further comparative analysis of the proposed annotation-based assembly with high-quality known potato genomes, showed a similar genome metrics with differences in total gene numbers related to the ploidy status. The genome assembly and annotation of DC presented in this study represent a valuable asset for comprehending potato genetics. This resource aids in targeted breeding initiatives and contributes to the creation of enhanced, resilient, and more productive potato varieties, particularly beneficial for countries in Latin America.
马铃薯(Solanum tuberosum)是保障粮食安全的重要作物,也是全球人类消费的第三大重要作物。哥伦比亚种植的 Diacol Capiro 品种占据主导地位,主要供应食品加工行业。该品种是高度杂合的同源四倍体,属于 Andigenum 组,其适应海拔 1800 至 3200 米的广泛环境的能力尤为突出。本文为该品种提供了一个染色体级别的组装,称为 DC。该组装是通过将环形一致测序与邻近连接 Hi-C 相结合进行支架构建得到的,组装大小为 2.369 Gb,包含 48 个假染色体,覆盖 2091 Gb,锚定率为 88.26%。参考基因组的指标包括 N50 为 50.5 Mb,BUSCO(基准通用单拷贝同源物)评分 99.38%,以及 LTR 组装指数评分 13.53,这些都表明该组装质量很高。全面注释共产生了 154114 个基因,注释序列的 BUSCO 评分 95.78%表明其完整性。预测的 NLR(核苷酸结合和富含亮氨酸重复基因)数量为 2107 个,其中含有大量 NBARC(Apaf-1、某些 R 基因产物和 CED-4 共有的核苷酸结合域)的基因(99.85%)。基于注释的组装与高质量已知马铃薯基因组的进一步比较分析表明,相似的基因组指标存在差异,总基因数量与多倍体状态有关。本研究提出的 DC 基因组组装和注释代表了理解马铃薯遗传学的宝贵资源。该资源有助于有针对性的育种计划,并有助于创造更具弹性、更高产的马铃薯品种,特别是对拉丁美洲国家有益。