Academy of Agriculture and Forestry Sciences, Qinghai University, Xining, China.
National Key Laboratory of Sanjiangyuan Ecology and Plateau Agriculture and Animal Husbandry, Qinghai University, Xining, China.
Plant Biotechnol J. 2022 Oct;20(10):1996-2005. doi: 10.1111/pbi.13883. Epub 2022 Aug 1.
Potato (Solanum tuberosum L.) originated in the Andes and evolved its vegetative propagation strategy through short day-dependent tuber development. Herein, we present a high-quality, chromosome-scale reference genome sequence of a tetraploid potato cultivar. The total length of this genome assembly was 2.67 Gb, with scaffold N50 and contig N50 sizes of 46.24 and 2.19 Mb, respectively. In total, 1.69 Gb repetitive sequences were obtained through de novo annotation, and long terminal repeats were the main transposable elements. A total of 126 070 protein-coding genes were annotated, of which 125 077 (99.21%) were located on chromosomes. The 48 chromosomes were classified into four haplotypes. We annotated 31 506 homologous genes, including 5913 (18.77%) genes with four homologues, 11 103 (35.24%) with three homologues, 12 177 (38.65%) with two homologues and 2313 (7.34%) with one homologue. MLH3, MSH6/7 and RFC3, which are the genes involved in the mismatch repair pathway, were found to be significantly expanded in the tetraploid potato genome relative to the diploid potato genome. Genome-wide association analysis revealed that cytochrome P450, flavonoid synthesis, chalcone enzyme, glycosyl hydrolase and glycosyl transferase genes were significantly correlated with the flesh colours of potato tuber in 150 tetraploid potatoes. This study provides valuable insights into the highly heterozygous autotetraploid potato genome and may facilitate the development of tools for potato cultivar breeding and further studies on autotetraploid crops.
马铃薯(Solanum tuberosum L.)起源于安第斯山脉,通过短日照依赖性块茎发育进化出营养繁殖策略。在此,我们呈现了一个四倍体马铃薯品种的高质量、染色体水平的参考基因组序列。该基因组组装的总长度为 26.7 亿碱基对,支架 N50 和 Contig N50 大小分别为 46.24 和 2.19 Mb。总共通过从头注释获得了 1.69 亿碱基对的重复序列,长末端重复是主要的转座元件。总共注释了 126070 个蛋白质编码基因,其中 125077 个(99.21%)位于染色体上。48 条染色体分为四个单倍型。我们注释了 31506 个同源基因,包括 5913 个(18.77%)具有四个同源基因,11103 个(35.24%)具有三个同源基因,12177 个(38.65%)具有两个同源基因,2313 个(7.34%)具有一个同源基因。与二倍体马铃薯基因组相比,多倍体马铃薯基因组中 MLH3、MSH6/7 和 RFC3 等参与错配修复途径的基因明显扩张。全基因组关联分析表明,细胞色素 P450、类黄酮合成、查尔酮酶、糖苷水解酶和糖基转移酶基因与 150 个四倍体马铃薯块茎的肉色显著相关。本研究为高度杂合的同源四倍体马铃薯基因组提供了有价值的见解,并可能促进马铃薯品种选育工具的开发和同源四倍体作物的进一步研究。