Wang Linhai, Xia Qiuju, Zhang Yanxin, Zhu Xiaodong, Zhu Xiaofeng, Li Donghua, Ni Xuemei, Gao Yuan, Xiang Haitao, Wei Xin, Yu Jingyin, Quan Zhiwu, Zhang Xiurong
Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Key Laboratory of Biology and Genetic Improvement of Oil Crops of the Ministry of Agriculture, Wuhan, 430062, China.
Shenzhen Engineering Laboratory of Crop Molecular Design Breeding, BGI-agro, 518083, Shenzhen, China.
BMC Genomics. 2016 Jan 5;17:31. doi: 10.1186/s12864-015-2316-4.
Sesame is an important high-quality oil seed crop. The sesame genome was de novo sequenced and assembled in 2014 (version 1.0); however, the number of anchored pseudomolecules was higher than the chromosome number (2n = 2x = 26) due to the lack of a high-density genetic map with 13 linkage groups.
We resequenced a permanent population consisting of 430 recombinant inbred lines and constructed a genetic map to improve the sesame genome assembly. We successfully anchored 327 scaffolds onto 13 pseudomolecules. The new genome assembly (version 2.0) included 97.5 % of the scaffolds greater than 150 kb in size present in assembly version 1.0 and increased the total pseudomolecule length from 233.7 to 258.4 Mb with 94.3 % of the genome assembled and 97.2 % of the predicted gene models anchored. Based on the new genome assembly, a bin map including 1,522 bins spanning 1090.99 cM was generated and used to identified 41 quantitative trait loci (QTLs) for sesame plant height and 9 for seed coat color. The plant height-related QTLs explained 3-24 % the phenotypic variation (mean value, 8 %), and 29 of them were detected in at least two field trials. Two major loci (qPH-8.2 and qPH-3.3) that contributed 23 and 18 % of the plant height were located in 350 and 928-kb spaces on Chr8 and Chr3, respectively. qPH-3.3, is predicted to be responsible for the semi-dwarf sesame plant phenotype and contains 102 candidate genes. This is the first report of a sesame semi-dwarf locus and provides an interesting opportunity for a plant architecture study of the sesame. For the sesame seed coat color, the QTLs of the color spaces L*, a*, and b* were detected with contribution rates of 3-46 %. qSCb-4.1 contributed approximately 39 % of the b* value and was located on Chr4 in a 199.9-kb space. A list of 32 candidate genes for the locus, including a predicted black seed coat-related gene, was determined by screening the newly anchored genome.
This study offers a high-density genetic map and an improved assembly of the sesame genome. The number of linkage groups and pseudomolecules in this assembly equals the number of sesame chromosomes for the first time. The map and updated genome assembly are expected to serve as a platform for future comparative genomics and genetic studies.
芝麻是一种重要的优质油料作物。芝麻基因组于2014年进行了从头测序和组装(版本1.0);然而,由于缺乏具有13个连锁群的高密度遗传图谱,锚定的假分子数量高于染色体数(2n = 2x = 26)。
我们对由430个重组自交系组成的永久群体进行了重测序,并构建了遗传图谱以改进芝麻基因组组装。我们成功地将327个支架锚定到13个假分子上。新的基因组组装(版本2.0)包含了版本1.0中大小大于150 kb的支架的97.5%,并将假分子总长度从233.7 Mb增加到258.4 Mb,基因组组装率为94.3%,预测基因模型的锚定率为97.2%。基于新的基因组组装,生成了一个包含1522个bin、跨度为1090.99 cM的bin图谱,并用于鉴定芝麻株高的41个数量性状位点(QTL)和种皮颜色的9个QTL。与株高相关的QTL解释了3%-24%的表型变异(平均值为8%),其中29个在至少两个田间试验中被检测到。两个主要位点(qPH-8.2和qPH-3.3)分别位于第8号和第3号染色体上350 kb和928 kb的区间内,对株高的贡献率分别为23%和18%。qPH-3.3预计是导致芝麻半矮化植株表型的原因,包含102个候选基因。这是芝麻半矮化位点的首次报道,为芝麻的株型研究提供了一个有趣的机会。对于芝麻种皮颜色,检测到颜色空间L*、a和b的QTL,贡献率为3%-46%。qSCb-4.1对b*值的贡献率约为39%,位于第4号染色体上199.9 kb的区间内。通过筛选新锚定的基因组,确定了该位点的32个候选基因列表,其中包括一个预测的与黑色种皮相关的基因。
本研究提供了一个高密度遗传图谱和改进的芝麻基因组组装。该组装中的连锁群和假分子数量首次与芝麻染色体数量相等。该图谱和更新的基因组组装有望成为未来比较基因组学和遗传研究的平台。