Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China.
School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
Sci Data. 2018 May 2;5:180079. doi: 10.1038/sdata.2018.79.
Genomic diversity within a species genome is the genetic basis of its phenotypic diversity essential for its adaptation to environments. The big picture of the total genetic diversity within Asian cultivated rice has been uncovered since the sequencing of 3,000 rice genomes, including the SNP data publicly available in the SNP-Seek database. Here we report other aspects of the genetic diversity, including rice sequences assembled from over 3,000 accessions but absent in the Nipponbare reference genome, structural variations (SVs) and gene presence/absence variations (PAVs) in 453 accessions with sequencing depth over 20x. Using either SVs or gene PAVs, we were able to reconstruct the population structure of O. sativa, which was consistent with previous result based on SNPs. Moreover, we demonstrated the usefulness of the new data sets by successfully detecting the strong association of the "Green Revolution gene", sd1, with plant height. Our data provide a more comprehensive view of the genetic diversity within rice, as well as additional genomic resources for research in rice breeding and plant biology.
物种基因组内的基因组多样性是其表型多样性的遗传基础,对于其适应环境至关重要。自 3000 个水稻基因组测序以来,揭示了亚洲栽培稻总遗传多样性的全貌,包括 SNP-Seek 数据库中公开的 SNP 数据。在这里,我们报告了遗传多样性的其他方面,包括来自 3000 多个品系的水稻序列,但在 Nipponbare 参考基因组中不存在,以及在测序深度超过 20 倍的 453 个品系中的结构变异(SVs)和基因存在/缺失变异(PAVs)。使用 SVs 或基因 PAVs,我们能够重建 O. sativa 的种群结构,这与基于 SNP 的先前结果一致。此外,我们通过成功检测到“绿色革命基因”sd1 与株高的强关联,证明了这些新数据集的有用性。我们的数据提供了一个更全面的视角来了解水稻内部的遗传多样性,以及为水稻育种和植物生物学研究提供了更多的基因组资源。