BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.
China National GeneBank, Jinsha Road, Dapeng New District, Shenzhen 518120, China.
Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giz007.
Genome sequencing has been widely used in plant research to construct reference genomes and provide evolutionary insights. However, few plant species have had their whole genome sequenced, thus restraining the utility of these data. We collected 1,093 samples of vascular plant species growing in the Ruili Botanical Garden, located in southwest China. Of these, we sequenced 761 samples and collected voucher specimens stored in the Herbarium of China National GeneBank.
The 761 sequenced samples represented 689 vascular plant species from 137 families belonging to 49 orders. Of these, 257 samples were identified to the species level and 504 to the family level, using specimen and chloroplast sequences. In total, we generated 54 Tb of sequencing data, with an average sequencing depth of 60X per species, as estimated from genome sizes. A reference phylogeny was reconstructed with 78 chloroplast genes for molecular identification and other possible applications.
The large dataset of vascular plant genomes generated in this study, which includes both high-depth whole-genome sequencing data and associated voucher specimens, is valuable for plant genome research and other applications. This project also provides insight into the feasibility and technical requirements for "planetary-scale" projects such as the 10,000 Plant Genomes Project and the Earth BioGenome Project.
基因组测序已广泛应用于植物研究中,以构建参考基因组并提供进化见解。然而,只有少数植物物种的全基因组被测序,这限制了这些数据的应用。我们收集了位于中国西南的瑞丽植物园内生长的 1093 种维管植物样本。其中,我们对 761 个样本进行了测序,并收集了保存在中国国家基因库标本馆中的凭证标本。
761 个测序样本代表了来自 49 个目、137 个科的 689 种维管植物,其中 257 个样本鉴定到种水平,504 个鉴定到科水平,使用的是标本和叶绿体序列。总共生成了 54TB 的测序数据,每个物种的平均测序深度为 60X,这是根据基因组大小估计的。我们构建了一个参考系统发育树,使用 78 个叶绿体基因进行分子鉴定和其他可能的应用。
本研究生成的大量维管植物基因组数据集,包括高深度全基因组测序数据和相关凭证标本,对植物基因组研究和其他应用具有重要价值。该项目还为“行星规模”项目(如 10000 种植物基因组计划和地球生物基因组计划)提供了可行性和技术要求的见解。