State Forestry Administration Key Open Laboratory on the Science and Technology of Bamboo and Rattan, Institute of Gene Science for Bamboo and Rattan Resources, International Center for Bamboo and Rattan, Futongdong Rd, WangJing, Chaoyang District, Beijing 100102, China.
BGI Genomics, BGI-Shenzhen, Building No. 7, BGI Park, No. 21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China.
Gigascience. 2018 Sep 1;7(9):giy097. doi: 10.1093/gigascience/giy097.
Calamus simplicifolius and Daemonorops jenkinsiana are two representative rattans, the most significant material sources for the rattan industry. However, the lack of reference genome sequences is a major obstacle for basic and applied biology on rattan.
We produced two chromosome-level genome assemblies of C. simplicifolius and D. jenkinsiana using Illumina, Pacific Biosciences, and Hi-C sequencing data. A total of ∼730 Gb and ∼682 Gb of raw data covered the predicted genome lengths (∼1.98 Gb of C. simplicifolius and ∼1.61 Gb of D. jenkinsiana) to ∼372 × and ∼426 × read depths, respectively. The two de novo genome assemblies, ∼1.94 Gb and ∼1.58 Gb, were generated with scaffold N50s of ∼160 Mb and ∼119 Mb in C. simplicifolius and D. jenkinsiana, respectively. The C. simplicifolius and D. jenkinsiana genomes were predicted to harbor 51,235 and 53,342 intact protein-coding gene models, respectively. Benchmarking Universal Single-Copy Orthologs evaluation demonstrated that genome completeness reached 96.4% and 91.3% in the C. simplicifolius and D. jenkinsiana genomes, respectively. Genome evolution showed that four Arecaceae plants clustered together, and the divergence time between the two rattans was ∼19.3 million years ago. Additionally, we identified 193 and 172 genes involved in the lignin biosynthesis pathway in the C. simplicifolius and D. jenkinsiana genomes, respectively.
We present the first de novo assemblies of two rattan genomes (C. simplicifolius and D. jenkinsiana). These data will not only provide a fundamental resource for functional genomics, particularly in promoting germplasm utilization for breeding, but also serve as reference genomes for comparative studies between and among different species.
省藤和白藤是两种重要的藤本植物,是藤材加工业最重要的原料。然而,缺乏参考基因组序列是藤本植物基础和应用生物学的主要障碍。
我们使用 Illumina、Pacific Biosciences 和 Hi-C 测序数据,生成了两种省藤和白藤的染色体水平基因组组装。总共约 730Gb 和 682Gb 的原始数据覆盖了预测基因组长度(约 1.98Gb 的省藤和约 1.61Gb 的白藤),达到约 372×和 426×的读取深度。两个从头组装的基因组,大小约为 1.94Gb 和 1.58Gb,在省藤和白藤中的支架 N50 分别约为 160Mb 和 119Mb。省藤和白藤基因组分别预测含有 51235 和 53342 个完整的蛋白质编码基因模型。基准通用单拷贝直系同源物评估表明,省藤和白藤基因组的完整性分别达到 96.4%和 91.3%。基因组进化表明,四棵棕榈科植物聚在一起,两种藤本植物的分化时间约为 1930 万年前。此外,我们分别在省藤和白藤基因组中鉴定出 193 个和 172 个参与木质素生物合成途径的基因。
我们首次组装了两种藤本植物(省藤和白藤)的从头基因组。这些数据不仅将为功能基因组学提供一个基础资源,特别是在促进种质资源利用进行育种方面,而且还将作为不同物种之间比较研究的参考基因组。