State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen, 518083, China.
BGI Life Science Joint Research Center, Northeast Forestry University, Harbin, 150400, China.
Sci Data. 2023 Aug 3;10(1):512. doi: 10.1038/s41597-023-02420-8.
Wood is the most important natural and endlessly renewable source of energy. Despite the ecological and economic importance of wood, many aspects of its formation have not yet been investigated. We performed chromosome-scale genome assemblies of three timber trees (Ochroma pyramidale, Mesua ferrea, and Tectona grandis) which exhibit different wood properties such as wood density, hardness, growth rate, and fiber cell wall thickness. The combination of 10X, stLFR, Hi-Fi sequencing and HiC data led us to assemble high-quality genomes evident by scaffold N50 length of 55.97 Mb (O. pyramidale), 22.37 Mb (M. ferrea) and 14.55 Mb (T. grandis) with >97% BUSCO completeness of the assemblies. A total of 35774, 24027, and 44813 protein-coding genes were identified in M. ferrea, T. grandis and O. pyramidale, respectively. The data generated in this study is anticipated to serve as a valuable genetic resource and will promote comparative genomic analyses, and it is of practical importance in gaining a further understanding of the wood properties in non-model woody species.
木材是最重要的天然、可再生能源。尽管木材具有生态和经济的重要性,但它的许多形成过程尚未得到研究。我们对三种木材树种(可可树、麻疯树和柚木)进行了染色体级别的基因组组装,这些树种表现出不同的木材特性,如木材密度、硬度、生长速度和纤维细胞壁厚度。10X、stLFR、Hi-Fi 测序和 HiC 数据的结合使我们能够组装出高质量的基因组,其明显特征是支架 N50 长度分别为 55.97Mb(可可树)、22.37Mb(麻疯树)和 14.55Mb(柚木),组装的完整性大于 97%。在麻疯树、柚木和可可树中分别鉴定到 35774、24027 和 44813 个蛋白质编码基因。本研究产生的数据有望成为有价值的遗传资源,并将促进比较基因组分析,对深入了解非模式木本物种的木材特性具有实际意义。