Guangdong Forestry Survey and Planning Institute, Guangzhou, 510520, China.
Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
BMC Genom Data. 2023 Nov 28;24(1):73. doi: 10.1186/s12863-023-01176-9.
Erythrophleum is a genus in the Fabaceae family. The genus contains only about 10 species, and it is best known for its hardwood and medical properties worldwide. Erythrophleum fordii Oliv. is the only species of this genus distributed in China. It has superior wood and can be used in folk medicine, which leads to its overexploitation in the wild. For its effective conservation and elucidation of the distinctive genetic traits of wood formation and medical components, we present its first genome assembly.
This work generated ~ 160.8 Gb raw Nanopore whole genome sequencing (WGS) long reads, ~ 126.0 Gb raw MGI WGS short reads and ~ 29.0 Gb raw RNA-seq reads using E. fordii leaf tissues. The de novo assembly contained 864,825,911 bp in the E. fordii genome, with 59 contigs and a contig N50 of 30,830,834 bp. Benchmarking Universal Single-Copy Orthologs (BUSCO) revealed 98.7% completeness of the assembly. The assembly contained 471,006,885 bp (54.4%) repetitive sequences and 28,761 genes that coded for 33,803 proteins. The protein sequences were functionally annotated against multiple databases, facilitating comparative genomic analysis.
血桐是豆科血桐属植物。该属约有 10 个种,以其优质硬木和药用价值闻名于世。分布于中国的血桐是该属唯一的物种。它具有优良的木材,可用于民间医学,这导致其在野外被过度开发。为了有效保护和阐明木材形成和药用成分的独特遗传特征,我们首次对其基因组进行了组装。
这项工作使用血桐叶片组织生成了约 160.8 Gb 的原始纳米孔全基因组测序(WGS)长读长、约 126.0 Gb 的原始 MGI WGS 短读长和约 29.0 Gb 的原始 RNA-seq 读长。血桐基因组的从头组装包含 864,825,911 bp,由 59 个 contigs 组成,contig N50 为 30,830,834 bp。基准通用单拷贝直系同源物(BUSCO)显示组装的完整性为 98.7%。该组装包含 471,006,885 bp(54.4%)的重复序列和 28,761 个编码 33,803 个蛋白质的基因。蛋白质序列被多个数据库进行了功能注释,促进了比较基因组分析。