Niu Yingfeng, Li Guohua, Ni Shubang, He Xiyong, Zheng Cheng, Liu Ziyan, Gong Lidan, Kong Guanghong, Li Wei, Liu Jin
Yunnan Institute of Tropical Crops, Xishuangbanna, China.
School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China.
Front Genet. 2022 Feb 23;13:835363. doi: 10.3389/fgene.2022.835363. eCollection 2022.
Macadamia is an evergreen tree belonging to the Proteaceae family. The two commercial macadamia species, and , are highly prized for their edible kernels. The genome was recently sequenced, but the genome of has to date not been published, which limits the study of biological research and breeding in this species. This study reports a high-quality genome sequence of based on the Oxford Nanopore Technologies technology and high-throughput chromosome conformation capture techniques (Hi-C). An assembly of 750.87 Mb with 51.11 Mb N50 length was generated, close to the 740 and 758 Mb size estimates by flow cytometry and k-mer analysis, respectively. Genome annotation indicated that 61.42% of the genome is composed of repetitive sequences and 34.95% is composed of long terminal repeat retrotransposons. Up to 31,571 protein-coding genes were predicted, of which 92.59% were functionally annotated. The average gene length was 6,055 bp. Comparative genome analysis revealed that the gene families associated with defense response, lipid transport, steroid biosynthesis, triglyceride lipase activity, and fatty acid metabolism are expanded in the genome. The distribution of fourfold synonymous third-codon transversion showed a recent whole-genome duplication event in . Genomic and transcriptomic analysis identified 187 genes encoding 33 crucial oil biosynthesis enzymes, depicting a comprehensive map of macadamia lipid biosynthesis. Besides, the 55 identified genes exhibited preferential expression in root as compared to that in other tissues. The genome sequence of provides novel insights for breeding novel varieties and genetic improvement of agronomic traits.
澳洲坚果是一种属于山龙眼科的常绿乔木。两种商业种植的澳洲坚果品种,即[品种一]和[品种二],因其可食用的果仁而备受珍视。[品种一]的基因组最近已被测序,但[品种二]的基因组迄今为止尚未发表,这限制了对该物种生物学研究和育种的研究。本研究基于牛津纳米孔技术和高通量染色体构象捕获技术(Hi-C)报告了[品种二]的高质量基因组序列。生成了一个大小为750.87 Mb、N50长度为51.11 Mb的组装序列,分别接近通过流式细胞术和k-mer分析估计的740和758 Mb大小。基因组注释表明,61.42%的基因组由重复序列组成,34.95%由长末端重复反转录转座子组成。预测出多达31,571个蛋白质编码基因,其中92.59%在功能上得到注释。平均基因长度为6,055 bp。比较基因组分析表明,与防御反应、脂质转运、类固醇生物合成、甘油三酯脂肪酶活性和脂肪酸代谢相关的基因家族在[品种二]基因组中有所扩展。四倍同义第三密码子颠换的分布显示[品种二]最近发生了一次全基因组复制事件。基因组和转录组分析鉴定出187个编码33种关键油脂生物合成酶的基因,描绘了澳洲坚果脂质生物合成的全面图谱。此外,与其他组织相比,鉴定出的55个[品种二]基因在根中表现出优先表达。[品种二]的基因组序列为培育新品种和农艺性状的遗传改良提供了新的见解。