Lim Leonard Whye Kit, Lau Melinda Mei Lin, Chung Hung Hui, Hussain Hasnain, Gan Han Ming
Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia.
Centre for Sago Research (CoSAR), Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia.
Data Brief. 2022 Jan 6;40:107800. doi: 10.1016/j.dib.2022.107800. eCollection 2022 Feb.
The sago palm ( Rottboll) is a tropical halophytic starch-producing, economically important crop palm mainly located in Southeast Asian countries. Recently, a genome survey was conducted on this palm using the Illumina sequencing platform, with a very low (21.5%) BUSCO genome completeness score, and most of them (∼78%) are either fragmented or missing. Thus, in this study, the sago palm genome completeness was further improved with the utilization of the Nanopore sequencing platform that produced longer reads. A hybrid genome assembly was conducted, and the outcome was a much complete sago palm genome with BUSCO completeness achieved at as high as 97.9%, with only ∼2% of them either fragmented or missing. The estimated genome size of the sago palm is 509,812,790 bp in this study. A sum of 33,242 protein-coding genes was revealed from the sago palm genome and around 96.39% of them had been functionally annotated. An investigation on the carbohydrate metabolism KEGG pathways also unearthed that starch synthesis was one of the major sago palm activities. The genome data obtained from this work is indispensable for future molecular evolutionary and genome-wide association studies on the economically important sago palm.
西米棕榈(Rottboll)是一种热带盐生淀粉生产作物,是经济上重要的棕榈树,主要分布在东南亚国家。最近,利用Illumina测序平台对这种棕榈进行了基因组调查,BUSCO基因组完整性得分非常低(21.5%),其中大部分(约78%)是片段化或缺失的。因此,在本研究中,利用产生更长读长的Nanopore测序平台进一步提高了西米棕榈基因组的完整性。进行了混合基因组组装,结果得到了一个更完整的西米棕榈基因组,BUSCO完整性高达97.9%,只有约2%是片段化或缺失的。本研究中,西米棕榈的估计基因组大小为509,812,790 bp。从西米棕榈基因组中鉴定出33,242个蛋白质编码基因,其中约96.39%已进行功能注释。对碳水化合物代谢KEGG途径的研究还发现,淀粉合成是西米棕榈的主要活动之一。这项工作获得的基因组数据对于未来对经济上重要的西米棕榈进行分子进化和全基因组关联研究不可或缺。