Suppr超能文献

槟榔染色体水平基因组组装。

Chromosome-scale genome assembly of areca palm (Areca catechu).

机构信息

Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China.

BGI Genomics, BGI-Shenzhen, Shenzhen, China.

出版信息

Mol Ecol Resour. 2021 Oct;21(7):2504-2519. doi: 10.1111/1755-0998.13446. Epub 2021 Jun 29.

Abstract

Areca palm (Areca catechu L.; family Arecaceae) is an important tropical medicinal crop and is also used for masticatory and religious purposes in Asia. Improvements to areca properties made by traditional breeding tools have been very slow, and further advances in its cultivation and practical use require genomic information, which is still unavailable. Here, we present a chromosome-scale reference genome assembly for areca by combining Illumina and PacBio data with Hi-C mapping technologies, covering the predicted A. catechu genome length (2.59 Gb, variety "Reyan#1") to an estimated 240× read depth. The assembly was 2.51 Gb in length with a scaffold N50 of 1.7Mb. The scaffolds were then further assembled into 16 pseudochromosomes, with an N50 of 172 Mb. Transposable elements comprised 80.37% of the areca genome, and 68.68% of them were long-terminal repeat retrotransposon elements. The areca palm genome was predicted to harbour 31,571 protein-coding genes and overall, 92.92% of genes were functionally annotated, including enriched and expanded families of genes responsible for biosynthesis of flavonoid, anthocyanin, monoterpenoid and their derivatives. Comparative analyses indicated that A. catechu probably diverged from its close relatives Elaeis guineensis and Cocos nucifera approximately 50.3 million years ago (Ma). Two whole genome duplication events in areca palm were found to be shared by palms and monocots, respectively. This genome assembly and associated resources represents an important addition to the palm genomics community and will be a valuable resource that will facilitate areca palm breeding and improve our understanding of areca palm biology and evolution.

摘要

槟榔(Areca catechu L.;家族 Arecaceae)是一种重要的热带药用作物,在亚洲也被用于咀嚼和宗教用途。传统的育种工具对槟榔特性的改进非常缓慢,要进一步提高其种植和实际应用,需要基因组信息,但目前还没有。在这里,我们通过将 Illumina 和 PacBio 数据与 Hi-C 作图技术相结合,为槟榔提供了一个染色体级别的参考基因组组装,覆盖了预测的 A. catechu 基因组长度(2.59 Gb,品种“Reyan#1”),估计达到 240×的读取深度。组装结果的长度为 2.51 Gb,支架 N50 为 1.7 Mb。然后,将支架进一步组装成 16 条假染色体,N50 为 172 Mb。转座元件占槟榔基因组的 80.37%,其中 68.68%为长末端重复反转录转座元件。槟榔基因组预测含有 31,571 个蛋白质编码基因,总体上 92.92%的基因具有功能注释,包括黄酮类、花青素、单萜及其衍生物生物合成相关基因的丰富和扩展家族。比较分析表明,A. catechu 可能与其近亲油棕(Elaeis guineensis)和椰子(Cocos nucifera)在大约 5030 万年前分化。在槟榔中发现了两次全基因组复制事件,分别与棕榈科和单子叶植物共享。这个基因组组装和相关资源是棕榈基因组学领域的一个重要补充,将成为槟榔育种的宝贵资源,并有助于我们更好地理解槟榔的生物学和进化。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验