Suppr超能文献

加拿大二棱型酿造大麦品种 AAC 协同的基因组组装。

Genome Assembly of the Canadian two-row Malting Barley cultivar AAC Synergy.

机构信息

Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100 Morden, MB R6M 1Y5, Canada.

Brandon Research and Development Centre, Agriculture and Agri-Food Canada, 2701 Grand Valley Road, Brandon, MB R7A 5Y3, Canada.

出版信息

G3 (Bethesda). 2021 Apr 15;11(4). doi: 10.1093/g3journal/jkab031.

Abstract

Barley (Hordeum vulgare L.) is one of the most important global crops. The six-row barley cultivar Morex reference genome has been used by the barley research community worldwide. However, this reference genome can have limitations when used for genomic and genetic diversity analysis studies, gene discovery, and marker development when working in two-row germplasm that is more common to Canadian barley. Here we assembled, for the first time, the genome sequence of a Canadian two-row malting barley, cultivar AAC Synergy. We applied deep Illumina paired-end reads, long mate-pair reads, PacBio sequences, 10X chromium linked read libraries, and chromosome conformation capture sequencing (Hi-C) to generate a contiguous assembly. The genome assembled from super-scaffolds had a size of 4.85 Gb, N50 of 2.32 Mb, and an estimated 93.9% of complete genes from a plant database (BUSCO, benchmarking universal single-copy orthologous genes). After removal of small scaffolds (< 300 Kb), the assembly was arranged into pseudomolecules of 4.14 Gb in size with seven chromosomes plus unanchored scaffolds. The completeness and annotation of the assembly were assessed by comparing it with the updated version of six-row Morex and recently released two-row Golden Promise genome assemblies.

摘要

大麦(Hordeum vulgare L.)是全球最重要的作物之一。全球的大麦研究界都在使用六棱大麦品种 Morex 的参考基因组。然而,当在加拿大更为常见的二棱大麦种质资源中进行基因组和遗传多样性分析研究、基因发现和标记开发时,该参考基因组可能会存在局限性。在这里,我们首次组装了加拿大二棱麦芽大麦品种 AAC Synergy 的基因组序列。我们应用深度 Illumina 配对末端reads、长 mate-pair reads、PacBio 序列、10X 铬链接 read 文库和染色体构象捕获测序(Hi-C)来生成连续的组装。从超级支架组装的基因组大小为 4.85Gb,N50 为 2.32Mb,估计有 93.9%的完整基因来自植物数据库(BUSCO,基准通用单拷贝直系同源基因)。去除小支架(<300 Kb)后,组装被排列成 4.14Gb 的假染色体,大小为 7 条染色体加上未锚定的支架。通过与最新版本的六棱 Morex 和最近发布的二棱 Golden Promise 基因组组装进行比较,评估了组装的完整性和注释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5afc/8049406/27bea8a3c30f/jkab031f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验