Suppr超能文献

雪羊(Ovis nivicola)的首个基因组草图组装。

The First Draft Genome Assembly of Snow Sheep (Ovis nivicola).

机构信息

Population Genomics Group, Department of Veterinary Sciences, Ludwig Maximilian University of Munich, Munich, Germany.

Laboratory for Functional Genome Analysis, Gene Center, Ludwig Maximilian University of Munich, Munich, Germany.

出版信息

Genome Biol Evol. 2020 Aug 1;12(8):1330-1336. doi: 10.1093/gbe/evaa124.

Abstract

The snow sheep, Ovis nivicola, which is endemic to the mountain ranges of northeastern Siberia, are well adapted to the harsh cold climatic conditions of their habitat. In this study, using long reads of Nanopore sequencing technology, whole-genome sequencing, assembly, and gene annotation of a snow sheep were carried out. Additionally, RNA-seq reads from several tissues were also generated to supplement the gene prediction in snow sheep genome. The assembled genome was ∼2.62 Gb in length and was represented by 7,157 scaffolds with N50 of about 2 Mb. The repetitive sequences comprised of 41% of the total genome. BUSCO analysis revealed that the snow sheep assembly contained full-length or partial fragments of 97% of mammalian universal single-copy orthologs (n = 4,104), illustrating the completeness of the assembly. In addition, a total of 20,045 protein-coding sequences were identified using comprehensive gene prediction pipeline. Of which 19,240 (∼96%) sequences were annotated using protein databases. Moreover, homology-based searches and de novo identification detected 1,484 tRNAs; 243 rRNAs; 1,931 snRNAs; and 782 miRNAs in the snow sheep genome. To conclude, we generated the first de novo genome of the snow sheep using long reads; these data are expected to contribute significantly to our understanding related to evolution and adaptation within the Ovis genus.

摘要

雪羊(Ovis nivicola)是一种特有的东北西伯利亚山脉物种,它们很好地适应了其栖息地恶劣寒冷的气候条件。在本研究中,我们使用纳米孔测序技术的长读长,对雪羊进行了全基因组测序、组装和基因注释。此外,还生成了来自几种组织的 RNA-seq 读长,以补充雪羊基因组中的基因预测。组装的基因组约为 26.2 亿碱基对,由 7157 个支架组成,N50 约为 2 兆碱基对。重复序列占总基因组的 41%。BUSCO 分析表明,雪羊的组装包含了 97%的哺乳动物通用单拷贝直系同源物(n=4104)的全长或部分片段,表明组装的完整性。此外,使用综合基因预测管道共鉴定了 20045 个蛋白质编码序列。其中,约 96%(19240 个)序列使用蛋白质数据库进行了注释。此外,基于同源性搜索和从头鉴定在雪羊基因组中检测到 1484 个 tRNA;243 个 rRNA;1931 个 snRNA;和 782 个 miRNA。总之,我们使用长读长生成了雪羊的第一个从头基因组,这些数据有望为我们理解绵羊属内的进化和适应做出重要贡献。

相似文献

9
Whole genome SNP scanning of snow sheep (Ovis nivicola).雪羊(Ovis nivicola)的全基因组单核苷酸多态性扫描。
Dokl Biochem Biophys. 2016 Jul;469(1):288-93. doi: 10.1134/S1607672916040141. Epub 2016 Sep 7.

引用本文的文献

本文引用的文献

2
Fast and accurate long-read assembly with wtdbg2.使用 wtdbg2 实现快速准确的长读长序列组装。
Nat Methods. 2020 Feb;17(2):155-158. doi: 10.1038/s41592-019-0669-3. Epub 2019 Dec 9.
9
Whole genome SNP scanning of snow sheep (Ovis nivicola).雪羊(Ovis nivicola)的全基因组单核苷酸多态性扫描。
Dokl Biochem Biophys. 2016 Jul;469(1):288-93. doi: 10.1134/S1607672916040141. Epub 2016 Sep 7.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验