Suppr超能文献

Illumina短读长测序数据、基因组组装及注释

Illumina short-read sequencing data, assembly and annotations of the genome.

作者信息

DSouza Stafny, Ponnanna Koushik, Chokkanna Amruthavalli, Ramachandra Nallur

机构信息

Department of Studies in Genetics and Genomics, University of Mysore, Mysuru, India.

出版信息

Data Brief. 2020 Dec 19;34:106674. doi: 10.1016/j.dib.2020.106674. eCollection 2021 Feb.

Abstract

The () is a member of subgroup of species group of widely distributed across South-East Asia and central to Southern Africa. It displays morphological similarities with other members of the subgroup with which it has a recent divergence history. The genomic DNA of Coorg strain was paired-end sequenced using Illumina HiSeq 2500 technology to obtain a draft genome assembly of 145.64 Mb. The generated assembly retrieved 93.6% of the conserved dipteran BUSCO orthologs. Approximately 85% of the predicted proteins exhibit sequence similarity to the proteins of which is the closest annotated species. This draft genome sequence is a valuable resource to geneticists and evolutionary biologists to understand molecular organisation of the genome and its evolution during early stages of speciation.

摘要

()是广泛分布于东南亚和非洲中部至南部的物种组的一个亚组的成员。它与具有近期分化历史的该亚组的其他成员表现出形态相似性。使用Illumina HiSeq 2500技术对Coorg菌株的基因组DNA进行了双末端测序,以获得145.64 Mb的基因组草图组装。生成的组装检索到了93.6%的保守双翅目BUSCO直系同源物。预测蛋白质中约85%与最接近的注释物种()的蛋白质表现出序列相似性。这个基因组草图序列是遗传学家和进化生物学家了解基因组的分子组织及其在物种形成早期阶段进化的宝贵资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d51e/7773860/ddadad14cf5f/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验