Suppr超能文献

严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组的数据流数据集。

Data stream dataset of SARS-CoV-2 genome.

作者信息

Barbosa Raquel de M, Fernandes Marcelo A C

机构信息

Laboratory of Drug Development, Department of Pharmacy, Federal University of Rio Grande do Norte, Natal, RN59078-970, Brazil.

Laboratory of Machine Learning and Intelligent Instrumentation, IMD/nPITI, Federal University of Rio Grande do Norte, Natal 59078-970, Brazil.

出版信息

Data Brief. 2020 Jun 10;31:105829. doi: 10.1016/j.dib.2020.105829. eCollection 2020 Aug.

Abstract

As of May 25, 2020, the novel coronavirus disease (called COVID-19) spread to more than 185 countries/regions with more than 348,000 deaths and more than 5,550,000 confirmed cases. In the bioinformatics area, one of the crucial points is the analysis of the virus nucleotide sequences using approaches such as data stream techniques and algorithms. However, to make feasible this approach, it is necessary to transform the nucleotide sequences string to numerical stream representation. Thus, the dataset provides four kinds of data stream representation (DSR) of SARS-CoV-2 virus nucleotide sequences. The dataset provides the DSR of 1557 instances of SARS-CoV-2 virus, 11540 other instances of other viruses from the Virus-Host DB dataset, and three instances of Riboviria viruses from NCBI (Betacoronavirus RaTG13, bat-SL-CoVZC45, and bat-SL-CoVZXC21).

摘要

截至2020年5月25日,新型冠状病毒病(称为COVID-19)已传播到185多个国家/地区,死亡人数超过34.8万,确诊病例超过555万。在生物信息学领域,关键点之一是使用诸如数据流技术和算法等方法分析病毒核苷酸序列。然而,为了使这种方法可行,有必要将核苷酸序列字符串转换为数字流表示形式。因此,该数据集提供了四种严重急性呼吸综合征冠状病毒2(SARS-CoV-2)病毒核苷酸序列的数据流表示(DSR)。该数据集提供了1557个SARS-CoV-2病毒实例、来自病毒-宿主数据库(Virus-Host DB)数据集的11540个其他病毒实例以及来自美国国立医学图书馆(NCBI)的三种核糖病毒实例(β冠状病毒RaTG13、蝙蝠严重急性呼吸综合征冠状病毒ZC45和蝙蝠严重急性呼吸综合征冠状病毒ZXC21)的DSR。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dd4/7306612/7f518b710601/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验