Kashkak Elena S, Kataev Vladimir Ya, Khlopko Yuri A, Budagaeva Valentina G, Danilova Erzhena V, Oorzhak Urana S, Dagurova Olga P, Plotnikov Andrey O
The Department of Chemistry, Tuvan State University, 36 Lenin St., Kyzyl 667000, Russian Federation.
Institute for Cellular and Intracellular Symbiosis, Ural Branch of Russian Academy of Sciences, 11 Pionerskaya St., Orenburg 460000, Russian Federation.
Data Brief. 2020 Sep 4;32:106278. doi: 10.1016/j.dib.2020.106278. eCollection 2020 Oct.
sp. SAM-B was isolated from Uzharlyg Mineral Cold Spring, Samagaltay Settlement, Republic of Tyva (Southern Siberia), Russian Federation. A whole genome sequencing of sp. SAM-B was performed using an Illumina MiSeq platform. The resulting draft genome contains 4,253,956 bp with 66.48% GC-content and 71 contigs; the longest contig contains 968,648 bp, and the N has a length of 401,736 bp. The genome includes 3816 protein-coding genes, among which 23 are responsible for protein degradation, 65 are associated with stress response, and 31 are associated with virulence, disease, and defense, including beta-lactamase and resistance to fluoroquinolones. The genome data on the SAM-B strain provides fundamental knowledge that would allow a better understanding of the microorganisms inhabiting cold water environments. Moreover, the results of the genome annotation indicated that diverse metabolic pathways are encoded in the genome of the SAM-B strain and that it has biotechnological potential. The draft genome sequence of sp. SAM-B has been deposited in DDBJ/ENA/GenBank under the accession number JABBXB000000000; the accession number of the genome sequence referred to in this paper is JABBXB010000000.
菌株SAM - B从俄罗斯联邦图瓦共和国(南西伯利亚)萨马加尔泰定居点的乌扎尔利格矿泉冷泉中分离得到。使用Illumina MiSeq平台对菌株SAM - B进行了全基因组测序。所得基因组草图包含4,253,956 bp,GC含量为66.48%,有71个重叠群;最长的重叠群包含968,648 bp,N50长度为401,736 bp。该基因组包括3816个蛋白质编码基因,其中23个负责蛋白质降解,65个与应激反应相关,31个与毒力、疾病和防御相关,包括β - 内酰胺酶和对氟喹诺酮类的抗性。关于SAM - B菌株的基因组数据提供了基础知识,有助于更好地了解栖息在冷水环境中的微生物。此外,基因组注释结果表明,SAM - B菌株的基因组编码了多种代谢途径,具有生物技术潜力。菌株SAM - B的基因组草图序列已保存在DDBJ/ENA/GenBank中,登录号为JABBXB000000000;本文引用的基因组序列登录号为JABBXB010000000。