Suppr超能文献

ZCURVE_CoV:一种识别冠状病毒基因组中蛋白质编码基因的新系统及其在分析严重急性呼吸综合征冠状病毒(SARS-CoV)基因组中的应用

ZCURVE_CoV: a new system to recognize protein coding genes in coronavirus genomes, and its applications in analyzing SARS-CoV genomes.

作者信息

Chen Ling-Ling, Ou Hong-Yu, Zhang Ren, Zhang Chun-Ting

机构信息

Department of Physics, Tianjin University, PR China.

出版信息

Biochem Biophys Res Commun. 2003 Jul 25;307(2):382-8. doi: 10.1016/s0006-291x(03)01192-6.

Abstract

A new system to recognize protein coding genes in the coronavirus genomes, specially suitable for the SARS-CoV genomes, has been proposed in this paper. Compared with some existing systems, the new program package has the merits of simplicity, high accuracy, reliability, and quickness. The system ZCURVE_CoV has been run for each of the 11 newly sequenced SARS-CoV genomes. Consequently, six genomes not annotated previously have been annotated, and some problems of previous annotations in the remaining five genomes have been pointed out and discussed. In addition to the polyprotein chain ORFs 1a and 1b and the four genes coding for the major structural proteins, spike (S), small envelop (E), membrane (M), and nuleocaspid (N), respectively, ZCURVE_CoV also predicts 5-6 putative proteins in length between 39 and 274 amino acids with unknown functions. Some single nucleotide mutations within these putative coding sequences have been detected and their biological implications are discussed. A web service is provided, by which a user can obtain the annotated result immediately by pasting the SARS-CoV genome sequences into the input window on the web site (http://tubic.tju.edu.cn/sars/). The software ZCURVE_CoV can also be downloaded freely from the web address mentioned above and run in computers under the platforms of Windows or Linux.

摘要

本文提出了一种用于识别冠状病毒基因组中蛋白质编码基因的新系统,特别适用于SARS-CoV基因组。与一些现有系统相比,新的程序包具有简单、高精度、可靠和快速的优点。ZCURVE_CoV系统已针对11个新测序的SARS-CoV基因组中的每一个运行。结果,6个先前未注释的基因组已被注释,并指出和讨论了其余5个基因组中先前注释存在的一些问题。除了多聚蛋白链开放阅读框1a和1b以及分别编码主要结构蛋白刺突(S)、小包膜(E)、膜(M)和核衣壳(N)的四个基因外,ZCURVE_CoV还预测了5-6个长度在39至274个氨基酸之间且功能未知的假定蛋白。已检测到这些假定编码序列内的一些单核苷酸突变,并讨论了其生物学意义。提供了一个网络服务,用户可以通过将SARS-CoV基因组序列粘贴到网站(http://tubic.tju.edu.cn/sars/)的输入窗口中立即获得注释结果。软件ZCURVE_CoV也可以从上述网址免费下载,并在Windows或Linux平台的计算机上运行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6d6/7134609/ec10b694e725/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验