Qin E'de, Zhu Qingyu, Yu Man, Fan Baochang, Chang Guohui, Si Bingyin, Yang Bao'an, Peng Wenming, Jiang Tao, Liu Bohua, Deng Yongqiang, Liu Hong, Zhang Yu, Wang Cui'e, Li Yuquan, Gan Yonghua, Li Xiaoyu, Lü Fushuang, Tan Gang, Cao Wuchun, Yang Ruifu, Wang Jian, Li Wei, Xu Zuyuan, Li Yan, Wu Qingfa, Lin Wei, Chen Weijun, Tang Lin, Deng Yajun, Han Yujun, Li Changfeng, Lei Meng, Li Guoqing, Li Wenjie, Lü Hong, Shi Jianping, Tong Zongzhong, Zhang Feng, Li Songgang, Liu Bin, Liu Siqi, Dong Wei, Wang Jun, Wong Gane K-S, Yu Jun, Yang Huanming
1Institute of Microbiology and Epidemiology, Chinese Academy of Military Medical Sciences, 100071 Beijing, China.
2Beijing Genomics Institute, Chinese Academy of Sciences, 101300 Beijing.
Chin Sci Bull. 2003;48(10):941-948. doi: 10.1007/BF03184203.
The genome sequence of the Severe Acute Respiratory Syndrome (SARS)-associated virus provides essential information for the identification of pathogen(s), exploration of etiology and evolution, interpretation of transmission and pathogenesis, development of diagnostics, prevention by future vaccination, and treatment by developing new drugs. We report the complete genome sequence and comparative analysis of an isolate (BJ01) of the coronavirus that has been recognized as a pathogen for SARS. The genome is 29725 nt in size and has 11 ORFs (Open Reading Frames). It is composed of a stable region encoding an RNA-dependent RNA polymerase (composed of 2 ORFs) and a variable region representing 4 CDSs (coding sequences) for viral structural genes (the S, E, M, N proteins) and 5 PUPs (putative uncharacterized proteins). Its gene order is identical to that of other known coronaviruses. The sequence alignment with all known RNA viruses places this virus as a member in the family of Coronaviridae. Thirty putative substitutions have been identified by comparative analysis of the 5 SARS-associated virus genome sequences in GenBank. Fifteen of them lead to possible amino acid changes (non-synonymous mutations) in the proteins. Three amino acid changes, with predicted alteration of physical and chemical features, have been detected in the S protein that is postulated to be involved in the immunoreactions between the virus and its host. Two amino acid changes have been detected in the M protein, which could be related to viral envelope formation. Phylogenetic analysis suggests the possibility of non-human origin of the SARS-associated viruses but provides no evidence that they are man-made. Further efforts should focus on identifying the etiology of the SARS-associated virus and ruling out conclusively the existence of other possible SARS-related pathogen(s).
严重急性呼吸综合征(SARS)相关病毒的基因组序列为病原体鉴定、病因及进化探索、传播与发病机制阐释、诊断方法开发、未来疫苗预防以及新药研发治疗提供了重要信息。我们报告了一株已被确认为SARS病原体的冠状病毒分离株(BJ01)的完整基因组序列及比较分析结果。该基因组大小为29725个核苷酸,有11个开放阅读框(ORF)。它由一个编码依赖RNA的RNA聚合酶的稳定区域(由2个ORF组成)和一个可变区域组成,可变区域包含4个病毒结构基因(S、E、M、N蛋白)的编码序列(CDS)以及5个假定的未表征蛋白(PUP)。其基因顺序与其他已知冠状病毒相同。与所有已知RNA病毒的序列比对将该病毒归为冠状病毒科成员。通过对GenBank中5个SARS相关病毒基因组序列的比较分析,已鉴定出30个假定的替换位点。其中15个导致蛋白质中可能的氨基酸变化(非同义突变)。在假定参与病毒与其宿主免疫反应的S蛋白中检测到3个氨基酸变化,这些变化预测会改变物理和化学特性。在M蛋白中检测到2个氨基酸变化,这可能与病毒包膜形成有关。系统发育分析表明SARS相关病毒可能起源于非人类,但没有证据表明它们是人造的。进一步的努力应集中在确定SARS相关病毒的病因并最终排除其他可能的SARS相关病原体的存在。