Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
Biodiversity Research Center, Academia Sinica, Taipei, 115, Taiwan; Institute for Comparative Genomics, American Museum of Natural History, New York City, NY, 10024, USA.
Virology. 2021 Jun;558:145-151. doi: 10.1016/j.virol.2021.02.013. Epub 2021 Mar 17.
At least six small alternative-frame open reading frames (ORFs) overlapping well-characterized SARS-CoV-2 genes have been hypothesized to encode accessory proteins. Researchers have used different names for the same ORF or the same name for different ORFs, resulting in erroneous homological and functional inferences. We propose standard names for these ORFs and their shorter isoforms, developed in consultation with the Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. We recommend calling the 39 codon Spike-overlapping ORF ORF2b; the 41, 57, and 22 codon ORF3a-overlapping ORFs ORF3c, ORF3d, and ORF3b; the 33 codon ORF3d isoform ORF3d-2; and the 97 and 73 codon Nucleocapsid-overlapping ORFs ORF9b and ORF9c. Finally, we document conflicting usage of the name ORF3b in 32 studies, and consequent erroneous inferences, stressing the importance of reserving identical names for homologs. We recommend that authors referring to these ORFs provide lengths and coordinates to minimize ambiguity caused by prior usage of alternative names.
至少有六个小的框架转换开放阅读框(ORF)与 SARS-CoV-2 基因重叠良好,被假设编码辅助蛋白。研究人员对相同的 ORF 使用了不同的名称,或对不同的 ORF 使用了相同的名称,导致错误的同源性和功能推断。我们提议为这些 ORF 及其较短的同工型取标准名称,这些名称是与国际病毒分类学委员会冠状病毒研究小组协商制定的。我们建议将 39 个密码子刺突重叠 ORF ORF2b 称为 ORF2b;将 41、57 和 22 个密码子 3a 重叠 ORF 称为 ORF3c、ORF3d 和 ORF3b;将 33 个密码子 3d 重叠 ORF 称为 ORF3d-2;将 97 和 73 个密码子核衣壳重叠 ORF 称为 ORF9b 和 ORF9c。最后,我们记录了在 32 项研究中对 ORF3b 名称的使用存在冲突,并因此产生错误推断,强调为同源物保留相同名称的重要性。我们建议引用这些 ORF 的作者提供长度和坐标,以最大程度地减少先前使用替代名称引起的歧义。