Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Québec, Canada; PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Canada.
Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Québec, Canada; PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Canada.
Exp Cell Res. 2020 Aug 1;393(1):112057. doi: 10.1016/j.yexcr.2020.112057. Epub 2020 May 6.
The discovery of functional yet non-annotated open reading frames (ORFs) throughout the genome of several species presents an unprecedented challenge in current genome annotation. These novel ORFs are shorter than annotated ones and many can be found on the same RNA, in opposition to current assumptions in annotation methodologies. Whilst the literature lacks consensus, these novel ORFs are commonly referred to as small ORFs (sORFs) or alternative ORFs (alt-ORFs). Unannotated ORFs represent an overlooked layer of complexity in the coding potential of genomes and are transforming our current vision of the nature of coding genes. In this review, we outline what constitutes a sORF or an alt-ORF and emphasize differences between both nomenclatures. We then describe complementary large-scale methods to accurately discover novel ORFs as well as yield functional insights on the novel proteins they encode. While serendipitous discoveries highlighted the functional importance of some novel ORFs, omics methods facilitate and improve their characterization to better understand physiological and pathological pathways. Functional annotation of sORFs, alt-ORFs and their corresponding microproteins will likely help fundamental and clinical research.
在多个物种的基因组中发现具有功能但未注释的开放阅读框(ORF),这对当前的基因组注释提出了前所未有的挑战。这些新的 ORF 比已注释的 ORF 短,许多可以在同一 RNA 上找到,与当前注释方法的假设相反。虽然文献中缺乏共识,但这些新的 ORF 通常被称为小 ORF(sORF)或替代 ORF(alt-ORF)。未注释的 ORF 代表了基因组编码潜力中被忽视的复杂性层次,并正在改变我们对编码基因本质的现有认识。在这篇综述中,我们概述了什么构成 sORF 或 alt-ORF,并强调了两者命名法之间的差异。然后,我们描述了互补的大规模方法,以准确发现新的 ORF,并对它们编码的新蛋白质产生功能见解。虽然偶然的发现强调了一些新的 ORF 的功能重要性,但组学方法促进并改善了它们的表征,以更好地理解生理和病理途径。sORF、alt-ORF 及其相应的微蛋白的功能注释可能有助于基础研究和临床研究。