Department of Biochemistry, Université de Sherbrooke, Quebec, Canada.
Univ. Lille, INSERM U1192, Laboratoire Protéomique, Réponse Inflammatoire & Spectrométrie de Masse (PRISM), Lille, France.
Proteomics. 2018 May;18(10):e1700058. doi: 10.1002/pmic.201700058. Epub 2017 Oct 11.
Short ORF-encoded peptides and small proteins in eukaryotes have been hiding in the shadow of large proteins for a long time. Recently, improved identifications in MS-based proteomics and ribosome profiling resulted in the detection of large numbers of small proteins. The variety of functions of small proteins is also emerging. It seems to be the right time to reflect on why small proteins remained invisible. In addition to the obvious technical challenge of detecting small proteins, they were mostly forgotten from annotations and they escaped detection because they were not sought. In this review, we identify conventions that need to be revisited, including the assumption that mature mRNAs carry only one coding sequence. The large-scale discovery of small proteins and of their functions will require changing some paradigms and undertaking the annotation of ORFs that are still largely perceived as irrelevant coding information compared to already annotated coding sequences.
长期以来,真核生物中的短开放阅读框编码肽和小蛋白一直隐藏在大型蛋白的阴影之下。最近,基于 MS 的蛋白质组学和核糖体分析的改进鉴定导致大量小蛋白的检测。小蛋白的各种功能也在不断涌现。现在似乎是时候反思一下为什么小蛋白仍然不可见了。除了检测小蛋白的明显技术挑战外,它们在注释中大多被遗忘了,而且由于没有被寻找,它们也逃避了检测。在这篇综述中,我们确定了需要重新审视的惯例,包括成熟 mRNA 仅携带一个编码序列的假设。大规模发现小蛋白及其功能将需要改变一些范例,并对 ORFs 进行注释,这些 ORFs 与已经注释的编码序列相比,仍然被视为与编码信息不相关。