Centro Andaluz de Biologia del Desarrollo, CSIC-UPO, Sevilla 41013, Spain.
Brighton and Sussex Medical School, University of Sussex, Brighton BN1 9PX, UK.
Nat Rev Mol Cell Biol. 2017 Sep;18(9):575-589. doi: 10.1038/nrm.2017.58. Epub 2017 Jul 12.
Small open reading frames (smORFs) of 100 codons or fewer are usually - if arbitrarily - excluded from proteome annotations. Despite this, the genomes of many metazoans, including humans, contain millions of smORFs, some of which fulfil key physiological functions. Recently, the transcriptome of Drosophila melanogaster was shown to contain thousands of smORFs of different classes that actively undergo translation, which produces peptides of mostly unknown function. Here, we present a comprehensive analysis of smORFs in flies, mice and humans. We propose the existence of several functional classes of smORFs, ranging from inert DNA sequences to transcribed and translated cis-regulators of translation and peptides with a propensity to function as regulators of membrane-associated proteins, or as components of ancient protein complexes in the cytoplasm. We suggest that the different smORF classes could represent steps in gene, peptide and protein evolution. Our analysis introduces a distinction between different peptide-coding classes of smORFs in animal genomes, and highlights the role of model organisms for the study of small peptide biology in the context of development, physiology and human disease.
小开放阅读框(smORFs)长度为 100 个密码子或更短,通常(如果任意的话)被排除在蛋白质组注释之外。尽管如此,包括人类在内的许多后生动物的基因组中都含有数以百万计的 smORFs,其中一些发挥着关键的生理功能。最近,果蝇的转录组被证明含有数千种不同类别的 smORFs,它们能够进行主动翻译,产生的肽类大多具有未知的功能。在这里,我们对果蝇、小鼠和人类中的 smORFs 进行了全面分析。我们提出了存在几种功能类别的 smORFs,范围从惰性 DNA 序列到转录和翻译的翻译顺式调控因子,以及具有作为膜相关蛋白调节剂或作为细胞质中古老蛋白质复合物组成部分的倾向的肽。我们认为,不同的 smORF 类可以代表基因、肽和蛋白质进化的不同步骤。我们的分析在动物基因组中引入了 smORFs 不同肽编码类之间的区别,并强调了模型生物在发育、生理和人类疾病背景下研究小肽生物学的作用。