Department of Microbiology and Immunology, Renaissance School of Medicine, Stony Brook University, Stony Brook, NY 11794, USA.
Int J Mol Sci. 2024 Mar 31;25(7):3924. doi: 10.3390/ijms25073924.
In the human genome, two short open reading frames (ORFs) separated by a transcriptional silencer and a small intervening sequence stem from the gene . The two ORFs show different translational characteristics, and they also show divergent patterns of evolutionary development. The studies presented here describe the evolution of the components of . One ORF consists of an ultra-conserved 68 amino acid (aa) sequence, whose origins can be traced beyond the evolutionary age of divergence of the elephant shark, ~462 MYA. The silencer also has ancient origins, but it has a complex and divergent pattern of evolutionary formation, as it overlaps both at the 68 aa ORF and the intervening sequence. The other ORF consists of 107 aa. It develops during primate evolution but is found to originate de novo from an ancestral non-coding genomic region with root origins within the Afrothere clade of placental mammals, whose evolutionary age of divergence is ~99 MYA. The formation of the complete 107 aa ORF during primate evolution is outlined, whereby sequence development is found to occur through biased mutations, with disruptive random mutations that also occur but lead to a dead-end. The 107 aa ORF is of particular significance, as there is evidence to suggest it is a protein that may function in human brain development. Its evolutionary formation presents a view of a human-specific ORF and its linked silencer that were predetermined in non-primate ancestral species. The genomic position of the silencer offers interesting possibilities for the regulation of transcription of the 107 aa ORF. A hypothesis is presented with respect to possible spatiotemporal expression of the 107 aa ORF in embryonic tissues.
在人类基因组中,两个短的开放阅读框(ORFs)由转录沉默子和小的间隔序列隔开,来自基因。这两个 ORF 表现出不同的翻译特征,并且它们也表现出不同的进化发展模式。这里介绍的研究描述了成分的进化。一个 ORF 由一个超保守的 68 个氨基酸(aa)序列组成,其起源可以追溯到比象鲨的进化年龄更早的时期,约 4.62 亿年前。沉默子也有古老的起源,但它具有复杂和不同的进化形成模式,因为它在 68 个 aa ORF 和间隔序列上都重叠。另一个 ORF 由 107 个 aa 组成。它在灵长类动物进化过程中发展,但起源于一个古老的非编码基因组区域,其起源根位于胎盘哺乳动物 Afrothere 分支内,其进化年龄为约 9900 万年前。概述了在灵长类动物进化过程中形成完整的 107 个 aa ORF 的过程,发现序列的发展是通过偏向性突变发生的,也会发生破坏性的随机突变,但会导致死胡同。107 个 aa ORF 具有特殊意义,因为有证据表明它是一种可能在人类大脑发育中起作用的蛋白质。它的进化形成提供了一个观点,即在非灵长类祖先物种中,人类特有的 ORF 及其相关沉默子是预先确定的。沉默子的基因组位置为 107 个 aa ORF 的转录调控提供了有趣的可能性。提出了一个假说,涉及 107 个 aa ORF 在胚胎组织中的时空表达。