Martin Reese, Tate Ann T
Department of Biological Sciences, Vanderbilt University, Nashville TN, 37235.
Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, USA.
bioRxiv. 2024 Nov 21:2024.11.19.624372. doi: 10.1101/2024.11.19.624372.
Fundamental traits of genes, including function, length and GC content, all vary with gene age. Pleiotropy, where a single gene affects multiple traits, arises through selection for novel traits and is expected to be removed from the genome through subfunctionalization following duplication events. It is unclear, however, how these opposing forces shape the prevalence of pleiotropy through time. We hypothesized that the prevalence of pleiotropy would be lowest in young genes, peak in middle aged genes, and then either decrease to a middling level in ancient genes or stay near the middle-aged peak, depending on the balance between exaptation and subfunctionalization. To address this question, we have calculated gene age and pleiotropic status for several model multicellular eukaryotes, including , , , , , and . Gene age was determined by finding the most distantly related species that shared an ortholog using the Open Tree of Life and the Orthologous Matrix Database (OMAdb). Pleiotropic status was determined using both protein-protein interactions (STRINGdb) and associated biological processes (Gene Ontology). We found that middle-aged and ancient genes tend to be more pleiotropic than young genes, and that this relationship holds across all species evaluated and across both modalities of measuring pleiotropy. We also found absolute differences in the degree of pleiotropy based on gene functional class, but only when looking at biological process count. From these results we propose that there is a fundamental relationship between pleiotropy and gene age and further study of this relationship may shed light on the mechanism behind the functional changes genes undergo as they age.
基因的基本特征,包括功能、长度和GC含量,均随基因年龄而变化。多效性是指单个基因影响多个性状,它通过对新性状的选择而产生,并且在复制事件后有望通过亚功能化从基因组中消除。然而,目前尚不清楚这些相反的力量如何随着时间塑造多效性的普遍程度。我们假设多效性的普遍程度在年轻基因中最低,在中年基因中达到峰值,然后根据适应性进化和亚功能化之间的平衡,在古老基因中要么降至中等水平,要么保持在中年峰值附近。为了解决这个问题,我们计算了几种模式多细胞真核生物的基因年龄和多效性状态,包括 、 、 、 、 和 。基因年龄通过使用生命之树开放数据库(Open Tree of Life)和直系同源矩阵数据库(OMAdb)找到共享直系同源基因的关系最远的物种来确定。多效性状态使用蛋白质-蛋白质相互作用(STRINGdb)和相关生物学过程(基因本体论)来确定。我们发现中年基因和古老基因往往比年轻基因具有更多的多效性,并且这种关系在所有评估的物种以及两种多效性测量方式中都成立。我们还发现基于基因功能类别的多效性程度存在绝对差异,但仅在查看生物学过程计数时如此。从这些结果中我们提出,多效性与基因年龄之间存在根本关系,对这种关系的进一步研究可能会揭示基因随着年龄增长所经历的功能变化背后的机制。