CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal.
Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK.
Hum Genet. 2024 Dec;143(12):1465-1479. doi: 10.1007/s00439-024-02715-9. Epub 2024 Nov 2.
Over the past decade, variations of the coding portion of the human genome have become increasingly evident. In this study, we focus on polymorphic pseudogenes, a unique and relatively unexplored type of pseudogene whose inactivating mutations have not yet been fixed in the human genome at the global population level. Thus, polymorphic pseudogenes are characterized by the presence in the population of both coding alleles and non-coding alleles originating from Loss-of-Function (LoF) mutations. These alleles can be found both in heterozygosity and in homozygosity in different human populations and thus represent pseudogenes that have not yet been fixed in the population.
A methodical cross-population analysis of 232 polymorphic pseudogenes, including 35 new examples, reveals that human olfactory signalling, drug metabolism and immunity are among the systems most impacted by the variable presence of LoF variants at high frequencies. Within this dataset, a total of 179 genes presented polymorphic LoF variants in all analysed populations. Transcriptome and proteome analysis confirmed that although these genes may harbour LoF alleles, when the coding allele is present, the gene remains active and can play a functional role in various metabolic pathways, including drug/xenobiotic metabolism and immunity. The observation that many polymorphic pseudogenes are members of multigene families argues that genetic redundancy may play a key role in compensating for the inactivation of one paralogue.
The distribution, expression and integration of cellular/biological networks in relation to human polymorphic pseudogenes, provide novel insights into the architecture of the human genome and the dynamics of gene gain and loss with likely functional impact.
在过去的十年中,人类基因组编码部分的变异变得越来越明显。在这项研究中,我们专注于多态性假基因,这是一种独特且相对未被探索的假基因类型,其失活突变尚未在全球人群水平的人类基因组中固定。因此,多态性假基因的特征是在人群中同时存在编码等位基因和来自功能丧失(LoF)突变的非编码等位基因。这些等位基因可以在不同人群的杂合子和纯合子中找到,因此代表尚未在人群中固定的假基因。
对 232 个多态性假基因进行了系统的跨人群分析,包括 35 个新的例子,结果表明,人类嗅觉信号、药物代谢和免疫是受 LoF 变体高频可变存在影响最大的系统之一。在这个数据集内,共有 179 个基因在所有分析的人群中均存在多态性 LoF 变体。转录组和蛋白质组分析证实,尽管这些基因可能含有 LoF 等位基因,但当编码等位基因存在时,该基因仍然活跃,并可在各种代谢途径中发挥功能作用,包括药物/外源性代谢物和免疫。观察到许多多态性假基因是多基因家族的成员,这表明遗传冗余可能在补偿一个同源基因失活方面发挥关键作用。
人类多态性假基因与细胞/生物网络的分布、表达和整合提供了有关人类基因组结构和基因获得和丢失动态的新见解,可能具有功能影响。