Department of Pediatrics, University of California, San Diego, School of Medicine, La Jolla, California, United States of America.
Department of Biology and Cell Biology, University of Virginia, Charlottesville, Virginia, United States of America.
PLoS Comput Biol. 2022 Jul 13;18(7):e1010295. doi: 10.1371/journal.pcbi.1010295. eCollection 2022 Jul.
The concept of "housekeeping gene" has been used for four decades but remains loosely defined. Housekeeping genes are commonly described as "essential for cellular existence regardless of their specific function in the tissue or organism", and "stably expressed irrespective of tissue type, developmental stage, cell cycle state, or external signal". However, experimental support for the tenet that gene essentiality is linked to stable expression across cell types, conditions, and organisms has been limited. Here we use genome-scale functional genomic screens together with bulk and single-cell sequencing technologies to test this link and optimize a quantitative and experimentally validated definition of housekeeping gene. Using the optimized definition, we identify, characterize, and provide as resources, housekeeping gene lists extracted from several human datasets, and 10 other animal species that include primates, chicken, and C. elegans. We find that stably expressed genes are not necessarily essential, and that the individual genes that are essential and stably expressed can considerably differ across organisms; yet the pathways enriched among these genes are conserved. Further, the level of conservation of housekeeping genes across the analyzed organisms captures their taxonomic groups, showing evolutionary relevance for our definition. Therefore, we present a quantitative and experimentally supported definition of housekeeping genes that can contribute to better understanding of their unique biological and evolutionary characteristics.
“管家基因”的概念已经使用了四十年,但仍然没有明确的定义。管家基因通常被描述为“无论其在组织或生物体中的特定功能如何,对细胞生存都是必不可少的”,并且“无论组织类型、发育阶段、细胞周期状态或外部信号如何,都稳定表达”。然而,实验支持基因的必需性与跨细胞类型、条件和生物体的稳定表达相关的这一原则一直受到限制。在这里,我们使用全基因组功能基因组筛选以及批量和单细胞测序技术来检验这一联系,并优化管家基因的定量和经过实验验证的定义。使用优化的定义,我们从几个人类数据集以及包括灵长类动物、鸡和秀丽隐杆线虫在内的 10 种其他动物物种中识别、描述和提供管家基因列表,作为资源。我们发现稳定表达的基因不一定是必需的,而且必需且稳定表达的单个基因在不同的生物体中可能有很大的不同;然而,这些基因中富集的途径是保守的。此外,在所分析的生物体中管家基因的保守程度可以捕获它们的分类群,表明我们的定义具有进化相关性。因此,我们提出了一种定量的和经过实验支持的管家基因定义,可以帮助更好地理解它们独特的生物学和进化特征。