Bermudez Rosa M, Wu Peter I-Fan, Callerame Deanna, Hammer Staci, Hu James C, Polymenis Michael
Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843 and.
Department of Genetics and Molecular Biology, Centro de Investigación y Estudios Avanzados del IPN, Mexico City, 07360, Mexico.
G3 (Bethesda). 2020 Jul 7;10(7):2345-2351. doi: 10.1534/g3.120.401350.
A long-standing effort in biology is to precisely define and group phenotypes that characterize a biological process, and the genes that underpin them. In and other organisms, functional screens have generated rich lists of phenotypes associated with individual genes. However, it is often challenging to identify sets of phenotypes and genes that are most closely associated with a given biological process. Here, we focused on the 166 phenotypes arising from loss-of-function and the 86 phenotypes from gain-of-function mutations in 571 genes currently assigned to cell cycle-related ontologies in To reduce this complexity, we applied unbiased, computational approaches of correspondence analysis to identify a minimum set of phenotypic variables that accounts for as much of the variability in the data as possible. Loss-of-function phenotypes can be reduced to 20 dimensions, while gain-of-function ones to 14 dimensions. We also pinpoint the contributions of phenotypes and genes in each set. The approach we describe not only simplifies the categorization of phenotypes associated with cell cycle progression but might also potentially serve as a discovery tool for gene function.
生物学领域长期以来的一项工作是精确界定和归类表征生物过程的表型以及支撑这些表型的基因。在酵母及其他生物体中,功能筛选已生成了与单个基因相关的丰富表型列表。然而,要确定与特定生物过程最密切相关的表型和基因组合往往具有挑战性。在此,我们聚焦于目前分配到酵母细胞周期相关本体论中的571个基因的功能丧失所产生的166种表型以及功能获得性突变产生的86种表型。为降低这种复杂性,我们应用了无偏的对应分析计算方法,以确定一组最少的表型变量,使其尽可能多地解释数据中的变异性。功能丧失表型可简化为20个维度,而功能获得性表型可简化为14个维度。我们还明确了每组中表型和基因的贡献。我们所描述的方法不仅简化了与细胞周期进程相关的表型分类,还可能潜在地用作基因功能的发现工具。