Zhou Haosheng, Lin Wei, Labra Sergio R, Lipton Stuart A, Elman Jeremy A, Schork Nicholas J, Rangan Aaditya V
IEEE/ACM Trans Comput Biol Bioinform. 2024 Oct 29;PP. doi: 10.1109/TCBB.2024.3487434.
Many traditional methods for analyzing gene-gene relationships focus on positive and negative correlations, both of which are a kind of 'symmetric' relationship. Biclustering is one such technique that typically searches for subsets of genes exhibiting correlated expression among a subset of samples. However, genes can also exhibit 'asymmetric' relationships, such as 'if-then' relationships used in boolean circuits. In this paper we develop a very general method that can be used to detect biclusters within gene-expression data that involve subsets of genes which are enriched for these 'boolean-asymmetric' relationships (BARs). These BAR-biclusters can correspond to heterogeneity that is driven by asymmetric gene-gene interactions, e.g., reflecting regulatory effects of one gene on another, rather than more standard symmetric interactions. Unlike typical approaches that search for BARs across the entire population, BAR-biclusters can detect asymmetric interactions that only occur among a subset of samples. We apply our method to a single-cell RNA-sequencing data-set, demonstrating that the statistically-significant BARbiclusters indeed contain additional information not present within the more traditional 'boolean-symmetric'-biclusters. For example, the BAR-biclusters involve different subsets of cells, and highlight different gene-pathways within the data-set. Moreover, by combining the boolean-asymmetric- and boolean-symmetricsignals, one can build linear classifiers which outperform those built using only traditional boolean-symmetric signals.
许多用于分析基因-基因关系的传统方法都集中在正相关和负相关上,这两种都是一种“对称”关系。双聚类就是这样一种技术,它通常在样本子集中搜索表现出相关表达的基因子集。然而,基因也可以表现出“不对称”关系,例如布尔电路中使用的“if-then”关系。在本文中,我们开发了一种非常通用的方法,可用于在基因表达数据中检测双聚类,这些双聚类涉及富含这些“布尔不对称”关系(BAR)的基因子集。这些BAR双聚类可以对应于由不对称基因-基因相互作用驱动的异质性,例如,反映一个基因对另一个基因的调控作用,而不是更标准的对称相互作用。与在整个群体中搜索BAR的典型方法不同,BAR双聚类可以检测仅在样本子集中发生的不对称相互作用。我们将我们的方法应用于单细胞RNA测序数据集,证明具有统计学意义的BAR双聚类确实包含了更多传统“布尔对称”双聚类中不存在的信息。例如,BAR双聚类涉及不同的细胞子集,并突出了数据集中不同的基因途径。此外,通过结合布尔不对称和布尔对称信号,可以构建线性分类器,其性能优于仅使用传统布尔对称信号构建的分类器。