School of Mathematics and Statistics, Xidian University, Xi'an, China.
BMC Genomics. 2023 May 25;24(1):280. doi: 10.1186/s12864-023-09374-6.
Cell clustering is a prerequisite for identifying differentially expressed genes (DEGs) in single-cell RNA sequencing (scRNA-seq) data. Obtaining a perfect clustering result is of central importance for subsequent analyses, but not easy. Additionally, the increase in cell throughput due to the advancement of scRNA-seq protocols exacerbates many computational issues, especially regarding method runtime. To address these difficulties, a new, accurate, and fast method for detecting DEGs in scRNA-seq data is needed.
Here, we propose single-cell minimum enclosing ball (scMEB), a novel and fast method for detecting single-cell DEGs without prior cell clustering results. The proposed method utilizes a small part of known non-DEGs (stably expressed genes) to build a minimum enclosing ball and defines the DEGs based on the distance of a mapped gene to the center of the hypersphere in a feature space.
We compared scMEB to two different approaches that could be used to identify DEGs without cell clustering. The investigation of 11 real datasets revealed that scMEB outperformed rival methods in terms of cell clustering, predicting genes with biological functions, and identifying marker genes. Moreover, scMEB was much faster than the other methods, making it particularly effective for finding DEGs in high-throughput scRNA-seq data. We have developed a package scMEB for the proposed method, which could be available at https://github.com/FocusPaka/scMEB .
细胞聚类是识别单细胞 RNA 测序 (scRNA-seq) 数据中差异表达基因 (DEGs) 的前提。获得完美的聚类结果对于后续分析至关重要,但并不容易。此外,由于 scRNA-seq 协议的进步,细胞通量的增加加剧了许多计算问题,尤其是关于方法运行时间的问题。为了解决这些困难,需要一种新的、准确的、快速的 scRNA-seq 数据中检测 DEGs 的方法。
在这里,我们提出了单细胞最小包围球 (scMEB),这是一种无需事先进行细胞聚类即可检测单细胞 DEGs 的新方法。该方法利用已知的非 DEGs(稳定表达基因)的一小部分构建最小包围球,并根据映射基因到特征空间中超球体中心的距离来定义 DEGs。
我们将 scMEB 与两种可用于在没有细胞聚类的情况下识别 DEGs 的不同方法进行了比较。对 11 个真实数据集的研究表明,scMEB 在细胞聚类、预测具有生物学功能的基因和识别标记基因方面优于竞争方法。此外,scMEB 比其他方法快得多,使其特别适用于在高通量 scRNA-seq 数据中寻找 DEGs。我们已经为所提出的方法开发了一个名为 scMEB 的软件包,可在 https://github.com/FocusPaka/scMEB 上获得。