Romero Selim, Gupta Shreyan, Gatlin Victoria, Chapkin Robert S, Cai James J
Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77843, USA.
Department of Nutrition, Texas A&M University, College Station, TX 77843, USA.
ArXiv. 2025 Aug 15:arXiv:2408.08867v3.
Feature selection is a machine learning technique for identifying relevant variables in classification and regression models. In single-cell RNA sequencing (scRNA-seq) data analysis, feature selection is used to identify relevant genes that are crucial for understanding cellular processes. Traditional feature selection methods often struggle with the complexity of scRNA-seq data and suffer from interpretation difficulties. Quantum annealing presents a promising alternative approach. In this study, we implement quantum annealing-empowered quadratic unconstrained binary optimization (QUBO) for feature selection in scRNA-seq data. Using data from a human cell differentiation system and an anticancer drug resistance study, we demonstrate that QUBO feature selection effectively identifies genes whose expression patterns reflect critical cell state transitions associated with differentiation and drug resistance development. Our findings indicate that quantum annealing-powered QUBO reveals complex gene expression patterns potentially missed by traditional methods, thereby enhancing scRNA-seq data analysis and interpretation.
特征选择是一种用于在分类和回归模型中识别相关变量的机器学习技术。在单细胞RNA测序(scRNA-seq)数据分析中,特征选择用于识别对理解细胞过程至关重要的相关基因。传统的特征选择方法常常难以应对scRNA-seq数据的复杂性,并且存在解释困难的问题。量子退火提供了一种有前景的替代方法。在本研究中,我们实现了用于scRNA-seq数据特征选择的量子退火赋能二次无约束二进制优化(QUBO)。使用来自人类细胞分化系统和抗癌药物耐药性研究的数据,我们证明QUBO特征选择能够有效地识别出其表达模式反映与分化和耐药性发展相关的关键细胞状态转变的基因。我们的研究结果表明,由量子退火驱动的QUBO揭示了传统方法可能遗漏的复杂基因表达模式,从而增强了scRNA-seq数据分析和解释。