Davalos Oscar A, Heydari A Ali, Fertig Elana J, Sindi Suzanne S, Hoyer Katrina K
Quantitative and Systems Biology Graduate Program, University of California, Merced, CA, USA.
Department of Applied Mathematics, University of California, Merced, CA, USA.
bioRxiv. 2023 Jun 1:2023.05.29.542760. doi: 10.1101/2023.05.29.542760.
A limitation of current deep learning (DL) approaches for single-cell RNA sequencing (scRNAseq) analysis is the lack of interpretability. Moreover, existing pipelines are designed and trained for specific tasks used disjointly for different stages of analysis. We present scANNA, a novel interpretable DL model for scRNAseq studies that leverages neural attention to learn gene associations. After training, the learned gene importance (interpretability) is used to perform downstream analyses (e.g., global marker selection and cell-type classification) without retraining. ScANNA's performance is comparable to or better than state-of-the-art methods designed and trained for specific standard scRNAseq analyses even though scANNA was not trained for these tasks explicitly. ScANNA enables researchers to discover meaningful results without extensive prior knowledge or training separate task-specific models, saving time and enhancing scRNAseq analyses.
当前用于单细胞RNA测序(scRNAseq)分析的深度学习(DL)方法存在一个局限性,即缺乏可解释性。此外,现有的流程是为特定任务设计和训练的,在不同的分析阶段 disjointly 使用。我们提出了scANNA,这是一种用于scRNAseq研究的新型可解释DL模型,它利用神经注意力来学习基因关联。训练后,学习到的基因重要性(可解释性)用于执行下游分析(例如,全局标记选择和细胞类型分类),而无需重新训练。尽管scANNA没有针对这些任务进行明确训练,但其性能与为特定标准scRNAseq分析设计和训练的现有最先进方法相当或更好。scANNA使研究人员无需广泛的先验知识或训练单独的特定任务模型就能发现有意义的结果,节省了时间并增强了scRNAseq分析。