Suppr超能文献

使用变分自编码器的深度非负矩阵分解及其在单细胞RNA测序数据中的应用

Deep Nonnegative Matrix Factorization Using a Variational Autoencoder With Application to Single-Cell RNA Sequencing Data.

作者信息

Jee Dong Jun, Kong Yixin, Chun Hyonho

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):883-893. doi: 10.1109/TCBB.2022.3172723. Epub 2023 Apr 3.

Abstract

Single-cell RNA sequencing is used to analyze the gene expression data of individual cells, thereby adding to existing knowledge of biological phenomena. Accordingly, this technology is widely used in numerous biomedical studies. Recently, the variational autoencoder has emerged and has been adopted for the analysis of single-cell data owing to its high capacity to manage large-scale data. Many different variants of the variational autoencoder have been applied, and have yielded superior results. However, because it is nonlinear, the model does not provide parameters that can be used to explain the underlying biological patterns. In this paper, we propose an interpretable nonnegative matrix factorization method that decomposes parameters into those shared across cells and those that are cell-specific. Effective nonlinear dimension reduction was achieved via a variational autoencoder applied to the cell-specific parameters. In addition to achieving nonlinear dimension reduction, our model could estimate the cell-type-specific gene expression. To improve the estimation accuracy, we introduced log-regularization, which reflects the single-cell property. Overall, our approach displayed excellent performance in a simulation study and in real data analyses, while maintaining good biological interpretability.

摘要

单细胞RNA测序用于分析单个细胞的基因表达数据,从而增加对生物现象的现有认识。因此,这项技术在众多生物医学研究中被广泛应用。最近,变分自编码器出现了,并因其处理大规模数据的高能力而被用于单细胞数据分析。变分自编码器的许多不同变体已被应用,并取得了优异的结果。然而,由于它是非线性的,该模型没有提供可用于解释潜在生物学模式的参数。在本文中,我们提出了一种可解释的非负矩阵分解方法,该方法将参数分解为细胞间共享的参数和细胞特异性的参数。通过应用于细胞特异性参数的变分自编码器实现了有效的非线性降维。除了实现非线性降维外,我们的模型还可以估计细胞类型特异性的基因表达。为了提高估计精度,我们引入了反映单细胞特性的对数正则化。总体而言,我们的方法在模拟研究和实际数据分析中表现出优异的性能,同时保持了良好的生物学可解释性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验