• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度生成解码器:表示的 MAP 估计可改进单细胞 RNA 数据的建模。

The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data.

机构信息

Center for Health Data Science, University of Copenhagen, 2200 Copenhagen, Denmark.

Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark.

出版信息

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad497.

DOI:10.1093/bioinformatics/btad497
PMID:37572301
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10483129/
Abstract

MOTIVATION

Learning low-dimensional representations of single-cell transcriptomics has become instrumental to its downstream analysis. The state of the art is currently represented by neural network models, such as variational autoencoders, which use a variational approximation of the likelihood for inference.

RESULTS

We here present the Deep Generative Decoder (DGD), a simple generative model that computes model parameters and representations directly via maximum a posteriori estimation. The DGD handles complex parameterized latent distributions naturally unlike variational autoencoders, which typically use a fixed Gaussian distribution, because of the complexity of adding other types. We first show its general functionality on a commonly used benchmark set, Fashion-MNIST. Secondly, we apply the model to multiple single-cell datasets. Here, the DGD learns low-dimensional, meaningful, and well-structured latent representations with sub-clustering beyond the provided labels. The advantages of this approach are its simplicity and its capability to provide representations of much smaller dimensionality than a comparable variational autoencoder.

AVAILABILITY AND IMPLEMENTATION

scDGD is available as a python package at https://github.com/Center-for-Health-Data-Science/scDGD. The remaining code is made available here: https://github.com/Center-for-Health-Data-Science/dgd.

摘要

动机

学习单细胞转录组学的低维表示已经成为其下游分析的重要手段。目前的技术水平代表是神经网络模型,如变分自动编码器,它使用似然的变分逼近进行推理。

结果

我们在这里提出了深度生成解码器(DGD),这是一种简单的生成模型,它通过最大后验估计直接计算模型参数和表示。与变分自动编码器不同,DGD 可以处理复杂的参数化潜在分布,因为添加其他类型的分布通常很复杂。我们首先在常用的基准集 Fashion-MNIST 上展示了它的一般功能。其次,我们将模型应用于多个单细胞数据集。在这里,DGD 学习到了低维的、有意义的、结构良好的潜在表示,并在提供的标签之外进行了子聚类。这种方法的优点是它的简单性,以及它能够提供比可比变分自动编码器小得多的维度的表示。

可用性和实现

scDGD 可作为 python 包在 https://github.com/Center-for-Health-Data-Science/scDGD 获得。其余代码可在此处获得:https://github.com/Center-for-Health-Data-Science/dgd。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/7ca3c90b7e5f/btad497f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/f9a6cf5a53a4/btad497f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/f9218ed6f205/btad497f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/836843f84a69/btad497f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/746ddfba1778/btad497f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/7ca3c90b7e5f/btad497f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/f9a6cf5a53a4/btad497f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/f9218ed6f205/btad497f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/836843f84a69/btad497f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/746ddfba1778/btad497f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ad/10483129/7ca3c90b7e5f/btad497f5.jpg

相似文献

1
The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data.深度生成解码器:表示的 MAP 估计可改进单细胞 RNA 数据的建模。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad497.
2
Mixture-of-Experts Variational Autoencoder for clustering and generating from similarity-based representations on single cell data.基于相似性表示的单细胞数据聚类和生成的专家混合变分自动编码器。
PLoS Comput Biol. 2021 Jun 30;17(6):e1009086. doi: 10.1371/journal.pcbi.1009086. eCollection 2021 Jun.
3
Parameter tuning is a key part of dimensionality reduction via deep variational autoencoders for single cell RNA transcriptomics.参数调整是通过深度变分自编码器进行单细胞RNA转录组学降维的关键部分。
Pac Symp Biocomput. 2019;24:362-373.
4
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
5
MichiGAN: sampling from disentangled representations of single-cell data using generative adversarial networks.MichiGAN:使用生成对抗网络从单细胞数据的解缠表示中进行采样。
Genome Biol. 2021 May 20;22(1):158. doi: 10.1186/s13059-021-02373-4.
6
A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations.基于变分推理和图自动编码器的 lncRNA-疾病关联预测的表示学习模型。
BMC Bioinformatics. 2021 Mar 21;22(1):136. doi: 10.1186/s12859-021-04073-z.
7
Deep Clustering Analysis via Dual Variational Autoencoder With Spherical Latent Embeddings.基于具有球形潜在嵌入的对偶变分自编码器的深度聚类分析
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6303-6312. doi: 10.1109/TNNLS.2021.3135460. Epub 2023 Sep 1.
8
Network-principled deep generative models for designing drug combinations as graph sets.基于网络原理的深度生成模型,用于将药物组合设计为图集合。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i445-i454. doi: 10.1093/bioinformatics/btaa317.
9
Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder.基于改进深度变分自动编码器的单细胞 RNA-seq 数据降维和可视化。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad152.
10
scVAE: variational auto-encoders for single-cell gene expression data.scVAE:用于单细胞基因表达数据的变分自动编码器。
Bioinformatics. 2020 Aug 15;36(16):4415-4422. doi: 10.1093/bioinformatics/btaa293.

引用本文的文献

1
multiDGD: A versatile deep generative model for multi-omics data.多 DGD:一种用于多组学数据的多功能深度生成模型。
Nat Commun. 2024 Nov 20;15(1):10031. doi: 10.1038/s41467-024-53340-z.
2
MOSAIC: An Artificial Intelligence-Based Framework for Multimodal Analysis, Classification, and Personalized Prognostic Assessment in Rare Cancers.MOSAIC:一种基于人工智能的罕见癌症多模态分析、分类和个性化预后评估框架。
JCO Clin Cancer Inform. 2024 Jun;8:e2400008. doi: 10.1200/CCI.24.00008.
3
N-of-one differential gene expression without control samples using a deep generative model.

本文引用的文献

1
Automatic cell-type harmonization and integration across Human Cell Atlas datasets.自动细胞类型协调和整合人类细胞图谱数据集。
Cell. 2023 Dec 21;186(26):5876-5891.e20. doi: 10.1016/j.cell.2023.11.026.
2
Cross-tissue immune cell analysis reveals tissue-specific features in humans.跨组织免疫细胞分析揭示人类组织特异性特征。
Science. 2022 May 13;376(6594):eabl5197. doi: 10.1126/science.abl5197.
3
A Python library for probabilistic analysis of single-cell omics data.一个用于单细胞组学数据概率分析的Python库。
使用深度生成模型进行无对照样本的 N-of-one 差异基因表达分析。
Genome Biol. 2023 Nov 16;24(1):263. doi: 10.1186/s13059-023-03104-7.
Nat Biotechnol. 2022 Feb;40(2):163-166. doi: 10.1038/s41587-021-01206-w.
4
A Manifold Learning Perspective on Representation Learning: Learning Decoder and Representations without an Encoder.表征学习的流形学习视角:无编码器学习解码器和表征
Entropy (Basel). 2021 Oct 25;23(11):1403. doi: 10.3390/e23111403.
5
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models.深度生成模型:VAE、GAN、归一化流、基于能量和自回归模型的比较综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7327-7347. doi: 10.1109/TPAMI.2021.3116668. Epub 2022 Oct 4.
6
VEGA is an interpretable generative model for inferring biological network activity in single-cell transcriptomics.VEGA 是一种可解释的生成模型,可用于推断单细胞转录组学中的生物网络活性。
Nat Commun. 2021 Sep 28;12(1):5684. doi: 10.1038/s41467-021-26017-0.
7
A robust nonlinear low-dimensional manifold for single cell RNA-seq data.单细胞 RNA-seq 数据的稳健非线性低维流形。
BMC Bioinformatics. 2020 Jul 21;21(1):324. doi: 10.1186/s12859-020-03625-z.
8
scVAE: variational auto-encoders for single-cell gene expression data.scVAE:用于单细胞基因表达数据的变分自动编码器。
Bioinformatics. 2020 Aug 15;36(16):4415-4422. doi: 10.1093/bioinformatics/btaa293.
9
Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model.基于多项模型的单细胞 RNA-Seq 特征选择和降维。
Genome Biol. 2019 Dec 23;20(1):295. doi: 10.1186/s13059-019-1861-6.
10
From Louvain to Leiden: guaranteeing well-connected communities.从鲁汶到莱顿:保障互联互通的社区。
Sci Rep. 2019 Mar 26;9(1):5233. doi: 10.1038/s41598-019-41695-z.