• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RVAgene:基因表达时间序列数据的生成式建模

RVAgene: generative modeling of gene expression time series data.

作者信息

Mitra Raktim, MacLean Adam L

机构信息

University of Southern California, Los Angeles, CA 90007, USA.

出版信息

Bioinformatics. 2021 Oct 11;37(19):3252-3262. doi: 10.1093/bioinformatics/btab260.

DOI:10.1093/bioinformatics/btab260
PMID:33974008
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8504625/
Abstract

MOTIVATION

Methods to model dynamic changes in gene expression at a genome-wide level are not currently sufficient for large (temporally rich or single-cell) datasets. Variational autoencoders offer means to characterize large datasets and have been used effectively to characterize features of single-cell datasets. Here, we extend these methods for use with gene expression time series data.

RESULTS

We present RVAgene: a recurrent variational autoencoder to model gene expression dynamics. RVAgene learns to accurately and efficiently reconstruct temporal gene profiles. It also learns a low dimensional representation of the data via a recurrent encoder network that can be used for biological feature discovery, and from which we can generate new gene expression data by sampling the latent space. We test RVAgene on simulated and real biological datasets, including embryonic stem cell differentiation and kidney injury response dynamics. In all cases, RVAgene accurately reconstructed complex gene expression temporal profiles. Via cross validation, we show that a low-error latent space representation can be learnt using only a fraction of the data. Through clustering and gene ontology term enrichment analysis on the latent space, we demonstrate the potential of RVAgene for unsupervised discovery. In particular, RVAgene identifies new programs of shared gene regulation of Lox family genes in response to kidney injury.

AVAILABILITY AND IMPLEMENTATION

All datasets analyzed in this manuscript are publicly available and have been published previously. RVAgene is available in Python, at GitHub: https://github.com/maclean-lab/RVAgene; Zenodo archive: http://doi.org/10.5281/zenodo.4271097.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

目前,在全基因组水平上对基因表达动态变化进行建模的方法尚不足以处理大型(时间丰富或单细胞)数据集。变分自编码器提供了表征大型数据集的方法,并已有效地用于表征单细胞数据集的特征。在这里,我们扩展这些方法以用于基因表达时间序列数据。

结果

我们提出了RVAgene:一种用于对基因表达动态进行建模的循环变分自编码器。RVAgene学习学习准确有效地重建时间基因谱。它还通过循环编码器网络学习数据的低维表示,该网络可用于生物特征发现,并且我们可以通过对潜在空间进行采样从中生成新的基因表达数据。我们在模拟和真实生物数据集上测试了RVAgene,包括胚胎干细胞分化和肾损伤反应动态。在所有情况下,RVAgene都能准确重建复杂的基因表达时间谱。通过交叉验证,我们表明仅使用一小部分数据就能学习到低误差的潜在空间表示。通过对潜在空间进行聚类和基因本体术语富集分析,我们展示了RVAgene进行无监督发现的潜力。特别是,RVAgene识别出了Lox家族基因在肾损伤反应中共享基因调控的新程序。

可用性和实现方式

本手稿中分析的所有数据集均可公开获取且已在之前发表。RVAgene以Python语言提供,可在GitHub上获取:https://github.com/maclean-lab/RVAgene;Zenodo存档:http://doi.org/10.5281/zenodo.4271097。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/4680622ab4b4/btab260f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/00297be68d8d/btab260f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/12106077ad30/btab260f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/8b7f9faf2ab6/btab260f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/79d651cef460/btab260f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/8475ba6bd762/btab260f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/4680622ab4b4/btab260f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/00297be68d8d/btab260f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/12106077ad30/btab260f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/8b7f9faf2ab6/btab260f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/79d651cef460/btab260f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/8475ba6bd762/btab260f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cfe/8504625/4680622ab4b4/btab260f6.jpg

相似文献

1
RVAgene: generative modeling of gene expression time series data.RVAgene:基因表达时间序列数据的生成式建模
Bioinformatics. 2021 Oct 11;37(19):3252-3262. doi: 10.1093/bioinformatics/btab260.
2
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
3
The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data.深度生成解码器:表示的 MAP 估计可改进单细胞 RNA 数据的建模。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad497.
4
ACTIVA: realistic single-cell RNA-seq generation with automatic cell-type identification using introspective variational autoencoders.ACTIVA:使用内省变分自动编码器实现自动细胞类型识别的真实单细胞 RNA-seq 生成。
Bioinformatics. 2022 Apr 12;38(8):2194-2201. doi: 10.1093/bioinformatics/btac095.
5
Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations.生物学启发的变分自动编码器允许对遗传和药物诱导的扰动进行预测建模。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad387.
6
bmVAE: a variational autoencoder method for clustering single-cell mutation data.基于变分自编码器的单细胞突变聚类方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac790.
7
A machine learning-based method for automatically identifying novel cells in annotating single-cell RNA-seq data.基于机器学习的方法,用于自动识别注释单细胞 RNA-seq 数据中的新型细胞。
Bioinformatics. 2022 Oct 31;38(21):4885-4892. doi: 10.1093/bioinformatics/btac617.
8
scVAE: variational auto-encoders for single-cell gene expression data.scVAE:用于单细胞基因表达数据的变分自动编码器。
Bioinformatics. 2020 Aug 15;36(16):4415-4422. doi: 10.1093/bioinformatics/btaa293.
9
scGAC: a graph attentional architecture for clustering single-cell RNA-seq data.scGAC:一种用于聚类单细胞 RNA-seq 数据的图注意力架构。
Bioinformatics. 2022 Apr 12;38(8):2187-2193. doi: 10.1093/bioinformatics/btac099.
10
Deep cross-omics cycle attention model for joint analysis of single-cell multi-omics data.用于单细胞多组学数据联合分析的深度跨组学循环注意力模型。
Bioinformatics. 2021 Nov 18;37(22):4091-4099. doi: 10.1093/bioinformatics/btab403.

引用本文的文献

1
Single-nucleus RNA sequencing reveals the specific molecular signatures of myeloid cells responding to brain injury after microglial replacement.单核RNA测序揭示了小胶质细胞替代后骨髓细胞对脑损伤作出反应的特定分子特征。
Front Immunol. 2025 Jul 24;16:1625673. doi: 10.3389/fimmu.2025.1625673. eCollection 2025.
2
A deep generative model for deciphering cellular dynamics and in silico drug discovery in complex diseases.一种用于解析复杂疾病中细胞动力学和计算机辅助药物发现的深度生成模型。
Nat Biomed Eng. 2025 Jun 20. doi: 10.1038/s41551-025-01423-7.
3
Data-driven model discovery and model selection for noisy biological systems.

本文引用的文献

1
Network inference with Granger causality ensembles on single-cell transcriptomics.基于单细胞转录组学的格兰杰因果关系集成的网络推断。
Cell Rep. 2022 Feb 8;38(6):110333. doi: 10.1016/j.celrep.2022.110333.
2
TENET: gene network reconstruction using transfer entropy reveals key regulatory factors from single cell transcriptomic data.TENET:利用传递熵重建基因网络,从单细胞转录组数据中揭示关键调控因子。
Nucleic Acids Res. 2021 Jan 11;49(1):e1. doi: 10.1093/nar/gkaa1014.
3
Gene regulation inference from single-cell RNA-seq data with linear differential equations and velocity inference.
针对有噪声的生物系统的数据驱动模型发现与模型选择
PLoS Comput Biol. 2025 Jan 21;21(1):e1012762. doi: 10.1371/journal.pcbi.1012762. eCollection 2025 Jan.
4
Generation of synthetic whole-slide image tiles of tumours from RNA-sequencing data via cascaded diffusion models.通过级联扩散模型从RNA测序数据生成肿瘤的合成全切片图像块
Nat Biomed Eng. 2025 Mar;9(3):320-332. doi: 10.1038/s41551-024-01193-8. Epub 2024 Mar 21.
5
SaPt-CNN-LSTM-AR-EA: a hybrid ensemble learning framework for time series-based multivariate DNA sequence prediction.SaPt-CNN-LSTM-AR-EA:一种用于基于时间序列的多变量DNA序列预测的混合集成学习框架。
PeerJ. 2023 Oct 4;11:e16192. doi: 10.7717/peerj.16192. eCollection 2023.
6
Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review.深度学习在单细胞 RNA 测序数据分析中的应用:综述。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):814-835. doi: 10.1016/j.gpb.2022.11.011. Epub 2022 Dec 14.
7
bmVAE: a variational autoencoder method for clustering single-cell mutation data.基于变分自编码器的单细胞突变聚类方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac790.
8
Single-cell analysis of the aging female mouse hypothalamus.衰老雌性小鼠下丘脑的单细胞分析。
Nat Aging. 2022 Jul;2(7):662-678. doi: 10.1038/s43587-022-00246-4. Epub 2022 Jul 4.
9
Single-nucleus chromatin accessibility and transcriptomic characterization of Alzheimer's disease.阿尔茨海默病中单细胞核染色质可及性和转录组特征。
Nat Genet. 2021 Aug;53(8):1143-1155. doi: 10.1038/s41588-021-00894-z. Epub 2021 Jul 8.
基于线性微分方程和速度推断的单细胞 RNA-seq 数据基因调控推断。
Bioinformatics. 2020 Sep 15;36(18):4774-4780. doi: 10.1093/bioinformatics/btaa576.
4
Array programming with NumPy.使用 NumPy 进行数组编程。
Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.
5
Interpretable factor models of single-cell RNA-seq via variational autoencoders.基于变分自动编码器的单细胞 RNA-seq 可解释因子模型。
Bioinformatics. 2020 Jun 1;36(11):3418-3421. doi: 10.1093/bioinformatics/btaa169.
6
Inference of gene regulatory networks based on nonlinear ordinary differential equations.基于非线性常微分方程的基因调控网络推断。
Bioinformatics. 2020 Dec 8;36(19):4885-4893. doi: 10.1093/bioinformatics/btaa032.
7
Single-Cell Profiling Reveals Sex, Lineage, and Regional Diversity in the Mouse Kidney.单细胞分析揭示了小鼠肾脏中的性别、谱系和区域多样性。
Dev Cell. 2019 Nov 4;51(3):399-413.e7. doi: 10.1016/j.devcel.2019.10.005.
8
Data denoising with transfer learning in single-cell transcriptomics.基于迁移学习的单细胞转录组学数据去噪。
Nat Methods. 2019 Sep;16(9):875-878. doi: 10.1038/s41592-019-0537-1. Epub 2019 Aug 30.
9
A comparison of single-cell trajectory inference methods.单细胞轨迹推断方法比较。
Nat Biotechnol. 2019 May;37(5):547-554. doi: 10.1038/s41587-019-0071-9. Epub 2019 Apr 1.
10
Scalable analysis of cell-type composition from single-cell transcriptomics using deep recurrent learning.利用深度循环神经网络对单细胞转录组学数据进行可扩展的细胞类型组成分析。
Nat Methods. 2019 Apr;16(4):311-314. doi: 10.1038/s41592-019-0353-7. Epub 2019 Mar 18.