• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

cfDiffusion:基于扩散的高质量单细胞RNA测序数据高效生成,无分类器引导。

cfDiffusion: diffusion-based efficient generation of high quality scRNA-seq data with classifier-free guidance.

作者信息

Zhang Tianjiao, Zhao Zhongqian, Ren Jixiang, Zhang Ziheng, Zhang Hongfei, Wang Guohua

机构信息

College of Computer and Control Engineering, Northeast Forestry University, No. 26, Hexing Road, Xiangfang District, Harbin 150040, China.

Faculty of Computing, Harbin Institute of Technology, No. 92 Xidazhi Street, Nangang District, Harbin 150001, China.

出版信息

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf071.

DOI:10.1093/bib/bbaf071
PMID:39987461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11846686/
Abstract

Single-cell RNA sequencing (scRNA-seq) technology provides a powerful means to measure gene expression at the individual cell level, thereby uncovering the intricate cellular heterogeneity that underlies various biological processes, including embryonic development, tumor metastasis, and microbial reproduction. However, the variable amounts of data generated across different cell types within tissues can compromise the accuracy of downstream analyses. Traditional approaches for generating scRNA-seq simulation data often rely on predefined data distributions, which can negatively impact the quality of the simulated data. Furthermore, these methods typically focus on simulating single-attribute cells, necessitating substantial additional data for the simulation of multi-attribute cells, which can lead to increased training times. To address these limitations, we propose cfDiffusion, a novel method grounded in diffusion models that incorporates Classifier-Free Guidance and a high-level feature caching mechanism. By leveraging Classifier-Free Guidance, cfDiffusion significantly reduces the training costs associated with model development compared to traditional Classifier Guidance methods. The integration of a caching mechanism further enhances efficiency by shortening inference times. While the inference duration of cfDiffusion remains longer than that of scDiffusion, it exhibits superior expressiveness and efficiency in generating multi-attribute single-cell data. Evaluated across datasets from multiple sequencing platforms, cfDiffusion consistently outperforms state-of-the-art models across various performance metrics. Additionally, cfDiffusion enables the simulation of single-cell data along a pseudo-time scale, facilitating advanced analyses such as tracking cell differentiation, investigating intercellular communication, and elucidating cellular heterogeneity.

摘要

单细胞RNA测序(scRNA-seq)技术提供了一种在单个细胞水平上测量基因表达的强大方法,从而揭示了各种生物过程(包括胚胎发育、肿瘤转移和微生物繁殖)背后复杂的细胞异质性。然而,组织内不同细胞类型产生的数据量不同,可能会影响下游分析的准确性。传统的生成scRNA-seq模拟数据的方法通常依赖于预定义的数据分布,这可能会对模拟数据的质量产生负面影响。此外,这些方法通常侧重于模拟单属性细胞,对于多属性细胞的模拟需要大量额外的数据,这可能会导致训练时间增加。为了解决这些局限性,我们提出了cfDiffusion,这是一种基于扩散模型的新方法,它结合了无分类器引导和高级特征缓存机制。通过利用无分类器引导,与传统的分类器引导方法相比,cfDiffusion显著降低了与模型开发相关的训练成本。缓存机制的集成通过缩短推理时间进一步提高了效率。虽然cfDiffusion的推理持续时间仍然比scDiffusion长,但它在生成多属性单细胞数据方面表现出卓越的表现力和效率。在来自多个测序平台的数据集上进行评估时,cfDiffusion在各种性能指标上始终优于现有模型。此外,cfDiffusion能够沿着伪时间尺度模拟单细胞数据,便于进行高级分析,如跟踪细胞分化、研究细胞间通讯和阐明细胞异质性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/78ce2fed47f4/bbaf071f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/2ea76599459a/bbaf071f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/8556d3eb4c5e/bbaf071f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/5b480781d3e2/bbaf071f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/a8d25b9bca37/bbaf071f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/ad76061dc9de/bbaf071f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/78ce2fed47f4/bbaf071f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/2ea76599459a/bbaf071f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/8556d3eb4c5e/bbaf071f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/5b480781d3e2/bbaf071f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/a8d25b9bca37/bbaf071f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/ad76061dc9de/bbaf071f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e43/11846686/78ce2fed47f4/bbaf071f6.jpg

相似文献

1
cfDiffusion: diffusion-based efficient generation of high quality scRNA-seq data with classifier-free guidance.cfDiffusion:基于扩散的高质量单细胞RNA测序数据高效生成,无分类器引导。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf071.
2
scDiffusion: conditional generation of high-quality single-cell data using diffusion model.scDiffusion:使用扩散模型生成高质量单细胞数据的条件生成。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae518.
3
scAMZI: attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data.scAMZI:用于scRNA序列数据聚类的带零膨胀层的基于注意力的深度自动编码器。
BMC Genomics. 2025 Apr 7;26(1):350. doi: 10.1186/s12864-025-11511-2.
4
GraCEImpute: A novel graph clustering autoencoder approach for imputation of single-cell RNA-seq data.GraCEImpute:一种用于单细胞RNA测序数据插补的新型图聚类自动编码器方法。
Comput Biol Med. 2025 Jan;184:109400. doi: 10.1016/j.compbiomed.2024.109400. Epub 2024 Nov 18.
5
Inferring gene regulatory networks from time-series scRNA-seq data via GRANGER causal recurrent autoencoders.通过格兰杰因果循环自动编码器从时间序列单细胞RNA测序数据推断基因调控网络。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf089.
6
scDFN: enhancing single-cell RNA-seq clustering with deep fusion networks.scDFN:利用深度融合网络增强单细胞 RNA-seq 聚类
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae486.
7
scIDPMs: Single-Cell RNA-Seq Imputation Using Diffusion Probabilistic Models.scIDPMs:使用扩散概率模型的单细胞RNA测序插补
IEEE J Biomed Health Inform. 2025 Apr;29(4):3057-3068. doi: 10.1109/JBHI.2024.3430554. Epub 2025 Apr 4.
8
Mcadet: A feature selection method for fine-resolution single-cell RNA-seq data based on multiple correspondence analysis and community detection.基于多重对应分析和社区检测的精细分辨率单细胞 RNA-seq 数据特征选择方法
PLoS Comput Biol. 2024 Oct 28;20(10):e1012560. doi: 10.1371/journal.pcbi.1012560. eCollection 2024 Oct.
9
scMUG: deep clustering analysis of single-cell RNA-seq data on multiple gene functional modules.scMUG:基于多个基因功能模块的单细胞RNA测序数据深度聚类分析
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf138.
10
scDMAE: A Generative Denoising Model Adopted Mask Strategy for scRNA-Seq Data Recovery.scDMAE:一种采用掩模策略的生成去噪模型,用于 scRNA-Seq 数据恢复。
IEEE J Biomed Health Inform. 2024 Jun;28(6):3772-3780. doi: 10.1109/JBHI.2024.3383921. Epub 2024 Jun 6.

本文引用的文献

1
A multichannel graph neural network based on multisimilarity modality hypergraph contrastive learning for predicting unknown types of cancer biomarkers.一种基于多相似模态超图对比学习的多通道图神经网络,用于预测未知类型的癌症生物标志物。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae575.
2
scDiffusion: conditional generation of high-quality single-cell data using diffusion model.scDiffusion:使用扩散模型生成高质量单细胞数据的条件生成。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae518.
3
scPriorGraph: constructing biosemantic cell-cell graphs with prior gene set selection for cell type identification from scRNA-seq data.
scPriorGraph:使用预先选择的基因集从 scRNA-seq 数据中构建生物语义细胞-细胞图,以进行细胞类型识别。
Genome Biol. 2024 Aug 5;25(1):207. doi: 10.1186/s13059-024-03357-w.
4
scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering.scZAG:基于 ZINB 的自动编码器与自适应数据增强图对比学习在 scRNA-seq 聚类中的整合。
Int J Mol Sci. 2024 May 29;25(11):5976. doi: 10.3390/ijms25115976.
5
CPPLS-MLP: a method for constructing cell-cell communication networks and identifying related highly variable genes based on single-cell sequencing and spatial transcriptomics data.CPPLS-MLP:一种基于单细胞测序和空间转录组学数据构建细胞间通讯网络和识别相关高变基因的方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae198.
6
scPML: pathway-based multi-view learning for cell type annotation from single-cell RNA-seq data.scPML:基于通路的单细胞 RNA-seq 数据的细胞类型注释的多视图学习。
Commun Biol. 2023 Dec 14;6(1):1268. doi: 10.1038/s42003-023-05634-z.
7
Single cell transcriptome sequencing of stimulated and frozen human peripheral blood mononuclear cells.刺激和冷冻的人外周血单个核细胞的单细胞转录组测序。
Sci Data. 2023 Jul 6;10(1):433. doi: 10.1038/s41597-023-02348-z.
8
LSH-GAN enables in-silico generation of cells for small sample high dimensional scRNA-seq data.LSH-GAN 可实现小样本高维 scRNA-seq 数据的计算机细胞生成。
Commun Biol. 2022 Jun 10;5(1):577. doi: 10.1038/s42003-022-03473-y.
9
HyPR-seq: Single-cell quantification of chosen RNAs via hybridization and sequencing of DNA probes.HyPR-seq:通过杂交和 DNA 探针测序对选定的 RNA 进行单细胞定量。
Proc Natl Acad Sci U S A. 2020 Dec 29;117(52):33404-33413. doi: 10.1073/pnas.2010738117. Epub 2020 Dec 21.
10
scIGANs: single-cell RNA-seq imputation using generative adversarial networks.scIGANs:基于生成对抗网络的单细胞 RNA-seq 插补。
Nucleic Acids Res. 2020 Sep 4;48(15):e85. doi: 10.1093/nar/gkaa506.