• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。

CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.

机构信息

School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018, China; Key Laboratory of Complex Systems Modeling and Simulation Ministry of Education, Ministry of Education, China.

School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018, China; School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, China.

出版信息

Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.

DOI:10.1016/j.compbiomed.2023.107263
PMID:37531858
Abstract

BACKGROUND

Single-cell RNA-sequencing (scRNA-seq) technology has revolutionized the study of cell heterogeneity and biological interpretation at the single-cell level. However, the dropout events commonly present in scRNA-seq data can markedly reduce the reliability of downstream analysis. Existing imputation methods often overlook the discrepancy between the established cell relationship from dropout noisy data and reality, which limits their performances due to the learned untrustworthy cell representations.

METHOD

Here, we propose a novel approach called the CL-Impute (Contrastive Learning-based Impute) model for estimating missing genes without relying on preconstructed cell relationships. CL-Impute utilizes contrastive learning and a self-attention network to address this challenge. Specifically, the proposed CL-Impute model leverages contrastive learning to learn cell representations from the self-perspective of dropout events, whereas the self-attention network captures cell relationships from the global-perspective.

RESULTS

Experimental results on four benchmark datasets, including quantitative assessment, cell clustering, gene identification, and trajectory inference, demonstrate the superior performance of CL-Impute compared with that of existing state-of-the-art imputation methods. Furthermore, our experiment reveals that combining contrastive learning and masking cell augmentation enables the model to learn actual latent features from noisy data with a high rate of dropout events, enhancing the reliability of imputed values.

CONCLUSIONS

CL-Impute is a novel contrastive learning-based method to impute scRNA-seq data in the context of high dropout rate. The source code of CL-Impute is available at https://github.com/yuchen21-web/Imputation-for-scRNA-seq.

摘要

背景

单细胞 RNA 测序(scRNA-seq)技术彻底改变了单细胞水平的细胞异质性和生物学解释的研究。然而,scRNA-seq 数据中常见的缺失事件会显著降低下游分析的可靠性。现有的插补方法通常忽略了由缺失噪声数据建立的细胞关系与现实之间的差异,这限制了它们的性能,因为学习到的不可信的细胞表示。

方法

在这里,我们提出了一种称为 CL-Impute(基于对比学习的插补)的新方法,用于在不依赖预先构建的细胞关系的情况下估计缺失的基因。CL-Impute 利用对比学习和自注意力网络来解决这个问题。具体来说,所提出的 CL-Impute 模型利用对比学习从缺失事件的自视角学习细胞表示,而自注意力网络从全局视角捕捉细胞关系。

结果

在包括定量评估、细胞聚类、基因识别和轨迹推断在内的四个基准数据集上的实验结果表明,CL-Impute 与现有最先进的插补方法相比具有优越的性能。此外,我们的实验表明,结合对比学习和掩蔽细胞增强可以使模型以高缺失率的噪声数据中学习实际的潜在特征,从而提高插补值的可靠性。

结论

CL-Impute 是一种在高缺失率背景下对 scRNA-seq 数据进行插补的新的基于对比学习的方法。CL-Impute 的源代码可在 https://github.com/yuchen21-web/Imputation-for-scRNA-seq 上获得。

相似文献

1
CL-Impute: A contrastive learning-based imputation for dropout single-cell RNA-seq data.CL-Impute:基于对比学习的 dropout 单细胞 RNA-seq 数据插补方法。
Comput Biol Med. 2023 Sep;164:107263. doi: 10.1016/j.compbiomed.2023.107263. Epub 2023 Jul 23.
2
GE-Impute: graph embedding-based imputation for single-cell RNA-seq data.GE-Impute:基于图嵌入的单细胞 RNA-seq 数据插补。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac313.
3
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
4
I-Impute: a self-consistent method to impute single cell RNA sequencing data.I-Impute:一种用于单细胞 RNA 测序数据插补的自洽方法。
BMC Genomics. 2020 Nov 18;21(Suppl 10):618. doi: 10.1186/s12864-020-07007-w.
5
AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification.AGImpute:基于带有缺失值识别的混合生成对抗网络对单细胞RNA测序数据进行插补
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae068.
6
SAE-Impute: imputation for single-cell data via subspace regression and auto-encoders.SAE-Impute:通过子空间回归和自动编码器对单细胞数据进行插补
BMC Bioinformatics. 2024 Oct 1;25(1):317. doi: 10.1186/s12859-024-05944-x.
7
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
8
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
9
Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data.Bubble:一种利用受批量RNA测序数据约束的自动编码器进行的快速单细胞RNA测序插补方法。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac580.
10
scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network.scGGAN:基于图的生成对抗网络的单细胞RNA测序数据插补
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad040.

引用本文的文献

1
AcImpute: a constraint-enhancing smooth-based approach for imputing single-cell RNA sequencing data.AcImpute:一种用于估算单细胞RNA测序数据的基于约束增强平滑的方法。
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btae711.
2
PbImpute: Precise Zero Discrimination and Balanced Imputation in Single-Cell RNA Sequencing Data.PbImpute:单细胞RNA测序数据中的精确零判别与平衡插补
J Chem Inf Model. 2025 Mar 10;65(5):2670-2684. doi: 10.1021/acs.jcim.4c02125. Epub 2025 Feb 17.
3
CPARI: a novel approach combining cell partitioning with absolute and relative imputation to address dropout in single-cell RNA-seq data.
CPARI:一种将细胞划分与绝对和相对插补相结合以解决单细胞RNA测序数据缺失值的新方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae668.
4
scVAG: Unified single-cell clustering via variational-autoencoder integration with Graph Attention Autoencoder.scVAG:通过变分自编码器与图注意力自编码器集成实现统一的单细胞聚类
Heliyon. 2024 Nov 27;10(23):e40732. doi: 10.1016/j.heliyon.2024.e40732. eCollection 2024 Dec 15.
5
scDTL: enhancing single-cell RNA-seq imputation through deep transfer learning with bulk cell information.scDTL:通过利用批量细胞信息进行深度迁移学习增强单细胞 RNA-seq 推断。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae555.
6
Recover then aggregate: unified cross-modal deep clustering with global structural information for single-cell data.恢复然后聚合:利用单细胞数据的全局结构信息进行统一的跨模态深度聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae485.
7
DCRELM: dual correlation reduction network-based extreme learning machine for single-cell RNA-seq data clustering.基于双相关降维网络的极限学习机用于单细胞 RNA-seq 数据聚类。
Sci Rep. 2024 Jun 12;14(1):13541. doi: 10.1038/s41598-024-64217-y.
8
scCRT: a contrastive-based dimensionality reduction model for scRNA-seq trajectory inference.scCRT:一种用于单细胞RNA测序轨迹推断的基于对比的降维模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae204.