• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度生成模型学习罕见细胞类型的判别和结构样本。

Learning discriminative and structural samples for rare cell types with deep generative model.

机构信息

School of Computer Science and Technology, Xidian University, Xi'an, 710071, China.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac317.

DOI:10.1093/bib/bbac317
PMID:35914950
Abstract

Cell types (subpopulations) serve as bio-markers for the diagnosis and therapy of complex diseases, and single-cell RNA-sequencing (scRNA-seq) measures expression of genes at cell level, paving the way for the identification of cell types. Although great efforts have been devoted to this issue, it remains challenging to identify rare cell types in scRNA-seq data because of the few-shot problem, lack of interpretability and separation of generating samples and clustering of cells. To attack these issues, a novel deep generative model for leveraging the small samples of cells (aka scLDS2) is proposed by precisely estimating the distribution of different cells, which discriminate the rare and non-rare cell types with adversarial learning. Specifically, to enhance interpretability of samples, scLDS2 generates the sparse faked samples of cells with $\ell _1$-norm, where the relations among cells are learned, facilitating the identification of cell types. Furthermore, scLDS2 directly obtains cell types from the generated samples by learning the block structure such that cells belonging to the same types are similar to each other with the nuclear-norm. scLDS2 joins the generation of samples, classification of the generated and truth samples for cells and feature extraction into a unified generative framework, which transforms the rare cell types detection problem into a classification problem, paving the way for the identification of cell types with joint learning. The experimental results on 20 datasets demonstrate that scLDS2 significantly outperforms 17 state-of-the-art methods in terms of various measurements with 25.12% improvement in adjusted rand index on average, providing an effective strategy for scRNA-seq data with rare cell types. (The software is coded using python, and is freely available for academic https://github.com/xkmaxidian/scLDS2).

摘要

细胞类型(亚群)可作为复杂疾病诊断和治疗的生物标志物,单细胞 RNA 测序(scRNA-seq)可测量细胞水平的基因表达,为鉴定细胞类型铺平道路。尽管已经付出了巨大的努力,但由于样本数量少、缺乏可解释性以及样本生成和细胞聚类的分离,仍然难以从 scRNA-seq 数据中鉴定稀有细胞类型。为了解决这些问题,我们提出了一种新的深度生成模型,即利用细胞小样本(scLDS2)的深度生成模型,通过精确估计不同细胞的分布,利用对抗学习来区分稀有和非稀有细胞类型。具体来说,为了增强样本的可解释性,scLDS2 通过生成具有 L1 范数的稀疏伪造细胞样本,学习细胞之间的关系,从而有助于识别细胞类型。此外,scLDS2 通过学习块结构,直接从生成的样本中获得细胞类型,使得属于同一类型的细胞彼此相似,核范数也相似。scLDS2 将样本生成、生成样本和真实样本的分类以及细胞特征提取纳入一个统一的生成框架,将稀有细胞类型检测问题转化为分类问题,为联合学习识别细胞类型铺平了道路。在 20 个数据集上的实验结果表明,scLDS2 在各种指标上都明显优于 17 种最先进的方法,平均调整兰德指数提高了 25.12%,为具有稀有细胞类型的 scRNA-seq 数据提供了一种有效的策略。(该软件使用 Python 编写,可在学术上免费使用:https://github.com/xkmaxidian/scLDS2)。

相似文献

1
Learning discriminative and structural samples for rare cell types with deep generative model.利用深度生成模型学习罕见细胞类型的判别和结构样本。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac317.
2
Learning deep features and topological structure of cells for clustering of scRNA-sequencing data.学习 scRNA-seq 数据聚类的细胞深度特征和拓扑结构。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac068.
3
jSRC: a flexible and accurate joint learning algorithm for clustering of single-cell RNA-sequencing data.jSRC:一种用于单细胞 RNA-seq 数据聚类的灵活准确的联合学习算法。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa433.
4
Network-Based Structural Learning Nonnegative Matrix Factorization Algorithm for Clustering of scRNA-Seq Data.用于scRNA-Seq数据聚类的基于网络的结构学习非负矩阵分解算法
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):566-575. doi: 10.1109/TCBB.2022.3161131. Epub 2023 Feb 3.
5
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
6
Joint learning dimension reduction and clustering of single-cell RNA-sequencing data.单细胞 RNA 测序数据的联合降维和聚类学习。
Bioinformatics. 2020 Jun 1;36(12):3825-3832. doi: 10.1093/bioinformatics/btaa231.
7
Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network.基于自动编码器和图神经网络的单细胞 RNA-seq 数据深度结构聚类。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac018.
8
scMultiGAN: cell-specific imputation for single-cell transcriptomes with multiple deep generative adversarial networks.scMultiGAN:使用多个深度生成对抗网络进行单细胞转录组的细胞特异性插补。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad384.
9
scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network.scGGAN:基于图的生成对抗网络的单细胞RNA测序数据插补
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad040.
10
TripletCell: a deep metric learning framework for accurate annotation of cell types at the single-cell level.三重细胞:一种用于单细胞水平准确注释细胞类型的深度度量学习框架。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad132.

引用本文的文献

1
sCIN: a contrastive learning framework for single-cell multi-omics data integration.sCIN:用于单细胞多组学数据整合的对比学习框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf411.
2
Flu-CNN: identifying host specificity of Influenza A virus using convolutional networks.流感卷积神经网络(Flu-CNN):利用卷积网络识别甲型流感病毒的宿主特异性
Hum Genomics. 2025 Aug 22;19(1):96. doi: 10.1186/s40246-025-00812-y.
3
scaLR: a low-resource deep neural network-based platform for single cell analysis and biomarker discovery.scaLR:一个基于低资源深度神经网络的单细胞分析和生物标志物发现平台。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf243.
4
Deciphering cell states and the cellular ecosystem to improve risk stratification in acute myeloid leukemia.解析细胞状态和细胞生态系统以改善急性髓系白血病的风险分层
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf028.
5
STCGAN: a novel cycle-consistent generative adversarial network for spatial transcriptomics cellular deconvolution.STCGAN:一种用于空间转录组学细胞反卷积的新型循环一致生成对抗网络。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae670.
6
scVAG: Unified single-cell clustering via variational-autoencoder integration with Graph Attention Autoencoder.scVAG:通过变分自编码器与图注意力自编码器集成实现统一的单细胞聚类
Heliyon. 2024 Nov 27;10(23):e40732. doi: 10.1016/j.heliyon.2024.e40732. eCollection 2024 Dec 15.
7
Exploring structured molecular landscape from single-cell multi-omics data by an explainable multimodal model.通过可解释的多模态模型从单细胞多组学数据中探索结构化分子景观。
iScience. 2024 Nov 2;27(12):111131. doi: 10.1016/j.isci.2024.111131. eCollection 2024 Dec 20.
8
Identifying cell types by lasso-constraint regularized Gaussian graphical model based on weighted distance penalty.基于加权距离惩罚的lasso 约束正则化高斯图形模型识别细胞类型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae572.
9
Unlocking cross-modal interplay of single-cell joint profiling with CellMATE.CellMATE 解锁单细胞联合分析的跨模态相互作用。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae582.
10
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.