• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProtoCell4P:一种基于原型的可解释神经网络,用于使用单细胞 RNA-seq 进行患者分类。

ProtoCell4P: an explainable prototype-based neural network for patient classification using single-cell RNA-seq.

机构信息

Department of Computer Science, University of Virginia, Charlottesville, VA, United States.

Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, United States.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad493.

DOI:10.1093/bioinformatics/btad493
PMID:37540223
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10444962/
Abstract

MOTIVATION

The rapid advance in single-cell RNA sequencing (scRNA-seq) technology over the past decade has provided a rich resource of gene expression profiles of single cells measured on patients, facilitating the study of many biological questions at the single-cell level. One intriguing research is to study the single cells which play critical roles in the phenotypes of patients, which has the potential to identify those cells and genes driving the disease phenotypes. To this end, deep learning models are expected to well encode the single-cell information and achieve precise prediction of patients' phenotypes using scRNA-seq data. However, we are facing critical challenges in designing deep learning models for classifying patient samples due to (i) the samples collected in the same dataset contain a variable number of cells-some samples might only have hundreds of cells sequenced while others could have thousands of cells, and (ii) the number of samples available is typically small and the expression profile of each cell is noisy and extremely high-dimensional. Moreover, the black-box nature of existing deep learning models makes it difficult for the researchers to interpret the models and extract useful knowledge from them.

RESULTS

We propose a prototype-based and cell-informed model for patient phenotype classification, termed ProtoCell4P, that can alleviate problems of the sample scarcity and the diverse number of cells by leveraging the cell knowledge with representatives of cells (called prototypes), and precisely classify the patients by adaptively incorporating information from different cells. Moreover, this classification process can be explicitly interpreted by identifying the key cells for decision making and by further summarizing the knowledge of cell types to unravel the biological nature of the classification. Our approach is explainable at the single-cell resolution which can identify the key cells in each patient's classification. The experimental results demonstrate that our proposed method can effectively deal with patient classifications using single-cell data and outperforms the existing approaches. Furthermore, our approach is able to uncover the association between cell types and biological classes of interest from a data-driven perspective.

AVAILABILITY AND IMPLEMENTATION

https://github.com/Teddy-XiongGZ/ProtoCell4P.

摘要

动机

过去十年单细胞 RNA 测序 (scRNA-seq) 技术的快速发展为测量患者单细胞的基因表达谱提供了丰富的资源,促进了在单细胞水平上研究许多生物学问题。一个有趣的研究是研究在患者表型中起关键作用的单细胞,这有可能识别出那些驱动疾病表型的细胞和基因。为此,深度学习模型有望很好地编码单细胞信息,并使用 scRNA-seq 数据实现对患者表型的精确预测。然而,由于 (i) 同一数据集中收集的样本包含数量不同的细胞-一些样本可能只有数百个测序细胞,而其他样本可能有数千个细胞,以及 (ii) 可用的样本数量通常较少,每个细胞的表达谱是嘈杂且极高维的,因此,我们在设计用于对患者样本进行分类的深度学习模型时面临着重大挑战。此外,现有深度学习模型的黑盒性质使得研究人员难以对模型进行解释并从中提取有用的知识。

结果

我们提出了一种基于原型和细胞信息的患者表型分类模型,称为 ProtoCell4P,它可以通过利用细胞知识和细胞代表(称为原型)来缓解样本稀缺和细胞数量多样化的问题,并通过自适应地整合来自不同细胞的信息来精确地对患者进行分类。此外,通过识别决策的关键细胞并进一步总结细胞类型的知识来揭示分类的生物学性质,可以显式地解释这个分类过程。我们的方法在单细胞分辨率上是可解释的,可以识别每个患者分类中的关键细胞。实验结果表明,我们提出的方法可以有效地使用单细胞数据进行患者分类,并优于现有方法。此外,我们的方法能够从数据驱动的角度揭示细胞类型和感兴趣的生物学类别之间的关联。

可用性和实现

https://github.com/Teddy-XiongGZ/ProtoCell4P。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/d04bd6e78f31/btad493f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/896e2e8c38e8/btad493f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/5b6f253dbbd1/btad493f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/b5e8ce3d189a/btad493f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/d8b6bce49d3b/btad493f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/d04bd6e78f31/btad493f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/896e2e8c38e8/btad493f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/5b6f253dbbd1/btad493f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/b5e8ce3d189a/btad493f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/d8b6bce49d3b/btad493f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699b/10444962/d04bd6e78f31/btad493f5.jpg

相似文献

1
ProtoCell4P: an explainable prototype-based neural network for patient classification using single-cell RNA-seq.ProtoCell4P:一种基于原型的可解释神经网络,用于使用单细胞 RNA-seq 进行患者分类。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad493.
2
Phenotype prediction from single-cell RNA-seq data using attention-based neural networks.基于注意力机制神经网络的单细胞 RNA-seq 数据表型预测。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae067.
3
DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data.DeepGSEA:单细胞转录组数据的可解释深度基因集富集分析。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae434.
4
Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data.基于对比学习的深度增强约束聚类算法在单细胞 RNA-seq 数据分析中的应用。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad222.
5
scNAME: neighborhood contrastive clustering with ancillary mask estimation for scRNA-seq data.scNAME:基于辅助掩模估计的 scRNA-seq 数据邻域对比聚类。
Bioinformatics. 2022 Mar 4;38(6):1575-1583. doi: 10.1093/bioinformatics/btac011.
6
CellTICS: an explainable neural network for cell-type identification and interpretation based on single-cell RNA-seq data.CellTICS:一种基于单细胞 RNA-seq 数据的可解释神经网络,用于细胞类型识别和解释。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad449.
7
SPANN: annotating single-cell resolution spatial transcriptome data with scRNA-seq data.SPANN:利用单细胞RNA测序数据注释单细胞分辨率空间转录组数据。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad533.
8
scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network.scDCCA:基于自动编码器网络的单细胞RNA测序数据深度对比聚类
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac625.
9
Deep single-cell RNA-seq data clustering with graph prototypical contrastive learning.基于图原型对比学习的深度单细胞 RNA-seq 数据聚类。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad342.
10
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.

引用本文的文献

1
Incorporating hierarchical information into multiple instance learning for patient phenotype prediction with single-cell RNA-sequencing data.将层次信息整合到多实例学习中,用于利用单细胞RNA测序数据预测患者表型。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i96-i104. doi: 10.1093/bioinformatics/btaf241.
2
cytoGPNet: Enhancing Clinical Outcome Prediction Accuracy Using Longitudinal Cytometry Data in Small Cohort Studies.细胞GP网络:在小队列研究中利用纵向细胞计数数据提高临床结果预测准确性。
bioRxiv. 2025 May 7:2025.05.01.651729. doi: 10.1101/2025.05.01.651729.
3
Explainable deep neural networks for predicting sample phenotypes from single-cell transcriptomics.

本文引用的文献

1
Single-nucleus profiling of human dilated and hypertrophic cardiomyopathy.人类扩张型和肥厚型心肌病的单细胞分析。
Nature. 2022 Aug;608(7921):174-180. doi: 10.1038/s41586-022-04817-8. Epub 2022 Jun 22.
2
Revealing the Immune Heterogeneity between Systemic Lupus Erythematosus and Rheumatoid Arthritis Based on Multi-Omics Data Analysis.基于多组学数据分析揭示系统性红斑狼疮和类风湿关节炎之间的免疫异质性。
Int J Mol Sci. 2022 May 5;23(9):5166. doi: 10.3390/ijms23095166.
3
Single-cell RNA-seq reveals cell type-specific molecular and genetic associations to lupus.
用于从单细胞转录组学预测样本表型的可解释深度神经网络。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae673.
4
scPanel: a tool for automatic identification of sparse gene panels for generalizable patient classification using scRNA-seq datasets.scPanel:一种使用 scRNA-seq 数据集进行通用患者分类的自动识别稀疏基因面板的工具。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae482.
5
DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data.DeepGSEA:单细胞转录组数据的可解释深度基因集富集分析。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae434.
单细胞 RNA 测序揭示了狼疮相关的细胞类型特异性分子和遗传关联。
Science. 2022 Apr 8;376(6589):eabf1970. doi: 10.1126/science.abf1970.
4
Single-cell RNA sequencing technologies and applications: A brief overview.单细胞 RNA 测序技术及应用:简述。
Clin Transl Med. 2022 Mar;12(3):e694. doi: 10.1002/ctm2.694.
5
Fully-automated and ultra-fast cell-type identification using specific marker combinations from single-cell transcriptomic data.利用单细胞转录组数据中的特定标记组合进行全自动超快速细胞类型识别。
Nat Commun. 2022 Mar 10;13(1):1246. doi: 10.1038/s41467-022-28803-w.
6
SARS-CoV-2-triggered mast cell rapid degranulation induces alveolar epithelial inflammation and lung injury.SARS-CoV-2 触发肥大细胞快速脱颗粒,导致肺泡上皮炎症和肺损伤。
Signal Transduct Target Ther. 2021 Dec 17;6(1):428. doi: 10.1038/s41392-021-00849-0.
7
CloudPred: Predicting Patient Phenotypes From Single-cell RNA-seq.云预测:从单细胞 RNA-seq 预测患者表型。
Pac Symp Biocomput. 2022;27:337-348.
8
Impaired local intrinsic immunity to SARS-CoV-2 infection in severe COVID-19.严重 COVID-19 中 SARS-CoV-2 感染局部固有免疫力受损。
Cell. 2021 Sep 2;184(18):4713-4733.e22. doi: 10.1016/j.cell.2021.07.023. Epub 2021 Jul 23.
9
Nasal ciliated cells are primary targets for SARS-CoV-2 replication in the early stage of COVID-19.鼻腔纤毛细胞是 COVID-19 早期 SARS-CoV-2 复制的主要靶标。
J Clin Invest. 2021 Jul 1;131(13). doi: 10.1172/JCI148517.
10
Bayesian inference of gene expression states from single-cell RNA-seq data.基于单细胞RNA测序数据的基因表达状态的贝叶斯推断。
Nat Biotechnol. 2021 Aug;39(8):1008-1016. doi: 10.1038/s41587-021-00875-x. Epub 2021 Apr 29.