• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于概率的基因表达条码,用于注释单细胞 RNA-seq 数据中的细胞类型。

A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data.

机构信息

Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA.

Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA and Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA.

出版信息

Biostatistics. 2022 Oct 14;23(4):1150-1164. doi: 10.1093/biostatistics/kxac021.

DOI:10.1093/biostatistics/kxac021
PMID:35770795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9802389/
Abstract

Single-cell RNA sequencing (scRNA-seq) quantifies gene expression for individual cells in a sample, which allows distinct cell-type populations to be identified and characterized. An important step in many scRNA-seq analysis pipelines is the annotation of cells into known cell types. While this can be achieved using experimental techniques, such as fluorescence-activated cell sorting, these approaches are impractical for large numbers of cells. This motivates the development of data-driven cell-type annotation methods. We find limitations with current approaches due to the reliance on known marker genes or from overfitting because of systematic differences, or batch effects, between studies. Here, we present a statistical approach that leverages public data sets to combine information across thousands of genes, uses a latent variable model to define cell-type-specific barcodes and account for batch effect variation, and probabilistically annotates cell-type identity from a reference of known cell types. The barcoding approach also provides a new way to discover marker genes. Using a range of data sets, including those generated to represent imperfect real-world reference data, we demonstrate that our approach substantially outperforms current reference-based methods, particularly when predicting across studies.

摘要

单细胞 RNA 测序 (scRNA-seq) 可定量分析样本中单个细胞的基因表达情况,从而识别和描述不同的细胞类型。在许多 scRNA-seq 分析流程中,一个重要步骤是将细胞注释为已知的细胞类型。虽然可以使用荧光激活细胞分选等实验技术来实现这一目标,但对于大量细胞来说,这些方法并不实用。这促使人们开发了数据驱动的细胞类型注释方法。我们发现,由于依赖已知的标记基因,或者由于研究之间存在系统性差异(或批次效应)而导致过度拟合,当前方法存在局限性。在这里,我们提出了一种统计方法,该方法利用公共数据集来整合数千个基因的信息,使用潜在变量模型定义细胞类型特异性的条码,并解释批次效应变化,从已知细胞类型的参考中概率性地注释细胞类型身份。条码方法还提供了一种发现标记基因的新方法。使用一系列数据集,包括为代表不完美的真实世界参考数据而生成的数据集,我们证明了我们的方法显著优于当前基于参考的方法,特别是在跨研究预测时。

相似文献

1
A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data.一种基于概率的基因表达条码,用于注释单细胞 RNA-seq 数据中的细胞类型。
Biostatistics. 2022 Oct 14;23(4):1150-1164. doi: 10.1093/biostatistics/kxac021.
2
scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets.scAnno:一种基于去卷积策略的单细胞 RNA 测序数据集自动细胞类型注释工具。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad179.
3
Detection of high variability in gene expression from single-cell RNA-seq profiling.从单细胞RNA测序分析中检测基因表达的高变异性。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):508. doi: 10.1186/s12864-016-2897-6.
4
Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview.单细胞 RNA 测序分析:分步概述。
Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19.
5
CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data.CellMixS:量化和可视化单细胞 RNA-seq 数据中的批次效应。
Life Sci Alliance. 2021 Mar 23;4(6). doi: 10.26508/lsa.202001004. Print 2021 Jun.
6
Detection of cell markers from single cell RNA-seq with sc2marker.使用 sc2marker 从单细胞 RNA-seq 中检测细胞标记物。
BMC Bioinformatics. 2022 Jul 12;23(1):276. doi: 10.1186/s12859-022-04817-5.
7
Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis.单细胞 Mayo 图谱 (scMayoMap):单细胞 RNA 测序数据分析中用于细胞类型注释的易用工具。
BMC Biol. 2023 Oct 20;21(1):223. doi: 10.1186/s12915-023-01728-6.
8
Random forest based similarity learning for single cell RNA sequencing data.基于随机森林的单细胞 RNA 测序数据相似性学习。
Bioinformatics. 2018 Jul 1;34(13):i79-i88. doi: 10.1093/bioinformatics/bty260.
9
ASAP: a web-based platform for the analysis and interactive visualization of single-cell RNA-seq data.ASAP:一个用于单细胞 RNA-seq 数据分析和交互式可视化的基于网络的平台。
Bioinformatics. 2017 Oct 1;33(19):3123-3125. doi: 10.1093/bioinformatics/btx337.
10
Identification of new marker genes from plant single-cell RNA-seq data using interpretable machine learning methods.利用可解释机器学习方法从植物单细胞 RNA-seq 数据中鉴定新的标记基因。
New Phytol. 2022 May;234(4):1507-1520. doi: 10.1111/nph.18053. Epub 2022 Mar 26.

引用本文的文献

1
IL-23 Promotes γδT Cell Activity in Dry Eye Disease Progression.白细胞介素-23在干眼病进展过程中促进γδT细胞活性。
Invest Ophthalmol Vis Sci. 2025 Feb 3;66(2):10. doi: 10.1167/iovs.66.2.10.
2
Baseline immune state and T-cell clonal kinetics are associated with durable response to CAR-T therapy in large B-cell lymphoma.基线免疫状态和T细胞克隆动力学与大B细胞淋巴瘤对CAR-T疗法的持久反应相关。
Blood. 2024 Dec 12;144(24):2490-2502. doi: 10.1182/blood.2024024381.
3
SIMS: A deep-learning label transfer tool for single-cell RNA sequencing analysis.SIMS:一种用于单细胞RNA测序分析的深度学习标签转移工具。
Cell Genom. 2024 Jun 12;4(6):100581. doi: 10.1016/j.xgen.2024.100581. Epub 2024 May 31.
4
Immune cell type signature discovery and random forest classification for analysis of single cell gene expression datasets.免疫细胞类型特征发现和随机森林分类用于分析单细胞基因表达数据集。
Front Immunol. 2023 Aug 4;14:1194745. doi: 10.3389/fimmu.2023.1194745. eCollection 2023.
5
Significance analysis for clustering with single-cell RNA-sequencing data.基于单细胞 RNA-seq 数据的聚类意义分析。
Nat Methods. 2023 Aug;20(8):1196-1202. doi: 10.1038/s41592-023-01933-9. Epub 2023 Jul 10.
6
Predictive and robust gene selection for spatial transcriptomics.用于空间转录组学的预测和稳健基因选择。
Nat Commun. 2023 Apr 12;14(1):2091. doi: 10.1038/s41467-023-37392-1.
7
Identifying tumor cells at the single-cell level using machine learning.利用机器学习在单细胞水平上识别肿瘤细胞。
Genome Biol. 2022 May 30;23(1):123. doi: 10.1186/s13059-022-02683-1.
8
Single cell gene regulatory networks in plants: Opportunities for enhancing climate change stress resilience.植物单细胞基因调控网络:提高应对气候变化应激能力的机会。
Plant Cell Environ. 2021 Jul;44(7):2006-2017. doi: 10.1111/pce.14012. Epub 2021 Feb 18.
9
Complete deconvolution of DNA methylation signals from complex tissues: a geometric approach.从复杂组织中完全反卷积 DNA 甲基化信号:一种几何方法。
Bioinformatics. 2021 May 23;37(8):1052-1059. doi: 10.1093/bioinformatics/btaa930.

本文引用的文献

1
Robust decomposition of cell type mixtures in spatial transcriptomics.空间转录组学中细胞类型混合物的稳健分解。
Nat Biotechnol. 2022 Apr;40(4):517-526. doi: 10.1038/s41587-021-00830-w. Epub 2021 Feb 18.
2
Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling.单细胞 RNA-seq 数据中肿瘤微环境细胞类型的概率分配。
Nat Methods. 2019 Oct;16(10):1007-1015. doi: 10.1038/s41592-019-0529-1. Epub 2019 Sep 9.
3
Supervised classification enables rapid annotation of cell atlases.监督分类可实现细胞图谱的快速标注。
Nat Methods. 2019 Oct;16(10):983-986. doi: 10.1038/s41592-019-0535-3. Epub 2019 Sep 9.
4
A comparison of automatic cell identification methods for single-cell RNA sequencing data.单细胞 RNA 测序数据的自动细胞识别方法比较。
Genome Biol. 2019 Sep 9;20(1):194. doi: 10.1186/s13059-019-1795-z.
5
SingleCellNet: A Computational Tool to Classify Single Cell RNA-Seq Data Across Platforms and Across Species.SingleCellNet:一种跨平台和跨物种对单细胞 RNA-Seq 数据进行分类的计算工具。
Cell Syst. 2019 Aug 28;9(2):207-213.e2. doi: 10.1016/j.cels.2019.06.004. Epub 2019 Jul 31.
6
Development of a Chimeric Model to Study and Manipulate Human Microglia In Vivo.体内研究和操纵人类小胶质细胞的嵌合模型的开发。
Neuron. 2019 Sep 25;103(6):1016-1033.e10. doi: 10.1016/j.neuron.2019.07.002. Epub 2019 Jul 30.
7
CHETAH: a selective, hierarchical cell type identification method for single-cell RNA sequencing.CHETAH:一种用于单细胞 RNA 测序的选择性、层次化细胞类型识别方法。
Nucleic Acids Res. 2019 Sep 19;47(16):e95. doi: 10.1093/nar/gkz543.
8
Comprehensive Integration of Single-Cell Data.单细胞数据的综合整合。
Cell. 2019 Jun 13;177(7):1888-1902.e21. doi: 10.1016/j.cell.2019.05.031. Epub 2019 Jun 6.
9
PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data.PanglaoDB:一个用于探索小鼠和人类单细胞 RNA 测序数据的网络服务器。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz046.
10
Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage.基于参考的肺单细胞测序分析揭示了一种过渡性成纤维细胞样巨噬细胞。
Nat Immunol. 2019 Feb;20(2):163-172. doi: 10.1038/s41590-018-0276-y. Epub 2019 Jan 14.