• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞分析中用于细胞类型区分的最优标记基因选择。

Optimal marker gene selection for cell type discrimination in single cell analyses.

机构信息

Department of Computer Science and Technology, University of Cambridge, Cambridge, UK.

Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD, USA.

出版信息

Nat Commun. 2021 Feb 19;12(1):1186. doi: 10.1038/s41467-021-21453-4.

DOI:10.1038/s41467-021-21453-4
PMID:33608535
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7895823/
Abstract

Single-cell technologies characterize complex cell populations across multiple data modalities at unprecedented scale and resolution. Multi-omic data for single cell gene expression, in situ hybridization, or single cell chromatin states are increasingly available across diverse tissue types. When isolating specific cell types from a sample of disassociated cells or performing in situ sequencing in collections of heterogeneous cells, one challenging task is to select a small set of informative markers that robustly enable the identification and discrimination of specific cell types or cell states as precisely as possible. Given single cell RNA-seq data and a set of cellular labels to discriminate, scGeneFit selects gene markers that jointly optimize cell label recovery using label-aware compressive classification methods. This results in a substantially more robust and less redundant set of markers than existing methods, most of which identify markers that separate each cell label from the rest. When applied to a data set given a hierarchy of cell types as labels, the markers found by our method improves the recovery of the cell type hierarchy with fewer markers than existing methods using a computationally efficient and principled optimization.

摘要

单细胞技术以前所未有的规模和分辨率描绘了多个数据模态的复杂细胞群体。单细胞基因表达、原位杂交或单细胞染色质状态的多组学数据在不同的组织类型中越来越多地得到应用。当从分离的细胞样本中分离特定的细胞类型或在异质细胞的集合中进行原位测序时,一个具有挑战性的任务是选择一小部分信息丰富的标记物,这些标记物能够稳健地尽可能精确地识别和区分特定的细胞类型或细胞状态。给定单细胞 RNA-seq 数据和一组要区分的细胞标签,scGeneFit 使用标签感知压缩分类方法选择共同优化细胞标签恢复的基因标记物。这导致了比现有方法更稳健、更少冗余的标记物集,而大多数现有方法识别的标记物将每个细胞标签与其他标签分开。当应用于给定细胞类型层次结构作为标签的数据集时,与使用现有方法相比,我们的方法发现的标记物使用计算效率高且有原则的优化以更少的标记物提高了细胞类型层次结构的恢复。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/6769e68b98ae/41467_2021_21453_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/fb5ebcb23abb/41467_2021_21453_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/110f00c76c76/41467_2021_21453_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/6769e68b98ae/41467_2021_21453_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/fb5ebcb23abb/41467_2021_21453_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/110f00c76c76/41467_2021_21453_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8c/7895823/6769e68b98ae/41467_2021_21453_Fig3_HTML.jpg

相似文献

1
Optimal marker gene selection for cell type discrimination in single cell analyses.单细胞分析中用于细胞类型区分的最优标记基因选择。
Nat Commun. 2021 Feb 19;12(1):1186. doi: 10.1038/s41467-021-21453-4.
2
geneBasis: an iterative approach for unsupervised selection of targeted gene panels from scRNA-seq.基因基础:一种从 scRNA-seq 中进行无监督选择靶向基因面板的迭代方法。
Genome Biol. 2021 Dec 6;22(1):333. doi: 10.1186/s13059-021-02548-z.
3
SCMarker: Ab initio marker selection for single cell transcriptome profiling.SCMarker:单细胞转录组分析的从头标记选择。
PLoS Comput Biol. 2019 Oct 28;15(10):e1007445. doi: 10.1371/journal.pcbi.1007445. eCollection 2019 Oct.
4
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa.一种用于隐性营养不良型大疱性表皮松解症的单细胞 RNA-seq 分析的多任务聚类方法。
PLoS Comput Biol. 2018 Apr 9;14(4):e1006053. doi: 10.1371/journal.pcbi.1006053. eCollection 2018 Apr.
5
starTracer is an accelerated approach for precise marker gene identification in single-cell RNA-Seq analysis.StarTracer 是一种在单细胞 RNA-Seq 分析中用于精确标记基因识别的加速方法。
Commun Biol. 2024 Sep 13;7(1):1128. doi: 10.1038/s42003-024-06790-6.
6
CIARA: a cluster-independent algorithm for identifying markers of rare cell types from single-cell sequencing data.CIARA:一种用于从单细胞测序数据中识别稀有细胞类型标记的与聚类无关的算法。
Development. 2023 Jun 1;150(11). doi: 10.1242/dev.201264. Epub 2023 Jun 8.
7
DTWscore: differential expression and cell clustering analysis for time-series single-cell RNA-seq data.DTW分数:时间序列单细胞RNA测序数据的差异表达和细胞聚类分析
BMC Bioinformatics. 2017 May 23;18(1):270. doi: 10.1186/s12859-017-1647-3.
8
Directly selecting cell-type marker genes for single-cell clustering analyses.直接选择用于单细胞聚类分析的细胞类型标记基因。
Cell Rep Methods. 2024 Jul 15;4(7):100810. doi: 10.1016/j.crmeth.2024.100810. Epub 2024 Jul 8.
9
SMaSH: a scalable, general marker gene identification framework for single-cell RNA-sequencing.SMaSH:一种用于单细胞 RNA 测序的可扩展的通用标记基因识别框架。
BMC Bioinformatics. 2022 Aug 8;23(1):328. doi: 10.1186/s12859-022-04860-2.
10
TiC2D: Trajectory Inference From Single-Cell RNA-Seq Data Using Consensus Clustering.TiC2D:基于一致性聚类的单细胞 RNA-Seq 数据轨迹推断。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2512-2522. doi: 10.1109/TCBB.2021.3061720. Epub 2022 Aug 8.

引用本文的文献

1
Discovery of optimal cell type classification marker genes from single cell RNA sequencing data.从单细胞RNA测序数据中发现最佳细胞类型分类标记基因。
BMC Methods. 2024;1. doi: 10.1186/s44330-024-00015-2. Epub 2024 Nov 4.
2
MuST: multiple-modality structure transformation for single-cell spatial transcriptomics.MuST:用于单细胞空间转录组学的多模态结构转换
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf405.
3
Fluctuation structure predicts genome-wide perturbation outcomes.波动结构可预测全基因组扰动结果。

本文引用的文献

1
Combinatorial prediction of marker panels from single-cell transcriptomic data.基于单细胞转录组学数据的组合标志物预测。
Mol Syst Biol. 2019 Oct;15(10):e9005. doi: 10.15252/msb.20199005.
2
Spatial organization of the somatosensory cortex revealed by osmFISH.osmFISH 揭示的躯体感觉皮层的空间组织
Nat Methods. 2018 Nov;15(11):932-935. doi: 10.1038/s41592-018-0175-z. Epub 2018 Oct 30.
3
A UNIFIED STATISTICAL FRAMEWORK FOR SINGLE CELL AND BULK RNA SEQUENCING DATA.用于单细胞和批量RNA测序数据的统一统计框架
Res Sq. 2025 Aug 12:rs.3.rs-7304871. doi: 10.21203/rs.3.rs-7304871/v1.
4
spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model.spRefine:使用由基因组语言模型驱动的无参考框架对空间转录组学进行去噪和插补。
bioRxiv. 2025 Jul 7:2025.04.22.649977. doi: 10.1101/2025.04.22.649977.
5
Spatial and multiomics analysis of human and mouse lung adenocarcinoma precursors reveals TIM-3 as a putative target for precancer interception.人类和小鼠肺腺癌前体的空间和多组学分析揭示TIM-3是癌前病变拦截的潜在靶点。
Cancer Cell. 2025 Jun 9;43(6):1125-1140.e10. doi: 10.1016/j.ccell.2025.04.003. Epub 2025 May 8.
6
Hypoxia-induced Wnt5a-secreting fibroblasts promote colon cancer progression.缺氧诱导分泌Wnt5a的成纤维细胞促进结肠癌进展。
Nat Commun. 2025 Apr 17;16(1):3653. doi: 10.1038/s41467-025-58748-9.
7
CORTADO: Hill Climbing Optimization for Cell-Type Specific Marker Gene Discovery.科尔塔多:用于细胞类型特异性标记基因发现的爬山优化算法
bioRxiv. 2024 Dec 23:2024.12.23.630040. doi: 10.1101/2024.12.23.630040.
8
Hierarchical marker genes selection in scRNA-seq analysis.单细胞RNA测序分析中的分层标记基因选择
PLoS Comput Biol. 2024 Dec 12;20(12):e1012643. doi: 10.1371/journal.pcbi.1012643. eCollection 2024 Dec.
9
CosGeneGate selects multi-functional and credible biomarkers for single-cell analysis.CosGeneGate 为单细胞分析选择多功能且可靠的生物标志物。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae626.
10
Probe set selection for targeted spatial transcriptomics.靶向空间转录组学的探针集选择
Nat Methods. 2024 Dec;21(12):2260-2270. doi: 10.1038/s41592-024-02496-z. Epub 2024 Nov 18.
Ann Appl Stat. 2018 Mar;12(1):609-632. doi: 10.1214/17-AOAS1110. Epub 2018 Mar 9.
4
Simultaneous epitope and transcriptome measurement in single cells.单细胞中表位和转录组的同步测量。
Nat Methods. 2017 Sep;14(9):865-868. doi: 10.1038/nmeth.4380. Epub 2017 Jul 31.
5
Standardized and flexible eight colour flow cytometry panels harmonized between different laboratories to study human NK cell phenotype and function.标准化和灵活的八色流式细胞术面板在不同实验室之间实现了协调,用于研究人类自然杀伤细胞表型和功能。
Sci Rep. 2017 Mar 10;7:43873. doi: 10.1038/srep43873.
6
Massively parallel digital transcriptional profiling of single cells.大规模平行数字化单细胞转录组分析。
Nat Commun. 2017 Jan 16;8:14049. doi: 10.1038/ncomms14049.
7
MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data.MAST:一种用于评估单细胞RNA测序数据中转录变化和表征异质性的灵活统计框架。
Genome Biol. 2015 Dec 10;16:278. doi: 10.1186/s13059-015-0844-5.
8
Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.利用纳升液滴对单个细胞进行高度并行的全基因组表达谱分析。
Cell. 2015 May 21;161(5):1202-1214. doi: 10.1016/j.cell.2015.05.002.
9
Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq.脑结构。单细胞 RNA 测序揭示的小鼠皮层和海马中的细胞类型。
Science. 2015 Mar 6;347(6226):1138-42. doi: 10.1126/science.aaa1934. Epub 2015 Feb 19.
10
Data exploration, quality control and testing in single-cell qPCR-based gene expression experiments.单细胞 qPCR 基因表达实验中的数据探索、质量控制和测试。
Bioinformatics. 2013 Feb 15;29(4):461-7. doi: 10.1093/bioinformatics/bts714. Epub 2012 Dec 24.