• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

选择基因特征用于单细胞基因表达数据的无监督分析。

Selecting gene features for unsupervised analysis of single-cell gene expression data.

机构信息

Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA.

Department of Biostatistics and Epidemiology, Rutgers School of Public Health, Piscataway, NJ 08854, USA.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab295.

DOI:10.1093/bib/bbab295
PMID:34351383
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8574996/
Abstract

Single-cell RNA sequencing (scRNA-seq) technologies facilitate the characterization of transcriptomic landscapes in diverse species, tissues, and cell types with unprecedented molecular resolution. In order to evaluate various biological hypotheses using high-dimensional single-cell gene expression data, most computational and statistical methods depend on a gene feature selection step to identify genes with high biological variability and reduce computational complexity. Even though many gene selection methods have been developed for scRNA-seq analysis, there lacks a systematic comparison of the assumptions, statistical models, and selection criteria used by these methods. In this article, we summarize and discuss 17 computational methods for selecting gene features in unsupervised analysis of single-cell gene expression data, with unified notations and statistical frameworks. Our discussion provides a useful summary to help practitioners select appropriate methods based on their assumptions and applicability, and to assist method developers in designing new computational tools for unsupervised learning of scRNA-seq data.

摘要

单细胞 RNA 测序 (scRNA-seq) 技术以空前的分子分辨率促进了对不同物种、组织和细胞类型中转录组图谱的特征描述。为了使用高维单细胞基因表达数据评估各种生物学假设,大多数计算和统计方法都依赖于基因特征选择步骤,以识别具有高生物学变异性的基因并降低计算复杂性。尽管已经开发了许多用于 scRNA-seq 分析的基因选择方法,但这些方法所使用的假设、统计模型和选择标准缺乏系统的比较。在本文中,我们总结和讨论了 17 种用于无监督分析单细胞基因表达数据中基因特征选择的计算方法,采用了统一的符号和统计框架。我们的讨论提供了一个有用的总结,以帮助从业者根据其假设和适用性选择合适的方法,并帮助方法开发人员设计用于 scRNA-seq 数据无监督学习的新计算工具。

相似文献

1
Selecting gene features for unsupervised analysis of single-cell gene expression data.选择基因特征用于单细胞基因表达数据的无监督分析。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab295.
2
Computational approaches for interpreting scRNA-seq data.用于解释单细胞RNA测序数据的计算方法。
FEBS Lett. 2017 Aug;591(15):2213-2225. doi: 10.1002/1873-3468.12684. Epub 2017 Jun 12.
3
Visualization of Single Cell RNA-Seq Data Using t-SNE in R.使用 R 中的 t-SNE 可视化单细胞 RNA-Seq 数据。
Methods Mol Biol. 2020;2117:159-167. doi: 10.1007/978-1-0716-0301-7_8.
4
A Gene Rank Based Approach for Single Cell Similarity Assessment and Clustering.基于基因排序的单细胞相似性评估和聚类方法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):431-442. doi: 10.1109/TCBB.2019.2931582. Epub 2021 Apr 6.
5
Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview.单细胞 RNA 测序分析:分步概述。
Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19.
6
Identification of Cell Types from Single-Cell Transcriptomic Data.从单细胞转录组数据中识别细胞类型。
Methods Mol Biol. 2019;1935:45-77. doi: 10.1007/978-1-4939-9057-3_4.
7
Highly Regional Genes: graph-based gene selection for single-cell RNA-seq data.高度区域性基因:基于图的单细胞 RNA-seq 数据基因选择。
J Genet Genomics. 2022 Sep;49(9):891-899. doi: 10.1016/j.jgg.2022.01.004. Epub 2022 Feb 8.
8
On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data.基于 QDE-SVM 的 scRNA-seq 数据基因特征选择和细胞类型分类方法。
PLoS One. 2023 Oct 19;18(10):e0292961. doi: 10.1371/journal.pone.0292961. eCollection 2023.
9
An accessible, interactive GenePattern Notebook for analysis and exploration of single-cell transcriptomic data.一个用于分析和探索单细胞转录组数据的可访问的交互式基因模式笔记本。
F1000Res. 2018 Aug 16;7:1306. doi: 10.12688/f1000research.15830.2. eCollection 2018.
10
Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data.单细胞 RNA-seq 数据的集成降维和特征基因提取。
Nat Commun. 2020 Nov 17;11(1):5853. doi: 10.1038/s41467-020-19465-7.

引用本文的文献

1
Gaylussacin, a stilbene glycoside, inhibits chronic obstructive pulmonary disease in mice.白藜芦醇苷,一种芪类糖苷,可抑制小鼠慢性阻塞性肺疾病。
Redox Biol. 2025 Jun 26;85:103744. doi: 10.1016/j.redox.2025.103744.
2
Exploring cell-to-cell variability and functional insights through differentially variable gene analysis.通过差异可变基因分析探索细胞间变异性和功能见解。
NPJ Syst Biol Appl. 2025 Mar 20;11(1):29. doi: 10.1038/s41540-025-00507-z.
3
Crafted experiments to evaluate feature selection methods for single-cell RNA-seq data.精心设计实验以评估单细胞RNA测序数据的特征选择方法。
NAR Genom Bioinform. 2025 Mar 19;7(1):lqaf023. doi: 10.1093/nargab/lqaf023. eCollection 2025 Mar.
4
Pan-cancer analysis and experimental verification of cytochrome B561 as a prognostic and therapeutic biomarker in breast cancer.细胞色素B561作为乳腺癌预后和治疗生物标志物的泛癌分析及实验验证
Discov Oncol. 2025 Mar 17;16(1):330. doi: 10.1007/s12672-025-02094-1.
5
Feature selection methods affect the performance of scRNA-seq data integration and querying.特征选择方法会影响单细胞RNA测序(scRNA-seq)数据整合与查询的性能。
Nat Methods. 2025 Apr;22(4):834-844. doi: 10.1038/s41592-025-02624-3. Epub 2025 Mar 13.
6
Controlled Noise: Evidence of epigenetic regulation of Single-Cell expression variability.受控噪声:单细胞表达变异性表观遗传调控的证据。
Bioinformatics. 2024 Jul 17;40(7). doi: 10.1093/bioinformatics/btae457.
7
Characterizing efficient feature selection for single-cell expression analysis.对单细胞表达分析中的高效特征选择进行刻画。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae317.
8
Species-agnostic transfer learning for cross-species transcriptomics data integration without gene orthology.无基因直系同源关系的跨物种转录组学数据整合的无物种特异性转移学习。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae004.
9
scAce: an adaptive embedding and clustering method for single-cell gene expression data.scAce:一种用于单细胞基因表达数据的自适应嵌入和聚类方法。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad546.
10
Differential variability analysis of single-cell gene expression data.单细胞基因表达数据的差异可变性分析。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad294.

本文引用的文献

1
scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured.scDesign2:一个透明的模拟器,可以生成具有捕获基因相关性的高保真单细胞基因表达计数数据。
Genome Biol. 2021 May 25;22(1):163. doi: 10.1186/s13059-021-02367-2.
2
Accurate feature selection improves single-cell RNA-seq cell clustering.准确的特征选择可提高单细胞 RNA-seq 细胞聚类。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab034.
3
Method of the Year: spatially resolved transcriptomics.年度方法:空间分辨转录组学。
Nat Methods. 2021 Jan;18(1):9-14. doi: 10.1038/s41592-020-01033-y.
4
A curated database reveals trends in single-cell transcriptomics.一个经过精心策划的数据库揭示了单细胞转录组学的发展趋势。
Database (Oxford). 2020 Nov 28;2020. doi: 10.1093/database/baaa073.
5
A molecular cell atlas of the human lung from single-cell RNA sequencing.人类肺部单细胞 RNA 测序的分子细胞图谱。
Nature. 2020 Nov;587(7835):619-625. doi: 10.1038/s41586-020-2922-4. Epub 2020 Nov 18.
6
Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data.单细胞 RNA-seq 数据的集成降维和特征基因提取。
Nat Commun. 2020 Nov 17;11(1):5853. doi: 10.1038/s41467-020-19465-7.
7
Cell Types of the Human Retina and Its Organoids at Single-Cell Resolution.人类视网膜及其类器官的细胞类型解析
Cell. 2020 Sep 17;182(6):1623-1640.e34. doi: 10.1016/j.cell.2020.08.013.
8
pipeComp, a general framework for the evaluation of computational pipelines, reveals performant single cell RNA-seq preprocessing tools.pipeComp 是一个用于评估计算流程的通用框架,它揭示了表现出色的单细胞 RNA-seq 预处理工具。
Genome Biol. 2020 Sep 1;21(1):227. doi: 10.1186/s13059-020-02136-7.
9
Developmental excitation-inhibition imbalance underlying psychoses revealed by single-cell analyses of discordant twins-derived cerebral organoids.单细胞分析源自于精神分裂症双生子差异大脑类器官揭示精神分裂症的发育性兴奋抑制失衡。
Mol Psychiatry. 2020 Nov;25(11):2695-2711. doi: 10.1038/s41380-020-0844-z. Epub 2020 Aug 7.
10
Demystifying "drop-outs" in single-cell UMI data.破解单细胞 UMI 数据中的“dropout”现象。
Genome Biol. 2020 Aug 6;21(1):196. doi: 10.1186/s13059-020-02096-y.