• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scDetect:一种基于排序的集成学习算法,用于癌症中单细胞RNA测序的细胞类型识别。

scDetect: a rank-based ensemble learning algorithm for cell type identification of single-cell RNA sequencing in cancer.

作者信息

Shen Yifei, Chu Qinjie, Timko Michael P, Fan Longjiang

机构信息

China Department of Laboratory Medicine, First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, China.

China Key Laboratory of Clinical In Vitro Diagnostic Techniques of Zhejiang Province, Hangzhou 310003, China.

出版信息

Bioinformatics. 2021 Nov 18;37(22):4115-4122. doi: 10.1093/bioinformatics/btab410.

DOI:10.1093/bioinformatics/btab410
PMID:34048541
Abstract

MOTIVATION

Single-cell RNA sequencing (scRNA-seq) has enabled the characterization of different cell types in many tissues and tumor samples. Cell type identification is essential for single-cell RNA profiling, currently transforming the life sciences. Often, this is achieved by searching for combinations of genes that have previously been implicated as being cell-type specific, an approach that is not quantitative and does not explicitly take advantage of other scRNA-seq studies. Batch effects and different data platforms greatly decrease the predictive performance in inter-laboratory and different data type validation.

RESULTS

Here, we present a new ensemble learning method named as 'scDetect' that combines gene expression rank-based analysis and a majority vote ensemble machine-learning probability-based prediction method capable of highly accurate classification of cells based on scRNA-seq data by different sequencing platforms. Because of tumor heterogeneity, in order to accurately predict tumor cells in the single-cell RNA-seq data, we have also incorporated cell copy number variation consensus clustering and epithelial score in the classification. We applied scDetect to scRNA-seq data from pancreatic tissue, mononuclear cells and tumor biopsies cells and show that scDetect classified individual cells with high accuracy and better than other publicly available tools.

AVAILABILITY AND IMPLEMENTATION

scDetect is an open source software. Source code and test data is freely available from Github (https://github.com/IVDgenomicslab/scDetect/) and Zenodo (https://zenodo.org/record/4764132#.YKCOlrH5AYN). The examples and tutorial page is at https://ivdgenomicslab.github.io/scDetect-Introduction/. And scDetect will be available from Bioconductor.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

单细胞RNA测序(scRNA-seq)已能够对许多组织和肿瘤样本中的不同细胞类型进行表征。细胞类型识别对于单细胞RNA分析至关重要,目前正在改变生命科学。通常,这是通过寻找先前被认为是细胞类型特异性的基因组合来实现的,这种方法不是定量的,也没有明确利用其他scRNA-seq研究。批次效应和不同的数据平台大大降低了实验室间和不同数据类型验证中的预测性能。

结果

在此,我们提出了一种名为“scDetect”的新集成学习方法,该方法结合了基于基因表达排名的分析和基于多数投票集成机器学习概率的预测方法,能够基于不同测序平台的scRNA-seq数据对细胞进行高精度分类。由于肿瘤异质性,为了准确预测单细胞RNA-seq数据中的肿瘤细胞,我们还在分类中纳入了细胞拷贝数变异共识聚类和上皮评分。我们将scDetect应用于胰腺组织、单核细胞和肿瘤活检细胞的scRNA-seq数据,并表明scDetect能够高精度地对单个细胞进行分类,且优于其他公开可用的工具。

可用性和实现

scDetect是一个开源软件。源代码和测试数据可从Github(https://github.com/IVDgenomicslab/scDetect/)和Zenodo(https://zenodo.org/record/4764132#.YKCOlrH5AYN)免费获取。示例和教程页面位于https://ivdgenomicslab.github.io/scDetect-Introduction/。并且scDetect将可从Bioconductor获得。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
scDetect: a rank-based ensemble learning algorithm for cell type identification of single-cell RNA sequencing in cancer.scDetect:一种基于排序的集成学习算法,用于癌症中单细胞RNA测序的细胞类型识别。
Bioinformatics. 2021 Nov 18;37(22):4115-4122. doi: 10.1093/bioinformatics/btab410.
2
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
3
A machine learning-based method for automatically identifying novel cells in annotating single-cell RNA-seq data.基于机器学习的方法,用于自动识别注释单细胞 RNA-seq 数据中的新型细胞。
Bioinformatics. 2022 Oct 31;38(21):4885-4892. doi: 10.1093/bioinformatics/btac617.
4
Autoencoder-based cluster ensembles for single-cell RNA-seq data analysis.基于自动编码器的单细胞 RNA-seq 数据分析聚类集成。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 19):660. doi: 10.1186/s12859-019-3179-5.
5
CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones.CONICS 将单细胞 RNA-seq 与 DNA 测序相结合,将基因表达映射到肿瘤亚克隆。
Bioinformatics. 2018 Sep 15;34(18):3217-3219. doi: 10.1093/bioinformatics/bty316.
6
scPNMF: sparse gene encoding of single cells to facilitate gene selection for targeted gene profiling.scPNMF:稀疏的单细胞基因编码,以方便选择用于靶向基因分析的基因。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i358-i366. doi: 10.1093/bioinformatics/btab273.
7
CTISL: a dynamic stacking multi-class classification approach for identifying cell types from single-cell RNA-seq data.CTISL:一种动态堆叠多类分类方法,用于从单细胞 RNA-seq 数据中识别细胞类型。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae063.
8
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.
9
scGCL: an imputation method for scRNA-seq data based on graph contrastive learning.scGCL:一种基于图对比学习的 scRNA-seq 数据插补方法。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad098.
10
Joint learning dimension reduction and clustering of single-cell RNA-sequencing data.单细胞 RNA 测序数据的联合降维和聚类学习。
Bioinformatics. 2020 Jun 1;36(12):3825-3832. doi: 10.1093/bioinformatics/btaa231.

引用本文的文献

1
Mapping Cell Identity from scRNA-seq: A primer on computational methods.从单细胞RNA测序映射细胞身份:计算方法入门
Comput Struct Biotechnol J. 2025 Apr 2;27:1559-1569. doi: 10.1016/j.csbj.2025.03.051. eCollection 2025.
2
Applications of gene pair methods in clinical research: advancing precision medicine.基因对方法在临床研究中的应用:推动精准医学发展。
Mol Biomed. 2025 Apr 9;6(1):22. doi: 10.1186/s43556-025-00263-w.
3
CTISL: a dynamic stacking multi-class classification approach for identifying cell types from single-cell RNA-seq data.
CTISL:一种动态堆叠多类分类方法,用于从单细胞 RNA-seq 数据中识别细胞类型。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae063.
4
Artificial Intelligence-Assisted Transcriptomic Analysis to Advance Cancer Immunotherapy.人工智能辅助转录组分析推动癌症免疫治疗
J Clin Med. 2023 Feb 6;12(4):1279. doi: 10.3390/jcm12041279.