DIscBIO：单细胞转录组学中生物标志物发现的用户友好型流程。

DIscBIO: A User-Friendly Pipeline for Biomarker Discovery in Single-Cell Transcriptomics.

机构信息

Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, 0372 Oslo, Norway.

Oslo Centre for Biostatistics and Epidemiology, Faculty of Medicine, University of Oslo, 0372 Oslo, Norway.

出版信息

Int J Mol Sci. 2021 Jan 30;22(3):1399. doi: 10.3390/ijms22031399.

DOI:10.3390/ijms22031399

PMID:33573289

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7866810/

Abstract

The growing attention toward the benefits of single-cell RNA sequencing (scRNA-seq) is leading to a myriad of computational packages for the analysis of different aspects of scRNA-seq data. For researchers without advanced programing skills, it is very challenging to combine several packages in order to perform the desired analysis in a simple and reproducible way. Here we present DIscBIO, an open-source, multi-algorithmic pipeline for easy, efficient and reproducible analysis of cellular sub-populations at the transcriptomic level. The pipeline integrates multiple scRNA-seq packages and allows biomarker discovery with decision trees and gene enrichment analysis in a network context using single-cell sequencing read counts through clustering and differential analysis. DIscBIO is freely available as an R package. It can be run either in command-line mode or through a user-friendly computational pipeline using Jupyter notebooks. We showcase all pipeline features using two scRNA-seq datasets. The first dataset consists of circulating tumor cells from patients with breast cancer. The second one is a cell cycle regulation dataset in myxoid liposarcoma. All analyses are available as notebooks that integrate in a sequential narrative R code with explanatory text and output data and images. R users can use the notebooks to understand the different steps of the pipeline and will guide them to explore their scRNA-seq data. We also provide a cloud version using Binder that allows the execution of the pipeline without the need of downloading R, Jupyter or any of the packages used by the pipeline. The cloud version can serve as a tutorial for training purposes, especially for those that are not R users or have limited programing skills. However, in order to do meaningful scRNA-seq analyses, all users will need to understand the implemented methods and their possible options and limitations.

摘要

单细胞 RNA 测序 (scRNA-seq) 的益处日益受到关注，这导致了大量用于分析 scRNA-seq 数据不同方面的计算软件包。对于没有高级编程技能的研究人员来说，将几个软件包组合在一起以简单、可重复的方式执行所需的分析是非常具有挑战性的。在这里，我们介绍了 DIscBIO，这是一个开源的、多算法的管道，用于在转录组水平上轻松、高效且可重复地分析细胞亚群。该管道集成了多个 scRNA-seq 软件包，并允许使用决策树和基因富集分析在网络上下文中发现生物标志物，使用单细胞测序读数以聚类和差异分析为基础。DIscBIO 可作为 R 包免费获得。它可以在命令行模式下运行，也可以通过使用 Jupyter 笔记本的用户友好的计算管道运行。我们使用两个 scRNA-seq 数据集展示了所有管道功能。第一个数据集包含乳腺癌患者的循环肿瘤细胞。第二个数据集是粘液样脂肪肉瘤的细胞周期调控数据集。所有分析都作为笔记本提供，这些笔记本以带有解释性文本和输出数据和图像的顺序叙述 R 代码进行整合。R 用户可以使用笔记本了解管道的不同步骤，并指导他们探索自己的 scRNA-seq 数据。我们还提供了一个使用 Binder 的云版本，允许在无需下载 R、Jupyter 或管道使用的任何软件包的情况下执行管道。云版本可作为培训目的的教程，特别是对于那些不是 R 用户或编程技能有限的人。然而，为了进行有意义的 scRNA-seq 分析，所有用户都需要了解所实现的方法及其可能的选项和限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e1/7866810/cac181f53589/ijms-22-01399-g001.jpg

相似文献

DIscBIO: A User-Friendly Pipeline for Biomarker Discovery in Single-Cell Transcriptomics.DIscBIO：单细胞转录组学中生物标志物发现的用户友好型流程。

Int J Mol Sci. 2021 Jan 30;22(3):1399. doi: 10.3390/ijms22031399.

An accessible, interactive GenePattern Notebook for analysis and exploration of single-cell transcriptomic data.一个用于分析和探索单细胞转录组数据的可访问的交互式基因模式笔记本。

F1000Res. 2018 Aug 16;7:1306. doi: 10.12688/f1000research.15830.2. eCollection 2018.

Single Cell Explorer, collaboration-driven tools to leverage large-scale single cell RNA-seq data.单细胞探索者，协作驱动的工具，可利用大规模单细胞 RNA-seq 数据。

BMC Genomics. 2019 Aug 27;20(1):676. doi: 10.1186/s12864-019-6053-y.

BioJupies: Automated Generation of Interactive Notebooks for RNA-Seq Data Analysis in the Cloud.BioJupies：在云端自动生成用于 RNA-Seq 数据分析的交互式笔记本。

Cell Syst. 2018 Nov 28;7(5):556-561.e3. doi: 10.1016/j.cels.2018.10.007. Epub 2018 Nov 14.

popsicleR: A R Package for Pre-processing and Quality Control Analysis of Single Cell RNA-seq Data. popsicleR：用于单细胞 RNA-seq 数据预处理和质量控制分析的 R 包。

J Mol Biol. 2022 Jun 15;434(11):167560. doi: 10.1016/j.jmb.2022.167560. Epub 2022 Mar 24.

Independent component analysis based gene co-expression network inference (ICAnet) to decipher functional modules for better single-cell clustering and batch integration.基于独立成分分析的基因共表达网络推断 (ICAnet) 以破译功能模块，从而更好地进行单细胞聚类和批次整合。

Nucleic Acids Res. 2021 May 21;49(9):e54. doi: 10.1093/nar/gkab089.

Shaoxia: a web-based interactive analysis platform for single cell RNA sequencing data.Shaoxia：一个用于单细胞RNA测序数据的基于网络的交互式分析平台。

BMC Genomics. 2024 Apr 24;25(1):402. doi: 10.1186/s12864-024-10322-1.

Single-Cell Transcriptome Analysis of T Cells.T细胞的单细胞转录组分析

Methods Mol Biol. 2019;2048:155-205. doi: 10.1007/978-1-4939-9728-2_16.

RNASeqR: An R Package for Automated Two-Group RNA-Seq Analysis Workflow.RNASeqR：一个用于自动化两群组 RNA-Seq 分析工作流程的 R 包。

IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):2023-2031. doi: 10.1109/TCBB.2019.2956708. Epub 2021 Oct 7.

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.scPipe：用于单细胞 RNA 测序数据的灵活 R/Bioconductor 预处理流水线。

PLoS Comput Biol. 2018 Aug 10;14(8):e1006361. doi: 10.1371/journal.pcbi.1006361. eCollection 2018 Aug.

引用本文的文献

Multiomics biomarkers were not superior to clinical variables for pan-cancer screening.对于泛癌筛查，多组学生物标志物并不优于临床变量。

Commun Med (Lond). 2024 Nov 17;4(1):234. doi: 10.1038/s43856-024-00671-z.

Growth signaling autonomy in circulating tumor cells aids metastatic seeding.循环肿瘤细胞中的生长信号自主性有助于转移播种。

PNAS Nexus. 2024 Jan 25;3(2):pgae014. doi: 10.1093/pnasnexus/pgae014. eCollection 2024 Feb.

A combined experimental-computational approach uncovers a role for the Golgi matrix protein Giantin in breast cancer progression.一种结合实验和计算的方法揭示了高尔基基质蛋白 Giantin 在乳腺癌进展中的作用。

PLoS Comput Biol. 2023 Apr 17;19(4):e1010995. doi: 10.1371/journal.pcbi.1010995. eCollection 2023 Apr.

本文引用的文献

scTyper: a comprehensive pipeline for the cell typing analysis of single-cell RNA-seq data.scTyper：单细胞 RNA-seq 数据分析的全面细胞分型分析流水线。

BMC Bioinformatics. 2020 Aug 4;21(1):342. doi: 10.1186/s12859-020-03700-5.

Identification of cell types from single cell data using stable clustering.基于稳定聚类的单细胞数据中的细胞类型鉴定。

Sci Rep. 2020 Jul 23;10(1):12349. doi: 10.1038/s41598-020-66848-3.

Integrative Analysis and Machine Learning based Characterization of Single Circulating Tumor Cells.基于整合分析和机器学习的单个循环肿瘤细胞特征分析

J Clin Med. 2020 Apr 22;9(4):1206. doi: 10.3390/jcm9041206.

MorphoSeq: Full Single-Cell Transcriptome Dynamics Up to Gastrulation in a Chordate.MorphoSeq：脊索动物中从原肠胚形成前到原肠胚形成的完整单细胞转录组动态。

Cell. 2020 May 14;181(4):922-935.e21. doi: 10.1016/j.cell.2020.03.055. Epub 2020 Apr 20.

Construction of a human cell landscape at single-cell level.在单细胞水平构建人类细胞图谱。

Nature. 2020 May;581(7808):303-309. doi: 10.1038/s41586-020-2157-4. Epub 2020 Mar 25.

Tracing tumorigenesis in a solid tumor model at single-cell resolution.单细胞分辨率追踪实体瘤模型中的肿瘤发生。

Nat Commun. 2020 Feb 20;11(1):991. doi: 10.1038/s41467-020-14777-0.

Circulating tumor cells in precision oncology: clinical applications in liquid biopsy and 3D organoid model.精准肿瘤学中的循环肿瘤细胞：液体活检和3D类器官模型中的临床应用

Cancer Cell Int. 2019 Dec 18;19:341. doi: 10.1186/s12935-019-1067-8. eCollection 2019.

Ten simple rules for writing and sharing computational analyses in Jupyter Notebooks.在Jupyter Notebook中撰写和分享计算分析的十条简单规则。

PLoS Comput Biol. 2019 Jul 25;15(7):e1007007. doi: 10.1371/journal.pcbi.1007007. eCollection 2019 Jul.

Hydro-Seq enables contamination-free high-throughput single-cell RNA-sequencing for circulating tumor cells.Hydro-Seq 可实现无污染的高通量循环肿瘤细胞单细胞 RNA 测序。

Nat Commun. 2019 May 15;10(1):2163. doi: 10.1038/s41467-019-10122-2.

Dynamics of Gene Expression in Single Root Cells of .单细胞根细胞中基因表达的动力学。

Plant Cell. 2019 May;31(5):993-1011. doi: 10.1105/tpc.18.00785. Epub 2019 Mar 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DIscBIO：单细胞转录组学中生物标志物发现的用户友好型流程。

DIscBIO: A User-Friendly Pipeline for Biomarker Discovery in Single-Cell Transcriptomics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献