RASflow：一个基于 Snakemake 的 RNA-Seq 分析工作流程。

RASflow: an RNA-Seq analysis workflow with Snakemake.

机构信息

Computational Biology Unit, Department of Informatics, University of Bergen, Thormohlens Gate 55, Bergen, 5009, Norway.

出版信息

BMC Bioinformatics. 2020 Mar 18;21(1):110. doi: 10.1186/s12859-020-3433-x.

DOI:10.1186/s12859-020-3433-x

PMID:32183729

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7079470/

Abstract

BACKGROUND

With the cost of DNA sequencing decreasing, increasing amounts of RNA-Seq data are being generated giving novel insight into gene expression and regulation. Prior to analysis of gene expression, the RNA-Seq data has to be processed through a number of steps resulting in a quantification of expression of each gene/transcript in each of the analyzed samples. A number of workflows are available to help researchers perform these steps on their own data, or on public data to take advantage of novel software or reference data in data re-analysis. However, many of the existing workflows are limited to specific types of studies. We therefore aimed to develop a maximally general workflow, applicable to a wide range of data and analysis approaches and at the same time support research on both model and non-model organisms. Furthermore, we aimed to make the workflow usable also for users with limited programming skills.

RESULTS

Utilizing the workflow management system Snakemake and the package management system Conda, we have developed a modular, flexible and user-friendly RNA-Seq analysis workflow: RNA-Seq Analysis Snakemake Workflow (RASflow). Utilizing Snakemake and Conda alleviates challenges with library dependencies and version conflicts and also supports reproducibility. To be applicable for a wide variety of applications, RASflow supports the mapping of reads to both genomic and transcriptomic assemblies. RASflow has a broad range of potential users: it can be applied by researchers interested in any organism and since it requires no programming skills, it can be used by researchers with different backgrounds. The source code of RASflow is available on GitHub: https://github.com/zhxiaokang/RASflow.

CONCLUSIONS

RASflow is a simple and reliable RNA-Seq analysis workflow covering many use cases.

摘要

背景

随着 DNA 测序成本的降低，越来越多的 RNA-Seq 数据被生成，为基因表达和调控提供了新的见解。在分析基因表达之前，必须对 RNA-Seq 数据进行一系列处理，从而对每个分析样本中的每个基因/转录本的表达进行量化。有许多工作流程可帮助研究人员对自己的数据或公共数据执行这些步骤，以利用数据重新分析中的新型软件或参考数据。然而，许多现有的工作流程仅限于特定类型的研究。因此，我们旨在开发一种最大限度通用的工作流程，适用于广泛的数据和分析方法，同时支持对模型和非模型生物的研究。此外，我们旨在使该工作流程也可供具有有限编程技能的用户使用。

结果

我们利用工作流管理系统 Snakemake 和包管理系统 Conda，开发了一个模块化、灵活且用户友好的 RNA-Seq 分析工作流程：RNA-Seq 分析 Snakemake 工作流程 (RASflow)。利用 Snakemake 和 Conda，可以缓解库依赖和版本冲突带来的挑战，同时支持可重复性。为了适用于各种应用，RASflow 支持将读取映射到基因组和转录组组装。RASflow 拥有广泛的潜在用户：它可以应用于对任何生物体感兴趣的研究人员，并且由于它不需要编程技能，因此可以供具有不同背景的研究人员使用。RASflow 的源代码可在 GitHub 上获得：https://github.com/zhxiaokang/RASflow。

结论

RASflow 是一种简单可靠的 RNA-Seq 分析工作流程，涵盖了许多用例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f9a/7079470/a2f261187160/12859_2020_3433_Fig1_HTML.jpg

相似文献

RASflow: an RNA-Seq analysis workflow with Snakemake.

BMC Bioinformatics. 2020 Mar 18;21(1):110. doi: 10.1186/s12859-020-3433-x.

Natrix: a Snakemake-based workflow for processing, clustering, and taxonomically assigning amplicon sequencing reads.

BMC Bioinformatics. 2020 Nov 16;21(1):526. doi: 10.1186/s12859-020-03852-4.

RNA-Seq in Nonmodel Organisms.

Methods Mol Biol. 2021;2243:143-167. doi: 10.1007/978-1-0716-1103-6_8.

kGWASflow: a modular, flexible, and reproducible Snakemake workflow for k-mers-based GWAS.

G3 (Bethesda). 2023 Dec 29;14(1). doi: 10.1093/g3journal/jkad246.

VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis.

BMC Bioinformatics. 2018 Apr 12;19(1):135. doi: 10.1186/s12859-018-2139-9.

Single Cell Explorer, collaboration-driven tools to leverage large-scale single cell RNA-seq data.

BMC Genomics. 2019 Aug 27;20(1):676. doi: 10.1186/s12864-019-6053-y.

MosaiCatcher v2: a single-cell structural variations detection and analysis reference framework based on Strand-seq.

Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad633.

WASP: a versatile, web-accessible single cell RNA-Seq processing platform.

BMC Genomics. 2021 Mar 18;22(1):195. doi: 10.1186/s12864-021-07469-6.

OneStopRNAseq: A Web Application for Comprehensive and Efficient Analyses of RNA-Seq Data.

Genes (Basel). 2020 Oct 2;11(10):1165. doi: 10.3390/genes11101165.

ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data.

BMC Bioinformatics. 2020 Jun 22;21(1):257. doi: 10.1186/s12859-020-03585-4.

引用本文的文献

Reversing Preeclampsia Pathology: AXL Inhibition Restores Mitochondrial Function and ECM Balance.

Cells. 2025 Aug 8;14(16):1229. doi: 10.3390/cells14161229.

Amino acid changes in two viral proteins drive attenuation of the yellow fever 17D vaccine.

Nat Microbiol. 2025 Jul 8. doi: 10.1038/s41564-025-02047-y.

Abstraction hierarchy to define biofoundry workflows and operations for interoperable synthetic biology research and applications.

Nat Commun. 2025 Jul 1;16(1):6056. doi: 10.1038/s41467-025-61263-6.

Introgression dynamics of sex-linked chromosomal inversions shape the Malawi cichlid radiation.

Science. 2025 Jun 12;388(6752):eadr9961. doi: 10.1126/science.adr9961.

Effect of Corticosterone on Gene Expression in the Context of Global Hippocampal Transcription.

Int J Mol Sci. 2025 May 21;26(10):4889. doi: 10.3390/ijms26104889.

Small nucleolar RNAs promote the restoration of muscle differentiation defects in cells from myotonic dystrophy type 1.

Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf232.

Transcriptome analysis reveals effects of ethynylestradiol and bisphenol A on multiple endocrine and metabolic pathways in the pituitary and liver of female Atlantic cod ().

Front Endocrinol (Lausanne). 2025 Jan 27;15:1491432. doi: 10.3389/fendo.2024.1491432. eCollection 2024.

TrAnnoScope: A Modular Snakemake Pipeline for Full-Length Transcriptome Analysis and Functional Annotation.

Genes (Basel). 2024 Nov 29;15(12):1547. doi: 10.3390/genes15121547.

The evolution of reduced facilitation in a four-species bacterial community.

Evol Lett. 2024 Jul 19;8(6):828-840. doi: 10.1093/evlett/qrae036. eCollection 2024 Dec.

Iroquois homeobox 4 (IRX4) derived micropeptide promotes prostate cancer progression and chemoresistance through Wnt signalling dysregulation.

Commun Med (Lond). 2024 Nov 1;4(1):224. doi: 10.1038/s43856-024-00613-9.

本文引用的文献

Variant analysis pipeline for accurate detection of genomic variants from transcriptome sequencing data.

PLoS One. 2019 Sep 23;14(9):e0216838. doi: 10.1371/journal.pone.0216838. eCollection 2019.

RNA sequencing: the teenage years.

Nat Rev Genet. 2019 Nov;20(11):631-656. doi: 10.1038/s41576-019-0150-2. Epub 2019 Jul 24.

ARMOR: An utomated eproducible dular Workflow for Preprocessing and Differential Analysis of NA-seq Data.

G3 (Bethesda). 2019 Jul 9;9(7):2089-2096. doi: 10.1534/g3.119.400185. Print 2019 Jul 1.

UTAP: User-friendly Transcriptome Analysis Pipeline.

BMC Bioinformatics. 2019 Mar 25;20(1):154. doi: 10.1186/s12859-019-2728-2.

BioJupies: Automated Generation of Interactive Notebooks for RNA-Seq Data Analysis in the Cloud.

Cell Syst. 2018 Nov 28;7(5):556-561.e3. doi: 10.1016/j.cels.2018.10.007. Epub 2018 Nov 14.

Ensembl 2019.

Nucleic Acids Res. 2019 Jan 8;47(D1):D745-D751. doi: 10.1093/nar/gky1113.

ArrayExpress update - from bulk to single-cell expression data.

Nucleic Acids Res. 2019 Jan 8;47(D1):D711-D715. doi: 10.1093/nar/gky964.

RNA-Seq analysis of transcriptome responses in Atlantic cod (Gadus morhua) precision-cut liver slices exposed to benzo[a]pyrene and 17α-ethynylestradiol.

Aquat Toxicol. 2018 Aug;201:174-186. doi: 10.1016/j.aquatox.2018.06.003. Epub 2018 Jun 7.

VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis.

BMC Bioinformatics. 2018 Apr 12;19(1):135. doi: 10.1186/s12859-018-2139-9.

Gaining comprehensive biological insight into the transcriptome by performing a broad-spectrum RNA-seq analysis.

Nat Commun. 2017 Jul 5;8(1):59. doi: 10.1038/s41467-017-00050-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

RASflow：一个基于 Snakemake 的 RNA-Seq 分析工作流程。

RASflow: an RNA-Seq analysis workflow with Snakemake.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献