孔隙大师：牛津纳米孔直接RNA测序数据集分析工作流程

MasterOfPores: A Workflow for the Analysis of Oxford Nanopore Direct RNA Sequencing Datasets.

作者信息

Cozzuto Luca, Liu Huanle, Pryszcz Leszek P, Pulido Toni Hermoso, Delgado-Tejedor Anna, Ponomarenko Julia, Novoa Eva Maria

机构信息

Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain.

International Institute of Molecular and Cell Biology, Warsaw, Poland.

出版信息

Front Genet. 2020 Mar 17;11:211. doi: 10.3389/fgene.2020.00211. eCollection 2020.

DOI:10.3389/fgene.2020.00211

PMID:32256520

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7089958/

Abstract

The direct RNA sequencing platform offered by Oxford Nanopore Technologies allows for direct measurement of RNA molecules without the need of conversion to complementary DNA, fragmentation or amplification. As such, it is virtually capable of detecting any given RNA modification present in the molecule that is being sequenced, as well as provide polyA tail length estimations at the level of individual RNA molecules. Although this technology has been publicly available since 2017, the complexity of the raw Nanopore data, together with the lack of systematic and reproducible pipelines, have greatly hindered the access of this technology to the general user. Here we address this problem by providing a fully benchmarked workflow for the analysis of direct RNA sequencing reads, termed . The pipeline starts with a pre-processing module, which converts raw current intensities into multiple types of processed data including FASTQ and BAM, providing metrics of the quality of the run, quality-filtering, demultiplexing, base-calling and mapping. In a second step, the pipeline performs downstream analyses of the mapped reads, including prediction of RNA modifications and estimation of polyA tail lengths. Four direct RNA MinION sequencing runs can be fully processed and analyzed in 10 h on 100 CPUs. The pipeline can also be executed in GPU locally or in the cloud, decreasing the run time fourfold. The software is written using the NextFlow framework for parallelization and portability, and relies on Linux containers such as Docker and Singularity for achieving better reproducibility. The workflow can be executed on any Unix-compatible OS on a computer, cluster or cloud without the need of installing any additional software or dependencies, and is freely available in Github (https://github.com/biocorecrg/master_of_pores). This workflow simplifies direct RNA sequencing data analyses, facilitating the study of the (epi)transcriptome at single molecule resolution.

摘要

牛津纳米孔技术公司提供的直接RNA测序平台能够直接测量RNA分子，无需将其转化为互补DNA、片段化或扩增。因此，它几乎能够检测正在测序的分子中存在的任何给定RNA修饰，并能在单个RNA分子水平上提供聚腺苷酸尾长度估计。尽管这项技术自2017年起就已公开可用，但原始纳米孔数据的复杂性，以及缺乏系统且可重复的流程，极大地阻碍了普通用户使用这项技术。在此，我们通过提供一个经过全面基准测试的工作流程来分析直接RNA测序读数，即。该流程从一个预处理模块开始，它将原始电流强度转换为多种类型的处理后数据，包括FASTQ和BAM，提供运行质量、质量过滤、解复用、碱基识别和映射的指标。第二步，该流程对映射后的读数进行下游分析，包括RNA修饰预测和聚腺苷酸尾长度估计。在100个CPU上，四个直接RNA MinION测序运行可以在10小时内完全处理和分析完毕。该流程也可以在本地或云端的GPU上执行，将运行时间缩短四倍。该软件使用NextFlow框架编写以实现并行化和可移植性，并依赖于Docker和Singularity等Linux容器以实现更好的可重复性。工作流程可以在计算机、集群或云端的任何Unix兼容操作系统上执行，无需安装任何额外软件或依赖项，并且可以在Github（https://github.com/biocorecrg/master_of_pores）上免费获取。这个工作流程简化了直接RNA测序数据分析，便于在单分子分辨率下研究（表观）转录组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f76/7089958/2f28db470f75/fgene-11-00211-g001.jpg

相似文献

MasterOfPores: A Workflow for the Analysis of Oxford Nanopore Direct RNA Sequencing Datasets.孔隙大师：牛津纳米孔直接RNA测序数据集分析工作流程

Front Genet. 2020 Mar 17;11:211. doi: 10.3389/fgene.2020.00211. eCollection 2020.

Nanopore Direct RNA Sequencing Data Processing and Analysis Using MasterOfPores.使用MasterOfPores进行纳米孔直接RNA测序数据处理与分析

Methods Mol Biol. 2023;2624:185-205. doi: 10.1007/978-1-0716-2962-8_13.

NanoSPC: a scalable, portable, cloud compatible viral nanopore metagenomic data processing pipeline.NanoSPC：一种可扩展、便携、与云兼容的病毒纳米孔宏基因组数据处理管道。

Nucleic Acids Res. 2020 Jul 2;48(W1):W366-W371. doi: 10.1093/nar/gkaa413.

DolphinNext: a distributed data processing platform for high throughput genomics.海豚下一代：一个用于高通量基因组学的分布式数据处理平台。

BMC Genomics. 2020 Apr 19;21(1):310. doi: 10.1186/s12864-020-6714-x.

Trans-NanoSim characterizes and simulates nanopore RNA-sequencing data.跨纳米模拟技术对纳米孔 RNA 测序数据进行了特征描述和模拟。

Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa061.

FASTdRNA: a workflow for the analysis of ONT direct RNA sequencing.FASTdRNA：一种用于纳米孔直接RNA测序分析的工作流程。

Bioinform Adv. 2023 Jul 20;3(1):vbad099. doi: 10.1093/bioadv/vbad099. eCollection 2023.

ModPhred: an integrative toolkit for the analysis and storage of nanopore sequencing DNA and RNA modification data.ModPhred：一个用于分析和存储纳米孔测序 DNA 和 RNA 修饰数据的集成工具包。

Bioinformatics. 2021 Dec 22;38(1):257-260. doi: 10.1093/bioinformatics/btab539.

COSAP: Comparative Sequencing Analysis Platform.COSAP：比较测序分析平台。

BMC Bioinformatics. 2024 Mar 26;25(1):130. doi: 10.1186/s12859-024-05756-z.

poreCov-An Easy to Use, Fast, and Robust Workflow for SARS-CoV-2 Genome Reconstruction Nanopore Sequencing.poreCov——一种用于SARS-CoV-2基因组重建的易于使用、快速且强大的纳米孔测序工作流程。

Front Genet. 2021 Jul 28;12:711437. doi: 10.3389/fgene.2021.711437. eCollection 2021.

MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction.MicroPIPE：验证用于高质量完整细菌基因组构建的端到端工作流程。

BMC Genomics. 2021 Jun 25;22(1):474. doi: 10.1186/s12864-021-07767-z.

引用本文的文献

MicroRNAs in long COVID: roles, diagnostic biomarker potential and detection.长新冠中的微小RNA：作用、诊断生物标志物潜力及检测

Hum Genomics. 2025 Aug 13;19(1):90. doi: 10.1186/s40246-025-00810-0.

Investigating RNA dynamics from single molecule transcriptomes.从单分子转录组研究RNA动态变化。

Trends Genet. 2025 Jun 4. doi: 10.1016/j.tig.2025.05.001.

Direct profiling of non-adenosines in poly(A) tails of endogenous and therapeutic mRNAs with Ninetails.使用九尾狐对内源和治疗性mRNA的聚腺苷酸尾中的非腺苷进行直接分析。

Nat Commun. 2025 Mar 18;16(1):2664. doi: 10.1038/s41467-025-57787-6.

Toward the use of nanopore RNA sequencing technologies in the clinic: challenges and opportunities.迈向纳米孔RNA测序技术在临床中的应用：挑战与机遇

Nucleic Acids Res. 2025 Feb 27;53(5). doi: 10.1093/nar/gkaf128.

Rapid and accurate demultiplexing of direct RNA nanopore sequencing data with SeqTagger.使用SeqTagger对直接RNA纳米孔测序数据进行快速准确的解复用。

Genome Res. 2025 Apr 14;35(4):956-966. doi: 10.1101/gr.279290.124.

Native RNA nanopore sequencing reveals antibiotic-induced loss of rRNA modifications in the A- and P-sites.天然 RNA 纳米孔测序揭示抗生素诱导 A 位和 P 位 rRNA 修饰的丢失。

Nat Commun. 2024 Nov 29;15(1):10054. doi: 10.1038/s41467-024-54368-x.

Mitochondrial transcriptome of Candida albicans in flagranti - direct RNA sequencing reveals a new layer of information.白色念珠菌活跃状态下的线粒体转录组 - 直接 RNA 测序揭示了新的信息层。

BMC Genomics. 2024 Sep 14;25(1):860. doi: 10.1186/s12864-024-10791-4.

The Use of Nanopore Sequencing to Analyze the Chloroplast Transcriptome Part I: Library Preparation.利用纳米孔测序分析叶绿体转录组第一部分：文库制备。

Methods Mol Biol. 2024;2776:243-257. doi: 10.1007/978-1-0716-3726-5_15.

Comprehensive map of ribosomal 2'-O-methylation and C/D box snoRNAs in Drosophila melanogaster.全面绘制果蝇核糖体 2'-O-甲基化和 C/D 盒 snoRNAs 图谱。

Nucleic Acids Res. 2024 Apr 12;52(6):2848-2864. doi: 10.1093/nar/gkae139.

The lncRNA Snhg11, a new candidate contributing to neurogenesis, plasticity, and memory deficits in Down syndrome.长链非编码 RNA Snhg11 是促进唐氏综合征神经发生、可塑性和记忆缺陷的新候选基因。

Mol Psychiatry. 2024 Jul;29(7):2117-2134. doi: 10.1038/s41380-024-02440-9. Epub 2024 Feb 27.

本文引用的文献

RNA modifications detection by comparative Nanopore direct RNA sequencing.通过比较纳米孔直接 RNA 测序检测 RNA 修饰。

Nat Commun. 2021 Dec 10;12(1):7198. doi: 10.1038/s41467-021-27393-3.

Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and mA modification.纳米孔直接 RNA 测序绘制拟南芥 mRNA 加工和 mA 修饰的复杂性图谱。

Elife. 2020 Jan 14;9:e49658. doi: 10.7554/eLife.49658.

Nanopore native RNA sequencing of a human poly(A) transcriptome.人 poly(A) 转录组的纳米孔天然 RNA 测序。

Nat Methods. 2019 Dec;16(12):1297-1305. doi: 10.1038/s41592-019-0617-2. Epub 2019 Nov 18.

Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules.使用 cDNA 和 RNA 分子的纳米孔测序对小鼠样本进行转录组谱分析。

Sci Rep. 2019 Oct 17;9(1):14908. doi: 10.1038/s41598-019-51470-9.

Accurate detection of mA RNA modifications in native RNA sequences.准确检测天然 RNA 序列中的 mA RNA 修饰。

Nat Commun. 2019 Sep 9;10(1):4079. doi: 10.1038/s41467-019-11713-9.

: alignment-free poly(A) length measurement for Oxford Nanopore RNA and DNA sequencing.无比对的 Oxford Nanopore RNA 和 DNA 测序的 poly(A) 长度测量。

RNA. 2019 Oct;25(10):1229-1241. doi: 10.1261/rna.071332.119. Epub 2019 Jul 2.

GenPipes: an open-source framework for distributed and scalable genomic analyses.GenPipes：一个用于分布式和可扩展基因组分析的开源框架。

Gigascience. 2019 Jun 1;8(6). doi: 10.1093/gigascience/giz037.

Stage-specific requirement for Mettl3-dependent mA mRNA methylation during haematopoietic stem cell differentiation.在造血干细胞分化过程中，Mettl3 依赖性 mA mRNA 甲基化的阶段特异性要求。

Nat Cell Biol. 2019 Jun;21(6):700-709. doi: 10.1038/s41556-019-0318-1. Epub 2019 May 6.

SciPipe: A workflow library for agile development of complex and dynamic bioinformatics pipelines.SciPipe：一个用于敏捷开发复杂和动态生物信息学管道的工作流库。

Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz044.

m6A-Dependent RNA Dynamics in T Cell Differentiation.m6A 依赖性 RNA 动力学在 T 细胞分化中的作用。

Genes (Basel). 2019 Jan 8;10(1):28. doi: 10.3390/genes10010028.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

孔隙大师：牛津纳米孔直接RNA测序数据集分析工作流程

MasterOfPores: A Workflow for the Analysis of Oxford Nanopore Direct RNA Sequencing Datasets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献