简而言之：用短读长差异表达分析工具解锁纳米孔长读长RNA测序数据。

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.

作者信息

Dong Xueyi, Tian Luyi, Gouil Quentin, Kariyawasam Hasaru, Su Shian, De Paoli-Iseppi Ricardo, Prawer Yair David Joseph, Clark Michael B, Breslin Kelsey, Iminitoff Megan, Blewitt Marnie E, Law Charity W, Ritchie Matthew E

机构信息

Epigenetics and Development Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.

Centre for Stem Cell Systems, Department of Anatomy and Neuroscience, The University of Melbourne, Parkville, Victoria 3010, Australia.

出版信息

NAR Genom Bioinform. 2021 Apr 26;3(2):lqab028. doi: 10.1093/nargab/lqab028. eCollection 2021 Jun.

DOI:10.1093/nargab/lqab028

PMID:33937765

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8074342/

Abstract

Application of Oxford Nanopore Technologies' long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistical testing. Here, we report the analysis of two nanopore RNA-seq datasets with the goal of obtaining gene- and isoform-level differential expression information. A dataset of synthetic, spliced, spike-in RNAs ('sequins') as well as a mouse neural stem cell dataset from samples with a null mutation of the epigenetic regulator was analysed using a mix of long-read specific tools for preprocessing together with established short-read RNA-seq methods for downstream analysis. We used to perform differential gene expression analysis, and the novel pipeline to perform isoform identification and quantification, followed by and (with ) to perform differential transcript usage analysis. We compared results from the sequins dataset to the ground truth, and results of the mouse dataset to a previous short-read study on equivalent samples. Overall, our work shows that transcriptomic analysis of long-read nanopore data using long-read specific preprocessing methods together with short-read differential expression methods and software that are already in wide use can yield meaningful results.

摘要

牛津纳米孔技术公司的长读长测序平台在转录组分析中的应用越来越普遍。然而，由于序列错误率高和文库规模小，这种分析可能具有挑战性，这会降低定量准确性并减少统计检验的效力。在此，我们报告了对两个纳米孔RNA测序数据集的分析，目的是获得基因和异构体水平的差异表达信息。使用长读长特有的预处理工具组合以及用于下游分析的既定短读长RNA测序方法，分析了合成的、剪接的、掺入的RNA（“测序标准品”）数据集以及来自表观遗传调控因子无效突变样本的小鼠神经干细胞数据集。我们使用进行差异基因表达分析，并使用新颖的流程进行异构体鉴定和定量，随后使用和（搭配）进行差异转录本使用分析。我们将测序标准品数据集的结果与真实情况进行比较，并将小鼠数据集的结果与之前对等效样本的短读长研究结果进行比较。总体而言，我们的工作表明，使用长读长特有的预处理方法以及已广泛使用的短读长差异表达方法和软件对长读长纳米孔数据进行转录组分析可以产生有意义的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8611/8074342/c1346ef8d29a/lqab028fig1.jpg

相似文献

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.简而言之：用短读长差异表达分析工具解锁纳米孔长读长RNA测序数据。

NAR Genom Bioinform. 2021 Apr 26;3(2):lqab028. doi: 10.1093/nargab/lqab028. eCollection 2021 Jun.

Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures.基于计算机模拟混合物对长读 RNA 测序分析工具进行基准测试。

Nat Methods. 2023 Nov;20(11):1810-1821. doi: 10.1038/s41592-023-02026-3. Epub 2023 Oct 2.

Methodologies for Transcript Profiling Using Long-Read Technologies.使用长读长技术进行转录本分析的方法

Front Genet. 2020 Jul 7;11:606. doi: 10.3389/fgene.2020.00606. eCollection 2020.

Long-read RNA sequencing identifies region- and sex-specific C57BL/6J mouse brain mRNA isoform expression and usage.长读 RNA 测序鉴定 C57BL/6J 小鼠脑 mRNA 异构体表达和使用的区域和性别特异性。

Mol Brain. 2024 Jun 20;17(1):40. doi: 10.1186/s13041-024-01112-7.

PSI-Sigma: a comprehensive splicing-detection method for short-read and long-read RNA-seq analysis.PSI-Sigma：一种用于短读长读 RNA-seq 分析的综合剪接检测方法。

Bioinformatics. 2019 Dec 1;35(23):5048-5054. doi: 10.1093/bioinformatics/btz438.

Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析

Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.

Comparative assessment of long-read error correction software applied to Nanopore RNA-sequencing data.应用于纳米孔RNA测序数据的长读长纠错软件的比较评估

Brief Bioinform. 2020 Jul 15;21(4):1164-1181. doi: 10.1093/bib/bbz058.

Identification of Protein Isoforms Using Reference Databases Built from Long and Short Read RNA-Sequencing.使用基于长读和短读 RNA 测序构建的参考数据库鉴定蛋白质同工型。

J Proteome Res. 2022 Jul 1;21(7):1628-1639. doi: 10.1021/acs.jproteome.1c00968. Epub 2022 May 25.

Identification of cell barcodes from long-read single-cell RNA-seq with BLAZE.使用 BLAZE 从长读单细胞 RNA-seq 中识别细胞条码。

Genome Biol. 2023 Apr 6;24(1):66. doi: 10.1186/s13059-023-02907-y.

Long-read RNA sequencing identifies region- and sex-specific C57BL/6J mouse brain mRNA isoform expression and usage.长读长RNA测序鉴定了C57BL/6J小鼠脑区和性别特异性mRNA异构体的表达及使用情况。

bioRxiv. 2024 Jan 11:2024.01.11.575219. doi: 10.1101/2024.01.11.575219.

引用本文的文献

MicroRNAs in long COVID: roles, diagnostic biomarker potential and detection.长新冠中的微小RNA：作用、诊断生物标志物潜力及检测

Hum Genomics. 2025 Aug 13;19(1):90. doi: 10.1186/s40246-025-00810-0.

Comparison of single-cell long-read and short-read transcriptome sequencing via cDNA molecule matching: quality evaluation of the MAS-ISO-seq approach.通过cDNA分子匹配比较单细胞长读长和短读长转录组测序：MAS-ISO-seq方法的质量评估

NAR Genom Bioinform. 2025 Jul 4;7(3):lqaf089. doi: 10.1093/nargab/lqaf089. eCollection 2025 Sep.

Transcriptomics in the era of long-read sequencing.长读长测序时代的转录组学

Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00828-z.

Identification of reproduction-related genes in the hypothalamus of sheep (Ovis aries) using the nanopore full-length transcriptome sequencing technology.利用纳米孔全长转录组测序技术鉴定绵羊（Ovis aries）下丘脑的生殖相关基因。

Sci Rep. 2024 Nov 13;14(1):27884. doi: 10.1038/s41598-024-79140-5.

Accurate long-read transcript discovery and quantification at single-cell, pseudo-bulk and bulk resolution with Isosceles.使用等腰 Iso-Seq 技术实现单细胞、拟群体和群体水平的准确长读转录本发现和定量。

Nat Commun. 2024 Aug 25;15(1):7316. doi: 10.1038/s41467-024-51584-3.

DNMT3B splicing dysregulation mediated by SMCHD1 loss contributes to DUX4 overexpression and FSHD pathogenesis.SMCHD1 缺失导致的 DNMT3B 剪接失调导致 DUX4 过表达和 FSHD 发病机制。

Sci Adv. 2024 May 31;10(22):eadn7732. doi: 10.1126/sciadv.adn7732. Epub 2024 May 29.

Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data.长读测序数据中 mRNA 异构体检测方法的综合评估。

Nat Commun. 2024 May 10;15(1):3972. doi: 10.1038/s41467-024-48117-3.

Merging short and stranded long reads improves transcript assembly.短读和单链长读的合并提高了转录本组装。

PLoS Comput Biol. 2023 Oct 26;19(10):e1011576. doi: 10.1371/journal.pcbi.1011576. eCollection 2023 Oct.

Identification and quantification of small exon-containing isoforms in long-read RNA sequencing data.在长读 RNA 测序数据中鉴定和定量含有小外显子的异构体。

Nucleic Acids Res. 2023 Nov 10;51(20):e104. doi: 10.1093/nar/gkad810.

Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures.基于计算机模拟混合物对长读 RNA 测序分析工具进行基准测试。

Nat Methods. 2023 Nov;20(11):1810-1821. doi: 10.1038/s41592-023-02026-3. Epub 2023 Oct 2.

本文引用的文献

Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing.利用长读测序技术全面描述人类和小鼠单细胞全长异构体。

Genome Biol. 2021 Nov 11;22(1):310. doi: 10.1186/s13059-021-02525-6.

Full-length transcript characterization of SF3B1 mutation in chronic lymphocytic leukemia reveals downregulation of retained introns.全长转录本特征分析揭示慢性淋巴细胞白血病 SF3B1 突变导致内含子滞留下调。

Nat Commun. 2020 Mar 18;11(1):1438. doi: 10.1038/s41467-020-15171-6.

De Novo Clustering of Long-Read Transcriptome Data Using a Greedy, Quality Value-Based Algorithm.使用基于质量值的贪婪算法对长读长转录组数据进行从头聚类

J Comput Biol. 2020 Apr;27(4):472-484. doi: 10.1089/cmb.2019.0299. Epub 2020 Mar 16.

Direct full-length RNA sequencing reveals unexpected transcriptome complexity during development.直接全长 RNA 测序揭示了发育过程中意想不到的转录组复杂性。

Genome Res. 2020 Feb;30(2):287-298. doi: 10.1101/gr.251512.119. Epub 2020 Feb 5.

Generation of a Transcriptional Radiation Exposure Signature in Human Blood Using Long-Read Nanopore Sequencing.使用长读长纳米孔测序在人血中生成转录辐射暴露特征。

Radiat Res. 2020 Feb;193(2):143-154. doi: 10.1667/RR15476.1. Epub 2019 Dec 12.

A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes.对纳米孔天然 RNA 测序进行全面考察，以对复杂转录组进行特征分析。

Nat Commun. 2019 Jul 31;10(1):3359. doi: 10.1038/s41467-019-11272-z.

The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads.Rsubread 软件包在 RNA 测序reads 的比对和定量方面，具有更简单、更快、更便宜和更好的优势。

Nucleic Acids Res. 2019 May 7;47(8):e47. doi: 10.1093/nar/gkz114.

GENCODE reference annotation for the human and mouse genomes.GENCODE 人类和小鼠基因组参考注释。

Nucleic Acids Res. 2019 Jan 8;47(D1):D766-D773. doi: 10.1093/nar/gky955.

Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.顺流而下：鲑鱼定量后差异转录本使用情况的统计分析

F1000Res. 2018 Jun 27;7:952. doi: 10.12688/f1000research.15398.3. eCollection 2018.

Single-cell isoform RNA sequencing characterizes isoforms in thousands of cerebellar cells.单细胞异构体RNA测序对数千个小脑细胞中的异构体进行了表征。

Nat Biotechnol. 2018 Oct 15. doi: 10.1038/nbt.4259.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

简而言之：用短读长差异表达分析工具解锁纳米孔长读长RNA测序数据。

The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献