• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

scnanoseq:用于牛津纳米孔单细胞RNA测序的nf-core流程

scnanoseq: an nf-core pipeline for Oxford Nanopore single-cell RNA-sequencing.

作者信息

Trull Austyn, Worthey Elizabeth A, Ianov Lara

机构信息

Institutional Research Core Program-Biological Data Science Core, University of Alabama at Birmingham, Birmingham, AL, 35233, United States.

Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, 35233, United States.

出版信息

Bioinformatics. 2025 Sep 1;41(9). doi: 10.1093/bioinformatics/btaf487.

DOI:10.1093/bioinformatics/btaf487
PMID:40905625
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12449243/
Abstract

MOTIVATION

Recent advancements in long-read single-cell RNA sequencing (scRNA-seq) have facilitated the quantification of full-length transcripts and isoforms at the single-cell level. Historically, long-read data would need to be complemented with short-read single-cell data in order to overcome the higher sequencing errors to correctly identify cellular barcodes and unique molecular identifiers. Improvements in Oxford Nanopore sequencing, and development of novel computational methods have removed this requirement. Though these methods now exist, the limited availability of modular and portable workflows remains a challenge.

RESULTS

Here, we present, nf-core/scnanoseq, a secondary analysis pipeline for long-read single-cell and single-nuclei RNA that delivers gene and transcript-level quantification. The scnanoseq pipeline is implemented using Nextflow and is built upon the nf-core framework, enabling portability across computational environments, scalability and reproducibility of results across pipeline runs. The nf-core/scnanoseq workflow follows best practices for analyzing single-cell and single-nuclei data, performing barcode detection and correction, genome and transcriptome read alignment, unique molecular identifier deduplication, gene and transcript quantification, and extensive quality control reporting.

AVAILABILITY AND IMPLEMENTATION

The source code, and detailed documentation are freely available at https://github.com/nf-core/scnanoseq and https://nf-co.re/scnanoseq under the MIT License. Documentation for the version of nf-core/scnanoseq used for this paper, including default parameters and descriptions of output files are available at https://nf-co.re/scnanoseq/1.1.0.

摘要

动机

长读长单细胞RNA测序(scRNA-seq)的最新进展推动了在单细胞水平上对全长转录本和异构体进行定量分析。从历史上看,长读长数据需要与短读长单细胞数据相结合,以克服较高的测序错误率,从而正确识别细胞条形码和独特分子标识符。牛津纳米孔测序技术的改进以及新型计算方法的开发消除了这一需求。尽管现在已有这些方法,但模块化和便携式工作流程的可用性有限仍然是一个挑战。

结果

在此,我们展示了nf-core/scnanoseq,这是一个用于长读长单细胞和单细胞核RNA的二级分析流程,可实现基因和转录本水平的定量分析。scnanoseq流程使用Nextflow实现,并基于nf-core框架构建,能够在不同计算环境中实现可移植性,以及在不同流程运行中实现结果的可扩展性和可重复性。nf-core/scnanoseq工作流程遵循分析单细胞和单细胞核数据的最佳实践,执行条形码检测和校正、基因组和转录组读段比对、独特分子标识符去重、基因和转录本定量分析以及广泛的质量控制报告。

可用性与实现方式

源代码和详细文档可在https://github.com/nf-core/scnanoseq和https://nf-co.re/scnanoseq上免费获取,遵循MIT许可协议。本文使用的nf-core/scnanoseq版本的文档,包括默认参数和输出文件描述,可在https://nf-co.re/scnanoseq/1.1.0获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb8d/12449243/f08a9e4537af/btaf487f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb8d/12449243/5432a256f62e/btaf487f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb8d/12449243/f08a9e4537af/btaf487f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb8d/12449243/5432a256f62e/btaf487f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb8d/12449243/f08a9e4537af/btaf487f2.jpg

相似文献

1
scnanoseq: an nf-core pipeline for Oxford Nanopore single-cell RNA-sequencing.scnanoseq:用于牛津纳米孔单细胞RNA测序的nf-core流程
Bioinformatics. 2025 Sep 1;41(9). doi: 10.1093/bioinformatics/btaf487.
2
An open-source nanopore-only sequencing workflow for analysis of clonal outbreaks delivers short-read level accuracy.一种用于分析克隆性暴发的仅基于纳米孔的开源测序工作流程可实现短读长水平的准确性。
J Clin Microbiol. 2025 Jul 18:e0066425. doi: 10.1128/jcm.00664-25.
3
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
4
nf-core/pacvar: a pipeline for analyzing long-read PacBio whole genome and repeat expansion sequencing data.nf-core/pacvar:一个用于分析长读长PacBio全基因组和重复序列扩增测序数据的流程。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf116.
5
Dogme: A nextflow pipeline for reprocessing nanopore RNA and DNA modifications.Dogme:用于重新处理纳米孔RNA和DNA修饰的Nextflow管道。
bioRxiv. 2025 Jun 11:2025.06.04.657941. doi: 10.1101/2025.06.04.657941.
6
clonevdjseq: A workflow and bioinformatics management system for sequencing, archiving, and analysis of VDJ sequences from clonal libraries.克隆VDJ序列分析:一种用于克隆文库中VDJ序列测序、存档和分析的工作流程及生物信息学管理系统。
BMC Bioinformatics. 2025 Jul 21;26(1):186. doi: 10.1186/s12859-025-06107-2.
7
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
8
Lessons learned: overcoming common challenges in reconstructing the SARS-CoV-2 genome from short-read sequencing data via CoVpipe2.经验教训:通过CoVpipe2从短读长测序数据重建严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组时克服常见挑战。
F1000Res. 2024 Apr 16;12:1091. doi: 10.12688/f1000research.136683.1. eCollection 2023.
9
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.
10
nf-core/marsseq: systematic preprocessing pipeline for MARS-seq experiments.nf-core/marsseq:用于MARS-seq实验的系统预处理流程。
Bioinform Adv. 2025 May 23;5(1):vbaf089. doi: 10.1093/bioadv/vbaf089. eCollection 2025.

本文引用的文献

1
Empowering bioinformatics communities with Nextflow and nf-core.借助Nextflow和nf-core助力生物信息学社区。
Genome Biol. 2025 Jul 29;26(1):228. doi: 10.1186/s13059-025-03673-9.
2
Oarfish: enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification.皇带鱼:增强的概率模型可提高长读长转录组定量的准确性。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i304-i313. doi: 10.1093/bioinformatics/btaf240.
3
Scywalker: scalable end-to-end data analysis workflow for long-read single-cell transcriptome sequencing.
Scywalker:适用于长读长单细胞转录组测序的可扩展端到端数据分析工作流程。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae549.
4
Advances in long-read single-cell transcriptomics.长读长时程单细胞转录组学的进展。
Hum Genet. 2024 Oct;143(9-10):1005-1020. doi: 10.1007/s00439-024-02678-x. Epub 2024 May 24.
5
NanoPack2: population-scale evaluation of long-read sequencing data.NanoPack2:长读测序数据的大规模评估。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad311.
6
High-throughput and high-accuracy single-cell RNA isoform analysis using PacBio circular consensus sequencing.使用 PacBio 环形一致测序进行高通量、高精度的单细胞 RNA 异构体分析。
Nat Commun. 2023 May 6;14(1):2631. doi: 10.1038/s41467-023-38324-9.
7
Identification of cell barcodes from long-read single-cell RNA-seq with BLAZE.使用 BLAZE 从长读单细胞 RNA-seq 中识别细胞条码。
Genome Biol. 2023 Apr 6;24(1):66. doi: 10.1186/s13059-023-02907-y.
8
Accurate isoform discovery with IsoQuant using long reads.利用长读长 IsoQuant 进行准确的异构体发现。
Nat Biotechnol. 2023 Jul;41(7):915-918. doi: 10.1038/s41587-022-01565-y. Epub 2023 Jan 2.
9
Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing.利用长读测序技术全面描述人类和小鼠单细胞全长异构体。
Genome Biol. 2021 Nov 11;22(1):310. doi: 10.1186/s13059-021-02525-6.
10
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.