• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Tranquillyzer:一种用于长读长转录组结构注释和多路分解的灵活神经网络框架。

Tranquillyzer: A Flexible Neural Network Framework for Structural Annotation and Demultiplexing of Long-Read Transcriptomes.

作者信息

Semwal Ayush, Morrison Jacob, Beddows Ian, Palmer Theron, Majewski Mary F, Jang H Josh, Johnson Benjamin K, Shen Hui

出版信息

bioRxiv. 2025 Jul 31:2025.07.25.666829. doi: 10.1101/2025.07.25.666829.

DOI:10.1101/2025.07.25.666829
PMID:40766630
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12324178/
Abstract

Long-read single-cell RNA sequencing using platforms such as Oxford Nanopore Technologies (ONT) enables full-length transcriptome profiling at single-cell resolution. However, high sequencing error rates, diverse library architectures, and increasing dataset scale introduce major challenges for accurately identifying cell barcodes (CBCs) and unique molecular identifiers (UMIs) - key prerequisites for reliable demultiplexing and deduplication, respectively. Existing pipelines rely on hard-coded heuristics or local transition rules that cannot fully capture this broader structural context and often fail to robustly interpret reads with indel-induced shifts, truncated segments, or non-canonical element ordering. We introduce (TRANscript QUantification In Long reads-anaLYZER), a flexible, architecture-aware deep learning framework for processing long-read single-cell RNA-seq data. employs a hybrid neural network architecture and a global, context-aware design, and enables precise identification of structural elements - even when elements are shifted, partially degraded, or repeated due to sequencing noise or library construction variability. In addition to supporting established single-cell protocols, accommodates custom library formats through rapid, one-time model training on user-defined label schemas, typically completed within a few hours on standard GPUs. Additional features such as scalability across large datasets and comprehensive visualization capabilities further position as a flexible and scalable framework solution for processing long-read single-cell transcriptomic datasets.

摘要

使用牛津纳米孔技术(ONT)等平台进行的长读长单细胞RNA测序能够在单细胞分辨率下进行全长转录组分析。然而,高测序错误率、多样的文库结构以及不断增加的数据集规模,给准确识别细胞条形码(CBC)和独特分子标识符(UMI)带来了重大挑战,而这分别是可靠的多路分解和重复数据删除的关键前提条件。现有的流程依赖于硬编码的启发式方法或局部转换规则,这些方法无法完全捕捉这种更广泛的结构背景,并且常常无法稳健地解读因插入缺失导致移位、片段截断或非规范元件排序的 reads。我们引入了TRANscript QUantification In Long reads-anaLYZER(长读长转录本定量分析器),这是一个灵活的、具有结构感知能力的深度学习框架,用于处理长读长单细胞RNA测序数据。该框架采用混合神经网络架构和全局的、上下文感知设计,即使元件因测序噪声或文库构建变异性而移位、部分降解或重复时,也能精确识别结构元件。除了支持已有的单细胞协议外,通过对用户定义的标签模式进行快速的一次性模型训练,该框架还能适应自定义文库格式,在标准GPU上通常只需几个小时即可完成。诸如跨大型数据集的可扩展性和全面的可视化功能等其他特性,进一步将该框架定位为处理长读长单细胞转录组数据集的灵活且可扩展的框架解决方案。

相似文献

1
Tranquillyzer: A Flexible Neural Network Framework for Structural Annotation and Demultiplexing of Long-Read Transcriptomes.Tranquillyzer:一种用于长读长转录组结构注释和多路分解的灵活神经网络框架。
bioRxiv. 2025 Jul 31:2025.07.25.666829. doi: 10.1101/2025.07.25.666829.
2
Short-Term Memory Impairment短期记忆障碍
3
Comparison of Illumina and Oxford Nanopore Technology systems for the genomic characterization of .用于……基因组特征分析的Illumina和牛津纳米孔技术系统的比较
Microbiol Spectr. 2025 Jul;13(7):e0129424. doi: 10.1128/spectrum.01294-24. Epub 2025 May 28.
4
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
5
Single-Cell Omics for Transcriptome CHaracterization (SCOTCH): isoform-level characterization of gene expression through long-read single-cell RNA sequencing.用于转录组表征的单细胞组学(SCOTCH):通过长读长单细胞RNA测序进行基因表达的异构体水平表征。
bioRxiv. 2025 Feb 6:2024.04.29.590597. doi: 10.1101/2024.04.29.590597.
6
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
7
An open-source nanopore-only sequencing workflow for analysis of clonal outbreaks delivers short-read level accuracy.一种用于分析克隆性暴发的仅基于纳米孔的开源测序工作流程可实现短读长水平的准确性。
J Clin Microbiol. 2025 Jul 18:e0066425. doi: 10.1128/jcm.00664-25.
8
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.高质量的长读长序列对于实现生物多样性基因组学的潜力至关重要。
BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.
9
Influence of Sequencing Technology on Pangenome-level Analysis and Detection of Antimicrobial Resistance Genes in ESKAPE Pathogens.测序技术对泛基因组水平分析及ESKAPE病原体中抗菌药物耐药基因检测的影响
bioRxiv. 2025 Jan 10:2025.01.08.631980. doi: 10.1101/2025.01.08.631980.
10
stGNN: Spatially Informed Cell-Type Deconvolution Based on Deep Graph Learning and Statistical Modeling.stGNN:基于深度图学习和统计建模的空间信息细胞类型反卷积
Interdiscip Sci. 2025 Jun 26. doi: 10.1007/s12539-025-00728-0.

本文引用的文献

1
Advances in single-cell long-read sequencing technologies.单细胞长读长测序技术的进展
NAR Genom Bioinform. 2024 May 20;6(2):lqae047. doi: 10.1093/nargab/lqae047. eCollection 2024 Jun.
2
High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors.高通量单细胞长读测序分析人类肿瘤中同一细胞的基因型和表型。
Nat Commun. 2023 Jul 11;14(1):4124. doi: 10.1038/s41467-023-39813-7.
3
High-throughput RNA isoform sequencing using programmed cDNA concatenation.使用可编程 cDNA 连接的高通量 RNA 异构体测序。
Nat Biotechnol. 2024 Apr;42(4):582-586. doi: 10.1038/s41587-023-01815-7. Epub 2023 Jun 8.
4
Identification of cell barcodes from long-read single-cell RNA-seq with BLAZE.使用 BLAZE 从长读单细胞 RNA-seq 中识别细胞条码。
Genome Biol. 2023 Apr 6;24(1):66. doi: 10.1186/s13059-023-02907-y.
5
Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments.在单细胞RNA测序实验的短读长和长读长数据中实现细胞条形码的快速准确匹配。
iScience. 2022 Jun 7;25(7):104530. doi: 10.1016/j.isci.2022.104530. eCollection 2022 Jul 15.
6
From bulk, single-cell to spatial RNA sequencing.从批量、单细胞到空间 RNA 测序。
Int J Oral Sci. 2021 Nov 15;13(1):36. doi: 10.1038/s41368-021-00146-0.
7
Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing.利用长读测序技术全面描述人类和小鼠单细胞全长异构体。
Genome Biol. 2021 Nov 11;22(1):310. doi: 10.1186/s13059-021-02525-6.
8
Nanopore sequencing of single-cell transcriptomes with scCOLOR-seq.使用scCOLOR-seq对单细胞转录组进行纳米孔测序。
Nat Biotechnol. 2021 Dec;39(12):1517-1520. doi: 10.1038/s41587-021-00965-w. Epub 2021 Jul 1.
9
Single cell transcriptome sequencing on the Nanopore platform with ScNapBar.使用ScNapBar在纳米孔平台上进行单细胞转录组测序。
RNA. 2021 Apr 27;27(7):763-70. doi: 10.1261/rna.078154.120.
10
HTSlib: C library for reading/writing high-throughput sequencing data.HTSlib:用于读取/写入高通量测序数据的 C 库。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab007.