• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过TranSigner对长RNA测序读数进行准确分配来增强转录组表达定量。

Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.

作者信息

Ji Hyun Joo, Pertea Mihaela

机构信息

Center for Computational Biology, Johns Hopkins University; Baltimore, MD.

Department of Computer Science, Johns Hopkins University; Baltimore, MD.

出版信息

bioRxiv. 2024 Aug 17:2024.04.13.589356. doi: 10.1101/2024.04.13.589356.

DOI:10.1101/2024.04.13.589356
PMID:39185147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11343119/
Abstract

Recently developed long-read RNA sequencing technologies promise to provide a more accurate and comprehensive view of transcriptomes compared to short-read sequencers, primarily due to their capability to achieve full-length sequencing of transcripts. However, realizing this potential requires computational tools tailored to process long reads, which exhibit a higher error rate than short reads. Existing methods for assembling and quantifying long-read data often disagree on expressed transcripts and their abundance levels, leading researchers to lack confidence in the transcriptomes produced using this data. One approach to address the uncertainties in transcriptome assembly and quantification is by assigning the long reads to transcripts, enabling a more detailed characterization of transcript support at the read level. Here, we introduce TranSigner, a versatile tool that assigns long reads to any input transcriptome. TranSigner consists of three consecutive modules performing: read alignment to the given transcripts, computation of read-to-transcript compatibility based on alignment scores and positions, and execution of an expectation-maximization algorithm to probabilistically assign reads to transcripts and estimate transcript abundances. Using simulated data and experimental datasets from three well-studied organisms - , and - we demonstrate that TranSigner achieves accurate read assignments, obtaining higher accuracy in transcript abundance estimation compared to existing tools.

摘要

与短读长测序仪相比,最近开发的长读长RNA测序技术有望提供更准确、更全面的转录组视图,这主要归功于它们能够实现转录本的全长测序。然而,要实现这一潜力,需要有专门用于处理长读长的计算工具,因为长读长的错误率比短读长高。现有的长读长数据组装和定量方法在表达的转录本及其丰度水平上常常存在分歧,这使得研究人员对使用这些数据生成的转录组缺乏信心。解决转录组组装和定量不确定性的一种方法是将长读长分配到转录本上,从而能够在读取水平上更详细地描述转录本支持情况。在此,我们介绍TranSigner,这是一种通用工具,可将长读长分配到任何输入的转录组。TranSigner由三个连续的模块组成,分别执行:将读取与给定的转录本进行比对、根据比对分数和位置计算读取与转录本的兼容性,以及执行期望最大化算法以概率方式将读取分配到转录本并估计转录本丰度。使用来自三种深入研究的生物——、和——的模拟数据和实验数据集,我们证明TranSigner实现了准确的读取分配,与现有工具相比,在转录本丰度估计方面获得了更高的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/558914f687b7/nihpp-2024.04.13.589356v3-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/068d6aec214b/nihpp-2024.04.13.589356v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/5b7e3d1600ad/nihpp-2024.04.13.589356v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/09b2c856bf5d/nihpp-2024.04.13.589356v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/fd03cf2cbb4f/nihpp-2024.04.13.589356v3-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/a11817864ef0/nihpp-2024.04.13.589356v3-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/477ffa7462ec/nihpp-2024.04.13.589356v3-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/1e093d845c41/nihpp-2024.04.13.589356v3-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/b1bab4c1ced5/nihpp-2024.04.13.589356v3-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/8e01ca49cf6a/nihpp-2024.04.13.589356v3-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/558914f687b7/nihpp-2024.04.13.589356v3-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/068d6aec214b/nihpp-2024.04.13.589356v3-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/5b7e3d1600ad/nihpp-2024.04.13.589356v3-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/09b2c856bf5d/nihpp-2024.04.13.589356v3-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/fd03cf2cbb4f/nihpp-2024.04.13.589356v3-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/a11817864ef0/nihpp-2024.04.13.589356v3-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/477ffa7462ec/nihpp-2024.04.13.589356v3-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/1e093d845c41/nihpp-2024.04.13.589356v3-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/b1bab4c1ced5/nihpp-2024.04.13.589356v3-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/8e01ca49cf6a/nihpp-2024.04.13.589356v3-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5b8/11343119/558914f687b7/nihpp-2024.04.13.589356v3-f0010.jpg

相似文献

1
Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.通过TranSigner对长RNA测序读数进行准确分配来增强转录组表达定量。
bioRxiv. 2024 Aug 17:2024.04.13.589356. doi: 10.1101/2024.04.13.589356.
2
Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.通过使用TranSigner准确分配长RNA测序读数来增强转录组表达定量。
Genome Biol. 2025 Aug 28;26(1):257. doi: 10.1186/s13059-025-03723-2.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Oarfish: enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification.皇带鱼:增强的概率模型可提高长读长转录组定量的准确性。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i304-i313. doi: 10.1093/bioinformatics/btaf240.
5
Oarfish: Enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification.皇带鱼:增强的概率模型可提高长读长转录组定量的准确性。
bioRxiv. 2024 Mar 1:2024.02.28.582591. doi: 10.1101/2024.02.28.582591.
6
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.
7
De novo Genome Assembly Using Long Reads and Chromosome Conformation Capture.使用长读长和染色体构象捕获进行从头基因组组装
Methods Mol Biol. 2025;2935:1-27. doi: 10.1007/978-1-0716-4583-3_1.
8
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.高质量的长读长序列对于实现生物多样性基因组学的潜力至关重要。
BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.
9
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
10
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.

本文引用的文献

1
A systematic benchmark of Nanopore long-read RNA sequencing for transcript-level analysis in human cell lines.用于人类细胞系转录本水平分析的纳米孔长读长RNA测序的系统基准测试。
Nat Methods. 2025 Apr;22(4):801-812. doi: 10.1038/s41592-025-02623-4. Epub 2025 Mar 13.
2
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
3
CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure.
CHESS 3:基于大规模表达数据、系统发育分析和蛋白质结构,改进和综合的人类基因和转录本目录。
Genome Biol. 2023 Oct 30;24(1):249. doi: 10.1186/s13059-023-03088-4.
4
The status of the human gene catalogue.人类基因目录的现状。
Nature. 2023 Oct;622(7981):41-47. doi: 10.1038/s41586-023-06490-x. Epub 2023 Oct 4.
5
Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures.基于计算机模拟混合物对长读 RNA 测序分析工具进行基准测试。
Nat Methods. 2023 Nov;20(11):1810-1821. doi: 10.1038/s41592-023-02026-3. Epub 2023 Oct 2.
6
Context-aware transcript quantification from long-read RNA-seq data with Bambu.使用 Bambu 从长读 RNA-seq 数据中进行上下文感知的转录本定量。
Nat Methods. 2023 Aug;20(8):1187-1195. doi: 10.1038/s41592-023-01908-w. Epub 2023 Jun 12.
7
ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data.ESPRESSO:从易错的长读 RNA-seq 数据中稳健地发现和定量转录本异构体。
Sci Adv. 2023 Jan 20;9(3):eabq5072. doi: 10.1126/sciadv.abq5072.
8
Native RNA or cDNA Sequencing for Transcriptomic Analysis: A Case Study on .用于转录组分析的天然RNA或cDNA测序:一项关于……的案例研究
Front Bioeng Biotechnol. 2022 Apr 12;10:842299. doi: 10.3389/fbioe.2022.842299. eCollection 2022.
9
Nanopore sequencing of RNA and cDNA molecules in .在. 中对 RNA 和 cDNA 分子进行纳米孔测序。
RNA. 2022 Mar;28(3):400-417. doi: 10.1261/rna.078937.121. Epub 2021 Dec 14.
10
Accurate expression quantification from nanopore direct RNA sequencing with NanoCount.利用 NanoCount 从纳米孔直接 RNA 测序中进行准确的表达定量。
Nucleic Acids Res. 2022 Feb 28;50(4):e19. doi: 10.1093/nar/gkab1129.