• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过剪接图进行精确的转录本定量分析。

Exact transcript quantification over splice graphs.

作者信息

Ma Cong, Zheng Hongyu, Kingsford Carl

机构信息

Computer Science Department, School of Engineering and Applied Science, Princeton University, 35 Olden Street, Princeton, NJ, 08544, USA.

Computational Biology Department, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, 15213, USA.

出版信息

Algorithms Mol Biol. 2021 May 10;16(1):5. doi: 10.1186/s13015-021-00184-7.

DOI:10.1186/s13015-021-00184-7
PMID:33971903
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8112020/
Abstract

BACKGROUND

The probability of sequencing a set of RNA-seq reads can be directly modeled using the abundances of splice junctions in splice graphs instead of the abundances of a list of transcripts. We call this model graph quantification, which was first proposed by Bernard et al. (Bioinformatics 30:2447-55, 2014). The model can be viewed as a generalization of transcript expression quantification where every full path in the splice graph is a possible transcript. However, the previous graph quantification model assumes the length of single-end reads or paired-end fragments is fixed.

RESULTS

We provide an improvement of this model to handle variable-length reads or fragments and incorporate bias correction. We prove that our model is equivalent to running a transcript quantifier with exactly the set of all compatible transcripts. The key to our method is constructing an extension of the splice graph based on Aho-Corasick automata. The proof of equivalence is based on a novel reparameterization of the read generation model of a state-of-art transcript quantification method.

CONCLUSION

We propose a new approach for graph quantification, which is useful for modeling scenarios where reference transcriptome is incomplete or not available and can be further used in transcriptome assembly or alternative splicing analysis.

摘要

背景

可以使用剪接图中剪接连接的丰度直接对一组RNA测序读数进行测序的概率建模,而不是使用转录本列表的丰度。我们将此模型称为图谱定量,这是由伯纳德等人首次提出的(《生物信息学》30:2447 - 55,2014年)。该模型可被视为转录本表达定量的一种推广,其中剪接图中的每条完整路径都是一个可能的转录本。然而,先前的图谱定量模型假定单端读数或双端片段的长度是固定的。

结果

我们对该模型进行了改进,以处理可变长度的读数或片段,并纳入偏差校正。我们证明,我们的模型等同于使用所有兼容转录本的集合精确运行转录本定量器。我们方法的关键是基于Aho - Corasick自动机构建剪接图的扩展。等价性证明基于对一种先进转录本定量方法的读数生成模型的新颖重新参数化。

结论

我们提出了一种新的图谱定量方法,该方法对于参考转录组不完整或不可用的建模场景很有用,并且可进一步用于转录组组装或可变剪接分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/4fddb9ff8bb9/13015_2021_184_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/dfb6f4ca7a2b/13015_2021_184_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/6cf64e32985c/13015_2021_184_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/4fddb9ff8bb9/13015_2021_184_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/dfb6f4ca7a2b/13015_2021_184_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/6cf64e32985c/13015_2021_184_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f269/8112020/4fddb9ff8bb9/13015_2021_184_Fig3_HTML.jpg

相似文献

1
Exact transcript quantification over splice graphs.通过剪接图进行精确的转录本定量分析。
Algorithms Mol Biol. 2021 May 10;16(1):5. doi: 10.1186/s13015-021-00184-7.
2
A high-resolution single-molecule sequencing-based Arabidopsis transcriptome using novel methods of Iso-seq analysis.利用 Iso-seq 分析的新方法进行高分辨率的单个分子测序的拟南芥转录组。
Genome Biol. 2022 Jul 7;23(1):149. doi: 10.1186/s13059-022-02711-0.
3
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.RSEM:有或无参考基因组的 RNA-Seq 数据的准确转录本定量。
BMC Bioinformatics. 2011 Aug 4;12:323. doi: 10.1186/1471-2105-12-323.
4
Ryūtō: network-flow based transcriptome reconstruction.龙童:基于网络流的转录组重构。
BMC Bioinformatics. 2019 Apr 16;20(1):190. doi: 10.1186/s12859-019-2786-5.
5
Visualization and analysis of RNA-Seq assembly graphs.RNA-Seq 组装图的可视化和分析。
Nucleic Acids Res. 2019 Aug 22;47(14):7262-7275. doi: 10.1093/nar/gkz599.
6
A Full-Length mRNA Transcriptome Generated From Hybrid-Corrected PacBio Long-Reads Improves the Transcript Annotation and Identifies Thousands of Novel Splice Variants in Atlantic Salmon.通过混合校正的PacBio长读长生成的全长mRNA转录组改善了转录本注释并鉴定了大西洋鲑鱼中数千种新的剪接变体。
Front Genet. 2021 Apr 27;12:656334. doi: 10.3389/fgene.2021.656334. eCollection 2021.
7
Deriving Ranges of Optimal Estimated Transcript Expression due to Nonidentifiability.由于不可识别性导致的最优转录本表达范围的推导。
J Comput Biol. 2022 Feb;29(2):121-139. doi: 10.1089/cmb.2021.0444. Epub 2022 Jan 17.
8
Strawberry: Fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq.草莓:基于RNA测序的快速且准确的基因组引导转录本重建与定量分析
PLoS Comput Biol. 2017 Nov 27;13(11):e1005851. doi: 10.1371/journal.pcbi.1005851. eCollection 2017 Nov.
9
A probabilistic framework for aligning paired-end RNA-seq data.用于比对 RNA-seq 数据的概率框架。
Bioinformatics. 2010 Aug 15;26(16):1950-7. doi: 10.1093/bioinformatics/btq336. Epub 2010 Jun 23.
10
A robust method for transcript quantification with RNA-seq data.一种利用RNA测序数据进行转录本定量的可靠方法。
J Comput Biol. 2013 Mar;20(3):167-87. doi: 10.1089/cmb.2012.0230.

引用本文的文献

1
Classification and Quantification of Unproductive Splicing Events.非生产性剪接事件的分类与定量
Acta Naturae. 2025 Apr-Jun;17(2):75-85. doi: 10.32607/actanaturae.27572.
2
A safety framework for flow decomposition problems via integer linear programming.通过整数线性规划对流量分解问题进行安全框架构建。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad640.
3
Quantification of mutant-allele expression at isoform level in cancer from RNA-seq data.从RNA测序数据中对癌症中异构体水平的突变等位基因表达进行定量分析。

本文引用的文献

1
GENCODE reference annotation for the human and mouse genomes.GENCODE 人类和小鼠基因组参考注释。
Nucleic Acids Res. 2019 Jan 8;47(D1):D766-D773. doi: 10.1093/nar/gky955.
2
Theory and A Heuristic for the Minimum Path Flow Decomposition Problem.最小路径流分解问题的理论与启发式方法。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):658-670. doi: 10.1109/TCBB.2017.2779509. Epub 2017 Dec 4.
3
SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions.SUPPA2:快速、准确且能感知不确定性的跨多种条件差异剪接分析。
NAR Genom Bioinform. 2022 Jul 13;4(3):lqac052. doi: 10.1093/nargab/lqac052. eCollection 2022 Sep.
4
Deriving Ranges of Optimal Estimated Transcript Expression due to Nonidentifiability.由于不可识别性导致的最优转录本表达范围的推导。
J Comput Biol. 2022 Feb;29(2):121-139. doi: 10.1089/cmb.2021.0444. Epub 2022 Jan 17.
Genome Biol. 2018 Mar 23;19(1):40. doi: 10.1186/s13059-018-1417-1.
4
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.
5
Accurate assembly of transcripts through phase-preserving graph decomposition.通过保留相位的图分解实现转录本的精确组装。
Nat Biotechnol. 2017 Dec;35(12):1167-1169. doi: 10.1038/nbt.4020. Epub 2017 Nov 13.
6
Salmon provides fast and bias-aware quantification of transcript expression.鲑鱼提供快速且无偏倚的转录本表达定量。
Nat Methods. 2017 Apr;14(4):417-419. doi: 10.1038/nmeth.4197. Epub 2017 Mar 6.
7
TransComb: genome-guided transcriptome assembly via combing junctions in splicing graphs.TransComb:通过梳理剪接图中的连接点进行基因组引导的转录组组装。
Genome Biol. 2016 Oct 19;17(1):213. doi: 10.1186/s13059-016-1074-1.
8
Near-optimal probabilistic RNA-seq quantification.近乎最优的概率 RNA-seq 定量。
Nat Biotechnol. 2016 May;34(5):525-7. doi: 10.1038/nbt.3519. Epub 2016 Apr 4.
9
Fast and accurate approximate inference of transcript expression from RNA-seq data.从RNA测序数据中快速准确地进行转录本表达的近似推断。
Bioinformatics. 2015 Dec 15;31(24):3881-9. doi: 10.1093/bioinformatics/btv483. Epub 2015 Aug 26.
10
Polyester: simulating RNA-seq datasets with differential transcript expression.聚酯:模拟具有差异转录本表达的RNA测序数据集。
Bioinformatics. 2015 Sep 1;31(17):2778-84. doi: 10.1093/bioinformatics/btv272. Epub 2015 Apr 28.