• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于 RNA-Seq 估计转录本表达量的新型最小成本流方法。

A novel min-cost flow method for estimating transcript expression with RNA-Seq.

机构信息

Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Finland.

出版信息

BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S15. doi: 10.1186/1471-2105-14-S5-S15. Epub 2013 Apr 10.

DOI:10.1186/1471-2105-14-S5-S15
PMID:23734627
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3622638/
Abstract

BACKGROUND

Through transcription and alternative splicing, a gene can be transcribed into different RNA sequences (isoforms), depending on the individual, on the tissue the cell is in, or in response to some stimuli. Recent RNA-Seq technology allows for new high-throughput ways for isoform identification and quantification based on short reads, and various methods have been put forward for this non-trivial problem.

RESULTS

In this paper we propose a novel radically different method based on minimum-cost network flows. This has a two-fold advantage: on the one hand, it translates the problem as an established one in the field of network flows, which can be solved in polynomial time, with different existing solvers; on the other hand, it is general enough to encompass many of the previous proposals under the least sum of squares model. Our method works as follows: in order to find the transcripts which best explain, under a given fitness model, a splicing graph resulting from an RNA-Seq experiment, we find a min-cost flow in an offset flow network, under an equivalent cost model. Under very weak assumptions on the fitness model, the optimal flow can be computed in polynomial time. Parsimoniously splitting the flow back into few path transcripts can be done with any of the heuristics and approximations available from the theory of network flows. In the present implementation, we choose the simple strategy of repeatedly removing the heaviest path.

CONCLUSIONS

We proposed a new very general method based on network flows for a multiassembly problem arising from isoform identification and quantification with RNA-Seq. Experimental results on prediction accuracy show that our method is very competitive with popular tools such as Cufflinks and IsoLasso. Our tool, called Traph (Transcrips in gRAPHs), is available at: http://www.cs.helsinki.fi/gsa/traph/.

摘要

背景

通过转录和选择性剪接,一个基因可以根据个体、细胞所在的组织或对某些刺激的反应,转录为不同的 RNA 序列(异构体)。最近的 RNA-Seq 技术允许基于短读长进行新的高通量异构体鉴定和定量方法,为此已经提出了各种方法。

结果

在本文中,我们提出了一种基于最小成本网络流的全新方法。这种方法有两个优势:一方面,它将问题转化为网络流领域的一个已建立问题,可以使用不同的现有求解器在多项式时间内求解;另一方面,它足够通用,可以包含许多以前基于最小二乘法模型的建议。我们的方法如下:为了找到在给定适合度模型下最好地解释 RNA-Seq 实验产生的剪接图的转录本,我们在等效成本模型下的偏移流网络中找到最小成本流。在非常弱的适合度模型假设下,最优流可以在多项式时间内计算。通过任何来自网络流理论的启发式和近似方法,可以将流巧妙地拆分为少数路径转录本。在当前的实现中,我们选择了简单的策略,即反复删除最重的路径。

结论

我们提出了一种新的基于网络流的通用方法,用于解决 RNA-Seq 中异构体鉴定和定量所产生的多组装问题。在预测准确性的实验结果表明,我们的方法与 Cufflinks 和 IsoLasso 等流行工具非常有竞争力。我们的工具名为 Traph(Graphs 中的转录本),可在 http://www.cs.helsinki.fi/gsa/traph/ 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/d8c2b4e64d6a/1471-2105-14-S5-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/d32b552cad7d/1471-2105-14-S5-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/8a6adb64aba9/1471-2105-14-S5-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/e2668872f11f/1471-2105-14-S5-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/d8c2b4e64d6a/1471-2105-14-S5-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/d32b552cad7d/1471-2105-14-S5-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/8a6adb64aba9/1471-2105-14-S5-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/e2668872f11f/1471-2105-14-S5-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9503/3622638/d8c2b4e64d6a/1471-2105-14-S5-S15-4.jpg

相似文献

1
A novel min-cost flow method for estimating transcript expression with RNA-Seq.一种基于 RNA-Seq 估计转录本表达量的新型最小成本流方法。
BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S15. doi: 10.1186/1471-2105-14-S5-S15. Epub 2013 Apr 10.
2
ORMAN: optimal resolution of ambiguous RNA-Seq multimappings in the presence of novel isoforms.ORMAN:在存在新的异构体的情况下,实现 RNA-Seq 多重比对的最佳分辨率。
Bioinformatics. 2014 Mar 1;30(5):644-51. doi: 10.1093/bioinformatics/btt591. Epub 2013 Oct 15.
3
Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.利用网络流从RNA测序数据中高效鉴定和定量RNA异构体
Bioinformatics. 2014 Sep 1;30(17):2447-55. doi: 10.1093/bioinformatics/btu317. Epub 2014 May 9.
4
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.TIGAR:一种通过变分贝叶斯推断进行 RNA-Seq 数据缺口对齐的转录本丰度估计方法。
Bioinformatics. 2013 Sep 15;29(18):2292-9. doi: 10.1093/bioinformatics/btt381. Epub 2013 Jul 2.
5
NURD: an implementation of a new method to estimate isoform expression from non-uniform RNA-seq data.NURD:一种从非均匀 RNA-seq 数据估计异构体表达的新方法的实现。
BMC Bioinformatics. 2013 Jul 10;14:220. doi: 10.1186/1471-2105-14-220.
6
FDM: a graph-based statistical method to detect differential transcription using RNA-seq data.FDM:一种基于图的统计方法,用于检测使用 RNA-seq 数据的差异转录。
Bioinformatics. 2011 Oct 1;27(19):2633-40. doi: 10.1093/bioinformatics/btr458. Epub 2011 Aug 8.
7
SSP: an interval integer linear programming for de novo transcriptome assembly and isoform discovery of RNA-seq reads.SSP:一种用于 RNA-seq reads 从头转录组组装和异构体发现的区间整数线性规划方法。
Genomics. 2013 Nov-Dec;102(5-6):507-14. doi: 10.1016/j.ygeno.2013.10.003. Epub 2013 Oct 23.
8
Visualization and analysis of RNA-Seq assembly graphs.RNA-Seq 组装图的可视化和分析。
Nucleic Acids Res. 2019 Aug 22;47(14):7262-7275. doi: 10.1093/nar/gkz599.
9
Piecing the puzzle together: a revisit to transcript reconstruction problem in RNA-seq.拼凑拼图:RNA-seq 中转录本重建问题的再探讨。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S3. doi: 10.1186/1471-2105-15-S9-S3. Epub 2014 Sep 10.
10
RNA-Skim: a rapid method for RNA-Seq quantification at transcript level.RNA-Skim:一种在转录水平上进行 RNA-Seq 定量的快速方法。
Bioinformatics. 2014 Jun 15;30(12):i283-i292. doi: 10.1093/bioinformatics/btu288.

引用本文的文献

1
Open-Field Blast Injury Disrupts Corneal Gene Expression Linked to Ion Transport, Sensory Perception, and Neural Signaling.开放性爆炸伤会破坏与离子转运、感觉感知和神经信号传导相关的角膜基因表达。
Invest Ophthalmol Vis Sci. 2025 Aug 1;66(11):68. doi: 10.1167/iovs.66.11.68.
2
Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.Cov-trans:一种用于冠状病毒中不连续转录本组装的高效算法。
BMC Genomics. 2024 Dec 30;25(1):1257. doi: 10.1186/s12864-024-11179-0.
3
Data-driven AI system for learning how to run transcript assemblers.

本文引用的文献

1
MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.MetaVelvet:Velvet 组装器的扩展,用于从短序列读取进行从头宏基因组组装。
Nucleic Acids Res. 2012 Nov 1;40(20):e155. doi: 10.1093/nar/gks678. Epub 2012 Jul 19.
2
The clonal and mutational evolution spectrum of primary triple-negative breast cancers.原发性三阴性乳腺癌的克隆和突变进化图谱。
Nature. 2012 Apr 4;486(7403):395-9. doi: 10.1038/nature10933.
3
Sparse linear modeling of next-generation mRNA sequencing (RNA-Seq) data for isoform discovery and abundance estimation.
用于学习如何运行转录本组装程序的数据驱动型人工智能系统。
bioRxiv. 2024 Oct 30:2024.01.25.577290. doi: 10.1101/2024.01.25.577290.
4
Transcriptomic landscape of quiescent and proliferating human corneal stromal fibroblasts.人角膜基质成纤维细胞静息态和增殖态的转录组图谱
Exp Eye Res. 2024 Nov;248:110073. doi: 10.1016/j.exer.2024.110073. Epub 2024 Sep 5.
5
RNA-Seq Analysis Unraveling Novel Genes and Pathways Influencing Corneal Wound Healing.RNA-Seq 分析揭示了影响角膜伤口愈合的新基因和途径。
Invest Ophthalmol Vis Sci. 2024 Sep 3;65(11):13. doi: 10.1167/iovs.65.11.13.
6
Floria: fast and accurate strain haplotyping in metagenomes.弗洛里亚:宏基因组中快速准确的菌株单倍型分型。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i30-i38. doi: 10.1093/bioinformatics/btae252.
7
A safety framework for flow decomposition problems via integer linear programming.通过整数线性规划对流量分解问题进行安全框架构建。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad640.
8
Phables: from fragmented assemblies to high-quality bacteriophage genomes.噬菌体:从碎片化组装到高质量噬菌体基因组。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad586.
9
Long noncoding RNA study: Genome-wide approaches.长链非编码RNA研究:全基因组方法。
Genes Dis. 2022 Nov 29;10(6):2491-2510. doi: 10.1016/j.gendis.2022.10.024. eCollection 2023 Nov.
10
Phables: from fragmented assemblies to high-quality bacteriophage genomes.噬菌体组装拼接软件(Phables):从片段化组装到高质量噬菌体基因组
bioRxiv. 2023 Sep 11:2023.04.04.535632. doi: 10.1101/2023.04.04.535632.
基于下一代 mRNA 测序(RNA-Seq)数据的稀疏线性建模用于发现异构体和丰度估计。
Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):19867-72. doi: 10.1073/pnas.1113972108. Epub 2011 Dec 1.
4
IsoLasso: a LASSO regression approach to RNA-Seq based transcriptome assembly.IsoLasso:一种基于RNA测序的转录组组装的套索回归方法。
J Comput Biol. 2011 Nov;18(11):1693-707. doi: 10.1089/cmb.2011.0171. Epub 2011 Sep 27.
5
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.通过 RNA-Seq 进行转录本组装和定量分析揭示了细胞分化过程中未注释的转录本和异构体转换。
Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.
6
Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.从头构建小鼠细胞类型特异性转录组揭示了 lincRNAs 的保守多外显子结构。
Nat Biotechnol. 2010 May;28(5):503-10. doi: 10.1038/nbt.1633. Epub 2010 May 2.
7
Computation for ChIP-seq and RNA-seq studies.染色质免疫沉淀测序(ChIP-seq)和RNA测序(RNA-seq)研究的计算
Nat Methods. 2009 Nov;6(11 Suppl):S22-32. doi: 10.1038/nmeth.1371.
8
TopHat: discovering splice junctions with RNA-Seq.TopHat:利用RNA测序发现剪接接头
Bioinformatics. 2009 May 1;25(9):1105-11. doi: 10.1093/bioinformatics/btp120. Epub 2009 Mar 16.
9
Mapping and quantifying mammalian transcriptomes by RNA-Seq.通过RNA测序对哺乳动物转录组进行定位和定量分析。
Nat Methods. 2008 Jul;5(7):621-8. doi: 10.1038/nmeth.1226. Epub 2008 May 30.
10
The multiassembly problem: reconstructing multiple transcript isoforms from EST fragment mixtures.多组装问题:从EST片段混合物中重建多个转录本异构体。
Genome Res. 2004 Mar;14(3):426-41. doi: 10.1101/gr.1304504. Epub 2004 Feb 12.