• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

龙童:基于网络流的转录组重构。

Ryūtō: network-flow based transcriptome reconstruction.

机构信息

Bioinformatics Group, Department of Computer Science & Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16-18, Leipzig, 04107, Germany.

Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, 04103, Germany.

出版信息

BMC Bioinformatics. 2019 Apr 16;20(1):190. doi: 10.1186/s12859-019-2786-5.

DOI:10.1186/s12859-019-2786-5
PMID:30991937
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6469118/
Abstract

BACKGROUND

The rapid increase in High-throughput sequencing of RNA (RNA-seq) has led to tremendous improvements in the detection and reconstruction of both expressed coding and non-coding RNA transcripts. Yet, the complete and accurate annotation of the complex transcriptional output of not only the human genome has remained elusive. One of the critical bottlenecks in this endeavor is the computational reconstruction of transcript structures, due to high noise levels, technological limits, and other biases in the raw data.

RESULTS

We introduce several new and improved algorithms in a novel workflow for transcript assembly and quantification. We propose an extension of the common splice graph framework that combines aspects of overlap and bin graphs and makes it possible to efficiently use both multi-splice and paired-end information to the fullest extent. Phasing information of reads is used to further resolve loci. The decomposition of read coverage patterns is modeled as a minimum-cost flow problem to account for the unavoidable non-uniformities of RNA-seq data.

CONCLUSION

Its performance compares favorably with state of the art methods on both simulated and real-life datasets. Ryūtō calls 1-4% more true transcripts, while calling 5-35% less false predictions compared to the next best competitor.

摘要

背景

高通量测序 RNA(RNA-seq)的快速发展,使得对表达编码和非编码 RNA 转录本的检测和重构都有了巨大的改进。然而,不仅人类基因组的复杂转录产物的完整和准确注释仍然难以实现。在这方面努力的一个关键瓶颈是由于高噪声水平、技术限制和原始数据中的其他偏差,转录本结构的计算重建。

结果

我们在一个新的转录物组装和定量的工作流程中引入了几个新的和改进的算法。我们提出了一种常见剪接图框架的扩展,该框架结合了重叠和 bin 图的方面,使得能够充分利用多剪接和配对末端信息。读取的相位信息用于进一步解决基因座。读取覆盖模式的分解被建模为最小成本流问题,以解释 RNA-seq 数据不可避免的非均匀性。

结论

它的性能在模拟和真实数据集上都优于最先进的方法。与下一个最佳竞争对手相比,Ryūtō 调用的真实转录本多 1-4%,而错误预测少 5-35%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/2f52cf7bcd9e/12859_2019_2786_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/c5ffdbc6ce3e/12859_2019_2786_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/0bab93c9f72f/12859_2019_2786_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/5c005f745062/12859_2019_2786_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/ea004e772c8f/12859_2019_2786_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/42ec680baa6a/12859_2019_2786_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/c1679aba8d80/12859_2019_2786_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/2f52cf7bcd9e/12859_2019_2786_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/c5ffdbc6ce3e/12859_2019_2786_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/0bab93c9f72f/12859_2019_2786_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/5c005f745062/12859_2019_2786_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/ea004e772c8f/12859_2019_2786_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/42ec680baa6a/12859_2019_2786_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/c1679aba8d80/12859_2019_2786_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1039/6469118/2f52cf7bcd9e/12859_2019_2786_Fig7_HTML.jpg

相似文献

1
Ryūtō: network-flow based transcriptome reconstruction.龙童:基于网络流的转录组重构。
BMC Bioinformatics. 2019 Apr 16;20(1):190. doi: 10.1186/s12859-019-2786-5.
2
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
3
Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification.信息转导能力降低了无注释异构体发现和定量中的不确定性。
Nucleic Acids Res. 2017 Sep 6;45(15):e143. doi: 10.1093/nar/gkx585.
4
Improved transcriptome quantification and reconstruction from RNA-Seq reads using partial annotations.利用部分注释改进RNA测序读数的转录组定量和重建
In Silico Biol. 2011;11(5-6):251-61. doi: 10.3233/ISB-2012-0459.
5
BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data.BinPacker:基于装箱法的RNA测序数据从头转录组组装
PLoS Comput Biol. 2016 Feb 19;12(2):e1004772. doi: 10.1371/journal.pcbi.1004772. eCollection 2016 Feb.
6
CIDANE: comprehensive isoform discovery and abundance estimation.CIDANE:全面的异构体发现与丰度估计
Genome Biol. 2016 Jan 30;17:16. doi: 10.1186/s13059-015-0865-0.
7
Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析
Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.
8
MITIE: Simultaneous RNA-Seq-based transcript identification and quantification in multiple samples.MITIE:在多个样本中基于 RNA-Seq 的同时转录本鉴定和定量。
Bioinformatics. 2013 Oct 15;29(20):2529-38. doi: 10.1093/bioinformatics/btt442. Epub 2013 Aug 25.
9
Piecing the puzzle together: a revisit to transcript reconstruction problem in RNA-seq.拼凑拼图:RNA-seq 中转录本重建问题的再探讨。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S3. doi: 10.1186/1471-2105-15-S9-S3. Epub 2014 Sep 10.
10
A robust method for transcript quantification with RNA-seq data.一种利用RNA测序数据进行转录本定量的可靠方法。
J Comput Biol. 2013 Mar;20(3):167-87. doi: 10.1089/cmb.2012.0230.

引用本文的文献

1
Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.Cov-trans:一种用于冠状病毒中不连续转录本组装的高效算法。
BMC Genomics. 2024 Dec 30;25(1):1257. doi: 10.1186/s12864-024-11179-0.
2
StringFix: an annotation-guided transcriptome assembler improves the recovery of amino acid sequences from RNA-Seq reads.StringFix:一种基于注释指导的转录组组装方法,可提高从 RNA-Seq 读段中恢复氨基酸序列的能力。
Genes Genomics. 2023 Dec;45(12):1599-1609. doi: 10.1007/s13258-023-01458-7. Epub 2023 Oct 14.
3
Phables: from fragmented assemblies to high-quality bacteriophage genomes.

本文引用的文献

1
Rare Splice Variants in Long Non-Coding RNAs.长链非编码RNA中的罕见剪接变体
Noncoding RNA. 2017 Jul 5;3(3):23. doi: 10.3390/ncrna3030023.
2
Strawberry: Fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq.草莓:基于RNA测序的快速且准确的基因组引导转录本重建与定量分析
PLoS Comput Biol. 2017 Nov 27;13(11):e1005851. doi: 10.1371/journal.pcbi.1005851. eCollection 2017 Nov.
3
Accurate assembly of transcripts through phase-preserving graph decomposition.通过保留相位的图分解实现转录本的精确组装。
噬菌体:从碎片化组装到高质量噬菌体基因组。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad586.
4
Phables: from fragmented assemblies to high-quality bacteriophage genomes.噬菌体组装拼接软件(Phables):从片段化组装到高质量噬菌体基因组
bioRxiv. 2023 Sep 11:2023.04.04.535632. doi: 10.1101/2023.04.04.535632.
5
Improving RNA Assembly via Safety and Completeness in Flow Decompositions.通过流分解中的安全性和完整性提高 RNA 组装
J Comput Biol. 2022 Dec;29(12):1270-1287. doi: 10.1089/cmb.2022.0261. Epub 2022 Oct 25.
6
Efficient Minimum Flow Decomposition via Integer Linear Programming.通过整数线性规划实现有效的最小流量分解。
J Comput Biol. 2022 Nov;29(11):1252-1267. doi: 10.1089/cmb.2022.0257. Epub 2022 Oct 18.
7
Deriving Ranges of Optimal Estimated Transcript Expression due to Nonidentifiability.由于不可识别性导致的最优转录本表达范围的推导。
J Comput Biol. 2022 Feb;29(2):121-139. doi: 10.1089/cmb.2021.0444. Epub 2022 Jan 17.
8
Chromosome-level Thlaspi arvense genome provides new tools for translational research and for a newly domesticated cash cover crop of the cooler climates.染色体水平的野豌豆基因组为转化研究和新驯化的凉爽气候下的现金覆盖作物提供了新工具。
Plant Biotechnol J. 2022 May;20(5):944-963. doi: 10.1111/pbi.13775. Epub 2022 Feb 6.
9
SAUTE: sequence assembly using target enrichment.SAUTE:基于目标富集的序列组装。
BMC Bioinformatics. 2021 Jul 21;22(1):375. doi: 10.1186/s12859-021-04174-9.
10
RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition.RefShannon:一种基于基因组指导的使用稀疏流分解的转录组组装方法。
PLoS One. 2020 Jun 2;15(6):e0232946. doi: 10.1371/journal.pone.0232946. eCollection 2020.
Nat Biotechnol. 2017 Dec;35(12):1167-1169. doi: 10.1038/nbt.4020. Epub 2017 Nov 13.
4
Ckmeans.1d.dp: Optimal -means Clustering in One Dimension by Dynamic Programming.Ckmeans.1d.dp:通过动态规划实现的一维最优均值聚类
R J. 2011 Dec;3(2):29-33.
5
TransComb: genome-guided transcriptome assembly via combing junctions in splicing graphs.TransComb:通过梳理剪接图中的连接点进行基因组引导的转录组组装。
Genome Biol. 2016 Oct 19;17(1):213. doi: 10.1186/s13059-016-1074-1.
6
Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data.用于从RNA测序数据中确定和定量全长mRNA剪接形式的算法的基准分析。
Bioinformatics. 2015 Dec 15;31(24):3938-45. doi: 10.1093/bioinformatics/btv488. Epub 2015 Sep 3.
7
HISAT: a fast spliced aligner with low memory requirements.HISAT:一种内存需求低的快速剪接比对器。
Nat Methods. 2015 Apr;12(4):357-60. doi: 10.1038/nmeth.3317. Epub 2015 Mar 9.
8
StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.StringTie能够从RNA测序读数中更完善地重建转录组。
Nat Biotechnol. 2015 Mar;33(3):290-5. doi: 10.1038/nbt.3122. Epub 2015 Feb 18.
9
IVT-seq reveals extreme bias in RNA sequencing.体外转录测序(IVT-seq)揭示了RNA测序中的极端偏差。
Genome Biol. 2014 Jun 30;15(6):R86. doi: 10.1186/gb-2014-15-6-r86.
10
Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.利用网络流从RNA测序数据中高效鉴定和定量RNA异构体
Bioinformatics. 2014 Sep 1;30(17):2447-55. doi: 10.1093/bioinformatics/btu317. Epub 2014 May 9.