• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MultiTrans:一种通过混合整数线性规划进行转录组组装的路径提取算法。

MultiTrans: An Algorithm for Path Extraction Through Mixed Integer Linear Programming for Transcriptome Assembly.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):48-56. doi: 10.1109/TCBB.2021.3083277. Epub 2022 Feb 3.

DOI:10.1109/TCBB.2021.3083277
PMID:34033544
Abstract

Recent advances in RNA-seq technology have made identification of expressed genes affordable, and thus boosting repaid development of transcriptomic studies. Transcriptome assembly, reconstructing all expressed transcripts from RNA-seq reads, is an essential step to understand genes, proteins, and cell functions. Transcriptome assembly remains a challenging problem due to complications in splicing variants, expression levels, uneven coverage and sequencing errors. Here, we formulate the transcriptome assembly problem as path extraction on splicing graphs (or assembly graphs), and propose a novel algorithm MultiTrans for path extraction using mixed integer linear programming. MultiTrans is able to take into consideration coverage constraints on vertices and edges, the number of paths and the paired-end information simultaneously. We benchmarked MultiTrans against two state-of-the-art transcriptome assemblers, TransLiG and rnaSPAdes. Experimental results show that MultiTrans generates more accurate transcripts compared to TransLiG (using the same splicing graphs) and rnaSPAdes (using the same assembly graphs). MultiTrans is freely available at https://github.com/jzbio/MultiTrans.

摘要

RNA-seq 技术的最新进展使得表达基因的鉴定变得经济实惠,从而加速了转录组研究的发展。转录组组装是从 RNA-seq 读取中重建所有表达转录本的重要步骤,是理解基因、蛋白质和细胞功能的关键步骤。由于剪接变体、表达水平、不均匀覆盖和测序错误的复杂性,转录组组装仍然是一个具有挑战性的问题。在这里,我们将转录组组装问题表述为拼接图(或组装图)上的路径提取,并提出了一种使用混合整数线性规划进行路径提取的新算法 MultiTrans。MultiTrans 能够同时考虑顶点和边的覆盖约束、路径数量和配对末端信息。我们将 MultiTrans 与两种最先进的转录组组装器 TransLiG 和 rnaSPAdes 进行了基准测试。实验结果表明,MultiTrans 生成的转录本比 TransLiG(使用相同的拼接图)和 rnaSPAdes(使用相同的组装图)更准确。MultiTrans 可在 https://github.com/jzbio/MultiTrans 上免费获得。

相似文献

1
MultiTrans: An Algorithm for Path Extraction Through Mixed Integer Linear Programming for Transcriptome Assembly.MultiTrans:一种通过混合整数线性规划进行转录组组装的路径提取算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):48-56. doi: 10.1109/TCBB.2021.3083277. Epub 2022 Feb 3.
2
Accurate inference of isoforms from multiple sample RNA-Seq data.从多个样本RNA测序数据中准确推断异构体
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S15. doi: 10.1186/1471-2164-16-S2-S15. Epub 2015 Jan 21.
3
IsoTree: A New Framework for de novo Transcriptome Assembly from RNA-seq Reads.IsoTree:一种从 RNA-seq 读取中从头组装转录组的新框架。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):938-948. doi: 10.1109/TCBB.2018.2808350. Epub 2018 Feb 21.
4
SSP: an interval integer linear programming for de novo transcriptome assembly and isoform discovery of RNA-seq reads.SSP:一种用于 RNA-seq reads 从头转录组组装和异构体发现的区间整数线性规划方法。
Genomics. 2013 Nov-Dec;102(5-6):507-14. doi: 10.1016/j.ygeno.2013.10.003. Epub 2013 Oct 23.
5
Transcriptome assembly and quantification from Ion Torrent RNA-Seq data.基于Ion Torrent RNA测序数据的转录组组装与定量分析
BMC Genomics. 2014;15 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2164-15-S5-S7. Epub 2014 Jul 14.
6
TransRef enables accurate transcriptome assembly by redefining accurate neo-splicing graphs.TransRef 通过重新定义准确的新剪接图谱来实现准确的转录组组装。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab261.
7
rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.rnaSPAdes:一种从头转录组组装程序及其在 RNA-Seq 数据中的应用。
Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz100.
8
TransLiG: a de novo transcriptome assembler that uses line graph iteration.TransLiG:一种基于线图迭代的从头转录组组装算法。
Genome Biol. 2019 Apr 23;20(1):81. doi: 10.1186/s13059-019-1690-7.
9
A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data.一种内存高效的算法,用于从RNA测序数据的德布鲁因图中获取剪接图和从头表达估计值。
BMC Genomics. 2014;15 Suppl 5(Suppl 5):S6. doi: 10.1186/1471-2164-15-S5-S6. Epub 2014 Jul 14.
10
Freddie: annotation-independent detection and discovery of transcriptomic alternative splicing isoforms using long-read sequencing.弗雷迪:使用长读测序进行注释独立的转录组可变剪接异构体的检测和发现。
Nucleic Acids Res. 2023 Jan 25;51(2):e11. doi: 10.1093/nar/gkac1112.

引用本文的文献

1
VirDiG: a transcriptome assembler for coronavirus.VirDiG:一种用于冠状病毒的转录组组装工具
Bioinform Adv. 2025 Apr 8;5(1):vbaf075. doi: 10.1093/bioadv/vbaf075. eCollection 2025.
2
Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.Cov-trans:一种用于冠状病毒中不连续转录本组装的高效算法。
BMC Genomics. 2024 Dec 30;25(1):1257. doi: 10.1186/s12864-024-11179-0.
3
A split Bregman method solving optimal reactive power dispatch for a doubly-fed induction generator-based wind farm.分裂布格曼算法求解双馈感应风力发电场的最优无功功率调度。
Sci Rep. 2022 Nov 10;12(1):19222. doi: 10.1038/s41598-022-17761-4.
4
Efficient Minimum Flow Decomposition via Integer Linear Programming.通过整数线性规划实现有效的最小流量分解。
J Comput Biol. 2022 Nov;29(11):1252-1267. doi: 10.1089/cmb.2022.0257. Epub 2022 Oct 18.
5
Jumper enables discontinuous transcript assembly in coronaviruses.跳跃基因使冠状病毒的不连续转录组装成为可能。
Nat Commun. 2021 Nov 18;12(1):6728. doi: 10.1038/s41467-021-26944-y.