• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于学习如何运行转录本组装程序的数据驱动型人工智能系统。

Data-driven AI system for learning how to run transcript assemblers.

作者信息

Shen Yihang, Yan Zhiwen, Kingsford Carl

机构信息

Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA.

出版信息

bioRxiv. 2024 Oct 30:2024.01.25.577290. doi: 10.1101/2024.01.25.577290.

DOI:10.1101/2024.01.25.577290
PMID:39554123
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11565938/
Abstract

We introduce AutoTuneX, a data-driven, AI system designed to automatically predict optimal parameters for transcript assemblers - tools for reconstructing expressed transcripts from the reads in a given RNA-seq sample. AutoTuneX is built by learning parameter knowledge from existing RNA-seq samples and transferring this knowledge to unseen samples. On 1588 human RNA-seq samples tested with two transcript assemblers, AutoTuneX predicts parameters that resulted in 98% of samples achieving more accurate transcript assembly compared to using default parameter settings, with some samples experiencing up to a 600% improvement in AUC. AutoTuneX offers a new strategy for automatically optimizing use of sequence analysis tools.

摘要

我们推出了AutoTuneX,这是一个数据驱动的人工智能系统,旨在自动预测转录本组装工具的最佳参数,转录本组装工具用于从给定RNA测序样本中的 reads 重建表达的转录本。AutoTuneX通过从现有的RNA测序样本中学习参数知识,并将这些知识转移到未见样本中构建而成。在用两种转录本组装工具测试的1588个人类RNA测序样本上,与使用默认参数设置相比,AutoTuneX预测的参数使得98%的样本实现了更准确的转录本组装,一些样本的AUC提高了600%。AutoTuneX为自动优化序列分析工具的使用提供了一种新策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/a80b783f0f6b/nihpp-2024.01.25.577290v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/5e6bdd5f67d9/nihpp-2024.01.25.577290v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/5dca5b9be4de/nihpp-2024.01.25.577290v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/3f34c941814c/nihpp-2024.01.25.577290v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/591f66f0d04e/nihpp-2024.01.25.577290v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/e9d785b49468/nihpp-2024.01.25.577290v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/3a00ce9d1930/nihpp-2024.01.25.577290v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/13ec752a1e10/nihpp-2024.01.25.577290v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/294da7b0c9f2/nihpp-2024.01.25.577290v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/bf68e67a5c5a/nihpp-2024.01.25.577290v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/55d5d8a76f05/nihpp-2024.01.25.577290v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/a3b32b52025e/nihpp-2024.01.25.577290v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/a80b783f0f6b/nihpp-2024.01.25.577290v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/5e6bdd5f67d9/nihpp-2024.01.25.577290v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/5dca5b9be4de/nihpp-2024.01.25.577290v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/3f34c941814c/nihpp-2024.01.25.577290v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/591f66f0d04e/nihpp-2024.01.25.577290v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/e9d785b49468/nihpp-2024.01.25.577290v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/3a00ce9d1930/nihpp-2024.01.25.577290v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/13ec752a1e10/nihpp-2024.01.25.577290v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/294da7b0c9f2/nihpp-2024.01.25.577290v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/bf68e67a5c5a/nihpp-2024.01.25.577290v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/55d5d8a76f05/nihpp-2024.01.25.577290v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/a3b32b52025e/nihpp-2024.01.25.577290v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ec/11565938/a80b783f0f6b/nihpp-2024.01.25.577290v2-f0012.jpg

相似文献

1
Data-driven AI system for learning how to run transcript assemblers.用于学习如何运行转录本组装程序的数据驱动型人工智能系统。
bioRxiv. 2024 Oct 30:2024.01.25.577290. doi: 10.1101/2024.01.25.577290.
2
More Accurate Transcript Assembly via Parameter Advising.通过参数建议实现更准确的转录本组装。
J Comput Biol. 2020 Aug;27(8):1181-1189. doi: 10.1089/cmb.2019.0286. Epub 2020 Apr 21.
3
Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study.优化从头转录组组装从短读 RNA-Seq 数据:一项比较研究。
BMC Bioinformatics. 2011 Dec 14;12 Suppl 14(Suppl 14):S2. doi: 10.1186/1471-2105-12-S14-S2.
4
Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data.利用短读 RNA-Seq 数据优化从头构建的普通小麦转录组组装。
BMC Genomics. 2012 Aug 14;13:392. doi: 10.1186/1471-2164-13-392.
5
StringFix: an annotation-guided transcriptome assembler improves the recovery of amino acid sequences from RNA-Seq reads.StringFix:一种基于注释指导的转录组组装方法,可提高从 RNA-Seq 读段中恢复氨基酸序列的能力。
Genes Genomics. 2023 Dec;45(12):1599-1609. doi: 10.1007/s13258-023-01458-7. Epub 2023 Oct 14.
6
Playing hide and seek with repeats in local and global de novo transcriptome assembly of short RNA-seq reads.在短RNA测序读数的局部和全局从头转录组组装中与重复序列玩捉迷藏游戏。
Algorithms Mol Biol. 2017 Feb 22;12:2. doi: 10.1186/s13015-017-0091-2. eCollection 2017.
7
Comparison of assembly algorithms for improving rate of metatranscriptomic functional annotation.比较提高宏转录组功能注释率的组装算法。
Microbiome. 2014 Oct 28;2:39. doi: 10.1186/2049-2618-2-39. eCollection 2014.
8
scRNAss: a single-cell RNA-seq assembler via imputing dropouts and combing junctions.scRNAss:一种通过填补缺失值和组合连接点来进行单细胞 RNA-seq 组装的方法。
Bioinformatics. 2019 Nov 1;35(21):4264-4271. doi: 10.1093/bioinformatics/btz240.
9
Accurate assembly of multiple RNA-seq samples with Aletsch.利用 Aletsch 对多个 RNA-seq 样本进行精确组装。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i307-i317. doi: 10.1093/bioinformatics/btae215.
10
Accurate assembly of multi-end RNA-seq data with Scallop2.使用Scallop2对多端RNA测序数据进行精确组装。
Nat Comput Sci. 2022 Mar;2(3):148-152. doi: 10.1038/s43588-022-00216-1. Epub 2022 Mar 28.

本文引用的文献

1
Accurate assembly of multi-end RNA-seq data with Scallop2.使用Scallop2对多端RNA测序数据进行精确组装。
Nat Comput Sci. 2022 Mar;2(3):148-152. doi: 10.1038/s43588-022-00216-1. Epub 2022 Mar 28.
2
Practical selection of representative sets of RNA-seq samples using a hierarchical approach.使用层次方法对 RNA-seq 样本进行有代表性的集合的实际选择。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i334-i341. doi: 10.1093/bioinformatics/btab315.
3
Automating parameter selection to avoid implausible biological pathway models.自动化参数选择,避免不合理的生物途径模型。
NPJ Syst Biol Appl. 2021 Feb 23;7(1):12. doi: 10.1038/s41540-020-00167-1.
4
RNA-Bloom enables reference-free and reference-guided sequence assembly for single-cell transcriptomes.RNA-Bloom 能够实现无参考和有参考的单细胞转录组序列组装。
Genome Res. 2020 Aug;30(8):1191-1200. doi: 10.1101/gr.260174.119. Epub 2020 Aug 17.
5
GFF Utilities: GffRead and GffCompare.
F1000Res. 2020 Apr 28;9. doi: 10.12688/f1000research.23297.2. eCollection 2020.
6
RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition.RefShannon:一种基于基因组指导的使用稀疏流分解的转录组组装方法。
PLoS One. 2020 Jun 2;15(6):e0232946. doi: 10.1371/journal.pone.0232946. eCollection 2020.
7
More Accurate Transcript Assembly via Parameter Advising.通过参数建议实现更准确的转录本组装。
J Comput Biol. 2020 Aug;27(8):1181-1189. doi: 10.1089/cmb.2019.0286. Epub 2020 Apr 21.
8
Constrained Bayesian optimization for automatic chemical design using variational autoencoders.使用变分自编码器进行自动化学设计的约束贝叶斯优化。
Chem Sci. 2019 Nov 18;11(2):577-586. doi: 10.1039/c9sc04026a. eCollection 2020 Jan 14.
9
Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads.量化 scallop-LR 在单分子长读段上进行转录本组装带来的益处。
Genome Biol. 2019 Dec 18;20(1):287. doi: 10.1186/s13059-019-1883-0.
10
Transcriptome assembly from long-read RNA-seq alignments with StringTie2.基于长读 RNA-seq 比对的转录组组装与 StringTie2。
Genome Biol. 2019 Dec 16;20(1):278. doi: 10.1186/s13059-019-1910-1.