• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于 RNA-seq 定量的快速且全局最优的解决方案。

A fast and globally optimal solution for RNA-seq quantification.

机构信息

Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, 97 Buxin Rd, Shenzhen, 518000, Guangdong, China.

School of Life Sciences, Southern University of Science and Technology, 1088 Xueyuan Blvd, Shenzhen 518055, Guangdong, China.

出版信息

Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad298.

DOI:10.1093/bib/bbad298
PMID:37595963
Abstract

Alignment-based RNA-seq quantification methods typically involve a time-consuming alignment process prior to estimating transcript abundances. In contrast, alignment-free RNA-seq quantification methods bypass this step, resulting in significant speed improvements. Existing alignment-free methods rely on the Expectation-Maximization (EM) algorithm for estimating transcript abundances. However, EM algorithms only guarantee locally optimal solutions, leaving room for further accuracy improvement by finding a globally optimal solution. In this study, we present TQSLE, the first alignment-free RNA-seq quantification method that provides a globally optimal solution for transcript abundances estimation. TQSLE adopts a two-step approach: first, it constructs a k-mer frequency matrix A for the reference transcriptome and a k-mer frequency vector b for the RNA-seq reads; then, it directly estimates transcript abundances by solving the linear equation ATAx = ATb. We evaluated the performance of TQSLE using simulated and real RNA-seq data sets and observed that, despite comparable speed to other alignment-free methods, TQSLE outperforms them in terms of accuracy. TQSLE is freely available at https://github.com/yhg926/TQSLE.

摘要

基于比对的 RNA-seq 定量方法通常需要在估计转录本丰度之前进行耗时的比对过程。相比之下,无比对的 RNA-seq 定量方法绕过了这一步骤,从而显著提高了速度。现有的无比对方法依赖于期望最大化(EM)算法来估计转录本丰度。然而,EM 算法仅保证局部最优解,通过寻找全局最优解,可以进一步提高准确性。在这项研究中,我们提出了 TQSLE,这是第一个提供转录本丰度估计全局最优解的无比对 RNA-seq 定量方法。TQSLE 采用两步法:首先,它为参考转录组构建 k-mer 频率矩阵 A 和为 RNA-seq 读取构建 k-mer 频率向量 b;然后,它通过求解线性方程 ATAx = ATb 直接估计转录本丰度。我们使用模拟和真实的 RNA-seq 数据集评估了 TQSLE 的性能,结果表明,尽管与其他无比对方法的速度相当,但 TQSLE 在准确性方面表现更优。TQSLE 可在 https://github.com/yhg926/TQSLE 上免费获取。

相似文献

1
A fast and globally optimal solution for RNA-seq quantification.一种用于 RNA-seq 定量的快速且全局最优的解决方案。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad298.
2
RNA-Skim: a rapid method for RNA-Seq quantification at transcript level.RNA-Skim:一种在转录水平上进行 RNA-Seq 定量的快速方法。
Bioinformatics. 2014 Jun 15;30(12):i283-i292. doi: 10.1093/bioinformatics/btu288.
3
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.RSEM:有或无参考基因组的 RNA-Seq 数据的准确转录本定量。
BMC Bioinformatics. 2011 Aug 4;12:323. doi: 10.1186/1471-2105-12-323.
4
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.TIGAR:一种通过变分贝叶斯推断进行 RNA-Seq 数据缺口对齐的转录本丰度估计方法。
Bioinformatics. 2013 Sep 15;29(18):2292-9. doi: 10.1093/bioinformatics/btt381. Epub 2013 Jul 2.
5
Strawberry: Fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq.草莓:基于RNA测序的快速且准确的基因组引导转录本重建与定量分析
PLoS Comput Biol. 2017 Nov 27;13(11):e1005851. doi: 10.1371/journal.pcbi.1005851. eCollection 2017 Nov.
6
Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data.从 RNA-seq 数据中定量异构体的双线性模型的交替 EM 算法。
Bioinformatics. 2020 Feb 1;36(3):805-812. doi: 10.1093/bioinformatics/btz640.
7
A mixture model for expression deconvolution from RNA-seq in heterogeneous tissues.一种用于异质组织中 RNA-seq 表达解卷积的混合模型。
BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S11. doi: 10.1186/1471-2105-14-S5-S11. Epub 2013 Apr 10.
8
AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana.AtRTD——一个用于准确量化拟南芥转录本特异性表达的全面参考转录本数据集资源。
New Phytol. 2015 Oct;208(1):96-101. doi: 10.1111/nph.13545. Epub 2015 Jun 25.
9
EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering.EMSAR:通过基于可映射性的分割和重新聚类从RNA测序数据估计转录本丰度
BMC Bioinformatics. 2015 Sep 3;16:278. doi: 10.1186/s12859-015-0704-z.
10
Fast and accurate approximate inference of transcript expression from RNA-seq data.从RNA测序数据中快速准确地进行转录本表达的近似推断。
Bioinformatics. 2015 Dec 15;31(24):3881-9. doi: 10.1093/bioinformatics/btv483. Epub 2015 Aug 26.