• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TIGAR2:利用更长的RNA测序读段对转录本异构体表达进行灵敏且准确的估计。

TIGAR2: sensitive and accurate estimation of transcript isoform expression with longer RNA-Seq reads.

作者信息

Nariai Naoki, Kojima Kaname, Mimori Takahiro, Sato Yukuto, Kawai Yosuke, Yamaguchi-Kabata Yumi, Nagasaki Masao

出版信息

BMC Genomics. 2014;15 Suppl 10(Suppl 10):S5. doi: 10.1186/1471-2164-15-S10-S5. Epub 2014 Dec 12.

DOI:10.1186/1471-2164-15-S10-S5
PMID:25560536
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4304212/
Abstract

BACKGROUND

High-throughput RNA sequencing (RNA-Seq) enables quantification and identification of transcripts at single-base resolution. Recently, longer sequence reads become available thanks to the development of new types of sequencing technologies as well as improvements in chemical reagents for the Next Generation Sequencers. Although several computational methods have been proposed for quantifying gene expression levels from RNA-Seq data, they are not sufficiently optimized for longer reads (e.g. >250 bp).

RESULTS

We propose TIGAR2, a statistical method for quantifying transcript isoforms from fixed and variable length RNA-Seq data. Our method models substitution, deletion, and insertion errors of sequencers based on gapped-alignments of reads to the reference cDNA sequences so that sensitive read-aligners such as Bowtie2 and BWA-MEM are effectively incorporated in our pipeline. Also, a heuristic algorithm is implemented in variational Bayesian inference for faster computation. We apply TIGAR2 to both simulation data and real data of human samples and evaluate performance of transcript quantification with TIGAR2 in comparison to existing methods.

CONCLUSIONS

TIGAR2 is a sensitive and accurate tool for quantifying transcript isoform abundances from RNA-Seq data. Our method performs better than existing methods for the fixed-length reads (100 bp, 250 bp, 500 bp, and 1000 bp of both single-end and paired-end) and variable-length reads, especially for reads longer than 250 bp.

摘要

背景

高通量RNA测序(RNA-Seq)能够在单碱基分辨率下对转录本进行定量和鉴定。近来,由于新型测序技术的发展以及新一代测序仪化学试剂的改进,可获得更长的序列 reads。尽管已经提出了几种从RNA-Seq数据中定量基因表达水平的计算方法,但它们对于更长的reads(例如>250 bp)并未进行充分优化。

结果

我们提出了TIGAR2,一种用于从固定长度和可变长度RNA-Seq数据中定量转录本异构体的统计方法。我们的方法基于reads与参考cDNA序列的间隙比对,对测序仪的替换、缺失和插入错误进行建模,以便将诸如Bowtie2和BWA-MEM等灵敏的read-aligner有效地纳入我们的流程。此外,在变分贝叶斯推理中实现了一种启发式算法以加快计算速度。我们将TIGAR2应用于人类样本的模拟数据和真实数据,并与现有方法相比评估TIGAR2在转录本定量方面的性能。

结论

TIGAR2是一种用于从RNA-Seq数据中定量转录本异构体丰度的灵敏且准确的工具。我们的方法在固定长度reads(单端和双端的100 bp、250 bp、500 bp和1000 bp)和可变长度reads方面比现有方法表现更好,尤其是对于长度超过250 bp的reads。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/d967a885b00d/1471-2164-15-S10-S5-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/9dc792caa471/1471-2164-15-S10-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/5054dac2fe10/1471-2164-15-S10-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/dd72dd86f70c/1471-2164-15-S10-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/0a596bb6862d/1471-2164-15-S10-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/23c91b1f8595/1471-2164-15-S10-S5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/6e6255309f82/1471-2164-15-S10-S5-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/d967a885b00d/1471-2164-15-S10-S5-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/9dc792caa471/1471-2164-15-S10-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/5054dac2fe10/1471-2164-15-S10-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/dd72dd86f70c/1471-2164-15-S10-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/0a596bb6862d/1471-2164-15-S10-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/23c91b1f8595/1471-2164-15-S10-S5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/6e6255309f82/1471-2164-15-S10-S5-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3da8/4304212/d967a885b00d/1471-2164-15-S10-S5-7.jpg

相似文献

1
TIGAR2: sensitive and accurate estimation of transcript isoform expression with longer RNA-Seq reads.TIGAR2:利用更长的RNA测序读段对转录本异构体表达进行灵敏且准确的估计。
BMC Genomics. 2014;15 Suppl 10(Suppl 10):S5. doi: 10.1186/1471-2164-15-S10-S5. Epub 2014 Dec 12.
2
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.TIGAR:一种通过变分贝叶斯推断进行 RNA-Seq 数据缺口对齐的转录本丰度估计方法。
Bioinformatics. 2013 Sep 15;29(18):2292-9. doi: 10.1093/bioinformatics/btt381. Epub 2013 Jul 2.
3
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.RSEM:有或无参考基因组的 RNA-Seq 数据的准确转录本定量。
BMC Bioinformatics. 2011 Aug 4;12:323. doi: 10.1186/1471-2105-12-323.
4
Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms.旗鱼能够使用轻量级算法从RNA测序读段中进行无比对的异构体定量分析。
Nat Biotechnol. 2014 May;32(5):462-4. doi: 10.1038/nbt.2862. Epub 2014 Apr 20.
5
A robust method for transcript quantification with RNA-seq data.一种利用RNA测序数据进行转录本定量的可靠方法。
J Comput Biol. 2013 Mar;20(3):167-87. doi: 10.1089/cmb.2012.0230.
6
A Bayesian approach for estimating allele-specific expression from RNA-Seq data with diploid genomes.一种用于从具有二倍体基因组的RNA测序数据中估计等位基因特异性表达的贝叶斯方法。
BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):2. doi: 10.1186/s12864-015-2295-5.
7
Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads.利用多映射 RNA-seq reads 进行单倍型和异构体特异性表达估计。
Genome Biol. 2011;12(2):R13. doi: 10.1186/gb-2011-12-2-r13. Epub 2011 Feb 10.
8
RNA-Seq gene expression estimation with read mapping uncertainty.基于读段比对不确定性的 RNA-Seq 基因表达估计。
Bioinformatics. 2010 Feb 15;26(4):493-500. doi: 10.1093/bioinformatics/btp692. Epub 2009 Dec 18.
9
Statistical inferences for isoform expression in RNA-Seq.RNA测序中异构体表达的统计推断。
Bioinformatics. 2009 Apr 15;25(8):1026-32. doi: 10.1093/bioinformatics/btp113. Epub 2009 Feb 25.
10
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.

引用本文的文献

1
Analysis of gene expression of Babesia gibsoni cultured with diminazene aceturate using RNA sequencing.使用RNA测序分析用乙酰马啉培养的吉氏巴贝斯虫的基因表达。
J Vet Med Sci. 2025 Feb 4;87(2):181-188. doi: 10.1292/jvms.24-0395. Epub 2024 Dec 30.
2
Clock Gene Expression in Eel Retina and Hypothalamus: Response to Photoperiod and Moonlight.时钟基因在鳗鱼视网膜和下丘脑的表达:对光周期和月光的反应。
J Exp Zool A Ecol Integr Physiol. 2025 Jan;343(1):81-94. doi: 10.1002/jez.2870. Epub 2024 Oct 7.
3
Oarfish: Enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification.

本文引用的文献

1
Fast and accurate approximate inference of transcript expression from RNA-seq data.从RNA测序数据中快速准确地进行转录本表达的近似推断。
Bioinformatics. 2015 Dec 15;31(24):3881-9. doi: 10.1093/bioinformatics/btv483. Epub 2015 Aug 26.
2
Improved variational Bayes inference for transcript expression estimation.用于转录本表达估计的改进变分贝叶斯推理
Stat Appl Genet Mol Biol. 2014 Apr 1;13(2):203-16. doi: 10.1515/sagmb-2013-0054.
3
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference.
皇带鱼:增强的概率模型可提高长读长转录组定量的准确性。
bioRxiv. 2024 Mar 1:2024.02.28.582591. doi: 10.1101/2024.02.28.582591.
4
Perplexity: evaluating transcript abundance estimation in the absence of ground truth.困惑度:在缺乏真实对照的情况下评估转录本丰度估计
Algorithms Mol Biol. 2022 Mar 25;17(1):6. doi: 10.1186/s13015-022-00214-y.
5
Antiemetic effects of baclofen in a shrew model of postoperative nausea and vomiting: Whole-transcriptome analysis in the nucleus of the solitary tract.巴氯芬在术后恶心呕吐的鼩鼱模型中的止吐作用:孤束核的全转录组分析。
CNS Neurosci Ther. 2022 Jun;28(6):922-931. doi: 10.1111/cns.13823. Epub 2022 Mar 3.
6
LIQA: long-read isoform quantification and analysis.LIQA:长读 isoform 定量分析。
Genome Biol. 2021 Jun 17;22(1):182. doi: 10.1186/s13059-021-02399-8.
7
Factorial study of the RNA-seq computational workflow identifies biases as technical gene signatures.RNA测序计算工作流程的析因研究将偏差识别为技术基因特征。
NAR Genom Bioinform. 2020 Jun 29;2(2):lqaa043. doi: 10.1093/nargab/lqaa043. eCollection 2020 Jun.
8
A Bayesian framework for inter-cellular information sharing improves dscRNA-seq quantification.贝叶斯框架用于细胞间信息共享可提高 dscRNA-seq 的定量分析。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i292-i299. doi: 10.1093/bioinformatics/btaa450.
9
Gene expression patterns of novel visual and non-visual opsin families in immature and mature Japanese eel males.未成熟和成熟日本鳗鱼雄性中新型视觉和非视觉视蛋白家族的基因表达模式
PeerJ. 2020 Feb 27;8:e8326. doi: 10.7717/peerj.8326. eCollection 2020.
10
Uncovering full-length transcript isoforms of sugarcane cultivar Khon Kaen 3 using single-molecule long-read sequencing.利用单分子长读长测序揭示甘蔗品种孔敬3号的全长转录本异构体
PeerJ. 2018 Oct 30;6:e5818. doi: 10.7717/peerj.5818. eCollection 2018.
TIGAR:一种通过变分贝叶斯推断进行 RNA-Seq 数据缺口对齐的转录本丰度估计方法。
Bioinformatics. 2013 Sep 15;29(18):2292-9. doi: 10.1093/bioinformatics/btt381. Epub 2013 Jul 2.
4
Shining a light on dark sequencing: characterising errors in Ion Torrent PGM data.照亮黑暗测序:剖析 Ion Torrent PGM 数据中的错误。
PLoS Comput Biol. 2013 Apr;9(4):e1003031. doi: 10.1371/journal.pcbi.1003031. Epub 2013 Apr 11.
5
A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers.三代测序平台的故事:Ion Torrent、Pacific Biosciences 和 Illumina MiSeq 测序仪的比较。
BMC Genomics. 2012 Jul 24;13:341. doi: 10.1186/1471-2164-13-341.
6
Identifying differentially expressed transcripts from RNA-seq data with biological variation.从具有生物学变异的 RNA-seq 数据中鉴定差异表达的转录本。
Bioinformatics. 2012 Jul 1;28(13):1721-8. doi: 10.1093/bioinformatics/bts260. Epub 2012 May 3.
7
Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration.综合基因组浏览器(IGV):高性能基因组学数据可视化和探索。
Brief Bioinform. 2013 Mar;14(2):178-92. doi: 10.1093/bib/bbs017. Epub 2012 Apr 19.
8
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.
9
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.RNA-seq 实验中使用 TopHat 和 Cufflinks 的差异基因和转录本表达分析。
Nat Protoc. 2012 Mar 1;7(3):562-78. doi: 10.1038/nprot.2012.016.
10
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.RSEM:有或无参考基因组的 RNA-Seq 数据的准确转录本定量。
BMC Bioinformatics. 2011 Aug 4;12:323. doi: 10.1186/1471-2105-12-323.