• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

与参考基因组比较转录组分析方法。

A comparison of transcriptome analysis methods with reference genome.

机构信息

Department of Medical Genetics and Developmental Biology, School of Basic Medical Sciences, Capital Medical University, Beijing, China.

Beijing Key Laboratory of Neural Regeneration and Repair, Capital Medical University, Beijing, China.

出版信息

BMC Genomics. 2022 Mar 25;23(1):232. doi: 10.1186/s12864-022-08465-0.

DOI:10.1186/s12864-022-08465-0
PMID:35337265
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8957167/
Abstract

BACKGROUND

The application of RNA-seq technology has become more extensive and the number of analysis procedures available has increased over the past years. Selecting an appropriate workflow has become an important issue for researchers in the field.

METHODS

In our study, six popular analytical procedures/pipeline were compared using four RNA-seq datasets from mouse, human, rat, and macaque, respectively. The gene expression value, fold change of gene expression, and statistical significance were evaluated to compare the similarities and differences among the six procedures. qRT-PCR was performed to validate the differentially expressed genes (DEGs) from all six procedures.

RESULTS

Cufflinks-Cuffdiff demands the highest computing resources and Kallisto-Sleuth demands the least. Gene expression values, fold change, p and q values of differential expression (DE) analysis are highly correlated among procedures using HTseq for quantification. For genes with medium expression abundance, the expression values determined using the different procedures were similar. Major differences in expression values come from genes with particularly high or low expression levels. HISAT2-StringTie-Ballgown is more sensitive to genes with low expression levels, while Kallisto-Sleuth may only be useful to evaluate genes with medium to high abundance. When the same thresholds for fold change and p value are chosen in DE analysis, StringTie-Ballgown produce the least number of DEGs, while HTseq-DESeq2, -edgeR or -limma generally produces more DEGs. The performance of Cufflinks-Cuffdiff and Kallisto-Sleuth varies in different datasets. For DEGs with medium expression levels, the biological verification rates were similar among all procedures.

CONCLUSION

Results are highly correlated among RNA-seq analysis procedures using HTseq for quantification. Difference in gene expression values mainly come from genes with particularly high or low expression levels. Moreover, biological validation rates of DEGs from all six procedures were similar for genes with medium expression levels. Investigators can choose analytical procedures according to their available computer resources, or whether genes of high or low expression levels are of interest. If computer resources are abundant, one can utilize multiple procedures to obtain the intersection of results to get the most reliable DEGs, or to obtain a combination of results to get a more comprehensive DE profile for transcriptomes.

摘要

背景

近年来,RNA-seq 技术的应用越来越广泛,可用的分析流程也越来越多。选择合适的工作流程已成为该领域研究人员的重要问题。

方法

在我们的研究中,分别使用来自小鼠、人类、大鼠和猕猴的四个 RNA-seq 数据集,比较了六种流行的分析程序/流程。通过评估基因表达值、基因表达的倍数变化和统计显著性,比较了这六种程序之间的异同。对所有六种程序的差异表达基因(DEG)进行 qRT-PCR 验证。

结果

Cufflinks-Cuffdiff 对计算资源的要求最高,Kallisto-Sleuth 对计算资源的要求最低。使用 HTseq 进行定量分析时,六种程序的基因表达值、倍数变化、差异表达(DE)分析的 p 和 q 值高度相关。对于中等表达丰度的基因,不同程序确定的表达值相似。表达值的主要差异来自表达水平特别高或低的基因。HISAT2-StringTie-Ballgown 对低表达水平的基因更敏感,而 Kallisto-Sleuth 可能仅对中高丰度的基因有用。当在 DE 分析中选择相同的倍数变化和 p 值阈值时,StringTie-Ballgown 产生的 DEG 数量最少,而 HTseq-DESeq2、edgeR 或 limma 通常会产生更多的 DEG。Cufflinks-Cuffdiff 和 Kallisto-Sleuth 在不同的数据集上表现不同。对于中等表达水平的 DEG,所有程序的生物学验证率相似。

结论

使用 HTseq 进行定量分析时,RNA-seq 分析程序的结果高度相关。基因表达值的差异主要来自表达水平特别高或低的基因。此外,对于中等表达水平的 DEG,所有六种程序的生物学验证率相似。研究人员可以根据可用的计算机资源或高或低表达水平的基因是否感兴趣来选择分析程序。如果计算机资源丰富,可以利用多种程序获得结果的交集,以获得最可靠的 DEG,或者获得结果的组合,以获得转录组更全面的 DE 图谱。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1a5e31b1a59e/12864_2022_8465_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1ea02dd84617/12864_2022_8465_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/189dce2e13ca/12864_2022_8465_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/a5f8aebf260a/12864_2022_8465_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1c7292580395/12864_2022_8465_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/16e3e2d5a677/12864_2022_8465_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/55b23cc19991/12864_2022_8465_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1a5e31b1a59e/12864_2022_8465_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1ea02dd84617/12864_2022_8465_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/189dce2e13ca/12864_2022_8465_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/a5f8aebf260a/12864_2022_8465_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1c7292580395/12864_2022_8465_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/16e3e2d5a677/12864_2022_8465_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/55b23cc19991/12864_2022_8465_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c47e/8957167/1a5e31b1a59e/12864_2022_8465_Fig7_HTML.jpg

相似文献

1
A comparison of transcriptome analysis methods with reference genome.与参考基因组比较转录组分析方法。
BMC Genomics. 2022 Mar 25;23(1):232. doi: 10.1186/s12864-022-08465-0.
2
hppRNA-a Snakemake-based handy parameter-free pipeline for RNA-Seq analysis of numerous samples.hppRNA-a 是一个基于 SnakeMake 的、方便易用的无参 RNA-Seq 分析流水线,可用于大量样本的分析。
Brief Bioinform. 2018 Jul 20;19(4):622-626. doi: 10.1093/bib/bbw143.
3
Plant Transcriptome Analysis with HISAT-StringTie-Ballgown and TopHat-Cufflinks Pipelines.使用 HISAT-StringTie-Ballgown 和 TopHat-Cufflinks 流程进行植物转录组分析。
Methods Mol Biol. 2024;2812:203-213. doi: 10.1007/978-1-0716-3886-6_11.
4
RNA-Seq Data Analysis for Differential Gene Expression Using HISAT2-StringTie-Ballgown Pipeline.基于 HISAT2-StringTie-Ballgown 分析流程的 RNA-Seq 数据分析用于差异基因表达
Methods Mol Biol. 2024;2812:101-113. doi: 10.1007/978-1-0716-3886-6_5.
5
Robust identification of differentially expressed genes from RNA-seq data.从 RNA-seq 数据中稳健地识别差异表达基因。
Genomics. 2020 Mar;112(2):2000-2010. doi: 10.1016/j.ygeno.2019.11.012. Epub 2019 Nov 20.
6
bestDEG: a web-based application automatically combines various tools to precisely predict differentially expressed genes (DEGs) from RNA-Seq data.bestDEG:一个基于网络的应用程序,可以自动结合各种工具,从 RNA-Seq 数据中准确预测差异表达基因(DEGs)。
PeerJ. 2022 Nov 10;10:e14344. doi: 10.7717/peerj.14344. eCollection 2022.
7
A transcriptome software comparison for the analyses of treatments expected to give subtle gene expression responses.用于分析预期产生微妙基因表达响应的处理的转录组软件比较。
BMC Genomics. 2022 Jun 20;23(1):452. doi: 10.1186/s12864-022-08673-8.
8
Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2.三种 RNA 测序差异表达分析方法:limma、EdgeR、DESeq2。
J Vis Exp. 2021 Sep 18(175). doi: 10.3791/62528.
9
Transcriptome software results show significant variation among different commercial pipelines.转录组软件的结果表明,不同商业管道之间存在显著差异。
BMC Genomics. 2023 Nov 3;24(1):662. doi: 10.1186/s12864-023-09683-w.
10
RNA-seq Data Analysis for Differential Expression.用于差异表达分析的RNA测序数据分析
Methods Mol Biol. 2022;2391:45-54. doi: 10.1007/978-1-0716-1795-3_4.

引用本文的文献

1
Explicit Scale Simulation for analysis of RNA-sequencing count data with ALDEx2.使用ALDEx2对RNA测序计数数据进行分析的显式尺度模拟。
NAR Genom Bioinform. 2025 Aug 19;7(3):lqaf108. doi: 10.1093/nargab/lqaf108. eCollection 2025 Sep.
2
Biosynthesis of Camphane Volatile Terpenes in Lour: Involved Genes and Enzymes.樟属植物中莰烷挥发性萜类化合物的生物合成:相关基因和酶
Plants (Basel). 2025 Jun 10;14(12):1767. doi: 10.3390/plants14121767.
3
Transcriptomic signatures of prostate cancer progression: a comprehensive RNA-seq study.前列腺癌进展的转录组特征:一项全面的RNA测序研究。

本文引用的文献

1
Similarities and Differences in the Acute-Phase Response to SARS-CoV-2 in Rhesus Macaques and African Green Monkeys.恒河猴和绿猴对 SARS-CoV-2 的急性期反应的异同。
Front Immunol. 2021 Oct 6;12:754642. doi: 10.3389/fimmu.2021.754642. eCollection 2021.
2
Construction and analysis of the abnormal lncRNA-miRNA-mRNA network in hypoxic pulmonary hypertension.构建并分析低氧性肺动脉高压异常长链非编码 RNA-微小 RNA-信使 RNA 网络。
Biosci Rep. 2021 Aug 27;41(8). doi: 10.1042/BSR20210021.
3
Systematic comparison and assessment of RNA-seq procedures for gene expression quantitative analysis.
3 Biotech. 2025 May;15(5):135. doi: 10.1007/s13205-025-04297-3. Epub 2025 Apr 19.
4
Multi-Omics Profiling of Lipid Variation and Regulatory Mechanisms in Poultry Breast Muscles.家禽胸肌脂质变化及调控机制的多组学分析
Animals (Basel). 2025 Feb 27;15(5):694. doi: 10.3390/ani15050694.
5
Oocyte transcriptomes and follicular fluid proteomics of ovine atretic follicles reveal the underlying mechanisms of oocyte degeneration.绵羊闭锁卵泡的卵母细胞转录组和卵泡液蛋白质组学揭示了卵母细胞退化的潜在机制。
BMC Genomics. 2025 Feb 1;26(1):97. doi: 10.1186/s12864-025-11291-9.
6
Comparison of baseline global gene expression profiles of prostate cancer cell lines LNCaP and DU145.前列腺癌细胞系LNCaP和DU145的基线全基因表达谱比较。
BMC Res Notes. 2024 Dec 31;17(1):398. doi: 10.1186/s13104-024-07050-w.
7
Response of young rice panicles to salt stress: insights based on phenotype and transcriptome analysis.幼穗对盐胁迫的响应:基于表型和转录组分析的见解
Front Plant Sci. 2024 Oct 21;15:1451469. doi: 10.3389/fpls.2024.1451469. eCollection 2024.
8
Where the "ruber" Meets the Road: Using the Genome of the Red Diamond Rattlesnake to Unravel the Evolutionary Processes Driving Venom Evolution.橡胶之路:利用红钻响尾蛇基因组揭示驱动毒液进化的演化过程。
Genome Biol Evol. 2024 Sep 3;16(9). doi: 10.1093/gbe/evae198.
9
Adjuvant Effects of a CC Chemokine for Enhancing the Efficacy of an Inactivated Vaccine in Nile Tilapia ().一种CC趋化因子对增强尼罗罗非鱼灭活疫苗效力的佐剂作用()。 (括号内内容原文缺失,所以无法完整准确翻译)
Vaccines (Basel). 2024 Jun 8;12(6):641. doi: 10.3390/vaccines12060641.
10
A comprehensive workflow for optimizing RNA-seq data analysis.一种优化 RNA-seq 数据分析的综合工作流程。
BMC Genomics. 2024 Jun 24;25(1):631. doi: 10.1186/s12864-024-10414-y.
系统比较和评估 RNA-seq 程序进行基因表达定量分析。
Sci Rep. 2020 Nov 12;10(1):19737. doi: 10.1038/s41598-020-76881-x.
4
RNA sequencing: the teenage years.RNA 测序:青少年时期。
Nat Rev Genet. 2019 Nov;20(11):631-656. doi: 10.1038/s41576-019-0150-2. Epub 2019 Jul 24.
5
Abnormalities in Prefrontal Cortical Gene Expression Profiles Relevant to Schizophrenia in MK-801-Exposed C57BL/6 Mice.MK-801 暴露的 C57BL/6 小鼠前额叶皮层基因表达谱与精神分裂症相关的异常。
Neuroscience. 2018 Oct 15;390:60-78. doi: 10.1016/j.neuroscience.2018.07.046. Epub 2018 Aug 11.
6
Gaining comprehensive biological insight into the transcriptome by performing a broad-spectrum RNA-seq analysis.通过进行广谱RNA测序分析,全面深入了解转录组的生物学特性。
Nat Commun. 2017 Jul 5;8(1):59. doi: 10.1038/s41467-017-00050-4.
7
Differential analysis of RNA-seq incorporating quantification uncertainty.整合定量不确定性的 RNA-seq 差异分析。
Nat Methods. 2017 Jul;14(7):687-690. doi: 10.1038/nmeth.4324. Epub 2017 Jun 5.
8
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.用于RNA测序数据差异表达分析的每个样本全局缩放和每个基因归一化方法的比较。
PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.
9
Salmon provides fast and bias-aware quantification of transcript expression.鲑鱼提供快速且无偏倚的转录本表达定量。
Nat Methods. 2017 Apr;14(4):417-419. doi: 10.1038/nmeth.4197. Epub 2017 Mar 6.
10
Empirical assessment of analysis workflows for differential expression analysis of human samples using RNA-Seq.使用RNA测序对人类样本进行差异表达分析的分析流程的实证评估。
BMC Bioinformatics. 2017 Jan 17;18(1):38. doi: 10.1186/s12859-016-1457-z.