• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在半合成 RNA-seq 数据模拟中忽略标准化的影响会产生人为的假阳性。

Neglecting the impact of normalization in semi-synthetic RNA-seq data simulations generates artificial false positives.

机构信息

Univ. Bordeaux, INSERM Bordeaux Population Health Research Center, U1219, INRIA SISTM, Bordeaux, F-33000, France.

Vaccine Research Institute, Créteil, F-94000, France.

出版信息

Genome Biol. 2024 Oct 30;25(1):281. doi: 10.1186/s13059-024-03231-9.

DOI:10.1186/s13059-024-03231-9
PMID:39478633
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11523660/
Abstract

A recent study reported exaggerated false positives by popular differential expression methods when analyzing large population samples. We reproduce the differential expression analysis simulation results and identify a caveat in the data generation process. Data not truly generated under the null hypothesis led to incorrect comparisons of benchmark methods. We provide corrected simulation results that demonstrate the good performance of dearseq and argue against the superiority of the Wilcoxon rank-sum test as suggested in the previous study.

摘要

最近的一项研究报告称,在分析大型人群样本时,流行的差异表达方法得出了夸张的假阳性结果。我们重现了差异表达分析的模拟结果,并发现了数据生成过程中的一个注意事项。数据并非真正在零假设下生成,导致基准方法的比较出现错误。我们提供了经过修正的模拟结果,证明了 dearseq 的良好性能,并反驳了之前研究中提出的 Wilcoxon 秩和检验具有优越性的观点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc8/11523660/984f841cd53f/13059_2024_3231_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc8/11523660/944d14967d7b/13059_2024_3231_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc8/11523660/984f841cd53f/13059_2024_3231_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc8/11523660/944d14967d7b/13059_2024_3231_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc8/11523660/984f841cd53f/13059_2024_3231_Fig2_HTML.jpg

相似文献

1
Neglecting the impact of normalization in semi-synthetic RNA-seq data simulations generates artificial false positives.在半合成 RNA-seq 数据模拟中忽略标准化的影响会产生人为的假阳性。
Genome Biol. 2024 Oct 30;25(1):281. doi: 10.1186/s13059-024-03231-9.
2
Response to "Neglecting normalization impact in semi-synthetic RNA-seq data simulation generates artificial false positives" and "Winsorization greatly reduces false positives by popular differential expression methods when analyzing human population samples".回应“在半合成 RNA-seq 数据模拟中忽略归一化影响会产生人为的假阳性”和“在分析人类群体样本时,流行的差异表达方法通过峰度化极大地减少了假阳性”。
Genome Biol. 2024 Oct 30;25(1):283. doi: 10.1186/s13059-024-03232-8.
3
Winsorization greatly reduces false positives by popular differential expression methods when analyzing human population samples.Winsorization 极大地减少了分析人类群体样本时常用的差异表达方法中的假阳性。
Genome Biol. 2024 Oct 30;25(1):282. doi: 10.1186/s13059-024-03230-w.
4
Exaggerated false positives by popular differential expression methods when analyzing human population samples.分析人类群体样本时,常用差异表达方法会导致假阳性结果夸大。
Genome Biol. 2022 Mar 15;23(1):79. doi: 10.1186/s13059-022-02648-4.
5
Benchmarking RNA-seq differential expression analysis methods using spike-in and simulation data.使用 Spike-in 和模拟数据进行 RNA-seq 差异表达分析方法的基准测试。
PLoS One. 2020 Apr 30;15(4):e0232271. doi: 10.1371/journal.pone.0232271. eCollection 2020.
6
Benchmarking differential expression analysis tools for RNA-Seq: normalization-based vs. log-ratio transformation-based methods.RNA-Seq 差异表达分析工具的基准测试:基于标准化与基于对数比变换的方法。
BMC Bioinformatics. 2018 Jul 18;19(1):274. doi: 10.1186/s12859-018-2261-8.
7
A benchmark of RNA-seq data normalization methods for transcriptome mapping on human genome-scale metabolic networks.基于人类基因组规模代谢网络的转录组映射的 RNA-seq 数据标准化方法基准测试。
NPJ Syst Biol Appl. 2024 Oct 24;10(1):124. doi: 10.1038/s41540-024-00448-z.
8
A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data.用于RNA测序数据差异表达分析的每个样本全局缩放和每个基因归一化方法的比较。
PLoS One. 2017 May 1;12(5):e0176185. doi: 10.1371/journal.pone.0176185. eCollection 2017.
9
Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions.从假设的角度选择样本间 RNA-Seq 标准化方法。
Brief Bioinform. 2018 Sep 28;19(5):776-792. doi: 10.1093/bib/bbx008.
10
Choice of library size normalization and statistical methods for differential gene expression analysis in balanced two-group comparisons for RNA-seq studies.RNA-seq 研究中平衡两组比较差异基因表达分析的库大小标准化和统计方法选择。
BMC Genomics. 2020 Jan 28;21(1):75. doi: 10.1186/s12864-020-6502-7.

引用本文的文献

1
Metformin-Enhanced Secretome from Periodontal Ligament Stem Cells Promotes Functional Recovery in an Inflamed Periodontal Model: In Vitro Study.二甲双胍增强的牙周膜干细胞分泌组促进炎症性牙周模型中的功能恢复:体外研究
J Funct Biomater. 2025 May 13;16(5):177. doi: 10.3390/jfb16050177.
2
Response to "Neglecting normalization impact in semi-synthetic RNA-seq data simulation generates artificial false positives" and "Winsorization greatly reduces false positives by popular differential expression methods when analyzing human population samples".回应“在半合成 RNA-seq 数据模拟中忽略归一化影响会产生人为的假阳性”和“在分析人类群体样本时,流行的差异表达方法通过峰度化极大地减少了假阳性”。
Genome Biol. 2024 Oct 30;25(1):283. doi: 10.1186/s13059-024-03232-8.

本文引用的文献

1
Exaggerated false positives by popular differential expression methods when analyzing human population samples.分析人类群体样本时,常用差异表达方法会导致假阳性结果夸大。
Genome Biol. 2022 Mar 15;23(1):79. doi: 10.1186/s13059-022-02648-4.
2
dearseq: a variance component score test for RNA-seq differential analysis that effectively controls the false discovery rate.DearSeq:一种用于RNA测序差异分析的方差成分评分检验,可有效控制错误发现率。
NAR Genom Bioinform. 2020 Nov 19;2(4):lqaa093. doi: 10.1093/nargab/lqaa093. eCollection 2020 Dec.
3
Data quality aware analysis of differential expression in RNA-seq with NOISeq R/Bioc package.
使用NOISeq R/Bioc软件包对RNA测序中的差异表达进行数据质量感知分析。
Nucleic Acids Res. 2015 Dec 2;43(21):e140. doi: 10.1093/nar/gkv711. Epub 2015 Jul 16.
4
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.使用DESeq2对RNA测序数据的倍数变化和离散度进行适度估计。
Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8.
5
Error estimates for the analysis of differential expression from RNA-seq count data.RNA-seq 计数数据差异表达分析的误差估计。
PeerJ. 2014 Sep 23;2:e576. doi: 10.7717/peerj.576. eCollection 2014.
6
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.voom:精确权重为RNA测序读数计数解锁线性模型分析工具。
Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.
7
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.edgeR:一个用于数字基因表达数据差异表达分析的 Bioconductor 包。
Bioinformatics. 2010 Jan 1;26(1):139-40. doi: 10.1093/bioinformatics/btp616. Epub 2009 Nov 11.