• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最佳α值可降低基因表达研究中的错误率:一种荟萃分析方法。

Optimal alpha reduces error rates in gene expression studies: a meta-analysis approach.

作者信息

Mudge J F, Martyniuk C J, Houlahan J E

机构信息

Department of Biology, Canadian Rivers Institute, University of New Brunswick, Saint John, NB, E2L 4L5, Canada.

Center for Environmental and Human Toxicology & Department of Physiological Sciences, UF Genetics Institute, University of Florida, Gainesville, Florida, 32611, USA.

出版信息

BMC Bioinformatics. 2017 Jun 21;18(1):312. doi: 10.1186/s12859-017-1728-3.

DOI:10.1186/s12859-017-1728-3
PMID:28637422
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5480162/
Abstract

BACKGROUND

Transcriptomic approaches (microarray and RNA-seq) have been a tremendous advance for molecular science in all disciplines, but they have made interpretation of hypothesis testing more difficult because of the large number of comparisons that are done within an experiment. The result has been a proliferation of techniques aimed at solving the multiple comparisons problem, techniques that have focused primarily on minimizing Type I error with little or no concern about concomitant increases in Type II errors. We have previously proposed a novel approach for setting statistical thresholds with applications for high throughput omics-data, optimal α, which minimizes the probability of making either error (i.e. Type I or II) and eliminates the need for post-hoc adjustments.

RESULTS

A meta-analysis of 242 microarray studies extracted from the peer-reviewed literature found that current practices for setting statistical thresholds led to very high Type II error rates. Further, we demonstrate that applying the optimal α approach results in error rates as low or lower than error rates obtained when using (i) no post-hoc adjustment, (ii) a Bonferroni adjustment and (iii) a false discovery rate (FDR) adjustment which is widely used in transcriptome studies.

CONCLUSIONS

We conclude that optimal α can reduce error rates associated with transcripts in both microarray and RNA-seq experiments, but point out that improved statistical techniques alone cannot solve the problems associated with high throughput datasets - these approaches need to be coupled with improved experimental design that considers larger sample sizes and/or greater study replication.

摘要

背景

转录组学方法(微阵列和RNA测序)在所有学科的分子科学领域都取得了巨大进展,但由于在一个实验中要进行大量比较,使得假设检验的解释变得更加困难。结果是大量旨在解决多重比较问题的技术不断涌现,这些技术主要侧重于将I型错误最小化,而很少或根本不考虑随之而来的II型错误增加。我们之前提出了一种用于设置统计阈值的新方法——最优α,适用于高通量组学数据,该方法能将犯任何一种错误(即I型或II型)的概率最小化,并且无需进行事后调整。

结果

对从同行评审文献中提取的242项微阵列研究进行的荟萃分析发现,当前设置统计阈值的做法导致了非常高的II型错误率。此外,我们证明,应用最优α方法所得到的错误率与使用(i)不进行事后调整、(ii)Bonferroni调整和(iii)转录组研究中广泛使用的错误发现率(FDR)调整时所获得的错误率一样低或更低。

结论

我们得出结论,最优α可以降低微阵列和RNA测序实验中转录本相关的错误率,但指出仅靠改进的统计技术无法解决与高通量数据集相关的问题——这些方法需要与考虑更大样本量和/或更高研究重复性的改进实验设计相结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a4d/5480162/b25c5074ef58/12859_2017_1728_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a4d/5480162/b25c5074ef58/12859_2017_1728_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a4d/5480162/b25c5074ef58/12859_2017_1728_Fig1_HTML.jpg

相似文献

1
Optimal alpha reduces error rates in gene expression studies: a meta-analysis approach.最佳α值可降低基因表达研究中的错误率:一种荟萃分析方法。
BMC Bioinformatics. 2017 Jun 21;18(1):312. doi: 10.1186/s12859-017-1728-3.
2
Comparative evaluation of gene set analysis approaches for RNA-Seq data.RNA测序数据基因集分析方法的比较评估
BMC Bioinformatics. 2014 Dec 5;15(1):397. doi: 10.1186/s12859-014-0397-8.
3
Sample size calculation while controlling false discovery rate for differential expression analysis with RNA-sequencing experiments.在RNA测序实验的差异表达分析中控制错误发现率时的样本量计算。
BMC Bioinformatics. 2016 Mar 31;17:146. doi: 10.1186/s12859-016-0994-9.
4
LPEseq: Local-Pooled-Error Test for RNA Sequencing Experiments with a Small Number of Replicates.LPEseq:针对少量重复样本的RNA测序实验的局部合并误差检验
PLoS One. 2016 Aug 17;11(8):e0159182. doi: 10.1371/journal.pone.0159182. eCollection 2016.
5
A note on the false discovery rate and inconsistent comparisons between experiments.关于错误发现率及实验间不一致比较的说明
Bioinformatics. 2008 May 15;24(10):1225-8. doi: 10.1093/bioinformatics/btn120. Epub 2008 Apr 19.
6
Towards the integration, annotation and association of historical microarray experiments with RNA-seq.实现历史基因芯片实验与 RNA-seq 的整合、注释和关联。
BMC Bioinformatics. 2013;14 Suppl 14(Suppl 14):S4. doi: 10.1186/1471-2105-14-S14-S4. Epub 2013 Oct 9.
7
Correlation between RNA-Seq and microarrays results using TCGA data.使用TCGA数据的RNA测序与微阵列结果之间的相关性。
Gene. 2017 Sep 10;628:200-204. doi: 10.1016/j.gene.2017.07.056. Epub 2017 Jul 20.
8
RNA-Seq provides new insights in the transcriptome responses induced by the carcinogen benzo[a]pyrene.RNA-Seq 为致癌物质苯并[a]芘诱导的转录组反应提供了新的见解。
Toxicol Sci. 2012 Dec;130(2):427-39. doi: 10.1093/toxsci/kfs250. Epub 2012 Aug 13.
9
A weighted sample size for microarray datasets that considers the variability of variance and multiplicity.一种加权样本大小的微阵列数据集,考虑了方差和多重性的可变性。
J Biosci Bioeng. 2009 Sep;108(3):252-8. doi: 10.1016/j.jbiosc.2009.03.017.
10
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline.RNA测序数据的基因集分析方法:性能评估与应用指南
Brief Bioinform. 2016 May;17(3):393-407. doi: 10.1093/bib/bbv069. Epub 2015 Sep 4.

引用本文的文献

1
Chronic arsenic exposure induces malignant transformation of human HaCaT cells through both deterministic and stochastic changes in transcriptome expression.慢性砷暴露通过转录组表达的确定性和随机性变化诱导人 HaCaT 细胞的恶性转化。
Toxicol Appl Pharmacol. 2024 Mar;484:116865. doi: 10.1016/j.taap.2024.116865. Epub 2024 Feb 17.
2
Mbnl2 loss alters novel context processing and impairs object recognition memory.Mbnl2缺失会改变新奇情境处理并损害物体识别记忆。
iScience. 2023 Apr 25;26(5):106732. doi: 10.1016/j.isci.2023.106732. eCollection 2023 May 19.
3
Applications of large-scale molecular profiling techniques to the study of the corpus luteum.

本文引用的文献

1
Comprehensive Assessments of RNA-seq by the SEQC Consortium: FDA-Led Efforts Advance Precision Medicine.SEQC联盟对RNA测序的全面评估:由美国食品药品监督管理局牵头的工作推动精准医学发展。
Pharmaceutics. 2016 Mar 15;8(1):8. doi: 10.3390/pharmaceutics8010008.
2
The Overlooked Fact: Fundamental Need for Spike-In Control for Virtually All Genome-Wide Analyses.被忽视的事实:几乎所有全基因组分析对掺入对照的根本需求。
Mol Cell Biol. 2015 Dec 28;36(5):662-7. doi: 10.1128/MCB.00970-14.
3
Considerations for the use of transcriptomics in identifying the 'genes that matter' for environmental adaptation.
大规模分子谱分析技术在黄体研究中的应用。
Anim Reprod. 2018 Aug 3;15(Suppl 1):791-804. doi: 10.21451/1984-3143-AR2018-0038. eCollection 2018 Jul-Sep.
4
Insights from two independent transcriptomic studies of the bovine corpus luteum during pregnancy.妊娠期牛黄体的两个独立转录组学研究的见解。
J Anim Sci. 2022 Jul 1;100(7). doi: 10.1093/jas/skac115.
5
The p value wars (again).p值之争(再次上演)。
Eur J Nucl Med Mol Imaging. 2019 Nov;46(12):2421-2423. doi: 10.1007/s00259-019-04467-5.
6
Molecular tumor analysis and liquid biopsy: a feasibility investigation analyzing circulating tumor DNA in patients with central nervous system lymphomas.分子肿瘤分析和液体活检:一项分析中枢神经系统淋巴瘤患者循环肿瘤 DNA 的可行性研究。
BMC Cancer. 2019 Mar 1;19(1):192. doi: 10.1186/s12885-019-5394-x.
7
The quest for an optimal alpha.追求最优阿尔法。
PLoS One. 2019 Jan 2;14(1):e0208631. doi: 10.1371/journal.pone.0208631. eCollection 2019.
8
Expression analysis of RNA sequencing data from human neural and glial cell lines depends on technical replication and normalization methods.从人类神经和神经胶质细胞系的 RNA 测序数据的表达分析取决于技术复制和归一化方法。
BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):412. doi: 10.1186/s12859-018-2382-0.
关于使用转录组学来识别环境适应性“关键基因”的考量因素。
J Exp Biol. 2015 Jun;218(Pt 12):1925-35. doi: 10.1242/jeb.114306.
4
Transcriptomics of developing embryos and organs: A raising tool for evo-devo.发育胚胎和器官的转录组学:一种用于进化发育生物学研究的新兴工具。
J Exp Zool B Mol Dev Evol. 2015 Jun;324(4):363-71. doi: 10.1002/jez.b.22595. Epub 2014 Nov 11.
5
Points of significance: Comparing samples—part I.重要要点:样本比较——第一部分。
Nat Methods. 2014 Mar;11(3):215-6. doi: 10.1038/nmeth.2858.
6
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.voom:精确权重为RNA测序读数计数解锁线性模型分析工具。
Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.
7
Revisiting global gene expression analysis.重新审视全球基因表达分析。
Cell. 2012 Oct 26;151(3):476-82. doi: 10.1016/j.cell.2012.10.012.
8
Negative consequences of using α = 0.05 for environmental monitoring decisions: a case study from a decade of Canada's Environmental Effects Monitoring Program.使用 α = 0.05 进行环境监测决策的负面后果:来自加拿大环境影响监测计划十年的案例研究。
Environ Sci Technol. 2012 Sep 4;46(17):9249-55. doi: 10.1021/es301320n. Epub 2012 Aug 17.
9
If all of your friends used α = 0.05, would you do it too?如果你所有的朋友都使用α = 0.05,你也会这么做吗?
Integr Environ Assess Manag. 2012 Jul;8(3):563-4. doi: 10.1002/ieam.1313.
10
Setting an optimal α that minimizes errors in null hypothesis significance tests.设置一个最优的α,使零假设显著性检验中的错误最小化。
PLoS One. 2012;7(2):e32734. doi: 10.1371/journal.pone.0032734. Epub 2012 Feb 28.