• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稳健的超参数估计可抵御高变异性基因,并提高检测差异表达的能力。

ROBUST HYPERPARAMETER ESTIMATION PROTECTS AGAINST HYPERVARIABLE GENES AND IMPROVES POWER TO DETECT DIFFERENTIAL EXPRESSION.

作者信息

Phipson Belinda, Lee Stanley, Majewski Ian J, Alexander Warren S, Smyth Gordon K

机构信息

Murdoch Childrens Research Institute.

The Walter and Eliza Hall Institute of Medical Research; The University of Melbourne.

出版信息

Ann Appl Stat. 2016 Jun;10(2):946-963. doi: 10.1214/16-AOAS920. Epub 2016 Jul 22.

DOI:10.1214/16-AOAS920
PMID:28367255
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5373812/
Abstract

One of the most common analysis tasks in genomic research is to identify genes that are differentially expressed (DE) between experimental conditions. Empirical Bayes (EB) statistical tests using moderated genewise variances have been very effective for this purpose, especially when the number of biological replicate samples is small. The EB procedures can however be heavily influenced by a small number of genes with very large or very small variances. This article improves the differential expression tests by robustifying the hyperparameter estimation procedure. The robust procedure has the effect of decreasing the informativeness of the prior distribution for outlier genes while increasing its informativeness for other genes. This effect has the double benefit of reducing the chance that hypervariable genes will be spuriously identified as DE while increasing statistical power for the main body of genes. The robust EB algorithm is fast and numerically stable. The procedure allows exact small-sample null distributions for the test statistics and reduces exactly to the original EB procedure when no outlier genes are present. Simulations show that the robustified tests have similar performance to the original tests in the absence of outlier genes but have greater power and robustness when outliers are present. The article includes case studies for which the robust method correctly identifies and downweights genes associated with hidden covariates and detects more genes likely to be scientifically relevant to the experimental conditions. The new procedure is implemented in the limma software package freely available from the Bioconductor repository.

摘要

基因组研究中最常见的分析任务之一是识别在不同实验条件下差异表达(DE)的基因。使用适度基因方差的经验贝叶斯(EB)统计检验在这方面非常有效,特别是当生物重复样本数量较少时。然而,EB程序可能会受到少数具有非常大或非常小方差的基因的严重影响。本文通过强化超参数估计程序改进了差异表达检验。稳健程序具有降低异常值基因先验分布的信息量,同时增加其他基因先验分布信息量的效果。这种效果具有双重好处,既减少了高变异性基因被错误识别为差异表达基因的可能性,又增加了主体基因的统计功效。稳健的EB算法快速且数值稳定。该程序允许测试统计量有精确的小样本零分布,并且在不存在异常值基因时精确地简化为原始的EB程序。模拟表明,在不存在异常值基因的情况下,稳健化检验与原始检验具有相似的性能,但在存在异常值时具有更大的功效和稳健性。本文包含案例研究,其中稳健方法正确识别并降低了与隐藏协变量相关的基因的权重,并检测到更多可能与实验条件在科学上相关的基因。新程序在可从Bioconductor存储库免费获得的limma软件包中实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/c4dc79090466/nihms853855f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/eca92eac1a58/nihms853855f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/2a2d7dbaeb0d/nihms853855f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/58bed7e45b91/nihms853855f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/c4dc79090466/nihms853855f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/eca92eac1a58/nihms853855f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/2a2d7dbaeb0d/nihms853855f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/58bed7e45b91/nihms853855f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08f6/5373812/c4dc79090466/nihms853855f4.jpg

相似文献

1
ROBUST HYPERPARAMETER ESTIMATION PROTECTS AGAINST HYPERVARIABLE GENES AND IMPROVES POWER TO DETECT DIFFERENTIAL EXPRESSION.稳健的超参数估计可抵御高变异性基因,并提高检测差异表达的能力。
Ann Appl Stat. 2016 Jun;10(2):946-963. doi: 10.1214/16-AOAS920. Epub 2016 Jul 22.
2
Use of within-array replicate spots for assessing differential expression in microarray experiments.利用芯片内重复点评估微阵列实验中的差异表达。
Bioinformatics. 2005 May 1;21(9):2067-75. doi: 10.1093/bioinformatics/bti270. Epub 2005 Jan 18.
3
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.一种用于稳健高效估计具有多种模式的差异基因表达的混合单向方差分析方法。
PLoS One. 2015 Sep 28;10(9):e0138810. doi: 10.1371/journal.pone.0138810. eCollection 2015.
4
Robust principal component analysis for accurate outlier sample detection in RNA-Seq data.RNA-Seq 数据中用于准确异常样本检测的稳健主成分分析。
BMC Bioinformatics. 2020 Jun 29;21(1):269. doi: 10.1186/s12859-020-03608-0.
5
Bayesian robust inference for differential gene expression in microarrays with multiple samples.用于多个样本微阵列中差异基因表达的贝叶斯稳健推断。
Biometrics. 2006 Mar;62(1):10-8. doi: 10.1111/j.1541-0420.2005.00397.x.
6
A Robust Approach for Identification of Cancer Biomarkers and Candidate Drugs.一种稳健的癌症生物标志物和候选药物鉴定方法。
Medicina (Kaunas). 2019 Jun 11;55(6):269. doi: 10.3390/medicina55060269.
7
Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。
Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.
8
Variance adaptive shrinkage (vash): flexible empirical Bayes estimation of variances.方差自适应收缩(vash):方差的灵活经验贝叶斯估计
Bioinformatics. 2016 Nov 15;32(22):3428-3434. doi: 10.1093/bioinformatics/btw483. Epub 2016 Jul 19.
9
No counts, no variance: allowing for loss of degrees of freedom when assessing biological variability from RNA-seq data.无计数,无方差:评估RNA测序数据的生物学变异性时考虑自由度损失。
Stat Appl Genet Mol Biol. 2017 Apr 25;16(2):83-93. doi: 10.1515/sagmb-2017-0010.
10
aFold - using polynomial uncertainty modelling for differential gene expression estimation from RNA sequencing data.aFold - 使用多项式不确定性建模进行 RNA 测序数据的差异基因表达估计。
BMC Genomics. 2019 May 10;20(1):364. doi: 10.1186/s12864-019-5686-1.

引用本文的文献

1
UBE3A reinstatement restores behaviorand proteome in an Angelman syndrome mouse model of imprinting defects.UBE3A 恢复可在印记缺陷的天使综合征小鼠模型中恢复行为和蛋白质组。
Mol Autism. 2025 Aug 28;16(1):45. doi: 10.1186/s13229-025-00675-z.
2
Obesity impact on leukocyte telomere shortening and immune aging assessed by Mendelian randomization and transcriptomics analysis.通过孟德尔随机化和转录组学分析评估肥胖对白细胞端粒缩短和免疫衰老的影响。
Sci Rep. 2025 Aug 23;15(1):30983. doi: 10.1038/s41598-025-16817-5.
3
Identification of Biomarkers for Acute Myocardial Infarction Based on Cell Senescence Genes and Machine Learning.

本文引用的文献

1
It's DE-licious: A Recipe for Differential Expression Analyses of RNA-seq Experiments Using Quasi-Likelihood Methods in edgeR.美味无比:使用edgeR中拟似然方法进行RNA测序实验差异表达分析的方法
Methods Mol Biol. 2016;1418:391-416. doi: 10.1007/978-1-4939-3578-9_19.
2
From reads to regions: a Bioconductor workflow to detect differential binding in ChIP-seq data.从 reads 到区域:用于检测 ChIP-seq 数据中差异结合的 Bioconductor 工作流程。
F1000Res. 2015 Oct 16;4:1080. doi: 10.12688/f1000research.7016.2. eCollection 2015.
3
csaw: a Bioconductor package for differential binding analysis of ChIP-seq data using sliding windows.
基于细胞衰老基因和机器学习的急性心肌梗死生物标志物识别
Anatol J Cardiol. 2025 Aug;29(8):409-422. doi: 10.14744/AnatolJCardiol.2025.5129.
4
Convergent Molecular Evolution Associated With Repeated Transitions to Gregarious Larval Behavior in Heliconiini.与赫利孔亚族幼虫群居行为的反复转变相关的趋同分子进化。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf179.
5
Frontal cortex pyramidal neuron expression profiles differentiate the prodromal stage from progressive degeneration across the Alzheimer's disease spectrum.额叶皮质锥体神经元表达谱可区分阿尔茨海默病谱系中前驱期与进行性退变。
Alzheimers Dement. 2025 Jul;21(7):e70395. doi: 10.1002/alz.70395.
6
Identification of shared pathogenetic mechanisms between endometriosis and RSA based on comprehensive bioinformatics analysis.基于综合生物信息学分析鉴定子宫内膜异位症与复发性流产之间的共同致病机制
J Assist Reprod Genet. 2025 Jul 24. doi: 10.1007/s10815-025-03596-1.
7
Multi-ancestry genome-wide meta-analysis of 56,241 individuals identifies known and novel cross-population and ancestry-specific associations as novel risk loci for Alzheimer's disease.对56241名个体进行的多祖先全基因组荟萃分析确定了已知和新的跨人群及特定祖先关联,作为阿尔茨海默病的新风险位点。
Genome Biol. 2025 Jul 17;26(1):210. doi: 10.1186/s13059-025-03564-z.
8
Molecular Phenogroups in Heart Failure: Large-Scale Proteomics in a Population-Based Cohort.心力衰竭中的分子表型组:基于人群队列的大规模蛋白质组学研究
Circ Genom Precis Med. 2025 Jul 16:e004953. doi: 10.1161/CIRCGEN.124.004953.
9
Privacy-preserving multicenter differential protein abundance analysis with FedProt.使用FedProt进行隐私保护的多中心差异蛋白质丰度分析。
Nat Comput Sci. 2025 Aug;5(8):675-688. doi: 10.1038/s43588-025-00832-7. Epub 2025 Jul 11.
10
Optimizing Proximity Proteomics on the EvoSep-timsTOF LC-MS System.优化EvoSep-timsTOF液相色谱-质谱系统上的邻近蛋白质组学
Proteomics. 2025 Aug;25(15):58-71. doi: 10.1002/pmic.70010. Epub 2025 Jul 11.
CSAW:一个用于使用滑动窗口对ChIP-seq数据进行差异结合分析的Bioconductor软件包。
Nucleic Acids Res. 2016 Mar 18;44(5):e45. doi: 10.1093/nar/gkv1191. Epub 2015 Nov 17.
4
diffHic: a Bioconductor package to detect differential genomic interactions in Hi-C data.diffHic:一个用于检测Hi-C数据中差异基因组相互作用的Bioconductor软件包。
BMC Bioinformatics. 2015 Aug 19;16:258. doi: 10.1186/s12859-015-0683-0.
5
MOZ and BMI1 play opposing roles during Hox gene activation in ES cells and in body segment identity specification in vivo.在胚胎干细胞中,MOZ和BMI1在Hox基因激活过程中发挥相反作用,在体内体节身份特化过程中也如此。
Proc Natl Acad Sci U S A. 2015 Apr 28;112(17):5437-42. doi: 10.1073/pnas.1422872112. Epub 2015 Apr 14.
6
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
7
Regulation of germinal center responses and B-cell memory by the chromatin modifier MOZ.染色质修饰酶 MOZ 调控生发中心反应和 B 细胞记忆。
Proc Natl Acad Sci U S A. 2014 Jul 1;111(26):9585-90. doi: 10.1073/pnas.1402485111. Epub 2014 Jun 16.
8
Robustly detecting differential expression in RNA sequencing data using observation weights.利用观测权重稳健检测RNA测序数据中的差异表达。
Nucleic Acids Res. 2014 Jun;42(11):e91. doi: 10.1093/nar/gku310. Epub 2014 Apr 20.
9
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.voom:精确权重为RNA测序读数计数解锁线性模型分析工具。
Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.
10
Prior robust empirical Bayes inference for large-scale data by conditioning on rank with application to microarray data.通过对秩进行条件化处理对大规模数据进行先前稳健的经验贝叶斯推断,并将其应用于微阵列数据。
Biostatistics. 2014 Jan;15(1):60-73. doi: 10.1093/biostatistics/kxt026. Epub 2013 Aug 8.