• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于功能富集分析的超几何检验的贝叶斯扩展。

A Bayesian extension of the hypergeometric test for functional enrichment analysis.

作者信息

Cao Jing, Zhang Song

机构信息

Department of Statistical Science, Southern Methodist University, Dallas, Texas 75275, U.S.A.

出版信息

Biometrics. 2014 Mar;70(1):84-94. doi: 10.1111/biom.12122. Epub 2013 Dec 9.

DOI:10.1111/biom.12122
PMID:24320951
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3954234/
Abstract

Functional enrichment analysis is conducted on high-throughput data to provide functional interpretation for a list of genes or proteins that share a common property, such as being differentially expressed (DE). The hypergeometric P-value has been widely used to investigate whether genes from pre-defined functional terms, for example, Gene Ontology (GO), are enriched in the DE genes. The hypergeometric P-value has three limitations: (1) computed independently for each term, thus neglecting biological dependence; (2) subject to a size constraint that leads to the tendency of selecting less-specific terms; (3) repeated use of information due to overlapping annotations by the true-path rule. We propose a Bayesian approach based on the non-central hypergeometric model. The GO dependence structure is incorporated through a prior on non-centrality parameters. The likelihood function does not include overlapping information. The inference about enrichment is based on posterior probabilities that do not have a size constraint. This method can detect moderate but consistent enrichment signals and identify sets of closely related and biologically meaningful functional terms rather than isolated terms. We also describe the basic ideas of assumption and implementation of different methods to provide some theoretical insights, which are demonstrated via a simulation study. A real application is presented.

摘要

对高通量数据进行功能富集分析,以便为具有共同特性(如差异表达)的一组基因或蛋白质提供功能解释。超几何P值已被广泛用于研究预定义功能术语(如基因本体论(GO))中的基因是否在差异表达基因中富集。超几何P值有三个局限性:(1)针对每个术语独立计算,从而忽略了生物学依赖性;(2)受大小约束,导致倾向于选择特异性较低的术语;(3)由于真实路径规则的重叠注释而重复使用信息。我们提出了一种基于非中心超几何模型的贝叶斯方法。通过对非中心参数的先验纳入GO依赖结构。似然函数不包括重叠信息。关于富集的推断基于没有大小约束的后验概率。该方法可以检测到适度但一致的富集信号,并识别出一组密切相关且具有生物学意义的功能术语,而不是孤立的术语。我们还描述了不同方法的假设和实现的基本思想,以提供一些理论见解,并通过模拟研究进行了验证。展示了一个实际应用。

相似文献

1
A Bayesian extension of the hypergeometric test for functional enrichment analysis.用于功能富集分析的超几何检验的贝叶斯扩展。
Biometrics. 2014 Mar;70(1):84-94. doi: 10.1111/biom.12122. Epub 2013 Dec 9.
2
GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.GO-Bayes:基于贝叶斯方法的基因本体论过表达分析。
Bioinformatics. 2010 Apr 1;26(7):905-11. doi: 10.1093/bioinformatics/btq059. Epub 2010 Feb 21.
3
GOMA: functional enrichment analysis tool based on GO modules.GOMA:基于基因本体(GO)模块的功能富集分析工具。
Chin J Cancer. 2013 Apr;32(4):195-204. doi: 10.5732/cjc.012.10151. Epub 2012 Dec 7.
4
The Neural/Immune Gene Ontology: clipping the Gene Ontology for neurological and immunological systems.神经/免疫基因本体论:为神经系统和免疫系统裁剪基因本体论。
BMC Bioinformatics. 2010 Sep 12;11:458. doi: 10.1186/1471-2105-11-458.
5
Empirical Bayes estimation of posterior probabilities of enrichment: a comparative study of five estimators of the local false discovery rate.经验贝叶斯估计富集后验概率:局部错误发现率五个估计量的比较研究。
BMC Bioinformatics. 2013 Mar 6;14:87. doi: 10.1186/1471-2105-14-87.
6
Biomarker identification and cancer classification based on microarray data using Laplace naive Bayes model with mean shrinkage.基于微阵列数据的拉普拉斯朴素贝叶斯模型均值收缩的生物标志物识别和癌症分类。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1649-62. doi: 10.1109/TCBB.2012.105.
7
Investigating the concordance of Gene Ontology terms reveals the intra- and inter-platform reproducibility of enrichment analysis.研究基因本体论术语的一致性揭示了富集分析的平台内和平台间可重复性。
BMC Bioinformatics. 2013 Apr 29;14:143. doi: 10.1186/1471-2105-14-143.
8
Functional enrichment analysis based on long noncoding RNA associations.基于长链非编码RNA关联的功能富集分析。
BMC Syst Biol. 2018 Apr 24;12(Suppl 4):45. doi: 10.1186/s12918-018-0571-0.
9
Multivariate hierarchical Bayesian model for differential gene expression analysis in microarray experiments.用于微阵列实验中差异基因表达分析的多元分层贝叶斯模型。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-9-S1-S9.
10
VAMPIRE microarray suite: a web-based platform for the interpretation of gene expression data.VAMPIRE微阵列套件:一个基于网络的基因表达数据解读平台。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W627-32. doi: 10.1093/nar/gki443.

引用本文的文献

1
To explore the prognostic efficacy and mechanism of ABCC5 clinical scoring model in hepatocellular carcinoma.探讨ABCC5临床评分模型在肝细胞癌中的预后疗效及机制。
Front Oncol. 2025 Aug 11;15:1519533. doi: 10.3389/fonc.2025.1519533. eCollection 2025.
2
Association of the Serpine1 with the Obesity Paradox in Ischemic Heart Failure.丝氨酸蛋白酶抑制剂E1(Serpine1)与缺血性心力衰竭中的肥胖悖论的关联。
Iran J Public Health. 2025 Jul;54(7):1516-1529. doi: 10.18502/ijph.v54i7.19158.
3
Global hinge sites of proteins as target sites for drug binding.蛋白质的全局铰链位点作为药物结合的靶位点。
Proc Natl Acad Sci U S A. 2024 Dec 3;121(49):e2414333121. doi: 10.1073/pnas.2414333121. Epub 2024 Nov 25.
4
NCOA3 knockdown delays human embryo development.NCOA3基因敲低会延迟人类胚胎发育。
Heliyon. 2024 Sep 13;10(18):e37639. doi: 10.1016/j.heliyon.2024.e37639. eCollection 2024 Sep 30.
5
Development and experimental validation of an energy metabolism-related gene signature for diagnosing of osteoporosis.开发和验证一个与能量代谢相关的基因特征用于骨质疏松症的诊断。
Sci Rep. 2024 Apr 8;14(1):8153. doi: 10.1038/s41598-024-59062-y.
6
Exploring the role of mitochondrial-associated and peripheral neuropathy genes in the pathogenesis of diabetic peripheral neuropathy.探讨线粒体相关和周围神经病变基因在糖尿病周围神经病变发病机制中的作用。
BMC Neurol. 2024 Mar 13;24(1):95. doi: 10.1186/s12883-024-03589-0.
7
Pyroptosis-Related Gene Signature Predicts Prognosis and Response to Immunotherapy and Medication in Pediatric and Young Adult Osteosarcoma Patients.焦亡相关基因特征预测小儿及青年骨肉瘤患者的预后、免疫治疗及药物反应。
J Inflamm Res. 2024 Jan 20;17:417-445. doi: 10.2147/JIR.S440425. eCollection 2024.
8
Oxidative Post-translational Protein Modifications upon Ischemia/Reperfusion Injury.缺血/再灌注损伤后的蛋白质氧化翻译后修饰
Antioxidants (Basel). 2024 Jan 16;13(1):106. doi: 10.3390/antiox13010106.
9
Comparative transcriptome, digital gene expression and proteome profiling analyses provide insights into the brachyurization from the megalopa to the first juvenile in .比较转录组、数字基因表达和蛋白质组分析为[具体物种]从大眼幼体到第一期幼体的短尾化过程提供了见解。
Heliyon. 2023 Jan 7;9(1):e12736. doi: 10.1016/j.heliyon.2022.e12736. eCollection 2023 Jan.
10
Genomic and transcriptomic analyses provide insights into valuable fatty acid biosynthesis and environmental adaptation of yellowhorn.基因组和转录组分析为黄果冷杉宝贵的脂肪酸生物合成及环境适应性提供了见解。
Front Plant Sci. 2022 Sep 6;13:991197. doi: 10.3389/fpls.2022.991197. eCollection 2022.

本文引用的文献

1
INCORPORATING BIOLOGICAL INFORMATION INTO LINEAR MODELS: A BAYESIAN APPROACH TO THE SELECTION OF PATHWAYS AND GENES.将生物信息整合到线性模型中:一种选择通路和基因的贝叶斯方法。
Ann Appl Stat. 2011 Sep 1;5(3):1978-2002. doi: 10.1214/11-AOAS463.
2
Bayesian Joint Modeling of Multiple Gene Networks and Diverse Genomic Data to Identify Target Genes of a Transcription Factor.用于识别转录因子靶基因的多基因网络和多样基因组数据的贝叶斯联合建模
Ann Appl Stat. 2012 Jan 1;6(1):334-355. doi: 10.1214/11-AOAS502.
3
Functional analysis beyond enrichment: non-redundant reciprocal linkage of genes and biological terms.功能分析超越富集:基因和生物术语的非冗余相互关联。
PLoS One. 2011;6(9):e24289. doi: 10.1371/journal.pone.0024289. Epub 2011 Sep 16.
4
NOA: a novel Network Ontology Analysis method.NOA:一种新颖的网络本体分析方法。
Nucleic Acids Res. 2011 Jul;39(13):e87. doi: 10.1093/nar/gkr251. Epub 2011 May 4.
5
Fraternal twins: Swiprosin-1/EFhd2 and Swiprosin-2/EFhd1, two homologous EF-hand containing calcium binding adaptor proteins with distinct functions.同卵双胞胎:Swiprosin-1/EFhd2 和 Swiprosin-2/EFhd1,两种具有不同功能的同源 EF 手钙离子结合衔接蛋白。
Cell Commun Signal. 2011 Jan 18;9:2. doi: 10.1186/1478-811X-9-2.
6
GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.GO-Bayes:基于贝叶斯方法的基因本体论过表达分析。
Bioinformatics. 2010 Apr 1;26(7):905-11. doi: 10.1093/bioinformatics/btq059. Epub 2010 Feb 21.
7
GOing Bayesian: model-based gene set analysis of genome-scale data.GOing Bayesian:基于模型的全基因组数据基因集分析。
Nucleic Acids Res. 2010 Jun;38(11):3523-32. doi: 10.1093/nar/gkq045. Epub 2010 Feb 19.
8
GeneCodis: interpreting gene lists through enrichment analysis and integration of diverse biological information.GeneCodis:通过富集分析和整合多种生物信息来解读基因列表。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W317-22. doi: 10.1093/nar/gkp416. Epub 2009 May 22.
9
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.利用DAVID生物信息学资源对大型基因列表进行系统和综合分析。
Nat Protoc. 2009;4(1):44-57. doi: 10.1038/nprot.2008.211.
10
Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists.生物信息学富集工具:通向大型基因列表全面功能分析的途径
Nucleic Acids Res. 2009 Jan;37(1):1-13. doi: 10.1093/nar/gkn923. Epub 2008 Nov 25.