• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过父子分析改进基因本体注释过度代表性的检测。

Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.

作者信息

Grossmann Steffen, Bauer Sebastian, Robinson Peter N, Vingron Martin

机构信息

Max-Planck-Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany.

出版信息

Bioinformatics. 2007 Nov 15;23(22):3024-31. doi: 10.1093/bioinformatics/btm440. Epub 2007 Sep 11.

DOI:10.1093/bioinformatics/btm440
PMID:17848398
Abstract

MOTIVATION

High-throughput experiments such as microarray hybridizations often yield long lists of genes found to share a certain characteristic such as differential expression. Exploring Gene Ontology (GO) annotations for such lists of genes has become a widespread practice to get first insights into the potential biological meaning of the experiment. The standard statistical approach to measuring overrepresentation of GO terms cannot cope with the dependencies resulting from the structure of GO because they analyze each term in isolation. Especially the fact that annotations are inherited from more specific descendant terms can result in certain types of false-positive results with potentially misleading biological interpretation, a phenomenon which we term the inheritance problem.

RESULTS

We present here a novel approach to analysis of GO term overrepresentation that determines overrepresentation of terms in the context of annotations to the term's parents. This approach reduces the dependencies between the individual term's measurements, and thereby avoids producing false-positive results owing to the inheritance problem. ROC analysis using study sets with overrepresented GO terms showed a clear advantage for our approach over the standard algorithm with respect to the inheritance problem. Although there can be no gold standard for exploratory methods such as analysis of GO term overrepresentation, analysis of biological datasets suggests that our algorithm tends to identify the core GO terms that are most characteristic of the dataset being analyzed.

摘要

动机

诸如微阵列杂交等高通量实验常常会产生一长串被发现具有某种共同特征(如差异表达)的基因列表。探索这些基因列表的基因本体论(GO)注释已成为一种广泛采用的做法,以便初步了解实验潜在的生物学意义。测量GO术语过度代表性的标准统计方法无法应对因GO结构而产生的依赖性,因为它们是孤立地分析每个术语。特别是注释从更具体的后代术语继承这一事实,可能会导致某些类型的假阳性结果,并产生潜在的误导性生物学解释,我们将这种现象称为继承问题。

结果

我们在此提出一种分析GO术语过度代表性的新方法,该方法在术语的父级注释背景下确定术语的过度代表性。这种方法减少了各个术语测量之间的依赖性,从而避免了因继承问题而产生假阳性结果。使用具有过度代表性GO术语的研究集进行的ROC分析表明,相对于继承问题,我们的方法比标准算法具有明显优势。尽管对于诸如GO术语过度代表性分析等探索性方法不存在金标准,但对生物数据集的分析表明,我们的算法倾向于识别出最能表征所分析数据集特征的核心GO术语。

相似文献

1
Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.通过父子分析改进基因本体注释过度代表性的检测。
Bioinformatics. 2007 Nov 15;23(22):3024-31. doi: 10.1093/bioinformatics/btm440. Epub 2007 Sep 11.
2
GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.GO-Bayes:基于贝叶斯方法的基因本体论过表达分析。
Bioinformatics. 2010 Apr 1;26(7):905-11. doi: 10.1093/bioinformatics/btq059. Epub 2010 Feb 21.
3
SEGS: search for enriched gene sets in microarray data.SEGS:在微阵列数据中搜索富集的基因集。
J Biomed Inform. 2008 Aug;41(4):588-601. doi: 10.1016/j.jbi.2007.12.001. Epub 2007 Dec 15.
4
ADGO: analysis of differentially expressed gene sets using composite GO annotation.ADGO:使用复合基因本体注释分析差异表达基因集
Bioinformatics. 2006 Sep 15;22(18):2249-53. doi: 10.1093/bioinformatics/btl378. Epub 2006 Jul 12.
5
Integration of GO annotations in Correspondence Analysis: facilitating the interpretation of microarray data.在对应分析中整合基因本体论注释:助力微阵列数据的解读
Bioinformatics. 2005 May 15;21(10):2424-9. doi: 10.1093/bioinformatics/bti367. Epub 2005 Mar 3.
6
Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps.使用自组织映射对酿酒酵母的基因表达数据和基因本体术语进行共聚类和可视化。
J Biomed Inform. 2007 Apr;40(2):160-73. doi: 10.1016/j.jbi.2006.05.001. Epub 2006 May 20.
7
BioLattice: a framework for the biological interpretation of microarray gene expression data using concept lattice analysis.生物格架:一种使用概念格分析对微阵列基因表达数据进行生物学解释的框架。
J Biomed Inform. 2008 Apr;41(2):232-41. doi: 10.1016/j.jbi.2007.10.003. Epub 2007 Nov 1.
8
Amplification of the Gene Ontology annotation of Affymetrix probe sets.Affymetrix探针集的基因本体注释的扩增。
BMC Bioinformatics. 2006 Mar 20;7:159. doi: 10.1186/1471-2105-7-159.
9
Group testing for pathway analysis improves comparability of different microarray datasets.用于通路分析的分组检验可提高不同微阵列数据集的可比性。
Bioinformatics. 2006 Oct 15;22(20):2500-6. doi: 10.1093/bioinformatics/btl424. Epub 2006 Aug 7.
10
Assessment and integration of publicly available SAGE, cDNA microarray, and oligonucleotide microarray expression data for global coexpression analyses.评估和整合公开可用的SAGE、cDNA微阵列和寡核苷酸微阵列表达数据以进行全局共表达分析。
Genomics. 2005 Oct;86(4):476-88. doi: 10.1016/j.ygeno.2005.06.009.

引用本文的文献

1
GeneSetCluster 2.0: a comprehensive toolset for summarizing and integrating gene-sets analysis.基因集聚类2.0:一个用于总结和整合基因集分析的综合工具集。
BMC Bioinformatics. 2025 Aug 21;26(1):219. doi: 10.1186/s12859-025-06249-3.
2
Reconciling multiple connectivity-based systems biology methods for drug repurposing.协调多种基于连通性的系统生物学方法用于药物再利用。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf387.
3
EnrichDO: a global weighted model for Disease Ontology enrichment analysis.EnrichDO:一种用于疾病本体富集分析的全局加权模型。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf021.
4
GA4GH Phenopacket-Driven Characterization of Genotype-Phenotype Correlations in Mendelian Disorders.GA4GH孟德尔疾病中基因型-表型相关性的表型数据包驱动特征分析
medRxiv. 2025 Mar 6:2025.03.05.25323315. doi: 10.1101/2025.03.05.25323315.
5
IsopretGO-analysing and visualizing the functional consequences of differential splicing.IsopretGO——分析和可视化可变剪接的功能后果。
NAR Genom Bioinform. 2024 Dec 5;6(4):lqae165. doi: 10.1093/nargab/lqae165. eCollection 2024 Dec.
6
Managing the tradeoff between reproduction and survival requires flexibility in behaviour and gene regulation in three-spined stickleback.在三刺鱼中,平衡繁殖与生存之间的权衡需要行为和基因调控方面的灵活性。
Proc Biol Sci. 2024 Dec;291(2036):20242296. doi: 10.1098/rspb.2024.2296. Epub 2024 Dec 4.
7
A transcriptome atlas of zygotic and somatic embryogenesis in Norway spruce.挪威云杉合子胚和体细胞胚发生的转录组图谱
Plant J. 2024 Dec;120(5):2238-2252. doi: 10.1111/tpj.17087. Epub 2024 Oct 27.
8
A transcriptomic hourglass in brown algae.褐藻中的转录组沙漏。
Nature. 2024 Nov;635(8037):129-135. doi: 10.1038/s41586-024-08059-8. Epub 2024 Oct 23.
9
Evolutionary and biomedical implications of sex differences in the primate brain transcriptome.灵长类动物大脑转录组中性别差异的进化和生物医学意义。
Cell Genom. 2024 Jul 10;4(7):100589. doi: 10.1016/j.xgen.2024.100589. Epub 2024 Jun 27.
10
A time course analysis through diapause reveals dynamic temporal patterns of microRNAs associated with endocrine regulation in the butterfly Pieris napi.通过滞育进行的时间进程分析揭示了与蝴蝶小苎麻赤蛱蝶内分泌调节相关的微小RNA的动态时间模式。
Mol Ecol. 2025 Aug;34(15):e17348. doi: 10.1111/mec.17348. Epub 2024 Apr 10.