注释概念综合和富集分析：一种基于逻辑的高通量实验解释方法。

Annotation concept synthesis and enrichment analysis: a logic-based approach to the interpretation of high-throughput experiments.

机构信息

School of Information Technology and Engineering, University of Ottawa, Ottawa, Ontario, K1N 6N5 Canada.

出版信息

Bioinformatics. 2011 Sep 1;27(17):2391-8. doi: 10.1093/bioinformatics/btr337. Epub 2011 Jul 9.

DOI:10.1093/bioinformatics/btr337

PMID:21743060

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3157920/

Abstract

MOTIVATION

Annotation Enrichment Analysis (AEA) is a widely used analytical approach to process data generated by high-throughput genomic and proteomic experiments such as gene expression microarrays. The analysis uncovers and summarizes discriminating background information (e.g. GO annotations) for sets of genes identified by experiments (e.g. a set of differentially expressed genes, a cluster). The discovered information is utilized by human experts to find biological interpretations of the experiments. However, AEA isolates and tests for overrepresentation only individual annotation terms or groups of similar terms and is limited in its ability to uncover complex phenomena involving relationship between multiple annotation terms from various knowledge bases. Also, AEA assumes that annotations describe the whole object of interest, which makes it difficult to apply it to sets of compound objects (e.g. sets of protein-protein interactions) and to sets of objects having an internal structure (e.g. protein complexes).

RESULTS

We propose a novel logic-based Annotation Concept Synthesis and Enrichment Analysis (ACSEA) approach. ACSEA fuses inductive logic reasoning with statistical inference to uncover more complex phenomena captured by the experiments. We evaluate our approach on large-scale datasets from several microarray experiments and on a clustered genome-wide genetic interaction network using different biological knowledge bases. The discovered interpretations have lower P-values than the interpretations found by AEA, are highly integrative in nature, and include analysis of quantitative and structured information present in the knowledge bases. The results suggest that ACSEA can boost effectiveness of the processing of high-throughput experiments.

CONTACT

mjiline@site.uottawa.ca.

摘要

动机

注释富集分析（AEA）是一种广泛使用的分析方法，用于处理高通量基因组和蛋白质组实验（如基因表达微阵列）生成的数据。该分析揭示并总结了实验（例如一组差异表达基因、一个聚类）确定的基因集的区分背景信息（例如 GO 注释）。发现的信息被人类专家用于寻找实验的生物学解释。然而，AEA 仅隔离和测试单个注释项或类似术语的组的过表达，并且在发现涉及来自各种知识库的多个注释项之间的关系的复杂现象方面能力有限。此外，AEA 假设注释描述了感兴趣的整个对象，这使得它难以将其应用于化合物对象集（例如蛋白质-蛋白质相互作用集）和具有内部结构的对象集（例如蛋白质复合物）。

结果

我们提出了一种新颖的基于逻辑的注释概念综合和富集分析（ACSEA）方法。ACSEA 将归纳逻辑推理与统计推断相结合，以发现实验捕获的更复杂现象。我们使用不同的生物知识库在来自多个微阵列实验的大规模数据集和全基因组遗传相互作用网络的聚类上评估我们的方法。发现的解释比 AEA 找到的解释具有更低的 P 值，本质上高度综合，并包括对知识库中存在的定量和结构化信息的分析。结果表明，ACSEA 可以提高高通量实验处理的效果。

联系方式

mjiline@site.uottawa.ca。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfa2/3157920/774f0029e1d7/btr337f1.jpg

相似文献

Annotation concept synthesis and enrichment analysis: a logic-based approach to the interpretation of high-throughput experiments.注释概念综合和富集分析：一种基于逻辑的高通量实验解释方法。

Bioinformatics. 2011 Sep 1;27(17):2391-8. doi: 10.1093/bioinformatics/btr337. Epub 2011 Jul 9.

Cluster analysis of protein array results via similarity of Gene Ontology annotation.通过基因本体注释的相似性对蛋白质阵列结果进行聚类分析。

BMC Bioinformatics. 2006 Jul 12;7:338. doi: 10.1186/1471-2105-7-338.

Optimizing gene set annotations combining GO structure and gene expression data.结合基因本体结构和基因表达数据优化基因集注释

BMC Syst Biol. 2018 Dec 31;12(Suppl 9):133. doi: 10.1186/s12918-018-0659-6.

Structural and functional-annotation of an equine whole genome oligoarray.马全基因组寡核苷酸芯片的结构和功能注释。

BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S8. doi: 10.1186/1471-2105-10-S11-S8.

Optimization of gene set annotations via entropy minimization over variable clusters (EMVC).通过对可变聚类进行熵最小化（EMVC）优化基因集注释。

Bioinformatics. 2014 Jun 15;30(12):1698-706. doi: 10.1093/bioinformatics/btu110. Epub 2014 Feb 25.

Comparing gene annotation enrichment tools for functional modeling of agricultural microarray data.比较基因注释富集工具在农业微阵列数据分析中的功能建模。

BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S9. doi: 10.1186/1471-2105-10-S11-S9.

ADGO: analysis of differentially expressed gene sets using composite GO annotation.ADGO：使用复合基因本体注释分析差异表达基因集

Bioinformatics. 2006 Sep 15;22(18):2249-53. doi: 10.1093/bioinformatics/btl378. Epub 2006 Jul 12.

Detecting phenotype-specific interactions between biological processes from microarray data and annotations.从微阵列数据和注释中检测生物过程的表型特异性相互作用。

IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1399-409. doi: 10.1109/TCBB.2012.65.

Large-scale gene co-expression network as a source of functional annotation for cattle genes.大规模基因共表达网络作为牛基因功能注释的来源

BMC Genomics. 2016 Nov 2;17(1):846. doi: 10.1186/s12864-016-3176-2.

PIGNON: a protein-protein interaction-guided functional enrichment analysis for quantitative proteomics.PIGNON：一种基于蛋白质-蛋白质相互作用的定量蛋白质组学功能富集分析方法。

BMC Bioinformatics. 2021 Jun 4;22(1):302. doi: 10.1186/s12859-021-04042-6.

引用本文的文献

Prognostic model of lung adenocarcinoma based on immunoprognosis-related genes and related drug prediction.基于免疫预后相关基因的肺腺癌预后模型及相关药物预测

J Thorac Dis. 2024 Sep 30;16(9):5860-5877. doi: 10.21037/jtd-24-569. Epub 2024 Sep 26.

Exploring MPC1 as a potential ferroptosis-linked biomarker in the cervical cancer tumor microenvironment: a comprehensive analysis.探索 MPC1 作为宫颈癌肿瘤微环境中潜在的铁死亡相关生物标志物：全面分析。

BMC Cancer. 2024 Oct 10;24(1):1258. doi: 10.1186/s12885-024-12622-x.

本文引用的文献

The genetic landscape of a cell.细胞的基因图谱。

Science. 2010 Jan 22;327(5964):425-31. doi: 10.1126/science.1180823.

DRYGIN: a database of quantitative genetic interaction networks in yeast.DRYGIN：酵母中定量遗传互作网络数据库。

Nucleic Acids Res. 2010 Jan;38(Database issue):D502-7. doi: 10.1093/nar/gkp820. Epub 2009 Oct 30.

CLEAN: CLustering Enrichment ANalysis.CLEAN：聚类富集分析。

BMC Bioinformatics. 2009 Jul 29;10:234. doi: 10.1186/1471-2105-10-234.

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists.生物信息学富集工具：通向大型基因列表全面功能分析的途径

Nucleic Acids Res. 2009 Jan;37(1):1-13. doi: 10.1093/nar/gkn923. Epub 2008 Nov 25.

Ontologizer 2.0--a multifunctional tool for GO term enrichment analysis and data exploration.Ontologizer 2.0——一款用于基因本体论（GO）术语富集分析和数据探索的多功能工具。

Bioinformatics. 2008 Jul 15;24(14):1650-1. doi: 10.1093/bioinformatics/btn250. Epub 2008 May 29.

PaLS: filtering common literature, biological terms and pathway information.PaLS：筛选常见文献、生物学术语和通路信息。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W364-7. doi: 10.1093/nar/gkn251. Epub 2008 May 8.

ProfCom: a web tool for profiling the complex functionality of gene groups identified from high-throughput data.ProfCom：一种用于剖析从高通量数据中识别出的基因群组复杂功能的网络工具。

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W347-51. doi: 10.1093/nar/gkn239. Epub 2008 May 6.

DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis.大卫知识库：一个以基因为中心的数据库，整合了异构基因注释资源，以促进高通量基因功能分析。

BMC Bioinformatics. 2007 Nov 2;8:426. doi: 10.1186/1471-2105-8-426.

Functional profiling of microarray experiments using text-mining derived bioentities.利用文本挖掘衍生生物实体对微阵列实验进行功能分析。

Bioinformatics. 2007 Nov 15;23(22):3098-9. doi: 10.1093/bioinformatics/btm445. Epub 2007 Sep 13.

Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.通过父子分析改进基因本体注释过度代表性的检测。

Bioinformatics. 2007 Nov 15;23(22):3024-31. doi: 10.1093/bioinformatics/btm440. Epub 2007 Sep 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

注释概念综合和富集分析：一种基于逻辑的高通量实验解释方法。

Annotation concept synthesis and enrichment analysis: a logic-based approach to the interpretation of high-throughput experiments.

机构信息

出版信息

MOTIVATION

RESULTS

CONTACT

动机

结果

联系方式

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献