增强对微阵列数据生物学解读的信心：显著GO类别的功能一致性。

Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories.

作者信息

Yang Da, Li Yanhui, Xiao Hui, Liu Qing, Zhang Min, Zhu Jing, Ma Wencai, Yao Chen, Wang Jing, Wang Dong, Guo Zheng, Yang Baofeng

机构信息

Department of Bioinformatics, Bio-pharmaceutical Key Laboratory of Heilongjiang Province-Incubator of State Key Laboratory, Harbin Medical University, Harbin 150086, China.

出版信息

Bioinformatics. 2008 Jan 15;24(2):265-71. doi: 10.1093/bioinformatics/btm558. Epub 2007 Nov 15.

DOI:10.1093/bioinformatics/btm558

PMID:18006543

Abstract

MOTIVATION

In microarray studies, numerous tools are available for functional enrichment analysis based on GO categories. Most of these tools, due to their requirement of a prior threshold for designating genes as differentially expressed genes (DEGs), are categorized as threshold-dependent methods that often suffer from a major criticism on their changing results with different thresholds.

RESULTS

In the present article, by considering the inherent correlation structure of the GO categories, a continuous measure based on semantic similarity of GO categories is proposed to investigate the functional consistence (or stability) of threshold-dependent methods. The results from several datasets show when simply counting overlapping categories between two groups, the significant category groups selected under different DEG thresholds are seemingly very different. However, based on the semantic similarity measure proposed in this article, the results are rather functionally consistent for a wide range of DEG thresholds. Moreover, we find that the functional consistence of gene lists ranked by SAM metric behaves relatively robust against changing DEG thresholds.

AVAILABILITY

Source code in R is available on request from the authors.

摘要

动机

在微阵列研究中，有许多工具可用于基于基因本体（GO）类别的功能富集分析。这些工具中的大多数，由于需要事先设定一个阈值来将基因指定为差异表达基因（DEG），因此被归类为依赖阈值的方法，而这些方法常常因不同阈值会导致结果变化而受到主要批评。

结果

在本文中，通过考虑GO类别的内在相关结构，提出了一种基于GO类别语义相似性的连续度量方法，以研究依赖阈值方法的功能一致性（或稳定性）。几个数据集的结果表明，当简单地计算两组之间的重叠类别时，在不同的DEG阈值下选择的显著类别组似乎非常不同。然而，基于本文提出的语义相似性度量，在广泛的DEG阈值范围内，结果在功能上相当一致。此外，我们发现，按SAM度量排名的基因列表的功能一致性在DEG阈值变化时表现得相对稳健。

可用性

可向作者索取R语言的源代码。

相似文献

Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories.

Bioinformatics. 2008 Jan 15;24(2):265-71. doi: 10.1093/bioinformatics/btm558. Epub 2007 Nov 15.

Algebraic stability indicators for ranked lists in molecular profiling.

Bioinformatics. 2008 Jan 15;24(2):258-64. doi: 10.1093/bioinformatics/btm550. Epub 2007 Nov 16.

Large scale data mining approach for gene-specific standardization of microarray gene expression data.

Bioinformatics. 2006 Dec 1;22(23):2898-904. doi: 10.1093/bioinformatics/btl500. Epub 2006 Oct 10.

Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data.

Bioinformatics. 2007 Aug 15;23(16):2063-72. doi: 10.1093/bioinformatics/btm289. Epub 2007 May 31.

Integration of GO annotations in Correspondence Analysis: facilitating the interpretation of microarray data.

Bioinformatics. 2005 May 15;21(10):2424-9. doi: 10.1093/bioinformatics/bti367. Epub 2005 Mar 3.

Annotation-based distance measures for patient subgroup discovery in clinical microarray studies.

Bioinformatics. 2007 Sep 1;23(17):2256-64. doi: 10.1093/bioinformatics/btm322. Epub 2007 Jun 22.

Data-adaptive test statistics for microarray data.

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii108-14. doi: 10.1093/bioinformatics/bti1119.

Cross platform microarray analysis for robust identification of differentially expressed genes.

BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-8-S1-S5.

Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis.

Bioinformatics. 2008 Jan 15;24(2):192-201. doi: 10.1093/bioinformatics/btm583. Epub 2007 Nov 27.

Structured polychotomous machine diagnosis of multiple cancer types using gene expression.

Bioinformatics. 2006 Apr 15;22(8):950-8. doi: 10.1093/bioinformatics/btl029. Epub 2006 Feb 1.

引用本文的文献

Neurotransmitter and metabolic effects of interferon-alpha in association with decreased striatal dopamine in a non-human primate model of cytokine-Induced depression.

Brain Behav Immun. 2025 Mar;125:308-318. doi: 10.1016/j.bbi.2025.01.010. Epub 2025 Jan 16.

Repeated social defeat stress leads to immunometabolic shifts in innate immune cells of the spleen.

Brain Behav Immun Health. 2023 Sep 25;34:100690. doi: 10.1016/j.bbih.2023.100690. eCollection 2023 Dec.

Label-Free Quantitative Proteomics Reveal the Involvement of PRT6 in Seed Responsiveness to Ethylene.

Int J Mol Sci. 2022 Aug 19;23(16):9352. doi: 10.3390/ijms23169352.

Cellular and immunometabolic mechanisms of inflammation in depression: Preliminary findings from single cell RNA sequencing and a tribute to Bruce McEwen.

Neurobiol Stress. 2022 May 24;19:100462. doi: 10.1016/j.ynstr.2022.100462. eCollection 2022 Jul.

Transcriptomic signatures of psychomotor slowing in peripheral blood of depressed patients: evidence for immunometabolic reprogramming.

Mol Psychiatry. 2021 Dec;26(12):7384-7392. doi: 10.1038/s41380-021-01258-z. Epub 2021 Sep 17.

Approximate search for known gene clusters in new genomes using PQ-trees.

Algorithms Mol Biol. 2021 Jul 9;16(1):16. doi: 10.1186/s13015-021-00190-9.

GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms.

Sci Rep. 2018 Oct 10;8(1):15107. doi: 10.1038/s41598-018-33219-y.

A rank-based algorithm of differential expression analysis for small cell line data with statistical control.

Brief Bioinform. 2019 Mar 22;20(2):482-491. doi: 10.1093/bib/bbx135.

Identifying disease-associated pathways in one-phenotype data based on reversal gene expression orderings.

Sci Rep. 2017 May 2;7(1):1348. doi: 10.1038/s41598-017-01536-3.

An improved method for functional similarity analysis of genes based on Gene Ontology.

BMC Syst Biol. 2016 Dec 23;10(Suppl 4):119. doi: 10.1186/s12918-016-0359-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

增强对微阵列数据生物学解读的信心：显著GO类别的功能一致性。

Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献