基因表达数据的荟萃分析：一种基于预测因子的方法。

Meta-analysis of gene expression data: a predictor-based approach.

作者信息

Fishel Irit, Kaufman Alon, Ruppin Eytan

机构信息

School of Medicine, Tel-Aviv University, Tel-Aviv 69978, Israel.

出版信息

Bioinformatics. 2007 Jul 1;23(13):1599-606. doi: 10.1093/bioinformatics/btm149. Epub 2007 Apr 26.

DOI:10.1093/bioinformatics/btm149

PMID:17463023

Abstract

MOTIVATION

With the increasing availability of cancer microarray data sets there is a growing need for integrative computational methods that evaluate multiple independent microarray data sets investigating a common theme or disorder. Meta-analysis techniques are designed to overcome the low sample size typical to microarray experiments and yield more valid and informative results than each experiment separately.

RESULTS

We propose a new meta-analysis technique that aims at finding a set of classifying genes, whose expression level may be used to answering the classification question in hand. Specifically, we apply our method to two independent lung cancer microarray data sets and identify a joint core subset of genes which putatively play an important role in tumor genesis of the lung. The robustness of the identified joint core set is demonstrated on a third unseen lung cancer data set, where it leads to successful classification using very few top-ranked genes. Identifying such a set of genes is of significant importance when searching for biologically meaningful biomarkers.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

随着癌症微阵列数据集的可用性不断提高，对整合计算方法的需求日益增长，这些方法用于评估多个研究共同主题或疾病的独立微阵列数据集。荟萃分析技术旨在克服微阵列实验中典型的样本量小的问题，并比单独的每个实验产生更有效和更具信息性的结果。

结果

我们提出了一种新的荟萃分析技术，旨在找到一组分类基因，其表达水平可用于回答手头的分类问题。具体而言，我们将我们的方法应用于两个独立的肺癌微阵列数据集，并识别出一组共同的核心基因子集，这些基因可能在肺癌的肿瘤发生中起重要作用。在第三个未见过的肺癌数据集上证明了所识别的共同核心集的稳健性，在该数据集中，使用极少数排名靠前的基因即可成功进行分类。在寻找具有生物学意义的生物标志物时，识别这样一组基因具有重要意义。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

Meta-analysis of gene expression data: a predictor-based approach.

Bioinformatics. 2007 Jul 1;23(13):1599-606. doi: 10.1093/bioinformatics/btm149. Epub 2007 Apr 26.

FM-test: a fuzzy-set-theory-based approach to differential gene expression data analysis.

BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S7. doi: 10.1186/1471-2105-7-S4-S7.

Large scale data mining approach for gene-specific standardization of microarray gene expression data.

Bioinformatics. 2006 Dec 1;22(23):2898-904. doi: 10.1093/bioinformatics/btl500. Epub 2006 Oct 10.

Independent component analysis-based penalized discriminant method for tumor classification using gene expression data.

Bioinformatics. 2006 Aug 1;22(15):1855-62. doi: 10.1093/bioinformatics/btl190. Epub 2006 May 18.

Visualization-based cancer microarray data classification analysis.

Bioinformatics. 2007 Aug 15;23(16):2147-54. doi: 10.1093/bioinformatics/btm312. Epub 2007 Jun 22.

Gene selection via the BAHSIC family of algorithms.

Bioinformatics. 2007 Jul 1;23(13):i490-8. doi: 10.1093/bioinformatics/btm216.

SEGS: search for enriched gene sets in microarray data.

J Biomed Inform. 2008 Aug;41(4):588-601. doi: 10.1016/j.jbi.2007.12.001. Epub 2007 Dec 15.

A meta-data based method for DNA microarray imputation.

BMC Bioinformatics. 2007 Mar 29;8:109. doi: 10.1186/1471-2105-8-109.

Meta-analysis of cancer gene-profiling data.

Methods Mol Biol. 2010;576:409-26. doi: 10.1007/978-1-59745-545-9_21.

Integration of GO annotations in Correspondence Analysis: facilitating the interpretation of microarray data.

Bioinformatics. 2005 May 15;21(10):2424-9. doi: 10.1093/bioinformatics/bti367. Epub 2005 Mar 3.

引用本文的文献

Integrative OMICS Data-Driven Procedure Using a Derivatized Meta-Analysis Approach.

Front Genet. 2022 Feb 4;13:828786. doi: 10.3389/fgene.2022.828786. eCollection 2022.

MetaGxData: Clinically Annotated Breast, Ovarian and Pancreatic Cancer Datasets and their Use in Generating a Multi-Cancer Gene Signature.

Sci Rep. 2019 Jun 19;9(1):8770. doi: 10.1038/s41598-019-45165-4.

Meta-analysis approach as a gene selection method in class prediction: does it improve model performance? A case study in acute myeloid leukemia.

BMC Bioinformatics. 2017 Apr 11;18(1):210. doi: 10.1186/s12859-017-1619-7.

Integrating multiple immunogenetic data sources for feature extraction and mining somatic hypermutation patterns: the case of "towards analysis" in chronic lymphocytic leukaemia.

BMC Bioinformatics. 2016 Jun 6;17 Suppl 5(Suppl 5):173. doi: 10.1186/s12859-016-1044-3.

Breast cancer prognosis risk estimation using integrated gene expression and clinical data.

Biomed Res Int. 2014;2014:459203. doi: 10.1155/2014/459203. Epub 2014 May 14.

On integrating multi-experiment microarray data.

Philos Trans A Math Phys Eng Sci. 2014 Apr 21;372(2016):20130136. doi: 10.1098/rsta.2013.0136. Print 2014 May 28.

Computational evaluation of cellular metabolic costs successfully predicts genes whose expression is deleterious.

Proc Natl Acad Sci U S A. 2013 Nov 19;110(47):19166-71. doi: 10.1073/pnas.1312361110. Epub 2013 Nov 6.

Maximizing biomarker discovery by minimizing gene signatures.

BMC Genomics. 2011 Dec 23;12 Suppl 5(Suppl 5):S6. doi: 10.1186/1471-2164-12-S5-S6.

A simple but highly effective approach to evaluate the prognostic performance of gene expression signatures.

PLoS One. 2011;6(12):e28320. doi: 10.1371/journal.pone.0028320. Epub 2011 Dec 7.

Ontology-based meta-analysis of global collections of high-throughput public data.

PLoS One. 2010 Sep 29;5(9):e13066. doi: 10.1371/journal.pone.0013066.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基因表达数据的荟萃分析：一种基于预测因子的方法。

Meta-analysis of gene expression data: a predictor-based approach.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

SUPPLEMENTARY INFORMATION

动机

结果

补充信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献