Division of Molecular Genetic Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
PLoS One. 2011 Apr 25;6(4):e18867. doi: 10.1371/journal.pone.0018867.
BACKGROUND: A large number of gene expression profiling (GEP) studies on prognosis of colorectal cancer (CRC) has been performed, but no reliable gene signature for prediction of CRC prognosis has been found. Bioinformatic enrichment tools are a powerful approach to identify biological processes in high-throughput data analysis. PRINCIPAL FINDINGS: We have for the first time collected the results from the 23 so far published independent GEP studies on CRC prognosis. In these 23 studies, 1475 unique, mapped genes were identified, from which 124 (8.4%) were reported in at least two studies, with 54 of them showing consisting direction in expression change between the single studies. Using these data, we attempted to overcome the lack of reproducibility observed in the genes reported in individual GEP studies by carrying out a pathway-based enrichment analysis. We used up to ten tools for overrepresentation analysis of Gene Ontology (GO) categories or Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways in each of the three gene lists (1475, 124 and 54 genes). This strategy, based on testing multiple tools, allowed us to identify the oxidative phosphorylation chain and the extracellular matrix receptor interaction categories, as well as a general category related to cell proliferation and apoptosis, as the only significantly and consistently overrepresented pathways in the three gene lists, which were reported by several enrichment tools. CONCLUSIONS: Our pathway-based enrichment analysis of 23 independent gene expression profiling studies on prognosis of CRC identified significantly and consistently overrepresented prognostic categories for CRC. These overrepresented categories have been functionally clearly related with cancer progression, and deserve further investigation.
背景:大量的基因表达谱(GEP)研究已经针对结直肠癌(CRC)的预后进行了,但尚未发现用于预测 CRC 预后的可靠基因特征。生物信息学富集工具是一种强大的方法,可以在高通量数据分析中识别生物学过程。
主要发现:我们首次收集了迄今为止发表的 23 项关于 CRC 预后的独立 GEP 研究的结果。在这 23 项研究中,鉴定出了 1475 个独特的映射基因,其中 124 个(8.4%)在至少两项研究中报告过,其中 54 个在单研究中表现出一致的表达变化方向。使用这些数据,我们试图通过进行基于途径的富集分析来克服单个 GEP 研究中报告的基因缺乏可重复性的问题。我们在三个基因列表(1475、124 和 54 个基因)中的每个基因列表中使用多达十种工具进行基因本体论(GO)类别或京都基因与基因组百科全书(KEGG)途径的过度表达分析。这种基于测试多种工具的策略使我们能够识别氧化磷酸化链和细胞外基质受体相互作用类别,以及与细胞增殖和凋亡相关的一般类别,作为仅在三个基因列表中显著且一致地过度表达的途径,这些途径被几个富集工具报告。
结论:我们对 23 项关于 CRC 预后的独立基因表达谱研究的基于途径的富集分析确定了 CRC 明显且一致地过度表达的预后类别。这些过度表达的类别在功能上与癌症进展明确相关,值得进一步研究。
J Cell Physiol. 2018-8-21
Math Biosci Eng. 2019-4-10
Int J Mol Sci. 2015-6-15
BMC Med. 2015-4-8
Environ Mol Mutagen. 2010-6
Nat Rev Cancer. 2010-1
Nat Rev Cancer. 2009-7
Genet Epidemiol. 2009-7