Camargo Anyela, Azuaje Francisco
School of Computing and Mathematics, University of Ulster at Jordanstown, Shore Road, Newtownabbey, County Antrim BT37 0QB, Northern Ireland, UK.
Genomics. 2008 Dec;92(6):404-13. doi: 10.1016/j.ygeno.2008.05.007. Epub 2008 Jul 1.
Dilated cardiomyopathy (DCM) is a leading cause of heart failure (HF) and cardiac transplantations in Western countries. Single-source gene expression analysis studies have identified potential disease biomarkers and drug targets. However, because of the diversity of experimental settings and relative lack of data, concerns have been raised about the robustness and reproducibility of the predictions. This study presents the identification of robust and reproducible DCM signature genes based on the integration of several independent data sets and functional network information. Gene expression profiles from three public data sets containing DCM and non-DCM samples were integrated and analyzed, which allowed the implementation of clinical diagnostic models. Differentially expressed genes were evaluated in the context of a global protein-protein interaction network, constructed as part of this study. Potential associations with HF were identified by searching the scientific literature. From these analyses, classification models were built and their effectiveness in differentiating between DCM and non-DCM samples was estimated. The main outcome was a set of integrated, potentially novel DCM signature genes, which may be used as reliable disease biomarkers. An empirical demonstration of the power of the integrative classification models against single-source models is also given.
扩张型心肌病(DCM)是西方国家心力衰竭(HF)和心脏移植的主要原因。单源基因表达分析研究已经确定了潜在的疾病生物标志物和药物靶点。然而,由于实验设置的多样性和相对缺乏数据,人们对预测的稳健性和可重复性提出了担忧。本研究基于整合多个独立数据集和功能网络信息,提出了对稳健且可重复的DCM特征基因的鉴定。整合并分析了来自三个包含DCM和非DCM样本的公共数据集的基因表达谱,这使得临床诊断模型得以实施。在作为本研究一部分构建的全局蛋白质-蛋白质相互作用网络的背景下,对差异表达基因进行了评估。通过搜索科学文献确定了与HF的潜在关联。通过这些分析,建立了分类模型,并评估了它们在区分DCM和非DCM样本方面的有效性。主要结果是一组整合的、可能新颖的DCM特征基因,它们可作为可靠的疾病生物标志物。还给出了综合分类模型相对于单源模型的功效的实证证明。