Department of Computer and Information Science (DISI), Università degli Studi di Genova, Via Dodecaneso 35, Genova, I-16146, Italy.
BMC Med Genomics. 2011 Jul 5;4:55. doi: 10.1186/1755-8794-4-55.
A molecular characterization of Alzheimer's Disease (AD) is the key to the identification of altered gene sets that lead to AD progression. We rely on the assumption that candidate marker genes for a given disease belong to specific pathogenic pathways, and we aim at unveiling those pathways stable across tissues, treatments and measurement systems. In this context, we analyzed three heterogeneous datasets, two microarray gene expression sets and one protein abundance set, applying a recently proposed feature selection method based on regularization.
For each dataset we identified a signature that was successively evaluated both from the computational and functional characterization viewpoints, estimating the classification error and retrieving the most relevant biological knowledge from different repositories. Each signature includes genes already known to be related to AD and genes that are likely to be involved in the pathogenesis or in the disease progression. The integrated analysis revealed a meaningful overlap at the functional level.
The identification of three gene signatures showing a relevant overlap of pathways and ontologies, increases the likelihood of finding potential marker genes for AD.
阿尔茨海默病(AD)的分子特征是确定导致 AD 进展的改变基因集的关键。我们假设给定疾病的候选标记基因属于特定的致病途径,我们的目标是揭示跨组织、治疗和测量系统稳定的途径。在这种情况下,我们分析了三个异构数据集,两个微阵列基因表达集和一个蛋白质丰度集,应用了一种最近提出的基于正则化的特征选择方法。
对于每个数据集,我们都确定了一个签名,然后从计算和功能表征的角度对其进行了评估,估计了分类错误,并从不同的存储库中检索了最相关的生物学知识。每个签名都包含已知与 AD 相关的基因以及可能参与发病机制或疾病进展的基因。综合分析在功能水平上显示出有意义的重叠。
鉴定出三个基因特征,这些特征在途径和本体上有显著的重叠,增加了发现 AD 潜在标记基因的可能性。