Hofmann-Apitius Martin, Ball Gordon, Gebel Stephan, Bagewadi Shweta, de Bono Bernard, Schneider Reinhard, Page Matt, Kodamullil Alpha Tom, Younesi Erfan, Ebeling Christian, Tegnér Jesper, Canard Luc
Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Institutszentrum Birlinghoven, Sankt Augustin D-53754, Germany.
Rheinische Friedrich-Wilhelms-Universitaet Bonn, University of Bonn, Bonn 53113, Germany.
Int J Mol Sci. 2015 Dec 7;16(12):29179-206. doi: 10.3390/ijms161226148.
Since the decoding of the Human Genome, techniques from bioinformatics, statistics, and machine learning have been instrumental in uncovering patterns in increasing amounts and types of different data produced by technical profiling technologies applied to clinical samples, animal models, and cellular systems. Yet, progress on unravelling biological mechanisms, causally driving diseases, has been limited, in part due to the inherent complexity of biological systems. Whereas we have witnessed progress in the areas of cancer, cardiovascular and metabolic diseases, the area of neurodegenerative diseases has proved to be very challenging. This is in part because the aetiology of neurodegenerative diseases such as Alzheimer´s disease or Parkinson´s disease is unknown, rendering it very difficult to discern early causal events. Here we describe a panel of bioinformatics and modeling approaches that have recently been developed to identify candidate mechanisms of neurodegenerative diseases based on publicly available data and knowledge. We identify two complementary strategies-data mining techniques using genetic data as a starting point to be further enriched using other data-types, or alternatively to encode prior knowledge about disease mechanisms in a model based framework supporting reasoning and enrichment analysis. Our review illustrates the challenges entailed in integrating heterogeneous, multiscale and multimodal information in the area of neurology in general and neurodegeneration in particular. We conclude, that progress would be accelerated by increasing efforts on performing systematic collection of multiple data-types over time from each individual suffering from neurodegenerative disease. The work presented here has been driven by project AETIONOMY; a project funded in the course of the Innovative Medicines Initiative (IMI); which is a public-private partnership of the European Federation of Pharmaceutical Industry Associations (EFPIA) and the European Commission (EC).
自人类基因组解码以来,生物信息学、统计学和机器学习技术在揭示应用于临床样本、动物模型和细胞系统的技术分析技术所产生的不同数据的数量和类型不断增加的模式方面发挥了重要作用。然而,在揭示因果驱动疾病的生物学机制方面进展有限,部分原因是生物系统固有的复杂性。尽管我们在癌症、心血管疾病和代谢疾病领域取得了进展,但神经退行性疾病领域已被证明极具挑战性。部分原因是阿尔茨海默病或帕金森病等神经退行性疾病的病因不明,使得很难辨别早期因果事件。在此,我们描述了一组最近开发的生物信息学和建模方法,这些方法基于公开可用的数据和知识来识别神经退行性疾病的候选机制。我们确定了两种互补策略——以遗传数据为起点的数据挖掘技术,使用其他数据类型进一步丰富,或者在支持推理和富集分析的基于模型的框架中编码关于疾病机制的先验知识。我们的综述说明了在神经病学领域,特别是神经退行性变领域整合异构、多尺度和多模态信息所带来的挑战。我们得出结论,通过加大力度对患有神经退行性疾病的每个个体进行长期系统收集多种数据类型,进展将会加快。这里介绍的工作是由AETIONOMY项目推动的;该项目由创新药物计划(IMI)资助;创新药物计划是欧洲制药工业协会联合会(EFPIA)和欧盟委员会(EC)的公私合作项目。