Zanin Massimiliano
Innaxis Foundation & Research Institute, Madrid, Spain; Department of Electrical Engineering, Faculty of Sciences and Technology, Universidade Nova de Lisboa, Caparica, Portugal.
F1000Res. 2016 Nov 16;5:2675. doi: 10.12688/f1000research.8282.1. eCollection 2016.
Complex network theory has been used, during the last decade, to understand the structures behind complex biological problems, yielding new knowledge in a large number of situations. Nevertheless, such knowledge has remained mostly qualitative. In this contribution, I show how information extracted from a network representation can be used in a quantitative way, to improve the score of a classification task. As a test bed, I consider a dataset corresponding to patients suffering from prostate cancer, and the task of successfully prognosing their survival. When information from a complex network representation is added on top of a simple classification model, the error is reduced from 27.9% to 23.8%. This confirms that network theory can be used to synthesize information that may not readily be accessible by standard data mining algorithms.
在过去十年中,复杂网络理论已被用于理解复杂生物学问题背后的结构,在大量情况下产生了新知识。然而,这些知识大多仍停留在定性层面。在本论文中,我展示了如何以定量方式使用从网络表示中提取的信息,以提高分类任务的得分。作为测试平台,我考虑了一个与前列腺癌患者相关的数据集,以及成功预测其生存情况的任务。当在简单分类模型之上添加来自复杂网络表示的信息时,错误率从27.9%降至23.8%。这证实了网络理论可用于合成标准数据挖掘算法可能无法轻易获取的信息。