Hidalgo César A, Blumm Nicholas, Barabási Albert-László, Christakis Nicholas A
Center for International Development and Harvard Kennedy School, Harvard University, Cambridge, Massachusetts, United States of America.
PLoS Comput Biol. 2009 Apr;5(4):e1000353. doi: 10.1371/journal.pcbi.1000353. Epub 2009 Apr 10.
The use of networks to integrate different genetic, proteomic, and metabolic datasets has been proposed as a viable path toward elucidating the origins of specific diseases. Here we introduce a new phenotypic database summarizing correlations obtained from the disease history of more than 30 million patients in a Phenotypic Disease Network (PDN). We present evidence that the structure of the PDN is relevant to the understanding of illness progression by showing that (1) patients develop diseases close in the network to those they already have; (2) the progression of disease along the links of the network is different for patients of different genders and ethnicities; (3) patients diagnosed with diseases which are more highly connected in the PDN tend to die sooner than those affected by less connected diseases; and (4) diseases that tend to be preceded by others in the PDN tend to be more connected than diseases that precede other illnesses, and are associated with higher degrees of mortality. Our findings show that disease progression can be represented and studied using network methods, offering the potential to enhance our understanding of the origin and evolution of human diseases. The dataset introduced here, released concurrently with this publication, represents the largest relational phenotypic resource publicly available to the research community.
利用网络整合不同的基因、蛋白质组和代谢数据集,已被视为阐明特定疾病起源的可行途径。在此,我们引入了一个新的表型数据库,该数据库总结了在表型疾病网络(PDN)中从3000多万患者的疾病史中获得的相关性。我们通过以下几点证明了PDN的结构与理解疾病进展相关:(1)患者倾向于患上在网络中与他们已患疾病相近的疾病;(2)不同性别和种族的患者,疾病在网络链接上的进展有所不同;(3)被诊断患有在PDN中连接性更高的疾病的患者,往往比受连接性较低疾病影响的患者死亡更早;(4)在PDN中倾向于先于其他疾病出现的疾病,往往比先于其他疾病的疾病连接性更高,且与更高的死亡率相关。我们的研究结果表明,疾病进展可以通过网络方法来表示和研究,这为增强我们对人类疾病起源和演变的理解提供了潜力。本文同时发布的数据集,是研究界可公开获取的最大的关系型表型资源。