College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.
Eur J Hum Genet. 2013 Jun;21(6):666-72. doi: 10.1038/ejhg.2012.218. Epub 2012 Oct 10.
Analysis of the biological pathways involved in complex human diseases is an important step in elucidating the pathogenesis and mechanism of diseases. Most pathway analysis approaches identify disease-related biological pathways using overlapping genes between pathways and diseases. However, these approaches ignore the functional biological association between pathways and diseases. In this paper, we designed a novel computational framework for prioritising disease-risk pathways based on functional profiling. The disease gene set and biological pathways were translated into functional profiles in the context of GO annotations. We then implemented a semantic similarity measurement for calculating the concordance score between a functional profile of disease genes and a functional profile of pathways (FPP); the concordance score was then used to prioritise and infer disease-risk pathways. A freely accessible web toolkit, 'Functional Profiling-based Pathway Prioritisation' (FPPP), was developed (http://bioinfo.hrbmu.edu.cn/FPPP). During validation, our method successfully identified known disease-pathway pairs with area under the ROC curve (AUC) values of 96.73 and 95.02% in tests using both pathway randomisation and disease randomisation. A robustness analysis showed that FPPP is reliable even when using data containing noise. A case study based on a dilated cardiomyopathy data set indicated that the high-ranking pathways from FPPP are well known to be linked with this disease. Furthermore, we predicted the risk pathways of 413 diseases by using FPPP to build a disease similarity landscape that systematically reveals the global modular organisation of disease associations.
分析涉及复杂人类疾病的生物途径是阐明疾病发病机制和机制的重要步骤。大多数途径分析方法通过途径和疾病之间的重叠基因来识别与疾病相关的生物途径。然而,这些方法忽略了途径和疾病之间的功能生物学关联。在本文中,我们设计了一种基于功能谱的疾病风险途径优先排序的新计算框架。疾病基因集和生物途径被转化为 GO 注释背景下的功能谱。然后,我们实施了一种语义相似性测量方法,用于计算疾病基因功能谱与途径功能谱(FPP)之间的一致性得分;然后使用一致性得分来优先排序和推断疾病风险途径。开发了一个免费的网络工具包“基于功能谱的途径优先排序”(FPPP)(http://bioinfo.hrbmu.edu.cn/FPPP)。在验证过程中,我们的方法使用途径随机化和疾病随机化测试成功识别了具有 96.73%和 95.02%的 AUC 值的已知疾病途径对。稳健性分析表明,即使使用包含噪声的数据,FPPP 也是可靠的。基于扩张型心肌病数据集的案例研究表明,FPPP 中排名较高的途径与该疾病密切相关。此外,我们通过使用 FPPP 构建疾病相似性景观来预测 413 种疾病的风险途径,该景观系统地揭示了疾病关联的全局模块化组织。