IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1627-1634. doi: 10.1109/TCBB.2018.2812886. Epub 2018 Mar 6.
In recent years, there has been numerous studies constructing a disease network with diverse sources of data. Many researchers attempted to extend the usage of the disease network by employing machine learning algorithms on various problems such as prediction of comorbidity. The relations between diseases can further be specified into causal relations. When causality is laid on the edges in the network, prediction for comorbid diseases can be more improved. However, not many machine learning algorithms have been developed to concern causality. In this study, we exploit a network based machine learning algorithm that generates comorbidity scores from a causal disease network. In order to find comorbid diseases, semi-supervised scoring for causal networks is proposed. It computes scores of entire nodes in the network when a specific node is labeled. Each score is calculated one at a time and affects to the others along causal edges. The algorithm iterates until it converges. We compared the scoring results of the causal disease network and those of simple association network. As a gold standard, we referenced the values of relative risk from prevalence database, HuDiNe. Scoring by the proposed method provides clearer distinguishability between the top-ranked diseases in the comorbidity list. This is a benefit because it allows the choosing of the most significant ones on an easier fashion. To present typical use of the resulting list, comorbid diseases of Huntington disease and pnuemonia are validated via PubMed literature, respectively.
近年来,已有大量研究利用来自不同数据源的方法构建疾病网络。许多研究人员尝试通过在各种问题(如共病预测)上使用机器学习算法来扩展疾病网络的用途。疾病之间的关系可以进一步细化为因果关系。当因果关系置于网络的边缘时,可以进一步改善共病的预测。然而,目前还没有多少机器学习算法被开发出来以关注因果关系。在这项研究中,我们利用基于网络的机器学习算法,从因果疾病网络中生成共病评分。为了找到共病,我们提出了一种因果网络的半监督评分方法。当标记特定节点时,它会计算网络中所有节点的得分。每次计算一个分数,并沿着因果边缘影响其他分数。算法会不断迭代,直到收敛。我们比较了因果疾病网络和简单关联网络的评分结果。作为黄金标准,我们参考了患病率数据库 HuDiNe 中相对风险的值。拟议方法的评分提供了共病清单中排名最高的疾病之间更清晰的可区分性。这是有益的,因为它允许以更简单的方式选择最重要的疾病。为了展示由此产生的列表的典型用途,我们分别通过 PubMed 文献验证了亨廷顿病和肺炎的共病。