Institute of Computing Science and Technology, Guangzhou University, Guangzhou 510006, China.
School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China.
Cells. 2019 Nov 7;8(11):1405. doi: 10.3390/cells8111405.
It is known that many diseases are caused by mutations or abnormalities in microRNA (miRNA). The usual method to predict miRNA disease relationships is to build a high-quality similarity network of diseases and miRNAs. All unobserved associations are ranked by their similarity scores, such that a higher score indicates a greater probability of a potential connection. However, this approach does not utilize information within the network. Therefore, in this study, we propose a machine learning method, called STIM, which uses network topology information to predict disease-miRNA associations. In contrast to the conventional approach, STIM constructs features according to information on similarity and topology in networks and then uses a machine learning model to predict potential associations. To verify the reliability and accuracy of our method, we compared STIM to other classical algorithms. The results of fivefold cross validation demonstrated that STIM outperforms many existing methods, particularly in terms of the area under the curve. In addition, the top 30 candidate miRNAs recommended by STIM in a case study of lung neoplasm have been confirmed in previous experiments, which proved the validity of the method.
已知许多疾病是由 microRNA (miRNA) 的突变或异常引起的。通常预测 miRNA 与疾病关系的方法是构建疾病和 miRNA 的高质量相似性网络。通过比较相似性得分来对所有未观察到的关联进行排序,得分越高表示潜在关联的可能性越大。然而,这种方法没有利用网络内的信息。因此,在这项研究中,我们提出了一种名为 STIM 的机器学习方法,该方法利用网络拓扑信息来预测疾病-miRNA 关联。与传统方法不同,STIM 根据网络中的相似性和拓扑信息构建特征,然后使用机器学习模型来预测潜在的关联。为了验证我们方法的可靠性和准确性,我们将 STIM 与其他经典算法进行了比较。五重交叉验证的结果表明,STIM 优于许多现有的方法,特别是在曲线下面积方面。此外,在肺癌病例研究中,STIM 推荐的前 30 个候选 miRNA 已在前实验中得到证实,这证明了该方法的有效性。