School of Computing and Mathematics, Computer Science Research Institute, University of Ulster at Jordanstown, Northern Ireland, UK.
Comput Biol Med. 2010 Mar;40(3):306-17. doi: 10.1016/j.compbiomed.2010.01.002. Epub 2010 Feb 6.
This study applied a knowledge-driven data integration framework for the inference of protein-protein interactions (PPI). Evidence from diverse genomic features is integrated using a knowledge-driven Bayesian network (KD-BN). Receiver operating characteristic (ROC) curves may not be the optimal assessment method to evaluate a classifier's performance in PPI prediction as the majority of the area under the curve (AUC) may not represent biologically meaningful results. It may be of benefit to interpret the AUC of a partial ROC curve whereby biologically interesting results are represented. Therefore, the novel application of the assessment method referred to as the partial ROC has been employed in this study to assess predictive performance of PPI predictions along with calculating the True positive/false positive rate and true positive/positive rate. By incorporating domain knowledge into the construction of the KD-BN, we demonstrate improvement in predictive performance compared with previous studies based upon the Naive Bayesian approach.
本研究应用知识驱动的数据集成框架来推断蛋白质-蛋白质相互作用(PPI)。使用知识驱动的贝叶斯网络(KD-BN)整合来自多种基因组特征的证据。受试者工作特征(ROC)曲线可能不是评估 PPI 预测中分类器性能的最佳评估方法,因为大多数曲线下面积(AUC)可能无法代表具有生物学意义的结果。解释 AUC 的局部 ROC 曲线可能会有所帮助,因为其中代表了具有生物学意义的结果。因此,本研究采用了一种称为局部 ROC 的新评估方法,用于评估 PPI 预测的预测性能,同时计算真阳性/假阳性率和真阳性/阳性率。通过将领域知识纳入 KD-BN 的构建中,与基于朴素贝叶斯方法的先前研究相比,我们展示了预测性能的提高。