You Zhu-Hong, Li Shuai, Gao Xin, Luo Xin, Ji Zhen
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guangdong 518060, China.
Department of Computing, Hong Kong Polytechnic University, Hong Kong.
Biomed Res Int. 2014;2014:598129. doi: 10.1155/2014/598129. Epub 2014 Aug 18.
Protein-protein interactions are the basis of biological functions, and studying these interactions on a molecular level is of crucial importance for understanding the functionality of a living cell. During the past decade, biosensors have emerged as an important tool for the high-throughput identification of proteins and their interactions. However, the high-throughput experimental methods for identifying PPIs are both time-consuming and expensive. On the other hand, high-throughput PPI data are often associated with high false-positive and high false-negative rates. Targeting at these problems, we propose a method for PPI detection by integrating biosensor-based PPI data with a novel computational model. This method was developed based on the algorithm of extreme learning machine combined with a novel representation of protein sequence descriptor. When performed on the large-scale human protein interaction dataset, the proposed method achieved 84.8% prediction accuracy with 84.08% sensitivity at the specificity of 85.53%. We conducted more extensive experiments to compare the proposed method with the state-of-the-art techniques, support vector machine. The achieved results demonstrate that our approach is very promising for detecting new PPIs, and it can be a helpful supplement for biosensor-based PPI data detection.
蛋白质-蛋白质相互作用是生物功能的基础,在分子水平上研究这些相互作用对于理解活细胞的功能至关重要。在过去十年中,生物传感器已成为高通量鉴定蛋白质及其相互作用的重要工具。然而,用于鉴定蛋白质-蛋白质相互作用的高通量实验方法既耗时又昂贵。另一方面,高通量蛋白质-蛋白质相互作用数据往往伴随着高假阳性率和高假阴性率。针对这些问题,我们提出了一种将基于生物传感器的蛋白质-蛋白质相互作用数据与一种新型计算模型相结合的蛋白质-蛋白质相互作用检测方法。该方法是基于极限学习机算法并结合蛋白质序列描述符的一种新表示方法开发的。在大规模人类蛋白质相互作用数据集上进行测试时,该方法在特异性为85.53%的情况下,预测准确率达到84.8%,灵敏度为84.08%。我们进行了更广泛的实验,将所提出的方法与最先进的技术——支持向量机进行比较。所取得的结果表明,我们的方法在检测新的蛋白质-蛋白质相互作用方面非常有前景,并且它可以作为基于生物传感器的蛋白质-蛋白质相互作用数据检测的有益补充。