Suppr超能文献

Clustering-based approach for predicting motif pairs from protein interaction data.

作者信息

Leung Henry Chi-Ming, Siu Man-Hung, Yiu Siu-Ming, Chin Francis Yuk-Lun, Sung Ken Wing-Kin

机构信息

Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong, China.

出版信息

J Bioinform Comput Biol. 2009 Aug;7(4):701-16. doi: 10.1142/s0219720009004266.

Abstract

UNLABELLED

Predicting motif pairs from a set of protein sequences based on the protein-protein interaction data is an important, but difficult computational problem. Tan et al. proposed a solution to this problem. However, the scoring function (using chi(2) testing) used in their approach is not adequate and their approach is also not scalable. It may take days to process a set of 5000 protein sequences with about 20,000 interactions. Later, Leung et al. proposed an improved scoring function and faster algorithms for solving the same problem. But, the model used in Leung et al. is complicated. The exact value of the scoring function is not easy to compute and an estimated value is used in practice. In this paper, we derive a better model to capture the significance of a given motif pair based on a clustering notion. We develop a fast heuristic algorithm to solve the problem. The algorithm is able to locate the correct motif pair in the yeast data set in about 45 minutes for 5000 protein sequences and 20,000 interactions. Moreover, we derive a lower bound result for the p-value of a motif pair in order for it to be distinguishable from random motif pairs. The lower bound result has been verified using simulated data sets.

AVAILABILITY

http://alse.cs.hku.hk/motif_pair.

摘要

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验