Wei Rong, Gao Lei, Zhang Tongliang
College of Science, Hebei Polytechnic University, Tangshan, Hebei 063009, P.R. China.
Protein Pept Lett. 2010 Oct;17(10):1215-22. doi: 10.2174/092986610792231429.
Discovering a protein motif is an important research topic in both bioinformatics and protein sciences. This paper presents a novel motif discovery algorithm which is capable of finding a motif set to represent a protein family. The algorithm involves an abstraction method of important features, a location-sensitive connection approach to link two features, and a repeated connection procedure to generate a motif set. The novel algorithm is applied to discovering motifs in 21 ligase subfamilies. The results show that the obtained motifs are able to represent the characteristics of the subfamilies effectively. The proposed algorithm could become a potential useful tool for protein family prediction.
发现蛋白质基序是生物信息学和蛋白质科学中的一个重要研究课题。本文提出了一种新颖的基序发现算法,该算法能够找到一个基序集来代表一个蛋白质家族。该算法包括重要特征的抽象方法、连接两个特征的位置敏感连接方法以及生成基序集的重复连接过程。将该新算法应用于发现21个连接酶亚家族中的基序。结果表明,所获得的基序能够有效地代表这些亚家族的特征。所提出的算法可能成为蛋白质家族预测的一个潜在有用工具。