College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, Shandong, China.
Int J Biol Sci. 2013;9(1):22-33. doi: 10.7150/ijbs.5371. Epub 2012 Dec 19.
The increasing protein sequences from the genome project require theoretical methods to predict transmembrane helical segments (TMHs). So far, several prediction methods have been reported, but there are some deficiencies in prediction accuracy and adaptability in these methods. In this paper, a method based on discrete wavelet transform (DWT) has been developed to predict the number and location of TMHs in membrane proteins. PDB coded as 1KQG is chosen as an example to describe the prediction process by this method. 80 proteins with known 3D structure from Mptopo database are chosen at random as data sets (including 325 TMHs) and 80 sequences are divided into 13 groups according to their function and type. TMHs prediction is carried out for each group of membrane protein sequences and obtain satisfactory result. To verify the feasibility of this method, 80 membrane protein sequences are treated as test sets, 308 TMHs can be predicted and the prediction accuracy is 96.3%. Compared with the main prediction results of seven popular prediction methods, the obtained results indicate that the proposed method in this paper has higher prediction accuracy.
基因组计划产生的越来越多的蛋白质序列需要理论方法来预测跨膜螺旋区 (TMHs)。到目前为止,已经报道了几种预测方法,但这些方法在预测准确性和适应性方面存在一些不足。本文提出了一种基于离散小波变换 (DWT) 的方法,用于预测膜蛋白中 TMHs 的数量和位置。选择 PDB 编码为 1KQG 的蛋白质作为示例,通过该方法描述预测过程。从 Mptopo 数据库中随机选择了 80 种具有已知 3D 结构的蛋白质作为数据集(包括 325 个 TMHs),并根据其功能和类型将 80 个序列分为 13 组。对每组膜蛋白序列进行 TMHs 预测,得到满意的结果。为了验证该方法的可行性,将 80 个膜蛋白序列作为测试集,可预测 308 个 TMHs,预测准确率为 96.3%。与七种流行的预测方法的主要预测结果进行比较,得到的结果表明,本文提出的方法具有更高的预测准确性。