Xiao Xuan, Zou Hong-Liang, Lin Wei-Zhong
Computer Department, Jing-De-Zhen Ceramic Institute, Jingdezhen, 333046, China,
J Membr Biol. 2015 Aug;248(4):745-52. doi: 10.1007/s00232-015-9787-8. Epub 2015 Mar 22.
Predicting membrane protein type is a challenging problem, particularly when the query proteins may simultaneously have two or more different types. Most of the existing methods can only be used to deal with the single-label proteins. Actually, multiple-label proteins should not be ignored because they usually bear some special functions worthy of in-depth studies. By introducing the "multi-labeled learning" and hybridizing evolution information through Grey-PSSM, a novel predictor called iMem-Seq is developed that can be used to deal with the systems containing both single and multiple types of membrane proteins. As a demonstration, the jackknife cross-validation was performed with iMem-Seq on a benchmark dataset of membrane proteins classified into the eight types, where some proteins belong to two or there types, but none has ≥25% pairwise sequence identity to any other in a same subset. It was demonstrated via the rigorous cross-validations that the new predictor remarkably outperformed all its counterparts. As a user-friendly web-server, iMem-Seq is freely accessible to the public at the website http://www.jci-bioinfo.cn/iMem-Seq .
预测膜蛋白类型是一个具有挑战性的问题,尤其是当查询蛋白可能同时具有两种或更多不同类型时。现有的大多数方法只能用于处理单标签蛋白。实际上,多标签蛋白不应被忽视,因为它们通常具有一些值得深入研究的特殊功能。通过引入“多标签学习”并通过Grey-PSSM混合进化信息,开发了一种名为iMem-Seq的新型预测器,可用于处理包含单类型和多类型膜蛋白的系统。作为演示,在一个分为八种类型的膜蛋白基准数据集上使用iMem-Seq进行了留一法交叉验证,其中一些蛋白属于两种或三种类型,但在同一子集中没有任何一个与其他蛋白具有≥25%的成对序列同一性。通过严格的交叉验证证明,新的预测器明显优于所有同类预测器。作为一个用户友好的网络服务器,公众可以通过网站http://www.jci-bioinfo.cn/iMem-Seq免费访问iMem-Seq 。