Zuo Yong-Chun, Su Wen-Xia, Zhang Shi-Hua, Wang Shan-Shan, Wu Cheng-Yan, Yang Lei, Li Guang-Peng
The Key Laboratory of Mammalian Reproductive Biology and Biotechnology of the Ministry of Education, College of Life Sciences, Inner Mongolia University, Hohhot, 010021, China.
Mol Biosyst. 2015 Mar;11(3):950-7. doi: 10.1039/c4mb00681j. Epub 2015 Jan 21.
Membrane transporters play crucial roles in the fundamental cellular processes of living organisms. Computational techniques are very necessary to annotate the transporter functions. In this study, a multi-class K nearest neighbor classifier based on the increment of diversity (KNN-ID) was developed to discriminate the membrane transporter types when the increment of diversity (ID) was introduced as one of the novel similarity distances. Comparisons with multiple recently published methods showed that the proposed KNN-ID method outperformed the other methods, obtaining more than 20% improvement for overall accuracy. The overall prediction accuracy reached was 83.1%, when the K was selected as 2. The prediction sensitivity achieved 76.7%, 89.1%, 80.1% for channels/pores, electrochemical potential-driven transporters, primary active transporters, respectively. Discrimination and comparison between any two different classes of transporters further demonstrated that the proposed method is a potential classifier and will play a complementary role for facilitating the functional assignment of transporters.
膜转运蛋白在生物体的基本细胞过程中发挥着关键作用。计算技术对于注释转运蛋白功能非常必要。在本研究中,当引入多样性增量(ID)作为一种新的相似性距离时,开发了一种基于多样性增量的多类K近邻分类器(KNN-ID)来区分膜转运蛋白类型。与多种最近发表的方法进行比较表明,所提出的KNN-ID方法优于其他方法,总体准确率提高了20%以上。当K选择为2时,总体预测准确率达到83.1%。通道/孔、电化学势驱动转运蛋白、初级主动转运蛋白的预测灵敏度分别达到76.7%、89.1%、80.1%。对任意两类不同转运蛋白之间的区分和比较进一步证明,所提出的方法是一种有潜力的分类器,将在促进转运蛋白功能分配方面发挥补充作用。