School of Internet of Things Engineering, Wuxi City College of Vocational Technology, Wuxi, 214153, China.
Sci Rep. 2018 Feb 5;8(1):2359. doi: 10.1038/s41598-018-20819-x.
Human immunodeficiency virus (HIV) is the retroviral agent that causes acquired immune deficiency syndrome (AIDS). The number of HIV caused deaths was about 4 million in 2016 alone; it was estimated that about 33 million to 46 million people worldwide living with HIV. The HIV disease is especially harmful because the progressive destruction of the immune system prevents the ability of forming specific antibodies and to maintain an efficacious killer T cell activity. Successful prediction of HIV protein has important significance for the biological and pharmacological functions. In this study, based on the concept of Chou's pseudo amino acid (PseAA) composition and increment of diversity (ID), support vector machine (SVM), logisitic regression (LR), and multilayer perceptron (MP) were presented to predict HIV-1 proteins and HIV-2 proteins. The results of the jackknife test indicated that the highest prediction accuracy and CC values were obtained by the SVM and MP were 0.9909 and 0.9763, respectively, indicating that the classifiers presented in this study were suitable for predicting two groups of HIV proteins.
人类免疫缺陷病毒 (HIV) 是导致获得性免疫缺陷综合征 (AIDS) 的逆转录病毒。仅在 2016 年,HIV 导致的死亡人数就约为 400 万;据估计,全球约有 3300 万至 4600 万人感染了 HIV。HIV 疾病尤其有害,因为免疫系统的逐渐破坏会阻止形成特定抗体的能力,并维持有效的杀伤性 T 细胞活性。成功预测 HIV 蛋白对其生物和药理学功能具有重要意义。在这项研究中,基于周元的伪氨基酸(PseAA)组成和多样性增量(ID)的概念,提出了支持向量机(SVM)、逻辑回归(LR)和多层感知器(MP)来预测 HIV-1 蛋白和 HIV-2 蛋白。Jackknife 测试的结果表明,SVM 和 MP 分别获得了最高的预测准确率和 CC 值,分别为 0.9909 和 0.9763,表明本研究提出的分类器适用于预测两组 HIV 蛋白。