Faculty of Biotechnology, Atma Jaya Catholic University of Indonesia, Jakarta, 12930, Indonesia.
Faculty of Biotechnology, Atma Jaya Catholic University of Indonesia, Jakarta, 12930, Indonesia.
Comput Biol Chem. 2021 Jun;92:107492. doi: 10.1016/j.compbiolchem.2021.107492. Epub 2021 Apr 24.
Streptococcus pneumoniae is a major cause of mortality in children under five years old. In recent years, the emergence of antibiotic-resistant strains of S. pneumoniae increases the threat level of this pathogen. For that reason, the exploration of S. pneumoniae protein virulence factors should be considered in developing new drugs or vaccines, for instance by the analysis of host-pathogen protein-protein interactions (HP-PPIs). In this research, prediction of protein-protein interactions was performed with a logistic regression model with the number of protein domain occurrences as features. By utilizing HP-PPIs of three different pathogens as training data, the model achieved 57-77 % precision, 64-75 % recall, and 96-98 % specificity. Prediction of human-S. pneumoniae protein-protein interactions using the model yielded 5823 interactions involving thirty S. pneumoniae proteins and 324 human proteins. Pathway enrichment analysis showed that most of the pathways involved in the predicted interactions are immune system pathways. Network topology analysis revealed β-galactosidase (BgaA) as the most central among the S. pneumoniae proteins in the predicted HP-PPI networks, with a degree centrality of 1.0 and a betweenness centrality of 0.451853. Further experimental studies are required to validate the predicted interactions and examine their roles in S. pneumoniae infection.
肺炎链球菌是导致五岁以下儿童死亡的主要原因。近年来,肺炎链球菌对抗生素的耐药菌株的出现增加了这种病原体的威胁程度。因此,在开发新药物或疫苗时,应该考虑肺炎链球菌蛋白毒力因子,例如通过分析宿主-病原体蛋白-蛋白相互作用(HP-PPIs)。在这项研究中,使用具有蛋白域出现次数作为特征的逻辑回归模型进行蛋白-蛋白相互作用预测。通过利用三种不同病原体的 HP-PPIs 作为训练数据,该模型实现了 57-77%的精度、64-75%的召回率和 96-98%的特异性。使用该模型预测人类-肺炎链球菌蛋白-蛋白相互作用,得到了涉及三十种肺炎链球菌蛋白和 324 个人类蛋白的 5823 个相互作用。途径富集分析表明,预测相互作用中涉及的大多数途径都是免疫系统途径。网络拓扑分析表明,在预测的 HP-PPI 网络中,β-半乳糖苷酶(BgaA)是肺炎链球菌蛋白中最核心的蛋白,其度中心度为 1.0,介数中心度为 0.451853。需要进一步的实验研究来验证预测的相互作用,并研究它们在肺炎链球菌感染中的作用。