Suppr超能文献

AtbPpred:使用极端随机树对抗结核肽进行基于序列的稳健预测。

AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees.

作者信息

Manavalan Balachandran, Basith Shaherin, Shin Tae Hwan, Wei Leyi, Lee Gwang

机构信息

Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea.

School of Computer Science and Technology, Tianjin University, China.

出版信息

Comput Struct Biotechnol J. 2019 Jul 3;17:972-981. doi: 10.1016/j.csbj.2019.06.024. eCollection 2019.

Abstract

is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent need for novel and effective alternative therapies. Peptide-based therapy has several advantages, such as diverse mechanisms of action, low immunogenicity, and selective affinity to bacterial cell envelopes. However, the identification of anti-tubercular peptides (AtbPs) via experimentation is laborious and expensive; hence, the development of an efficient computational method is necessary for the prediction of AtbPs prior to both in vitro and in vivo experiments. To this end, we developed a two-layer machine learning (ML)-based predictor called AtbPpred for the identification of AtbPs. In the first layer, we applied a two-step feature selection procedure and identified the optimal feature set individually for nine different feature encodings, whose corresponding models were developed using extremely randomized tree (ERT). In the second-layer, the predicted probability of AtbPs from the above nine models were considered as input features to ERT and developed the final predictor. AtbPpred respectively achieved average accuracies of 88.3% and 87.3% during cross-validation and an independent evaluation, which were ~8.7% and 10.0% higher than the state-of-the-art method. Furthermore, we established a user-friendly webserver which is currently available at http://thegleelab.org/AtbPpred. We anticipate that this predictor could be useful in the high-throughput prediction of AtbPs and also provide mechanistic insights into its functions.

摘要

是人类最危险的病原体之一。它是结核病(TB)的病原体,感染了世界近三分之一的人口。由于耐多药结核病和广泛耐药结核病的高发病率,迫切需要新颖有效的替代疗法。基于肽的疗法具有多种优势,例如作用机制多样、免疫原性低以及对细菌细胞膜具有选择性亲和力。然而,通过实验鉴定抗结核肽(AtbPs)既费力又昂贵;因此,在体外和体内实验之前,开发一种高效的计算方法来预测AtbPs是必要的。为此,我们开发了一种基于两层机器学习(ML)的预测器AtbPpred,用于鉴定AtbPs。在第一层,我们应用了两步特征选择程序,并针对九种不同的特征编码分别确定了最佳特征集,其相应的模型是使用极端随机树(ERT)开发的。在第二层,将上述九个模型预测的AtbPs概率作为ERT的输入特征,开发了最终的预测器。在交叉验证和独立评估期间,AtbPpred的平均准确率分别达到88.3%和87.3%,比现有方法分别高出约8.7%和10.0%。此外,我们建立了一个用户友好的网络服务器,目前可在http://thegleelab.org/AtbPpred上获取。我们预计该预测器可用于AtbPs的高通量预测,并为其功能提供机制性见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af06/6658830/14de823be31b/ga1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验