Suppr超能文献

pLoc-mGneg:通过基于通用伪氨基酸组成的深度基因本体学习预测革兰氏阴性菌蛋白质的亚细胞定位。

pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC.

作者信息

Cheng Xiang, Xiao Xuan, Chou Kuo-Chen

机构信息

Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China; The Gordon Life Science Institute, Boston, MA 02478, USA.

The Gordon Life Science Institute, Boston, MA 02478, USA; Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China; Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia.

出版信息

Genomics. 2017 Oct 6. doi: 10.1016/j.ygeno.2017.10.002.

Abstract

Information of the proteins' subcellular localization is crucially important for revealing their biological functions in a cell, the basic unit of life. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop computational tools for timely identifying their subcellular locations based on the sequence information alone. The current study is focused on the Gram-negative bacterial proteins. Although considerable efforts have been made in protein subcellular prediction, the problem is far from being solved yet. This is because mounting evidences have indicated that many Gram-negative bacterial proteins exist in two or more location sites. Unfortunately, most existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions important for both basic research and drug design. In this study, by using the multi-label theory, we developed a new predictor called "pLoc-mGneg" for predicting the subcellular localization of Gram-negative bacterial proteins with both single and multiple locations. Rigorous cross-validation on a high quality benchmark dataset indicated that the proposed predictor is remarkably superior to "iLoc-Gneg", the state-of-the-art predictor for the same purpose. For the convenience of most experimental scientists, a user-friendly web-server for the novel predictor has been established at http://www.jci-bioinfo.cn/pLoc-mGneg/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.

摘要

蛋白质亚细胞定位信息对于揭示其在细胞(生命的基本单位)中的生物学功能至关重要。随着后基因组时代产生的蛋白质序列大量涌现,迫切需要开发仅基于序列信息就能及时识别其亚细胞定位的计算工具。当前的研究聚焦于革兰氏阴性菌蛋白质。尽管在蛋白质亚细胞定位预测方面已经付出了相当大的努力,但问题远未得到解决。这是因为越来越多的证据表明,许多革兰氏阴性菌蛋白质存在于两个或更多的定位位点。不幸的是,大多数现有方法只能用于处理单定位蛋白质。实际上,多定位蛋白质可能具有一些对基础研究和药物设计都很重要的特殊生物学功能。在本研究中,我们利用多标签理论开发了一种名为“pLoc-mGneg”的新预测器,用于预测革兰氏阴性菌单定位和多定位蛋白质的亚细胞定位。在一个高质量基准数据集上进行的严格交叉验证表明,所提出的预测器明显优于用于相同目的的最先进预测器“iLoc-Gneg”。为方便大多数实验科学家,已在http://www.jci-bioinfo.cn/pLoc-mGneg/建立了一个用户友好的新型预测器网络服务器,用户通过该服务器可以轻松获得所需结果,而无需处理复杂的数学问题。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验