• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepLoc:使用深度学习进行蛋白质亚细胞定位预测。

DeepLoc: prediction of protein subcellular localization using deep learning.

机构信息

Department of Bio and Health Informatics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.

The Bioinformatics Centre, Department of Biology, University of Copenhagen, 2200 Copenhagen N, Denmark.

出版信息

Bioinformatics. 2017 Nov 1;33(21):3387-3395. doi: 10.1093/bioinformatics/btx431.

DOI:10.1093/bioinformatics/btx431
PMID:29036616
Abstract

MOTIVATION

The prediction of eukaryotic protein subcellular localization is a well-studied topic in bioinformatics due to its relevance in proteomics research. Many machine learning methods have been successfully applied in this task, but in most of them, predictions rely on annotation of homologues from knowledge databases. For novel proteins where no annotated homologues exist, and for predicting the effects of sequence variants, it is desirable to have methods for predicting protein properties from sequence information only.

RESULTS

Here, we present a prediction algorithm using deep neural networks to predict protein subcellular localization relying only on sequence information. At its core, the prediction model uses a recurrent neural network that processes the entire protein sequence and an attention mechanism identifying protein regions important for the subcellular localization. The model was trained and tested on a protein dataset extracted from one of the latest UniProt releases, in which experimentally annotated proteins follow more stringent criteria than previously. We demonstrate that our model achieves a good accuracy (78% for 10 categories; 92% for membrane-bound or soluble), outperforming current state-of-the-art algorithms, including those relying on homology information.

AVAILABILITY AND IMPLEMENTATION

The method is available as a web server at http://www.cbs.dtu.dk/services/DeepLoc. Example code is available at https://github.com/JJAlmagro/subcellular_localization. The dataset is available at http://www.cbs.dtu.dk/services/DeepLoc/data.php.

CONTACT

jjalma@dtu.dk.

摘要

动机

由于在蛋白质组学研究中具有相关性,预测真核蛋白质亚细胞定位是生物信息学中一个研究得很好的课题。许多机器学习方法已成功应用于该任务,但在大多数方法中,预测依赖于从知识数据库中注释同源物。对于没有注释同源物的新蛋白质,并且对于预测序列变体的影响,仅使用序列信息预测蛋白质特性的方法是可取的。

结果

在这里,我们提出了一种使用深度神经网络的预测算法,该算法仅依赖于序列信息来预测蛋白质亚细胞定位。在其核心,预测模型使用循环神经网络来处理整个蛋白质序列和注意力机制,该机制识别对亚细胞定位重要的蛋白质区域。该模型在从最新的 UniProt 版本之一提取的蛋白质数据集上进行了训练和测试,其中实验注释的蛋白质遵循比以前更严格的标准。我们证明,我们的模型实现了较高的准确性(10 个类别中的 78%;膜结合或可溶性的 92%),优于当前最先进的算法,包括依赖同源信息的算法。

可用性和实现

该方法可作为网络服务器在 http://www.cbs.dtu.dk/services/DeepLoc 使用。示例代码可在 https://github.com/JJAlmagro/subcellular_localization 获得。数据集可在 http://www.cbs.dtu.dk/services/DeepLoc/data.php 获得。

联系

jjalma@dtu.dk。

相似文献

1
DeepLoc: prediction of protein subcellular localization using deep learning.DeepLoc:使用深度学习进行蛋白质亚细胞定位预测。
Bioinformatics. 2017 Nov 1;33(21):3387-3395. doi: 10.1093/bioinformatics/btx431.
2
DeepLoc 2.0: multi-label subcellular localization prediction using protein language models.DeepLoc 2.0:使用蛋白质语言模型进行多标签亚细胞定位预测。
Nucleic Acids Res. 2022 Jul 5;50(W1):W228-W234. doi: 10.1093/nar/gkac278.
3
DeepLoc 2.1: multi-label membrane protein type prediction using protein language models.DeepLoc 2.1:使用蛋白质语言模型进行多标签膜蛋白类型预测。
Nucleic Acids Res. 2024 Jul 5;52(W1):W215-W220. doi: 10.1093/nar/gkae237.
4
An introduction to deep learning on biological sequence data: examples and solutions.深度学习在生物序列数据上的应用:实例与解决方案。
Bioinformatics. 2017 Nov 15;33(22):3685-3690. doi: 10.1093/bioinformatics/btx531.
5
DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.DeepMito:使用卷积神经网络准确预测蛋白质亚线粒体定位
Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.
6
Deep Forest-based Prediction of Protein Subcellular Localization.基于深度森林的蛋白质亚细胞定位预测。
Curr Gene Ther. 2018;18(5):268-274. doi: 10.2174/1566523218666180913110949.
7
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
8
Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multi-label learning based on gene ontology annotation and profile alignment.利用 Chou 的 5 步规则,通过基于基因本体论注释和序列比对的多标签学习,预测革兰氏阴性和革兰氏阳性细菌蛋白质的亚细胞定位。
J Integr Bioinform. 2020 Jun 29;18(1):51-79. doi: 10.1515/jib-2019-0091.
9
DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding.DeepLncLoc:一种基于子序列嵌入的深度学习框架,用于长非编码 RNA 亚细胞定位预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab360.
10
Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.基于知识的真核生物蛋白质亚细胞定位预测。
BMC Bioinformatics. 2009 Dec 3;10 Suppl 15(Suppl 15):S8. doi: 10.1186/1471-2105-10-S15-S8.

引用本文的文献

1
ProtLoc-GRPO: Cell line-specific subcellular localization prediction using a graph-based model and reinforcement learning.ProtLoc-GRPO:使用基于图的模型和强化学习进行细胞系特异性亚细胞定位预测。
bioRxiv. 2025 Jul 22:2025.07.17.665451. doi: 10.1101/2025.07.17.665451.
2
Expanded genetic and functional diversity of oceanic fungi.海洋真菌不断扩展的遗传和功能多样性。
Microbiome. 2025 Aug 4;13(1):179. doi: 10.1186/s40168-025-02162-2.
3
BloodProST: prediction of blood-secretory proteins through self-training.BloodProST:通过自我训练预测血液分泌蛋白
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf385.
4
Phylogenetic and Structural Insights into Melatonin Receptors in Plants: Case Study in Jacq.植物中褪黑素受体的系统发育和结构洞察:以 Jacq. 为例
Plants (Basel). 2025 Jun 26;14(13):1952. doi: 10.3390/plants14131952.
5
RSLpred2: An Integrated Web Server for the Annotation of Rice Proteome Subcellular Localization Using Deep Learning.RSLpred2:一个使用深度学习对水稻蛋白质组亚细胞定位进行注释的集成网络服务器。
Rice (N Y). 2025 Jul 4;18(1):58. doi: 10.1186/s12284-025-00767-7.
6
Practical Applications of Language Models in Protein Sorting Prediction: SignalP 6.0, DeepLoc 2.1, and DeepLocPro 1.0.语言模型在蛋白质分选预测中的实际应用:SignalP 6.0、DeepLoc 2.1和DeepLocPro 1.0
Methods Mol Biol. 2025;2941:153-175. doi: 10.1007/978-1-0716-4623-6_10.
7
Identification of Small Open Reading Frame-Encoded Peptides in Glioma by an Optimized Proteomics Strategy.通过优化的蛋白质组学策略鉴定胶质瘤中小开放阅读框编码的肽段
Mol Cell Proteomics. 2025 Jun 11;24(7):101016. doi: 10.1016/j.mcpro.2025.101016.
8
From genome to drug targets: computational subtractive genomics reveals novel anti-filarial targets in Wuchereria bancrofti and identifies plant-based inhibitors of β-1,4-mannosyltransferase, a high-priority target.从基因组到药物靶点:计算性消减基因组学揭示了班氏吴策线虫新的抗丝虫靶点,并鉴定出β-1,4-甘露糖基转移酶的植物源抑制剂,该酶是一个高度优先的靶点。
Mol Divers. 2025 Jun 11. doi: 10.1007/s11030-025-11229-z.
9
Prediction of protein subcellular localization in single cells.单细胞中蛋白质亚细胞定位的预测。
Nat Methods. 2025 May 13. doi: 10.1038/s41592-025-02696-1.
10
Cdc48 plays a crucial role in redox homeostasis through dynamic reshaping of its interactome during early stationary phase.在稳定期早期,Cdc48通过动态重塑其相互作用组在氧化还原稳态中发挥关键作用。
Redox Biol. 2025 Jul;84:103651. doi: 10.1016/j.redox.2025.103651. Epub 2025 May 1.