• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iDeepSubMito:基于深度学习的蛋白质亚线粒体定位鉴定。

iDeepSubMito: identification of protein submitochondrial localization with deep learning.

机构信息

School of Artificial Intelligence, Jilin University, Jilin, China.

Information Science and Technology, Northeast Normal University, Jilin, China.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab288.

DOI:10.1093/bib/bbab288
PMID:34337657
Abstract

Mitochondria are membrane-bound organelles containing over 1000 different proteins involved in mitochondrial function, gene expression and metabolic processes. Accurate localization of those proteins in the mitochondrial compartments is critical to their operation. A few computational methods have been developed for predicting submitochondrial localization from the protein sequences. Unfortunately, most of these computational methods focus on employing biological features or evolutionary information to extract sequence features, which greatly limits the performance of subsequent identification. Moreover, the efficiency of most computational models is still under explored, especially the deep learning feature, which is promising but requires improvement. To address these limitations, we propose a novel computational method called iDeepSubMito to predict the location of mitochondrial proteins to the submitochondrial compartments. First, we adopted a coding scheme using the ProteinELMo to model the probability distribution over the protein sequences and then represent the protein sequences as continuous vectors. Then, we proposed and implemented convolutional neural network architecture based on the bidirectional LSTM with self-attention mechanism, to effectively explore the contextual information and protein sequence semantic features. To demonstrate the effectiveness of our proposed iDeepSubMito, we performed cross-validation on two datasets containing 424 proteins and 570 proteins respectively, and consisting of four different mitochondrial compartments (matrix, inner membrane, outer membrane and intermembrane regions). Experimental results revealed that our method outperformed other computational methods. In addition, we tested iDeepSubMito on the M187, M983 and MitoCarta3.0 to further verify the efficiency of our method. Finally, the motif analysis and the interpretability analysis were conducted to reveal novel insights into subcellular biological functions of mitochondrial proteins. iDeepSubMito source code is available on GitHub at https://github.com/houzl3416/iDeepSubMito.

摘要

线粒体是一种具有膜结构的细胞器,包含超过 1000 种不同的蛋白质,这些蛋白质参与线粒体功能、基因表达和代谢过程。这些蛋白质在细胞器中的准确定位对于它们的功能至关重要。已经开发了一些计算方法来根据蛋白质序列预测亚线粒体定位。不幸的是,大多数这些计算方法都集中于利用生物特征或进化信息来提取序列特征,这极大地限制了后续识别的性能。此外,大多数计算模型的效率仍有待探索,特别是深度学习特征,虽然很有前途,但需要改进。为了解决这些限制,我们提出了一种新的计算方法,称为 iDeepSubMito,用于预测线粒体蛋白质的亚线粒体定位。首先,我们采用了一种编码方案,使用 ProteinELMo 来对蛋白质序列的概率分布进行建模,然后将蛋白质序列表示为连续向量。然后,我们提出并实现了基于双向 LSTM 的卷积神经网络架构,该架构具有自注意力机制,可有效探索上下文信息和蛋白质序列语义特征。为了证明我们提出的 iDeepSubMito 的有效性,我们在两个分别包含 424 个和 570 个蛋白质的数据集上进行了交叉验证,这些数据集包含四个不同的线粒体区室(基质、内膜、外膜和膜间区室)。实验结果表明,我们的方法优于其他计算方法。此外,我们还在 M187、M983 和 MitoCarta3.0 上测试了 iDeepSubMito,以进一步验证我们方法的效率。最后,进行了基序分析和可解释性分析,以揭示线粒体蛋白质亚细胞生物功能的新见解。iDeepSubMito 的源代码可在 GitHub 上获得,网址为 https://github.com/houzl3416/iDeepSubMito。

相似文献

1
iDeepSubMito: identification of protein submitochondrial localization with deep learning.iDeepSubMito:基于深度学习的蛋白质亚线粒体定位鉴定。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab288.
2
DeepPred-SubMito: A Novel Submitochondrial Localization Predictor Based on Multi-Channel Convolutional Neural Network and Dataset Balancing Treatment.DeepPred-SubMito:一种基于多通道卷积神经网络和数据集平衡处理的新型亚线粒体定位预测器。
Int J Mol Sci. 2020 Aug 9;21(16):5710. doi: 10.3390/ijms21165710.
3
DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.DeepMito:使用卷积神经网络准确预测蛋白质亚线粒体定位
Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.
4
Mammalian Oxa1 protein is useful for assessment of submitochondrial protein localization and mitochondrial membrane integrity.哺乳动物 Oxa1 蛋白可用于评估亚线粒体蛋白定位和线粒体膜完整性。
Anal Biochem. 2010 Feb 15;397(2):250-2. doi: 10.1016/j.ab.2009.10.035. Epub 2009 Oct 23.
5
Computer-Aided Prediction of Protein Mitochondrial Localization.计算机辅助预测蛋白质的线粒体定位。
Methods Mol Biol. 2021;2275:433-452. doi: 10.1007/978-1-0716-1262-0_28.
6
Large-scale prediction and analysis of protein sub-mitochondrial localization with DeepMito.DeepMito 实现大规模预测和分析蛋白质亚线粒体定位
BMC Bioinformatics. 2020 Sep 16;21(Suppl 8):266. doi: 10.1186/s12859-020-03617-z.
7
Protein submitochondrial localization from integrated sequence representation and SVM-based backward feature extraction.基于整合序列表示和支持向量机的反向特征提取的蛋白质亚线粒体定位
Mol Biosyst. 2015 Jan;11(1):170-7. doi: 10.1039/c4mb00340c. Epub 2014 Oct 21.
8
iCircRBP-DHN: identification of circRNA-RBP interaction sites using deep hierarchical network.iCircRBP-DHN:使用深度层次网络识别 circRNA-RBP 相互作用位点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa274.
9
DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding.DeepLncLoc:一种基于子序列嵌入的深度学习框架,用于长非编码 RNA 亚细胞定位预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab360.
10
Predicting protein-ligand binding residues with deep convolutional neural networks.使用深度卷积神经网络预测蛋白质-配体结合残基。
BMC Bioinformatics. 2019 Feb 26;20(1):93. doi: 10.1186/s12859-019-2672-1.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
Protein subcellular localization prediction tools.蛋白质亚细胞定位预测工具。
Comput Struct Biotechnol J. 2024 Apr 15;23:1796-1807. doi: 10.1016/j.csbj.2024.04.032. eCollection 2024 Dec.
3
A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.
一种基于Transformer的蛋白质-蛋白质相互作用位点预测集成框架。
Research (Wash D C). 2023 Sep 27;6:0240. doi: 10.34133/research.0240. eCollection 2023.
4
Transformer Architecture and Attention Mechanisms in Genome Data Analysis: A Comprehensive Review.基因组数据分析中的Transformer架构与注意力机制:全面综述
Biology (Basel). 2023 Jul 22;12(7):1033. doi: 10.3390/biology12071033.
5
Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics.蛋白质亚细胞定位预测及相关主题的最新进展
Front Bioinform. 2022 May 19;2:910531. doi: 10.3389/fbinf.2022.910531. eCollection 2022.
6
SAWRPI: A Stacking Ensemble Framework With Adaptive Weight for Predicting ncRNA-Protein Interactions Using Sequence Information.SAWRPI:一种基于序列信息预测非编码RNA-蛋白质相互作用的具有自适应权重的堆叠集成框架。
Front Genet. 2022 Feb 28;13:839540. doi: 10.3389/fgene.2022.839540. eCollection 2022.
7
ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs.ECM-LSE:利用k间隔氨基酸对的深度潜在空间编码预测细胞外基质蛋白
Front Bioeng Biotechnol. 2021 Oct 14;9:752658. doi: 10.3389/fbioe.2021.752658. eCollection 2021.