• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用蛋白质相互作用网络预测蛋白质亚线粒体定位

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks.

作者信息

Hoseini Adele Sadat Haghighat, Mirzarezaee Mitra

机构信息

Department of Computer Engineering, Science and Research branch, Islamic Azad University, Tehran, Iran.

School of Biological Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.

出版信息

Iran J Biotechnol. 2018 Aug 11;16(3):e1933. doi: 10.15171/ijb.1933. eCollection 2018 Aug.

DOI:10.15171/ijb.1933
PMID:31457027
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6697825/
Abstract

BACKGROUND

Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from protein sequences. In contrast, protein interactions have been less investigated.

OBJECTIVES

As protein interactions usually occur in the same or adjacent places, using this feature to find the location would be efficient and impressive. This study did not aim at increasing the total accuracy of the conducted research. The study has focused on the features of the proteins' interaction and their employment which lead to a higher accuracy.

MATERIALS AND METHODS

In this study, we have examined the protein interaction network as one of the features for prediction of the protein localization and its effects on the prediction results. In this regards, we have gathered some of the most common features including Amino Acid Composition, Dipeptide Compositions, Pseudo Amino Acid Compositions (PseAAC), Position Specific Scoring Matrix (PSSM), Functional Domain, Gene Ontology information, and the Pair-wise sequence alignment. The results of the classification are compared to the ones using protein interactions. For achieving this goal different machine learning algorithms were tested.

RESULTS

The best-obtained results of using single feature set obtained using SVM classifier for PseAAC feature. The accuracy of combining all features with PPI data, using the Decision Tree and Random Forest classifiers, was 82.49% and 83.35%, respectively. In another experiment, using just protein interaction data with the different cutting points resulted in obtaining an accuracy of 93.035% for the protein location prediction.

CONCLUSION

In total, it was shown that protein(s) interaction has a significant impact on the prediction of the mitochondrial proteins' location. This feature can separately distinguish the locations well. Using this feature the accuracy of the results is raised up to 5%.

摘要

背景

蛋白质定位预测是生物信息学中最重要的问题之一,用于预测细胞和细胞器(如线粒体)中的蛋白质。在本研究中,应用了几种机器学习算法来预测细胞内蛋白质的位置。这些算法使用从蛋白质序列中提取的特征。相比之下,蛋白质相互作用的研究较少。

目的

由于蛋白质相互作用通常发生在相同或相邻的位置,利用这一特征来确定位置将是高效且令人印象深刻的。本研究并非旨在提高所开展研究的总体准确性。该研究专注于蛋白质相互作用的特征及其应用,从而实现更高的准确性。

材料与方法

在本研究中,我们将蛋白质相互作用网络作为预测蛋白质定位的特征之一进行了研究,并探讨了其对预测结果的影响。为此,我们收集了一些最常见的特征,包括氨基酸组成、二肽组成、伪氨基酸组成(PseAAC)、位置特异性评分矩阵(PSSM)、功能域、基因本体信息以及成对序列比对。将分类结果与使用蛋白质相互作用的结果进行比较。为实现这一目标,测试了不同的机器学习算法。

结果

使用支持向量机(SVM)分类器对PseAAC特征获得的单特征集取得了最佳结果。使用决策树和随机森林分类器将所有特征与蛋白质-蛋白质相互作用(PPI)数据相结合时,准确率分别为82.49%和83.35%。在另一项实验中,仅使用具有不同切点的蛋白质相互作用数据,蛋白质定位预测的准确率达到了93.035%。

结论

总体而言,研究表明蛋白质相互作用对线粒体蛋白质定位的预测有显著影响。这一特征能够很好地分别区分不同位置。利用这一特征,结果的准确率提高了5%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51aa/6697825/bd92ef9fce96/ijb-2018-03-e1933-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51aa/6697825/8a272d6bf6cd/ijb-2018-03-e1933-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51aa/6697825/bd92ef9fce96/ijb-2018-03-e1933-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51aa/6697825/8a272d6bf6cd/ijb-2018-03-e1933-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51aa/6697825/bd92ef9fce96/ijb-2018-03-e1933-g002.jpg

相似文献

1
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks.利用蛋白质相互作用网络预测蛋白质亚线粒体定位
Iran J Biotechnol. 2018 Aug 11;16(3):e1933. doi: 10.15171/ijb.1933. eCollection 2018 Aug.
2
Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition.通过将伪位置特异性评分矩阵纳入广义 Chou 的伪氨基酸组成来预测蛋白质亚线粒体位置。
J Theor Biol. 2018 Aug 7;450:86-103. doi: 10.1016/j.jtbi.2018.04.026. Epub 2018 Apr 18.
3
Accurate prediction of subcellular location of apoptosis proteins combining Chou's PseAAC and PsePSSM based on wavelet denoising.基于小波去噪结合周氏伪氨基酸组成和伪位置特异性得分矩阵对凋亡蛋白亚细胞定位的准确预测
Oncotarget. 2017 Nov 21;8(64):107640-107665. doi: 10.18632/oncotarget.22585. eCollection 2017 Dec 8.
4
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
5
Mito-GSAAC: mitochondria prediction using genetic ensemble classifier and split amino acid composition.Mito-GSAAC:基于遗传集成分类器和分裂氨基酸组成的线粒体预测。
Amino Acids. 2012 Apr;42(4):1443-54. doi: 10.1007/s00726-011-0888-0. Epub 2011 Mar 29.
6
CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition.CE-PLoc:一种通过融合不同模式的伪氨基酸组成来预测蛋白质亚细胞位置的集成分类器。
Comput Biol Chem. 2011 Aug 10;35(4):218-29. doi: 10.1016/j.compbiolchem.2011.05.003. Epub 2011 May 27.
7
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
8
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.
9
Accurate prediction of multi-label protein subcellular localization through multi-view feature learning with RBRL classifier.通过使用 RBRL 分类器的多视图特征学习实现多标签蛋白质亚细胞定位的准确预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab012.
10
Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches.植物微小肽:一种使用集成机器学习方法预测单靶点和多靶点蛋白质亚细胞定位的计算框架。
AoB Plants. 2019 Oct 17;12(3):plz068. doi: 10.1093/aobpla/plz068. eCollection 2020 Jun.

引用本文的文献

1
SubmitoLoc: Identification of mitochondrial sub cellular locations of proteins using support vector machine.SubmitoLoc:使用支持向量机鉴定蛋白质的线粒体亚细胞定位
Bioinformation. 2019 Dec 31;15(12):863-868. doi: 10.6026/97320630015863. eCollection 2019.

本文引用的文献

1
Prediction of relative solvent accessibility by support vector regression and best-first method.通过支持向量回归和最佳优先方法预测相对溶剂可及性。
EXCLI J. 2010 Feb 8;9:29-38. eCollection 2010.
2
rpiCOOL: A tool for In Silico RNA-protein interaction detection using random forest.rpiCOOL:一种使用随机森林进行计算机模拟RNA-蛋白质相互作用检测的工具。
J Theor Biol. 2016 Aug 7;402:1-8. doi: 10.1016/j.jtbi.2016.04.025. Epub 2016 Apr 28.
3
Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.
通过将二肽组成纳入周氏广义伪氨基酸组成来预测蛋白质的亚线粒体定位
J Membr Biol. 2016 Jun;249(3):293-304. doi: 10.1007/s00232-015-9868-8. Epub 2016 Jan 8.
4
LocFuse: human protein-protein interaction prediction via classifier fusion using protein localization information.LocFuse:利用蛋白质定位信息通过分类器融合进行人类蛋白质-蛋白质相互作用预测
Genomics. 2014 Dec;104(6 Pt B):496-503. doi: 10.1016/j.ygeno.2014.10.006. Epub 2014 Oct 16.
5
Using over-represented tetrapeptides to predict protein submitochondria locations.使用过度代表性的四肽预测蛋白质亚线粒体定位。
Acta Biotheor. 2013 Jun;61(2):259-68. doi: 10.1007/s10441-013-9181-9. Epub 2013 Mar 10.
6
Predicting protein submitochondria locations by combining different descriptors into the general form of Chou's pseudo amino acid composition.通过将不同描述符组合到 Chou 的伪氨基酸组成的通用形式中,预测蛋白质亚线粒体位置。
Amino Acids. 2012 Aug;43(2):545-55. doi: 10.1007/s00726-011-1143-4. Epub 2011 Nov 20.
7
Multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization.基于 Chou 的 PseAAC 构建的多核转移学习在蛋白质亚线粒体定位中的应用。
J Theor Biol. 2012 Jan 21;293:121-30. doi: 10.1016/j.jtbi.2011.10.015. Epub 2011 Oct 21.
8
CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition.CE-PLoc:一种通过融合不同模式的伪氨基酸组成来预测蛋白质亚细胞位置的集成分类器。
Comput Biol Chem. 2011 Aug 10;35(4):218-29. doi: 10.1016/j.compbiolchem.2011.05.003. Epub 2011 May 27.
9
Recent progress in predicting protein sub-subcellular locations.蛋白质亚细胞定位预测的最新进展。
Expert Rev Proteomics. 2011 Jun;8(3):391-404. doi: 10.1586/epr.11.20.
10
Identify submitochondria and subchloroplast locations with pseudo amino acid composition: approach from the strategy of discrete wavelet transform feature extraction.用伪氨基酸组成识别亚线粒体和亚叶绿体位置:基于离散小波变换特征提取策略的方法
Biochim Biophys Acta. 2011 Mar;1813(3):424-30. doi: 10.1016/j.bbamcr.2011.01.011. Epub 2011 Jan 19.