• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TESTLoc:从 EST 数据预测蛋白质亚细胞定位。

TESTLoc: protein subcellular localization prediction from EST data.

机构信息

Robert-Cedergren Center for Bioinformatics and Genomics; Biochemistry Department, Université de Montréal, 2900 Edouard-Montpetit, Montreal, QC, H3T 1J4, Canada.

出版信息

BMC Bioinformatics. 2010 Nov 15;11:563. doi: 10.1186/1471-2105-11-563.

DOI:10.1186/1471-2105-11-563
PMID:21078192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3000424/
Abstract

BACKGROUND

The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes.

RESULTS

We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%).

CONCLUSIONS

TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html.

摘要

背景

真核细胞的结构错综复杂,其中的隔室和亚结构专门用于特定的生物过程。了解蛋白质的亚细胞定位不仅表明了生物过程在不同的细胞隔室中是如何组织的,而且有助于揭示单个蛋白质的功能。仅基于序列信息就可以进行计算定位预测,并且已经成功应用于来自几乎所有亚细胞隔室和所有生命领域的蛋白质。然而,我们意识到,当前的预测工具在部分蛋白质序列(例如从表达序列标签(EST)数据推断出的序列)上的性能并不理想,限制了对来自真核生物的大量且在分类上最全面的序列信息的利用。

结果

我们开发了一种新的预测器 TESTLoc,适用于基于从 EST(EST-肽)概念上翻译的部分序列的蛋白质亚细胞定位预测。支持向量机(SVM)被用作计算方法,EST-肽由不同的特征表示,如氨基酸组成和理化性质。当将 TESTLoc 应用于最具挑战性的测试案例(植物数据)时,它产生了高准确性(约 85%)。

结论

TESTLoc 是一个专为 EST 数据设计的定位预测工具。它为用户提供了多种模型可供选择,并可在 http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html 下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/78a7a22878bd/1471-2105-11-563-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/0bec2df57551/1471-2105-11-563-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/93047aaf6bd0/1471-2105-11-563-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/feb192a19a90/1471-2105-11-563-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/78a7a22878bd/1471-2105-11-563-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/0bec2df57551/1471-2105-11-563-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/93047aaf6bd0/1471-2105-11-563-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/feb192a19a90/1471-2105-11-563-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f86/3000424/78a7a22878bd/1471-2105-11-563-4.jpg

相似文献

1
TESTLoc: protein subcellular localization prediction from EST data.TESTLoc:从 EST 数据预测蛋白质亚细胞定位。
BMC Bioinformatics. 2010 Nov 15;11:563. doi: 10.1186/1471-2105-11-563.
2
'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.“团结起来攻克难关”:通过整合多种专业工具增强蛋白质亚细胞定位预测
BMC Bioinformatics. 2007 Oct 29;8:420. doi: 10.1186/1471-2105-8-420.
3
pSLIP: SVM based protein subcellular localization prediction using multiple physicochemical properties.pSLIP:基于支持向量机并利用多种物理化学性质进行蛋白质亚细胞定位预测
BMC Bioinformatics. 2005 Jun 17;6:152. doi: 10.1186/1471-2105-6-152.
4
ESAP plus: a web-based server for EST-SSR marker development.ESAP plus:一个用于EST-SSR标记开发的基于网络的服务器。
BMC Genomics. 2016 Dec 22;17(Suppl 13):1035. doi: 10.1186/s12864-016-3328-4.
5
Support vector machine approach for protein subcellular localization prediction.用于蛋白质亚细胞定位预测的支持向量机方法
Bioinformatics. 2001 Aug;17(8):721-8. doi: 10.1093/bioinformatics/17.8.721.
6
ProLoc-GO: utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization.ProLoc-GO:利用信息丰富的基因本体术语进行基于序列的蛋白质亚细胞定位预测。
BMC Bioinformatics. 2008 Feb 1;9:80. doi: 10.1186/1471-2105-9-80.
7
Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.基于知识的真核生物蛋白质亚细胞定位预测。
BMC Bioinformatics. 2009 Dec 3;10 Suppl 15(Suppl 15):S8. doi: 10.1186/1471-2105-10-S15-S8.
8
SherLoc: high-accuracy prediction of protein subcellular localization by integrating text and protein sequence data.SherLoc:通过整合文本和蛋白质序列数据对蛋白质亚细胞定位进行高精度预测。
Bioinformatics. 2007 Jun 1;23(11):1410-7. doi: 10.1093/bioinformatics/btm115. Epub 2007 Mar 28.
9
Supervised learning method for the prediction of subcellular localization of proteins using amino acid and amino acid pair composition.使用氨基酸和氨基酸对组成预测蛋白质亚细胞定位的监督学习方法。
BMC Genomics. 2008;9 Suppl 1(Suppl 1):S16. doi: 10.1186/1471-2164-9-S1-S16.
10
ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST.ESLpred:基于支持向量机的方法,利用二肽组成和PSI-BLAST对真核蛋白质进行亚细胞定位。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W414-9. doi: 10.1093/nar/gkh350.

引用本文的文献

1
Identification and Expression Pattern of EZH2 in Pig Developing Fetuses.鉴定和表达模式 EZH2 在猪胎儿发育。
Biomed Res Int. 2020 Oct 5;2020:5315930. doi: 10.1155/2020/5315930. eCollection 2020.
2
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
3
PSI: a comprehensive and integrative approach for accurate plant subcellular localization prediction.PSI:一种用于准确预测植物亚细胞定位的全面综合方法。

本文引用的文献

1
Combining machine learning and homology-based approaches to accurately predict subcellular localization in Arabidopsis.结合机器学习和基于同源性的方法准确预测拟南芥的亚细胞定位。
Plant Physiol. 2010 Sep;154(1):36-54. doi: 10.1104/pp.110.156851. Epub 2010 Jul 20.
2
Going from where to why--interpretable prediction of protein subcellular localization.从何处到为何——蛋白质亚细胞定位的可解释预测。
Bioinformatics. 2010 May 1;26(9):1232-8. doi: 10.1093/bioinformatics/btq115. Epub 2010 Mar 17.
3
A systemic proteomic analysis of Populus chloroplast by using shotgun method.
PLoS One. 2013 Oct 23;8(10):e75826. doi: 10.1371/journal.pone.0075826. eCollection 2013.
4
A hybrid distance measure for clustering expressed sequence tags originating from the same gene family.一种用于聚类来自同一基因家族的表达序列标签的混合距离度量方法。
PLoS One. 2012;7(10):e47216. doi: 10.1371/journal.pone.0047216. Epub 2012 Oct 11.
5
An ensemble classifier for eukaryotic protein subcellular location prediction using gene ontology categories and amino acid hydrophobicity.基于基因本体论类别和氨基酸疏水性的真核蛋白质亚细胞定位预测的集成分类器。
PLoS One. 2012;7(1):e31057. doi: 10.1371/journal.pone.0031057. Epub 2012 Jan 30.
利用鸟枪法对毛白杨叶绿体进行系统蛋白质组分析。
Mol Biol Rep. 2011 Jun;38(5):3045-54. doi: 10.1007/s11033-010-9971-y. Epub 2010 Feb 5.
4
Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.基于知识的真核生物蛋白质亚细胞定位预测。
BMC Bioinformatics. 2009 Dec 3;10 Suppl 15(Suppl 15):S8. doi: 10.1186/1471-2105-10-S15-S8.
5
MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction.MultiLoc2:整合系统发育和基因本体论术语可提高亚细胞蛋白质定位预测。
BMC Bioinformatics. 2009 Sep 1;10:274. doi: 10.1186/1471-2105-10-274.
6
Importing mitochondrial proteins: machineries and mechanisms.导入线粒体蛋白:机制与原理
Cell. 2009 Aug 21;138(4):628-44. doi: 10.1016/j.cell.2009.08.005.
7
Proteomic analysis of mitochondria from Caenorhabditis elegans.秀丽隐杆线虫线粒体的蛋白质组学分析
Proteomics. 2009 Oct;9(19):4539-53. doi: 10.1002/pmic.200900101.
8
Supervised ensembles of prediction methods for subcellular localization.用于亚细胞定位的预测方法的监督集成
J Bioinform Comput Biol. 2009 Apr;7(2):269-85. doi: 10.1142/s0219720009004072.
9
Expressed sequence tags: an overview.表达序列标签:概述
Methods Mol Biol. 2009;533:1-12. doi: 10.1007/978-1-60327-136-3_1.
10
Plasticity of a key metabolic pathway in fungi.真菌中关键代谢途径的可塑性
Funct Integr Genomics. 2009 May;9(2):145-51. doi: 10.1007/s10142-008-0095-6. Epub 2008 Sep 16.