• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用进化计算对蛋白质序列的序列长度模式进行自动推导和优化。

Automated derivation and refinement of sequence length patterns for protein sequences using evolutionary computation.

作者信息

Sadowski M I, Parish J H, Westhead D R

机构信息

Bioinformatics Unit, Department of Computer Science, University College London, Gower Street, London WC1E 6BT, UK.

出版信息

Biosystems. 2005 Sep;81(3):247-54. doi: 10.1016/j.biosystems.2005.05.001.

DOI:10.1016/j.biosystems.2005.05.001
PMID:16076522
Abstract

Several stratagems are used in protein bioinformatics for the classification of proteins based on sequence, structure or function. We explore the concept of a minimal signature embedded in a sequence that defines the likely position of a protein in a classification. Specifically, we address the derivation of sparse profiles for the G-protein coupled receptor (GPCR) clan of integral membrane proteins. We present an evolutionary algorithm (EA) for the derivation of sparse profiles (signatures) without the need to supply a multiple alignment. We also apply an evolution strategy (ES) to the problem of pattern and profile refinement. Patterns were derived for the GPCR 'superfamily' and GPCR families 1-3 individually from starting populations of randomly generated signatures, using a database of integral membrane protein sequences and an objective function using a modified receiver operator characteristic (ROC) statistic. The signature derived for the family 1 GPCR sequences was shown to perform very well in a stringent cross-validation test, detecting 76% of unseen GPCR sequences at 5% error. Application of the ES refinement method to a signature developed by a previously described method [Sadowski, M.I., Parish, J.H., 2003. Automated generation and refinement of protein signatures: case study with G-protein coupled receptors. Bioinformatics 19, 727-734] resulted in a 6% increase of coverage for 5% error as measured in the validation test. We note that there might be a limit to this or any classification of proteins based on patterns or schemata.

摘要

在蛋白质生物信息学中,有几种策略可用于根据序列、结构或功能对蛋白质进行分类。我们探索了序列中嵌入的最小特征的概念,该特征定义了蛋白质在分类中可能的位置。具体而言,我们研究了整合膜蛋白的G蛋白偶联受体(GPCR)家族的稀疏特征的推导。我们提出了一种进化算法(EA),用于推导稀疏特征(签名),而无需提供多序列比对。我们还将进化策略(ES)应用于模式和特征细化问题。使用整合膜蛋白序列数据库和使用修正的接收者操作特征(ROC)统计量的目标函数,从随机生成的特征的起始群体中分别为GPCR“超家族”和GPCR家族1-3推导模式。在严格的交叉验证测试中,为家族1 GPCR序列推导的特征表现非常出色,在5%的错误率下检测到76%的未见过的GPCR序列。将ES细化方法应用于先前描述的方法[Sadowski, M.I., Parish, J.H., 2003. 蛋白质特征的自动生成和细化:G蛋白偶联受体的案例研究。生物信息学19, 727-734]开发的特征,在验证测试中,5%错误率下的覆盖率提高了6%。我们注意到,基于模式或图式对蛋白质进行这种或任何分类可能存在局限性。

相似文献

1
Automated derivation and refinement of sequence length patterns for protein sequences using evolutionary computation.使用进化计算对蛋白质序列的序列长度模式进行自动推导和优化。
Biosystems. 2005 Sep;81(3):247-54. doi: 10.1016/j.biosystems.2005.05.001.
2
A novel and efficient technique for identification and classification of GPCRs.一种用于G蛋白偶联受体识别与分类的新颖且高效的技术。
IEEE Trans Inf Technol Biomed. 2008 Jul;12(4):541-8. doi: 10.1109/TITB.2007.911308.
3
Protein classification based on text document classification techniques.基于文本文档分类技术的蛋白质分类。
Proteins. 2005 Mar 1;58(4):955-70. doi: 10.1002/prot.20373.
4
GPCR-GIA: a web-server for identifying G-protein coupled receptors and their families with grey incidence analysis.GPCR-GIA:一个利用灰色关联分析识别 G 蛋白偶联受体及其家族的网络服务器。
Protein Eng Des Sel. 2009 Nov;22(11):699-705. doi: 10.1093/protein/gzp057. Epub 2009 Sep 22.
5
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
6
Classifying G-protein coupled receptors with bagging classification tree.利用装袋分类树对G蛋白偶联受体进行分类。
Comput Biol Chem. 2004 Oct;28(4):275-80. doi: 10.1016/j.compbiolchem.2004.08.001.
7
Automatic generation and evaluation of sparse protein signatures for families of protein structural domains.蛋白质结构域家族的稀疏蛋白质特征的自动生成与评估
Protein Sci. 2005 Jan;14(1):13-23. doi: 10.1110/ps.04929005.
8
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
9
Dimerization in aminergic G-protein-coupled receptors: application of a hidden-site class model of evolution.胺能G蛋白偶联受体中的二聚化:进化的隐藏位点类模型的应用
Biochemistry. 2003 Dec 16;42(49):14522-31. doi: 10.1021/bi035097r.
10
Statistical analysis and prediction of functional residues effective for GPCR-G-protein coupling selectivity.对GPCR-G蛋白偶联选择性有效的功能残基的统计分析与预测
Protein Eng Des Sel. 2006 Jun;19(6):277-83. doi: 10.1093/protein/gzl010. Epub 2006 Mar 24.