• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最大最小相关分析在蛋白质环境分类中的应用及其功能预测。

Application of maximin correlation analysis to classifying protein environments for function prediction.

机构信息

School of Electrical Engineering, Korea University, Seoul 136-713, Republic of Korea.

出版信息

Biochem Biophys Res Commun. 2010 Sep 17;400(2):219-24. doi: 10.1016/j.bbrc.2010.08.042. Epub 2010 Aug 16.

DOI:10.1016/j.bbrc.2010.08.042
PMID:20719237
Abstract

More and more protein structures are being discovered, but most of these still have little functional information. Based on the assumption that structural resemblance would lead to functional similarity, researchers computationally compare a new structure with functionally annotated structures, for high-throughput function prediction. The effectiveness of this approach depends critically upon the quality of comparison. In particular, robust classification often becomes difficult when a function class is an aggregate of multiple subclasses, as is the case with protein annotations. For such multiple-subclass classification problems, an optimal method termed the maximin correlation analysis (MCA) was proposed. However, MCA has never been applied to automated protein function prediction although MCA can minimize the misclassification risk in the correlation-based nearest neighbor classification, thus increasing classification accuracy. In this article, we apply MCA to classifying three-dimensional protein local environment data derived from a subset of the protein data bank (PDB). In our framework, the MCA-based classifier outperformed the compared alternatives by 7-19% and 6-27% in terms of average sensitivity and specificity, respectively. Given that correlation-based similarity measures have been widely used for mining protein data, we expect that MCA would be employed to enhance other types of automated function prediction methods.

摘要

越来越多的蛋白质结构被发现,但其中大多数仍然缺乏功能信息。基于结构相似性会导致功能相似性的假设,研究人员通过计算将新结构与具有功能注释的结构进行比较,以实现高通量功能预测。这种方法的有效性取决于比较的质量。特别是,当功能类别是多个子类的组合时,例如蛋白质注释的情况,稳健的分类通常变得困难。对于这种多子类分类问题,提出了一种称为最大最小相关分析(MCA)的最优方法。然而,尽管 MCA 可以最小化基于相关性的最近邻分类中的分类错误风险,从而提高分类准确性,但它从未应用于自动化蛋白质功能预测。在本文中,我们将 MCA 应用于从蛋白质数据库(PDB)子集派生的三维蛋白质局部环境数据的分类。在我们的框架中,基于 MCA 的分类器在平均灵敏度和特异性方面分别优于比较的替代方法 7-19%和 6-27%。鉴于基于相关性的相似性度量已被广泛用于挖掘蛋白质数据,我们预计 MCA 将被用于增强其他类型的自动化功能预测方法。

相似文献

1
Application of maximin correlation analysis to classifying protein environments for function prediction.最大最小相关分析在蛋白质环境分类中的应用及其功能预测。
Biochem Biophys Res Commun. 2010 Sep 17;400(2):219-24. doi: 10.1016/j.bbrc.2010.08.042. Epub 2010 Aug 16.
2
Analysis and prediction of functional sub-types from protein sequence alignments.基于蛋白质序列比对的功能亚类型分析与预测。
J Mol Biol. 2000 Oct 13;303(1):61-76. doi: 10.1006/jmbi.2000.4036.
3
Variable predictive model based classification algorithm for effective separation of protein structural classes.基于可变预测模型的分类算法用于有效分离蛋白质结构类别。
Comput Biol Chem. 2008 Aug;32(4):302-6. doi: 10.1016/j.compbiolchem.2008.03.009. Epub 2008 Apr 1.
4
Rapid model quality assessment for protein structure predictions using the comparison of multiple models without structural alignments.使用不进行结构比对的多模型比较进行蛋白质结构预测的快速模型质量评估。
Bioinformatics. 2010 Jan 15;26(2):182-8. doi: 10.1093/bioinformatics/btp629. Epub 2009 Nov 6.
5
Combining evolutionary and structural information for local protein structure prediction.结合进化和结构信息进行局部蛋白质结构预测。
Proteins. 2004 Sep 1;56(4):782-94. doi: 10.1002/prot.20158.
6
The use of gene ontology evidence codes in preventing classifier assessment bias.基因本体证据代码在防止分类器评估偏差中的应用。
Bioinformatics. 2009 May 1;25(9):1173-7. doi: 10.1093/bioinformatics/btp122. Epub 2009 Mar 2.
7
Using ensemble classifier to identify membrane protein types.使用集成分类器识别膜蛋白类型。
Amino Acids. 2007;32(4):483-8. doi: 10.1007/s00726-006-0439-2. Epub 2006 Oct 12.
8
Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction.利用基因本体论评估蛋白质相似性及其在核亚定位预测中的应用。
BMC Bioinformatics. 2006 Nov 7;7:491. doi: 10.1186/1471-2105-7-491.
9
Using Nearest Feature Line and Tunable Nearest Neighbor methods for prediction of protein subcellular locations.使用最近特征线和可调最近邻方法预测蛋白质亚细胞定位。
Comput Biol Chem. 2005 Oct;29(5):388-92. doi: 10.1016/j.compbiolchem.2005.08.002. Epub 2005 Oct 5.
10
AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings.AutoSCOP:使用独特的模式-类别映射自动预测SCOP分类
Bioinformatics. 2007 May 15;23(10):1203-10. doi: 10.1093/bioinformatics/btm089. Epub 2007 Mar 22.

引用本文的文献

1
Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently.定向进化蛋白质生物催化剂的合成生物学:智能导航序列空间。
Chem Soc Rev. 2015 Mar 7;44(5):1172-239. doi: 10.1039/c4cs00351a.