• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

论可理解的分类模型对于蛋白质功能预测的重要性。

On the importance of comprehensible classification models for protein function prediction.

机构信息

Computing Laboratory, University of Kent, Canterbury, UK.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):172-82. doi: 10.1109/TCBB.2008.47.

DOI:10.1109/TCBB.2008.47
PMID:20150679
Abstract

The literature on protein function prediction is currently dominated by works aimed at maximizing predictive accuracy, ignoring the important issues of validation and interpretation of discovered knowledge, which can lead to new insights and hypotheses that are biologically meaningful and advance the understanding of protein functions by biologists. The overall goal of this paper is to critically evaluate this approach, offering a refreshing new perspective on this issue, focusing not only on predictive accuracy but also on the comprehensibility of the induced protein function prediction models. More specifically, this paper aims to offer two main contributions to the area of protein function prediction. First, it presents the case for discovering comprehensible protein function prediction models from data, discussing in detail the advantages of such models, namely, increasing the confidence of the biologist in the system's predictions, leading to new insights about the data and the formulation of new biological hypotheses, and detecting errors in the data. Second, it presents a critical review of the pros and cons of several different knowledge representations that can be used in order to support the discovery of comprehensible protein function prediction models.

摘要

目前,蛋白质功能预测的文献主要集中在最大限度地提高预测准确性的工作上,而忽略了验证和解释发现的知识的重要问题,这可能会导致新的有生物学意义的见解和假设,并促进生物学家对蛋白质功能的理解。本文的总体目标是批判性地评估这种方法,为这个问题提供一个新的视角,不仅关注预测准确性,还关注所诱导的蛋白质功能预测模型的可理解性。更具体地说,本文旨在为蛋白质功能预测领域做出两个主要贡献。首先,它从数据中提出了发现可理解的蛋白质功能预测模型的案例,详细讨论了此类模型的优势,即提高生物学家对系统预测的信心,导致对数据的新见解和提出新的生物学假设,并检测数据中的错误。其次,它对可用于支持发现可理解的蛋白质功能预测模型的几种不同知识表示形式的优缺点进行了批判性回顾。

相似文献

1
On the importance of comprehensible classification models for protein function prediction.论可理解的分类模型对于蛋白质功能预测的重要性。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):172-82. doi: 10.1109/TCBB.2008.47.
2
Prediction of protein structural class for the twilight zone sequences.对处于模糊界限区域的序列进行蛋白质结构类别的预测。
Biochem Biophys Res Commun. 2007 Jun 1;357(2):453-60. doi: 10.1016/j.bbrc.2007.03.164. Epub 2007 Apr 5.
3
Prediction of protein structural class with Rough Sets.基于粗糙集的蛋白质结构类预测
BMC Bioinformatics. 2006 Jan 14;7:20. doi: 10.1186/1471-2105-7-20.
4
Prediction of protein structure classes with flexible neural tree.使用灵活神经树预测蛋白质结构类别。
Biomed Mater Eng. 2014;24(6):3797-806. doi: 10.3233/BME-141209.
5
CHORAL: a differential geometry approach to the prediction of the cores of protein structures.CHORAL:一种用于预测蛋白质结构核心的微分几何方法。
Bioinformatics. 2005 Oct 1;21(19):3719-25. doi: 10.1093/bioinformatics/bti595. Epub 2005 Jul 26.
6
Improved method for predicting beta-turn using support vector machine.使用支持向量机预测β-转角的改进方法。
Bioinformatics. 2005 May 15;21(10):2370-4. doi: 10.1093/bioinformatics/bti358. Epub 2005 Mar 29.
7
Modeling protein loops with knowledge-based prediction of sequence-structure alignment.基于知识的序列-结构比对预测对蛋白质环进行建模。
Bioinformatics. 2007 Nov 1;23(21):2836-42. doi: 10.1093/bioinformatics/btm456. Epub 2007 Sep 7.
8
RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins.RONN:应用于检测蛋白质天然无序区域的生物基础功能神经网络技术。
Bioinformatics. 2005 Aug 15;21(16):3369-76. doi: 10.1093/bioinformatics/bti534. Epub 2005 Jun 9.
9
Predicting protein structure classes from function predictions.从功能预测中预测蛋白质结构类别。
Bioinformatics. 2004 Mar 22;20(5):770-6. doi: 10.1093/bioinformatics/btg483. Epub 2004 Jan 29.
10
Identification of function-associated loop motifs and application to protein function prediction.功能相关环基序的鉴定及其在蛋白质功能预测中的应用。
Bioinformatics. 2006 Sep 15;22(18):2237-43. doi: 10.1093/bioinformatics/btl382. Epub 2006 Jul 26.

引用本文的文献

1
An interpretable machine learning model for diagnosis of Alzheimer's disease.一种用于阿尔茨海默病诊断的可解释机器学习模型。
PeerJ. 2019 Mar 1;7:e6543. doi: 10.7717/peerj.6543. eCollection 2019.
2
A review of supervised machine learning applied to ageing research.对应用于衰老研究的监督式机器学习的综述。
Biogerontology. 2017 Apr;18(2):171-188. doi: 10.1007/s10522-017-9683-y. Epub 2017 Mar 6.
3
Learning Classification Models of Cognitive Conditions from Subtle Behaviors in the Digital Clock Drawing Test.从数字时钟绘画测试中的细微行为学习认知状况的分类模型。
Mach Learn. 2016 Mar;102(3):393-441. doi: 10.1007/s10994-015-5529-5. Epub 2015 Oct 20.
4
Hierarchical ensemble methods for protein function prediction.用于蛋白质功能预测的分层集成方法。
ISRN Bioinform. 2014 May 4;2014:901419. doi: 10.1155/2014/901419. eCollection 2014.
5
Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients.基于决策树回归方法预测分布容积,使用预测的组织:血浆分配系数。
J Cheminform. 2015 Feb 26;7:6. doi: 10.1186/s13321-015-0054-x. eCollection 2015.
6
An infrastructure to mine molecular descriptors for ligand selection on virtual screening.一种用于在虚拟筛选中挖掘分子描述符以进行配体选择的基础设施。
Biomed Res Int. 2014;2014:325959. doi: 10.1155/2014/325959. Epub 2014 Apr 9.
7
Context-based preprocessing of molecular docking data.基于上下文的分子对接数据预处理。
BMC Genomics. 2013;14 Suppl 6(Suppl 6):S6. doi: 10.1186/1471-2164-14-S6-S6. Epub 2013 Oct 25.
8
Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data.自动设计决策树归纳算法,以适应柔性受体对接数据。
BMC Bioinformatics. 2012 Nov 21;13:310. doi: 10.1186/1471-2105-13-310.
9
A data mining approach for classifying DNA repair genes into ageing-related or non-ageing-related.一种将 DNA 修复基因分类为与衰老相关或非衰老相关的方法。
BMC Genomics. 2011 Jan 12;12:27. doi: 10.1186/1471-2164-12-27.
10
Mining flexible-receptor docking experiments to select promising protein receptor snapshots.挖掘柔性受体对接实验,以选择有前途的蛋白质受体快照。
BMC Genomics. 2010 Dec 22;11 Suppl 5(Suppl 5):S6. doi: 10.1186/1471-2164-11-S5-S6.