• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算与系统生物学中基于决策树方法的监督学习

Supervised learning with decision tree-based methods in computational and systems biology.

作者信息

Geurts Pierre, Irrthum Alexandre, Wehenkel Louis

机构信息

Department of EE and CS & GIGA-Research, University of Liège, Belgium.

出版信息

Mol Biosyst. 2009 Dec;5(12):1593-605. doi: 10.1039/b907946g. Epub 2009 Oct 5.

DOI:10.1039/b907946g
PMID:20023720
Abstract

At the intersection between artificial intelligence and statistics, supervised learning allows algorithms to automatically build predictive models from just observations of a system. During the last twenty years, supervised learning has been a tool of choice to analyze the always increasing and complexifying data generated in the context of molecular biology, with successful applications in genome annotation, function prediction, or biomarker discovery. Among supervised learning methods, decision tree-based methods stand out as non parametric methods that have the unique feature of combining interpretability, efficiency, and, when used in ensembles of trees, excellent accuracy. The goal of this paper is to provide an accessible and comprehensive introduction to this class of methods. The first part of the review is devoted to an intuitive but complete description of decision tree-based methods and a discussion of their strengths and limitations with respect to other supervised learning methods. The second part of the review provides a survey of their applications in the context of computational and systems biology.

摘要

在人工智能与统计学的交叉领域,监督学习使算法能够仅根据对一个系统的观测自动构建预测模型。在过去二十年中,监督学习一直是分析分子生物学背景下不断增加且日益复杂的数据的首选工具,在基因组注释、功能预测或生物标志物发现等方面都有成功应用。在监督学习方法中,基于决策树的方法作为非参数方法脱颖而出,具有将可解释性、效率以及在树的集成中使用时的出色准确性相结合的独特特性。本文的目的是对这类方法进行通俗易懂且全面的介绍。综述的第一部分致力于对基于决策树的方法进行直观而完整的描述,并讨论它们相对于其他监督学习方法的优缺点。综述的第二部分对它们在计算生物学和系统生物学背景下的应用进行了概述。

相似文献

1
Supervised learning with decision tree-based methods in computational and systems biology.计算与系统生物学中基于决策树方法的监督学习
Mol Biosyst. 2009 Dec;5(12):1593-605. doi: 10.1039/b907946g. Epub 2009 Oct 5.
2
Accuracy-based learning classifier systems: models, analysis and applications to classification tasks.基于准确性的学习分类器系统:模型、分析及其在分类任务中的应用。
Evol Comput. 2003 Fall;11(3):209-38. doi: 10.1162/106365603322365289.
3
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.
4
Neural networks.神经网络
Methods Mol Biol. 2010;609:197-222. doi: 10.1007/978-1-60327-241-4_12.
5
Protein function prediction with high-throughput data.利用高通量数据进行蛋白质功能预测。
Amino Acids. 2008 Oct;35(3):517-30. doi: 10.1007/s00726-008-0077-y. Epub 2008 Apr 22.
6
EvDTree: structure-dependent substitution profiles based on decision tree classification of 3D environments.EvDTree:基于三维环境决策树分类的结构相关替代概况
BMC Bioinformatics. 2005 Jan 10;6:4. doi: 10.1186/1471-2105-6-4.
7
A greedy algorithm for supervised discretization.一种用于监督离散化的贪心算法。
J Biomed Inform. 2004 Aug;37(4):285-92. doi: 10.1016/j.jbi.2004.07.006.
8
Efficient prediction methods for selecting effective siRNA sequences.有效 siRNA 序列选择的高效预测方法。
Comput Biol Med. 2010 Feb;40(2):149-58. doi: 10.1016/j.compbiomed.2009.11.011.
9
On the use of multi-objective evolutionary algorithms for the induction of fuzzy classification rule systems.关于多目标进化算法在模糊分类规则系统归纳中的应用
Biosystems. 2005 Aug;81(2):101-12. doi: 10.1016/j.biosystems.2005.02.003.
10
Prediction of acetylcholinesterase inhibitors and characterization of correlative molecular descriptors by machine learning methods.通过机器学习方法预测乙酰胆碱酯酶抑制剂并对相关分子描述符进行特征化。
Eur J Med Chem. 2010 Mar;45(3):1167-72. doi: 10.1016/j.ejmech.2009.12.038. Epub 2009 Dec 28.

引用本文的文献

1
Chemical imaging for biological systems: techniques, AI-driven processing, and applications.生物系统的化学成像:技术、人工智能驱动的处理及应用
J Mater Chem B. 2025 Jun 18;13(24):6916-6948. doi: 10.1039/d4tb02876g.
2
Leveraging mixed-effects regression trees for the analysis of high-dimensional longitudinal data to identify the low and high-risk subgroups: simulation study with application to genetic study.利用混合效应回归树分析高维纵向数据以识别低风险和高风险亚组:应用于基因研究的模拟研究
BioData Min. 2025 Mar 19;18(1):22. doi: 10.1186/s13040-025-00437-w.
3
Predicting the risk of invasive fungal infections in ICU sepsis population: the AMI risk assessment tool.
预测重症监护病房脓毒症患者侵袭性真菌感染的风险:AMI风险评估工具
Infection. 2025 Feb 3. doi: 10.1007/s15010-024-02465-w.
4
Explainable Machine-Learning Models to Predict Weekly Risk of Hyperglycemia, Hypoglycemia, and Glycemic Variability in Patients With Type 1 Diabetes Based on Continuous Glucose Monitoring.基于持续葡萄糖监测的可解释机器学习模型预测1型糖尿病患者高血糖、低血糖和血糖变异性的每周风险
J Diabetes Sci Technol. 2024 Oct 8:19322968241286907. doi: 10.1177/19322968241286907.
5
Constructing a Hospital Department Development-Level Assessment Model: Machine Learning and Expert Consultation Approach in Complex Hospital Data Environments.构建医院科室发展水平评估模型:复杂医院数据环境下的机器学习与专家咨询方法。
JMIR Form Res. 2024 Sep 4;8:e54638. doi: 10.2196/54638.
6
Predictive model and risk analysis for coronary heart disease in people living with HIV using machine learning.基于机器学习的 HIV 感染者冠心病预测模型和风险分析。
BMC Med Inform Decis Mak. 2024 Apr 25;24(1):110. doi: 10.1186/s12911-024-02511-5.
7
Bridging systems biology and tissue engineering: Unleashing the full potential of complex 3D tissue models of disease.连接系统生物学与组织工程:释放疾病复杂三维组织模型的全部潜力。
Biophys Rev (Melville). 2024 Apr 10;5(2):021301. doi: 10.1063/5.0179125. eCollection 2024 Jun.
8
Machine Learning Isotropic Values of Radical Polymers.自由基聚合物的机器学习各向同性值。
J Chem Theory Comput. 2024 Mar 26;20(6):2592-2604. doi: 10.1021/acs.jctc.3c01252. Epub 2024 Mar 8.
9
A Machine Learning-Based Image Segmentation Method to Quantify In Vitro Osteoclast Culture Endpoints.基于机器学习的体外破骨细胞培养终点定量图像分割方法。
Calcif Tissue Int. 2023 Oct;113(4):437-448. doi: 10.1007/s00223-023-01121-z. Epub 2023 Aug 11.
10
Learning Relationships Between Chemical and Physical Stability for Peptide Drug Development.学习化学和物理稳定性之间的关系,以促进肽类药物的开发。
Pharm Res. 2023 Mar;40(3):701-710. doi: 10.1007/s11095-023-03475-3. Epub 2023 Feb 16.