Suppr
超能文献

机器学习：生物信息学中不可或缺的工具。

Machine learning: an indispensable tool in bioinformatics.

作者信息

Inza Iñaki, Calvo Borja, Armañanzas Rubén, Bengoetxea Endika, Larrañaga Pedro, Lozano José A

机构信息

Intelligent Systems Group, Donostia - San Sebastián, Basque Country, Spain.

出版信息

Methods Mol Biol. 2010;593:25-48. doi: 10.1007/978-1-60327-194-3_2.

DOI:10.1007/978-1-60327-194-3_2

PMID:19957143

Abstract

The increase in the number and complexity of biological databases has raised the need for modern and powerful data analysis tools and techniques. In order to fulfill these requirements, the machine learning discipline has become an everyday tool in bio-laboratories. The use of machine learning techniques has been extended to a wide spectrum of bioinformatics applications. It is broadly used to investigate the underlying mechanisms and interactions between biological molecules in many diseases, and it is an essential tool in any biomarker discovery process. In this chapter, we provide a basic taxonomy of machine learning algorithms, and the characteristics of main data preprocessing, supervised classification, and clustering techniques are shown. Feature selection, classifier evaluation, and two supervised classification topics that have a deep impact on current bioinformatics are presented. We make the interested reader aware of a set of popular web resources, open source software tools, and benchmarking data repositories that are frequently used by the machine learning community.

摘要

生物数据库数量的增加及其复杂性的提升，使得对现代且强大的数据分析工具和技术的需求不断增长。为了满足这些需求，机器学习学科已成为生物实验室中的日常工具。机器学习技术的应用已扩展到广泛的生物信息学应用领域。它被广泛用于研究许多疾病中生物分子之间的潜在机制和相互作用，并且是任何生物标志物发现过程中的重要工具。在本章中，我们提供了机器学习算法的基本分类，并展示了主要数据预处理、监督分类和聚类技术的特点。介绍了特征选择、分类器评估以及对当前生物信息学有深远影响的两个监督分类主题。我们让感兴趣的读者了解机器学习社区经常使用的一组流行网络资源、开源软件工具和基准测试数据存储库。

相似文献

Machine learning: an indispensable tool in bioinformatics.

Methods Mol Biol. 2010;593:25-48. doi: 10.1007/978-1-60327-194-3_2.

A review of feature selection techniques in bioinformatics.

Bioinformatics. 2007 Oct 1;23(19):2507-17. doi: 10.1093/bioinformatics/btm344. Epub 2007 Aug 24.

Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics.

Prog Brain Res. 2006;158:83-108. doi: 10.1016/S0079-6123(06)58004-5.

Machine learning in bioinformatics: a brief survey and recommendations for practitioners.

Comput Biol Med. 2006 Oct;36(10):1104-25. doi: 10.1016/j.compbiomed.2005.09.002. Epub 2005 Oct 13.

Data mining in bioinformatics using Weka.

Bioinformatics. 2004 Oct 12;20(15):2479-81. doi: 10.1093/bioinformatics/bth261. Epub 2004 Apr 8.

Open-source tools for data mining.

Clin Lab Med. 2008 Mar;28(1):37-54, vi. doi: 10.1016/j.cll.2007.10.002.

Support vector machine classification on the web.

Bioinformatics. 2004 Mar 1;20(4):586-7. doi: 10.1093/bioinformatics/btg461. Epub 2004 Jan 22.

Knowledge discovery via machine learning for neurodegenerative disease researchers.

Methods Mol Biol. 2009;569:173-96. doi: 10.1007/978-1-59745-524-4_9.

Biowep: a workflow enactment portal for bioinformatics applications.

BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S19. doi: 10.1186/1471-2105-8-S1-S19.

A primer on gene expression and microarrays for machine learning researchers.

J Biomed Inform. 2004 Aug;37(4):293-303. doi: 10.1016/j.jbi.2004.07.002.

引用本文的文献

Chemical imaging for biological systems: techniques, AI-driven processing, and applications.

J Mater Chem B. 2025 Jun 18;13(24):6916-6948. doi: 10.1039/d4tb02876g.

Machine learning identifies key metabolic reactions in bacterial growth on different carbon sources.

Mol Syst Biol. 2024 Mar;20(3):170-186. doi: 10.1038/s44320-024-00017-w. Epub 2024 Jan 30.

Statistical Analysis and Tokenization of Epitopes to Construct Artificial Neoepitope Libraries.

ACS Synth Biol. 2023 Oct 20;12(10):2812-2818. doi: 10.1021/acssynbio.3c00201. Epub 2023 Sep 13.

Classification of porcine reproductive and respiratory syndrome clinical impact in Ontario sow herds using machine learning approaches.

Front Vet Sci. 2023 Jun 7;10:1175569. doi: 10.3389/fvets.2023.1175569. eCollection 2023.

A User's Guide to Machine Learning for Polymeric Biomaterials.

ACS Polym Au. 2022 Nov 17;3(2):141-157. doi: 10.1021/acspolymersau.2c00037. eCollection 2023 Apr 12.

KLF9 and EPYC acting as feature genes for osteoarthritis and their association with immune infiltration.

J Orthop Surg Res. 2022 Jul 28;17(1):365. doi: 10.1186/s13018-022-03247-6.

A comparative analysis of machine learning classifiers for predicting protein-binding nucleotides in RNA sequences.

Comput Struct Biotechnol J. 2022 Jun 17;20:3195-3207. doi: 10.1016/j.csbj.2022.06.036. eCollection 2022.

The risk of ischemic stroke and hemorrhagic stroke in Chinese adults with low-density lipoprotein cholesterol concentrations < 70 mg/dL.

BMC Med. 2021 Jun 16;19(1):142. doi: 10.1186/s12916-021-02014-4.

Application of machine learning to predict reduction in total PANSS score and enrich enrollment in schizophrenia clinical trials.

Clin Transl Sci. 2021 Sep;14(5):1864-1874. doi: 10.1111/cts.13035. Epub 2021 May 3.

A Priori Estimation of the Narrow-Band UVB Phototherapy Outcome for Moderate-to-Severe Psoriasis Based on the Patients' Questionnaire and Blood Tests Using Random Forest Classifier.

Clin Cosmet Investig Dermatol. 2021 Mar 18;14:253-259. doi: 10.2147/CCID.S296604. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

机器学习：生物信息学中不可或缺的工具。

Machine learning: an indispensable tool in bioinformatics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译