Suppr超能文献

使用基因表达数据发现生物标志物的机器学习方法

Machine Learning Approaches for Biomarker Discovery Using Gene Expression Data

作者信息

Zhang Xiaokang, Jonassen Inge, Goksøyr Anders

机构信息

Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway

Center for Cancer Biomarkers, Department of Informatics, University of Bergen, Bergen, Norway

Abstract

Biomarkers are of great importance in many fields, such as cancer research, toxicology, diagnosis and treatment of diseases, and to better understand biological response mechanisms to internal or external intervention. High-throughput gene expression profiling technologies, such as DNA microarrays and RNA sequencing, provide large gene expression data sets which enable data-driven biomarker discovery. Traditional statistical tests have been the mainstream for identifying differentially expressed genes as biomarkers. In recent years, machine learning techniques such as feature selection have gained more popularity. Given many options, picking the most appropriate method for a particular data becomes essential. Different evaluation metrics have therefore been proposed. Being evaluated on different aspects, a method’s varied performance across different datasets leads to the idea of integrating multiple methods. Many integration strategies are proposed and have shown great potential. This chapter gives an overview of the current research advances and existing issues in biomarker discovery using machine learning approaches on gene expression data.

摘要

生物标志物在许多领域都非常重要,如癌症研究、毒理学、疾病的诊断和治疗,以及为了更好地理解对内部或外部干预的生物反应机制。高通量基因表达谱技术,如DNA微阵列和RNA测序,提供了大量的基因表达数据集,从而能够进行数据驱动的生物标志物发现。传统统计测试一直是识别差异表达基因作为生物标志物的主流方法。近年来,诸如特征选择等机器学习技术越来越受欢迎。面对众多选择,为特定数据挑选最合适的方法变得至关重要。因此,人们提出了不同的评估指标。由于在不同方面进行评估,一种方法在不同数据集上的表现各异,这就催生了整合多种方法的想法。人们提出了许多整合策略,并已显示出巨大潜力。本章概述了使用机器学习方法处理基因表达数据进行生物标志物发现的当前研究进展和存在的问题。

相似文献

2
3
Robust biomarker screening from gene expression data by stable machine learning-recursive feature elimination methods.
Comput Biol Chem. 2022 Oct;100:107747. doi: 10.1016/j.compbiolchem.2022.107747. Epub 2022 Jul 29.
5
Deciphering the role of lipid metabolism-related genes in Alzheimer's disease: a machine learning approach integrating Traditional Chinese Medicine.
Front Endocrinol (Lausanne). 2024 Oct 23;15:1448119. doi: 10.3389/fendo.2024.1448119. eCollection 2024.
6
Advancements within Modern Machine Learning Methodology: Impacts and Prospects in Biomarker Discovery.
Curr Med Chem. 2021;28(32):6512-6531. doi: 10.2174/0929867328666210208111821.
7
Identifying candidate RNA-seq biomarkers for severity discrimination in chemical injuries: A machine learning and molecular dynamics approach.
Int Immunopharmacol. 2025 Feb 20;148:114090. doi: 10.1016/j.intimp.2025.114090. Epub 2025 Jan 22.
8
Deep learning facilitates multi-data type analysis and predictive biomarker discovery in cancer precision medicine.
Comput Struct Biotechnol J. 2023 Jan 31;21:1372-1382. doi: 10.1016/j.csbj.2023.01.043. eCollection 2023.
9
Integration of RNA-Seq data with heterogeneous microarray data for breast cancer profiling.
BMC Bioinformatics. 2017 Nov 21;18(1):506. doi: 10.1186/s12859-017-1925-0.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验