一种结合图神经网络和基因关系的新型生物标志物选择方法，应用于微阵列数据。

A novel biomarker selection method combining graph neural network and gene relationships applied to microarray data.

机构信息

School of Computer Science and Engineering, Northeastern University, Shenyang, China.

Key Laboratory of Intelligent Computing in Medical Image (MIIC), Northeastern University, Ministry of Education, Shenyang, China.

出版信息

BMC Bioinformatics. 2022 Jul 26;23(1):303. doi: 10.1186/s12859-022-04848-y.

DOI:10.1186/s12859-022-04848-y

PMID:35883022

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9327232/

Abstract

BACKGROUND

The discovery of critical biomarkers is significant for clinical diagnosis, drug research and development. Researchers usually obtain biomarkers from microarray data, which comes from the dimensional curse. Feature selection in machine learning is usually used to solve this problem. However, most methods do not fully consider feature dependence, especially the real pathway relationship of genes.

RESULTS

Experimental results show that the proposed method is superior to classical algorithms and advanced methods in feature number and accuracy, and the selected features have more significance.

METHOD

This paper proposes a feature selection method based on a graph neural network. The proposed method uses the actual dependencies between features and the Pearson correlation coefficient to construct graph-structured data. The information dissemination and aggregation operations based on graph neural network are applied to fuse node information on graph structured data. The redundant features are clustered by the spectral clustering method. Then, the feature ranking aggregation model using eight feature evaluation methods acts on each clustering sub-cluster for different feature selection.

CONCLUSION

The proposed method can effectively remove redundant features. The algorithm's output has high stability and classification accuracy, which can potentially select potential biomarkers.

摘要

背景

关键生物标志物的发现对于临床诊断、药物研发具有重要意义。研究人员通常从微阵列数据中获取生物标志物，这些数据来源于维度诅咒。机器学习中的特征选择通常用于解决这个问题。然而，大多数方法并没有充分考虑特征之间的依赖性，尤其是基因的实际通路关系。

结果

实验结果表明，所提出的方法在特征数量和准确性方面优于经典算法和先进方法，并且选择的特征具有更高的意义。

方法

本文提出了一种基于图神经网络的特征选择方法。所提出的方法使用特征之间的实际依赖性和皮尔逊相关系数来构建图结构数据。基于图神经网络的信息传播和聚合操作应用于融合图结构数据上的节点信息。通过谱聚类方法对冗余特征进行聚类。然后，使用八种特征评估方法的特征排序聚合模型对每个聚类子聚类进行不同的特征选择。

结论

所提出的方法可以有效地去除冗余特征。该算法的输出具有较高的稳定性和分类准确性，有可能选择潜在的生物标志物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f83f/9327232/657b68013704/12859_2022_4848_Fig1_HTML.jpg

相似文献

A novel biomarker selection method combining graph neural network and gene relationships applied to microarray data.

BMC Bioinformatics. 2022 Jul 26;23(1):303. doi: 10.1186/s12859-022-04848-y.

ILRC: a hybrid biomarker discovery algorithm based on improved L1 regularization and clustering in microarray data.

BMC Bioinformatics. 2021 Oct 22;22(1):514. doi: 10.1186/s12859-021-04443-7.

Determination of biomarkers from microarray data using graph neural network and spectral clustering.

Sci Rep. 2021 Dec 13;11(1):23828. doi: 10.1038/s41598-021-03316-6.

Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer.

Artif Intell Med. 2024 May;151:102840. doi: 10.1016/j.artmed.2024.102840. Epub 2024 Mar 11.

Dual regularized subspace learning using adaptive graph learning and rank constraint: Unsupervised feature selection on gene expression microarray datasets.

Comput Biol Med. 2023 Dec;167:107659. doi: 10.1016/j.compbiomed.2023.107659. Epub 2023 Nov 4.

Joint feature selection and optimal bipartite graph learning for subspace clustering.

Neural Netw. 2023 Jul;164:408-418. doi: 10.1016/j.neunet.2023.04.044. Epub 2023 May 5.

Multi-view projected clustering with graph learning.

Neural Netw. 2020 Jun;126:335-346. doi: 10.1016/j.neunet.2020.03.020. Epub 2020 Apr 1.

A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization.

BMC Bioinformatics. 2023 Apr 4;24(1):130. doi: 10.1186/s12859-023-05247-7.

Spectral embedding network for attributed graph clustering.

Neural Netw. 2021 Oct;142:388-396. doi: 10.1016/j.neunet.2021.05.026. Epub 2021 May 27.

Attributed graph clustering with multi-task embedding learning.

Neural Netw. 2022 Aug;152:224-233. doi: 10.1016/j.neunet.2022.04.018. Epub 2022 Apr 20.

引用本文的文献

Navigating the microarray landscape: a comprehensive review of feature selection techniques and their applications.

Front Big Data. 2025 Jul 10;8:1624507. doi: 10.3389/fdata.2025.1624507. eCollection 2025.

Identifying Cancer Stage-Related Biomarkers for Lung Adenocarcinoma by Integrating Both Node and Edge Features.

Genes (Basel). 2025 Feb 24;16(3):261. doi: 10.3390/genes16030261.

Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification.

Biomolecules. 2023 Sep 15;13(9):1391. doi: 10.3390/biom13091391.

Applications of Neural Networks in Biomedical Data Analysis.

Biomedicines. 2022 Jun 21;10(7):1469. doi: 10.3390/biomedicines10071469.

本文引用的文献

Determination of biomarkers from microarray data using graph neural network and spectral clustering.

Sci Rep. 2021 Dec 13;11(1):23828. doi: 10.1038/s41598-021-03316-6.

Transcriptome profiling by combined machine learning and statistical R analysis identifies TMEM236 as a potential novel diagnostic biomarker for colorectal cancer.

Sci Rep. 2021 Jul 12;11(1):14304. doi: 10.1038/s41598-021-92692-0.

Machine Learning Based Computational Gene Selection Models: A Survey, Performance Evaluation, Open Issues, and Future Research Directions.

Front Genet. 2020 Dec 10;11:603808. doi: 10.3389/fgene.2020.603808. eCollection 2020.

Wrapper-based gene selection with Markov blanket.

Comput Biol Med. 2017 Feb 1;81:11-23. doi: 10.1016/j.compbiomed.2016.12.002. Epub 2016 Dec 5.

Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm.

BMC Bioinformatics. 2014 Feb 20;15:49. doi: 10.1186/1471-2105-15-49.

TIGRESS: Trustful Inference of Gene REgulation using Stability Selection.

BMC Syst Biol. 2012 Nov 22;6:145. doi: 10.1186/1752-0509-6-145.

The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function.

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W214-20. doi: 10.1093/nar/gkq537.

A modified T-test feature selection method and its application on the HapMap genotype data.

Genomics Proteomics Bioinformatics. 2007 Dec;5(3-4):242-9. doi: 10.1016/S1672-0229(08)60011-X.

Ant system: optimization by a colony of cooperating agents.

IEEE Trans Syst Man Cybern B Cybern. 1996;26(1):29-41. doi: 10.1109/3477.484436.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种结合图神经网络和基因关系的新型生物标志物选择方法，应用于微阵列数据。

A novel biomarker selection method combining graph neural network and gene relationships applied to microarray data.

机构信息

出版信息

BACKGROUND

RESULTS

METHOD

CONCLUSION

背景

结果

方法

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献