• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图卷积网络的高维小样本数据特征选择。

Graph convolutional network-based feature selection for high-dimensional and low-sample size data.

机构信息

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, United States.

Center for Artificial Intelligence and Modeling, The Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Champaign, IL 61820, United States.

出版信息

Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad135.

DOI:10.1093/bioinformatics/btad135
PMID:37084264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10126323/
Abstract

MOTIVATION

Feature selection is a powerful dimension reduction technique which selects a subset of relevant features for model construction. Numerous feature selection methods have been proposed, but most of them fail under the high-dimensional and low-sample size (HDLSS) setting due to the challenge of overfitting.

RESULTS

We present a deep learning-based method-GRAph Convolutional nEtwork feature Selector (GRACES)-to select important features for HDLSS data. GRACES exploits latent relations between samples with various overfitting-reducing techniques to iteratively find a set of optimal features which gives rise to the greatest decreases in the optimization loss. We demonstrate that GRACES significantly outperforms other feature selection methods on both synthetic and real-world datasets.

AVAILABILITY AND IMPLEMENTATION

The source code is publicly available at https://github.com/canc1993/graces.

摘要

动机

特征选择是一种强大的降维技术,它可以为模型构建选择一组相关特征。已经提出了许多特征选择方法,但由于过拟合的挑战,它们大多数在高维低样本量(HDLSS)环境下失败。

结果

我们提出了一种基于深度学习的方法-GRAPH Convolutional nEtwork feature Selector(GRACES)-用于选择 HDLSS 数据的重要特征。GRACES 利用各种过拟合减少技术来挖掘样本之间的潜在关系,以迭代地找到一组最优特征,从而导致优化损失的最大减少。我们证明了 GRACES 在合成数据集和真实数据集上都显著优于其他特征选择方法。

可用性和实现

源代码可在 https://github.com/canc1993/graces 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/81739e4ebfa7/btad135f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/6bafcfc07706/btad135f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/67c51f91f200/btad135f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/ea63852a89ee/btad135f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/024c834659c4/btad135f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/81739e4ebfa7/btad135f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/6bafcfc07706/btad135f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/67c51f91f200/btad135f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/ea63852a89ee/btad135f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/024c834659c4/btad135f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f904/10126323/81739e4ebfa7/btad135f5.jpg

相似文献

1
Graph convolutional network-based feature selection for high-dimensional and low-sample size data.基于图卷积网络的高维小样本数据特征选择。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad135.
2
Unsupervised domain selective graph convolutional network for preoperative prediction of lymph node metastasis in gastric cancer.无监督域选择图卷积网络用于胃癌术前淋巴结转移预测。
Med Image Anal. 2022 Jul;79:102467. doi: 10.1016/j.media.2022.102467. Epub 2022 Apr 28.
3
Network-based drug sensitivity prediction.基于网络的药物敏感性预测。
BMC Med Genomics. 2020 Dec 28;13(Suppl 11):193. doi: 10.1186/s12920-020-00829-3.
4
forgeNet: a graph deep neural network model using tree-based ensemble classifiers for feature graph construction.forgeNet:一种基于图的深度神经网络模型,使用基于树的集成分类器进行特征图构建。
Bioinformatics. 2020 Jun 1;36(11):3507-3515. doi: 10.1093/bioinformatics/btaa164.
5
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
6
Adversarial dense graph convolutional networks for single-cell classification.对抗密集图卷积网络用于单细胞分类。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad043.
7
A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data.基于基因表达数据的疾病预后分类和特征选择的图嵌入深度前馈网络。
Bioinformatics. 2018 Nov 1;34(21):3727-3737. doi: 10.1093/bioinformatics/bty429.
8
SELECTOR: Heterogeneous graph network with convolutional masked autoencoder for multimodal robust prediction of cancer survival.基于卷积掩蔽自动编码器的异质图网络用于癌症生存的多模态稳健预测。
Comput Biol Med. 2024 Apr;172:108301. doi: 10.1016/j.compbiomed.2024.108301. Epub 2024 Mar 15.
9
Scaling tree-based automated machine learning to biomedical big data with a feature set selector.使用特征集选择器将基于树的自动化机器学习扩展到生物医学大数据。
Bioinformatics. 2020 Jan 1;36(1):250-256. doi: 10.1093/bioinformatics/btz470.
10
Applying stability selection to consistently estimate sparse principal components in high-dimensional molecular data.应用稳定性选择方法在高维分子数据中一致估计稀疏主成分。
Bioinformatics. 2015 Aug 15;31(16):2683-90. doi: 10.1093/bioinformatics/btv197. Epub 2015 Apr 10.

引用本文的文献

1
BIM-Ken: Identifying Disease-Related miRNA Biomarkers Based on Knowledge-Enhanced Bio-Network.BIM-Ken:基于知识增强生物网络识别疾病相关的微小RNA生物标志物
Genes (Basel). 2025 Jul 28;16(8):902. doi: 10.3390/genes16080902.
2
Graph feature selection for enhancing radiomic stability and reproducibility across multiple institutions in head and neck cancer.用于提高头颈癌多机构间影像组学稳定性和可重复性的图形特征选择
Sci Rep. 2025 Jul 31;15(1):27995. doi: 10.1038/s41598-025-12161-w.
3
An effective heuristic for developing hybrid feature selection in high dimensional and low sample size datasets.

本文引用的文献

1
Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images.经噪声训练的深度神经网络能有效预测人类视觉及其对挑战性图像的神经反应。
PLoS Biol. 2021 Dec 9;19(12):e3001418. doi: 10.1371/journal.pbio.3001418. eCollection 2021 Dec.
2
Deep Feature Selection: Theory and Application to Identify Enhancers and Promoters.深度特征选择:识别增强子和启动子的理论与应用
J Comput Biol. 2016 May;23(5):322-36. doi: 10.1089/cmb.2015.0189. Epub 2016 Jan 22.
3
A global reference for human genetic variation.
一种用于在高维低样本量数据集中开发混合特征选择的有效启发式方法。
BMC Bioinformatics. 2024 Dec 26;25(1):390. doi: 10.1186/s12859-024-06017-9.
4
Spatially resolved subcellular protein-protein interactomics in drug-perturbed lung-cancer cultures and tissues.药物扰动的肺癌培养物和组织中的空间分辨亚细胞蛋白质-蛋白质相互作用组学
Nat Biomed Eng. 2024 Oct 30. doi: 10.1038/s41551-024-01271-x.
5
Biobjective gradient descent for feature selection on high dimension, low sample size data.高维小样本数据的特征选择的双目标梯度下降。
PLoS One. 2024 Jul 18;19(7):e0305654. doi: 10.1371/journal.pone.0305654. eCollection 2024.
6
Cauchy non-convex sparse feature selection method for the high-dimensional small-sample problem in motor imagery EEG decoding.用于运动想象脑电信号解码中高维小样本问题的柯西非凸稀疏特征选择方法
Front Neurosci. 2023 Nov 3;17:1292724. doi: 10.3389/fnins.2023.1292724. eCollection 2023.
人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
4
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.通过深度学习预测 DNA 和 RNA 结合蛋白的序列特异性。
Nat Biotechnol. 2015 Aug;33(8):831-8. doi: 10.1038/nbt.3300. Epub 2015 Jul 27.
5
RNA Sequencing and Analysis.RNA测序与分析。
Cold Spring Harb Protoc. 2015 Apr 13;2015(11):951-69. doi: 10.1101/pdb.top084970.
6
Similarity network fusion for aggregating data types on a genomic scale.基于基因组尺度聚合数据类型的相似网络融合。
Nat Methods. 2014 Mar;11(3):333-7. doi: 10.1038/nmeth.2810. Epub 2014 Jan 26.
7
High-dimensional feature selection by feature-wise kernelized Lasso.基于特征核 Lasso 的高维特征选择。
Neural Comput. 2014 Jan;26(1):185-207. doi: 10.1162/NECO_a_00537. Epub 2013 Oct 8.
8
Evaluating feature selection strategies for high dimensional, small sample size datasets.评估高维小样本数据集的特征选择策略。
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:949-52. doi: 10.1109/IEMBS.2011.6090214.
9
Bayesian feature and model selection for Gaussian mixture models.高斯混合模型的贝叶斯特征与模型选择
IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):1013-8. doi: 10.1109/TPAMI.2006.111.
10
Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.基于互信息的特征选择:最大依赖、最大相关和最小冗余准则。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38. doi: 10.1109/TPAMI.2005.159.