• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于层次图卷积网络的基因表达数据表示学习

A Hierarchical Graph Convolution Network for Representation Learning of Gene Expression Data.

出版信息

IEEE J Biomed Health Inform. 2021 Aug;25(8):3219-3229. doi: 10.1109/JBHI.2021.3052008. Epub 2021 Aug 5.

DOI:10.1109/JBHI.2021.3052008
PMID:33449889
Abstract

The curse of dimensionality, which is caused by high-dimensionality and low-sample-size, is a major challenge in gene expression data analysis. However, the real situation is even worse: labelling data is laborious and time-consuming, so only a small part of the limited samples will be labelled. Having such few labelled samples further increases the difficulty of training deep learning models. Interpretability is an important requirement in biomedicine. Many existing deep learning methods are trying to provide interpretability, but rarely apply to gene expression data. Recent semi-supervised graph convolution network methods try to address these problems by smoothing the label information over a graph. However, to the best of our knowledge, these methods only utilize graphs in either the feature space or sample space, which restrict their performance. We propose a transductive semi-supervised representation learning method called a hierarchical graph convolution network (HiGCN) to aggregate the information of gene expression data in both feature and sample spaces. HiGCN first utilizes external knowledge to construct a feature graph and a similarity kernel to construct a sample graph. Then, two spatial-based GCNs are used to aggregate information on these graphs. To validate the model's performance, synthetic and real datasets are provided to lend empirical support. Compared with two recent models and three traditional models, HiGCN learns better representations of gene expression data, and these representations improve the performance of downstream tasks, especially when the model is trained on a few labelled samples. Important features can be extracted from our model to provide reliable interpretability.

摘要

高维低样本量导致的维度灾难是基因表达数据分析中的一个主要挑战。然而,实际情况甚至更糟:标记数据既费力又耗时,因此只有一小部分有限的样本会被标记。如此少的标记样本进一步增加了训练深度学习模型的难度。可解释性是生物医学的一个重要要求。许多现有的深度学习方法都在努力提供可解释性,但很少应用于基因表达数据。最近的半监督图卷积网络方法试图通过在图上平滑标签信息来解决这些问题。然而,据我们所知,这些方法仅在特征空间或样本空间中使用图,这限制了它们的性能。我们提出了一种称为层次图卷积网络(HiGCN)的转导半监督表示学习方法,用于聚合基因表达数据在特征和样本空间中的信息。HiGCN 首先利用外部知识构建特征图和相似性核来构建样本图。然后,使用两个基于空间的 GCN 在这些图上聚合信息。为了验证模型的性能,提供了合成和真实数据集以提供经验支持。与两个最近的模型和三个传统模型相比,HiGCN 学习了更好的基因表达数据表示,并且这些表示提高了下游任务的性能,特别是在模型仅用少量标记样本进行训练时。可以从我们的模型中提取重要特征,以提供可靠的可解释性。

相似文献

1
A Hierarchical Graph Convolution Network for Representation Learning of Gene Expression Data.基于层次图卷积网络的基因表达数据表示学习
IEEE J Biomed Health Inform. 2021 Aug;25(8):3219-3229. doi: 10.1109/JBHI.2021.3052008. Epub 2021 Aug 5.
2
A unified deep semi-supervised graph learning scheme based on nodes re-weighting and manifold regularization.一种基于节点重新加权和流形正则化的统一深度半监督图学习方案。
Neural Netw. 2023 Jan;158:188-196. doi: 10.1016/j.neunet.2022.11.017. Epub 2022 Nov 19.
3
A novel candidate disease gene prioritization method using deep graph convolutional networks and semi-supervised learning.一种使用深度图卷积网络和半监督学习的新型候选疾病基因优先级排序方法。
BMC Bioinformatics. 2022 Oct 14;23(1):422. doi: 10.1186/s12859-022-04954-x.
4
Graph Convolution Networks with manifold regularization for semi-supervised learning.图卷积网络与流形正则化的半监督学习。
Neural Netw. 2020 Jul;127:160-167. doi: 10.1016/j.neunet.2020.04.016. Epub 2020 Apr 23.
5
SFWN: A Novel Semi-Supervised Feature Weighted Neural Network for Gene Data Feature Learning and Mining With Graph Modeling.SFWN:一种新型的半监督特征加权神经网络,用于基因数据特征学习和挖掘的图建模。
IEEE J Biomed Health Inform. 2024 Nov;28(11):6405-6416. doi: 10.1109/JBHI.2023.3309842. Epub 2024 Nov 6.
6
Deep semi-supervised learning via dynamic anchor graph embedding in latent space.基于潜在空间动态锚图嵌入的深度半监督学习。
Neural Netw. 2022 Feb;146:350-360. doi: 10.1016/j.neunet.2021.11.026. Epub 2021 Dec 1.
7
Patient Representation Learning From Heterogeneous Data Sources and Knowledge Graphs Using Deep Collective Matrix Factorization: Evaluation Study.使用深度集体矩阵分解从异构数据源和知识图谱中进行患者表示学习:评估研究
JMIR Med Inform. 2022 Jan 20;10(1):e28842. doi: 10.2196/28842.
8
KHGCN: Knowledge-Enhanced Recommendation with Hierarchical Graph Capsule Network.KHGCN:基于层次图胶囊网络的知识增强推荐
Entropy (Basel). 2023 Apr 20;25(4):697. doi: 10.3390/e25040697.
9
L1-norm locally linear representation regularization multi-source adaptation learning.L1 范数局部线性表示正则化多源自适应学习。
Neural Netw. 2015 Sep;69:80-98. doi: 10.1016/j.neunet.2015.01.009. Epub 2015 Feb 25.
10
Multi-Scale Aggregation Graph Neural Networks Based on Feature Similarity for Semi-Supervised Learning.基于特征相似性的多尺度聚合图神经网络用于半监督学习
Entropy (Basel). 2021 Mar 28;23(4):403. doi: 10.3390/e23040403.

引用本文的文献

1
HiRAND: A novel GCN semi-supervised deep learning-based framework for classification and feature selection in drug research and development.HiRAND:一种基于图卷积网络的新型半监督深度学习框架,用于药物研发中的分类和特征选择。
Front Oncol. 2023 Jan 26;13:1047556. doi: 10.3389/fonc.2023.1047556. eCollection 2023.
2
Extraction and Analysis of Foot Bone Shape Features Based on Deep Learning.基于深度学习的足部骨骼形态特征提取与分析。
Comput Math Methods Med. 2022 Aug 10;2022:2372160. doi: 10.1155/2022/2372160. eCollection 2022.