• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iSOM-GSN:一种通过自组织映射将多组学数据转化为基因相似网络的综合方法。

iSOM-GSN: an integrative approach for transforming multi-omic data into gene similarity networks via self-organizing maps.

机构信息

School of Computer Science, University of Windsor, Windsor, ON N9B 3P4, Canada.

出版信息

Bioinformatics. 2020 Aug 1;36(15):4248-4254. doi: 10.1093/bioinformatics/btaa500.

DOI:10.1093/bioinformatics/btaa500
PMID:32407457
Abstract

MOTIVATION

One of the main challenges in applying graph convolutional neural networks (CNNs) on gene-interaction data is the lack of understanding of the vector space to which they belong, and also the inherent difficulties involved in representing those interactions on a significantly lower dimension, viz Euclidean spaces. The challenge becomes more prevalent when dealing with various types of heterogeneous data. We introduce a systematic, generalized method, called iSOM-GSN, used to transform 'multi-omic' data with higher dimensions onto a 2D grid. Afterwards, we apply a CNN to predict disease states of various types. Based on the idea of Kohonen's self-organizing map, we generate a 2D grid for each sample for a given set of genes that represent a gene similarity network.

RESULTS

We have tested the model to predict breast and prostate cancer using gene expression, DNA methylation and copy number alteration. Prediction accuracies in the 94-98% range were obtained for tumor stages of breast cancer and calculated Gleason scores of prostate cancer with just 14 input genes for both cases. The scheme not only outputs nearly perfect classification accuracy, but also provides an enhanced scheme for representation learning, visualization, dimensionality reduction and interpretation of multi-omic data.

AVAILABILITY AND IMPLEMENTATION

The source code and sample data are available via a Github project at https://github.com/NaziaFatima/iSOM_GSN.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

将图卷积神经网络 (CNN) 应用于基因相互作用数据时面临的主要挑战之一是缺乏对其所属向量空间的理解,以及在低维(即欧几里得空间)上表示这些相互作用所固有的困难。当处理各种类型的异构数据时,挑战变得更加突出。我们引入了一种系统的、广义的方法,称为 iSOM-GSN,用于将具有更高维度的“多组学”数据转换到 2D 网格上。然后,我们应用 CNN 来预测各种类型的疾病状态。基于 Kohonen 的自组织映射的思想,我们为给定的一组代表基因相似性网络的基因生成每个样本的 2D 网格。

结果

我们已经使用基因表达、DNA 甲基化和拷贝数改变来测试模型,以预测乳腺癌和前列腺癌。对于乳腺癌的肿瘤分期和前列腺癌的计算 Gleason 评分,仅使用两种情况下的 14 个输入基因,就获得了 94%-98%的预测准确率。该方案不仅输出近乎完美的分类准确率,而且还提供了一种增强的表示学习、可视化、降维和多组学数据解释方案。

可用性和实现

源代码和示例数据可通过位于 https://github.com/NaziaFatima/iSOM_GSN 的 Github 项目获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
iSOM-GSN: an integrative approach for transforming multi-omic data into gene similarity networks via self-organizing maps.iSOM-GSN:一种通过自组织映射将多组学数据转化为基因相似网络的综合方法。
Bioinformatics. 2020 Aug 1;36(15):4248-4254. doi: 10.1093/bioinformatics/btaa500.
2
Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network.基于UMAP嵌入和卷积神经网络的多组学数据整合模型
Cancer Inform. 2022 Sep 28;21:11769351221124205. doi: 10.1177/11769351221124205. eCollection 2022.
3
Explaining decisions of graph convolutional neural networks: patient-specific molecular subnetworks responsible for metastasis prediction in breast cancer.解释图卷积神经网络决策:乳腺癌转移预测中与患者特异性相关的分子子网络。
Genome Med. 2021 Mar 11;13(1):42. doi: 10.1186/s13073-021-00845-7.
4
Multi-Omic Graph Diagnosis (MOGDx): a data integration tool to perform classification tasks for heterogeneous diseases.多组学图谱诊断(MOGDx):一种用于对异质疾病进行分类任务的数据集成工具。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae523.
5
A learning-based framework for miRNA-disease association identification using neural networks.基于神经网络的 miRNA-疾病关联识别学习框架。
Bioinformatics. 2019 Nov 1;35(21):4364-4371. doi: 10.1093/bioinformatics/btz254.
6
MODIG: integrating multi-omics and multi-dimensional gene network for cancer driver gene identification based on graph attention network model.MODIG:基于图注意力网络模型的多组学和多维基因网络整合用于癌症驱动基因识别。
Bioinformatics. 2022 Oct 31;38(21):4901-4907. doi: 10.1093/bioinformatics/btac622.
7
Signal metrics analysis of oscillatory patterns in bacterial multi-omic networks.细菌多组学网络中振荡模式的信号指标分析
Bioinformatics. 2021 Jun 16;37(10):1411-1419. doi: 10.1093/bioinformatics/btaa966.
8
An efficient approach based on multi-sources information to predict circRNA-disease associations using deep convolutional neural network.基于多源信息的深度学习卷积神经网络预测 circRNA 疾病关联的有效方法。
Bioinformatics. 2020 Jul 1;36(13):4038-4046. doi: 10.1093/bioinformatics/btz825.
9
Adversarial dense graph convolutional networks for single-cell classification.对抗密集图卷积网络用于单细胞分类。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad043.
10
TGSA: protein-protein association-based twin graph neural networks for drug response prediction with similarity augmentation.TGSA:基于蛋白质-蛋白质关联的孪生图神经网络,用于通过相似性增强进行药物反应预测。
Bioinformatics. 2022 Jan 3;38(2):461-468. doi: 10.1093/bioinformatics/btab650.

引用本文的文献

1
PaCMAP-embedded convolutional neural network for multi-omics data integration.用于多组学数据整合的嵌入PaCMAP的卷积神经网络。
Heliyon. 2023 Dec 5;10(1):e23195. doi: 10.1016/j.heliyon.2023.e23195. eCollection 2024 Jan 15.
2
Multi-omics Pathways Workflow (MOPAW): An Automated Multi-omics Workflow on the Cancer Genomics Cloud.多组学通路工作流程(MOPAW):癌症基因组学云上的自动化多组学工作流程。
Cancer Inform. 2023 Jun 16;22:11769351231180992. doi: 10.1177/11769351231180992. eCollection 2023.
3
Translational Bioinformatics for Human Reproductive Biology Research: Examples, Opportunities and Challenges for a Future Reproductive Medicine.
生殖生物学研究的转化生物信息学:未来生殖医学的实例、机遇与挑战。
Int J Mol Sci. 2022 Dec 20;24(1):4. doi: 10.3390/ijms24010004.
4
Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions.用于基因表达建模的Transformer(T-GEM):一种用于基于基因表达的表型预测的可解释深度学习模型。
Cancers (Basel). 2022 Sep 29;14(19):4763. doi: 10.3390/cancers14194763.
5
Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network.基于UMAP嵌入和卷积神经网络的多组学数据整合模型
Cancer Inform. 2022 Sep 28;21:11769351221124205. doi: 10.1177/11769351221124205. eCollection 2022.
6
Integrated Multi-Omics Maps of Lower-Grade Gliomas.低级别胶质瘤的综合多组学图谱
Cancers (Basel). 2022 Jun 4;14(11):2797. doi: 10.3390/cancers14112797.
7
A roadmap for multi-omics data integration using deep learning.利用深度学习进行多组学数据整合的路线图。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab454.
8
Predictions, Pivots, and a Pandemic: a Review of 2020's Top Translational Bioinformatics Publications.预测、转变和大流行:2020 年顶级转化生物信息学出版物综述。
Yearb Med Inform. 2021 Aug;30(1):219-225. doi: 10.1055/s-0041-1726540. Epub 2021 Sep 3.
9
CancerSiamese: one-shot learning for predicting primary and metastatic tumor types unseen during model training.癌症连体网络:用于预测模型训练期间未见过的原发性和转移性肿瘤类型的一次性学习。
BMC Bioinformatics. 2021 May 12;22(1):244. doi: 10.1186/s12859-021-04157-w.