• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于使用L1000数据的图神经网络的基因表达推断

Gene expression inference based on graph neural networks using L1000 data.

作者信息

Kim Tae Hyun, Kim Harim, Hwang Hyunjin, Kang Shinwhan, Shin Kijung, Baek Inwha

机构信息

Department of Regulatory Science, Graduate School, Kyung Hee University, 26 Kyungheedae-ro, Dongdaemun District, Seoul 02447, South Korea.

College of Pharmacy, Kyung Hee University, 26 Kyungheedae-ro, Dongdaemun District, Seoul 02447, South Korea.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf273.

DOI:10.1093/bib/bbaf273
PMID:40505083
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12161499/
Abstract

Gene expression profiles can serve as proxies for cellular states and provide valuable insights into the discovery of functional connections across diverse cellular contexts. A cost-effective method called L1000 has been developed to generate gene expression profiles for over a million different conditions. Since gene expression inference of this method relies on linear regression, nonlinear regression methods, including deep learning models, have been assessed. However, these approaches process gene expression data as a vector structure, motivating us to investigate whether nonlinear models based on a graph structure are more effective in capturing the relationships between genes underlying gene expression profiles. In this work, we show that the graph neural network (GNN) model with genes as nodes outperforms both linear and nonlinear non-GNN models in predicting gene expression values and expression-based gene rankings. Importantly, our GNN model requires ~10-fold less information than other models to achieve comparable performance. A strategic selection of input features, or incorporating an organ feature, from which the gene expression data are derived, further improves gene expression inference performance of the GNN model. Additionally, we evaluate the cross-platform generality of gene expression inference. Our study demonstrates that the transformation of RNA expression data into a graph structure effectively captures nonlinear correlations between genes, thereby enabling highly accurate and efficient prediction of gene expression profiles.

摘要

基因表达谱可作为细胞状态的代理,并为发现不同细胞环境中的功能联系提供有价值的见解。一种名为L1000的经济高效方法已被开发出来,用于生成超过一百万个不同条件下的基因表达谱。由于该方法的基因表达推断依赖于线性回归,因此包括深度学习模型在内的非线性回归方法已被评估。然而,这些方法将基因表达数据作为向量结构进行处理,这促使我们研究基于图结构的非线性模型在捕捉基因表达谱背后基因之间的关系方面是否更有效。在这项工作中,我们表明以基因为节点的图神经网络(GNN)模型在预测基因表达值和基于表达的基因排名方面优于线性和非线性非GNN模型。重要的是,我们的GNN模型在实现可比性能时所需的信息比其他模型少约10倍。从基因表达数据所源自的器官特征中进行输入特征的策略性选择或纳入该器官特征,可进一步提高GNN模型的基因表达推断性能。此外,我们评估了基因表达推断的跨平台通用性。我们的研究表明,将RNA表达数据转换为图结构可有效捕捉基因之间的非线性相关性,从而实现对基因表达谱的高度准确和高效预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df11/12161499/94ec28d21917/bbaf273f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df11/12161499/54eb91a50e0e/bbaf273f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df11/12161499/94ec28d21917/bbaf273f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df11/12161499/54eb91a50e0e/bbaf273f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df11/12161499/94ec28d21917/bbaf273f2.jpg

相似文献

1
Gene expression inference based on graph neural networks using L1000 data.基于使用L1000数据的图神经网络的基因表达推断
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf273.
2
An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.一种用于准确预测蛋白质-蛋白质相互作用的端到端知识图谱融合图神经网络
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2518-2530. doi: 10.1109/TCBB.2024.3486216. Epub 2024 Dec 10.
3
DGSIST: Clustering spatial transcriptome data based on deep graph structure Infomax.DGSIST:基于深度图结构 Infomax 的聚类空间转录组数据。
Methods. 2024 Nov;231:226-236. doi: 10.1016/j.ymeth.2024.10.002. Epub 2024 Oct 15.
4
MIGGRI: A multi-instance graph neural network model for inferring gene regulatory networks for Drosophila from spatial expression images.MIGGRI:一种用于从果蝇空间表达图像推断基因调控网络的多实例图神经网络模型。
PLoS Comput Biol. 2023 Nov 8;19(11):e1011623. doi: 10.1371/journal.pcbi.1011623. eCollection 2023 Nov.
5
Amogel: a multi-omics classification framework using associative graph neural networks with prior knowledge for biomarker identification.Amogel:一种使用具有先验知识的关联图神经网络进行生物标志物识别的多组学分类框架。
BMC Bioinformatics. 2025 Mar 28;26(1):94. doi: 10.1186/s12859-025-06111-6.
6
Hierarchical Graph Transformer With Contrastive Learning for Gene Regulatory Network Inference.用于基因调控网络推理的具有对比学习的分层图变换器
IEEE J Biomed Health Inform. 2025 Jan;29(1):690-699. doi: 10.1109/JBHI.2024.3476490. Epub 2025 Jan 7.
7
Network-based drug sensitivity prediction.基于网络的药物敏感性预测。
BMC Med Genomics. 2020 Dec 28;13(Suppl 11):193. doi: 10.1186/s12920-020-00829-3.
8
LR-GNN: a graph neural network based on link representation for predicting molecular associations.LR-GNN:一种基于链接表示的图神经网络,用于预测分子关联。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab513.
9
Augmented Graph Neural Network with hierarchical global-based residual connections.基于层次全局残差连接的增强图神经网络。
Neural Netw. 2022 Jun;150:149-166. doi: 10.1016/j.neunet.2022.03.008. Epub 2022 Mar 10.
10
MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT:一种使用时间序列基因表达数据推断基因调控网络的新算法。
BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.

本文引用的文献

1
Reinventing gene expression connectivity through regulatory and spatial structural empowerment via principal node aggregation graph neural network.通过主节点聚合图神经网络,从调控和空间结构赋权的角度重新定义基因表达连通性。
Nucleic Acids Res. 2024 Jul 22;52(13):e60. doi: 10.1093/nar/gkae514.
2
Machine learning and related approaches in transcriptomics.转录组学中的机器学习及相关方法。
Biochem Biophys Res Commun. 2024 Sep 10;724:150225. doi: 10.1016/j.bbrc.2024.150225. Epub 2024 Jun 4.
3
Transforming L1000 profiles to RNA-seq-like profiles with deep learning.
利用深度学习将 L1000 数据转化为 RNA-seq 数据。
BMC Bioinformatics. 2022 Sep 13;23(1):374. doi: 10.1186/s12859-022-04895-5.
4
A nonlinear correlation measure with applications to gene expression data.一种具有应用于基因表达数据的非线性相关度量方法。
PLoS One. 2022 Jun 21;17(6):e0270270. doi: 10.1371/journal.pone.0270270. eCollection 2022.
5
Deep neural network prediction of genome-wide transcriptome signatures - beyond the Black-box.深度神经网络预测全基因组转录组特征——超越黑盒。
NPJ Syst Biol Appl. 2022 Feb 23;8(1):9. doi: 10.1038/s41540-022-00218-9.
6
Explaining decisions of graph convolutional neural networks: patient-specific molecular subnetworks responsible for metastasis prediction in breast cancer.解释图卷积神经网络决策:乳腺癌转移预测中与患者特异性相关的分子子网络。
Genome Med. 2021 Mar 11;13(1):42. doi: 10.1186/s13073-021-00845-7.
7
Drug-induced cell viability prediction from LINCS-L1000 through WRFEN-XGBoost algorithm.通过 WRfen-XGBoost 算法从 LINCS-L1000 预测药物诱导的细胞活力。
BMC Bioinformatics. 2021 Jan 6;22(1):13. doi: 10.1186/s12859-020-03949-w.
8
Deep Learning on High-Throughput Transcriptomics to Predict Drug-Induced Liver Injury.基于高通量转录组学的深度学习预测药物性肝损伤
Front Bioeng Biotechnol. 2020 Nov 27;8:562677. doi: 10.3389/fbioe.2020.562677. eCollection 2020.
9
Biological network analysis with deep learning.基于深度学习的生物网络分析。
Brief Bioinform. 2021 Mar 22;22(2):1515-1530. doi: 10.1093/bib/bbaa257.
10
Network inference in systems biology: recent developments, challenges, and applications.系统生物学中的网络推断:最新进展、挑战与应用。
Curr Opin Biotechnol. 2020 Jun;63:89-98. doi: 10.1016/j.copbio.2019.12.002. Epub 2020 Jan 9.