• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

图神经网络在提取候选基因和生物学关联中的应用系统综述

A Systematic Review of the Application of Graph Neural Networks to Extract Candidate Genes and Biological Associations.

作者信息

Saxena Ankita, Nixon Bridgette, Boyd Amelia, Evans James, Faraone Stephen V

机构信息

Department of Neuroscience and Physiology, State University of New York-Norton College of Medicine at Upstate Medical University, New York, USA.

Department of Psychiatry and Behavioral Sciences, State University of new York-Norton College of Medicine at Upstate Medical University, New York, USA.

出版信息

Am J Med Genet B Neuropsychiatr Genet. 2025 Sep;198(6):3-18. doi: 10.1002/ajmg.b.33031. Epub 2025 May 2.

DOI:10.1002/ajmg.b.33031
PMID:40317893
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12335376/
Abstract

The development of high throughput technologies has resulted in the collection of large quantities of genomic and transcriptomic data. However, identifying disease-associated genes or networks from these data has remained an ongoing challenge. In recent years, graph neural networks (GNNs) have emerged as a promising analytical tool, but it is not well understood which characteristics of these models result in improved performance. We conducted a systematic search and review of publications that used GNNs to identify disease-associated biological interactions. Information was extracted about model characteristics and performance with the goal of examining the relationship between these factors and performance. Data leakage was found in 31% of these models. For node level tasks, univariate positive associations were identified between model accuracy and use of hyper parameter optimization, data leakage via hyperparameter optimization, test set size, and total dataset size. Among graph level tasks, an increase in AUC was identified in association with testing method and a decrease with optimization reporting. Data leakage may pose an issue for GNN-based approaches; the adoption of best practice guidelines and consistent reporting of model design would be beneficial for future studies.

摘要

高通量技术的发展使得大量基因组和转录组数据得以收集。然而,从这些数据中识别疾病相关基因或网络仍然是一个持续存在的挑战。近年来,图神经网络(GNN)已成为一种很有前景的分析工具,但对于这些模型的哪些特征能带来性能提升,人们还了解得不够透彻。我们对使用GNN识别疾病相关生物相互作用的出版物进行了系统的检索和综述。提取了有关模型特征和性能的信息,目的是研究这些因素与性能之间的关系。在这些模型中,31%存在数据泄露问题。对于节点级任务,在模型准确性与超参数优化的使用、通过超参数优化导致的数据泄露、测试集大小以及总数据集大小之间发现了单变量正相关关系。在图级任务中,发现AUC的增加与测试方法有关,而与优化报告有关的则有所下降。数据泄露可能给基于GNN的方法带来问题;采用最佳实践指南并一致报告模型设计将对未来的研究有益。

相似文献

1
A Systematic Review of the Application of Graph Neural Networks to Extract Candidate Genes and Biological Associations.图神经网络在提取候选基因和生物学关联中的应用系统综述
Am J Med Genet B Neuropsychiatr Genet. 2025 Sep;198(6):3-18. doi: 10.1002/ajmg.b.33031. Epub 2025 May 2.
2
Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models.从在细胞图上训练的图神经网络中提取知识,用于非神经学生模型。
Sci Rep. 2025 Aug 10;15(1):29274. doi: 10.1038/s41598-025-13697-7.
3
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
4
Antibody tests for identification of current and past infection with SARS-CoV-2.抗体检测用于鉴定 SARS-CoV-2 的现症感染和既往感染。
Cochrane Database Syst Rev. 2022 Nov 17;11(11):CD013652. doi: 10.1002/14651858.CD013652.pub2.
5
What is the value of routinely testing full blood count, electrolytes and urea, and pulmonary function tests before elective surgery in patients with no apparent clinical indication and in subgroups of patients with common comorbidities: a systematic review of the clinical and cost-effective literature.在没有明显临床指征的患者和常见合并症患者亚组中,在择期手术前常规检测全血细胞计数、电解质和尿素以及肺功能测试的价值:对临床和成本效益文献的系统评价。
Health Technol Assess. 2012 Dec;16(50):i-xvi, 1-159. doi: 10.3310/hta16500.
6
Short-Term Memory Impairment短期记忆障碍
7
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
8
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.

本文引用的文献

1
Integration of multi-source gene interaction networks and omics data with graph attention networks to identify novel disease genes.将多源基因相互作用网络和组学数据与图注意力网络相结合以识别新型疾病基因。
Bioinformatics. 2025 Apr 23. doi: 10.1093/bioinformatics/btaf181.
2
GREMI: An Explainable Multi-Omics Integration Framework for Enhanced Disease Prediction and Module Identification.GREMI:一种用于增强疾病预测和模块识别的可解释多组学集成框架。
IEEE J Biomed Health Inform. 2024 Nov;28(11):6983-6996. doi: 10.1109/JBHI.2024.3439713. Epub 2024 Nov 6.
3
Predicting disease-gene associations through self-supervised mutual infomax graph convolution network.通过自监督互信息最大化图卷积网络预测疾病-基因关联。
Comput Biol Med. 2024 Mar;170:108048. doi: 10.1016/j.compbiomed.2024.108048. Epub 2024 Jan 30.
4
Cord blood lipid correlation network profiles are associated with subsequent attention-deficit/hyperactivity disorder and autism spectrum disorder symptoms at 2 years: a prospective birth cohort study.脐带血脂质相关网络特征与 2 岁时注意力缺陷/多动障碍和自闭症谱系障碍症状相关:一项前瞻性出生队列研究。
EBioMedicine. 2024 Feb;100:104949. doi: 10.1016/j.ebiom.2023.104949. Epub 2024 Jan 9.
5
A primer on the use of machine learning to distil knowledge from data in biological psychiatry.机器学习在生物精神病学中从数据中提取知识的基础教程。
Mol Psychiatry. 2024 Feb;29(2):387-401. doi: 10.1038/s41380-023-02334-2. Epub 2024 Jan 4.
6
Genomic Machine Learning Meta-regression: Insights on Associations of Study Features With Reported Model Performance.基因组机器学习元回归:研究特征与报告模型性能关联的见解。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Jan-Feb;21(1):169-177. doi: 10.1109/TCBB.2023.3343808. Epub 2024 Feb 5.
7
Predicting cell-type specific disease genes of diabetes with the biological network.利用生物网络预测糖尿病的细胞类型特异性疾病基因。
Comput Biol Med. 2024 Feb;169:107849. doi: 10.1016/j.compbiomed.2023.107849. Epub 2023 Dec 13.
8
RMDGCN: Prediction of RNA methylation and disease associations based on graph convolutional network with attention mechanism.RMDGCN:基于图卷积网络和注意力机制的 RNA 甲基化和疾病关联预测。
PLoS Comput Biol. 2023 Dec 6;19(12):e1011677. doi: 10.1371/journal.pcbi.1011677. eCollection 2023 Dec.
9
The Reactome Pathway Knowledgebase 2024.Reactome 通路知识库 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D672-D678. doi: 10.1093/nar/gkad1025.
10
Biological informed graph neural network for tumor mutation burden prediction and immunotherapy-related pathway analysis in gastric cancer.用于胃癌肿瘤突变负荷预测和免疫治疗相关通路分析的生物信息图神经网络
Comput Struct Biotechnol J. 2023 Sep 22;21:4540-4551. doi: 10.1016/j.csbj.2023.09.021. eCollection 2023.