• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CSGDN:用于预测作物基因-表型关联的对比符号图扩散网络。

CSGDN: contrastive signed graph diffusion network for predicting crop gene-phenotype associations.

作者信息

Pan Yiru, Ji Xingyu, You Jiaqi, Li Lu, Liu Zhenping, Zhang Xianlong, Zhang Zeyu, Wang Maojun

机构信息

National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, 430070 Hubei, China.

出版信息

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf062.

DOI:10.1093/bib/bbaf062
PMID:39976387
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11840565/
Abstract

Positive and negative association prediction between gene and phenotype helps to illustrate the underlying mechanism of complex traits in organisms. The transcription and regulation activity of specific genes will be adjusted accordingly in different cell types, developmental timepoints, and physiological states. There are the following two problems in obtaining the positive/negative associations between gene and phenotype: (1) high-throughput DNA/RNA sequencing and phenotyping are expensive and time-consuming due to the need to process large sample sizes; (2) experiments introduce both random and systematic errors, and, meanwhile, calculations or predictions using software or models may produce noise. To address these two issues, we propose a Contrastive Signed Graph Diffusion Network, CSGDN, to learn robust node representations with fewer training samples to achieve higher link prediction accuracy. CSGDN uses a signed graph diffusion method to uncover the underlying regulatory associations between genes and phenotypes. Then, stochastic perturbation strategies are used to create two views for both original and diffusive graphs. Lastly, a multiview contrastive learning paradigm loss is designed to unify the node presentations learned from the two views to resist interference and reduce noise. We perform experiments to validate the performance of CSGDN in three crop datasets: Gossypium hirsutum, Brassica napus, and Triticum turgidum. The results show that the proposed model outperforms state-of-the-art methods by up to 9. 28% AUC for the prediction of link sign in the G. hirsutum dataset. The source code of our model is available at https://github.com/Erican-Ji/CSGDN.

摘要

基因与表型之间正负关联预测有助于阐明生物体复杂性状的潜在机制。特定基因的转录和调控活性会在不同细胞类型、发育时间点和生理状态下相应地进行调整。在获取基因与表型之间的正负关联时存在以下两个问题:(1)由于需要处理大量样本,高通量DNA/RNA测序和表型分析既昂贵又耗时;(2)实验会引入随机误差和系统误差,同时,使用软件或模型进行计算或预测可能会产生噪声。为了解决这两个问题,我们提出了一种对比符号图扩散网络(CSGDN),以使用较少的训练样本学习稳健的节点表示,从而实现更高的链接预测准确性。CSGDN使用符号图扩散方法来揭示基因与表型之间潜在的调控关联。然后,采用随机扰动策略为原始图和扩散图创建两个视图。最后,设计了一种多视图对比学习范式损失,以统一从两个视图中学到的节点表示,从而抵抗干扰并减少噪声。我们进行实验以验证CSGDN在三个作物数据集(陆地棉、甘蓝型油菜和硬粒小麦)中的性能。结果表明,对于陆地棉数据集中链接符号的预测,所提出的模型比现有方法的AUC最高高出9.28%。我们模型的源代码可在https://github.com/Erican-Ji/CSGDN获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/57115ee8ff5d/bbaf062f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/36f2c3367c08/bbaf062f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/3e45fc07dc1f/bbaf062f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/d9a2dabce198/bbaf062f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/e962ce3774a4/bbaf062f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/57115ee8ff5d/bbaf062f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/36f2c3367c08/bbaf062f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/3e45fc07dc1f/bbaf062f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/d9a2dabce198/bbaf062f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/e962ce3774a4/bbaf062f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee2/11840565/57115ee8ff5d/bbaf062f5.jpg

相似文献

1
CSGDN: contrastive signed graph diffusion network for predicting crop gene-phenotype associations.CSGDN:用于预测作物基因-表型关联的对比符号图扩散网络。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf062.
2
Global-local aware Heterogeneous Graph Contrastive Learning for multifaceted association prediction in miRNA-gene-disease networks.基于全局-局部感知的异质图对比学习在 miRNA-基因-疾病网络中的多方面关联预测
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae443.
3
SGCLDGA: unveiling drug-gene associations through simple graph contrastive learning.SGCLDGA:通过简单的图对比学习揭示药物-基因关联。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae231.
4
SGCLMD: Signed graph-based contrastive learning model for predicting somatic mutation-drug association.SGCLMD:用于预测体细胞突变与药物关联的基于符号图的对比学习模型。
Comput Biol Med. 2025 May;190:110067. doi: 10.1016/j.compbiomed.2025.110067. Epub 2025 Mar 26.
5
GCLink: a graph contrastive link prediction framework for gene regulatory network inference.GCLink:一种用于基因调控网络推断的图对比链接预测框架。
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf074.
6
Signed graph embedding via multi-order neighborhood feature fusion and contrastive learning.基于多阶邻域特征融合与对比学习的带符号图嵌入
Neural Netw. 2025 Feb;182:106897. doi: 10.1016/j.neunet.2024.106897. Epub 2024 Nov 17.
7
scRGCL: a cell type annotation method for single-cell RNA-seq data using residual graph convolutional neural network with contrastive learning.scRGCL:一种使用带有对比学习的残差图卷积神经网络对单细胞RNA测序数据进行细胞类型注释的方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae662.
8
Multi-task prediction-based graph contrastive learning for inferring the relationship among lncRNAs, miRNAs and diseases.基于多任务预测的图对比学习推断 lncRNAs、miRNAs 和疾病之间的关系。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad276.
9
scMGATGRN: a multiview graph attention network-based method for inferring gene regulatory networks from single-cell transcriptomic data.scMGATGRN:一种基于多视图图注意力网络的方法,用于从单细胞转录组数据推断基因调控网络。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae526.
10
Graph Convolutional Network and Contrastive Learning Small Nucleolar RNA (snoRNA) Disease Associations (GCLSDA): Predicting snoRNA-Disease Associations via Graph Convolutional Network and Contrastive Learning.图卷积网络与对比学习小核仁 RNA(snoRNA)疾病关联(GCLSDA):通过图卷积网络和对比学习预测 snoRNA-疾病关联。
Int J Mol Sci. 2023 Sep 22;24(19):14429. doi: 10.3390/ijms241914429.

引用本文的文献

1
Multi-omics analysis constructs a novel neuroendocrine prostate cancer classifier and classification system.多组学分析构建了一种新型的神经内分泌前列腺癌分类器和分类系统。
Sci Rep. 2025 Apr 22;15(1):13901. doi: 10.1038/s41598-025-96683-3.
2
Cellular mechanical memory: a potential tool for mesenchymal stem cell-based therapy.细胞机械记忆:基于间充质干细胞治疗的潜在工具。
Stem Cell Res Ther. 2025 Mar 31;16(1):159. doi: 10.1186/s13287-025-04249-x.

本文引用的文献

1
Structural variation reshapes population gene expression and trait variation in 2,105 Brassica napus accessions.结构变异重塑了 2105 个甘蓝型油菜品系的群体基因表达和性状变异。
Nat Genet. 2024 Nov;56(11):2538-2550. doi: 10.1038/s41588-024-01957-7. Epub 2024 Nov 5.
2
Integrating GWAS, RNA-Seq and functional analysis revealed that BnaA02.SE mediates silique elongation by affecting cell proliferation and expansion in Brassica napus.GWAS、RNA-Seq 和功能分析的整合表明,BnaA02.SE 通过影响油菜细胞的增殖和扩张来介导角果伸长。
Plant Biotechnol J. 2024 Oct;22(10):2907-2920. doi: 10.1111/pbi.14413. Epub 2024 Jun 20.
3
A deep learning framework for predicting disease-gene associations with functional modules and graph augmentation.
一种基于功能模块和图增强的深度学习框架,用于预测疾病-基因关联。
BMC Bioinformatics. 2024 Jun 14;25(1):214. doi: 10.1186/s12859-024-05841-3.
4
From genetic associations to genes: methods, applications, and challenges.从基因关联到基因:方法、应用及挑战。
Trends Genet. 2024 Aug;40(8):642-667. doi: 10.1016/j.tig.2024.04.008. Epub 2024 May 11.
5
TabHLH27 orchestrates root growth and drought tolerance to enhance water use efficiency in wheat.TabHLH27 调控根系生长和耐旱性,提高小麦水分利用效率。
J Integr Plant Biol. 2024 Jul;66(7):1295-1312. doi: 10.1111/jipb.13670. Epub 2024 May 2.
6
Biolinguistic graph fusion model for circRNA-miRNA association prediction.生物语言学图融合模型用于 circRNA-miRNA 关联预测。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae058.
7
Combined GWAS and eGWAS reveals the genetic basis underlying drought tolerance in emmer wheat (Triticum turgidum L.).联合全基因组关联分析和电子 GWAS 揭示了硬粒小麦(Triticum turgidum L.)抗旱性的遗传基础。
New Phytol. 2024 Jun;242(5):2115-2131. doi: 10.1111/nph.19589. Epub 2024 Feb 15.
8
Systematic identification of wheat spike developmental regulators by integrated multi-omics, transcriptional network, GWAS, and genetic analyses.通过整合多组学、转录网络、GWAS 和遗传分析系统鉴定小麦穗发育调控因子。
Mol Plant. 2024 Mar 4;17(3):438-459. doi: 10.1016/j.molp.2024.01.010. Epub 2024 Feb 2.
9
Regulatory controls of duplicated gene expression during fiber development in allotetraploid cotton.在异源四倍体棉花纤维发育过程中复制基因表达的调控控制。
Nat Genet. 2023 Nov;55(11):1987-1997. doi: 10.1038/s41588-023-01530-8. Epub 2023 Oct 16.
10
A high-resolution genotype-phenotype map identifies the TaSPL17 controlling grain number and size in wheat.高分辨率基因型-表型图谱鉴定控制小麦粒数和大小的 TaSPL17。
Genome Biol. 2023 Aug 28;24(1):196. doi: 10.1186/s13059-023-03044-2.