• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GeneWalk 使用网络表示学习来确定生物背景下相关的基因功能。

GeneWalk identifies relevant gene functions for a biological context using network representation learning.

机构信息

Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, 02115, USA.

Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, 02115, USA.

出版信息

Genome Biol. 2021 Feb 2;22(1):55. doi: 10.1186/s13059-021-02264-8.

DOI:10.1186/s13059-021-02264-8
PMID:33526072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7852222/
Abstract

A bottleneck in high-throughput functional genomics experiments is identifying the most important genes and their relevant functions from a list of gene hits. Gene Ontology (GO) enrichment methods provide insight at the gene set level. Here, we introduce GeneWalk ( github.com/churchmanlab/genewalk ) that identifies individual genes and their relevant functions critical for the experimental setting under examination. After the automatic assembly of an experiment-specific gene regulatory network, GeneWalk uses representation learning to quantify the similarity between vector representations of each gene and its GO annotations, yielding annotation significance scores that reflect the experimental context. By performing gene- and condition-specific functional analysis, GeneWalk converts a list of genes into data-driven hypotheses.

摘要

高通量功能基因组学实验中的一个瓶颈是从基因命中列表中识别最重要的基因及其相关功能。基因本体论 (GO) 富集方法提供了在基因集水平上的深入了解。在这里,我们介绍了 GeneWalk(github.com/churchmanlab/genewalk),它可以识别对于正在检查的实验设置至关重要的单个基因及其相关功能。在自动组装特定于实验的基因调控网络之后,GeneWalk 使用表示学习来量化每个基因及其 GO 注释的向量表示之间的相似性,从而产生反映实验背景的注释显着性分数。通过执行基因和条件特异性功能分析,GeneWalk 将基因列表转换为数据驱动的假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/47d6f2b40386/13059_2021_2264_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/798d095a691d/13059_2021_2264_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/bcb59596326a/13059_2021_2264_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/b869c5b52fe8/13059_2021_2264_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/2aa93e3a6971/13059_2021_2264_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/47d6f2b40386/13059_2021_2264_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/798d095a691d/13059_2021_2264_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/bcb59596326a/13059_2021_2264_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/b869c5b52fe8/13059_2021_2264_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/2aa93e3a6971/13059_2021_2264_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c1/7852222/47d6f2b40386/13059_2021_2264_Fig5_HTML.jpg

相似文献

1
GeneWalk identifies relevant gene functions for a biological context using network representation learning.GeneWalk 使用网络表示学习来确定生物背景下相关的基因功能。
Genome Biol. 2021 Feb 2;22(1):55. doi: 10.1186/s13059-021-02264-8.
2
ALGAEFUN with MARACAS, microALGAE FUNctional enrichment tool for MicroAlgae RnA-seq and Chip-seq AnalysiS.玛拉卡斯藻趣,一种微藻 RNA-seq 和 Chip-seq 分析的微藻功能富集工具。
BMC Bioinformatics. 2022 Mar 31;23(1):113. doi: 10.1186/s12859-022-04639-5.
3
RNA-Seq analysis reveals pluripotency-associated genes and their interaction networks in human embryonic stem cells.RNA-Seq 分析揭示了人类胚胎干细胞中与多能性相关的基因及其相互作用网络。
Comput Biol Chem. 2020 Apr;85:107239. doi: 10.1016/j.compbiolchem.2020.107239. Epub 2020 Feb 21.
4
RNA-Seq-Based Breast Cancer Subtypes Classification Using Machine Learning Approaches.基于RNA测序的乳腺癌亚型机器学习分类方法
Comput Intell Neurosci. 2020 Oct 29;2020:4737969. doi: 10.1155/2020/4737969. eCollection 2020.
5
Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA.转录器:一种用于注释组装读段和识别非编码RNA的自动化计算流程。
PLoS One. 2015 Nov 18;10(11):e0140268. doi: 10.1371/journal.pone.0140268. eCollection 2015.
6
SFGD: a comprehensive platform for mining functional information from soybean transcriptome data and its use in identifying acyl-lipid metabolism pathways.SFGD:一个用于从大豆转录组数据中挖掘功能信息及其在鉴定酰基脂质代谢途径中的应用的综合平台。
BMC Genomics. 2014 Apr 8;15:271. doi: 10.1186/1471-2164-15-271.
7
Gene expression profiles and pathway enrichment analysis to identification of differentially expressed gene and signaling pathways in epithelial ovarian cancer based on high-throughput RNA-seq data.基于高通量RNA测序数据的基因表达谱和通路富集分析,以鉴定上皮性卵巢癌中差异表达的基因和信号通路。
Genomics. 2022 Jan;114(1):161-170. doi: 10.1016/j.ygeno.2021.11.031. Epub 2021 Nov 25.
8
Annotation of gene product function from high-throughput studies using the Gene Ontology.使用基因本体论对高通量研究中的基因产物功能进行注释。
Database (Oxford). 2019 Jan 1;2019:baz007. doi: 10.1093/database/baz007.
9
NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases.NET-GE:一种基于网络的新型基因富集方法,用于检测与孟德尔疾病相关的生物学过程。
BMC Genomics. 2015;16 Suppl 8(Suppl 8):S6. doi: 10.1186/1471-2164-16-S8-S6. Epub 2015 Jun 18.
10
Seten: a tool for systematic identification and comparison of processes, phenotypes, and diseases associated with RNA-binding proteins from condition-specific CLIP-seq profiles.Seten:一种用于从特定条件的CLIP-seq图谱中系统识别和比较与RNA结合蛋白相关的过程、表型和疾病的工具。
RNA. 2017 Jun;23(6):836-846. doi: 10.1261/rna.059089.116. Epub 2017 Mar 23.

引用本文的文献

1
Identification of Prognostic Biomarkers in Gene Expression Profile of Neuroblastoma Via Machine Learning.通过机器学习鉴定神经母细胞瘤基因表达谱中的预后生物标志物
Pediatr Discov. 2025 May 27;3(2):e70009. doi: 10.1002/pdi3.70009. eCollection 2025 Jun.
2
Protective exercise responses in the dentate gyrus of Alzheimer's disease mouse model revealed with single-nucleus RNA-sequencing.通过单核RNA测序揭示阿尔茨海默病小鼠模型齿状回中的保护性运动反应。
Nat Neurosci. 2025 Jun 12. doi: 10.1038/s41593-025-01971-w.
3
GeneCOCOA: Detecting context-specific functions of individual genes using co-expression data.

本文引用的文献

1
Pathway crosstalk effects: Shrinkage and disentanglement using a Bayesian hierarchical model.通路串扰效应:使用贝叶斯分层模型进行收缩和去纠缠
Stat Biosci. 2016 Oct;8(2):374-394. doi: 10.1007/s12561-016-9160-1. Epub 2016 Jul 26.
2
Machine learning guided association of adverse drug reactions with in vitro target-based pharmacology.机器学习指导的药物不良反应与体外基于靶点的药理学关联。
EBioMedicine. 2020 Jul;57:102837. doi: 10.1016/j.ebiom.2020.102837. Epub 2020 Jun 18.
3
Compressing gene expression data using multiple latent space dimensionalities learns complementary biological representations.
基因COCOA:利用共表达数据检测单个基因的上下文特异性功能。
PLoS Comput Biol. 2025 Mar 31;21(3):e1012278. doi: 10.1371/journal.pcbi.1012278. eCollection 2025.
4
Graph contrastive learning of subcellular-resolution spatial transcriptomics improves cell type annotation and reveals critical molecular pathways.亚细胞分辨率空间转录组学的图对比学习改善细胞类型注释并揭示关键分子途径。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf020.
5
BioPathNet: Enhancing Link Prediction in Biomedical Knowledge Graphs through Path Representation Learning.生物路径网络:通过路径表示学习增强生物医学知识图谱中的链接预测
Res Sq. 2024 Sep 18:rs.3.rs-5057842. doi: 10.21203/rs.3.rs-5057842/v1.
6
A best-match approach for gene set analyses in embedding spaces.一种在嵌入空间中进行基因集分析的最佳匹配方法。
Genome Res. 2024 Oct 11;34(9):1421-1433. doi: 10.1101/gr.279141.124.
7
Unveiling hidden connections in omics data via pyPARAGON: an integrative hybrid approach for disease network construction.通过 pyPARAGON 揭示组学数据中的隐藏关联:一种用于疾病网络构建的综合混合方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae399.
8
Path-based reasoning for biomedical knowledge graphs with BioPathNet.使用BioPathNet对生物医学知识图谱进行基于路径的推理。
bioRxiv. 2024 Aug 10:2024.06.17.599219. doi: 10.1101/2024.06.17.599219.
9
Current and future directions in network biology.网络生物学的当前与未来发展方向。
Bioinform Adv. 2024 Aug 14;4(1):vbae099. doi: 10.1093/bioadv/vbae099. eCollection 2024.
10
Sex-biasing influence of autism-associated gene overdosage at connectomic, behavioral, and transcriptomic levels.在连接组学、行为和转录组学水平上,自闭症相关基因剂量过表达的性别偏向影响。
Sci Adv. 2024 Jul 12;10(28):eadg1421. doi: 10.1126/sciadv.adg1421.
使用多个潜在空间维度压缩基因表达数据可学习互补的生物学表现形式。
Genome Biol. 2020 May 11;21(1):109. doi: 10.1186/s13059-020-02021-3.
4
Toward a gold standard for benchmarking gene set enrichment analysis.迈向基因集富集分析基准测试的金标准。
Brief Bioinform. 2021 Jan 18;22(1):545-556. doi: 10.1093/bib/bbz158.
5
Causal Inference Engine: a platform for directional gene set enrichment analysis and inference of active transcriptional regulators.因果推理引擎:一个用于有向基因集富集分析和推断活性转录调控因子的平台。
Nucleic Acids Res. 2019 Dec 16;47(22):11563-11573. doi: 10.1093/nar/gkz1046.
6
Pathway Commons 2019 Update: integration, analysis and exploration of pathway data.Pathway Commons 2019 更新:途径数据的整合、分析和探索。
Nucleic Acids Res. 2020 Jan 8;48(D1):D489-D497. doi: 10.1093/nar/gkz946.
7
Identifying significantly impacted pathways: a comprehensive review and assessment.识别受显著影响的途径:全面回顾与评估。
Genome Biol. 2019 Oct 9;20(1):203. doi: 10.1186/s13059-019-1790-4.
8
Deep learning: new computational modelling techniques for genomics.深度学习:基因组学的新计算建模技术。
Nat Rev Genet. 2019 Jul;20(7):389-403. doi: 10.1038/s41576-019-0122-6.
9
Integrating node embeddings and biological annotations for genes to predict disease-gene associations.整合基因的节点嵌入和生物学注释以预测疾病-基因关联。
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):138. doi: 10.1186/s12918-018-0662-y.
10
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.