• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于生物医学文献的无监督事件图表示和相似性学习。

Unsupervised Event Graph Representation and Similarity Learning on Biomedical Literature.

机构信息

Department of Computer Science and Engineering (DISI), University of Bologna, 40126 Bologna, Italy.

Independent Researcher, 48018 Faenza, Italy.

出版信息

Sensors (Basel). 2021 Dec 21;22(1):3. doi: 10.3390/s22010003.

DOI:10.3390/s22010003
PMID:35009544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8747118/
Abstract

The automatic extraction of biomedical events from the scientific literature has drawn keen interest in the last several years, recognizing complex and semantically rich graphical interactions otherwise buried in texts. However, very few works revolve around learning embeddings or similarity metrics for event graphs. This gap leaves biological relations unlinked and prevents the application of machine learning techniques to promote discoveries. Taking advantage of recent deep graph kernel solutions and pre-trained language models, we propose Deep Divergence Event Graph Kernels (DDEGK), an unsupervised inductive method to map events into low-dimensional vectors, preserving their structural and semantic similarities. Unlike most other systems, DDEGK operates at a graph level and does not require task-specific labels, feature engineering, or known correspondences between nodes. To this end, our solution compares events against a small set of anchor ones, trains cross-graph attention networks for drawing pairwise alignments (bolstering interpretability), and employs transformer-based models to encode continuous attributes. Extensive experiments have been done on nine biomedical datasets. We show that our learned event representations can be effectively employed in tasks such as graph classification, clustering, and visualization, also facilitating downstream semantic textual similarity. Empirical results demonstrate that DDEGK significantly outperforms other state-of-the-art methods.

摘要

近年来,从科学文献中自动提取生物医学事件引起了极大的兴趣,因为这些事件中包含着复杂且语义丰富的图形交互,而这些信息通常隐藏在文本中。然而,很少有工作致力于学习事件图的嵌入或相似性度量。这一差距导致生物关系无法关联,也阻碍了机器学习技术在促进发现方面的应用。我们利用最近的深度图核解决方案和预训练的语言模型,提出了深度分歧事件图核(DDEGK),这是一种无监督的归纳方法,可将事件映射到低维向量中,同时保留它们的结构和语义相似性。与大多数其他系统不同,DDEGK 在图级别上运行,不需要特定于任务的标签、特征工程或节点之间的已知对应关系。为此,我们的解决方案将事件与一小部分锚定事件进行比较,为绘制成对对齐(增强可解释性)训练交叉图注意力网络,并采用基于转换器的模型对连续属性进行编码。我们在九个生物医学数据集上进行了广泛的实验。结果表明,我们学习到的事件表示可以有效地用于图分类、聚类和可视化等任务,同时也有助于下游语义文本相似度。实验结果表明,DDEGK 明显优于其他最先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/b77977eb6198/sensors-22-00003-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/2bed3b805b9c/sensors-22-00003-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/2f6217f63955/sensors-22-00003-g0A1a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/bd56db075a12/sensors-22-00003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/51dc88f18391/sensors-22-00003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/0d26c8982b22/sensors-22-00003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/86d449ec2313/sensors-22-00003-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/6c3a3cd2f02e/sensors-22-00003-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/4728293ba1cb/sensors-22-00003-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/90d4f5d83eff/sensors-22-00003-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/acbb1ddab783/sensors-22-00003-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/4d16ea446353/sensors-22-00003-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/b77977eb6198/sensors-22-00003-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/2bed3b805b9c/sensors-22-00003-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/2f6217f63955/sensors-22-00003-g0A1a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/bd56db075a12/sensors-22-00003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/51dc88f18391/sensors-22-00003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/0d26c8982b22/sensors-22-00003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/86d449ec2313/sensors-22-00003-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/6c3a3cd2f02e/sensors-22-00003-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/4728293ba1cb/sensors-22-00003-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/90d4f5d83eff/sensors-22-00003-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/acbb1ddab783/sensors-22-00003-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/4d16ea446353/sensors-22-00003-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ab/8747118/b77977eb6198/sensors-22-00003-g011.jpg

相似文献

1
Unsupervised Event Graph Representation and Similarity Learning on Biomedical Literature.基于生物医学文献的无监督事件图表示和相似性学习。
Sensors (Basel). 2021 Dec 21;22(1):3. doi: 10.3390/s22010003.
2
Effective type label-based synergistic representation learning for biomedical event trigger detection.用于生物医学事件触发检测的基于有效类型标签的协同表示学习
BMC Bioinformatics. 2024 Jul 31;25(1):251. doi: 10.1186/s12859-024-05851-1.
3
Explaining protein-protein interactions with knowledge graph-based semantic similarity.用基于知识图的语义相似度解释蛋白质-蛋白质相互作用。
Comput Biol Med. 2024 Mar;170:108076. doi: 10.1016/j.compbiomed.2024.108076. Epub 2024 Feb 1.
4
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
5
edge2vec: Representation learning using edge semantics for biomedical knowledge discovery.边向量模型:利用边语义的表示学习方法进行生物医学知识发现。
BMC Bioinformatics. 2019 Jun 10;20(1):306. doi: 10.1186/s12859-019-2914-2.
6
Multi-Ontology Refined Embeddings (MORE): A hybrid multi-ontology and corpus-based semantic representation model for biomedical concepts.多本体精炼嵌入模型(MORE):一种基于混合多本体和语料库的生物医学概念语义表示模型。
J Biomed Inform. 2020 Nov;111:103581. doi: 10.1016/j.jbi.2020.103581. Epub 2020 Oct 1.
7
Learning Multi-Types of Neighbor Node Attributes and Semantics by Heterogeneous Graph Transformer and Multi-View Attention for Drug-Related Side-Effect Prediction.通过异构图 Transformer 和多视图注意学习多类型邻居节点属性和语义,用于药物相关副作用预测。
Molecules. 2023 Sep 9;28(18):6544. doi: 10.3390/molecules28186544.
8
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
9
Survey on graph embeddings and their applications to machine learning problems on graphs.关于图嵌入及其在图上机器学习问题中的应用的综述。
PeerJ Comput Sci. 2021 Feb 4;7:e357. doi: 10.7717/peerj-cs.357. eCollection 2021.
10
Exploiting graph kernels for high performance biomedical relation extraction.利用图核进行高性能生物医学关系提取。
J Biomed Semantics. 2018 Jan 30;9(1):7. doi: 10.1186/s13326-017-0168-3.

引用本文的文献

1
Efficient Memory-Enhanced Transformer for Long-Document Summarization in Low-Resource Regimes.高效记忆增强型Transformer 用于低资源环境下的长文档摘要
Sensors (Basel). 2023 Mar 28;23(7):3542. doi: 10.3390/s23073542.

本文引用的文献

1
Efficient Self-Supervised Metric Information Retrieval: A Bibliography Based Method Applied to COVID Literature.高效的自监督度量信息检索:一种应用于 COVID 文献的基于文献的方法。
Sensors (Basel). 2021 Sep 26;21(19):6430. doi: 10.3390/s21196430.
2
Text-Graph Enhanced Knowledge Graph Representation Learning.文本-图增强的知识图谱表示学习
Front Artif Intell. 2021 Aug 17;4:697856. doi: 10.3389/frai.2021.697856. eCollection 2021.
3
Diagnostic Prediction with Sequence-of-sets Representation Learning for Clinical Events.用于临床事件的集序列表示学习的诊断预测
Artif Intell Med Conf Artif Intell Med (2005-). 2020 Aug;12299:348-358. doi: 10.1007/978-3-030-59137-3_31. Epub 2020 Sep 26.
4
A Survey on Knowledge Graphs: Representation, Acquisition, and Applications.知识图谱综述:表示、获取与应用
IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):494-514. doi: 10.1109/TNNLS.2021.3070843. Epub 2022 Feb 3.
5
Survey on graph embeddings and their applications to machine learning problems on graphs.关于图嵌入及其在图上机器学习问题中的应用的综述。
PeerJ Comput Sci. 2021 Feb 4;7:e357. doi: 10.7717/peerj-cs.357. eCollection 2021.
6
DeepEventMine: end-to-end neural nested event extraction from biomedical texts.DeepEventMine:从生物医学文本中进行端到端的神经嵌套事件提取。
Bioinformatics. 2020 Dec 8;36(19):4910-4917. doi: 10.1093/bioinformatics/btaa540.
7
A Comprehensive Survey on Graph Neural Networks.图神经网络综述。
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.
8
Orthogonal Deep Neural Networks.正交深度神经网络。
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1352-1368. doi: 10.1109/TPAMI.2019.2948352. Epub 2021 Mar 4.
9
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
10
Similarity corpus on microbial transcriptional regulation.微生物转录调控相似性语料库。
J Biomed Semantics. 2019 May 22;10(1):8. doi: 10.1186/s13326-019-0200-x.