基于图的生物医学文献综述的聚类团分析。

Clustering cliques for graph-based summarization of the biomedical research literature.

机构信息

Department of Medical Informatics, China Medical University, Shenyang, Liaoning 110001, China.

出版信息

BMC Bioinformatics. 2013 Jun 7;14:182. doi: 10.1186/1471-2105-14-182.

DOI:10.1186/1471-2105-14-182

PMID:23742159

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3682874/

Abstract

BACKGROUND

Graph-based notions are increasingly used in biomedical data mining and knowledge discovery tasks. In this paper, we present a clique-clustering method to automatically summarize graphs of semantic predications produced from PubMed citations (titles and abstracts).

RESULTS

SemRep is used to extract semantic predications from the citations returned by a PubMed search. Cliques were identified from frequently occurring predications with highly connected arguments filtered by degree centrality. Themes contained in the summary were identified with a hierarchical clustering algorithm based on common arguments shared among cliques. The validity of the clusters in the summaries produced was compared to the Silhouette-generated baseline for cohesion, separation and overall validity. The theme labels were also compared to a reference standard produced with major MeSH headings.

CONCLUSIONS

For 11 topics in the testing data set, the overall validity of clusters from the system summary was 10% better than the baseline (43% versus 33%). While compared to the reference standard from MeSH headings, the results for recall, precision and F-score were 0.64, 0.65, and 0.65 respectively.

摘要

背景

基于图的概念在生物医学数据挖掘和知识发现任务中越来越多地被使用。在本文中，我们提出了一种团簇聚类方法，用于自动总结从 PubMed 引文中提取的语义谓词的图（标题和摘要）。

结果

SemRep 用于从 PubMed 搜索返回的引文中提取语义谓词。通过基于节点度的中心度过滤，识别出具有高度连接参数的频繁出现的谓词的团簇。基于团簇之间共享的常见参数，使用层次聚类算法来识别摘要中的主题。对生成的摘要中的聚类的有效性进行了比较，以确定凝聚、分离和整体有效性的 Silhouette 生成基线。主题标签还与使用主要 MeSH 标题生成的参考标准进行了比较。

结论

在测试数据集的 11 个主题中，系统摘要中的聚类的整体有效性比基线提高了 10%（43%比 33%）。与 MeSH 标题的参考标准相比，召回率、精度和 F 分数分别为 0.64、0.65 和 0.65。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1cb/3682874/1c0aa2411eae/1471-2105-14-182-1.jpg

相似文献

Clustering cliques for graph-based summarization of the biomedical research literature.基于图的生物医学文献综述的聚类团分析。

BMC Bioinformatics. 2013 Jun 7;14:182. doi: 10.1186/1471-2105-14-182.

Context-driven automatic subgraph creation for literature-based discovery.用于基于文献的发现的上下文驱动自动子图创建

J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.

Graph-based biomedical text summarization: An itemset mining and sentence clustering approach.基于图的生物医学文本摘要：一种基于项集挖掘和句子聚类的方法。

J Biomed Inform. 2018 Aug;84:42-58. doi: 10.1016/j.jbi.2018.06.005. Epub 2018 Jun 15.

Degree centrality for semantic abstraction summarization of therapeutic studies.治疗研究语义抽象总结的度中心性。

J Biomed Inform. 2011 Oct;44(5):830-8. doi: 10.1016/j.jbi.2011.05.001. Epub 2011 May 8.

A graph-based recovery and decomposition of Swanson's hypothesis using semantic predications.基于图的 Swanson 假说恢复和分解，使用语义谓词。

J Biomed Inform. 2013 Apr;46(2):238-51. doi: 10.1016/j.jbi.2012.09.004. Epub 2012 Sep 28.

A Knowledge Graph of Combined Drug Therapies Using Semantic Predications From Biomedical Literature: Algorithm Development.利用生物医学文献中的语义谓词构建的联合药物治疗知识图谱：算法开发

JMIR Med Inform. 2020 Apr 28;8(4):e18323. doi: 10.2196/18323.

Enhancing the coverage of SemRep using a relation classification approach.利用关系分类方法增强 SemRep 的覆盖范围。

J Biomed Inform. 2024 Jul;155:104658. doi: 10.1016/j.jbi.2024.104658. Epub 2024 May 21.

SemMedDB: a PubMed-scale repository of biomedical semantic predications.SemMedDB：一个基于 PubMed 规模的生物医学语义断言知识库。

Bioinformatics. 2012 Dec 1;28(23):3158-60. doi: 10.1093/bioinformatics/bts591. Epub 2012 Oct 8.

Publishing Biomedical Predication Repository About MeSH Co-Occurrences in MEDLINE.发布关于医学主题词（MeSH）在医学在线数据库（MEDLINE）中共现情况的生物医学预测知识库。

Stud Health Technol Inform. 2016;228:765-9.

Knowledge Extraction from MEDLINE by Combining Clustering with Natural Language Processing.通过聚类与自然语言处理相结合从医学在线数据库中提取知识

AMIA Annu Symp Proc. 2015 Nov 5;2015:915-24. eCollection 2015.

引用本文的文献

Classification of clinically useful sentences in clinical evidence resources.临床证据资源中临床有用句子的分类。

J Biomed Inform. 2016 Apr;60:14-22. doi: 10.1016/j.jbi.2016.01.003. Epub 2016 Jan 13.

Context-driven automatic subgraph creation for literature-based discovery.用于基于文献的发现的上下文驱动自动子图创建

J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.

Text summarization in the biomedical domain: a systematic review of recent research.生物医学领域的文本摘要：近期研究的系统综述

J Biomed Inform. 2014 Dec;52:457-67. doi: 10.1016/j.jbi.2014.06.009. Epub 2014 Jul 10.

Natural language processing pipelines to annotate BioC collections with an application to the NCBI disease corpus.用于注释BioC文集的自然语言处理管道及其在NCBI疾病语料库中的应用。

Database (Oxford). 2014 Jun 16;2014. doi: 10.1093/database/bau056. Print 2014.

本文引用的文献

Constructing a semantic predication gold standard from the biomedical literature.从生物医学文献中构建语义谓词黄金标准。

BMC Bioinformatics. 2011 Dec 20;12:486. doi: 10.1186/1471-2105-12-486.

Degree centrality for semantic abstraction summarization of therapeutic studies.治疗研究语义抽象总结的度中心性。

J Biomed Inform. 2011 Oct;44(5):830-8. doi: 10.1016/j.jbi.2011.05.001. Epub 2011 May 8.

Clustering more than two million biomedical publications: comparing the accuracies of nine text-based similarity approaches.对两百多万篇生物医学文献进行聚类：比较九种基于文本的相似度方法的准确性。

PLoS One. 2011 Mar 17;6(3):e18029. doi: 10.1371/journal.pone.0018029.

An overview of MetaMap: historical perspective and recent advances.MetaMap 概述：历史视角与最新进展。

J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.

Clique-based data mining for related genes in a biomedical database.生物医学数据库中基于团的数据挖掘相关基因

BMC Bioinformatics. 2009 Jul 1;10:205. doi: 10.1186/1471-2105-10-205.

Complex discovery from weighted PPI networks.基于加权 PPI 网络的复杂发现。

Bioinformatics. 2009 Aug 1;25(15):1891-7. doi: 10.1093/bioinformatics/btp311. Epub 2009 May 12.

Automatic summarization of MEDLINE citations for evidence-based medical treatment: a topic-oriented evaluation.基于证据的医学治疗的 MEDLINE 引文自动摘要：面向主题的评估。

J Biomed Inform. 2009 Oct;42(5):801-13. doi: 10.1016/j.jbi.2008.10.002. Epub 2008 Nov 5.

Automatic summarization of mouse gene information by clustering and sentence extraction from MEDLINE abstracts.通过对MEDLINE摘要进行聚类和句子提取来自动汇总小鼠基因信息。

AMIA Annu Symp Proc. 2007 Oct 11;2007:831-5.

Exploration of a collection of documents in neuroscience and extraction of topics by clustering.探索神经科学领域的一系列文献并通过聚类提取主题。

Neural Netw. 2008 Oct;21(8):1205-11. doi: 10.1016/j.neunet.2008.05.009. Epub 2008 Jun 7.

Identifying gene-disease associations using centrality on a literature mined gene-interaction network.利用文献挖掘的基因相互作用网络中的中心性来识别基因与疾病的关联。

Bioinformatics. 2008 Jul 1;24(13):i277-85. doi: 10.1093/bioinformatics/btn182.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于图的生物医学文献综述的聚类团分析。

Clustering cliques for graph-based summarization of the biomedical research literature.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献