MKEM：一种用于挖掘未被发现的公共知识的多层次知识涌现模型。

MKEM: a Multi-level Knowledge Emergence Model for mining undiscovered public knowledge.

机构信息

Department of Bio and Brain Engineering, KAIST, South Korea.

出版信息

BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S3. doi: 10.1186/1471-2105-11-S2-S3.

DOI:10.1186/1471-2105-11-S2-S3

PMID:20406501

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3165192/

Abstract

BACKGROUND

Since Swanson proposed the Undiscovered Public Knowledge (UPK) model, there have been many approaches to uncover UPK by mining the biomedical literature. These earlier works, however, required substantial manual intervention to reduce the number of possible connections and are mainly applied to disease-effect relation. With the advancement in biomedical science, it has become imperative to extract and combine information from multiple disjoint researches, studies and articles to infer new hypotheses and expand knowledge.

METHODS

We propose MKEM, a Multi-level Knowledge Emergence Model, to discover implicit relationships using Natural Language Processing techniques such as Link Grammar and Ontologies such as Unified Medical Language System (UMLS) MetaMap. The contribution of MKEM is as follows: First, we propose a flexible knowledge emergence model to extract implicit relationships across different levels such as molecular level for gene and protein and Phenomic level for disease and treatment. Second, we employ MetaMap for tagging biological concepts. Third, we provide an empirical and systematic approach to discover novel relationships.

RESULTS

We applied our system on 5000 abstracts downloaded from PubMed database. We performed the performance evaluation as a gold standard is not yet available. Our system performed with a good precision and recall and we generated 24 hypotheses.

CONCLUSIONS

Our experiments show that MKEM is a powerful tool to discover hidden relationships residing in extracted entities that were represented by our Substance-Effect-Process-Disease-Body Part (SEPDB) model.

摘要

背景

自从 Swanson 提出未被发现的公共知识（UPK）模型以来，已经有许多通过挖掘生物医学文献来发现 UPK 的方法。然而，这些早期的工作需要大量的手动干预来减少可能的连接数量，并且主要应用于疾病-效应关系。随着生物医学科学的进步，从多个不相关的研究、研究和文章中提取和组合信息以推断新的假设和扩展知识变得至关重要。

方法

我们提出了 MKEM，一种多层次知识发现模型，使用自然语言处理技术（如链接语法和本体论，如统一医学语言系统（UMLS）MetaMap）来发现隐含关系。MKEM 的贡献如下：首先，我们提出了一种灵活的知识发现模型，以提取不同层次（如基因和蛋白质的分子水平以及疾病和治疗的表型水平）之间的隐含关系。其次，我们使用 MetaMap 对生物概念进行标记。第三，我们提供了一种发现新关系的经验和系统方法。

结果

我们将我们的系统应用于从 PubMed 数据库下载的 5000 篇摘要。由于尚未提供黄金标准，因此我们进行了性能评估。我们的系统具有良好的精度和召回率，并且生成了 24 个假设。

结论

我们的实验表明，MKEM 是一种强大的工具，可以发现隐藏在我们的物质-效应-过程-疾病-身体部位（SEPDB）模型所表示的提取实体中存在的关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ecb/3165192/bc681f0ee23b/1471-2105-11-S2-S3-1.jpg

相似文献

MKEM: a Multi-level Knowledge Emergence Model for mining undiscovered public knowledge.MKEM：一种用于挖掘未被发现的公共知识的多层次知识涌现模型。

BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S3. doi: 10.1186/1471-2105-11-S2-S3.

Knowledge based word-concept model estimation and refinement for biomedical text mining.用于生物医学文本挖掘的基于知识的词概念模型估计与优化。

J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.

Text mining facilitates database curation - extraction of mutation-disease associations from Bio-medical literature.文本挖掘有助于数据库管理——从生物医学文献中提取突变与疾病的关联。

BMC Bioinformatics. 2015 Jun 6;16:185. doi: 10.1186/s12859-015-0609-x.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Mining biomedical data using MetaMap Transfer (MMtx) and the Unified Medical Language System (UMLS).使用MetaMap Transfer（MMtx）和统一医学语言系统（UMLS）挖掘生物医学数据。

Methods Mol Biol. 2007;408:153-69. doi: 10.1007/978-1-59745-547-3_9.

Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.生物医学文本到UMLS元词表的有效映射：MetaMap程序

Proc AMIA Symp. 2001:17-21.

Assessment of disease named entity recognition on a corpus of annotated sentences.基于带注释句子语料库的疾病命名实体识别评估。

BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-9-S3-S3.

Using data-driven sublanguage pattern mining to induce knowledge models: application in medical image reports knowledge representation.利用数据驱动的子语言模式挖掘来诱导知识模型：在医学图像报告知识表示中的应用。

BMC Med Inform Decis Mak. 2018 Jul 6;18(1):61. doi: 10.1186/s12911-018-0645-3.

A Unified Medical Language System (UMLS) based system for Literature-Based Discovery in medicine.一种基于统一医学语言系统（UMLS）的医学文献发现系统。

Stud Health Technol Inform. 2013;192:412-6.

Extracting biomedical events from pairs of text entities.从文本实体对中提取生物医学事件。

BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-16-S10-S8. Epub 2015 Jul 13.

引用本文的文献

Computational Literature-based Discovery for Natural Products Research: Current State and Future Prospects.基于计算文献的天然产物研究发现：现状与未来展望。

Front Bioinform. 2022 Mar 15;2:827207. doi: 10.3389/fbinf.2022.827207. eCollection 2022.

A systematic review on literature-based discovery workflow.基于文献的发现工作流程的系统综述。

PeerJ Comput Sci. 2019 Nov 18;5:e235. doi: 10.7717/peerj-cs.235. eCollection 2019.

A context-based ABC model for literature-based discovery.基于上下文的文献发现 ABC 模型。

PLoS One. 2019 Apr 24;14(4):e0215313. doi: 10.1371/journal.pone.0215313. eCollection 2019.

Generation and application of drug indication inference models using typed network motif comparison analysis.使用类型化网络基元比较分析生成和应用药物适应证推断模型。

BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S2. doi: 10.1186/1472-6947-13-S1-S2. Epub 2013 Apr 5.

Discovering context-specific relationships from biological literature by using multi-level context terms.通过使用多层次语境术语从生物文献中发现特定语境关系。

BMC Med Inform Decis Mak. 2012 Apr 30;12 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-12-S1-S1.

本文引用的文献

Reactive oxygen species up-regulate p53 and Puma; a possible mechanism for apoptosis during combined treatment with TRAIL and wogonin.活性氧上调 p53 和 Puma；TRAIL 和白杨素联合治疗诱导细胞凋亡的可能机制。

Br J Pharmacol. 2009 Aug;157(7):1189-202. doi: 10.1111/j.1476-5381.2009.00245.x. Epub 2009 May 11.

Discovering novel causal patterns from biomedical natural-language texts using Bayesian nets.使用贝叶斯网络从生物医学自然语言文本中发现新的因果模式。

IEEE Trans Inf Technol Biomed. 2008 Nov;12(6):714-22. doi: 10.1109/TITB.2008.920793.

Genistein, a dietary isoflavone, down-regulates the MDM2 oncogene at both transcriptional and posttranslational levels.染料木黄酮是一种膳食异黄酮，它在转录和翻译后水平下调MDM2癌基因。

Cancer Res. 2005 Sep 15;65(18):8200-8. doi: 10.1158/0008-5472.CAN-05-1302.

The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.自然语言处理中领域知识与语言结构的相互作用：解读生物医学文本中的上位命题

J Biomed Inform. 2003 Dec;36(6):462-77. doi: 10.1016/j.jbi.2003.11.003.

Integrating a hypernymic proposition interpreter into a semantic processor for biomedical texts.将一个上位命题解释器集成到用于生物医学文本的语义处理器中。

AMIA Annu Symp Proc. 2003;2003:239-43.

Generating hypotheses by discovering implicit associations in the literature: a case report of a search for new potential therapeutic uses for thalidomide.通过发现文献中的隐性关联来生成假设：关于寻找沙利度胺新潜在治疗用途的病例报告

J Am Med Inform Assoc. 2003 May-Jun;10(3):252-9. doi: 10.1197/jamia.M1158. Epub 2003 Jan 28.

Exploring text mining from MEDLINE.探索来自医学文献数据库（MEDLINE）的文本挖掘。

Proc AMIA Symp. 2002:722-6.

Supporting discovery in medicine by association rule mining in Medline and UMLS.通过在医学文献数据库（Medline）和一体化医学语言系统（UMLS）中进行关联规则挖掘来支持医学发现。

Stud Health Technol Inform. 2001;84(Pt 2):1344-8.

Fish oil, Raynaud's syndrome, and undiscovered public knowledge.鱼油、雷诺综合征与未被发现的公共知识。

Perspect Biol Med. 1986 Autumn;30(1):7-18. doi: 10.1353/pbm.1986.0087.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MKEM：一种用于挖掘未被发现的公共知识的多层次知识涌现模型。

MKEM: a Multi-level Knowledge Emergence Model for mining undiscovered public knowledge.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献