使用康斯坦茨信息挖掘器（KNIME）平台对生物医学文章进行文本挖掘：以溶血尿毒综合征为例

Text Mining of Biomedical Articles Using the Konstanz Information Miner (KNIME) Platform: Hemolytic Uremic Syndrome as a Case Study.

作者信息

Dorr Ricardo A, Casal Juan J, Toriano Roxana

机构信息

Facultad de Medicina, Instituto de Fisiología y Biofísica Bernardo Houssay (IFIBIO Houssay), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina.

出版信息

Healthc Inform Res. 2022 Jul;28(3):276-283. doi: 10.4258/hir.2022.28.3.276. Epub 2022 Jul 31.

DOI:10.4258/hir.2022.28.3.276

PMID:35982602

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9388920/

Abstract

OBJECTIVES

Automated systems for information extraction are becoming very useful due to the enormous scale of the existing literature and the increasing number of scientific articles published worldwide in the field of medicine. We aimed to develop an accessible method using the open-source platform KNIME to perform text mining (TM) on indexed publications. Material from scientific publications in the field of life sciences was obtained and integrated by mining information on hemolytic uremic syndrome (HUS) as a case study.

METHODS

Text retrieved from Europe PubMed Central (PMC) was processed using specific KNIME nodes. The results were presented in the form of tables or graphical representations. Data could also be compared with those from other sources.

RESULTS

By applying TM to the scientific literature on HUS as a case study, and by selecting various fields from scientific articles, it was possible to obtain a list of individual authors of publications, build bags of words and study their frequency and temporal use, discriminate topics (HUS vs. atypical HUS) in an unsupervised manner, and cross-reference information with a list of FDA-approved drugs.

CONCLUSIONS

Following the instructions in the tutorial, researchers without programming skills can successfully perform TM on the indexed scientific literature. This methodology, using KNIME, could become a useful tool for performing statistics, analyzing behaviors, following trends, and making forecast related to medical issues. The advantages of TM using KNIME include enabling the integration of scientific information, helping to carry out reviews, and optimizing the management of resources dedicated to basic and clinical research.

摘要

目标

由于现有文献规模巨大且全球医学领域发表的科学文章数量不断增加，信息提取自动化系统变得非常有用。我们旨在开发一种使用开源平台KNIME的可访问方法，对索引出版物进行文本挖掘（TM）。作为案例研究，通过挖掘溶血尿毒综合征（HUS）的信息，获取并整合了生命科学领域科学出版物的材料。

方法

使用特定的KNIME节点处理从欧洲 PubMed 中心（PMC）检索到的文本。结果以表格或图形表示的形式呈现。数据也可以与其他来源的数据进行比较。

结果

通过将TM应用于以HUS为案例研究的科学文献，并从科学文章中选择各个字段，可以获得出版物的个人作者列表，构建词袋并研究其频率和时间使用情况，可以以无监督方式区分主题（HUS与非典型HUS），并将信息与FDA批准的药物列表进行交叉引用。

结论

按照教程中的说明，没有编程技能的研究人员可以成功地对索引科学文献进行TM。这种使用KNIME的方法可能成为进行统计、分析行为、跟踪趋势以及对医学问题进行预测的有用工具。使用KNIME进行TM的优点包括能够整合科学信息、帮助进行综述以及优化用于基础研究和临床研究资源的管理

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/602b/9388920/419fe9683830/hir-2022-28-3-276f1.jpg

相似文献

Text Mining of Biomedical Articles Using the Konstanz Information Miner (KNIME) Platform: Hemolytic Uremic Syndrome as a Case Study.使用康斯坦茨信息挖掘器（KNIME）平台对生物医学文章进行文本挖掘：以溶血尿毒综合征为例

Healthc Inform Res. 2022 Jul;28(3):276-283. doi: 10.4258/hir.2022.28.3.276. Epub 2022 Jul 31.

[Obtaining new information on hemolytic uremic syndrome by text mining].[通过文本挖掘获取溶血尿毒综合征的新信息]

Medicina (B Aires). 2022;82(4):513-524.

Searching and Evaluating Publications and Preprints Using Europe PMC.利用 Europe PMC 搜索和评估出版物及预印本

Curr Protoc. 2023 Mar;3(3):e694. doi: 10.1002/cpz1.694.

Text mining in livestock animal science: introducing the potential of text mining to animal sciences.文本挖掘在畜牧动物科学中的应用：介绍文本挖掘在动物科学中的应用潜力。

J Anim Sci. 2012 Oct;90(10):3666-76. doi: 10.2527/jas.2011-4841. Epub 2012 Jun 4.

[Text mining in scientific publications with Argentine authors].[对阿根廷作者的科学出版物进行文本挖掘]

Medicina (B Aires). 2021;81(2):214-223.

Automated curation of gene name normalization results using the Konstanz information miner.使用康斯坦茨信息挖掘器对基因名称标准化结果进行自动管理。

J Biomed Inform. 2015 Feb;53:58-64. doi: 10.1016/j.jbi.2014.08.016. Epub 2014 Sep 10.

Biomedical Literature Mining and Its Components.生物医学文献挖掘及其组成部分。

Methods Mol Biol. 2022;2496:1-16. doi: 10.1007/978-1-0716-2305-3_1.

Drug discovery applications for KNIME: an open source data mining platform.KNIME 在药物发现中的应用：一个开源的数据挖掘平台。

Curr Top Med Chem. 2012;12(18):1965-79. doi: 10.2174/156802612804910331.

Hemolytic uremic syndrome: differential diagnosis with the onset of inflammatory bowel diseases.溶血性尿毒症综合征：与炎症性肠病发作的鉴别诊断。

Acta Biomed. 2018 Dec 17;89(9-S):153-157. doi: 10.23750/abm.v89i9-S.7911.

KNIME-CDK: Workflow-driven cheminformatics.KNIME-CDK：基于工作流的化学信息学。

BMC Bioinformatics. 2013 Aug 22;14:257. doi: 10.1186/1471-2105-14-257.

本文引用的文献

[Obtaining new information on hemolytic uremic syndrome by text mining].[通过文本挖掘获取溶血尿毒综合征的新信息]

Medicina (B Aires). 2022;82(4):513-524.

[Text mining in scientific publications with Argentine authors].[对阿根廷作者的科学出版物进行文本挖掘]

Medicina (B Aires). 2021;81(2):214-223.

Review and Analysis of Massively Registered Clinical Trials of COVID-19 using the Text Mining Approach.利用文本挖掘方法对大规模注册的 COVID-19 临床试验进行回顾和分析。

Rev Recent Clin Trials. 2021;16(3):242-257. doi: 10.2174/1574887115666201202110919.

Unsupervised word embeddings capture latent knowledge from materials science literature.无监督词嵌入方法可以从材料科学文献中提取潜在知识。

Nature. 2019 Jul;571(7763):95-98. doi: 10.1038/s41586-019-1335-8. Epub 2019 Jul 3.

Pathogenic role of inflammatory response during Shiga toxin-associated hemolytic uremic syndrome (HUS).志贺毒素相关性溶血尿毒综合征（HUS）中炎症反应的致病作用。

Pediatr Nephrol. 2018 Nov;33(11):2057-2071. doi: 10.1007/s00467-017-3876-0. Epub 2018 Jan 25.

Text Mining in Biomedical Domain with Emphasis on Document Clustering.生物医学领域中的文本挖掘，重点在于文档聚类

Healthc Inform Res. 2017 Jul;23(3):141-146. doi: 10.4258/hir.2017.23.3.141. Epub 2017 Jul 31.

HUS and atypical HUS.溶血尿毒综合征和非典型溶血尿毒综合征。

Blood. 2017 May 25;129(21):2847-2856. doi: 10.1182/blood-2016-11-709865. Epub 2017 Apr 17.

The Virtual Physiological Human: Ten Years After.虚拟生理人：十年之后

Annu Rev Biomed Eng. 2016 Jul 11;18:103-23. doi: 10.1146/annurev-bioeng-110915-114742.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用康斯坦茨信息挖掘器（KNIME）平台对生物医学文章进行文本挖掘：以溶血尿毒综合征为例

Text Mining of Biomedical Articles Using the Konstanz Information Miner (KNIME) Platform: Hemolytic Uremic Syndrome as a Case Study.

作者信息

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

目标

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献