Suppr超能文献

pubmed.mineR:一个带有文本挖掘算法的R包,用于分析PubMed摘要。

pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts.

作者信息

Rani Jyoti, Shah A B Rauf, Ramachandran Srinivasan

机构信息

GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, New Delhi 110 025, India.

出版信息

J Biosci. 2015 Oct;40(4):671-82. doi: 10.1007/s12038-015-9552-2.

Abstract

The PubMed literature database is a valuable source of information for scientific research. It is rich in biomedical literature with more than 24 million citations. Data-mining of voluminous literature is a challenging task. Although several text-mining algorithms have been developed in recent years with focus on data visualization, they have limitations such as speed, are rigid and are not available in the open source. We have developed an R package, pubmed.mineR, wherein we have combined the advantages of existing algorithms, overcome their limitations, and offer user flexibility and link with other packages in Bioconductor and the Comprehensive R Network (CRAN) in order to expand the user capabilities for executing multifaceted approaches. Three case studies are presented, namely, 'Evolving role of diabetes educators', 'Cancer risk assessment' and 'Dynamic concepts on disease and comorbidity' to illustrate the use of pubmed.mineR. The package generally runs fast with small elapsed times in regular workstations even on large corpus sizes and with compute intensive functions. The pubmed.mineR is available at http://cran.rproject. org/web/packages/pubmed.mineR.

摘要

PubMed文献数据库是科学研究的宝贵信息来源。它拥有丰富的生物医学文献,引用次数超过2400万次。对大量文献进行数据挖掘是一项具有挑战性的任务。尽管近年来已经开发了几种侧重于数据可视化的文本挖掘算法,但它们存在速度慢、过于僵化以及不开源等局限性。我们开发了一个R包pubmed.mineR,在其中我们结合了现有算法的优点,克服了它们的局限性,并为用户提供灵活性,以及与生物导体(Bioconductor)和综合R网络(CRAN)中的其他包建立链接,以扩展用户执行多方面方法的能力。本文展示了三个案例研究,即“糖尿病教育者角色的演变”、“癌症风险评估”和“疾病与共病的动态概念”,以说明pubmed.mineR的使用。即使处理大型语料库和计算密集型函数,该包在常规工作站上通常运行速度很快,耗时较短。pubmed.mineR可在http://cran.rproject.org/web/packages/pubmed.mineR获取。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验