Department of Computer Science, Virginia Commonwealth University, 401 S. Main St., Rm E4222, Richmond, VA 23284, USA.
Department of Computer Science, Virginia Commonwealth University, 401 S. Main St., Rm E4222, Richmond, VA 23284, USA.
J Biomed Inform. 2017 Oct;74:20-32. doi: 10.1016/j.jbi.2017.08.011. Epub 2017 Aug 31.
This paper provides an introduction and overview of literature based discovery (LBD) in the biomedical domain. It introduces the reader to modern and historical LBD models, key system components, evaluation methodologies, and current trends. After completion, the reader will be familiar with the challenges and methodologies of LBD. The reader will be capable of distinguishing between recent LBD systems and publications, and be capable of designing an LBD system for a specific application.
From biomedical researchers curious about LBD, to someone looking to design an LBD system, to an LBD expert trying to catch up on trends in the field. The reader need not be familiar with LBD, but knowledge of biomedical text processing tools is helpful.
This paper describes a unifying framework for LBD systems. Within this framework, different models and methods are presented to both distinguish and show overlap between systems. Topics include term and document representation, system components, and an overview of models including co-occurrence models, semantic models, and distributional models. Other topics include uninformative term filtering, term ranking, results display, system evaluation, an overview of the application areas of drug development, drug repurposing, and adverse drug event prediction, and challenges and future directions. A timeline showing contributions to LBD, and a table summarizing the works of several authors is provided. Topics are presented from a high level perspective. References are given if more detailed analysis is required.
本文介绍了生物医学领域中基于文献的发现(LBD)的介绍和概述。它向读者介绍了现代和历史 LBD 模型、关键系统组件、评估方法以及当前趋势。完成后,读者将熟悉 LBD 的挑战和方法。读者将能够区分最近的 LBD 系统和出版物,并能够为特定应用设计 LBD 系统。
从对 LBD 感兴趣的生物医学研究人员,到想要设计 LBD 系统的人员,再到试图了解该领域最新趋势的 LBD 专家。读者不必熟悉 LBD,但了解生物医学文本处理工具会有所帮助。
本文描述了 LBD 系统的统一框架。在这个框架内,提出了不同的模型和方法来区分和展示系统之间的重叠。主题包括术语和文档表示、系统组件以及包括共现模型、语义模型和分布模型在内的模型概述。其他主题包括无信息术语过滤、术语排名、结果显示、系统评估、药物开发、药物再利用和药物不良事件预测的应用领域概述以及挑战和未来方向。提供了一个显示 LBD 贡献的时间线和一个总结几位作者工作的表格。从高层次的角度介绍了主题。如果需要更详细的分析,则会提供参考。