Suppr超能文献

利用图像增强生物医学搜索界面。

Enhancing biomedical search interfaces with images.

作者信息

Trelles Trabucco Juan, Arighi Cecilia, Shatkay Hagit, Marai G Elisabeta

机构信息

Department of Computer Science, University of Illinois Chicago, Chicago, IL 60607, USA.

Department of Computer and Information Science, University of Delaware, Newark, DE 19716, USA.

出版信息

Bioinform Adv. 2023 Jul 17;3(1):vbad095. doi: 10.1093/bioadv/vbad095. eCollection 2023.

Abstract

MOTIVATION

Figures in biomedical papers communicate essential information with the potential to identify relevant documents in biomedical and clinical settings. However, academic search interfaces mainly search over text fields.

RESULTS

We describe a search system for biomedical documents that leverages image modalities and an existing index server. We integrate a problem-specific taxonomy of image modalities and image-based data into a custom search system. Our solution features a front-end interface to enhance classical document search results with image-related data, including page thumbnails, figures, captions and image-modality information. We demonstrate the system on a subset of the CORD-19 document collection. A quantitative evaluation demonstrates higher precision and recall for biomedical document retrieval. A qualitative evaluation with domain experts further highlights our solution's benefits to biomedical search.

AVAILABILITY AND IMPLEMENTATION

A demonstration is available at https://runachay.evl.uic.edu/scholar. Our code and image models can be accessed via github.com/uic-evl/bio-search. The dataset is continuously expanded.

摘要

动机

生物医学论文中的图表传达了重要信息,有可能在生物医学和临床环境中识别相关文档。然而,学术搜索界面主要是在文本字段上进行搜索。

结果

我们描述了一种用于生物医学文档的搜索系统,该系统利用图像模态和现有的索引服务器。我们将特定问题的图像模态分类法和基于图像的数据集成到一个定制的搜索系统中。我们的解决方案具有一个前端界面,可通过与图像相关的数据(包括页面缩略图、图表、标题和图像模态信息)来增强经典文档搜索结果。我们在CORD-19文档集合的一个子集中演示了该系统。定量评估表明,生物医学文档检索的精度和召回率更高。与领域专家进行的定性评估进一步突出了我们的解决方案对生物医学搜索的益处。

可用性和实现方式

可在https://runachay.evl.uic.edu/scholar上进行演示。我们的代码和图像模型可通过github.com/uic-evl/bio-search访问。数据集在不断扩展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eb8/10359625/243fb1e74a7a/vbad095f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验