Literature Services, EMBL-EBI, Wellcome Trust Genome Campus, Cambridge, UK.
Nucleic Acids Res. 2024 Jan 5;52(D1):D1668-D1676. doi: 10.1093/nar/gkad1085.
Europe PMC (https://europepmc.org/) is an open access database of life science journal articles and preprints, which contains over 42 million abstracts and over 9 million full text articles accessible via the website, APIs and bulk download. This publication outlines new developments to the Europe PMC platform since the last database update in 2020 (1) and focuses on five main areas. (i) Improving discoverability, reproducibility and trust in preprints by indexing new preprint content, enriching preprint metadata and identifying withdrawn and removed preprints. (ii) Enhancing support for text and data mining by expanding the types of annotations provided and developing the Europe PMC Annotations Corpus, which can be used to train machine learning models to increase their accuracy and precision. (iii) Developing the Article Status Monitor tool and email alerts, to notify users about new articles and updates to existing records. (iv) Positioning Europe PMC as an open scholarly infrastructure through increasing the portion of open source core software, improving sustainability and accessibility of the service.
Europe PMC(https://europepmc.org/)是一个开放获取的生命科学期刊文章和预印本数据库,其中包含超过 4200 万条摘要和超过 900 万篇全文文章,可以通过网站、API 和批量下载访问。本出版物概述了自 2020 年上一次数据库更新以来 Europe PMC 平台的新发展(1),并重点介绍了五个主要领域。(i)通过索引新的预印本内容、丰富预印本元数据以及识别撤回和删除的预印本来提高预印本的可发现性、可重复性和可信度。(ii)通过扩展提供的注释类型并开发 Europe PMC 注释语料库来增强对文本和数据挖掘的支持,该语料库可用于训练机器学习模型以提高其准确性和精度。(iii)开发文章状态监控工具和电子邮件警报,通知用户有关新文章和现有记录更新的信息。(iv)通过增加开源核心软件的比例、提高服务的可持续性和可访问性,将 Europe PMC 定位为一个开放的学术基础设施。