• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物医学专利信息检索工具的开发。

Development of an information retrieval tool for biomedical patents.

机构信息

Centre Biological Engineering, University of Minho, Braga 4710-057, Portugal; Silicolife Lda, Braga 4715-387, Portugal.

Centre Biological Engineering, University of Minho, Braga 4710-057, Portugal.

出版信息

Comput Methods Programs Biomed. 2018 Jun;159:125-134. doi: 10.1016/j.cmpb.2018.03.012. Epub 2018 Mar 14.

DOI:10.1016/j.cmpb.2018.03.012
PMID:29650307
Abstract

BACKGROUND AND OBJECTIVE

The volume of biomedical literature has been increasing in the last years. Patent documents have also followed this trend, being important sources of biomedical knowledge, technical details and curated data, which are put together along the granting process. The field of Biomedical text mining (BioTM) has been creating solutions for the problems posed by the unstructured nature of natural language, which makes the search of information a challenging task. Several BioTM techniques can be applied to patents. From those, Information Retrieval (IR) includes processes where relevant data are obtained from collections of documents. In this work, the main goal was to build a patent pipeline addressing IR tasks over patent repositories to make these documents amenable to BioTM tasks.

METHODS

The pipeline was developed within @Note2, an open-source computational framework for BioTM, adding a number of modules to the core libraries, including patent metadata and full text retrieval, PDF to text conversion and optical character recognition. Also, user interfaces were developed for the main operations materialized in a new @Note2 plug-in.

RESULTS

The integration of these tools in @Note2 opens opportunities to run BioTM tools over patent texts, including tasks from Information Extraction, such as Named Entity Recognition or Relation Extraction. We demonstrated the pipeline's main functions with a case study, using an available benchmark dataset from BioCreative challenges. Also, we show the use of the plug-in with a user query related to the production of vanillin.

CONCLUSIONS

This work makes available all the relevant content from patents to the scientific community, decreasing drastically the time required for this task, and provides graphical interfaces to ease the use of these tools.

摘要

背景与目的

近年来,生物医学文献的数量不断增加。专利文献也紧随这一趋势,成为生物医学知识、技术细节和经过整理的数据的重要来源,这些信息都是在授予专利的过程中汇集在一起的。生物医学文本挖掘(BioTM)领域一直在为自然语言的非结构化性质所带来的问题创建解决方案,这使得信息搜索成为一项具有挑战性的任务。有几种 BioTM 技术可应用于专利。其中,信息检索(IR)包括从文档集合中获取相关数据的过程。在这项工作中,主要目标是构建一个专利管道,解决专利库中的 IR 任务,使这些文档能够适应 BioTM 任务。

方法

该管道是在 @Note2 中开发的,这是一个用于 BioTM 的开源计算框架,为核心库添加了许多模块,包括专利元数据和全文检索、PDF 到文本转换和光学字符识别。此外,还为主要操作开发了用户界面,这些操作体现在一个新的 @Note2 插件中。

结果

这些工具在 @Note2 中的集成为在专利文本上运行 BioTM 工具提供了机会,包括信息提取任务,如命名实体识别或关系提取。我们通过一个案例研究展示了该管道的主要功能,使用了来自 BioCreative 挑战的可用基准数据集。此外,我们还展示了该插件的使用,包括与香草醛生产相关的用户查询。

结论

这项工作使科学界能够获得专利的所有相关内容,大大减少了完成这项任务所需的时间,并提供了图形界面,以方便这些工具的使用。

相似文献

1
Development of an information retrieval tool for biomedical patents.生物医学专利信息检索工具的开发。
Comput Methods Programs Biomed. 2018 Jun;159:125-134. doi: 10.1016/j.cmpb.2018.03.012. Epub 2018 Mar 14.
2
@Note: a workbench for biomedical text mining.注意:一个用于生物医学文本挖掘的工作台。
J Biomed Inform. 2009 Aug;42(4):710-20. doi: 10.1016/j.jbi.2009.04.002. Epub 2009 Apr 22.
3
Biomedical named entity recognition and linking datasets: survey and our recent development.生物医学命名实体识别与链接数据集:综述及我们的最新进展
Brief Bioinform. 2020 Dec 1;21(6):2219-2238. doi: 10.1093/bib/bbaa054.
4
The BioPrompt-box: an ontology-based clustering tool for searching in biological databases.生物提示框:一种用于在生物数据库中搜索的基于本体的聚类工具。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S8. doi: 10.1186/1471-2105-8-S1-S8.
5
Extracting biomedical events from pairs of text entities.从文本实体对中提取生物医学事件。
BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-16-S10-S8. Epub 2015 Jul 13.
6
Linking genes to literature: text mining, information extraction, and retrieval applications for biology.将基因与文献相联系:生物学的文本挖掘、信息提取及检索应用
Genome Biol. 2008;9 Suppl 2(Suppl 2):S8. doi: 10.1186/gb-2008-9-s2-s8. Epub 2008 Sep 1.
7
A modular framework for biomedical concept recognition.生物医学概念识别的模块化框架。
BMC Bioinformatics. 2013 Sep 24;14:281. doi: 10.1186/1471-2105-14-281.
8
Knowledge discovery in biology and biotechnology texts: a review of techniques, evaluation strategies, and applications.生物学与生物技术文本中的知识发现:技术、评估策略及应用综述
Crit Rev Biotechnol. 2005 Jan-Jun;25(1-2):31-52. doi: 10.1080/07388550590935571.
9
Metabolic Pathway Mining.代谢途径挖掘
Methods Mol Biol. 2017;1526:139-158. doi: 10.1007/978-1-4939-6613-4_8.
10
Comparison of character-level and part of speech features for name recognition in biomedical texts.生物医学文本中用于名称识别的字符级特征与词性特征比较。
J Biomed Inform. 2004 Dec;37(6):423-35. doi: 10.1016/j.jbi.2004.08.008.

引用本文的文献

1
Microbiome in Lower Urinary Tract Symptoms (LUTSs): Mapping the State of the Art with Bibliometric Analysis.下尿路症状(LUTS)中的微生物群:用文献计量分析描绘最新进展
Life (Basel). 2023 Feb 16;13(2):552. doi: 10.3390/life13020552.
2
Scientometric Study of Research in Information Retrieval in Medical Sciences.医学信息检索研究的科学计量学研究
Med J Islam Repub Iran. 2022 Jun 16;36:65. doi: 10.47176/mjiri.36.65. eCollection 2022.