• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Design, implementation, and operation of a rapid, robust named entity recognition web service.一个快速、强大的命名实体识别网络服务的设计、实现与运营。
J Cheminform. 2019 Mar 8;11(1):19. doi: 10.1186/s13321-019-0344-9.
2
Next generation community assessment of biomedical entity recognition web servers: metrics, performance, interoperability aspects of BeCalm.生物医学实体识别网络服务器的下一代社区评估:BeCalm的指标、性能及互操作性方面
J Cheminform. 2019 Jun 24;11(1):42. doi: 10.1186/s13321-019-0363-6.
3
Web services-based text-mining demonstrates broad impacts for interoperability and process simplification.基于网络服务的文本挖掘对互操作性和流程简化具有广泛影响。
Database (Oxford). 2014 Jun 10;2014. doi: 10.1093/database/bau050. Print 2014.
4
BCC-NER: bidirectional, contextual clues named entity tagger for gene/protein mention recognition.BCC-NER:用于基因/蛋白质提及识别的双向上下文线索命名实体标记器。
EURASIP J Bioinform Syst Biol. 2017 Dec;2017(1):7. doi: 10.1186/s13637-017-0060-6. Epub 2017 May 5.
5
Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations.利用词向量将领域知识融入化学和生物医学命名实体识别。
J Cheminform. 2015 Jan 19;7(Suppl 1 Text mining for chemistry and the CHEMDNER track):S9. doi: 10.1186/1758-2946-7-S1-S9. eCollection 2015.
6
MER: a shell script and annotation server for minimal named entity recognition and linking.MER:用于最小命名实体识别与链接的 shell 脚本及注释服务器。
J Cheminform. 2018 Dec 5;10(1):58. doi: 10.1186/s13321-018-0312-9.
7
Improving spaCy dependency annotation and PoS tagging web service using independent NER services.使用独立命名实体识别服务改进spaCy依存关系标注和词性标注网络服务。
Genomics Inform. 2019 Jun;17(2):e21. doi: 10.5808/GI.2019.17.2.e21. Epub 2019 Jun 24.
8
LSTMVoter: chemical named entity recognition using a conglomerate of sequence labeling tools.LSTMVoter:使用序列标注工具集合进行化学命名实体识别。
J Cheminform. 2019 Jan 10;11(1):3. doi: 10.1186/s13321-018-0327-2.
9
Biomedical named entity recognition using deep neural networks with contextual information.基于上下文信息的深度神经网络的生物医学命名实体识别。
BMC Bioinformatics. 2019 Dec 27;20(1):735. doi: 10.1186/s12859-019-3321-4.
10
A hybrid named entity tagger for tagging human proteins/genes.一种用于标记人类蛋白质/基因的混合命名实体标记器。
Int J Data Min Bioinform. 2014;10(3):315-28. doi: 10.1504/ijdmb.2014.064545.

引用本文的文献

1
Getting Started with the IDG KMC Datasets and Tools.开始使用 IDG KMC 数据集和工具。
Curr Protoc. 2022 Jan;2(1):e355. doi: 10.1002/cpz1.355.
2
Microbe-set enrichment analysis facilitates functional interpretation of microbiome profiling data.微生物组富集分析有助于对微生物组分析数据进行功能解释。
Sci Rep. 2020 Dec 8;10(1):21466. doi: 10.1038/s41598-020-78511-y.
3
Improving spaCy dependency annotation and PoS tagging web service using independent NER services.使用独立命名实体识别服务改进spaCy依存关系标注和词性标注网络服务。
Genomics Inform. 2019 Jun;17(2):e21. doi: 10.5808/GI.2019.17.2.e21. Epub 2019 Jun 24.
4
Next generation community assessment of biomedical entity recognition web servers: metrics, performance, interoperability aspects of BeCalm.生物医学实体识别网络服务器的下一代社区评估:BeCalm的指标、性能及互操作性方面
J Cheminform. 2019 Jun 24;11(1):42. doi: 10.1186/s13321-019-0363-6.
5
Geneshot: search engine for ranking genes from arbitrary text queries.Geneshot:从任意文本查询中对基因进行排名的搜索引擎。
Nucleic Acids Res. 2019 Jul 2;47(W1):W571-W577. doi: 10.1093/nar/gkz393.

本文引用的文献

1
TISSUES 2.0: an integrative web resource on mammalian tissue expression.TISSUES 2.0:一个关于哺乳动物组织表达的综合网络资源。
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay003.
2
RAIN: RNA-protein Association and Interaction Networks.RAIN:RNA-蛋白质关联与相互作用网络
Database (Oxford). 2017 Jan 10;2017. doi: 10.1093/database/baw167. Print 2017.
3
BRENDA in 2017: new perspectives and new tools in BRENDA.2017年的BRENDA:BRENDA中的新视角与新工具。
Nucleic Acids Res. 2017 Jan 4;45(D1):D380-D388. doi: 10.1093/nar/gkw952. Epub 2016 Oct 19.
4
The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.2017年的STRING数据库:质量可控的蛋白质-蛋白质相互作用网络,广泛可用。
Nucleic Acids Res. 2017 Jan 4;45(D1):D362-D368. doi: 10.1093/nar/gkw937. Epub 2016 Oct 18.
5
Ensembl 2017.Ensembl 2017年
Nucleic Acids Res. 2017 Jan 4;45(D1):D635-D642. doi: 10.1093/nar/gkw1104. Epub 2016 Nov 28.
6
Expansion of the Gene Ontology knowledgebase and resources.基因本体知识库及资源的扩展。
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338. doi: 10.1093/nar/gkw1108. Epub 2016 Nov 29.
7
Overview of the interactive task in BioCreative V.生物创意V中交互式任务概述。
Database (Oxford). 2016 Sep 1;2016. doi: 10.1093/database/baw119. Print 2016.
8
EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation.摘要:用于宏基因组样本注释的环境元数据交互式提取和术语建议
Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/baw005. Print 2016.
9
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D7-19. doi: 10.1093/nar/gkv1290. Epub 2015 Nov 28.
10
STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data.STITCH 5:利用组织和亲和力数据扩充蛋白质-化学相互作用网络。
Nucleic Acids Res. 2016 Jan 4;44(D1):D380-4. doi: 10.1093/nar/gkv1277. Epub 2015 Nov 20.

一个快速、强大的命名实体识别网络服务的设计、实现与运营。

Design, implementation, and operation of a rapid, robust named entity recognition web service.

作者信息

Pletscher-Frankild Sune, Jensen Lars Juhl

机构信息

Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.

Intomics A/S, Lyngby, Denmark.

出版信息

J Cheminform. 2019 Mar 8;11(1):19. doi: 10.1186/s13321-019-0344-9.

DOI:10.1186/s13321-019-0344-9
PMID:30850898
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6419787/
Abstract

Most BioCreative tasks to date have focused on assessing the quality of text-mining annotations in terms of precision and recall. Interoperability, speed, and stability are, however, other important factors to consider for practical applications of text mining. For about a decade, we have run named entity recognition (NER) web services, which are designed to be efficient, implemented using a multi-threaded queueing system to robustly handle many simultaneous requests, and hosted at a supercomputer facility. To participate in this new task, we extended the existing NER tagging service with support for the BeCalm API. The tagger suffered no downtime during the challenge and, as in earlier tests, proved to be highly efficient, consistently processing requests of 5000 abstracts in less than half a minute. In fact, the majority of this time was spent not on the NER task but rather on retrieving the document texts from the challenge servers. The latter was found to be the main bottleneck even when hosting a copy of the tagging service on a Raspberry Pi 3, showing that local document storage or caching would be desirable features to include in future revisions of the API standard.

摘要

到目前为止,大多数生物创意任务都集中在根据精确率和召回率来评估文本挖掘注释的质量。然而,对于文本挖掘的实际应用而言,互操作性、速度和稳定性是其他需要考虑的重要因素。大约十年来,我们一直在运行命名实体识别(NER)网络服务,该服务设计高效,使用多线程排队系统实现,以稳健处理许多同时请求,并托管在超级计算机设施中。为了参与这项新任务,我们扩展了现有的NER标记服务,以支持BeCalm API。在挑战期间,标记器没有停机,并且和早期测试一样,被证明效率很高,始终能在不到半分钟的时间内处理5000篇摘要的请求。实际上,这段时间的大部分并非花在NER任务上,而是花在从挑战服务器检索文档文本上。即使在树莓派3上托管标记服务的副本时,后者也被发现是主要瓶颈,这表明本地文档存储或缓存将是未来API标准修订版中值得纳入的功能。