• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Argonaut:一个集成的、交互的、基于文本挖掘的工作平台,支持管理。

Argo: an integrative, interactive, text mining-based workbench supporting curation.

机构信息

National Centre for Text Mining and School of Computer Science, University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK.

出版信息

Database (Oxford). 2012 Mar 20;2012:bas010. doi: 10.1093/database/bas010. Print 2012.

DOI:10.1093/database/bas010
PMID:22434844
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3308166/
Abstract

Curation of biomedical literature is often supported by the automatic analysis of textual content that generally involves a sequence of individual processing components. Text mining (TM) has been used to enhance the process of manual biocuration, but has been focused on specific databases and tasks rather than an environment integrating TM tools into the curation pipeline, catering for a variety of tasks, types of information and applications. Processing components usually come from different sources and often lack interoperability. The well established Unstructured Information Management Architecture is a framework that addresses interoperability by defining common data structures and interfaces. However, most of the efforts are targeted towards software developers and are not suitable for curators, or are otherwise inconvenient to use on a higher level of abstraction. To overcome these issues we introduce Argo, an interoperable, integrative, interactive and collaborative system for text analysis with a convenient graphic user interface to ease the development of processing workflows and boost productivity in labour-intensive manual curation. Robust, scalable text analytics follow a modular approach, adopting component modules for distinct levels of text analysis. The user interface is available entirely through a web browser that saves the user from going through often complicated and platform-dependent installation procedures. Argo comes with a predefined set of processing components commonly used in text analysis, while giving the users the ability to deposit their own components. The system accommodates various areas and levels of user expertise, from TM and computational linguistics to ontology-based curation. One of the key functionalities of Argo is its ability to seamlessly incorporate user-interactive components, such as manual annotation editors, into otherwise completely automatic pipelines. As a use case, we demonstrate the functionality of an in-built manual annotation editor that is well suited for in-text corpus annotation tasks. DATABASE URL: http://www.nactem.ac.uk/Argo.

摘要

生物医学文献的编纂通常需要借助对文本内容的自动分析来完成,而这通常涉及一系列单独的处理组件。文本挖掘 (TM) 已被用于增强手动生物编纂的过程,但它主要集中在特定的数据库和任务上,而不是在一个集成 TM 工具的编纂流程的环境中,以满足各种任务、信息类型和应用的需求。处理组件通常来自不同的来源,并且往往缺乏互操作性。成熟的非结构化信息管理架构 (Unstructured Information Management Architecture) 是一个通过定义通用数据结构和接口来解决互操作性问题的框架。然而,大多数努力都针对软件开发人员,而不适合编纂人员,或者在更高的抽象层次上使用起来很不方便。为了解决这些问题,我们引入了 Argo,这是一个用于文本分析的可互操作、集成、交互和协作的系统,它具有方便的图形用户界面,可以简化处理工作流程的开发,并在劳动密集型的手动编纂中提高生产力。稳健、可扩展的文本分析采用模块化方法,采用组件模块来进行不同层次的文本分析。用户界面完全可以通过网络浏览器使用,用户无需经历通常复杂且依赖于平台的安装过程。Argo 提供了一组常用的处理组件,同时允许用户存储自己的组件。该系统可以适应各种领域和用户专业水平,从 TM 和计算语言学到基于本体的编纂。Argo 的一个关键功能是能够将用户交互组件(例如手动注释编辑器)无缝地集成到其他完全自动化的管道中。作为一个用例,我们展示了内置手动注释编辑器的功能,它非常适合用于文本内语料库注释任务。数据库 URL:http://www.nactem.ac.uk/Argo。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/efe80e177d43/bas010f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/07b587130ca3/bas010f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/d94fe7bcf9c5/bas010f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/efe80e177d43/bas010f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/07b587130ca3/bas010f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/d94fe7bcf9c5/bas010f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18a2/3308166/efe80e177d43/bas010f3.jpg

相似文献

1
Argo: an integrative, interactive, text mining-based workbench supporting curation.Argonaut:一个集成的、交互的、基于文本挖掘的工作平台,支持管理。
Database (Oxford). 2012 Mar 20;2012:bas010. doi: 10.1093/database/bas010. Print 2012.
2
Text-mining-assisted biocuration workflows in Argo.阿尔戈中基于文本挖掘的生物编目工作流程。
Database (Oxford). 2014 Jul 18;2014. doi: 10.1093/database/bau070. Print 2014.
3
Argo: enabling the development of bespoke workflows and services for disease annotation.阿尔戈:助力开发用于疾病注释的定制工作流程和服务。
Database (Oxford). 2016 May 17;2016. doi: 10.1093/database/baw066. Print 2016.
4
Processing biological literature with customizable Web services supporting interoperable formats.通过支持可互操作格式的可定制网络服务处理生物学文献。
Database (Oxford). 2014 Jul 8;2014. doi: 10.1093/database/bau064. Print 2014.
5
Egas: a collaborative and interactive document curation platform.Egas:一个协作式交互式文档管理平台。
Database (Oxford). 2014 Jun 11;2014. doi: 10.1093/database/bau048. Print 2014.
6
Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature.Textpresso 中心:一个可定制的平台,用于搜索、文本挖掘、查看和管理生物医学文献。
BMC Bioinformatics. 2018 Mar 9;19(1):94. doi: 10.1186/s12859-018-2103-8.
7
An overview of the BioCreative 2012 Workshop Track III: interactive text mining task.BioCreative 2012 研讨会第三轨道:交互式文本挖掘任务概述。
Database (Oxford). 2013 Jan 17;2013:bas056. doi: 10.1093/database/bas056. Print 2013.
8
@Note: a workbench for biomedical text mining.注意:一个用于生物医学文本挖掘的工作台。
J Biomed Inform. 2009 Aug;42(4):710-20. doi: 10.1016/j.jbi.2009.04.002. Epub 2009 Apr 22.
9
LiverCancerMarkerRIF: a liver cancer biomarker interactive curation system combining text mining and expert annotations.肝癌标志物资源整合框架(LiverCancerMarkerRIF):一种结合文本挖掘和专家注释的肝癌生物标志物交互式管理系统。
Database (Oxford). 2014 Aug 27;2014. doi: 10.1093/database/bau085. Print 2014.
10
tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles.tagtog:在 PLoS 全文文章中进行基因提及的交互式和文本挖掘辅助注释。
Database (Oxford). 2014 Apr 7;2014(0):bau033. doi: 10.1093/database/bau033. Print 2014.

引用本文的文献

1
MedTAG: a portable and customizable annotation tool for biomedical documents.MedTAG:一个用于生物医学文档的可移植和可定制的注释工具。
BMC Med Inform Decis Mak. 2021 Dec 18;21(1):352. doi: 10.1186/s12911-021-01706-4.
2
BENTO: A Visual Platform for Building Clinical NLP Pipelines Based on CodaLab.BENTO:一个基于CodaLab构建临床自然语言处理管道的可视化平台。
Proc Conf Assoc Comput Linguist Meet. 2020 Jul;2020:95-100. doi: 10.18653/v1/2020.acl-demos.13.
3
Constructing knowledge graphs and their biomedical applications.构建知识图谱及其生物医学应用。

本文引用的文献

1
Using workflows to explore and optimise named entity recognition for chemistry.使用工作流探索和优化化学命名实体识别。
PLoS One. 2011;6(5):e20181. doi: 10.1371/journal.pone.0020181. Epub 2011 May 25.
2
Event extraction for systems biology by text mining the literature.通过文献挖掘进行系统生物学的事件抽取。
Trends Biotechnol. 2010 Jul;28(7):381-90. doi: 10.1016/j.tibtech.2010.04.005. Epub 2010 Jun 1.
3
Galaxy: a web-based genome analysis tool for experimentalists.Galaxy:一款面向实验人员的基于网络的基因组分析工具。
Comput Struct Biotechnol J. 2020 Jun 2;18:1414-1428. doi: 10.1016/j.csbj.2020.05.017. eCollection 2020.
4
TeamTat: a collaborative text annotation tool.TeamTat:一个协作文本注释工具。
Nucleic Acids Res. 2020 Jul 2;48(W1):W5-W11. doi: 10.1093/nar/gkaa333.
5
Health Professionals' Perception about Big Data Technology in Greece.希腊医疗专业人员对大数据技术的认知
Acta Inform Med. 2020 Mar;28(1):48-51. doi: 10.5455/aim.2020.28.48-51.
6
Annotating and detecting phenotypic information for chronic obstructive pulmonary disease.标注与检测慢性阻塞性肺疾病的表型信息。
JAMIA Open. 2019 Apr 26;2(2):261-271. doi: 10.1093/jamiaopen/ooz009. eCollection 2019 Jul.
7
An extensive review of tools for manual annotation of documents.对文档手动标注工具的全面回顾。
Brief Bioinform. 2021 Jan 18;22(1):146-163. doi: 10.1093/bib/bbz130.
8
PubTator central: automated concept annotation for biomedical full text articles.PubTator 中心:用于生物医学全文文章的自动概念标注。
Nucleic Acids Res. 2019 Jul 2;47(W1):W587-W593. doi: 10.1093/nar/gkz389.
9
Open Agile text mining for bioinformatics: the PubAnnotation ecosystem.开放的生物信息学敏捷文本挖掘:PubAnnotation 生态系统。
Bioinformatics. 2019 Nov 1;35(21):4372-4380. doi: 10.1093/bioinformatics/btz227.
10
COPIOUS: A gold standard corpus of named entities towards extracting species occurrence from biodiversity literature.COPIOUS:一个用于从生物多样性文献中提取物种出现信息的命名实体黄金标准语料库。
Biodivers Data J. 2019 Jan 22(7):e29626. doi: 10.3897/BDJ.7.e29626. eCollection 2019.
Curr Protoc Mol Biol. 2010 Jan;Chapter 19:Unit 19.10.1-21. doi: 10.1002/0471142727.mb1910s89.
4
Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD).文本挖掘和化学-基因-疾病网络的人工整理用于比较毒理学基因组数据库(CTD)。
BMC Bioinformatics. 2009 Oct 8;10:326. doi: 10.1186/1471-2105-10-326.
5
Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation.蛋白质亚细胞定位的半自动管理:一种基于文本挖掘的基因本体论(GO)细胞组分管理方法。
BMC Bioinformatics. 2009 Jul 21;10:228. doi: 10.1186/1471-2105-10-228.
6
U-Compare: share and compare text mining tools with UIMA.U-Compare:与 UIMA 共享和比较文本挖掘工具。
Bioinformatics. 2009 Aug 1;25(15):1997-8. doi: 10.1093/bioinformatics/btp289. Epub 2009 May 4.
7
FACTA: a text search engine for finding associated biomedical concepts.FACTA:一个用于查找相关生物医学概念的文本搜索引擎。
Bioinformatics. 2008 Nov 1;24(21):2559-60. doi: 10.1093/bioinformatics/btn469. Epub 2008 Sep 4.
8
New challenges for text mining: mapping between text and manually curated pathways.文本挖掘的新挑战:文本与人工整理通路之间的映射
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-9-S3-S5.
9
Natural language processing in aid of FlyBase curators.自然语言处理辅助果蝇数据库(FlyBase)的编辑人员。
BMC Bioinformatics. 2008 Apr 14;9:193. doi: 10.1186/1471-2105-9-193.
10
An open-source framework for large-scale, flexible evaluation of biomedical text mining systems.一个用于大规模、灵活评估生物医学文本挖掘系统的开源框架。
J Biomed Discov Collab. 2008 Jan 29;3:1. doi: 10.1186/1747-5333-3-1.