• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Marky:一种支持多用户和迭代文档注释项目中注释一致性的工具。

Marky: a tool supporting annotation consistency in multi-user and iterative document annotation projects.

作者信息

Pérez-Pérez Martín, Glez-Peña Daniel, Fdez-Riverola Florentino, Lourenço Anália

机构信息

ESEI - Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, Universidad de Vigo, 32004 Ourense, Spain(1).

ESEI - Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n, Universidad de Vigo, 32004 Ourense, Spain(1); Centre of Biological Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal.

出版信息

Comput Methods Programs Biomed. 2015 Feb;118(2):242-51. doi: 10.1016/j.cmpb.2014.11.005. Epub 2014 Nov 25.

DOI:10.1016/j.cmpb.2014.11.005
PMID:25480679
Abstract

BACKGROUND AND OBJECTIVES

Document annotation is a key task in the development of Text Mining methods and applications. High quality annotated corpora are invaluable, but their preparation requires a considerable amount of resources and time. Although the existing annotation tools offer good user interaction interfaces to domain experts, project management and quality control abilities are still limited. Therefore, the current work introduces Marky, a new Web-based document annotation tool equipped to manage multi-user and iterative projects, and to evaluate annotation quality throughout the project life cycle.

METHODS

At the core, Marky is a Web application based on the open source CakePHP framework. User interface relies on HTML5 and CSS3 technologies. Rangy library assists in browser-independent implementation of common DOM range and selection tasks, and Ajax and JQuery technologies are used to enhance user-system interaction.

RESULTS

Marky grants solid management of inter- and intra-annotator work. Most notably, its annotation tracking system supports systematic and on-demand agreement analysis and annotation amendment. Each annotator may work over documents as usual, but all the annotations made are saved by the tracking system and may be further compared. So, the project administrator is able to evaluate annotation consistency among annotators and across rounds of annotation, while annotators are able to reject or amend subsets of annotations made in previous rounds. As a side effect, the tracking system minimises resource and time consumption.

CONCLUSIONS

Marky is a novel environment for managing multi-user and iterative document annotation projects. Compared to other tools, Marky offers a similar visually intuitive annotation experience while providing unique means to minimise annotation effort and enforce annotation quality, and therefore corpus consistency. Marky is freely available for non-commercial use at http://sing.ei.uvigo.es/marky.

摘要

背景与目标

文档注释是文本挖掘方法与应用开发中的一项关键任务。高质量的注释语料库非常宝贵,但其准备工作需要大量资源和时间。尽管现有注释工具为领域专家提供了良好的用户交互界面,但项目管理和质量控制能力仍然有限。因此,当前工作引入了Marky,这是一种基于网络的新型文档注释工具,能够管理多用户和迭代项目,并在项目生命周期内评估注释质量。

方法

Marky的核心是一个基于开源CakePHP框架的网络应用程序。用户界面依赖于HTML5和CSS3技术。Rangy库有助于在与浏览器无关的情况下实现常见的DOM范围和选择任务,Ajax和JQuery技术用于增强用户与系统的交互。

结果

Marky能够对注释者之间和内部的工作进行可靠管理。最值得注意的是,其注释跟踪系统支持系统的和按需的一致性分析以及注释修正。每个注释者可以像往常一样处理文档,但所做的所有注释都会由跟踪系统保存,并可以进一步比较。因此,项目管理员能够评估注释者之间以及多轮注释之间的注释一致性,而注释者能够拒绝或修正前几轮中所做注释的子集。作为一个附带效果,跟踪系统将资源和时间消耗降至最低。

结论

Marky是管理多用户和迭代文档注释项目的一个新颖环境。与其他工具相比,Marky提供了类似的视觉直观注释体验,同时提供了独特的方法来最小化注释工作量并确保注释质量,从而保证语料库的一致性。Marky可在http://sing.ei.uvigo.es/marky免费用于非商业用途。

相似文献

1
Marky: a tool supporting annotation consistency in multi-user and iterative document annotation projects.Marky:一种支持多用户和迭代文档注释项目中注释一致性的工具。
Comput Methods Programs Biomed. 2015 Feb;118(2):242-51. doi: 10.1016/j.cmpb.2014.11.005. Epub 2014 Nov 25.
2
TeamTat: a collaborative text annotation tool.TeamTat:一个协作文本注释工具。
Nucleic Acids Res. 2020 Jul 2;48(W1):W5-W11. doi: 10.1093/nar/gkaa333.
3
BIOMedical Search Engine Framework: Lightweight and customized implementation of domain-specific biomedical search engines.生物医学搜索引擎框架:特定领域生物医学搜索引擎的轻量级定制实现。
Comput Methods Programs Biomed. 2016 Jul;131:63-77. doi: 10.1016/j.cmpb.2016.03.030. Epub 2016 Apr 8.
4
The EU-ADR corpus: annotated drugs, diseases, targets, and their relationships.欧盟不良反应数据库:标注药物、疾病、靶点及其相互关系。
J Biomed Inform. 2012 Oct;45(5):879-84. doi: 10.1016/j.jbi.2012.04.004. Epub 2012 Apr 25.
5
De-identification of clinical notes in French: towards a protocol for reference corpus development.法语临床记录的去识别化:迈向参考语料库开发协议
J Biomed Inform. 2014 Aug;50:151-61. doi: 10.1016/j.jbi.2013.12.014. Epub 2013 Dec 29.
6
GeneTools--application for functional annotation and statistical hypothesis testing.基因工具——用于功能注释和统计假设检验的应用程序。
BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470.
7
ESTIMA, a tool for EST management in a multi-project environment.ESTIMA,一种用于多项目环境中EST管理的工具。
BMC Bioinformatics. 2004 Nov 4;5:176. doi: 10.1186/1471-2105-5-176.
8
Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction.PubMed 查询的半自动语义标注:一项关于质量、效率和满意度的研究。
J Biomed Inform. 2011 Apr;44(2):310-8. doi: 10.1016/j.jbi.2010.11.001. Epub 2010 Nov 20.
9
Integrating UIMA annotators in a web-based text processing framework.将UIMA注释器集成到基于Web的文本处理框架中。
Stud Health Technol Inform. 2013;192:1191.
10
Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences.临床文本的句法分析:处理不规范句子的指南和语料库开发。
J Am Med Inform Assoc. 2013 Nov-Dec;20(6):1168-77. doi: 10.1136/amiajnl-2013-001810. Epub 2013 Aug 1.

引用本文的文献

1
MedTAG: a portable and customizable annotation tool for biomedical documents.MedTAG:一个用于生物医学文档的可移植和可定制的注释工具。
BMC Med Inform Decis Mak. 2021 Dec 18;21(1):352. doi: 10.1186/s12911-021-01706-4.
2
Markup: A Web-Based Annotation Tool Powered by Active Learning.标记:一种由主动学习驱动的基于网络的注释工具。
Front Digit Health. 2021 Jul 26;3:598916. doi: 10.3389/fdgth.2021.598916. eCollection 2021.
3
TeamTat: a collaborative text annotation tool.TeamTat:一个协作文本注释工具。
Nucleic Acids Res. 2020 Jul 2;48(W1):W5-W11. doi: 10.1093/nar/gkaa333.
4
An extensive review of tools for manual annotation of documents.对文档手动标注工具的全面回顾。
Brief Bioinform. 2021 Jan 18;22(1):146-163. doi: 10.1093/bib/bbz130.
5
Collaborative relation annotation and quality analysis in Markyt environment.马克提环境中的协作关系标注与质量分析。
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax090.
6
Construction of antimicrobial peptide-drug combination networks from scientific literature based on a semi-automated curation workflow.基于半自动整理工作流程,从科学文献构建抗菌肽-药物组合网络。
Database (Oxford). 2016 Dec 26;2016. doi: 10.1093/database/baw143. Print 2016.
7
The Markyt visualisation, prediction and benchmark platform for chemical and gene entity recognition at BioCreative/CHEMDNER challenge.用于生物创意/化学命名实体识别挑战赛中化学和基因实体识别的Markyt可视化、预测和基准测试平台。
Database (Oxford). 2016 Aug 19;2016. doi: 10.1093/database/baw120. Print 2016.