• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于硬新闻事件监测的封闭域事件抽取:一项系统研究。

Closed-domain event extraction for hard news event monitoring: a systematic study.

作者信息

Dukić David, Došilović Filip Karlo, Pluščec Domagoj, Šnajder Jan

机构信息

University of Zagreb Faculty of Electrical Engineering and Computing, TakeLab, Zagreb, Croatia.

出版信息

PeerJ Comput Sci. 2024 Oct 10;10:e2355. doi: 10.7717/peerj-cs.2355. eCollection 2024.

DOI:10.7717/peerj-cs.2355
PMID:39650353
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11623237/
Abstract

News event monitoring systems allow real-time monitoring of a large number of events reported in the news, including the urgent and critical events comprising the so-called hard news. These systems heavily rely on natural language processing (NLP) to perform automatic event extraction at scale. While state-of-the-art event extraction models are readily available, integrating them into a news event monitoring system is not as straightforward as it seems due to practical issues related to model selection, robustness, and scale. To address this gap, we present a study on the practical use of event extraction models for news event monitoring. Our study focuses on the key task of closed-domain main event extraction (CDMEE), which aims to determine the type of the story's main event and extract its arguments from the text. We evaluate a range of state-of-the-art NLP models for this task, including those based on pre-trained language models. Aiming at a more realistic evaluation than done in the literature, we introduce a new dataset manually labeled with event types and their arguments. Additionally, we assess the scalability of CDMEE models and analyze the trade-off between accuracy and inference speed. Our results give insights into the performance of state-of-the-art NLP models on the CDMEE task and provide recommendations for developing effective, robust, and scalable news event monitoring systems.

摘要

新闻事件监测系统允许对新闻中报道的大量事件进行实时监测,包括构成所谓硬新闻的紧急和关键事件。这些系统严重依赖自然语言处理(NLP)来大规模执行自动事件提取。虽然最先进的事件提取模型很容易获得,但由于与模型选择、鲁棒性和规模相关的实际问题,将它们集成到新闻事件监测系统中并不像看起来那么简单。为了弥补这一差距,我们提出了一项关于事件提取模型在新闻事件监测中的实际应用的研究。我们的研究重点是封闭域主要事件提取(CDMEE)的关键任务,该任务旨在确定故事主要事件的类型并从文本中提取其论据。我们针对此任务评估了一系列最先进的NLP模型,包括基于预训练语言模型的模型。为了进行比文献中更现实的评估,我们引入了一个新的数据集,该数据集手动标注了事件类型及其论据。此外,我们评估了CDMEE模型的可扩展性,并分析了准确性和推理速度之间的权衡。我们的结果深入了解了最先进的NLP模型在CDMEE任务上的性能,并为开发有效、鲁棒和可扩展的新闻事件监测系统提供了建议。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/eb37ccf28263/peerj-cs-10-2355-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/1b77153696a1/peerj-cs-10-2355-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/42b53e2c78dc/peerj-cs-10-2355-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/b388785ac634/peerj-cs-10-2355-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/eb37ccf28263/peerj-cs-10-2355-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/1b77153696a1/peerj-cs-10-2355-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/42b53e2c78dc/peerj-cs-10-2355-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/b388785ac634/peerj-cs-10-2355-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04e6/11623237/eb37ccf28263/peerj-cs-10-2355-g004.jpg

相似文献

1
Closed-domain event extraction for hard news event monitoring: a systematic study.用于硬新闻事件监测的封闭域事件抽取:一项系统研究。
PeerJ Comput Sci. 2024 Oct 10;10:e2355. doi: 10.7717/peerj-cs.2355. eCollection 2024.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
Extraction of Information Related to Adverse Drug Events from Electronic Health Record Notes: Design of an End-to-End Model Based on Deep Learning.从电子健康记录笔记中提取与药物不良事件相关的信息:基于深度学习的端到端模型设计
JMIR Med Inform. 2018 Nov 26;6(4):e12159. doi: 10.2196/12159.
5
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
6
Targeting COVID-19 and Human Resources for Health News Information Extraction: Algorithm Development and Validation.针对新冠肺炎与卫生新闻信息提取的卫生人力资源:算法开发与验证
JMIR AI. 2024 Oct 30;3:e55059. doi: 10.2196/55059.
7
Semantic aware enhanced event causality identification.语义感知增强的事件因果关系识别。
Sci Rep. 2024 Dec 30;14(1):32078. doi: 10.1038/s41598-024-83678-9.
8
A systematic review of natural language processing for classification tasks in the field of incident reporting and adverse event analysis.自然语言处理在事件报告和不良事件分析领域分类任务中的系统评价
Int J Med Inform. 2019 Dec;132:103971. doi: 10.1016/j.ijmedinf.2019.103971. Epub 2019 Oct 5.
9
Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.利用自然语言处理技术从韩国不良事件报告系统的不良药物事件叙述中自动提取全面的药物安全信息。
Drug Saf. 2023 Aug;46(8):781-795. doi: 10.1007/s40264-023-01323-2. Epub 2023 Jun 17.
10
Multilingual event extraction for epidemic detection.用于疫情检测的多语言事件提取
Artif Intell Med. 2015 Oct;65(2):131-43. doi: 10.1016/j.artmed.2015.06.005. Epub 2015 Jul 17.

本文引用的文献

1
A Survey on Deep Learning Event Extraction: Approaches and Applications.深度学习事件抽取研究综述:方法与应用
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6301-6321. doi: 10.1109/TNNLS.2022.3213168. Epub 2024 May 2.
2
The measurement of observer agreement for categorical data.分类数据观察者一致性的测量。
Biometrics. 1977 Mar;33(1):159-74.