• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个用于文档级中文金融事件提取的数据集。

A dataset for document level Chinese financial event extraction.

作者信息

Chen Yubo, Zhou Tong, Li Sirui, Zhao Jun

机构信息

The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China.

School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 101408, China.

出版信息

Sci Data. 2025 May 10;12(1):772. doi: 10.1038/s41597-025-05083-9.

DOI:10.1038/s41597-025-05083-9
PMID:40348739
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12065892/
Abstract

Financial event modeling is fundamental to financial investment decisions and risk management, crucial for the stability and growth of financial institutions, and helps ensure the stability and quality of people's lives. Utilizing state-of-the-art natural language processing techniques for automated financial event extraction addresses the inefficiencies and high costs associated with traditional event identification and modeling, which rely heavily on financial domain experts. However, existing datasets fail to tackle the issues with long documents in practical situations. To address this, we first propose DocFEE, a large-scale Document-level Chinese Financial Event Extraction dataset. It reflects the length of announcement documents and the long-distance dependencies of event arguments in real-world scenarios.

摘要

金融事件建模是金融投资决策和风险管理的基础,对金融机构的稳定和发展至关重要,并有助于确保人们生活的稳定和质量。利用最先进的自然语言处理技术进行自动金融事件提取,解决了与传统事件识别和建模相关的效率低下和成本高昂的问题,传统方法严重依赖金融领域专家。然而,现有数据集未能解决实际情况下长文档的问题。为了解决这个问题,我们首先提出了DocFEE,一个大规模的文档级中文金融事件提取数据集。它反映了公告文档的长度以及现实场景中事件论据的长距离依赖关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/44ba440a0fab/41597_2025_5083_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/8414385c58a4/41597_2025_5083_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/bd4860bce242/41597_2025_5083_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/6eadf781f2fc/41597_2025_5083_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/de017433a8fe/41597_2025_5083_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/f62ae1047da8/41597_2025_5083_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/2b44d319a405/41597_2025_5083_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/045e3b671606/41597_2025_5083_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/44ba440a0fab/41597_2025_5083_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/8414385c58a4/41597_2025_5083_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/bd4860bce242/41597_2025_5083_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/6eadf781f2fc/41597_2025_5083_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/de017433a8fe/41597_2025_5083_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/f62ae1047da8/41597_2025_5083_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/2b44d319a405/41597_2025_5083_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/045e3b671606/41597_2025_5083_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91dd/12065892/44ba440a0fab/41597_2025_5083_Fig8_HTML.jpg

相似文献

1
A dataset for document level Chinese financial event extraction.一个用于文档级中文金融事件提取的数据集。
Sci Data. 2025 May 10;12(1):772. doi: 10.1038/s41597-025-05083-9.
2
Closed-domain event extraction for hard news event monitoring: a systematic study.用于硬新闻事件监测的封闭域事件抽取:一项系统研究。
PeerJ Comput Sci. 2024 Oct 10;10:e2355. doi: 10.7717/peerj-cs.2355. eCollection 2024.
3
MFEE: a multi-word lexical feature enhancement framework for Chinese geological hazard event extraction.MFEE:一种用于中文地质灾害事件提取的多词词汇特征增强框架。
PeerJ Comput Sci. 2023 Mar 13;9:e1275. doi: 10.7717/peerj-cs.1275. eCollection 2023.
4
Improving unified information extraction in Chinese mental health domain with instruction-tuned LLMs and type-verification component.使用指令微调的语言模型和类型验证组件改进中文心理健康领域的统一信息提取
Artif Intell Med. 2025 Apr;162:103087. doi: 10.1016/j.artmed.2025.103087. Epub 2025 Feb 19.
5
Joint variation and ZhuYin dataset for Traditional Chinese document enhancement.用于中医文献增强的联合变化与诸因数据集。
Sci Data. 2024 Nov 27;11(1):1295. doi: 10.1038/s41597-024-04146-7.
6
TGIN: Document-level event extraction with two-phase graph inference network.TGIN:基于两阶段图推理网络的文档级事件抽取。
Neural Netw. 2024 Aug;176:106343. doi: 10.1016/j.neunet.2024.106343. Epub 2024 Apr 25.
7
Active learning for ontological event extraction incorporating named entity recognition and unknown word handling.结合命名实体识别和未知词处理的本体事件抽取的主动学习
J Biomed Semantics. 2016 Apr 27;7:22. doi: 10.1186/s13326-016-0059-z. eCollection 2016.
8
Advancing document-level event extraction: Integration across texts and reciprocal feedback.
Math Biosci Eng. 2023 Nov 3;20(11):20050-20072. doi: 10.3934/mbe.2023888.
9
A large-scale dataset for Chinese historical document recognition and analysis.一个用于中国历史文献识别与分析的大规模数据集。
Sci Data. 2025 Jan 29;12(1):169. doi: 10.1038/s41597-025-04495-x.
10
Semantic aware enhanced event causality identification.语义感知增强的事件因果关系识别。
Sci Rep. 2024 Dec 30;14(1):32078. doi: 10.1038/s41598-024-83678-9.

本文引用的文献

1
The psychophysiology of real-time financial risk processing.实时金融风险处理的心理生理学
J Cogn Neurosci. 2002 Apr 1;14(3):323-39. doi: 10.1162/089892902317361877.