• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从新闻中提取因果图:时间序列因果关系学习技术的比较研究

Causal graph extraction from news: a comparative study of time-series causality learning techniques.

作者信息

Maisonnave Mariano, Delbianco Fernando, Tohme Fernando, Milios Evangelos, Maguitman Ana G

机构信息

Departamento de Ciencias e Ingeniería de la Computación, Universidad Nacional del Sur, Bahía Blanca, Buenos Aires, Argentina.

Faculty of Computer Science, Dalhousie University, Halifax, Canada.

出版信息

PeerJ Comput Sci. 2022 Aug 3;8:e1066. doi: 10.7717/peerj-cs.1066. eCollection 2022.

DOI:10.7717/peerj-cs.1066
PMID:35967930
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9374167/
Abstract

Causal graph extraction from news has the potential to aid in the understanding of complex scenarios. In particular, it can help explain and predict events, as well as conjecture about possible cause-effect connections. However, limited work has addressed the problem of large-scale extraction of causal graphs from news articles. This article presents a novel framework for extracting causal graphs from digital text media. The framework relies on topic-relevant variables representing terms and ongoing events that are selected from a domain under analysis by applying specially developed information retrieval and natural language processing methods. Events are represented as event-phrase embeddings, which make it possible to group similar events into semantically cohesive clusters. A time series of the selected variables is given as input to a causal structure learning techniques to learn a causal graph associated with the topic that is being examined. The complete framework is applied to the New York Times dataset, which covers news for a period of 246 months (roughly 20 years), and is illustrated through a case study. An initial evaluation based on synthetic data is carried out to gain insight into the most effective time-series causality learning techniques. This evaluation comprises a systematic analysis of nine state-of-the-art causal structure learning techniques and two novel ensemble methods derived from the most effective techniques. Subsequently, the complete framework based on the most promising causal structure learning technique is evaluated with domain experts in a real-world scenario through the use of the presented case study. The proposed analysis offers valuable insights into the problems of identifying topic-relevant variables from large volumes of news and learning causal graphs from time series.

摘要

从新闻中提取因果图有助于理解复杂的事件场景。特别是,它可以帮助解释和预测事件,以及推测可能的因果关系。然而,目前针对从新闻文章中大规模提取因果图的研究还比较有限。本文提出了一种从数字文本媒体中提取因果图的新颖框架。该框架依赖于与主题相关的变量,这些变量代表通过应用专门开发的信息检索和自然语言处理方法从分析领域中选择的术语和正在发生的事件。事件被表示为事件短语嵌入,这使得将相似事件分组为语义连贯的集群成为可能。将所选变量的时间序列作为输入提供给因果结构学习技术,以学习与正在研究的主题相关的因果图。完整的框架应用于《纽约时报》数据集,该数据集涵盖了246个月(约20年)的新闻,并通过一个案例研究进行说明。基于合成数据进行了初步评估,以深入了解最有效的时间序列因果关系学习技术。该评估包括对九种最先进的因果结构学习技术以及从最有效技术派生的两种新颖的集成方法进行系统分析。随后,通过使用所呈现的案例研究,在实际场景中与领域专家一起对基于最有前景的因果结构学习技术的完整框架进行评估。所提出的分析为从大量新闻中识别与主题相关的变量以及从时间序列中学习因果图的问题提供了有价值的见解。

相似文献

1
Causal graph extraction from news: a comparative study of time-series causality learning techniques.从新闻中提取因果图:时间序列因果关系学习技术的比较研究
PeerJ Comput Sci. 2022 Aug 3;8:e1066. doi: 10.7717/peerj-cs.1066. eCollection 2022.
2
Developing a novel causal inference algorithm for personalized biomedical causal graph learning using meta machine learning.利用元机器学习开发个性化生物医学因果图学习的新因果推理算法。
BMC Med Inform Decis Mak. 2024 May 27;24(1):137. doi: 10.1186/s12911-024-02510-6.
3
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
A practical approach towards causality mining in clinical text using active transfer learning.一种使用主动迁移学习在临床文本中进行因果关系挖掘的实用方法。
J Biomed Inform. 2021 Nov;123:103932. doi: 10.1016/j.jbi.2021.103932. Epub 2021 Oct 8.
5
Event prediction from news text using subgraph embedding and graph sequence mining.使用子图嵌入和图序列挖掘从新闻文本中进行事件预测。
World Wide Web. 2022;25(6):2403-2428. doi: 10.1007/s11280-021-01002-1. Epub 2022 Feb 28.
6
Extracting causal relations from the literature with word vector mapping.从文献中通过词向量映射提取因果关系。
Comput Biol Med. 2019 Dec;115:103524. doi: 10.1016/j.compbiomed.2019.103524. Epub 2019 Nov 27.
7
Multimodal learning on graphs for disease relation extraction.基于图的多模态学习在疾病关系抽取中的应用。
J Biomed Inform. 2023 Jul;143:104415. doi: 10.1016/j.jbi.2023.104415. Epub 2023 Jun 3.
8
A storytree-based model for inter-document causal relation extraction from news articles.一种基于故事树的从新闻文章中提取文档间因果关系的模型。
Knowl Inf Syst. 2023;65(2):827-853. doi: 10.1007/s10115-022-01781-7. Epub 2022 Nov 3.
9
Compass: Towards Better Causal Analysis of Urban Time Series.指南针:迈向更好的城市时间序列因果分析
IEEE Trans Vis Comput Graph. 2022 Jan;28(1):1051-1061. doi: 10.1109/TVCG.2021.3114875. Epub 2021 Dec 24.
10
Building causal models for finding actual causes of unmanned aerial vehicle failures.构建用于找出无人机故障实际原因的因果模型。
Front Robot AI. 2024 Feb 7;11:1123762. doi: 10.3389/frobt.2024.1123762. eCollection 2024.

引用本文的文献

1
A methodological approach for inferring causal relationships from opinions and news-derived events with an application to climate change.一种从观点和新闻衍生事件中推断因果关系并应用于气候变化的方法。
PeerJ Comput Sci. 2025 Jun 19;11:e2964. doi: 10.7717/peerj-cs.2964. eCollection 2025.
2
Building causal models for finding actual causes of unmanned aerial vehicle failures.构建用于找出无人机故障实际原因的因果模型。
Front Robot AI. 2024 Feb 7;11:1123762. doi: 10.3389/frobt.2024.1123762. eCollection 2024.
3
Validation of causal inference data using DirectLiNGAM in an environmental small-scale model and calculation settings.

本文引用的文献

1
Detecting and quantifying causal associations in large nonlinear time series datasets.检测和量化大型非线性时间序列数据集的因果关系。
Sci Adv. 2019 Nov 27;5(11):eaau4996. doi: 10.1126/sciadv.aau4996. eCollection 2019 Nov.
2
Inferring causation from time series in Earth system sciences.从地球系统科学中的时间序列推断因果关系。
Nat Commun. 2019 Jun 14;10(1):2553. doi: 10.1038/s41467-019-10105-3.
3
The TETRAD Project: Constraint Based Aids to Causal Model Specification.TETRAD项目:基于约束的因果模型规范辅助工具
在环境小规模模型和计算设置中使用DirectLiNGAM对因果推断数据进行验证。
MethodsX. 2023 Dec 20;12:102528. doi: 10.1016/j.mex.2023.102528. eCollection 2024 Jun.
Multivariate Behav Res. 1998 Jan 1;33(1):65-117. doi: 10.1207/s15327906mbr3301_3.
4
SIMoNe: Statistical Inference for MOdular NEtworks.SIMoNe:模块化网络的统计推断
Bioinformatics. 2009 Feb 1;25(3):417-8. doi: 10.1093/bioinformatics/btn637. Epub 2008 Dec 10.
5
Measuring information transfer.测量信息传递。
Phys Rev Lett. 2000 Jul 10;85(2):461-4. doi: 10.1103/PhysRevLett.85.461.