• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

证据三角测量器:利用大语言模型跨研究设计提取和综合因果证据。

Evidence triangulator: using large language models to extract and synthesize causal evidence across study designs.

作者信息

Shi Xuanyu, Zhao Wenjing, Chen Ting, Yang Chao, Du Jian

机构信息

Institute of Medical Technology, Peking University, Beijing, China.

National Institute of Health Data Science, Peking University, Beijing, China.

出版信息

Nat Commun. 2025 Aug 9;16(1):7355. doi: 10.1038/s41467-025-62783-x.

DOI:10.1038/s41467-025-62783-x
PMID:40783407
Abstract

Health strategies increasingly emphasize both behavioural and biomedical interventions, yet the complex and often contradictory guidance on diet, behavior, and health outcomes complicates evidence-based decision-making. Evidence triangulation across diverse study designs is essential for balancing biases and establishing causality, but scalable, automated methods for achieving this are lacking. In this study, we assess the performance of large language models in extracting both ontological and methodological information from scientific literature to automate evidence triangulation. A two-step extraction approach-focusing on exposure-outcome concepts first, followed by relation extraction-outperforms a one-step method, particularly in identifying the direction of effect (F1 = 0.86) and statistical significance (F1 = 0.96). Using salt intake and blood pressure as a case study, we calculate the Convergency of Evidence and Level of Convergency, finding a strong excitatory effect of salt on blood pressure (942 studies), and weak excitatory effect on cardiovascular diseases and deaths (124 studies). This approach complements traditional meta-analyses by integrating evidence across study designs, and enabling rapid, dynamic assessment of scientific controversies.

摘要

健康策略越来越强调行为和生物医学干预措施,然而,关于饮食、行为和健康结果的复杂且常常相互矛盾的指导意见,使得基于证据的决策变得复杂。跨多种研究设计进行证据三角验证对于平衡偏差和确定因果关系至关重要,但缺乏可扩展的自动化方法来实现这一点。在本研究中,我们评估了大语言模型从科学文献中提取本体论和方法论信息以实现证据三角验证自动化的性能。一种两步提取方法——首先关注暴露-结果概念,然后进行关系提取——优于一步法,特别是在确定效应方向(F1 = 0.86)和统计显著性(F1 = 0.96)方面。以盐摄入量和血压为例进行研究,我们计算了证据的收敛性和收敛水平,发现盐对血压有强烈的兴奋作用(942项研究),而对心血管疾病和死亡有较弱的兴奋作用(124项研究)。这种方法通过整合不同研究设计的证据,并能够对科学争议进行快速、动态的评估,对传统的荟萃分析起到了补充作用。

相似文献

1
Evidence triangulator: using large language models to extract and synthesize causal evidence across study designs.证据三角测量器:利用大语言模型跨研究设计提取和综合因果证据。
Nat Commun. 2025 Aug 9;16(1):7355. doi: 10.1038/s41467-025-62783-x.
2
Replacing salt with low-sodium salt substitutes (LSSS) for cardiovascular health in adults, children and pregnant women.用低钠盐替代物(LSSS)代替盐以促进成年人、儿童和孕妇的心血管健康。
Cochrane Database Syst Rev. 2022 Aug 10;8(8):CD015207. doi: 10.1002/14651858.CD015207.
3
Healthcare outcomes assessed with observational study designs compared with those assessed in randomized trials.与随机试验中评估的医疗保健结果相比,观察性研究设计评估的医疗保健结果。
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):MR000034. doi: 10.1002/14651858.MR000034.pub2.
4
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施:系统评价概述
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.
5
Individual-level interventions to reduce personal exposure to outdoor air pollution and their effects on people with long-term respiratory conditions.个体层面的干预措施以减少个人接触室外空气污染及其对长期呼吸系统疾病患者的影响。
Cochrane Database Syst Rev. 2021 Aug 9;8(8):CD013441. doi: 10.1002/14651858.CD013441.pub2.
6
Effects of a gluten-reduced or gluten-free diet for the primary prevention of cardiovascular disease.减少或无麸质饮食对心血管疾病一级预防的影响。
Cochrane Database Syst Rev. 2022 Feb 24;2(2):CD013556. doi: 10.1002/14651858.CD013556.pub2.
7
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备:证据综合和成本效益分析。
Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.
8
Reduced dietary salt for prevention of cardiovascular disease.减少膳食盐摄入以预防心血管疾病。
Cochrane Database Syst Rev. 2003(1):CD003656. doi: 10.1002/14651858.CD003656.
9
Reduced dietary salt for prevention of cardiovascular disease.减少膳食盐摄入以预防心血管疾病。
Cochrane Database Syst Rev. 2003(2):CD003656. doi: 10.1002/14651858.CD003656.
10
Reduced dietary salt for prevention of cardiovascular disease.减少膳食盐摄入以预防心血管疾病。
Cochrane Database Syst Rev. 2003(3):CD003656. doi: 10.1002/14651858.CD003656.

本文引用的文献

1
Including non-randomized studies of interventions in meta-analyses of randomized controlled trials changed the estimates in more than a third of the studies: evidence from an empirical analysis.在随机对照试验的荟萃分析中纳入干预措施的非随机研究,改变了超过三分之一研究中的估计值:来自实证分析的证据。
J Clin Epidemiol. 2025 May 5;183:111815. doi: 10.1016/j.jclinepi.2025.111815.
2
Future of Evidence Synthesis: Automated, Living, and Interactive Systematic Reviews and Meta-analyses.证据综合的未来:自动化、动态及交互式系统评价与Meta分析
Mayo Clin Proc Digit Health. 2024 Jun 8;2(3):361-365. doi: 10.1016/j.mcpdig.2024.05.023. eCollection 2024 Sep.
3
Quantifying convergence and consistency.
量化收敛和一致性。
Eur J Neurosci. 2024 Nov;60(10):6391-6394. doi: 10.1111/ejn.16561. Epub 2024 Oct 15.
4
Using causal diagrams within the Grading of Recommendations, Assessment, Development and Evaluation framework to evaluate confounding adjustment in observational studies.在推荐评估、发展和评估框架内使用因果图来评估观察性研究中的混杂调整。
J Clin Epidemiol. 2024 Nov;175:111532. doi: 10.1016/j.jclinepi.2024.111532. Epub 2024 Sep 18.
5
Causal inference on human behaviour.人类行为的因果推断。
Nat Hum Behav. 2024 Aug;8(8):1448-1459. doi: 10.1038/s41562-024-01939-z. Epub 2024 Aug 23.
6
Triangulating evidence in health sciences with Annotated Semantic Queries.健康科学中使用带注释语义查询的三角证据。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae519.
7
Performance of a Large Language Model in Screening Citations.大语言模型在引文筛选中的表现。
JAMA Netw Open. 2024 Jul 1;7(7):e2420496. doi: 10.1001/jamanetworkopen.2024.20496.
8
Performance of two large language models for data extraction in evidence synthesis.两种大型语言模型在证据综合数据提取中的性能比较。
Res Synth Methods. 2024 Sep;15(5):818-824. doi: 10.1002/jrsm.1732. Epub 2024 Jun 19.
9
Enhancing the coverage of SemRep using a relation classification approach.利用关系分类方法增强 SemRep 的覆盖范围。
J Biomed Inform. 2024 Jul;155:104658. doi: 10.1016/j.jbi.2024.104658. Epub 2024 May 21.
10
Sensitivity and Specificity of Using GPT-3.5 Turbo Models for Title and Abstract Screening in Systematic Reviews and Meta-analyses.使用 GPT-3.5 Turbo 模型进行系统评价和荟萃分析的标题和摘要筛选的灵敏度和特异性。
Ann Intern Med. 2024 Jun;177(6):791-799. doi: 10.7326/M23-3389. Epub 2024 May 21.