• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

个体与BARD:结构化协作贝叶斯推理在线系统的实验评估

Individuals vs. BARD: Experimental Evaluation of an Online System for Structured, Collaborative Bayesian Reasoning.

作者信息

Korb Kevin B, Nyberg Erik P, Oshni Alvandi Abraham, Thakur Shreshth, Ozmen Mehmet, Li Yang, Pearson Ross, Nicholson Ann E

机构信息

Faculty of Information Technology, Monash University, Melbourne, VIC, Australia.

Department of Economics, University of Melbourne, Melbourne, VIC, Australia.

出版信息

Front Psychol. 2020 Jun 18;11:1054. doi: 10.3389/fpsyg.2020.01054. eCollection 2020.

DOI:10.3389/fpsyg.2020.01054
PMID:32625129
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7314942/
Abstract

US intelligence analysts must weigh up relevant evidence to assess the probability of their conclusions, and express this reasoning clearly in written reports for decision-makers. Typically, they work alone with no special analytic tools, and sometimes succumb to common probabilistic and causal reasoning errors. So, the US government funded a major research program (CREATE) for four large academic teams to develop new structured, collaborative, software-based methods that might achieve better results. Our team's method (BARD) is the first to combine two key techniques: constructing causal Bayesian network models (BNs) to represent analyst knowledge, and small-group collaboration via the Delphi technique. BARD also incorporates compressed, high-quality online training allowing novices to use it, and checklist-inspired report templates with a rudimentary AI tool for generating text explanations from analysts' BNs. In two prior experiments, our team showed BARD's BN-building assists probabilistic reasoning when used by individuals, with a large effect (Glass' Δ 0.8) (Cruz et al., 2020), and even minimal Delphi-style interactions improve the BN structures individuals produce, with medium to very large effects (Glass' Δ 0.5-1.3) (Bolger et al., 2020). This experiment is the critical test of BARD as an integrated system and possible alternative to business-as-usual for intelligence analysis. Participants were asked to solve three probabilistic reasoning problems spread over 5 weeks, developed by our team to test both quantitative accuracy and susceptibility to tempting qualitative fallacies. Our 256 participants were randomly assigned to form 25 teams of 6-9 using BARD and 58 individuals using Google Suite and (if desired) the best pen-and-paper techniques. For each problem, BARD outperformed this control with very large to huge effects (Glass' Δ 1.4-2.2), greatly exceeding CREATE's initial target. We conclude that, for suitable problems, BARD already offers significant advantages over both business-as-usual and existing BN software. Our effect sizes also suggest BARD's BN-building and collaboration combined beneficially and cumulatively, although implementation differences decreased performances compared to Cruz et al. (2020), so interaction may have contributed. BARD has enormous potential for further development and testing of specific components and on more complex problems, and many potential applications beyond intelligence analysis.

摘要

美国情报分析师必须权衡相关证据,以评估其结论的可能性,并在为决策者撰写的报告中清晰地阐述这一推理过程。通常情况下,他们独立工作,没有特殊的分析工具,有时会陷入常见的概率和因果推理错误。因此,美国政府资助了一个大型研究项目(CREATE),让四个大型学术团队开发新的结构化、协作式、基于软件的方法,以期获得更好的结果。我们团队的方法(BARD)首次结合了两项关键技术:构建因果贝叶斯网络模型(BNs)来表示分析师的知识,以及通过德尔菲技术进行小组协作。BARD还包含压缩的高质量在线培训,使新手也能使用,以及受清单启发的报告模板,并配有一个初级人工智能工具,用于根据分析师的贝叶斯网络生成文本解释。在之前的两项实验中,我们团队表明,BARD的贝叶斯网络构建在个人使用时有助于概率推理,效果显著(格拉斯效应量Δ为0.8)(克鲁兹等人,2020年),即使是最少的德尔菲式互动也能改善个人生成的贝叶斯网络结构,效果从中等到非常大(格拉斯效应量Δ为0.5 - 1.3)(博尔格等人,2020年)。本次实验是对BARD作为一个集成系统以及情报分析常规方法可能替代方案的关键测试。参与者被要求在5周内解决我们团队设计的三个概率推理问题,这些问题旨在测试定量准确性以及对诱人的定性谬误的易感性。我们的256名参与者被随机分配,组成25个由6 - 9人组成的团队使用BARD,另外58人使用谷歌套件以及(如有需要)最佳的纸笔技术。对于每个问题,BARD的表现均远超这个对照组,效果非常大到极其显著(格拉斯效应量Δ为1.4 - 2.2),大大超出了CREATE的初始目标。我们得出结论,对于合适的问题,BARD已经比常规方法和现有的贝叶斯网络软件具有显著优势。我们的效应量还表明,BARD的贝叶斯网络构建和协作相结合产生了有益的累积效果,尽管与克鲁兹等人(2020年)相比,实施差异导致了性能下降,所以互动可能起到了作用。BARD在特定组件的进一步开发和测试以及处理更复杂问题方面具有巨大潜力,并且在情报分析之外还有许多潜在应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/de6871f67956/fpsyg-11-01054-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/492c865911bb/fpsyg-11-01054-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/3a13730e431a/fpsyg-11-01054-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/fd98cc3c9c01/fpsyg-11-01054-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/3332e1062a21/fpsyg-11-01054-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/de6871f67956/fpsyg-11-01054-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/492c865911bb/fpsyg-11-01054-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/3a13730e431a/fpsyg-11-01054-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/fd98cc3c9c01/fpsyg-11-01054-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/3332e1062a21/fpsyg-11-01054-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e4b/7314942/de6871f67956/fpsyg-11-01054-g0005.jpg

相似文献

1
Individuals vs. BARD: Experimental Evaluation of an Online System for Structured, Collaborative Bayesian Reasoning.个体与BARD:结构化协作贝叶斯推理在线系统的实验评估
Front Psychol. 2020 Jun 18;11:1054. doi: 10.3389/fpsyg.2020.01054. eCollection 2020.
2
BARD: A Structured Technique for Group Elicitation of Bayesian Networks to Support Analytic Reasoning.BARD:一种用于支持分析推理的贝叶斯网络的结构化群组启发式技术。
Risk Anal. 2022 Jun;42(6):1155-1178. doi: 10.1111/risa.13759. Epub 2021 Jun 19.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Evaluating the effectiveness of artificial intelligence-based tools in detecting and understanding sleep health misinformation: Comparative analysis using Google Bard and OpenAI ChatGPT-4.评估基于人工智能的工具在检测和理解睡眠健康错误信息方面的有效性:使用 Google Bard 和 OpenAI ChatGPT-4 的比较分析。
J Sleep Res. 2024 Dec;33(6):e14210. doi: 10.1111/jsr.14210. Epub 2024 Apr 5.
5
Analysing the Applicability of ChatGPT, Bard, and Bing to Generate Reasoning-Based Multiple-Choice Questions in Medical Physiology.分析ChatGPT、Bard和必应在医学生理学中生成基于推理的多项选择题的适用性。
Cureus. 2023 Jun 26;15(6):e40977. doi: 10.7759/cureus.40977. eCollection 2023 Jun.
6
Measuring the effect of commuting on the performance of the Bayesian Aerosol Release Detector.测量通勤对贝叶斯气溶胶释放探测器性能的影响。
BMC Med Inform Decis Mak. 2009 Nov 3;9 Suppl 1(Suppl 1):S7. doi: 10.1186/1472-6947-9-S1-S7.
7
8
Developing Bayesian networks from a dependency-layered ontology: A proof-of-concept in radiation oncology.从依赖分层本体中开发贝叶斯网络:放射肿瘤学中的概念验证。
Med Phys. 2017 Aug;44(8):4350-4359. doi: 10.1002/mp.12340. Epub 2017 Jun 30.
9
Bard Versus the 2022 American Society of Plastic Surgeons In-Service Examination: Performance on the Examination in Its Intern Year.巴德与2022年美国整形外科医师学会在职考试:实习年度考试表现
Aesthet Surg J Open Forum. 2023 Jul 19;6:ojad066. doi: 10.1093/asjof/ojad066. eCollection 2024.
10
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

1
Assessing the Capability of ChatGPT, Google Bard, and Microsoft Bing in Solving Radiology Case Vignettes.评估ChatGPT、谷歌巴德和微软必应解决放射学病例 vignettes的能力。
Indian J Radiol Imaging. 2023 Dec 29;34(2):276-282. doi: 10.1055/s-0043-1777746. eCollection 2024 Apr.
2
BARD: A Structured Technique for Group Elicitation of Bayesian Networks to Support Analytic Reasoning.BARD:一种用于支持分析推理的贝叶斯网络的结构化群组启发式技术。
Risk Anal. 2022 Jun;42(6):1155-1178. doi: 10.1111/risa.13759. Epub 2021 Jun 19.

本文引用的文献

1
BARD: A Structured Technique for Group Elicitation of Bayesian Networks to Support Analytic Reasoning.BARD:一种用于支持分析推理的贝叶斯网络的结构化群组启发式技术。
Risk Anal. 2022 Jun;42(6):1155-1178. doi: 10.1111/risa.13759. Epub 2021 Jun 19.
2
Widening Access to Bayesian Problem Solving.扩大贝叶斯问题解决方法的应用范围。
Front Psychol. 2020 Apr 9;11:660. doi: 10.3389/fpsyg.2020.00660. eCollection 2020.
3
The Zero-Sum Fallacy in Evidence Evaluation.证据评估中的零和谬误。
Psychol Sci. 2019 Feb;30(2):250-260. doi: 10.1177/0956797618818484. Epub 2018 Dec 31.
4
A Biased Bayesian Inference for Decision-Making and Cognitive Control.一种用于决策和认知控制的有偏贝叶斯推理
Front Neurosci. 2018 Oct 12;12:734. doi: 10.3389/fnins.2018.00734. eCollection 2018.
5
Eliciting improved quantitative judgements using the IDEA protocol: A case study in natural resource management.使用 IDEA 协议得出改进的定量判断:自然资源管理中的案例研究。
PLoS One. 2018 Jun 22;13(6):e0198468. doi: 10.1371/journal.pone.0198468. eCollection 2018.
6
Approaches to Cognitive Modeling in Dynamic Systems Control.动态系统控制中的认知建模方法。
Front Psychol. 2017 Nov 29;8:2032. doi: 10.3389/fpsyg.2017.02032. eCollection 2017.
7
Individual versus group decision making: Jurors' reliance on central and peripheral information to evaluate expert testimony.个体决策与群体决策:陪审员在评估专家证词时对核心信息和边缘信息的依赖
PLoS One. 2017 Sep 20;12(9):e0183580. doi: 10.1371/journal.pone.0183580. eCollection 2017.
8
ANOVA and the variance homogeneity assumption: Exploring a better gatekeeper.方差分析与方差齐性假设:探寻更好的把关方法。
Br J Math Stat Psychol. 2018 Feb;71(1):1-12. doi: 10.1111/bmsp.12103. Epub 2017 Jun 1.
9
Are groups more rational than individuals? A review of interactive decision making in groups.群体比个体更理性吗?对群体互动决策的综述。
Wiley Interdiscip Rev Cogn Sci. 2012 Jul;3(4):471-482. doi: 10.1002/wcs.1184. Epub 2012 May 11.
10
The Bayesian boom: good thing or bad?贝叶斯方法的兴起:是好是坏?
Front Psychol. 2014 Aug 8;5:765. doi: 10.3389/fpsyg.2014.00765. eCollection 2014.