• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大语言模型在检测误导性可视化方面的表现如何(或有多差)?

How Good (Or Bad) Are LLMs at Detecting Misleading Visualizations?

作者信息

Lo Leo Yu-Ho, Qu Huamin

出版信息

IEEE Trans Vis Comput Graph. 2025 Jan;31(1):1116-1125. doi: 10.1109/TVCG.2024.3456333. Epub 2024 Nov 25.

DOI:10.1109/TVCG.2024.3456333
PMID:39264775
Abstract

In this study, we address the growing issue of misleading charts, a prevalent problem that undermines the integrity of information dissemination. Misleading charts can distort the viewer's perception of data, leading to misinterpretations and decisions based on false information. The development of effective automatic detection methods for misleading charts is an urgent field of research. The recent advancement of multimodal Large Language Models (LLMs) has introduced a promising direction for addressing this challenge. We explored the capabilities of these models in analyzing complex charts and assessing the impact of different prompting strategies on the models' analyses. We utilized a dataset of misleading charts collected from the internet by prior research and crafted nine distinct prompts, ranging from simple to complex, to test the ability of four different multimodal LLMs in detecting over 21 different chart issues. Through three experiments-from initial exploration to detailed analysis-we progressively gained insights into how to effectively prompt LLMs to identify misleading charts and developed strategies to address the scalability challenges encountered as we expanded our detection range from the initial five issues to 21 issues in the final experiment. Our findings reveal that multimodal LLMs possess a strong capability for chart comprehension and critical thinking in data interpretation. There is significant potential in employing multimodal LLMs to counter misleading information by supporting critical thinking and enhancing visualization literacy. This study demonstrates the applicability of LLMs in addressing the pressing concern of misleading charts.

摘要

在本研究中,我们探讨了误导性图表这一日益严重的问题,这是一个普遍存在的问题,破坏了信息传播的完整性。误导性图表会扭曲观众对数据的认知,导致基于错误信息的误解和决策。开发有效的误导性图表自动检测方法是一个紧迫的研究领域。多模态大语言模型(LLMs)的最新进展为应对这一挑战引入了一个有前景的方向。我们探索了这些模型在分析复杂图表以及评估不同提示策略对模型分析的影响方面的能力。我们利用了先前研究从互联网上收集的一个误导性图表数据集,并精心设计了九个不同的提示,从简单到复杂,以测试四种不同的多模态大语言模型检测超过21种不同图表问题的能力。通过三个实验——从初步探索到详细分析——我们逐步深入了解了如何有效地提示大语言模型识别误导性图表,并制定了策略来应对在我们将检测范围从最初的五个问题扩大到最终实验中的21个问题时遇到的可扩展性挑战。我们的研究结果表明,多模态大语言模型在数据解释中的图表理解和批判性思维方面具有很强的能力。通过支持批判性思维和提高可视化素养,利用多模态大语言模型来对抗误导性信息具有巨大潜力。这项研究证明了大语言模型在解决误导性图表这一紧迫问题上的适用性。

相似文献

1
How Good (Or Bad) Are LLMs at Detecting Misleading Visualizations?大语言模型在检测误导性可视化方面的表现如何(或有多差)?
IEEE Trans Vis Comput Graph. 2025 Jan;31(1):1116-1125. doi: 10.1109/TVCG.2024.3456333. Epub 2024 Nov 25.
2
How Aligned are Human Chart Takeaways and LLM Predictions? A Case Study on Bar Charts with Varying Layouts.人类图表要点与大语言模型预测的契合度如何?关于具有不同布局的柱状图的案例研究。
IEEE Trans Vis Comput Graph. 2025 Jan;31(1):536-546. doi: 10.1109/TVCG.2024.3456378. Epub 2024 Nov 25.
3
Identification of Online Health Information Using Large Pretrained Language Models: Mixed Methods Study.使用大型预训练语言模型识别在线健康信息:混合方法研究。
J Med Internet Res. 2025 May 14;27:e70733. doi: 10.2196/70733.
4
Enhancing Data Literacy On-Demand: LLMs as Guides for Novices in Chart Interpretation.按需提升数据素养:大语言模型作为图表解读新手的指南
IEEE Trans Vis Comput Graph. 2025 Sep;31(9):4712-4727. doi: 10.1109/TVCG.2024.3413195.
5
Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation.大语言模型具备可视化素养吗?对经过修改的可视化进行评估以测试数据解读中的泛化能力。
IEEE Trans Vis Comput Graph. 2025 Oct;31(10):7004-7018. doi: 10.1109/TVCG.2025.3536358.
6
Multimodal LLMs for retinal disease diagnosis via OCT: few-shot versus single-shot learning.通过光学相干断层扫描(OCT)进行视网膜疾病诊断的多模态语言模型:少样本学习与单样本学习
Ther Adv Ophthalmol. 2025 May 20;17:25158414251340569. doi: 10.1177/25158414251340569. eCollection 2025 Jan-Dec.
7
Large language models for biomedicine: foundations, opportunities, challenges, and best practices.大型语言模型在生物医学领域的应用:基础、机遇、挑战和最佳实践。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2114-2124. doi: 10.1093/jamia/ocae074.
8
Empowering large language models for automated clinical assessment with generation-augmented retrieval and hierarchical chain-of-thought.通过生成增强检索和分层思维链赋能大型语言模型进行自动化临床评估。
Artif Intell Med. 2025 Apr;162:103078. doi: 10.1016/j.artmed.2025.103078. Epub 2025 Feb 12.
9
The Applications of Large Language Models in Mental Health: Scoping Review.大语言模型在心理健康领域的应用:范围综述
J Med Internet Res. 2025 May 5;27:e69284. doi: 10.2196/69284.
10
Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.模型调优还是提示调优?大型语言模型在临床概念和关系抽取中的应用研究。
J Biomed Inform. 2024 May;153:104630. doi: 10.1016/j.jbi.2024.104630. Epub 2024 Mar 26.