ShortcutLens：一种用于探索自然语言理解数据集中捷径的可视化分析方法。

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset.

作者信息

Jin Zhihua, Wang Xingbo, Cheng Furui, Sun Chunhui, Liu Qun, Qu Huamin

出版信息

IEEE Trans Vis Comput Graph. 2024 Jul;30(7):3594-3608. doi: 10.1109/TVCG.2023.3236380. Epub 2024 Jun 27.

DOI:10.1109/TVCG.2023.3236380

Abstract

Benchmark datasets play an important role in evaluating Natural Language Understanding (NLU) models. However, shortcuts-unwanted biases in the benchmark datasets-can damage the effectiveness of benchmark datasets in revealing models' real capabilities. Since shortcuts vary in coverage, productivity, and semantic meaning, it is challenging for NLU experts to systematically understand and avoid them when creating benchmark datasets. In this paper, we develop a visual analytics system, ShortcutLens, to help NLU experts explore shortcuts in NLU benchmark datasets. The system allows users to conduct multi-level exploration of shortcuts. Specifically, Statistics View helps users grasp the statistics such as coverage and productivity of shortcuts in the benchmark dataset. Template View employs hierarchical and interpretable templates to summarize different types of shortcuts. Instance View allows users to check the corresponding instances covered by the shortcuts. We conduct case studies and expert interviews to evaluate the effectiveness and usability of the system. The results demonstrate that ShortcutLens supports users in gaining a better understanding of benchmark dataset issues through shortcuts, inspiring them to create challenging and pertinent benchmark datasets.

摘要

基准数据集在评估自然语言理解（NLU）模型中起着重要作用。然而，基准数据集中的捷径——即不必要的偏差——会损害基准数据集在揭示模型真实能力方面的有效性。由于捷径在覆盖范围、生产率和语义含义方面各不相同，NLU专家在创建基准数据集时系统地理解并避免它们具有挑战性。在本文中，我们开发了一个可视化分析系统ShortcutLens，以帮助NLU专家探索NLU基准数据集中的捷径。该系统允许用户对捷径进行多层次探索。具体来说，统计视图帮助用户掌握基准数据集中捷径的覆盖范围和生产率等统计信息。模板视图采用分层且可解释的模板来总结不同类型的捷径。实例视图允许用户检查捷径所涵盖的相应实例。我们进行了案例研究和专家访谈，以评估该系统的有效性和可用性。结果表明，ShortcutLens支持用户通过捷径更好地理解基准数据集问题，激发他们创建具有挑战性和相关性的基准数据集。

相似文献

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset.ShortcutLens：一种用于探索自然语言理解数据集中捷径的可视化分析方法。

IEEE Trans Vis Comput Graph. 2024 Jul;30(7):3594-3608. doi: 10.1109/TVCG.2023.3236380. Epub 2024 Jun 27.

Visual Explanation for Open-Domain Question Answering With BERT.基于BERT的开放域问答的可视化解释。

IEEE Trans Vis Comput Graph. 2024 Jul;30(7):3779-3797. doi: 10.1109/TVCG.2023.3243676. Epub 2024 Jun 27.

A Study on the Impacts of Slot Types and Training Data on Joint Natural Language Understanding in a Spanish Medication Management Assistant Scenario.西班牙药物管理助手场景下插槽类型和训练数据对联合自然语言理解的影响研究

Sensors (Basel). 2022 Mar 18;22(6):2364. doi: 10.3390/s22062364.

Multi-task learning approach for utilizing temporal relations in natural language understanding tasks.多任务学习方法在自然语言理解任务中利用时间关系。

Sci Rep. 2023 May 26;13(1):8587. doi: 10.1038/s41598-023-35009-7.

HICL: Hashtag-Driven In-Context Learning for Social Media Natural Language Understanding.HICL：用于社交媒体自然语言理解的标签驱动上下文学习

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7037-7050. doi: 10.1109/TNNLS.2024.3384987. Epub 2025 Apr 4.

TriPlan: an interactive visual analytics approach for better tourism route planning.TriPlan：一种用于更好地进行旅游路线规划的交互式视觉分析方法。

J Vis (Tokyo). 2023;26(1):231-248. doi: 10.1007/s12650-022-00861-8. Epub 2022 Aug 16.

MCLEAN: Multilevel Clustering Exploration As Network.MCLEAN：作为网络的多层次聚类探索

PeerJ Comput Sci. 2018 Jan 29;4:e145. doi: 10.7717/peerj-cs.145. eCollection 2018.

A Natural-language-based Visual Query Approach of Uncertain Human Trajectories.基于自然语言的不确定人体轨迹可视化查询方法。

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):1256-1266. doi: 10.1109/TVCG.2019.2934671. Epub 2019 Aug 20.

GANViz: A Visual Analytics Approach to Understand the Adversarial Game.GANViz：一种理解对抗游戏的可视化分析方法。

IEEE Trans Vis Comput Graph. 2018 Jun;24(6):1905-1917. doi: 10.1109/TVCG.2018.2816223.

Q-BENCH: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs.Q-BENCH：从单图像到图像对的低级视觉多模态基础模型基准测试

IEEE Trans Pattern Anal Mach Intell. 2024 Aug 21;PP. doi: 10.1109/TPAMI.2024.3445770.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ShortcutLens：一种用于探索自然语言理解数据集中捷径的可视化分析方法。

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献