• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TG-CSR:一个基于九个形式化常识类别的人工标注数据集。

TG-CSR: A human-labeled dataset grounded in nine formal commonsense categories.

作者信息

Santos Henrique, Mulvehill Alice M, Shen Ke, Kejriwal Mayank, McGuinness Deborah L

机构信息

Rensselaer Polytechnic Institute 110 8th St., Troy, NY 12180, USA.

University of Southern California 4676 Admiralty Way, Suite 1001 Marina del Rey CA, 90292, USA.

出版信息

Data Brief. 2023 Oct 11;51:109666. doi: 10.1016/j.dib.2023.109666. eCollection 2023 Dec.

DOI:10.1016/j.dib.2023.109666
PMID:37876745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10590714/
Abstract

Machine Common Sense Reasoning is the subfield of Artificial Intelligence that aims to enable machines to behave or make decisions similarly to humans in everyday and ordinary situations. To measure progress, benchmarks in the form of question-answering datasets have been developed and published in the community to evaluate machine commonsense models, including large language models. We describe the individual label data produced by six human annotators originally used in computing ground truth for the Theoretically-Grounded Commonsense Reasoning (TG-CSR) benchmark's composing datasets. According to a set of instructions, annotators were provided with spreadsheets containing the original TG-CSR prompts and asked to insert labels in specific spreadsheet cells during annotation sessions. TG-CSR data is organized in JSON files, individual raw label data in a spreadsheet file, and individual normalized label data in JSONL files. The release of individual labels can enable the analysis of the labeling process itself, including studies of noise and consistency across annotators.

摘要

机器常识推理是人工智能的一个子领域,旨在使机器在日常和普通情况下的行为或决策方式与人类相似。为了衡量进展,以问答数据集形式的基准已经在社区中开发并发布,用于评估机器常识模型,包括大语言模型。我们描述了最初用于计算理论基础常识推理(TG-CSR)基准组成数据集的地面真值的六名人类注释者产生的个体标签数据。根据一组说明,向注释者提供了包含原始TG-CSR提示的电子表格,并要求他们在注释会话期间在特定的电子表格单元格中插入标签。TG-CSR数据以JSON文件形式组织,个体原始标签数据存储在电子表格文件中,个体标准化标签数据存储在JSONL文件中。个体标签的发布可以对标签过程本身进行分析,包括对注释者之间的噪声和一致性的研究。

相似文献

1
TG-CSR: A human-labeled dataset grounded in nine formal commonsense categories.TG-CSR:一个基于九个形式化常识类别的人工标注数据集。
Data Brief. 2023 Oct 11;51:109666. doi: 10.1016/j.dib.2023.109666. eCollection 2023 Dec.
2
The plausibility machine commonsense (PMC) dataset: A massively crowdsourced human-annotated dataset for studying plausibility in large language models.似真性机器常识(PMC)数据集:一个用于研究大语言模型中似真性的大规模众包人工标注数据集。
Data Brief. 2024 Aug 24;57:110869. doi: 10.1016/j.dib.2024.110869. eCollection 2024 Dec.
3
A noise audit of human-labeled benchmarks for machine commonsense reasoning.机器常识推理的人工标注基准的噪声审计。
Sci Rep. 2024 Apr 14;14(1):8609. doi: 10.1038/s41598-024-58937-4.
4
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense.CRIC:一个用于视觉与常识组合推理的视觉问答数据集。
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5561-5578. doi: 10.1109/TPAMI.2022.3210780. Epub 2023 Apr 3.
5
CommonsenseVIS: Visualizing and Understanding Commonsense Reasoning Capabilities of Natural Language Models.常识视觉化(CommonsenseVIS):可视化与理解自然语言模型的常识推理能力
IEEE Trans Vis Comput Graph. 2023 Oct 26;PP. doi: 10.1109/TVCG.2023.3327153.
6
Community annotation experiment for ground truth generation for the i2b2 medication challenge.社区注释实验,为 i2b2 药物挑战赛生成真实数据。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):519-23. doi: 10.1136/jamia.2010.004200.
7
Robust Commonsense Reasoning Against Noisy Labels Using Adaptive Correction.使用自适应校正对有噪声标签进行稳健的常识推理
IEEE Trans Cybern. 2024 Jul;54(7):4138-4149. doi: 10.1109/TCYB.2023.3339629. Epub 2024 Jul 11.
8
Hurdles to Artificial Intelligence Deployment: Noise in Schemas and "Gold" Labels.人工智能部署的障碍:模式中的噪声和“黄金”标签。
Radiol Artif Intell. 2023 Jan 11;5(2):e220056. doi: 10.1148/ryai.220056. eCollection 2023 Mar.
9
Leveraging Symbolic Knowledge Bases for Commonsense Natural Language Inference Using Pattern Theory.利用符号知识库和模式理论进行常识自然语言推理。
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13185-13202. doi: 10.1109/TPAMI.2023.3287837. Epub 2023 Oct 3.
10
Joint Answering and Explanation for Visual Commonsense Reasoning.视觉常识推理的联合回答和解释。
IEEE Trans Image Process. 2023;32:3836-3846. doi: 10.1109/TIP.2023.3286259. Epub 2023 Jul 12.