• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大型语言模型能够像人类一样对叙述性事件进行分割。

Large language models can segment narrative events similarly to humans.

作者信息

Michelmann Sebastian, Kumar Manoj, Norman Kenneth A, Toneva Mariya

出版信息

ArXiv. 2023 Jan 24:arXiv:2301.10297v1.

PMID:36748005
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9900968/
Abstract

Humans perceive discrete events such as "restaurant visits" and "train rides" in their continuous experience. One important prerequisite for studying human event perception is the ability of researchers to quantify when one event ends and another begins. Typically, this information is derived by aggregating behavioral annotations from several observers. Here we present an alternative computational approach where event boundaries are derived using a large language model, GPT-3, instead of using human annotations. We demonstrate that GPT-3 can segment continuous narrative text into events. GPT-3-annotated events are significantly correlated with human event annotations. Furthermore, these GPT-derived annotations achieve a good approximation of the "consensus" solution (obtained by averaging across human annotations); the boundaries identified by GPT-3 are closer to the consensus, on average, than boundaries identified by individual human annotators. This finding suggests that GPT-3 provides a feasible solution for automated event annotations, and it demonstrates a further parallel between human cognition and prediction in large language models. In the future, GPT-3 may thereby help to elucidate the principles underlying human event perception.

摘要

人类在其连续的体验中感知离散事件,如“去餐厅”和“乘坐火车”。研究人类事件感知的一个重要前提是研究人员能够量化一个事件何时结束以及另一个事件何时开始。通常,此信息是通过汇总多个观察者的行为注释得出的。在这里,我们提出了一种替代的计算方法,即使用大语言模型GPT-3来推导事件边界,而不是使用人工注释。我们证明GPT-3可以将连续的叙述文本分割成事件。GPT-3注释的事件与人工事件注释显著相关。此外,这些由GPT得出的注释很好地近似了“共识”解决方案(通过对人工注释求平均值获得);平均而言,GPT-3识别的边界比单个人工注释者识别的边界更接近共识。这一发现表明GPT-3为自动事件注释提供了一个可行的解决方案,并且它证明了人类认知与大语言模型中的预测之间的进一步平行关系。未来,GPT-3可能因此有助于阐明人类事件感知背后的原理。

相似文献

1
Large language models can segment narrative events similarly to humans.大型语言模型能够像人类一样对叙述性事件进行分割。
ArXiv. 2023 Jan 24:arXiv:2301.10297v1.
2
Large language models can segment narrative events similarly to humans.大型语言模型能够像人类一样对叙述性事件进行分段。
Behav Res Methods. 2025 Jan 3;57(1):39. doi: 10.3758/s13428-024-02569-z.
3
Using Large Language Models to Annotate Complex Cases of Social Determinants of Health in Longitudinal Clinical Records.使用大语言模型注释纵向临床记录中健康社会决定因素的复杂病例。
medRxiv. 2024 Apr 27:2024.04.25.24306380. doi: 10.1101/2024.04.25.24306380.
4
GPT is an effective tool for multilingual psychological text analysis.GPT 是一种用于多语言心理文本分析的有效工具。
Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2308950121. doi: 10.1073/pnas.2308950121. Epub 2024 Aug 12.
5
Bayesian Surprise Predicts Human Event Segmentation in Story Listening.贝叶斯惊奇预测故事倾听中的人类事件分割。
Cogn Sci. 2023 Oct;47(10):e13343. doi: 10.1111/cogs.13343.
6
CACER: Clinical concept Annotations for Cancer Events and Relations.CACER:癌症事件与关系的临床概念注释。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2583-2594. doi: 10.1093/jamia/ocae231.
7
Automated Paper Screening for Clinical Reviews Using Large Language Models: Data Analysis Study.使用大型语言模型对临床综述进行自动化论文筛选:数据分析研究。
J Med Internet Res. 2024 Jan 12;26:e48996. doi: 10.2196/48996.
8
Comparing GPT-4 and Human Researchers in Health Care Data Analysis: Qualitative Description Study.GPT-4 与人类研究人员在医疗数据分析中的比较:定性描述研究。
J Med Internet Res. 2024 Aug 21;26:e56500. doi: 10.2196/56500.
9
irAE-GPT: Leveraging large language models to identify immune-related adverse events in electronic health records and clinical trial datasets.免疫相关不良事件生成式预训练变换器(irAE-GPT):利用大语言模型在电子健康记录和临床试验数据集中识别免疫相关不良事件。
medRxiv. 2025 Mar 6:2025.03.05.25323445. doi: 10.1101/2025.03.05.25323445.
10
The plausibility machine commonsense (PMC) dataset: A massively crowdsourced human-annotated dataset for studying plausibility in large language models.似真性机器常识(PMC)数据集:一个用于研究大语言模型中似真性的大规模众包人工标注数据集。
Data Brief. 2024 Aug 24;57:110869. doi: 10.1016/j.dib.2024.110869. eCollection 2024 Dec.