Orwig William, Edenbaum Emma R, Greene Joshua D, Schacter Daniel L
Harvard University.
J Creat Behav. 2024 Mar;58(1):128-136. doi: 10.1002/jocb.636. Epub 2024 Jan 11.
Recent developments in computerized scoring via semantic distance have provided automated assessments of verbal creativity. Here, we extend past work, applying computational linguistic approaches to characterize salient features of creative text. We hypothesize that, in addition to semantic diversity, the degree to which a story includes perceptual details, thus transporting the reader to another time and place, would be predictive of creativity. Additionally, we explore the use of generative language models to supplement human data collection and examine the extent to which machine-generated stories can mimic human creativity. We collect 600 short stories from human participants and GPT-3, subsequently randomized and assessed on their creative quality. Results indicate that the presence of perceptual details, in conjunction with semantic diversity, is highly predictive of creativity. These results were replicated in an independent sample of stories ( = 120) generated by GPT-4. We do not observe a significant difference between human and AI-generated stories in terms of creativity ratings, and we also observe positive correlations between human and AI assessments of creativity. Implications and future directions are discussed.
通过语义距离进行计算机评分的最新进展为言语创造力提供了自动化评估。在此,我们拓展以往的工作,应用计算语言学方法来刻画创造性文本的显著特征。我们假设,除了语义多样性之外,一个故事包含感知细节从而将读者带入另一个时间和地点的程度将能够预测创造力。此外,我们探索使用生成式语言模型来补充人类数据收集,并研究机器生成的故事在多大程度上能够模仿人类创造力。我们从人类参与者和GPT-3收集了600篇短篇小说,随后将其随机分组并对其创造性质量进行评估。结果表明,感知细节的存在与语义多样性相结合,对创造力具有高度预测性。这些结果在由GPT-4生成的一个独立故事样本( = 120)中得到了复制。我们在创造力评分方面未观察到人类生成的故事与人工智能生成的故事之间存在显著差异,并且我们还观察到人类与人工智能对创造力的评估之间存在正相关。我们讨论了研究结果的意义和未来方向。