• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双词语义距离作为自然语言中连续语义流的指标:理论、工具和应用。

Bigram semantic distance as an index of continuous semantic flow in natural language: Theory, tools, and applications.

机构信息

Eleanor M. Saffran Center for Cognitive Neuroscience, Temple University.

Technion-Israel Institute of Technology.

出版信息

J Exp Psychol Gen. 2023 Sep;152(9):2578-2590. doi: 10.1037/xge0001389. Epub 2023 Apr 20.

DOI:10.1037/xge0001389
PMID:37079833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10790181/
Abstract

Much of our understanding of word meaning has been informed through studies of single words. High-dimensional semantic space models have recently proven instrumental in elucidating connections words. Here we show how bigram semantic distance can yield novel insights into conceptual cohesion and topic flow when computed over continuous language samples. For example, "Cats drink milk" is comprised of an ordered vector of bigrams (cat-drink, drink-milk). Each of these bigrams has a unique semantic distance. These distances in turn may provide a metric of dispersion or the flow of concepts as language unfolds. We offer an R-package ("semdistflow") that transforms any user-specified language transcript into a vector of ordered bigrams, appending two metrics of semantic distance to each pair. We validated these distance metrics on a continuous stream of simulated verbal fluency data assigning predicted switch markers between alternating semantic clusters (animals, musical instruments, fruit). We then generated bigram distance norms on a large sample of text and demonstrated applications of the technique to a classic work of short fiction, (London, 1908). In one application, we showed that bigrams spanning sentence boundaries are punctuated by jumps in the semantic distance. We discuss the promise of this technique for characterizing semantic processing in real-world narratives and for bridging findings at the single word level with macroscale discourse analyses. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

摘要

我们对单词含义的理解在很大程度上是通过对单个单词的研究得到的。高维语义空间模型最近被证明在阐明单词之间的联系方面非常有用。在这里,我们展示了如何在连续的语言样本上计算双词语义距离,从而为概念内聚和主题流动提供新的见解。例如,“猫喝牛奶”由有序的双词向量(cat-drink,drink-milk)组成。这些双词中的每一个都有一个独特的语义距离。这些距离反过来又可以提供一个分散或概念流动的度量,因为语言在展开。我们提供了一个 R 包(“semdistflow”),可以将任何用户指定的语言转录转换为有序双词向量,并为每对双词添加两个语义距离度量。我们在连续的模拟口头流畅性数据流上验证了这些距离度量,为交替语义簇(动物、乐器、水果)之间的预测切换标记分配了这些距离度量。然后,我们在大量文本上生成了双词距离规范,并展示了该技术在经典短篇小说(London,1908)中的应用。在一个应用中,我们表明跨越句子边界的双词被语义距离的跳跃打断。我们讨论了这种技术在描述现实世界叙事中的语义处理以及将单字水平的发现与宏观话语分析联系起来的潜力。(PsycInfo 数据库记录(c)2023 APA,保留所有权利)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/9ac0de4c2789/nihms-1957540-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/de864770e74e/nihms-1957540-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/f45726c503af/nihms-1957540-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/9ac0de4c2789/nihms-1957540-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/de864770e74e/nihms-1957540-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/f45726c503af/nihms-1957540-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f992/10790181/9ac0de4c2789/nihms-1957540-f0003.jpg

相似文献

1
Bigram semantic distance as an index of continuous semantic flow in natural language: Theory, tools, and applications.双词语义距离作为自然语言中连续语义流的指标:理论、工具和应用。
J Exp Psychol Gen. 2023 Sep;152(9):2578-2590. doi: 10.1037/xge0001389. Epub 2023 Apr 20.
2
Semantic flow and its relation to controlled semantic retrieval deficits in the narrative production of people with aphasia.语义流及其与失语症患者叙事产生中受控语义检索缺陷的关系。
Neuropsychologia. 2022 Jun 6;170:108235. doi: 10.1016/j.neuropsychologia.2022.108235. Epub 2022 Apr 14.
3
Letter processing in upright bigrams predicts reading fluency variations in children.正立双字母词加工预测儿童阅读流畅性变化。
J Exp Psychol Gen. 2022 Sep;151(9):2237-2249. doi: 10.1037/xge0001175. Epub 2022 Feb 10.
4
Deep Artificial Neural Networks Reveal a Distributed Cortical Network Encoding Propositional Sentence-Level Meaning.深度人工神经网络揭示命题句级意义的分布式皮层网络编码。
J Neurosci. 2021 May 5;41(18):4100-4119. doi: 10.1523/JNEUROSCI.1152-20.2021. Epub 2021 Mar 22.
5
Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies.语言习得和词汇加工中的特征独特性效应:来自大型研究的见解
Cogn Process. 2020 Nov;21(4):669-685. doi: 10.1007/s10339-019-00947-6. Epub 2020 Jan 23.
6
Tracking word semantic change in biomedical literature.追踪生物医学文献中的词汇语义变化。
Int J Med Inform. 2018 Jan;109:76-86. doi: 10.1016/j.ijmedinf.2017.11.006. Epub 2017 Nov 13.
7
Unsupervised low-dimensional vector representations for words, phrases and text that are transparent, scalable, and produce similarity metrics that are not redundant with neural embeddings.用于单词、短语和文本的无监督低维向量表示,具有透明性、可扩展性,并能产生与神经嵌入不冗余的相似性度量。
J Biomed Inform. 2019 Feb;90:103096. doi: 10.1016/j.jbi.2019.103096. Epub 2019 Jan 14.
8
The text-package: An R-package for analyzing and visualizing human language using natural language processing and transformers.文本包:一个用于使用自然语言处理和转换器分析和可视化人类语言的 R 包。
Psychol Methods. 2023 Dec;28(6):1478-1498. doi: 10.1037/met0000542. Epub 2023 May 1.
9
Lexical factors and cerebral regions influencing verbal fluency performance in MCI.影响 MCI 患者言语流畅性表现的词汇因素和大脑区域。
Neuropsychologia. 2014 Feb;54:98-111. doi: 10.1016/j.neuropsychologia.2013.12.010. Epub 2013 Dec 30.
10
Exposure to co-occurrence regularities in language drives semantic integration of new words.接触语言中的共现规律会促进新单词的语义整合。
J Exp Psychol Learn Mem Cogn. 2022 Jul;48(7):1064-1081. doi: 10.1037/xlm0001122. Epub 2022 Apr 7.

引用本文的文献

1
Modality-Specificity is not a Necessary Condition for Grounded Semantic Cognition: Commentary on.模态特异性并非基于语义认知的必要条件:评……
Lang Cogn Neurosci. 2024;39(7):854-858. doi: 10.1080/23273798.2023.2210234. Epub 2023 May 9.
2
What we mean when we say semantic: Toward a multidisciplinary semantic glossary.我们所说的语义是什么意思:迈向一个多学科语义词汇表。
Psychon Bull Rev. 2025 Feb;32(1):243-280. doi: 10.3758/s13423-024-02556-7. Epub 2024 Sep 4.

本文引用的文献

1
Neural evidence of switch processes during semantic and phonetic foraging in human memory.人类记忆中语义和语音觅食过程转换时的神经证据。
Proc Natl Acad Sci U S A. 2023 Oct 17;120(42):e2312462120. doi: 10.1073/pnas.2312462120. Epub 2023 Oct 12.
2
Divergent semantic integration (DSI): Extracting creativity from narratives with distributional semantic modeling.发散语义整合(DSI):通过分布语义建模从叙事中提取创造力。
Behav Res Methods. 2023 Oct;55(7):3726-3759. doi: 10.3758/s13428-022-01986-2. Epub 2022 Oct 17.
3
Sensorimotor distance: A grounded measure of semantic similarity for 800 million concept pairs.
感觉运动距离:用于衡量 8 亿对概念对语义相似性的基础测度。
Behav Res Methods. 2023 Oct;55(7):3416-3432. doi: 10.3758/s13428-022-01965-7. Epub 2022 Sep 21.
4
An investigation of the cognitive and neural correlates of semantic memory search related to creative ability.对与创造力相关的语义记忆搜索的认知和神经相关性的调查。
Commun Biol. 2022 Jun 16;5(1):604. doi: 10.1038/s42003-022-03547-x.
5
Semantic flow and its relation to controlled semantic retrieval deficits in the narrative production of people with aphasia.语义流及其与失语症患者叙事产生中受控语义检索缺陷的关系。
Neuropsychologia. 2022 Jun 6;170:108235. doi: 10.1016/j.neuropsychologia.2022.108235. Epub 2022 Apr 14.
6
Semantic projection recovers rich human knowledge of multiple object features from word embeddings.语义投射从词嵌入中恢复了人类对多个对象特征的丰富知识。
Nat Hum Behav. 2022 Jul;6(7):975-987. doi: 10.1038/s41562-022-01316-8. Epub 2022 Apr 14.
7
Different computational relations in language are captured by distinct brain systems.不同的语言计算关系由不同的大脑系统捕捉。
Cereb Cortex. 2023 Feb 7;33(4):997-1013. doi: 10.1093/cercor/bhac117.
8
Decoding the information structure underlying the neural representation of concepts.解码概念的神经表示背后的信息结构。
Proc Natl Acad Sci U S A. 2022 Feb 8;119(6). doi: 10.1073/pnas.2108091119.
9
Editors' Introduction to Networks of the Mind: How Can Network Science Elucidate Our Understanding of Cognition?《心智网络:网络科学如何深化我们对认知的理解》编辑引言
Top Cogn Sci. 2022 Jan;14(1):45-53. doi: 10.1111/tops.12598.
10
Semantic network analysis (SemNA): A tutorial on preprocessing, estimating, and analyzing semantic networks.语义网络分析(SemNA):关于语义网络预处理、估计和分析的教程。
Psychol Methods. 2023 Aug;28(4):860-879. doi: 10.1037/met0000463. Epub 2021 Dec 23.