斯特鲁德尔：基于属性和类型的基于语料库的语义模型。

Strudel: a corpus-based semantic model based on properties and types.

机构信息

Center for Mind Brain Sciences, University of Trento.

出版信息

Cogn Sci. 2010 Mar;34(2):222-54. doi: 10.1111/j.1551-6709.2009.01068.x. Epub 2009 Sep 30.

DOI:10.1111/j.1551-6709.2009.01068.x

Abstract

Computational models of meaning trained on naturally occurring text successfully model human performance on tasks involving simple similarity measures, but they characterize meaning in terms of undifferentiated bags of words or topical dimensions. This has led some to question their psychological plausibility (Murphy, 2002;Schunn, 1999). We present here a fully automatic method for extracting a structured and comprehensive set of concept descriptions directly from an English part-of-speech-tagged corpus. Concepts are characterized by weighted properties, enriched with concept-property types that approximate classical relations such as hypernymy and function. Our model outperforms comparable algorithms in cognitive tasks pertaining not only to concept-internal structures (discovering properties of concepts, grouping properties by property type) but also to inter-concept relations (clustering into superordinates), suggesting the empirical validity of the property-based approach.

摘要

基于自然语言文本训练的意义计算模型在涉及简单相似性度量的任务中成功模拟了人类的表现，但它们将意义描述为不分青红皂白的词袋或主题维度。这使得一些人对其心理合理性产生了质疑（Murphy，2002；Schunn，1999）。我们在这里提出了一种从英语词性标注语料库中直接提取结构化和全面的概念描述的全自动方法。概念的特点是具有加权属性，并丰富了概念属性类型，这些类型近似于超类和功能等经典关系。我们的模型在认知任务中的表现优于可比算法，不仅涉及概念内部结构（发现概念的属性，按属性类型对属性进行分组），还涉及概念间关系（聚类为上级概念），这表明基于属性的方法具有经验有效性。

相似文献

Strudel: a corpus-based semantic model based on properties and types.

Cogn Sci. 2010 Mar;34(2):222-54. doi: 10.1111/j.1551-6709.2009.01068.x. Epub 2009 Sep 30.

More data trumps smarter algorithms: comparing pointwise mutual information with latent semantic analysis.

Behav Res Methods. 2009 Aug;41(3):647-56. doi: 10.3758/BRM.41.3.647.

Automatic extraction of property norm-like data from large text corpora.

Cogn Sci. 2014 May-Jun;38(4):638-82. doi: 10.1111/cogs.12091.

A knowledge-driven approach to biomedical document conceptualization.

Artif Intell Med. 2010 Jun;49(2):67-78. doi: 10.1016/j.artmed.2010.02.005. Epub 2010 Apr 3.

Grounding co-occurrence: Identifying features in a lexical co-occurrence model of semantic memory.

Behav Res Methods. 2009 Nov;41(4):1210-23. doi: 10.3758/BRM.41.4.1210.

An ontology on property for physical, chemical, and biological systems.

APMIS Suppl. 2004(117):1-210.

Learning semantic and visual similarity for endomicroscopy video retrieval.

IEEE Trans Med Imaging. 2012 Jun;31(6):1276-88. doi: 10.1109/TMI.2012.2188301. Epub 2012 Feb 16.

Gender identity and its implications for the concepts of masculinity and femininity.

Nebr Symp Motiv. 1984;32:59-95.

Enhancing MEDLINE document clustering by incorporating MeSH semantic similarity.

Bioinformatics. 2009 Aug 1;25(15):1944-51. doi: 10.1093/bioinformatics/btp338. Epub 2009 Jun 3.

[Foundations of the new phylogenetics].

Zh Obshch Biol. 2004 Jul-Aug;65(4):334-66.

引用本文的文献

The Three Terms Task - an open benchmark to compare human and artificial semantic representations.

Sci Data. 2023 Mar 2;10(1):117. doi: 10.1038/s41597-023-02015-3.

How the Brain Dynamically Constructs Sentence-Level Meanings From Word-Level Features.

Front Artif Intell. 2022 Apr 21;5:733163. doi: 10.3389/frai.2022.733163. eCollection 2022.

Semantic projection recovers rich human knowledge of multiple object features from word embeddings.

Nat Hum Behav. 2022 Jul;6(7):975-987. doi: 10.1038/s41562-022-01316-8. Epub 2022 Apr 14.

Exploring the Representations of Individual Entities in the Brain Combining EEG and Distributional Semantics.

Front Artif Intell. 2022 Feb 23;5:796793. doi: 10.3389/frai.2022.796793. eCollection 2022.

The Representation of Coordinate Relations in Lexical Semantic Memory.

Front Psychol. 2020 Feb 11;11:98. doi: 10.3389/fpsyg.2020.00098. eCollection 2020.

Modeling the Structure and Dynamics of Semantic Processing.

Cogn Sci. 2018 Nov;42(8):2890-2917. doi: 10.1111/cogs.12690. Epub 2018 Oct 7.

Predicting Lexical Priming Effects from Distributional Semantic Similarities: A Replication with Extension.

Front Psychol. 2016 Oct 24;7:1646. doi: 10.3389/fpsyg.2016.01646. eCollection 2016.

Understanding Karma Police: The Perceived Plausibility of Noun Compounds as Predicted by Distributional Models of Semantic Representation.

PLoS One. 2016 Oct 12;11(10):e0163200. doi: 10.1371/journal.pone.0163200. eCollection 2016.

Language networks associated with computerized semantic indices.

Neuroimage. 2015 Jan 1;104:125-37. doi: 10.1016/j.neuroimage.2014.10.008. Epub 2014 Oct 12.

PLoS One. 2013 Jun 14;8(6):e65366. doi: 10.1371/journal.pone.0065366. Print 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

斯特鲁德尔：基于属性和类型的基于语料库的语义模型。

Strudel: a corpus-based semantic model based on properties and types.

机构信息

Center for Mind Brain Sciences, University of Trento.

出版信息

Cogn Sci. 2010 Mar;34(2):222-54. doi: 10.1111/j.1551-6709.2009.01068.x. Epub 2009 Sep 30.

DOI:10.1111/j.1551-6709.2009.01068.x

PMID:21564211

Abstract

摘要

斯特鲁德尔：基于属性和类型的基于语料库的语义模型。

Strudel: a corpus-based semantic model based on properties and types.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

斯特鲁德尔：基于属性和类型的基于语料库的语义模型。

Strudel: a corpus-based semantic model based on properties and types.

机构信息

出版信息

相似文献

引用本文的文献