Suppr超能文献

语言编码地理信息。

Language encodes geographical information.

机构信息

University of Memphis Erasmus University Rotterdam.

出版信息

Cogn Sci. 2009 Jan;33(1):51-73. doi: 10.1111/j.1551-6709.2008.01003.x.

Abstract

Population counts and longitude and latitude coordinates were estimated for the 50 largest cities in the United States by computational linguistic techniques and by human participants. The mathematical technique Latent Semantic Analysis applied to newspaper texts produced similarity ratings between the 50 cities that allowed for a multidimensional scaling (MDS) of these cities. MDS coordinates correlated with the actual longitude and latitude of these cities, showing that cities that are located together share similar semantic contexts. This finding was replicated using a first-order co-occurrence algorithm. The computational estimates of geographical location as well as population were akin to human estimates. These findings show that language encodes geographical information that language users in turn may use in their understanding of language and the world.

摘要

通过计算语言学技术和人工参与者,估计了美国 50 个最大城市的人口数量和经纬度坐标。应用于报纸文本的潜在语义分析数学技术产生了 50 个城市之间的相似性评分,允许对这些城市进行多维缩放 (MDS)。 MDS 坐标与这些城市的实际经纬度相关联,表明位于一起的城市具有相似的语义背景。使用一阶共现算法复制了这一发现。地理位置和人口的计算估计与人类估计相似。这些发现表明,语言编码了地理信息,而语言使用者可能会在理解语言和世界时使用这些信息。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验