Suppr超能文献

利用推特绘制英式英语中的词汇方言变异图谱

Mapping Lexical Dialect Variation in British English Using Twitter.

作者信息

Grieve Jack, Montgomery Chris, Nini Andrea, Murakami Akira, Guo Diansheng

机构信息

Department of English Language and Linguistics, University of Birmingham, Birmingham, United Kingdom.

School of English, University of Sheffield, Sheffield, United Kingdom.

出版信息

Front Artif Intell. 2019 Jul 12;2:11. doi: 10.3389/frai.2019.00011. eCollection 2019.

Abstract

There is a growing trend in regional dialectology to analyse large corpora of social media data, but it is unclear if the results of these studies can be generalized to language as a whole. To assess the generalizability of Twitter dialect maps, this paper presents the first systematic comparison of regional lexical variation in Twitter corpora and traditional survey data. We compare the regional patterns found in 139 lexical dialect maps based on a 1.8 billion word corpus of geolocated UK Twitter data and the BBC Voices dialect survey. A spatial analysis of these 139 map pairs finds a broad alignment between these two data sources, offering evidence that both approaches to data collection allow for the same basic underlying regional patterns to be identified. We argue that these results license the use of Twitter corpora for general inquiries into regional lexical variation and change.

摘要

在区域方言学领域,分析大量社交媒体数据的趋势日益明显,但这些研究结果能否推广至整个语言尚不明晰。为评估推特方言地图的可推广性,本文首次对推特语料库中的区域词汇变异与传统调查数据进行了系统比较。我们基于18亿词的英国推特地理位置数据语料库和英国广播公司语音方言调查,比较了139个词汇方言地图中发现的区域模式。对这139组地图的空间分析发现,这两个数据源之间存在广泛的一致性,这表明两种数据收集方法都能识别出相同的基本潜在区域模式。我们认为,这些结果证明了使用推特语料库对区域词汇变异和变化进行一般性探究的合理性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9da4/7861259/e13ef8b5fa4e/frai-02-00011-g0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验