• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

社交媒体中词汇变化的传播

Diffusion of lexical change in social media.

作者信息

Eisenstein Jacob, O'Connor Brendan, Smith Noah A, Xing Eric P

机构信息

School of Interactive Computing, Georgia Institute of Technology, Atlanta, Georgia, United States of America.

School of Computer Science, University of Massachusetts, Amherst, Massachusetts, United States of America.

出版信息

PLoS One. 2014 Nov 19;9(11):e113114. doi: 10.1371/journal.pone.0113114. eCollection 2014.

DOI:10.1371/journal.pone.0113114
PMID:25409166
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4237389/
Abstract

Computer-mediated communication is driving fundamental changes in the nature of written language. We investigate these changes by statistical analysis of a dataset comprising 107 million Twitter messages (authored by 2.7 million unique user accounts). Using a latent vector autoregressive model to aggregate across thousands of words, we identify high-level patterns in diffusion of linguistic change over the United States. Our model is robust to unpredictable changes in Twitter's sampling rate, and provides a probabilistic characterization of the relationship of macro-scale linguistic influence to a set of demographic and geographic predictors. The results of this analysis offer support for prior arguments that focus on geographical proximity and population size. However, demographic similarity - especially with regard to race - plays an even more central role, as cities with similar racial demographics are far more likely to share linguistic influence. Rather than moving towards a single unified "netspeak" dialect, language evolution in computer-mediated communication reproduces existing fault lines in spoken American English.

摘要

计算机介导的交流正在推动书面语言本质的根本性变化。我们通过对一个包含1.07亿条推特消息(由270万个唯一用户账户撰写)的数据集进行统计分析来研究这些变化。使用潜向量自回归模型对数千个单词进行汇总,我们识别出美国语言变化传播中的高层次模式。我们的模型对推特采样率的不可预测变化具有鲁棒性,并提供了宏观语言影响与一组人口和地理预测因素之间关系的概率表征。该分析结果为先前关注地理 proximity 和人口规模的论点提供了支持。然而,人口统计学上的相似性——尤其是在种族方面——发挥着更为核心的作用,因为具有相似种族人口统计学特征的城市更有可能共享语言影响。计算机介导的交流中的语言演变并非朝着单一统一的“网络语言”方言发展,而是再现了美国英语口语中现有的断层线。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/1ef91a0a9bd7/pone.0113114.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/bab9c357ab58/pone.0113114.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/66004fc4e640/pone.0113114.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/98c0bb051e42/pone.0113114.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/1ef91a0a9bd7/pone.0113114.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/bab9c357ab58/pone.0113114.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/66004fc4e640/pone.0113114.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/98c0bb051e42/pone.0113114.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca31/4237389/1ef91a0a9bd7/pone.0113114.g007.jpg

相似文献

1
Diffusion of lexical change in social media.社交媒体中词汇变化的传播
PLoS One. 2014 Nov 19;9(11):e113114. doi: 10.1371/journal.pone.0113114. eCollection 2014.
2
Do Global Cities Enable Global Views? Using Twitter to Quantify the Level of Geographical Awareness of U.S. Cities.全球城市能带来全球视野吗?利用推特量化美国城市的地理认知水平。
PLoS One. 2015 Jul 13;10(7):e0132464. doi: 10.1371/journal.pone.0132464. eCollection 2015.
3
Using Social Media to Predict Food Deserts in the United States: Infodemiology Study of Tweets.利用社交媒体预测美国的食物荒漠:推特信息流行病学研究。
JMIR Public Health Surveill. 2022 Jul 5;8(7):e34285. doi: 10.2196/34285.
4
Geolocation of multiple sociolinguistic markers in Buenos Aires.布宜诺斯艾利斯多种社会语言标记的定位。
PLoS One. 2022 Sep 9;17(9):e0274114. doi: 10.1371/journal.pone.0274114. eCollection 2022.
5
The human geography of Twitter: Quantifying regional identity and inter-region communication in England and Wales.推特上的人文地理学:定量分析英格兰和威尔士的地区特征和地区间交流。
PLoS One. 2019 Apr 15;14(4):e0214466. doi: 10.1371/journal.pone.0214466. eCollection 2019.
6
Examining Twitter-Derived Negative Racial Sentiment as Indicators of Cultural Racism: Observational Associations With Preterm Birth and Low Birth Weight Among a Multiracial Sample of Mothers, 2011-2021.考察推特衍生的负面种族情绪作为文化种族主义的指标:2011-2021 年间,多民族母亲样本中与早产和低出生体重的观察性关联。
J Med Internet Res. 2023 Apr 28;25:e44990. doi: 10.2196/44990.
7
The Association Between State-Level Racial Attitudes Assessed From Twitter Data and Adverse Birth Outcomes: Observational Study.从 Twitter 数据评估的州级种族态度与不良生育结局之间的关联:观察性研究。
JMIR Public Health Surveill. 2020 Jul 6;6(3):e17103. doi: 10.2196/17103.
8
Methods to Establish Race or Ethnicity of Twitter Users: Scoping Review.方法建立种族或族裔的 Twitter 用户:范围审查。
J Med Internet Res. 2022 Apr 29;24(4):e35788. doi: 10.2196/35788.
9
Palliative care experiences of adult cancer patients from ethnocultural groups: a qualitative systematic review protocol.不同种族文化群体成年癌症患者的姑息治疗体验:一项定性系统评价方案
JBI Database System Rev Implement Rep. 2015 Jan;13(1):99-111. doi: 10.11124/jbisrir-2015-1809.
10
Twitter and public health.推特与公共卫生。
J Public Health Manag Pract. 2015 Jul-Aug;21(4):375-83. doi: 10.1097/PHH.0000000000000041.

引用本文的文献

1
Semantic change in adults is not primarily a generational phenomenon.成年人的语义变化并非主要是一种代际现象。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2426815122. doi: 10.1073/pnas.2426815122. Epub 2025 Jul 28.
2
When dialects collide: how socioeconomic mixing affects language use.当方言碰撞时:社会经济融合如何影响语言使用。
EPJ Data Sci. 2025;14(1):47. doi: 10.1140/epjds/s13688-025-00563-9. Epub 2025 Jul 10.
3
The sociolinguistic foundations of language modeling.语言建模的社会语言学基础。

本文引用的文献

1
Personality, gender, and age in the language of social media: the open-vocabulary approach.社交媒体语言中的个性、性别和年龄:开放词汇方法。
PLoS One. 2013 Sep 25;8(9):e73791. doi: 10.1371/journal.pone.0073791. eCollection 2013.
2
The geography of happiness: connecting twitter sentiment and expression, demographics, and objective characteristics of place.幸福地理学:连接推特情绪和表达、人口统计学以及地点的客观特征。
PLoS One. 2013 May 29;8(5):e64417. doi: 10.1371/journal.pone.0064417. Print 2013.
3
Principles of parametric estimation in modeling language competition.
Front Artif Intell. 2025 Jan 13;7:1472411. doi: 10.3389/frai.2024.1472411. eCollection 2024.
4
Patterns of linguistic simplification on social media platforms over time.随着时间推移,社交媒体平台上语言简化的模式。
Proc Natl Acad Sci U S A. 2024 Dec 10;121(50):e2412105121. doi: 10.1073/pnas.2412105121. Epub 2024 Dec 6.
5
Lexical innovations are rarely passed on during one's lifetime: Epidemiological perspectives on estimating the basic reproductive ratio of words.词汇创新在一个人的一生中很少会传承下去:关于估计词汇基本繁殖率的流行病学观点。
PLoS One. 2024 Dec 5;19(12):e0312336. doi: 10.1371/journal.pone.0312336. eCollection 2024.
6
African American English intensifier : Using twitter to investigate syntactic change in low-frequency forms.非裔美国英语强化词:利用推特研究低频形式的句法变化。
Front Artif Intell. 2023 Jan 27;5:683104. doi: 10.3389/frai.2022.683104. eCollection 2022.
7
Social Networks of Lexical Innovation. Investigating the Social Dynamics of Diffusion of Neologisms on Twitter.词汇创新的社交网络。探究推特上新词传播的社会动态。
Front Artif Intell. 2021 Nov 1;4:648583. doi: 10.3389/frai.2021.648583. eCollection 2021.
8
Socioeconomic Patterns of Twitter User Activity.推特用户活动的社会经济模式。
Entropy (Basel). 2021 Jun 19;23(6):780. doi: 10.3390/e23060780.
9
Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary.对比词汇扩散系数:量化平凡事物的粘性
Proc Int World Wide Web Conf. 2021 Apr;2021:565-574. doi: 10.1145/3442381.3449819.
10
Digital Articulation: Examining Text-Based Linguistic Performances in Mobile Communication Through Keystroke-Logging Analysis.数字表达:通过按键记录分析研究移动通信中基于文本的语言表现。
Front Artif Intell. 2021 Jan 25;3:539920. doi: 10.3389/frai.2020.539920. eCollection 2020.
建模语言竞争中的参数估计原理。
Proc Natl Acad Sci U S A. 2013 Jun 11;110(24):9698-703. doi: 10.1073/pnas.1303108110. Epub 2013 May 28.
4
A 61-million-person experiment in social influence and political mobilization.一项涉及 6100 万人的社会影响和政治动员实验。
Nature. 2012 Sep 13;489(7415):295-8. doi: 10.1038/nature11421.
5
Mapping the origins and expansion of the Indo-European language family.绘制印欧语系的起源和扩张图。
Science. 2012 Aug 24;337(6097):957-60. doi: 10.1126/science.1219669.
6
Identifying influential and susceptible members of social networks.识别社交网络中的有影响力和易感染成员。
Science. 2012 Jul 20;337(6092):337-41. doi: 10.1126/science.1215842. Epub 2012 Jun 21.
7
Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter.全球社交网络中的快乐和信息的时间模式:快乐计量学和 Twitter。
PLoS One. 2011;6(12):e26752. doi: 10.1371/journal.pone.0026752. Epub 2011 Dec 7.
8
Quantitative social dialectology: explaining linguistic variation geographically and socially.定量社会方言学:从地理和社会角度解释语言变异。
PLoS One. 2011;6(9):e23613. doi: 10.1371/journal.pone.0023613. Epub 2011 Sep 1.
9
Modeling users' activity on twitter networks: validation of Dunbar's number.在推特网络上模拟用户的活动:邓巴数的验证。
PLoS One. 2011;6(8):e22656. doi: 10.1371/journal.pone.0022656. Epub 2011 Aug 3.
10
Evolved structure of language shows lineage-specific trends in word-order universals.语言的进化结构显示了语序共性在谱系特异性方面的趋势。
Nature. 2011 May 5;473(7345):79-82. doi: 10.1038/nature09923. Epub 2011 Apr 13.