• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

哈哈哈,老兄,没错!:可拉伸单词的双参数特征以及打字错误和拼写错误的动态。

Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.

机构信息

Department of Mathematics and Statistics, University of Vermont, Burlington, VT, United States of America.

Vermont Complex Systems Center, University of Vermont, Burlington, VT, United States of America.

出版信息

PLoS One. 2020 May 27;15(5):e0232938. doi: 10.1371/journal.pone.0232938. eCollection 2020.

DOI:10.1371/journal.pone.0232938
PMID:32459802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7252599/
Abstract

Stretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of 'stretchable words' found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, 'balance' and 'stretch', that capture their main characteristics, and explore their dynamics by creating visual tools we call 'balance plots' and 'spelling trees'. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics.

摘要

拉长的单词,如“heellllp”或“heyyyyy”,是口语中的常见特征,常用于强调或夸大词根的含义。虽然拉长的单词在正式书面语言和词典中很少见,但它们在社交媒体中很常见。在本文中,我们研究了在大约 1000 亿条推文作者在 8 年时间内的频率分布,这些推文都使用了拉长的单词。我们引入了两个核心参数,“平衡”和“拉伸”,来捕捉它们的主要特征,并通过创建我们称之为“平衡图”和“拼写树”的可视化工具来探索它们的动态。我们讨论了如何使用我们在这里开发的工具和方法来研究错别字和拼写错误的统计模式,并将其作为涉及可拉伸单词的其他语言研究的基础,以及在字典增强、语言处理和任何序列构建重要的领域中的潜在应用,例如遗传学。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/a0eb6c262956/pone.0232938.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/1fcc419e0ef2/pone.0232938.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/9c5e0f430785/pone.0232938.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/bf7785659dda/pone.0232938.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/06a2a1f2e544/pone.0232938.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/1d9f7ff6e252/pone.0232938.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/cbce8db13b12/pone.0232938.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/2cf6bf33d99f/pone.0232938.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/28f107418211/pone.0232938.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/9e321a219db0/pone.0232938.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/2ac07b6b9c98/pone.0232938.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/efc97833cd36/pone.0232938.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/551c8bfe39c9/pone.0232938.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/a0eb6c262956/pone.0232938.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/1fcc419e0ef2/pone.0232938.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/9c5e0f430785/pone.0232938.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/bf7785659dda/pone.0232938.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/06a2a1f2e544/pone.0232938.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/1d9f7ff6e252/pone.0232938.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/cbce8db13b12/pone.0232938.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/2cf6bf33d99f/pone.0232938.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/28f107418211/pone.0232938.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/9e321a219db0/pone.0232938.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/2ac07b6b9c98/pone.0232938.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/efc97833cd36/pone.0232938.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/551c8bfe39c9/pone.0232938.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/870c/7252599/a0eb6c262956/pone.0232938.g013.jpg

相似文献

1
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.哈哈哈,老兄,没错!:可拉伸单词的双参数特征以及打字错误和拼写错误的动态。
PLoS One. 2020 May 27;15(5):e0232938. doi: 10.1371/journal.pone.0232938. eCollection 2020.
2
A Multilinguistic Approach to Evaluating Student Spelling in Writing Samples.一种评估写作样本中学生拼写的多语言方法。
Lang Speech Hear Serv Sch. 2018 Jul 5;49(3):509-523. doi: 10.1044/2018_LSHSS-17-0095.
3
The language of well-being: Tracking fluctuations in emotion experience through everyday speech.幸福感的语言:通过日常言语追踪情绪体验的波动。
J Pers Soc Psychol. 2020 Feb;118(2):364-387. doi: 10.1037/pspp0000244. Epub 2019 Apr 4.
4
The strong, the weak, and the first: The impact of phonological stress on processing of orthographic errors in silent reading.强者、弱者与首因效应:语音重音对默读中拼写错误处理的影响
Brain Res. 2016 Apr 1;1636:208-218. doi: 10.1016/j.brainres.2016.01.003. Epub 2016 Jan 12.
5
THE INFLUENCE OF SYLLABIFICATION RULES IN L1 ON L2 WORD RECOGNITION.母语音节划分规则对二语单词识别的影响。
Psychol Rep. 2015 Oct;117(2):535-53. doi: 10.2466/28.PR0.117c17z9. Epub 2015 Sep 4.
6
Spelling patterns in preadolescents with atypical language skills: phonological, morphological, and orthographic factors.具有非典型语言技能的青春期前儿童的拼写模式:语音、形态和正字法因素。
Dev Neuropsychol. 2006;29(1):93-123. doi: 10.1207/s15326942dn2901_6.
7
Single or dual orthographic representations for reading and spelling? A study of Italian dyslexic-dysgraphic and normal children.阅读和拼写的单一或双重正字法表示?对意大利阅读障碍和正常儿童的研究。
Cogn Neuropsychol. 2010;27(4):305-33. doi: 10.1080/02643294.2010.543539. Epub 2011 Jan 12.
8
The acquisition of allophonic rules: statistical learning with linguistic constraints.音位变体规则的习得:受语言限制的统计学习。
Cognition. 2006 Oct;101(3):B31-41. doi: 10.1016/j.cognition.2005.10.006. Epub 2005 Dec 20.
9
[Spelling development in the Spanish language].[西班牙语的拼写发展]
Psicothema. 2008 Nov;20(4):786-94.
10
The Basis of the Adoption of Borrowed Letters in the Kazakh Alphabet.哈萨克语字母中采用外来字母的依据。
J Psycholinguist Res. 2023 Dec;52(6):2979-2999. doi: 10.1007/s10936-023-10030-z. Epub 2023 Nov 11.

本文引用的文献

1
English verb regularization in books and tweets.图书和推文的英语动词正则化。
PLoS One. 2018 Dec 28;13(12):e0209651. doi: 10.1371/journal.pone.0209651. eCollection 2018.
2
Mapping the Americanization of English in space and time.描绘英语在时空上的美国化。
PLoS One. 2018 May 25;13(5):e0197741. doi: 10.1371/journal.pone.0197741. eCollection 2018.
3
Crowdsourcing dialect characterization through Twitter.通过推特众包方言特征分析
PLoS One. 2014 Nov 19;9(11):e112074. doi: 10.1371/journal.pone.0112074. eCollection 2014.
4
Diffusion of lexical change in social media.社交媒体中词汇变化的传播
PLoS One. 2014 Nov 19;9(11):e113114. doi: 10.1371/journal.pone.0113114. eCollection 2014.
5
Positivity of the English language.英语的积极性。
PLoS One. 2012;7(1):e29484. doi: 10.1371/journal.pone.0029484. Epub 2012 Jan 11.
6
Efficiency of coding in macaque vocal communication.猕猴声音通讯中的编码效率。
Biol Lett. 2010 Aug 23;6(4):469-71. doi: 10.1098/rsbl.2009.1062. Epub 2010 Jan 27.