• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

社交媒体语言中的个性、性别和年龄:开放词汇方法。

Personality, gender, and age in the language of social media: the open-vocabulary approach.

机构信息

Positive Psychology Center, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America ; Computer & Information Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

出版信息

PLoS One. 2013 Sep 25;8(9):e73791. doi: 10.1371/journal.pone.0073791. eCollection 2013.

DOI:10.1371/journal.pone.0073791
PMID:24086296
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3783449/
Abstract

We analyzed 700 million words, phrases, and topic instances collected from the Facebook messages of 75,000 volunteers, who also took standard personality tests, and found striking variations in language with personality, gender, and age. In our open-vocabulary technique, the data itself drives a comprehensive exploration of language that distinguishes people, finding connections that are not captured with traditional closed-vocabulary word-category analyses. Our analyses shed new light on psychosocial processes yielding results that are face valid (e.g., subjects living in high elevations talk about the mountains), tie in with other research (e.g., neurotic people disproportionately use the phrase 'sick of' and the word 'depressed'), suggest new hypotheses (e.g., an active life implies emotional stability), and give detailed insights (males use the possessive 'my' when mentioning their 'wife' or 'girlfriend' more often than females use 'my' with 'husband' or 'boyfriend'). To date, this represents the largest study, by an order of magnitude, of language and personality.

摘要

我们分析了从 75000 名志愿者的 Facebook 消息中收集到的 7 亿个单词、短语和主题实例,这些志愿者还接受了标准的个性测试,发现语言在个性、性别和年龄方面存在显著差异。在我们的开放式词汇技术中,数据本身驱动了对语言的全面探索,这种探索可以区分人群,发现传统的封闭式词汇词类分析无法捕捉到的联系。我们的分析为社会心理过程提供了新的视角,得出的结果具有直观性(例如,居住在高海拔地区的人会谈论山脉),与其他研究结果相吻合(例如,神经质的人过度使用“sick of”和“depressed”这两个词),提出了新的假设(例如,积极的生活意味着情绪稳定),并提供了详细的见解(男性在提到他们的“妻子”或“女朋友”时比女性更频繁地使用“my”,而女性在提到“丈夫”或“男朋友”时则较少使用“my”)。到目前为止,这是语言和个性方面规模最大的研究,数量级提高了一个数量级。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/f10d482fed92/pone.0073791.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/f37e7027cd03/pone.0073791.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/b9cd65388ddd/pone.0073791.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/e9bf354faf1e/pone.0073791.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/62b5e4a1ac2d/pone.0073791.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/e73d0d7c57c0/pone.0073791.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/f10d482fed92/pone.0073791.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/f37e7027cd03/pone.0073791.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/b9cd65388ddd/pone.0073791.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/e9bf354faf1e/pone.0073791.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/62b5e4a1ac2d/pone.0073791.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/e73d0d7c57c0/pone.0073791.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6762/3783449/f10d482fed92/pone.0073791.g006.jpg

相似文献

1
Personality, gender, and age in the language of social media: the open-vocabulary approach.社交媒体语言中的个性、性别和年龄:开放词汇方法。
PLoS One. 2013 Sep 25;8(9):e73791. doi: 10.1371/journal.pone.0073791. eCollection 2013.
2
The online social self: an open vocabulary approach to personality.网络社交自我:一种研究人格的开放词汇方法。
Assessment. 2014 Apr;21(2):158-69. doi: 10.1177/1073191113514104. Epub 2013 Dec 8.
3
From "Sooo excited!!!" to "So proud": using language to study development.从“太兴奋了!!!”到“太自豪了”:用语言研究发展。
Dev Psychol. 2014 Jan;50(1):178-88. doi: 10.1037/a0035048. Epub 2013 Nov 25.
4
Closed- and open-vocabulary approaches to text analysis: A review, quantitative comparison, and recommendations.封闭式和开放式词汇方法在文本分析中的应用:综述、定量比较和建议。
Psychol Methods. 2021 Aug;26(4):398-427. doi: 10.1037/met0000349.
5
Latent human traits in the language of social media: An open-vocabulary approach.社交媒体语言中的潜在人类特征:一种开放式词汇方法。
PLoS One. 2018 Nov 28;13(11):e0201703. doi: 10.1371/journal.pone.0201703. eCollection 2018.
6
Automatic personality assessment through social media language.通过社交媒体语言进行自动人格评估。
J Pers Soc Psychol. 2015 Jun;108(6):934-52. doi: 10.1037/pspp0000020. Epub 2014 Nov 3.
7
Women are Warmer but No Less Assertive than Men: Gender and Language on Facebook.女性比男性更热情,但同样坚定:脸书上的性别与语言
PLoS One. 2016 May 25;11(5):e0155885. doi: 10.1371/journal.pone.0155885. eCollection 2016.
8
The language of character strengths: Predicting morally valued traits on social media.品德优势语言:社交媒体上道德价值观特质的预测。
J Pers. 2020 Apr;88(2):287-306. doi: 10.1111/jopy.12491. Epub 2019 May 29.
9
Cross-Cultural Sex/Gender Differences in Produced Word Content Before the Age of 3 Years.3 岁前产生的词汇内容中的跨文化性别差异。
Psychol Sci. 2023 Apr;34(4):411-423. doi: 10.1177/09567976221146537. Epub 2023 Feb 2.
10
Personality expression in Chinese language use.汉语使用中的个性表达。
Int J Psychol. 2017 Dec;52(6):463-472. doi: 10.1002/ijop.12259. Epub 2016 Feb 10.

引用本文的文献

1
Current applications and future directions in natural language processing for news media and mental health.新闻媒体与心理健康领域自然语言处理的当前应用及未来发展方向
Sci Rep. 2025 Sep 15;15(1):32532. doi: 10.1038/s41598-025-18413-z.
2
On sex, drugs, and alcohol: a mixed-method analysis of youth posts on social media in the United States.关于性、毒品和酒精:对美国年轻人社交媒体帖子的混合方法分析。
J Child Media. 2022;16(4):514-531. doi: 10.1080/17482798.2022.2059537. Epub 2022 Apr 6.
3
User preferences in multi-objective routes: The role of gradient visualization and personality measures.

本文引用的文献

1
The Power of Personality: The Comparative Validity of Personality Traits, Socioeconomic Status, and Cognitive Ability for Predicting Important Life Outcomes.人格的力量:人格特质、社会经济地位和认知能力在预测重要人生结果方面的比较效度
Perspect Psychol Sci. 2007 Dec;2(4):313-45. doi: 10.1111/j.1745-6916.2007.00047.x.
2
Private traits and attributes are predictable from digital records of human behavior.个人特质和属性可从人类行为的数字记录中预测出来。
Proc Natl Acad Sci U S A. 2013 Apr 9;110(15):5802-5. doi: 10.1073/pnas.1218772110. Epub 2013 Mar 11.
3
Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter.
多目标路线中的用户偏好:梯度可视化和个性测量的作用。
PLoS One. 2025 Aug 6;20(8):e0329387. doi: 10.1371/journal.pone.0329387. eCollection 2025.
4
The narratives of war (NoW) corpus of written testimonies of the Russia-Ukraine war.俄乌战争书面证词的战争叙事(NoW)语料库。
Lang Resour Eval. 2025;59(3):2415-2426. doi: 10.1007/s10579-025-09813-8. Epub 2025 Feb 19.
5
Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估
J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.
6
The construction of emotional meaning in language.语言中情感意义的构建。
Commun Psychol. 2025 Jul 7;3(1):99. doi: 10.1038/s44271-025-00255-0.
7
On the conversational persuasiveness of GPT-4.论GPT-4的对话说服力。
Nat Hum Behav. 2025 May 19. doi: 10.1038/s41562-025-02194-6.
8
Using the Language of elite athletes to predict their personality and on court transgressions.运用精英运动员的语言来预测他们的个性和场上违规行为。
Sci Rep. 2025 May 16;15(1):17002. doi: 10.1038/s41598-025-99667-5.
9
Computational analysis of 100 K choice dilemmas: Decision attributes, trade-off structures, and model-based prediction.10万个选择困境的计算分析:决策属性、权衡结构及基于模型的预测
Proc Natl Acad Sci U S A. 2025 Apr 29;122(17):e2406489122. doi: 10.1073/pnas.2406489122. Epub 2025 Apr 21.
10
Inferring Depression and Its Semantic Underpinnings from Simple Lexical Choices.从简单的词汇选择推断抑郁症及其语义基础。
Depress Anxiety. 2024 Mar 28;2024:3010831. doi: 10.1155/2024/3010831. eCollection 2024.
全球社交网络中的快乐和信息的时间模式:快乐计量学和 Twitter。
PLoS One. 2011;6(12):e26752. doi: 10.1371/journal.pone.0026752. Epub 2011 Dec 7.
4
Diurnal and seasonal mood vary with work, sleep, and daylength across diverse cultures.在不同的文化中,昼夜节律和季节性情绪会随工作、睡眠和日照时间的变化而变化。
Science. 2011 Sep 30;333(6051):1878-81. doi: 10.1126/science.1202775.
5
Sociology. Social scientists wade into the tweet stream.社会学。社会科学家涉足推特信息流。
Science. 2011 Sep 30;333(6051):1814-5. doi: 10.1126/science.333.6051.1814.
6
Web of war.战争之网
Nature. 2011 Mar 31;471(7340):566-8. doi: 10.1038/471566a.
7
Quantitative analysis of culture using millions of digitized books.利用数百万本数字化书籍进行文化的定量分析。
Science. 2011 Jan 14;331(6014):176-82. doi: 10.1126/science.1199644. Epub 2010 Dec 16.
8
Personality in 100,000 Words: A large-scale analysis of personality and word use among bloggers.十万字中的个性:对博主个性与词汇使用的大规模分析
J Res Pers. 2010 Jun 1;44(3):363-373. doi: 10.1016/j.jrp.2010.04.001.
9
Facebook profiles reflect actual personality, not self-idealization.脸书个人资料反映的是真实个性,而非自我理想化。
Psychol Sci. 2010 Mar;21(3):372-4. doi: 10.1177/0956797609360756. Epub 2010 Jan 29.
10
Social science. Computational social science.社会科学。计算社会科学。
Science. 2009 Feb 6;323(5915):721-3. doi: 10.1126/science.1167742.