• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在线阅读习惯能够揭示个性特征:迈向心理微观定位检测

Online reading habits can reveal personality traits: towards detecting psychological microtargeting.

作者信息

Simchon Almog, Sutton Adam, Edwards Matthew, Lewandowsky Stephan

机构信息

School of Psychological Science, University of Bristol, Bristol BS8 1QU, UK.

Department of Computer Science, University of Bristol, Bristol BS8 1QU, UK.

出版信息

PNAS Nexus. 2023 Jun 7;2(6):pgad191. doi: 10.1093/pnasnexus/pgad191. eCollection 2023 Jun.

DOI:10.1093/pnasnexus/pgad191
PMID:37333766
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10276193/
Abstract

Building on big data from , we generated two computational text models: (i) Predicting the personality of users from the text they have written and (ii) predicting the personality of users based on the text they have consumed. The second model is novel and without precedent in the literature. We recruited active Reddit users () of fiction-writing communities. The participants completed a Big Five personality questionnaire and consented for their Reddit activity to be scraped and used to create a machine learning model. We trained an natural language processing model [Bidirectional Encoder Representations from Transformers (BERT)], predicting personality from produced text (average performance: ). We then applied this model to a new set of Reddit users (), predicted their personality based on their produced text, and trained a second BERT model to predict their predicted-personality scores based on consumed text (average performance: ). By doing so, we provide the first glimpse into the linguistic markers of personality-congruent consumed content.

摘要

基于来自……的大数据,我们生成了两个计算文本模型:(i)根据用户所写文本预测其个性,以及(ii)根据用户所消费的文本预测其个性。第二个模型是新颖的,在文献中没有先例。我们招募了Reddit上小说写作社区的活跃用户(……)。参与者完成了一份大五人格问卷,并同意对他们在Reddit上的活动进行抓取,用于创建一个机器学习模型。我们训练了一个自然语言处理模型[来自变换器的双向编码器表示(BERT)],根据所生成的文本预测个性(平均性能:……)。然后,我们将这个模型应用于一组新的Reddit用户(……),根据他们所生成的文本预测他们的个性,并训练第二个BERT模型,根据所消费的文本预测他们的预测个性分数(平均性能:……)。通过这样做,我们首次初步了解了与个性相符的消费内容的语言标记。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/9449ff6c9fc0/pgad191f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/7c85aeb48014/pgad191f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/aa12b7c390f8/pgad191f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/7018092ea577/pgad191f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/9449ff6c9fc0/pgad191f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/7c85aeb48014/pgad191f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/aa12b7c390f8/pgad191f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/7018092ea577/pgad191f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e58/10276193/9449ff6c9fc0/pgad191f4.jpg

相似文献

1
Online reading habits can reveal personality traits: towards detecting psychological microtargeting.在线阅读习惯能够揭示个性特征:迈向心理微观定位检测
PNAS Nexus. 2023 Jun 7;2(6):pgad191. doi: 10.1093/pnasnexus/pgad191. eCollection 2023 Jun.
2
Understanding Mental Health Issues in Different Subdomains of Social Networking Services: Computational Analysis of Text-Based Reddit Posts.理解不同社交网络服务领域的心理健康问题:基于文本的 Reddit 帖子的计算分析。
J Med Internet Res. 2023 Nov 30;25:e49074. doi: 10.2196/49074.
3
Towards Transfer Learning Techniques-BERT, DistilBERT, BERTimbau, and DistilBERTimbau for Automatic Text Classification from Different Languages: A Case Study.面向迁移学习技术——BERT、DistilBERT、BERTimbau 和 DistilBERTimbau 用于来自不同语言的自动文本分类:案例研究。
Sensors (Basel). 2022 Oct 26;22(21):8184. doi: 10.3390/s22218184.
4
Explainable Personality Prediction Using Answers to Open-Ended Interview Questions.利用开放式面试问题的答案进行可解释的人格预测
Front Psychol. 2022 Nov 18;13:865841. doi: 10.3389/fpsyg.2022.865841. eCollection 2022.
5
Extracting Multiple Worries From Breast Cancer Patient Blogs Using Multilabel Classification With the Natural Language Processing Model Bidirectional Encoder Representations From Transformers: Infodemiology Study of Blogs.使用基于Transformer的自然语言处理模型双向编码器表征的多标签分类从乳腺癌患者博客中提取多种担忧:博客的信息流行病学研究
JMIR Cancer. 2022 Jun 3;8(2):e37840. doi: 10.2196/37840.
6
Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.基于RoBERTa-WWM-ext + CNN(带有全词掩码扩展的基于变换器预训练方法的稳健优化双向编码器表示与卷积神经网络相结合)模型的医患对话多标签分类:命名实体研究
JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.
7
Predicting Age Groups of Reddit Users Based on Posting Behavior and Metadata: Classification Model Development and Validation.基于发帖行为和元数据预测 Reddit 用户年龄组:分类模型的开发和验证。
JMIR Public Health Surveill. 2021 Mar 16;7(3):e25807. doi: 10.2196/25807.
8
A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.一种基于 Bi-LSTM 和 BERT 模型的心理健康预测新型文本挖掘方法。
Comput Intell Neurosci. 2022 Mar 3;2022:7893775. doi: 10.1155/2022/7893775. eCollection 2022.
9
Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation.通过语言模型检测基于网络论坛用户的抑郁症实现早期医疗干预:纵向分析与评估
JMIR AI. 2023 Mar 24;2:e41205. doi: 10.2196/41205.
10
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.

引用本文的文献

1
Advancing psychological assessment: quantifying self-compassion through free-text responses and language model BERT.推进心理评估:通过自由文本回答和语言模型BERT量化自我同情。
Sci Rep. 2025 Jul 16;15(1):25778. doi: 10.1038/s41598-025-02801-6.
2
Warning people that they are being microtargeted fails to eliminate persuasive advantage.警告人们他们正成为微观目标受众并不能消除说服力优势。
Commun Psychol. 2025 Jan 29;3(1):15. doi: 10.1038/s44271-025-00188-8.
3
Predicting attitudes toward ambiguity using natural language processing on free descriptions for open-ended question measurements.

本文引用的文献

1
A computational text analysis investigation of the relation between personal and linguistic agency.一项关于个人能动性与语言能动性之间关系的计算文本分析研究。
Commun Psychol. 2023 Sep 25;1(1):23. doi: 10.1038/s44271-023-00020-1.
2
Digital public health interventions at scale: The impact of social media advertising on beliefs and outcomes related to COVID vaccines.大规模数字化公共卫生干预措施:社交媒体广告对 COVID 疫苗相关信念和结果的影响。
Proc Natl Acad Sci U S A. 2023 Jan 31;120(5):e2208110120. doi: 10.1073/pnas.2208110120. Epub 2023 Jan 26.
3
Natural language analyzed with AI-based transformers predict traditional subjective well-being measures approaching the theoretical upper limits in accuracy.
使用自然语言处理对开放式问题测量的自由描述进行歧义预测。
Sci Rep. 2024 Apr 9;14(1):8276. doi: 10.1038/s41598-024-59118-z.
4
The persuasive effects of political microtargeting in the age of generative artificial intelligence.生成式人工智能时代政治微观目标定位的说服效果。
PNAS Nexus. 2024 Jan 29;3(2):pgae035. doi: 10.1093/pnasnexus/pgae035. eCollection 2024 Feb.
基于人工智能的转换器分析的自然语言在准确性上接近传统主观幸福感测量的理论上限。
Sci Rep. 2022 Mar 10;12(1):3918. doi: 10.1038/s41598-022-07520-w.
4
Closed- and open-vocabulary approaches to text analysis: A review, quantitative comparison, and recommendations.封闭式和开放式词汇方法在文本分析中的应用:综述、定量比较和建议。
Psychol Methods. 2021 Aug;26(4):398-427. doi: 10.1037/met0000349.
5
Boosting people's ability to detect microtargeted advertising.增强人们发现精准定向广告的能力。
Sci Rep. 2021 Jul 30;11(1):15541. doi: 10.1038/s41598-021-94796-z.
6
Beyond Openness to Experience and Conscientiousness: Testing links between lower-level personality traits and American political orientation.超越经验开放性和尽责性:检验低层级人格特质与美国政治倾向之间的联系。
J Pers. 2021 Aug;89(4):754-773. doi: 10.1111/jopy.12613. Epub 2021 Jan 12.
7
The small effects of political advertising are small regardless of context, message, sender, or receiver: Evidence from 59 real-time randomized experiments.无论背景、信息、发送者或接收者如何,政治广告的效果都很小:来自59项实时随机实验的证据。
Sci Adv. 2020 Sep 2;6(36). doi: 10.1126/sciadv.abc4046. Print 2020 Sep.
8
Field studies of psychologically targeted ads face threats to internal validity.针对心理的定向广告的实地研究面临内部效度方面的威胁。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):E5254-E5255. doi: 10.1073/pnas.1805363115. Epub 2018 May 18.
9
Reply to Eckles et al.: Facebook's optimization algorithms are highly unlikely to explain the effects of psychological targeting.回复埃克尔斯等人:脸书的优化算法极不可能解释心理定向的效果。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):E5256-E5257. doi: 10.1073/pnas.1806854115. Epub 2018 May 18.
10
Psychological targeting as an effective approach to digital mass persuasion.心理定位作为一种有效的数字大众说服方法。
Proc Natl Acad Sci U S A. 2017 Nov 28;114(48):12714-12719. doi: 10.1073/pnas.1710966114. Epub 2017 Nov 13.