大型团队科学揭示了机器学习在对情感体验的生理指标进行建模方面的前景与局限。

Big team science reveals promises and limitations of machine learning efforts to model physiological markers of affective experience.

作者信息

Coles Nicholas A, Perz Bartosz, Behnke Maciej, Eichstaedt Johannes C, Kim Soo Hyung, Vu Tu N, Raman Chirag, Tejada Julian, Huynh Van-Thong, Zhang Guangyi, Cui Tanming, Podder Sharanyak, Chavda Rushi, Pandey Shubham, Upadhyay Arpit, Padilla-Buritica Jorge I, Barrera Causil Carlos J, Ji Linying, Dollack Felix, Kiyokawa Kiyoshi, Liu Huakun, Perusquia-Hernandez Monica, Uchiyama Hideaki, Wei Xin, Cao Houwei, Yang Ziqing, Iancarelli Alessia, McVeigh Kieran, Wang Yiyu, Berwian Isabel M, Chiu Jamie C, Mirea Dan-Mircea, Nook Erik C, Vartiainen Henna I, Whiting Claire, Cho Young Won, Chow Sy-Miin, Fisher Zachary F, Li Yanling, Xiong Xiaoyue, Shen Yuqi, Tagliazucchi Enzo, Bugnon Leandro A, Ospina Raydonal, Bruno Nicolas M, D'Amelio Tomas A, Zamberlan Federico, Mercado Diaz Luis R, Pinzon-Arenas Javier O, Posada-Quintero Hugo F, Bilalpur Maneesh, Hinduja Saurabh, Marmolejo-Ramos Fernando, Canavan Shaun, Jivnani Liza, Saganowski Stanisław

机构信息

University of Florida, Gainesville, FL, USA.

Wrocław University of Science and Technology, Wroclaw, Województwo Dolnośląskie, Poland.

出版信息

R Soc Open Sci. 2025 Jun 25;12(6):241778. doi: 10.1098/rsos.241778. eCollection 2025 Jun.

DOI:10.1098/rsos.241778

PMID:40568544

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12187420/

Abstract

Researchers are increasingly using machine learning to study physiological markers of emotion. We evaluated the promises and limitations of this approach via a big team science competition. Twelve teams competed to predict self-reported affective experiences using a multi-modal set of peripheral nervous system measures. Models were trained and tested in multiple ways: with data divided by participants, targeted emotion, inductions, and time. In 100% of tests, teams outperformed baseline models that made random predictions. In 46% of tests, teams also outperformed baseline models that relied on the simple average of ratings from training datasets. More notably, results uncovered a methodological challenge: multiplicative constraints on generalizability. Inferences about the accuracy and theoretical implications of machine learning efforts depended not only on their architecture, but also how they were trained, tested, and evaluated. For example, some teams performed better when tested on observations from the same (vs. different) subjects seen during training. Such results could be interpreted as evidence against claims of universality. However, such conclusions would be premature because other teams exhibited the opposite pattern. Taken together, results illustrate how big team science can be leveraged to understand the promises and limitations of machine learning methods in affective science and beyond.

摘要

研究人员越来越多地使用机器学习来研究情绪的生理指标。我们通过一场大型团队科学竞赛评估了这种方法的前景和局限性。十二支团队竞争，利用一套多模式的外周神经系统测量方法来预测自我报告的情感体验。模型通过多种方式进行训练和测试：按参与者、目标情绪、诱导方式和时间划分数据。在100%的测试中，各团队的表现均优于进行随机预测的基线模型。在46%的测试中，各团队的表现也优于依赖训练数据集评分简单平均值的基线模型。更值得注意的是，研究结果揭示了一个方法学挑战：普遍性的乘法约束。关于机器学习成果的准确性和理论意义的推断不仅取决于其架构，还取决于它们的训练、测试和评估方式。例如，一些团队在对训练期间见过的相同（而非不同）受试者的观察结果进行测试时表现更好。这样的结果可以被解释为反对普遍性主张的证据。然而，这样的结论还为时过早，因为其他团队呈现出相反的模式。综合来看，研究结果说明了如何利用大型团队科学来理解机器学习方法在情感科学及其他领域的前景和局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0612/12187420/3444b2bcd45a/rsos.241778.f001.jpg

相似文献

Big team science reveals promises and limitations of machine learning efforts to model physiological markers of affective experience.

R Soc Open Sci. 2025 Jun 25;12(6):241778. doi: 10.1098/rsos.241778. eCollection 2025 Jun.

Eliciting adverse effects data from participants in clinical trials.

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.

JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

"Just Ask What Support We Need": Autistic Adults' Feedback on Social Skills Training.

Autism Adulthood. 2025 May 28;7(3):283-292. doi: 10.1089/aut.2023.0136. eCollection 2025 Jun.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

How lived experiences of illness trajectories, burdens of treatment, and social inequalities shape service user and caregiver participation in health and social care: a theory-informed qualitative evidence synthesis.

Health Soc Care Deliv Res. 2025 Jun;13(24):1-120. doi: 10.3310/HGTQ8159.

Home treatment for mental health problems: a systematic review.

Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

A Pilot Study of Political Experiences and Barriers to Voting Among Autistic Adults Participating in Online Survey Research in the United States.

Autism Adulthood. 2025 May 28;7(3):261-272. doi: 10.1089/aut.2023.0119. eCollection 2025 Jun.

本文引用的文献

Team scientists should normalize disagreement.

Science. 2024 Jun 7;384(6700):1076-1077. doi: 10.1126/science.ado7070. Epub 2024 Jun 6.

The neurobiology of interoception and affect.

Trends Cogn Sci. 2024 Jul;28(7):643-661. doi: 10.1016/j.tics.2024.01.009. Epub 2024 Feb 22.

Cracking the code review process.

Nat Comput Sci. 2022 May;2(5):277. doi: 10.1038/s43588-022-00261-w.

Transparency Is Now the Default at .

Psychol Sci. 2024 Jul;35(7):708-711. doi: 10.1177/09567976231221573. Epub 2023 Dec 27.

Implementing code review in the scientific workflow: Insights from ecology and evolutionary biology.

J Evol Biol. 2023 Oct;36(10):1347-1356. doi: 10.1111/jeb.14230.

Emotion. 2024 Mar;24(2):506-521. doi: 10.1037/emo0001265. Epub 2023 Aug 21.

How to build up big team science: a practical guide for large-scale collaborations.

R Soc Open Sci. 2023 Jun 7;10(6):230235. doi: 10.1098/rsos.230235. eCollection 2023 Jun.

Toward Explainable Affective Computing: A Review.

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13101-13121. doi: 10.1109/TNNLS.2023.3270027. Epub 2024 Oct 7.

'Big team' science challenges us to reconsider authorship.

Nat Hum Behav. 2023 May;7(5):665-667. doi: 10.1038/s41562-023-01572-2.

Beyond playing 20 questions with nature: Integrative experiment design in the social and behavioral sciences.

Behav Brain Sci. 2022 Dec 21;47:e33. doi: 10.1017/S0140525X22002874.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

大型团队科学揭示了机器学习在对情感体验的生理指标进行建模方面的前景与局限。

Big team science reveals promises and limitations of machine learning efforts to model physiological markers of affective experience.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献