行为科学开源大语言模型教程。

A tutorial on open-source large language models for behavioral science.

机构信息

University of Basel, Basel, Switzerland.

Max Planck Institute for Human Development, Berlin, Germany.

出版信息

Behav Res Methods. 2024 Dec;56(8):8214-8237. doi: 10.3758/s13428-024-02455-8. Epub 2024 Aug 15.

DOI:10.3758/s13428-024-02455-8

PMID:39147947

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11525391/

Abstract

Large language models (LLMs) have the potential to revolutionize behavioral science by accelerating and improving the research cycle, from conceptualization to data analysis. Unlike closed-source solutions, open-source frameworks for LLMs can enable transparency, reproducibility, and adherence to data protection standards, which gives them a crucial advantage for use in behavioral science. To help researchers harness the promise of LLMs, this tutorial offers a primer on the open-source Hugging Face ecosystem and demonstrates several applications that advance conceptual and empirical work in behavioral science, including feature extraction, fine-tuning of models for prediction, and generation of behavioral responses. Executable code is made available at github.com/Zak-Hussain/LLM4BeSci.git . Finally, the tutorial discusses challenges faced by research with (open-source) LLMs related to interpretability and safety and offers a perspective on future research at the intersection of language modeling and behavioral science.

摘要

大型语言模型 (LLMs) 有潜力通过加速和改进研究周期，从概念化到数据分析，从而彻底改变行为科学。与闭源解决方案不同，LLM 的开源框架可以实现透明度、可重复性和遵守数据保护标准，这为它们在行为科学中的应用提供了至关重要的优势。为了帮助研究人员利用 LLM 的潜力，本教程提供了对开源 Hugging Face 生态系统的简介，并展示了几个应用程序，这些应用程序推进了行为科学中的概念和实证工作，包括特征提取、模型微调以进行预测以及生成行为反应。可执行代码可在 github.com/Zak-Hussain/LLM4BeSci.git 上获得。最后，本教程讨论了与（开源）LLM 相关的可解释性和安全性研究所面临的挑战，并对语言模型和行为科学交叉领域的未来研究提供了一个视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/69a559a95d90/13428_2024_2455_Fig1_HTML.jpg

相似文献

A tutorial on open-source large language models for behavioral science.

Behav Res Methods. 2024 Dec;56(8):8214-8237. doi: 10.3758/s13428-024-02455-8. Epub 2024 Aug 15.

Open-Source Large Language Models in Radiology: A Review and Tutorial for Practical Research and Clinical Deployment.

Radiology. 2025 Jan;314(1):e241073. doi: 10.1148/radiol.241073.

MacBehaviour: An R package for behavioural experimentation on large language models.

Behav Res Methods. 2024 Dec 18;57(1):19. doi: 10.3758/s13428-024-02524-y.

[Large language models from OpenAI, Google, Meta, X and Co. : The role of "closed" and "open" models in radiology].

Radiologie (Heidelb). 2024 Oct;64(10):779-786. doi: 10.1007/s00117-024-01327-8. Epub 2024 Jun 7.

Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.

Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.

Behavioral science labs: How to solve the multi-user problem.

Behav Res Methods. 2024 Dec;56(8):8238-8258. doi: 10.3758/s13428-024-02467-4. Epub 2024 Aug 12.

Large language models for error detection in radiology reports: a comparative analysis between closed-source and privacy-compliant open-source models.

Eur Radiol. 2025 Feb 20. doi: 10.1007/s00330-025-11438-y.

Distilling large language models for matching patients to clinical trials.

J Am Med Inform Assoc. 2024 Sep 1;31(9):1953-1963. doi: 10.1093/jamia/ocae073.

Performance and Reproducibility of Large Language Models in Named Entity Recognition: Considerations for the Use in Controlled Environments.

Drug Saf. 2025 Mar;48(3):287-303. doi: 10.1007/s40264-024-01499-1. Epub 2024 Dec 11.

Leveraging Open-Source Large Language Models for Data Augmentation in Hospital Staff Surveys: Mixed Methods Study.

JMIR Med Educ. 2024 Nov 19;10:e51433. doi: 10.2196/51433.

引用本文的文献

Human Expertise and Large Language Model Embeddings in the Content Validity Assessment of Personality Tests.

Educ Psychol Meas. 2025 Aug 14:00131644251355485. doi: 10.1177/00131644251355485.

Examining Chat GPT with nonwords and machine psycholinguistic techniques.

PLoS One. 2025 Jun 6;20(6):e0325612. doi: 10.1371/journal.pone.0325612. eCollection 2025.

Using large language models to facilitate academic work in the psychological sciences.

Curr Psychol. 2025;44(9):7910-7918. doi: 10.1007/s12144-025-07438-2. Epub 2025 Jan 28.

Semantic embeddings reveal and address taxonomic incommensurability in psychological measurement.

Nat Hum Behav. 2025 Mar 11. doi: 10.1038/s41562-024-02089-y.

Predicting MBTI personality of YouTube users.

Sci Rep. 2025 Feb 28;15(1):7221. doi: 10.1038/s41598-025-85183-z.

SEMbeddings: how to evaluate model misfit before data collection using large-language models.

Front Psychol. 2025 Feb 4;15:1433339. doi: 10.3389/fpsyg.2024.1433339. eCollection 2024.

AI can outperform humans in predicting correlations between personality items.

Commun Psychol. 2025 Feb 12;3(1):23. doi: 10.1038/s44271-025-00205-w.

Mapping Mental Representations With Free Associations: A Tutorial Using the R Package associatoR.

J Cogn. 2025 Jan 6;8(1):3. doi: 10.5334/joc.407. eCollection 2025.

Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals.

Sustain Sci. 2024;19(5):1773-1787. doi: 10.1007/s11625-024-01516-3. Epub 2024 Jul 24.

Novel embeddings improve the prediction of risk perception.

EPJ Data Sci. 2024;13(1):38. doi: 10.1140/epjds/s13688-024-00478-x. Epub 2024 May 22.

本文引用的文献

Studying and improving reasoning in humans and machines.

Commun Psychol. 2024 Jun 3;2(1):51. doi: 10.1038/s44271-024-00091-8.

GPT is an effective tool for multilingual psychological text analysis.

Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2308950121. doi: 10.1073/pnas.2308950121. Epub 2024 Aug 12.

Novel embeddings improve the prediction of risk perception.

EPJ Data Sci. 2024;13(1):38. doi: 10.1140/epjds/s13688-024-00478-x. Epub 2024 May 22.

Living guidelines for generative AI - why scientists must oversee its use.

Nature. 2023 Oct;622(7984):693-696. doi: 10.1038/d41586-023-03266-1.

AI and science: what 1,600 researchers think.

Nature. 2023 Sep;621(7980):672-675. doi: 10.1038/d41586-023-02980-0.

A deep learning approach to personality assessment: Generalizing across items and expanding the reach of survey-based research.

J Pers Soc Psychol. 2024 Feb;126(2):312-331. doi: 10.1037/pspp0000480. Epub 2023 Sep 7.

Emergent analogical reasoning in large language models.

Nat Hum Behav. 2023 Sep;7(9):1526-1541. doi: 10.1038/s41562-023-01659-w. Epub 2023 Jul 31.

ChatGPT outperforms crowd workers for text-annotation tasks.

Proc Natl Acad Sci U S A. 2023 Jul 25;120(30):e2305016120. doi: 10.1073/pnas.2305016120. Epub 2023 Jul 18.

How do we know how smart AI systems are?

Science. 2023 Jul 14;381(6654):adj5957. doi: 10.1126/science.adj5957. Epub 2023 Jul 13.

The debate over understanding in AI's large language models.

Proc Natl Acad Sci U S A. 2023 Mar 28;120(13):e2215907120. doi: 10.1073/pnas.2215907120. Epub 2023 Mar 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

行为科学开源大语言模型教程。

A tutorial on open-source large language models for behavioral science.

机构信息

University of Basel, Basel, Switzerland.

Max Planck Institute for Human Development, Berlin, Germany.

出版信息

Behav Res Methods. 2024 Dec;56(8):8214-8237. doi: 10.3758/s13428-024-02455-8. Epub 2024 Aug 15.

DOI:10.3758/s13428-024-02455-8

PMID:39147947

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11525391/

Abstract

摘要

行为科学开源大语言模型教程。

A tutorial on open-source large language models for behavioral science.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

行为科学开源大语言模型教程。

A tutorial on open-source large language models for behavioral science.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献