• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

行为科学开源大语言模型教程。

A tutorial on open-source large language models for behavioral science.

机构信息

University of Basel, Basel, Switzerland.

Max Planck Institute for Human Development, Berlin, Germany.

出版信息

Behav Res Methods. 2024 Dec;56(8):8214-8237. doi: 10.3758/s13428-024-02455-8. Epub 2024 Aug 15.

DOI:10.3758/s13428-024-02455-8
PMID:39147947
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11525391/
Abstract

Large language models (LLMs) have the potential to revolutionize behavioral science by accelerating and improving the research cycle, from conceptualization to data analysis. Unlike closed-source solutions, open-source frameworks for LLMs can enable transparency, reproducibility, and adherence to data protection standards, which gives them a crucial advantage for use in behavioral science. To help researchers harness the promise of LLMs, this tutorial offers a primer on the open-source Hugging Face ecosystem and demonstrates several applications that advance conceptual and empirical work in behavioral science, including feature extraction, fine-tuning of models for prediction, and generation of behavioral responses. Executable code is made available at github.com/Zak-Hussain/LLM4BeSci.git . Finally, the tutorial discusses challenges faced by research with (open-source) LLMs related to interpretability and safety and offers a perspective on future research at the intersection of language modeling and behavioral science.

摘要

大型语言模型 (LLMs) 有潜力通过加速和改进研究周期,从概念化到数据分析,从而彻底改变行为科学。与闭源解决方案不同,LLM 的开源框架可以实现透明度、可重复性和遵守数据保护标准,这为它们在行为科学中的应用提供了至关重要的优势。为了帮助研究人员利用 LLM 的潜力,本教程提供了对开源 Hugging Face 生态系统的简介,并展示了几个应用程序,这些应用程序推进了行为科学中的概念和实证工作,包括特征提取、模型微调以进行预测以及生成行为反应。可执行代码可在 github.com/Zak-Hussain/LLM4BeSci.git 上获得。最后,本教程讨论了与(开源)LLM 相关的可解释性和安全性研究所面临的挑战,并对语言模型和行为科学交叉领域的未来研究提供了一个视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/5ed5a9e51caa/13428_2024_2455_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/69a559a95d90/13428_2024_2455_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/8a9108d76cd6/13428_2024_2455_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/40431d16754f/13428_2024_2455_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/669bf8c6fac3/13428_2024_2455_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/defb2498711e/13428_2024_2455_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/0680544073e0/13428_2024_2455_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/5ed5a9e51caa/13428_2024_2455_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/69a559a95d90/13428_2024_2455_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/8a9108d76cd6/13428_2024_2455_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/40431d16754f/13428_2024_2455_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/669bf8c6fac3/13428_2024_2455_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/defb2498711e/13428_2024_2455_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/0680544073e0/13428_2024_2455_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0217/11525391/5ed5a9e51caa/13428_2024_2455_Fig7_HTML.jpg

相似文献

1
A tutorial on open-source large language models for behavioral science.行为科学开源大语言模型教程。
Behav Res Methods. 2024 Dec;56(8):8214-8237. doi: 10.3758/s13428-024-02455-8. Epub 2024 Aug 15.
2
Open-Source Large Language Models in Radiology: A Review and Tutorial for Practical Research and Clinical Deployment.放射学中的开源大语言模型:实践研究与临床应用综述及教程
Radiology. 2025 Jan;314(1):e241073. doi: 10.1148/radiol.241073.
3
MacBehaviour: An R package for behavioural experimentation on large language models.MacBehaviour:一个用于对大语言模型进行行为实验的R软件包。
Behav Res Methods. 2024 Dec 18;57(1):19. doi: 10.3758/s13428-024-02524-y.
4
[Large language models from OpenAI, Google, Meta, X and Co. : The role of "closed" and "open" models in radiology].[来自OpenAI、谷歌、Meta、X及其他公司的大语言模型:“封闭”和“开放”模型在放射学中的作用]
Radiologie (Heidelb). 2024 Oct;64(10):779-786. doi: 10.1007/s00117-024-01327-8. Epub 2024 Jun 7.
5
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
6
Behavioral science labs: How to solve the multi-user problem.行为科学实验室:如何解决多用户问题。
Behav Res Methods. 2024 Dec;56(8):8238-8258. doi: 10.3758/s13428-024-02467-4. Epub 2024 Aug 12.
7
Large language models for error detection in radiology reports: a comparative analysis between closed-source and privacy-compliant open-source models.用于放射学报告错误检测的大语言模型:闭源模型与符合隐私规定的开源模型的对比分析
Eur Radiol. 2025 Feb 20. doi: 10.1007/s00330-025-11438-y.
8
Distilling large language models for matching patients to clinical trials.提炼大型语言模型以实现患者与临床试验的匹配。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1953-1963. doi: 10.1093/jamia/ocae073.
9
Performance and Reproducibility of Large Language Models in Named Entity Recognition: Considerations for the Use in Controlled Environments.大型语言模型在命名实体识别中的性能与可重复性:在受控环境中使用的考量
Drug Saf. 2025 Mar;48(3):287-303. doi: 10.1007/s40264-024-01499-1. Epub 2024 Dec 11.
10
Leveraging Open-Source Large Language Models for Data Augmentation in Hospital Staff Surveys: Mixed Methods Study.利用开源大型语言模型在医院员工调查中进行数据扩充:混合方法研究。
JMIR Med Educ. 2024 Nov 19;10:e51433. doi: 10.2196/51433.

引用本文的文献

1
Human Expertise and Large Language Model Embeddings in the Content Validity Assessment of Personality Tests.人格测试内容效度评估中的人类专业知识与大语言模型嵌入
Educ Psychol Meas. 2025 Aug 14:00131644251355485. doi: 10.1177/00131644251355485.
2
Examining Chat GPT with nonwords and machine psycholinguistic techniques.运用非词和机器心理语言学技术对ChatGPT进行研究。
PLoS One. 2025 Jun 6;20(6):e0325612. doi: 10.1371/journal.pone.0325612. eCollection 2025.
3
Using large language models to facilitate academic work in the psychological sciences.

本文引用的文献

1
Studying and improving reasoning in humans and machines.研究并改进人类和机器的推理能力。
Commun Psychol. 2024 Jun 3;2(1):51. doi: 10.1038/s44271-024-00091-8.
2
GPT is an effective tool for multilingual psychological text analysis.GPT 是一种用于多语言心理文本分析的有效工具。
Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2308950121. doi: 10.1073/pnas.2308950121. Epub 2024 Aug 12.
3
Novel embeddings improve the prediction of risk perception.新型嵌入改进了风险感知的预测。
使用大语言模型促进心理科学领域的学术工作。
Curr Psychol. 2025;44(9):7910-7918. doi: 10.1007/s12144-025-07438-2. Epub 2025 Jan 28.
4
Semantic embeddings reveal and address taxonomic incommensurability in psychological measurement.语义嵌入揭示并解决了心理测量中的分类不可通约性问题。
Nat Hum Behav. 2025 Mar 11. doi: 10.1038/s41562-024-02089-y.
5
Predicting MBTI personality of YouTube users.预测YouTube用户的MBTI性格类型。
Sci Rep. 2025 Feb 28;15(1):7221. doi: 10.1038/s41598-025-85183-z.
6
SEMbeddings: how to evaluate model misfit before data collection using large-language models.语义嵌入:如何在使用大语言模型进行数据收集之前评估模型不匹配情况。
Front Psychol. 2025 Feb 4;15:1433339. doi: 10.3389/fpsyg.2024.1433339. eCollection 2024.
7
AI can outperform humans in predicting correlations between personality items.在预测人格项目之间的相关性方面,人工智能的表现可能优于人类。
Commun Psychol. 2025 Feb 12;3(1):23. doi: 10.1038/s44271-025-00205-w.
8
Mapping Mental Representations With Free Associations: A Tutorial Using the R Package associatoR.通过自由联想映射心理表征:使用R包associatoR的教程
J Cogn. 2025 Jan 6;8(1):3. doi: 10.5334/joc.407. eCollection 2025.
9
Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals.使用新数据和集成模型改进可持续发展目标的自动标注。
Sustain Sci. 2024;19(5):1773-1787. doi: 10.1007/s11625-024-01516-3. Epub 2024 Jul 24.
10
Novel embeddings improve the prediction of risk perception.新型嵌入改进了风险感知的预测。
EPJ Data Sci. 2024;13(1):38. doi: 10.1140/epjds/s13688-024-00478-x. Epub 2024 May 22.
EPJ Data Sci. 2024;13(1):38. doi: 10.1140/epjds/s13688-024-00478-x. Epub 2024 May 22.
4
Living guidelines for generative AI - why scientists must oversee its use.生成式人工智能的实用指南——为何科学家必须监督其使用。
Nature. 2023 Oct;622(7984):693-696. doi: 10.1038/d41586-023-03266-1.
5
AI and science: what 1,600 researchers think.人工智能与科学:1600名研究人员的看法。
Nature. 2023 Sep;621(7980):672-675. doi: 10.1038/d41586-023-02980-0.
6
A deep learning approach to personality assessment: Generalizing across items and expanding the reach of survey-based research.一种基于深度学习的人格评估方法:跨项目泛化和扩大基于调查研究的范围。
J Pers Soc Psychol. 2024 Feb;126(2):312-331. doi: 10.1037/pspp0000480. Epub 2023 Sep 7.
7
Emergent analogical reasoning in large language models.大语言模型中的紧急类比推理。
Nat Hum Behav. 2023 Sep;7(9):1526-1541. doi: 10.1038/s41562-023-01659-w. Epub 2023 Jul 31.
8
ChatGPT outperforms crowd workers for text-annotation tasks.在文本注释任务中,ChatGPT的表现优于众包工作者。
Proc Natl Acad Sci U S A. 2023 Jul 25;120(30):e2305016120. doi: 10.1073/pnas.2305016120. Epub 2023 Jul 18.
9
How do we know how smart AI systems are?我们如何知道人工智能系统有多聪明?
Science. 2023 Jul 14;381(6654):adj5957. doi: 10.1126/science.adj5957. Epub 2023 Jul 13.
10
The debate over understanding in AI's large language models.人工智能大型语言模型中的理解之争。
Proc Natl Acad Sci U S A. 2023 Mar 28;120(13):e2215907120. doi: 10.1073/pnas.2215907120. Epub 2023 Mar 21.