使用ChatGPT进行人机交互研究：入门指南。

Using ChatGPT for human-computer interaction research: a primer.

作者信息

Tabone Wilbert, de Winter Joost

机构信息

Department of Cognitive Robotics, Faculty of Mechanical, Maritime and Materials Engineering, Delft University of Technology, Delft 2628CD, The Netherlands.

出版信息

R Soc Open Sci. 2023 Sep 13;10(9):231053. doi: 10.1098/rsos.231053. eCollection 2023 Sep.

DOI:10.1098/rsos.231053

PMID:37711151

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10498031/

Abstract

ChatGPT could serve as a tool for text analysis within the field of Human-Computer Interaction, though its validity requires investigation. This study applied ChatGPT to: (1) textbox questionnaire responses on nine augmented-reality interfaces, (2) interview data from participants who experienced these interfaces in a virtual simulator, and (3) transcribed think-aloud data of participants who viewed a real painting and its replica. Using a hierarchical approach, ChatGPT produced scores or summaries of text batches, which were then aggregated. Results showed that (1) ChatGPT generated sentiment scores of the interfaces that correlated extremely strongly ( > 0.99) with human rating scale outcomes and with a rule-based sentiment analysis method (criterion validity). Additionally, (2) by inputting automatically transcribed interviews to ChatGPT, it provided meaningful meta-summaries of the qualities of the interfaces (face validity). One meta-summary analysed in depth was found to have substantial but imperfect overlap with a content analysis conducted by an independent researcher (criterion validity). Finally, (3) ChatGPT's summary of the think-aloud data highlighted subtle differences between the real painting and the replica (face validity), a distinction corresponding with a keyword analysis (criterion validity). In conclusion, our research indicates that, with appropriate precautions, ChatGPT can be used as a valid tool for analysing text data.

摘要

ChatGPT可以作为人机交互领域内文本分析的一种工具，不过其有效性有待研究。本研究将ChatGPT应用于：（1）关于九个增强现实界面的文本框问卷回复；（2）来自在虚拟模拟器中体验过这些界面的参与者的访谈数据；（3）观看一幅真实画作及其复制品的参与者的出声思考数据转录。采用分层方法，ChatGPT生成了文本批次的分数或总结，然后进行汇总。结果表明：（1）ChatGPT生成的界面情感分数与人类评分量表结果以及基于规则的情感分析方法高度相关（>0.99）（效标效度）。此外，（2）通过将自动转录的访谈输入ChatGPT，它提供了关于界面质量的有意义的元总结（表面效度）。深入分析的一个元总结被发现与独立研究人员进行的内容分析有实质性但并不完美的重叠（效标效度）。最后，（3）ChatGPT对出声思考数据的总结突出了真实画作和复制品之间的细微差异（表面效度），这种差异与关键词分析相符（效标效度）。总之，我们的研究表明，采取适当预防措施后，ChatGPT可作为分析文本数据的有效工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe65/10498031/219b3a4c0b5c/rsos231053f01.jpg

相似文献

Using ChatGPT for human-computer interaction research: a primer.使用ChatGPT进行人机交互研究：入门指南。

R Soc Open Sci. 2023 Sep 13;10(9):231053. doi: 10.1098/rsos.231053. eCollection 2023 Sep.

ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。

Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.

A Multidisciplinary Assessment of ChatGPT's Knowledge of Amyloidosis: Observational Study.对ChatGPT关于淀粉样变性知识的多学科评估：观察性研究。

JMIR Cardio. 2024 Apr 19;8:e53421. doi: 10.2196/53421.

Performance and exploration of ChatGPT in medical examination, records and education in Chinese: Pave the way for medical AI.ChatGPT 在中文体检、病历和教育方面的表现和探索：为医疗 AI 铺平道路。

Int J Med Inform. 2023 Sep;177:105173. doi: 10.1016/j.ijmedinf.2023.105173. Epub 2023 Aug 4.

Prompts, Pearls, Imperfections: Comparing ChatGPT and a Human Researcher in Qualitative Data Analysis.提示、要点与不足：在定性数据分析中比较ChatGPT和人类研究者

Qual Health Res. 2024 May 22:10497323241244669. doi: 10.1177/10497323241244669.

Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study.ChatGPT 在粤语情感分析中的有效性：对比研究。

J Med Internet Res. 2024 Jan 30;26:e51069. doi: 10.2196/51069.

Assessing ChatGPT's capacity for clinical decision support in pediatrics: A comparative study with pediatricians using KIDMAP of Rasch analysis.评估 ChatGPT 在儿科临床决策支持方面的能力：使用 KIDMAP 的 Rasch 分析与儿科医生进行的比较研究。

Medicine (Baltimore). 2023 Jun 23;102(25):e34068. doi: 10.1097/MD.0000000000034068.

Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现：调查研究。

JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.

ChatGPT's Performance on the Hand Surgery Self-Assessment Exam: A Critical Analysis.ChatGPT在手外科自我评估考试中的表现：一项批判性分析。

J Hand Surg Glob Online. 2024 Jan 2;6(2):200-205. doi: 10.1016/j.jhsg.2023.11.014. eCollection 2024 Mar.

ChatGPT's Ability to Assess Quality and Readability of Online Medical Information: Evidence From a Cross-Sectional Study.ChatGPT评估在线医学信息质量和可读性的能力：一项横断面研究的证据。

Cureus. 2023 Jul 20;15(7):e42214. doi: 10.7759/cureus.42214. eCollection 2023 Jul.

引用本文的文献

Evaluating the ability of large Language models to predict human social decisions.评估大语言模型预测人类社会决策的能力。

Sci Rep. 2025 Sep 2;15(1):32290. doi: 10.1038/s41598-025-17188-7.

Multilingual capabilities of GPT: A study of structural ambiguity.GPT的多语言能力：结构歧义研究

PLoS One. 2025 Jul 7;20(7):e0326943. doi: 10.1371/journal.pone.0326943. eCollection 2025.

An open-source reproducible chess robot for human-robot interaction research.一种用于人机交互研究的开源可重现国际象棋机器人。

Front Robot AI. 2025 Apr 16;12:1436674. doi: 10.3389/frobt.2025.1436674. eCollection 2025.

Urban walkability through different lenses: A comparative study of GPT-4o and human perceptions.不同视角下的城市步行适宜性：GPT-4o与人类认知的比较研究

PLoS One. 2025 Apr 29;20(4):e0322078. doi: 10.1371/journal.pone.0322078. eCollection 2025.

Complementing but Not Replacing: Comparing the Impacts of GPT-4 and Native-Speaker Interaction on Chinese L2 Writing Outcomes.互补而非替代：比较GPT-4与母语者互动对汉语二语写作成果的影响

Behav Sci (Basel). 2025 Apr 17;15(4):540. doi: 10.3390/bs15040540.

Human Factors and Organizational Issues in Health Informatics: Review of Recent Developments and Advances.健康信息学中的人为因素与组织问题：近期发展与进展综述

Yearb Med Inform. 2024 Aug;33(1):196-209. doi: 10.1055/s-0044-1800744. Epub 2025 Apr 8.

"Having providers who are trained and have empathy is life-saving": Improving primary care communication through thematic analysis with ChatGPT and human expertise.“拥有经过培训且富有同理心的医护人员能拯救生命”：通过与ChatGPT及人类专业知识进行主题分析来改善初级医疗保健沟通。

PEC Innov. 2024 Dec 28;6:100371. doi: 10.1016/j.pecinn.2024.100371. eCollection 2025 Jun.

Use of ChatGPT to Explore Gender and Geographic Disparities in Scientific Peer Review.使用ChatGPT探索科学同行评审中的性别和地域差异。

J Med Internet Res. 2024 Dec 9;26:e57667. doi: 10.2196/57667.

Putting ChatGPT vision (GPT-4V) to the test: risk perception in traffic images.对ChatGPT视觉模型（GPT-4V）进行测试：交通图像中的风险感知。

R Soc Open Sci. 2024 May 29;11(5):231676. doi: 10.1098/rsos.231676. eCollection 2024 May.

ChatGPT for Automated Qualitative Research: Content Analysis.ChatGPT 在定性研究中的自动化应用：内容分析。

J Med Internet Res. 2024 Jul 25;26:e59050. doi: 10.2196/59050.

本文引用的文献

GPT-4 passes the bar exam.GPT-4通过了律师资格考试。

Philos Trans A Math Phys Eng Sci. 2024 Apr 15;382(2270):20230254. doi: 10.1098/rsta.2023.0254. Epub 2024 Feb 26.

Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers.使用检测器和不知情的人类评审员，将ChatGPT生成的科学摘要与真实摘要进行比较。

NPJ Digit Med. 2023 Apr 26;6(1):75. doi: 10.1038/s41746-023-00819-6.

ChatGPT: the future of discharge summaries?ChatGPT：出院小结的未来？

Lancet Digit Health. 2023 Mar;5(3):e107-e108. doi: 10.1016/S2589-7500(23)00021-3. Epub 2023 Feb 6.

ChatGPT: five priorities for research.ChatGPT：研究的五个优先事项。

Nature. 2023 Feb;614(7947):224-226. doi: 10.1038/d41586-023-00288-7.

ChatGPT is fun, but not an author.ChatGPT 很有趣，但不是作者。

Science. 2023 Jan 27;379(6630):313. doi: 10.1126/science.adg7879. Epub 2023 Jan 26.

Concerns About the Potential Risks of Artificial Intelligence in Manuscript Writing. Letter.关于人工智能在稿件撰写中潜在风险的担忧。信函。

J Urol. 2023 Apr;209(4):682-683. doi: 10.1097/JU.0000000000003131. Epub 2022 Dec 23.

AI bot ChatGPT writes smart essays - should professors worry?人工智能聊天机器人ChatGPT能写出很巧妙的文章——教授们应该担心吗？

Nature. 2022 Dec 9. doi: 10.1038/d41586-022-04397-7.

How do people distribute their attention while observing ?人们在观察时如何分配注意力？

Perception. 2022 Nov;51(11):763-788. doi: 10.1177/03010066221122697. Epub 2022 Sep 29.

Thematic analysis of qualitative data: AMEE Guide No. 131.定性数据分析的主题分析：AMEE 指南第 131 号。

Med Teach. 2020 Aug;42(8):846-854. doi: 10.1080/0142159X.2020.1755030. Epub 2020 May 1.

A Review of the Quality Indicators of Rigor in Qualitative Research.一项关于定性研究严谨性质量指标的综述。

Am J Pharm Educ. 2020 Jan;84(1):7120. doi: 10.5688/ajpe7120.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用ChatGPT进行人机交互研究：入门指南。

Using ChatGPT for human-computer interaction research: a primer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献