文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

对ChatGPT作为数据分析工具的性能考察。

Examination of ChatGPT's Performance as a Data Analysis Tool.

作者信息

Koçak Duygu

机构信息

Alanya Alaaddin Keykubat University, Alanya/Antalya, Turkey.

出版信息

Educ Psychol Meas. 2025 Jan 3:00131644241302721. doi: 10.1177/00131644241302721.


DOI:10.1177/00131644241302721
PMID:39759537
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11696938/
Abstract

This study examines the performance of ChatGPT, developed by OpenAI and widely used as an AI-based conversational tool, as a data analysis tool through exploratory factor analysis (EFA). To this end, simulated data were generated under various data conditions, including normal distribution, response category, sample size, test length, factor loading, and measurement models. The generated data were analyzed using ChatGPT-4o twice with a 1-week interval under the same prompt, and the results were compared with those obtained using R code. In data analysis, the Kaiser-Meyer-Olkin (KMO) value, total variance explained, and the number of factors estimated using the empirical Kaiser criterion, Hull method, and Kaiser-Guttman criterion, as well as factor loadings, were calculated. The findings obtained from ChatGPT at two different times were found to be consistent with those obtained using R. Overall, ChatGPT demonstrated good performance for steps that require only computational decisions without involving researcher judgment or theoretical evaluation (such as KMO, total variance explained, and factor loadings). However, for multidimensional structures, although the estimated number of factors was consistent across analyses, biases were observed, suggesting that researchers should exercise caution in such decisions.

摘要

本研究通过探索性因子分析(EFA)检验了由OpenAI开发并被广泛用作基于人工智能的对话工具的ChatGPT作为数据分析工具的性能。为此,在各种数据条件下生成了模拟数据,包括正态分布、响应类别、样本量、测验长度、因子载荷和测量模型。在相同提示下,使用ChatGPT-4o对生成的数据进行了两次分析,间隔为1周,并将结果与使用R代码获得的结果进行比较。在数据分析中,计算了Kaiser-Meyer-Olkin(KMO)值、解释的总方差、使用经验Kaiser准则、赫尔方法和Kaiser-Guttman准则估计的因子数量以及因子载荷。发现ChatGPT在两个不同时间获得的结果与使用R获得的结果一致。总体而言,ChatGPT在仅需要计算决策而不涉及研究者判断或理论评估的步骤(如KMO、解释的总方差和因子载荷)中表现良好。然而,对于多维结构,尽管各分析中估计的因子数量一致,但仍观察到偏差,这表明研究者在做出此类决策时应谨慎。

相似文献

[1]
Examination of ChatGPT's Performance as a Data Analysis Tool.

Educ Psychol Meas. 2025-1-3

[2]
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.

Front Med (Lausanne). 2023-12-13

[3]
Evaluating the Influence of Role-Playing Prompts on ChatGPT's Misinformation Detection Accuracy: Quantitative Study.

JMIR Infodemiology. 2024-9-26

[4]
Assessing the Accuracy of Generative Conversational Artificial Intelligence in Debunking Sleep Health Myths: Mixed Methods Comparative Study With Expert Analysis.

JMIR Form Res. 2024-4-16

[5]
Application of Large Language Models in Medical Training Evaluation-Using ChatGPT as a Standardized Patient: Multimetric Assessment.

J Med Internet Res. 2025-1-1

[6]
ChatGPT's Attitude, Knowledge, and Clinical Application in Geriatrics Practice and Education: Exploratory Observational Study.

JMIR Form Res. 2025-1-3

[7]
Sailing the Seven Seas: A Multinational Comparison of ChatGPT's Performance on Medical Licensing Examinations.

Ann Biomed Eng. 2024-6

[8]
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.

JMIR Med Educ. 2024-2-9

[9]
Optimizing ChatGPT's Interpretation and Reporting of Delirium Assessment Outcomes: Exploratory Study.

JMIR Form Res. 2024-10-1

[10]
Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.

Int J Nurs Stud. 2024-5

引用本文的文献

[1]
ChatGPT's performance in sample size estimation: a preliminary study on the capabilities of artificial intelligence.

Fam Pract. 2025-8-14

[2]
Statistical Rigor and Reproducibility in the AI Era.

Balkan Med J. 2025-9-1

本文引用的文献

[1]
Factor Retention Using Machine Learning With Ordinal Data.

Appl Psychol Meas. 2022-7

[2]
Investigating the performance of exploratory graph analysis and traditional techniques to identify the number of latent factors: A simulation and tutorial.

Psychol Methods. 2020-6

[3]
Descriptive Statistics for Modern Test Score Distributions: Skewness, Kurtosis, Discreteness, and Ceiling Effects.

Educ Psychol Meas. 2015-6

[4]
Psychometric Properties and Factor Structure of a Long and Shortened Version of the Cognitive and Behavioural Responses Questionnaire.

Psychosom Med. 2018

[5]
Validity and Reliability of the Positive Aspects of Caregiving (PAC) Scale and Development of Its Shorter Version (S-PAC) Among Family Caregivers of Older Adults.

Gerontologist. 2017-8-1

[6]
An empirical Kaiser criterion.

Psychol Methods. 2016-3-31

[7]
Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares.

Behav Res Methods. 2016-9

[8]
Exploratory factor analysis in validation studies: uses and recommendations.

Psicothema. 2014

[9]
The use of Likert scales with children.

J Pediatr Psychol. 2014-4

[10]
An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data.

Psychol Methods. 2004-12

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索