• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Examination of ChatGPT's Performance as a Data Analysis Tool.对ChatGPT作为数据分析工具的性能考察。
Educ Psychol Meas. 2025 Jan 3:00131644241302721. doi: 10.1177/00131644241302721.
2
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
3
Evaluating the Influence of Role-Playing Prompts on ChatGPT's Misinformation Detection Accuracy: Quantitative Study.评估角色扮演提示对 ChatGPT 错误信息检测准确率的影响:定量研究。
JMIR Infodemiology. 2024 Sep 26;4:e60678. doi: 10.2196/60678.
4
Assessing the Accuracy of Generative Conversational Artificial Intelligence in Debunking Sleep Health Myths: Mixed Methods Comparative Study With Expert Analysis.评估生成式对话人工智能在破除睡眠健康误区方面的准确性:采用专家分析的混合方法比较研究
JMIR Form Res. 2024 Apr 16;8:e55762. doi: 10.2196/55762.
5
Application of Large Language Models in Medical Training Evaluation-Using ChatGPT as a Standardized Patient: Multimetric Assessment.大语言模型在医学培训评估中的应用——以ChatGPT作为标准化病人:多指标评估
J Med Internet Res. 2025 Jan 1;27:e59435. doi: 10.2196/59435.
6
ChatGPT's Attitude, Knowledge, and Clinical Application in Geriatrics Practice and Education: Exploratory Observational Study.ChatGPT在老年医学实践与教育中的态度、知识及临床应用:探索性观察研究
JMIR Form Res. 2025 Jan 3;9:e63494. doi: 10.2196/63494.
7
Sailing the Seven Seas: A Multinational Comparison of ChatGPT's Performance on Medical Licensing Examinations.航海七海:ChatGPT 在医学执照考试中的表现的跨国比较。
Ann Biomed Eng. 2024 Jun;52(6):1542-1545. doi: 10.1007/s10439-023-03338-3. Epub 2023 Aug 8.
8
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现:调查研究。
JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.
9
Optimizing ChatGPT's Interpretation and Reporting of Delirium Assessment Outcomes: Exploratory Study.优化 ChatGPT 对谵妄评估结果的解释和报告:探索性研究。
JMIR Form Res. 2024 Oct 1;8:e51383. doi: 10.2196/51383.
10
Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响:来自台湾护理执照考试的见解。
Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.

引用本文的文献

1
ChatGPT's performance in sample size estimation: a preliminary study on the capabilities of artificial intelligence.ChatGPT在样本量估计方面的表现:关于人工智能能力的初步研究。
Fam Pract. 2025 Aug 14;42(5). doi: 10.1093/fampra/cmaf069.
2
Statistical Rigor and Reproducibility in the AI Era.人工智能时代的统计严谨性与可重复性。
Balkan Med J. 2025 Sep 1;42(5):386-387. doi: 10.4274/balkanmedj.galenos.2025.2025.040825. Epub 2025 Aug 11.

本文引用的文献

1
Factor Retention Using Machine Learning With Ordinal Data.使用机器学习处理有序数据的因子保留
Appl Psychol Meas. 2022 Jul;46(5):406-421. doi: 10.1177/01466216221089345. Epub 2022 May 4.
2
Investigating the performance of exploratory graph analysis and traditional techniques to identify the number of latent factors: A simulation and tutorial.探索性图分析和传统技术识别潜在因素数量的性能研究:模拟与教程。
Psychol Methods. 2020 Jun;25(3):292-320. doi: 10.1037/met0000255. Epub 2020 Mar 19.
3
Descriptive Statistics for Modern Test Score Distributions: Skewness, Kurtosis, Discreteness, and Ceiling Effects.现代测试分数分布的描述性统计:偏度、峰度、离散度和天花板效应。
Educ Psychol Meas. 2015 Jun;75(3):365-388. doi: 10.1177/0013164414548576. Epub 2014 Sep 15.
4
Psychometric Properties and Factor Structure of a Long and Shortened Version of the Cognitive and Behavioural Responses Questionnaire.认知与行为反应问卷长版和简版的心理计量学特性和因子结构。
Psychosom Med. 2018 Feb/Mar;80(2):230-237. doi: 10.1097/PSY.0000000000000536.
5
Validity and Reliability of the Positive Aspects of Caregiving (PAC) Scale and Development of Its Shorter Version (S-PAC) Among Family Caregivers of Older Adults.照顾积极面量表(PAC)的有效性和可靠性及其在老年患者家庭照顾者中的更简短版本(S-PAC)的开发。
Gerontologist. 2017 Aug 1;57(4):e75-e84. doi: 10.1093/geront/gnw198.
6
An empirical Kaiser criterion.经验 Kaiser 准则。
Psychol Methods. 2017 Sep;22(3):450-466. doi: 10.1037/met0000074. Epub 2016 Mar 31.
7
Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares.有序数据的验证性因子分析:稳健极大似然法与对角加权最小二乘法的比较
Behav Res Methods. 2016 Sep;48(3):936-49. doi: 10.3758/s13428-015-0619-7.
8
Exploratory factor analysis in validation studies: uses and recommendations.验证性研究中的探索性因素分析:用途与建议。
Psicothema. 2014;26(3):395-400. doi: 10.7334/psicothema2013.349.
9
The use of Likert scales with children.在儿童中使用李克特量表。
J Pediatr Psychol. 2014 Apr;39(3):369-79. doi: 10.1093/jpepsy/jst079. Epub 2013 Oct 24.
10
An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data.对有序数据验证性因子分析的替代估计方法的实证评估。
Psychol Methods. 2004 Dec;9(4):466-91. doi: 10.1037/1082-989X.9.4.466.

对ChatGPT作为数据分析工具的性能考察。

Examination of ChatGPT's Performance as a Data Analysis Tool.

作者信息

Koçak Duygu

机构信息

Alanya Alaaddin Keykubat University, Alanya/Antalya, Turkey.

出版信息

Educ Psychol Meas. 2025 Jan 3:00131644241302721. doi: 10.1177/00131644241302721.

DOI:10.1177/00131644241302721
PMID:39759537
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11696938/
Abstract

This study examines the performance of ChatGPT, developed by OpenAI and widely used as an AI-based conversational tool, as a data analysis tool through exploratory factor analysis (EFA). To this end, simulated data were generated under various data conditions, including normal distribution, response category, sample size, test length, factor loading, and measurement models. The generated data were analyzed using ChatGPT-4o twice with a 1-week interval under the same prompt, and the results were compared with those obtained using R code. In data analysis, the Kaiser-Meyer-Olkin (KMO) value, total variance explained, and the number of factors estimated using the empirical Kaiser criterion, Hull method, and Kaiser-Guttman criterion, as well as factor loadings, were calculated. The findings obtained from ChatGPT at two different times were found to be consistent with those obtained using R. Overall, ChatGPT demonstrated good performance for steps that require only computational decisions without involving researcher judgment or theoretical evaluation (such as KMO, total variance explained, and factor loadings). However, for multidimensional structures, although the estimated number of factors was consistent across analyses, biases were observed, suggesting that researchers should exercise caution in such decisions.

摘要

本研究通过探索性因子分析(EFA)检验了由OpenAI开发并被广泛用作基于人工智能的对话工具的ChatGPT作为数据分析工具的性能。为此,在各种数据条件下生成了模拟数据,包括正态分布、响应类别、样本量、测验长度、因子载荷和测量模型。在相同提示下,使用ChatGPT-4o对生成的数据进行了两次分析,间隔为1周,并将结果与使用R代码获得的结果进行比较。在数据分析中,计算了Kaiser-Meyer-Olkin(KMO)值、解释的总方差、使用经验Kaiser准则、赫尔方法和Kaiser-Guttman准则估计的因子数量以及因子载荷。发现ChatGPT在两个不同时间获得的结果与使用R获得的结果一致。总体而言,ChatGPT在仅需要计算决策而不涉及研究者判断或理论评估的步骤(如KMO、解释的总方差和因子载荷)中表现良好。然而,对于多维结构,尽管各分析中估计的因子数量一致,但仍观察到偏差,这表明研究者在做出此类决策时应谨慎。