由 ChatGPT 生成的跑步者训练计划并未得到教练专家的最佳评级，但随着更多输入信息的增加，其质量会有所提高。

ChatGPT Generated Training Plans for Runners are not Rated Optimal by Coaching Experts, but Increase in Quality with Additional Input Information.

机构信息

Department of Sports Science and Movement Pedagogy, Technische Universität Braunschweig, Braunschweig, Germany.

Integrative and Experimental Exercise Science, Department of Sport Science, University of Würzburg, Würzburg, Germany.

出版信息

J Sports Sci Med. 2024 Mar 1;23(1):56-72. doi: 10.52082/jssm.2024.56. eCollection 2024 Mar.

DOI:10.52082/jssm.2024.56

PMID:38455449

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10915606/

Abstract

ChatGPT may be used by runners to generate training plans to enhance performance or health aspects. However, the quality of ChatGPT generated training plans based on different input information is unknown. The objective of the study was to evaluate ChatGPT-generated six-week training plans for runners based on different input information granularity. Three training plans were generated by ChatGPT using different input information granularity. 22 quality criteria for training plans were drawn from the literature and used to evaluate training plans by coaching experts on a 1-5 Likert Scale. A Friedmann test assessed significant differences in quality between training plans. For training plans 1, 2 and 3, a median rating of <3 was given 19, 11, and 1 times, a median rating of 3 was given 3, 5, and 8 times and a median rating of >3 was given 0, 6, 13 times, respectively. Training plan 1 received significantly lower ratings compared to training plan 2 for 3 criteria, and 15 times significantly lower ratings compared to training plan 3 (p < 0.05). Training plan 2 received significantly lower ratings (p < 0.05) compared to plan 3 for 9 criteria. ChatGPT generated plans are ranked sub-optimally by coaching experts, although the quality increases when more input information are provided. An understanding of aspects relevant to programming distance running training is important, and we advise avoiding the use of ChatGPT generated training plans without an expert coach's feedback.

摘要

ChatGPT 可能被跑步者用于生成训练计划，以提高表现或健康方面。然而，基于不同输入信息的 ChatGPT 生成的训练计划的质量是未知的。本研究的目的是评估基于不同输入信息粒度的 ChatGPT 生成的六周跑步者训练计划。ChatGPT 使用不同的输入信息粒度生成了三个训练计划。从文献中提取了 22 个训练计划质量标准，并由教练专家使用 1-5 分的李克特量表对训练计划进行评估。弗里德曼检验评估了训练计划之间质量的显著差异。对于训练计划 1、2 和 3，分别有 19、11 和 1 次被评为<3，3、5 和 8 次被评为 3，0、6 和 13 次被评为>3。与训练计划 2 相比，训练计划 1 有 3 项标准的评分明显较低，与训练计划 3 相比有 15 项标准的评分明显较低（p < 0.05）。与训练计划 3 相比，训练计划 2 有 9 项标准的评分明显较低（p < 0.05）。虽然提供更多的输入信息会提高质量，但教练专家对 ChatGPT 生成的计划评价不高。了解与编程长跑训练相关的方面很重要，我们建议在没有专家教练反馈的情况下避免使用 ChatGPT 生成的训练计划。

相似文献

ChatGPT Generated Training Plans for Runners are not Rated Optimal by Coaching Experts, but Increase in Quality with Additional Input Information.由 ChatGPT 生成的跑步者训练计划并未得到教练专家的最佳评级，但随着更多输入信息的增加，其质量会有所提高。

J Sports Sci Med. 2024 Mar 1;23(1):56-72. doi: 10.52082/jssm.2024.56. eCollection 2024 Mar.

Reproducibility and quality of hypertrophy-related training plans generated by GPT-4 and Google Gemini as evaluated by coaching experts.由GPT-4和谷歌Gemini生成的与肥大相关的训练计划的可重复性和质量，由教练专家评估。

Biol Sport. 2025 Apr;42(2):289-329. doi: 10.5114/biolsport.2025.145911. Epub 2024 Dec 18.

ChatGPT-4o-Generated Exercise Plans for Patients with Type 2 Diabetes Mellitus-Assessment of Their Safety and Other Quality Criteria by Coaching Experts.ChatGPT-4生成的2型糖尿病患者运动计划——由指导专家评估其安全性及其他质量标准

Sports (Basel). 2025 Mar 24;13(4):92. doi: 10.3390/sports13040092.

Acceptance and trust in AI-generated exercise plans among recreational athletes and quality evaluation by experienced coaches: a pilot study.休闲运动员对人工智能生成的锻炼计划的接受度与信任度以及经验丰富教练的质量评估：一项试点研究

BMC Res Notes. 2025 Mar 13;18(1):112. doi: 10.1186/s13104-025-07172-9.

Assessing the Accuracy, Completeness and Safety of ChatGPT-4o Responses on Pressure Injuries in Infants: Clinical Applications and Future Implications.评估ChatGPT-4o对婴儿压力性损伤回答的准确性、完整性和安全性：临床应用及未来影响

Nurs Rep. 2025 Apr 14;15(4):130. doi: 10.3390/nursrep15040130.

Evaluating the application of ChatGPT in China's residency training education: An exploratory study.评估ChatGPT在中国住院医师规范化培训教育中的应用：一项探索性研究。

Med Teach. 2025 May;47(5):858-864. doi: 10.1080/0142159X.2024.2377808. Epub 2024 Jul 12.

Assessing the Accuracy of Generative Conversational Artificial Intelligence in Debunking Sleep Health Myths: Mixed Methods Comparative Study With Expert Analysis.评估生成式对话人工智能在破除睡眠健康误区方面的准确性：采用专家分析的混合方法比较研究

JMIR Form Res. 2024 Apr 16;8:e55762. doi: 10.2196/55762.

Chat-GPT on brain tumors: An examination of Artificial Intelligence/Machine Learning's ability to provide diagnoses and treatment plans for example neuro-oncology cases.Chat-GPT 与脑肿瘤：人工智能/机器学习提供神经肿瘤学等案例诊断和治疗方案的能力评估。

Clin Neurol Neurosurg. 2024 Apr;239:108238. doi: 10.1016/j.clineuro.2024.108238. Epub 2024 Mar 9.

Could artificial intelligence write mental health nursing care plans?人工智能能写心理健康护理计划吗？

J Psychiatr Ment Health Nurs. 2024 Feb;31(1):79-86. doi: 10.1111/jpm.12965. Epub 2023 Aug 4.

ChatGPT as a Source for Patient Information on Patellofemoral Surgery-A Comparative Study Amongst Laymen, Doctors, and Experts.ChatGPT作为髌股关节手术患者信息来源的比较研究——非专业人士、医生和专家之间的对比

Clin Pract. 2024 Nov 5;14(6):2376-2384. doi: 10.3390/clinpract14060186.

引用本文的文献

Assessment of Recommendations Provided to Athletes Regarding Sleep Education by GPT-4o and Google Gemini: Comparative Evaluation Study.GPT-4o和谷歌Gemini向运动员提供的关于睡眠教育的建议评估：比较评估研究

JMIR Form Res. 2025 Jul 8;9:e71358. doi: 10.2196/71358.

The sports nutrition knowledge of large language model (LLM) artificial intelligence (AI) chatbots: An assessment of accuracy, completeness, clarity, quality of evidence, and test-retest reliability.大语言模型（LLM）人工智能（AI）聊天机器人的运动营养知识：准确性、完整性、清晰度、证据质量及重测信度评估

PLoS One. 2025 Jun 13;20(6):e0325982. doi: 10.1371/journal.pone.0325982. eCollection 2025.

Sports (Basel). 2025 Mar 24;13(4):92. doi: 10.3390/sports13040092.

Biol Sport. 2025 Apr;42(2):289-329. doi: 10.5114/biolsport.2025.145911. Epub 2024 Dec 18.

BMC Res Notes. 2025 Mar 13;18(1):112. doi: 10.1186/s13104-025-07172-9.

[What is the potential of ChatGPT for qualified patient information? : Attempt of a structured analysis on the basis of a survey regarding complementary and alternative medicine (CAM) in rheumatology].[ChatGPT在提供合格患者信息方面的潜力如何？：基于一项关于风湿病补充和替代医学（CAM）的调查进行结构化分析的尝试]

Z Rheumatol. 2025 Apr;84(3):179-187. doi: 10.1007/s00393-024-01535-6. Epub 2024 Jul 10.

本文引用的文献

Evaluating ChatGPT as an adjunct for the multidisciplinary tumor board decision-making in primary breast cancer cases.评估 ChatGPT 在原发性乳腺癌多学科肿瘤委员会决策中的辅助作用。

Arch Gynecol Obstet. 2023 Dec;308(6):1831-1844. doi: 10.1007/s00404-023-07130-5. Epub 2023 Jul 17.

Evaluating Chatbot Efficacy for Answering Frequently Asked Questions in Plastic Surgery: A ChatGPT Case Study Focused on Breast Augmentation.评估聊天机器人在回答整形手术常见问题方面的效果：以聚焦隆胸手术的ChatGPT为例的研究

Aesthet Surg J. 2023 Sep 14;43(10):1126-1135. doi: 10.1093/asj/sjad140.

Appropriateness of ophthalmic symptoms triage by a popular online artificial intelligence chatbot.一个广受欢迎的在线人工智能聊天机器人对眼科症状进行分诊的适宜性。

Eye (Lond). 2023 Dec;37(17):3692-3693. doi: 10.1038/s41433-023-02556-2. Epub 2023 Apr 29.

Ethics of large language models in medicine and medical research.医学及医学研究中大型语言模型的伦理问题。

Lancet Digit Health. 2023 Jun;5(6):e333-e335. doi: 10.1016/S2589-7500(23)00083-3. Epub 2023 Apr 27.

Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.比较医生和人工智能聊天机器人对发布在公共社交媒体论坛上的患者问题的回复。

JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.

Polarized Training Is Optimal for Endurance Athletes.极化训练对耐力运动员最为适宜。

Med Sci Sports Exerc. 2022 Jun 1;54(6):1028-1031. doi: 10.1249/MSS.0000000000002871. Epub 2022 Feb 8.

Polarized Training Is Not Optimal for Endurance Athletes.极化训练对耐力运动员并非最佳选择。

Med Sci Sports Exerc. 2022 Jun 1;54(6):1032-1034. doi: 10.1249/MSS.0000000000002869. Epub 2022 Feb 8.

Defining Training and Performance Caliber: A Participant Classification Framework.定义培训和绩效水平：参与者分类框架。

Int J Sports Physiol Perform. 2022 Feb 1;17(2):317-331. doi: 10.1123/ijspp.2021-0451. Epub 2022 Dec 29.

Development of a Revised Conceptual Framework of Physical Training for Use in Research and Practice.用于研究与实践的体育训练修订概念框架的开发。

Sports Med. 2022 Apr;52(4):709-724. doi: 10.1007/s40279-021-01551-5. Epub 2021 Sep 14.

The Promise of Sleep: A Multi-Sensor Approach for Accurate Sleep Stage Detection Using the Oura Ring.《睡眠的承诺：使用 Oura 戒指进行准确睡眠阶段检测的多传感器方法》

Sensors (Basel). 2021 Jun 23;21(13):4302. doi: 10.3390/s21134302.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验