Köroğlu Ekin Yiğit, Ersoy Reyhan, Saçıkara Muhammed, Dellal Kahramanca Fatma Dilek, Polat Şefika Burçak, Topaloğlu Oya, Çakır Bekir
Ankara Bilkent City Hospital, Endocrinology and Metabolism Department, Üniversiteler Mahallesi, 1604, Cadde No: 9 Çankaya, Ankara, Türkiye.
Ankara Yıldırım Beyazıt Faculty of Medicine, Endocrinology and Metabolism Department, Üniversiteler Mahallesi, 1604, Cadde No: 9 Çankaya, Ankara, Türkiye.
Endocrine. 2025 Mar;87(3):1141-1149. doi: 10.1007/s12020-024-04086-7. Epub 2024 Nov 5.
ChatGPT is a widely used artificial intelligence modeling tool. Healthcare is one potential area of use of ChatGPT. This study aimed to test the usability and reliability of ChatGPT in acromegaly, which is less known in society and should be evaluated by a group of specialized physicians.
The study is designed in two parts. For the first part, 35 questions regarding acromegaly that patients frequently ask were identified, and these questions were asked to ChatGPT. In the second part, four patient examples were presented to ChatGPT using medical terminology. Three experts evaluated ChatGPT's answers to the questions and approaches in case management using 7-point scales in terms of safety, reliability, correctness, and usability.
When the ChatGPT answers to the patient's questions were evaluated, a mean score of 6.78 ± 0.55 was given for correctness and 6.69 ± 0.60 for reliability. The mean scores given by the raters for correctness, safety and usability in the evaluation of the cases were as follows: 6.33 ± 0.88, 6.16 ± 0. 71 and 6.08 ± 0.79 points for case 1; 5.35 ± 1.88, 5.29 ± 1.80 and 5.20 ± 1.86 points for case 2; 6.08 ± 0.97, 6.00 ± 0.93 and 5.91 ± 0.82 points for case 3; 6.10 ± 1.29, 6.13 ± 1.30 and 6.16 ± 1.14 points for case 4.
ChatGPT can actively answer the questions of acromegaly patients. Although it is not a reliable source alone in managing patients with acromegaly, it can be a supportive tool for physicians.
ChatGPT是一种广泛使用的人工智能建模工具。医疗保健是ChatGPT的一个潜在应用领域。本研究旨在测试ChatGPT在肢端肥大症方面的可用性和可靠性,肢端肥大症在社会上鲜为人知,应由一组专科医生进行评估。
该研究分为两个部分。第一部分,确定了35个患者经常问到的关于肢端肥大症的问题,并将这些问题提交给ChatGPT。第二部分,使用医学术语向ChatGPT呈现了四个患者案例。三位专家使用7分制从安全性、可靠性、正确性和可用性方面评估了ChatGPT对问题的回答及病例管理方法。
在评估ChatGPT对患者问题的回答时,正确性的平均得分为6.78±0.55,可靠性的平均得分为6.69±0.60。评估病例时,评分者给出的正确性、安全性和可用性的平均得分如下:病例1分别为6.33±0.88、6.16±0.71和6.08±0.79分;病例2分别为5.35±1.88,、5.29±1.80和5.20±1.86分;病例3分别为6.08±0.97、6.0±0.93和5.91±0.82分;病例4分别为6.10±1.29、6.13±1.30和6.16±1.14分。
ChatGPT可以积极回答肢端肥大症患者的问题。虽然它在管理肢端肥大症患者方面单独使用时不可靠,但它可以成为医生的辅助工具。