Abdelgadir Yasir, Thongprayoon Charat, Miao Jing, Suppadungsuk Supawadee, Pham Justin H, Mao Michael A, Craici Iasmina M, Cheungpasitporn Wisit
Division of Nephrology and Hypertension, Mayo Clinic, Rochester, MN, United States.
Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Samut Prakan, Thailand.
Front Artif Intell. 2024 Sep 2;7:1457586. doi: 10.3389/frai.2024.1457586. eCollection 2024.
Accurate ICD-10 coding is crucial for healthcare reimbursement, patient care, and research. AI implementation, like ChatGPT, could improve coding accuracy and reduce physician burden. This study assessed ChatGPT's performance in identifying ICD-10 codes for nephrology conditions through case scenarios for pre-visit testing.
Two nephrologists created 100 simulated nephrology cases. ChatGPT versions 3.5 and 4.0 were evaluated by comparing AI-generated ICD-10 codes against predetermined correct codes. Assessments were conducted in two rounds, 2 weeks apart, in April 2024.
In the first round, the accuracy of ChatGPT for assigning correct diagnosis codes was 91 and 99% for version 3.5 and 4.0, respectively. In the second round, the accuracy of ChatGPT for assigning the correct diagnosis code was 87% for version 3.5 and 99% for version 4.0. ChatGPT 4.0 had higher accuracy than ChatGPT 3.5 ( = 0.02 and 0.002 for the first and second round respectively). The accuracy did not significantly differ between the two rounds ( > 0.05).
ChatGPT 4.0 can significantly improve ICD-10 coding accuracy in nephrology through case scenarios for pre-visit testing, potentially reducing healthcare professionals' workload. However, the small error percentage underscores the need for ongoing review and improvement of AI systems to ensure accurate reimbursement, optimal patient care, and reliable research data.
准确的ICD - 10编码对于医疗报销、患者护理和研究至关重要。像ChatGPT这样的人工智能应用可以提高编码准确性并减轻医生负担。本研究通过就诊前测试的病例场景评估了ChatGPT在识别肾脏病ICD - 10编码方面的表现。
两名肾病学家创建了100个模拟肾病病例。通过将人工智能生成的ICD - 10编码与预先确定的正确编码进行比较,对ChatGPT 3.5版和4.0版进行评估。评估在2024年4月分两轮进行,两轮间隔2周。
在第一轮中,ChatGPT 3.5版和4.0版分配正确诊断编码的准确率分别为91%和99%。在第二轮中,ChatGPT 3.5版分配正确诊断编码的准确率为87%,4.0版为99%。ChatGPT 4.0的准确率高于ChatGPT 3.5(第一轮和第二轮分别为 = 0.02和0.002)。两轮之间的准确率没有显著差异(> 0.05)。
ChatGPT 4.0可以通过就诊前测试的病例场景显著提高肾脏病ICD - 10编码的准确性,有可能减轻医疗专业人员的工作量。然而,小误差百分比凸显了持续审查和改进人工智能系统以确保准确报销、优化患者护理和可靠研究数据的必要性。