ChatGPT对常见活体肾捐赠问题的回答的准确性、清晰度和全面性。

Accuracy, Clarity, and Comprehensiveness of ChatGPT Outputs for Commonly Asked Questions About Living Kidney Donation.

作者信息

Singla Ria, Lodhi Sumiya, Kibret Taddele, Jegatheswaran Januvi, Glavinovic Tamara, Massicotte-Azarniouch David, Karpinski Jolanta, Powell Rinu, Burns Kevin, Sood Manish M, Bugeja Ann

机构信息

Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada.

School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada.

出版信息

Clin Transplant. 2025 Sep;39(9):e70303. doi: 10.1111/ctr.70303.

DOI:10.1111/ctr.70303

PMID:40891338

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12402969/

Abstract

INTRODUCTION

The effectiveness of ChatGPT responses to common living kidney donation (LKD) queries remains unclear.

METHODS

We surveyed nephrologists and living kidney donors/candidates to evaluate ChatGPT-3.5's accuracy, comprehensiveness, and clarity in answering common donation questions in English and French. Ratings used a 5-point Likert scale, with percentage agreement and modified Fleiss' Kappa measuring inter-rater consistency.

RESULTS

The evaluation of ChatGPT-3.5's responses varied between nephrologists and kidney donors/candidates. Nephrologists showed moderate percentage agreement for English responses (50%-59%) and poor agreement for French responses (9%-45%). Kidney donors/candidates exhibited high agreement for English (90%-100%) but low for French (0%-77%). Inter-rater agreement among nephrologists was moderate for both English (Kappa 0.74, 95% CI: 0.67, 0.79, p < 0.0001) and French (Kappa 0.70, 95% CI: 0.64, 0.77, p < 0.0001). In contrast, inter-rater agreement was poor among donors/candidates for both English (Kappa -0.10, 95% CI: -0.14, -0.07, p = 0.99) and French (Kappa -0.03, 95% CI: -0.07, 0, p = 0.81).

CONCLUSION

ChatGPT 3.5's responses to common LKD queries demonstrated limited agreement among nephrologists and kidney donors/donor candidates, highlighting its lack of reliability as a supplement to existing educational materials for living kidney donor programs in English and French.

摘要

引言

ChatGPT对常见活体肾捐赠（LKD）问题的回答效果尚不清楚。

方法

我们对肾病学家以及活体肾捐赠者/候选者进行了调查，以评估ChatGPT-3.5在以英语和法语回答常见捐赠问题时的准确性、全面性和清晰度。评分采用5分制李克特量表，用百分比一致性和修正的弗莱斯kappa系数来衡量评分者间的一致性。

结果

肾病学家和肾捐赠者/候选者对ChatGPT-3.5回答的评价存在差异。肾病学家对英语回答的百分比一致性中等（50%-59%），对法语回答的一致性较差（9%-45%）。肾捐赠者/候选者对英语回答的一致性较高（90%-100%），但对法语回答的一致性较低（0%-77%）。肾病学家之间，英语（kappa系数0.74，95%置信区间：0.67，0.79，p<0.0001）和法语（kappa系数0.70，95%置信区间：0.64，0.77，p<0.0001）的评分者间一致性均为中等。相比之下，捐赠者/候选者之间，英语（kappa系数-0.10，95%置信区间：-0.14，-0.07，p=0.99）和法语（kappa系数-0.03，95%置信区间：-0.07，0，p=0.81）的评分者间一致性均较差。

结论

ChatGPT 3.5对常见LKD问题的回答在肾病学家和肾捐赠者/候选者之间显示出有限的一致性，突出了其作为英语和法语活体肾捐赠项目现有教育材料补充的可靠性不足。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

ChatGPT对常见活体肾捐赠问题的回答的准确性、清晰度和全面性。

Accuracy, Clarity, and Comprehensiveness of ChatGPT Outputs for Commonly Asked Questions About Living Kidney Donation.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSION

引言

方法

结果

结论

相似文献

本文引用的文献

ChatGPT对常见活体肾捐赠问题的回答的准确性、清晰度和全面性。

Accuracy, Clarity, and Comprehensiveness of ChatGPT Outputs for Commonly Asked Questions About Living Kidney Donation.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSION

引言

方法

结果

结论

相似文献

本文引用的文献