评估 ChatGPT 在回答变应性鼻炎和慢性鼻-鼻窦炎相关问题方面的表现。

Evaluating ChatGPT's Performance in Answering Questions About Allergic Rhinitis and Chronic Rhinosinusitis.

机构信息

Department of Otolaryngology-Head and Neck Surgery, The Third Affiliated Hospital of Sun Yat-Sen University, Guangzhou, China.

Department of Allergy, The Third Affiliated Hospital of Sun Yat-Sen University, Guangzhou, China.

出版信息

Otolaryngol Head Neck Surg. 2024 Aug;171(2):571-577. doi: 10.1002/ohn.832. Epub 2024 May 26.

DOI:10.1002/ohn.832

PMID:38796735

Abstract

OBJECTIVE

This study aims to evaluate the accuracy of ChatGPT in answering allergic rhinitis (AR) and chronic rhinosinusitis (CRS) related questions.

STUDY DESIGN

This is a cross-sectional study.

SETTING

Each question was inputted as a separate, independent prompt.

METHODS

Responses to AR (n = 189) and CRS (n = 242) related questions, generated by GPT-3.5 and GPT-4, were independently graded for accuracy by 2 senior rhinology professors, with disagreements adjudicated by a third reviewer.

RESULTS

Overall, ChatGPT demonstrated a satisfactory performance, accurately answering over 80% of questions across all categories. Specifically, GPT-4.0's accuracy in responding to AR-related questions significantly exceeded that of GPT-3.5, but distinction not evident in CRS-related questions. Patient-originated questions had a significantly higher accuracy compared to doctor-originated questions when utilizing GPT-4.0 to respond to AR-related questions. This discrepancy was not observed with GPT-3.5 or in the context of CRS-related questions. Across different types of content, ChatGPT excelled in covering basic knowledge, prevention, and emotion for AR and CRS. However, it experienced challenges when addressing questions about recent advancements, a trend consistent across both GPT-3.5 and GPT-4.0 iterations. Importantly, the accuracy of responses remained unaffected when questions were posed in Chinese.

CONCLUSION

Our findings suggest ChatGPT's capability to convey accurate information for AR and CRS patients, and offer insights into its performance across various domains, guiding its utilization and improvement.

摘要

目的

本研究旨在评估 ChatGPT 在回答变应性鼻炎（AR）和慢性鼻-鼻窦炎（CRS）相关问题方面的准确性。

设计

这是一项横断面研究。

设置

每个问题都作为一个单独的、独立的提示输入。

方法

由 2 位资深鼻科学教授独立评估 GPT-3.5 和 GPT-4 生成的 AR（n=189）和 CRS（n=242）相关问题的回答的准确性，有分歧的由第 3 位审稿人裁决。

结果

总体而言，ChatGPT 表现出令人满意的性能，在所有类别中准确回答了超过 80%的问题。具体而言，GPT-4.0 回答 AR 相关问题的准确性明显高于 GPT-3.5，但在 CRS 相关问题中则不明显。当利用 GPT-4.0 回答 AR 相关问题时，患者提出的问题比医生提出的问题具有更高的准确性。在 GPT-3.5 或 CRS 相关问题中则没有观察到这种差异。在不同类型的内容中，ChatGPT 在涵盖 AR 和 CRS 的基础知识、预防和情感方面表现出色。然而，在涉及最近进展的问题时，它遇到了挑战，这种趋势在 GPT-3.5 和 GPT-4.0 迭代中都存在。重要的是，当以中文提问时，回答的准确性不受影响。

结论

我们的研究结果表明，ChatGPT 有能力为 AR 和 CRS 患者传达准确的信息，并深入了解其在各个领域的表现，指导其利用和改进。

相似文献

Evaluating ChatGPT's Performance in Answering Questions About Allergic Rhinitis and Chronic Rhinosinusitis.评估 ChatGPT 在回答变应性鼻炎和慢性鼻-鼻窦炎相关问题方面的表现。

Otolaryngol Head Neck Surg. 2024 Aug;171(2):571-577. doi: 10.1002/ohn.832. Epub 2024 May 26.

Performance and exploration of ChatGPT in medical examination, records and education in Chinese: Pave the way for medical AI.ChatGPT 在中文体检、病历和教育方面的表现和探索：为医疗 AI 铺平道路。

Int J Med Inform. 2023 Sep;177:105173. doi: 10.1016/j.ijmedinf.2023.105173. Epub 2023 Aug 4.

Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响：来自台湾护理执照考试的见解。

Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.

Impact of allergic rhinitis on quality of life in patients with chronic rhinosinusitis.变应性鼻炎对慢性鼻-鼻窦炎患者生活质量的影响。

Am J Otolaryngol. 2024 Jul-Aug;45(4):104325. doi: 10.1016/j.amjoto.2024.104325. Epub 2024 Apr 22.

Evaluating ChatGPT's Accuracy in Responding to Patient Education Questions on Acute Kidney Injury and Continuous Renal Replacement Therapy.评估 ChatGPT 在回答急性肾损伤和连续肾脏替代治疗患者教育问题时的准确性。

Blood Purif. 2024;53(9):725-731. doi: 10.1159/000539065. Epub 2024 Apr 26.

Comparing ChatGPT's and Surgeon's Responses to Thyroid-related Questions From Patients.比较ChatGPT与外科医生对患者甲状腺相关问题的回答。

J Clin Endocrinol Metab. 2025 Feb 18;110(3):e841-e850. doi: 10.1210/clinem/dgae235.

The Prevalence of Allergic Rhinitis in Southwestern Iran and Its Association with Chronic Rhinosinusitis: A GA2LEN Study.伊朗西南部变应性鼻炎的流行及其与慢性鼻-鼻窦炎的关系：GA2LEN 研究。

Iran J Allergy Asthma Immunol. 2021 Jun 6;20(3):263-270. doi: 10.18502/ijaai.v20i3.6342.

Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.GPT-4V 在回答日本耳鼻喉科学委员会认证考试问题方面的表现：评估研究。

JMIR Med Educ. 2024 Mar 28;10:e57054. doi: 10.2196/57054.

National Trends in Allergic Rhinitis and Chronic Rhinosinusitis and COVID-19 Pandemic-Related Factors in South Korea, from 1998 to 2021.韩国 1998 年至 2021 年变应性鼻炎和慢性鼻-鼻窦炎的全国趋势及与 COVID-19 大流行相关因素

Int Arch Allergy Immunol. 2024;185(4):355-361. doi: 10.1159/000535648. Epub 2024 Jan 5.

Appropriateness of ChatGPT in Answering Heart Failure Related Questions.ChatGPT 在回答心力衰竭相关问题方面的适宜性。

Heart Lung Circ. 2024 Sep;33(9):1314-1318. doi: 10.1016/j.hlc.2024.03.005. Epub 2024 May 31.

引用本文的文献

Evaluating ChatGPT's Concordance with Clinical Guidelines of Ménière's Disease in Chinese.评估ChatGPT与中国梅尼埃病临床指南的一致性。

Diagnostics (Basel). 2025 Aug 11;15(16):2006. doi: 10.3390/diagnostics15162006.

The Application and Diagnostic Accuracy of Artificial Intelligence in Rhinology: A Review.人工智能在鼻科学中的应用与诊断准确性：综述

Cureus. 2025 Jul 15;17(7):e87966. doi: 10.7759/cureus.87966. eCollection 2025 Jul.

Utilization of artificial intelligence in clinical practice: A systematic review of China's experiences.人工智能在临床实践中的应用：对中国经验的系统评价。

Digit Health. 2025 May 19;11:20552076251343752. doi: 10.1177/20552076251343752. eCollection 2025 Jan-Dec.

Domain-Specific Customization for Language Models in Otolaryngology: The ENT GPT Assistant.耳鼻喉科语言模型的特定领域定制：ENT GPT助手

OTO Open. 2025 May 5;9(2):e70125. doi: 10.1002/oto2.70125. eCollection 2025 Apr-Jun.

Application and research progress of artificial intelligence in allergic diseases.人工智能在过敏性疾病中的应用与研究进展

Int J Med Sci. 2025 Apr 9;22(9):2088-2102. doi: 10.7150/ijms.105422. eCollection 2025.

Artificial intelligence in otorhinolaryngology: current trends and application areas.耳鼻咽喉科学中的人工智能：当前趋势与应用领域

Eur Arch Otorhinolaryngol. 2025 May;282(5):2697-2707. doi: 10.1007/s00405-025-09272-5. Epub 2025 Feb 17.

Trends and research foci in immunoregulatory mechanisms of allergic rhinitis: a bibliometric analysis (2014-2024).变应性鼻炎免疫调节机制的研究热点和趋势：文献计量分析（2014-2024 年）。

Front Immunol. 2024 Sep 24;15:1443954. doi: 10.3389/fimmu.2024.1443954. eCollection 2024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估 ChatGPT 在回答变应性鼻炎和慢性鼻-鼻窦炎相关问题方面的表现。

Evaluating ChatGPT's Performance in Answering Questions About Allergic Rhinitis and Chronic Rhinosinusitis.

机构信息

出版信息

OBJECTIVE

STUDY DESIGN

SETTING

METHODS

RESULTS

CONCLUSION

目的

设计

设置

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献