评估 ChatGPT 对宫颈癌和乳腺癌常见问题的回答。

An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer.

机构信息

School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.

Department of Primary Healthcare and Family Medicine, Faculty of Medicine, Universidad de Chile, Santiago, Chile.

出版信息

BMC Womens Health. 2024 Sep 2;24(1):482. doi: 10.1186/s12905-024-03320-8.

DOI:10.1186/s12905-024-03320-8

PMID:39223612

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11367894/

Abstract

BACKGROUND

Cervical cancer (CC) and breast cancer (BC) threaten women's well-being, influenced by health-related stigma and a lack of reliable information, which can cause late diagnosis and early death. ChatGPT is likely to become a key source of health information, although quality concerns could also influence health-seeking behaviours.

METHODS

This cross-sectional online survey compared ChatGPT's responses to five physicians specializing in mammography and five specializing in gynaecology. Twenty frequently asked questions about CC and BC were asked on 26th and 29th of April, 2023. A panel of seven experts assessed the accuracy, consistency, and relevance of ChatGPT's responses using a 7-point Likert scale. Responses were analyzed for readability, reliability, and efficiency. ChatGPT's responses were synthesized, and findings are presented as a radar chart.

RESULTS

ChatGPT had an accuracy score of 7.0 (range: 6.6-7.0) for CC and BC questions, surpassing the highest-scoring physicians (P < 0.05). ChatGPT took an average of 13.6 s (range: 7.6-24.0) to answer each of the 20 questions presented. Readability was comparable to that of experts and physicians involved, but ChatGPT generated more extended responses compared to physicians. The consistency of repeated answers was 5.2 (range: 3.4-6.7). With different contexts combined, the overall ChatGPT relevance score was 6.5 (range: 4.8-7.0). Radar plot analysis indicated comparably good accuracy, efficiency, and to a certain extent, relevance. However, there were apparent inconsistencies, and the reliability and readability be considered inadequate.

CONCLUSIONS

ChatGPT shows promise as an initial source of information for CC and BC. ChatGPT is also highly functional and appears to be superior to physicians, and aligns with expert consensus, although there is room for improvement in readability, reliability, and consistency. Future efforts should focus on developing advanced ChatGPT models explicitly designed to improve medical practice and for those with concerns about symptoms.

摘要

背景

宫颈癌（CC）和乳腺癌（BC）威胁着女性的健康，这受到健康相关耻辱感和缺乏可靠信息的影响，可能导致诊断延迟和早逝。ChatGPT 可能成为关键的健康信息来源，尽管质量问题也可能影响寻求医疗行为。

方法

这项横断面在线调查比较了 ChatGPT 对五位专门从事乳房 X 线照相术和五位专门从事妇科的医生的回答。2023 年 4 月 26 日和 29 日，询问了 20 个关于 CC 和 BC 的常见问题。一个由七名专家组成的小组使用 7 点李克特量表评估了 ChatGPT 回答的准确性、一致性和相关性。分析了回复的可读性、可靠性和效率。综合了 ChatGPT 的回复，并以雷达图的形式呈现结果。

结果

ChatGPT 在 CC 和 BC 问题上的准确率得分为 7.0（范围：6.6-7.0），超过了得分最高的医生（P<0.05）。ChatGPT 平均需要 13.6 秒（范围：7.6-24.0）来回答提出的 20 个问题中的每一个。可读性与参与的专家和医生相当，但 ChatGPT 生成的回复比医生更长。重复回答的一致性为 5.2（范围：3.4-6.7）。将不同的上下文结合起来，ChatGPT 的整体相关性评分为 6.5（范围：4.8-7.0）。雷达图分析表明，准确性、效率和在一定程度上的相关性都相当不错。然而，存在明显的不一致性，可靠性和可读性需要考虑改进。

结论

ChatGPT 有望成为 CC 和 BC 的初始信息来源。ChatGPT 功能强大，似乎优于医生，并且与专家共识一致，尽管在可读性、可靠性和一致性方面还有改进的空间。未来的努力应集中于开发专门用于改善医疗实践的高级 ChatGPT 模型，以及那些对症状有顾虑的人。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7616/11367894/41f72ded88dd/12905_2024_3320_Fig1_HTML.jpg

相似文献

An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer.

BMC Womens Health. 2024 Sep 2;24(1):482. doi: 10.1186/s12905-024-03320-8.

Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.

Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.

Evaluating the accuracy and readability of ChatGPT in providing parental guidance for adenoidectomy, tonsillectomy, and ventilation tube insertion surgery.

Int J Pediatr Otorhinolaryngol. 2024 Jun;181:111998. doi: 10.1016/j.ijporl.2024.111998. Epub 2024 May 31.

Evaluating ChatGPT to test its robustness as an interactive information database of radiation oncology and to assess its responses to common queries from radiotherapy patients: A single institution investigation.

Cancer Radiother. 2024 Jun;28(3):258-264. doi: 10.1016/j.canrad.2023.11.005. Epub 2024 Jun 12.

How artificial intelligence can provide information about subdural hematoma: Assessment of readability, reliability, and quality of ChatGPT, BARD, and perplexity responses.

Medicine (Baltimore). 2024 May 3;103(18):e38009. doi: 10.1097/MD.0000000000038009.

A Multidisciplinary Assessment of ChatGPT's Knowledge of Amyloidosis: Observational Study.

JMIR Cardio. 2024 Apr 19;8:e53421. doi: 10.2196/53421.

Evaluation of ChatGPT's responses to information needs and information seeking of dementia patients.

Sci Rep. 2024 May 4;14(1):10273. doi: 10.1038/s41598-024-61068-5.

Readability analysis of ChatGPT's responses on lung cancer.

Sci Rep. 2024 Jul 26;14(1):17234. doi: 10.1038/s41598-024-67293-2.

ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.

Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.

Accuracy and reproducibility of ChatGPT's free version answers about endometriosis.

Int J Gynaecol Obstet. 2024 May;165(2):691-695. doi: 10.1002/ijgo.15309. Epub 2023 Dec 18.

引用本文的文献

Assessing the Accuracy and Readability of Large Language Model Guidance for Patients on Breast Cancer Surgery Preparation and Recovery.

J Clin Med. 2025 Aug 1;14(15):5411. doi: 10.3390/jcm14155411.

本文引用的文献

ChatGPT outscored human candidates in a virtual objective structured clinical examination in obstetrics and gynecology.

Am J Obstet Gynecol. 2023 Aug;229(2):172.e1-172.e12. doi: 10.1016/j.ajog.2023.04.020. Epub 2023 Apr 22.

Appropriateness of Breast Cancer Prevention and Screening Recommendations Provided by ChatGPT.

Radiology. 2023 May;307(4):e230424. doi: 10.1148/radiol.230424. Epub 2023 Apr 4.

Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.

N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.

Artificial Intelligence and Machine Learning in Clinical Medicine, 2023.

N Engl J Med. 2023 Mar 30;388(13):1201-1208. doi: 10.1056/NEJMra2302038.

ChatGPT versus the neurosurgical written boards: a comparative analysis of artificial intelligence/machine learning performance on neurosurgical board-style questions.

J Neurosurg. 2023 Mar 24;139(3):904-911. doi: 10.3171/2023.2.JNS23419.

Using ChatGPT to evaluate cancer myths and misconceptions: artificial intelligence and cancer information.

JNCI Cancer Spectr. 2023 Mar 1;7(2). doi: 10.1093/jncics/pkad015.

The exciting potential for ChatGPT in obstetrics and gynecology.

Am J Obstet Gynecol. 2023 Jun;228(6):696-705. doi: 10.1016/j.ajog.2023.03.009. Epub 2023 Mar 15.

Artificial intelligence-based chatbot patient information on common retinal diseases using ChatGPT.

Acta Ophthalmol. 2023 Nov;101(7):829-831. doi: 10.1111/aos.15661. Epub 2023 Mar 13.

ChatGPT Is Shaping the Future of Medical Writing But Still Requires Human Judgment.

Radiology. 2023 Apr;307(2):e230171. doi: 10.1148/radiol.230171. Epub 2023 Feb 2.

ChatGPT and Other Large Language Models Are Double-edged Swords.

Radiology. 2023 Apr;307(2):e230163. doi: 10.1148/radiol.230163. Epub 2023 Jan 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评估 ChatGPT 对宫颈癌和乳腺癌常见问题的回答。

An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献