探索ChatGPT在解答患者关于结直肠癌筛查直接询问方面的有效性。

Exploring ChatGPT effectiveness in addressing direct patient queries on colorectal cancer screening.

作者信息

Maida Marcello, Mori Yuichi, Fuccio Lorenzo, Sferrazza Sandro, Vitello Alessandro, Facciorusso Antonio, Hassan Cesare

机构信息

Department of Medicine and Surgery, Kore University of Enna, Enna, Italy.

Gastroenterology Unit, Umberto I Hospital, Enna, Italy.

出版信息

Endosc Int Open. 2025 May 12;13:a25689416. doi: 10.1055/a-2568-9416. eCollection 2025.

DOI:10.1055/a-2568-9416

PMID:40376022

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12080512/

Abstract

BACKGROUND AND STUDY AIMS

Recent studies showed that large language models (LLMs) could enhance understanding of colorectal cancer (CRC) screening, potentially increasing participation rates. However, a limitation of these studies is that questions posed to LLMs are generated by experts. This study aimed to investigate ChatGPT-4o effectiveness in answering CRC screening queries directly generated by patients.

PATIENTS AND METHODS

Ten consecutive subjects aged 50 to 69 years who were eligible for the Italian national CRC screening program but not actively involved were enrolled. Four possible scenarios for CRC screening were presented to each participant and they were asked to formulate one question per scenario to gather additional information. These questions were then posed to ChatGPT in two separate sessions. The responses were evaluated by five senior experts, who rated each answer based on three criteria: accuracy, completeness, and comprehensibility, using a 5-point Likert scale. In addition, the same 10 patients who created the questions assessed the answers, rating each response as complete, understandable, and trustworthy on a dichotomous scale (yes/no).

RESULTS

Experts rated the responses with mean scores of 4.1 ± 1.0 for accuracy, 4.2 ± 1.0 for completeness, and 4.3 ± 1.0 for comprehensibility. Patients rated responses as complete in 97.5%, understandable in 95%, and trustworthy in 100% of cases. Consistency over time was confirmed by an 86.8% similarity between session responses.

CONCLUSIONS

Despite variability in questions and answers, ChatGPT confirmed good performances in answering CRC screening queries, even when used directly by patients.

摘要

背景与研究目的

近期研究表明，大语言模型（LLMs）能够增进对结直肠癌（CRC）筛查的理解，可能会提高参与率。然而，这些研究的一个局限性在于向大语言模型提出的问题是由专家生成的。本研究旨在调查ChatGPT - 4o在回答患者直接提出的CRC筛查相关问题方面的有效性。

患者与方法

连续纳入10名年龄在50至69岁之间、符合意大利国家CRC筛查计划条件但未积极参与的受试者。向每位参与者呈现四种可能的CRC筛查场景，并要求他们针对每个场景提出一个问题以获取更多信息。然后，在两个不同的环节中将这些问题提交给ChatGPT。由五位资深专家对回答进行评估，他们根据准确性、完整性和可理解性这三个标准，使用5分制李克特量表对每个答案进行评分。此外，提出问题的这10名患者对答案进行评估，在二分制量表（是/否）上对每个回答的完整性、可理解性和可信度进行评分。

结果

专家对回答的评分如下：准确性平均得分为4.1±1.0，完整性平均得分为4.2±1.0，可理解性平均得分为4.3±1.0。患者对回答的评分结果为：97.5%的回答完整，95%的回答可理解，100%的回答可信。不同环节回答之间86.8%的相似度证实了随时间的一致性。

结论

尽管问题和答案存在差异，但ChatGPT在回答CRC筛查相关问题方面表现良好，即使是患者直接使用时也是如此。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eec0/12080512/bdd7561a7ff3/10-1055-a-2568-9416_25704761.jpg

相似文献

Exploring ChatGPT effectiveness in addressing direct patient queries on colorectal cancer screening.

Endosc Int Open. 2025 May 12;13:a25689416. doi: 10.1055/a-2568-9416. eCollection 2025.

The role of generative language systems in increasing patient awareness of colon cancer screening.

Endoscopy. 2025 Mar;57(3):262-268. doi: 10.1055/a-2388-6084. Epub 2024 Aug 14.

Assessing the Accuracy, Completeness and Safety of ChatGPT-4o Responses on Pressure Injuries in Infants: Clinical Applications and Future Implications.

Nurs Rep. 2025 Apr 14;15(4):130. doi: 10.3390/nursrep15040130.

Is the information provided by large language models valid in educating patients about adolescent idiopathic scoliosis? An evaluation of content, clarity, and empathy : The perspective of the European Spine Study Group.

Spine Deform. 2025 Mar;13(2):361-372. doi: 10.1007/s43390-024-00955-3. Epub 2024 Nov 4.

Patient Support in Obstructive Sleep Apnoea by a Large Language Model - ChatGPT 4o on Answering Frequently Asked Questions on First Line Positive Airway Pressure and Second Line Hypoglossal Nerve Stimulation Therapy: A Pilot Study.

Nat Sci Sleep. 2024 Dec 27;16:2269-2277. doi: 10.2147/NSS.S495654. eCollection 2024.

Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.

Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.

ChatGPT Versus Consultants: Blinded Evaluation on Answering Otorhinolaryngology Case-Based Questions.

JMIR Med Educ. 2023 Dec 5;9:e49183. doi: 10.2196/49183.

Comparative analysis of ChatGPT-4o mini, ChatGPT-4o and Gemini Advanced in the treatment of postmenopausal osteoporosis.

BMC Musculoskelet Disord. 2025 Apr 16;26(1):369. doi: 10.1186/s12891-025-08601-3.

Evaluating ChatGPT to test its robustness as an interactive information database of radiation oncology and to assess its responses to common queries from radiotherapy patients: A single institution investigation.

Cancer Radiother. 2024 Jun;28(3):258-264. doi: 10.1016/j.canrad.2023.11.005. Epub 2024 Jun 12.

Large language model comparisons between English and Chinese query performance for cardiovascular prevention.

Commun Med (Lond). 2025 May 16;5(1):177. doi: 10.1038/s43856-025-00802-0.

引用本文的文献

Dr. LLM Will See You Now: The Ability of ChatGPT to Provide Geographically Tailored Colorectal Cancer Screening and Surveillance Recommendations.

J Clin Med. 2025 Jul 18;14(14):5101. doi: 10.3390/jcm14145101.

本文引用的文献

Patient Perceptions on the Follow-Up of Abnormal Cancer Screening Test Results.

J Gen Intern Med. 2025 May;40(6):1280-1287. doi: 10.1007/s11606-024-09128-4. Epub 2024 Oct 18.

The role of generative language systems in increasing patient awareness of colon cancer screening.

Endoscopy. 2025 Mar;57(3):262-268. doi: 10.1055/a-2388-6084. Epub 2024 Aug 14.

Optimizing large language models in digestive disease: strategies and challenges to improve clinical outcomes.

Liver Int. 2024 Sep;44(9):2114-2124. doi: 10.1111/liv.15974. Epub 2024 May 31.

Effect of Colonoscopy Screening on Risks of Colorectal Cancer and Related Death.

N Engl J Med. 2022 Oct 27;387(17):1547-1556. doi: 10.1056/NEJMoa2208375. Epub 2022 Oct 9.

A lack of information engagement among colorectal cancer screening non-attenders: cross-sectional survey.

BMC Public Health. 2016 Jul 29;16:659. doi: 10.1186/s12889-016-3374-5.

The Global Burden of Cancer 2013.

JAMA Oncol. 2015 Jul;1(4):505-27. doi: 10.1001/jamaoncol.2015.0735.

Participation rates for organized colorectal cancer screening programmes: an international comparison.

J Med Screen. 2015 Sep;22(3):119-26. doi: 10.1177/0969141315584694. Epub 2015 May 12.

Long-term colorectal-cancer mortality after adenoma removal.

N Engl J Med. 2014 Aug 28;371(9):799-807. doi: 10.1056/NEJMoa1315870.

Attendance and yield over three rounds of population-based fecal immunochemical test screening.

Am J Gastroenterol. 2014 Aug;109(8):1257-64. doi: 10.1038/ajg.2014.168. Epub 2014 Jul 1.

Limited health literacy is a barrier to colorectal cancer screening in England: evidence from the English Longitudinal Study of Ageing.

Prev Med. 2014 Apr;61(100):100-5. doi: 10.1016/j.ypmed.2013.11.012. Epub 2013 Nov 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

探索ChatGPT在解答患者关于结直肠癌筛查直接询问方面的有效性。

Exploring ChatGPT effectiveness in addressing direct patient queries on colorectal cancer screening.

作者信息

机构信息

出版信息

BACKGROUND AND STUDY AIMS

PATIENTS AND METHODS

RESULTS

CONCLUSIONS

背景与研究目的

患者与方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献