人工智能聊天机器人是否符合基于证据的癌症筛查建议？

Are AI chatbots concordant with evidence-based cancer screening recommendations?

作者信息

Nickel Brooke, Ayre Julie, Marinovich M Luke, Smith David P, Chiam Karen, Lee Christoph I, Wilt Timothy J, Taba Melody, McCaffery Kirsten, Houssami Nehmat

机构信息

Sydney Health Literacy Lab, Sydney School of Public Health, Faculty of Medicine and Health, The University of Sydney, Australia; Wiser Healthcare, Sydney School of Public Health, Faculty of Medicine and Health, The University of Sydney, Australia.

Sydney Health Literacy Lab, Sydney School of Public Health, Faculty of Medicine and Health, The University of Sydney, Australia.

出版信息

Patient Educ Couns. 2025 May;134:108677. doi: 10.1016/j.pec.2025.108677. Epub 2025 Jan 21.

DOI:10.1016/j.pec.2025.108677

PMID:39862490

Abstract

OBJECTIVE

This study aimed to assess whether information from AI chatbots on benefits and harms of breast and prostate cancer screening were concordant with evidence-based cancer screening recommendations.

METHODS

Seven unique prompts (four breast cancer; three prostate cancer) were presented to ChatGPT in March 2024. A total of 60 criteria (30 breast; 30 prostate) were used to assess the concordance of information. Concordance was scored between 0 and 2 against the United States Preventive Services Task Force (USPSTF) breast and prostate cancer screening recommendations independently by international cancer screening experts.

RESULTS

43 of 60 (71.7 %) criteria were completely concordant, 3 (5 %) were moderately concordant and 14 (23.3 %) were not concordant or not present, with most of the non-concordant criteria (9 of 14, 64.3 %) being from prompts for the oldest age groups. ChatGPT hallucinations (i.e., completely made up, non-sensical or irrelevant information) were found in 9 of 60 criteria (15 %).

CONCLUSIONS

ChatGPT provided information mostly concordant with USPSTF breast and prostate cancer screening recommendations, however, important gaps exist. These findings provide insights into the role of AI to communicate cancer screening benefits and harms and hold increased relevance for periods of guideline change.

PRACTICE IMPLICATIONS

AI generated information on cancer screening should be taken in conjunction with official screening recommendations and/or information from clinicians.

摘要