Suppr超能文献

评估ChatGPT在为老年女性提供乳腺钼靶筛查建议方面的准确性:人工智能与癌症沟通

Evaluating ChatGPT's Accuracy in Providing Screening Mammography Recommendations among Older Women: Artificial Intelligence and Cancer Communication.

作者信息

Braithwaite Dejana, Karanth Shama D, Divaker Joel, Schoenborn Nancy, Lin Kenneth, Richman Ilana, Hochhegger Bruno, O'Neill Suzanne, Schonberg Mara

出版信息

Res Sq. 2024 Jan 31:rs.3.rs-3911155. doi: 10.21203/rs.3.rs-3911155/v1.

Abstract

The U.S. Preventive Services Task Force (USPSTF) recommends biennial screening mammography through age 74. Guidelines vary as to whether or not they recommended mammography screening to women aged 75 and older. This study aims to determine the ability of ChatGPT to provide appropriate recommendations for breast cancer screening in patients aged 75 years and older. 12 questions and 4 clinical vignettes addressing fundamental concepts about breast cancer screening and prevention in patients aged 75 years and older were created and asked to ChatGPT three consecutive times to generate 3 sets of responses. The responses were graded by a multi-disciplinary panel of experts in the intersection of breast cancer screening and aging The responses were graded as 'appropriate', 'inappropriate', or 'unreliable' based on the reviewer's clinical judgment, content of the response, and whether the content was consistent across the three responses Appropriateness was determined through a majority consensus. The responses generated by ChatGPT were appropriate for 11/17 questions (64%). Three questions were graded as inappropriate (18%) and 2 questions were graded as unreliable (12%). A consensus was not reached on one question (6%) and was graded as no consensus. While recognizing the limitations of ChatGPT, it has potential to provide accurate health care information and could be utilized by healthcare professionals to assist in providing recommendations for breast cancer screening in patients age 75 years and older. Physician oversight will be necessary, due to the possibility of ChatGPT to provide inappropriate and unreliable responses, and the importance of accuracy in medicine.

摘要

美国预防服务工作组(USPSTF)建议,74岁及以下女性应每两年进行一次乳腺钼靶筛查。对于75岁及以上女性是否推荐进行乳腺钼靶筛查,各指南的规定有所不同。本研究旨在确定ChatGPT为75岁及以上患者提供乳腺癌筛查适当建议的能力。针对75岁及以上患者乳腺癌筛查和预防的基本概念,设计了12个问题和4个临床病例,并连续三次向ChatGPT提问,以生成3组回答。由乳腺癌筛查与老龄化交叉领域的多学科专家小组对这些回答进行评分。根据评审人员的临床判断、回答内容以及三个回答的内容是否一致,将回答评为“适当”、“不适当”或“不可靠”。适当性通过多数共识确定。ChatGPT生成的回答在17个问题中有11个是适当的(64%)。三个问题被评为不适当(18%),两个问题被评为不可靠(12%)。有一个问题(6%)未达成共识,被评为无共识。虽然认识到ChatGPT的局限性,但它有潜力提供准确的医疗保健信息,医疗保健专业人员可利用它协助为75岁及以上患者提供乳腺癌筛查建议。由于ChatGPT可能提供不适当和不可靠的回答,以及医学中准确性的重要性,医生的监督将是必要的。

相似文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验