• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索人工智能、大语言模型的作用:将以患者为中心的信息和临床决策支持能力与妇科肿瘤学指南进行比较。

Exploring the role of artificial intelligence, large language models: Comparing patient-focused information and clinical decision support capabilities to the gynecologic oncology guidelines.

作者信息

Reicher Lee, Lutsker Guy, Michaan Nadav, Grisaru Dan, Laskov Ido

机构信息

Department of Gynecologic Oncology, Lis Hospital for Women, Tel Aviv Medical Center, Tel Aviv, Israel.

Sackler School of Medicine, Department of Gynecology, Tel Aviv University, Tel Aviv, Israel.

出版信息

Int J Gynaecol Obstet. 2025 Feb;168(2):419-427. doi: 10.1002/ijgo.15869. Epub 2024 Aug 20.

DOI:10.1002/ijgo.15869
PMID:39161265
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11726133/
Abstract

Gynecologic cancer requires personalized care to improve outcomes. Large language models (LLMs) hold the potential to provide intelligent question-answering with reliable information about medical queries in clear and plain English, which can be understood by both healthcare providers and patients. We aimed to evaluate two freely available LLMs (ChatGPT and Google's Bard) in answering questions regarding the management of gynecologic cancer. The LLMs' performances were evaluated by developing a set questions that addressed common gynecologic oncologic findings from a patient's perspective and more complex questions to elicit recommendations from a clinician's perspective. Each question was presented to the LLM interface, and the responses generated by the artificial intelligence (AI) model were recorded. The responses were assessed based on the adherence to the National Comprehensive Cancer Network and European Society of Gynecological Oncology guidelines. This evaluation aimed to determine the accuracy and appropriateness of the information provided by LLMs. We showed that the models provided largely appropriate responses to questions regarding common cervical cancer screening tests and BRCA-related questions. Less useful answers were received to complex and controversial gynecologic oncology cases, as assessed by reviewing the common guidelines. ChatGPT and Bard lacked knowledge of regional guideline variations, However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps of management and follow up. We conclude that LLMs may have a role as an adjunct informational tool to improve outcomes.

摘要

妇科癌症需要个性化护理以改善治疗效果。大语言模型有潜力以清晰易懂的英语提供关于医疗问题的可靠信息的智能问答,医疗服务提供者和患者都能理解。我们旨在评估两个免费的大语言模型(ChatGPT和谷歌的Bard)在回答有关妇科癌症管理问题方面的表现。通过设计一系列问题来评估大语言模型的性能,这些问题从患者角度涉及常见的妇科肿瘤学发现,以及从临床医生角度提出的更复杂问题以引出建议。每个问题都呈现给大语言模型界面,并记录人工智能(AI)模型生成的回答。根据是否符合美国国立综合癌症网络和欧洲妇科肿瘤学会指南来评估这些回答。该评估旨在确定大语言模型提供信息的准确性和适当性。我们发现,这些模型对有关常见宫颈癌筛查测试和与BRCA相关问题的回答在很大程度上是恰当的。通过审查通用指南评估,对于复杂和有争议的妇科肿瘤病例,得到的有用答案较少。ChatGPT和Bard缺乏对地区指南差异的了解,然而,它为患者和护理人员提供了关于下一步管理和随访的实用且多方面的建议。我们得出结论,大语言模型可能作为辅助信息工具发挥作用以改善治疗效果。

相似文献

1
Exploring the role of artificial intelligence, large language models: Comparing patient-focused information and clinical decision support capabilities to the gynecologic oncology guidelines.探索人工智能、大语言模型的作用:将以患者为中心的信息和临床决策支持能力与妇科肿瘤学指南进行比较。
Int J Gynaecol Obstet. 2025 Feb;168(2):419-427. doi: 10.1002/ijgo.15869. Epub 2024 Aug 20.
2
Is the information provided by large language models valid in educating patients about adolescent idiopathic scoliosis? An evaluation of content, clarity, and empathy : The perspective of the European Spine Study Group.大语言模型提供的信息在对患者进行青少年特发性脊柱侧凸教育方面是否有效?内容、清晰度和同理心的评估:欧洲脊柱研究小组的观点
Spine Deform. 2025 Mar;13(2):361-372. doi: 10.1007/s43390-024-00955-3. Epub 2024 Nov 4.
3
Proficiency, Clarity, and Objectivity of Large Language Models Versus Specialists' Knowledge on COVID-19's Impacts in Pregnancy: Cross-Sectional Pilot Study.大型语言模型在新冠肺炎对妊娠影响方面的熟练度、清晰度和客观性与专家知识对比:横断面试点研究
JMIR Form Res. 2025 Feb 5;9:e56126. doi: 10.2196/56126.
4
Large language model comparisons between English and Chinese query performance for cardiovascular prevention.心血管疾病预防中英查询性能的大语言模型比较。
Commun Med (Lond). 2025 May 16;5(1):177. doi: 10.1038/s43856-025-00802-0.
5
Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study.评估生成式 AI 大语言模型 ChatGPT、Google Bard 和 Microsoft Bing Chat 在支持循证牙科方面的性能:比较混合方法研究。
J Med Internet Res. 2023 Dec 28;25:e51580. doi: 10.2196/51580.
6
Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions.人工智能在减重手术中的表现:ChatGPT-4、Bing 和 Bard 在《美国代谢与减重外科学会减重手术教科书》减重手术问题中的比较分析。
Surg Obes Relat Dis. 2024 Jul;20(7):609-613. doi: 10.1016/j.soard.2024.04.014. Epub 2024 May 8.
7
Harnessing artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in generating clinician-level bariatric surgery recommendations.利用人工智能在减重手术中的应用:ChatGPT-4、Bing 和 Bard 在生成临床医生水平的减重手术建议方面的比较分析。
Surg Obes Relat Dis. 2024 Jul;20(7):603-608. doi: 10.1016/j.soard.2024.03.011. Epub 2024 Mar 24.
8
Artificial intelligence in hepatology: a comparative analysis of ChatGPT-4, Bing, and Bard at answering clinical questions.肝病学中的人工智能:ChatGPT-4、必应和巴德在回答临床问题方面的比较分析
J Can Assoc Gastroenterol. 2025 Feb 22;8(2):58-62. doi: 10.1093/jcag/gwae055. eCollection 2025 Apr.
9
Utility of Large Language Models for Health Care Professionals and Patients in Navigating Hematopoietic Stem Cell Transplantation: Comparison of the Performance of ChatGPT-3.5, ChatGPT-4, and Bard.大型语言模型在造血干细胞移植导航中对医疗保健专业人员和患者的实用性:ChatGPT-3.5、ChatGPT-4 和 Bard 的性能比较。
J Med Internet Res. 2024 May 17;26:e54758. doi: 10.2196/54758.
10
[Evaluating the accuracy of large language models in answering mammography screening questions in Italian and English: a study based on the Eusobi guidelines.].[评估大型语言模型在回答意大利语和英语乳腺钼靶筛查问题时的准确性:一项基于尤索比指南的研究。]
Recenti Prog Med. 2025 Mar;116(3):162-167. doi: 10.1701/4460.44556.

引用本文的文献

1
Exploring the possibilities and limitations of customized large language model to support and improve cervical cancer screening.探索定制大语言模型以支持和改进宫颈癌筛查的可能性与局限性。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):242. doi: 10.1186/s12911-025-03088-3.
2
Artificial intelligence in the diagnosis and management of gynecologic cancer.人工智能在妇科癌症的诊断与管理中的应用
Int J Gynaecol Obstet. 2025 Apr 25. doi: 10.1002/ijgo.70094.
3
A bibliometric analysis of artificial intelligence applied to cervical cancer.人工智能应用于宫颈癌的文献计量分析
Front Med (Lausanne). 2025 Apr 8;12:1562818. doi: 10.3389/fmed.2025.1562818. eCollection 2025.
4
The Role of Medical Therapies in the Management of Cervical Intraepithelial Neoplasia: A Narrative Review.医学治疗在宫颈上皮内瘤变管理中的作用:一项叙述性综述
Medicina (Kaunas). 2025 Feb 13;61(2):326. doi: 10.3390/medicina61020326.
5
Real world perspectives on endometriosis disease phenotyping through surgery, omics, health data, and artificial intelligence.通过手术、组学、健康数据和人工智能对子宫内膜异位症疾病表型进行的真实世界观察。
NPJ Womens Health. 2025;3(1):8. doi: 10.1038/s44294-024-00052-w. Epub 2025 Feb 6.

本文引用的文献

1
Let's chat about cervical cancer: Assessing the accuracy of ChatGPT responses to cervical cancer questions.让我们来聊聊宫颈癌:评估 ChatGPT 对宫颈癌问题回答的准确性。
Gynecol Oncol. 2023 Dec;179:164-168. doi: 10.1016/j.ygyno.2023.11.008. Epub 2023 Nov 21.
2
Large language models in medicine.医学中的大型语言模型。
Nat Med. 2023 Aug;29(8):1930-1940. doi: 10.1038/s41591-023-02448-8. Epub 2023 Jul 17.
3
Reliability of Medical Information Provided by ChatGPT: Assessment Against Clinical Guidelines and Patient Information Quality Instrument.ChatGPT 提供的医学信息的可靠性:与临床指南和患者信息质量工具的评估。
J Med Internet Res. 2023 Jun 30;25:e47479. doi: 10.2196/47479.
4
Exploring ChatGPT's Potential in Facilitating Adaptation of Clinical Guidelines: A Case Study of Diabetic Ketoacidosis Guidelines.探索ChatGPT在促进临床指南适应方面的潜力:以糖尿病酮症酸中毒指南为例的案例研究。
Cureus. 2023 May 9;15(5):e38784. doi: 10.7759/cureus.38784. eCollection 2023 May.
5
Large language model (ChatGPT) as a support tool for breast tumor board.大语言模型(ChatGPT)作为乳腺肿瘤多学科诊疗团队的辅助工具。
NPJ Breast Cancer. 2023 May 30;9(1):44. doi: 10.1038/s41523-023-00557-8.
6
FUTURE OF THE LANGUAGE MODELS IN HEALTHCARE: THE ROLE OF CHATGPT.语言模型在医疗保健领域的未来:ChatGPT 的作用。
Arq Bras Cir Dig. 2023 May 8;36:e1727. doi: 10.1590/0102-672020230002e1727. eCollection 2023.
7
ESGO/ESTRO/ESP Guidelines for the management of patients with cervical cancer - Update 2023.ESGO/ESTRO/ESP 宫颈癌管理指南-2023 年更新版。
Int J Gynecol Cancer. 2023 May 1;33(5):649-666. doi: 10.1136/ijgc-2023-004429.
8
Assessing the Capability of ChatGPT in Answering First- and Second-Order Knowledge Questions on Microbiology as per Competency-Based Medical Education Curriculum.根据基于能力的医学教育课程评估ChatGPT回答微生物学一阶和二阶知识问题的能力。
Cureus. 2023 Mar 12;15(3):e36034. doi: 10.7759/cureus.36034. eCollection 2023 Mar.
9
Appropriateness of Breast Cancer Prevention and Screening Recommendations Provided by ChatGPT.ChatGPT提供的乳腺癌预防和筛查建议的适宜性。
Radiology. 2023 May;307(4):e230424. doi: 10.1148/radiol.230424. Epub 2023 Apr 4.
10
Uterine Neoplasms, Version 1.2023, NCCN Clinical Practice Guidelines in Oncology.子宫肿瘤,第1.2023版,美国国立综合癌症网络(NCCN)肿瘤学临床实践指南
J Natl Compr Canc Netw. 2023 Feb;21(2):181-209. doi: 10.6004/jnccn.2023.0006.