• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能可促进风险分层算法在膀胱癌患者病例场景中的应用。

Artificial Intelligence can Facilitate Application of Risk Stratification Algorithms to Bladder Cancer Patient Case Scenarios.

作者信息

Yudovich Max S, Alzubaidi Ahmad N, Raman Jay D

机构信息

Penn State Health Milton S. Hershey Medical Center, Hershey, PA, USA.

出版信息

Clin Med Insights Oncol. 2024 Nov 17;18:11795549241296781. doi: 10.1177/11795549241296781. eCollection 2024.

DOI:10.1177/11795549241296781
PMID:39559828
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11571244/
Abstract

BACKGROUND

Chat Generative Pre-Trained Transformer (ChatGPT) has previously been shown to accurately predict colon cancer screening intervals when provided with clinical data and context in the form of guidelines. The National Comprehensive Cancer Network (NCCN) guideline on non-muscle invasive bladder cancer (NMIBC) includes criteria for risk stratification into low-, intermediate-, and high-risk groups based on patient and disease characteristics. The aim of this study is to evaluate the ability of ChatGPT to apply the NCCN Guidelines to risk stratify theoretical patient scenarios related to NMIBC.

METHODS

Thirty-six hypothetical patient scenarios related to NMIBC were created and submitted to GPT-3.5 and GPT-4 at two separate time points. First, both models were prompted to risk stratify patients without any additional context provided. Custom instructions were then provided as textual context using the written versions of the NMIBC NCCN Guidelines, followed by repeat risk stratification. Finally, GPT-4 was provided with an image of the NMIBC risk groups table, and the risk stratification was again performed.

RESULTS

GPT-3.5 correctly risk stratified 68% (24.5 of 36) of scenarios without context, slightly increasing to 74% (26.5 of 36) with textual context. Using GPT-4, the model had accuracy of 83% (30 of 36) without context, reaching 100% (36 of 36) with textual context ( = .025). GPT-4 with image context maintained similar accuracy to GPT-4 without context, with accuracy 81% (29 of 36). ChatGPT generally performed poorly when stratifying intermediate risk NMIBC (33%-63%). When risk stratification was incorrect, most responses were overestimations of risk.

CONCLUSIONS

GPT-4 can accurately risk stratify patients with respect to NMIBC when provided with context containing guidelines. Overestimation of risk is more common than underestimation, and intermediate risk NMIBC is most likely to be incorrectly stratified. With further validation, GPT-4 can become a tool for risk stratification of NMIBC in clinical practice.

摘要

背景

之前的研究表明,当以指南的形式提供临床数据和背景信息时,聊天生成预训练变换器(ChatGPT)能够准确预测结肠癌筛查间隔。美国国立综合癌症网络(NCCN)关于非肌层浸润性膀胱癌(NMIBC)的指南包括根据患者和疾病特征将风险分层为低、中、高风险组的标准。本研究的目的是评估ChatGPT应用NCCN指南对与NMIBC相关的理论患者情况进行风险分层的能力。

方法

创建了36个与NMIBC相关的假设患者情况,并在两个不同时间点提交给GPT-3.5和GPT-4。首先,在不提供任何额外背景信息的情况下,促使两个模型对患者进行风险分层。然后使用NMIBC NCCN指南的书面版本作为文本背景提供自定义说明,随后再次进行风险分层。最后,向GPT-4提供NMIBC风险组表的图像,并再次进行风险分层。

结果

GPT-3.5在无背景信息的情况下正确对68%(36个中的24.5个)的情况进行了风险分层,在有文本背景信息时略有增加至74%(36个中的26.5个)。使用GPT-4时,该模型在无背景信息时的准确率为83%(36个中的30个),在有文本背景信息时达到100%(36个中的36个)(P = 0.025)。有图像背景信息的GPT-4与无背景信息的GPT-4保持相似的准确率,准确率为81%(36个中的29个)。ChatGPT在对中度风险NMIBC进行分层时总体表现较差(33%-63%)。当风险分层错误时,大多数回答是对风险的高估。

结论

当提供包含指南的背景信息时,GPT-4能够准确地对NMIBC患者进行风险分层。风险高估比低估更常见,中度风险NMIBC最容易被错误分层。经过进一步验证,GPT-4可以成为临床实践中NMIBC风险分层的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/11cba0424ed0/10.1177_11795549241296781-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/d8c4af9d9087/10.1177_11795549241296781-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/ce5839967726/10.1177_11795549241296781-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/11cba0424ed0/10.1177_11795549241296781-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/d8c4af9d9087/10.1177_11795549241296781-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/ce5839967726/10.1177_11795549241296781-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8827/11571244/11cba0424ed0/10.1177_11795549241296781-fig3.jpg

相似文献

1
Artificial Intelligence can Facilitate Application of Risk Stratification Algorithms to Bladder Cancer Patient Case Scenarios.人工智能可促进风险分层算法在膀胱癌患者病例场景中的应用。
Clin Med Insights Oncol. 2024 Nov 17;18:11795549241296781. doi: 10.1177/11795549241296781. eCollection 2024.
2
[Results of a Questionnaire-Based Study on Guideline Adherence Regarding Adjuvant Treatment Recommendations for Patients with Non-Muscle-Invasive Bladder Cancer: Just a Disturbing Sidelight?].[关于非肌层浸润性膀胱癌患者辅助治疗建议的指南依从性的问卷调查研究结果:只是一个令人不安的侧面情况?]
Aktuelle Urol. 2016 Sep;47(5):408-13. doi: 10.1055/s-0042-104785. Epub 2016 Jun 14.
3
European Association of Urology Guidelines on Non-muscle-invasive Bladder Cancer (TaT1 and Carcinoma In Situ) - 2019 Update.欧洲泌尿外科学会非肌肉浸润性膀胱癌(TaT1 和原位癌)指南 - 2019 年更新版。
Eur Urol. 2019 Nov;76(5):639-657. doi: 10.1016/j.eururo.2019.08.016. Epub 2019 Aug 20.
4
ChatGPT compared to national guidelines for management of ovarian cancer: Did ChatGPT get it right? - A Memorial Sloan Kettering Cancer Center Team Ovary study.ChatGPT 与卵巢癌管理的国家指南比较:ChatGPT 是否做对了?- 纪念斯隆凯特琳癌症中心卵巢癌团队研究。
Gynecol Oncol. 2024 Oct;189:75-79. doi: 10.1016/j.ygyno.2024.07.007. Epub 2024 Jul 22.
5
Discrepancy Between European Association of Urology Guidelines and Daily Practice in the Management of Non-muscle-invasive Bladder Cancer: Results of a European Survey.欧洲泌尿外科学会指南与非肌肉浸润性膀胱癌管理日常实践之间的差异:一项欧洲调查的结果。
Eur Urol Focus. 2019 Jul;5(4):681-688. doi: 10.1016/j.euf.2017.09.002. Epub 2017 Oct 23.
6
Performance of ChatGPT on the Peruvian National Licensing Medical Examination: Cross-Sectional Study.ChatGPT在秘鲁国家医学执照考试中的表现:横断面研究
JMIR Med Educ. 2023 Sep 28;9:e48039. doi: 10.2196/48039.
7
Advancing Medical Education: Performance of Generative Artificial Intelligence Models on Otolaryngology Board Preparation Questions With Image Analysis Insights.推进医学教育:生成式人工智能模型在耳鼻喉科委员会备考问题上的表现及图像分析见解
Cureus. 2024 Jul 9;16(7):e64204. doi: 10.7759/cureus.64204. eCollection 2024 Jul.
8
Risk Stratification Tools and Prognostic Models in Non-muscle-invasive Bladder Cancer: A Critical Assessment from the European Association of Urology Non-muscle-invasive Bladder Cancer Guidelines Panel.非肌层浸润性膀胱癌的风险分层工具和预后模型:欧洲泌尿外科学会非肌层浸润性膀胱癌指南小组的批判性评估。
Eur Urol Focus. 2020 May 15;6(3):479-489. doi: 10.1016/j.euf.2018.11.005. Epub 2018 Nov 22.
9
Appropriateness of ChatGPT in Answering Heart Failure Related Questions.ChatGPT 在回答心力衰竭相关问题方面的适宜性。
Heart Lung Circ. 2024 Sep;33(9):1314-1318. doi: 10.1016/j.hlc.2024.03.005. Epub 2024 May 31.
10
Performance of Progressive Generations of GPT on an Exam Designed for Certifying Physicians as Certified Clinical Densitometrists.GPT 各代产品在专为认证医师为认证临床骨密度技师而设计的考试中的表现。
J Clin Densitom. 2024 Apr-Jun;27(2):101480. doi: 10.1016/j.jocd.2024.101480. Epub 2024 Feb 17.

引用本文的文献

1
Comment on: "Artificial Intelligence Can Facilitate Application of Risk Stratification Algorithms to Bladder Cancer Patient Case Scenarios".评论:“人工智能可促进风险分层算法在膀胱癌患者病例场景中的应用” 。
Clin Med Insights Oncol. 2025 Jul 6;19:11795549251350242. doi: 10.1177/11795549251350242. eCollection 2025.

本文引用的文献

1
ChatGPT v4 outperforming v3.5 on cancer treatment recommendations in quality, clinical guideline, and expert opinion concordance.ChatGPT v4在癌症治疗建议方面,在质量、临床指南和专家意见一致性上优于v3.5。
Digit Health. 2024 Aug 14;10:20552076241269538. doi: 10.1177/20552076241269538. eCollection 2024 Jan-Dec.
2
The Prognostic Significance of Histological Subtypes in Patients with Muscle-Invasive Bladder Cancer: An Overview of the Current Literature.组织学亚型在肌层浸润性膀胱癌患者中的预后意义:当前文献综述
J Clin Med. 2024 Jul 25;13(15):4349. doi: 10.3390/jcm13154349.
3
The accuracy and quality of image-based artificial intelligence for muscle-invasive bladder cancer prediction.
基于图像的人工智能在预测肌肉浸润性膀胱癌方面的准确性和质量。
Insights Imaging. 2024 Aug 1;15(1):185. doi: 10.1186/s13244-024-01780-y.
4
Performance of GPT-3.5 and GPT-4 on standardized urology knowledge assessment items in the United States: a descriptive study.GPT-3.5 和 GPT-4 在标准化美国泌尿科知识评估项目中的表现:一项描述性研究。
J Educ Eval Health Prof. 2024;21:17. doi: 10.3352/jeehp.2024.21.17. Epub 2024 Jul 8.
5
Is ChatGPT ready for primetime? Performance of artificial intelligence on a simulated Canadian urology board exam.ChatGPT 准备好正式登场了吗?人工智能在模拟加拿大泌尿外科委员会考试中的表现。
Can Urol Assoc J. 2024 Oct;18(10):329-332. doi: 10.5489/cuaj.8800.
6
Comparative analysis of artificial intelligence chatbot recommendations for urolithiasis management: A study of EAU guideline compliance.人工智能聊天机器人对尿石症管理建议的比较分析:一项关于欧洲泌尿外科学会指南依从性的研究
Fr J Urol. 2024 Jul;34(7-8):102666. doi: 10.1016/j.fjurol.2024.102666. Epub 2024 Jun 5.
7
The efficacy of artificial intelligence in urology: a detailed analysis of kidney stone-related queries.人工智能在泌尿科的疗效:肾结石相关查询的详细分析。
World J Urol. 2024 Mar 14;42(1):158. doi: 10.1007/s00345-024-04847-z.
8
Evaluating ChatGPT ability to answer urinary tract Infection-Related questions.评估 ChatGPT 回答尿路感染相关问题的能力。
Infect Dis Now. 2024 Jun;54(4):104884. doi: 10.1016/j.idnow.2024.104884. Epub 2024 Mar 8.
9
Urological Cancers and ChatGPT: Assessing the Quality of Information and Possible Risks for Patients.泌尿系统癌症与ChatGPT:评估信息质量及对患者的潜在风险
Clin Genitourin Cancer. 2024 Apr;22(2):454-457.e4. doi: 10.1016/j.clgc.2023.12.017. Epub 2024 Jan 5.
10
ChatGPT on guidelines: Providing contextual knowledge to GPT allows it to provide advice on appropriate colonoscopy intervals.ChatGPT 指南:提供上下文知识可以使 GPT 提供有关适当结肠镜检查间隔的建议。
J Gastroenterol Hepatol. 2024 Jan;39(1):81-106. doi: 10.1111/jgh.16375. Epub 2023 Oct 19.