• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于乳腺癌决策支持的小型语言模型聊天机器人的概念验证研究——一种透明、源码可控、可解释且数据安全的方法。

Proof-of-concept study of a small language model chatbot for breast cancer decision support - a transparent, source-controlled, explainable and data-secure approach.

机构信息

Institute for Digital Medicine, University Hospital Giessen and Marburg, Philipps-University Marburg, Marburg, Germany.

Stanford Center for Biomedical Informatics Research, Stanford University School of Medicine, Palo Alto, CA, USA.

出版信息

J Cancer Res Clin Oncol. 2024 Oct 9;150(10):451. doi: 10.1007/s00432-024-05964-3.

DOI:10.1007/s00432-024-05964-3
PMID:39382778
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11464535/
Abstract

PURPOSE

Large language models (LLM) show potential for decision support in breast cancer care. Their use in clinical care is currently prohibited by lack of control over sources used for decision-making, explainability of the decision-making process and health data security issues. Recent development of Small Language Models (SLM) is discussed to address these challenges. This preclinical proof-of-concept study tailors an open-source SLM to the German breast cancer guideline (BC-SLM) to evaluate initial clinical accuracy and technical functionality in a preclinical simulation.

METHODS

A multidisciplinary tumor board (MTB) is used as the gold-standard to assess the initial clinical accuracy in terms of concordance of the BC-SLM with MTB and comparing it to two publicly available LLM, ChatGPT3.5 and 4. The study includes 20 fictional patient profiles and recommendations for 5 treatment modalities, resulting in 100 binary treatment recommendations (recommended or not recommended). Statistical evaluation includes concordance with MTB in % including Cohen's Kappa statistic (κ). Technical functionality is assessed qualitatively in terms of local hosting, adherence to the guideline and information retrieval.

RESULTS

The overall concordance amounts to 86% for BC-SLM (κ = 0.721, p < 0.001), 90% for ChatGPT4 (κ = 0.820, p < 0.001) and 83% for ChatGPT3.5 (κ = 0.661, p < 0.001). Specific concordance for each treatment modality ranges from 65 to 100% for BC-SLM, 85-100% for ChatGPT4, and 55-95% for ChatGPT3.5. The BC-SLM is locally functional, adheres to the standards of the German breast cancer guideline and provides referenced sections for its decision-making.

CONCLUSION

The tailored BC-SLM shows initial clinical accuracy and technical functionality, with concordance to the MTB that is comparable to publicly-available LLMs like ChatGPT4 and 3.5. This serves as a proof-of-concept for adapting a SLM to an oncological disease and its guideline to address prevailing issues with LLM by ensuring decision transparency, explainability, source control, and data security, which represents a necessary step towards clinical validation and safe use of language models in clinical oncology.

摘要

目的

大型语言模型(LLM)在乳腺癌护理决策支持方面具有潜力。由于缺乏对决策使用的来源、决策过程的可解释性和健康数据安全性问题的控制,它们目前在临床护理中的使用受到限制。最近讨论了小型语言模型(SLM)的发展,以解决这些挑战。本临床前概念验证研究根据德国乳腺癌指南(BC-SLM)对开源 SLM 进行了调整,以在临床前模拟中评估其初始临床准确性和技术功能。

方法

多学科肿瘤委员会(MTB)作为金标准,根据 BC-SLM 与 MTB 的一致性评估初始临床准确性,并将其与两个公开可用的 LLM(ChatGPT3.5 和 4)进行比较。该研究包括 20 个虚构的患者概况和 5 种治疗方式的建议,产生了 100 个二进制治疗建议(推荐或不推荐)。统计评估包括与 MTB 的一致性,以百分比表示,并包括 Cohen's Kappa 统计量(κ)。技术功能从本地托管、遵守指南和信息检索的角度进行定性评估。

结果

BC-SLM 的总体一致性为 86%(κ=0.721,p<0.001),ChatGPT4 的一致性为 90%(κ=0.820,p<0.001),ChatGPT3.5 的一致性为 83%(κ=0.661,p<0.001)。BC-SLM 对每种治疗方式的具体一致性范围为 65%至 100%,ChatGPT4 为 85%至 100%,ChatGPT3.5 为 55%至 95%。BC-SLM 具有本地功能,符合德国乳腺癌指南的标准,并为其决策提供参考部分。

结论

经过调整的 BC-SLM 显示出初始临床准确性和技术功能,与 MTB 的一致性与公开可用的 LLM(如 ChatGPT4 和 3.5)相当。这为适应 SLM 以满足 LLM 存在的问题提供了概念验证,包括确保决策透明度、可解释性、来源控制和数据安全性,这是临床验证和安全使用语言模型在临床肿瘤学中的必要步骤。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/1945b73d9369/432_2024_5964_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/bc0af05a6292/432_2024_5964_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/537e56fdf1ff/432_2024_5964_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/1945b73d9369/432_2024_5964_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/bc0af05a6292/432_2024_5964_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/537e56fdf1ff/432_2024_5964_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4a7/11464535/1945b73d9369/432_2024_5964_Fig3_HTML.jpg

相似文献

1
Proof-of-concept study of a small language model chatbot for breast cancer decision support - a transparent, source-controlled, explainable and data-secure approach.用于乳腺癌决策支持的小型语言模型聊天机器人的概念验证研究——一种透明、源码可控、可解释且数据安全的方法。
J Cancer Res Clin Oncol. 2024 Oct 9;150(10):451. doi: 10.1007/s00432-024-05964-3.
2
Evolution of publicly available large language models for complex decision-making in breast cancer care.公开可用的大型语言模型在乳腺癌护理中复杂决策方面的发展。
Arch Gynecol Obstet. 2024 Jul;310(1):537-550. doi: 10.1007/s00404-024-07565-4. Epub 2024 May 29.
3
Expert-Guided Large Language Models for Clinical Decision Support in Precision Oncology.专家指导的大型语言模型在精准肿瘤学中的临床决策支持。
JCO Precis Oncol. 2024 Oct;8:e2400478. doi: 10.1200/PO-24-00478. Epub 2024 Oct 30.
4
Challenging ChatGPT 3.5 in Senology-An Assessment of Concordance with Breast Cancer Tumor Board Decision Making.在乳腺病学中挑战ChatGPT 3.5——与乳腺癌肿瘤委员会决策的一致性评估
J Pers Med. 2023 Oct 16;13(10):1502. doi: 10.3390/jpm13101502.
5
Leveraging Large Language Models for Precision Monitoring of Chemotherapy-Induced Toxicities: A Pilot Study with Expert Comparisons and Future Directions.利用大语言模型进行化疗诱导毒性的精准监测:一项专家比较及未来方向的试点研究
Cancers (Basel). 2024 Aug 12;16(16):2830. doi: 10.3390/cancers16162830.
6
Leveraging Large Language Models for Decision Support in Personalized Oncology.利用大型语言模型为个性化肿瘤学提供决策支持。
JAMA Netw Open. 2023 Nov 1;6(11):e2343689. doi: 10.1001/jamanetworkopen.2023.43689.
7
Watson for Oncology and breast cancer treatment recommendations: agreement with an expert multidisciplinary tumor board. Watson for Oncology 与乳腺癌治疗推荐:与专家多学科肿瘤委员会的一致性。
Ann Oncol. 2018 Feb 1;29(2):418-423. doi: 10.1093/annonc/mdx781.
8
Using guideline-based clinical decision support in oncological multidisciplinary team meetings: A prospective, multicenter concordance study.在肿瘤多学科团队会议中使用基于指南的临床决策支持:一项前瞻性、多中心一致性研究。
Int J Qual Health Care. 2022 Mar 19;34(1). doi: 10.1093/intqhc/mzac007.
9
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
10
Concordance Study Between IBM Watson for Oncology and Clinical Practice for Patients with Cancer in China.IBM Watson 肿瘤解决方案在中国癌症患者中的临床应用一致性研究。
Oncologist. 2019 Jun;24(6):812-819. doi: 10.1634/theoncologist.2018-0255. Epub 2018 Sep 4.

引用本文的文献

1
A comprehensive landscape of AI applications in broad-spectrum drug interaction prediction: a systematic review.人工智能在广谱药物相互作用预测中的应用全景:一项系统综述。
J Cheminform. 2025 Sep 19;17(1):141. doi: 10.1186/s13321-025-01093-2.
2
DeepSeek-R1 vs OpenAI o1 for Ophthalmic Diagnoses and Management Plans.用于眼科诊断和管理计划的DeepSeek-R1与OpenAI o1对比
JAMA Ophthalmol. 2025 Sep 4. doi: 10.1001/jamaophthalmol.2025.2918.
3
ChatGPT-4 Responses on Ankle Cartilage Surgery Often Diverge from Expert Consensus: A Comparative Analysis.

本文引用的文献

1
NGS-Guided Precision Oncology in Breast Cancer and Gynecological Tumors-A Retrospective Molecular Tumor Board Analysis.基于二代测序技术的乳腺癌和妇科肿瘤精准肿瘤学——一项回顾性分子肿瘤专家委员会分析
Cancers (Basel). 2024 Apr 19;16(8):1561. doi: 10.3390/cancers16081561.
2
Utilizing large language models in breast cancer management: systematic review.利用大型语言模型进行乳腺癌管理:系统评价。
J Cancer Res Clin Oncol. 2024 Mar 19;150(3):140. doi: 10.1007/s00432-024-05678-6.
3
Precision therapeutic landscape for breast cancer: where are we headed?
ChatGPT-4对踝关节软骨手术的回答往往与专家共识存在分歧:一项比较分析。
Foot Ankle Orthop. 2025 Aug 13;10(3):24730114251352494. doi: 10.1177/24730114251352494. eCollection 2025 Jul.
4
Retrieval augmented generation for large language models in healthcare: A systematic review.医疗保健领域大语言模型的检索增强生成:一项系统综述。
PLOS Digit Health. 2025 Jun 11;4(6):e0000877. doi: 10.1371/journal.pdig.0000877. eCollection 2025 Jun.
乳腺癌的精准治疗前景:我们将走向何方?
Lancet Reg Health West Pac. 2024 Jan 31;42:101024. doi: 10.1016/j.lanwpc.2024.101024. eCollection 2024 Jan.
4
Insights for precision oncology from the integration of genomic and clinical data of 13,880 tumors from the 100,000 Genomes Cancer Programme.从 10 万基因组癌症计划中 13880 个肿瘤的基因组和临床数据集成中获得的精准肿瘤学见解。
Nat Med. 2024 Jan;30(1):279-289. doi: 10.1038/s41591-023-02682-0. Epub 2024 Jan 11.
5
Leveraging Large Language Models for Decision Support in Personalized Oncology.利用大型语言模型为个性化肿瘤学提供决策支持。
JAMA Netw Open. 2023 Nov 1;6(11):e2343689. doi: 10.1001/jamanetworkopen.2023.43689.
6
Liquid biopsy epigenomic profiling for cancer subtyping.液体活检的表观基因组特征分析用于癌症分型。
Nat Med. 2023 Nov;29(11):2737-2741. doi: 10.1038/s41591-023-02605-z. Epub 2023 Oct 21.
7
Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer.利用大语言模型开发提示,以从乳腺癌的病理学和超声报告中提取临床信息。
Radiat Oncol J. 2023 Sep;41(3):209-216. doi: 10.3857/roj.2023.00633. Epub 2023 Sep 21.
8
Trends and variation in treatment of early breast cancer in European certified breast centres: an EUSOMA-based analysis.欧洲认证乳腺中心早期乳腺癌治疗的趋势和变化:基于 EUSOMA 的分析。
Eur J Cancer. 2023 Oct;192:113244. doi: 10.1016/j.ejca.2023.113244. Epub 2023 Jul 18.
9
Evaluating ChatGPT as an adjunct for the multidisciplinary tumor board decision-making in primary breast cancer cases.评估 ChatGPT 在原发性乳腺癌多学科肿瘤委员会决策中的辅助作用。
Arch Gynecol Obstet. 2023 Dec;308(6):1831-1844. doi: 10.1007/s00404-023-07130-5. Epub 2023 Jul 17.
10
Evaluating GPT as an Adjunct for Radiologic Decision Making: GPT-4 Versus GPT-3.5 in a Breast Imaging Pilot.评估 GPT 作为放射学决策辅助工具:GPT-4 与 GPT-3.5 在乳腺成像试点中的比较。
J Am Coll Radiol. 2023 Oct;20(10):990-997. doi: 10.1016/j.jacr.2023.05.003. Epub 2023 Jun 21.