• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对头颈部(HN)患者的支持性护理,使用外部知识库对定制再训练大语言模型进行测试与验证。

Testing and Validation of a Custom Retrained Large Language Model for the Supportive Care of HN Patients with External Knowledge Base.

作者信息

Zhu Libing, Rong Yi, McGee Lisa A, Rwigema Jean-Claude M, Patel Samir H

机构信息

Department of Radiation Oncology, Mayo Clinic, Phoenix, AZ 85054, USA.

出版信息

Cancers (Basel). 2024 Jun 24;16(13):2311. doi: 10.3390/cancers16132311.

DOI:10.3390/cancers16132311
PMID:39001375
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11240646/
Abstract

PURPOSE

This study aimed to develop a retrained large language model (LLM) tailored to the needs of HN cancer patients treated with radiotherapy, with emphasis on symptom management and survivorship care.

METHODS

A comprehensive external database was curated for training ChatGPT-4, integrating expert-identified consensus guidelines on supportive care for HN patients and correspondences from physicians and nurses within our institution's electronic medical records for 90 HN patients. The performance of our model was evaluated using 20 patient post-treatment inquiries that were then assessed by three Board certified radiation oncologists (RadOncs). The rating of the model was assessed on a scale of 1 (strongly disagree) to 5 (strongly agree) based on accuracy, clarity of response, completeness s, and relevance.

RESULTS

The average scores for the 20 tested questions were 4.25 for accuracy, 4.35 for clarity, 4.22 for completeness, and 4.32 for relevance, on a 5-point scale. Overall, 91.67% (220 out of 240) of assessments received scores of 3 or higher, and 83.33% (200 out of 240) received scores of 4 or higher.

CONCLUSION

The custom-trained model demonstrates high accuracy in providing support to HN patients offering evidence-based information and guidance on their symptom management and survivorship care.

摘要

目的

本研究旨在开发一种经过重新训练的大型语言模型(LLM),以满足接受放射治疗的头颈部癌症(HN)患者的需求,重点是症状管理和生存护理。

方法

精心策划了一个全面的外部数据库来训练ChatGPT-4,该数据库整合了专家确定的关于HN患者支持性护理的共识指南,以及来自我们机构电子病历中90名HN患者的医生和护士的通信记录。使用20个患者治疗后咨询评估我们模型的性能,然后由三名获得董事会认证的放射肿瘤学家(RadOncs)进行评估。根据准确性、回答清晰度、完整性和相关性,对模型的评分采用1(强烈不同意)至5(强烈同意)的量表进行评估。

结果

在5分制中,20个测试问题的平均得分分别为:准确性4.25分、清晰度4.35分、完整性4.22分、相关性4.32分。总体而言,91.67%(240个中的220个)的评估得分在3分或更高,83.33%(240个中的200个)的评估得分在4分或更高。

结论

定制训练的模型在为HN患者提供支持方面表现出高准确性,为他们的症状管理和生存护理提供基于证据的信息和指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/6414e760bad4/cancers-16-02311-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/d455a7341fe6/cancers-16-02311-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/51afcebad5e2/cancers-16-02311-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/6414e760bad4/cancers-16-02311-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/d455a7341fe6/cancers-16-02311-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/51afcebad5e2/cancers-16-02311-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c498/11240646/6414e760bad4/cancers-16-02311-g003.jpg

相似文献

1
Testing and Validation of a Custom Retrained Large Language Model for the Supportive Care of HN Patients with External Knowledge Base.针对头颈部(HN)患者的支持性护理,使用外部知识库对定制再训练大语言模型进行测试与验证。
Cancers (Basel). 2024 Jun 24;16(13):2311. doi: 10.3390/cancers16132311.
2
Quality of Large Language Model Responses to Radiation Oncology Patient Care Questions.大型语言模型对放射肿瘤学患者护理问题的回复质量。
JAMA Netw Open. 2024 Apr 1;7(4):e244630. doi: 10.1001/jamanetworkopen.2024.4630.
3
Supportive care among head and neck cancer patients: An initial validation of the Dutch version of the Performance Status Scale for Head and Neck Cancer (D-PSS-HN).头颈部癌症患者的支持性护理:头颈癌功能状态量表荷兰语版(D-PSS-HN)的初步验证
Int J Lang Commun Disord. 2023 Sep-Oct;58(5):1668-1679. doi: 10.1111/1460-6984.12894. Epub 2023 May 15.
4
Physician Versus Large Language Model Chatbot Responses to Web-Based Questions From Autistic Patients in Chinese: Cross-Sectional Comparative Analysis.中文自闭症患者网络问诊中,医生与大型语言模型聊天机器人回复的对比分析:横断面研究。
J Med Internet Res. 2024 Apr 30;26:e54706. doi: 10.2196/54706.
5
A head and neck cancer intervention for use in survivorship clinics: a protocol for a feasibility study.一种用于癌症康复诊所的头颈癌干预措施:一项可行性研究方案。
Pilot Feasibility Stud. 2016 May 5;2:23. doi: 10.1186/s40814-016-0061-3. eCollection 2016.
6
Evidence-based potential of generative artificial intelligence large language models in orthodontics: a comparative study of ChatGPT, Google Bard, and Microsoft Bing.生成式人工智能大语言模型在正畸学中的循证潜力:ChatGPT、谷歌巴德和微软必应的比较研究
Eur J Orthod. 2024 Apr 13. doi: 10.1093/ejo/cjae017.
7
Benchmarking large language models' performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard.比较分析 ChatGPT-3.5、ChatGPT-4.0 和谷歌巴德在近视防控方面的表现:大型语言模型的基准测试。
EBioMedicine. 2023 Sep;95:104770. doi: 10.1016/j.ebiom.2023.104770. Epub 2023 Aug 23.
8
Assessment of a Large Language Model's Responses to Questions and Cases About Glaucoma and Retina Management.评估大型语言模型对青光眼和视网膜管理相关问题和病例的回答。
JAMA Ophthalmol. 2024 Apr 1;142(4):371-375. doi: 10.1001/jamaophthalmol.2023.6917.
9
The Head and Neck Survivorship Tool (HN-STAR) Trial (WF-1805CD): A protocol for a cluster-randomized, hybrid effectiveness-implementation, pragmatic trial to improve the follow-up care of head and neck cancer survivors.头颈部生存者工具(HN-STAR)试验(WF-1805CD):一项旨在改善头颈部癌症生存者随访护理的集群随机、混合有效性实施、实用试验的方案。
Contemp Clin Trials. 2021 Aug;107:106448. doi: 10.1016/j.cct.2021.106448. Epub 2021 May 21.
10
Large Language Model-Based Chatbot vs Surgeon-Generated Informed Consent Documentation for Common Procedures.基于大语言模型的聊天机器人与外科医生生成的常见手术知情同意书文档。
JAMA Netw Open. 2023 Oct 2;6(10):e2336997. doi: 10.1001/jamanetworkopen.2023.36997.

引用本文的文献

1
Exploring the possibilities and limitations of customized large language model to support and improve cervical cancer screening.探索定制大语言模型以支持和改进宫颈癌筛查的可能性与局限性。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):242. doi: 10.1186/s12911-025-03088-3.
2
Improving the Precision of Deep-Learning-Based Head and Neck Target Auto-Segmentation by Leveraging Radiology Reports Using a Large Language Model.通过使用大语言模型利用放射学报告提高基于深度学习的头颈部靶区自动分割的精度
Cancers (Basel). 2025 Jun 10;17(12):1935. doi: 10.3390/cancers17121935.
3
Applications of Natural Language Processing in Otolaryngology: A Scoping Review.

本文引用的文献

1
Quality of Large Language Model Responses to Radiation Oncology Patient Care Questions.大型语言模型对放射肿瘤学患者护理问题的回复质量。
JAMA Netw Open. 2024 Apr 1;7(4):e244630. doi: 10.1001/jamanetworkopen.2024.4630.
2
Systematic analysis of ChatGPT, Google search and Llama 2 for clinical decision support tasks.系统分析 ChatGPT、Google 搜索和 Llama 2 在临床决策支持任务中的应用。
Nat Commun. 2024 Mar 6;15(1):2050. doi: 10.1038/s41467-024-46411-8.
3
Large language models streamline automated machine learning for clinical studies.
自然语言处理在耳鼻咽喉科的应用:一项范围综述
Laryngoscope. 2025 Sep;135(9):3049-3063. doi: 10.1002/lary.32198. Epub 2025 May 1.
4
Current status and future direction of cancer research using artificial intelligence for clinical application.利用人工智能进行临床应用的癌症研究现状与未来方向。
Cancer Sci. 2025 Feb;116(2):297-307. doi: 10.1111/cas.16395. Epub 2024 Nov 18.
大型语言模型简化了临床研究的自动化机器学习。
Nat Commun. 2024 Feb 21;15(1):1603. doi: 10.1038/s41467-024-45879-8.
4
CancerGPT for few shot drug pair synergy prediction using large pretrained language models.使用大型预训练语言模型进行少样本药物对协同作用预测的CancerGPT
NPJ Digit Med. 2024 Feb 19;7(1):40. doi: 10.1038/s41746-024-01024-9.
5
The future landscape of large language models in medicine.医学领域大语言模型的未来前景。
Commun Med (Lond). 2023 Oct 10;3(1):141. doi: 10.1038/s43856-023-00370-1.
6
Evaluating large language models on medical evidence summarization.基于医学证据总结对大语言模型进行评估。
NPJ Digit Med. 2023 Aug 24;6(1):158. doi: 10.1038/s41746-023-00896-7.
7
Use of Artificial Intelligence Chatbots for Cancer Treatment Information.使用人工智能聊天机器人获取癌症治疗信息。
JAMA Oncol. 2023 Oct 1;9(10):1459-1462. doi: 10.1001/jamaoncol.2023.2954.
8
Comparison of Ophthalmologist and Large Language Model Chatbot Responses to Online Patient Eye Care Questions.眼科医生与大型语言模型聊天机器人对在线患者眼部护理问题的回复比较。
JAMA Netw Open. 2023 Aug 1;6(8):e2330320. doi: 10.1001/jamanetworkopen.2023.30320.
9
Unlocking the Power of ChatGPT, Artificial Intelligence, and Large Language Models: Practical Suggestions for Radiation Oncologists.解锁 ChatGPT、人工智能和大型语言模型的力量:为放射肿瘤学家提供的实用建议。
Pract Radiat Oncol. 2023 Nov-Dec;13(6):e484-e490. doi: 10.1016/j.prro.2023.06.011. Epub 2023 Aug 19.
10
Development of a Personalized Chat Model Based on the European Association of Urology Oncology Guidelines: Harnessing the Power of Generative Artificial Intelligence in Clinical Practice.基于欧洲泌尿外科学会肿瘤学指南的个性化聊天模型的开发:在临床实践中利用生成式人工智能的力量。
Eur Urol Oncol. 2024 Feb;7(1):160-162. doi: 10.1016/j.euo.2023.06.009. Epub 2023 Jul 18.