• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大语言模型和荟萃分析提高肺癌质子治疗后并发症预测能力。

Improving Prediction of Complications Post-Proton Therapy in Lung Cancer Using Large Language Models and Meta-Analysis.

机构信息

Medical Physics and Informatics Laboratory of Electronics Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan.

Department of Radiation Oncology, Kaohsiung Chang Gung Memorial Hospital and Chang Gung University College of Medicine, Kaohsiung, Taiwan.

出版信息

Cancer Control. 2024 Jan-Dec;31:10732748241286749. doi: 10.1177/10732748241286749.

DOI:10.1177/10732748241286749
PMID:39307562
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11418344/
Abstract

PURPOSE

This study enhances the efficiency of predicting complications in lung cancer patients receiving proton therapy by utilizing large language models (LLMs) and meta-analytical techniques for literature quality assessment.

MATERIALS AND METHODS

We integrated systematic reviews with LLM evaluations, sourcing studies from Web of Science, PubMed, and Scopus, managed via EndNote X20. Inclusion and exclusion criteria ensured literature relevance. Techniques included meta-analysis, heterogeneity assessment using Cochran's Q test and I statistics, and subgroup analyses for different complications. Quality and bias risk were assessed using the PROBAST tool and further analyzed with models such as ChatGPT-4, Llama2-13b, and Llama3-8b. Evaluation metrics included AUC, accuracy, precision, recall, F1 score, and time efficiency (WPM).

RESULTS

The meta-analysis revealed an overall effect size of 0.78 for model predictions, with high heterogeneity observed (I = 72.88%, < 0.001). Subgroup analysis for radiation-induced esophagitis and pneumonitis revealed predictive effect sizes of 0.79 and 0.77, respectively, with a heterogeneity index (I) of 0%, indicating that there were no significant differences among the models in predicting these specific complications. A literature assessment using LLMs demonstrated that ChatGPT-4 achieved the highest accuracy at 90%, significantly outperforming the Llama3 and Llama2 models, which had accuracies ranging from 44% to 62%. Additionally, LLM evaluations were conducted 3229 times faster than manual assessments were, markedly enhancing both efficiency and accuracy. The risk assessment results identified nine studies as high risk, three as low risk, and one as unknown, confirming the robustness of the ChatGPT-4 across various evaluation metrics.

CONCLUSION

This study demonstrated that the integration of large language models with meta-analysis techniques can significantly increase the efficiency of literature evaluations and reduce the time required for assessments, confirming that there are no significant differences among models in predicting post proton therapy complications in lung cancer patients.

摘要

目的

本研究通过利用大语言模型(LLM)和文献质量评估的荟萃分析技术,提高预测接受质子治疗的肺癌患者并发症的效率。

材料与方法

我们整合了系统评价和 LLM 评估,从 Web of Science、PubMed 和 Scopus 中获取研究,通过 EndNote X20 进行管理。纳入和排除标准确保了文献的相关性。方法包括荟萃分析、使用 Cochran's Q 检验和 I 统计量评估异质性,以及针对不同并发症的亚组分析。使用 PROBAST 工具评估质量和偏倚风险,并进一步使用 ChatGPT-4、Llama2-13b 和 Llama3-8b 等模型进行分析。评估指标包括 AUC、准确性、精度、召回率、F1 得分和时间效率(WPM)。

结果

荟萃分析显示,模型预测的总体效应大小为 0.78,观察到高度异质性(I = 72.88%, < 0.001)。对放射性食管炎和放射性肺炎的亚组分析显示,预测效应大小分别为 0.79 和 0.77,异质性指数(I)为 0%,表明在预测这些特定并发症方面,模型之间没有显著差异。使用 LLM 对文献进行评估表明,ChatGPT-4 的准确率最高,为 90%,明显优于 Llama3 和 Llama2 模型,其准确率范围为 44%至 62%。此外,LLM 评估的速度比手动评估快 3229 倍,显著提高了效率和准确性。风险评估结果确定了 9 项研究为高风险,3 项为低风险,1 项为未知,证实了 ChatGPT-4 在各种评估指标下的稳健性。

结论

本研究表明,将大语言模型与荟萃分析技术相结合,可以显著提高文献评估的效率,并减少评估所需的时间,证实了在预测肺癌患者接受质子治疗后的并发症方面,模型之间没有显著差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/c8f90bd7e789/10.1177_10732748241286749-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/be9298e5aa56/10.1177_10732748241286749-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/2213db88e0e3/10.1177_10732748241286749-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/a41390ef6fb6/10.1177_10732748241286749-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/446c4964b6c2/10.1177_10732748241286749-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/c8f90bd7e789/10.1177_10732748241286749-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/be9298e5aa56/10.1177_10732748241286749-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/2213db88e0e3/10.1177_10732748241286749-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/a41390ef6fb6/10.1177_10732748241286749-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/446c4964b6c2/10.1177_10732748241286749-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5217/11418344/c8f90bd7e789/10.1177_10732748241286749-fig5.jpg

相似文献

1
Improving Prediction of Complications Post-Proton Therapy in Lung Cancer Using Large Language Models and Meta-Analysis.利用大语言模型和荟萃分析提高肺癌质子治疗后并发症预测能力。
Cancer Control. 2024 Jan-Dec;31:10732748241286749. doi: 10.1177/10732748241286749.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare.ChatGPT及其他对话式大语言模型在医疗保健领域的系统评价
medRxiv. 2024 Apr 27:2024.04.26.24306390. doi: 10.1101/2024.04.26.24306390.
4
The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review.大型语言模型在变革急诊医学中的作用:范围综述
JMIR Med Inform. 2024 May 10;12:e53787. doi: 10.2196/53787.
5
Leveraging Large Language Models for Precision Monitoring of Chemotherapy-Induced Toxicities: A Pilot Study with Expert Comparisons and Future Directions.利用大语言模型进行化疗诱导毒性的精准监测:一项专家比较及未来方向的试点研究
Cancers (Basel). 2024 Aug 12;16(16):2830. doi: 10.3390/cancers16162830.
6
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较:评估研究。
J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.
7
Performance of ChatGPT Across Different Versions in Medical Licensing Examinations Worldwide: Systematic Review and Meta-Analysis.ChatGPT 在全球医学执照考试不同版本中的表现:系统评价和荟萃分析。
J Med Internet Res. 2024 Jul 25;26:e60807. doi: 10.2196/60807.
8
Evidence-based potential of generative artificial intelligence large language models in orthodontics: a comparative study of ChatGPT, Google Bard, and Microsoft Bing.生成式人工智能大语言模型在正畸学中的循证潜力:ChatGPT、谷歌巴德和微软必应的比较研究
Eur J Orthod. 2024 Apr 13. doi: 10.1093/ejo/cjae017.
9
Using artificial intelligence to generate medical literature for urology patients: a comparison of three different large language models.利用人工智能为泌尿科患者生成医学文献:三种不同的大型语言模型比较。
World J Urol. 2024 Jul 29;42(1):455. doi: 10.1007/s00345-024-05146-3.
10
Use of Large Language Models to Predict Neuroimaging.大语言模型在神经影像学预测中的应用。
J Am Coll Radiol. 2023 Oct;20(10):1004-1009. doi: 10.1016/j.jacr.2023.06.008. Epub 2023 Jul 8.

本文引用的文献

1
Sa-TTCA: An SVM-based approach for tumor T-cell antigen classification using features extracted from biological sequencing and natural language processing.Sa-TTCA:一种基于 SVM 的方法,用于使用从生物测序和自然语言处理中提取的特征对肿瘤 T 细胞抗原进行分类。
Comput Biol Med. 2024 May;174:108408. doi: 10.1016/j.compbiomed.2024.108408. Epub 2024 Apr 4.
2
Can large language models reason about medical questions?大型语言模型能对医学问题进行推理吗?
Patterns (N Y). 2024 Mar 1;5(3):100943. doi: 10.1016/j.patter.2024.100943. eCollection 2024 Mar 8.
3
The Trajectory of Oral Mucositis in Head and Neck Cancer Patients Undergoing Radiotherapy and its Influencing Factors.
头颈部癌放疗患者口腔黏膜炎的轨迹及其影响因素
Ear Nose Throat J. 2025 May;104(5):NP257-NP269. doi: 10.1177/01455613241228211. Epub 2024 Feb 9.
4
Using meta-analysis and CNN-NLP to review and classify the medical literature for normal tissue complication probability in head and neck cancer.运用荟萃分析和 CNN-NLP 对头颈部癌症正常组织并发症概率的医学文献进行回顾和分类。
Radiat Oncol. 2024 Jan 9;19(1):5. doi: 10.1186/s13014-023-02381-7.
5
Predicting the Effect of Proton Beam Therapy Technology on Pulmonary Toxicities for Patients With Locally Advanced Lung Cancer Enrolled in the Proton Collaborative Group Prospective Clinical Trial.预测质子束治疗技术对质子协作组前瞻性临床试验中局部晚期肺癌患者肺部毒性的影响。
Int J Radiat Oncol Biol Phys. 2024 May 1;119(1):66-77. doi: 10.1016/j.ijrobp.2023.11.026. Epub 2023 Nov 23.
6
Prediction of radiation pneumonitis using the effective α/β of lungs and heart in NSCLC patients treated with proton beam therapy.使用质子束治疗非小细胞肺癌患者的肺和心脏有效α/β预测放射性肺炎。
Radiother Oncol. 2024 Jan;190:110013. doi: 10.1016/j.radonc.2023.110013. Epub 2023 Nov 14.
7
Combined Methylation of and Genes in Diagnosing Malignant Pleural Effusion.联合 和 基因甲基化用于诊断恶性胸腔积液
Discov Med. 2023 Oct;35(178):845-852. doi: 10.24976/Discov.Med.202335178.79.
8
Screening for primary aldosteronism on and off interfering medications.停用或不停用干扰药物时的原发性醛固酮增多症筛查。
Endocrine. 2024 Jan;83(1):178-187. doi: 10.1007/s12020-023-03520-6. Epub 2023 Oct 5.
9
Leveraging transformers-based language models in proteome bioinformatics.基于转换器的语言模型在蛋白质组生物信息学中的应用。
Proteomics. 2023 Dec;23(23-24):e2300011. doi: 10.1002/pmic.202300011. Epub 2023 Jun 29.
10
ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations.医学领域的ChatGPT:其应用、优势、局限性、未来前景及伦理考量概述
Front Artif Intell. 2023 May 4;6:1169595. doi: 10.3389/frai.2023.1169595. eCollection 2023.