• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

绘制晚期上皮性卵巢癌全景远非文字所能描述:两个大语言模型,八项任务,一段征程。

Mapping the Advanced-Stage Epithelial Ovarian Cancer Landscape Goes Beyond Words: Two Large Language Models, Eight Tasks, One Journey.

作者信息

Quaranta Michela, Laios Alexandros, Rogers Charlie, Mavromatidou Anastasia Ioanna, Thangavelu Amudha, Theophilou Georgios, Nugent David, DeJong Diederick, Kalampokis Evangelos

机构信息

Department of Gynaecologic Oncology, ESGO Centre of Excellence for Ovarian Cancer Surgery, St James's University Hospital, Leeds LS9 7TF, UK.

Information Systems Lab, Department of Business Administration, University of Macedonia, 54636 Thessaloniki, Greece.

出版信息

J Clin Med. 2025 Mar 25;14(7):2223. doi: 10.3390/jcm14072223.

DOI:10.3390/jcm14072223
PMID:40217674
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11989528/
Abstract

The advancement of natural language processing (NLP) technologies has transformed various sectors. However, their application in the healthcare domain, particularly for analysing clinical notes, remains underdeveloped. We investigated the use of deep neural networks, specifically transformer-based models, to predict intraoperative and post-operative outcomes related to advanced-stage epithelial ovarian cancer cytoreduction (aEOC) using unstructured surgical notes. We evaluated the performance of RoBERTa, a general-purpose language model, and GatorTron, a domain-specific model, across eight binary classification tasks using the same dataset. The dataset consisted of 560 surgical records from patients with aEOC who underwent cytoreductive surgery at a tertiary UK reference centre. Predictive outcomes were converted into binary features to facilitate classification tasks. To enhance the contextual information available to the models, textual data from "operative findings" and "operative notes" were concatenated. Our findings highlight the tangible benefits of employing domain-specific language models for clinical text analysis. GatorTron generally outperformed RoBERTa across most predictive tasks, underscoring the advantages of domain-specific pretraining for understanding medical terminology and context. Both models struggled to predict certain outcomes, particularly those involving post-operative events like major complications and length of hospital stay, despite adjustments in hyperparameters and training strategies. This limitation suggests that operative text alone may not sufficiently capture the complexities of post-operative recovery. These findings have valuable implications for developing medical AI systems to improve the delivery of modern aEOC healthcare.

摘要

自然语言处理(NLP)技术的进步已经改变了各个领域。然而,它们在医疗保健领域的应用,特别是在分析临床记录方面,仍然不够发达。我们研究了使用深度神经网络,特别是基于Transformer的模型,通过非结构化手术记录来预测与晚期上皮性卵巢癌细胞减灭术(aEOC)相关的术中及术后结果。我们使用相同的数据集,在八个二分类任务中评估了通用语言模型RoBERTa和领域特定模型GatorTron的性能。该数据集由560份来自在英国一家三级参考中心接受细胞减灭术的aEOC患者的手术记录组成。将预测结果转换为二元特征以方便分类任务。为了增强模型可用的上下文信息,将“手术发现”和“手术记录”中的文本数据进行了拼接。我们的研究结果突出了采用领域特定语言模型进行临床文本分析的切实好处。在大多数预测任务中,GatorTron通常优于RoBERTa,这凸显了领域特定预训练在理解医学术语和上下文方面 的优势。尽管对超参数和训练策略进行了调整,但两个模型在预测某些结果时都遇到了困难,特别是那些涉及术后事件(如重大并发症和住院时间)的结果。这一局限性表明,仅手术文本可能无法充分捕捉术后恢复的复杂性。这些发现对于开发医疗人工智能系统以改善现代aEOC医疗保健的提供具有宝贵的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/298d0781f0b4/jcm-14-02223-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/a033ae0da48f/jcm-14-02223-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/962c390bf5ef/jcm-14-02223-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/298d0781f0b4/jcm-14-02223-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/a033ae0da48f/jcm-14-02223-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/962c390bf5ef/jcm-14-02223-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/482f/11989528/298d0781f0b4/jcm-14-02223-g003.jpg

相似文献

1
Mapping the Advanced-Stage Epithelial Ovarian Cancer Landscape Goes Beyond Words: Two Large Language Models, Eight Tasks, One Journey.绘制晚期上皮性卵巢癌全景远非文字所能描述:两个大语言模型,八项任务,一段征程。
J Clin Med. 2025 Mar 25;14(7):2223. doi: 10.3390/jcm14072223.
2
RoBERTa-Assisted Outcome Prediction in Ovarian Cancer Cytoreductive Surgery Using Operative Notes.基于手术记录的 RoBERTa 辅助卵巢癌细胞减灭术预后预测
Cancer Control. 2023 Jan-Dec;30:10732748231209892. doi: 10.1177/10732748231209892.
3
A large language model for electronic health records.用于电子健康记录的大型语言模型。
NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.
4
Contextualized medication information extraction using Transformer-based deep learning architectures.基于 Transformer 的深度学习架构的上下文药物信息提取。
J Biomed Inform. 2023 Jun;142:104370. doi: 10.1016/j.jbi.2023.104370. Epub 2023 Apr 24.
5
Critical assessment of transformer-based AI models for German clinical notes.基于变压器的德国临床记录人工智能模型的批判性评估。
JAMIA Open. 2022 Nov 15;5(4):ooac087. doi: 10.1093/jamiaopen/ooac087. eCollection 2022 Dec.
6
Comparison of Pretraining Models and Strategies for Health-Related Social Media Text Classification.与健康相关的社交媒体文本分类的预训练模型和策略比较。
Healthcare (Basel). 2022 Aug 5;10(8):1478. doi: 10.3390/healthcare10081478.
7
From admission to discharge: a systematic review of clinical natural language processing along the patient journey.从入院到出院:患者就诊流程中临床自然语言处理的系统评价。
BMC Med Inform Decis Mak. 2024 Aug 29;24(1):238. doi: 10.1186/s12911-024-02641-w.
8
Identification of Semantically Similar Sentences in Clinical Notes: Iterative Intermediate Training Using Multi-Task Learning.临床笔记中语义相似句子的识别:使用多任务学习的迭代中间训练
JMIR Med Inform. 2020 Nov 27;8(11):e22508. doi: 10.2196/22508.
9
Evaluating large language models for health-related text classification tasks with public social media data.利用公共社交媒体数据评估用于健康相关文本分类任务的大型语言模型。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2181-2189. doi: 10.1093/jamia/ocae210.
10
Detection of Personal and Family History of Suicidal Thoughts and Behaviors using Deep Learning and Natural Language Processing: A Multi-Site Study.使用深度学习和自然语言处理检测自杀念头和行为的个人及家族史:一项多中心研究
Res Sq. 2024 Mar 11:rs.3.rs-4014472. doi: 10.21203/rs.3.rs-4014472/v1.

本文引用的文献

1
The Potential of Gemini and GPTs for Structured Report Generation based on Free-Text F-FDG PET/CT Breast Cancer Reports.基于自由文本F-FDG PET/CT乳腺癌报告的Gemini和GPTs在结构化报告生成中的潜力。
Acad Radiol. 2025 Feb;32(2):624-633. doi: 10.1016/j.acra.2024.08.052. Epub 2024 Sep 7.
2
From admission to discharge: a systematic review of clinical natural language processing along the patient journey.从入院到出院:患者就诊流程中临床自然语言处理的系统评价。
BMC Med Inform Decis Mak. 2024 Aug 29;24(1):238. doi: 10.1186/s12911-024-02641-w.
3
Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models.
使用Transformer模型从肺癌筛查患者的放射学报告中提取肺结节及结节特征
J Healthc Inform Res. 2024 May 17;8(3):463-477. doi: 10.1007/s41666-024-00166-5. eCollection 2024 Sep.
4
ChatGPT compared to national guidelines for management of ovarian cancer: Did ChatGPT get it right? - A Memorial Sloan Kettering Cancer Center Team Ovary study.ChatGPT 与卵巢癌管理的国家指南比较:ChatGPT 是否做对了?- 纪念斯隆凯特琳癌症中心卵巢癌团队研究。
Gynecol Oncol. 2024 Oct;189:75-79. doi: 10.1016/j.ygyno.2024.07.007. Epub 2024 Jul 22.
5
Explaining the Elusive Nature of a Well-Defined Threshold for Blood Transfusion in Advanced Epithelial Ovarian Cancer Cytoreductive Surgery.解析晚期上皮性卵巢癌肿瘤细胞减灭术中明确输血阈值难以捉摸的本质。
Diagnostics (Basel). 2023 Dec 30;14(1):94. doi: 10.3390/diagnostics14010094.
6
Exploring the Potential Role of Upper Abdominal Peritonectomy in Advanced Ovarian Cancer Cytoreductive Surgery Using Explainable Artificial Intelligence.利用可解释人工智能探索上腹部腹膜切除术在晚期卵巢癌减瘤手术中的潜在作用。
Cancers (Basel). 2023 Nov 13;15(22):5386. doi: 10.3390/cancers15225386.
7
RoBERTa-Assisted Outcome Prediction in Ovarian Cancer Cytoreductive Surgery Using Operative Notes.基于手术记录的 RoBERTa 辅助卵巢癌细胞减灭术预后预测
Cancer Control. 2023 Jan-Dec;30:10732748231209892. doi: 10.1177/10732748231209892.
8
Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods.荷兰临床文本中的否定词检测:基于规则和机器学习方法的评估。
BMC Bioinformatics. 2023 Jan 9;24(1):10. doi: 10.1186/s12859-022-05130-x.
9
Natural language processing: state of the art, current trends and challenges.自然语言处理:技术现状、当前趋势与挑战。
Multimed Tools Appl. 2023;82(3):3713-3744. doi: 10.1007/s11042-022-13428-4. Epub 2022 Jul 14.
10
Ask Rosa - The making of a digital genetic conversation tool, a chatbot, about hereditary breast and ovarian cancer.向罗莎提问——制作一个关于遗传性乳腺癌和卵巢癌的数字遗传对话工具,一个聊天机器人。
Patient Educ Couns. 2022 Jun;105(6):1488-1494. doi: 10.1016/j.pec.2021.09.027. Epub 2021 Oct 6.