利用基于Transformer的语言模型和多任务学习对[F]氟脱氧葡萄糖PET-CT肺癌报告进行不确定性感知自动TNM分期分类。

Uncertainty-aware automatic TNM staging classification for [F] Fluorodeoxyglucose PET-CT reports for lung cancer utilising transformer-based language models and multi-task learning.

作者信息

Barlow Stephen H, Chicklore Sugama, He Yulan, Ourselin Sebastien, Wagner Thomas, Barnes Anna, Cook Gary J R

机构信息

School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK.

King's College London and Guy's and St. Thomas' PET Centre, St. Thomas' Hospital, London, UK.

出版信息

BMC Med Inform Decis Mak. 2024 Dec 18;24(1):396. doi: 10.1186/s12911-024-02814-7.

DOI:10.1186/s12911-024-02814-7

PMID:39695672

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11657742/

Abstract

BACKGROUND

[F] Fluorodeoxyglucose (FDG) PET-CT is a clinical imaging modality widely used in diagnosing and staging lung cancer. The clinical findings of PET-CT studies are contained within free text reports, which can currently only be categorised by experts manually reading them. Pre-trained transformer-based language models (PLMs) have shown success in extracting complex linguistic features from text. Accordingly, we developed a multi-task 'TNMu' classifier to classify the presence/absence of tumour, node, metastasis ('TNM') findings (as defined by The Eight Edition of TNM Staging for Lung Cancer). This is combined with an uncertainty classification task ('u') to account for studies with ambiguous TNM status.

METHODS

2498 reports were annotated by a nuclear medicine physician and split into train, validation, and test datasets. For additional evaluation an external dataset (n = 461 reports) was created, and annotated by two nuclear medicine physicians with agreement reached on all examples. We trained and evaluated eleven publicly available PLMs to determine which is most effective for PET-CT reports, and compared multi-task, single task and traditional machine learning approaches.

RESULTS

We find that a multi-task approach with GatorTron as PLM achieves the best performance, with an overall accuracy (all four tasks correct) of 84% and a Hamming loss of 0.05 on the internal test dataset, and 79% and 0.07 on the external test dataset. Performance on the individual TNM tasks approached expert performance with macro average F1 scores of 0.91, 0.95 and 0.90 respectively on external data. For uncertainty an F1 of 0.77 is achieved.

CONCLUSIONS

Our 'TNMu' classifier successfully extracts TNM staging information from internal and external PET-CT reports. We concluded that multi-task approaches result in the best performance, and better computational efficiency over single task PLM approaches. We believe these models can improve PET-CT services by assisting in auditing, creating research cohorts, and developing decision support systems. Our approach to handling uncertainty represents a novel first step but has room for further refinement.

摘要

背景

[F]氟脱氧葡萄糖（FDG）PET-CT是一种广泛应用于肺癌诊断和分期的临床成像方式。PET-CT研究的临床结果包含在自由文本报告中，目前只能由专家通过人工阅读进行分类。基于预训练变压器的语言模型（PLM）已成功从文本中提取复杂的语言特征。因此，我们开发了一种多任务“TNMu”分类器，用于对肿瘤、淋巴结、转移（“TNM”）结果（如《肺癌TNM分期第八版》所定义）的存在与否进行分类。这与不确定性分类任务（“u”）相结合，以处理TNM状态不明确的研究。

方法

由一名核医学医师对2498份报告进行注释，并将其分为训练集、验证集和测试集。为了进行额外评估，创建了一个外部数据集（n = 461份报告），并由两名核医学医师进行注释，所有示例均达成一致。我们训练并评估了11个公开可用的PLM，以确定哪个对PET-CT报告最有效，并比较了多任务、单任务和传统机器学习方法。

结果

我们发现，以GatorTron作为PLM的多任务方法性能最佳，在内部测试数据集上的总体准确率（所有四项任务均正确）为84%，汉明损失为0.05，在外部测试数据集上为79%和0.07。在各个TNM任务上的性能接近专家水平，在外部数据上的宏观平均F1分数分别为0.91、0.95和0.90。对于不确定性，F1分数为0.77。

结论

我们的“TNMu”分类器成功地从内部和外部PET-CT报告中提取了TNM分期信息。我们得出结论，多任务方法性能最佳，且比单任务PLM方法具有更高的计算效率。我们相信这些模型可以通过协助审核、创建研究队列和开发决策支持系统来改善PET-CT服务。我们处理不确定性的方法代表了新颖的第一步，但仍有进一步完善的空间。

相似文献

Uncertainty-aware automatic TNM staging classification for [F] Fluorodeoxyglucose PET-CT reports for lung cancer utilising transformer-based language models and multi-task learning.利用基于Transformer的语言模型和多任务学习对[F]氟脱氧葡萄糖PET-CT肺癌报告进行不确定性感知自动TNM分期分类。

BMC Med Inform Decis Mak. 2024 Dec 18;24(1):396. doi: 10.1186/s12911-024-02814-7.

Lung Cancer Staging Using Chest CT and FDG PET/CT Free-Text Reports: Comparison Among Three ChatGPT Large Language Models and Six Human Readers of Varying Experience.使用胸部CT和FDG PET/CT自由文本报告进行肺癌分期：三种ChatGPT大语言模型与六位不同经验水平的人类读者的比较

AJR Am J Roentgenol. 2024 Dec;223(6):e2431696. doi: 10.2214/AJR.24.31696. Epub 2024 Sep 4.

F-FDG PET/CT Uptake Classification in Lymphoma and Lung Cancer by Using Deep Convolutional Neural Networks.使用深度卷积神经网络对淋巴瘤和肺癌的 F-FDG PET/CT 摄取进行分类。

Radiology. 2020 Feb;294(2):445-452. doi: 10.1148/radiol.2019191114. Epub 2019 Dec 10.

Development and validation of a machine learning-based F-fluorodeoxyglucose PET/CT radiomics signature for predicting gastric cancer survival.基于机器学习的 F-氟代脱氧葡萄糖 PET/CT 影像组学特征的建立与验证，用于预测胃癌患者的生存情况。

Cancer Imaging. 2024 Jul 30;24(1):99. doi: 10.1186/s40644-024-00741-4.

The Potential of Gemini and GPTs for Structured Report Generation based on Free-Text F-FDG PET/CT Breast Cancer Reports.基于自由文本F-FDG PET/CT乳腺癌报告的Gemini和GPTs在结构化报告生成中的潜力。

Acad Radiol. 2025 Feb;32(2):624-633. doi: 10.1016/j.acra.2024.08.052. Epub 2024 Sep 7.

Prognostic value of metabolic tumor volume on [F]FDG PET/CT in addition to the TNM classification system of locally advanced non-small cell lung cancer.代谢肿瘤体积在[F]FDG PET/CT上对局部晚期非小细胞肺癌的预后价值（除TNM分类系统外）

Cancer Imaging. 2024 Dec 21;24(1):171. doi: 10.1186/s40644-024-00811-7.

Automated lung cancer assessment on 18F-PET/CT using Retina U-Net and anatomical region segmentation.使用 Retina U-Net 和解剖区域分割对 18F-PET/CT 进行自动肺癌评估。

Eur Radiol. 2023 Jun;33(6):4270-4279. doi: 10.1007/s00330-022-09332-y. Epub 2023 Jan 10.

Performance Comparison Between F-FDG PET/CT Plus Brain MRI and Conventional Staging Plus Brain MRI in Staging of Small Cell Lung Carcinoma.正电子发射断层扫描/计算机断层扫描（PET/CT）联合脑磁共振成像与常规分期联合脑磁共振成像在小细胞肺癌分期中的比较。

AJR Am J Roentgenol. 2018 Jul;211(1):185-192. doi: 10.2214/AJR.17.18935. Epub 2018 Apr 18.

Fluorine-18-fluorodeoxyglucose (FDG) positron emission tomography (PET) computed tomography (CT) for the detection of bone, lung, and lymph node metastases in rhabdomyosarcoma.氟-18-氟代脱氧葡萄糖（FDG）正电子发射断层扫描（PET）计算机断层扫描（CT）用于检测横纹肌肉瘤中的骨、肺和淋巴结转移。

Cochrane Database Syst Rev. 2021 Nov 9;11(11):CD012325. doi: 10.1002/14651858.CD012325.pub2.

Small Cell Lung Cancer Staging: Prospective Comparison of Conventional Staging Tests, FDG PET/CT, Whole-Body MRI, and Coregistered FDG PET/MRI.小细胞肺癌分期：传统分期检查、FDG PET/CT、全身MRI及融合的FDG PET/MRI的前瞻性比较

AJR Am J Roentgenol. 2022 May;218(5):899-908. doi: 10.2214/AJR.21.26868. Epub 2021 Dec 8.

本文引用的文献

A Hybrid CNN-Transformer Model for Predicting N Staging and Survival in Non-Small Cell Lung Cancer Patients Based on CT-Scan.基于 CT 扫描的 CNN-Transformer 混合模型预测非小细胞肺癌患者 N 分期和生存情况

Tomography. 2024 Oct 10;10(10):1676-1693. doi: 10.3390/tomography10100123.

Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use".自然语言处理算法在放射学报告中的肺肿瘤分期中的应用：“包括 PET-CT 以及向临床应用的验证”。

J Imaging Inform Med. 2024 Feb;37(1):3-12. doi: 10.1007/s10278-023-00913-x. Epub 2024 Jan 12.

Domain-adapted Large Language Models for Classifying Nuclear Medicine Reports.用于核医学报告分类的领域自适应大语言模型

Radiol Artif Intell. 2023 Sep 27;5(6):e220281. doi: 10.1148/ryai.220281. eCollection 2023 Nov.

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习：一个包含综述与基准测试的统一框架

IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.

Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting.利用数据增强和提示技术的大型语言模型从放射学报告推断癌症疾病反应。

J Am Med Inform Assoc. 2023 Sep 25;30(10):1657-1664. doi: 10.1093/jamia/ocad133.

Actionable reporting versus unwanted advice in PET-CT reports.在 PET-CT 报告中，可执行报告与不必要的建议。

Clin Radiol. 2023 Sep;78(9):666-670. doi: 10.1016/j.crad.2023.05.015. Epub 2023 Jun 8.

BERT-based Transfer Learning in Sentence-level Anatomic Classification of Free-Text Radiology Reports.基于BERT的自由文本放射学报告句子级解剖分类迁移学习

Radiol Artif Intell. 2023 Feb 15;5(2):e220097. doi: 10.1148/ryai.220097. eCollection 2023 Mar.

How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation.自然语言处理如何借助自由文本放射学报告辅助肺肿瘤学肿瘤淋巴结转移分期：算法开发与验证

JMIR Form Res. 2023 Mar 22;7:e38125. doi: 10.2196/38125.

A large language model for electronic health records.用于电子健康记录的大型语言模型。

NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.

Natural Language Processing for Smart Healthcare.自然语言处理在智慧医疗中的应用。

IEEE Rev Biomed Eng. 2024;17:4-18. doi: 10.1109/RBME.2022.3210270. Epub 2024 Jan 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用基于Transformer的语言模型和多任务学习对[F]氟脱氧葡萄糖PET-CT肺癌报告进行不确定性感知自动TNM分期分类。

Uncertainty-aware automatic TNM staging classification for [F] Fluorodeoxyglucose PET-CT reports for lung cancer utilising transformer-based language models and multi-task learning.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献