大型语言模型在分类分支导管内乳头状黏液性肿瘤中的效能。

Efficacy of a large language model in classifying branch-duct intraductal papillary mucinous neoplasms.

作者信息

Sato Mai, Yasaka Koichiro, Abe Shimon, Kurashima Joji, Asari Yusuke, Kiryu Shigeru, Abe Osamu

机构信息

The University of Tokyo, Tokyo, Japan.

International University of Health and Welfare, Ōtawara, Japan.

出版信息

Abdom Radiol (NY). 2025 Jun 11. doi: 10.1007/s00261-025-05062-z.

DOI:10.1007/s00261-025-05062-z

PMID:40498341

Abstract

OBJECTIVES

Appropriate categorization based on magnetic resonance imaging (MRI) findings is important for managing intraductal papillary mucinous neoplasms (IPMNs). In this study, a large language model (LLM) that classifies IPMNs based on MRI findings was developed, and its performance was compared with that of less experienced human readers.

METHODS

The medical image management and processing systems of our hospital were searched to identify MRI reports of branch-duct IPMNs (BD-IPMNs). They were assigned to the training, validation, and testing datasets in chronological order. The model was trained on the training dataset, and the best-performing model on the validation dataset was evaluated on the test dataset. Furthermore, two radiology residents (Readers 1 and 2) and an intern (Reader 3) manually sorted the reports in the test dataset. The accuracy, sensitivity, and time required for categorizing were compared between the model and readers.

RESULTS

The accuracy of the fine-tuned LLM for the test dataset was 0.966, which was comparable to that of Readers 1 and 2 (0.931-0.972) and significantly better than that of Reader 3 (0.907). The fine-tuned LLM had an area under the receiver operating characteristic curve of 0.982 for the classification of cyst diameter ≥ 10 mm, which was significantly superior to that of Reader 3 (0.944). Furthermore, the fine-tuned LLM (25 s) completed the test dataset faster than the readers (1,887-2,646 s).

CONCLUSION

The fine-tuned LLM classified BD-IPMNs based on MRI findings with comparable performance to that of radiology residents and significantly reduced the time required.

摘要

目的

基于磁共振成像（MRI）结果进行恰当分类对于导管内乳头状黏液性肿瘤（IPMN）的管理很重要。在本研究中，开发了一种基于MRI结果对IPMN进行分类的大语言模型（LLM），并将其性能与经验较少的人类读者的性能进行比较。

方法

检索我院的医学图像管理和处理系统，以识别分支导管IPMN（BD-IPMN）的MRI报告。它们按时间顺序被分配到训练、验证和测试数据集。该模型在训练数据集上进行训练，并在测试数据集上评估验证数据集中表现最佳的模型。此外，两名放射科住院医师（读者1和读者2）和一名实习生（读者3）对测试数据集中的报告进行人工分类。比较了模型和读者在分类准确性、敏感性和所需时间方面的差异。

结果

针对测试数据集，微调后的LLM的准确率为0.966，与读者1和读者2的准确率（0.931 - 0.972）相当，且显著优于读者3的准确率（0.907）。对于囊肿直径≥10 mm的分类，微调后的LLM的受试者操作特征曲线下面积为0.982，显著优于读者3的（0.944）。此外，微调后的LLM（25秒）完成测试数据集的速度比读者（1887 - 2646秒）快。

结论

微调后的LLM基于MRI结果对BD-IPMN进行分类，其性能与放射科住院医师相当，并显著减少了所需时间。

相似文献

Efficacy of a large language model in classifying branch-duct intraductal papillary mucinous neoplasms.

Abdom Radiol (NY). 2025 Jun 11. doi: 10.1007/s00261-025-05062-z.

Automated classification of brain MRI reports using fine-tuned large language models.

Neuroradiology. 2024 Dec;66(12):2177-2183. doi: 10.1007/s00234-024-03427-7. Epub 2024 Jul 12.

Classification of Interventional Radiology Reports into Technique Categories with a Fine-Tuned Large Language Model.

J Imaging Inform Med. 2024 Dec 13. doi: 10.1007/s10278-024-01370-w.

Fine-Tuned Large Language Model for Extracting Patients on Pretreatment for Lung Cancer from a Picture Archiving and Communication System Based on Radiological Reports.

J Imaging Inform Med. 2025 Feb;38(1):327-334. doi: 10.1007/s10278-024-01186-8. Epub 2024 Jul 2.

Radiomic nomogram based on MRI to predict grade of branching type intraductal papillary mucinous neoplasms of the pancreas: a multicenter study.

Cancer Imaging. 2021 Mar 9;21(1):26. doi: 10.1186/s40644-021-00395-6.

The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports.

J Imaging Inform Med. 2025 Apr;38(2):865-872. doi: 10.1007/s10278-024-01242-3. Epub 2024 Aug 26.

Development and Validation of a Multi-institutional Preoperative Nomogram for Predicting Grade of Dysplasia in Intraductal Papillary Mucinous Neoplasms (IPMNs) of the Pancreas: A Report from The Pancreatic Surgery Consortium.

Ann Surg. 2018 Jan;267(1):157-163. doi: 10.1097/SLA.0000000000002015.

Determining Malignant Potential of Intraductal Papillary Mucinous Neoplasm of the Pancreas: CT versus MRI by Using Revised 2017 International Consensus Guidelines.

Radiology. 2019 Oct;293(1):134-143. doi: 10.1148/radiol.2019190144. Epub 2019 Sep 3.

An imaging-based model to predict the malignant potential of intraductal papillary mucinous neoplasm of the pancreas.

Eur Radiol. 2025 Feb;35(2):700-711. doi: 10.1007/s00330-024-11003-z. Epub 2024 Aug 7.

Proper management and follow-up strategy of branch duct intraductal papillary mucinous neoplasms of the pancreas.

Dig Liver Dis. 2012 Mar;44(3):257-60. doi: 10.1016/j.dld.2011.09.010. Epub 2011 Oct 24.

本文引用的文献

Open-Source Large Language Models in Radiology: A Review and Tutorial for Practical Research and Clinical Deployment.

Radiology. 2025 Jan;314(1):e241073. doi: 10.1148/radiol.241073.

Data set terminology of deep learning in medicine: a historical review and recommendation.

Jpn J Radiol. 2024 Oct;42(10):1100-1109. doi: 10.1007/s11604-024-01608-1. Epub 2024 Jun 10.

Computed tomography-based radiomics diagnostic approach for differential diagnosis between early- and late-stage pancreatic ductal adenocarcinoma.

World J Gastrointest Oncol. 2024 Apr 15;16(4):1256-1267. doi: 10.4251/wjgo.v16.i4.1256.

Clinical Impact of Deep Learning Reconstruction in MRI.

Radiographics. 2023 Jun;43(6):e220133. doi: 10.1148/rg.220133.

Improving detection performance of hepatocellular carcinoma and interobserver agreement for liver imaging reporting and data system on CT using deep learning reconstruction.

Abdom Radiol (NY). 2023 Apr;48(4):1280-1289. doi: 10.1007/s00261-023-03834-z. Epub 2023 Feb 9.

Number of Worrisome Features and Risk of Malignancy in Intraductal Papillary Mucinous Neoplasm.

J Am Coll Surg. 2022 Jun 1;234(6):1021-1030. doi: 10.1097/XCS.0000000000000176. Epub 2022 Mar 22.

Application of Unenhanced Computed Tomography Texture Analysis to Differentiate Pancreatic Adenosquamous Carcinoma from Pancreatic Ductal Adenocarcinoma.

Curr Med Sci. 2022 Feb;42(1):217-225. doi: 10.1007/s11596-022-2535-2. Epub 2022 Jan 28.

Pancreatic intraductal papillary mucinous neoplasms: Current diagnosis and management.

World J Gastrointest Oncol. 2021 Dec 15;13(12):1880-1895. doi: 10.4251/wjgo.v13.i12.1880.

Impact of deep learning reconstruction on intracranial 1.5 T magnetic resonance angiography.

Jpn J Radiol. 2022 May;40(5):476-483. doi: 10.1007/s11604-021-01225-2. Epub 2021 Dec 1.

Deep Learning for MR Angiography: Automated Detection of Cerebral Aneurysms.

Radiology. 2019 Jan;290(1):187-194. doi: 10.1148/radiol.2018180901. Epub 2018 Oct 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

大型语言模型在分类分支导管内乳头状黏液性肿瘤中的效能。

Efficacy of a large language model in classifying branch-duct intraductal papillary mucinous neoplasms.

作者信息

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献