Warner Joshua D, Hartman Robert P, Blezek Daniel J, Thomas John V
Department of Radiology, University of Wisconsin-Madison School of Medicine & Public Health, 600 Highland Ave, Madison, WI, 53792-3252, USA.
Department of Radiology, Mayo Clinic, Rochester, MN, USA.
J Imaging Inform Med. 2025 Jan 30. doi: 10.1007/s10278-025-01395-9.
Exam protocoling is a significant non-interpretive task burden for radiologists. The purpose of this work was to develop a natural language processing (NLP) artificial intelligence (AI) solution for automated protocoling of standard abdomen and pelvic magnetic resonance imaging (MRI) exams from basic associated order information and patient metadata. This Institutional Review Board exempt retrospective study used de-identified metadata from consecutive adult abdominal and pelvic MRI scans performed at our institution spanning 2.5 years from 2019 to 2021 to fine-tune an AI model to predict the exam protocol. The NLP algorithm Bidirectional Encoder Representations from Transformers (BERT) was employed in sequence classification mode. Twelve months of data from the COVID pandemic were excluded to avoid bias from known practice and referral pattern disruptions, with approximately 46,000 MRI exams in the resulting cohort. The final trained model had an accuracy of 88.5% with a Matthews correlation coefficient of 0.874, a true positive rate of 0.872, and a true negative rate of 0.995. Subsequent expert review of the errors performed to satisfy departmental leadership showed 81.9% were in fact correct or reasonable alternative protocols, yielding real-world performance accuracy of 97.9%. We conclude that NLP algorithms, including "smaller" large language models like the BERT family often overlooked today, can predict MRI imaging protocols for the abdomen and pelvis with high real-world performance, offering to decrease radiologists' non-interpretive task load and increasing departmental efficiency.
检查方案制定对于放射科医生来说是一项重大的非解释性任务负担。这项工作的目的是开发一种自然语言处理(NLP)人工智能(AI)解决方案,用于根据基本相关医嘱信息和患者元数据自动制定标准腹部和盆腔磁共振成像(MRI)检查方案。这项机构审查委员会豁免的回顾性研究使用了2019年至2021年在我们机构进行的连续成人腹部和盆腔MRI扫描的去识别元数据,以微调一个AI模型来预测检查方案。NLP算法双向编码器表征变换器(BERT)被用于序列分类模式。为避免已知实践和转诊模式中断造成的偏差,排除了新冠疫情期间12个月的数据,最终队列中有大约46000例MRI检查。最终训练的模型准确率为88.5%,马修斯相关系数为0.874,真阳性率为0.872,真阴性率为0.995。随后为满足部门领导要求而对错误进行的专家审查显示,81.9%实际上是正确的或合理的替代方案,实际性能准确率为97.9%。我们得出结论,NLP算法,包括如今经常被忽视的像BERT家族这样的“较小”大语言模型,可以以较高的实际性能预测腹部和盆腔的MRI成像方案,有望减轻放射科医生的非解释性任务负担并提高部门效率。