Suppr超能文献

标准与人工智能辅助PET/CT阅片流程在淋巴瘤治疗前分期中的比较有效性:一项多机构阅片者研究评估

Comparative effectiveness of standard vs. AI-assisted PET/CT reading workflow for pre-treatment lymphoma staging: a multi-institutional reader study evaluation.

作者信息

Frood Russell, Willaime Julien M Y, Miles Brad, Chambers Greg, Al-Chalabi H'ssein, Ali Tamir, Hougham Natasha, Brooks Naomi, Petrides George, Naylor Matthew, Ward Daniel, Sulkin Tom, Chaytor Richard, Strouhal Peter, Patel Chirag, Scarsbrook Andrew F

机构信息

Department of Radiology, Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom.

Leeds Institute of Health Research, University of Leeds, Leeds, United Kingdom.

出版信息

Front Nucl Med. 2024 Jan 11;3:1327186. doi: 10.3389/fnume.2023.1327186. eCollection 2023.

Abstract

BACKGROUND

Fluorine-18 fluorodeoxyglucose (FDG)-positron emission tomography/computed tomography (PET/CT) is widely used for staging high-grade lymphoma, with the time to evaluate such studies varying depending on the complexity of the case. Integrating artificial intelligence (AI) within the reporting workflow has the potential to improve quality and efficiency. The aims of the present study were to evaluate the influence of an integrated research prototype segmentation tool implemented within diagnostic PET/CT reading software on the speed and quality of reporting with variable levels of experience, and to assess the effect of the AI-assisted workflow on reader confidence and whether this tool influenced reporting behaviour.

METHODS

Nine blinded reporters (three trainees, three junior consultants and three senior consultants) from three UK centres participated in a two-part reader study. A total of 15 lymphoma staging PET/CT scans were evaluated twice: first, using a standard PET/CT reporting workflow; then, after a 6-week gap, with AI assistance incorporating pre-segmentation of disease sites within the reading software. An even split of PET/CT segmentations with gold standard (GS), false-positive (FP) over-contour or false-negative (FN) under-contour were provided. The read duration was calculated using file logs, while the report quality was independently assessed by two radiologists with >15 years of experience. Confidence in AI assistance and identification of disease was assessed via online questionnaires for each case.

RESULTS

There was a significant decrease in time between non-AI and AI-assisted reads (median 15.0 vs. 13.3 min,  < 0.001). Sub-analysis confirmed this was true for both junior (14.5 vs. 12.7 min,  = 0.03) and senior consultants (15.1 vs. 12.2 min,  = 0.03) but not for trainees (18.1 vs. 18.0 min,  = 0.2). There was no significant difference between report quality between reads. AI assistance provided a significant increase in confidence of disease identification ( < 0.001). This held true when splitting the data into FN, GS and FP. In 19/88 cases, participants did not identify either FP (31.8%) or FN (11.4%) segmentations. This was significantly greater for trainees (13/30, 43.3%) than for junior (3/28, 10.7%,  = 0.05) and senior consultants (3/30, 10.0%,  = 0.05).

CONCLUSIONS

The study findings indicate that an AI-assisted workflow achieves comparable performance to humans, demonstrating a marginal enhancement in reporting speed. Less experienced readers were more influenced by segmentation errors. An AI-assisted PET/CT reading workflow has the potential to increase reporting efficiency without adversely affecting quality, which could reduce costs and report turnaround times. These preliminary findings need to be confirmed in larger studies.

摘要

背景

氟-18氟脱氧葡萄糖(FDG)-正电子发射断层扫描/计算机断层扫描(PET/CT)广泛用于高级别淋巴瘤的分期,评估此类检查的时间因病例复杂性而异。将人工智能(AI)整合到报告工作流程中有可能提高质量和效率。本研究的目的是评估诊断性PET/CT阅读软件中实施的综合研究原型分割工具对不同经验水平报告速度和质量的影响,并评估人工智能辅助工作流程对读者信心的影响以及该工具是否影响报告行为。

方法

来自英国三个中心的九名盲法报告者(三名实习生、三名初级顾问和三名高级顾问)参与了一项分为两部分的读者研究。共对15例淋巴瘤分期PET/CT扫描进行了两次评估:首先,使用标准PET/CT报告工作流程;然后,在间隔6周后,在人工智能辅助下,阅读软件内对病变部位进行预分割。提供了与金标准(GS)、假阳性(FP)轮廓过度或假阴性(FN)轮廓不足的PET/CT分割的均匀分布。使用文件日志计算阅读时间,而报告质量由两名经验超过15年的放射科医生独立评估。通过针对每个病例的在线问卷评估对人工智能辅助和疾病识别的信心。

结果

非人工智能和人工智能辅助阅读之间的时间显著减少(中位数15.0分钟对13.3分钟,<0.001)。亚分析证实,初级顾问(14.5分钟对12.7分钟,=0.03)和高级顾问(15.1分钟对12.2分钟,=0.03)均如此,但实习生(18.1分钟对18.0分钟,=0.2)并非如此。两次阅读之间的报告质量没有显著差异。人工智能辅助显著提高了疾病识别的信心(<0.001)。将数据分为FN、GS和FP时也是如此。在19/88例病例中,参与者未识别出FP(31.8%)或FN(11.4%)分割。实习生(13/30,43.3%)的这一比例显著高于初级顾问(3/28,10.7%,=0.05)和高级顾问(3/30,10.0%,=0.05)。

结论

研究结果表明,人工智能辅助工作流程的表现与人类相当,报告速度略有提高。经验较少的读者受分割错误的影响更大。人工智能辅助的PET/CT阅读工作流程有可能提高报告效率而不会对质量产生不利影响,这可以降低成本和缩短报告周转时间。这些初步发现需要在更大规模的研究中得到证实。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60e9/11440880/0ebbba28c930/fnume-03-1327186-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验