Thompson Yee Lam Elim, Levine Gary M, Chen Weijie, Sahiner Berkman, Li Qin, Petrick Nicholas, Delfino Jana G, Lago Miguel A, Cao Qian, Samuelson Frank W
The U.S. Food and Drug Administration, White Oak, MD USA.
Queueing Syst. 2024;108(3-4):579-610. doi: 10.1007/s11134-024-09927-w. Epub 2024 Sep 21.
In the past decade, artificial intelligence (AI) algorithms have made promising impacts in many areas of healthcare. One application is AI-enabled prioritization software known as computer-aided triage and notification (CADt). This type of software as a medical device is intended to prioritize reviews of radiological images with time-sensitive findings, thus shortening the waiting time for patients with these findings. While many CADt devices have been deployed into clinical workflows and have been shown to improve patient treatment and clinical outcomes, quantitative methods to evaluate the wait-time-savings from their deployment are not yet available. In this paper, we apply queueing theory methods to evaluate the wait-time-savings of a CADt by calculating the average waiting time per patient image without and with a CADt device being deployed. We study two workflow models with one or multiple radiologists (servers) for a range of AI diagnostic performances, radiologist's reading rates, and patient image (customer) arrival rates. To evaluate the time-saving performance of a CADt, we use the difference in the mean waiting time between the diseased patient images in the with-CADt scenario and that in the without-CADt scenario as our performance metric. As part of this effort, we have developed and also share a software tool to simulate the radiology workflow around medical image interpretation, to verify theoretical results, and to provide confidence intervals for the performance metric we defined. We show quantitatively that a CADt triage device is more effective in a busy, short-staffed reading setting, which is consistent with our clinical intuition and simulation results. Although this work is motivated by the need for evaluating CADt devices, the evaluation methodology presented in this paper can be applied to assess the time-saving performance of other types of algorithms that prioritize a subset of customers based on binary outputs.
在过去十年中,人工智能(AI)算法在医疗保健的许多领域都产生了令人瞩目的影响。其中一个应用是名为计算机辅助分诊与通知(CADt)的人工智能驱动的优先级排序软件。这种作为医疗设备的软件旨在对具有时间敏感性检查结果的放射图像审查进行优先级排序,从而缩短有这些检查结果的患者的等待时间。虽然许多CADt设备已被部署到临床工作流程中,并已证明可改善患者治疗和临床结果,但尚未有定量方法来评估其部署所节省的等待时间。在本文中,我们应用排队论方法,通过计算在未部署和已部署CADt设备的情况下每张患者图像的平均等待时间,来评估CADt节省的等待时间。我们研究了两种工作流程模型,分别有一名或多名放射科医生(服务器),涵盖了一系列人工智能诊断性能、放射科医生的阅读速度和患者图像(客户)到达率。为了评估CADt的节省时间性能,我们将有CADt场景下患病患者图像的平均等待时间与无CADt场景下的平均等待时间之差作为我们的性能指标。作为这项工作的一部分,我们开发并分享了一个软件工具,用于模拟围绕医学图像解读的放射科工作流程,以验证理论结果,并为我们定义的性能指标提供置信区间。我们定量地表明,CADt分诊设备在繁忙、人员短缺的阅读环境中更有效,这与我们的临床直觉和模拟结果一致。尽管这项工作的动机是评估CADt设备,但本文提出的评估方法可应用于评估其他基于二元输出对一部分客户进行优先级排序的算法的节省时间性能。