Martinez Stanford, Ramirez-Tamayo Carolina, Akhter Faruqui Syed Hasib, Clark Kal, Alaeddini Adel, Czarnek Nicholas, Aggarwal Aarushi, Emamzadeh Sahra, Mock Jeffrey R, Golob Edward J
Department of Mechanical Engineering, The University of Texas at San Antonio, San Antonio, TX, United States.
Department of Engineering Technology, Sam Houston State University, Huntsville, TX, United States.
JMIR Form Res. 2025 Jan 22;9:e53928. doi: 10.2196/53928.
Perception-related errors comprise most diagnostic mistakes in radiology. To mitigate this problem, radiologists use personalized and high-dimensional visual search strategies, otherwise known as search patterns. Qualitative descriptions of these search patterns, which involve the physician verbalizing or annotating the order he or she analyzes the image, can be unreliable due to discrepancies in what is reported versus the actual visual patterns. This discrepancy can interfere with quality improvement interventions and negatively impact patient care.
The objective of this study is to provide an alternative method for distinguishing between radiologists by means of captured eye-tracking data such that the raw gaze (or processed fixation data) can be used to discriminate users based on subconscious behavior in visual inspection.
We present a novel discretized feature encoding based on spatiotemporal binning of fixation data for efficient geometric alignment and temporal ordering of eye movement when reading chest x-rays. The encoded features of the eye-fixation data are used by machine learning classifiers to discriminate between faculty and trainee radiologists. A clinical trial case study was conducted using metrics such as the area under the curve, accuracy, F-score, sensitivity, and specificity to evaluate the discriminability between the 2 groups regarding their level of experience. The classification performance was then compared with state-of-the-art methodologies. In addition, a repeatability experiment using a separate dataset, experimental protocol, and eye tracker was performed with 8 participants to evaluate the robustness of the proposed approach.
The numerical results from both experiments demonstrate that classifiers using the proposed feature encoding methods outperform the current state-of-the-art in differentiating between radiologists in terms of experience level. An average performance gain of 6.9% is observed compared with traditional features while classifying experience levels of radiologists. This gain in accuracy is also substantial across different eye tracker-collected datasets, with improvements of 6.41% using the Tobii eye tracker and 7.29% using the EyeLink eye tracker. These results signify the potential impact of the proposed method for identifying radiologists' level of expertise and those who would benefit from additional training.
The effectiveness of the proposed spatiotemporal discretization approach, validated across diverse datasets and various classification metrics, underscores its potential for objective evaluation, informing targeted interventions and training strategies in radiology. This research advances reliable assessment tools, addressing challenges in perception-related errors to enhance patient care outcomes.
在放射学中,与感知相关的错误构成了大多数诊断失误。为了缓解这一问题,放射科医生会使用个性化的高维视觉搜索策略,也就是所谓的搜索模式。这些搜索模式的定性描述,即医生口头说出或标注其分析图像的顺序,可能由于报告内容与实际视觉模式之间的差异而不可靠。这种差异会干扰质量改进措施,并对患者护理产生负面影响。
本研究的目的是提供一种通过捕获的眼动追踪数据来区分放射科医生的替代方法,以便原始注视(或处理后的注视数据)可用于根据视觉检查中的潜意识行为来区分用户。
我们提出了一种基于注视数据时空分箱的新型离散特征编码方法,用于在读取胸部X光片时对眼动进行高效的几何对齐和时间排序。机器学习分类器使用眼注视数据的编码特征来区分放射科教员和实习医生。使用曲线下面积、准确率、F分数、灵敏度和特异性等指标进行了一项临床试验案例研究,以评估两组在经验水平方面的可区分性。然后将分类性能与最先进的方法进行比较。此外,还使用单独的数据集、实验方案和眼动追踪器对8名参与者进行了重复性实验,以评估所提出方法的稳健性。
两个实验的数值结果均表明,使用所提出的特征编码方法的分类器在区分放射科医生的经验水平方面优于当前的最先进方法。在对放射科医生的经验水平进行分类时,与传统特征相比,平均性能提升了6.9%。在不同的眼动追踪器收集的数据集中,这种准确率的提升也很显著,使用Tobii眼动追踪器时提高了6.41%,使用EyeLink眼动追踪器时提高了7.29%。这些结果表明所提出的方法在识别放射科医生的专业水平以及确定哪些人将从额外培训中受益方面具有潜在影响。
所提出的时空离散化方法的有效性在不同数据集和各种分类指标上得到了验证,突出了其在客观评估方面的潜力,为放射学中的针对性干预和培训策略提供了依据。这项研究推进了可靠的评估工具,应对了与感知相关错误方面的挑战,以提高患者护理结果。