一种用于眼动追踪校准的新型稳健多变量模式估计器。

A new robust multivariate mode estimator for eye-tracking calibration.

作者信息

Brilhault Adrien, Neuenschwander Sergio, Rios Ricardo Araujo

机构信息

Department of Computer Science, Federal University of Bahia, Salvador, Brazil.

Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil.

出版信息

Behav Res Methods. 2023 Feb;55(2):516-553. doi: 10.3758/s13428-022-01809-4. Epub 2022 Mar 16.

DOI:10.3758/s13428-022-01809-4

Abstract

We propose in this work a new method for estimating the main mode of multivariate distributions, with application to eye-tracking calibration. When performing eye-tracking experiments with poorly cooperative subjects, such as infants or monkeys, the calibration data generally suffer from high contamination. Outliers are typically organized in clusters, corresponding to fixations in the time intervals when subjects were not looking at the calibration points. In this type of multimodal distributions, most central tendency measures fail at estimating the principal fixation coordinates (the first mode), resulting in errors and inaccuracies when mapping the gaze to the screen coordinates. Here, we developed a new algorithm to identify the first mode of multivariate distributions, named BRIL, which relies on recursive depth-based filtering. This novel approach was tested on artificial mixtures of Gaussian and Uniform distributions, and compared to existing methods (conventional depth medians, robust estimators of location and scatter, and clustering-based approaches). We obtained outstanding performances, even for distributions containing very high proportions of outliers, both grouped in clusters and randomly distributed. Finally, we demonstrate the strength of our method in a real-world scenario using experimental data from eye-tracking calibrations with Capuchin monkeys, especially for highly contaminated distributions where other algorithms typically lack accuracy.

摘要

在这项工作中，我们提出了一种估计多元分布主模态的新方法，并将其应用于眼动追踪校准。在用合作性较差的受试者（如婴儿或猴子）进行眼动追踪实验时，校准数据通常受到高度污染。异常值通常聚集成簇，对应于受试者未注视校准点的时间间隔内的注视。在这种多峰分布类型中，大多数集中趋势度量在估计主要注视坐标（第一模态）时都会失败，从而在将注视映射到屏幕坐标时导致误差和不准确。在此，我们开发了一种新算法来识别多元分布的第一模态，名为BRIL，它依赖于基于递归深度的滤波。这种新方法在高斯分布和均匀分布的人工混合数据上进行了测试，并与现有方法（传统深度中位数、位置和离散度的稳健估计器以及基于聚类的方法）进行了比较。即使对于包含非常高比例异常值（无论是成簇分组还是随机分布）的分布，我们也取得了出色的性能。最后，我们使用卷尾猴眼动追踪校准的实验数据，在实际场景中展示了我们方法的优势，特别是对于其他算法通常缺乏准确性的高度污染分布。