Tsotsos John K
Department of Electrical Engineering and Computer Science, York University, Toronto, ON M3J 1P3, Canada.
J Imaging. 2022 Jul 31;8(8):212. doi: 10.3390/jimaging8080212.
When we study the human ability to attend, what exactly do we seek to understand? It is not clear what the answer might be to this question. There is still so much to know, while acknowledging the tremendous progress of past decades of research. It is as if each new study adds a tile to the mosaic that, when viewed from a distance, we hope will reveal the big picture of attention. However, there is no map as to how each tile might be placed nor any guide as to what the overall picture might be. It is like digging up bits of mosaic tile at an ancient archeological site with no key as to where to look and then not only having to decide which picture it belongs to but also where exactly in that puzzle it should be placed. I argue that, although the unearthing of puzzle pieces is very important, so is their placement, but this seems much less emphasized. We have mostly unearthed a treasure trove of puzzle pieces but they are all waiting for cleaning and reassembly. It is an activity that is scientifically far riskier, but with great risk comes a greater reward. Here, I will look into two areas of broad agreement, specifically regarding visual attention, and dig deeper into their more nuanced meanings, in the hope of sketching a starting point for the guide to the attention mosaic. The goal is to situate visual attention as a purely computational problem and not as a data explanation task; it may become easier to place the puzzle pieces once you understand why they exist in the first place.
当我们研究人类的注意力时,我们究竟试图理解什么?这个问题的答案尚不清楚。尽管过去几十年的研究取得了巨大进展,但仍有许多需要了解的地方。就好像每一项新研究都为一幅镶嵌画增添了一块瓷砖,从远处看,我们希望这幅镶嵌画能揭示出注意力的全貌。然而,对于每一块瓷砖该如何摆放,或者整体画面会是什么样,并没有地图或指引。这就好比在一个古老的考古遗址挖掘镶嵌画瓷砖碎片,却没有关于从何处寻找的线索,然后不仅要确定它属于哪幅画,还要确定它在这幅拼图中的确切位置。我认为,虽然挖掘拼图碎片非常重要,但它们的摆放同样重要,然而这一点似乎很少被强调。我们大多挖掘出了大量的拼图碎片,但它们都有待清理和重新组装。这是一项在科学上风险大得多的活动,但风险越大,回报也越大。在这里,我将探讨两个广泛达成共识的领域,特别是关于视觉注意力的领域,并深入挖掘它们更细微的含义,希望能勾勒出注意力镶嵌画指南的起点。目标是将视觉注意力定位为一个纯粹的计算问题,而不是一个数据解释任务;一旦你理解了这些拼图碎片为何首先存在,或许就更容易摆放它们了。