Sun Jennifer J, Kennedy Ann, Zhan Eric, Anderson David J, Yue Yisong, Perona Pietro
Caltech.
Northwestern University.
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2021 Jun;2021:2875-2884. doi: 10.1109/cvpr46437.2021.00290. Epub 2021 Nov 2.
Specialized domain knowledge is often necessary to accurately annotate training sets for in-depth analysis, but can be burdensome and time-consuming to acquire from domain experts. This issue arises prominently in automated behavior analysis, in which agent movements or actions of interest are detected from video tracking data. To reduce annotation effort, we present TREBA: a method to learn annotation-sample efficient trajectory embedding for behavior analysis, based on multi-task self-supervised learning. The tasks in our method can be efficiently engineered by domain experts through a process we call "task programming", which uses programs to explicitly encode structured knowledge from domain experts. Total domain expert effort can be reduced by exchanging data annotation time for the construction of a small number of programmed tasks. We evaluate this trade-off using data from behavioral neuroscience, in which specialized domain knowledge is used to identify behaviors. We present experimental results in three datasets across two domains: mice and fruit flies. Using embeddings from TREBA, we reduce annotation burden by up to a factor of 10 without compromising accuracy compared to state-of-the-art features. Our results thus suggest that task programming and self-supervision can be an effective way to reduce annotation effort for domain experts.
对于深入分析而言,准确标注训练集往往需要专业领域知识,但从领域专家那里获取这些知识既繁琐又耗时。这个问题在自动行为分析中尤为突出,在自动行为分析中,要从视频跟踪数据中检测感兴趣的智能体运动或动作。为了减少标注工作量,我们提出了TREBA:一种基于多任务自监督学习来学习用于行为分析的标注样本高效轨迹嵌入的方法。我们方法中的任务可以由领域专家通过一个我们称为“任务编程”的过程来有效设计,该过程使用程序来明确编码领域专家的结构化知识。通过用构建少量编程任务的时间来换取数据标注时间,可以减少领域专家的总体工作量。我们使用行为神经科学的数据来评估这种权衡,在行为神经科学中,专业领域知识用于识别行为。我们在两个领域的三个数据集中展示了实验结果:小鼠和果蝇。与最先进的特征相比,使用TREBA的嵌入,我们在不影响准确性的情况下将标注负担降低了多达10倍。因此,我们的结果表明,任务编程和自监督可以成为减少领域专家标注工作量的有效方法。