Sullivan Jessica, Mei Michelle, Perfors Andrew, Wojcik Erica, Frank Michael C
Skidmore College.
University of Melbourne.
Open Mind (Camb). 2021 May 26;5:20-29. doi: 10.1162/opmi_a_00039. eCollection 2021.
We introduce a new resource: the SAYCam corpus. Infants aged 6-32 months wore a head-mounted camera for approximately 2 hr per week, over the course of approximately two-and-a-half years. The result is a large, naturalistic, longitudinal dataset of infant- and child-perspective videos. Over 200,000 words of naturalistic speech have already been transcribed. Similarly, the dataset is searchable using a number of criteria (e.g., age of participant, location, setting, objects present). The resulting dataset will be of broad use to psychologists, linguists, and computer scientists.
SAYCam语料库。6至32个月大的婴儿每周佩戴头戴式摄像头约2小时,持续约两年半的时间。结果得到了一个从婴幼儿视角出发的大型、自然主义的纵向视频数据集。现已转录了超过200,000字的自然语言。同样,该数据集可根据多种标准进行搜索(例如,参与者年龄、地点、场景、出现的物体)。所得数据集将对心理学家、语言学家和计算机科学家有广泛用途。