School of Computer and Communication Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.
Dev Sci. 2023 Jan;26(1):e13278. doi: 10.1111/desc.13278. Epub 2022 Jun 7.
Towards the end of the second trimester of gestation, a human fetus is able to register environmental sounds. This in utero auditory experience is characterized by comprising strongly low-pass-filtered versions of sounds from the external world. Here, we present computational tests of the hypothesis that this early exposure to severely degraded auditory inputs serves an adaptive purpose-it may induce the neural development of extended temporal integration. Such integration can facilitate the detection of information carried by low-frequency variations in the auditory signal, including emotional or other prosodic content. To test this prediction, we characterized the impact of several training regimens, biomimetic and otherwise, on a computational model system trained and tested on the task of emotion recognition. We find that training with an auditory trajectory recapitulating that of a neurotypical infant in the pre-to-postnatal period results in temporally extended receptive field structures and yields the best subsequent accuracy and generalization performance on the task of emotion recognition. This strongly suggests that the progression from low-pass-filtered to full-frequency inputs is likely to be an adaptive feature of our development, conferring significant benefits to later auditory processing abilities relying on temporally extended analyses. Additionally, this finding can help explain some of the auditory impairments associated with preterm births, suggests guidelines for the design of auditory environments in neonatal care units, and points to enhanced training procedures for computational models.
在妊娠中期结束时,人类胎儿能够记录环境声音。这种宫内听觉体验的特点是包含来自外部世界的强烈低通滤波版本的声音。在这里,我们提出了一个计算测试假设,即这种早期接触严重退化听觉输入的情况是适应性的——它可能诱导神经发育扩展的时间整合。这种整合可以促进对听觉信号低频变化所携带信息的检测,包括情感或其他韵律内容。为了验证这一预测,我们对一个计算模型系统进行了特征描述,该系统在情感识别任务上进行了训练和测试,包括生物模拟和其他训练方案。我们发现,用类似于神经典型婴儿在产前到产后期间的听觉轨迹进行训练,会导致时间上扩展的感受野结构,并在情感识别任务上获得最佳的后续准确性和泛化性能。这强烈表明,从低通滤波到全频输入的转变很可能是我们发育的一个适应性特征,为依赖时间扩展分析的后期听觉处理能力提供了显著的益处。此外,这一发现可以帮助解释与早产儿相关的一些听觉障碍,为新生儿护理单元的听觉环境设计提供了指导,并为计算模型的增强训练程序指明了方向。