• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双向生成对抗性表示学习用于自然刺激合成。

Bidirectional generative adversarial representation learning for natural stimulus synthesis.

机构信息

Department of Bioengineering, Imperial College London, London, United Kingdom.

出版信息

J Neurophysiol. 2024 Oct 1;132(4):1156-1169. doi: 10.1152/jn.00421.2023. Epub 2024 Aug 28.

DOI:10.1152/jn.00421.2023
PMID:39196986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11495180/
Abstract

Thousands of species use vocal signals to communicate with one another. Vocalizations carry rich information, yet characterizing and analyzing these complex, high-dimensional signals is difficult and prone to human bias. Moreover, animal vocalizations are ethologically relevant stimuli whose representation by auditory neurons is an important subject of research in sensory neuroscience. A method that can efficiently generate naturalistic vocalization waveforms would offer an unlimited supply of stimuli with which to probe neuronal computations. Although unsupervised learning methods allow for the projection of vocalizations into low-dimensional latent spaces learned from the waveforms themselves, and generative modeling allows for the synthesis of novel vocalizations for use in downstream tasks, we are not aware of any model that combines these tasks to synthesize naturalistic vocalizations in the waveform domain for stimulus playback. In this paper, we demonstrate BiWaveGAN: a bidirectional generative adversarial network (GAN) capable of learning a latent representation of ultrasonic vocalizations (USVs) from mice. We show that BiWaveGAN can be used to generate, and interpolate between, realistic vocalization waveforms. We then use these synthesized stimuli along with natural USVs to probe the sensory input space of mouse auditory cortical neurons. We show that stimuli generated from our method evoke neuronal responses as effectively as real vocalizations, and produce receptive fields with the same predictive power. BiWaveGAN is not restricted to mouse USVs but can be used to synthesize naturalistic vocalizations of any animal species and interpolate between vocalizations of the same or different species, which could be useful for probing categorical boundaries in representations of ethologically relevant auditory signals. A new type of artificial neural network is presented that can be used to generate animal vocalization waveforms and interpolate between them to create new vocalizations. We find that our synthetic naturalistic stimuli drive auditory cortical neurons in the mouse equally well and produce receptive field features with the same predictive power as those obtained with natural mouse vocalizations, confirming the quality of the stimuli produced by the neural network.

摘要

数千种物种使用声音信号相互交流。声音信号携带着丰富的信息,但对这些复杂的高维信号进行特征描述和分析是困难的,并且容易受到人为偏见的影响。此外,动物的声音信号是具有生态意义的刺激物,听觉神经元对其的表示是感觉神经科学研究的一个重要课题。一种能够有效地生成自然声音波形的方法将提供无限数量的刺激物,用于探测神经元计算。尽管无监督学习方法允许将声音信号投影到从波形本身学习到的低维潜在空间中,生成模型允许合成用于下游任务的新声音信号,但我们不知道有任何模型可以将这些任务结合起来,以在波形域中合成自然声音信号用于刺激回放。在本文中,我们展示了 BiWaveGAN:一种能够从老鼠的超声波声音(USV)中学习潜在表示的双向生成对抗网络(GAN)。我们表明,BiWaveGAN 可用于生成和在真实声音信号之间进行插值。然后,我们使用这些合成的刺激物以及自然 USV 来探测老鼠听觉皮层神经元的感觉输入空间。我们表明,我们的方法生成的刺激物可以像真实声音一样有效地引起神经元反应,并产生具有相同预测能力的感受野。BiWaveGAN 不仅限于老鼠 USV,还可以用于合成任何动物物种的自然声音信号,并在同一物种或不同物种的声音信号之间进行插值,这对于探测与生态相关的听觉信号的表示中的类别边界可能很有用。提出了一种新的人工神经网络类型,可用于生成动物声音信号并在它们之间进行插值以创建新的声音。我们发现,我们的合成自然刺激物同样能够驱动老鼠的听觉皮层神经元,并产生与用自然老鼠声音获得的感受野特征具有相同预测能力的特征,从而确认了神经网络产生的刺激物的质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/53108b8ed174/jn.00421.2023_f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/7af85a90b314/jn-00421-2023r01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/d7120280f7d8/jn.00421.2023_f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/21de8e50b63b/jn.00421.2023_f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/be3405bc02df/jn.00421.2023_f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/c6d17627cd5a/jn.00421.2023_f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/f526b8833995/jn.00421.2023_f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/53108b8ed174/jn.00421.2023_f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/7af85a90b314/jn-00421-2023r01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/d7120280f7d8/jn.00421.2023_f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/21de8e50b63b/jn.00421.2023_f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/be3405bc02df/jn.00421.2023_f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/c6d17627cd5a/jn.00421.2023_f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/f526b8833995/jn.00421.2023_f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ca/11495180/53108b8ed174/jn.00421.2023_f006.jpg

相似文献

1
Bidirectional generative adversarial representation learning for natural stimulus synthesis.双向生成对抗性表示学习用于自然刺激合成。
J Neurophysiol. 2024 Oct 1;132(4):1156-1169. doi: 10.1152/jn.00421.2023. Epub 2024 Aug 28.
2
Short-Term Memory Impairment短期记忆障碍
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Vocal usage learning and vocal comprehension learning in harbor seals.海豹的发声使用学习和发声理解学习。
BMC Neurosci. 2024 Oct 4;25(1):48. doi: 10.1186/s12868-024-00899-4.
5
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
6
Developmental encoding of natural sounds in the mouse auditory cortex.在小鼠听觉皮层中自然声音的发育编码。
Cereb Cortex. 2024 Nov 5;34(11). doi: 10.1093/cercor/bhae438.
7
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.
8
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
9
Sexual Harassment and Prevention Training性骚扰与预防培训
10
Comparison of cellulose, modified cellulose and synthetic membranes in the haemodialysis of patients with end-stage renal disease.纤维素、改性纤维素和合成膜在终末期肾病患者血液透析中的比较。
Cochrane Database Syst Rev. 2001(3):CD003234. doi: 10.1002/14651858.CD003234.

本文引用的文献

1
Spike sorting with Kilosort4.Kilosort4 进行尖峰分类。
Nat Methods. 2024 May;21(5):914-921. doi: 10.1038/s41592-024-02232-7. Epub 2024 Apr 8.
2
Composite receptive fields in the mouse auditory cortex.鼠类听觉皮层的复合感受野。
J Physiol. 2023 Sep;601(18):4091-4104. doi: 10.1113/JP285003. Epub 2023 Aug 14.
3
Computational bioacoustics with deep learning: a review and roadmap.深度学习的计算生物声学:综述与路线图。
PeerJ. 2022 Mar 21;10:e13152. doi: 10.7717/peerj.13152. eCollection 2022.
4
Cortical representation of group social communication in bats.蝙蝠群体社交通讯的皮层表征。
Science. 2021 Oct 22;374(6566):eaba9584. doi: 10.1126/science.aba9584.
5
Primary visual cortex straightens natural video trajectories.初级视皮层使自然视频轨迹变直。
Nat Commun. 2021 Oct 13;12(1):5982. doi: 10.1038/s41467-021-25939-z.
6
Neurally driven synthesis of learned, complex vocalizations.神经驱动的学习型复杂发声合成。
Curr Biol. 2021 Aug 9;31(15):3419-3425.e5. doi: 10.1016/j.cub.2021.05.035. Epub 2021 Jun 16.
7
Low-dimensional learned feature spaces quantify individual and group differences in vocal repertoires.低维习得特征空间定量个体和群体在声音曲目上的差异。
Elife. 2021 May 14;10:e67855. doi: 10.7554/eLife.67855.
8
Analysis of ultrasonic vocalizations from mice using computer vision and machine learning.利用计算机视觉和机器学习分析小鼠的超声发声。
Elife. 2021 Mar 31;10:e59161. doi: 10.7554/eLife.59161.
9
Inception loops discover what excites neurons most using deep predictive models.Inception 循环使用深度预测模型发现最能激发神经元的事物。
Nat Neurosci. 2019 Dec;22(12):2060-2065. doi: 10.1038/s41593-019-0517-x. Epub 2019 Nov 4.
10
Parallels in the sequential organization of birdsong and human speech.鸟鸣和人类言语在顺序组织上的相似性。
Nat Commun. 2019 Aug 12;10(1):3636. doi: 10.1038/s41467-019-11605-y.