• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于随机且可控地生成逼真语言输入以模拟婴儿语言习得的流程。

A pipeline for stochastic and controlled generation of realistic language input for simulating infant language acquisition.

作者信息

Räsänen Okko, Kocharov Daniil

机构信息

Signal Processing Research Centre, Tampere University, Tampere, Finland.

出版信息

Behav Res Methods. 2025 Sep 4;57(10):275. doi: 10.3758/s13428-025-02772-6.

DOI:10.3758/s13428-025-02772-6
PMID:40908330
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12411597/
Abstract

Computational models of early language development involve implementing theories of learning as functional learning algorithms, exposing these models to realistic language input, and comparing learning outcomes to those in infants. While recent research has made major strides in developing more powerful learning models and evaluation protocols grounded in infant data, models are still predominantly trained with non-naturalistic input data, such as crowd-sourced read speech or text transcripts. This is due to the lack of suitable child-directed speech (CDS) corpora in terms of scale and quality. In parallel, the question of how properties and individual variability in language input affect learning outcomes is an active area of empirical research, underlining the need for realistic yet controllable data for modeling such phenomena. This paper presents a solution to the training data problem through stochastic generation of naturalistic CDS data using statistical models, thereby enabling controlled computational simulations with naturalistic input. We provide a proof-of-concept demonstration of the approach by showing how naturalistic CDS transcripts can be generated with a language model conditioned on recipient information (here, infant age), and how text-to-speech systems can be used to convert the transcripts to high-quality speech with a controllable speaking style. We also conduct modeling experiments with generated speech corpora by varying different aspects of the data, showing how this maps into different learning outcomes, thereby demonstrating the feasibility of the approach for controlled language learning simulations. Finally, we discuss the limitations of using synthetic data in general, and of the present proof-of-concept pipeline in particular.

摘要

早期语言发展的计算模型包括将学习理论作为功能性学习算法来实现,让这些模型接触现实的语言输入,并将学习结果与婴儿的学习结果进行比较。虽然最近的研究在开发基于婴儿数据的更强大的学习模型和评估协议方面取得了重大进展,但模型仍然主要使用非自然主义的输入数据进行训练,比如众包的朗读语音或文本转录本。这是因为在规模和质量方面缺乏合适的儿童导向型语言(CDS)语料库。与此同时,语言输入的属性和个体差异如何影响学习结果这一问题是实证研究的一个活跃领域,这突出表明需要用于对这类现象进行建模的现实但可控的数据。本文通过使用统计模型随机生成自然主义的CDS数据,提出了一种解决训练数据问题的方法,从而能够进行基于自然主义输入的可控计算模拟。我们通过展示如何使用基于接收者信息(这里是婴儿年龄)的语言模型生成自然主义的CDS转录本,以及如何使用文本转语音系统将转录本转换为具有可控说话风格的高质量语音,对该方法进行了概念验证演示。我们还通过改变数据的不同方面,对生成的语音语料库进行建模实验,展示这如何映射到不同的学习结果,从而证明该方法用于可控语言学习模拟的可行性。最后,我们讨论了一般使用合成数据的局限性,特别是当前概念验证流程的局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/8ac8d20bd863/13428_2025_2772_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/e5dbea71a276/13428_2025_2772_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/fd9e509b1eb3/13428_2025_2772_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/1f1a36a72eeb/13428_2025_2772_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/db28acf14d5d/13428_2025_2772_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/47f18e0188b0/13428_2025_2772_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/8ac8d20bd863/13428_2025_2772_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/e5dbea71a276/13428_2025_2772_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/fd9e509b1eb3/13428_2025_2772_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/1f1a36a72eeb/13428_2025_2772_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/db28acf14d5d/13428_2025_2772_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/47f18e0188b0/13428_2025_2772_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d638/12411597/8ac8d20bd863/13428_2025_2772_Fig6_HTML.jpg

相似文献

1
A pipeline for stochastic and controlled generation of realistic language input for simulating infant language acquisition.一种用于随机且可控地生成逼真语言输入以模拟婴儿语言习得的流程。
Behav Res Methods. 2025 Sep 4;57(10):275. doi: 10.3758/s13428-025-02772-6.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.
4
Short-Term Memory Impairment短期记忆障碍
5
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
6
Neonatal Nurses' Understanding of the Factors That Enhance and Hinder Early Communication Between Preterm Infants and Their Parents: A Narrative Inquiry Study.新生儿护士对促进和阻碍早产儿与其父母早期沟通因素的理解:一项叙事探究研究。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70093. doi: 10.1111/1460-6984.70093.
7
Non-speech oral motor treatment for children with developmental speech sound disorders.针对发育性语音障碍儿童的非言语口腔运动治疗。
Cochrane Database Syst Rev. 2015 Mar 25;2015(3):CD009383. doi: 10.1002/14651858.CD009383.pub2.
8
Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.错误的恶臭还是潜力的光辉:言语病理学中(不)负责任地使用ChatGPT的挑战。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088.
9
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
10
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

本文引用的文献

1
Acquiring a language vs. inducing a grammar.习得一门语言与归纳一种语法。
Cognition. 2024 Jun;247:105771. doi: 10.1016/j.cognition.2024.105771. Epub 2024 Mar 19.
2
The acquisition of speech categories: Beyond perceptual narrowing, beyond unsupervised learning and beyond infancy.言语类别的习得:超越知觉窄化、超越无监督学习且超越婴儿期。
Lang Cogn Neurosci. 2023;38(4):419-445. doi: 10.1080/23273798.2022.2105367. Epub 2022 Aug 8.
3
Grounded language acquisition through the eyes and ears of a single child.通过单个儿童的眼睛和耳朵进行基础语言习得。
Science. 2024 Feb 2;383(6682):504-511. doi: 10.1126/science.adi1374. Epub 2024 Feb 1.
4
The BabyView camera: Designing a new head-mounted camera to capture children's early social and visual environments.婴儿视角摄像机:设计一款新的头戴式摄像机,以捕捉儿童早期的社会和视觉环境。
Behav Res Methods. 2024 Apr;56(4):3523-3534. doi: 10.3758/s13428-023-02206-1. Epub 2023 Sep 1.
5
Introducing Meta-analysis in the Evaluation of Computational Models of Infant Language Development.在婴儿语言发展的计算模型评估中引入元分析。
Cogn Sci. 2023 Jul;47(7):e13307. doi: 10.1111/cogs.13307.
6
Realistic and broad-scope learning simulations: first results and challenges.真实且广泛范围的学习模拟:初步结果与挑战。
J Child Lang. 2023 Nov;50(6):1294-1317. doi: 10.1017/S0305000923000272. Epub 2023 May 29.
7
Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input.早期语音学习无需语音类别:基于真实输入的大规模模拟研究的启示。
Proc Natl Acad Sci U S A. 2021 Feb 9;118(7). doi: 10.1073/pnas.2001844118.
8
A thorough evaluation of the Language Environment Analysis (LENA) system.深入评估语言环境分析(LENA)系统。
Behav Res Methods. 2021 Apr;53(2):467-486. doi: 10.3758/s13428-020-01393-5.
9
Longform recordings of everyday life: Ethics for best practices.日常生活的长篇记录:最佳实践的伦理。
Behav Res Methods. 2020 Oct;52(5):1951-1969. doi: 10.3758/s13428-020-01365-9.
10
childes-db: A flexible and reproducible interface to the child language data exchange system.childes-db:一个灵活且可重复使用的接口,用于儿童语言数据交换系统。
Behav Res Methods. 2019 Aug;51(4):1928-1941. doi: 10.3758/s13428-018-1176-7.