• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

邻居与亲属:语音嵌入如何反映全球范围内的语言联系?

Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?

作者信息

Törö Tuukka, Suni Antti, Šimko Juraj

机构信息

Department of Digital Humanities, University of Helsinki, Helsinki, Finland.

出版信息

PLoS One. 2025 Aug 25;20(8):e0330755. doi: 10.1371/journal.pone.0330755. eCollection 2025.

DOI:10.1371/journal.pone.0330755
PMID:40853958
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12377560/
Abstract

Investigating linguistic relationships on a global scale requires analyzing diverse features such as syntax, phonology and prosody, which evolve at varying rates influenced by internal diversification, language contact, and sociolinguistic factors. Recent advances in machine learning (ML) offer complementary alternatives to traditional historical and typological approaches. Instead of relying on expert labor in analyzing specific linguistic features, these new methods enable the exploration of linguistic variation through embeddings derived directly from speech, opening new avenues for large-scale, data-driven analyses. This study employs embeddings from the fine-tuned XLS-R self-supervised language identification model voxlingua107-xls-r-300m-wav2vec, to analyze relationships between 106 world languages based on speech recordings. Using linear discriminant analysis (LDA), language embeddings are clustered and compared with genealogical, lexical, and geographical distances. The results demonstrate that embedding-based distances align closely with traditional measures, effectively capturing both global and local typological patterns. Challenges in visualizing relationships, particularly with hierarchical clustering and network-based methods, highlight the dynamic nature of language change. The findings show potential for scalable analyses of language variation based on speech embeddings, providing new perspectives on relationships among languages. By addressing methodological considerations such as corpus size and latent space dimensionality, this approach opens avenues for studying low-resource languages and bridging macro- and micro-level linguistic variation. Future work aims to extend these methods to underrepresented languages and integrate sociolinguistic variation for a more comprehensive understanding of linguistic diversity.

摘要

在全球范围内研究语言关系需要分析各种特征,如句法、音系和韵律,这些特征受内部多样性、语言接触和社会语言因素的影响,以不同的速度演变。机器学习(ML)的最新进展为传统的历史和类型学方法提供了补充选择。这些新方法不再依赖专家人力来分析特定的语言特征,而是通过直接从语音中提取的嵌入来探索语言变异,为大规模、数据驱动的分析开辟了新途径。本研究采用了经过微调的XLS-R自监督语言识别模型voxlingua107-xls-r-300m-wav2vec的嵌入,基于语音记录分析106种世界语言之间的关系。使用线性判别分析(LDA),对语言嵌入进行聚类,并与谱系、词汇和地理距离进行比较。结果表明,基于嵌入的距离与传统测量方法紧密对齐,有效地捕捉了全球和局部的类型学模式。在可视化关系方面的挑战,特别是使用层次聚类和基于网络的方法时,凸显了语言变化的动态性质。研究结果显示了基于语音嵌入对语言变异进行可扩展分析的潜力,为语言之间的关系提供了新视角。通过解决诸如语料库大小和潜在空间维度等方法学问题,这种方法为研究资源匮乏的语言以及弥合宏观和微观层面的语言变异开辟了道路。未来的工作旨在将这些方法扩展到代表性不足的语言,并整合社会语言变异,以更全面地理解语言多样性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/7ef2e78e3772/pone.0330755.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/4d407f32f628/pone.0330755.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/4641a7391a49/pone.0330755.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/1d357b1b20a4/pone.0330755.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/cfc56c12090f/pone.0330755.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/59eb7caa3055/pone.0330755.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/ab5db61e08b4/pone.0330755.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/2f9035539299/pone.0330755.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/0fd17d5877e1/pone.0330755.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/aa2d89f18ba8/pone.0330755.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/90d928c22283/pone.0330755.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/7ef2e78e3772/pone.0330755.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/4d407f32f628/pone.0330755.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/4641a7391a49/pone.0330755.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/1d357b1b20a4/pone.0330755.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/cfc56c12090f/pone.0330755.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/59eb7caa3055/pone.0330755.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/ab5db61e08b4/pone.0330755.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/2f9035539299/pone.0330755.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/0fd17d5877e1/pone.0330755.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/aa2d89f18ba8/pone.0330755.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/90d928c22283/pone.0330755.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d12/12377560/7ef2e78e3772/pone.0330755.g011.jpg

相似文献

1
Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?邻居与亲属:语音嵌入如何反映全球范围内的语言联系?
PLoS One. 2025 Aug 25;20(8):e0330755. doi: 10.1371/journal.pone.0330755. eCollection 2025.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Age-related differences in the interplay of fluency and complexity in Chinese-speaking seniors' oral narratives.老年人汉语口语叙事流畅性与复杂性的交互作用随年龄的变化。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1672-1690. doi: 10.1111/1460-6984.13023. Epub 2024 Feb 26.
4
A systematic review on production and comprehension of linguistic prosody in people with acquired language and communication disorders resulting from unilateral brain lesions.单侧脑损伤所致后天性语言和交流障碍患者的语言韵律产生和感知的系统评价
J Commun Disord. 2023 Jan-Feb;101:106298. doi: 10.1016/j.jcomdis.2022.106298. Epub 2023 Jan 7.
5
Algorithmic Classification of Psychiatric Disorder-Related Spontaneous Communication Using Large Language Model Embeddings: Algorithm Development and Validation.使用大语言模型嵌入对精神障碍相关自发交流进行算法分类:算法开发与验证
JMIR AI. 2025 May 30;4:e67369. doi: 10.2196/67369.
6
Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估
J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.
7
`It's not just linguistically, there's much more going on': The experiences and practices of bilingual paediatric speech and language therapists in the UK.“这不仅仅是语言方面的问题,还有更多的问题在起作用”:英国双语儿科言语和语言治疗师的经验和实践。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1715-1733. doi: 10.1111/1460-6984.13027. Epub 2024 Mar 23.
8
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
9
Neonatal Nurses' Understanding of the Factors That Enhance and Hinder Early Communication Between Preterm Infants and Their Parents: A Narrative Inquiry Study.新生儿护士对促进和阻碍早产儿与其父母早期沟通因素的理解:一项叙事探究研究。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70093. doi: 10.1111/1460-6984.70093.
10
Do you like my voice? Stakeholder perspectives about the acceptability of synthetic child voices in three South African languages.你喜欢我的声音吗?利益相关者对三种南非语言中合成儿童声音可接受性的看法。
Int J Lang Commun Disord. 2025 Jan-Feb;60(1):e13152. doi: 10.1111/1460-6984.13152.

本文引用的文献

1
Microbial Phylogenetic Context Using Phylogenetic Outlines.使用系统发育轮廓进行微生物系统发育背景分析。
Genome Biol Evol. 2021 Sep 1;13(9). doi: 10.1093/gbe/evab213.
2
Comparative Analysis of Majority Language Influence on North Sámi Prosody Using WaveNet-Based modeling.基于 WaveNet 建模的北萨米语韵律受主要语言影响的对比分析。
Lang Speech. 2022 Dec;65(4):859-888. doi: 10.1177/0023830920983591. Epub 2020 Dec 29.
3
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
4
Why are some languages confused for others? Investigating data from the Great Language Game.为什么有些语言会被混淆为其他语言?对来自大型语言游戏的数据进行调查。
PLoS One. 2017 Apr 5;12(4):e0165934. doi: 10.1371/journal.pone.0165934. eCollection 2017.
5
On the accuracy of language trees.语言树的准确性。
PLoS One. 2011;6(6):e20109. doi: 10.1371/journal.pone.0020109. Epub 2011 Jun 3.
6
The myth of language universals: language diversity and its importance for cognitive science.语言共性的神话:语言多样性及其对认知科学的重要性。
Behav Brain Sci. 2009 Oct;32(5):429-48; discussion 448-494. doi: 10.1017/S0140525X0999094X.
7
Neighbor-net: an agglomerative method for the construction of phylogenetic networks.邻接网络:一种用于构建系统发育网络的凝聚方法。
Mol Biol Evol. 2004 Feb;21(2):255-65. doi: 10.1093/molbev/msh018. Epub 2003 Dec 5.