• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

作为说话者相似度度量的欧几里得距离,包括同卵双胞胎对:一项使用源和滤波器语音特征的法医调查。

Euclidean Distances as measures of speaker similarity including identical twin pairs: A forensic investigation using source and filter voice characteristics.

作者信息

San Segundo Eugenia, Tsanas Athanasios, Gómez-Vilda Pedro

机构信息

Department of Language and Linguistic Science, University of York, Heslington, York, YO10 5DD, UK.

Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK; Wolfson Centre for Mathematical Biology, Mathematical Institute, University of Oxford, Oxford, UK; Sleep and Circadian Neuroscience Institute, Nuffield Department of Medicine, University of Oxford, UK.

出版信息

Forensic Sci Int. 2017 Jan;270:25-38. doi: 10.1016/j.forsciint.2016.11.020. Epub 2016 Nov 17.

DOI:10.1016/j.forsciint.2016.11.020
PMID:27912151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5698260/
Abstract

There is a growing consensus that hybrid approaches are necessary for successful speaker characterization in Forensic Speaker Comparison (FSC); hence this study explores the forensic potential of voice features combining source and filter characteristics. The former relate to the action of the vocal folds while the latter reflect the geometry of the speaker's vocal tract. This set of features have been extracted from pause fillers, which are long enough for robust feature estimation while spontaneous enough to be extracted from voice samples in real forensic casework. Speaker similarity was measured using standardized Euclidean Distances (ED) between pairs of speakers: 54 different-speaker (DS) comparisons, 54 same-speaker (SS) comparisons and 12 comparisons between monozygotic twins (MZ). Results revealed that the differences between DS and SS comparisons were significant in both high quality and telephone-filtered recordings, with no false rejections and limited false acceptances; this finding suggests that this set of voice features is highly speaker-dependent and therefore forensically useful. Mean ED for MZ pairs lies between the average ED for SS comparisons and DS comparisons, as expected according to the literature on twin voices. Specific cases of MZ speakers with very high ED (i.e. strong dissimilarity) are discussed in the context of sociophonetic and twin studies. A preliminary simplification of the Vocal Profile Analysis (VPA) Scheme is proposed, which enables the quantification of voice quality features in the perceptual assessment of speaker similarity, and allows for the calculation of perceptual-acoustic correlations. The adequacy of z-score normalization for this study is also discussed, as well as the relevance of heat maps for detecting the so-called phantoms in recent approaches to the biometric menagerie.

摘要

越来越多的人达成共识,即混合方法对于法医语音比较(FSC)中成功的说话人特征描述是必要的;因此,本研究探讨了结合源特征和滤波器特征的语音特征的法医潜力。前者与声带的动作有关,而后者反映了说话人声道的几何形状。这组特征是从停顿填充词中提取的,停顿填充词足够长以便进行稳健的特征估计,同时又足够自然,可以从实际法医案件工作中的语音样本中提取。使用说话人对之间的标准化欧几里得距离(ED)来测量说话人相似度:54组不同说话人(DS)比较、54组同一说话人(SS)比较以及12组同卵双胞胎(MZ)之间的比较。结果显示,在高质量录音和电话滤波录音中,DS和SS比较之间的差异均显著,没有错误拒绝且错误接受有限;这一发现表明,这组语音特征高度依赖于说话人,因此在法医方面很有用。正如关于双胞胎声音的文献所预期的那样,MZ对的平均ED介于SS比较和DS比较的平均ED之间。在社会语音学和双胞胎研究的背景下讨论了MZ说话人ED非常高(即非常不相似)的具体案例。提出了语音特征分析(VPA)方案的初步简化方法,该方法能够在说话人相似度的感知评估中对语音质量特征进行量化,并允许计算感知声学相关性。还讨论了本研究中z分数归一化的适用性,以及热图在检测生物特征库最新方法中所谓“幻影”方面的相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/9fa7059c84d8/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/25b311e5f94e/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/ec598df0a793/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/0d3921430f65/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/471c875ea251/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/2b8dca40d5bb/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/31b957292f92/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/6603bd6f7f82/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/f9ed123aafb2/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/9fa7059c84d8/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/25b311e5f94e/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/ec598df0a793/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/0d3921430f65/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/471c875ea251/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/2b8dca40d5bb/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/31b957292f92/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/6603bd6f7f82/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/f9ed123aafb2/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/5698260/9fa7059c84d8/gr9.jpg

相似文献

1
Euclidean Distances as measures of speaker similarity including identical twin pairs: A forensic investigation using source and filter voice characteristics.作为说话者相似度度量的欧几里得距离,包括同卵双胞胎对:一项使用源和滤波器语音特征的法医调查。
Forensic Sci Int. 2017 Jan;270:25-38. doi: 10.1016/j.forsciint.2016.11.020. Epub 2016 Nov 17.
2
A Simplified Vocal Profile Analysis Protocol for the Assessment of Voice Quality and Speaker Similarity.一种用于评估语音质量和说话者相似度的简化语音特征分析方案。
J Voice. 2017 Sep;31(5):644.e11-644.e27. doi: 10.1016/j.jvoice.2017.01.005. Epub 2017 Feb 15.
3
Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison.对具有遗传和非遗传关系的发音人元音共振峰频率的声学分析及其对法庭说话人比较的影响。
PLoS One. 2021 Feb 18;16(2):e0246645. doi: 10.1371/journal.pone.0246645. eCollection 2021.
4
A test of the effectiveness of speaker verification for differentiating between identical twins.一项关于语音识别技术区分同卵双胞胎有效性的测试。
Sci Justice. 2008 Dec;48(4):182-6. doi: 10.1016/j.scijus.2008.02.002.
5
Do long-term acoustic-phonetic features and mel-frequency cepstral coefficients provide complementary speaker-specific information for forensic voice comparison?长期声学-语音特征和梅尔频率倒谱系数是否为法医语音比较提供了互补的说话人特异性信息?
Forensic Sci Int. 2024 Oct;363:112199. doi: 10.1016/j.forsciint.2024.112199. Epub 2024 Aug 22.
6
Speaker-individuality in suprasegmental temporal features: Implications for forensic voice comparison.超音段时间特征中的说话者个体性:对法医语音比较的启示。
Forensic Sci Int. 2014 May;238:59-67. doi: 10.1016/j.forsciint.2014.02.019. Epub 2014 Mar 5.
7
Multi-parametric analysis of speech timing in inter-talker identical twin pairs and cross-pair comparisons: Some forensic implications.多参数分析说话人时间在说话者相同的双胞胎对和交叉对比较中的表现:一些法医学上的启示。
PLoS One. 2022 Jan 21;17(1):e0262800. doi: 10.1371/journal.pone.0262800. eCollection 2022.
8
Inter-speaker articulatory variability during vowel-consonant-vowel sequences in twins and unrelated speakers.双胞胎与非双胞胎说话人在 VC 连续语中发音的个体间可变性。
J Acoust Soc Am. 2013 Nov;134(5):3766-80. doi: 10.1121/1.4822480.
9
[Similarity of monozygotic twins regarding vocal performance and acoustic markers and possible clinical significance].[单卵双胞胎在发声表现和声学标记方面的相似性及可能的临床意义]
HNO. 2000 Jun;48(6):462-9. doi: 10.1007/s001060050598.
10
Automatic source speaker selection for voice conversion.用于语音转换的自动源说话人选择。
J Acoust Soc Am. 2009 Jan;125(1):480-91. doi: 10.1121/1.3027445.

引用本文的文献

1
Novel Targets in a High-Altitude Pulmonary Hypertension Rat Model Based on RNA-seq and Proteomics.基于RNA测序和蛋白质组学的高原肺动脉高压大鼠模型中的新靶点
Front Med (Lausanne). 2021 Nov 3;8:742436. doi: 10.3389/fmed.2021.742436. eCollection 2021.
2
Assessing Parkinson's Disease at Scale Using Telephone-Recorded Speech: Insights from the Parkinson's Voice Initiative.利用电话录音语音大规模评估帕金森病:帕金森语音倡议的见解
Diagnostics (Basel). 2021 Oct 14;11(10):1892. doi: 10.3390/diagnostics11101892.
3
Machine learning approach for automatic recognition of tomato-pollinating bees based on their buzzing-sounds.

本文引用的文献

1
A guideline for the validation of likelihood ratio methods used for forensic evidence evaluation.用于法医证据评估的似然比方法验证指南。
Forensic Sci Int. 2017 Jul;276:142-153. doi: 10.1016/j.forsciint.2016.03.048. Epub 2016 Apr 26.
2
Adaptive Multi-Rate Compression Effects on Vowel Analysis.自适应多速率压缩对元音分析的影响。
Front Bioeng Biotechnol. 2015 Aug 20;3:118. doi: 10.3389/fbioe.2015.00118. eCollection 2015.
3
Objective Automatic Assessment of Rehabilitative Speech Treatment in Parkinson's Disease.帕金森病康复性言语治疗的客观评估
基于嗡嗡声的番茄授粉蜂自动识别的机器学习方法。
PLoS Comput Biol. 2021 Sep 16;17(9):e1009426. doi: 10.1371/journal.pcbi.1009426. eCollection 2021 Sep.
4
Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison.对具有遗传和非遗传关系的发音人元音共振峰频率的声学分析及其对法庭说话人比较的影响。
PLoS One. 2021 Feb 18;16(2):e0246645. doi: 10.1371/journal.pone.0246645. eCollection 2021.
5
A random forest classifier predicts recurrence risk in patients with ovarian cancer.随机森林分类器预测卵巢癌患者的复发风险。
Mol Med Rep. 2018 Sep;18(3):3289-3297. doi: 10.3892/mmr.2018.9300. Epub 2018 Jul 19.
6
Prognostic significance of microsatellite instability‑associated pathways and genes in gastric cancer.微卫星不稳定性相关通路和基因在胃癌中的预后意义。
Int J Mol Med. 2018 Jul;42(1):149-160. doi: 10.3892/ijmm.2018.3643. Epub 2018 Apr 26.
IEEE Trans Neural Syst Rehabil Eng. 2014 Jan;22(1):181-90. doi: 10.1109/TNSRE.2013.2293575.
4
Robust fundamental frequency estimation in sustained vowels: detailed algorithmic comparisons and information fusion with adaptive Kalman filtering.持续元音中的稳健基频估计:详细的算法比较及与自适应卡尔曼滤波的信息融合
J Acoust Soc Am. 2014 May;135(5):2885-901. doi: 10.1121/1.4870484.
5
Speech tempo and fundamental frequency patterns: a case study of male monozygotic twins and an age- and sex-matched sibling.言语节奏和基频模式:以男性同卵双胞胎及年龄和性别匹配的同胞为案例的研究
Logoped Phoniatr Vocol. 2013 Dec;38(4):173-81. doi: 10.3109/14015439.2012.742562. Epub 2012 Nov 29.
6
Hierarchical clustering analysis of blood plasma lipidomics profiles from mono- and dizygotic twin families.对单卵和双卵双胞胎家庭的血浆脂质组学图谱进行层次聚类分析。
Eur J Hum Genet. 2013 Jan;21(1):95-101. doi: 10.1038/ejhg.2012.110. Epub 2012 Jun 20.
7
Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease.新型语音信号处理算法可实现帕金森病的高精度分类。
IEEE Trans Biomed Eng. 2012 May;59(5):1264-71. doi: 10.1109/TBME.2012.2183367. Epub 2012 Jan 9.
8
Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson's disease symptom severity.非线性语音分析算法映射到标准指标,可实现对帕金森病平均症状严重程度的临床有用量化。
J R Soc Interface. 2011 Jun 6;8(59):842-55. doi: 10.1098/rsif.2010.0456. Epub 2010 Nov 17.
9
Local-learning-based feature selection for high-dimensional data analysis.基于局部学习的高维数据分析特征选择。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1610-26. doi: 10.1109/TPAMI.2009.190.
10
The biometric menagerie.生物识别动物园。
IEEE Trans Pattern Anal Mach Intell. 2010 Feb;32(2):220-30. doi: 10.1109/TPAMI.2008.291.