• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过对多倒谱特征约简的广泛分析改进语音欺骗检测

Improving Voice Spoofing Detection Through Extensive Analysis of Multicepstral Feature Reduction.

作者信息

Souza Leonardo Mendes de, Guido Rodrigo Capobianco, Contreras Rodrigo Colnago, Viana Monique Simplicio, Bongarti Marcelo Adriano Dos Santos

机构信息

Department of Computer Science and Statistics, Institute of Biosciences, Letters and Exact Sciences, São Paulo State University, São José do Rio Preto 15054-000, SP, Brazil.

Department of Science and Technology, Federal University of São Paulo, São José dos Campos 12247-014, SP, Brazil.

出版信息

Sensors (Basel). 2025 Aug 5;25(15):4821. doi: 10.3390/s25154821.

DOI:10.3390/s25154821
PMID:40807985
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12349442/
Abstract

Voice biometric systems play a critical role in numerous security applications, including electronic device authentication, banking transaction verification, and confidential communications. Despite their widespread utility, these systems are increasingly targeted by sophisticated spoofing attacks that leverage advanced artificial intelligence techniques to generate realistic synthetic speech. Addressing the vulnerabilities inherent to voice-based authentication systems has thus become both urgent and essential. This study proposes a novel experimental analysis that extensively explores various dimensionality reduction strategies in conjunction with supervised machine learning models to effectively identify spoofed voice signals. Our framework involves extracting multicepstral features followed by the application of diverse dimensionality reduction methods, such as Principal Component Analysis (PCA), Truncated Singular Value Decomposition (SVD), statistical feature selection (ANOVA F-value, Mutual Information), Recursive Feature Elimination (RFE), regularization-based LASSO selection, Random Forest feature importance, and Permutation Importance techniques. Empirical evaluation using the ASVSpoof 2017 v2.0 dataset measures the classification performance with the Equal Error Rate (EER) metric, achieving values of approximately 10%. Our comparative analysis demonstrates significant performance gains when dimensionality reduction methods are applied, underscoring their value in enhancing the security and effectiveness of voice biometric verification systems against emerging spoofing threats.

摘要

语音生物识别系统在众多安全应用中发挥着关键作用,包括电子设备认证、银行交易验证和机密通信。尽管这些系统具有广泛的实用性,但它们越来越多地成为复杂欺骗攻击的目标,这些攻击利用先进的人工智能技术生成逼真的合成语音。因此,解决基于语音的认证系统固有的漏洞变得既紧迫又至关重要。本研究提出了一种新颖的实验分析方法,该方法广泛探索各种降维策略,并结合监督机器学习模型,以有效识别被欺骗的语音信号。我们的框架包括提取多谱特征,然后应用各种降维方法,如主成分分析(PCA)、截断奇异值分解(SVD)、统计特征选择(方差分析F值、互信息)、递归特征消除(RFE)、基于正则化的套索选择、随机森林特征重要性和排列重要性技术。使用ASVSpoof 2017 v2.0数据集进行的实证评估以等错误率(EER)指标衡量分类性能,得到的值约为10%。我们的比较分析表明,应用降维方法时性能有显著提升,突出了它们在增强语音生物识别验证系统抵御新出现的欺骗威胁的安全性和有效性方面的价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/e291abf86358/sensors-25-04821-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/e6a210b4e8b1/sensors-25-04821-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/bc61ee4fc29b/sensors-25-04821-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/8829d5921e5d/sensors-25-04821-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/c39a2eb18de4/sensors-25-04821-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/e291abf86358/sensors-25-04821-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/e6a210b4e8b1/sensors-25-04821-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/bc61ee4fc29b/sensors-25-04821-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/8829d5921e5d/sensors-25-04821-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/c39a2eb18de4/sensors-25-04821-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72cb/12349442/e291abf86358/sensors-25-04821-g005.jpg

相似文献

1
Improving Voice Spoofing Detection Through Extensive Analysis of Multicepstral Feature Reduction.通过对多倒谱特征约简的广泛分析改进语音欺骗检测
Sensors (Basel). 2025 Aug 5;25(15):4821. doi: 10.3390/s25154821.
2
DeepLASD countermeasure for logical access audio spoofing.针对逻辑访问音频欺骗的深度LASD对策。
Sci Rep. 2025 Jul 1;15(1):20839. doi: 10.1038/s41598-025-04808-5.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
5
Exploring Multi-Channel GPS Receivers for Detecting Spoofing Attacks on UAVs Using Machine Learning.探索用于使用机器学习检测无人机欺骗攻击的多通道GPS接收器。
Sensors (Basel). 2025 Jun 28;25(13):4045. doi: 10.3390/s25134045.
6
A novel double machine learning approach for detecting early breast cancer using advanced feature selection and dimensionality reduction techniques.一种使用先进特征选择和降维技术检测早期乳腺癌的新型双机器学习方法。
Sci Rep. 2025 Jul 2;15(1):22971. doi: 10.1038/s41598-025-06426-7.
7
Multimodal biometric authentication system leveraging optimally trained ensemble classifier using feature-level fusion.利用经过优化训练的集成分类器并采用特征级融合的多模态生物特征认证系统。
Technol Health Care. 2025 Jul 31:9287329251363424. doi: 10.1177/09287329251363424.
8
TabNet and TabTransformer: Novel Deep Learning Models for Chemical Toxicity Prediction in Comparison With Machine Learning.TabNet和TabTransformer:与机器学习相比用于化学毒性预测的新型深度学习模型。
J Appl Toxicol. 2025 May 1. doi: 10.1002/jat.4803.
9
Classification of finger movements through optimal EEG channel and feature selection.通过最优脑电图通道和特征选择对手指运动进行分类。
Front Hum Neurosci. 2025 Jul 16;19:1633910. doi: 10.3389/fnhum.2025.1633910. eCollection 2025.
10
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

本文引用的文献

1
An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.多倒谱投影表示策略在嗓音障碍检测中的实验分析。
Sensors (Basel). 2023 May 30;23(11):5196. doi: 10.3390/s23115196.
2
High-Dimensional Brain in a High-Dimensional World: Blessing of Dimensionality.高维世界中的高维大脑:维度之幸
Entropy (Basel). 2020 Jan 9;22(1):82. doi: 10.3390/e22010082.
3
Effective dysphonia detection using feature dimension reduction and kernel density estimation for patients with Parkinson's disease.
基于特征降维和核密度估计的帕金森病患者有效嗓音障碍检测。
PLoS One. 2014 Feb 20;9(2):e88825. doi: 10.1371/journal.pone.0088825. eCollection 2014.
4
Masculine voices signal men's threat potential in forager and industrial societies.男性的声音在狩猎采集社会和工业社会中传达了他们的威胁潜力。
Proc Biol Sci. 2012 Feb 7;279(1728):601-9. doi: 10.1098/rspb.2011.0829. Epub 2011 Jul 13.
5
The prioritization of voice fundamental frequency or formants in listeners' assessments of speaker size, masculinity, and attractiveness.在听众对说话者体型、男性化程度和吸引力的评估中,语音基频或共振峰的优先级。
J Acoust Soc Am. 2011 Apr;129(4):2201-12. doi: 10.1121/1.3552866.
6
Suitability of dysphonia measurements for telemonitoring of Parkinson's disease.发声障碍测量用于帕金森病远程监测的适用性
IEEE Trans Biomed Eng. 2009 Apr;56(4):1015. doi: 10.1109/TBME.2008.2005954.
7
What is a support vector machine?什么是支持向量机?
Nat Biotechnol. 2006 Dec;24(12):1565-7. doi: 10.1038/nbt1206-1565.
8
Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors.通过短期倒谱参数和基于神经网络的检测器自动检测语音损伤。
IEEE Trans Biomed Eng. 2004 Feb;51(2):380-4. doi: 10.1109/TBME.2003.820386.
9
Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques.恒河猴的声道长度和共振峰频率离散与体型相关。
J Acoust Soc Am. 1997 Aug;102(2 Pt 1):1213-22. doi: 10.1121/1.421048.
10
Perceptual linear predictive (PLP) analysis of speech.语音的感知线性预测(PLP)分析
J Acoust Soc Am. 1990 Apr;87(4):1738-52. doi: 10.1121/1.399423.