• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于感知小波包熵和卷积神经网络的身份向量提取用于语音认证

Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication.

作者信息

Lei Lei, She Kun

机构信息

School of Information and Software Engineering, University of Electrical and Science and Technology of China, Chengdu 610054, China.

出版信息

Entropy (Basel). 2018 Aug 13;20(8):600. doi: 10.3390/e20080600.

DOI:10.3390/e20080600
PMID:33265689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7513125/
Abstract

Recently, the accuracy of voice authentication system has increased significantly due to the successful application of the identity vector (i-vector) model. This paper proposes a new method for i-vector extraction. In the method, a perceptual wavelet packet transform (PWPT) is designed to convert speech utterances into wavelet entropy feature vectors, and a Convolutional Neural Network (CNN) is designed to estimate the frame posteriors of the wavelet entropy feature vectors. In the end, i-vector is extracted based on those frame posteriors. TIMIT and VoxCeleb speech corpus are used for experiments and the experimental results show that the proposed method can extract appropriate i-vector which reduces the equal error rate () and improve the accuracy of voice authentication system in clean and noisy environment.

摘要

近年来,由于身份向量(i-vector)模型的成功应用,语音认证系统的准确性显著提高。本文提出了一种新的i-vector提取方法。该方法中,设计了一种感知小波包变换(PWPT)将语音话语转换为小波熵特征向量,并设计了一个卷积神经网络(CNN)来估计小波熵特征向量的帧后验概率。最后,基于这些帧后验概率提取i-vector。使用TIMIT和VoxCeleb语音语料库进行实验,实验结果表明,该方法能够提取合适的i-vector,降低等错误率(EER),并提高语音认证系统在干净和嘈杂环境下的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/72ecd69e3d43/entropy-20-00600-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/048bf512d5ad/entropy-20-00600-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/32f74d63715b/entropy-20-00600-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/2c694bbe3af1/entropy-20-00600-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/5378c4fadc6f/entropy-20-00600-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/67441beab409/entropy-20-00600-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/bb2080abacfa/entropy-20-00600-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/fd7d1446bea5/entropy-20-00600-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/171cec1f9d82/entropy-20-00600-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/2111045730bf/entropy-20-00600-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/72ecd69e3d43/entropy-20-00600-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/048bf512d5ad/entropy-20-00600-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/32f74d63715b/entropy-20-00600-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/2c694bbe3af1/entropy-20-00600-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/5378c4fadc6f/entropy-20-00600-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/67441beab409/entropy-20-00600-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/bb2080abacfa/entropy-20-00600-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/fd7d1446bea5/entropy-20-00600-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/171cec1f9d82/entropy-20-00600-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/2111045730bf/entropy-20-00600-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e153/7513125/72ecd69e3d43/entropy-20-00600-g010.jpg

相似文献

1
Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication.基于感知小波包熵和卷积神经网络的身份向量提取用于语音认证
Entropy (Basel). 2018 Aug 13;20(8):600. doi: 10.3390/e20080600.
2
Entropy-based feature extraction technique in conjunction with wavelet packet transform for multi-mental task classification.结合小波包变换的基于熵的特征提取技术用于多心理任务分类。
Biomed Tech (Berl). 2019 Sep 25;64(5):529-542. doi: 10.1515/bmt-2018-0105.
3
Medical Image Authentication Method Based on the Wavelet Packet and Energy Entropy.基于小波包和能量熵的医学图像认证方法
Entropy (Basel). 2022 Jun 8;24(6):798. doi: 10.3390/e24060798.
4
[Epileptic EEG signal classification based on wavelet packet transform and multivariate multiscale entropy].基于小波包变换和多变量多尺度熵的癫痫脑电信号分类
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2013 Oct;30(5):1073-8, 1090.
5
Voice activity detection algorithm using perceptual wavelet entropy neighbor slope.基于感知小波熵邻域斜率的语音活动检测算法
Biomed Mater Eng. 2014;24(6):3295-301. doi: 10.3233/BME-141152.
6
A Novel Method for Classifying Liver and Brain Tumors Using Convolutional Neural Networks, Discrete Wavelet Transform and Long Short-Term Memory Networks.基于卷积神经网络、离散小波变换和长短时记忆网络的肝脏和脑肿瘤分类新方法。
Sensors (Basel). 2019 Apr 28;19(9):1992. doi: 10.3390/s19091992.
7
Pathological brain detection in MRI scanning by wavelet packet Tsallis entropy and fuzzy support vector machine.基于小波包Tsallis熵和模糊支持向量机的MRI扫描中病理性脑检测
Springerplus. 2015 Nov 24;4(1):716. doi: 10.1186/s40064-015-1523-4. eCollection 2015.
8
Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients' speech signal with unilateral vocal fold paralysis.基于遗传算法的小波包特征优化选择在单侧声带麻痹患者语音信号病理评估中的应用
Comput Biol Med. 2007 Apr;37(4):474-85. doi: 10.1016/j.compbiomed.2006.08.016. Epub 2006 Oct 10.
9
ECG authentication system design incorporating a convolutional neural network and generalized S-Transformation.结合卷积神经网络和广义 S 变换的心电图身份验证系统设计。
Comput Biol Med. 2018 Nov 1;102:168-179. doi: 10.1016/j.compbiomed.2018.09.027. Epub 2018 Sep 29.
10
Combining Multi-Scale Wavelet Entropy and Kernelized Classification for Bearing Multi-Fault Diagnosis.结合多尺度小波熵与核分类的轴承多故障诊断
Entropy (Basel). 2019 Feb 5;21(2):152. doi: 10.3390/e21020152.

引用本文的文献

1
Accurate and early prediction of the wound healing outcome of burn injuries using the wavelet Shannon entropy of terahertz time-domain waveforms.利用太赫兹时域波形的小波香农熵对烧伤伤口愈合结果进行准确和早期预测。
J Biomed Opt. 2022 Nov;27(11). doi: 10.1117/1.JBO.27.11.116001.
2
Machine Learning Techniques for THz Imaging and Time-Domain Spectroscopy.太赫兹成像和时域光谱的机器学习技术。
Sensors (Basel). 2021 Feb 8;21(4):1186. doi: 10.3390/s21041186.
3
Automatic recognition of breast invasive ductal carcinoma based on terahertz spectroscopy with wavelet packet transform and machine learning.

本文引用的文献

1
The mammalian cochlear map is optimally warped.哺乳动物的耳蜗图谱呈最佳扭曲状态。
J Acoust Soc Am. 2003 Aug;114(2):896-906. doi: 10.1121/1.1587150.
基于小波包变换和机器学习的太赫兹光谱技术自动识别乳腺浸润性导管癌
Biomed Opt Express. 2020 Jan 21;11(2):971-981. doi: 10.1364/BOE.381623. eCollection 2020 Feb 1.