• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BPCNN:用于说话人伪造检测的卷积神经网络的双点输入。

BPCNN: Bi-Point Input for Convolutional Neural Networks in Speaker Spoofing Detection.

机构信息

Department of Artificial Intelligence, Kongju National University, Cheonan 31080, Korea.

School of Computer Science, University of Seoul, Seoul 02504, Korea.

出版信息

Sensors (Basel). 2022 Jun 14;22(12):4483. doi: 10.3390/s22124483.

DOI:10.3390/s22124483
PMID:35746264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9227335/
Abstract

We propose a method, called bi-point input, for convolutional neural networks (CNNs) that handle variable-length input features (e.g., speech utterances). Feeding input features into a CNN in a mini-batch unit requires that all features in each mini-batch have the same shape. A set of variable-length features cannot be directly fed into a CNN because they commonly have different lengths. Feature segmentation is a dominant method for CNNs to handle variable-length features, where each feature is decomposed into fixed-length segments. A CNN receives one segment as an input at one time. However, a CNN can consider only the information of one segment at one time, not the entire feature. This drawback limits the amount of information available at one time and consequently results in suboptimal solutions. Our proposed method alleviates this problem by increasing the amount of information available at one time. With the proposed method, a CNN receives a pair of two segments obtained from a feature as an input at one time. Each of the two segments generally covers different time ranges and therefore has different information. We also propose various combination methods and provide a rough guidance to set a proper segment length without evaluation. We evaluate the proposed method on the spoofing detection tasks using the ASVspoof 2019 database under various conditions. The experimental results reveal that the proposed method reduces the relative equal error rate (EER) by approximately 17.2% and 43.8% on average for the logical access (LA) and physical access (PA) tasks, respectively.

摘要

我们提出了一种方法,称为双点输入,用于处理可变长度输入特征(例如,语音话语)的卷积神经网络(CNN)。在批量单元中将输入特征馈送到 CNN 中需要使每个批量中的所有特征具有相同的形状。由于它们通常具有不同的长度,因此一组可变长度特征不能直接馈送到 CNN 中。特征分割是 CNN 处理可变长度特征的主要方法,其中每个特征分解为固定长度的段。CNN 一次接收一个段作为输入。但是,CNN 一次只能考虑一个段的信息,而不是整个特征。这个缺点限制了一次可用的信息量,从而导致次优的解决方案。我们提出的方法通过增加一次可用的信息量来缓解这个问题。在我们的方法中,CNN 一次接收来自特征的两个段对作为输入。这两个段通常覆盖不同的时间范围,因此具有不同的信息。我们还提出了各种组合方法,并提供了一个大致的指导,无需评估即可设置适当的段长。我们在各种条件下使用 ASVspoof 2019 数据库的欺骗检测任务上评估了所提出的方法。实验结果表明,所提出的方法分别平均将逻辑访问(LA)和物理访问(PA)任务的相对等错误率(EER)降低了约 17.2%和 43.8%。

相似文献

1
BPCNN: Bi-Point Input for Convolutional Neural Networks in Speaker Spoofing Detection.BPCNN:用于说话人伪造检测的卷积神经网络的双点输入。
Sensors (Basel). 2022 Jun 14;22(12):4483. doi: 10.3390/s22124483.
2
An efficient brain tumor image classifier by combining multi-pathway cascaded deep neural network and handcrafted features in MR images.基于多通路级联深度神经网络和磁共振图像手工特征融合的脑肿瘤图像高效分类器
Med Biol Eng Comput. 2021 Aug;59(7-8):1495-1527. doi: 10.1007/s11517-021-02370-6. Epub 2021 Jun 29.
3
Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction.使用卷积神经网络从患者多变量时间序列数据中学习隐藏模式:以医疗保健成本预测为例。
J Biomed Inform. 2020 Nov;111:103565. doi: 10.1016/j.jbi.2020.103565. Epub 2020 Sep 25.
4
Attention-Based Temporal-Frequency Aggregation for Speaker Verification.基于注意力的时频聚合在说话人验证中的应用。
Sensors (Basel). 2022 Mar 10;22(6):2147. doi: 10.3390/s22062147.
5
Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level Visual Speech Recognition.基于多个卷积神经网络的句子级唇读识别架构。
Sensors (Basel). 2021 Dec 23;22(1):72. doi: 10.3390/s22010072.
6
fMRI volume classification using a 3D convolutional neural network robust to shifted and scaled neuronal activations.使用对移位和缩放神经元激活具有鲁棒性的 3D 卷积神经网络进行 fMRI 体积分类。
Neuroimage. 2020 Dec;223:117328. doi: 10.1016/j.neuroimage.2020.117328. Epub 2020 Sep 5.
7
Multichannel Two-Dimensional Convolutional Neural Network Based on Interactive Features and Group Strategy for Chinese Sentiment Analysis.基于交互特征和分组策略的多通道二维卷积神经网络的中文情感分析。
Sensors (Basel). 2022 Jan 18;22(3):714. doi: 10.3390/s22030714.
8
A comparative study of pre-trained convolutional neural networks for semantic segmentation of breast tumors in ultrasound.用于超声乳腺肿瘤语义分割的预训练卷积神经网络的比较研究
Comput Biol Med. 2020 Nov;126:104036. doi: 10.1016/j.compbiomed.2020.104036. Epub 2020 Oct 8.
9
Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。
Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.
10
Fast and fully-automated detection and segmentation of pulmonary nodules in thoracic CT scans using deep convolutional neural networks.使用深度卷积神经网络快速、全自动检测和分割胸部 CT 扫描中的肺结节。
Comput Med Imaging Graph. 2019 Jun;74:25-36. doi: 10.1016/j.compmedimag.2019.02.003. Epub 2019 Mar 22.