• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关键词定位技术可提高用户自定义关键词的识别准确率。

Keyword spotting techniques to improve the recognition accuracy of user-defined keywords.

机构信息

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China.

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Neural Netw. 2021 Jul;139:237-245. doi: 10.1016/j.neunet.2021.03.012. Epub 2021 Mar 18.

DOI:10.1016/j.neunet.2021.03.012
PMID:33794426
Abstract

The existing keyword spotting (KWS) techniques can recognize pre-defined keywords well but have a poor recognition accuracy for user-defined keywords. In real use cases, there is a high demand for users to define their keywords for various reasons. To address the problem, in this work, three techniques have been proposed, including incremental training with revised loss function, data augmentation, and fine-grained training, to improve the accuracy for the user-defined keywords while maintaining high accuracy for pre-defined keywords. The proposed techniques are applied to a classical KWS model (cnn-trad-fpool3) and a state-of-the-art KWS model (res15) respectively. The experimental results show that the proposed techniques have better recognition accuracy than several existing methods for the recognition of use-defined keywords. With the proposed techniques, the recognition accuracy of user-defined keywords on cnn-trad-fpool3 and res15 are significantly improved by 21.78% and 24.42%, respectively.

摘要

现有的关键词检测(KWS)技术可以很好地识别预定义的关键词,但对用户定义的关键词的识别准确率较差。在实际应用中,由于各种原因,用户对定义自己的关键词有很高的需求。针对这个问题,在这项工作中,提出了三种技术,包括带有修订损失函数的增量训练、数据增强和细粒度训练,以提高用户定义关键词的准确率,同时保持对预定义关键词的高准确率。所提出的技术分别应用于一个经典的 KWS 模型(cnn-trad-fpool3)和一个最先进的 KWS 模型(res15)。实验结果表明,所提出的技术在识别用户定义的关键词方面比现有的几种方法具有更好的识别准确率。在所提出的技术的帮助下,cnn-trad-fpool3 和 res15 上用户定义关键词的识别准确率分别显著提高了 21.78%和 24.42%。

相似文献

1
Keyword spotting techniques to improve the recognition accuracy of user-defined keywords.关键词定位技术可提高用户自定义关键词的识别准确率。
Neural Netw. 2021 Jul;139:237-245. doi: 10.1016/j.neunet.2021.03.012. Epub 2021 Mar 18.
2
End-to-end keyword search system based on attention mechanism and energy scorer for low resource languages.基于注意力机制和能量得分器的针对低资源语言的端到端关键词搜索系统。
Neural Netw. 2021 Jul;139:326-334. doi: 10.1016/j.neunet.2021.04.002. Epub 2021 Apr 10.
3
A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents.一种用于评估历史手写文档转录的多关键词识别系统性能的模型。
J Imaging. 2020 Nov 3;6(11):117. doi: 10.3390/jimaging6110117.
4
Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting.基于 Hough 变换的角度特征用于无学习的手写关键词定位。
Sensors (Basel). 2021 Jul 7;21(14):4648. doi: 10.3390/s21144648.
5
A novel word spotting method based on recurrent neural networks.基于循环神经网络的新型字词定位方法。
IEEE Trans Pattern Anal Mach Intell. 2012 Feb;34(2):211-24. doi: 10.1109/TPAMI.2011.113.
6
Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution.基于多尺度深度时间卷积的两级流关键字检测与定位。
Neural Netw. 2022 Jun;150:28-42. doi: 10.1016/j.neunet.2022.03.003. Epub 2022 Mar 10.
7
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance.BiFSMNv2:将用于关键词检测的二元神经网络性能提升至实际网络性能水平
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10674-10686. doi: 10.1109/TNNLS.2023.3243259. Epub 2024 Aug 5.
8
Keyword Spotting Using Human Electrocorticographic Recordings.利用人类皮层脑电图记录进行关键词识别
Front Neurosci. 2019 Feb 19;13:60. doi: 10.3389/fnins.2019.00060. eCollection 2019.
9
Physical Reservoir Computing Using van der Waals Ferroelectrics for Acoustic Keyword Spotting.利用范德华铁电体进行物理水库计算以实现声学关键词识别
ACS Nano. 2024 Aug 27;18(34):23265-23276. doi: 10.1021/acsnano.4c06144. Epub 2024 Aug 14.
10
A unified framework for image retrieval using keyword and visual features.一种使用关键词和视觉特征进行图像检索的统一框架。
IEEE Trans Image Process. 2005 Jul;14(7):979-89. doi: 10.1109/tip.2005.847289.

引用本文的文献

1
Global status and trends in type 2 diabetes remission from 2002 to 2022: A bibliometric and visual analysis.2002年至2022年全球2型糖尿病缓解的现状与趋势:文献计量与可视化分析
Medicine (Baltimore). 2025 May 2;104(18):e42257. doi: 10.1097/MD.0000000000042257.
2
Worldwide Productivity and Research Trend of Publications Concerning Cancer-Related Neuropathic Pain: A Bibliometric Study.全球癌症相关性神经病理性疼痛相关出版物的生产力和研究趋势:一项文献计量学研究
J Pain Res. 2022 Sep 8;15:2747-2759. doi: 10.2147/JPR.S378119. eCollection 2022.