• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Voice-based user interface for hands-free data entry and automation at workplaces.

作者信息

Harihar Daiwiik, Shrivastava Vedansh, Talele Pratvina, Jahagirdar Aditi

机构信息

School of Computer Science and Engineering, Dr. Vishwanath Karad MIT World Peace University, Paud Road, Kothrud, Pune, Maharashtra 411038, India.

出版信息

MethodsX. 2025 Aug 28;15:103596. doi: 10.1016/j.mex.2025.103596. eCollection 2025 Dec.

DOI:10.1016/j.mex.2025.103596
PMID:40949828
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12423416/
Abstract

The increasing demand for hands-free interaction in modern workplaces has led to the development of Voice-Based User Interfaces (VUIs) that enhance accessibility, efficiency and automation. This research presents the Voice-Based User Interface for Hands-Free Data Entry and Automation at Workplaces. The system enables real-time speech-to-text transcription, allowing users to interact with workplace applications without manual input, making it intuitive, user-friendly and capable of enhancing efficiency and convenience in various workplace scenarios. Through extensive testing and evaluation, the study demonstrates the practicality and benefits of the Voice-Based User Interface for hands-free data entry and automation. :•Utilized WIT.AI API for speech-to-text transcription.•Implemented chunking, caching, and concurrency control to optimize processing.•Evaluated performance using Word Error Rate (WER), Levenshtein Distance and Cosine Similarity on real world datasets.The system proves to be upto 88.8% accurate in recognizing spoken commands and efficiently converting them into text with best performance achieved when the audio was divided into 7 optimal chunks. Cosine Similarity for these chunks is more accurate than that of sizeable file and approximately 2. Moreover, the integration of real-time updates across different domains (educational, legal, medical) and data synchronization enhances productivity and usability. In conclusion, the Voice-Based User Interface offers a viable solution for hands-free data entry and automation at workplaces.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/23a1073ce756/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/17248a112533/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/4cad2cb2a716/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/3f5572a5639b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/02906357fe01/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/5b73e5e26f4c/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/23a1073ce756/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/17248a112533/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/4cad2cb2a716/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/3f5572a5639b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/02906357fe01/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/5b73e5e26f4c/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5089/12423416/23a1073ce756/gr5.jpg

相似文献

1
Voice-based user interface for hands-free data entry and automation at workplaces.
MethodsX. 2025 Aug 28;15:103596. doi: 10.1016/j.mex.2025.103596. eCollection 2025 Dec.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
5
Automation of medical imaging business reporting workflows in Ontario for quantitative and qualitative process improvement.安大略省医学影像业务报告工作流程的自动化,以实现定量和定性的流程改进。
J Med Imaging Radiat Sci. 2025 Apr 2;56(4):101891. doi: 10.1016/j.jmir.2025.101891.
6
Effectiveness of voice rehabilitation on vocalisation in postlaryngectomy patients: a systematic review.喉切除术后患者的嗓音康复对发声效果的影响:系统评价。
Int J Evid Based Healthc. 2010 Dec;8(4):256-8. doi: 10.1111/j.1744-1609.2010.00177.x.
7
Auditory-Perceptual Evaluation of Situationally-Bound Judgements of Listener Comfort for Postlaryngectomy Voice and Speech.喉切除术后嗓音和言语情境性听觉舒适度判断的听觉感知评估
Int J Lang Commun Disord. 2025 Sep-Oct;60(5):e70114. doi: 10.1111/1460-6984.70114.
8
Optimizing Clinical Decision Support System Functionality by Leveraging Specific Human-Computer Interaction Elements: Insights From a Systematic Review.通过利用特定人机交互元素优化临床决策支持系统功能:系统评价的见解
JMIR Hum Factors. 2025 May 6;12:e69333. doi: 10.2196/69333.
9
Empowering inclusive education: a multi-modal android application for accessible transliteration of Indian languages into Braille script.
Disabil Rehabil Assist Technol. 2025 Oct;20(7):2392-2406. doi: 10.1080/17483107.2025.2539439. Epub 2025 Aug 20.
10
Automatic Image Recognition Meal Reporting Among Young Adults: Randomized Controlled Trial.年轻人中自动图像识别膳食报告:随机对照试验。
JMIR Mhealth Uhealth. 2025 Aug 14;13:e60070. doi: 10.2196/60070.

本文引用的文献

1
Transforming industrial automation: voice recognition control via containerized PLC device.变革工业自动化:通过集装箱式可编程逻辑控制器设备实现语音识别控制
Sci Rep. 2024 Nov 26;14(1):29387. doi: 10.1038/s41598-024-81172-w.
2
Investigating the Accessibility of Voice Assistants With Impaired Users: Mixed Methods Study.调查残障用户对语音助手的可及性:混合方法研究。
J Med Internet Res. 2020 Sep 25;22(9):e18431. doi: 10.2196/18431.