• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

让对打印文稿进行人工评分成为过去式:对赫尔曼(2025年)的评论

Making manual scoring of typed transcripts a thing of the past: a commentary on Herrmann (2025).

作者信息

Bosker Hans Rutger

机构信息

Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands.

出版信息

Speech Lang Hear. 2025 Jun 9;28(1):2514395. doi: 10.1080/2050571X.2025.2514395. eCollection 2025.

DOI:10.1080/2050571X.2025.2514395
PMID:40757149
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12312738/
Abstract

Coding the accuracy of typed transcripts from experiments testing speech intelligibility is an arduous endeavour. A recent study in this journal [Herrmann, B. 2025. Leveraging natural language processing models to automate speech-intelligibility scoring. (1)] presents a novel approach for automating the scoring of such listener transcripts, leveraging Natural Language Processing (NLP) models. It involves the calculation of the semantic similarity between transcripts and target sentences using high-dimensional vectors, generated by such NLP models as ADA2, GPT2, BERT, and USE. This approach demonstrates exceptional accuracy, with negligible underestimation of intelligibility scores (by about 2-4%), numerically outperforming simpler computational tools like Autoscore and TSR. The method uniquely relies on semantic representations generated by large language models. At the same time, these models also form the Achilles heel of the technique: the transparency, accessibility, data security, ethical framework, and cost of the selected model directly impact the suitability of the NLP-based scoring method. Hence, working with such models can raise serious risks regarding the reproducibility of scientific findings. This in turn emphasises the need for fair, ethical, and evidence-based open source models. With such models, Herrmann's new tool represents a valuable addition to the speech scientist's toolbox.

摘要

对测试语音清晰度的实验中的打字记录准确性进行编码是一项艰巨的工作。本期刊最近的一项研究[赫尔曼,B. 2025年。利用自然语言处理模型实现语音清晰度评分自动化。(1)]提出了一种新颖的方法,利用自然语言处理(NLP)模型实现对此类听众记录的评分自动化。它涉及使用由ADA2、GPT2、BERT和USE等NLP模型生成的高维向量来计算记录与目标句子之间的语义相似度。这种方法显示出极高的准确性,对清晰度分数的低估可以忽略不计(约2 - 4%),在数值上优于Autoscore和TSR等更简单的计算工具。该方法独特地依赖于大语言模型生成的语义表示。与此同时,这些模型也构成了该技术的致命弱点:所选模型的透明度、可访问性、数据安全性、道德框架和成本直接影响基于NLP的评分方法的适用性。因此,使用此类模型可能会给科学发现的可重复性带来严重风险。这反过来强调了对公平、道德且基于证据的开源模型的需求。有了这样的模型,赫尔曼的新工具成为语音科学家工具箱中的一项宝贵补充。

相似文献

1
Making manual scoring of typed transcripts a thing of the past: a commentary on Herrmann (2025).让对打印文稿进行人工评分成为过去式:对赫尔曼(2025年)的评论
Speech Lang Hear. 2025 Jun 9;28(1):2514395. doi: 10.1080/2050571X.2025.2514395. eCollection 2025.
2
Short-Term Memory Impairment短期记忆障碍
3
Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.错误的恶臭还是潜力的光辉:言语病理学中(不)负责任地使用ChatGPT的挑战。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088.
4
Predicting Drug-Side Effect Relationships From Parametric Knowledge Embedded in Biomedical BERT Models: Methodological Study With a Natural Language Processing Approach.从生物医学BERT模型中嵌入的参数知识预测药物副作用关系:一种自然语言处理方法的方法学研究
JMIR Med Inform. 2025 Jul 10;13:e67513. doi: 10.2196/67513.
5
Non-speech oral motor treatment for children with developmental speech sound disorders.针对发育性语音障碍儿童的非言语口腔运动治疗。
Cochrane Database Syst Rev. 2015 Mar 25;2015(3):CD009383. doi: 10.1002/14651858.CD009383.pub2.
6
Neonatal Nurses' Understanding of the Factors That Enhance and Hinder Early Communication Between Preterm Infants and Their Parents: A Narrative Inquiry Study.新生儿护士对促进和阻碍早产儿与其父母早期沟通因素的理解:一项叙事探究研究。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70093. doi: 10.1111/1460-6984.70093.
7
Grommets (ventilation tubes) for hearing loss associated with otitis media with effusion in children.用于治疗儿童渗出性中耳炎所致听力损失的鼓膜通气管(通风管)
Cochrane Database Syst Rev. 2005 Jan 25(1):CD001801. doi: 10.1002/14651858.CD001801.pub2.
8
Sexual Harassment and Prevention Training性骚扰与预防培训
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Automated Scoring of the Speech Intelligibility Test Using Autoscore.使用自动评分系统对言语清晰度测试进行自动评分。
Am J Speech Lang Pathol. 2025 Jul 29;34(4S):2397-2408. doi: 10.1044/2024_AJSLP-24-00276. Epub 2024 Dec 12.

本文引用的文献

1
Intelligibility as a measure of speech perception: Current approaches, challenges, and recommendations.可懂度作为言语感知的衡量标准:当前方法、挑战与建议。
J Acoust Soc Am. 2023 Jan;153(1):68. doi: 10.1121/10.0016806.
2
Using fuzzy string matching for automated assessment of listener transcripts in speech intelligibility studies.使用模糊字符串匹配实现言语可懂度研究中听众记录的自动评估。
Behav Res Methods. 2021 Oct;53(5):1945-1953. doi: 10.3758/s13428-021-01542-4. Epub 2021 Mar 10.
3
Autoscore: An open-source automated tool for scoring listener perception of speech.Autoscore:一个用于对语音的听众感知进行评分的开源自动化工具。
J Acoust Soc Am. 2019 Jan;145(1):392. doi: 10.1121/1.5087276.