• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

失语症交流中交易成功简短评估的信度

Reliability of the Brief Assessment of Transactional Success in Communication in Aphasia.

作者信息

Kurland Jacquie, Liu Anna, Varadharaju Vishnupriya, Stokes Polly, Cavanaugh Rob

机构信息

University of Massachusetts Amherst, Department of Speech, Language, and Hearing Sciences.

University of Massachusetts Amherst, Department of Mathematics and Statistics.

出版信息

Aphasiology. 2025;39(3):363-384. doi: 10.1080/02687038.2024.2351029. Epub 2024 May 16.

DOI:10.1080/02687038.2024.2351029
PMID:40160198
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11949443/
Abstract

BACKGROUND

While many measures exist for assessing discourse in aphasia, manual transcription, editing, and scoring are prohibitively labor intensive, a major obstacle to their widespread use by clinicians (Bryant et al., 2017; Cruice et al., 2020). Many tools also lack rigorous psychometric evidence of reliability and validity (Azios et al., 2022; Carragher et al., 2023). Establishing test reliability is the first step in our long-term goal of automating the Brief Assessment of Transactional Success in aphasia (BATS; Kurland et al., 2021) and making it accessible to clinicians and clinical researchers.

AIMS

We evaluated multiple aspects of test reliability of the BATS by examining correlations between human/machine and human/human interrater edited transcripts, raw vs. edited transcripts, interrater scoring of main concepts, and test-retest performance. We hypothesized that automated methods of transcription and discourse analysis would demonstrate sufficient reliability to move forward with test development.

METHODS & PROCEDURES: We examined 576 story retelling narratives from a sample of 24 persons with aphasia and familiar and unfamiliar conversation partners (CP). Participants with aphasia (PWA) retold stories immediately after watching/listening to short video/audio clips. CP retold stories after six-minute topic-constrained conversations with a PWA in which the dyad co-constructed the stories. We utilized two macrostructural measures to analyze the automated speech-to-text transcripts of story retells: 1) a modified version of a semi-automated tool for measuring main concepts (mainConcept: Cavanaugh et al., 2021); and 2) an automated natural language processing 'pipeline' to assess topic similarity.

OUTCOMES & RESULTS: Correlations between raw and edited scores were excellent, interrater reliability on transcripts and main concept scoring were acceptable. Test-retest on repeated stimuli was acceptable. This was especially true of aphasic story retellings where there were actual within subject repeated stimuli.

CONCLUSIONS

Results suggest that automated speech-to-text was generally sufficient in most cases to avoid the time-consuming, labor intensive step of transcribing and editing discourse. Overall, our study results suggest that natural language processing automated methods such as text vectorization and cosine similarity are a fast, efficient way to obtain a measure of topic similarity between two discourse samples. Although test-retest reliability for the semi-automated mainConcept method was generally higher than for automated methods of measuring topic similarity, we found no evidence of a difference between machine automated and human-reliant scoring.

摘要

背景

虽然存在多种评估失语症话语的方法,但人工转录、编辑和评分的劳动强度极大,这是临床医生广泛使用这些方法的主要障碍(布莱恩特等人,2017年;克鲁斯等人,2020年)。许多工具也缺乏关于可靠性和有效性的严格心理测量证据(阿齐奥斯等人,2022年;卡拉赫等人,2023年)。建立测试可靠性是我们实现失语症交易成功简短评估(BATS;库兰德等人,2021年)自动化并使其可供临床医生和临床研究人员使用这一长期目标的第一步。

目的

我们通过检查人机和人人间评分者编辑转录本之间的相关性、原始转录本与编辑后转录本之间的相关性、主要概念的评分者间评分以及重测表现,评估了BATS测试可靠性的多个方面。我们假设转录和话语分析的自动化方法将显示出足够的可靠性,以便推进测试开发。

方法与程序

我们检查了来自24名失语症患者以及熟悉和不熟悉的对话伙伴(CP)样本的576个故事复述叙述。失语症患者(PWA)在观看/收听短视频/音频片段后立即复述故事。CP在与PWA进行六分钟主题受限对话后复述故事,在对话中二人共同构建故事。我们使用两种宏观结构测量方法来分析故事复述的自动语音转文本转录本:1)一种用于测量主要概念的半自动工具(主要概念:卡瓦诺等人,2021年)的修改版本;2)一种自动自然语言处理“管道”来评估主题相似性。

结果与结论

原始分数与编辑后分数之间呈极好的相关性,转录本和主要概念评分的评分者间可靠性是可接受的。对重复刺激的重测是可接受的。在失语症故事复述中尤其如此,其中存在实际的受试者内重复刺激。

结论

结果表明,在大多数情况下,自动语音转文本通常足以避免转录和编辑话语这一耗时且劳动强度大的步骤。总体而言,我们的研究结果表明,诸如文本向量化和余弦相似度等自然语言处理自动化方法是获得两个话语样本之间主题相似性度量的快速、有效方法。虽然半自动主要概念方法的重测可靠性通常高于测量主题相似性的自动化方法,但我们没有发现机器自动化评分和人工评分之间存在差异的证据。

相似文献

1
Reliability of the Brief Assessment of Transactional Success in Communication in Aphasia.失语症交流中交易成功简短评估的信度
Aphasiology. 2025;39(3):363-384. doi: 10.1080/02687038.2024.2351029. Epub 2024 May 16.
2
Large Language Models' Ability to Assess Main Concepts in Story Retelling: A Proof-of-Concept Comparison of Human Versus Machine Ratings.
Am J Speech Lang Pathol. 2025 Mar 31:1-11. doi: 10.1044/2025_AJSLP-24-00400.
3
The development of a novel, standardized, norm-referenced Arabic Discourse Assessment Tool (ADAT), including an examination of psychometric properties of discourse measures in aphasia.开发一种新型、标准化、基于常模的阿拉伯语语篇评估工具(ADAT),包括评估失语症患者语篇测量的心理测量特性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2103-2117. doi: 10.1111/1460-6984.13083. Epub 2024 Jun 18.
4
Development and Real-Time Clinical Application of New Transcription-Less Discourse Assessment Approaches for Arabic Speakers With Aphasia.针对失语症阿拉伯语患者的新型无转录语篇评估方法的开发与实时临床应用
Int J Lang Commun Disord. 2025 May-Jun;60(3):e70043. doi: 10.1111/1460-6984.70043.
5
A scoping review of transcription-less practices for analysis of aphasic discourse and implications for future research.无转录语料分析的范围综述及对未来研究的启示
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1734-1762. doi: 10.1111/1460-6984.13028. Epub 2024 Mar 23.
6
Comprehensive Assessment of Reading in Aphasia (CARA) reading questionnaire-German version.失语症阅读综合评估(CARA)问卷 - 德语版
Int J Lang Commun Disord. 2023 Sep-Oct;58(5):1588-1609. doi: 10.1111/1460-6984.12884. Epub 2023 Apr 19.
7
Reviewing the quality of discourse information measures in aphasia.失语症中话语信息测量质量的综述。
Int J Lang Commun Disord. 2017 Nov;52(6):689-732. doi: 10.1111/1460-6984.12318. Epub 2017 May 31.
8
Adaptation of The Scenario Test for Greek-speaking people with aphasia: A reliability and validity study.适应希腊语失语症人群的情景测试:一项可靠性和有效性研究。
Int J Lang Commun Disord. 2022 Jul;57(4):865-880. doi: 10.1111/1460-6984.12727. Epub 2022 May 12.
9
A study on the reliability and validity of the Japanese version of the Scenario Test for people with chronic stroke-induced aphasia: A cross-sectional study.一项关于慢性脑卒中性失语症患者情景测试日语版的信度和效度的研究:一项横断面研究。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1878-1892. doi: 10.1111/1460-6984.13040. Epub 2024 May 1.
10
Cultural adaptation and psychometric testing of The Scenario Test UK for people with aphasia.针对失语症患者的英国情景测试的文化适应性与心理测量测试
Int J Lang Commun Disord. 2018 Jul;53(4):748-760. doi: 10.1111/1460-6984.12379. Epub 2018 Mar 3.

引用本文的文献

1
Expanding the scope: multimodal dimensions in aphasia discourse analysis-preliminary findings.拓展范围:失语症话语分析中的多模态维度——初步研究结果
Front Hum Neurosci. 2024 Sep 25;18:1419311. doi: 10.3389/fnhum.2024.1419311. eCollection 2024.

本文引用的文献

1
Main Concepts for Three Different Discourse Tasks in a Large Non-Clinical Sample.来自大量非临床样本的三种不同话语任务的主要概念。
Aphasiology. 2016;30(1):45-73. doi: 10.1080/02687038.2015.1057891. Epub 2015 Jun 18.
2
Towards efficient, ecological assessment of interaction: A scoping review of co-constructed communication.走向高效、生态的互动评估:协同构建沟通的范围综述。
Int J Lang Commun Disord. 2024 May-Jun;59(3):831-875. doi: 10.1111/1460-6984.12957. Epub 2023 Oct 21.
3
Automation of Language Sample Analysis.语言样本分析自动化。
J Speech Lang Hear Res. 2023 Jul 12;66(7):2421-2433. doi: 10.1044/2023_JSLHR-22-00642. Epub 2023 Jun 22.
4
Measuring communication as a core outcome in aphasia trials: Results of the ROMA-2 international core outcome set development meeting.测量失语症试验中的沟通情况作为核心结局指标:ROMAA-2 国际核心结局集开发会议的结果。
Int J Lang Commun Disord. 2023 Jul-Aug;58(4):1017-1028. doi: 10.1111/1460-6984.12840. Epub 2022 Dec 30.
5
Conversation as an Outcome of Aphasia Treatment: A Systematic Scoping Review.对话作为失语症治疗的结果:系统范围综述。
Am J Speech Lang Pathol. 2022 Nov 16;31(6):2920-2942. doi: 10.1044/2022_AJSLP-22-00011. Epub 2022 Nov 10.
6
A Monte Carlo Study of Confidence Interval Methods for Generalizability Coefficient.可推广性系数置信区间方法的蒙特卡罗研究
Educ Psychol Meas. 2022 Aug;82(4):705-718. doi: 10.1177/00131644211033899. Epub 2021 Aug 7.
7
Main concepts for two picture description tasks: An addition to.两个图片描述任务的主要概念:补充内容。
Aphasiology. 2020;34(1):119-136. doi: 10.1080/02687038.2018.1561417. Epub 2019 Jan 6.
8
Standardizing Assessment of Spoken Discourse in Aphasia: A Working Group With Deliverables.标准化失语症口语评估:具有可交付成果的工作组。
Am J Speech Lang Pathol. 2021 Feb 11;30(1S):491-502. doi: 10.1044/2020_AJSLP-19-00093. Epub 2020 Jun 25.
9
Videoconference Administration of the Western Aphasia Battery-Revised: Feasibility and Validity.视频会议管理西方失语症成套测验修订版:可行性和有效性。
Am J Speech Lang Pathol. 2020 May 8;29(2):673-687. doi: 10.1044/2019_AJSLP-19-00023. Epub 2020 Mar 19.
10
UK speech and language therapists' views and reported practices of discourse analysis in aphasia rehabilitation.英国言语治疗师在失语症康复中对话语分析的看法和报告的实践。
Int J Lang Commun Disord. 2020 May;55(3):417-442. doi: 10.1111/1460-6984.12528. Epub 2020 Feb 24.