• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器评分句法:CLAN 自动评分程序与人工评分的比较。

Machine-Scored Syntax: Comparison of the CLAN Automatic Scoring Program to Manual Scoring.

机构信息

Department of Speech-Language-Hearing Sciences, Hofstra University, Hempstead, NY.

School of Languages and Social Sciences, Aston University, Birmingham, United Kingdom.

出版信息

Lang Speech Hear Serv Sch. 2020 Apr 7;51(2):479-493. doi: 10.1044/2019_LSHSS-19-00056. Epub 2020 Mar 18.

DOI:10.1044/2019_LSHSS-19-00056
PMID:32186956
Abstract

Purpose The results of automatic machine scoring of the Index of Productive Syntax from the Computerized Language ANalysis (CLAN) tools of the Child Language Data Exchange System of TalkBank (MacWhinney, 2000) were compared to manual scoring to determine the accuracy of the machine-scored method. Method Twenty transcripts of 10 children from archival data of the Weismer Corpus from the Child Language Data Exchange System at 30 and 42 months were examined. Measures of absolute point difference and point-to-point accuracy were compared, as well as points erroneously given and missed. Two new measures for evaluating automatic scoring of the Index of Productive Syntax were introduced: Machine Item Accuracy (MIA) and Cascade Failure Rate- these measures further analyze points erroneously given and missed. Differences in total scores, subscale scores, and individual structures were also reported. Results Mean absolute point difference between machine and hand scoring was 3.65, point-to-point agreement was 72.6%, and MIA was 74.9%. There were large differences in subscales, with Noun Phrase and Verb Phrase subscales generally providing greater accuracy and agreement than Question/Negation and Sentence Structures subscales. There were significantly more erroneous than missed items in machine scoring, attributed to problems of mistagging of elements, imprecise search patterns, and other errors. Cascade failure resulted in an average of 4.65 points lost per transcript. Conclusions The CLAN program showed relatively inaccurate outcomes in comparison to manual scoring on both traditional and new measures of accuracy. Recommendations for improvement of the program include accounting for second exemplar violations and applying cascaded credit, among other suggestions. It was proposed that research on machine-scored syntax routinely report accuracy measures detailing erroneous and missed scores, including MIA, so that researchers and clinicians are aware of the limitations of a machine-scoring program. Supplemental Material https://doi.org/10.23641/asha.11984364.

摘要

目的

比较计算机语言分析(CLAN)工具中的生产语法索引的自动机器评分与手动评分的结果,以确定机器评分方法的准确性。方法:检查来自 TalkBank 的儿童语言数据交换系统(MacWhinney,2000)的 Weismer 语料库档案数据中 10 个儿童的 20 个转录本,年龄分别为 30 个月和 42 个月。比较了绝对点差和逐点准确性、误给和漏给的分数。引入了两种评估生产语法索引自动评分的新度量标准:机器项目准确性(MIA)和级联失败率——这些度量标准进一步分析了误给和漏给的分数。还报告了总分数、子量表分数和个别结构的差异。结果:机器评分与手动评分之间的平均绝对点差为 3.65,逐点一致性为 72.6%,MIA 为 74.9%。子量表的差异很大,名词短语和动词短语子量表通常比问题/否定和句子结构子量表提供更高的准确性和一致性。机器评分中误给的项目明显多于漏给的项目,这归因于元素标记错误、搜索模式不精确以及其他错误。级联故障导致每个转录本平均损失 4.65 分。结论:与手动评分相比,CLAN 程序在传统和新的准确性度量上的结果都相对不准确。该程序的改进建议包括考虑第二个示例违规并应用级联信用,以及其他建议。有人建议,对机器评分语法的研究应定期报告准确性度量标准,详细说明错误和遗漏的分数,包括 MIA,以便研究人员和临床医生了解机器评分程序的局限性。补充材料:https://doi.org/10.23641/asha.11984364。

相似文献

1
Machine-Scored Syntax: Comparison of the CLAN Automatic Scoring Program to Manual Scoring.机器评分句法:CLAN 自动评分程序与人工评分的比较。
Lang Speech Hear Serv Sch. 2020 Apr 7;51(2):479-493. doi: 10.1044/2019_LSHSS-19-00056. Epub 2020 Mar 18.
2
Promises and pitfalls of machine scoring of the Index of Productive Syntax.《生产性句法指数机器评分的前景与陷阱》
Clin Linguist Phon. 2016;30(6):433-48. doi: 10.3109/02699206.2016.1139184. Epub 2016 Feb 25.
3
Improving Automatic IPSyn Coding.改进自动 IPSyn 编码。
Lang Speech Hear Serv Sch. 2020 Oct 2;51(4):1187-1189. doi: 10.1044/2020_LSHSS-20-00090. Epub 2020 Sep 21.
4
The Index of Productive Syntax: Psychometric Properties and Suggested Modifications.生产语法指数:心理测量学特性和建议的修正。
Am J Speech Lang Pathol. 2022 Jan 18;31(1):239-256. doi: 10.1044/2021_AJSLP-21-00084. Epub 2021 Nov 8.
5
Young Children's Structure Production: A Revision of the Index of Productive Syntax.幼儿的结构生成:生产性句法指数的修订版
Lang Speech Hear Serv Sch. 2018 Oct 24;49(4):995-1008. doi: 10.1044/2018_LSHSS-17-0092.
6
Using Free Computer-Assisted Language Sample Analysis to Evaluate and Set Treatment Goals for Children Who Speak African American English.使用免费的计算机辅助语言样本分析评估和设定讲非裔美国英语的儿童的治疗目标。
Lang Speech Hear Serv Sch. 2021 Jan 19;52(1):31-50. doi: 10.1044/2020_LSHSS-19-00107. Epub 2021 Jan 18.
7
Using Computer Programs for Language Sample Analysis.使用计算机程序进行语言样本分析。
Lang Speech Hear Serv Sch. 2020 Jan 8;51(1):103-114. doi: 10.1044/2019_LSHSS-18-0148. Epub 2019 Nov 7.
8
Automation of the Northwestern Narrative Language Analysis System.西北叙事语言分析系统的自动化
J Speech Lang Hear Res. 2020 Jun 22;63(6):1835-1844. doi: 10.1044/2020_JSLHR-19-00267. Epub 2020 May 28.
9
Manual Versus Automated Narrative Analysis of Agrammatic Production Patterns: The Northwestern Narrative Language Analysis and Computerized Language Analysis.语法缺失性产出模式的人工叙事分析与自动叙事分析对比:西北叙事语言分析与计算机化语言分析
J Speech Lang Hear Res. 2018 Feb 15;61(2):373-385. doi: 10.1044/2017_JSLHR-L-17-0185.
10
Using Computerized Language Analysis to Evaluate Grammatical Skills.使用计算机语言分析评估语法技能。
Lang Speech Hear Serv Sch. 2020 Apr 7;51(2):184-204. doi: 10.1044/2019_LSHSS-19-00032.

引用本文的文献

1
The Index of Productive Syntax: Psychometric Properties and Suggested Modifications.生产语法指数:心理测量学特性和建议的修正。
Am J Speech Lang Pathol. 2022 Jan 18;31(1):239-256. doi: 10.1044/2021_AJSLP-21-00084. Epub 2021 Nov 8.
2
Tracking Child Language Development With Neural Network Language Models.利用神经网络语言模型追踪儿童语言发展
Front Psychol. 2021 Jul 8;12:674402. doi: 10.3389/fpsyg.2021.674402. eCollection 2021.
3
Using Free Computer-Assisted Language Sample Analysis to Evaluate and Set Treatment Goals for Children Who Speak African American English.
使用免费的计算机辅助语言样本分析评估和设定讲非裔美国英语的儿童的治疗目标。
Lang Speech Hear Serv Sch. 2021 Jan 19;52(1):31-50. doi: 10.1044/2020_LSHSS-19-00107. Epub 2021 Jan 18.
4
Improving Automatic IPSyn Coding.改进自动 IPSyn 编码。
Lang Speech Hear Serv Sch. 2020 Oct 2;51(4):1187-1189. doi: 10.1044/2020_LSHSS-20-00090. Epub 2020 Sep 21.