Suppr超能文献

柯尔莫哥洛夫复杂度测度的实用性:L2 群体与 L1 背景分析。

Utility of Kolmogorov complexity measures: Analysis of L2 groups and L1 backgrounds.

机构信息

King Saud University, Riyadh, Saudi Arabia.

出版信息

PLoS One. 2024 Apr 18;19(4):e0301806. doi: 10.1371/journal.pone.0301806. eCollection 2024.

Abstract

The proliferation of automated syntactic complexity tools allowed the analysis of larger amounts of learner writing. However, existing tools tend to be language-specific or depend on segmenting learner production into native-based units of analysis. This study examined the utility of a language-general and unsupervised linguistic complexity metric: Kolmogorov complexity in discriminating between L2 proficiency levels within several languages (Czech, German, Italian, English) and across various L1 backgrounds (N = 10) using two large CEFR-rater learner corpora. Kolmogorov complexity was measured at three levels: syntax, morphology, and overall linguistic complexity. Pairwise comparisons indicated that all Kolmogorov complexity measures discriminated among the proficiency levels within the L2s. L1-based variation in complexity was also observed. Distinct syntactic and morphological complexity patterns were found when L2 English writings were analyzed across versus within L1 backgrounds. These results indicate that Kolmogorov complexity could serve as a valuable metric in L2 writing research due to its cross-linguistic flexibility and holistic nature.

摘要

自动化句法复杂度工具的普及使得分析更多的学习者写作成为可能。然而,现有的工具往往是特定于语言的,或者依赖于将学习者的产出分割成基于母语的分析单元。本研究使用两个大型 CEFR 评分者学习者语料库,检验了一种语言通用且无需监督的语言复杂度度量:柯尔莫哥洛夫复杂度(Kolmogorov complexity),在区分几种语言(捷克语、德语、意大利语、英语)和不同母语背景下(N=10)的 L2 水平时的有效性。柯尔莫哥洛夫复杂度在句法、形态和整体语言复杂度三个层面进行了测量。成对比较表明,所有柯尔莫哥洛夫复杂度指标都能区分 L2 中的熟练程度水平。复杂度方面的母语差异也观察到了。当分析跨母语背景的 L2 英语写作时,发现了不同的句法和形态复杂度模式。这些结果表明,由于柯尔莫哥洛夫复杂度具有跨语言的灵活性和整体性,因此它可以成为 L2 写作研究中的一个有价值的度量标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8348/11026146/44701a0c6795/pone.0301806.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验