Dawdy-Hesterberg Lisa Garnand, Pierrehumbert Janet Breckenridge
Department of Linguistics, Northwestern University , 2016 Sheridan Road, Evanston , IL 60202-0854 , USA.
Lang Cogn Neurosci. 2014 Nov 26;29(10):1268-1282. doi: 10.1080/23273798.2014.899377. Epub 2014 Mar 18.
The noun plural system in Modern Standard Arabic lies at a nexus of critical issues in morphological learnability. The suffixing "sound" plural competes with as many as 31 non-concatenative "broken" plural patterns. Our computational analysis of singular-plural pairs in the Corpus of Contemporary Arabic explores what types of linguistic information are statistically relevant to morphological generalisation for this highly complex system. We show that an analogical approach with the generalised context model is highly successful in predicting the plural form for any given singular form. This model proves to be robust to variation, as evidenced by its stability across 10 rounds of cross-validation. The predictive power is carried almost entirely by the CV template, a representation which specifies a segment's status as a consonant or vowel only, providing further support for the abstraction of prosodic templates in the Arabic morphological system as proposed by McCarthy and Prince.
现代标准阿拉伯语中的名词复数系统处于形态可学性关键问题的交汇点。后缀式“规则”复数形式与多达31种非连接式“破碎”复数形式相互竞争。我们对当代阿拉伯语语料库中单数 - 复数对进行的计算分析,探究了对于这个高度复杂的系统而言,哪些类型的语言信息在统计上与形态泛化相关。我们表明,采用广义语境模型的类推方法在预测任何给定单数形式的复数形式方面非常成功。该模型被证明对变化具有鲁棒性,这在其十轮交叉验证中的稳定性得到了证明。预测能力几乎完全由CV模板承载,该表示仅指定一个音段为辅音或元音的状态,为麦卡锡和普林斯提出的阿拉伯语形态系统中韵律模板的抽象提供了进一步支持。