School of Foreign Languages, Ludong University, Yantai, China.
Department of Linguistics, Zhejiang University, Hangzhou, China.
PLoS One. 2021 Nov 11;16(11):e0259818. doi: 10.1371/journal.pone.0259818. eCollection 2021.
The models of linguistic networks and their analytical tools constitute a potential methodology for investigating the formation of structural patterns in actual language use. Research with this methodology has just started, which can hopefully shed light on the emergent nature of linguistic structure. This study attempts to employ linguistic networks to investigate the formation of modern Chinese two-character words (as structural units based on the chunking of their component characters) in the actual use of modern Chinese, which manifests itself as continuous streams of Chinese characters. Network models were constructed based on authentic Chinese language data, with Chinese characters as nodes, their co-occurrence relations as directed links, and the co-occurrence frequencies as link weights. Quantitative analysis of the network models has shown that a Chinese two-character word can highlight itself as a two-node island, i.e., a cohesive sub-network with its two component characters co-occurring more frequently than they co-occur with the other characters. This highlighting mechanism may play a vital role in the formation and acquisition of two-character words in actual language use. Moreover, this mechanism may also throw some light on the emergence of other structural phenomena (with the chunking of specific linguistic units as their basis).
语言网络模型及其分析工具构成了一种研究实际语言使用中结构模式形成的潜在方法。该方法的研究才刚刚开始,有望揭示语言结构的突现性质。本研究试图运用语言网络来研究现代汉语双字符词(基于其组成字符的分块的结构单元)在现代汉语实际使用中的形成,这种形式表现为汉字的连续流。基于真实的汉语语言数据构建网络模型,以汉字为节点,它们的共现关系为有向链接,共现频率为链接权重。对网络模型的定量分析表明,一个现代汉语双字符词可以突出自身为一个两节点孤岛,即其两个组成字符比与其他字符共现更频繁的有凝聚力的子网。这种突出机制可能在实际语言使用中双字符词的形成和习得中发挥重要作用。此外,这种机制也可能为其他结构现象的出现(以特定语言单位的分块为基础)提供一些启示。