Vasishth Shravan, Nicenboim Bruno, Beckman Mary E, Li Fangfang, Kong Eun Jong
Department of Linguistics, University of Potsdam.
Department of Linguistics, Ohio State University.
J Phon. 2018 Nov;71:147-161. doi: 10.1016/j.wocn.2018.07.008. Epub 2018 Aug 29.
This tutorial analyzes voice onset time (VOT) data from Dongbei (Northeastern) Mandarin Chinese and North American English to demonstrate how Bayesian linear mixed models can be fit using the programming language Stan via the R package brms. Through this case study, we demonstrate some of the advantages of the Bayesian framework: researchers can (i) flexibly define the underlying process that they believe to have generated the data; (ii) obtain direct information regarding the uncertainty about the parameter that relates the data to the theoretical question being studied; and (iii) incorporate prior knowledge into the analysis. Getting started with Bayesian modeling can be challenging, especially when one is trying to model one's own (often unique) data. It is difficult to see how one can apply general principles described in textbooks to one's own specific research problem. We address this barrier to using Bayesian methods by providing three detailed examples, with source code to allow easy reproducibility. The examples presented are intended to give the reader a flavor of the process of model-fitting; suggestions for further study are also provided. All data and code are available from: https://osf.io/g4zpv.
本教程分析了东北官话和北美英语的语音起始时间(VOT)数据,以展示如何通过R包brms使用编程语言Stan拟合贝叶斯线性混合模型。通过这个案例研究,我们展示了贝叶斯框架的一些优点:研究人员可以(i)灵活定义他们认为产生数据的潜在过程;(ii)获得有关将数据与正在研究的理论问题相关的参数不确定性的直接信息;以及(iii)将先验知识纳入分析。开始进行贝叶斯建模可能具有挑战性,尤其是当一个人试图对自己(通常是独特的)数据进行建模时。很难看出如何将教科书中描述的一般原则应用于自己的特定研究问题。我们通过提供三个详细示例及源代码以实现轻松再现,来解决使用贝叶斯方法的这一障碍。所呈现的示例旨在让读者领略模型拟合的过程;还提供了进一步学习的建议。所有数据和代码可从以下网址获取:https://osf.io/g4zpv 。