Han Shengtong, Zhang Hongmei, Sheng Wenhui, Arshad Hasan
Joseph J. Zilber School of Public Health, University of Wisconsin, Milwaukee, Milwaukee, WI.
School of Public Health, University of Memphis, Memphis, TN.
J Stat Comput Simul. 2019;89(5):815-830. doi: 10.1080/00949655.2019.1572756. Epub 2019 Jan 28.
This article focuses on the clustering problem based on Dirichlet process (DP) mixtures. To model both time invariant and temporal patterns, different from other existing clustering methods, the proposed semi-parametric model is flexible in that both the common and unique patterns are taken into account simultaneously. Furthermore, by jointly clustering subjects and the associated variables, the intrinsic complex shared patterns among subjects and among variables are expected to be captured. The number of clusters and cluster assignments are directly inferred with the use of DP. Simulation studies illustrate the effectiveness of the proposed method. An application to wheal size data is discussed with an aim of identifying novel temporal patterns among allergens within subject clusters.
本文聚焦于基于狄利克雷过程(DP)混合模型的聚类问题。为了对时不变模式和时间模式进行建模,与其他现有聚类方法不同,所提出的半参数模型具有灵活性,因为它能同时考虑共同模式和独特模式。此外,通过对个体和相关变量进行联合聚类,有望捕捉个体之间以及变量之间内在的复杂共享模式。利用DP直接推断聚类的数量和聚类分配。模拟研究说明了所提方法的有效性。讨论了在风团大小数据上的应用,目的是识别个体聚类内过敏原之间新的时间模式。