Fang Hua
Department of Computer and Information Science, University of Massachusetts Dartmouth, Department of Quantitative Health Sciences, University of Massachusetts Medical School, Worcester, MA 01655.
Smart Health (Amst). 2017 Jun;1-2:50-65. doi: 10.1016/j.smhl.2017.04.002. Epub 2017 Apr 27.
Missing data are common in longitudinal observational and randomized controlled trials in smart health studies. Multiple-imputation based fuzzy clustering is an emerging non-parametric soft computing method, used for either semi-supervised or unsupervised learning. Multiple imputation (MI) has been widely-used in missing data analyses, but has not yet been scrutinized for unsupervised learning methods, although they are important for explaining the heterogeneity of treatment effects. Built upon our previous work on MIfuzzy clustering, this paper introduces the MIFuzzy concepts and performance, theoretically, empirically and numerically demonstrate how MI-based approach can reduce the uncertainty of clustering accuracy in comparison to non- and single-imputation based clustering approach. This paper advances our understanding of the utility and strength of MIFuzzy clustering approach to processing incomplete longitudinal behavioral intervention data.
在智能健康研究的纵向观察性试验和随机对照试验中,缺失数据很常见。基于多重填补的模糊聚类是一种新兴的非参数软计算方法,用于半监督学习或无监督学习。多重填补(MI)已广泛应用于缺失数据分析,但尚未针对无监督学习方法进行仔细研究,尽管这些方法对于解释治疗效果的异质性很重要。基于我们之前关于MI模糊聚类的工作,本文介绍了MI模糊的概念和性能,从理论、实证和数值方面证明了与基于非填补和单一填补的聚类方法相比,基于MI的方法如何能够降低聚类准确性的不确定性。本文加深了我们对MI模糊聚类方法处理不完整纵向行为干预数据的效用和优势的理解。