Department of Statistics, TU Dortmund University, Dortmund, Germany.
Institute of Biometry and Clinical Epidemiology, Charitè-Universitätsmedizin Berlin, Berlin, Germany.
Biom J. 2024 Dec;66(8):e70008. doi: 10.1002/bimj.70008.
In many life science experiments or medical studies, subjects are repeatedly observed and measurements are collected in factorial designs with multivariate data. The analysis of such multivariate data is typically based on multivariate analysis of variance (MANOVA) or mixed models, requiring complete data, and certain assumption on the underlying parametric distribution such as continuity or a specific covariance structure, for example, compound symmetry. However, these methods are usually not applicable when discrete data or even ordered categorical data are present. In such cases, nonparametric rank-based methods that do not require stringent distributional assumptions are the preferred choice. However, in the multivariate case, most rank-based approaches have only been developed for complete observations. It is the aim of this work to develop asymptotic correct procedures that are capable of handling missing values, allowing for singular covariance matrices and are applicable for ordinal or ordered categorical data. This is achieved by applying a wild bootstrap procedure in combination with quadratic form-type test statistics. Beyond proving their asymptotic correctness, extensive simulation studies validate their applicability for small samples. Finally, two real data examples are analyzed.
在许多生命科学实验或医学研究中,通常采用析因设计,对重复观测的对象进行多元数据收集。此类多元数据分析通常基于多元方差分析(MANOVA)或混合模型,需要完整的数据,以及对潜在参数分布的某些假设,例如连续性或特定的协方差结构,例如复合对称。然而,当存在离散数据甚至有序分类数据时,这些方法通常不适用。在这种情况下,不要求严格分布假设的非参数基于秩的方法是首选。然而,在多元情况下,大多数基于秩的方法仅针对完整观测进行了开发。本工作的目的是开发具有处理缺失值能力的渐近正确程序,允许奇异协方差矩阵,并适用于有序或有序分类数据。这是通过应用野 Bootstrap 程序与二次型检验统计量相结合来实现的。除了证明其渐近正确性外,广泛的模拟研究还验证了它们在小样本下的适用性。最后,分析了两个真实数据示例。