Brazauskas Ruta, Le-Rademacher Jennifer
Division of Biostatistics, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI 53226, USA.
Department of Health Sciences Research, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA.
Comput Methods Programs Biomed. 2016 Oct;135:199-207. doi: 10.1016/j.cmpb.2016.07.027. Epub 2016 Jul 25.
Clustered competing risks data arise often in genetic studies, multicenter investigations, and matched-pairs studies. In the last two decades, major advances in competing risks theory had been made. Many new statistical methods need to be evaluated via simulation studies. Some mechanisms for simulating clustered competing risks data have been considered in the literature. However, most of them produce data where the strength of the dependence between individuals within a cluster is not clear. In this article, we aim to examine various techniques for generating bivariate competing risks data.
Theoretical framework for simulating dependent competing risks data using latent failure time approach, multistate models, and shared frailty models is described. The steps needed to implement each method are outlined. Properties of each technique are discussed and standard measures of association are provided in order to assess the degree of dependence in simulated paired competing risks data.
In addition to describing a variety of techniques to generate dependent competing risks data, the cross-hazard ratios from multiple scenarios for each method are computed. The cross-hazard ratios provide a means to compare the level of dependence of the generated data across methods. This acts as a guide for researchers to select an approach and the parameters needed to achieve the desired degree of dependence for their simulation studies.
聚类竞争风险数据常见于基因研究、多中心调查及配对研究中。在过去二十年里,竞争风险理论取得了重大进展。许多新的统计方法需要通过模拟研究进行评估。文献中已考虑了一些模拟聚类竞争风险数据的机制。然而,其中大多数生成的数据中,聚类内个体间的依赖强度并不明确。在本文中,我们旨在研究生成双变量竞争风险数据的各种技术。
描述了使用潜在失效时间方法、多状态模型和共享脆弱性模型模拟相关竞争风险数据的理论框架。概述了实施每种方法所需的步骤。讨论了每种技术的特性,并提供了关联的标准度量,以评估模拟的配对竞争风险数据中的依赖程度。
除了描述生成相关竞争风险数据的各种技术外,还计算了每种方法在多种情况下的交叉风险比。交叉风险比提供了一种比较不同方法生成数据的依赖程度的方法。这为研究人员在模拟研究中选择一种方法以及实现所需依赖程度所需的参数提供了指导。