Zhang Haixiang, Li Yang, Wang HaiYing
Center for Applied Mathematics and KL-AAGDM, 12605 Tianjin University , Tianjin 300072, China.
Department of Biostatistics and Health Data Science, Indiana University School of Medicine and Richard M. Fairbanks School of Public Health, Indianapolis, IN 46202, USA.
Int J Biostat. 2025 Feb 4;21(1):53-65. doi: 10.1515/ijb-2024-0042. eCollection 2025 May 1.
To ensure privacy protection and alleviate computational burden, we propose a fast subsmaling procedure for the Cox model with massive survival datasets from multi-centered, decentralized sources. The proposed estimator is computed based on optimal subsampling probabilities that we derived and enables transmission of subsample-based summary level statistics between different storage sites with only one round of communication. For inference, the asymptotic properties of the proposed estimator were rigorously established. An extensive simulation study demonstrated that the proposed approach is effective. The methodology was applied to analyze a large dataset from the U.S. airlines.