Pan Wenliang, Wang Xueqin, Zhang Heping, Zhu Hongtu, Zhu Jin
Department of Statistical Science, School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, China (
Department of Statistical Science, School of Mathematics, Southern China Center for Statistical Science, Sun Yat-Sen University, Guangzhou, 510275, China; Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, 510080, China; and Xinhua College, Sun Yat-Sen University, Guangzhou, 510520, China (
J Am Stat Assoc. 2020;115(529):307-317. doi: 10.1080/01621459.2018.1543600. Epub 2019 Apr 11.
Technological advances in science and engineering have led to the routine collection of large and complex data objects, where the dependence structure among those objects is often of great interest. Those complex objects (e.g, different brain subcortical structures) often reside in some Banach spaces, and hence their relationship cannot be well characterized by most of the existing measures of dependence such as correlation coefficients developed in Hilbert spaces. To overcome the limitations of the existing measures, we propose Ball Covariance as a generic measure of dependence between two random objects in two possibly different Banach spaces. Our Ball Covariance possesses the following attractive properties: (i) It is nonparametric and model-free, which make the proposed measure robust to model mis-specification; (ii) It is nonnegative and equal to zero if and only if two random objects in two separable Banach spaces are independent; (iii) Empirical Ball Covariance is easy to compute and can be used as a test statistic of independence. We present both theoretical and numerical results to reveal the potential power of the Ball Covariance in detecting dependence. Also importantly, we analyze two real datasets to demonstrate the usefulness of Ball Covariance in the complex dependence detection.
科学与工程领域的技术进步使得大规模复杂数据对象的常规收集成为可能,而这些对象之间的依赖结构往往备受关注。那些复杂对象(例如,不同的脑皮质下结构)通常存在于某些巴拿赫空间中,因此它们之间的关系无法用大多数现有的依赖度量(如在希尔伯特空间中开发的相关系数)很好地描述。为了克服现有度量的局限性,我们提出球协方差作为两个可能不同的巴拿赫空间中两个随机对象之间依赖关系的通用度量。我们的球协方差具有以下吸引人的特性:(i)它是非参数且无模型的,这使得所提出的度量对模型误设具有鲁棒性;(ii)它是非负的,并且当且仅当两个可分巴拿赫空间中的随机对象相互独立时等于零;(iii)经验球协方差易于计算,并且可以用作独立性的检验统计量。我们给出了理论和数值结果,以揭示球协方差在检测依赖关系方面的潜在能力。同样重要的是,我们分析了两个真实数据集,以证明球协方差在复杂依赖关系检测中的有用性。